Share via


Deploy healthcare data foundations in healthcare data solutions

Healthcare data foundations offer ready-to-run data pipelines to help you structure data for analytics and AI/machine learning modeling. You must deploy the healthcare data foundations before you use any other healthcare data solutions capability.

Deploy healthcare data foundations

You can set up the healthcare data foundations capability by using the Setup your solution wizard on the healthcare data solutions home page. This step is required after you deploy your healthcare data solutions environment. For the detailed steps, see Deploy healthcare data foundations.

Post deployment, you can find the capability listed on the healthcare data solutions home page under Data preparation. Select the capability to explore the deployed artifacts.

Artifacts

The following table lists the details for the Fabric environment, the data pipeline, and the notebooks deployed by the healthcare data foundations capability. To learn more about the lakehouses that they use, see Medallion lakehouse design.

Caution

Healthcare data foundations deploy with preconfigured values that are required for healthcare data solutions in Microsoft Fabric. Make sure that you don't modify any of the configuration values for these artifacts.

Artifact Type Description
healthcare#_environment Fabric environment Preconfigures the required Fabric runtime version (Runtime 1.2 (Apache Spark 3.4 and Delta Lake 2.4)) and provides the other public and custom libraries that the healthcare data solutions in Microsoft Fabric need. The data pipelines use this environment instead of the workspace-level runtime settings.
healthcare#_msft_config_notebook Notebook Reads and populates the global configuration values from the admin lakehouse. This notebook's parameters are preconfigured during the deployment process.
healthcare#_msft_bronze_silver_flatten Notebook Flattens the clinical dataset from the ClinicalFhir table in the bronze lakehouse to the respective FHIR resource table and other tables in the silver lakehouse healthcare data model. It flattens only the primary-level fields in the silver lakehouse. Nested or deeper hierarchical structures within the data retain their original structure.

Note: Don't run more than one instance of this notebook at a time, because it can cause inconsistent results.
healthcare#_msft_fhir_flattening_sample Notebook FHIR extensions are child elements that provide more information about an element in a FHIR resource. Currently, these extensions are supported as strings within the healthcare data model in the silver lakehouse. This notebook provides examples on how to access this extension data and utilize it within a dataframe.
healthcare#_msft_fhir_ndjson_bronze_ingestion Notebook Facilitates the ingestion of FHIR NDSJON data into delta tables within the bronze lakehouse.

Note: Don't run more than one instance of this notebook at a time, because it can cause inconsistent results.
healthcare#_msft_raw_process_movement Notebook Uses the healthcare data solutions library to extract ZIP files and organize them for multiple modalities, which contain various namespaces and file extensions. The notebook moves the files based on the fileOrchestrationConfig.json configuration, which contains mapping details such as modality, modality_format, and extension. It adds a timestamp prefix to the original file names to ensure uniqueness of files across multiple source systems.
healthcare#_msft_clinical_data_foundation_ingestion Data pipeline Orchestrates the execution of the notebook/activities required for end-to-end ingestion of clinical data from the unified folder structure in the bronze lakehouse. This data is then moved to the healthcare data model in the silver lakehouse.

A screenshot displaying the deployed artifacts.

References