You can use the Microsoft Fabric OneLake catalog source to extract metadata from a Microsoft Fabric OneLake source system.
Microsoft Fabric OneLake is a Microsoft Fabric data lake for analytics data.
Extracted metadata
You can extract specific metadata from source systems that you connect to with a Microsoft Fabric OneLake catalog source.
Metadata Command Center extracts files and folders from a Microsoft Fabric OneLake source system.
You can extract workbooks, worksheets, and columns from Microsoft Excel files.
Metadata Command Center extracts the following metadata from a Microsoft Fabric OneLake source system:
•File system
•Flat field
•Flat file
•Folder
•Hierarchical file
•Hierarchical field
File types
The following table lists the structures associated with the file types that you can extract metadata from:
File Type
Partition structure
AVRO
Single partition, multiple partitions, schema merge
CSV
Single partition, multiple partitions, schema merge
JSON
Single partition, multiple partitions, schema merge
Parquet
Single partition, multiple partitions, schema merge
XML
Single partition, multiple partitions, schema merge
You can extract metadata from TSV and TXT file types.
You can extract metadata from XML and XSD file formats as XML file objects. Metadata Command Center extracts only elements and attributes from XML and XSD files. If the size of the XML file exceeds 100 KB, Metadata Command Center extracts metadata from the initial 100 KB of the file. However, for XSD file types, Metadata Command Center extracts complete metadata.
You can extract metadata from the following Microsoft Excel file types:
•Excel 97-2003 Workbook with XLS extension
•Excel Workbook with XLSX extension
•Excel Macro-Enabled Workbook with XLSM extension
Data profiling for Microsoft Fabric OneLake objects
Configure data profiling to run profiles on the metadata extracted from a Microsoft Fabric OneLake source system. You can view the profiling statistics in Data Governance and Catalog.
You can run data profiles on the following objects:
•Delimited
•AVRO
•CSV
•JSON
•Parquet
Compatible connectors
Before you configure a Microsoft Fabric OneLake catalog source, you must connect to the Microsoft Fabric OneLake source system.
Use the Microsoft Fabric OneLake connector to connect to the Microsoft Fabric OneLake source system.
For information about configuring a connection, see Connections.