About the Microsoft SharePoint Online catalog source
You can use the Microsoft SharePoint Online catalog source to extract metadata from the Microsoft SharePoint Online source system.
Microsoft SharePoint Online is a web-based cloud service that integrates with Microsoft Office. It provides a platform to collaborate and allows users to store, share, and manage content.
Extracted metadata
You can extract specific metadata from a Microsoft SharePoint Online source system with a Microsoft SharePoint Online catalog source.
Metadata Command Center extracts the following metadata from a Microsoft SharePoint Online source system:
•Site
•Subsite
•Document library
•Folder
•Hierarchical file
•Flat File
•XML File
•XSD File
•Attribute
•Element
•Hierarchical field
•Flat field
You can extract workbooks, worksheets, and columns from Microsoft Excel files.
The following table lists the structures associated with the file types that you can extract metadata from:
File Type
Partition structure
AVRO
Single partition, multiple partitions, schema merge
CSV
Single partition, multiple partitions, schema merge
JSON
Single partition, multiple partitions, schema merge
Parquet
Single partition, multiple partitions, schema merge
XML
Single partition, multiple partitions, schema merge
Excel file types
You can extract metadata from the following Microsoft Excel file types:
•Excel 97-2003 Workbook with XLS extension
•Excel Workbook with XLSX extension
•Excel Macro-Enabled Workbook with XLSM extension
Compatible connectors
Before you configure a catalog source, you must connect to the Microsoft SharePoint Online source system.
Use the Microsoft Sharepoint Online connector to connect to the Microsoft SharePoint Online source system.
For information about configuring a connection, see Connections.