You can use the Oracle Cloud Object Storage catalog source to extract metadata from the Oracle Cloud Object Storage source system.
Metadata Command Center extracts the following metadata from an Oracle Cloud Object Storage source system:
•OCI Bucket
•Folder
•File
•Flat File
•Hierarchical File
•Flat Field
•Hierarchical Field
•XML File
•XSD File
•Attribute
•Element
•Workbook
•Worksheet
•Column
You can extract workbooks, worksheets, and columns from Microsoft Excel files.
File types
You can extract metadata from the following file types using an Oracle Cloud Object Storage catalog source:
•AVRO
•CSV
•DOC
•DOCX
•JSON
•Parquet
•PDF
•TSV
•TXT
•XML (XML and XSD files)
•ZIP
The following table lists the structures associated with the file types that you can extract metadata from:
File Type
Partition structure
AVRO
Single partition, multiple partitions, schema merge
CSV
Single partition, multiple partitions, schema merge
JSON
Single partition, multiple partitions, schema merge
Parquet
Single partition, multiple partitions, schema merge
XML
Single partition, multiple partitions, schema merge
You can extract metadata from XML and XSD file formats as XML file objects. Metadata Command Center extracts only elements and attributes from XML and XSD files. If the size of the XML file exceeds 100 KB, Metadata Command Center extracts metadata from the initial 100 KB of the file. However, for XSD file types, Metadata Command Center extracts complete metadata.
You can extract metadata from the following Microsoft Excel file types: