You can use the SFTP File System catalog source to extract metadata from the SFTP File System source.
SFTP is a secure file transfer protocol that enables you to transfer files over SSH, also known as Secure Shell protocol.
Extracted metadata
You can extract specific metadata from an SFTP File System with an SFTP File System catalog source.
Metadata Command Center extracts metadata from the following objects:
•File Systems
•Flat Fields
•Flat Files
•Folders from an SFTP File System source
Supported file types
You can extract metadata from the following file types:
•Attribute
•AVRO
•CSV
•Element
•JSON
•Microsoft Excel files
- Excel 97-2003 Workbook with XLS extension
- Excel Workbook with XLSX extension
- Excel Macro-Enabled Workbook with XLSM extension
•Parquet files based on the partition structure
•TSV
•TXT
•XML File
•XSD File
The following table lists the structures associated with the file types that you can extract metadata from:
File Type
Partition structure
AVRO
Single partition, multiple partitions, schema merge
CSV
Single partition, multiple partitions, schema merge
JSON
Single partition, multiple partitions, schema merge
Parquet
Single partition, multiple partitions, schema merge
XML
Single partition, multiple partitions, schema merge
You can extract workbooks, worksheets, and columns from Microsoft Excel files.
You can extract metadata from XML and XSD file formats as XML file objects. Metadata Command Center extracts only elements and attributes from XML and XSD files. If the size of the XML file exceeds 100 KB, Metadata Command Center extracts metadata from the initial 100 KB of the file. However, for XSD file types, Metadata Command Center extracts complete metadata.
Compatible connectors
Before you configure a catalog source, you must connect to the SFTP File System source system.
Use the Advanced SFTP V2 connector to connect to the SFTP File System source system.
For information about configuring a connection, see Connections.