Introduction to Talend Data Integration catalog sources
You can use Metadata Command Center to extract metadata from a source system.
A source system is any system that contains data or metadata. For example, Talend Data Integration is a source system from which you can extract metadata through a Talend Data Integration catalog source with Metadata Command Center. A catalog source is an object that represents and contains metadata from the source system.
Before you extract metadata from a source system, you first create and register a catalog source that represents the source system.
When Metadata Command Center extracts metadata, Data Governance and Catalog displays the extracted metadata and its attributes as technical assets. You can then perform tasks such as analyzing the assets, viewing lineage, and creating links between those assets and their business context.
You can only extract metadata using this catalog source.
Extraction and view process
To extract metadata from a source system, configure the catalog source and run the extraction job in Metadata Command Center. Then view the results in Data Governance and Catalog.
The following image shows the process to extract metadata from a Talend Data Integration source system:
After you verify prerequisites, perform the following tasks to extract metadata from Talend Data Integration:
1Register a catalog source. Create a catalog source object, select the source system, and specify the path to the directory that contains exported Talend projects for metadata extraction.
2Configure the catalog source. Specify the runtime environment, optionally configure parameters for the metadata extraction capability, and add filters for metadata extraction.
3Associate stakeholders. Optionally, associate users with technical assets, giving the users permission to perform actions determined by their roles.
4Run or schedule the catalog source job.
5Optionally, assign a connection to referenced source system assets.
After you run the catalog source job, you view the results in Data Governance and Catalog.
Note: You can view the lineage with object references without performing connection assignment. After connection assignment, you can view lineage with the objects.
About the Talend Data Integration catalog source
You can use the Talend Data Integration catalog source to extract metadata from the Talend Data Integration source system.
Talend Data Integration is an ETL data integration platform that you can use to gain business insights from data.
Extracted metadata
You can use the Talend Data Integration catalog source to extract metadata from a Talend Data Integration source system.
You can extract metadata from Talend projects that contain the following components:
•Cloud. For example, Amazon S3, Snowflake, HDFS.
•Database. For example, relational databases.
•File. For example, XML, Flat, Excel, CSV.
•Processing. For example, map, join, filter.
Metadata Command Center extracts the following metadata from a Talend Data Integration source system: