Data lineage is a visual representation of the flow of data across the systems in your organization. Lineage depicts how the data flows from the system of its origin to the system of its destination.
Data lineage views are available for technical assets in the catalog source. You can view lineage at the catalog source, data set, or data element level.
The lineage at the catalog source level shows how data flows from one catalog source to another. The lineage at the data set and the data element levels show how other technical assets such as files or tables contribute to the selected asset.
If linking catalog sources is available for your catalog source, you can use Metadata Command Center to generate data lineage based on rules or by generating automated lineage with CLAIRE. You can choose source and target catalog sources and objects to link and generate lineage.
To determine whether linking catalog sources is available for your catalog source, navigate to the Configuration tab of the Link Catalog Sources page. The catalog source must appear in the list of source and target catalog sources.
The catalog source level shows how data flows from one catalog source to another with the lineage aggregating data from the data set and data element levels.
To view data lineage at the catalog source level, open a technical asset, click the Lineage tab, and then verify that the level is set to Catalog Source Level.
View lineage at the data set level
The data set level displays individual sets of data in the data flow.
To view data lineage at the catalog source level, open a technical asset, click the Lineage tab, and then verify that the level is set to Catalog Source Level.
The following image shows table-level lineage where the tDBInput_1 subjob uses data from the CUSTOMER reference source table to generate output to the customer_tgt_h2l reference target table before connection assignment:
The following image shows table-level lineage where the tDBInput_1 subjob uses data from the CUSTOMER source table to generate output to the customer_tgt_h2l target table after connection assignment to two Oracle source systems:
View lineage at the data element level
The data element level displays details of the data set level. At the data element level, you can see the input sources for expressions or commands and calculations or transformations on the data.
To view data lineage at the data element level, open a technical asset, click the Lineage tab, and then verify that the level is set to Data Element Level.
The following image shows column-level lineage where the CUST_ID calculation of the tDBInput_1 subjob uses data from the CUST_ID reference column of the CUSTOMER reference source table to generate output to the CUST_ID reference column of the customer_tgt_h2l reference target table before connection assignment:
The following image shows column-level lineage where the CUST_ID calculation of the tDBInput_1 subjob uses data from the CUST_ID column of the CUSTOMER source table to generate output to the CUST_ID column of the customer_tgt_h2l target table after connection assignment to two Oracle source systems:
View referenced source systems
Drill down into the referenced source systems that are referenced by catalog sources.
1Browse to the catalog source that you want to view.
2Click the Hierarchy tab to view the catalog source as a hierarchy.
3Find an asset with the type Reference Catalog Source.
The following image shows an expanded view of a referenced catalog source:
4Click to expand the referenced source to see its components.