You can use Metadata Command Center to extract metadata from a source system.
A source system is any system that contains data or metadata. For example, Oracle is a source system from which you can extract metadata through an Oracle catalog source. A catalog source is an object that represents and contains metadata from the source system.
Before you extract metadata from a source system, you first create and register a catalog source that represents the source system. Then you configure capabilities for the catalog source. A capability is a task that Metadata Command Center can perform, such as metadata extraction, data profiling, data classification, or glossary association.
When Metadata Command Center extracts metadata, Data Governance and Catalog displays the extracted metadata and its attributes as technical assets. You can then perform tasks such as analyzing the assets, viewing lineage, and creating links between those assets and their business context.
The following table describes the capabilities of the catalog source:
Capability
Description
Serverless Runtime Environment
A serverless runtime environment is an advanced serverless deployment solution that doesn't require downloading, installing, configuring, or maintaining a Secure Agent or Secure Agent group. You can use a serverless runtime environment in the same way that you use a Secure Agent when you configure a catalog source.
Advanced Programming Language Parsing
Advanced Programming Language Parsing parses the source system code in addition to extracting objects from the source system.
Data Profiling and Quality
- Data Profiling. Assesses source metadata and analyzes the collected statistics to discover content and structure, such as value distribution, patterns, and data types.
- Data Quality. Measures the reliability of the data and enables data usage.
- Data Observability. Identifies anomalies in the characteristics of the data.
Data Classification
Data classification is the process of identifying and organizing data into relevant categories based on the functional meaning of the data. Classifying data can help your organization manage risks, compliance, and data security.
Relationship Discovery
The relationship discovery capability identifies pairs of similar columns and relationships between tables within a catalog source.
Glossary Association
You can associate terms that are in the glossary with technical assets to provide user-friendly business names to technical assets. Glossary Association automatically associates glossary terms with technical assets or recommends glossary terms that you can manually associate with technical assets in Data Governance and Catalog.
Extraction and view process
To extract metadata from a source system, configure the catalog source and run the catalog source job in Metadata Command Center. Then view the results in Data Governance and Catalog.
The following image shows the process to extract metadata from a source system:
After you verify prerequisites, perform the following tasks to extract metadata from Oracle:
1Register a catalog source. Create a catalog source object, select the source system, and specify values for connection properties.
2Configure the catalog source. Specify the runtime environment and configure parameters for metadata extraction. Optionally, add filters to include or exclude source system assets from metadata extraction. You can also configure other capabilities such as data profiling and quality.
3Optionally, associate stakeholders. Associate users with technical assets, giving the users permission to perform actions determined by their roles.
4Run or schedule the catalog source job.
5If the catalog source job generates referenced asset objects, assign a connection to referenced source system assets, and rerun the job to get connection-aware scan results.
After you run the catalog source job, you view the results in Data Governance and Catalog.
About the Oracle catalog source
You can use the Oracle catalog source to extract metadata from an Oracle source system. You can also run profiles and configure other capabilities, such as data classification.
Configure an Oracle catalog source to extract metadata from an Oracle database hosted on a dedicated server, an AWS RDS Oracle instance, or Oracle Autonomous Database.
Oracle Autonomous Database is a SaaS implementation of Oracle Database on the Oracle Cloud Infrastructure.
Extracted metadata
You can use the Oracle catalog source to extract metadata from an Oracle source system.
Metadata Command Center extracts the following objects from an Oracle source system:
•Database
•Schema
•Table
•View
•Materialized View
Note: Objects of the Materialized View type appear as View in Data Governance and Catalog.
•Column
- Primary Key
- Foreign Key
•Stored Procedure
•Function
•Package
•External Table
•Sequence
•Trigger
•Objects created using Oracle database link
You can also use the following SQL functionalities:
•Static SQL
•PL/SQL
•Procedure calls with parameters tracing
•Dynamic SQL with metatables
•COPY_DATA
•EXPORT_DATA
Data profiling for Oracle objects
Configure data profiling to run profiles on the metadata extracted from an Oracle source system. You can either use the Oracle connector to establish a connection to Oracle for profiling data, or use the JDBC V2 connector to enable Secure Agent-based data profiling for Oracle. You can view the profiling statistics in Data Governance and Catalog.
You can run data profiles on the following objects:
•Table
•View
•External table
The data profiling task runs profiles on the following data types:
•CHAR
•DATE
•NUMBER
•NUMBER (p,s)
•TIMESTAMP
•VARCHAR
•VARCHAR2
•NCHAR
•NVARCHAR2
•FLOAT (126)
•TIMESTAMP WITH TIMEZONE
•TIMESTAMP WITH LOCAL TIMEZONE
•INTERVAL DAY TO SECOND
•INTERVAL YEAR TO MONTH
•BINARY FLOAT
•BINARY DOUBLE
•FLOAT
•UROWID
•ROWID
Note: Oracle supports a maximum precision value of 38 for the NUMBER data type for data profiling.
You can perform data profiling on Oracle external tables, run data quality automations for Oracle external tables, and profile catalog sources with Oracle and Oracle external tables.
To create an external table, obtain data from one of the following sources:
•Oracle Loader
•Oracle Data Pump
•Large Object columns (LOBs)
•CSV delimited files
For more information, see the corresponding third-party vendor documentation.
Compatible connectors
Before you configure an Oracle catalog source, you must connect to the Oracle source system.
Use the Oracle connector to connect to the Oracle source system in Administrator.
For information about configuring a connection, see Connections in the Administrator help.