You can use the Amazon Redshift catalog source to extract metadata from an Amazon Redshift source system.
Amazon Redshift is a cloud-based data warehousing service that you can use to analyze and store data.
Extracted metadata
You can use the Amazon Redshift catalog source to extract metadata from an Amazon Redshift source system.
Metadata Command Center extracts the following metadata from an Amazon Redshift source system:
•Database
•Schema
•External Schema
•Table
•External Table
•View
•Materialized View
Note:
Objects of the Materialized View type appear as View in
Data Governance and Catalog
.
•Function
•Procedure
•Column
Data profiling for Amazon Redshift objects
Configure data profiling to run profiles on the metadata extracted from an Amazon Redshift source system. You can view the profiling statistics in Data Governance and Catalog.
You can run data profiles on the following objects:
•Table
•View
•Spectrum External Table
You can run profiles on Spectrum external tables that are created using the following file formats:
•PARQUET
•AVRO
•TEXT
•RC
•SEQUENCE
•ORC
Note:
If the Spectrum external tables include columns with string data types, those columns are skipped and are not considered for data profiling and data quality.
The data profiling task runs profiles on the following data types:
•SMALLINT
•INTEGER
•BIGINT
•DECIMAL
•REAL
•DOUBLE PRECISION
•BOOLEAN
•CHAR
•VARCHAR
•DATE
•TIMESTAMP
•STRING
•SUPER
Compatible connectors
Before you configure an Amazon Redshift catalog source, you must connect to the Amazon Redshift source system.
Use the Amazon Redshift V2 connector to connect to the Amazon Redshift source system.
For information about configuring a connection, see Connections.