Amazon Redshift Sources > Introduction to Amazon Redshift catalog sources > About the Amazon Redshift catalog source

About the Amazon Redshift catalog source

You can use the Amazon Redshift catalog source to extract metadata from an Amazon Redshift source system.

Amazon Redshift is a cloud-based data warehousing service that you can use to analyze and store data.

Extracted metadata

You can use the Amazon Redshift catalog source to extract metadata from an Amazon Redshift source system.

Metadata Command Center extracts the following metadata from an Amazon Redshift source system:

•Database
•Schema
•External Schema
•Table
•External Table
•View
•Materialized View

Note:

Objects of the Materialized View type appear as View in Data Governance and Catalog.

•Function
•Procedure
•Column

Data profiling for Amazon Redshift objects

Configure data profiling to run profiles on the metadata extracted from an Amazon Redshift source system. You can view the profiling statistics in Data Governance and Catalog.

You can run data profiles on the following objects:

•Table
•View
•Spectrum External Table

You can run profiles on Spectrum external tables that are created using the following file formats:

•PARQUET
•AVRO
•TEXT
•RC
•SEQUENCE
•ORC

Note:

If the Spectrum external tables include columns with string data types, those columns are skipped and are not considered for data profiling and data quality.

The data profiling task runs profiles on the following data types:

•SMALLINT
•INTEGER
•BIGINT
•DECIMAL
•REAL
•DOUBLE PRECISION
•BOOLEAN
•CHAR
•VARCHAR
•DATE
•TIMESTAMP
•STRING
•SUPER

Compatible connectors

Before you configure an Amazon Redshift catalog source, you must connect to the Amazon Redshift source system.

Use the Amazon Redshift V2 connector to connect to the Amazon Redshift source system.

For information about configuring a connection, see Connections.