Amazon Redshift is a cloud-based data warehousing service that you can use to analyze and store data.
Objects extracted
The metadata extraction service extracts the following objects from an Amazon Redshift source system:
•Database
•Schema
•External Schema
•Table
•External Table
•View
•Materialized View
Note: Objects of the Materialized View type appear as View in Data Governance and Catalog.
•Function
•Procedure
•Column
Data profiling for Amazon Redshift objects
Configure data profiling to run profiles on the metadata extracted from an Amazon Redshift source system. You can run data profiles on the following Amazon Redshift objects:
•Table
•View
•Spectrum External Table
You can view the profiling statistics in Data Governance and Catalog. The data profiling task runs profiles on the following data types for Amazon Redshift objects:
•SMALLINT
•INTEGER
•BIGINT
•DECIMAL
•REAL
•DOUBLE PRECISION
•BOOLEAN
•CHAR
•VARCHAR
•DATE
•TIMESTAMP
You can run profiles on Spectrum external tables that are created using the following file formats:
•PARQUET
•AVRO
•TEXT
•RC
•SEQUENCE
•ORC
Note: If the Spectrum external tables include string data type or complex data types, the entire external table is skipped and not considered for data profiling and data quality.
Sampling type
Determine the sample rows on which you want to run the data profiling task. You can choose one of the following sampling types for an Amazon Redshift catalog source:
- All Rows
- Limit N Rows
- Random N Rows
Data Lineage
The following lineage data is available for Amazon Redshift assets:
•From table to view
•Stored Procedures linked to tables or views
•Functions linked to tables or views
•Procedure calling a procedure
•Procedure calling a function
For more information about data lineage, see Data Lineage in the Working With Assets help.