You can use the Amazon Athena catalog source to extract metadata from an Amazon Athena source system.
Amazon Athena is an interactive query service to query and analyze big data in Amazon S3 using standard SQL.
Extracted metadata
You can extract specific metadata from an Amazon Athena source system with an Amazon Athena catalog source.
Metadata Command Center extracts the following objects from an Amazon Athena source system:
•Database
•Schema
•View
•ViewColumn
•External Table
•External Column
•Calculation
•Resource
•Nested Field
You can extract objects with the following complex data types and their nested fields from an Amazon Athena source system:
•Array
•Map
•Struct
Data Governance and Catalog displays objects with complex data types as external columns. The nested fields within arrays appear as elements, while those within maps appear as keys and values.
Data profiling for Amazon Athena objects
Configure data profiling to run profiles on the metadata extracted from an Amazon Athena source system. You can view the profiling statistics in Data Governance and Catalog.
You can run data profiles on the following objects:
•External tables created in the following file formats:
- AVRO
- CSV
- Delta
- JSON
- Parquet
•External columns
The data profiling task runs profiles on the following data types:
•Bigint
•Boolean
•Char
•Date
•Decimal
•Double
•Float
•Int
•Smallint
•String
•Timestamp
•Tinyint
•Varchar
Compatible connectors
Before you configure an Amazon Athena catalog source, you must connect to the Amazon Athena source system.
Use the Amazon Athena connector to connect to the Amazon Athena source system.
For information about configuring a connection, see Connections.