Before you perform data exploration tasks in CLAIRE GPT for Data Governance and Catalog, ensure that you perform the following tasks:
•Enable the Data Governance and Catalog service for the runtime environment that your organization uses.
•Verify that you extracted metadata from the required source systems into the catalog.
•Configure the following connections on the Connections page of Administrator:
- Amazon Redshift
- Databricks
Note: Before you perform data exploration tasks in CLAIRE GPT, verify that the Databricks server is up and running to avoid delays in responses from CLAIRE GPT.
•Enable the connections you configured earlier on the Secure Agent group from the Enable/Disable Components in runtime environment page of Administrator.
•To get precise data exploration responses, enrich the data catalog and associate assets with accurate business terms and descriptions.
•For accurate and reliable data exploration responses, ensure that the primary and foreign key constraints of tables are properly defined at the schema level of the source system. For Google BigQuery and Microsoft Azure Synapse sources, create and run custom catalog sources in Metadata Command Center to extract and ingest the primary and foreign key constraints and their relationships into Data Governance and Catalog.
For information about creating a custom catalog source in Metadata Command Center, see Custom metadata integration.
•To get data exploration responses for tables that do not have primary key constraints defined at the schema level of the source system, you must enable data profiling and run the catalog source.
For information about how to set up CLAIRE GPT for users in your organization, see Administer CLAIRE GPT.
Configuring IDMC Data Gateway Service properties
Secure Agent services such as IDMC Data Gateway Service are pluggable microservices that the Secure Agent uses for data processing. To configure Secure Agent service properties, open the Runtime Environments page in Administrator and edit the Secure Agent. To change or optimize the behavior of the IDMC Data Gateway Service, you can configure the system and custom properties when you edit a Secure Agent.
To change or optimize the behavior of the IDMC Data Gateway Service, you can configure the system and custom properties when you edit a Secure Agent.
You can configure the following system properties of the IDMC Data Gateway Service in the System Configuration Details section:
Type
Name
Description
Value Range
Default Value
APP_CFG
idmc_dgs_connectionPoolMaxSize
The maximum number of data source connections cached for a Secure Agent.
Minimum is 10.
Maximum is 1000.
100
APP_CFG
idmc_dgs_connectionPoolCacheExpiryDuration
The maximum duration for which the connection with a data source is kept open when it is unused.
Note: The value of the parameter must be in minutes.
Minimum is 10.
Maximum is 120.
30 minutes
APP_CFG
idmc_dgs_queryExecutionTimeout
The maximum duration for which a database query is run on a data source.
Note: The value of the parameter must be in seconds.
Minimum is 1
There is no maximum value.
60 seconds
Note: APP_CFG refers to application configuration properties.
For more information about the system properties you can configure for the IDMC Data Gateway Service, see Secure Agent services.
Configuring IDMC Data Gateway Service properties
Secure Agent services such as IDMC Data Gateway Service are pluggable microservices that the Secure Agent uses for data processing. To configure Secure Agent service properties, open the Runtime Environments page in Administrator and edit the Secure Agent.
To change or optimize the behavior of the IDMC Data Gateway Service, you can configure the system and custom properties when you edit a Secure Agent.
You can configure the following system properties of the IDMC Data Gateway Service in the System Configuration Details section:
Type
Name
Description
Value Range
Default Value
APP_CFG
idmc_dgs_connectionPoolMaxSize
The maximum number of data source connections cached for a Secure Agent.
Minimum is 10.
Maximum is 1000.
100
APP_CFG
idmc_dgs_connectionPoolCacheExpiryDuration
The maximum duration for which the connection with a data source is kept open when it is unused.
Note: The value of the parameter must be in minutes.
Minimum is 10.
Maximum is 120.
30 minutes
APP_CFG
idmc_dgs_queryExecutionTimeout
The maximum duration for which a database query is run on a data source.
Note: The value of the parameter must be in seconds.
Minimum is 1
There is no maximum value.
60 seconds
Note: APP_CFG refers to application configuration properties.
For more information about the system properties you can configure for the IDMC Data Gateway Service, see Secure Agent services.