Data Governance User Guide > Automate Data Quality Rules > Prerequisites
  

Prerequisites

To run local data quality rules automatically, you must be familiar with resources and scanners in Enterprise Data Catalog, and rules and mappings in Informatica Data Quality.

Enterprise Data Catalog

To automate data quality rules, verify the following prerequisites in Enterprise Data Catalog:
  1. 1. You have installed Enterprise Data Catalog. See the Product Availability Matrix for the supported versions.
  2. 2. To automate the onboarding of data sets and attributes, you must first configure data onboarding rules in the Admin Panel. For more information on creating data onboarding rules, refer to the Configure Data Onboarding Rules topic in the Axon Data Governance 7.3.1 Administrator Guide.
  3. 3. Apply onboarding rules and enable automated onboarding from Axon systems. Based on the configured onboarding rules, Axon includes or excludes onboarding of the data sets and attributes from Enterprise Data Catalog. For more information, see Data Onboarding Rules for a System.
  4. 4. Enterprise Data Catalog can scan the following source databases:
  5. 5. The file sources for which you want to run rules are CSV flat files in native file systems.
If you run rules on Amazon S3 flat file resources that contains data of any data type other than the String data type, the onboarded data from Enterprise Data Catalog to Axon is of String data type.
You cannot run rules on Amazon S3 parquet file resources that contains data of Map data type.
If you run rules on a parquet file that uses the TO_DECIMAL(value) function with high precision enabled, the mapping will not work. For the mapping to be successful, you must use the TO_DECIMAL38(value ,[scale]) function. The value is either string or a numeric and scale is and integer literal between 0 and 28.
Note: If you disable high precision while generating the mappings, the accuracy of the mapping output is affected.
You cannot run rules on Amazon Redshift resources that contain data of TIMESTAMPTZ or TIMESTAMP WITH TIME ZONE data type. Instead, you can use Amazon Redshift resources that contain data Date/Time data type.

Informatica Data Quality

To automate data quality rules, verify the following prerequisites in Informatica Data Quality:
  1. 1. You have installed Informatica Data Quality. See the Product Availability Matrix for the supported versions.
  2. 2. The following services are running on Informatica Data Quality:
  3. 3. Ensure that you configure Data Integration Service with Workflow DB to run the mappings.
  4. 4. You can connect Axon Agent 7.3.1 to Informatica Data Quality 10.5.0 and 10.5.1. For more information, refer to the Configure Access to Informatica Data Quality topic in the Axon Data Governance 7.3.1 Administrator Guide.
  5. Note: You can use parquet files on Amazon S3 as the source database only if you connect Axon Agent to Informatica Data Quality 10.4.1, 10.5.0 and 10.5.1.
  6. 5. Axon connects to the Axon Agent using the HTTP protocol.
  7. 6. The database where Informatica Data Quality stores the data quality scores is a relational database. The Axon Agent connects to this database to retrieve and display data quality scores in Axon.
  8. 7. To run data quality rules on fields, Informatica Data Quality must connect to the sources scanned by Enterprise Data Catalog. Make sure that all critical connection parameters are configured in Informatica Data Quality, such as the password and Support Mixed-Case Identifier option for the resource.
  9. 8. You have enabled rule automation in the Axon Admin Panel. To automate data quality rules, see the Automate Data Quality Rules topic in the Axon Data Governance 7.3.1 Administrator Guide.