The duplicate analysis process use reference data files called population files to evaluate the identity information in the input data. When you run a mapping with a Deduplicate transformation, the transformation compares the input data to the population file for the country in which the input data originates.
Data Quality downloads the population files to the Secure Agent host machine when you install the Secure Agent. You do not need to take any action to download the files.
You can review the location of the population files on the Administrator service.
Reviewing the Administrator properties for identity population data
You can review and configure the directory to which the Secure Agent downloads identity population files in the Administrator service. You can also review and configure the directories that the deduplication process uses for index and cache data.
1From the My Services page, select the Administrator service.
2Choose the Runtime Environments option.
3Select the Secure Agent that you will use to run mappings with the Deduplicate transformation.
4Hover over the Actions icon for the Secure Agent, and select the Edit Secure Agent option.
The following image shows the option:
The Secure Agent page appears.
5Under System Configuration Details, select Data Integration Server in the Service field and select IDQ in the Type field.
The System Configuration Details pane returns a list of properties based on the type that you specified.
6Review the following properties:
- IdentityReferenceDataLocation. Identifies the location of the population files.
- IdentityReferenceIndexLocation. Identifies the location of the index files that the deduplication process creates at run time.
- IdentityCacheLocation. Identifies the location of the temporary files that the deduplication process creates at run time.
By default, the properties identify directories in the Informatica installation.
7Optionally, update the property values to suit your system and the mappings that you will run.