Introduction to Reference Data > Reference Data Overview
  

Reference Data Overview

A reference data object contains a set of data values that you can search while performing data quality operations on source data. You can create reference data objects in the Developer tool and the Analyst tool. You can also import reference data objects to the Model repository. The Data Quality Content installer includes reference data objects that you can import.
You can create and edit the following types of reference data:
Reference tables
A reference table contains standard and alternative versions of a set of data values. You add a reference table to a transformation in the Developer tool to verify that source data values are accurate and correctly formatted.
A database table contains at least two columns. One column contains the standard or preferred version of a string, and other columns contain alternative versions. When you add a reference table to a transformation, the transformation searches the input port data for values that also appear in the table. You can create tables with any data that is useful to the data project you work on.
Content Sets
Content sets are repository and file objects that contain reference data values. When you add a content set to a transformation, the transformation searches the input port data for values that appear in the content or for strings that match the data patterns defined in the content set.
The Data Quality Content installer includes the following types of reference data:
Informatica reference tables
Database tables created by Informatica. You import Informatica reference tables when you import accelerator objects from the Content Installer. The reference tables contain standard and alternative versions of common business terms from several countries. The types of reference information include telephone area codes, postcode formats, first names, Social Security number formats, occupations, and acronyms. You can edit Informatica reference tables.
Informatica content sets
Content sets created by Informatica. You import content sets when you import accelerator objects from the Content Installer. A content set contains different types of reference data that you can use to perform search operations in data quality transformations.
Address reference data files
Reference data files that identify all valid addresses in a country. The Address Validator transformation reads this data. You cannot create or edit address reference data files.
The Content Installer installs files for the countries that you have purchased. Address reference data is current for a defined period and you must refresh your data regularly, for example every quarter. You cannot view or edit address reference data.
Identity population files
Contain information on types of personal, household, and corporate identities. The Match transformation and the Comparison transformation use this data to parse potential identities from input fields. You cannot create or edit address identity population files.
The Content Installer writes population files to the file system.