Dictionary Files for Data Masking
The Data Masking task uses a set of built-in dictionary files or the custom dictionary files that you create. When you configure a substitution masking operation, you select a dictionary that contains substitute values. The Data Masking task performs a lookup on the dictionary selected and replaces the source data with data from the dictionary.
When you install or upgrade the Secure Agent in a runtime environment, the Data Masking task downloads and saves the dictionary files to the following location:
<Secure Agent installation directory>\apps\Data_Integration_Server\data
You cannot edit or rename these files, but you can change the content within the specified file structure.
Note: The data in the dictionary files is test data.
The Data Masking task downloads the following dictionary files:
- •informatica_mask_address.dic
- •informatica_mask_cc_american_express.dic
- •informatica_mask_cc_diners_club.dic
- •informatica_mask_cc_american_express.dic
- •informatica_mask_cc_diners_club.dic
- •informatica_mask_cc_discover.dic
- •informatica_mask_cc_jcb.dic
- •informatica_mask_cc_master_card.dic
- •informatica_mask_cc_visa.dic
- •informatica_mask_countries.dic
- •informatica_mask_email.dic
- •informatica_mask_female_first_names.dic
- •informatica_mask_first_names.dic
- •informatica_mask_job_position.dic
- •informatica_mask_last_names.dic
- •informatica_mask_male_first_names.dic
- •informatica_mask_states.dic
- •informatica_mask_streets.dic
- •informatica_mask_uk_ni.dic
- •informatica_mask_us_telephone.dic
- •informatica_mask_us_towns.dic
- •informatica_mask_us_ssn.dic
- •informatica_mask_us_zipcode.dic
- •dict.csv
- •defaultValue.xml
If you want to use the custom dictionary, you must add a connection to the custom flat file dictionary. If there are multiple Secure Agents in a runtime environment, you must also copy the custom dictionary file to the following location:
<Secure Agent installation directory>\apps\Data_Integration_Server\data
You can use any of the flat file formats, such as .txt, .dic, and .csv. There is no limit on the maximum number of fields that can be present in a flat file dictionary. All the fields must have a column header, and the fields must be separated by a comma. The structure within the file must contain a sequential number and a value separated by a comma. A file can contain more than two columns. When you configure a substitution rule with custom dictionaries, you can select the dictionary column. To support non-English characters, you can use different code pages from a flat file connection when you configure a substitution rule with custom dictionaries.
The following text is a sample from a flat file dictionary that you can use to mask the credit card numbers:
SNO, CC_JCB
1, 3500-0003-7382-2377
2, 3500-0092-0490-3652
3, 3500-0077-6261-9918
4, 3500-0039-3695-5973
5, 3500-0089-8551-0603
6, 3500-0064-5387-7207
7, 3500-0030-0361-1582
8, 3500-0042-8477-2366