Data Quality Assets > Part III: Deduplicate assets > Introduction to deduplicate assets > Deduplication objectives
  

Deduplication objectives

The Objective option on the Deduplication tab defines the type of identity that the Deduplicate transformation analyzes when you run a mapping with the transformation. The objective also indicates the types of information that the transformation expects to read for the identity. When you configure the transformation, you map the input fields that contain the identity information to the most appropriate fields on the deduplicate asset.
Each objective supports one or more index key fields. You select an index key on the Deduplication tab in the asset. Some objectives define similar identity types, for example Family and Household. In each case, the deduplication process uses unique comparison logic.
The set of potential input fields on an objective contains at least one mandatory field and may also identify one or more required fields. To get the most accurate results from the deduplication operations, map each mandatory field on the asset to an appropriate input field on the transformation. Likewise, map at least one required field on the asset to an appropriate input field on the transformation. The mandatory and required input field names indicate the types of information that the input fields must contain.
Each objective also supports a set of input fields that can contain additional data about the identity. Optionally, map any additional asset field to a transformation field that contains the appropriate information. To optimize the analysis of the identity data, ensure that the Deduplicate transformation reads as many of the fields as possible.
For more information about mandatory and required fields on each objective, see Field types and identity objectives.
The following table describes the objectives that you can select and identifies the fields that you can select as the index key field:
Identity objective
Description
Index Keys
Address
Identifies records that share an address.
Address Part 1
Code
Date
Exact
Geocode
Telephone Number
Address Code
Identifies records that share an address code.
Address Part 1
Author ISBN
Identifies records that share information about an author who published work with an ISBN number.
Email
Exact
Person Name
ISBN10
ISBN13
CC Issuer
Identifies records that share information about a credit card issuer.
Company Name
Credit Card
Exact
Organization Name
CC Owner
Identifies records that share information about a credit card holder.
Address Part 1
Credit Card
Date
Email
Exact
Geocode
Person Name
Telephone Number
Contact
Identifies records that share a contact at a single organization and location.
Address Part 1
Code
Company Name
Date
Email
Exact
Geocode
Person Name
Organization Name
Telephone Number
Corp Entity
Identifies records that share corporate identification data.
Address Part 1
Code
Company Name
Date
Exact
Geocode
Organization Name
Telephone Number
Division
Identifies records that share an office location within an organization.
Address Part 1
Code
Company Name
Exact
Geocode
Organization Name
Telephone Number
Family
Identifies individuals that belong to the same family.
Address Part 1
Code
Email
Exact
Geocode
Person Name
Telephone Number
Fields
Identifies records that share identity data across multiple fields that you select.
Address Part 1
Code
Company Name
Credit Card
Date
Email
Exact
Geocode
ISBN10
ISBN13
Organization Name
Person Name
Product Description
Product Name
Telephone Number
VIN
[Generic field]
Generic
Identifies records that share identity information when different types of identity are represented in a single field. Select the objective for records that contain Australia, United Kingdom, or United States data.
Note: The Generic field that you select might contain person names, organization names, or addresses. The objective looks for common values across these information types in the field.
[Generic field]
Geocode
Identifies records that share geocode data.
Geocode
Household
Identifies individuals that belong to the same household.
Address Part 1
Code
Email
Exact
Geocode
Person Name
Telephone Number
Individual
Identifies duplicate individuals.
Date
Code
Email
Exact
Person Name
Organization
Identifies records that share organization data.
Address Part 1
Code
Company Name
Date
Exact
Geocode
Organization Name
Telephone Number
Person Name
Identifies records that share information about a person.
Address Part 1
Code
Date
Email
Exact
Geocode
Person Name
Telephone Number
Product
Identifies records that share information about a product.
Code
Company Name
Exact
Organization Name
Product Description
Product Name
Publisher ISBN
Identifies records that share information about a publishing company. The information includes ISBN data for published works.
Address Part 1
Company Name
Exact
Geocode
ISBN10
ISBN13
Organization Name
Resident
Identifies duplicate individuals at the same address.
Address Part 1
Code
Date
Email
Exact
Geocode
Person Name
Telephone Number
VIN Manufacturer
Identifies records that share information about a vehicle manufacturer.
Address Part 1
Company Name
Exact
Geocode
Organization Name
VIN
VIN Owner
Identifies records that share information about a vehicle owner.
Address Part 1
Code
Company Name
Date
Email
Exact
Geocode
Organization Name
Person Name
VIN
Wide Contact
Identifies records that share a contact at an organization.
Code
Company Name
Email
Exact
Organization Name
Person Name
Wide Household
Identifies individuals that belong the same household.
Address Part 1
Code
Email
Exact
Geocode
Person Name
Telephone Number