Exception Management Guide > Duplicate Record Tasks > Duplicate Record Tasks Overview

Duplicate Record Tasks Overview

A duplicate record task identifies the records in a data set that might contain duplicate or redundant information. The task displays the records as a series of clusters. Each cluster identifies a set of records that contain similar or identical data values.

When you work on a duplicate record task, you review each cluster and you determine whether the records in the cluster are duplicates of each other. If the records are duplicates, you define a preferred version of the record that the cluster represents. If the cluster contains a unique record, you move the record to an empty cluster. If a record matches the records in another cluster, you can move the record to the other cluster.

Complete the task after you review all of the clusters and define a preferred record for each cluster.

Note: Two or more records are duplicates when they represent the same entity in the source data set. Records can contain similar data and not represent the same information to the business. Your organization can defines the business rules that identify duplicate records.