Developer Transformation Guide > Duplicate Record Exception Transformation > Ports

Ports

The Duplicate Record Exception transformation has multiple groups of input and output ports.

The following figure shows an example of the input and output ports:

The ports include the Data and Control input groups. The output groups are Standard Output and Cluster Data.

Duplicate Record Exception Transformation Input Ports

A Duplicate Record Exception transformation has a Data group and a Control group of input ports.

The Data group contains user-defined ports that receive the source data.

The Control ports receive metadata that the Match transformation adds to the source data. The following table describes the Control ports:

Port	Description
Score	Decimal value between 0 and 1. Identifies the degree of similarity with the record that linked the record to the cluster.
Row_Identifier	A unique identifier for the record.
Cluster_ID	The ID of the match cluster to which the record belongs.

Duplicate Record Exception Transformation Output Ports

The Duplicate Record Exception transformation has multiple output groups. By default, the transformation writes duplicate records to the Standard Output group. The transformation writes potential matches to the Cluster Data group. You can add an output group for unique records.

You can change the record types that the transformation writes to output ports by changing the default settings on the Configuration view.

The following table describes the output ports for the Standard Output group:

Port	Description
Score	Decimal value between 0 and 1. Identifies the degree of similarity between the record and another record in a cluster.
Row_Identifier	A unique identifier for the record.
Cluster_ID	ID of the cluster that the Match transformation assigned the record to.
User-defined ports	The source data fields.

The following table describes the output ports in the Cluster Data group:

Port	Description
Row_Identifier	The unique identifier for the record.
Sequential_Cluster_ID	Identifies the cluster in a Human task. A workflow uses the sequential cluster ID to assign a cluster to an instance of a Human task.
Cluster_ID	Identifies the cluster that the record belongs to. The Match transformation assigns a cluster ID to all records.
Score	Decimal value from 0 to 1. Identifies the degree of similarity with the record that linked the record to the cluster.
Is_Master	String value that indicates if the record is the preferred record in the cluster. By default, the first row in the cluster is the preferred record. Value is Y or N.
Workflow_ID	ID that identifies the workflow for the record in a task. When you run the mapping outside of a workflow, the workflow ID is DummyWorkflowID.
User-defined ports	The source data ports.

Creating Ports

Add each input port to the Data group. When you add the input port, the Developer tool adds an output port with the same name to the Standard Output group, the Cluster Data group, and the Unique Records group.

1. Select the Data input group.

The group is highlighted.

2. Click New (Insert).

The Developer tool adds a field to the Data group, the Standard Output group, the Cluster Data group, and the Unique Records group.

3. Change the name of the field as required.

The Developer tool changes the field name in the other groups.

4. Enter the rest of the ports that you need to add for the data source.