Deduplicate assets > Test deduplicate assets > Testing a deduplicate asset
  

Testing a deduplicate asset

Test a deduplicate asset to verify that the data flows through the asset in the ways that you expect.
    1Open the deduplicate asset.
    2Select the Deduplication tab.
    3 Select a Secure Agent to run the test.
    To refresh the list of active Secure Agents, click the Refresh icon.
    4Enter data values in the test panel, or import the data to test. To import the data, click the Import option in the test panel.
    Consider the following guidelines before you add or import data to the test panel:
    For more information see Rules and guidelines for test data.
    5Optionally, save the asset to preserve the current test data and configuration.
    6Click Test.
    You can sort, search, and filter the test results in the following ways:
    The sort, search, and filter options work together. For example, if you apply a filter to the test data and you enter a value in the Find field, the asset displays any row that meets both the filter and search criteria.
    7Verify that the deduplication process analyzes the test data in the manner that you expect.
    8Optionally, select the Consolidation tab.
    Use the options on the tab to verify that the consolidation process generates preferred records in the manner that you expect.
    The test panel on the Consolidation tab contains the results of any test that you ran on the Deduplication tab.
    Note: If you add one or more fields to the Consolidation tab, bear in mind that several field names are reserved on the deduplicate asset. To read the list of reserved field names, see Configuring the consolidation process.
    9Click Test, and review the results of the test. You can test the consolidation options in row-based mode and field-based mode.

Understanding the test results

When you run a test on the Deduplication or Consolidation tab, the test results include a number of predefined fields. The input data and the test results on the Deduplication tab form the basis of the test that you can perform on the Consolidation tab.
The test results on the Deduplication tab include the following predefined fields:
Cluster ID
Contains the identifier of the cluster to which the input record belongs.
In the deduplication process, a cluster is a set of records whose data values match each other to a degree that exceeds the duplicate threshold. Records in the same set are likely to identify the same identity. A set may contain a single record, as every unique record is a perfect match with itself.
Cluster Size
Contains the number of records in the set to which the current record belongs. When a set contains a unique record, the cluster size is 1.
The test results on the Consolidation tab include the following predefined fields:
Cluster ID
Contains the identifier of the cluster to which the input record belongs. The Cluster ID fields on the Deduplication and Consolidation tabs contain identical information for a given test.
Preferred Record
Contains the values in the preferred record that the test creates for the current input.
The Deduplication and Consolidation tabs also display the mandatory and required fields that apply for the objective and index key that you select. In addition, the Consolidation tab displays all of the fields that the objective can use and any custom field that you add.

Rules and guidelines for test data

You can import data to the test panel in the deduplicate asset and save the test data in the asset configuration.
Consider the following rules and guidelines when you add data to the test panel:
The following rules and guidelines apply to the deduplicate asset: