You can create the following types of data quality assets:
•Cleanse. A cleanse asset performs standardization operations on data.
•Deduplicate. A deduplicate asset evaluates the degrees of similarity between the identities in a record set and assigns records with highly similar identities to clusters. A cluster is a set of records that are likely to represent the same information. A deduplicate asset optionally consolidates the records in each cluster into a preferred version of the records.
•Dictionary. A dictionary is a reference data set that you use to evaluate data in a rule specification.
•Labeler. A labeler asset identifies values in an input field that correspond to types of information that you specify. The asset returns a label for each value and optionally returns a standardized version of the value.
•Parse. A parse asset identifies one or more values in an input string and writes the values to discrete output fields according to the data that they contain.
•Rule Specification. A rule specification is an asset that represents the data requirements of a business rule in logical form.
•Verifier. A verifier asset evaluates the accuracy and deliverability of address records.
To create the assets, select the New page.
When you select the New page, Data Quality displays the options to create a cleanse asset, a dictionary, a deduplicate asset, a parse asset, a labeler asset, a rule specification asset, or a verifier asset.
Select the asset type that you want to create.
Note: For more information about an asset, including instructions on creating an asset, browse to the corresponding publication in the Cloud Data Quality documentation. For example, the Rule Specifications publication includes a topic on Creating a rule specification.