Part III: Data Discovery with Informatica Developer > Data Domain Discovery in Informatica Developer > Data Domain Discovery Options in Informatica Developer
  

Data Domain Discovery Options in Informatica Developer

You can select the source columns, data domains, and inference options when you create a profile to perform data domain discovery. You can also choose to omit columns from data domain discovery based on their datatypes and data length.

Data Domain Selection in Informatica Developer

The Data Domain Selection options list all the domains from the data domain glossary. You can search for specific data domains and select them before you run them as a part of data domain discovery.
The following table describes the Data Domain Selection options for data domain discovery:
Option
Description
Enabled as part of the "Run Profile" action
Includes the data domain discovery options when you run the profile.
Name
Data domain name.
Description
Description for the data domain.
Data Domain Group
Name of the data domain groups to which the data domain belongs.
Show data domain group in hierarchy
Lists all data domain groups with the data domains grouped under each data domain group.

Data Domain Column Selection in Informatica Developer

You use the Column Selection options to choose the columns you want to run as a part of data domain discovery.
The following table describes the Column Selection options for data domain discovery:
Option
Description
Column
Column name.
Datatype
Datatype of the column.
Precision
Maximum precision for the column.
Scale
Scale of the column.
Nullable
Indicates a column that can have null values.
Description
Description for the column.

Data Domain Inference Options in Informatica Developer

The inference options determine whether domain discovery must run on column data, column name, or both. You can also specify whether the profile needs to process all rows in the data source and set a minimum conformance percentage.
The following table describes the Inference options for data domain discovery:
Option
Description
Override the default inference options
Enables you to change the predefined inference options.
Data
Profile runs on column data.
Column name
Profile runs on column titles.
Data and Column name
Profile runs on both column data and column titles.
All rows
Profile runs on all rows of the data source.
Maximum rows to profile
The maximum number of rows the profile can run on. The Developer tool chooses the rows starting from the first row in the source.
Minimum conformance percentage
The minimum conformance percentage of the column data to be eligible for data domain match. The conformance percentage is the ratio of number of matching rows divided by the total number of rows.
Note: The Developer tool considers null values as non-matching rows. Columns containing a high number of null values may not result in data domain inference unless you specify a low value for minimum conformance percentage.