Part II: Data Discovery with Informatica Analyst > Data Domain Discovery in Informatica Analyst > Data Domain Discovery Options in Informatica Analyst
  

Data Domain Discovery Options in Informatica Analyst

Use the data domain discovery options to choose the columns, data domains, and inference options for data domain discovery. Inference options include choosing whether you want to run data domain discovery based on a rule on column data, column name, or both.

Data Domain Column Selection in Informatica Analyst

The Columns options lists all the columns in the data source. You can choose the columns you want to run as a part of data domain discovery.
The following table describes the Columns options for data domain discovery:
Option
Description
Name
Column name.
Data Type
Datatype of the column.
Precision
The maximum precision for the column.
Scale
Scale of the column.
Nullable
Indicates a column that can have null values.
Description
The description for the column.

Data Domain Selection in Informatica Analyst

The Data domains options list all the data domains from the data domain glossary. You can choose the data domains you want to run as a part of data domain discovery.
The following table describes the Data domains options for data domain discovery:
Option
Description
Show data domain group in hierarchy
Lists all data domain groups with the data domains grouped under each data domain group.
Name
Data domain name.
Data domain group
The name of the data domain group to which the data domain belongs.
Description
The description for the data domain.

Data Domain Inference Options in Informatica Analyst

Inference options determine whether data domain discovery must run on column data, column name, or both. You can also specify the maximum number of rows the profile can analyze and the minimum conformance percentage for data domain match.
The following table describes the inference options for data domain discovery:
Option
Description
Data
Runs the profile on column data.
Column name
Runs the profile on column titles.
Data and column name
Runs the profile on both column data and column titles.
All Rows
Runs the profile on all rows from the source.
Maximum rows to run a profile on
Maximum number of rows the profile can run on. The Analyst tool chooses the rows starting from the first row in the source.
Minimum Conformance Percentage
The minimum conformance percentage of the column data to be eligible for data domain match. The conformance percentage is the ratio of number of matching rows divided by the total number of rows.
Note: The Analyst tool considers null values as nonmatching rows. Columns containing a high number of null values might not result in data domain inference unless you specify a low value for minimum conformance percentage.
Exclude columns with an approved data domain
Excludes columns with an approved data domain from the data domain inference of the profile run.