Part II: Data Discovery with Informatica Analyst > Enterprise Discovery in Informatica Analyst > Configuration Options for Enterprise Discovery
  

Configuration Options for Enterprise Discovery

The configuration options for enterprise discovery include data domain discovery options, column profile sampling options, and general profile properties such as name and description.
You can choose to run a column profile or a profile to perform data domain discovery. You can also choose to run both column profile and a profile to perform data domain discovery as part of the configuration.

Data Domain Discovery Settings

The data domain discovery settings include choosing whether data domain discovery must run on column data, column name, or both column data and column name. You can choose data domains and specify whether data domain discovery needs to process all the rows in the data source. You can also set a minimum conformance percentage for data domain inference.
The following table describes the data domain discovery settings that you can configure for enterprise discovery in the Analyst tool:
Option
Description
Enable data domain discovery
Performs data domain discovery as part of enterprise discovery.
Run data domain discovery on data
Performs data domain discovery on column data.
Run data domain discovery on column name
Performs data domain discovery on the name of each column.
All rows
Performs data domain discovery on all source rows.
Maximum rows for data domain discovery
The maximum number of rows the profile can run on. The Analyst tool chooses rows starting from the first row in the source.
Minimum conformance percentage
The minimum conformance percentage of the column data to be eligible for a data domain match. The conformance percentage is the ratio of number of matching rows divided by the total number of rows.
Note: The Analyst tool considers null values as nonmatching rows.
Exclude columns with approved data domains
Excludes columns with approved data domains from the data domain inference of the profile run.

Column Profile Sampling Options

The sampling options determine whether the Analyst tool runs a column profile on all rows of the data sources or limited number of rows.
The following table describes the column profile sampling options that you can configure for enterprise discovery:
Option
Description
Enable column profiling
Runs a column profile as part of enterprise discovery.
All Rows
Runs a column profile on all rows in the data source.
First <number> Rows
The number of rows that you want to run the column profile on. The Analyst tool chooses the rows starting from the first row in the data source.
Exclude datatype inference for columns with approved datatypes
Excludes columns with an approved datatype from the datatype inference of the profile run.