Option | Description |
---|---|
Enable data domain discovery | Performs data domain discovery as part of enterprise discovery. |
Run data domain discovery on data | Performs data domain discovery on column data. |
Run data domain discovery on column name | Performs data domain discovery on the name of each column. |
Minimum conformance percentage | The minimum conformance percentage of rows in the data set required for a data domain match. The conformance percentage is the ratio of number of matching rows divided by the total number of rows. Note: The Analyst tool considers null values as nonmatching rows. |
Minimum conforming rows | The minimum number of rows in the data set required for a data domain match. |
Exclude null values from data domain discovery | Excludes the null values from the data set for data domain discovery. |
Exclude columns with approved data domains | Excludes columns with approved data domains from the data domain inference of the profile run. |
All rows | Performs data domain discovery on all source rows. |
First | The maximum number of rows the profile can run on. The Analyst tool chooses rows starting from the first row in the source. |
Option | Description |
---|---|
Enable column profiling | Runs a column profile as part of enterprise discovery. |
Exclude approved data types and data domains from the data type and data domain inference in the subsequent profile runs | Excludes the approved data type or data domain from data type and data domain inference from the next profile run. |
Option | Description |
---|---|
All Rows | Runs a column profile on all rows in the data source. |
First <number> Rows | The number of rows that you want to run the column profile on. The Analyst tool chooses the rows starting from the first row in the data source. |
Option | Description |
---|---|
Native | The Analyst tool submits the profile jobs to the Profiling Service Module. The Profiling Service Module then breaks down the profile jobs into a set of mappings. The Data Integration Service runs these mappings and writes the profile results to the profile warehouse. |
Hadoop | The Data Integration Service pushes the profile logic to the Blaze engine on the Hadoop cluster to run profiles. |