You can run column profile and data domain discovery as part of enterprise discovery in Informatica Analyst.
1. In the Discovery workspace, select New > Profile.
The New Profile wizard appears.
2. Select Enterprise Discovery . Click Next.
The Specify General Properties tab appears.
3. In the Specify General Properties tab, enter a name for the enterprise discovery profile and an optional description. In the Location field, select the project or folder where you want to create the profile. Click Next.
The Select Data Objects tab appears.
4. In the Select Data Objects tab, click Choose.
The Choose Data objects dialog box appears.
5. In the Choose Data objects dialog box, choose one or more data objects to add to the profile. Click Save.
The data objects appear in the Data Objects pane.
6. Click Next.
The Select Resources tab appears.
7. In the Select Resources tab, click Choose to open the Select Resources tab.
You can import data from multiple relational data sources.
8. In the Select Resources tab, select the connections, schemas, tables, and views that you want to include in the profile. Click Save.
The left pane in the dialog box lists all the internal and external connections, schemas, tables, and views under the Informatica domain.
The resources appear in the Resource pane.
9. Click Next.
The Specify Settings tab appears.
10. In the Specify Settings tab, you can configure the column profile options and data domain discovery options. Click Save and Finish to save the enterprise discovery profile, or click Save and Run to run the profile.
You can perform the following tasks in the Specify Settings tab.
- - Enable data domain discovery. Click Choose to select data domains that you want to discover from the Choose Data Domains dialog box. The selected data domains appear in the Data Domains for Data Domain Discovery pane.
- - Run data domain on data, column name, or on both data and column name.
- - Select all the rows in the data source, or choose a maximum number of rows to run domain discovery on.
- - Choose a minimum conformance percentage or specify the minimum number of conforming rows for data domain discovery.
- - Enable column profile settings and select all rows or first few rows in the data source for the column profile. You can exclude data type inference for columns with approved data types in the column profile.
- - Choose Native, Blaze, or Spark as the run-time environment. After you choose Blaze or Spark, select a Hadoop connection to run profiles.
You can view the enterprise discovery results under the Summary and Profiles tabs.