Data Domain Discovery Results in Informatica Developer
Data domain discovery results display statistics about columns that match data domains, including conformance criteria for data domain match, and whether column names match data domains.
You can drill down the results further for analysis. You can also verify the results on all the rows of the data source and add the results to a data model from the editor in the Developer tool. You can sort the results based on data domains, data domain groups, and columns. You can export data domain discovery results to a Microsoft Excel file.
The following table describes the data domain discovery results:
Column name | Description |
---|
Name | Name of the data domain, data domain group, or column based on whether you select the Data Domain, Data Domain Group, or Columns view. |
Connection | Name of the connection. |
Status | The inference status of the column. |
% Data Conformance | The minimum conformance percentage of rows required for a data domain match. |
Conforming row count | The minimum number of rows required for a data domain match. |
% Null | The percentage of null values for the column. |
Total Rows | The total number of rows. |
Column Name Match | Indicates whether the column name matches a data domain name. |
Data Domain Groups | The data domain group that the data domain belongs to. |
Documented Data type | Data type declared for the column in the profile object. |
Drilldown | If selected, drills down on rows. |
Verified | Indicates the validation of the data domain match on all rows of the data source. |
Last Run Time | Date and time of the last profile run. |
Viewing by Data Domain Groups
You can view data domain discovery results sorted by data domain groups.
1. Run the profile to view its results.
2. Click Results.
3. Click Data Domain Discovery.
You can view the data domain discovery results in the right panel.
4. Verify that the Data Domain option is selected in the Show field.
5. Select Show data domain group hierarchy to view the results sorted by the data domain groups.
Viewing by Columns
You can view data domain discovery results sorted by source columns that match data domains.
1. Run the profile to view its results.
2. Click Results.
3. Click Data Domain Discovery.
You can view the data domain discovery results in the right panel.
4. Select Columns to view the results sorted by source columns that match data domains.
Verifying the Results
When you run a profile, it analyzes a sample of the data source to infer profile results. You can run the profile on all rows of the source data to verify the inference results.
1. Run the profile to view its results.
2. Click Results.
3. Click Data Domain Discovery.
You can view the data domain discovery results in the right panel.
4. Select a column in the right panel that you want to verify.
5. Right-click the column and select Verify to run the profile on all rows of the data source.
You might see a change in the % Data Conformance value or Conforming row count value after you verify the results.
6. To verify the inference results of multiple columns, select multiple columns. You can then right-click and select Verify All.
Approving Data Domains
If you run data domain discovery in a single data object profile, you can approve the inferred data domains for multiple columns at a time. If you run data domain discovery as a part of enterprise discovery, you can approve the data domain of one source column at a time. To approve the data domains of multiple columns after enterprise discovery, you can open the individual data object profile tasks and approve the data domains.
1. In the Object Explorer view, select a profile.
2. Double-click the profile to open it.
The profile opens in a tab.
3. If you ran a single data object profile, select the Data Domain Discovery view and then select a row. The row contains data domain discovery results for each column.
4. Right-click the row, and select Accept.
The inference status of the data domain changes to Accepted.
5. If you ran enterprise discovery, select the Data Domains view and then select a data domain.
The columns that match the data domain appear in the right panel.
6. Right-click the column that you want to approve and select Accept. You can also select multiple rejected columns and approve them as required.
The inference status of the data domain changes to Accepted.
7. To restore the inferred status of the data domain, right-click the row and click Reset.
Rejecting Data Domains
Informatica Developer displays inferred data domains in the profile results by default. You can reject inferred or approved data domains. You can choose to show or hide the rejected data domains.
1. In the Object Explorer view, select a profile.
2. Double-click the profile to open it.
The profile opens in a tab.
3. In the Data Domain Discovery view, or Data Domains view, select a row.
4. To reject inferred data domains, right-click the row and select Reject.
Informatica Developer greys out the rejected data domain from the data domain discovery results.
5. To hide the rejected data domains, right-click the row and select Hide Rejected.
6. To view the rejected data domains, right-click one of the rows, and select Show Rejected.
Exporting Data Domain Discovery Results from Informatica Developer
When you export the data domain discovery results to an .xlsx file from Informatica Developer, you can save the file either to the server or a specific location in the client machine.
1. Run a profile to perform data domain discovery.
2. Click the Results view.
3. Click the Export Results to File icon.
The Export data to a file dialog box appears.
4. Enter the file name. Optionally, use the default file name.
5. Under Save, choose Save on Client and click Browse to select a location and save the file locally in your computer. By default, Informatica Developer writes the file to a server location set in the Data Integration Service properties of Informatica Administrator.
6. Click OK.