Profile Management
You can create and run profiles, add primary and unique keys to the source, and verify and add constraints to the source. Alternatively, you can import profiles that you create and run in Informatica Developer.
You can run a profile multiple times, and you can edit a profile between runs.
Creating a Primary Key Profile
Create a primary key profile to identify a list of possible primary keys that you can add to tables without keys. Create a primary key profile if the source data does not contain primary and foreign key relationships.
1. Open the project, and click the Discover view.
2. Click the Profile view.
3. Click Actions > New Profile to create a profile.
4. In the New Profile dialog box, enter the profile name and description. Choose to create a primary key profile.
5. Click Next.
6. Select the tables to profile, and click OK.
7. Click Next.
8. In the Sampling Options page, enter the maximum number of columns that you want to include in the primary key.
9. Enter the maximum number of rows that you want to run the profile on.
10. Enter the minimum conformance required, or enter the maximum number of rows that can violate the criteria.
11. Click Finish.
12. Click Actions > Execute.
Creating an Entity Profile
The entity profile creates a list of possible parent tables that you can define for data subset operations. Create and run entity profiles in the Discover view.
1. Open the project, and click the Discover view.
2. Click the Profile view.
The Profile view shows a list of the profiles in the project.
3. Click Actions > New Profile to create a profile.
4. In the New Profile dialog box, enter the profile name and description. Choose to create an entity profile.
Select primary key and entity profile if the physical database does not contain any primary keys.
5. Click Next.
6. Select the tables to profile, and click OK.
7. Click Next.
8. In the Sampling Options page, enter the maximum number of columns that you want to include in the primary key.
9. Enter the maximum number of rows that you want to run the profile on.
10. Enter the minimum conformance required, or enter the maximum number of rows that can violate the criteria.
The following table describes the entity options that you can update:
Entity Option | Description |
---|
Data types used in comparison | Determines how Test Data Manager derives data types to compare in the tables. Choose one of the following options: - - Inferred data types. Compares the primary key and the foreign key based on the data types inferred by TDM.
- - Documented data types. Compares the primary key and the foreign key based on the data types defined for the source column.
|
Comparison case sensitivity | Indicates if the foreign key comparison is case-sensitive or not. If you choose case-sensitivity, profile performance might be affected. |
Trim values before comparison | Trims leading values, trailing values, or leading and trailing values. |
Minimum conformance percent | Minimum percentage of rows without key violations that the profile allows for foreign key discovery. |
Regenerate Signature | Determines whether the data profiling warehouse should clear the existing object information before the Data Integration Service runs the profile. |
11. Click Save.
12. Click Actions > Execute to run the profile.
Creating a Data Domain Profile
Create and run data domain profiles in the Discover view.
A project must contain policies before you create a data domain profile. The policies contain the data domains that you can use in a profile for data discovery.
1. Open the project and click the Discover view.
2. Click the Profile view.
The Profile view shows a list of the profiles in the project.
3. Click Actions > New Profile to create a new profile.
4. In the New Profile dialog box, enter the profile name and description. Choose to create a data domain profile.
5. Select the tables to profile and click OK.
6. Click Next.
7. In the Select Sampling Options pane, choose whether to add policies or data domains to the profile. When you select a policy, Test Data Manager includes all data domains in the policy.
Test Data Manager returns a list of policies or data domains in the pane.
8. Select the policies or the data domains to profile.
9. In the Sampling panel, select whether to run data discovery on the source data, the column name, or the data and the column name.
You can run a profile for column metadata and then run it again for the source data.
10. Enter the maximum number of rows to profile.
11. Enter the minimum conformance percent.
All rows might not conform to the data domain expression pattern. You can enter a minimum percentage of the profiled rows that must conform.
12. Click Save.
13. Click Actions > Execute.
Creating a Column Profile
To perform a data generation operation, you can create and run a column profile.
1. Open a project and click the Discover view.
2. Click the Profile view.
The Profile view shows a list of the profiles in the project.
3. Click Actions > New Profile.
The New Profile dialog box appears.
4. Enter the profile name and description.
5. Select Column.
6. Click Next.
7. To add the tables that you want to run a profile on, click Add.
The Select Tables dialog box appears.
8. Select the tables, and click OK.
The list of selected tables appears.
9. Click Next.
10. To create the column profile, configure the required sampling options.
11. Click Finish.
The profile opens in a tab.
12. To run the profile, click Actions > Execute.
The Execute Profile dialog box appears.
13. Select a connection to run the column profile.
A column profile run fails if you do not specify a schema name in the Microsoft SQL Server database connection.
14. Click Execute, and close the profile.
15. In the project, click Monitor to view the progress of the profile run.
Editing a Profile
You can edit a profile and run it multiple times. When you run the profile, the Data Integration Service overwrites the original profile results.
When you edit a profile, you can change the profile name and description. You can add or remove tables. When you run data domain profiles, you can change the policies and data domains in the profile. When you run primary key profiles and entity profiles, you can change the options.
1. Open the project and click the Discover | Profile view.
2. In the Discover view, click a profile name to open it.
3. Click Edit.
The Edit Profile dialog box appears. The dialog box has the General tab, the Tables tab, and the Relationship Sampling tab or the Sampling tab depending on the profile type.
4. On the General tab, you can edit the name and description of the profile.
You cannot change the profile type.
5. On the Tables tab, add or remove tables that you want to profile.
To add a table, click Add and select the tables you want to add. To delete a table, select the check box for the table you want to delete. Click Delete.
6. On the Relationship Sampling tab, select to override default inference options. Edit the options for finding primary key and foreign keys in the source data.
7. On the Sampling tab, you can change the data domains or policies in the profile. You can choose to run the profile against the source data, the column name, or both data and column name.
8. Click Actions > Execute to run the profile.
Deleting a Profile
You can delete profiles.
1. Open the project and click the Discover view.
2. In the Discover pane, select the profile row, but do not open the profile.
3. Click Actions > Delete to delete the profile.
4. Click Yes to confirm the delete.