Data Subset Task Flow
To define a data subset, add entities, groups, and templates to an project. Create a data subset plan and add the entities, groups, and templates from the project to the plan. To load data into the subset database, generate and start a workflow from the plan.
Complete the following high-level tasks to create the components for the data subset plan:
- 1. Create a project that contains one or more sources.
- 2. Create an entity to select related parent and child tables to include in the subset database.
- 3. Create filter criteria that defines what data to include in the subset database.
- 4. Optionally, create a group to select unrelated tables to include in the subset database.
- 5. Optionally, create a template that contains the entities and groups.
- 6. Create a data subset plan and add the entities, groups, and templates to it. Define parameters to filter the data in the columns. For example, you can define a parameter to filter a month column to contain only October.
- 7. Generate a workflow from the plan.
- 8. Start the workflow.
- 9. Monitor the progress of the workflow.
- 10. In the target database, verify that the correct data was copied.