Data Subset Task Flow
To define a data subset, add entities and groups to a project. Create a data subset plan and add the entities and groups from the project to the plan. To load data into the subset database, generate and start a workflow from the plan.
Complete the following high-level tasks to create the components for the data subset plan:
- 1. Create a project that contains one or more sources.
- 2. Create an entity to select related parent and child tables to include in the subset database.
- 3. Create filter criteria that defines what data to include in the subset database.
- 4. Optionally, create a group to select unrelated tables to include in the subset database.
- 5. Create a data subset plan and add the entities and groups to it. Define parameters to filter the data in the columns. For example, you can define a parameter to filter a month column to contain only October.
- 6. Generate a workflow from the plan.
- 7. Start the workflow.
- 8. Monitor the progress of the workflow.
- 9. In the target database, verify that the correct data was copied.