Data Subset Overview
You can create a subset of production data if you need a small, targeted, and referentially intact copy of the production data to use in a nonproduction environment. A nonproduction environment might include development, test, or training environments.
For example, you might create a subset of financial data for a specific region or time period.
To create a data subset from relational and flat file sources, you define one or more of the following components:
- Entity
- Defines a set of tables that are related based on physical or logical constraints. Create an entity when you need to copy data from related tables to a subset database. When you create an entity, select parameters to filter data in columns that you want to include in the subset database. You can manually create an entity or use an entity from profile results.
- Group
- Defines a set of unrelated tables. Create a group when you need to copy data from one or more unrelated tables to a subset database.
- Template
- Contains entities and groups. A template must contain at least one entity. Create a template when you want to reuse a set of entities and groups.
After you define the data subset components, add them to a plan. You can generate a workflow from the plan. The workflow loads a subset of the source data into a target database.