Configuring a Data Masking Task
Configure a Data Masking task to apply Data Masking to a source.
To configure a Data Masking task, perform the following steps:
- 1. Define the Data Masking task.
- 2. Configure the source.
- 3. Configure the target.
- 4. Configure the data subset.
- 5. Define data masking rules.
- 6. Schedule the Data Masking task.
When you configure a Data Masking task, you can save your work at any time. You can choose one of the following options:
- •Save and continue
- •Save and run
- •Save and close
Prerequisites
Before you configure a Data Masking task, perform the following tasks:
- •Use API version 32.0 and above to perform the Data Masking task.
- •Set the EnableSalesForceStagingResponse flag to True in the runtime environment to improve batch processing.
- •Disable triggers, validations, lookup filters, and workflow rules on the target object.
- •Ensure that the source is synchronized with the target.
- •Ensure that all the mandatory fields in the target are present in the source.
- •Align the user profiles, and check the user permissions and the visibility of the objects in the target.
Step 1. Define the Data Masking Task
Create the Data Masking task.
1. Click Task Wizards > Data Masking.
2. Click New.
The Definition page appears.
3. Enter a Data Masking task name and the description. Click Next.
Step 2. Configure the Source
To configure a source, select a connection on the Source page and edit the connection. Alternatively, you can create another connection.
1. On the Source page, select a Salesforce connection.
The source must have a primary key.
2. Optionally, choose to edit the connection or create another connection.
- - To edit the connection, select a connection and click View. In the View Connection dialog box, click Edit.
- - To create another connection, click New. In the New Connection dialog box, enter the connection information.
3. Choose a single source object or multiple source objects.
- - To select a single object from the list, click Single. Select a single source object from the list. You can preview the source object details in the Data Preview section.
- - To select many objects, click Multiple.
A list of source objects appears.
Note: Objects that you cannot update do not appear in the list of source objects. For example, you cannot update objects of type isSfIdLookup(), isCreateable(), isupdateable(), or isreferenced(). You cannot update objects with cyclic relationships. For example, you cannot update the User, Profile, Community, and Idea objects.
4. If you choose multiple source objects, perform the following steps:
- a. Click Add.
The Select Source Object dialog box appears with a list of objects.
- b. Select a source object and click Select.
- c. Select the added object and click Add.
The Select Related Objects dialog box appears.
- d. Select the related child, parent, and self-reference objects that you want to include in the source objects.
- e. To add the related objects, click Select.
You can view and download the schema graph in the DOT format to view the relationships between the related objects.
Step 3. Configure the Target
To configure a target, select a connection on the Target page. The target connection type must be the same as the source connection type.
1. On the Target page, you can choose to perform inplace masking or save the edited data to a different location.
- - To perform inplace masking, select Same as source. If you select the target same as the source, the Connection list is disabled. The Same as source check box is selected by default.
- - To save masked data to a different location, clear Same as source, and select a connection from the Connection list.
The target object or objects are the same as the source.
2. Optionally, choose to edit the connection or create another connection.
- - To edit the connection, select a connection and click View. In the View Connection dialog box, click Edit.
- - To create another connection, click New. In the New Connection dialog box, enter the connection information.
3. From the Task Operation list, select the operation that you want to perform.
You can perform update, upsert, and insert operations on a partial sandbox. When you run an inplace masking task, you can perform an update operation. If the target is different from the source, you can perform insert and upsert operations.
4. Select a target field that can link to target records. Select an existing external ID, custom field, or unique field from the list, or create another target field. To create another target field, click Create. To add the target field, click Create.
You can view the target field details and errors or warnings if any. You can save the external IDs to perform another upsert operation.
5. If you change the source object or connection properties, click Refresh Fields.
6. If you want to validate the source and target fields, click Validation Reports.
Step 4. Configure the Data Subset
To configure a subset operation, use the Data Filters and Relationship Behavior options on the Data Subset page.
1. On the Data Subset page, click New to create a data filter.
- - To create a simple data filter, select an object and an operator. Enter the value you want to use, and click OK.
- - To create an advanced data filter, click Advanced. Select an object. Select the field and create the filter expression. Click OK.
If you created simple data filters for the same object, creating an advanced filter merges the simple data filters for the object with the advanced filter. For more information about configuring advanced data filters, see
Advanced Data Filters
You can apply filters to a single object within a task.
Note: Fields that you cannot update do not appear in the filter. You cannot apply filters on the following field types: TEXTAREA (RICH), TEXTAREA (RICH), and TEXT ENCRYPTED.
2. To configure child relationships, click Configure.
The Configure Relationship Behavior dialog box appears.
3. Enable the child records that you want to include in the subset.
4. Click Save.
You can view and download the schema graph in the DOT format to view the relationships between the objects.
5. To estimate the target subset size, click Save and Compute Subset.
You can view the detailed results such as source objects, total number of target rows, total number of subset rows, and subset size.
Step 5. Define Data Masking Rules
On the Data Masking page, choose the object and select masking rules to assign to each field in the target.
1. On the Data Masking page, select a source object to view the fields.
The task lists the common fields and the missing mandatory fields.
2. To view information about a field in the source object, click Status.
The field data type determines the masking rules that you can apply to it. Fields that you cannot mask do not display a masking rule list.
3. To assign a rule to a common field, select the rule in the Masking Rule list.
If the rule you select requires additional parameters, a Configure button appears next to the rule.
4. To configure the masking rule properties, click Configure.
Each masking rule can have different properties.
5. Configure the masking rule properties and click Save.
When you select a mapplet rule, you must configure input and output fields of the mapplet.
6. To assign the default masking rules to the fields, click Default Assignment. To clear the masking rules assignment, click Clear Assignment.
7. To view and configure expression for the mandatory fields that are missing in the source, click Missing Mandatory Fields.
8. In the Actions column, click Configure Expression and enter an expression in the expression builder. Click OK.
9. After you configure masking rules for all fields, click Next.
Step 6. Schedule the Data Masking Task
Configure when to run the Data Masking task from the Schedule page.
You can run the Data Masking task manually or schedule it. You can schedule the Data Masking task to run at a specific time or at specified time intervals.
Running the Data Masking Task Immediately
You can run a Data Masking task without scheduling it.
1. Click Task Wizards > Data Masking.
The Data Masking Tasks page appears with the list of tasks that you created.
2. Click the Run Now icon for the task you want to run.
Scheduling the Data Masking Task
Configure when to run the Data Masking task from the Schedule page.
1. On the Schedule page, choose whether to run the Data Masking task on a schedule or run manually without any schedule.
2. To run the Data Masking task, select the runtime environment that contains the Secure Agent to run the task.
Note: You cannot use a cloud runtime to run a data masking task.
3. If you selected multiple source objects, configure the data subset options. Select a staging connection that runs the data subset operation. Configure the staging connection and start the staging database service.
4. Optionally, select Drop Staging Tables to drop the staging tables even if there are error rows in the task.
5. Select an email notification option.
6. Optionally, configure the advanced options.
7. Optionally, configure the advanced Salesforce options if the API version is 32.0 and higher.
8. Click Save and Close.