Tasks > Masking tasks > Configuring a masking task
  

Configuring a masking task

Configure a masking task to apply data masking to a source and to create a data subset.
To configure a masking task, perform the following steps:
  1. 1Define the masking task.
  2. 2Configure the source.
  3. 3Configure the target.
  4. 4Configure the data subset.
  5. 5Define data masking rules.
  6. 6Schedule the masking task.
When you configure a masking task, you can save your work after you enter all required properties. You can choose one of the following options:

Prerequisites

Before you configure a masking task, perform the following tasks:

Step 1. Define the masking task

Create the masking task.
    1Open Data Integration and click New to open the New Asset window.
    2Select Tasks > Masking Task.
    3Click Create.
    The new task window opens on the Definition page.
    4Enter a masking task name and an optional description.
    5Click Browse and select a project or folder location to store the task.
    6Click Next.
    The Source page opens.

Step 2. Configure the source

To configure a source, select a connection on the Source page. You can edit a connection that you select. Alternatively, you can create another connection.
    1On the Source page, select a connection from the list of connections.
    The source must have a primary key.
    2Optional. Choose to edit the connection or create another connection.
    3Choose a single source object or multiple source objects.
    A list of source objects appears.
    Note: Objects that you cannot update do not appear in the list of source objects. For example, you cannot update objects of type isSfIdLookup(), isCreateable(), isupdateable(), or isreferenced(). You cannot update objects with cyclic relationships. For example, you cannot update the User, Profile, Community, and Idea objects.
    4If you choose multiple source objects, perform the following steps:
    1. aClick Add.
    2. The Select Source Object dialog box appears with a list of objects.
    3. bSelect a source object and click Select.
    4. cSelect the added object and click Add.
    5. The Select Related Objects dialog box appears.
    6. dSelect the related child, parent, or self-reference objects that you want to include in the source objects. The objects move to the list of selected objects.
    7. eTo add the related objects, click Select.
    8. You can view and download the schema graph in the DOT format to view the relationships between the related objects.
    5Click Next.
    The Target page opens.

Step 3. Configure the target

To configure a target, select a connection on the Target page. The target connection type must be the same as the source connection type.
    1On the Target page, you can choose to perform inplace masking or save the edited data to a different location.
    The target object or objects are the same as the source.
    2Optional. You can choose to edit the connection or create another connection.
    3From the Task Operation list, select the operation that you want to perform.
    On a partial sandbox, you can perform update, upsert, and insert operations. When you run an inplace masking task, you can perform an update operation. If the target is different from the source, you can perform insert and upsert operations.
    4Select a target field that can link to target records. Select an existing external ID, custom field, or unique field from the list, or create another target field. To create another target field, click Create. To add the target field, click Create.
    You can view the target field details and any errors or warnings. You can save the external IDs to perform another upsert operation.
    5If you change the source object or connection properties, click Refresh Fields.
    6If you want to validate the source and target fields, click Validation Reports.
    7Click Next.
    The Data Filters page opens.

Step 4. Configure the data subset

To configure a subset operation, use the Data Filters and Relationship Behavior options on the Data Filters page. Skip this step if you do not want to create a data subset.
    1On the Data Filters page, click New to create a data filter.
    You can apply filters to a single object within a task.
    Note: Fields that you cannot update do not appear in the filter. You cannot apply filters on the following field types: TEXTAREA (RICH), TEXTAREA (RICH), and TEXT ENCRYPTED.
    2To configure child relationships, click Configure.
    The Configure Relationship Behavior dialog box appears.
    3Enable the child records that you want to include in the subset.
    4Click Save.
    Join Operation displays the number of join operations that are required to create the subset. You can view and download the schema graph in the DOT format to view the relationships between the objects.
    5Optional. Click View to view the sequence in which a task with multiple objects selects objects to create the data subset.
    6Click Next.
    The Masking page opens.

Step 5. Define data masking rules

On the Masking page, choose the object and select masking rules to assign to each field in the target.
    1On the Masking page, select a source object to view the fields.
    The task lists the common fields and the missing mandatory fields.
    2To view information about a field in the source object, click Status.
    The field data type determines the masking rules that you can apply to it. Fields that you cannot mask do not display a masking rule list.
    3To use a dictionary in a masking rule or to configure unique substitution masking, click Configure Connections and add the relational or flat file dictionary or storage connection to the task.
    The Is Unique option does not appear if the task does not include a storage connection. You cannot choose a dictionary in a rule unless you add it to the masking task.
    4To assign a rule to a common field, select the rule in the Masking Rule list.
    If the rule you select requires additional parameters, a Configure button appears next to the rule.
    5To configure the masking rule properties, click Configure.
    Each masking rule can have different properties.
    6Configure the masking rule properties and click Save.
    When you select a mapplet rule, you must configure input and output fields of the mapplet.
    7To assign the default masking rules to the fields, click Default Assignment. To clear the masking rules assignment, click Clear Assignment.
    8To view and configure an expression for the mandatory fields that are missing in the source, click Missing Mandatory Fields.
    9In the Actions column, click Configure Expression and enter an expression in the expression builder. Click OK.
    10After you configure masking rules for all fields, click Next.
    The Schedule page opens.

Step 6. Schedule the masking task

Configure when to run the masking task from the Schedule page.
You can run the masking task manually or schedule it. You can schedule the masking task to run at a specific time or at specified time intervals.

Running the masking task immediately

You can run a masking task without scheduling it.
    1Click Explore to open the Explore page.
    2Choose to browse by projects or assets from the Explore list.
    Select Assets to view a list of all assets. Select Projects to view a list of all projects. You can then select a project to view the assets in the project.
    3You can run a task manually in one of the following ways:
    You can view the progress and status of the job on the My Jobs page. You can also view and manage jobs from the All Jobs or Running Jobs page in Monitor.

Scheduling the masking task

Configure when to run the masking task from the Schedule page.
    1On the Schedule page, choose whether to run the masking task on a schedule or run manually without any schedule.
    2If you choose to run the task on a schedule, choose a schedule from the list or click New to create a new schedule.
    3To run the masking task, select the runtime environment that contains the Secure Agent to run the task.
    Note: You cannot use a cloud runtime environment to run a masking task.
    4Optionally, if the task runs in a serverless runtime environment, configure serverless usage properties.
    5If you selected multiple source objects, configure the data subset options. Select a staging connection that runs the data subset operation. Configure the staging connection and start the staging database service.
    6Optionally, select Drop Staging Tables to drop the staging tables even if there are error rows in the task.
    7Select an email notification option.
    8Optionally, configure the advanced options.
    9Optionally, configure the advanced Salesforce options if the API version is 32.0 and higher.
    10Click Save to save and keep the task open, or click Finish to save and close the task.