When you add a step to a labeler asset, Data Quality prompts you to define a step with a character set or with a dictionary.
The following image shows the options that you configure when you define a step with a dictionary:
The dictionary options include the following properties:
1Labeler mode.
Indicates the type of labeling operations to perform on the input data.
2Add Step option.
Adds a step to the asset. A step describes a labeling operation that a mapping can apply to an input data field.
3Up and Down options.
Moves a step that you select up or down within the step sequence.
4Step sequence.
Defines the order in which a mapping applies each step to the input field at run time. The mapping performs labeling operations in the order that you specify.
5Options name.
Identifies the dictionary that you add to the step.
6Test input field.
Contains the input data that the Secure Agent uses to test the steps in the labeler sequence.
7Test output fields.
Contain the result of the test. The test output field contains a copy of the input field data in which any dictionary value is replaced by the label that you specify.
8Import file option.
Imports data to the test panel.
9Step type.
Identifies the type of step to which the properties apply.
10Dictionary name.
Identifies the dictionary that the step applies to the input data. You select the dictionary.
11Label name.
Specifies the label that the step applies to the characters in an input string. You enter a single character as the label.
The labeling operation returns the label for each character in a matching dictionary value.
12Casing option.
Specifies whether the step considers the character case of the input values that it compares to the dictionary values. For example, the character case may be relevant if you plan to label person names from an input string, as person names begin with an uppercase letter.
Clear the option if the character case is not relevant to the type of data that you plan to label from the inputs.
13Override any later step in the sequence option.
Determines whether the current step output overrides the output from later steps in the sequence.
14Ignore Text option.
Adds a character to ignore during a labeling operation. You can add one or more characters. The labeling operation does not assign the label to any character that matches a character in the dictionary.
The Ignore Test option includes the following properties:
- Search term. Specifies the characters to ignore when you perform a labeling operation.
- Case sensitive. Determines whether the input characters must match the case of the search characters.
- Uppercase. Converts the characters in the input string that match the search term to uppercase.
- Start position. Specifies the character position in the input string at which the asset starts to analyze the characters.
- End position. Specifies the character position in the input string at which the asset ends the analysis.