You use a regular expression-style syntax to define the format of the input values and output tokens that Data Governance and Catalog processes when you use the tokenize data de-identification technique. Read this chapter to understand the syntax and see examples of how to use it.
Note:
Since you only use the tokenization expression to define and match the format of a string, and not for extracting portions of it, the tokenization syntax that you use does not support all of the constructs available in other regular expression syntaxes.
The following are examples of what you cannot use in tokenization expressions:
•Anchors
•Backreferences
•Capture groups
•Limited support for repeated elements
•These shorthand character classes: /d, /D, /h, /H, /s, /S, /v, /V, /w, /W, and .
Follow this guide when constructing your tokenization expression.