Using CLAIRE GPT > Data Quality > Cleanse sensitive credit card data for compliance
  

Cleanse sensitive credit card data for compliance

You’re a data steward at a retail bank that provides a range of financial services, including credit cards and loans. Your bank's credit card data is stored in a specific schema in a data warehouse. For regulatory compliance, you want to ensure that the tables containing sensitive credit card data are consistent and free from common data quality issues such as missing values, duplicates, and outliers.
To achieve this goal, you can take the following approach:
By following this approach, you can cleanse tables containing sensitive credit card data for regulatory compliance purposes.
For more information about data quality cleanse, see Data quality cleanse.
To achieve the goal, perform the following steps:
  1. 1Log in to CLAIRE GPT.
  2. 2Start a conversation to identify the tables that contain credit card details and get an overview of the table.
  3. Enter the following prompt:
    Find tables related to credit card details from Contoso schema and provide an overview of the tables.
    The following image shows the response displaying the table containing credit card details and a detailed overview of the table.
    The response displays a card named 'Tables related to credit card details in Contoso schema'. The CREDIT_CARD_DETAILS table overview shows general information, including name, business description, resource type, owner, number of rows, asset lifecycle, profiled status, created by, and last modified details, along with business context and key columns.
    To view the details of the table, click the card named Tables related to credit card details in Contoso schema.
    You can expand the reasoning to view the analysis process of the discovery agent.
    The following image shows the canvas with a tabular view of the tables containing credit card details: The response displays an open canvas with one asset identified as a table related to credit card details in Contoso schema. The open canvas also shows columns for Description, Created On, Created By, and Key Insights.
    You can click a table name to open it in Data Governance and Catalog and see further details.
  4. 3Assess whether this table contains sensitive data and identify the columns that contain sensitive data.
  5. Enter the following prompt:
    Show me the possible sensitive columns in this table
    The agent identifies the sensitive columns based on the column names and its association with the glossary terms.
    The following image shows the response displaying the sensitive columns in the table: The response displays a list of sensitive table columns such as Card_Holder's_Name, Card_Number, CVV_CVV2, Card_PIN, Expiry_Date, and Issue_Date, along with their descriptions. The response also displays the CREDIT_CARD_DETAILS Asset Overview card.
  6. 4Get recommendations with a detailed plan to cleanse the data in the CREDIT_CARD_DETAILS table.
  7. Enter the following prompt:
    Clean CREDIT_CARD_DETAILS
    The following image shows the response displaying a detailed plan containing data quality cleanse rules for the CREDIT_CARD_DETAILS table: The response displays the cleansing plan for the CREDIT_CARD_DETAILS table. It shows nine sequential steps, including Fetch and Enrich Metadata Attributes from Catalog, Identify Critical Data Elements, Metadata Quality Issues, Statistical Metrics, Generate Python Cleaning Code, Compile Report, and Generate a Mapplet Using the Cleaning Code.
    CLAIRE GPT lists detailed steps to identify critical data elements, fix data quality issues, and generate a reusable mapplet for repeatable data cleansing.
  8. 5If you are happy with the recommended cleansing plan, click Yes, proceed with the plan.
  9. The data quality agent performs the following tasks and creates a card for each of the tasks after you proceed with the cleansing plan:
    1. aIdentifies critical columns in the CREDIT_CARD_DETAILS table.
    2. bClusters columns that are functionally dependent on each other.
    3. cFixes data quality issues such as missing values and duplicates.
    4. dGenerates a sample of the cleansed data.
    5. eGenerates a reusable mapplet.
    6. fGenerates a report for the cleansed data.
    You can click each card to open the canvas and view the exact details of each task.
    The following image shows a sample of the cleansed data: The response displays a tabular view of the clean data sample for the CREDIT_CARD_DETAILS table showing columns for Rule Name, Reason for Fixes, Card_Holder's_Name, Credit_Limit, Card_Number, and Card_Type_Code with rows displaying corresponding data values and cleansing rule details.
You were able to quickly identify tables containing sensitive credit card details, get data quality cleanse rules recommendation, and apply the cleansing plan to the table to remediate data quality issues using simple natural language prompts.