Creating and Running Scorecards Overview
A scorecard is the graphical representation of valid values for a column or the output of a rule in profile results. Use scorecards to measure and monitor data quality progress over time.
To create a scorecard, you add columns from the profile to a scorecard as metrics, assign weights to metrics, and configure the score thresholds. To run a scorecard, you select the valid values for the metric and run the scorecard to see the scores for the metrics.
Scorecards display the value frequency for columns in a profile as scores. Scores reflect the percentage of valid values for a metric.
Story
HypoStores wants to incorporate data from the newly-acquired Los Angeles office into its data warehouse. Before they merge the data they want to make sure that the data in different customer tiers and states is analyzed for data quality. You are the analyst who is responsible for monitoring the progress of performing the data quality analysis You want to create a scorecard from the customer tier and state profile columns, configure thresholds for data quality, and view the score trend charts to determine how the scores improve over time.
Objectives
In this lesson, you will complete the following tasks:
- 1. Create a scorecard from the results of the Profile_LA_Customers_Custom profile to view the scores for the CustomerTier and State columns.
- 2. Run the scorecard to generate the scores for the CustomerTier and State columns.
- 3. View the scorecard to see the scores for each column.
- 4. Edit the scorecard to specify different valid values for the scores.
- 5. Configure score thresholds and run the scorecard.
- 6. View score trend charts to determine how scores improve over time.
Prerequisites
Before you start this lesson, verify the following prerequisite:
- •You have completed lessons 1 through 5 in this tutorial.
Timing
Set aside 15 minutes to complete the tasks in this lesson.