Metrics
A metric is a column of a data source or output of a rule that is part of a scorecard. When you create a scorecard, you can assign a weight to each metric. Create a metric group to categorize related metrics in a scorecard into a set.
Metric Weights
When you create a scorecard, you can assign a weight to each metric. The default value for a weight is 0.
When you run a scorecard, the Analyst tool calculates the weighted average for each metric group based on the metric score and weight you assign to each metric.
For example, you assign a weight of W1 to metric M1, and you assign a weight of W2 to metric M2. The Analyst tool uses the following formula to calculate the weighted average:
(M1 X W1 + M2 X W2) / (W1 + W2)
Value of Data Quality
A measure of data quality in the source data is critical information in the management of the data assets in the organization. The cost of invalid data in metrics represented in a scorecard helps organizations derive value in monitoring data quality of the source data. As a data analyst, you might want to associate a value, such as a currency unit or any custom unit, to metrics and metric groups. You can then run the scorecard to view the total cost of invalid data in the source data.
You can define the cost unit for a metric based on the business needs. You can also configure a variable or fixed cost for each metric when you create a scorecard or edit it.
Fixed Cost
Fixed cost is a constant value that you can assign to a metric in a scorecard. You can choose a predefined cost unit or create a custom cost unit that meets the business needs.
Variable Cost
Variable cost is a value that you assign to a metric based on the values in a numeric column of a data source. The Data Integration Service calculates the variable cost for the metric based on the column or virtual column that you assign to the cost.
Example
As a mortgage loan officer, you need to provide your customers with payment books so that the customers can submit the mortgage payments. You can use a scorecard to measure the accuracy of your customer addresses to ensure the delivery of the payment books. You might want to set the variable cost to the Monthly Payment Amount column for the Address Accuracy metric. Run the scorecard to compute the total cost that the mortgage organization loses if customers did not pay the monthly amount on time.
Defining Thresholds
You can set thresholds for each score in a scorecard. A threshold specifies the range in percentage of bad data that is acceptable for columns in a record. You can set thresholds for good, acceptable, or unacceptable ranges of data. You can define thresholds for each column when you add columns to a scorecard, or when you edit a scorecard.
Complete one of the following prerequisite tasks before you define thresholds for columns in a scorecard:
- •Open a profile and add columns from the profile to the scorecard in the Add to Scorecard dialog box.
- •Optionally, click a scorecard in the Library workspace and select Actions > Edit to edit the scorecard in the Edit Scorecard dialog box.
1. In the Add to Scorecard dialog box or the Edit Scorecard dialog box, select each metric in the Metrics pane.
2. In the Metric Thresholds pane, enter the thresholds that represent the upper bound of the unacceptable range and the lower bound of the good range.
You can set thresholds for up to two decimal places.
3. Click Next or Save.