Column Profile Concepts Overview
A column profile determines the characteristics of columns in a data source, such as value frequency, percentages, and patterns.
Column profiling discovers the following facts about data:
- •The number of unique and null values in each column, expressed as a number and a percentage.
- •The patterns of data in each column and the frequencies with which these values occur.
- •Statistics about the column values, such as the maximum and minimum lengths of values and the first and last values in each column.
Use column profile options to select the columns on which you want to run a profile, set data sampling options, and set drill-down options when you create a profile.
A rule is business logic that defines conditions applied to source data when you run a profile. You can add a rule to the profile to validate data.
Create scorecards to periodically review data quality. You create scorecards before and after you apply rules to profiles so that you can view a graphical representation of the valid values for columns.