Tabular Assets
You can view tabular assets such as relational tables, CSV files, and dimension tables in Enterprise Data Catalog. You can view details about tabular assets, check the data quality of tabular assets, rate and review tabular assets, and ask questions about tabular assets in the Overview tab.
A tabular asset is any asset that contains data that you can view in a tabular format.
The following types of assets are examples of tabular assets:
- •Business intelligence dimension, logical, and presentation tables
- •CSV, JSON, and XML files
- •Hive tables
- •Relational tables, views, and synonyms
- •Salesforce objects
The Overview tab shows different details for different types of tabular assets. For tabular assets in resources that support profiling, the Overview tab shows the system attributes, custom attributes, composite data domains, and profile results that include column names, business terms, data types, and data domains. For other tabular assets, the Overview view shows the system attributes, custom attributes, and links.
If the columns or fields in a tabular asset have profile results, you can check the quality of the column or field data from the Overview view.
The following image shows the sample Overview tab for a relational table:
- 1. Add a business title to the table.
- 2. Path to the table.
- 3. Follow the table.
Tabular Asset Overview
The Overview tab for a tabular asset displays the details, such as the description, people associated with the asset, related business terms, composite data domains, system attributes, and column or link details.
Enterprise Data Catalog displays the following details for a tabular asset:
- Description
- The Description section displays the tabular asset description. Use the Edit Properties icon () to edit or add description for the asset. You can enter rich text as the description for the asset. Use the text formatting options to format the rich text. To add more descriptive context to the asset, you can also add multiple sections under the Description section. For example, you create and configure a rich text type custom attribute called Additional Description for the tabular asset in the Catalog Administrator. Enterprise Data Catalog displays the custom attribute as an another section under the Description section in the Overview tab.
- Sample Columns
The Overview tab displays the sample columns for tabular assets, such as view columns, table columns, CSV fields, XML file fields, and JSON file fields, other semi-structured sources, and for sources that you can run the column profile on.
The Sample Columns section displays the following details for the sample columns:
- - Name. Displays the column or field name. To view the column or field asset details, click the column name. The column asset details appear in the Overview tab.
- - Business title. Displays the business title associated with the column. Hover the mouse over the business title to view the name and confidence score. You can accept or reject the business title.
- - Data Domains. Displays the data domains associated with the column. Hover the mouse over a data domain to view the Name Match, Data Match, Proximity Match, System Suggested, and Assigned by details. The inferred data domains appear in orange color and the curated data domains appear in green color.
- - Show All. To view all the columns in the asset in the Columns tab, click the Show All or click Columns.
- Reviews
- The Reviews section displays the reviews and ratings for a tabular asset. By default, the section displays a text box and five star rating scale next to the text box. To review the tabular asset, enter the review in the text box and rate the tabular asset, and then click Submit.
- The section displays the review that you provided at the top of the list of the existing reviews. You can mark any review as helpful if you find the review useful.
- The following image displays the Reviews section in the Overview tab:
- The section displays a maximum of five latest reviews. To view all the reviews, click Show All. The Reviews tab displays the complete list of reviews about the tabular asset, average ratings of the asset, and total user ratings on the tabular asset.
The following image displays the Reviews tab for a tabular asset:
- In the Reviews tab, you can review the tabular asset and filter the reviews based on the ratings.
- You can also sort the list based on the following options:
- - Recent Reviews. Sorts the reviews list based on the latest reviews.
- - Voted Most Helpful. Sorts the reviews list based on reviews that are voted as most helpful.
- - Ratings. Sorts the reviews list based on reviews with highest ratings.
- For more information about reviews, see Rate and Review Asset.
- Questions
- The Questions section displays the questions that are asked about the tabular asset. To ask a question, enter the question in the text box, and then click Submit. The section displays the question that you ask at the top of the existing questions. The questions include the answer and delete options. A user who has knowledge about the tabular asset can answer the questions asked. You can also mark an answer as helpful if you find the answer useful.
The following image displays the Questions section in the Overview tab:
- The section displays a maximum of five latest questions that are asked. To view all the questions, click Show All. The Questions tab displays the complete list of questions asked about the tabular asset.
The following image displays the Questions tab for a tabular asset:
- You can filter the questions based on the following options:
- - My Questions. Displays all the questions that you asked about the tabular asset.
- - Answered Questions. Displays all the questions that have answers.
- - Unanswered Questions. Displays all the questions without answers.
- For more information about questions, see Asset Questions.
- People
- The People section displays the users associated with the tabular asset. The section by default displays the data owner and data steward fields. The section also displays the followers of the tabular asset. Use the Edit Properties icon () to add or change the users associated with the glossary. The additional user fields that you see in the section appear after the catalog administrator creates and configures a custom attribute of User data type for a tabular asset in the Catalog Administrator.
- Custom Attributes
- The properties that the Custom Attributes section displays depends upon the different types of custom attributes the catalog administrator creates and configures for a tabular asset in the Catalog Administrator. The section displays custom attributes of data types, such as the date, string, boolean, decimal, integer, and URL. Use the Edit Properties icon () to add or change values of the properties in the section.
- System Attributes
The Overview tab displays the System Attributes section for tabular assets. The system attributes that are shown by default vary based on the parent resource type. For example, the default system attributes for a relational table include comments, data source UUID, native type, and resource name.
You can change the system attributes that are displayed in the Overview tab when you configure search filters in the Application Configuration dialog box.
- Composite Data Domains
- The Overview tab displays the Composite Data Domains section if tabular assets in resources have associated composite data domains. The section displays all the composite data domains that are inferred on the tabular asset. To view the columns that have data domains associated with the composite data domain, click the filter icon next to the composite data domain name. If the composite data domain is inferred on multiple tabular assets, you can hover over the composite data domain name link and click View Instances to view other tables on which the composite data domain is inferred. You can also click the composite data domain name link to display the Overview tab for the composite data domain.
The following image shows the Composite Data Domain section in the Overview tab of a tabular asset:
- Business Terms
- The Related Business Terms section displays custom attributes of data types, such as BusinessTerm, Glossary, AxonGlossary. The section displays related terms when the catalog administrator creates and configures the custom attribute for the tabular asset in the Catalog Administrator. You can associate multiple business terms as custom attribute values with the asset. Use the Edit Properties () to add, remove, or change the business term values.
- Business Classifications
- The Business Classification section displays all the asset classifications based on custom attributes of reference data types. The custom attributes of a reference data type depend on how the custom attributes are defined in Informatica Business Glossary or Informatica Axon. For example, if you have a table called Sales Details and you want to classify the table based on a region. To classify the table, you can assign a custom attribute of reference data type called APAC to the table. Use the Edit Properties () to add, remove, or change the business classification values.
- Links
- The Overview tab displays the Links section for tabular assets in resources that do not support profiling. The Links section displays the child assets of the tabular asset. For example, if you view the Overview for a pivot table, the Links section displays the axes in the table. If you view a presentation table, the Links section displays the presentation columns and derived presentation columns.
Viewing Column Details
The Columns tab displays all the columns for tabular assets, such as view columns, table columns, synonym columns, public synonym columns, and for sources that you can run the column profile on.
On the Columns tab, you can view the following details for each column in the tabular asset:
- Name
- Displays column names. By default, the columns list appears in the ascending order based on the column position in the data source. To sort columns in ascending or descending order based on the source position, click the arrow () icon. To sort columns in alphabetical order by name, click the Name heading on the Columns tab. To sort columns in reverse alphabetical order, click the column heading again. To view the column asset details on the Overview tab, click the column name link.
- Business Title
- Displays the business title associated with the column. You can also view the recommended business terms if you want to associate a business term as a business title with the column.
- Data domains
- Displays the data domains associated with the column. Hover the mouse over a data domain to view the Name Match, Data Match, Proximity Match, System Suggested, and Assigned by details. The inferred data domains appear in orange color and the curated data domains appear in green color. The Name Match displays Yes when the column name matches with the data domain. The Proximity Match displays a score that is a ratio of the number of proximal data domains discovered in the data asset to the number of configured proximal data domains for an inferred data domain.
- Null Distinct Non-Distinct %
- Displays the percentage of null, distinct, and non-distinct values in a column. Hover the mouse over the horizontal bar chart to view the number rows that contain null, distinct, and non-distinct values.
- Source Data Type Inferred Data Types
- Displays the documented data type and inferred data types of the column. Hover the mouse to view the list of documented data type and inferred data types with percentages.
The following image displays the sample Columns tab:
Viewing the Rules and Scorecards Information
The Data Quality tab displays the details of the rules and scorecards for a tabular asset. You can view the column results for a rule and scorecard to measure data quality of the data source.
The Rules tab displays the following details:
- •Name. Displays the rule name. Hover the mouse to view the description of the rule asset. You can view the value frequency for output columns for a particular rule. In the value frequency section, you can view the distribution of the values across the data asset, the maximum and minimum values in the data asset, the distribution of null, distinct, and non-distinct values in the data asset, and a list of values and their frequency in a tabular form. You can view the details of all the output columns associated with a rule. To view the rule asset details, click the rule name. The rule asset details appear in the Overview tab.
- •Null Distinct Non-distinct%: Displays the percentage of null, distinct, and non-distinct values in the column. You can view the number of rows that contain null, distinct, and non-distinct values.
- •Input Column. Displays the number of columns on which the rule is applied. You can view a list of the columns for a particular rule asset.
- •Value. Displays the minimum and maximum values in the rule output.
- •Length. Displays the minimum and maximum length of the values in the rule output.
- •Data Type. Displays the data type of the rule. Hover the mouse over a data type to view the details of the documented data type or inferred data type.
The following image displays the Rules section in the Data Quality tab:
The Scorecards tab displays the following details:
- •Name. Displays the scorecard name. To view the scorecard asset details, click the scorecard name. The scorecard asset details appear in the Overview tab.
- •Total Metric Groups. Displays the total number of metric groups in the scorecard.
- •Metric Groups. Displays the name of the metric groups. You can view the percentage of values for each metric group. You can apply filters to view good, acceptable, or unacceptable range of data for each metric group. You can also apply filters to choose metric groups associated with a scorecard asset. You can view scorecards in the ascending or descending order of date.
- •Tables Applied. Displays the tables associated with the scorecard.
The following image displays the Scorecard section in the Data Quality tab:
Checking the Data Quality of Tabular Assets
You can check the data quality of tabular assets in resources that support profiling such as relational tables, relational views, CSV files, JSON files, XML files, and Salesforce objects. If the assets have profile results, Enterprise Data Catalog shows the distribution of data in a separate Sample Columns view. To display the detailed Sample Columns view, click Show All in the Sample Column section.
The following image shows the sample of the Columns view for a relational table with profile results:
- 1. Bar charts that show the percentage of null values, distinct values, and non-distinct values in each field.
- 2. Colums view that displays profile results, data domains, business title, and column names.
The bar chart for each column or field shows the percentage of null values, distinct values, and non-distinct values in the column or field. To view the number of rows associated with each percentage, hover the cursor over the bar chart.
Viewing the Unique Key Details
A unique key is a column or combination of columns that uniquely identifies a row in a data source. Unique key discovery identifies the unique keys that meet a specific confidence score.
The Keys tab displays the details of the unique keys for assets such as tables and files.
The Unique Keys section displays the following details:
- •Key Name. Displays the name of the key.
- •Columns. Displays columns that are identified as the unique keys. To view the column details, click the column name. The column asset details appear in the Overview tab.
- •Description. Displays the key description.
- •Confidence Score. Displays the score assigned to the unique key. The confidence score is calculated by an internal algorithm based on the uniqueness of data in the columns, number of null values and the position of discovered columns in the source data object.
- •Type. Displays the type of unique key. Unique keys can be documented, inferred, accepted, or rejected. You can curate unique key results in the Enterprise Data Catalog. Hover the mouse over the key name to accept or reject the columns inferred as the unique key. The inferred type appears in orange color, accepted type appears in green color and rejected type appears in red color. You can click Undo Accept or Undo Reject to reset the unique key results to the inferred state. Hover over () to view the details of the person who accepted or rejected the inferred unique key results.
You can click show tab to view documented, inferred, accepted, rejected, or all types of unique key results. The show tab displays all types of unique key results by default. You can enter a key name or column name in the Find tab to search the unique key inference. You can discover maximum of five unique key each table in the source data object. You can search documented or inferred keys in the Enterprise Data Catalog.
Technical Preview Notice: Effective in version 10.4.0, Unique key feature is available for technical preview.