Based on your requirements, you can enable or disable the auto-acceptance of recommended glossary terms, set a threshold limit for auto-acceptance, ignore the prefix and suffix of a data element while making recommendations to associate business glossary terms, and use abbreviation and synonym definitions from a lookup table to associate glossary terms.
When you enable Glossary Association for a catalog source, you can configure the following options on the Glossary Association tab:
Enable auto-acceptance
Select this option to automatically accept recommended glossary terms and assign them as business names to data elements extracted from the source system. The associated glossary term is selected on the basis of the confidence score threshold that you specify. If you don't select this option, you can manually accept or reject recommended glossary terms in Data Governance and Catalog.
Confidence score threshold for auto-acceptance
Specify a percentage from 80 to 100 inclusive to set a threshold limit based on which the glossary association capability automatically accepts recommended glossary terms.
For example, consider a cust_nm column in an Oracle source system and a Customer Name glossary term. When you configure the Oracle catalog source, select the Enable auto-acceptance option for glossary association, set the confidence score threshold to 92%, and run the catalog source job. If the Customer Name glossary term matches the cust_nm column with a score of 92% or more, then the Glossary Association capability assigns the Customer Name business name to the cust_nm column in Data Governance and Catalog.
Enable below-threshold recommendations
If you enable auto-acceptance, you can select this option to receive glossary association recommendations below the auto-acceptance threshold. The recommendations are generated based on the confidence score threshold that you specify.
Confidence score threshold for recommendations
If you enable auto-acceptance, specify a percentage from 80 to the selected auto-acceptance threshold to set a confidence score limit based on which the glossary association capability makes recommendations. You can accept or reject the recommended glossary terms that fall within this range in Data Governance and Catalog.
If you disable auto-acceptance, specify a percentage from 80 to 100 inclusive to set a confidence score limit based on which the glossary association capability makes recommendations. Accept or reject the recommendations in Data Governance and Catalog.
Assign business names and descriptions
Select this option to automatically assign business names and descriptions to technical assets. You can then choose to retain existing assignments and only assign business names and descriptions to assets that don't have assignments, or allow overwrite of existing assignments.
By default, existing assignments are retained.
Ignore specified keywords
Specify prefix and suffix values that you want to ignore from data elements during glossary association. Prefix and suffix values are case insensitive. If you don't select this option, the Glossary Association capability considers entire data elements while making recommendations to associate business glossary terms with your technical assets.
Define the glossary association scope
Choose specific top-level business glossary assets to associate with technical assets. Selecting a top-level asset selects its child assets as well. Select Top-level Glossary Assets and specify the assets on the Select Assets page.
After you run the catalog source job, the following glossary associations are removed:
•All glossary associations that are not in scope based on the selected glossaries.
•All corresponding glossary associations if you deleted the top-level glossary assets that were previously defined in the glossary association scope.
Note: If the top-level glossary asset that you previously defined in the glossary association scope is no longer a top-level asset, the association is retained.
Use abbreviation and synonym definitions
Enable this option to use a lookup table with abbreviations and synonyms to improve glossary association accuracy. To upload a lookup synonym file with the abbreviations and synonyms, click Select.
When you use abbreviation and synonym definitions, the glossary association capability finds accurate matches based on the lookup table and increases the confidence score of the glossary terms. This enables CLAIRE to prioritize glossary terms that match your defined synonyms, reducing false positives and enhancing confidence score for automated associations.
For example, without the lookup table for glossary association, TRX_ID might match 'Transfer ID' or 'Training ID', and INV_DT could match both 'Inventory Date' and 'Invoice Date'. If you enable glossary association with a lookup table, the algorithm boosts the confidence score for the matches and assigns the names with the correct business glossary terms, such as 'Transaction ID' and 'Invoice Date'.