The November 2024 release of Data Governance and Catalog includes the following new features and enhancements.
Metadata access control
This release introduces metadata access control, which provides options for enhanced control over how users interact with the assets in Data Governance and Catalog and Data Marketplace. As an administrator, you can define access policies in Metadata Command Center that allow you to have greater control over:
•The assets that users can interact with.
•Individual aspects of an asset, providing different levels of access to different users.
•The assets for which users can assume stakeholder responsibilities.
Note: After you upgrade to the November 2024 version, Data Governance and Catalog and Data Marketplace will continue to work the same as the previous version. However, privileges and permissions for all assets and users in your organization that are defined in Administrator will be converted to access policies in Metadata Command Center. Administrators need to review the new access policies in Metadata Command Center when the upgrade is complete.
For information about metadata access control, see Access control in the Administration help.
Share a saved search
If you are the owner of a saved search, you can now share the search with several users, user groups, and user roles in your organization. If you have the manage access control privilege, you can also transfer ownership of a saved search to a different recipient if the existing owner is inactive.
When recipients receive the saved search, they can view or edit the search based on the permissions assigned to them by the search owner. When recipients receive a saved search in a dashboard widget, they only have the view privilege of the search.
The following image shows how you can share a saved search:
For more information about how to share a saved search, see the Working With Assets help in Data Governance and Catalog.
Share dashboards
If you are the owner of a dashboard, you can now share single or multiple dashboards with several users, user groups, and user roles in your organization. If you have manage access control privilege, you can also transfer ownership of a dashboard to a different recipient if the existing owner is inactive.
When recipients receive the dashboard, they can view or edit the dashboard based on the permissions assigned to them by the dashboard owner.
The following image shows how you can share a dashboard:
For more information about how to share a dashboard, see the Introduction and Getting Started help in Data Governance and Catalog.
Export assets from catalog source
You can now export all assets or specific assets directly from the catalog source page using the Export all or specific assets option.
You can refine and export your data using additional filters.
The following image shows how you can export assets from catalog source:
For more information about how to export technical assets from catalog source, see the Working With Assets help in Data Governance and Catalog.
Bulk export and import enhancements
This release includes the following enhancements to bulk export and import:
•When you export assets, the exported file contains a new column that displays the details of all the inherited and assigned stakeholders of the exported assets.
•You can export or import assets in the .csv format along with .xls format using the export and import APIs.
•When you create or update a data quality rule template using bulk import, you can now specify the primary and secondary glossary assets that apply to the rule template. The bulk import template and the exported file for data quality rule template contains two new fields called Primary Glossary and Secondary Glossary.
•When you update a data quality rule occurrence with new scores using bulk import, the data quality rule occurrence page displays the total runs along with the respective scores, total rows, failed rows, date, and timestamp.
•You can add and delete glossaries and data element classifications associated with a manual data element using the bulk import template for manual data elements.
•While creating a data quality rule occurrence using bulk import, the Technical Rule Reference field is mandatory if you select Informatica Cloud Data Quality as the measuring method. This field is not mandatory for other measuring methods.
Previously, the Technical Rule Reference field was optional for all measuring methods.
For more information about how you can bulk import assets, see the Bulk Import Assets help in Data Governance and Catalog.
Data quality score aggregation for Business Area and Legal Entity assets
View aggregate and individual data quality scores for Business Area and Legal Entity assets on the Data Quality tab of these asset pages.
For more information, see the Asset Details in Data Governance and Catalog.
Search improvements
Specify the search method
You can now specify the search method that you want the system to use when you search for assets. You can select one of the following methods to search for assets in Data Governance and Catalog:
- Keyword. Allows you to use a keyword or phrase that might be used in an asset name or description. You can also enable the Flexible toggle on this search method to find matches even when you misspell, interchange, or miss a keyword in a search phrase.
- Query. Allows you to use a search query for a more precise and refined set of results.
- Adaptive. The system automatically uses the appropriate search method based on your input. You can use a keyword, phrase, or a search query to find the assets.
The following image shows the search bar in Data Governance and Catalog:
For more information about how you can search for assets, see the Working With Assets in Data Governance and Catalog.
Find bar for dropdown menus
You can use now use the Find bar in the dropdown menu options to directly type and search among the menu options in Data Governance and Catalog .
The following image shows the Find bar in dropdown menu options:
Filters assets in a widget based on custom attributes
You can filter assets in a widget based on custom attributes. You can now choose to display custom attributes of assets using the custom attribute option in the Select Display Columns list or group assets by custom attributes using the Group By option in widgets.
Exact total count of search results
You can now view the exact count of your search results.
Previously, if the total search results exceeded 10000 assets, Data Governance and Catalog would display "10K+ assets" instead of showing the exact count of assets.
Improved search precision using wildcard entry
When you search for assets using a search query that uses a wildcard entry *, the system now displays only the assets whose names or descriptions start with the search keyword that you entered.
Earlier, the system would display even the assets where the search keyword that you entered was part of a larger keyword or phrase.
For more information about how you can search for assets, see the Working With Assets in Data Governance and Catalog.
View process maps for Process assets
You can now view a visual representation of a process flow on the newly added Process Maps tab of a process asset. A process flow comprises of the Process type assets that have a is Predecessor to relationship with one another.
The following image shows the process map for a process asset:
For more information about process maps, see the Asset Details in Data Governance and Catalog.
Generate data observability events for specific data
Receive data observability event notifications for data that is relevant for you. The administrator can apply filters on catalog sources in Metadata Command Center to specify the metrics, sensitivity of the anomaly detection algorithm, and detection rules to apply on profiled data.
For information about configuring data observability, see Configuring data observability events in the Administration help module of Metadata Command Center.
Configure asset tabs on the Browse page
On the Browse page, you can now specify the tabs that you want to display and also reorder the displayed tabs.
The following image shows the Tab Settings dialog box where you can configure asset tabs on the Browse page:
Informatica QuickLook browser extension available in new languages
The Informatica QuickLook browser extension is now translated to the Brazilian Portuguese, French, and Spanish languages.
Generate automated lineage with CLAIRE
You can now link catalog sources and generate lineage automatically with CLAIRE. You can choose to automatically accept CLAIRE-generated lineage recommendations or manually accept them. CLAIRE-generated lineage recommendations are automatically accepted based on a threshold limit.
Stakeholders of the source and target catalog sources can reject the auto-accepted and manually accepted catalog source links generated by CLAIRE in Data Governance and Catalog.
Note: You can generate lineage automatically with CLAIRE only if your organization administrator allows the use of CLAIRE generative AI services.
For information about automated lineage generation, see Link catalog sources in the Administration help.
For more information about curation of the generated catalog source links, see Linked lineage in Data Governance and Catalog help.
New catalog sources
This release includes the following new catalog sources:
•Oracle Data Integrator
•SAP Datasphere (Preview)
•SAP Analytics Cloud (Preview)
•SAS Base Programs
To enable and configure this catalog source, you need assistance from Informatica Professional Services. For more information, contact your account representative.
For more information, see the Oracle Data Integrator, SAP Datasphere, SAP Analytics Cloud, and SAS Base Programs catalog source help.
Enhanced catalog sources
Databricks
This release includes the following enhancements:
- You can view notebooks that reference Unity Catalog tables in the lineage.
- You can extract Databricks Unity Catalog lineage from system tables.
- You can extract the following complex data types along with their nested fields from Databricks Delta Lake source systems:
▪ Map
▪ Struct
▪ Array
For more information, see the Databricks catalog source help.
Google BigQuery
A metadata extraction job extracts information about partitions and clusters.
For more information, see the Google BigQuery catalog source help.
Informatica Intelligent Cloud Services
You can perform connection assignment to the following endpoint catalog sources:
- Microsoft Fabric Data Lakehouse
- Microsoft Fabric Warehouse
For more information, see the Informatica Intelligent Cloud Services catalog source help.
SAP BW/4HANA
You can perform connection assignment to SAP HANA endpoint catalog sources.
For more information, see the Catalog Source Configuration help.
Snowflake
This release includes the following enhancements:
- You can extract metadata from stored procedures that use Snowpark Python language. (Preview)
- You can use the Client Credentials authentication type for metadata extraction.
For more information, see the Snowflake catalog source help.
Microsoft Power BI
This release includes the following enhancements:
- You can view lineage data from Microsoft Fabric Lakehouse and Microsoft Fabric Data Warehouse source systems.
- You can extract metadata from dataset tables.
For more information, see the Catalog Source Configuration help.
Microsoft Azure Data Factory
You can now process the Filter and Fail activity types.
For more information, see the Microsoft Azure Data Factory help.
Informatica PowerCenter
You can view partitioned files in an IBM Db2 for LUW reference source system that you connect to as an endpoint catalog source.
For more information, see the Catalog Source Configuration help.
Configure additional data capabilities
You can configure the following additional data capabilities on catalog sources:
•Glossary association and data classification capabilities on Informatica PowerCenter catalog sources.
•Glossary association, relationship discovery, and data classification capabilities on the following catalog sources:
- Microsoft Fabric Data Warehouse
- PostgreSQL
- QlikSense
- QlikView
•Profiling and data quality tasks on the following catalog sources:
- IBM Db2 for LUW
- IBM Db2 for z/OS
- Hadoop Distributed File System
•Data observability on the following catalog sources:
- Apache Hive
- MariaDB
- MySQL
- Microsoft Fabric Data Lakehouse
- Microsoft Fabric Data Warehouse
- Microsoft Fabric OneLake
- PostgreSQL
- SAP Business Warehouse (SAP BW)
- SAP HANA Database
- Teradata Database
For more information about catalog sources, see the Catalog Source Configuration help.
Profiling enhancements
Microsoft SQL Server
You can run profiles on tables with more than 5000 columns.
Salesforce
You can run profiles on the following data types:
- Picklist
- Text Area
Amazon Redshift
You can run profiles on tables and external tables with the string data type and on columns with the super data type.
Microsoft Fabric OneLake
You can run profiles on objects created in the following file formats:
- AVRO
- CSV
- JSON
- Parquet
For more information, see the Catalog Source Configuration help.
Configure external secrets manager authentication for data profiling and data quality jobs
You can configure the following catalog sources to run data profiling and data quality jobs using Azure Key Vault authentication:
•Google Cloud Storage
•JDBC
•Oracle
•PostgreSQL
•Microsoft Azure SQL Server
•Microsoft Azure Synapse
•MySQL
•MariaDB
For information about how to configure Secrets Manager in Administrator, see Organization Administration in the Administration help.
Incremental metadata extraction
You can now run incremental metadata extraction jobs on the following catalog sources:
•Amazon S3
•Snowflake
A full metadata extraction extracts all objects from the source to the catalog. An incremental metadata extraction considers only the changed and new objects since the last successful catalog source job run. Incremental metadata extraction doesn’t remove deleted objects from the catalog and doesn’t extract metadata of code-based objects.
For more information about Amazon S3 and Snowflake catalog sources, see the Amazon S3 and Snowflake help.
Synchronize metadata from Data Integration ELT mappings
The Informatica Intelligent Cloud Services catalog source and IDMC metadata now synchronize metadata from ELT (Extract Load Transform) mappings in Data Integration.
For information about IDMC metadata, see IDMC Metadata in the Administration help.
For information about the Informatica Intelligent Cloud Services catalog source, see the Informatica Intelligent Cloud Services catalog source help.
Column header detection for CSV files
The detection of column headers in CSV files includes the following improvements:
•Duplicate data element names are suffixed with #<number>.
•Detects headers automatically for delimited files.
•Empty header values are extracted as UnknownColumn<position>.
For information about catalog sources, see the Catalog Source Configuration help.