Before you create a catalog source, ensure that you have the information required to connect to the source system.
Perform the following tasks:
•Assign the required permissions.
•Configure permissions to the service account that you use to access the Google Cloud Platform project.
•Configure a connection to the Google Cloud Storage source system in Administrator.
•Optionally, if you want to identify pairs of similar columns and relationships between tables within a catalog source, import a relationship inference model.
Verify permissions
To extract metadata and to configure other capabilities that a catalog source might include, you need account access and permissions on the source system. The permissions required might vary depending on the capability.
Permissions to extract metadata
Ensure that you have the required permissions to enable metadata extraction.
Grant the Storage Object Viewer role to the service account that you use to access the Google Cloud Platform project.
Permissions to run data profiles
Ensure that you have the required permissions to run profiles.
Verify that the administrator performs the following tasks:
•Grants the Storage Object Viewer role to the service account that you use to access the Google Cloud Platform project.
•Grants the storage.objects.create permission to the Google Cloud Storage bucket where the profiling results are stored temporarily.
Permissions to perform data classification
You can perform data classification with the permissions required to perform metadata extraction.
Permissions to perform relationship discovery
You can perform relationship discovery with the permissions required to perform metadata extraction.
Permissions to perform glossary association
You can perform glossary association with the permissions required to perform metadata extraction.
Create a connection
Create a Google Cloud Storage connection object in Administrator with the connection details of the Google Cloud Storage source system.
1In Administrator, select Connections.
2Click New Connection.
3In the Connection Details section, enter the following connection details:
Connection property
Description
Connection Name
Name of the connection.
Each connection name must be unique within the organization. Connection names can contain alphanumeric characters, spaces, and the following special characters: _ . + -,
Maximum length is 255 characters.
Description
Description of the connection. Maximum length is 4000 characters.
4Select the Google Cloud Storage V2 connection type.
5Enter properties specific to the Google Cloud Storage connection:
Property
Description
Runtime Environment
The execution platform that runs tasks. The runtime environment is either the Agent based environment or a serverless runtime environment.
Service Account ID
The account ID that is used to generate the service account email address and a unique ID.
Service Account Key
Private key of the Google Cloud platform service account.
Project ID
The ID of the Google Cloud platform project that you want to access.
Private Key ID
Private key ID of the Google Cloud platform service account.
Client ID
The ID of the Google Cloud platform service account.
Is Encrypted File
Specifies whether the file is encrypted.
Bucket Name
Bucket name of the stored objects from which you want to extract metadata.
6Click Test Connection.
7Click Save.
Import a relationship inference model
Import a relationship inference model if you want to configure the relationship discovery capability. You can either import a predefined relationship inference model, or import a model file from your local machine.
1In Metadata Command Center, click Explore on the navigation panel.
2Expand the menu and select Relationship Inference Model. The following image shows the Explore page with the Relationship Inference Model menu:
3Select one of the following options:
- Import Predefined Content. Imports a predefined relationship inference model called Column Similarity Model v1.0.
- Import. Imports the predefined relationship inference model from your local machine. Select this if you previously imported predefined content into your local machine and the inference model is stored on the machine.
To import a file, click Choose File in the Import Relationship Inference Model window and navigate to the model file on your local machine. You can also drag and drop the file.
The imported models appear in the list of relationship inference models on the Relationship Discovery tab.