Catalog Source Configuration > Google BigQuery SQL Script > Before you begin
  

Before you begin

Before you create a catalog source, ensure that you have the information required to connect to the source system.
Perform the following tasks:

Verify permissions

To extract metadata and to configure other capabilities that a catalog source might include, you need account access and permissions on the source system. The permissions required might vary depending on the capability.

Permissions for metadata extraction

To extract Google BigQuery SQL Script metadata, you need account access and permissions to the source system.
Verify that you have the READ permission to access the folder containing scripts.
To assign permissions to extract metadata, choose one of the following role options:
The bigquery.tables.getData permission is needed to query the __TABLES__ table from a dataset to get information such as description, ID, and last modified date. The bigquery.jobs.create permission is needed to run queries on the dataset.

Create a connection

Create a Google BigQuery connection object in Administrator with the connection details of the Google BigQuery source system.
    1In Administrator, select Connections.
    2Click New Connection.
    3In the Connection Details section, enter the following connection details:
    Connection property
    Description
    Connection Name
    Name of the connection.
    Each connection name must be unique within the organization. Connection names can contain alphanumeric characters, spaces, and the following special characters: _ . + -,
    Maximum length is 255 characters.
    Description
    Description of the connection. Maximum length is 4000 characters.
    4Select the Google BigQuery V2 connection type.
    5Enter properties specific to the Google BigQuery connection:
    Property
    Description
    Runtime Environment
    The execution platform that runs tasks. The runtime environment is either a Secure Agent or a serverless runtime environment.
    Authentication Type
    Select the Service Account authentication type to access Google BigQuery and configure the authentication-specific parameters.
    Service Account Email
    The client_email value from the Google service account key JSON file.
    Service Account Key
    The private_key value from the Google service account key JSON file.
    Project ID
    The project_id value from the Google service account key JSON file.
    If you have created multiple projects with the same service account, enter the ID of the project that contains the dataset that you want to connect to.
    Connection mode
    The mode that you want to use to read data from or write data to Google BigQuery. Choose the Hybrid connection mode for optimum data profiling results.
    Use Legacy SQL for Custom Query
    Select this option to use a legacy SQL to define a custom query. If you clear this option, use a standard SQL to define a custom query. Clear this option for optimum data profiling results.
    Retry Strategy Section
    Enable or disable retry. When you read data from Google BigQuery in staging mode, you can configure the retry strategy when the Google BigQuery V2 connection fails to connect to the Google BigQuery source.
    6Click Test Connection.
    7Click Save.

Create endpoint catalog sources for connection assignment

An endpoint catalog source represents a source system that the catalog source references. Before you perform connection assignment, create endpoint catalog sources and run the catalog source jobs.
You can then perform connection assignment to reference source systems and run connection-aware scans to view complete lineage with source system objects.