Targets for Databricks

Property	SQL warehouse¹	All-purpose cluster²	Job cluster³
Source Type - Single Object	Yes	Yes	Yes
Source Type - Parameter	Yes	Yes	Yes
Object - Existing	Yes	Yes	Yes
Object - Create New at Runtime	Yes	Yes	Yes
Create target - Object Name, Table Location, Database Name	Yes	Yes	Yes
Create target - Table Properties	Yes	No	No
Operation - Insert, Update, Upsert, Delete	Yes	No	Yes
Operation - Data Driven	Yes	No	No
Target Database Name	Yes	No	Yes
Target Table Name	Yes	No	Yes
Update Override Query	Yes	No	No
Write Disposition - Append, Truncate	Yes	No	Yes
Write Disposition - Truncate Always	Yes	No	No
Update Mode - Update as update, Update else insert	Yes	No	Yes
Staging Location	Yes	No	Yes
Pre SQL	Yes	No	No
Post SQL	Yes	No	No
DTM Staging File Size	Yes	No	No
Job Timeout	No	No	Yes
Job Status Poll Interval	No	No	Yes
DB REST API Timeout	No	No	Yes
DB REST API Retry Interval	No	No	Yes
Forward Rejected Rows	Yes	No	Yes
¹The Secure Agent connects to the SQL warehouse at design time and runtime. ²The Secure Agent connects to the all-purpose cluster to import the metadata at design time. ³The Secure Agent connects to the job cluster to run the mappings.

Target properties for Databricks

Property	Description
Connection	Name of the target connection. Select a target connection or click New Parameter to define a new parameter for the target connection.
Target Type	Target type. Select one of the following types: - Single Object. - Parameter. Select Parameter to define the target type when you configure the task.
Object	Name of the target object.
Create Target	Creates a target. Enter a name for the target object and select the source fields that you want to use. By default, all source fields are used. You can select an existing target object or create a new target object at runtime. You cannot parameterize the target at runtime.
Operation	Defines the type of operation to be performed on the target table. Select from the following list of operations: - Insert (Default) - Update - Upsert - Delete - Data Driven¹ When you use an upsert operation, you must configure the Update Mode in target details as Update else Insert. If the key column gets null value from the source, the following actions take place for different operations: - Update. Skips the operation and does not update the row. - Delete. Skips the operation and does not delete the row. - Upsert. Inserts a new row instead of updating the existing row.
Update Columns	The fields to use as temporary primary key columns when you update, upsert, or delete data on the Databricks Delta target tables. When you select more than one update column, the mapping task uses the AND operator with the update columns to identify matching rows. Applies to update, upsert, delete and data driven operations.
Data Driven Condition¹	Flags rows for an insert, update, delete, or reject operation based on the expressions that you define. For example, the following IIF statement flags a row for reject if the ID field is null. Otherwise, it flags the row for update: IIF (ISNULL(ID), DD_REJECT, DD_UPDATE ) Required if you select the data driven operation.
¹Applies only to mappings in advanced mode.

Advanced Property	Description
Target Database Name¹	Overrides the database name provided in the connection and the database selected in the metadata browser for existing targets. Note: You cannot override the database name when you create a new target at runtime.
Target Table Name¹	Overrides the table name at runtime for existing targets.
Update Override Query	Overrides the default update query that the agent generates for the update operation specified in this field. Use the merge command for the update operation.
Write Disposition	Overwrites or adds data to the existing data in a table. You can select from the following options: - Append. Appends data to the existing data in the table even if the table is empty. - Truncate. Overwrites the existing data in the table. Only applies to Insert operation and non-empty sources. - Truncate Always. Overwrites the existing data in the table. Applies to insert, update, upsert, and delete target operations for empty and non-empty sources. Note: You cannot perform Truncate Always when you don't specify the database name in the connection properties and while creating a new target at runtime.
Update Mode¹	Defines how rows are updated in the target tables. Select from the following options: - Update As Update: Rows matching the selected update columns are updated in the target. - Update Else Insert: Rows matching the selected update columns are updated in the target. Rows that don't match are appended to the target.
Staging Location	Relative directory path to store the staging files. - If the Databricks is hosted on the AWS platform, use the path relative to the Amazon S3 staging bucket. - If the Databricks is hosted on the Azure platform, use the path relative to the Azure Data Lake Store Gen2 staging filesystem name. Note: When you use the unity catalog, a pre-existing location on user's cloud storage must be provided in the Staging Location.
Pre SQL	The pre-SQL command to run before the agent writes to Databricks. For example, if you want to assign sequence object to a primary key field of the target table before you write data to the table, specify a pre-SQL statement. You can specify multiple pre-SQL commands, each separated with a semicolon.
Post SQL	The post-SQL command to run after the agent completes the write operation. For example, if you want to alter the table created by using create target option and assign constraints to the table before you write data to the table, specify a post-SQL statement. You can specify multiple post-SQL commands, each separated with a semicolon.
DTM Staging File Size	The size of the flat file that Data Integration creates locally in a temporary folder to stage the data before writing to Databrick. Default is 50 MB.
Job Timeout¹	Maximum time in seconds that is taken by the Spark job to complete processing. If the job is not completed within the time specified, the job terminates and the mapping fails. If the job timeout is not specified, the mapping shows success or failure based on the job completion.
Job Status Poll Interval¹	Poll interval in seconds at which the Secure Agent checks the status of the job completion. Default is 30 seconds.
DB REST API Timeout¹	The Maximum time in seconds for which the Secure Agent retries the REST API calls to Databricks when there is an error due to network connection or if the REST endpoint returns 5xx HTTP error code. Default is 10 minutes.
DB REST API Retry Interval¹	The time Interval in seconds at which the Secure Agent must retry the REST API call, when there is an error due to network connection or when the REST endpoint returns 5xx HTTP error code. This value does not apply to the Job status REST API. Use job status poll interval value for the Job status REST API. Default is 30 seconds.
Forward Rejected Rows	Determines whether the transformation passes rejected rows to the next transformation or drops rejected rows. By default, the agent forwards rejected rows to the next transformation.
¹Doesn't apply to mappings in advanced mode.

Targets for Databricks

Target properties for Databricks

Create a target table at runtime

Rules and guidelines for create target at runtime

Override the update operation

Passthrough partitioning

Determine the order of processing for multiple targets

Set -DEnableSingleCommit=true in the Secure Agent properties

Set the EnableSingleCommit property in the task properties

Optimize the staging performance of a mapping

Setting the staging property

Rules and guidelines