Property | Description |
---|---|
Name | Name of the Data Preparation service. The name is not case sensitive and must be unique within the domain. It cannot exceed 128 characters or begin with @. It also cannot contain spaces or the following special characters: ` ~ % ^ * + = { } \ ; : ' " / ? . , < > | ! ( ) ] [ |
Description | Description of the Data Preparation service. The description cannot exceed 765 characters. |
Location | Location of the Data Preparation Service in the Informatica domain. You can create the service within a folder in the domain. |
License | License object with the data lake option that allows the use of the Data Preparation Service. |
Node Assignment | Type of node in the Informatica domain on which the Data Preparation Service runs. Select Single Node if a single service process runs on the node or Primary and Backup Nodes if a service process is enabled on each node for high availability. However, only a single process runs at any given time, and the other processes maintain standby status. The Primary and Backup Nodes option will be available for selection based on the license configuration. Default is Single Node. |
Node | Name of the node on which the Data Preparation Service runs. |
Backup Nodes | If your license includes high availability, nodes on which the service can run if the primary node is unavailable. |
Property | Description |
---|---|
HTTP Port | Port number for the HTTP connection to the Data Preparation Service. |
Enable Secure Communication | Use a secure connection to connect to the Data Preparation Service. If you enable secure communication, you must set all required HTTPS properties, including the keystore and truststore properties. |
HTTPS Port | Port number for the HTTPS connection to the Data Preparation Service. |
Keystore File | Path and the file name of keystore file that contains key and certificates required for HTTPS communication. |
Keystore Password | Password for the keystore file. |
Truststore File | Path and the file name of truststore file that contains authentication certificates for the HTTPS connection. |
Truststore Password | Password for the truststore file. |
Property | Description |
---|---|
Database Type | Type of database to use for the Data Preparation repository. Only MySQL database is supported. |
Host Name | Host name of the machine that hosts the Data Preparation repository database. |
Port Number | Port number for the database. |
Database User Name | Database user account to use to connect to the Data Preparation repository. |
Database User Password | Password for the Data Preparation repository database user account. |
Schema Name | Schema or database name of the Data Preparation repository database. |
Property | Description |
---|---|
Solr Port | Port number for the Apache Solr server used to provide data preparation recommendations. |
Property | Description |
---|---|
Local Storage Location | Directory for data preparation file storage on the node on which the Data Preparation Service runs. |
HDFS Connection | HDFS connection for data preparation file storage. |
HDFS Storage Location | HDFS location for data preparation file storage. If the connection to the local storage fails, the Data Preparation Service recovers data preparation files from the HDFS location. |
Hadoop Authentication Mode | Security mode of the Hadoop cluster for data preparation storage. If the Hadoop cluster uses Kerberos authentication, you must set the required Hadoop security properties for the cluster. |
HDFS Service Principal Name | Service Principal Name (SPN) for the data preparation Hadoop cluster. Specify the service principal name in the following format: user/_HOST@REALM. |
Hadoop Impersonation User Name | User name to use in Hadoop impersonation as set in core-site.xml. |
SPN Keytab File for User Impersonation | Path and file name of the SPN keytab file for the user account to impersonate when connecting to the Hadoop cluster. The keytab file must be in a directory on the machine where the Data Preparation Service runs. |
Property | Description |
---|---|
Hadoop Distribution | Hadoop distribution used for the data lake Hadoop cluster. Select Cloudera or HortonWorks. |