Intelligent Data Lake Administrator Guide > Data Preparation Service > Creating and Managing the Data Preparation Service
  

Creating and Managing the Data Preparation Service

Use the Administrator tool to create and manage the Data Preparation service. When you change a service property, you must recycle the service or disable and then enable the service for the changes to take affect.

Creating the Data Preparation Service

Use the service creation wizard in the Administrator tool to create the service.
    1. In the Administrator tool, click the Managetab.
    2. Click the Services and Nodes view.
    3. In the Domain Navigator, select the domain.
    4. Click Actions > New > Data Preparation Service.
    5. On the New Data Preparation Service - Step 1 of 6 page, enter the following properties:
    Property
    Description
    Name
    Name of the Data Preparation service. The name is not case sensitive and must be unique within the domain. It cannot exceed 128 characters or begin with @. It also cannot contain spaces or the following special characters: ` ~ % ^ * + = { } \ ; : ' " / ? . , < > | ! ( ) ] [
    Description
    Description of the Data Preparation service. The description cannot exceed 765 characters.
    Location
    Location of the Data Preparation Service in the Informatica domain. You can create the service within a folder in the domain.
    License
    License object with the data lake option that allows the use of the Data Preparation Service.
    Node Assignment
    Type of node in the Informatica domain on which the Data Preparation Service runs. Select Single Node if a single service process runs on the node or Primary and Backup Nodes if a service process is enabled on each node for high availability. However, only a single process runs at any given time, and the other processes maintain standby status.
    The Primary and Backup Nodes option will be available for selection based on the license configuration.
    Default is Single Node.
    Node
    Name of the node on which the Data Preparation Service runs.
    Backup Nodes
    If your license includes high availability, nodes on which the service can run if the primary node is unavailable.
    6. Click Next.
    7. On the New Data Preparation Service - Step 2 of 6 page, enter the following properties:
    Property
    Description
    HTTP Port
    Port number for the HTTP connection to the Data Preparation Service.
    Enable Secure Communication
    Use a secure connection to connect to the Data Preparation Service. If you enable secure communication, you must set all required HTTPS properties, including the keystore and truststore properties.
    HTTPS Port
    Port number for the HTTPS connection to the Data Preparation Service.
    Keystore File
    Path and the file name of keystore file that contains key and certificates required for HTTPS communication.
    Keystore Password
    Password for the keystore file.
    Truststore File
    Path and the file name of truststore file that contains authentication certificates for the HTTPS connection.
    Truststore Password
    Password for the truststore file.
    8. Click Next.
    9. On the New Data Preparation Service - Step 3 of 6 page, enter the following properties:
    Property
    Description
    Database Type
    Type of database to use for the Data Preparation repository. Only MySQL database is supported.
    Host Name
    Host name of the machine that hosts the Data Preparation repository database.
    Port Number
    Port number for the database.
    Database User Name
    Database user account to use to connect to the Data Preparation repository.
    Database User Password
    Password for the Data Preparation repository database user account.
    Schema Name
    Schema or database name of the Data Preparation repository database.
    10. Click Next.
    11. On the New Data Preparation Service - Step 4 of 6 page, enter the following properties:
    Property
    Description
    Solr Port
    Port number for the Apache Solr server used to provide data preparation recommendations.
    12. Click Next.
    13. On the New Data Preparation Service - Step 5 of 6 page, enter the following properties:
    Property
    Description
    Local Storage Location
    Directory for data preparation file storage on the node on which the Data Preparation Service runs.
    HDFS Connection
    HDFS connection for data preparation file storage.
    HDFS Storage Location
    HDFS location for data preparation file storage. If the connection to the local storage fails, the Data Preparation Service recovers data preparation files from the HDFS location.
    Hadoop Authentication Mode
    Security mode of the Hadoop cluster for data preparation storage. If the Hadoop cluster uses Kerberos authentication, you must set the required Hadoop security properties for the cluster.
    HDFS Service Principal Name
    Service Principal Name (SPN) for the data preparation Hadoop cluster. Specify the service principal name in the following format: user/_HOST@REALM.
    Hadoop Impersonation User Name
    User name to use in Hadoop impersonation as set in core-site.xml.
    SPN Keytab File for User Impersonation
    Path and file name of the SPN keytab file for the user account to impersonate when connecting to the Hadoop cluster. The keytab file must be in a directory on the machine where the Data Preparation Service runs.
    14. Click Next.
    15. On the New Data Preparation Service - Step 6 of 6 page, enter the following properties:
    Property
    Description
    Hadoop Distribution
    Hadoop distribution used for the data lake Hadoop cluster. Select Cloudera or HortonWorks.
    16. Click Finish.
Note: After you create the Data Preparation Service, ensure that you complete the following steps:
  1. 1. In the Administrator tool, click Actions > Create Repository to create the repository contents.
  2. 2. Click Actions > Enable Service.

Enabling, Disabling, and Recycling the Data Preparation Service

You can enable, disable, and recycle the service from the Administrator tool.
    1. In the Administrator tool, click the Manage tab > Services and Nodes view.
    2. In the Domain Navigator, select the service.
    3. On the Actions tab, select one of the following options:
    1. a. Enable Service to enable the service.
    2. b. Disable Service to disable the service.
    3. Choose the mode to disable the service in. Optionally, you can choose to specify whether the action was planned or unplanned, and enter comments about the action. If you complete these options, the information appears in the Events and Command History panels in the Domain view on the Manage tab.
    4. c. Recycle Service to recycle the service.

Editing the Data Preparation Service

To edit the Data Preparation Service, select the service in the Domain Navigator and click the Properties view. You can change the properties while the service is running, but you must restart the service for the properties to take effect.
To edit the Data Preparation Service:
    1. To edit specific properties, click the pencil icon in the selected properties area.
    2. In the Edit Properties window, edit the required fields.
    3. Click OK.
    4. Click Actions > Recycle Service.
    5. In the Recycle Service window, select the required options.
    6. Click OK to restart the service.

Deleting the Data Preparation Service

Only users with ADMIN or WRITE permissions for the Data Preparation Service can delete the service.
To delete the Data Preparation Service:
    1. On the Manage tab, select the Services and Nodes view.
    2. In the Domain Navigator, select the Data Preparation Service.
    3. Disable the Data Preparation Service by clicking Actions > Disable Service .
    4. To delete the Data Preparation Service, click Actions > Delete.