Step 5. Set Up User Impersonation
When a user performs tasks in Enterprise Data Preparation, the Data Integration Service connects to Hadoop services to access the data lake and perform the operation on behalf of the user. The Data Integration Service uses impersonation to pass the user account to Hadoop.
Configure user impersonation for Enterprise Data Preparation based on the user access requirements of your organization.
You can configure user impersonation in the following ways:
- Using the Enterprise Data Preparation user account to connect to Hadoop services
- You can configure the Data Integration Service to impersonate the user account logged in to the Enterprise Data Preparation application. The user account logged in to Enterprise Data Preparation must have user or group authorization to connect to the Hadoop services. To use this option, you must configure the Data Integration Service to use operating system profiles. You must assign the Enterprise Data Preparation user account a default operating system profile with the Use the logged in user as Hadoop Impersonation User option selected.
- Using an authorized user account to connect to Hadoop services
- If the Enterprise Data Preparation user accounts do not have logged authorization to connect Hadoop, you can configure the Data Integration Service to impersonate a specific authorized user. The impersonated user account must have authorization to connect to the data lake and the Hadoop services. To use this option, you must configure the Data Integration Service to use operating system profiles. The operating system profile must have the Use the specified user as Hadoop Impersonation User option selected and must specify the authorized user account to impersonate.
- Using the Hive connection user account to connect to Hadoop services
- If user access to the data lake does not require authorization or if all users have the same authorization, you do not need to set up operating system profiles to connect to the Hadoop services. The Data Integration Service connects to Hadoop services using the user account specified in the Hive connection object for Enterprise Data Preparation.
Configuring the User Account in the Hive Connection
The Hive connection object defines the Data Integration Service user account that can impersonate Enterprise Data Preparation users. Set the user name in the Hive connection object you use to connect to the data lake.
1. In the Administrator tool, click Manage Connections.
2. In the Navigator, select the Hive connection to the data lake.
3. In the properties view, edit the Common Attribute to Both the Modes section.
The Edit Common Attribute to Both the Modes dialog box appears.
4. Set the user name and password for the Data Integration Service user account you want to use to impersonate Enterprise Data Preparation users.
The user account must be the user account specified for user impersonation in the Hadoop cluster.
5. Click OK.
For more information about connection objects and the Hive connection properties, see the Informatica Administrator Guide.