Installation and Upgrade Guide > Part II: Install and Configure for Amazon EMR > Configuration for Amazon EMR > Configure the Data Integration Service
  

Configure the Data Integration Service

Configure Data Integration Service properties for the Hadoop environment.
You can configure the Data Integration Service in the Administrator tool.
The following table describes the Hadoop properties for the Data Integration Service:
Property
Description
Informatica Home Directory on Hadoop
The Big Data Management home directory on every data node created by the Hadoop RPM install. Default is /opt/Informatica.
Hadoop Distribution Directory
The directory containing a collection of Hive and Hadoop JARS on the cluster from the RPM Install locations. The directory contains the minimum set of JARS required to process Informatica mappings in a Hadoop environment. Type /<Big Data Management installation directory>/Informatica/services/shared/hadoop/<Hadoop distribution name>_<version number>.
For example:
/opt/Informatica/services/shared/hadoop/amazon_emr_5.0.0
Data Integration Service Hadoop Distribution Directory
The Hadoop distribution directory on the Data Integration Service node.
Type /<Informatica installation directory>/Informatica/services/shared/hadoop/<Hadoop distribution name>_<version number>.
For example:
../../services/shared/hadoop/amazon_emr_5.0.0
Note: The contents of the Data Integration Service Hadoop distribution directory must be identical to Hadoop distribution directory on the data nodes.