Installation and Upgrade Guide > Part II: Install and Configure for Amazon EMR > Configuration for Amazon EMR > Developer Tool Tasks for Amazon EMR
  

Developer Tool Tasks for Amazon EMR

Perform tasks on the machine that hosts the Developer tool. The tasks that you complete enable the Developer tool and the Data Integration Service to connect to the Hadoop environment.

Create Connections

Define a Hadoop connection to run a mapping in the Hadoop environment.
  1. 1. Log in to the Developer tool.
  2. 2. Click Window > Preferences.
  3. 3. Select Informatica > Connections.
  4. 4. Expand the domain in the Available Connections list.
  5. 5. Select the Cluster connection type from the list and click Add.
  6. The New Cluster Connection dialog box appears.
  7. 6. Enter the general properties for the connection.
  8. 7. Click Next.
  9. 8. Enter the hadoop cluster properties and the common properties for the Hadoop connection.
  10. 9. Click Next.
  11. 10. Enter the Hive pushdown configuration properties and the Hive configuration.
  12. 11. Click Next.
  13. 12. Enter properties for the Blaze engine.
  14. 13. Enter properties for the Spark engine.
  15. 14. Click Finish.
For information about connection properties, see Connection Property Reference.

Update Files on the Developer Tool

Update developerCore.ini on the machine that hosts the Developer tool.

developerCore.ini

Edit developerCore.ini to enable communication between the Developer tool and the Hadoop cluster. You can find developerCore.ini in the following directory: <Informatica installation directory>\clients\DeveloperClient
Add the following property:
-DINFA_HADOOP_DIST_DIR=hadoop\<distribution><version>
The change takes effect when you restart the Developer tool.