Installation and Upgrade Guide > Part VII: Install and Configure for MapR > Hadoop Configuration Manager for MapR > Run the Configuration Manager in Console Mode

Run the Configuration Manager in Console Mode

Run the Hadoop Configuration Manager in console mode to configure the Informatica domain for Big Data Management.

1. On the machine where the Data Integration Service runs, open the command line.

2. Go to the following directory: <Informatica installation directory>/tools/BDMUtil

3. Run BDMConfig.sh.

4. Choose option 3 to configure for MapR.

5. In the Distribution Folder Selection section, choose the directory of the MapR distribution that you want to configure.

Note: The distribution might be stored in a directory that uses a different distribution version number.

6. In the Connection Type section, select 1 to configure MapR through SSH.

Configuration through SSH requires SSH connections from the Informatica domain to the machines that host the NameNode, YARN ResourceManager, and Hive client. Informatica recommends that you use an SSH connection without a password or have sshpass or Expect installed. If you do not use one of these methods, you must enter the password each time the utility downloads a file from the Hadoop cluster.

7. For the NameNode host name field, enter the MapR Resource Manager host name.

8. Enter the SSH user ID.

9. Enter the password for the SSH user ID, or press enter if you use an SSH connection without a password.

10. Enter the location for the core-site.xml file on the Hadoop cluster.

11. Enter the location for the mapr-clusters.conf file on the Hadoop cluster.

The Hadoop Configuration Manger connects to the name node and downloads the following files: core-site.xml and mapr-clusters.conf.

12. Enter the YARN Resource Manager host name.

Note: YARN Resource Manager was formerly known as JobTracker.

13. Enter the SSH user ID.

14. Enter the password for the SSH user ID, or press enter if you use an SSH connection without a password.

15. Enter the directory for the mapred-site.xml file on the Hadoop cluster.

16. Enter the directory for the yarn-site.xml file on the Hadoop cluster.

The utility connects to the JobTracker and downloads the following files: mapred-site.xml and yarn-site.xml.

17. Enter the Hive client host name.

18. Enter the SSH user ID.

19. Enter the password for the SSH user ID, or press enter if you use an SSH connection without a password.

20. Enter the directory for the hive-site.xml file on the Hadoop cluster.

The configuration manager connects to the Hive client and downloads the following file: hive-site.xml.

21. Optionally, You can choose to configure the HBase server.

Option	Description
1	No. Select this option if you do not want to configure the HBase server.
2	Yes. Select this option to configure the HBase server.

If you select Yes, enter the following information to configure the HBase server:

a. Enter the HBase server host name.
b. Enter the SSH user ID.
c. Enter the password for the SSH user ID, or press enter if you use an SSH connection without a password.
d. Enter the directory for the hbase-site.xml file on the Hadoop cluster.

22. In the Create Connections section, select the connection type to create Big Data Management connections:

Option	Description
1. Hive	Create a Hive connection to access Hive as a source or target.
2. HDFS	Create an HDFS connection to read data from or write data to the HDFS file system on a Hadoop cluster.
3. Hadoop	Create a Hadoop connection to run mappings in the Hadoop environment.
4. HBase	Create an HBase connection to access HBase.
5. Select all	Create all four types of connection.

Press the number that corresponds to your choice.

23. The Domain Information section displays the domain name and the node name. Enter the following additional information about Informatica domain.

a. Enter the domain user name.
b. Enter the domain password.
c. Enter the Data Integration Service name.
d. If the Hadoop cluster uses Kerberos authentication, enter the following information:

▪ Hadoop Kerberos service principal name
▪ Hadoop Kerberos keytab location. Location of the keytab on the Data Integration Service machine.

Note: After you enter the Data Integration Service name, the utility tests the domain connection, and then recycles the Data Integration Service.

24. In the Connection Details section, provide the connection properties.

Based on the type of connection you choose to create, the utility requires different properties.

The Hadoop Configuration Manager creates the connections.

25. The Hadoop Configuration Manager reports a summary of its operations, including whether connection creation succeeded and the location of log files.

The Hadoop Configuration Manager creates the following file in the <Informatica installation directory>/tools/BDMUtil directory:

ClusterConfig.properties.<timestamp>: Contains details about the properties fetched from the Hadoop cluster, including cluster node names, to provide templates for connection creation commands. To use these connection creation templates to create connections to the Hadoop cluster, edit the connection name, domain username and password in the generated commands.