Configuring High Availability
When you use the Hadoop Configuration Manager to configure Big Data Management, it enables Big Data Management to read from and write to a highly available Hadoop cluster.
A highly available Hadoop cluster can provide uninterrupted access to the JobTracker, name node, and ResourceManager in the cluster. The JobTracker is the service within Hadoop that assigns MapReduce jobs on the cluster. The name node tracks file data across the cluster. The ResourceManager tracks resources and schedules applications in the cluster.
The Hadoop Configuration Manager configures the Data Integration Service to read from and write to a highly available cluster on the following distributions:
- •Azure HDInsight
- •Cloudera CDH
- •Hortonworks HDP
- •IBM BigInsights