Hive Connector > Introduction to Hive Connector > Configure Hive Connector to download the distribution-specific Hive libraries
  

Configure Hive Connector to download the distribution-specific Hive libraries

You must configure Hive Connector to download the distribution specific Hive third-party libraries. The Informatica Hive third-party script and the Informatica Hive third-party properties files are available as part of the Hive Connector package in the Secure Agent installation.

Distributions applicable for Hive mappings

You can utilize the following distribution versions when you use Hive Connector to run mappings:

Distributions applicable for Hive mappings in advanced mode

You can utilize the following distribution versions when Hive Connector runs on the advanced cluster:
Perform the following tasks to download distribution specific Hive third-party libraries before you use Hive Connector:
  1. 1Run the script to copy the third-party libraries to the Secure Agent location. Ensure that you have full permissions to the directories where the Hive libraries are copied.
  2. The script is interactive and you need to specify the job type and the Hadoop cluster you want to use, when prompted.
  3. 2Add the runtime DTM property, INFA_HADOOP_DISTRO_NAME, and set its value to the applicable distribution that you want to use.
  4. 3Restart the Secure Agent.

Run the script on a Linux system

You can run the Hive third-party script either in the Secure Agent installation directory or outside the Secure Agent installation directory. The script and the Informatica Hive third-party property files are part of the Hive Connector package that is located in the Secure Agent installation directory.
When you run the Hive third-party script, you need to specify the Hadoop distribution that you want to use.
Based on the distribution you select, the Hadoop distribution directory is created in deploy_to_main/distros/Parsers/.
Note: CDH_6.1 option is applicable for Cloudera CDH 6.1, Cloudera CDP 7.1 private cloud, and Cloudera CDW 7.2 public cloud in mappings. For mappings in advanced mode, CDH_6.1 is applicable only for Cloudera CDH 6.1. EMR_5.20 is applicable for EMR_6.1, EMR_6.2, and EMR_6.3 for Hive mappings in advanced mode, whereas EMR_5.20 is applicable only for Amazon EMR 5.20, EMR 6.3, and EMR 6.4 in mappings.

Run the script in the Secure Agent installation directory

Run the script in the Secure Agent installation directory when the Secure Agent has Internet access to download the third-party libraries.
    1Go to the following Secure Agent installation directory where the Informatica Hive third-party script is located:
    <Secure Agent installation directory>/downloads/package-hiveadapter.<version>/package/hive/thirdparty/informatica.hiveadapter/scripts/
    2From the terminal, run the following command from the scripts folder:
    sh downloadHiveLibs.sh
    3When prompted, select Data Integration or CDI Advanced Mode for which you want to run the script.
    4When prompted, specify the value of the Hadoop distribution that you want to use.
    The third-party libraries are copied to the following directories based on the integration option you selected in step 3:
    5Only for the HDInsight_4.0 or HDP_3.1 Hadoop distribution, perform the following tasks:
    1. aNavigate to the directory where the Hive third-party libraries are available on the cluster machine.
    2. For example, the /usr/hdp/ directory on the cluster machine.
    3. bManually download and copy the Hive third-party libraries to the directory displayed in the prompt.
    4. The prompt also displays a list of the Hive third-party libraries that you need to download.
      The script fails to download a few Hive third-party libraries for the HDInsight_4.0 or HDP_3.1 Hadoop distribution because these repositories are private in Cloudera.
      For more information about private repositories in Cloudera, see Transition to private repositories.

Run the script outside the Secure Agent installation directory

Run the script outside the Secure Agent installation directory when the Secure Agent does not have Internet access to download the third-party libraries or due to other network restrictions.
    1Go to the following Secure Agent installation directory where the Informatica Hive third-party script is located:
    <Secure Agent installation directory>/downloads/package-hiveadapter.<version>/package/hive/thirdparty/informatica.hiveadapter/scripts/
    2Copy the scripts folder outside the Secure Agent installation directory.
    3From the terminal, run the following command from the scripts folder:
    sh downloadHiveLibs.sh
    4When prompted, select Data Integration or CDI Advanced Mode for which you want to run the script.
    5When prompted, specify the value of the Hadoop distribution that you want to use.
    The third-party libraries are copied to the following directories based on the integration option you selected in step 4:
    6Manually copy the generated directories with the third-party libraries to the Secure Agent installation location.
    7Only for the HDInsight_4.0 or HDP_3.1 Hadoop distribution, perform the following tasks:
    1. aNavigate to the directory where the Hive third-party libraries are available on the cluster machine.
    2. For example, the /usr/hdp/ directory on the cluster machine.
    3. bManually download and copy the Hive third-party libraries to the directory displayed in the prompt.
    4. The prompt also displays a list of the Hive third-party libraries that you need to download.
      The script fails to download a few Hive third-party libraries for the HDInsight_4.0 or HDP_3.1 Hadoop distribution because these repositories are private in Cloudera.
      For more information about private repositories in Cloudera, see Transition to private repositories.

Set the custom property for the Data Integration Service

Set the INFA_HADOOP_DISTRO_NAME property for the DTM in the Secure Agent properties and set the value of the distribution version that you want to use.
    1Open Administrator and select Runtime Environments.
    2 Select the Secure Agent for which you want to configure the DTM property.
    3On the upper-right corner of the page, click Edit.
    4Add the following DTM properties in the Custom Configuration section:

Restart the Secure Agent

After you complete the configurations and set the properties, restart the Secure Agent to reflect the changes.