Installation and Configuration Guide > Part III: Run the Big Data Suite Installer > Create a Domain and Install All Big Data Products > Begin the Install
  

Begin the Install

This task includes installer prompts to begin the installation. You will provide basic information such as acceptance of terms, installation option, and the installation directory.
When you complete the preliminary tasks, you will continue with the installer prompts and will provide information to configure the domain.

Run the Installer

Perform the following steps to run the installer:
    1. Log in to the machine with a system user account.
    2. Close all other applications.
    3. On a shell command line, run the install.sh file from the root directory.
    The installer displays the message to verify that the locale environment variables are set.
    4. If the environment variables are not set, press n to exit the installer and set them as required.
    If the environment variables are set, press y to continue.
    5. Press 1 to install Informatica Big Data Suite Products.
    The installer displays different options based on the platform you are installing on.
    The following options appear:
    1. a. Press 1 to run the Pre-Installation System Check Tool.
    2. For more information about running the Pre-Installation (i10Pi) System Check Tool, see Run the Pre-Installation (i10Pi) System Check Tool.
    3. b. Press 2 to run the Informatica Kerberos SPN Format Generator.
    4. For more information about running the Informatica Kerberos SPN Format Generator, see Running the SPN Format Generator on Linux.
    5. c. Press 3 to run the installer.
The Welcome section appears.

Accept Terms and Conditions

    1. Read the terms and conditions for Informatica installation and the product usage toolkit and select I agree to the terms and conditions.
    Informatica DiscoveryIQ is a product usage tool that sends routine reports on data usage and system statistics to Informatica. Informatica DiscoveryIQ uploads data to Informatica 15 minutes after you install and configure Informatica domain. Thereafter, the domain sends the data every 30 days. You can choose to disable usage statistics from the Administrator tool.
    1. a. Press 1 if you do not want to accept the terms and conditions.
    2. b. Press 2 to accept the terms and conditions.
    2. Version 10.2.1 is for big data products only, such as Big Data Management and Big Data Quality. This version does not support non-big data products, such as PowerCenter or Informatica Data Quality.
    1. a. Press 1 and type quit to quit the installation.
    2. b. Press 2 to continue the installation.
    If you choose to not accept the terms and condition, the installer prompts you to accept the terms and conditions.
The Component Selection sections appears.

Choose the Installation Option

After you accept terms and conditions, you can install Informatica domain services supporting Big Data Management services, Enterprise Data Catalog, and Enterprise Data Lake.
    1. Press 3 to install Informatica Enterprise Data Lake.
    When you select this option, you can choose to install Only Enterprise Data Lake, Enterprise Data Catalog and Enterprise Data Lake, and Informatica domain services, supporting Big Data Management services, Enterprise Data Catalog, and Enterprise Data Lake.
    2. Select whether current version of the Informatica domain services is installed on the node.
    1. a. Press 1 if current version of the Informatica domain services is not installed on the node.
    2. b. Press 2 if current version of the Informatica domain services is installed on the node.
    3. Select the Hadoop cluster type for Enterprise Data Catalog.
    1. a. Press 1 to select External cluster type.
    2. b. Press 2 to select Embedded cluster type.
    3. If you choose to run on an internal cluster, the installer creates a cluster to run the services.
    4. Select whether you have read and accepted terms and conditions to use Java SE Development Kit software.
    1. a. Press 1 to not accept the terms and conditions to use Java SE Development Kit software.
    2. b. Press 2 to accept the terms and conditions to use Java SE Development Kit software.
The Installation Prerequisites section displays the installation requirements. Verify that all requirements are met before you continue the installation.

Tune the Application Service

After you review the installation prerequisites, you can choose to tune the application services for better performance based on the deployment type in your environment. If you do not tune now, you can tune the services later through infacmd.
    1. Select if you want the installer to tune the application services.
    Prompt
    Description
    Select if you want the installer to tune the application services.
    Select if you want to tune the services.
    1 - No
    2 - Yes
    Select no if you do not want to tune the services.
    Select yes if you want to tune the services.
    If you are joining the node to existing domain, ensure the deployment type you select here is same deployment type as the gateway nodes.
    2. Select the deployment type associated with the Informatica environment.
    Prompt
    Description
    1. Sandbox
    Choose this option if the environment is used for proof of concepts or as a sandbox environment with minimal users. Sandbox environments are typically configured with 16 cores, 32 GB RAM, and about 50 GB disk space.
    2. Basic
    Choose this option if the environment is used for low volume processing environments with low levels of concurrency. Basic environments are typically single- or multi-node setups configured with 24 cores, 64 GB RAM, and about 100 GB disk space.
    3. Standard
    Choose this option if the environment is used for high volume processing but with low levels of concurrency. Standard environments are typically multi-node setups configured with 64 GB RAM, more than 100 GB disk space per node, and 48 cores across nodes.
    4. Advanced
    Choose this option if the environment is used for high volume processing with high levels of concurrency. Advanced environments are typically multi-node setups configured with 128 GB RAM, more than 100 GB disk space per node, and 96 cores across nodes.
    3. Select whether you want to change the deployment type or continue with the current deployment selection.
    1. a. Press 1 to change the deployment type.
    2. b. Press 2 to continue with the current deployment selection.
The License and Installation Directory section appears.

Specify the Installation Directory

After you verify the installation prerequisites, you can specify the installation directory.
    1. Enter the path to the license key file and press Enter.
    2. Enter the installation directory.
    The directory names in the path must not contain spaces or the following special characters: @|* $ # ! % ( ) { } [ ] , ; ' Default is /home/toolinst.
    Note: Informatica recommends using alphanumeric characters in the installation directory path. If you use a special character such as á or €, unexpected results might occur at run time.
    3. Select whether you want to run the pre-validation utility.
    1. a. Press 1 to skip the pre-validation utility.
    2. b. Press 2 to run the pre-validation utility.
    The utility helps you validate the prerequisites to install Enterprise Data Catalog in an embedded cluster. The utility also validates the Informatica domain, cluster hosts, and the Hadoop cluster services configuration.
If you choose to skip the pre-validation utility, the Pre-Installation Summary section appears. Review the installation summary.
If choose to run the pre-validation utility, the Embedded Hadoop Cluster Pre-Validaion section appears.

Prepare Pre-validation for the External Cluster

If you chose to use external Hadoop cluster, you can configure the external Hadoop cluster.
    1. Select if there is passwordless Secure Shell connection between the Informatica domain and the cluster nodes.
    1. a. Press 1 to secure the connection between the Informatica domain and the cluster nodes.
    2. b. Press 2 if you do not want to secure the connection between the Informatica domain and the cluster nodes.
    2. If you chose to secure the connection between the Informatica domain and the cluster nodes, enter the secure connection information.
    The table describes the information you can enter to secure the connection:
    Property
    Description
    Host SSH user
    User to connect to other hosts from the domain host.
    SSH host names
    Cluster nodes that you want to connect using passwordless Secure shell.
    3. Select whether the cluster uses Kerberos authentication.
    1. a. Press 1 if the Hadoop cluster uses Kerberos authentication.
    2. b. Press 2 if the Hadoop cluster does not use Kerberos authentication.
    3. If you chose that the cluster uses Kerberos authentication, enter the Kerberos information.
      Property
      Description
      Keytab Location
      Location of the merged user and host keytab file.
      KDC Domain Name
      Name of the KDC domain.
    4. Select if you want to validate the cluster configuration.
    1. a. Press 1 to validate the cluster configuration.
    2. b. Press 2 if you do not want to validate the cluster configuration.
    5. Select if you want the installer to automatically configure the cluster.
    1. a. Press 1 if you do not want the installer to automatically configure the cluster. If you do not choose this option, you cannot validate the cluster configuration.
    2. b. Press 2 if you want the installer to automatically configure the cluster.
    6. If you chose to validate the cluster configuration, you can select the cluster type to configure.
    The following table describes the options you can select the cluster type to configure:
    Option
    Description
    Cloudera
    Select to create a cluster configuration for a Cloudera cluster.
    Hortonworks
    Select to create a cluster configuration for a Hortonworks cluster.
    Azure HDInsight
    Select to create a cluster configuration for a Azure HDInsight cluster.
    7. Enter the information to configure the cluster.
    The following table describes the properties you need to set for configuring the cluster.
    Option
    Description
    Cluster Hadoop distribution URL
    URL to access the Hadoop cluster.
    Cluster Hadoop distribution URL user
    User name to access the Hadoop cluster.
    Cluster Hadoop distribution URL password
    Password to access the Hadoop cluster.
    YARN Queue Name
    The YARN scheduler queue name used by the Blaze engine that specifies available resources on a cluster.
    Service cluster name
    Name of the service cluster.

Perform Pre-validation for the Embedded Hadoop Cluster

If you chose to use embedded Hadoop cluster, you can configure the embedded Hadoop cluster.
    1. Select whether you want to enable Kerberos authentication for the cluster.
    1. a. Press 1 to configure the Hadoop cluster to run on a network that does not use Kerberos authentication.
    2. b. Press 2 to configure the Hadoop cluster to run on a network that uses Kerberos authentication.
    3. If you chose to run on a network that uses Kerberos authentication, enter the Kerberos information.
      Property
      Description
      Keytab Location
      Location of the merged user and host keytab file.
      Kerberos Configuration File
      Location of the Kerberos configuration file.
    2. Enter the gateway user name and press Enter. Default is root.
    3. Enter the Informatica Hadoop cluster gateway hostname in the following format: <hostname>.<FQDN> and press Enter.
    4. Enter the list of comma-separated Informatica Hadoop cluster nodes as shown in the following format: <hostname>.<FQDN>, <hostname1>.<FQDN>, <hostname2>.<FQDN> and press Enter.
    5. Enter the Informatica Hadoop cluster gateway port and press Enter. Default is 8080.
    Make sure that you do not configure Oracle with port 8080 on the same machine where Informatica Cluster Service runs.
    6. Enter the path to the working directory and press Enter. The path indicates the location where you want to mount the Informatica Cluster Service.
    The installer starts the pre-validation utility.
The Pre-Installation Summary section appears. Review the installation summary.