Big Data
This section describes new Big Data features in version 9.6.1.
Data Types in a Hive Environment
You can push high precision Decimal data types to a Hive environment that uses Hive 0.11 and above.
If the mapping is not enabled for high precision, the Data Integration Service converts all decimal values to double values.
If the mapping is enabled for high precision, the Data Integration Service converts decimal values with a precision greater than 28 to double values.
For more information, see the Informatica 9.6.1 Big Data Edition User Guide.
Hive Connection Properties
In the Hive connection, you specify the following properties:
- •Enter advanced Hive or Hadoop properties to configure or override Hive or Hadoop cluster properties in hive-site.xml on the machine on which the Data Integration Service runs.
- •Enter the user name of the user that the Data Integration Service impersonates to run mappings on the Hadoop cluster.
For more information, see the Informatica 9.6.1 Big Data Edition User Guide.
User Authentication
You can enable the Data Integration Service to run mapping and workflow jobs on a Hadoop cluster that uses Kerberos authentication. The Hadoop cluster authenticates the SPN of the Data Integration Service user account to run mapping and workflow jobs on the Hadoop cluster. To enable another user to run jobs on the Hadoop cluster, you can configure the SPN of the Data Integration Service user account to impersonate another user account.
For more information, see the Informatica 9.6.1 Big Data Edition User Guide.
Mappings on Hadoop Distributions
You can enable mappings to run on the following Hadoop distributions:
- •Cloudera CDH 5.0
- •Hortonworks HDP 2.0
- •MapR 3.1
- •Pivotal HD 1.1
For more information, see the Informatica 9.6.1 Big Data Edition Installation and Configuration Guide.