User Guide > Connections > HDFS Connection
  

HDFS Connection

Create an HDFS connection to write data to an HDFS binary or sequence file.
You can create and manage an HDFS connection in the Developer tool or through infacmd.

General Properties

The following table describes the general connection properties for the HDFS connection:
Property
Description
Name
The name of the connection. The name is not case sensitive and must be unique within the domain. You can change this property after you create the connection. The name cannot exceed 128 characters, contain spaces, or contain the following special characters:
~ ` ! $ % ^ & * ( ) - + = { [ } ] | \ : ; " ' < , > . ? /
ID
The string that the Data Integration Service uses to identify the connection. The ID is not case sensitive. It must be 255 characters or less and must be unique in the domain. You cannot change this property after you create the connection. Default value is the connection name.
Description
The description of the connection. Enter a string that you can use to identify the connection. The description cannot exceed 4,000 characters.
Location
The domain where you want to create the connection. Select the domain name.
Type
The connection type. Select Hadoop File System.

Connection Properties

The following table describes the connection properties for the HDFS connection:
Property
Description
User Name
User name to access HDFS.
NameNode URI
The URI to access HDFS.
Use the following format to specify the NameNode URI in Cloudera and Hortonworks distributions:
hdfs://<namenode>:<port>
Where
  • - <namenode> is the host name or IP address of the NameNode.
  • - <port> is the port that the NameNode listens for remote procedure calls (RPC).
Use the maprfs:/// format to specify the NameNode URI in MapR distribution.
Cluster Configuration
The name of the cluster configuration associated with the Hadoop environment.
For more information about cluster configuration, see the Informatica Big Data Management Administrator Guide.

Creating an HDFS Connection using Informatica Command Line Program

You can use the infacmd command line program to create an HDFS connection. Access the command from the <Informatica installation directory>/clients/DeveloperClient/infacmd directory.
On UNIX, run the following command:
sh infacmd.sh createConnection
On Windows, run the following command:
infacmd.bat createConnection
Enter connection options in the following format:
... -o option_name='value' option_name='value' ...
For example, run the following command to create an HDFS connection on UNIX:
sh infacmd.sh createConnection -dn <domain name> -un <domain user> -pd <domain password> -cn <connection name> -cid <connection id> -ct HadoopFileSystem -o NameNodeURL='hdfs://<namenode>:<port>' USERNAME='username' clusterConfigId='ConfId'
For more information about the CreateConnection command and HDFS connection options, see the Informatica Command Reference Guide.