Hadoop Files Connector > Synchronization Tasks with Hadoop Files > Hadoop Files Sources in Synchronization Tasks
  

Hadoop Files Sources in Synchronization Tasks

When you configure a synchronization task to use a Hadoop Files source, you can configure the source properties.
The source properties appear on the Source page of the Synchronization Task wizard when you specify a Hadoop Files connection.
The general properties display the name and description of the Hadoop Files source. Configure the source properties for the source object.
The following table describes the Hadoop Files source properties:
Property
Description
Connection
Name of the connection.
Source Type
Select Single.
Source Object
Select the file from which you want to read data. Though selecting a source object is mandatory, the agent ignores this object. The agent processes the source object specified in File Path in advanced source properties.
You can also configure advanced source properties when you schedule the synchronization task. Advanced source properties appear on the Schedule page of the Synchronization Task wizard.
The following table describes the Hadoop Files advanced source properties:
Advanced Property
Description
File Path
Mandatory. Location of the file or directory from which you want to read data. Maximum length is 255 characters. If the path is a directory, all the files in the directory must have the same file format.
If the file or directory is in HDFS, enter the path without the node URI. For example, /user/lib/testdir specifies the location of a directory in HDFS. The path must not contain more than 512 characters.
If the file or directory is in the local system, enter the fully qualified path. For example, /user/testdir specifies the location of a directory in the local system.
File Format
File format. Default is Binary.
Compression Format
Optional. Compression format of the source files. Select one of the following options:
  • - AUTO
  • - DEFLATE
  • - gzip
  • - bzip2
Select AUTO to detect the compression format based on the source file extension.