Property | Description |
---|---|
File Protocol | Choose Local File to scan the files in the local Windows or Linux machine. |
Path | Specify the absolute path of the file from where you want to import the metadata into the catalog. Make sure that the absolute path that you enter in this field is available on the machine where you have installed Enterprise Data Catalog and all the cluster nodes. |
Test Connection | Click Test Connection to verify the connection to the specified location. |
Property | Description |
---|---|
File Protocol | Choose SFTP to scan the files on a remote Linux machine. |
User Name | Enter the user name to access the Linux machine. |
Password | Enter the password to access the Linux machine. |
Host | Specify the host name or IP address of the Linux machine. |
Port | Specify the port number on which the SFTP file protocol is configured on the machine. The default value is 22. |
Path | Specify the network or shared path of the file from where you want to import the metadata into the catalog. |
Test Connection | Click Test Connection to verify the connection to the specified location. |
Property | Description |
---|---|
File Protocol | Choose SMB/CIFS to scan the files located on a remote Windows machine. |
User Name | Enter the user name to access the Windows machine. |
Password | Enter the password to access the Windows machine. |
Physical Location | Applies to flat files. Enter the absolute path of the shared directory on the Windows machine. For example, assume that the files are located in the c:\user1\1021\SMB\test_files\test1 and c:\user1\1021\SMB\test_files\test2 folders. You map the c:\user1\1021\SMB location which results in the \\SMB\ shared directory. In this scenario, enter \\SMB\ in the Physical Location field. For more information about how Enterprise Data Catalog uses the physical location field for lineage, see Appendix F - Lineage between PowerCenter Resources and Flat Files. |
Host | Specify the host name or IP address of the machine. |
Path | Specify the network or shared path from where you want to import the metadata into the catalog. For example, enter \\SMB\test_files\ in the Path field. |
Test Connection | Click Test Connection to verify the connection to the specified location. |
Property | Description |
---|---|
Enable Source Metadata | Extracts and ingests metadata from the data source. |
File Types | Select any or all of the following file types from which you want to extract metadata:
Note: You can select Specific File Types option in the dialog box to select files under all the categories. |
Treat Files Without Extension As | Select one of the following options to identify files without an extension:
|
Enter File Delimiter | You can specify one of the following delimiters:
Verify that you enclose the delimiter in single quotes. For example, '$'. Use a comma to separate multiple delimiters. For example, '$','%','&' If you specify a multi-character delimiter, do not specify any other delimiter for the scanner. When you run a file system resource with a multi-character delimiter, the scanner considers the files that contain the delimiter as valid CSV files and the remaining files as unstructured files while processing the files. The following resources do not support multi-character delimiter for CSV files in Enterprise Data Catalog:
|
Other File Types | Extracts basic file metadata such as size of the file, path to the file, and time stamp information from other file types. |
First Level Directory | Specify a directory or a list of directories under the source directory. If you leave this option blank, Enterprise Data Catalog imports all the files from the specified source directory. To specify a directory or a list of directories, you can perform the following steps:
Note: If you are selecting multiple directories, you must separate the directories using a semicolon (;). |
Include Subdirectory | Select this option to import all the files in the subdirectories under the source directory. |
Case Sensitive | Specifies that the resource is configured for case sensitivity. Select one of the following values:
The default value is True. |
Memory | The memory required to run the scanner job. Select one of the following values based on the data set size imported:
Note: For more information about the memory values, see the Tuning Enterprise Data Catalog Performance article on How To-Library Articles tab in the Informatica Doc Portal |
JVM Options | JVM parameters that you can set to configure scanner container. Use the following arguments to configure the parameters:
|
Track Data Source Changes | View metadata source change notifications in Enterprise Data Catalog. |