Application Service Guide > Data Integration Service Grid > Troubleshooting a Grid
  

Troubleshooting a Grid

I enabled a Data Integration Service that runs on a grid, but one of the service processes failed to start.
When you enable a Data Integration Service that runs on a grid, a service process starts on each node in the grid that has the service role. A service process might fail to start for the following reasons:
A job failed to run on a Data Integration Service grid. Which logs do I review?
If the Data Integration Service grid is configured to run jobs in the service process or in separate local processes, review the following logs in this order:
  1. 1. Job log accessible from the Monitor tab.
  2. Includes log events about how the DTM instance runs the job.
  3. 2. Data Integration Service log accessible from the Service view of the Logs tab.
  4. Includes log events about service configuration, processing, and failures.
If the Data Integration Service grid is configured to run jobs in separate remote processes, additional components write log files. Review the following logs in this order:
  1. 1. Job log accessible from the Monitor tab.
  2. Includes log events about how the DTM instance runs the job.
  3. 2. Data Integration Service log accessible from the Service view of the Logs tab.
  4. Includes log events about service configuration, processing, and failures. The Data Integration Service log includes the following message which indicates the host name and port number of the master compute node:
    INFO: [GRIDCAL_0204] The Integration Service [<MyDISName>] elected a new master compute node [<HostName>:<PortNumber>].
  5. 3. Master compute node log accessible in the cadi_services_0.log file located in the log directory configured for the master compute node.
  6. Includes log events written by the Service Manager on the master compute node about managing the grid of compute nodes and orchestrating worker service process requests. The master compute node logs are not accessible from the Administrator tool.
  7. 4. Resource Manager Service log accessible from the Service view of the Logs tab.
  8. Includes log events about service configuration and processing and about nodes with the compute role that register with the service.
  9. 5. Container management log accessible from the Domain view of the Logs tab. Select Container Management for the category.
  10. Includes log events about how the Service Manager manages containers on nodes with the compute role.
A mapping that ran in a separate remote process has an incomplete log file.
When a mapping runs on a Data Integration Service grid configured to run jobs in separate remote processes, the Data Integration Service writes two files for the mapping log. The worker service process that optimizes and compiles the mapping on the service node writes log events to one log file. The DTM process that runs the mapping on the compute node writes log events to another log file. When you access the mapping log, the Data Integration Service consolidates the two files into a single log file.
A mapping log might be incomplete for the following reasons:
To find the temporary DTM log file on the worker compute node, find the following message in the first section of the mapping log:
INFO: [GCL_5] The grid task [gtid-1443479776986-1-79777626-99] cluster logs can be found at [./1443479776986/taskletlogs/gtid-1443479776986-1-79777626-99].
The listed directory is a subdirectory of the following default log directory configured for the worker compute node:
<Informatica installation directory>/logs/<node name>/dtmLogs/