Data Integration Service
The Data Integration Service receives requests from Informatica client tools to run integration, profile, and data preparation jobs. It writes results to different databases, and it writes run-time metadata to the Model repository. When you create the service, you need to associate another application service with it.
The following table lists the dependencies for products, services, and databases that are associated with the Data Integration Service.
Dependency | Summary |
---|
Products | The following products use the Data Integration Service: - - Big Data Management
- - Big Data Parsing
- - Big Data Quality
- - Big Data Streaming
- - Enterprise Data Catalog
- - Enterprise Data Lake
|
Services | The Data Integration Service requires a direct association with the following service: - - Model Repository Service
|
Databases | The Data Integration Service uses the following databases: - - Data object cache. Stores cached logical data objects and virtual tables.
- - Profiling warehouse. Stores profiling information, such as profile and scorecard results.
- - Workflow database. Stores run-time metadata for workflows.
|
Installer | You can create the Data Integration Service when you run the installer. |
Rules and Guidelines
Consider the following rules and guidelines for Data Integration Service creation:
- •If you plan to use operating system profiles for Big Data Management, you must create a dedicated Data Integration Service for Enterprise Data Catalog. Enterprise Data Catalog does not support operating system profiles.
- •Informatica recommends creating a dedicated Data Integration Service for Enterprise Data Lake.
Data Object Cache Database Requirements
The data object cache database stores cached logical data objects and virtual tables for the Data Integration Service. You specify the data object cache database connection when you create the Data Integration Service.
The data object cache database supports the following database types:
- •IBM DB2 UDB
- •Microsoft SQL Server
- •Oracle
Allow 200 MB of disk space for the database.
Note: Ensure that you install the database client on the machine on which you want to run the Data Integration Service.
IBM DB2 Database Requirements
Use the following guidelines when you set up the repository on IBM DB2:
- •Verify that the database user account has CREATETAB and CONNECT privileges.
- •Informatica does not support IBM DB2 table aliases for repository tables. Verify that table aliases have not been created for any tables in the database.
- •Set the tablespace pageSize parameter to 32768 bytes.
- •Set the NPAGES parameter to at least 5000. The NPAGES parameter determines the number of pages in the tablespace.
Microsoft SQL Server Database Requirements
Use the following guidelines when you set up the repository on Microsoft SQL Server:
- •Verify that the database user account has CONNECT and CREATE TABLE privileges.
Oracle Database Requirements
Use the following guidelines when you set up the repository on Oracle:
- • Verify that the database user has the following privileges:
- - CREATE INDEX
- - CREATE SESSION
- - CREATE SYNONYM
- - CREATE TABLE
- - CREATE VIEW
- - DROP TABLE
- - INSERT INTO TABLE
- - UPDATE TABLE
- •Informatica does not support Oracle public synonyms for repository tables. Verify that public synonyms have not been created for any tables in the database.
Profiling Warehouse Requirements
The profiling warehouse database stores profiling and scorecard results. You specify the profiling warehouse connection when you create the Data Integration Service.
The profiling warehouse supports the following database types:
- •IBM DB2 UDB
- •Microsoft SQL Server
- •Oracle
Allow 10 GB of disk space for the database.
Note: Ensure that you install the database client on the machine on which you want to run the Data Integration Service. You can specify a JDBC connection or Hive connection as a profiling warehouse connection for IBM DB2 UDB, Microsoft SQL Server, and Oracle database types.
For more information about configuring the database, see the documentation for your database system.
IBM DB2 Database Requirements
Use the following guidelines when you set up the repository on IBM DB2:
- •The database user account must have the CREATETAB, CONNECT, CREATE VIEW, and CREATE FUNCTION privileges.
- •Informatica does not support IBM DB2 table aliases for repository tables. Verify that table aliases have not been created for any tables in the database.
- •Set the tablespace pageSize parameter to 32768 bytes.
- •Set the NPAGES parameter to at least 5000. The NPAGES parameter determines the number of pages in the tablespace.
Note: Informatica does not support the partitioned database environment for IBM DB2 databases when you use a JDBC connection as the profiling warehouse connection.
Microsoft SQL Server Database Requirements
Use the following guidelines when you set up the repository on Microsoft SQL Server:
- •The database user account must have the CONNECT, CREATE TABLE, CREATE VIEW, and CREATE FUNCTION privileges.
Oracle Database Requirements
Use the following guidelines when you set up the repository on Oracle:
- • Verify that the database user has the following privileges:
- - ALTER TABLE
- - CREATE ANY INDEX
- - CREATE PROCEDURE
- - CREATE SESSION
- - CREATE TABLE
- - CREATE VIEW
- - DROP TABLE
- - UPDATE TABLE
- •Informatica does not support Oracle public synonyms for repository tables. Verify that public synonyms have not been created for any tables in the database.
- •Set the tablespace parameter.
- •Set the following parameters to the Informatica recommended values:
Parameter | Recommended Value |
---|
open_cursors | 3000 |
Sessions | 1000 |
Processes | 1000 |
Workflow Database Requirements
The Data Integration Service stores run-time metadata for workflows in the workflow database. Before you create the workflow database, set up a database and database user account for the workflow database.
You specify the workflow database connection when you create the Data Integration Service.
The workflow database supports the following database types:
- •IBM DB2 UDB
- •Microsoft SQL Server
- •Oracle
Allow 200 MB of disk space for the database.
Note: Ensure that you install the database client on the machine on which you want to run the Data Integration Service.
IBM DB2 Database Requirements
Use the following guidelines when you set up the repository on IBM DB2:
Microsoft SQL Server Database Requirements
Use the following guidelines when you set up the repository on Microsoft SQL Server:
Oracle Database Requirements
Use the following guidelines when you set up the repository on Oracle: