Deployment Type | Description |
|---|---|
Sandbox | Used for proof of concepts or as a sandbox with minimal users. |
Basic | Used for low volume processing with low levels of concurrency. |
Standard | Used for high volume processing with low levels of concurrency. |
Advanced | Used for high volume processing with high levels of concurrency. |
Deployment Criteria | Sandbox | Basic | Standard | Advanced |
|---|---|---|---|---|
Total data volume | 2-10 GB | 10 GB - 2 TB | 2 TB - 50 TB | 50 TB+ |
Number of nodes in the Hadoop environment | 2 - 5 | 5 - 25 | 25 - 100 | 100+ |
Number of developers | 2 | 5 | 10 | 50 |
Number of concurrent jobs in the Hadoop environment | < 10 | < 250 | < 500 | 2000+ |
Number of Model repository objects | <1000 | < 5000 | 5000 - 20000 | 20000+ |
Number of deployed applications | < 10 | < 25 | < 100 | < 500 |
Number of objects per deployed application | < 10 | < 50 | < 100 | < 100 |
Service | Sandbox | Basic | Standard | Advanced |
|---|---|---|---|---|
Analyst Service | 768 MB | 1 GB | 2 GB | 4 GB |
Content Management Service | 1 GB | 2 GB | 4 GB | 4 GB |
Data Integration Service | 640 MB | 2 GB | 4 GB | 6 GB |
Model Repository Service | 1 GB | 1 GB | 2 GB | 4 GB |
Resource Manager Service | 512 MB | 512 MB | 2 GB | 4 GB |
Search Service | 768 MB | 1 GB | 2 GB | 4 GB |
Run-time Environment | Sandbox | Basic | Standard | Advanced |
|---|---|---|---|---|
Native | 10 | 10 | 15 | 30 |
Hadoop | 10 | 500 | 1000 | 2000 |
Property | Description |
|---|---|
spark.driver.memory | The driver process memory that the Spark engine uses to run mapping jobs. |
spark.executor.memory | The amount of memory that each executor process uses to run tasklets on the Spark engine. |
spark.executor.cores | The number of cores that each executor process uses to run tasklets on the Spark engine. |
spark.sql.shuffle.partitions | The number of partitions that the Spark engine uses to shuffle data to process joins or aggregations in a mapping job. |
Property | Sandbox | Basic | Standard | Advanced |
|---|---|---|---|---|
spark.driver.memory | 1 GB | 2 GB | 4 GB | 4 GB |
spark.executor.memory | 2 GB | 4 GB | 6 GB | 6 GB |
spark.executor.cores | 2 | 2 | 2 | 2 |
spark.sql.shuffle.partitions | 100 | 400 | 1500 | 3000 |
Property | Description | Value |
|---|---|---|
infagrid.orch.scheduler.oop.container.pref.memory | The amount of memory that the Blaze engine uses to run tasklets. | 5120 |
infagrid.orch.scheduler.oop.container.pref.vcore | The number of DTM instances that run on the Blaze engine. | 4 |
infagrid.tasklet.dtm.buffer.block.size | The amount of buffer memory that a DTM instance uses to move a block of data in a tasklet. | 6553600 |
* The tuned properties do not depend on the deployment type. | ||
Option | Argument | Description |
|---|---|---|
-DomainName -dn | domain_name | Required. Name of the Informatica domain. You can set the domain name with the -dn option or the environment variable INFA_DEFAULT_DOMAIN. If you set a domain name with both methods, the -dn option takes precedence. |
-UserName -un | user_name | Required if the domain uses Native or LDAP authentication. User name to connect to the domain. You can set the user name with the -un option or the environment variable INFA_DEFAULT_DOMAIN_USER. If you set a user name with both methods, the -un option takes precedence. Optional if the domain uses Kerberos authentication. To run the command with single sign-on, do not set the user name. If you set the user name, the command runs without single sign-on. |
-Password -pd | password | Required if you specify the user name. Password for the user name. The password is case sensitive. You can set a password with the -pd option or the environment variable INFA_DEFAULT_DOMAIN_PASSWORD. If you set a password with both methods, the password set with the -pd option takes precedence. |
-SecurityDomain -sdn | security_domain | Required if the domain uses LDAP authentication. Optional if the domain uses native authentication or Kerberos authentication. Name of the security domain to which the domain user belongs. You can set a security domain with the -sdn option or the environment variable INFA_DEFAULT_SECURITY_DOMAIN. If you set a security domain name with both methods, the -sdn option takes precedence. The security domain name is case sensitive. If the domain uses native or LDAP authentication, the default is Native. If the domain uses Kerberos authentication, the default is the LDAP security domain created during installation. The name of the security domain is the same as the user realm specified during installation. |
-ResilienceTimeout -re | timeout_period_in_seconds | Optional. Amount of time in seconds that infacmd attempts to establish or re-establish a connection to the domain. You can set the resilience timeout period with the -re option or the environment variable INFA_CLIENT_RESILIENCE_TIMEOUT. If you set the resilience timeout period with both methods, the -re option takes precedence. |
-Size -s | tuning_size_name | Required. The deployment type that represents big data processing requirements based on concurrency and volume. You can enter Sandbox, Basic, Standard, or Advanced. |
-ServiceNames -sn | service_names | Optional. List of services configured in the Informatica domain. Separate each service name with a comma. You can tune the following services:
Default is none. |
-BlazeConnectionNames -bcn | connection_names | Optional. List of Hadoop connections configured in the Informatica domain. For each Hadoop connection, the command tunes Blaze configuration properties in the Hadoop connection. Separate each Hadoop connection name with a comma. Default is none. |
-SparkConnectionNames -scn | connection_names | Optional. List of Hadoop connections configured in the Informatica domain. For each Hadoop connection, the command tunes Spark configuration properties in the Hadoop connection. Separate each Hadoop connection name with a comma. Default is none. |
-All -a | yes_or_no | Optional. Enter yes to apply recommended settings to all Analyst Services, Content Management Services, Data Integration Services, Model Repository Services, Resource Manager Services, Search Services, and Hadoop connections in the Informatica domain. Enter no to apply the recommended settings only to the services and Hadoop connections that you specify. Default is no. |