Deployment Types
A deployment type represents big data processing requirements based on concurrency and volume.
The deployment type defines the amount of resources that application services and run-time engines require to function efficiently, and how resources should be allocated between the application services and run-time engines. The deployment types are Sandbox, Basic, Standard, and Advanced.
The following table describes the deployment types:
Deployment Type | Description |
---|
Sandbox | Used for proof of concepts or as a sandbox with minimal users. |
Basic | Used for low volume processing with low levels of concurrency. |
Standard | Used for high volume processing with low levels of concurrency. |
Advanced | Used for high volume processing with high levels of concurrency. |
Each deployment type is described using deployment criteria. The deployment criteria is a set of characteristics that are common to each deployment type. Use the deployment criteria to help you understand the deployment type that best fits the environment that you use for big data processing.
The following table defines the deployment criteria:
Deployment Criteria | Sandbox | Basic | Standard | Advanced |
---|
Total data volume | 2-10 GB | 10 GB - 2 TB | 2 TB - 50 TB | 50 TB+ |
Number of nodes in the Hadoop environment | 2 - 5 | 5 - 25 | 25 - 100 | 100+ |
Number of developers | 2 | 5 | 10 | 50 |
Number of concurrent jobs in the Hadoop environment | < 10 | < 250 | < 500 | 2000+ |
Number of Model repository objects | <1000 | < 5000 | 5000 - 20000 | 20000+ |
Number of deployed applications | < 10 | < 25 | < 100 | < 500 |
Number of objects per deployed application | < 10 | < 50 | < 100 | < 100 |
For example, you estimate that your environment handles an average of 400 concurrent jobs and a data volume of 35 TB. According to the deployment criteria, the deployment type that best describes your environment is Standard.