| Did this page help you? Yes No Tell us about it... |
The Amazon Elastic MapReduce (Amazon EMR) default configuration settings are appropriate for most workloads. However, based on your job flow’s specific memory and processing requirements, you might want to modify the configuration settings.
For example, if your job flow tasks are memory-intensive, you can use fewer tasks per core node and reduce your job tracker heap size. A predefined bootstrap action is available to configure your job flow on startup. For more information, see the Configure Memory-Intensive Workloads bootstrap action.
The following tables list the recommended configuration settings for each Amazon EC2 instance type. The default configurations for the cc1.4xlarge, cc2.8xlarge, and cg1.4xlarge instances are sufficient for memory-intensive workloads; therefore, the recommended configuration settings for these instances are not listed.
m1.small
| Parameter | Value |
|---|---|
| HADOOP_JOBTRACKER_HEAPSIZE | 512 |
| HADOOP_NAMENODE_HEAPSIZE | 512 |
| HADOOP_TASKTRACKER_HEAPSIZE | 256 |
| HADOOP_DATANODE_HEAPSIZE | 128 |
| mapred.child.java.opts | -Xmx512m |
| mapred.tasktracker.map.tasks.maximum | 2 |
| mapred.tasktracker.reduce.tasks.maximum | 1 |
m1.large
| Parameter | Value |
|---|---|
| HADOOP_JOBTRACKER_HEAPSIZE | 3072 |
| HADOOP_NAMENODE_HEAPSIZE | 1024 |
| HADOOP_TASKTRACKER_HEAPSIZE | 512 |
| HADOOP_DATANODE_HEAPSIZE | 512 |
| mapred.child.java.opts | -Xmx1024m |
| mapred.tasktracker.map.tasks.maximum | 3 |
| mapred.tasktracker.reduce.tasks.maximum | 1 |
m1.xlarge
| Parameter | Value |
|---|---|
| HADOOP_JOBTRACKER_HEAPSIZE | 9216 |
| HADOOP_NAMENODE_HEAPSIZE | 3072 |
| HADOOP_TASKTRACKER_HEAPSIZE | 512 |
| HADOOP_DATANODE_HEAPSIZE | 512 |
| mapred.child.java.opts | -Xmx1024m |
| mapred.tasktracker.map.tasks.maximum | 8 |
| mapred.tasktracker.reduce.tasks.maximum | 3 |
c1.medium
| Parameter | Value |
|---|---|
| HADOOP_JOBTRACKER_HEAPSIZE | 768 |
| HADOOP_NAMENODE_HEAPSIZE | 512 |
| HADOOP_TASKTRACKER_HEAPSIZE | 256 |
| HADOOP_DATANODE_HEAPSIZE | 128 |
| mapred.child.java.opts | -Xmx512m |
| mapred.tasktracker.map.tasks.maximum | 2 |
| mapred.tasktracker.reduce.tasks.maximum | 1 |
c1.xlarge
| Parameter | Value |
|---|---|
| HADOOP_JOBTRACKER_HEAPSIZE | 2048 |
| HADOOP_NAMENODE_HEAPSIZE | 1024 |
| HADOOP_TASKTRACKER_HEAPSIZE | 512 |
| HADOOP_DATANODE_HEAPSIZE | 512 |
| mapred.child.java.opts | -Xmx512m |
| mapred.tasktracker.map.tasks.maximum | 7 |
| mapred.tasktracker.reduce.tasks.maximum | 2 |
m2.xlarge
| Parameter | Value |
|---|---|
| HADOOP_JOBTRACKER_HEAPSIZE | 4096 |
| HADOOP_NAMENODE_HEAPSIZE | 2048 |
| HADOOP_TASKTRACKER_HEAPSIZE | 512 |
| HADOOP_DATANODE_HEAPSIZE | 512 |
| mapred.child.java.opts | -Xmx3072m |
| mapred.tasktracker.map.tasks.maximum | 3 |
| mapred.tasktracker.reduce.tasks.maximum | 1 |
m2.2xlarge
| Parameter | Value |
|---|---|
| HADOOP_JOBTRACKER_HEAPSIZE | 8192 |
| HADOOP_NAMENODE_HEAPSIZE | 4096 |
| HADOOP_TASKTRACKER_HEAPSIZE | 1024 |
| HADOOP_DATANODE_HEAPSIZE | 1024 |
| mapred.child.java.opts | -Xmx4096m |
| mapred.tasktracker.map.tasks.maximum | 6 |
| mapred.tasktracker.reduce.tasks.maximum | 2 |
m2.4xlarge
| Parameter | Value |
|---|---|
| HADOOP_JOBTRACKER_HEAPSIZE | 8192 |
| HADOOP_NAMENODE_HEAPSIZE | 8192 |
| HADOOP_TASKTRACKER_HEAPSIZE | 1024 |
| HADOOP_DATANODE_HEAPSIZE | 1024 |
| mapred.child.java.opts | -Xmx4096m |
| mapred.tasktracker.map.tasks.maximum | 14 |
| mapred.tasktracker.reduce.tasks.maximum | 4 |