Amazon Elastic MapReduce
Developer Guide (API Version 2009-11-30)
Print this pageEmail this pageGo to the ForumsView the PDFShare this page on TwitterShare this page on FacebookBookmark this page on DeliciousSubmit this page to RedditSubmit this page to DiggDid this page help you?  Yes  No   Tell us about it...

Document History

The following table describes the important changes to the documentation since the last release of Amazon Elastic MapReduce (Amazon EMR).

API version: 2009-03-31.

Latest documentation update: April 9, 2012.

ChangeDescriptionRelease Date
AMI 2.0.5 Enhancements to performance and other updates. For details, see AMI Versions Supported in Amazon EMR. April 19, 2012
Pig 0.9.2 Amazon Elastic MapReduce supports Pig 0.9.2. Pig 0.9.2 adds support for user-defined functions written in Python and other improvements. For more information, go to Pig 0.9.2 Patches. April 9, 2012
Pig versioning Amazon Elastic MapReduce supports the ability to specify the Pig version when launching a job flow. For more information, go to Pig Configuration. April 9, 2012
Hive 0.7.1.4 Amazon Elastic MapReduce supports Hive 0.7.1.4. For more information, go to Hive Configuration. April 9, 2012
AMI 1.0.1 Updates sources.list to the new location of the Lenny distribution in archive.debian.org. April 3, 2012
Hive 0.7.1.3 Support for new version of Hive, version 0.7.1.3, which adds the dynamodb.retry.duration variable which you can use to configure the timeout duration for retrying Hive queries. This version of Hive also supports setting the Amazon DynamoDB endpoint from within the Hive command-line application. March 13, 2012
Support for IAM in the console Support for AWS Identity and Access Management (IAM) in the Amazon EMR console. Improvements for S3DistCp and support for Hive 0.7.1.2 are also included. February 28, 2012
Support for CloudWatch Metrics Support for monitoring job flow metrics and setting alarms on metrics. January 31, 2012
Support for S3DistCp Support for distributed copy using S3DistCp. January 19, 2012
Support for Amazon DynamoDB Support for exporting and querying data stored in Amazon DynamoDB. January 18, 2012
AMI 2.0.2 and Hive 0.7.1.1 Support for Amazon EMR AMI 2.0.2 and Hive 0.7.1.1. January 17, 2012
Cluster Compute Eight Extra Large (cc2.8xlarge) Support for Cluster Compute Eight Extra Large (cc2.8xlarge) instances in job flows. December 21, 2011
Hadoop 0.20.205 Support for Hadoop 0.20.205. For more information see Supported Hadoop Versions. December 11, 2011
Pig 0.9.1Support for Pig 0.9.1. For more information see Supported Pig Versions. December 11, 2011
AMI versioning You can now specify which version of the Amazon EMR AMI to use to launch your job flow. All Amazon EC2 instances in the job flow will be initialized with the AMI version that you specify. For more information see Specify the Amazon EMR AMI Version. December 11, 2011
Amazon EMR job flows on Amazon Virtual Private Cloud (Amazon VPC)You can now launch Amazon EMR job flows inside of your Amazon Virtual Private Cloud (Amazon VPC) for greater control over network configuration and access. For more information see Running Job Flows on an Amazon VPC. December 11, 2011
Spot InstancesSupport for launching job flow instance groups as Spot Instances added. For more information see Lowering Costs with Spot Instances. August 19, 2011
Hive 0.7.1Support for Hive 0.7.1 added. For more information see Supported Hive Versions. July 25, 2011
Termination ProtectionSupport for a new Termination Protection feature. For more information see Protecting a Job Flow from Termination. April 14, 2011
TaggingSupport for Amazon EC2 tagging. For more information see Using Tagging.March 9, 2011
IAM IntegrationSupport for Amazon Identity and Access Management. For more information see AWS Identity and Access Management (IAM) and Configuring User Permissions.February 21, 2011
Elastic IP SupportSupport for Elastic IP addresses. For more information see Elastic IP Address and Using Elastic IP Addresses. February 21, 2011
Environment ConfigurationExpanded sections on Environment Configuration and Performance Tuning. For more information see Performance Tuning and Environment Configuration. February 21, 2011
Distributed CacheFor more information on using DistributedCache to upload files and libraries, see Using Distributed Cache. February 21, 2011
How to build modules using Amazon Elastic MapReduce (Amazon EMR)For more information see Building Binaries Using Amazon EMR. February 21, 2011
Comparison of job flow typesFor more information see Appendix: Compare Job Flow Types. February 21, 2011
Amazon S3 multipart uploadSupport of Amazon S3 multipart upload through the AWS Java SDK. For more information see Multipart Upload.January 6, 2010
Hive 0.70Support for Hive 0.70 and concurrent versions of Hive 0.5 and Hive 0.7 on same cluster. Note: You need to update the Elastic MapReduce Command Line Interface to resize running job flows and modify instance groups. For more information see Hive Configuration.December 8, 2010
JDBC Drivers for HiveSupport for JDBC with Hive 0.5 and Hive 0.7. For more information see Using the Hive JDBC Driver. December 8, 2010
Support HPCSupport for cluster compute instances. For more information see Amazon EC2 Instances.November 14, 2010
Bootstrap ActionsExpanded content and samples for bootstrap actions. For more information see Bootstrap Actions.November 14, 2010
Cascading job flowsDescription of Cascading job flow support. For more information see How to Create a Cascading Job Flow and Cascading.November 14, 2010
Resize Running Job FlowSupport for resizing a running job flow. New node types task and core replace slave node. For more information see Architectural Overview of Amazon EMR, Resizeable Running Job Flows, and Resizing Running Job Flows.October 19, 2010
Appendix: Configuration OptionsExpanded information on configuration options available in Amazon EMR. For more information, refer to Hadoop Configuration.October 19, 2010
Guide revision

This release features a reorganization of the Amazon EMR Developer Guide.

October 19, 2010