Amazon launches Hadoop data crunching service

Amazon launches Hadoop data crunching service

Summary: Amazon on Thursday announced a new cloud computing service that uses Hadoop, a free software framework, to crunch tons of data. The service, called Amazon Elastic MapReduce, is designed for businesses, researchers and analysts trying to conduct data intensive number crunching (statement).

SHARE:
3

Amazon on Thursday announced a new cloud computing service that uses Hadoop, a free software framework, to crunch tons of data. 

The service, called Amazon Elastic MapReduce, is designed for businesses, researchers and analysts trying to conduct data intensive number crunching (statement). Hadoop, which is used by companies like Google and Yahoo, is being pushed into the enterprise data center by startups like Cloudera. 

Clarification: Google invented and uses the MapReduce technology, but it doesn't use Hadoop, an open-source implementation of MapReduce. At least it doesn't use it broadly. It has its own in-house version.

Amazon's Hadoop framework runs on the company's Elastic Compute Cloud (EC2) and Simple Storage Service (S3). The general idea is that customers can use MapReduce to pay by the sip as they do things like index the web, data mine, conduct financial analysis, simulation and bioinformatics research. In a must read analysis, Dana Gardner reckons that Amazon's move could be a game changer for business intelligence

In a statement, Amazon said:

Amazon Elastic MapReduce creates data processing job flows that are executed by Hadoop software on the web-scale infrastructure of Amazon EC2. The service automatically launches and configures the number and type of Amazon EC2 instances specified by customers. It then kicks off a Hadoop implementation of the MapReduce programming model, which loads large amounts of user input data from Amazon S3 and then subdivides it for parallel processing using Amazon EC2 instances. As processing completes, data is re-combined and reduced into a final solution, and the results deposited back into Amazon S3. Users can configure, manipulate, and monitor job flows through web service APIs or via the AWS Management Console.

That roughly translates to: Bring your data mining to us.

MapReduce is a separate service and here's the pricing in the U.S.:

Topics: CXO, Amazon, Cloud, IT Employment

Kick off your day with ZDNet's daily email newsletter. It's the freshest tech news and opinion, served hot. Get it.

Talkback

3 comments
Log in or register to join the discussion
  • Making MapReduce Mainstream

    Hadoop is great for teams of developers to crunch data. However, the "tipping point" in bringing the value of MapReduce to companies is by providing enterprise-class database features such as security, tight SQL integration, and more. Aster Data offers nCluster on the Amazon cloud that combines the functionality of a database to manage the data along with the power of SQL and MapReduce to query it. Customers can store, manage, query and mine data in one platform realizing the full benefits of Amazon's Cloud.
    http://www.asterdata.com/blog/index.php/category/mapreduce/

    swooledge
  • RE: Amazon launches Hadoop data crunching service

    This is awesome news!
    -Des

    http://antiagingview.blogspot.com/
    desmondhaynes
  • RE: Amazon launches Hadoop data crunching service

    It has been an expected move from Amazon to introduce
    MapReduce service. I am glad Amazon team listens to
    the community. I think the readers of this post may
    want to learn about CloudBerry Explorer freeware
    product http://cloudberrylab.com/ that helps managing
    Amazon S3 and CloudFront services.
    cloudberryman