Revolution Analytics brings in-Hadoop R to Cloudera's CDH

Revolution Analytics brings in-Hadoop R to Cloudera's CDH

Summary: Revolution Analytics brings R programming to data stored in Cloudera's Hadoop distro (CDH), instead of the other way around.

TOPICS: Big Data

Revolution Analytics, the corporate benefactor-in-chief of the R programming language, is now supporting the integration of R inside Cloudera's Distribution Including Apache Hadoop (CDH), versions 3.0 and 4.0.

Specifically, the upcoming version 7.0 of the Revolution R Enterpise distribution and its ScaleR algorithms will run inside CDH3 and CDH4, eliminating the need to move data out of CDH and into the conventional storage that R code would otherwise require.  Avoiding such data movement will allow predictive analytics functionality implemented in R to execute more immediately and quickly, allowing for more real-time analysis and generally better performance even in non-real-time applications.

This integration results from Cloudera's partnership with Revolution Analytics, one of many such partnerships Cloudera has brokered across the analytics industry.  It also marks yet another scenario wherein Hadoop's Distributed File System (HDFS) is becoming a standard storage medium with which many data engines, query processors, and programming languages feel they must interoperate.

Topic: Big Data

Andrew Brust

About Andrew Brust

Andrew J. Brust has worked in the software industry for 25 years as a developer, consultant, entrepreneur and CTO, specializing in application development, databases and business intelligence technology.

Kick off your day with ZDNet's daily email newsletter. It's the freshest tech news and opinion, served hot. Get it.


1 comment
Log in or register to join the discussion
  • very cool

    This is a huge improvement on the Machine Learning front, much nicer than hacking Mahout code.