Oracle will bundle a tweaked version of Cloudera's Hadoop distribution with the Oracle Big Data Appliance.
The announcement, made on Tuesday, sees Oracle opt for a Hadoop distribution that was snubbed by storage rival EMC, which opted for MapR's technology instead.
"The blending of Oracle Big Data Appliance with [the Cloudera Distribution Including Apache Hadoop] and Cloudera Manager is a natural and highly complementary fit," Mike Olson, co-founder and chief executive of Cloudera, said in a statement. "Cloudera is delighted to be teaming with Oracle to deliver the first engineered system designed to make Apache Hadoop work together with Oracle's product stack to address the most demanding analytic workloads."
Cloudera is the main commercial distributor of Apache Hadoop, the open-source technology that uses tools detailed by Google and developed by Yahoo to process massive amounts of data.
Oracle announced the Oracle Big Data Appliance at Oracle OpenWorld in San Francisco in October. The appliance is designed for organisations that want to analyse and generate insights from vast amounts of data. At the time it was widely assumed Oracle would develop its own Hadoop distribution, but with the announcement on Tuesday it seems the database specialist has opted for a partnership instead.
The device is composed of 18 Oracle Sun servers with a combined total of 864GB of memory, 216 CPU cores, 648TB of raw disk storage, 40Gbps of Infiniband connectivity between servers and to other Oracle appliances, and 10Gbps connectivity to the rest of the datacentre.
The appliance will be bundled with the Cloudera Distribution Including Apache Hadoop (CDH); Cloudera Manager; R, an open-source statistic-friendly programming language; Oracle NoSQL Database Community Edition; and the Oracle HotSpot Java Virtual Machine.
Cloudera also has a partnership with Dell.