Pentaho, a longstanding open source business intelligence applications player, notes that Hadoop and several top NoSQL databases are licensed under Apache. Pentaho's Kettle open source project, othwerwise known as Pentaho Data Integration Community Edition, is devoted to "operationalizing" big data.
Some of the big data capabilities in Kettle that will be open sourced include "the ability to input, output, manipulate and report on data using the following Hadoop and NoSQL stores: Cassandra, Hadoop HDFS, Hadoop MapReduce, Hadapt, HBase, Hive, HPCC Systems and MongoDB," the company announced.
Traditional relational databases and data tools are insufficient for handling big datasets.
One exec had this to say about the open source move:
“In order to obtain broader market adoption of big data technology including Hadoop and NoSQL, Pentaho is open sourcing its data integration product under the free Apache license. This will foster success and productivity for developers, analysts and data scientists giving them one tool for data integration and access to discovery and visualization," said Matt Caster, founder and chief architectb of Pentaho's Kettle Project.