Cloudera leads for Hadoop: EMC and Microsoft lurk in the wings

Cloudera leads for Hadoop: EMC and Microsoft lurk in the wings

Summary: While Cloudera, along with core Apache and Hortonworks, have the lion's share of current European Hadoop use, proprietary vendors could be the elephants in the room.


The first wave of Apache-based distributions of open-source framework Hadoop still dominate in Europe, but newer versions from proprietary vendors are starting to figure strongly in firms' adoption plans.

With Hadoop rapidly becoming the standard for big-data analytics, Cloudera leads the field with 41 percent of the framework's use in Europe, followed by core Apache code on 30 percent, Hortonworks on 18 percent and MapR on nine percent.

However, EMC and Microsoft, which do not register in current adoption figures, emerge as viable candidates for future Hadoop projects, according to the research from data-integration company Syncsort.

Some nine percent of firms cite EMC's Greenplum HD as an Hadoop distribution they are considering using in future plans, with seven percent listing Microsoft's HDInsight Service. The numbers saying they are considering Cloudera and core Apache for future plans drop to 29 percent and 15 percent respectively.

Hortonworks and MapR do better when it comes to firms' future intentions, with figures of 23 percent and 15 percent. IBM's InfoSphere BigInsights stays constant, recording two percent for current use and as a distribution being considered for future use.

The research suggests more than 60 percent of the 300 organisations questioned are already experimenting with Hadoop or have been using it for more than a year. Roughly the same number say they have Hadoop projects due to start shortly.

Only 24 percent say they have no intention of using the open-source data analytics system in the near future.

The common barriers to Hadoop adoption are uncertainty and ignorance, cited by almost half of organisations, followed by concerns about connecting with other apps, lack of skills and unanticipated costs.

Drivers for Hadoop adoption

No single use seems to be driving Hadoop adoption, but marketing intelligence appears to be behind many of the first projects to be tackled, particularly in the retail and online industries.

In the financial services sector, Hadoop's role has shifted from large-scale data storage to credit and market risk applications.

A bare majority of firms fall into the categories of either being in the early phase of Hadoop adoption and working on prototyping and experimentation, or are already using Hadoop as an enterprise data-management platform.

Only about a quarter of companies, 27 percent, think they have a good data strategy and the right tools, but they still outstrip the 14 percent of organisations with no data strategy, or the 16 percent that admit to be struggling with the tools to integrate data.

Topics: Big Data, EMC, Enterprise Software, IBM, Microsoft

Kick off your day with ZDNet's daily email newsletter. It's the freshest tech news and opinion, served hot. Get it.


Log in or register to join the discussion
  • Cloudera Leads for Hadoop in Europe

    Almost an unnecessary study, because Hadoop distribution choice has little to do with the ultimate analytics solution; Hadoop distribution is just part of the solution in the vast majority of cases. E.g., no streaming for Hadoop, no visualization, limited analytics compared to R and SAS. In short, buy a distribution and a bunch of services and you can get started, but it will not ultimately get you to "big data." Sorry (EMC Greenplum) Pivotal HD and Microsoft, but less than 10% of "considering" isn't particularly hopeful, and Intel is nowhere despite all the brouhaha earlier this year.
  • Survey Source and Number of Respondents?

    Toby, I appreciate the article, but you're missing key information to validate the data -- who did this survey? how was it carried out and how many people responded to it? So many vendors and analyst firms are running these right now, it is important to know where the info is coming from in order to understand how to interpret the results.
  • Questionable Journalism

    Toby, why didn't you site your source was a survey at a vendor conference of non-random sample. This is not scientific nor proper survery research both in approach, sample size and distribution. You should have more respect for the nature of your profession.

    "In a recent survey conducted by data processing software vendor Syncsort on 300 attendees of two Big Data conferences in Europe, it was revealed that Cloudera is the widely-adopted Hadoop distribution platform in Europe."