Microsoft's Hadoop roadmap reveals new big data deliverables

By | February 27, 2012, 9:04am PST

Summary: A Microsoft roadmap slide sheds some additional details on the scope of the work the company is doing to integrate Hadoop with Windows Azure and Windows Server.

When it comes to big data, Microsoft has more in the works than just Windows Azure and Windows Server versions of the Hadoop big-data framework. The company is working on a number of supplementary tools and technologies that it plans to roll out aggressively in the first half of this year, according to a roadmap shared with me by one of my contacts.

Microsoft announced last fall that the company was partnering with Hortonworks to create Windows Azure and Windows Server Hadoop distributions. Company officials also shared via a short video on Microsoft’s Channel 9 that Microsoft was looking beyond those distributions themselves and was working on ways to integrate bidirectionally with the Hadoop file system and Hadoop tools like Sqoop and Flume. Microsoft officials have used the codename “Isotope” to refer to coming suite of Microsoft products and utilities that will support Hadoop on Windows Azure and Windows Server.

According to the roadmap slide below, Microsoft is planning to deliver the final version of Hadoop on Azure on March 30. (We knew this was supposed to happen some time in March, but didn’t have the exact target date until now.) At that same time, Microsoft also is planning to launch a tech preview of Hadoop on Windows Server, the roadmap says, with the final version of that offering targeted for release on June 29. (Again, we knew the target was June for Hadoop on Windows Server, but didn’t know of a specific date.)

(click on the slide to enlarge)

The exact delivery dates aren’t all that interesting to me, as they’re probably still somewhat subject to change (given that this slide is dated December 2011). Here’s what I found to be far more interesting in this slide:

See that mention of “BigTop” that is under the Hadoop on Azure GA (general availability) item? That’s a real surprising blast from the past — at least for this Microsoft codename watcher. “BigTop” was a Microsoft project I last wrote about back in 2004. BigTop, from what I was told, was all about helping developers create a set of loosely coupled, distributed operating-systems components in a relatively rapid way. Last I had heard Microsoft killed its BigTop incubation effort, but it seems at least some of the technologies from it are alive and well if it’s mentioned as a part of the current roadmap.

Update: Thanks to reader Andrew Bayer (@abayer), who alerted me to the “other” BigTop — and the far more likely reference on this slide. There’s an Apache BigTop that’s all about interoperability tests and packaging. I’m 99% sure this is the BigTop meant by the slide and not the old Microsoft BigTop.

Also under the Hadoop on Azure GA item is a .Net/Common Language Runtime (CLR)/C# framework for Hadoop programming. I’m hearing second-hand that there’s no real C# support there yet, beyond a single sample which can be used in a limited capacity. So it will be interesting to see how/if this materializes by March.

On the Hadoop on Server side, there’s another mention of Active Directory (AD) integration — something Microsoft execs alluded to in the aforementioned Channel 9 Isotope video. The roadmap slide also notes System Center (SC) integration is coming, too. And by the time the product is generally available in late June, it looks like support for the “R” statistical graphics and computation language will be incorporated, as will availability of “secure HDFS” (Hadoop File System).

I’ve asked Microsoft officials if they’ve got any more details to share about their Hadoop plans. No word back so far.

Kick off your day with ZDNet's daily e-mail newsletter. It's the freshest tech news and opinion, served hot. Get it.

Topics

Mary Jo has covered the tech industry for more than 25 years for a variety of publications and Web sites, and is a frequent guest on radio, TV and podcasts, speaking about all things Microsoft-related. She is the author of Microsoft 2.0: How Microsoft plans to stay relevant in the post-Gates era (John Wiley & Sons, 2008).

Disclosure

Mary-Jo Foley

Freelance journalist/blogger Mary Jo Foley has nothing to disclose. WYSIWYG (what you see is what you get). I do not own Microsoft stock or stock in any of its partners or competitors. I have no business ventures that are sponsored by/funded by Microsoft or any of its partners or competitors.

Biography

Mary-Jo Foley

Mary Jo Foley has covered the tech industry for 25 years for a variety of publications, including ZDNet, eWeek and Baseline. She has kept close tabs on Microsoft strategy, products and technologies for the past 10 years. In the late 1990s, she penned the award-winning "At The Evil Empire" column for ZDNet, and more recently the Microsoft Watch blog for Ziff Davis.

Got a tip? Send her an email with your rants, rumors, tips and tattles. Confidentiality guaranteed.

Related Discussions on TechRepublic

Did you know you can take part in these discussions with your ZDNet membership?
3
Comments

Join the conversation!

Just In

Protocol Buffer support? Cool
pnewhook 1st Mar
I'm thinking that's Google's Protocol Buffers (http://code.google.com/p/protobuf/) being referred to. If that gets proper .Net/Windows support, that'd be a big win for distributed system design.
-1 Votes
+ -
Beyond my ken.
MSFTWorshipper 27th Feb
I couldn't understand anything in the article because where was the iPhone?
0 Votes
+ -
Never mind,
jorwell Updated - 28th Feb
as long as your data is big, it doesn't matter if you understand it or not.
0 Votes
+ -
I'm thinking that's Google's Protocol Buffers (http://code.google.com/p/protobuf/) being referred to. If that gets proper .Net/Windows support, that'd be a big win for distributed system design.

Join the conversation!

Formatting +
BB Codes - Note: HTML is not supported in forums
  • [b] Bold [/b]
  • [i] Italic [/i]
  • [u] Underline [/u]
  • [s] Strikethrough [/s]
  • [q] "Quote" [/q]
  • [ol][*] 1. Ordered List [/ol]
  • [ul][*] · Unordered List [/ul]
  • [pre] Preformat [/pre]
  • [quote] "Blockquote" [/quote]
ie8 fix
Click Here

The best of ZDNet, delivered

ZDNet Newsletters

Get the best of ZDNet delivered straight to your inbox

Facebook Activity

White Papers, Webcasts, & Resources
ie8 fix