ie8 fix
Click Here

Virtually Speaking

Dan Kusnetzky, Paula Rooney and Ken Hess

Comments on NY Times Big Data article

By | December 28, 2011, 3:15am PST

Summary: The concept of Big Data is breaking through into the New York Times. The article only scratches the surface, however.

During my morning news scan, I came across “For Start-Ups That Aim at Giants, Sorting the Data Cloud Is the Next Big Thing” by Malia Wollan. Although the article starts off as a general treatment of Big Data and why it is becoming an increasingly important area of technology, it then turns into a review of Splunk and a few profiles of Splunk customers. This, of course, doesn’t tell the whole story.

In my article “What is “Big Data?,” I pointed out that “Big Data” is a catch phrase that first appeared in the high performance/technical computing segment of the overall IT market. Suppliers of many different types of virtualization technology have been presenting why their product or service is a critical requirement for a potential customer interested in deploying a “Big Data” application.

Some of these suppliers are speaking about the use of a distributed cache mechanism, one supplier calls this “memory virtualization,” in order to allow a large number of systems to access a huge, rapidly changing data sent. The key here is rapidly changing. The data sets in question are updated so frequently that a traditional database engine could not possibly keep up.

Other suppliers, such as DataStax and MapR, are focused one level higher up the stack. They’re presenting their version of Apache Hadoop, a popular framework for “distributed processing of large data sets across clusters of computers.”

Still others focus on how Big Data applicaitons can be mapped onto storage virtualization solutions. The goal of these suppliers is making sure that this huge amount of rapidly changing data can somehow make it safely to more tradtional storage systems.

After speaking with quite a number of these suppliers, it becomes clear that each is working from a different, rather self-serving definition of what the concept entails. In an attempt to come up with something a bit more useful, I described Big Data as:

In simplest terms, the phrase refers to the tools, processes and procedures allowing an organization to create, manipulate, and manage very large data sets and storage facilities.

Does your organization have a need for “Big Data” applications? What tools are you using?

My thanks go out to Malia Wollan for bringing the topic up in the New York Times. 

Kick off your day with ZDNet's daily e-mail newsletter. It's the freshest tech news and opinion, served hot. Get it.

Topics

Daniel Kusnetzky is a distinguished analyst and the founder of the Kusnetzky Group LLC.

Disclosure

Dan Kusnetzky

The Kusnetzky Group LLC is an independent technology industry research firm that focuses on system software, virtualization and cloud computing technology.

Dan's opinions are based upon research, personal experiences and actual use of technology. They are not based upon the relationships the company may or may not have with suppliers, end user organizations, the media, consultants or other analysts.

Dan's research is available on a subscription basis through the Kusnetzky Group LLC. Dan's attendance at industry events or at client meetings may be sponsored by the client. Clients may provide hardware or software for testing prior to the publication of analysis that includes that product. Clients may also provide shirts, jackets, coffee cups, folders, backpacks, pens and other event chotchkies. While nice, these don't effect Dan's opinions or insight about those clients or their products.

Biography

Dan Kusnetzky

Daniel Kusnetzky, Analyst and Founder of Kusnetzky Group LLC, is responsible for research, publications, and operations. Mr. Kusnetzky has been involved with information technology since the late 1970s. Mr. Kusnetzky has been responsible for research operations at the 451 Group; corporate and marketing strategy for Open-Xchange; system software and virtualization research at IDC; and program and product management at Digital Equipment Corporation.; Today, Mr. Kusnetzky focuses on system software, virtualization technology and cloud computing.

3
Comments

Join the conversation!

Just In

Definition of Big Data
bsiscovick 11th Jan
Hi Dan, our firm, IA Ventures (www.iaventures.com) is 100% focused on investing in early stage Big Data companies. We've spent countless hours contemplating the term Big Data. In our experience, Big Data refers to extracting insight from complex data. Complex data does not only mean 'large amounts of data', but refers to data that exhibits some combination of the following characteristics: massive scale, unstructured and/or real time.

Of course, there is no 'right' definition of Big Data - but figured I'd share the way we think about it at IA Ventures.
0 Votes
+ -
Good article Dan. While there is still some confusion about articulating and defining Big Data, there is no doubt that Managing large volumes of data is the next big IT wave and it is going to be really big business for the next few years starting 2012 as it goes from R&D mode to production mode.
-Trendwise
www.TrenwiseAnalytics.com
Great article Dan. It is worth mentioning the HPCC Systems platform as a great fit for all data models. HPCC is a mature platform and provides for a data delivery engine together with a data transformation and linking system equivalent to Hadoop. The main advantages over other alternatives are the real-time delivery of data queries and the extremely powerful ECL language programming model. Also, the ROI for HPCC is significantly better than Hadoop in that it requires less nodes and less programmers. Visit http://hpccsystems.com
0 Votes
+ -
Definition of Big Data
bsiscovick 11th Jan
Hi Dan, our firm, IA Ventures (www.iaventures.com) is 100% focused on investing in early stage Big Data companies. We've spent countless hours contemplating the term Big Data. In our experience, Big Data refers to extracting insight from complex data. Complex data does not only mean 'large amounts of data', but refers to data that exhibits some combination of the following characteristics: massive scale, unstructured and/or real time.

Of course, there is no 'right' definition of Big Data - but figured I'd share the way we think about it at IA Ventures.

Join the conversation!

Formatting +
BB Codes - Note: HTML is not supported in forums
  • [b] Bold [/b]
  • [i] Italic [/i]
  • [u] Underline [/u]
  • [s] Strikethrough [/s]
  • [q] "Quote" [/q]
  • [ol][*] 1. Ordered List [/ol]
  • [ul][*] · Unordered List [/ul]
  • [pre] Preformat [/pre]
  • [quote] "Blockquote" [/quote]
ie8 fix

The best of ZDNet, delivered

ZDNet Newsletters

Get the best of ZDNet delivered straight to your inbox

Facebook Activity

White Papers, Webcasts, & Resources
ie8 fix