Google explains how Compute Engine fits in big data pipeline

Google explains how Compute Engine fits in big data pipeline

Summary: After the debut of Compute Engine, Google product managers explain how it fits into the company's big data strategy overall.

TOPICS: Google

SAN FRANCISCO -- For anyone familiar with big data right now, this is a no-brainer: managing massive amounts of big data is hard.

But Google product manager Ju-kay Kwek posited that there is a better way to solve this problem: leveraging Google's expertise to put your big data to work for you.

See also: Google launches Amazon Web Services killer, but lacks maturity, options Google Drive upgrades make it more appealing for business Google intros Compute Engine infrastructure service

Naturally, this would be the advertised argument during a Google I/O 2012 panel discussion about turning big data into a competitive advantage. Nevertheless, with the unveiling of the Google Compute Engine Infrastructure-as-a-Service platform earlier on Thursday, it's time to hear more about what Google is planning to do in this field.

Kwek cited a statistic from IDC that predicts the big data landscape will grow from being worth $3.2 billion in 2010 to $16.9 billion by 2015, equalling roughly 40 percent CAGR market growth. Based on that alone, it's also a no-brainer that Google would want to put all of the data it already has (and then some) to better use in the enterprise market.

Data is being seen as a core business asset, and increasingly a lot of business data (i.e. social, CRM) is out in the cloud. Kwek remarked that many new things are possible (and only available) by using the cloud, such as unique algorithms and scalability.

However, Kwek acknowledged that it's tough for enterprise customers to capture all of the data they generate, and scaling traditional business infrastructures for big data is equally difficult.

In terms of what this big data actually looks like, Kwek outlined some common characteristics: structured/unstructured/semi-structured, millions (if not billions) of rows, and it's too large to process or store on a single machine.

Google product manager Navneet Joneja presented some of the cloud solutions that Google is offering to tackle big data, explaining that Google is more focused on the solution rather than on the infrastructure.

Along with Google BigQuery, which was designed to handle massive datasets with billions of rows, two examples include Google Cloud Storage with unlimited data at up to 5TB per object, high redundancy and simple sharing, as well as the Google App Engine, a scalable application development and execution environment.

Like connecting Android devices just by using a single Google account login, Joneja explained that this cloud data analytics pipeline should work together seamlessly. However, he did acknowledge that any of these products should work just as fine alone.


Topic: Google

Kick off your day with ZDNet's daily email newsletter. It's the freshest tech news and opinion, served hot. Get it.


Log in or register to join the discussion
  • Hope Google wins this battle in the cloud

    Amazon is the leader in the cloud data storage. Google still has long way to go in order to beat them. Still, it would be interesting to follow this battle.

    - Sara
  • New legislation

    Any company big enough to need big data should be investigated by the Anti-Trust Division.
  • Google Compute: imitation instead of innovation (as usual)

    Google's "Compute Engine" cloud service is a blatant attempt to copy Amazon's EC2 service, and to an extent also copying Microsoft's Azure.

    The corporation could have come up with a name more imaginative and less similar to Amazon's EC2 (Elastic Computing, but presumably the similarity is a deliberate attempt by Google to portray their GC as an alternative to Amazon's EC2. Yet, because Google carries an anti--Microsoft chip on its shoulder, unlike EC2 Google's offering won't even offer the Windows servers that most companies actually use and need -- it's a joke.

    Can you guess what Google's strategy us for trying to make their service competitive against Amazon's? No, not some clever innovation as fanboys would like to think. Google's big new idea is simply to undercut the competition on price. As always, Google's entire strategy is dependent on the cash from search, their one great innovation -- albeit old news a decade ago.

    Thus, Google Compute is another classic example of imitation instead of innovation. Google's has become "cheap" in both senses of the word.
    Tim Acheson
  • @ Tim Acheson

    Tim, not sure where the beef is with Google Compute Engine or Google for that matter. Maybe your a bitter person, however let's take a step back for just a second. Amazon's cloud offering isn't called "EC2." It's actually called Amazon Web Services. EC2 is one of many instance types of the AWS offerings. Therefore, I really don't see how Google Compute Engine has any resemblance? Even if Amazon were EC2, how does compute engine relate to Elastic Compute Cloud? I work in the cloud industry and naming conventions are always similar for brand awareness.

    Not innovative? Google has been in cloud computing forever. This is nothing new. It is very innovative to have another IaaS player in the realm. It drives competition and innovation. I would argue that if AWS were the only market player would be a disservice to the industry. It's all about choice and if you know about some of the disadvantages of AWS you would probably understand why it's important to have more players.

    I'm just saying. Take the emotion out of your post and put it into context. Geez.
    Alex Alvord