Google BigQuery: Self-service cloud data analysis, from your iPad or desktop

Google BigQuery: Self-service cloud data analysis, from your iPad or desktop

Summary: Google made its BigQuery service publicly available last month. So I decided to put it through its paces, and compare it to Microsoft’s Excel and PowerPivot.

SHARE:

 |  Image 4 of 11

  • BigQuery data is stored in tables, much as in a relational database.  Tables, in turn, are stored within datasets.  Datasets serve as a unit of security, allowing for sharing with specific users or the overall public.  Google supplies the publicdata:samples dataset which is added to every BigQuery project.  This allows you to examine and query tables right away.

    At the left side of the screen, you can drill down on a dataset to see the tables it contains.  Select one of those tables and its schema will appear on the right-hand side of the console.  The github_timeline table’s schema is displayed in this figure.  Notice that its label on the left side of the screen appears in bold text with a red bar beside it.  The “Click to preview table data” link does what it says, but you can also write your own SQL queries.
     

  • To run a Query, type it in, then click the “RUN QUERY” button or just tap Ctrl-Enter on your keyboard.  While the query is running, the query text area is disabled and the elapsed query time clock runs up, right next to the “Query running” label.

    BigQuery does not permit “SELECT *”-style queries; instead, you must specify all column names.  And although you’ll be querying large datasets, you will want to keep your result sets small.  To do that, make use of aggregating queries (using aggregate functions and GROUP BY) and/or the LIMIT n clause at the end of your query as was done here (i.e. “LIMIT 200” appears at the end of the query).

    Tables are identified using a syntax of datasetname.tablename.  If you reference any table from the samples dataset, you’ll need to use the “publicdata:” prefix before the “samples” dataset name.

  • BigQuery isn’t just for big desktop and laptop computers.  It runs very well on the iPad, for example, as shown here.  And BigQuery is hip to tablet use too: rotate your iPad from landscape to portrait and the page rendering adjusts, showing you several extra rows of data (illustrated nicely here).

  • Thumbnail 1
  • Thumbnail 2
  • Thumbnail 3
  • Thumbnail 4
  • Thumbnail 5
  • Thumbnail 6
  • Thumbnail 7
  • Thumbnail 8
  • Thumbnail 9
  • Thumbnail 10
  • Thumbnail 11

Topic: Enterprise Software

Kick off your day with ZDNet's daily email newsletter. It's the freshest tech news and opinion, served hot. Get it.

Related Stories

Talkback

3 comments
Log in or register to join the discussion
  • Data privacy?

    BigQuery provides big-data analytics in a completely hosted offering. Hope there is no issue on the front of data privacy. Any idea?

    - Lisa
    http://www.hireamobileappdeveloper.com/
    SaraParker23
  • BigQuery or PowerPivot

    What exactly was this article about?

    So I can load data into BigQuery using CSV, export it out as CSV, import the CSV into Excel, then use PowerPivot to actually do analytics.

    Why not just go from CSV directly into Excel? What value is BigQuery bringing to the process; other than turning over my data to Google and giving me a query interface (but not a visualization interface) that will work on an iPad?
    Marc Jellinek
    • PowerPivot or BigQuery

      @Mark, my goal was to show you how BigQuery works, and to contrast that with how you might do similar work in Excel. I wasn't suggesting that you use one and then move the output to the other, although that would work.

      When you ask "Why not just go from CSV directly into Excel?", I suppose that is one of the questions I wanted to provoke you to think about. Would you rather use something like PowerPivot + Excel on your desktop, or would you prefer to stay cloud + browser (and SQL) based and use BigQuery? What's your take? Does the cloud trump the desktop + Excel?
      andrewbrust