Linux and Open Source

Dana Blankenhorn & Paula Rooney

Can the Google index go open source without killing it?

By Dana Blankenhorn | July 19, 2010, 5:08am PDT

Summary

Google, Yahoo, Microsoft and others should start negotiating on a statement of principles, explained in code, under which commercial search engines will operate. Governments would be assured that all engines were playing by rules, and they weren’t gaming the system for their own ends.

Topics

Blogger Info

Dana Blankenhorn

Biography

Dana Blankenhorn

Dana Blankenhorn

Dana Blankenhorn has been a business journalist for nearly 25 years and has covered the online world professionally since 1985. He founded the Interactive Age Daily for CMP Media, and has written for the Chicago Tribune, Advertising Age's "NetMarketing" supplement, and dozens of other publications over the years.

Paula Rooney

Biography

Paula Rooney

Paula Rooney

Paula Rooney has covered the technology industry for more than 15 years, starting with semiconductor design and mini-computer systems at EDN News and later focused on PC software companies including Microsoft, Lotus, Oracle, Red Hat, Novell and other open source and commercial software companies for CRN and PCWeek. She received a silver award from the American Society of Business Publication Editors in 2005 for her profile on Linus Torvalds and edited and co-authored "Partnering With Microsoft," a book about Microsoft's channel published by CMP Publishing in 2004. Rooney graduated from the Columbia University Graduate School of Journalism in 1997. In her off time, she enjoys scuba diving, sailing, sun worshipping, running and reading. She resides on the shores of Scituate, Massachusetts.

CNET’s Tom Krazit has offered a great run-down of recent moves aimed at forcing Google to give up its search algorithm, and Google’s pushback.

Google open sources more code than anyone else, but it has never revealed how its search engine works.

This is done, it says, to protect the Net from spammers and scammers and others who might game the results. Even with this secrecy, the scams persist. A vast underground industry exists to perpetrate them.

But those algorithms have become so important, critics insist, that Google has gained an editorial power over the whole Internet with them, and so they should be opened to scrutiny.

The controversy picked up steam when a Google Brazil executive told Tom Foremski open sourcing the index might be a good idea. Foremski ran with it, saying that a single index would save an incredible amount of bandwidth.

Giving away Google’s secret source is a bridge too far for me. I don’t think it would eliminate search engine competition in any case, as Foremski seems to believe. He notes some “unknown robot” is currently dragging more bandwidth from him than Google and Yahoo combined.

Still, there is a public interest in all this, and I am in a compromising mood this morning, so let me offer a modest proposal.

An open standard. Google, Yahoo, Microsoft and others should start negotiating on a statement of principles, explained in code, under which commercial search engines will operate.

There would be plenty of room for differentiation under that standard. It would be in every engine’s interests to have such differentiation, for precisely the reasons Google states. But governments would be assured that all engines were playing by rules, and they weren’t gaming the system for their own ends.

I wouldn’t even insist on a transparent process for the development of this standard. It could be done through a series of secret meetings on a high mountaintop somewhere, and government would not need to participate.

Just code some assurances you’re not mucking with the results and go about your business. You’re not being evil anyway. Now make sure your competitors aren’t being evil either.

Kick off your day with ZDNet's daily e-mail newsletter. It's the freshest tech news and opinion, served hot. Get it.

Dana Blankenhorn has been a business journalist for 30 years, a tech freelancer since 1983.

Disclosure

Dana Blankenhorn

Dana Blankenhorn has been a journalist, writer and part-time futurist for over 30 years.

At the present moment I run only a personal blog in addition to my ZDNet open source blog.

DanaBlankenhorn.Com has the subtitle The War Against Oil. In the past I have used it to write about political history, e-commerce, personal matters, some ideas related to open source, and The World of Always On, which is the idea of using sensors, motes and RFID to turn WiFi links into platforms for applications which live in the air.

My IRA account at Schwab holds a few tech shares, most notably some Intel and Applied Materials, but there are no open source companies in it. I don’t even own any CBS stock.

Biography

Dana Blankenhorn

Dana Blankenhorn has been a business journalist for nearly 25 years and has covered the online world professionally since 1985. He founded the Interactive Age Daily for CMP Media, and has written for the Chicago Tribune, Advertising Age's "NetMarketing" supplement, and dozens of other publications over the years.

Related Discussions on TechRepublic

Did you know you can take part in these discussions with your ZDNet membership?

Talkback Most Recent of 14 Talkback(s)

  • Google will never reveal how its search engine works
    Google already proved that they are an evil company. They are also proven liars. In addition, their employees can manipulate searches. Their creepy search engine is equivalent to a despicable villain's ray gun. Why would they hand that over?
    ZDNet Gravatar
    iPad-awan
    (Edited: 07/19/2010 08:07 AM)
  • ZDNet Blogger

    I disagree
    @iPad-awan I have seen many companies with immense, albeit temporary, power operate in my years in this business. Google has been among the gentlest in its use of power. Compared with Microsoft or Apple or IBM, they're meekness personified.
    ZDNet Gravatar
    DanaBlankenhorn
    07/19/2010 11:04 AM
  • Well put
    @DanaBlankenhorn

    Well put
    I challenge anyone to find behavior from Google that in any way remotely compares to the behavior from Microsoft which is documented in the following link from the U.S. justice department.

    http://www.albion.com/microsoft/findings.html
    ZDNet Gravatar
    dfolk2
    07/19/2010 11:34 AM
  • RE: Can the Google index go open source without killing it?
    @DanaBlankenhorn Yeah, the way they introduced Buzz was - well, it wasn't so gentle/meek. They just flipped a switch and suddenly everybody had to deal with it.

    I'm sure that the people themselves inside the company can be gentle/meek, but that rarely translates to the corporate image.

    In addition, you totally missed the point - "iPad-awan" wasn't talking about gentleness or meekness at all, you just pulled that out of nowhere to knock down a straw man. "Evil" people and "liars" can certainly work under the guise of meekness. That makes them cunning - it does not make them right.

    The most dangerous people are not those who have clearly evil personalities - they will be caught soon enough. The most dangerous people are those who do wrong under the cover of a good personality.
    ZDNet Gravatar
    CobraA1
    07/20/2010 09:57 AM
  • RE: Can the Google index go open source without killing it?
    @DanaBlankenhorn

    Wrong Google.

    This is the Google that monitors your emails in Gmail to see what you're interested in? That keeps search requests for the lifetime of their servers? That monitors everything that happens in Chrome?

    All in the name of targetted ads, Google is a legal spam company. Nothing friendly about them.
    ZDNet Gravatar
    Cyberjester
    07/27/2010 12:39 AM
  • RE: Can the Google index go open source without killing it?
    "Google open sources more code than anyone else": That's not true. They took a lot of Java code from Apache foundation, they took the Linux kernel code, and they did what pleased them with it. Their rate of return to the community is very small in comparison on what they took. At the end, their OS shares less an less with Linux, and their java is not Java anymore. Who is the winner ?
    ZDNet Gravatar
    atari_z
    (Edited: 07/19/2010 04:00 PM)
  • RE: Can the Google index go open source without killing it?
    @atari_z

    What exactly does Google owe you? You ask: Who is the winner? How is that Google's problem? They have a business to run. The winner is the person getting search results. No one using Google cares one bit about code that came from the Apache foundation.
    ZDNet Gravatar
    croberts
    07/20/2010 08:45 AM
  • ZDNet Blogger

    No secret sauce...
    Dana, thanks for your input. But I'm not asking Google to share its secret sauce, which is its analysis/algorithm. All the robots are collecting the same open information - what's on a web site. Why duplicate that effort 20 times (on my site 20 robots visit daily using 45% of my bandwidth.)

    A central open index could be collected by one "openbot" and accessible to any company. That should be cheaper than collecting all that data individually. The analysis of that data is where the secret sauce is.
    ZDNet Gravatar
    foremski
    07/19/2010 08:48 PM
  • ZDNet Blogger

    RE: Can the Google index go open source without killing it?
    @foremski Thanks for writing. I was confused a little by the point where you noted some "unknown" robot more bandwidth than the Google and Yahoo robots combined.

    Ever figure out who the bad robot was?
    ZDNet Gravatar
    DanaBlankenhorn
    07/20/2010 12:00 PM
  • RE: Can the Google index go open source without killing it?
    @foremski

    Interesting idea.

    I doubt it would happen, and here's why.

    Just a little personal anecdote. I was living in shared housing whilst in uni. I rent a bedroom, share facilities. We each buy our own food. There was, no kidding, 8 bottles of exactly the same milk in the fridge, 8 cartons of margerine, 8 loaves of bread. I moved into a house with some friends.

    There's now 2 people in this house, with 2 bottles of the same milk, 2 cartons of the same margerine, 2 loaves of the same bread.

    No-one trusts each other enough to co-operate. If a company has an idea to change it's robot, it won't share it to benefit everyone, it'll slap 50 patents on it. You won't get to Google's level by sharing. As much as it would benefit everyone, it just wouldn't happen.
    ZDNet Gravatar
    Cyberjester
    07/27/2010 12:44 AM
  • Why would you hand over your money maker
    The search engine is Google's primary source of income, why would they open source it? For one thing, they do have the largest bandwidth on the planet, the other is that advertising through them is how they make money, which is why everything they offer is "free" for use.
    ZDNet Gravatar
    Maarek
    07/20/2010 09:50 AM
  • What is the business value in doing this?
    Can the author add some addendum to the article to tell us what value the business will get by doing this? That is the part I don't really understand. How is it a win-win-win for everyone?
    ZDNet Gravatar
    gudushi.kutushi
    07/20/2010 11:25 AM
  • ZDNet Blogger

    Credibility
    @gudushi.kutushi Having the basic rules be transparent gives everyone the assurance no one is gaming the system. This benefits all players. The biggest player benefits most.
    ZDNet Gravatar
    DanaBlankenhorn
    07/20/2010 12:01 PM
  • RE: Can the Google index go open source without killing it?
    @DanaBlankenhorn

    You're assuming the players are all playing fair to start with. :P

    There's a game, can't remember which. It's like the police offer. Two criminals are caught, each is given an offer. Turn in the other one and your sentence will be lessened. If they both confess and burn the other, their sentence increases. If one burns the other, their sentence is decreased. If neither confess, their sentence stays the same.

    Same = 1
    Decrease = 0.5
    Increase = 3

    It's much more beneficial for both of them to remain silent. But the majority of people in this game will burn the other. We're much too selfish to play fair, and because of which we're too paranoid to trust the other, thinking they'll do us in before we can do them in.
    ZDNet Gravatar
    Cyberjester
    07/27/2010 12:47 AM

Talkback - Tell Us What You Think

advertisement

The best of ZDNet, delivered

ZDNet Newsletters

Get the best of ZDNet delivered straight to your inbox

Facebook Activity

White Papers, Webcasts, & Resources
advertisement
Click Here