Advancing artificial intelligence: Microsoft deploys corgis to beat Google on imaging

Advancing artificial intelligence: Microsoft deploys corgis to beat Google on imaging

Summary: Microsoft says Project Adam has given it the world's best image classifier.


The race to advance artificial intelligence using cheap networked PCs kitted up to mimic the human brain has a new challenger: Project Adam, Microsoft's attempt at using deep learning to improve natural language processing, computer vision, and speech recognition.

Microsoft says it's taken a big step in creating true artificial intelligence (AI) with Project Adam — a deep neural network built on commodity hardware and adept at categorising different breeds of corgi.

The groundwork for Project Adam was laid by a 2012 project by Google, which saw the search giant demonstrate a network of 16,000 computers could teach itself to identify cat images drawn from YouTube.

"The machine-learning models we have trained in the past have been very tiny, especially in comparison to the size of our brain in terms of connections between neurons," Trishul Chilimbi, a Microsoft researcher behind Project Adam who's also been working on Bing, said.

"What the Google work had indicated is that if you train a larger model on more data, you do better on hard AI tasks like image classification."

Instead of Google's cats, Microsoft researchers put Adam to work identifying different breeds of the Queen Elizabeth II's preferred hound, the corgi, using 14 million images from ImageNet, an image database divided into 22,000 categories.

According to Microsoft, thanks to Project Adam's network of two billion connections, it's created the world's best image classifier that it says is "50 times faster" than Google's effort, more than twice as accurate, and requiring 30 times fewer machines.

The project aims to capture the potential of "hierarchical representation learning using big data", according to Microsoft. As it explains in a video, the technology could allow users to photograph food to immediately discover its nutritional information, or be put to work helping to detect diseases earlier.

Chilimbi said the "sweet spot" for the number of layers in a deep neural network is six — which is close to the human visual cortex. After that, each additional layer delivers smaller returns.

So the project's approach to learning the difference between different corgi breeds would be broken down into layers — for example, the dog's shape, followed by another layer that learns textures and fur, then another focussed on body parts such as the shapes of ears and eyes. The fourth layer would learn complex body parts while the fifth would be dedicated to "high level recognisable concepts" like a dog's face.

"The reason it's interesting is that each layer of this neural network learns automatically a higher-level feature based on the layer below it. The top-level layer learns high-level concepts like plants, written text, or shiny objects. It seems that you come to a point where there’s diminishing returns to going another level deep. Biologically, it seems the right thing, as well," said Chilimbi.

According to Chilimbi, it's still a mystery how deep neural networks can figure out to break down an image into levels of its features, for example, after being told that an image is a Pembroke Welsh corgi.

"There's no instruction that we provide for that. You just have training algorithms saying, 'This is the image, this is the label.' It automatically figures out these hierarchical features. That's still a deep, mysterious, not well understood process. But then, nature has had several million years to work her magic in shaping the brain, so it shouldn't be surprising that we will need time to slowly unravel the mysteries."

Read more on artificial intelligence

Topics: Big Data, Emerging Tech

Liam Tung

About Liam Tung

Liam Tung is an Australian business technology journalist living a few too many Swedish miles north of Stockholm for his liking. He gained a bachelors degree in economics and arts (cultural studies) at Sydney's Macquarie University, but hacked (without Norse or malicious code for that matter) his way into a career as an enterprise tech, security and telecommunications journalist with ZDNet Australia. These days Liam is a full time freelance technology journalist who writes for several publications.

Kick off your day with ZDNet's daily email newsletter. It's the freshest tech news and opinion, served hot. Get it.


Log in or register to join the discussion
  • There is no one "simulating the human brain"

    they have yet to even model the basic neurological function of a tardigrade, let alone a higher primate.

    This kind of thing is just the worst hype imaginable. The efforts of the industry to make artificial intelligence more intelligent is laudable, but a little more humility would be appropriate. We're not doing the things we're claiming we're doing.
  • Microsoft going to the dogs!

    I know, grin, but Liam certainly missed a "golden click bait" blog title, wouldn't everyone agree.

    BTW, interesting article. However, this neural net facial recognition technology is really being driven by the need to identify and categorize human faces for the NSA and other Five Eyes security databases. And "they" are getting very good at it.
    • or so we hope

      because if NSA is not really good at recognizing faces, your face (or mine for that matter) may get linked to a complete stranger :-)

      I wonder, though, if the algorithm is good enough to tell the dogs of the same breed apart?
    • Vegas

      I believe the Las Vegas casinos already have and are using a similar technology. Wouldn't surprise me if the NSA has already tapped into their database.
  • Advancing artificial intelligence: Microsoft deploys corgis to beat Google

    Microsoft has been working on natural language technology for a couple of years. There have been times when I wanted to know what an object was so this may help. But I'm also a bit cautious about the technology as well.
  • Good ol Microsoft

    Always chasing the dog.

    When was the last time they were innovative?