Google AI researchers find strange new reason to play Jeopardy!

Scientists at Google's AI unit tested a deep neural network on clues from the popular gameshow Jeopardy!. But unlike IBM's Watson triumph, this was less about the answers and more about the strange way the computer reformulates the question.

Google's Zak Stone on TPUs and the evolution of AI accelerators Zak Stone, product manager for Tensorflow and Cloud TPUs for the Google Brain team, says Google TPUs are just one element of a "Cambrian explosion" of new experiments in computer architecture.

When IBM's Watson computer beat two world champions at the game show Jeopardy! in 2011, it was a moment to marvel at how a machine could take comprehend the language of a question and could mine its vast memory for an appropriate response.

Google scientists have found another use for Jeopardy! questions, having little to do with understanding human speech and more about how computers communicate with one another.

And this week, they've made that work an open-source software tool available on GitHub to anyone using Google's TensorFlow framework for machine learning.

"Active Question Answering," or Active QA, as the TensorFlow package is called, will reformulate a given English-language question into multiple different re-wordings, and find the variant that does best at retrieving an answer from a database.

The system was developed by feeding Jeopardy! clues into a "reinforcement learning" neural network. The network got better and better at re-wording questions as it was rewarded for successfully retrieving the right answer.

Also: IBM Watson: The inside story of how the Jeopardy!-winning supercomputer was born, and what it wants to do next TechRepublic

Google AI authors, in the blog post on the project, note that their famous corporate mission is to "organize the world's information." In keeping with that, they "envision that this research will help us design systems that provide better and more interpretable answers, and hope it will help others develop systems that can interact with the world using natural language."

In the original paper, Ask The Right Questions: Active Question Reformulation With Reinforcement Learning, presented this past spring at the International Conference on Learning Representations, Google AI researchers Christian Buck, Jannis Bulian, Massimiliano Ciaramita, Wojciech Gajewski, Andrea Gesmundo, Neil Houlsby, and Wei Wang built upon principles of machine translation. They interpreted the task of training a computer to reformulate clues from Jeopardy! as being akin to foreign language translation. The goal was to paraphrase the Jeopardy! clues in a syntax that improves querying of a database.

For example, given a clue like "Gandhi was deeply influenced by this count who wrote 'War and Peace'," the neural network had to learn to put that clue into the form of a question that would produce the correct answer, which is Leo Tolstoy. (The Jeopardy! questions were gotten from a 2017 project, called SearchQA, built by researchers at New York University and Carnegie Mellon. Their project was, in turn, taken by crawling the ebsite "J! Archive," a fan site for the show.)


A diagram of the Active QA operation: a Jeopardy! clue is reformulated into a new question, it's submitted to the BiDaf database, and a convolutional neural network ranks the returned answers for the best one, which then serves as the reward to train the word rephrasing.

The Active QA package includes the a customized version of Google's TensorFlow code for machine translation. It's based on Google research in 2014 on what's called "sequence to sequence" neural networks for translating between, say, English and French.

The code package also includes a so-called question answering system, the actual database that retrieves the answers put to it by Active QA. This is based on a deep learning system developed in 2017 by researchers at the Allen Institute for Artificial Intelligence, and the University of Washington, for answering questions, called "BiDaf."

What's most significant, perhaps, in the paper and in this new toolkit, is that the deep neural network is not learning how to come up with well-phrased natural-language speech, nor is it learning much about asking questions in the typical sense that humans mean it. It's not like The Washington Post's robot journalist, impersonating human writing.

Also: Watching YouTube videos may someday let robots copy humans

Rather, Active QA is learning tricks that improve how to search a database, and the results often sound like gibberish to a human ear. For example, the authors note that the above clue about Ghandi ("Gandhi was deeply influenced by this count who wrote 'War and Peace'") was reformulated by Active QA as "What is name gandhi gandhi influence wrote peace peace?"

In another instance, the original Jeopardy! clue, "During the Tertiary Period, India plowed into Eurasia & this highest mountain range was formed," was refashioned as "What is name were tertiary period in india plowed eurasia?" Which succeeded in returning the correct answer: Himalayias. Numerous examples, many having the same weird patterns of awkward grammar and repeated words, are offered in the appendix at the back of the paper.

While it's doggerel as far as natural language, the authors see the computer-constructed phrases as a real advance in query skills. The Active QA neural net wasn't just slightly modifying the original clues, it was actually discovering on its own techniques that have long been around in the science of information retrieval, things such as "stemming," where a verb, say, is changed from its conjugated form to its root form.

Also: Google Brain, Microsoft plumb the mysteries of networks with AI

"Sometimes," they write, "AQA learns to generate semantically nonsensical, novel, surface term variants; e.g., it might transform the adjective dense to densey." The "only justification for this," they conclude, is that it does a good job "exploiting" the way the BiDaf database has encoded the answers.

As the authors put it, "It seems quite remarkable then that AQA is able to learn non-trivial reformulation policies ... One can think of the policy as a language for formulating questions that the agent has developed while engaging in a machine-machine communication with the environment."

The day may not be far off when bots will do more of the Googling than people.

Previous and related coverage:

What is AI? Everything you need to know

An executive guide to artificial intelligence, from machine learning and general AI to neural networks.

What is deep learning? Everything you need to know

The lowdown on deep learning: from how it relates to the wider field of machine learning through to how to get started with it.

What is machine learning? Everything you need to know

This guide explains what machine learning is, how it is related to artificial intelligence, how it works and why it matters.

What is cloud computing? Everything you need to know about

An introduction to cloud computing right from the basics up to IaaS and PaaS, hybrid, public, and private cloud.

Related stories: