Business

Google to improve image text recognition?

Google has filed a patent for the recognition and use of text contained in images and videos.

Written by David Meyer, Contributor Jan. 7, 2008 at 2:39 p.m. PT

Google has filed a patent for the recognition and use of text contained in images and videos.

The application, made in June 2007, was published on Thursday and covers "methods, systems and apparatus including computer program products for using extracted image text," according to Google.

"In one implementation, a computer-implemented method is provided," reads the abstract for the application. "The method includes receiving an input of one or more image-search terms and identifying keywords from the received one or more image search terms. The method also includes searching a collection of keywords including keywords extracted from image text, retrieving an image associated with extracted image text corresponding to one or more of the image-search terms, and presenting the image."

Google, proprietor of the most widely used image search facility on the Internet and video site, YouTube, has much to gain from correctly interpreting text within images and video. Such a capability could, for example, be used to create more accurate keywords, automated file tagging and the identification of where a picture was taken based on signage in the background.

However, on Monday a company spokesperson offered Google's standard reply to questions regarding patent applications. "We file patent applications on a variety of ideas that our employees come up with," said the spokesperson. "Some of those ideas later mature into real products or services, some don't. Prospective product announcements should not necessarily be inferred from our patent applications."

The patent application is not the first time Google has delved into the world of optical character recognition, a technology currently used mostly for scanning documents into word-processor-friendly formats.

In September 2006 the company helped debug an old OCR engine called Tesseract -- originally developed by Hewlett-Packard -- and released it as open source. At the time, Google also quietly mentioned that it was eager to hire "top-notch OCR engineers."

David Meyer of ZDNet UK reported from London.

Editorial standards

Show Comments

Google to improve image text recognition?

Related

ChatGPT vs. Microsoft Copilot vs. Gemini: Which is the best AI chatbot?

Samsung's flagship tablet looks better than ever. And it's only $399 for Memorial Day

I just ordered the cheapest Surface Pro option - why I (probably) won't regret it