Google algorithm busts CAPTCHA with 99.8 percent accuracy

Summary:Google engineers have defeated CAPTCHA thanks to a Street View algorithm designed to decipher blurry street addresses.

reCAPTCHA's hardest puzzles
Image: Google

Next time you want to post a comment on a blog or use an online contact form, chances are you'll be confronted by a puzzle asking you to read some blurry and distorted text. Known as CAPTCHA, theses challenges are supposed to only be solvable by humans, in order to prevent unwanted bots from using web services.

However, their days as a human-only pursuit could be numbered: Google has built its own automated system that can beat CAPTCHAs with 99.8 percent accuracy.

The algorithm developed by Google researchers is being used by its Street View team to improve Google Maps, by helping to recognising characters in natural or blurry images — for example, the house numbers captured by the Street View cars in the course of gathering imagery for the mapping service.

According to the company, the algorithm can now accurately recognise 90 percent of street numbers, meaning Google Maps users looking for a particular building are likely to get a more specific result.    

But, given the nature of that challenge, it turns out that the algorithm is also well-suited to solving CAPTCHA puzzles designed to fox spammers using bots for services like Gmail. As Google's engineers explain in a recently published paper, the algorithm has 99.8 percent accuracy rate when trying to decipher the hardest puzzles created by Google's own CAPTCHA service, reCAPTCHA.

The algorithm would be highly-prized by spammers, who are on the hunt for ways to automatically pass CAPTCHA puzzles.

streetview
Street View images that can be solved by the algorithm. Image: Google

While (optical character recognition) OCR technology is fairly mature, apparently reading characters from photographs is a "hard problem" to solve, according to Google, whose researchers have overcome it with the use of a "deep convolutional neural network that operates directly on the image pixels".

Despite the 99.8 percent accuracy rate of the algorithm, Google says reCAPTCHA isn't broken or ineffective, partly due to an update to the service last year, which added "advanced risk analysis techniques". The system considers the user's engagement with it before, during, and after they interact with it. Using this approach helps it determine whether a potential user is likely to be human or not, before deciding how difficult a puzzle to serve up.

Topics: Security, Google

About

Liam Tung is an Australian business technology journalist living a few too many Swedish miles north of Stockholm for his liking. He gained a bachelors degree in economics and arts (cultural studies) at Sydney's Macquarie University, but hacked (without Norse or malicious code for that matter) his way into a career as an enterprise tech, s... Full Bio

Kick off your day with ZDNet's daily email newsletter. It's the freshest tech news and opinion, served hot. Get it.

Related Stories

The best of ZDNet, delivered

You have been successfully signed up. To sign up for more newsletters or to manage your account, visit the Newsletter Subscription Center.
Subscription failed.