Feeds

Google boffins beat own Captchas

The StreetView numbers game

Intelligent flash storage arrays

A group of Google scientists working on extracting numbers from StreetView images has discovered that their technology can also match humans at solving captchas.

The aim, according to their research paper (at Arxiv, here), was to automatically extract accurate street number data from StreetView images so as to improve Google Maps location information.

Prior work, the researchers write, had worked on extracting individual numbers from an image, identifying each number, and then reassembling the whole street number. This, however, is inefficient, so the group let by Ian Goodfellow focussed on taking an entire image and identifying all the numbers in it.

Testing their model on Google's StreetView House Numbers dataset (which contains 200,000 numbers), the researchers found they were able to match human accuracy of 98 per cent with “95.64 per cent coverage”.

To achieve that accuracy, the researchers spent six days training Google's DistBelief neural network modelling framework. That training was then applied to all the house numbers Google holds – well into the tens of millions. The less constrained dataset reduced coverage down to 89 per cent while holding accuracy at the 98 per cent “equal to a human” threshold.

Google's Street View number recognition

Getting numbers out of images is easy, says Google

The same model was then tested against Google's reCAPTCHA puzzle, achieving 99.8 per cent accuracy. The researchers write that while this doesn't render Captchas useless, “the utility of distorted text as a reverse turing test by itself is significantly diminished”.

As Google's Vinay Shet writes in this blog post, “the act of typing in the answer to a distorted image should not be the only factor when it comes to determining a human versus a machine”, and Google itself is reducing its “dependence on text distortions as the main differentiator between human and machine,” using Captchas instead to “perform advanced risk analysis”. ®

Remote control for virtualized desktops

More from The Register

next story
Webcam hacker pervs in MASS HOME INVASION
You thought you were all alone? Nope – change your password, says ICO
You really need to do some tech support for Aunty Agnes
Free anti-virus software, expires, stops updating and p0wns the world
Meet OneRNG: a fully-open entropy generator for a paranoid age
Kiwis to seek random investors for crowd-funded randomiser
USB coding anarchy: Consider all sticks licked
Thumb drive design ruled by almighty buck
Attack reveals 81 percent of Tor users but admins call for calm
Cisco Netflow a handy tool for cheapskate attackers
Privacy bods offer GOV SPY VICTIMS a FREE SPYWARE SNIFFER
Looks for gov malware that evades most antivirus
Patch NOW! Microsoft slings emergency bug fix at Windows admins
Vulnerability promotes lusers to domain overlords ... oops
prev story

Whitepapers

Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
A strategic approach to identity relationship management
ForgeRock commissioned Forrester to evaluate companies’ IAM practices and requirements when it comes to customer-facing scenarios versus employee-facing ones.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Protecting against web application threats using SSL
SSL encryption can protect server‐to‐server communications, client devices, cloud resources, and other endpoints in order to help prevent the risk of data loss and losing customer trust.
Top 5 reasons to deploy VMware with Tegile
Data demand and the rise of virtualization is challenging IT teams to deliver storage performance, scalability and capacity that can keep up, while maximizing efficiency.