Feeds

Google boffins beat own Captchas

The StreetView numbers game

Top 5 reasons to deploy VMware with Tegile

A group of Google scientists working on extracting numbers from StreetView images has discovered that their technology can also match humans at solving captchas.

The aim, according to their research paper (at Arxiv, here), was to automatically extract accurate street number data from StreetView images so as to improve Google Maps location information.

Prior work, the researchers write, had worked on extracting individual numbers from an image, identifying each number, and then reassembling the whole street number. This, however, is inefficient, so the group let by Ian Goodfellow focussed on taking an entire image and identifying all the numbers in it.

Testing their model on Google's StreetView House Numbers dataset (which contains 200,000 numbers), the researchers found they were able to match human accuracy of 98 per cent with “95.64 per cent coverage”.

To achieve that accuracy, the researchers spent six days training Google's DistBelief neural network modelling framework. That training was then applied to all the house numbers Google holds – well into the tens of millions. The less constrained dataset reduced coverage down to 89 per cent while holding accuracy at the 98 per cent “equal to a human” threshold.

Google's Street View number recognition

Getting numbers out of images is easy, says Google

The same model was then tested against Google's reCAPTCHA puzzle, achieving 99.8 per cent accuracy. The researchers write that while this doesn't render Captchas useless, “the utility of distorted text as a reverse turing test by itself is significantly diminished”.

As Google's Vinay Shet writes in this blog post, “the act of typing in the answer to a distorted image should not be the only factor when it comes to determining a human versus a machine”, and Google itself is reducing its “dependence on text distortions as the main differentiator between human and machine,” using Captchas instead to “perform advanced risk analysis”. ®

Beginner's guide to SSL certificates

More from The Register

next story
'Regin': The 'New Stuxnet' spook-grade SOFTWARE WEAPON described
'A degree of technical competence rarely seen'
You really need to do some tech support for Aunty Agnes
Free anti-virus software, expires, stops updating and p0wns the world
You stupid BRICK! PCs running Avast AV can't handle Windows fixes
Fix issued, fingers pointed, forums in flames
Regin: The super-spyware the security industry has been silent about
NSA fingered as likely source of complex malware family
Privacy bods offer GOV SPY VICTIMS a FREE SPYWARE SNIFFER
Looks for gov malware that evades most antivirus
Patch NOW! Microsoft slings emergency bug fix at Windows admins
Vulnerability promotes lusers to domain overlords ... oops
HACKERS can DELETE SURVEILLANCE DVRS remotely – report
Hikvision devices wide open to hacking, claim securobods
prev story

Whitepapers

Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
Getting started with customer-focused identity management
Learn why identity is a fundamental requirement to digital growth, and how without it there is no way to identify and engage customers in a meaningful way.
10 threats to successful enterprise endpoint backup
10 threats to a successful backup including issues with BYOD, slow backups and ineffective security.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
The Heartbleed Bug: how to protect your business with Symantec
What happens when the next Heartbleed (or worse) comes along, and what can you do to weather another chapter in an all-too-familiar string of debilitating attacks?