Feeds

Googlebooks crusade captures CAPTCHA king

Fights spam. Pumps OCR

Boost IT visibility and business value

Google has acquired reCAPTCHA, a free CAPTCHA service that also serves as a means of digitizing printed books and newspapers. Among other things, the Mountain View web giant is looking to juice its ever-controversial library-scanning Book Search project.

Google announced the acquisition this morning with a post to the Official Google Blog, and it couldn't help but trumpet the news with, yes, a CAPTCHA:

Google Acquires ReCaptcha

"The image above is a CAPTCHA — you can read it, but computers have a harder time interpreting the letters. We tried to make it hard for computers to recognize because we wanted to give humans the scoop first, but we're happy to announce to everybody now that Google has acquired reCAPTCHA, a company that provides CAPTCHAs to help protect more than 100,000 websites from spam and fraud," the post reads.

But its not just spam and fraud protection that interests the Mountain View Chocolate Factory. ReCAPTCHA is also a way for Google to improve the OCR (optical character recognition) technology it uses to digitize printed materials for both its Book Search and News Archive Search services.

In providing websites with CAPTCHAs - visual Turing tests that separate humans from machines - reCAPTCHA often includes text scanned from books and newspapers that can't be read with OCR. It pairs this unknown text with a recognized word or phrase. Website visitors are asked to read both words, and if they get the known word correct, ReCaptchas can assume they also read the unknown text correctly.

ReCAPTCHA - a Pittsburgh, Pennsylvania-based outfit that spun off from research originated at Carnegie Mellon University - is currently helping the New York Times to digitize its archive.

Luis von Ahn, the reCAPTCHA founder who co-authored Google's blog post, is one of the Carnegie Mellon researchers who coined the term CAPTCHA, short for Completely Automated Public Turing test to tell Computers and Humans Apart. ReCAPTCHAs first hit the web in 2007, and Ahn founded the company in 2008. The Carnege Mellon assistant computer science professor has not responded to our request for comment.

"Google is the best fit for reCAPTCHA," reads a canned statement from von Ahn tucked into a press release. "From the very start, people often assumed the project was connected to Google, so it only makes sense that reCAPTCHA Inc. ultimately would find a home within Google."

Von Ahn will remain on the Carnegie Mellon computer science faculty, but he will also work at Google's Pittsburgh engineering office, which is on the university's campus. In the press release, he indicated that reCAPTCHA aleady has close ties with Google. In 2006, the company licensed an Ahn-developed game for use in its Google Image Labeler. Terms of Google's acquisiton were not disclosed. ®

Build a business case: developing custom apps

More from The Register

next story
BBC goes offline in MASSIVE COCKUP: Stephen Fry partly muzzled
Auntie tight-lipped as major outage rolls on
iPad? More like iFAD: We reveal why Apple fell into IBM's arms
But never fear fanbois, you're still lapping up iPhones, Macs
Nadella: Apps must run on ALL WINDOWS – PCs, slabs and mobes
Phone egg, meet desktop chicken - your mother
White? Male? You work in tech? Let us guess ... Twitter? We KNEW it!
Grim diversity numbers dumped alongside Facebook earnings
Microsoft: We're making ONE TRUE WINDOWS to rule us all
Enterprise, Windows still power firm's shaky money-maker
HP, Microsoft prove it again: Big Business doesn't create jobs
SMEs get lip service - what they need is dinner at the Club
ITC: Seagate and LSI can infringe Realtek patents because Realtek isn't in the US
Land of the (get off scot) free, when it's a foreign owner
Dude, you're getting a Dell – with BITCOIN: IT giant slurps cryptocash
1. Buy PC with Bitcoin. 2. Mine more coins. 3. Goto step 1
There's NOTHING on TV in Europe – American video DOMINATES
Even France's mega subsidies don't stop US content onslaught
prev story

Whitepapers

Top three mobile application threats
Prevent sensitive data leakage over insecure channels or stolen mobile devices.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Top 8 considerations to enable and simplify mobility
In this whitepaper learn how to successfully add mobile capabilities simply and cost effectively.
Application security programs and practises
Follow a few strategies and your organization can gain the full benefits of open source and the cloud without compromising the security of your applications.
The Essential Guide to IT Transformation
ServiceNow discusses three IT transformations that can help CIO's automate IT services to transform IT and the enterprise.