Feeds

Yahoo! mocks Google Privacy Theatre

Less-nonsensical anonymization

Protecting against web application threats using SSL

Analysis The privacy gap between Yahoo! and Google is greater than you think. It's not just that Yahoo! will anonymize user search data 6 months before Google anonymizes user search data. It's that Yahoo! anonymization is less nonsensical than Google anonymization.

Today, as we dutifully reported, Yahoo! said it would anonymize user search data within a mere 90 days (with exceptions for fraud, security, and legal obligations). It even agreed to extend this unprecedented policy to page views, page clicks, ad views, and ad clicks.

Of course, anonymization is a meaningless word. But it would seem that Yahoo!'s use of the term isn't nearly as misleading as Google's. When Yahoo! says it will anonymize log data, it intends to:

  • Delete the final octet of the user's IP address
  • Run the user's Yahoo! ID through a one-way secret hash and delete the last 50 per cent of the hashed identifier
  • Run the user's cookie identifiers through a one-way secret hash
  • Filter all personally identifiable information - such as credit card numbers, social security numbers, and non-popular names - from search queries

In its lust for targeted advertising and who knows what else, Yahoo! has stopped short of true anonymization: deleting IPs, IDs, and cookie info entirely. Recreating this data isn't beyond the realm of possibility. But at Google, recreation is trivial.

The Mountain View Chocolate Factory says it will - at some unspecified point in the future - anonymize user data after nine months. But it takes some additional liberties with the word "anonymize".

With its nine-month anonymiztion, Google intends to "change some of the bits" in the user IPs stored on its servers. But that's it. The plan would leave cookie data alone.

And that means IPs are easily restored.

Google may erase certain IP bits on your nine-month-old search queries, but those bits will remain intact on newer queries - and both sets of queries will carry the same cookie info. Recovering the missing bits on older data is one-step process.

After 18 months, Google does alter cookie data - in some unspecified way. And the company argues that users have the power to scrub their own cookies before then. "We have focused on IP addresses, because we recognize that users cannot control IP addresses in logs," the company has told us. "On the other hand, users can control their cookies.

"When a user clears cookies, s/he will effectively break any link between the cleared cookie and our raw IP logs once those logs hit the 9-month anonymization point. Moreover, we are still continuing to focus on ways to help users exert better controls over their cookies."

Of course, most users don't even know what a cookie is.

Plus, Google has not said it will disassociate search queries from your Google ID - required for using Google services such as Gmail or Google Docs and Spreadsheets.

In September, Google also said it might tweak its nine-month policies. But today, in the email, the ad broker provided no update. At the moment, it's unclear when Google will even begin its nine-month IP doctoring.

But the company wants you to know it takes privacy very seriously. "We aim to strike the appropriate balance between protecting our users' privacy and offering them benefits of data retention, such as better security measures and new innovations," it said.

It did not mention advertising.

Yes, Yahoo! is balancing as well. But the wounded web portal has gone significantly further than Google to protect its users from hacks, subpoenas, and, yes, national security letters. The rub is that Yahoo! handles about 20 per cent of US search traffic - and Google commands 70. ®

The next step in data security

More from The Register

next story
New 'Cosmos' browser surfs the net by TXT alone
No data plan? No WiFi? No worries ... except sluggish download speed
'Windows 9' LEAK: Microsoft's playing catchup with Linux
Multiple desktops and live tiles in restored Start button star in new vids
iOS 8 release: WebGL now runs everywhere. Hurrah for 3D graphics!
HTML 5's pretty neat ... when your browser supports it
Mathematica hits the Web
Wolfram embraces the cloud, promies private cloud cut of its number-cruncher
Google extends app refund window to two hours
You now have 120 minutes to finish that game instead of 15
Intel: Hey, enterprises, drop everything and DO HADOOP
Big Data analytics projected to run on more servers than any other app
Mozilla shutters Labs, tells nobody it's been dead for five months
Staffer's blog reveals all as projects languish on GitHub
SUSE Linux owner Attachmate gobbled by Micro Focus for $2.3bn
Merger will lead to mainframe and COBOL powerhouse
prev story

Whitepapers

Providing a secure and efficient Helpdesk
A single remote control platform for user support is be key to providing an efficient helpdesk. Retain full control over the way in which screen and keystroke data is transmitted.
WIN a very cool portable ZX Spectrum
Win a one-off portable Spectrum built by legendary hardware hacker Ben Heck
Saudi Petroleum chooses Tegile storage solution
A storage solution that addresses company growth and performance for business-critical applications of caseware archive and search along with other key operational systems.
Protecting users from Firesheep and other Sidejacking attacks with SSL
Discussing the vulnerabilities inherent in Wi-Fi networks, and how using TLS/SSL for your entire site will assure security.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.