Feeds

Microsoft finally cuts Bing data retention time to six months

Anonymise this!

Internet Security Threat Report 2014

Microsoft has finally slashed the amount of time it keeps some online search query data to just six months, over a year after it declared it would make the change if the likes of Google and Yahoo! agreed to play ball.

The company’s privacy chief Peter Cullen said late yesterday that Microsoft planned to implement the changes to its data retention policy over the next 12 to 18 months.

“We will delete the entire Internet Protocol address associated with search queries at six months rather than at 18 months,” he said.

“This new and significant step will be incorporated into our existing privacy practices, which already provide strong protections for Bing users.”

In December 2008 Microsoft said it supported the Article 29 Working Party’s guidelines for anonymisation on the web, before adding that such rules could only be adopted if they were introduced industry-wide.

The Article 29 Working Party is a group of European Union bureaucrats who have been pushing to get search engine firms to purge their user records after six months.

Under Microsoft’s previous policy, the software vendor claimed it took steps to “de-identify” the data by cutting it loose from account information that could uncover the person who performed the search in Bing.

However, the remaining data were left to languish online for 18 months before MS droids finally deleted the IP address, dumped the de-identified cookie ID and any other cross-session IDs associated with the query.

Cullen said Microsoft had no plans to change the fundamentals of that policy. However, the firm will start to delete IP addresses associated with Bing search queries after the data has been available online for six months.

All of which isn't a million miles away from Google’s current lukewarmish approach to anonymising an individual’s search data online.

Redmond will similarly leave the de-identified cookie and cross-session IDs intact, but after 18 months it claimed it will suck all the data out of the intertubes for good.

In September 2008 Google agreed to half the amount of time it retained IP addresses and user data garnered from search query logs.

At the time, the internet kingpin said it would anonymise IP addresses on its server logs after nine months “to address regulatory concerns to take another step to improve privacy for our users”.

But Mountain View later admitted to El Reg that it would only "change some of the bits" in the user IPs stored in its server logs, while leaving the all-important cookie data alone.

“There are many good reasons to retain and review search data. Studying trends in search queries enables us to improve the quality of our results, protect against fraud and maintain a secure and viable business,” said Cullen, who echoed Google’s previous justification for keeping the data online.

“But consumer privacy can and must be preserved. For our part, Microsoft continues to examine our practices to ensure we strike the right balance and achieving [sic] both goals.” ®

Intelligent flash storage arrays

More from The Register

next story
Webcam hacker pervs in MASS HOME INVASION
You thought you were all alone? Nope – change your password, says ICO
You really need to do some tech support for Aunty Agnes
Free anti-virus software, expires, stops updating and p0wns the world
USB coding anarchy: Consider all sticks licked
Thumb drive design ruled by almighty buck
Attack reveals 81 percent of Tor users but admins call for calm
Cisco Netflow a handy tool for cheapskate attackers
Privacy bods offer GOV SPY VICTIMS a FREE SPYWARE SNIFFER
Looks for gov malware that evades most antivirus
Patch NOW! Microsoft slings emergency bug fix at Windows admins
Vulnerability promotes lusers to domain overlords ... oops
prev story

Whitepapers

Choosing cloud Backup services
Demystify how you can address your data protection needs in your small- to medium-sized business and select the best online backup service to meet your needs.
Getting started with customer-focused identity management
Learn why identity is a fundamental requirement to digital growth, and how without it there is no way to identify and engage customers in a meaningful way.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Choosing a cloud hosting partner with confidence
Download Choosing a Cloud Hosting Provider with Confidence to learn more about cloud computing - the new opportunities and new security challenges.
Storage capacity and performance optimization at Mizuno USA
Mizuno USA turn to Tegile storage technology to solve both their SAN and backup issues.