Feeds

Google Apps sics crawlers on public docs and sheets

Beware what you publish

Security for virtualized datacentres

Google will soon allow search engines to crawl and index documents, spreadsheets, and presentations published to the web via its online office suite, Google Apps.

On Friday, in a letter to Google Apps users, the web giant informed users the change would arrive "in a few weeks." This was confirmed by a Google spokeswoman in an email to The Reg, who pointed out that on the Google Apps "help center" site, the company says the change is no more than a fortnight away.

"We will be launching a change for published docs. The change will allow published docs that are linked to from a public website to be crawled and indexed, which means they can appear in search results you see on Google.com and other search engines," Google says.

This only applies to files explicitly published using the suite's "publish as web page" or "publish/embed" options and linked to from a public webpage. This does not apply to files shared via the "Allow anyone with the link to view (no sign-in required)" option, which provides for document sharing without links to the public web.

Google warns that if you don't want your publicly-published documents crawled, you can de-publish them. Instructions for de-publishing are here.

At the help center, one Google Apps user has asked if - in light of the change - the company could provide a clear indication of which apps are public and which are not. "I think this makes it very important that you bring back the indication on the docs listing of those files that are published," the user says. "Maybe a separate label/folder of published docs/spreadsheets?"

Indeed, as it stands, Google Apps master view does not tell you which docs are publicly published and which aren't. ®

Choosing a cloud hosting partner with confidence

More from The Register

next story
New 'Cosmos' browser surfs the net by TXT alone
No data plan? No WiFi? No worries ... except sluggish download speed
'Windows 9' LEAK: Microsoft's playing catchup with Linux
Multiple desktops and live tiles in restored Start button star in new vids
iOS 8 release: WebGL now runs everywhere. Hurrah for 3D graphics!
HTML 5's pretty neat ... when your browser supports it
Mathematica hits the Web
Wolfram embraces the cloud, promies private cloud cut of its number-cruncher
Google extends app refund window to two hours
You now have 120 minutes to finish that game instead of 15
Intel: Hey, enterprises, drop everything and DO HADOOP
Big Data analytics projected to run on more servers than any other app
Mozilla shutters Labs, tells nobody it's been dead for five months
Staffer's blog reveals all as projects languish on GitHub
SUSE Linux owner Attachmate gobbled by Micro Focus for $2.3bn
Merger will lead to mainframe and COBOL powerhouse
iOS 8 Healthkit gets a bug SO Apple KILLS it. That's real healthcare!
Not fit for purpose on day of launch, says Cupertino
prev story

Whitepapers

Providing a secure and efficient Helpdesk
A single remote control platform for user support is be key to providing an efficient helpdesk. Retain full control over the way in which screen and keystroke data is transmitted.
WIN a very cool portable ZX Spectrum
Win a one-off portable Spectrum built by legendary hardware hacker Ben Heck
Saudi Petroleum chooses Tegile storage solution
A storage solution that addresses company growth and performance for business-critical applications of caseware archive and search along with other key operational systems.
Protecting users from Firesheep and other Sidejacking attacks with SSL
Discussing the vulnerabilities inherent in Wi-Fi networks, and how using TLS/SSL for your entire site will assure security.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.