The Register® — Biting the hand that feeds IT

Feeds

Scroogle busted again after Google tweak

Private Google scraper on fourth life

Regcast training : Hyper-V 3.0, VM high availability and disaster recovery

Update Scroogle is once again offline after Google made yet another change to the auxiliary web interface the not-for-profit uses to serve up a privacy-friendly version of Mountain View's search engine.

"Here we go again," Scroogle founder Daniel Brandt says in a post to the site. "We regret to announce that Google changed their output format once again." As he looks for a solution, Brandt tells users to "check back in a day or two." He did not immediately respond to a request for comment.

Brandt and Scroogle (www.scroogle.org) have been scraping Google search results since 2002, letting privacy-conscious netizens search Google without being tracked by the web giant. But this past May, after Google removed an interface page where Brandt was scraping results, the service went offline. It returned the next day, when Brandt tapped a different interface, but this interface soon disappeared as well.

Brandt recovered again. But now Scroogle is down for a third time. "The last time this happened was in July, and we were down for five days," he says. "During that time we looked for the simplest remaining Google format we could find, reprogrammed our parser, and ended up with something that worked. However, the file we fetched from Google was three times more bloated for the same information, as compared to the previous format we used, and we are still not happy about this.

"Now it looks like even more bloat. We have to take a closer look at the new format and see if we can program around it."

According to Brandt, his service requires a Google interface that serves up Google results in a relatively simple format. Scroogle is run entirely on donations. There are no ads.

Scroogle originally scraped results from an interface at google.com/ie, which Google built for use inside the sidebar offered by Internet Explorer 6. But as Google killed its support for IE6, it snuffed this interface. Brandt replicated the setup through another page — google.com/search — by adding an IE parameter ("&output=ie") to the url. But then this option vanished as well. And so on.

Google has never attempted to shutdown Scroogle directly. But historically, it seems, there have been conflicting options within the company over how to treat Brandt and Scroogle. Nowadays, the site itself does turn up in Google's search results. ®

Update

Scroogle is now back online. "I have to tolerate more bloat, and there are occasional screw-ups in the parsing...But it is usable," Brandt tells us. "Google's coding gets more complex as the files get more bloated. I'm not sure it's worth trying to clean up what I'm showing now, because I have little faith that Scroogle can last much longer."

Cloud storage: Lower cost and increase uptime

Anonymous Coward

Yes

It's news because plenty of people find it to be a valuable service.

25
1

@MacRat: Fail indeed

Because scroogle is not "some random website." Scroogle is probably the best known, long established Google scraper. It also makes a strong point of being a non-commercial protest/public interest site regarding individual privacy on the Internet. Whether you think they are completely full of themselves or not, they are far more relevant than "some random website."

-d

11
1

Ttir,amcla/od

"Some random website can't scrap from Google.

Why is this news?"

This post made me wish there was a Let Me Scroogle That For You site.

Honestly, how uninformed do you need to be about a subject to stop yourself making a post about it...

4
0

More from The Register

Thanks, NSA: Amazon sales of Orwell's 1984 rise 9,500%
Citizens of Oceania bone up on the new reality
Microsoft to open Windows Stores inside 600 Best Buy locations
Product showcases 'must be seen to be believed'
 breaking news
Author Iain (M) Banks falls to cancer at 59
Misses the release of his final work
 breaking news
What did the Lehman Brothers implosion look like to a techie?
Insider tells all about the Gnab Gib at Lehmans
It's official: 'tweet' an English word – not just in the avian sense
If the Oxford English Dictionary says it is so, then it is so
 breaking news
The only Waze is Google: Ad giant tipped to gobble map app 'for $1.3bn'
Pac-Man-satnav-ish upstart in bidding war with Apple, Facebook
 breaking news
1-in-10 e-tomes 'are self-published'... most are 'rubbish' says book ed
Publishing man scoffs at go-it-alone writers, ursines still fouling in forests
 breaking news
Facebook RSS reader said to uncloak June 20
Secret event scooped by Scottish developer?