Feeds

Scroogle busted again after Google tweak

Private Google scraper on fourth life

Boost IT visibility and business value

Update Scroogle is once again offline after Google made yet another change to the auxiliary web interface the not-for-profit uses to serve up a privacy-friendly version of Mountain View's search engine.

"Here we go again," Scroogle founder Daniel Brandt says in a post to the site. "We regret to announce that Google changed their output format once again." As he looks for a solution, Brandt tells users to "check back in a day or two." He did not immediately respond to a request for comment.

Brandt and Scroogle (www.scroogle.org) have been scraping Google search results since 2002, letting privacy-conscious netizens search Google without being tracked by the web giant. But this past May, after Google removed an interface page where Brandt was scraping results, the service went offline. It returned the next day, when Brandt tapped a different interface, but this interface soon disappeared as well.

Brandt recovered again. But now Scroogle is down for a third time. "The last time this happened was in July, and we were down for five days," he says. "During that time we looked for the simplest remaining Google format we could find, reprogrammed our parser, and ended up with something that worked. However, the file we fetched from Google was three times more bloated for the same information, as compared to the previous format we used, and we are still not happy about this.

"Now it looks like even more bloat. We have to take a closer look at the new format and see if we can program around it."

According to Brandt, his service requires a Google interface that serves up Google results in a relatively simple format. Scroogle is run entirely on donations. There are no ads.

Scroogle originally scraped results from an interface at google.com/ie, which Google built for use inside the sidebar offered by Internet Explorer 6. But as Google killed its support for IE6, it snuffed this interface. Brandt replicated the setup through another page — google.com/search — by adding an IE parameter ("&output=ie") to the url. But then this option vanished as well. And so on.

Google has never attempted to shutdown Scroogle directly. But historically, it seems, there have been conflicting options within the company over how to treat Brandt and Scroogle. Nowadays, the site itself does turn up in Google's search results. ®

Update

Scroogle is now back online. "I have to tolerate more bloat, and there are occasional screw-ups in the parsing...But it is usable," Brandt tells us. "Google's coding gets more complex as the files get more bloated. I'm not sure it's worth trying to clean up what I'm showing now, because I have little faith that Scroogle can last much longer."

Next gen security for virtualised datacentres

More from The Register

next story
6 Obvious Reasons Why Facebook Will Ban This Article (Thank God)
Clampdown on clickbait ... and El Reg is OK with this
No, thank you. I will not code for the Caliphate
Some assignments, even the Bongster decline must
Fast And Furious 6 cammer thrown in slammer for nearly three years
Man jailed for dodgy cinema recording of Hollywood movie
Caught red-handed: UK cops, PCSOs, specials behaving badly… on social media
No Mr Fuzz, don't ask a crime victim to be your pal on Facebook
Barnes & Noble: Swallow a Samsung Nook tablet, please ... pretty please
Novelslab finally on sale with ($199 - $20) price tag
Ballmer leaves Microsoft board to spend more time with his b-balls
From Clippy to Clippers: Hi, I see you're running an NBA team now ...
Video of US journalist 'beheading' pulled from social media
Yanked footage featured British-accented attacker and US journo James Foley
Assange™: Hey world, I'M STILL HERE, ignore that Snowden guy
Press conference: ME ME ME ME ME ME ME (cont'd pg 94)
Call of Duty daddy considers launching own movie studio
Activision Blizzard might like quality control of a CoD film
prev story

Whitepapers

Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
5 things you didn’t know about cloud backup
IT departments are embracing cloud backup, but there’s a lot you need to know before choosing a service provider. Learn all the critical things you need to know.
Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
Top 8 considerations to enable and simplify mobility
In this whitepaper learn how to successfully add mobile capabilities simply and cost effectively.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?