The Register® — Biting the hand that feeds IT

Comments on: Google free again to pump out porn thumbnails

Hoist by their own petard 

Posted Thursday 17th May 2007 10:45 GMT

Google says it honours crawler controls such as robots.txt. If Perfect 10 was serious about preventing their images appearing on Google they should have taken the obvious step of putting crawler control files on their website.

It appears that they didn't do that. If they had they'd have sued Google for telling porkies about robot exclusion and won the case.

Of course, installing controls would have also hidden their site from surfers cruising Google for pr0n. This explains why they probably didn't install them.

To me this case sounds like pure greed by Perfect 10. They were trying to have their cake and eat it too.

mod re-write .htaccess type solution thing 

Posted Thursday 17th May 2007 11:35 GMT

Not 100% sure about this but I think that server trickery allows for a certain level of control to what content is fetched by a third party server or bot.

Can one, for example, include a rule that any image that is requested by Google be replaced by one depicting test shouting 'yo! keep yer hands off our pr0n!'?

Trickery will get you nowhere 

Posted Thursday 17th May 2007 12:18 GMT

"server trickery allows for a certain level of control"

A server can return something different when it detects it is googlebot that is making the request. But google occasionally checks anonymously to ensure that the server is returning the same content to googlebot as it returns to ordinary browsers -- and at that time, google will rightly demote the site for misrepresentation.

robots.txt 

Posted Thursday 17th May 2007 12:45 GMT

You're thinking of robots.txt which tells search engine crawlers like Google's where they're not welcome. Though .htaccess can be used (and indeed, should always be used) to block bots that ignore robots.txt among many, many other features like preventing image hotlinking, blocking by referrer, etc. etc.

Facts, get them before commenting 

Posted Saturday 19th May 2007 13:58 GMT

The case has nothing to do with robots.txt. Perfect 10 sued because all the pictures on google that are on the image search are pirated images. I am sure perfect 10 wouldn't mind google sending traffic its way, however, all the traffic google would create goes to some sites that stole the pictures and are giving the pictures away for free. And I am sure they sent notices, and never removed them.

ok, well, yeah 

Posted Wednesday 23rd May 2007 16:35 GMT

"The case has nothing to do with robots.txt."

The article seems to suggest otherwise.

Another thought, if I may.

Porn sites in the US are now required to keep a record / proof of the models they publish pics / vids of. Secondary producers (eg. affiliates) are also required to hold this info (I think).

Shouldn't Google to the same then for each XXX pic is displays?

Don’t Miss