Feeds

Copyfraud: Poisoning the public domain

How web giants are stealing the future of knowledge

High performance access to file storage

Special report The public domain is the greatest resource in human history: eventually all knowledge will become part of it. Its riches serve all mankind, but it faces a new threat. Vast libraries of public domain works are being plundered by claims of "copyright". It's called copyfraud - and we'll discover how large corporations like Google, Yahoo, and Amazon have structured their businesses to assist it and profit from it.

Copyfraud first came to my attention nearly two years ago in my scholarly research. As Google Books began releasing massive numbers of newly scanned public domain books, Japanese Studies scholars excitedly announced their discoveries. I found links to books by one of the first American journalists to arrive in Japan, Lafcadio Hearn. Scans of his books from the 1890s onward - with their eccentric letterpress typography and gold foil stamped leather covers - give us the details of presentation and are the minutiae that launched a thousand dissertations. This is just the sort of material that excites scholars like me; we examine his books to see how the West first came to understand Japan.

But Hearn's books also have commercial value as public domain works. Hearn's book of traditional Japanese ghost stories, Kwaidan was released as a film in 1964 and was nominated for an Academy Award; a modern film homage appeared in 2007. Hearn profited by recording, translating, and publishing a book of traditional folk tales. When his book lapsed into the public domain, it was used again as source material for a copyrighted commercial derivative work on film. This recycling of ancient oral tradition by Hearn helped propel a generation of big-budget horror films, becoming a profit centre for movie studios in both America and Japan.

Upon hearing the news of Hearn's work appearing on Google Books, I searched for Hearn's classic Glimpses of an Unfamiliar Japan. Although a freely downloadable scan of Volume 2 was available, only a limited preview edition of Volume 1 from Kessinger Publishing was available, under a 2004 "copyright". The preview edition was marked "Copyrighted material" on every page. Many pages were omitted, the book is nothing but an excerpt.

Another public domain classic disappears. But why?

Kessinger made the document useless to scholars, to force them to purchase the full hardcopy edition for $25. Links on the Google Books page directed purchasers to the Kessinger edition on Amazon.com and other online booksellers. Scholars were outraged. These works are clearly in the public domain, dating back to the 1890s and beyond.

When questioned, Google said it "must err on the side of caution... until we have determined that the book has entered the public domain." But with the sheer volume of ebooks being submitted by outside publishers, there are obvious delays in clearing rights. Some publishers have exploited this gap, providing copyfraud editions where no free edition was available.

Google suggested waiting patiently, as surely these missing volumes would eventually be scanned and added to its free online library. But the practice began to spread. Other publishers followed Kessinger's model and several more copyfraud editions of Volume 1 followed. Nearly two years later, a freely downloadable scan of Volume 1 finally appeared. Some publishers had used the two year gap to profit.

We can see the publishers' motivation. Their business model has been ruined. There is a long tradition of small publishers selling facsimiles of rare public domain books. They were a precursor of the print-on-demand online model, collecting and curating large libraries of obscure, disreputable, and uncommercial works on topics like occultism, homeopathy, military history, and crackpot cancer cures, and selling them to niche markets. But with the advent of Google Books, their market has evaporated: the books are no longer scarce. So they fought back the only way they knew how, by exploiting the gaps in Google's book coverage.

High performance access to file storage

More from The Register

next story
Android engineer: We DIDN'T copy Apple OR follow Samsung's orders
Veep testifies for Samsung during Apple patent trial
One year on: diplomatic fail as Chinese APT gangs get back to work
Mandiant says past 12 months shows Beijing won't call off its hackers
EFF: Feds plan to put 52 MILLION FACES into recognition database
System would identify faces as part of biometrics collection
Big Content goes after Kim Dotcom
Six studios sling sueballs at dead download destination
Alphadex fires back at British Gas with overcharging allegation
Brit colo outfit says it paid for 347KVA, has been charged for 1940KVA
Jack the RIPA: Blighty cops ignore law, retain innocents' comms data
Prime minister: Nothing to see here, go about your business
Singapore decides 'three strikes' laws are too intrusive
When even a prurient island nation thinks an idea is dodgy it has problems
Banks slap Olympus with £160 MEEELLION lawsuit
Scandal hit camera maker just can't shake off its past
France bans managers from contacting workers outside business hours
«Email? Mais non ... il est plus tard que six heures du soir!»
Reprieve for Weev: Court disowns AT&T hacker's conviction
Appeals court strikes down landmark sentence
prev story

Whitepapers

Mainstay ROI - Does application security pay?
In this whitepaper learn how you and your enterprise might benefit from better software security.
Five 3D headsets to be won!
We were so impressed by the Durovis Dive headset we’ve asked the company to give some away to Reg readers.
3 Big data security analytics techniques
Applying these Big Data security analytics techniques can help you make your business safer by detecting attacks early, before significant damage is done.
The benefits of software based PBX
Why you should break free from your proprietary PBX and how to leverage your existing server hardware.
Mobile application security study
Download this report to see the alarming realities regarding the sheer number of applications vulnerable to attack, as well as the most common and easily addressable vulnerability errors.