Feeds

It came from the vaults! Google seeks to open the library

Digitization project raises hopes and concerns

  • alert
  • submit to reddit

The essential guide to IT transformation

In what could be a historic move in the history of the internet, Google has announced arrangements with Harvard University, and a handful of public libraries, to digitize parts of their valuable collections and make them available over the public web. Yahoo!, Grokker and Microsoft are working on similar ventures.

Google Print Library, as it's called, will take many years to complete its first phase, and like the others, faces tremendous hurdles. Copyright and licensing issues remain a huge obstacle; the ontological expertise remains the domain of information professionals; and as a monopoly gateway to the world's information, no private corporation can expect to evade regulatory concerns. And lazy governments, both central and local, could find use it as an excuse to axe what commitments they have to making high quality information available. Any of these issues could hobble the venture, providing a service that's as useful as the fake cardboard book-props one can buy by the yard to fill an empty study bookcase. But as a statement of intent, such ventures deserve to be taken seriously.

Google will co-operate in scanning and digitizing works with major academic libraries and make them searchable. The results will be displayed using Google Print - which uses DRM to restrict the viewing and printing of copyright material - and display links to either commercial booksellers such as Amazon.com or, using Open Worldcat metadata, provide information where to find it at your local library. Initial partners include Harvard, with 15 million books, Oxford's Bodleian Library, Stanford and Michigan University, where the scanning of seven million books is expected to take six years. Google won't at first offer advertisements on Print Library, although there's plenty of scope for this to change. For example: Do you want fries with your burgher?.

At ResourceShelf, Gary Price has a roundup of other digitization projects, and librarian Steve Cohen offers a few notes of caution. Google will need to improve on the brute force text search algorithms it uses today, he notes, and "libraries should be pushing their own materials through their websites rather than having to 'rely' on Google to do so".

The promise of universal access to data repeated over a decade of internet hype has not been fulfilled, and the role of librarians as information professionals has been consistently undervalued - something, we suspect, to do with the adolescent hostility to expertise that characterizes so much internet evangelism. Which in turn, probably has a lot to do with the internet's libertarian backgrounds. Whether the private sector succeeds after a decade of failure in overcoming copyright interests remains to be seen, and whether it can be trusted to do so is another question. We'll certain need the librarians, to keep the Microsofts and Googles both honest and effective. ®

Next gen security for virtualised datacentres

More from The Register

next story
6 Obvious Reasons Why Facebook Will Ban This Article (Thank God)
Clampdown on clickbait ... and El Reg is OK with this
No, thank you. I will not code for the Caliphate
Some assignments, even the Bongster decline must
Kaspersky backpedals on 'done nothing wrong, nothing to fear' blather
Founder (and internet passport fan) now says privacy is precious
TROLL SLAYER Google grabs $1.3 MEEELLION in patent counter-suit
Chocolate Factory hits back at firm for suing customers
Mozilla's 'Tiles' ads debut in new Firefox nightlies
You can try turning them off and on again
Primetime precrime? Minority Report TV series 'being developed'
I have to know. I have to find out what happened to my life
Sit tight, fanbois. Apple's '$400' wearable release slips into early 2015
Sources: time to put in plenty of clock-watching for' iWatch
prev story

Whitepapers

5 things you didn’t know about cloud backup
IT departments are embracing cloud backup, but there’s a lot you need to know before choosing a service provider. Learn all the critical things you need to know.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Backing up Big Data
Solving backup challenges and “protect everything from everywhere,” as we move into the era of big data management and the adoption of BYOD.
Consolidation: The Foundation for IT Business Transformation
In this whitepaper learn how effective consolidation of IT and business resources can enable multiple, meaningful business benefits.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?