Feeds

British Library wants taxpayer to gobble the web

Cost? We don't know

The Essential Guide to IT Transformation

British Library wants to archive the UK web, creating an invaluable national treasure trove of porn, celebrity trivia gossip and Daily Mail comments. But it admits it can't put a figure on the project - which looks like becoming a huge, open-ended commitment for the taxpayer.

Today the Library stepped up the pressure for the law to be changed, allowing copyright libraries to create copies of web material for research purposes of other copyright holders material. Five statutory libraries already have permission to make printed material available. Now the British Library says it wants the Web too.

"It's not a request for additional funding," a BL spokesperson said, but they couldn't say how much the creeping mission would end up costing us. At first, the BL won't archive every Tweet, but do an annual crawl, with some sites such as No 10 Downing Street archived more often. That would cost 220TB of data, it reckons about £4,000 in storage.

But that would barely make a dimple in a replica of UK web output, now that so many non-web chat areas have migrated to a home between angle brackets. The BL acknowledges there are eight million sites.

What, we wondered, was the point of archiving every single "Ashlee Cole iz a slag" typed into a browser?

"It may be that somebody wants to look back and research celebrity and this could be important to their research," we were told.

No doubt. But every Tweet and comment?

It was cheaper, the spokesman assured us, than employing a curator to choose between the best Ashley/Cheryl comments (for example).

Ah, right. So the mechanics dictate the curation policy.

But it was also fairer, he added, because the neutral, objective web bot couldn't be accused of bias. Even in momentous national conversations as the Cole divorce.

There are plenty of comments flying around this morning wondering why public money should be required to archive more than a handful of websites. Especially with Brewster Kahle's Archive.Org, which is privately funded.

At first the library told us the public was unaware that websites disappear without some part of the British state keeping a copy - an interesting claim. I've never met anyone who thinks all websites are preserved by some silent, omniscient backup programme.

Then the Library told us that the private sector couldn't be trusted to do the job, because future funding couldn't be assured. But with the British state in the red to the tune of £180bn this year, a defecit larger than Greece's in GDP terms (12.8 per cent), and frontline services such as nurses facing the chop, it's questionable whether anyone wants prefers to keep a copy of those Mail comments instead. ®

Build a business case: developing custom apps

More from The Register

next story
iPad? More like iFAD: We reveal why Apple fell into IBM's arms
But never fear fanbois, you're still lapping up iPhones, Macs
Amazon says Hachette should lower ebook prices, pay authors more
Oh yeah ... and a 30% cut for Amazon to seal the deal
Philip K Dick 'Nazi alternate reality' story to be made into TV series
Amazon Studios, Ridley Scott firm to produce The Man in the High Castle
Nintend-OH NO! Sorry, Mario – your profits are in another castle
Red-hatted mascot, red-colored logo, red-stained finance books
Sonos AXES support for Apple's iOS4 and 5
Want to use your iThing? You can't - it's too old
Joe Average isn't worth $10 a year to Mark Zuckerberg
The Social Network deflates the PC resurgence with mobile-only usage prediction
Chips are down at Broadcom: Thousands of workers laid off
Cellphone baseband device biz shuttered
Feel free to BONK on the TUBE, says Transport for London
Plus: Almost NOBODY uses pay-by-bonk on buses - Visa
Twitch rich as Google flicks $1bn hitch switch, claims snitch
Gameplay streaming biz and search king refuse to deny fresh gobble rumors
Stick a 4K in them: Super high-res TVs are DONE
4,000 pixels is niche now... Don't say we didn't warn you
prev story

Whitepapers

Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Boost IT visibility and business value
How building a great service catalog relieves pressure points and demonstrates the value of IT service management.
Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
The Essential Guide to IT Transformation
ServiceNow discusses three IT transformations that can help CIO's automate IT services to transform IT and the enterprise.
Maximize storage efficiency across the enterprise
The HP StoreOnce backup solution offers highly flexible, centrally managed, and highly efficient data protection for any enterprise.