Feeds

British Library wants taxpayer to gobble the web

Cost? We don't know

High performance access to file storage

British Library wants to archive the UK web, creating an invaluable national treasure trove of porn, celebrity trivia gossip and Daily Mail comments. But it admits it can't put a figure on the project - which looks like becoming a huge, open-ended commitment for the taxpayer.

Today the Library stepped up the pressure for the law to be changed, allowing copyright libraries to create copies of web material for research purposes of other copyright holders material. Five statutory libraries already have permission to make printed material available. Now the British Library says it wants the Web too.

"It's not a request for additional funding," a BL spokesperson said, but they couldn't say how much the creeping mission would end up costing us. At first, the BL won't archive every Tweet, but do an annual crawl, with some sites such as No 10 Downing Street archived more often. That would cost 220TB of data, it reckons about £4,000 in storage.

But that would barely make a dimple in a replica of UK web output, now that so many non-web chat areas have migrated to a home between angle brackets. The BL acknowledges there are eight million sites.

What, we wondered, was the point of archiving every single "Ashlee Cole iz a slag" typed into a browser?

"It may be that somebody wants to look back and research celebrity and this could be important to their research," we were told.

No doubt. But every Tweet and comment?

It was cheaper, the spokesman assured us, than employing a curator to choose between the best Ashley/Cheryl comments (for example).

Ah, right. So the mechanics dictate the curation policy.

But it was also fairer, he added, because the neutral, objective web bot couldn't be accused of bias. Even in momentous national conversations as the Cole divorce.

There are plenty of comments flying around this morning wondering why public money should be required to archive more than a handful of websites. Especially with Brewster Kahle's Archive.Org, which is privately funded.

At first the library told us the public was unaware that websites disappear without some part of the British state keeping a copy - an interesting claim. I've never met anyone who thinks all websites are preserved by some silent, omniscient backup programme.

Then the Library told us that the private sector couldn't be trusted to do the job, because future funding couldn't be assured. But with the British state in the red to the tune of £180bn this year, a defecit larger than Greece's in GDP terms (12.8 per cent), and frontline services such as nurses facing the chop, it's questionable whether anyone wants prefers to keep a copy of those Mail comments instead. ®

High performance access to file storage

More from The Register

next story
Audio fans, prepare yourself for the Second Coming ... of Blu-ray
High Fidelity Pure Audio – is this what your ears have been waiting for?
Dropbox defends fantastically badly timed Condoleezza Rice appointment
'Nothing is going to change with Dr. Rice's appointment,' file sharer promises
Nokia offers 'voluntary retirement' to 6,000+ Indian employees
India's 'predictability and stability' cited as mobe-maker's tax payment deadline nears
It may be ILLEGAL to run Heartbleed health checks – IT lawyer
Do the right thing, earn up to 10 years in clink
France bans managers from contacting workers outside business hours
«Email? Mais non ... il est plus tard que six heures du soir!»
Adrian Mole author Sue Townsend dies at 68
RIP Blighty's best-selling author of the 1980s
Zucker punched: Google gobbles Facebook-wooed Titan Aerospace
Up, up and away in my beautiful balloon flying broadband-bot
Analysts: Bright future for smartphones, tablets, wearables
There's plenty of good money to be made if you stay out of the PC market
Jeff Bezos reveals Amazon's brutal scale in annual letter
Bit-flipping retail mogul seems hybrid of Ford and Rockefeller
prev story

Whitepapers

Mainstay ROI - Does application security pay?
In this whitepaper learn how you and your enterprise might benefit from better software security.
Five 3D headsets to be won!
We were so impressed by the Durovis Dive headset we’ve asked the company to give some away to Reg readers.
3 Big data security analytics techniques
Applying these Big Data security analytics techniques can help you make your business safer by detecting attacks early, before significant damage is done.
The benefits of software based PBX
Why you should break free from your proprietary PBX and how to leverage your existing server hardware.
Mobile application security study
Download this report to see the alarming realities regarding the sheer number of applications vulnerable to attack, as well as the most common and easily addressable vulnerability errors.