Feeds

Holy crap! EMC gives Vatican Library 2.8PB to store manuscripts

NOT on the cloud, we note....

Internet Security Threat Report 2014

The Vatican Library is losing its walls. Its 89,000 historic manuscripts are being made available online for access by scholars world-wide courtesy of EMC.

The library, properly known as the Vatican Apostolic Library, is located in the Vatican City and is one of the oldest libraries in the world, established formally in 1475 but thought to have functioned for a long time before that. The library's function is to be a resource for scholars researching history, law, philosophy, science and theology.

The Abyss of Hell

The Abyss of Hell by Sandro Botticelli in the Vatican Library

It stores some 89,000 manuscripts, including 8,900 incunabula, manuscripts printed before 1501 in Europe. The Vatican collection of these is the fourth largest in the world. It also holds some 5,000 Greek manuscripts, with authors such Homer, Sophocles, Plato, and Hippocrates, and which include New Testament manuscripts. Many of these Greek documents are decorated with Byzantine miniatures. Its most famous book is possibly the Codex Vaticanus Graecus 1209, the earliest almost complete bible which dates from the 4th century.

A great many manuscripts in the library have not been formally catalogued and so much remains to be discovered, such as, possibly, unknown historical texts by Aristotle or Cicero. Digitisation will aid this because more scholars will be able to access the library's resources.

The first digitisation plans were announced in 2012, and involved a million and a half pages of material, collaboration with the UK's Bodleian Library, and a grant of £2 million from the Polonsky Foundation in London. Now it's up to 40 million pages in the first 3-year phase of a 9-year project.

EMC Vatican Library Video still

EMC Vatican LIbrary Digitisation video

A manuscript or books could have up to 500 pages, with each page needing 150MB of storage. Altogether, according to a spokesperson in an EMC video about the project, 45PB of storage will be needed.

This first phase of the project lasts three years and involves 40 million pages. EMC Italy is sponsoring this phase as part of an Information Heritage Initiative. It's contributing 2.8PB of Atmos, Isilon, and VNX storage arrays, Networker backup software and DataDomain deduplicating backup storage, and working with its Italian partner DEDAGROUP ICT Network.

Gianni Camisa, the group's managing director, provided a quote for EMC's release which perhaps suffered from translation issues; "This is a highly complex project of immense cultural value. We are pleased to offer our expertise around dematerialisation to a complex project of such historical significance." Digitisation might be a better translation of the Italian word that gave rise to dematerialisation.

The Amos storage will be used for long term conservation with the Isilon arrays used for items needing fast access. Documents will be stored in an ISO-certifiable digital format to ensure, EMC says, future availability.

At the moment a maximum of 200 scholars at a time can physically be in and use the Vatican Library. By digitising the manuscripts this can be increased to a much higher number and the manuscripts themselves, many extremely old and very fragile, will not have to handled and put at risk. The Vatican Library will be able to better preserve its heritage by becoming a library without walls. ®

Beginner's guide to SSL certificates

More from The Register

next story
Docker's app containers are coming to Windows Server, says Microsoft
MS chases app deployment speeds already enjoyed by Linux devs
'Hmm, why CAN'T I run a water pipe through that rack of media servers?'
Leaving Las Vegas for Armenia kludging and Dubai dune bashing
'Urika': Cray unveils new 1,500-core big data crunching monster
6TB of DRAM, 38TB of SSD flash and 120TB of disk storage
Facebook slurps 'paste sites' for STOLEN passwords, sprinkles on hash and salt
Zuck's ad empire DOESN'T see details in plain text. Phew!
SDI wars: WTF is software defined infrastructure?
This time we play for ALL the marbles
Windows 10: Forget Cloudobile, put Security and Privacy First
But - dammit - It would be insane to say 'don't collect, because NSA'
Oracle hires former SAP exec for cloudy push
'We know Larry said cloud was gibberish, and insane, and idiotic, but...'
prev story

Whitepapers

Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Why cloud backup?
Combining the latest advancements in disk-based backup with secure, integrated, cloud technologies offer organizations fast and assured recovery of their critical enterprise data.
Win a year’s supply of chocolate
There is no techie angle to this competition so we're not going to pretend there is, but everyone loves chocolate so who cares.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Intelligent flash storage arrays
Tegile Intelligent Storage Arrays with IntelliFlash helps IT boost storage utilization and effciency while delivering unmatched storage savings and performance.