Feeds

Sun packs 150 billion web pages into meat locker

Getting your arms around the internet

Next gen security for virtualised datacentres

If you believe the Gospel According to Robert J. Cringley, Google pilfered its top-secret modular data center from the Internet Archive.

In a now-famous 2005 online expose, Cringley puts Google co-founder Larry Page at a pitch meeting where the Internet Archive's Bruce Baumgart considers the advantages of stuffing a full-fledged data center into a shipping container. The Archive's "Petabyte Box" presentation is dated November 8, 2003, and on December 30, Google filed for a patent describing its own containerized data center.

Less than four years later, the patent was granted. And according to one former employee, it's now the norm for Google to erect its ultra-hot data centers by piecing together intermodal shipping containers pre-packed with servers and cooling equipment. Inside the Mountain View Chocolate Factory, Page and company call it Project Will Power.

The Internet Archive eventually built the Petabyte Box - though it shrunk the name a bit and stopped short of actually packing its compact contraption into a shipping container. A PetaBox planted at the San Francisco Presido has long hosted the Archive's Wayback Machine - an 150 billion-page web history dating back to 1996.

Wayback Machine - the container

Internet history in a box

Now, more than five years after first pitching the idea, the outfit that launched the container revolution has finally containerized itself. This morning, deep inside the sun-splashed Santa Clara campus of Sun Microsystems, Internet Archive founder Brewster Kahle cut the ribbon on a single Sun Modular Datacenter housing the entire Wayback Machine. That's thirteen years of archived web pages packed into a container significantly smaller than your living room.

"At a metaphysical level, what we're doing today is reconceptualizing what a computer is...We're reconceptualizing what a library is," said Kahle, the MIT-trained computer scientist who sold his Alexa web-ranking engine to Amazon before birthing the not-for-profit Internet Archive.

Wayback Machine - Greg Papadopoulos and Brewster Kahle

Inside the Wayback Machine with Sun's Greg Papadopoulos and the Internet Archive's Brewster Kahle

"You can actually take a tour of this data center and ask 'How big is the web?' You can ask 'How much does it weigh?' These are things you can actually wrap your hands around in a very literal way."

Secure remote control for conventional and virtual desktops

Next page: In the beginning...

More from The Register

next story
HP busts out new ProLiant Gen9 servers
Think those are cool? Wait till you get a load of our racks
Shoot-em-up: Sony Online Entertainment hit by 'large scale DDoS attack'
Games disrupted as firm struggles to control network
Like condoms, data now comes in big and HUGE sizes
Linux Foundation lights a fire under storage devs with new conference
Community chest: Storage firms need to pay open-source debts
Samba implementation? Time to get some devs on the job
Silicon Valley jolted by magnitude 6.1 quake – its biggest in 25 years
Did the earth move for you at VMworld – oh, OK. It just did. A lot
Forrester says it's time to give up on physical storage arrays
The physical/virtual storage tipping point may just have arrived
prev story

Whitepapers

5 things you didn’t know about cloud backup
IT departments are embracing cloud backup, but there’s a lot you need to know before choosing a service provider. Learn all the critical things you need to know.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Backing up Big Data
Solving backup challenges and “protect everything from everywhere,” as we move into the era of big data management and the adoption of BYOD.
Consolidation: The Foundation for IT Business Transformation
In this whitepaper learn how effective consolidation of IT and business resources can enable multiple, meaningful business benefits.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?