Feeds

Sun packs 150 billion web pages into meat locker

Getting your arms around the internet

Boost IT visibility and business value

If you believe the Gospel According to Robert J. Cringley, Google pilfered its top-secret modular data center from the Internet Archive.

In a now-famous 2005 online expose, Cringley puts Google co-founder Larry Page at a pitch meeting where the Internet Archive's Bruce Baumgart considers the advantages of stuffing a full-fledged data center into a shipping container. The Archive's "Petabyte Box" presentation is dated November 8, 2003, and on December 30, Google filed for a patent describing its own containerized data center.

Less than four years later, the patent was granted. And according to one former employee, it's now the norm for Google to erect its ultra-hot data centers by piecing together intermodal shipping containers pre-packed with servers and cooling equipment. Inside the Mountain View Chocolate Factory, Page and company call it Project Will Power.

The Internet Archive eventually built the Petabyte Box - though it shrunk the name a bit and stopped short of actually packing its compact contraption into a shipping container. A PetaBox planted at the San Francisco Presido has long hosted the Archive's Wayback Machine - an 150 billion-page web history dating back to 1996.

Wayback Machine - the container

Internet history in a box

Now, more than five years after first pitching the idea, the outfit that launched the container revolution has finally containerized itself. This morning, deep inside the sun-splashed Santa Clara campus of Sun Microsystems, Internet Archive founder Brewster Kahle cut the ribbon on a single Sun Modular Datacenter housing the entire Wayback Machine. That's thirteen years of archived web pages packed into a container significantly smaller than your living room.

"At a metaphysical level, what we're doing today is reconceptualizing what a computer is...We're reconceptualizing what a library is," said Kahle, the MIT-trained computer scientist who sold his Alexa web-ranking engine to Amazon before birthing the not-for-profit Internet Archive.

Wayback Machine - Greg Papadopoulos and Brewster Kahle

Inside the Wayback Machine with Sun's Greg Papadopoulos and the Internet Archive's Brewster Kahle

"You can actually take a tour of this data center and ask 'How big is the web?' You can ask 'How much does it weigh?' These are things you can actually wrap your hands around in a very literal way."

The essential guide to IT transformation

Next page: In the beginning...

More from The Register

next story
The Return of BSOD: Does ANYONE trust Microsoft patches?
Sysadmins, you're either fighting fires or seen as incompetents now
Microsoft: Azure isn't ready for biz-critical apps … yet
Microsoft will move its own IT to the cloud to avoid $200m server bill
Oracle reveals 32-core, 10 BEEELLION-transistor SPARC M7
New chip scales to 1024 cores, 8192 threads 64 TB RAM, at speeds over 3.6GHz
US regulators OK sale of IBM's x86 server biz to Lenovo
Now all that remains is for gov't offices to ban the boxes
Flash could be CHEAPER than SAS DISK? Come off it, NetApp
Stats analysis reckons we'll hit that point in just three years
Object storage bods Exablox: RAID is dead, baby. RAID is dead
Bring your own disks to its object appliances
Nimble's latest mutants GORGE themselves on unlucky forerunners
Crossing Sandy Bridges without stopping for breath
prev story

Whitepapers

5 things you didn’t know about cloud backup
IT departments are embracing cloud backup, but there’s a lot you need to know before choosing a service provider. Learn all the critical things you need to know.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Build a business case: developing custom apps
Learn how to maximize the value of custom applications by accelerating and simplifying their development.
Rethinking backup and recovery in the modern data center
Combining intelligence, operational analytics, and automation to enable efficient, data-driven IT organizations using the HP ABR approach.
Next gen security for virtualised datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.