Feeds

Sun packs 150 billion web pages into meat locker

Getting your arms around the internet

Beginner's guide to SSL certificates

If you believe the Gospel According to Robert J. Cringley, Google pilfered its top-secret modular data center from the Internet Archive.

In a now-famous 2005 online expose, Cringley puts Google co-founder Larry Page at a pitch meeting where the Internet Archive's Bruce Baumgart considers the advantages of stuffing a full-fledged data center into a shipping container. The Archive's "Petabyte Box" presentation is dated November 8, 2003, and on December 30, Google filed for a patent describing its own containerized data center.

Less than four years later, the patent was granted. And according to one former employee, it's now the norm for Google to erect its ultra-hot data centers by piecing together intermodal shipping containers pre-packed with servers and cooling equipment. Inside the Mountain View Chocolate Factory, Page and company call it Project Will Power.

The Internet Archive eventually built the Petabyte Box - though it shrunk the name a bit and stopped short of actually packing its compact contraption into a shipping container. A PetaBox planted at the San Francisco Presido has long hosted the Archive's Wayback Machine - an 150 billion-page web history dating back to 1996.

Wayback Machine - the container

Internet history in a box

Now, more than five years after first pitching the idea, the outfit that launched the container revolution has finally containerized itself. This morning, deep inside the sun-splashed Santa Clara campus of Sun Microsystems, Internet Archive founder Brewster Kahle cut the ribbon on a single Sun Modular Datacenter housing the entire Wayback Machine. That's thirteen years of archived web pages packed into a container significantly smaller than your living room.

"At a metaphysical level, what we're doing today is reconceptualizing what a computer is...We're reconceptualizing what a library is," said Kahle, the MIT-trained computer scientist who sold his Alexa web-ranking engine to Amazon before birthing the not-for-profit Internet Archive.

Wayback Machine - Greg Papadopoulos and Brewster Kahle

Inside the Wayback Machine with Sun's Greg Papadopoulos and the Internet Archive's Brewster Kahle

"You can actually take a tour of this data center and ask 'How big is the web?' You can ask 'How much does it weigh?' These are things you can actually wrap your hands around in a very literal way."

Intelligent flash storage arrays

Next page: In the beginning...

More from The Register

next story
Azure TITSUP caused by INFINITE LOOP
Fat fingered geo-block kept Aussies in the dark
NASA launches new climate model at SC14
75 days of supercomputing later ...
Yahoo! blames! MONSTER! email! OUTAGE! on! CUT! CABLE! bungle!
Weekend woe for BT as telco struggles to restore service
You think the CLOUD's insecure? It's BETTER than UK.GOV's DATA CENTRES
We don't even know where some of them ARE – Maude
DEATH by COMMENTS: WordPress XSS vuln is BIGGEST for YEARS
Trio of XSS turns attackers into admins
BOFH: WHERE did this 'fax-enabled' printer UPGRADE come from?
Don't worry about that cable, it's part of the config
Cloud unicorns are extinct so DiData cloud mess was YOUR fault
Applications need to be built to handle TITSUP incidents
Astro-boffins start opening universe simulation data
Got a supercomputer? Want to simulate a universe? Here you go
prev story

Whitepapers

Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
A strategic approach to identity relationship management
ForgeRock commissioned Forrester to evaluate companies’ IAM practices and requirements when it comes to customer-facing scenarios versus employee-facing ones.
10 threats to successful enterprise endpoint backup
10 threats to a successful backup including issues with BYOD, slow backups and ineffective security.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Protecting against web application threats using SSL
SSL encryption can protect server‐to‐server communications, client devices, cloud resources, and other endpoints in order to help prevent the risk of data loss and losing customer trust.