Feeds

Web 0.2 archivists save Geocities from deletion

Preserving history one hideous webpage at a time

Security for virtualized datacentres

A group of web preservationists called the Archive Team is trying to save most of Geocities for the ages before Yahoo! erases the beloved old-school web-hosting service from the face of the internet.

In honor of the dearly departing web host, we'll continue in a more suitable format:

Welcome to my Geocities story!!!

  This news is under construction!   

Archive Team boss Jason Scott recently detailed on his blog about the team's newest project to download Geocities for posterity after Yahoo!'s announcement that it's pulling the plug on the web community "later this year."

The group's stated goal is to save websites or data that's in danger of being lost - and certainly Geocities is a resource worthy of preservation if there ever was one. Nearly two decades worth of blinking text, animated gifs, fanfiction, and broken links are at risk of disappearing with the blink of the eye. This is the personal internet young, raw and blemished - before big blogging services and social networking sites arrived to completely homogenize the space.

From Scott's webpage:

We've been downloading at an enormous rate, probably along the lines of a gigabyte a half-hour of Geocities, through all our different vectors.

Because we're talking literally millions of files with an average size of 1 to 30 kilobytes, it becomes harder and harder to get a "big picture" view of everything we've grabbed, but after 48 hours of work, Archive Team has saved over 200,000 Geocities sites. We're now pulling in new sites at the rate of something like 5 a second. Is that fast enough? We'll see, won't we.



Scott wrote that the team believes that it's sucked up nearly every site on Geocities from 1999 and before - at least those that still exist. Unfortunately, the Archive Team found that Yahoo apparently quietly purged a lot of Geocities "neighborhoods" (subdomains like http://www.geocities.com/RainForest/) completely, including WallStreet and NorthPole. Poor Santa probably never knew what hit him.

Thoroughly archiving Geocities is the team's current priority, Scott wrote. Making the data available takes a back seat.

"People who have been talking about copyright and stuff seem to think I'm going to sell it or take credit or some crap," Scott wrote. He added that there's no plans on releasing the data, but he'll "make sure people can get it, somehow."

Check out the Archive Team here, or even offer some help on their noble project. ®

This page best viewed with Netscape

Beginner's guide to SSL certificates

More from The Register

next story
Bono apologises for iTunes album dump
Megalomania, generosity and FEAR of irrelevance drove group to Apple deal
Facebook, Apple: LADIES! Why not FREEZE your EGGS? It's on the company!
No biological clockwatching when you work in Silicon Valley
Arab States make play for greater government control of the internet
Nerds told to get lost in last-minute power grab bid at UN meeting
Apple SILENCES Bose, YANKS headphones from stores
The, er, Beats go on after noise-cancelling spat
Doctor Who's Flatline: Cool monsters, yes, but utterly limp subplots
We know what the Doctor does, stop going on about it already
Zippy one-liners, broken promises: Doctor Who on the Orient Express
Series finally hits stride, but Clara's U-turn is baffling
Don't bother telling people if you lose their data, say Euro bods
You read that right – with the proviso that it's encrypted
America's super-secret X-37B plane returns to Earth after nearly TWO YEARS aloft
674 days in space for US Air Force's mystery orbital vehicle
10 Top Tips For PRs Considering Whether To Phone The Register
You'll Read These And LOL Even Though They're Serious
prev story

Whitepapers

Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Win a year’s supply of chocolate
There is no techie angle to this competition so we're not going to pretend there is, but everyone loves chocolate so who cares.
Why cloud backup?
Combining the latest advancements in disk-based backup with secure, integrated, cloud technologies offer organizations fast and assured recovery of their critical enterprise data.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Saudi Petroleum chooses Tegile storage solution
A storage solution that addresses company growth and performance for business-critical applications of caseware archive and search along with other key operational systems.