Feeds

Web 0.2 archivists save Geocities from deletion

Preserving history one hideous webpage at a time

The essential guide to IT transformation

A group of web preservationists called the Archive Team is trying to save most of Geocities for the ages before Yahoo! erases the beloved old-school web-hosting service from the face of the internet.

In honor of the dearly departing web host, we'll continue in a more suitable format:

Welcome to my Geocities story!!!

  This news is under construction!   

Archive Team boss Jason Scott recently detailed on his blog about the team's newest project to download Geocities for posterity after Yahoo!'s announcement that it's pulling the plug on the web community "later this year."

The group's stated goal is to save websites or data that's in danger of being lost - and certainly Geocities is a resource worthy of preservation if there ever was one. Nearly two decades worth of blinking text, animated gifs, fanfiction, and broken links are at risk of disappearing with the blink of the eye. This is the personal internet young, raw and blemished - before big blogging services and social networking sites arrived to completely homogenize the space.

From Scott's webpage:

We've been downloading at an enormous rate, probably along the lines of a gigabyte a half-hour of Geocities, through all our different vectors.

Because we're talking literally millions of files with an average size of 1 to 30 kilobytes, it becomes harder and harder to get a "big picture" view of everything we've grabbed, but after 48 hours of work, Archive Team has saved over 200,000 Geocities sites. We're now pulling in new sites at the rate of something like 5 a second. Is that fast enough? We'll see, won't we.



Scott wrote that the team believes that it's sucked up nearly every site on Geocities from 1999 and before - at least those that still exist. Unfortunately, the Archive Team found that Yahoo apparently quietly purged a lot of Geocities "neighborhoods" (subdomains like http://www.geocities.com/RainForest/) completely, including WallStreet and NorthPole. Poor Santa probably never knew what hit him.

Thoroughly archiving Geocities is the team's current priority, Scott wrote. Making the data available takes a back seat.

"People who have been talking about copyright and stuff seem to think I'm going to sell it or take credit or some crap," Scott wrote. He added that there's no plans on releasing the data, but he'll "make sure people can get it, somehow."

Check out the Archive Team here, or even offer some help on their noble project. ®

This page best viewed with Netscape

Gartner critical capabilities for enterprise endpoint backup

More from The Register

next story
6 Obvious Reasons Why Facebook Will Ban This Article (Thank God)
Clampdown on clickbait ... and El Reg is OK with this
Mozilla's 'Tiles' ads debut in new Firefox nightlies
You can try turning them off and on again
No, thank you. I will not code for the Caliphate
Some assignments, even the Bongster decline must
Barnes & Noble: Swallow a Samsung Nook tablet, please ... pretty please
Novelslab finally on sale with ($199 - $20) price tag
Banking apps: Handy, can grab all your money... and RIDDLED with coding flaws
Yep, that one place you'd hoped you wouldn't find 'em
TROLL SLAYER Google grabs $1.3 MEEELLION in patent counter-suit
Chocolate Factory hits back at firm for suing customers
Primetime precrime? Minority Report TV series 'being developed'
I have to know. I have to find out what happened to my life
Netflix swallows yet another bitter pill, inks peering deal with TWC
Net neutrality crusader once again pays up for priority access
prev story

Whitepapers

Top 10 endpoint backup mistakes
Avoid the ten endpoint backup mistakes to ensure that your critical corporate data is protected and end user productivity is improved.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Backing up distributed data
Eliminating the redundant use of bandwidth and storage capacity and application consolidation in the modern data center.
The essential guide to IT transformation
ServiceNow discusses three IT transformations that can help CIOs automate IT services to transform IT and the enterprise
Next gen security for virtualised datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.