Feeds

Tumblr tumbles down, stays there for 24 hours

Maintenance was planned, cluster f*ck less so

Security for virtualized datacentres

Tumblr has blamed a database problem for an outage that left the popular microblogging service largely unavailable for more than 24 hours.

The service was restored on Monday afternoon, allowing users to resume their postings of pictures of cats and musings on life, as is the local custom.

In an update, Tumblr founder David Karp apologised for the outage, which he blamed on a technical glitch.

Yesterday afternoon, during planned maintenance that was not intended to interrupt service, an issue arose that took down a critical database cluster. This brought down our entire network while our engineers worked feverishly to restore these databases and bring your blogs back online.

Karp admitted the Sunday/Monday outage was just the most serious glitch in a larger series of service problems that the micro-blogging platform has experienced of late. He said that the site had quadrupled its engineering team and was in the process of rolling out a more distributed architecture in a bid to make it more robust.

Website availability problems in the absence of a denial of service attack can usually be traced back to one of three problems or some combination thereof: insufficient bandwidth, poor code or not enough server horsepower to cope with demand. Failure to run a distributed system with well designed failover and backup invites trouble, especially for high demand sites, as Tumblr discovered this week.

Last month 4chan and Tumblr users engaged in an online spat that escalated to involve denial of service attacks on each side. Tumblr came off worse in the rumpus, which was apparently triggered by accusations that Tumblr users were stealing jokes from 4chan without crediting the source. ®

Choosing a cloud hosting partner with confidence

More from The Register

next story
New 'Cosmos' browser surfs the net by TXT alone
No data plan? No WiFi? No worries ... except sluggish download speed
'Windows 9' LEAK: Microsoft's playing catchup with Linux
Multiple desktops and live tiles in restored Start button star in new vids
iOS 8 release: WebGL now runs everywhere. Hurrah for 3D graphics!
HTML 5's pretty neat ... when your browser supports it
Mathematica hits the Web
Wolfram embraces the cloud, promies private cloud cut of its number-cruncher
Google extends app refund window to two hours
You now have 120 minutes to finish that game instead of 15
Intel: Hey, enterprises, drop everything and DO HADOOP
Big Data analytics projected to run on more servers than any other app
Mozilla shutters Labs, tells nobody it's been dead for five months
Staffer's blog reveals all as projects languish on GitHub
SUSE Linux owner Attachmate gobbled by Micro Focus for $2.3bn
Merger will lead to mainframe and COBOL powerhouse
iOS 8 Healthkit gets a bug SO Apple KILLS it. That's real healthcare!
Not fit for purpose on day of launch, says Cupertino
prev story

Whitepapers

Providing a secure and efficient Helpdesk
A single remote control platform for user support is be key to providing an efficient helpdesk. Retain full control over the way in which screen and keystroke data is transmitted.
WIN a very cool portable ZX Spectrum
Win a one-off portable Spectrum built by legendary hardware hacker Ben Heck
Saudi Petroleum chooses Tegile storage solution
A storage solution that addresses company growth and performance for business-critical applications of caseware archive and search along with other key operational systems.
Protecting users from Firesheep and other Sidejacking attacks with SSL
Discussing the vulnerabilities inherent in Wi-Fi networks, and how using TLS/SSL for your entire site will assure security.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.