Feeds

Amazon: Some data won't be recovered after cloud outage

Post mortem wait

  • alert
  • submit to reddit

High performance access to file storage

Amazon says that about 0.07 per cent of the EBS storage volumes in the East Region of its infrastructure cloud are not "fully recoverable" following the extended outage that hit the service last Thursday.

The company has yet to fully explain the cause of the outage, but it still plans to publish a "post mortem" on the incident. "We are digging deeply into the root causes of this event," the company says in a post to its Amazon Web Services status dashboard.

In the early hours Pacific time on Thursday, Amazon said on its status page that it was investigating connectivity issues with its EC2 (Elastic Compute Cloud) service, which provides on-demand access to processing power via the web. The outage brought down many websites that run atop the service, including Quora, Sencha, Reddit, and FourSquare. According to one of the brief status messages from Amazon, the problem began with a "network event" that caused the service to re-mirror a large number of Elastic Block Store volumes in its East Region.

Amazon divides its "infrastructure cloud" into multiple geographic regions, and it guarantees 99.95 per cent availability within each region if you're using multiple "availability zones". Some regions – including the East Region, served up from Northern Virginia – are divided into these ostensibly separate zones, and Amazon has always said that these zones are "insulated" from each other's failures. But the East Region outage spread across multiple zones.

On Sunday, the company said that a "majority" of affected EBS volumes had been restored, but that it needed more time to restore data for some customers. But on Monday, it announced that some volumes would not be restored. "We have completed our remaining recovery efforts and though we've recovered nearly all of the stuck volumes, we've determined that a small number of volumes (0.07% of the volumes in our US-East Region) will not be fully recoverable," the company said.

It is in the process of contacting these customers.

For many – including Thorsten von Eicken, CTO of RightScale, an EC2 management service, and the employees of Scalr, an open source platform similar to RightScale – one of the chief problem is that Amazon has so far provided so little information about the outage. We await the post mortem with bated breath. Amazon has never said how its "availability zones" are designed. ®

Update: This story been updated to provide more detail on Amazon's uptime guarantee for EC2.

High performance access to file storage

More from The Register

next story
This time it's 'Personal': new Office 365 sub covers just two devices
Redmond also brings Office into Google's back yard
European Court of Justice rips up Data Retention Directive
Rules 'interfering' measure to be 'invalid'
Dropbox defends fantastically badly timed Condoleezza Rice appointment
'Nothing is going to change with Dr. Rice's appointment,' file sharer promises
Cisco reps flog Whiptail's Invicta arrays against EMC and Pure
Storage reseller report reveals who's selling what
Bored with trading oil and gold? Why not flog some CLOUD servers?
Chicago Mercantile Exchange plans cloud spot exchange
Just what could be inside Dropbox's new 'Home For Life'?
Biz apps, messaging, photos, email, more storage – sorry, did you think there would be cake?
IT bods: How long does it take YOU to train up on new tech?
I'll leave my arrays to do the hard work, if you don't mind
prev story

Whitepapers

Securing web applications made simple and scalable
In this whitepaper learn how automated security testing can provide a simple and scalable way to protect your web applications.
Five 3D headsets to be won!
We were so impressed by the Durovis Dive headset we’ve asked the company to give some away to Reg readers.
HP ArcSight ESM solution helps Finansbank
Based on their experience using HP ArcSight Enterprise Security Manager for IT security operations, Finansbank moved to HP ArcSight ESM for fraud management.
The benefits of software based PBX
Why you should break free from your proprietary PBX and how to leverage your existing server hardware.
Mobile application security study
Download this report to see the alarming realities regarding the sheer number of applications vulnerable to attack, as well as the most common and easily addressable vulnerability errors.