The Register® — Biting the hand that feeds IT

Feeds

Amazon's weekend cloud outage highlights EBS problems

The red-headed stepchild of Bezos & Co's cloud just can't keep up

Free ESG report : Seamless data management with Avere FXT

Problems in the Amazon cloud over the weekend crushed apps like Vine, websites like Airbnb, and numerous other services that depend on Bezos & Co's hulking cloud, and the problems were due to a familiar culprit – Elastic Block Store (EBS).

EBS is a network-attached block level storage service for Amazon EC2 instances. Amazon says it is "suited for applications that require a database, file system, or access to raw block level storage," – in other words, everything.

Sunday's failure marked the third significant outage in two years to come about from EBS failures, and brought to mind the characterization of EBS as "a barrel of laughs in terms of performance and reliability" by a former Reddit sysadmin after a major outage in April 2011.

The problems on Sunday were acknowledged by Amazon in a post to the company's status dashboard at 1:22pm Pacific Time, when the company said it was "investigating degraded performance for some volumes in a single [Availability Zone] in the US-EAST-1 Region."

Amazon found that the problem was a network issue that led to elevated EBS-related API error rates in a single region. "The networking device was removed from service and we are performing a forensic investigation to understand how it failed," the company wrote.

Besides the 2011 incident, EBS also went down in December 2012. In the wake of that outage, one EBS-reliant company named Awe.sm wrote that "to maintain high uptime, we have stopped trusting EBS." Awe.sm added that in its experience, input-output rates on EBS volumes were poor, that when it fails it tends to fail across an entire data center cluster, and that if it goes down when connected to an image when running Ubuntu it fails severely.

Given the outage during the weekend just gone, cloud-first businesses might want to start looking at EBS and working out how to design their systems around potential failures in Amazon's data center hubs. ®

5 ways to reduce advertising network latency

Whitepapers

5 ways to reduce advertising network latency
Implementing the tactics laid out in this whitepaper can help reduce your overall advertising network latency.
Supercharge your infrastructure
Fusion­‐io has developed a shared storage solution that provides new performance management capabilities required to maximize flash utilization.
Avere FXT with FlashMove and FlashMirror
This ESG Lab validation report documents hands-on testing of the Avere FXT Series Edge Filer with the AOS 3.0 operating environment.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Email delivery: 4 steps to get more email to the inbox
This whitepaper lists some steps and information that will give you the best opportunity to achieve an amazing sender reputation.

More from The Register

next story
Dedupe-dedupe, dedupe-dedupe-dedupe: Flashy clients crowd around Permabit diamond
3 of the top six flash vendors are casing the OEM dedupe tech, claims analyst
Disk-pushers, get reel: Even GOOGLE relies on tape
Prepare to be beaten by your old, cheap rival
Dragons' Den star's biz Outsourcery sends yet more millions up in smoke
Telly moneybags went into the cloud and still nobody's making any profit
Hong Kong's data centres stay high and dry amid Typhoon Usagi
180 km/h winds kill 25 in China, but the data centres keep humming
Microsoft lures punters to hybrid storage cloud with free storage arrays
Spend on Azure, get StorSimple box at the low, low price of $0
WD unveils new MyBook line: External drives now bigger... and CHEAP
Less than £0.04/GB, but it loses the Thunderbolt speed
VMware vSAN test pilots: Don't panic but there's a chance of DATA LOSS
AHCI SATA controller won't play nice with Virtzilla's robo-storage beta
prev story