Feeds

Amazon web services customers vent spleen

Cloud giant says sorry for outage and resulting issues

7 Elements of Radically Simple OS Migration

Vulnerabilities in Amazon's web services that were exposed after lightning hit power supplies at the weekend have led to stinging criticism from some customers.

The bolt knocked out the utility and back-up generators in Dublin, causing a blackout which took down the Elastic Cloud Compute (EC2) and Relational Database Services (RDS).

Efforts to bring EC2 back online were delayed as the Elastic Block Storage (EBS) servers required manual operations before they could restore customer volumes, while making extra copies of data sucked up capacity, meaning it needed to find extra juice elsewhere.

Amazon said on Monday it would resolve the process in 48 hours but wrote to customers yesterday informing them it had discovered an error in EBS software which "incorrectly deleted" one or more blocks when cleaning snapshots.

"The root cause was a software error that caused the snapshot references to a subset of blocks to be missed during the reference counting process," the company said.

Snapshots containing the missing blocks were disabled and copies of affected snapshots have replaced the empty blocks.

"We apologise for any potential impact this might have on your applications," Amazon said.

On its services health board today, the US firm described its EC2 services as still having connectivity issues.

One customer said his data contained significant numbers of "trashed blocks". Fortunately he had migrated to a virtual server at another hosting firm, so he deleted the snapshots and breathed "a sigh of relief", but warned it could have been worse if he had relied on EBS snapshot for backups.

"This just goes to confirm my own assessment, which is that AWS is not suitable for small-scale deployments. The economies of scale and price/performance just don't work at the low end.

"They are much more suitable for large scale deployments where service provision and backups can be split across multiple availability zones. It also serves as a reminder not to put all one's eggs into one basket," he said.

Another agreed, "There is clearly a massive defect in the multi-availability zone products."

The master database instances hosted in areas unaffected by the Dublin incident were "taken down by their slaves being hosted inside the affected zone", said another source.

One customer summarised the situation as "if you stored your backup in the cloud too, [Amazon] hosed it".

Best practices for enterprise data

More from The Register

next story
Sysadmin Day 2014: Quick, there's still time to get the beers in
He walked over the broken glass, killed the thugs... and er... reconnected the cables*
VMware builds product executables on 50 Mac Minis
And goes to the Genius Bar for support
Multipath TCP speeds up the internet so much that security breaks
Black Hat research says proposed protocol will bork network probes, flummox firewalls
Auntie remains MYSTIFIED by that weekend BBC iPlayer and website outage
Still doing 'forensics' on the caching layer – Beeb digi wonk
Microsoft's Euro cloud darkens: US FEDS can dig into foreign servers
They're not emails, they're business records, says court
Microsoft says 'weird things' can happen during Windows Server 2003 migrations
Fix coming for bug that makes Kerberos croak when you run two domain controllers
Cisco says network virtualisation won't pay off everywhere
Another sign of strain in the Borg/VMware relationship?
prev story

Whitepapers

7 Elements of Radically Simple OS Migration
Avoid the typical headaches of OS migration during your next project by learning about 7 elements of radically simple OS migration.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Consolidation: The Foundation for IT Business Transformation
In this whitepaper learn how effective consolidation of IT and business resources can enable multiple, meaningful business benefits.
Solving today's distributed Big Data backup challenges
Enable IT efficiency and allow a firm to access and reuse corporate information for competitive advantage, ultimately changing business outcomes.
A new approach to endpoint data protection
What is the best way to ensure comprehensive visibility, management, and control of information on both company-owned and employee-owned devices?