Feeds

Amazon cloud still on fritz after 36 hours

'All hands on deck'

Internet Security Threat Report 2014

Amazon's cloud is still on the fritz, a day and a half after the company first reported connection problems, latency issues, and increased error rates across the service. But on Friday morning, the company said that full service should be restored for a "majority" of users by the afternoon Pacific time.

"We continue to see progress in recovering volumes, and have heard many additional customers confirm that they're recovering. Our current estimate is that the majority of volumes will be recovered over the next 5 to 6 hours," the company said in a post to its Amazon Web Services status page.

In some cases, Amazon said, it will take longer to restore data. With these volumes, the company is having to restore backups it made to its own S3 online storage service on Thursday.

The problems began in the early hours of Thursday morning Pacific time. At 1:41 am, Amazon said on its status page that it was investigating connectivity issues with its EC2 (Elastic Compute Cloud) service, which provides on-demand access to processing power via the web. The outage brought down several websites that run atop the service, including Quora, Sencha, Reddit, and FourSquare.

The outage also affected Amazon's Elastic Block Store, Relational Database Service, and Elastic Beanstalk services. And according to one post from the company, it all began with a "networking event" that triggered a large amount of re-mirroring of EBS volumes in the "East region" of Amazon Web Services. Amazon divides its so-called infrastructure cloud service into multiple geographic regions, and it guarantees 99.95 per cent availability within each region.

Some regions, including the East region, are divided into multiple "availability zones". For years, Amazon has said that these zones are "insulated" from each other's failures. But yesterday's outage spread across zones in the East region. Amazon has never said how these zones are designed. It's unclear whether they're locations in separate data centers or not.

"We can assure you that all-hands are on deck to recover as quickly as possible," the company said late last night. ®

Update

Amazon has now said that a majority of volumes have indeed been restored. "These volumes were recovered by ~1:30pm PDT," the company said at 2:15pm Pacific time. "We mentioned that a 'smaller number of volumes will require a more time consuming process to recover, and we anticipate that those will take longer to recover.' We're now starting to work on those.'"

Beginner's guide to SSL certificates

More from The Register

next story
Docker's app containers are coming to Windows Server, says Microsoft
MS chases app deployment speeds already enjoyed by Linux devs
'Hmm, why CAN'T I run a water pipe through that rack of media servers?'
Leaving Las Vegas for Armenia kludging and Dubai dune bashing
SDI wars: WTF is software defined infrastructure?
This time we play for ALL the marbles
'Urika': Cray unveils new 1,500-core big data crunching monster
6TB of DRAM, 38TB of SSD flash and 120TB of disk storage
Facebook slurps 'paste sites' for STOLEN passwords, sprinkles on hash and salt
Zuck's ad empire DOESN'T see details in plain text. Phew!
Windows 10: Forget Cloudobile, put Security and Privacy First
But - dammit - It would be insane to say 'don't collect, because NSA'
Oracle hires former SAP exec for cloudy push
'We know Larry said cloud was gibberish, and insane, and idiotic, but...'
Symantec backs out of Backup Exec: Plans to can appliance in Jan
Will still provide support to existing customers
prev story

Whitepapers

Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Why cloud backup?
Combining the latest advancements in disk-based backup with secure, integrated, cloud technologies offer organizations fast and assured recovery of their critical enterprise data.
Win a year’s supply of chocolate
There is no techie angle to this competition so we're not going to pretend there is, but everyone loves chocolate so who cares.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Intelligent flash storage arrays
Tegile Intelligent Storage Arrays with IntelliFlash helps IT boost storage utilization and effciency while delivering unmatched storage savings and performance.