Feeds

Amazon cloud still on fritz after 36 hours

'All hands on deck'

Secure remote control for conventional and virtual desktops

Amazon's cloud is still on the fritz, a day and a half after the company first reported connection problems, latency issues, and increased error rates across the service. But on Friday morning, the company said that full service should be restored for a "majority" of users by the afternoon Pacific time.

"We continue to see progress in recovering volumes, and have heard many additional customers confirm that they're recovering. Our current estimate is that the majority of volumes will be recovered over the next 5 to 6 hours," the company said in a post to its Amazon Web Services status page.

In some cases, Amazon said, it will take longer to restore data. With these volumes, the company is having to restore backups it made to its own S3 online storage service on Thursday.

The problems began in the early hours of Thursday morning Pacific time. At 1:41 am, Amazon said on its status page that it was investigating connectivity issues with its EC2 (Elastic Compute Cloud) service, which provides on-demand access to processing power via the web. The outage brought down several websites that run atop the service, including Quora, Sencha, Reddit, and FourSquare.

The outage also affected Amazon's Elastic Block Store, Relational Database Service, and Elastic Beanstalk services. And according to one post from the company, it all began with a "networking event" that triggered a large amount of re-mirroring of EBS volumes in the "East region" of Amazon Web Services. Amazon divides its so-called infrastructure cloud service into multiple geographic regions, and it guarantees 99.95 per cent availability within each region.

Some regions, including the East region, are divided into multiple "availability zones". For years, Amazon has said that these zones are "insulated" from each other's failures. But yesterday's outage spread across zones in the East region. Amazon has never said how these zones are designed. It's unclear whether they're locations in separate data centers or not.

"We can assure you that all-hands are on deck to recover as quickly as possible," the company said late last night. ®

Update

Amazon has now said that a majority of volumes have indeed been restored. "These volumes were recovered by ~1:30pm PDT," the company said at 2:15pm Pacific time. "We mentioned that a 'smaller number of volumes will require a more time consuming process to recover, and we anticipate that those will take longer to recover.' We're now starting to work on those.'"

Secure remote control for conventional and virtual desktops

More from The Register

next story
Ellison: Sparc M7 is Oracle's most important silicon EVER
'Acceleration engines' key to performance, security, Larry says
Linux? Bah! Red Hat has its eye on the CLOUD – and it wants to own it
CEO says it will be 'undisputed leader' in enterprise cloud tech
Oracle SHELLSHOCKER - data titan lists unpatchables
Database kingpin lists 32 products that can't be patched (yet) as GNU fixes second vuln
Ello? ello? ello?: Facebook challenger in DDoS KNOCKOUT
Gets back up again after half an hour though
Hey, what's a STORAGE company doing working on Internet-of-Cars?
Boo - it's not a terabyte car, it's just predictive maintenance and that
prev story

Whitepapers

A strategic approach to identity relationship management
ForgeRock commissioned Forrester to evaluate companies’ IAM practices and requirements when it comes to customer-facing scenarios versus employee-facing ones.
Storage capacity and performance optimization at Mizuno USA
Mizuno USA turn to Tegile storage technology to solve both their SAN and backup issues.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Beginner's guide to SSL certificates
De-mystify the technology involved and give you the information you need to make the best decision when considering your online security options.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.