Feeds

Windows Azure Compute cloud goes TITSUP PLANET-WIDE

Looks like a distributed system, breaks like a single tenant

Remote control for virtualized desktops

Microsoft's Windows Azure cloud was hit by a worldwide partial compute outage today, calling into question how effectively Redmond has partitioned its service.

The problems emerged at 2.35AM UTC, and were still ongoing as of 10.20PM UTC the same day, according to the company's service dashboard.

"Manual actions to perform Swap Deployment operations on Cloud Services may error, which will then restrict Service Management functions," the company said.

Every single Azure region – a geographically distant and independent set of data centers – was affected, but for posterity that included: West US, West Europe, Southeast Asia, South Central US, North Europe, North Central US, East Asia, and East US.

"We are taking all necessary steps to mitigate this incident for the affected hosted services as soon as possible. Further updates will be published within 2 hours to keep you apprised of the situation. We apologize for any inconvenience this causes our customers," the company wrote at 10PM UTC.

Swap Deployment operations let developers initiate a virtual IP address swap between staging and production environments for services. Swap Deployment is an asynchronous operation that interacts with an Azure management service. Though not a main component of the IaaS cloud, an outage would be irritating for some heavy users, and a global outage is likely to damage confidence in Microsoft's ability to manage services at scale.

WindowsAzureFail

Dashboard dashed ... a global failure is the absolute worst thing that can happen to a cloud

Alongside a global fail to a sub-component of Compute, the Azure cloud's Website feature also reported a global problem with "FTP data access" which began at 7PM UTC, suggesting a cascading fail from some part of the problem that downed Swap Deployment.

The antithesis of cloud computing is a problem cropping up that affects all regions simultaneously, and yet this marks the second time in under a year that Microsoft has had a concurrent global fail.

Last time we had a Blue Sky of Death it was due to a lapsed security certificate which downed all worldwide Windows Azure storage services. This time a much more minor component of the cloud has gone down, but the fact it has failed globally is a severe indictment against the partitioning policies Microsoft may have put in place. ®

Remote control for virtualized desktops

More from The Register

next story
Azure TITSUP caused by INFINITE LOOP
Fat fingered geo-block kept Aussies in the dark
NASA launches new climate model at SC14
75 days of supercomputing later ...
Yahoo! blames! MONSTER! email! OUTAGE! on! CUT! CABLE! bungle!
Weekend woe for BT as telco struggles to restore service
You think the CLOUD's insecure? It's BETTER than UK.GOV's DATA CENTRES
We don't even know where some of them ARE – Maude
BOFH: WHERE did this 'fax-enabled' printer UPGRADE come from?
Don't worry about that cable, it's part of the config
Want to STUFF Facebook with blatant ADVERTISING? Fine! But you must PAY
Pony up or push off, Zuck tells social marketeers
Astro-boffins start opening universe simulation data
Got a supercomputer? Want to simulate a universe? Here you go
prev story

Whitepapers

Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
A strategic approach to identity relationship management
ForgeRock commissioned Forrester to evaluate companies’ IAM practices and requirements when it comes to customer-facing scenarios versus employee-facing ones.
How to determine if cloud backup is right for your servers
Two key factors, technical feasibility and TCO economics, that backup and IT operations managers should consider when assessing cloud backup.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Protecting against web application threats using SSL
SSL encryption can protect server‐to‐server communications, client devices, cloud resources, and other endpoints in order to help prevent the risk of data loss and losing customer trust.