Feeds

Deleted cloud in second fall from sky

More off-demand computing from FlexiScale

7 Elements of Radically Simple OS Migration

XCalibre's FlexiScale cloud has disappeared from the heavens. Again.

In late August, an engineer with the UK-based hosting outfit accidentally deleted the company's high-profile compute cloud - which offers on-demand storage, processing, and network bandwidth a la Amazon Web Services - and now XCalibre is working to resolve a "core network failure" that has kept some customers off-line for as much as twenty-four hours.

According XCalibre CEO Tony Lucas, the outage hit at about 5pm UK time on Wednesday, when the cloud experienced "a near simultaneous switch failure" in the switches that connect the storage to the processing nodes. "That is relatively easy to fix, though you do have to take everything down and restart it again," Lucas tells The Reg. "But because of a software limitation in a particular piece of software we use...which only allows you to do one job at a time, so when we have to restart hundreds and hundreds of servers, it takes sometime."

It's no secret that FlexisScale relies on Virtual Iron, the virtualization manager based on the open-source Xen hypervisor.

Lucas says that some customers were back up and running by 9pm UK yesterday. But others are still waiting for their bit of cloud to reappear. "Every single server was restarted by 7:45 this morning [UK time], but there is a network bug that a number of them are still having issues with. We're going through them one-by-one and we're down to a handful - somewhere in the teens."

Lucas is intent on beefing up his architecture so this sort of thing doesn't happen in the future. But that's twice in two months. At the end of August, that engineer accidentally deleted the cloud's main storage volume, and XCalibre needed several days to rebuild it.

And in the midst of the latest outage, some customers are peeved. "I am angry, very angry, so yes there's some vitriol in here, I was hoping that sleeping on it would dull that, but being that all my servers are still down it hasn't," says someone who calls himself Flish.

"I didn't have to wait very long for the next outage," says RichText. "Fortunately, I'm only testing things out at the moment. Does anyone actually use Flexiscale for anything mission-critical?"

A good question. There are drawbacks to putting your apps in the sky. In recent months, we've also seen plenty of downtime from Amazon Web Services - and Google Apps too. ®

Best practices for enterprise data

More from The Register

next story
Microsoft's Euro cloud darkens: US FEDS can dig into foreign servers
They're not emails, they're business records, says court
Sysadmin Day 2014: Quick, there's still time to get the beers in
He walked over the broken glass, killed the thugs... and er... reconnected the cables*
VMware builds product executables on 50 Mac Minis
And goes to the Genius Bar for support
Multipath TCP speeds up the internet so much that security breaks
Black Hat research says proposed protocol will bork network probes, flummox firewalls
Auntie remains MYSTIFIED by that weekend BBC iPlayer and website outage
Still doing 'forensics' on the caching layer – Beeb digi wonk
Microsoft says 'weird things' can happen during Windows Server 2003 migrations
Fix coming for bug that makes Kerberos croak when you run two domain controllers
Cisco says network virtualisation won't pay off everywhere
Another sign of strain in the Borg/VMware relationship?
prev story

Whitepapers

7 Elements of Radically Simple OS Migration
Avoid the typical headaches of OS migration during your next project by learning about 7 elements of radically simple OS migration.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Consolidation: The Foundation for IT Business Transformation
In this whitepaper learn how effective consolidation of IT and business resources can enable multiple, meaningful business benefits.
Solving today's distributed Big Data backup challenges
Enable IT efficiency and allow a firm to access and reuse corporate information for competitive advantage, ultimately changing business outcomes.
A new approach to endpoint data protection
What is the best way to ensure comprehensive visibility, management, and control of information on both company-owned and employee-owned devices?