Feeds

Rotten routers caused Intermedia service crash, says CEO

Wanna qualify for SLA credits, got 72 hours

Internet Security Threat Report 2014

The service blackout at third-party Microsoft Exchange hosting biz Intermedia was caused by glitches in core routing kit, the CEO has confirmed.

As revealed by El Reg on Tuesday, Intermedia went down in the UK early afternoon and remained down for some hours, with customers forced to vent their spleen on Twitter because they could not contact the company.

Phil Koen, CEO at Intermedia, sent us some comments - which were posted on his blog - saying that normal service delivery has resumed, "there has been no data loss and there were no security breaches".

He revealed that on 28 August it encountered an "issue with our core routers" and it "implemented a fix" that rectified the fault a day later, or so Intermedia thought.

But by yesterday morning, "further anomalies" were observed said Koen:

"Attempts at intervention were unsuccessful, and corruption in our core routing table progressed to the edge routers in all our US data centres".

"This created significant packet loss between the edge and core in each data centre and it prevented delivery of service to our customers. In addition, as our communication systems reside in the same data centres, our ability to communicate with customers and partners was disrupted".

Customers on both sides of the pond - folk in the UK claimed to be affected by the outage too - were united in the anger over the lack of communication from Intermedia, and leapt onto Twitter to complain.

Koen said the network and services were brought online at 3.30pm Eastern Standard Time (EST) and the final server related issues were sorted out by 6.30 EST.

Intermedia will now complete the Reason For Outrage report, take lessons from this to "improve stability and resilience" and increase the "responsiveness and robustness of customer notification tools and systems", said the CEO.

"Although we were successful in notifying many of our customers about the issues via alternate email addresses, text messages and HostPilot, not all customers were reached," he added.

Customers have 72 hours to log the call to qualify for SLA credits, though as our readers pointed yesterday, this does not cover any potential financial losses caused by a dip in productivity or lost business. ®

Secure remote control for conventional and virtual desktops

More from The Register

next story
BOFH: WHERE did this 'fax-enabled' printer UPGRADE come from?
Don't worry about that cable, it's part of the config
Azure TITSUP caused by INFINITE LOOP
Fat fingered geo-block kept Aussies in the dark
You think the CLOUD's insecure? It's BETTER than UK.GOV's DATA CENTRES
We don't even know where some of them ARE – Maude
Want to STUFF Facebook with blatant ADVERTISING? Fine! But you must PAY
Pony up or push off, Zuck tells social marketeers
Yahoo! blames! MONSTER! email! OUTAGE! on! CUT! CABLE! bungle!
Weekend woe for BT as telco struggles to restore service
Oi, Europe! Tell US feds to GTFO of our servers, say Microsoft and pals
By writing a really angry letter about how it's harming our cloud business, ta
prev story

Whitepapers

Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
Getting started with customer-focused identity management
Learn why identity is a fundamental requirement to digital growth, and how without it there is no way to identify and engage customers in a meaningful way.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Reducing the cost and complexity of web vulnerability management
How using vulnerability assessments to identify exploitable weaknesses and take corrective action can reduce the risk of hackers finding your site and attacking it.
Top 5 reasons to deploy VMware with Tegile
Data demand and the rise of virtualization is challenging IT teams to deliver storage performance, scalability and capacity that can keep up, while maximizing efficiency.