Feeds

HDS drive array failure suspected in bank giant's ATM outage

Red faces all round after service goes titsup

Boost IT visibility and business value

Did a Hitachi Data Systems USP-V array controller failure cause the Barclay's ATM outage yesterday?

Yesterday, to its great embarrassment, Barclays' ATM network in the south of England crashed at 1pm, together with a lot of its online banking facilities. Functions were not restored until 4.30pm or later, and thousands of people were caused great inconvenience through not being able to get cash or manage their bank accounts online.

Barclays said it was due to a hardware failure at its data centre in Gloucester, which serves its ATM network south of the Wash. Various reports on the BBC, The Sun, The Mail and elsewhere said that a hardware component of a drive array had failed and that engineers were replacing cards.

What drive array was this? One that was involved in storing data relevant to cash machine operations and online banking? Also, given that the Gloucester data centre has a history of computing system failures (see here, here, and here) why wasn't there an adequate fallback mechanism in place?

We know that, in 2008, Barclays ordered a large, high-end USP-V storage array from Hitachi Data Systems, as part of a 4-year storage-on-demand contract for its Gloucester data centre. It was to provide storage for mainframe and Unix systems. The capacity would rise to 1PB and would start coming online in February this year. There was a separate mid-range AMS storage array supplied by HDS which provided file-based storage for Windows servers through a NetApp NAS head.

Apparently HDS had a similar USP-V contract in a separate part of the data centre.

Under the new contract there was a penalty clause for downtime with the penalty increasing as the downtime increases.

Some of the accounts of the Gloucester data centre's history of ATM crashes show that the mainframe system is involved in ATM operations and this indicates that the USP-V system could be the failed drive array in yesterday's outage.

This was confirmed by a source familiar with the situation from another IT supplier, who also said that HP/EDS have the maintenance contract for the affected system.

HDS recently announced failover clustering facilities for the USP-V. If a USP-V controller in a cluster fails then operations are automatically picked up by a second USP-V controller. Without such a High Availability Manager arrangement, a failed USP-V controller can cause the storage array behind it to be inaccessible until the controller is repaired.

Bastiaan van Amstel, the senior EMEA PR manager for HDS, said, regarding the outage: "A lot of due diligence is happening at the moment and, before it is completed nothing can be said." He added: "Many vendors are involved in the IT at Barclays." ®

The essential guide to IT transformation

More from The Register

next story
The Return of BSOD: Does ANYONE trust Microsoft patches?
Sysadmins, you're either fighting fires or seen as incompetents now
Microsoft: Azure isn't ready for biz-critical apps … yet
Microsoft will move its own IT to the cloud to avoid $200m server bill
Oracle reveals 32-core, 10 BEEELLION-transistor SPARC M7
New chip scales to 1024 cores, 8192 threads 64 TB RAM, at speeds over 3.6GHz
Docker kicks KVM's butt in IBM tests
Big Blue finds containers are speedy, but may not have much room to improve
US regulators OK sale of IBM's x86 server biz to Lenovo
Now all that remains is for gov't offices to ban the boxes
Gartner's Special Report: Should you believe the hype?
Enough hot air to carry a balloon to the Moon
Flash could be CHEAPER than SAS DISK? Come off it, NetApp
Stats analysis reckons we'll hit that point in just three years
Dell The Man shrieks: 'We've got a Bitcoin order, we've got a Bitcoin order'
$50k of PowerEdge servers? That'll be 85 coins in digi-dosh
prev story

Whitepapers

5 things you didn’t know about cloud backup
IT departments are embracing cloud backup, but there’s a lot you need to know before choosing a service provider. Learn all the critical things you need to know.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Build a business case: developing custom apps
Learn how to maximize the value of custom applications by accelerating and simplifying their development.
Rethinking backup and recovery in the modern data center
Combining intelligence, operational analytics, and automation to enable efficient, data-driven IT organizations using the HP ABR approach.
Next gen security for virtualised datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.