Feeds

'We don't use UPS. If we did we'd have huge UPSs and tiny computers'

When the heat is on, the last thing a supercomputer needs is a big battery

Choosing a cloud hosting partner with confidence

The heatwave-driven outage at the VLSCI supercomputing facility last week could have been worse than it was, with power cuts also a risk, the facility has confirmed.

A senior systems administrator at VLSCI, Chris Samuel, has discussed the outage and the lessons learned with The Register.

While the reason for the shutdown was heat, Samuel said there were also concerns that the heatwave might lead to a power cut. Melburnians were warned last week that as the heatwave dragged on (and air-conditioners laboured to cope) that there might be cuts.

There were some cuts, but they didn't affect the VLSCI, which is a good thing, because there isn't a backup. As he told us, power cuts are always a concern: “we don't use UPS for the computer systems – we would end up with huge UPSs and tiny computer systems.”

“That said, we've always been very lucky with power around this area … it might be because of our proximity to [Melbourne] hospitals.”

As we wrote yesterday, the incoming water temperature ended up exceeding the specification for the facility. The cooling is a closed system (thanks also to the commenter who also noticed this).

The VLSCI setup, Samuel explained has one coolant loop from the roof into a buffer tank. From there, the water is fed to CDUs – coolant distribution units – where they dump the heat from the machines. Inside the machine rooms, there are three closed loops: one each for two Blue Gene/Q racks, and a third for the water cooled rear rack doors for the other machines.

The water is then circulated to the chillers on the roof, “and the cycle begins again”, he said. In the extreme heat, the roof temperatures meant that the chillers were delivering water that Samuel explained “was getting close to the threshold for the racks, and was still climbing.”

Avoca was the most affected system, simply because it's so much more powerful than the Merri or Barcoo machines: “Even though it's far more power efficient than the Intel systems, its combined heat generating capability is huge – it dumps far more heat into the water than both the Intel systems combined.” ®

New hybrid storage solutions

Whitepapers

Providing a secure and efficient Helpdesk
A single remote control platform for user support is be key to providing an efficient helpdesk. Retain full control over the way in which screen and keystroke data is transmitted.
WIN a very cool portable ZX Spectrum
Win a one-off portable Spectrum built by legendary hardware hacker Ben Heck
Storage capacity and performance optimization at Mizuno USA
Mizuno USA turn to Tegile storage technology to solve both their SAN and backup issues.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Security and trust: The backbone of doing business over the internet
Explores the current state of website security and the contributions Symantec is making to help organizations protect critical data and build trust with customers.