Swiss boffins slap together homegrown zBox4 supercomputer
Overnighter yields 54 teraflops with flashy storage
Magic Quadrant for Enterprise Backup/Recovery
Students and a few wandering professors at the Institute for Theoretical Physics at the University of Zurich had a perfect weekend when they scraped together the funds to buy the parts for their zBox4 homegrown supercomputer and assembled the two-rack beast.
This is the fourth generation of homegrown machines that the ITP students have put together, and it represents a serious upgrade over the zBox3 system it replaces, including a new "platter" design where the motherboards and storage are mounted as well as new custom shelving that looks a tad bit like baker's shelves such as those made famous by Google in its early data centers.
Like many supercomputer upgrades, the zBox4 upgrade was stalled a bit by the later-than-expected delivery by Intel of the Xeon E5 processors, which were expected in late 2011 but which came out in the spring of 2012. The specs of the machine are detailed here, and by El Reg's calculations, the 192-node machine with 3,072 cores should deliver around 54 teraflops, considerably more than the 576-core zBox3 machine based on Core2 processors and with only 1.3TB of aggregate main memory.
The interesting bit about the zBox4 machine is what the boffins opted for in terms of hardware components. To start with, the system has 192 of Super Micro's X9DRT-IBQF motherboards, which have on-board QDR InfiniBand ports.
They chose the eight-core E5-2660 processor to slap into these boards, which is the fastest 95 watt part that Intel sells with all eight cores fired up and which delivers 140.8 gigaflops of peak gigaflops at double precision. Each node was configured with 64GB of main memory, for the expected 4GB per core that is the average out there in HPC Land.
Each node also has a 128GB Vertex 4 flash-based SSD, which is the skinniest of the OCZ units in the Vertex 4 family (and therefore the least expensive). That yields 12.3TB of main memory and 24.6TB of flash storage across the cluster.
The nodes are lashed together with QLogic QDR InfiniBand switches from Intel in a 2:1 fat tree configuration with three core switches and nine leaf switches. There are Gigabit Ethernet switches that link to ports on the nodes for management.
All of the nodes run Scientific Linux 6.3, the clone of Red Hat Enterprise Linux 6.3 with math libraries and other tunings for HPC workloads. The Swiss boffins are using Slurm as their queuing system.
The whole shebang will burn around 44 kilowatts under full load, and costs under $750,000, or about $13,888 per teraflops. The entire machine was built in under 24 hours.
The zBox4 system will link into an existing homegrown Lustre file system with 684TB of capacity that has support for 10 Gigabit Ethernet or QDR InfiniBand links into the storage and that takes up 48 racks of space with its 342 1.5TB disks and 171 2TB disks. ®
COMMENTS
That wasn't quite 24h..
They're Swiss, which means that (a) someone planned this well beforehand and (b) there is probably some custom engineering in there which will have taken a bit more time to prepare (the alu sheets they use as mounting surface are a good example of that).
Having said that, the ETH does some awesome stuff, so I'll probably hop in in a few days and have a look at this thing in the umm, flesh metal. I'm interested in how they keep this thing cool..
Did anyone see ..
.. *any* sign of anti-static measures taken?
Just curious..
Homegrown?
It looks to me like they went to the shops with 3/4 million dollars and bought some high end off the shelf components. Nice kit, nice machine and all that but where is the home grown part?

IT infrastructure monitoring strategies
Requirements Checklist for Choosing a Cloud Backup and Recovery Service Provider
Data control in the cloud
Cloud based data management
Enabling efficient data center monitoring