Feeds

Fujitsu peddles three-way hybrid super to Nagoya University

Mash up to push up above 3.66 petaflops – eventually

Boost IT visibility and business value

Nagoya University is the latest academic institute in Japan to take a slice of the K supercomputer design - and put it on its campus to run applications on a monstrous 10.51 petaflops box. In theory.

And in an interesting twist, the new machine is a hybrid Sparc64-Xeon-Xeon Phi cluster that will eventually push up into the multiple peta-FLOPS performance level.

Remind me what a K supercomputer is

The number-crunching power of the K super is not what makes it monstrous, at least not compared to hybrid supers marrying GPU or x86 coprocessors to CPUs to goose their performance.

The K machine was built by Fujitsu for the Japanese government and is housed at the Rikagaku Kenkyusho (Riken) research lab in Kobe, Japan. The machine, which was ranked at the pinnacle of the Top500 supercomputer rankings for a short time, uses Fujitsu's eight-core, 2GHz Sparc64-VIIIfx processor and the Tofu 6D mesh/torus interconnect to link them together. The K super has 22,032 blade servers, and this, not its aggregate floating point performance, is what makes it a monster.

What is the university building?

The new Nagoya machine will use a PrimeHPC FX10, which is an upgraded K-compatible box that puts sixteen-core Sparc64-IXfc processors into the machine. The FX10 scales up to 1,024 racks and a maximum of 23 petaflops; it has not been upgraded to use the sixteen-core "Athena" Sparc64-X processor, which is used in the Sparc M series commercial server line from Fujitsu, as yet.

These Athena machines were launched in January in Japan, and in April Oracle also decided to resell the boxes to its commercial customers. Oracle has shown little interest in the traditional HPC market and does not resell the PrimeHPC FX10 machines or clusters based on Fujitsu's Primergy Xeon boxes, either.

This is good for Fujitsu, which needs to recoup substantial investments it has made in Sparc64 processors and the Tofu interconnect.

The wonder, of course, is why Fujitsu has not created Xeon blade servers that can plug into the Tofu interconnect. This could be particularly useful for certain kinds of workloads that run well on a torus. It would be very interesting to see Fujitsu plug Tofu into PCI-Express 3.0 ports and thus break it free from a tight link with the Sparc64 processors.

Cray broke free of the Opteron's HyperTransport point-to-point interconnect with the "Aries" Dragonfly interconnect, giving it better options to lash together CPUs, FPGAs, GPU coprocessors like Nvidia's Tesla cards, and x86 coprocessors like Intel's Xeon Phi cards.

What's in the big red box?

The initial Nagoya hybrid supercomputer being built by Fujitsu will have 384 PrimeHPC FX10 server nodes, which works out to 6,144 cores and around 90 teraflops of floating point crunching. This FX10 box is linked to a cluster of Primergy CX400 tray servers that feature two-socket Xeon E5 nodes – a total of 552 of them.

The CX400 setup allows for up to 84 nodes to be crammed into a rack. And pushing the hybrid nature up another notch, 184 of the CX400 nodes have an Intel Xeon Phi coprocessor, each delivering about a teraflops of double-precision math.

All told, this initial hybrid machine is rated at 561.4 teraflops of aggregate peak theoretical performance across its three computing units. And, incidentally, some of the Xeon nodes are running ScaleMP's vSMP aggregation hypervisor, turning them into a virtual SMP box for running large, shared-memory applications.

This is not a particularly powerful machine by modern standards, but Nagoya says it has plans in the future to upgrade the hybrid machine to 3.66 petaflops of compute capacity.

The university did not say how it would accomplish this, but putting Xeon Phi coprocessors on all of the Xeon nodes would only add another 368 teraflops, so that isn't it.

Nagoya has installed several parallel machines based on Fujitsu M9000 big iron boxes and its FX1 single-socket servers as well as Opteron-based clusters in the past, and it stands to reason that the future system it installs will continue to be hybrid, spreading work across Sparc, Xeon, and Xeon Phi nodes.

In effect, the cheap Xeon Phi FLOPS mean Nagoya can afford to indulge in relatively expensive Sparc64 FLOPS and not have to port some of its older applications. But then again, the rest of the box could be built with mostly Xeon and Xeon Phi chips with really large Sparc jobs being pushed out to the K-compatible super. This is probably the most cost-effective tactic provided there is spare capacity on the K machine.

The Nagoya hybrid super will use Fujitsu's Technical Computing Suite, a set of compilers and cluster libraries, to run applications and will also put the Fujitsu Exabyte File System (FEFS), a variant of the Lustre parallel file system announced in November 2011, on a 6PB storage cluster to feed data to the computing beast. ®

Boost IT visibility and business value

More from The Register

next story
Pay to play: The hidden cost of software defined everything
Enter credit card details if you want that system you bought to actually be useful
HP busts out new ProLiant Gen9 servers
Think those are cool? Wait till you get a load of our racks
Shoot-em-up: Sony Online Entertainment hit by 'large scale DDoS attack'
Games disrupted as firm struggles to control network
Community chest: Storage firms need to pay open-source debts
Samba implementation? Time to get some devs on the job
Like condoms, data now comes in big and HUGE sizes
Linux Foundation lights a fire under storage devs with new conference
Silicon Valley jolted by magnitude 6.1 quake – its biggest in 25 years
Did the earth move for you at VMworld – oh, OK. It just did. A lot
prev story

Whitepapers

Gartner critical capabilities for enterprise endpoint backup
Learn why inSync received the highest overall rating from Druva and is the top choice for the mobile workforce.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Rethinking backup and recovery in the modern data center
Combining intelligence, operational analytics, and automation to enable efficient, data-driven IT organizations using the HP ABR approach.
Consolidation: The Foundation for IT Business Transformation
In this whitepaper learn how effective consolidation of IT and business resources can enable multiple, meaningful business benefits.
Next gen security for virtualised datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.