Cell supers rule the Green 500 HPC rankings
Just don't ask what the price would be for a petaflops
IBM has taken high performance computing very seriously in the past decade, and it has also been on the vanguard of the energy efficiency - and boy does it show on the Green 500 list. The top 20 machines on the list are made by IBM, and they average 402.4 megaflops per watt in computing efficiency. Of the 186 machines in the Top 500 and Green 500 lists that bear the IBM label, the machines are averaging 134.5 megaflops per watt.
Across the whole Green 500 list, the average machine consumes 400.9 kilowatts of power and delivers 98.6 megaflops per watt. This is terrible, by comparison, and is a reflection of some very heavy-hitting machines that are long in the tooth still being on the list. The least energy efficient machine is an Itanium machine installed at Lawrence Livermore National Lab that is rated at 19.9 teraflops, but which burns 4.9 megawatts of juice and therefore yields only a little more than 4 megaflops per watt.
Number 499 on the Green 500 list is the Earth Simulator, a massively parallel vector machine built by the Japanese government that used to be the top of the performance list (and for many years). While Earth Simulator was ground-breaking when it delivered 35.9 teraflops of sustained performance, those vector processors sure do eat juice. Earth Simulator consumes 3.2 megawatts and yielded a paltry 11.2 megaflops per watt.
In terms of other vendors and their rankings on the Green 500 list, Silicon Graphics comes in with a pair of Altix ICE x64 blade clusters at numbers 21 and 22 on the list that deliver 240 and 233 megaflops per watt, respectively. The interesting thing here is that the slightly more efficient box, installed at oil giant Total, is rated at 106 teraflops and comes in at number 17 on the Top 500 list, while the less efficient box (but only moderately so) is the new "Pleiades" Altix ICE cluster installed at NASA's Ames Research Center.
That machine is delivering 487 teraflops of sustained performance on the Linpack test, and is delivering 233 megaflops per watt of power efficiency. While that is half that of the Cell or Opteron-Cell hybrid machines, the Altix architecture would seem to scale performance and power consumption linearly - something SGI is sure to emphasize. Moreover, with the advent of "Nehalem" Xeon boxes early next year, it is not hard to envision the Altix ICE machines being as power efficient as Cell-based boxes.
The most power efficient Cray box on the Green 500 list is the new "Franklin" XT4 massively parallel Opteron cluster running at Lawrence Berkeley National Laboratory, which is number seven on the Top 500 list at 266 teraflops. However, Franklin burns 1.15 megawatts of juice, yielding 231.6 megaflops per watt of computing efficiency.
And while Sun Microsystems is justifiable happy to have the "Ranger" Opteron-InfiniBand cluster installed at the University of Texas, the cluster, which is ranked at number six on the Top 500 list with 433.2 teraflops of sustained Linpack performance, Ranger needs 2 megawatts to run and drops to number 30 on the Green 500 list, at 216.6 megaflops per watt. Once again, with Intel's Nehalem chips, Sun should be able to get its power efficiency up there near the 500 megaflops per watt barrier, but it seems unlikely that UT is going to do a box swap.
And for all its talk about the density and power efficiency of the new iDataPlex blade servers, these custom-built IBM boxes are not anything to write home about in terms of power efficiency on HPC workloads. The two iDataPlex boxes installed at NASA's Goddard Space Flight Center are rated at 192.1 megaflops per watt using 2.5 GHz quad-core Xeon processors. Even with a doubling should Nehalem yield that much more bang for the watt, iDataPlex is not going to be as power efficient as Cell-based machines.
The most power efficient box from Hewlett-Packard on the Green 500 list is a cluster of BL460c and BL2x220 blade servers (the latter puts two server nodes on a single blade) that delivers 218 megaflops per watt while burning 327 kilowatts to run. This box is located at the Joint Supercomputer Center in Russia, and is rated 71.3 teraflops; it is 35 on the Top 500 list and 27 on the Green 500 list. HP is clearly going to be banking on two-node Nehalem blades next year to get it back into the power efficiency game in the HPC racket in 2009. ®