Feeds

ClearSpeed plots 1 TeraFlop floating point pizza box

1U rocket ship

Internet Security Threat Report 2014

IDF Floating point whiz ClearSpeed continues to round out its software play ahead of several major product upgrades.

ClearSpeed ships floating point accelerator cards that slot into x86-based systems. The hardware delivers a major performance boost while only consuming around 30 watts of power. Customers in the high performance computing market - organizations such as national labs and oil and gas firms - have shown the most interest in ClearSpeed's kit to date.

ClearSpeed has tried to make it as easy as possible to write software for its chips. Admittedly, however, pushing multi-threaded software onto a unique architecture comes with some challenges.

To that end, ClearSpeed announced this week at the Intel Developer Forum that its CSXL software library will now "provide plug and play acceleration for the most commonly used 64bit level 3 BLAS and LAPACK functions that underpin the foundations of the vast majority of scientific and engineering applications." This builds on ClearSpeed's work to boost application performance while using standard libraries and without requiring changes to the underlying code.

ClearSpeed has also opened a new software developer community, which has its web presence here. The company hopes the site will serve as a meeting point for developers working on HPC applications.

Lastly, ClearSpeed talked up a research program conducted in conjunction with Intel around crafting software for servers that have both standard x86 chips and accelerators.

ClearWare

While the software bits and pieces are crucial, it's hardware that usually gets Reg readers' mouths watering. Along those lines, ClearSpeed again teamed with Intel at IDF to show off a 16U cluster that notched more than one TeraFlop of performance on the Linpack benchmark. The cluster relied on ClearSpeed's Advance e620 hardware, which can provide up to 80 GigaFlops of peak double precision floating point performance and more than one GigaFlop per watt of sustained Linpack performance.

"The entire cluster had a maximum power consumption of less than 7KW and completed the benchmark in just 14 minutes, half the time required by the non-accelerated system," ClearSpeed said. "The energy used to achieve this TeraFLOP performance was approximately 1.5KWh, costing a mere 15 cents assuming a cost of 10 cents per KWh. With the latest quad core Intel processors, the same performance and energy profile could be compressed into just 10 rack units and cost less than $150,000."

But really, a single TeraFlop in 10U is for the weak.

So, ClearSpeed is expected to show a demonstration unit at the November Supercomputing conference in which it delivers one TeraFlop of acceleration via a single 1U appliance-like system. Based on our chat with ClearSpeed CEO Tom Beese, we think this leap forward will come via much smaller accelerator modules that can fit into things such as blade servers.

Intel spent much of IDF bragging about the floating point performance that we'll see with its "Nehalem" processors in 2008. According to Beese, these general purpose Xeon chips will likely show very competitive results versus ClearSpeed's own product.

"We are very clear that we're not meant to be seen as an alternative to a CPU," Beese said. "We are always meant to be complementary to a CPU. In those terms, it's important to remember that our performance per watt will always be much more efficient than a CPU."

So, basically, there's only so much floating point performance you can squeeze out of even a super-charged Xeon server. Anyone that needs more juice in the same amount of space will have to pick up an accelerator like that from ClearSpeed. This is a crucial proposition for companies or labs that can no longer afford to build out their data center space and for those with little extra power to spare.

Beese said that ClearSpeed will likely roll out a major revision of its products within the next year. That hardware should again help the company leap way past even Intel's speediest chips in terms of floating performance.

ClearSpeed has been at this acceleration game for a long time and now seems to benefit from the broad market interest in co-processors. We're seeing a lot of work being done with FPGAs, graphics chips and tweaked multi-core CPUs aimed to handle very specific software loads, usually in the HPC and media markets.

If nothing else, ClearSpeed can claim a nice lead in this space from a software standpoint, as many of the other accelerator makers struggle to teach coders the ways of their gear. ®

Beginner's guide to SSL certificates

More from The Register

next story
Docker's app containers are coming to Windows Server, says Microsoft
MS chases app deployment speeds already enjoyed by Linux devs
'Hmm, why CAN'T I run a water pipe through that rack of media servers?'
Leaving Las Vegas for Armenia kludging and Dubai dune bashing
'Urika': Cray unveils new 1,500-core big data crunching monster
6TB of DRAM, 38TB of SSD flash and 120TB of disk storage
Facebook slurps 'paste sites' for STOLEN passwords, sprinkles on hash and salt
Zuck's ad empire DOESN'T see details in plain text. Phew!
SDI wars: WTF is software defined infrastructure?
This time we play for ALL the marbles
Windows 10: Forget Cloudobile, put Security and Privacy First
But - dammit - It would be insane to say 'don't collect, because NSA'
Oracle hires former SAP exec for cloudy push
'We know Larry said cloud was gibberish, and insane, and idiotic, but...'
Symantec backs out of Backup Exec: Plans to can appliance in Jan
Will still provide support to existing customers
prev story

Whitepapers

Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Why cloud backup?
Combining the latest advancements in disk-based backup with secure, integrated, cloud technologies offer organizations fast and assured recovery of their critical enterprise data.
Win a year’s supply of chocolate
There is no techie angle to this competition so we're not going to pretend there is, but everyone loves chocolate so who cares.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Intelligent flash storage arrays
Tegile Intelligent Storage Arrays with IntelliFlash helps IT boost storage utilization and effciency while delivering unmatched storage savings and performance.