The Register® — Biting the hand that feeds IT

Comments on: AMD's new Firestream chip tops 1 teraflop

At Some Point, A Network of PS3s Will Solve the Big Bang 

Posted Tuesday 17th June 2008 21:10 GMT

IT Angle

Seriously, the GPUs our there are beginning to look like super computer nodes.

Er, 9250? 

Posted Tuesday 17th June 2008 22:56 GMT

Coat

That'd make it a Radeon, right?

(Hmm. Stupid server won't accept "9250?" as a title.)

are these (some useful flavour of) IEEE754 floats, or just wide fixed-points, or what? 

Posted Tuesday 17th June 2008 23:35 GMT

Does anyone know?

1TF 

Posted Wednesday 18th June 2008 02:20 GMT

Thumb Up

Sure be handy when modelling and the polygons get out of control...

Ya, so? - Cloud Computing 

Posted Wednesday 18th June 2008 05:25 GMT

Boffin

"Seriously, the GPUs our there are beginning to look like super computer nodes."

Not surprising, as they have been for quite a long time.

To put this into perspective, the original super computers from Cray had a computational speed of 100 megaflops.

This "video card" is 10,000 times faster and it's sitting idle in the millions of computers around the world who have the hardware installed but aren't playing a 3d game at the moment.

As these chips become generally more integrated into the average desktop environment, how do you think the world will use the computational power they provide?

Sukoku?

Useful, yes 

Posted Wednesday 18th June 2008 06:06 GMT

For the FireStream 9170, it used IEEE 754 floats, but with some minor limitations on parts of the standard; i.e., the result of a division might not be the most accurate one possible.

I think the same will be true of the 9250. However, the 9170 was supposed to have shipped last month, and there seems nothing on the web about that having actually happened. The 9250 is an improvement; I do hope it will in fact be available for purchase in the third quarter of this year, as claimed.

Floats 

Posted Wednesday 18th June 2008 06:41 GMT

GPU's use IEEE754 floats.

I wonder what the double precision performance is like, specially compared to the latest nVidia GPGPU.

NVIDIA don't say whether their performance figures are double or single precision and AMD specify they are single precision only.

For the vast majority of the HPC crowd I suspect double precision performance is more important.

Cloud? 

Posted Wednesday 18th June 2008 09:07 GMT

Paris Hilton

Cloud computing would surely require us to be able to have faster connections than 512k/s for 18 hours a day...

Paris, because we could all benefit if she shared her processing power...

Single precision float is often enough 

Posted Wednesday 18th June 2008 09:28 GMT

It all depends on what you are doing - there are many applications where single precision is sufficient. Or with a bit of thought the computations can be re-arranged to avoid requiring extreme precision e.g. avoid taking small differences between two large numbers.

In the old days when FPUs were about 1/1000 (or less) of todays speed and C would only compile to double precision floating point ops I had to resort to writing a small set of C-callable assembly language single precision routines for vector arithmetic.

It was worth it, reduced run times from 4 hours to just over 1hr !

@Vendicar 

Posted Wednesday 18th June 2008 09:32 GMT

Thumb Up

"As these chips become generally more integrated into the average desktop environment, how do you think the world will use the computational power they provide?"

Stanford University's Folding at Home project, using public distributed computing to unscramble the structure of bio-molecules, has been using networks of GPU cards and PS3s for quite a while. Not surprisingly, they are now getting far more teraflops from this small group of machines than from the much larger group of CPU donors.

@Antony Riley - Reading difficulty. 

Posted Wednesday 18th June 2008 09:51 GMT

"I wonder what the double precision performance is like"

"The 9250 includes double-precision floating point hardware performing at more than 200 gigaflops."

Ahh, POVRay! 

Posted Wednesday 18th June 2008 10:22 GMT

Wouldn't it be grand if some über-hacker could great a POVRay fork that used nVidia GPUs? (Hint! Hint!)

Of course, that would require that nVidia does actually support double-precision FP. Years ago, when Intel developed the first SSE instruction set, the POVRay maintainers stated that experiments had shown that single-precision FP just couldn't produce renders worth the disk-space they took up.

Anyway, POVRay development seems a bit moribund these days. Come on guys! I demand that you spend all your free hours writing code, free of charge, so that freeloading dilettantes like myself can produce clumsy, amateurish pictures that they never actually finish!

Oh wait, maybe I understand their reluctance, now... :-(

@Stuart Van Onselen - Further reading difficulties.. 

Posted Wednesday 18th June 2008 12:38 GMT

"Of course, that would require that nVidia does actually support double-precision FP."

From the Tesla article...

"Key to the release today of the Tesla-10 Series processor is the presence of 64-bit, double-precision floating point support."

Although you may have been referring to normal graphics card?

Tesla, too, now? 

Posted Wednesday 18th June 2008 13:18 GMT

I'm glad to hear that the Tesla-10 has double-precision. Previous versions didn't, which was why I was waiting for the FireStream 9170 from AMD, which did have double-precision support.

@JonB 

Posted Wednesday 18th June 2008 13:43 GMT

Yer darn tootin' I was thinking of "normal" GPUs. I don't think my budget _quite_ stretches to a rack of high-end nVidia HPC kit (In fact, these days I need bank finance to buy a stick of chewing gum.)

Indeed, what's been getting me excited over the last few years is the thought that a consumer-grade piece of hardware can theoretically outperform a warehouse full of the old Crays. The mind boggles!

Nice for number crunchers 

Posted Wednesday 18th June 2008 14:11 GMT

Boffin

Whilst people like myself drool over such power, whither then the average desktop? Even fancier rendering for Windows Vista?

Lastly:- single-precision is useless for computational fluid dynamics; the truncation error must be small, since the overall error increases with every iteration.

@Antony 

Posted Wednesday 18th June 2008 14:28 GMT

The Tesla 10 numbers are for single precision. Only 1/8 of the 240 shaders are capable of double precision, so 30 of them, while the other 210 are single precision only. So since they went from 128 shaders in Tesla 8 to 240 in Tesla 10 at not that much higher clock rate and just about doubled performance, it has to only be for single precision. Double precision would be a lot lower, quite likely a bit lower than the ratio ATI/AMD is claiming on the 9250.

@ Stuart Van Onselen 

Posted Wednesday 18th June 2008 18:27 GMT

Sorry, but don't expect to be running POV-RAY on one of these any time soon.

GPU's don't have a general purpose instruction set capable of running the complex logic of a raytracer. They're more like DSP's, they excell at predictable repeated operations on large data sets.

@ Matthew Saroff 

Posted Wednesday 18th June 2008 20:05 GMT

Black Helicopters

What if this universe in which we live is purely a simulation running on a couple of PS3s in some other-dimensional kid's basement?

Real Power is Virtual ........ STOP PRESS. 

Posted Thursday 19th June 2008 08:14 GMT

.... Rewind to the End BEGIN Again. Start ... Run Weighty Spin Waits.

"What if this universe in which we live is purely a simulation running on a couple of PS3s in some other-dimensional kid's basement?" .... By Anonymous Coward

Posted Wednesday 18th June 2008 20:05 GMT

School them well then would be Wise, AC, for that is an Awesome Power which Cannot Permit Abuse for that Destroys IT and the Abuser.

@Gerhardt 

Posted Thursday 19th June 2008 08:19 GMT

Paris Hilton

If your error is increasing with every iteration, stop iterating. Or did you mean "with every timestep"?

Paris, because we've all seen the condition of her boundary.

@AC, deep philosophical question 

Posted Thursday 19th June 2008 23:03 GMT

"What if this universe in which we live is purely a simulation running on a couple of PS3s in some other-dimensional kid's basement?" .... By Anonymous Coward

In which case I wonder if the universe would still exist if the PS3's were turned off? The only purpose in running the simulation would be for the other-dimensional kid to see the results.

The Mandelbrot Set exists anyway, the only purpose in running WinFract is so that we may see it.

Don’t Miss

Amazon logo 75Amazon cloud heads for Asian sky

Steals MS thunder with Redmondian SDK

EMCData Domain-besotted EMC dumps Quantum

Not the girl for me, even after lending her $100m

Intel logo teaserBrace of Intel SSDs imminent

Low- or high-end as you prefer

AMD unmasks Opterons of servers future

Faces for 'Magny Cours' and 'San Marino'