Feeds

AMD plots single thread boost with x86 extensions

Bulldozer injection

Build a business case: developing custom apps

Still intent on improving single thread software performance, AMD has outlined some planned additions to the x86 instruction set that will appear in chips shipping in 2009.

The SSE5 extensions should make life easier on software developers and lead to rather dramatic performance gains. In particular, AMD expects the tweaks to boost code used in the high performance computing, multimedia and security arenas. Customers will see the extensions in AMD's new "Bulldozer" core-based chips that arrive in 2009, first for PC chips and then for server chips.

AMD and Intel have turned to adding more cores per chip to improve their products' performance rather than amping up GHz as in the past. This shift, caused by heat issues, means that developers need to write more complex multi-threaded software than can spread well across all of the cores. The software industry, however, is moving relatively slowly with these efforts, leaving tons of single-threaded code that could use a performance aid one way or another.

According to AMD, the new extensions will bring a couple of major breakthroughs.

For one, AMD will follow the RISC crowd with support for 3-Operand Instructions - up from two. So, unlike in the past where you would do A plus B and then have to store the result of the operation in A or B, developers can now store the result in a third location. This should reduce the total number of instructions needed to perform certain tasks and require less effort on the part of developers to keep track of registers.

The support for 3-Operand Instructions allows AMD to roll out a "fused multiply accumulate" instruction as well. This melds multiplication and addition to permit "iterative calculations with one instruction."

"It is basically taking two consecutive operations that occur very often in sequence and just making them a single operation in the instruction set," said Michael Frank, an AMD fellow.

With the extensions, AMD has seen up to a 5x performance gain in AES (Advanced Encryption Standard) encryption and a 30 per cent boost for DCT (discrete cosine transform), a mathematical operation used with audio and video codecs.

AMD has released the specification for the new instructions here.

Boost IT visibility and business value

More from The Register

next story
The Return of BSOD: Does ANYONE trust Microsoft patches?
Sysadmins, you're either fighting fires or seen as incompetents now
Microsoft: Azure isn't ready for biz-critical apps … yet
Microsoft will move its own IT to the cloud to avoid $200m server bill
Shoot-em-up: Sony Online Entertainment hit by 'large scale DDoS attack'
Games disrupted as firm struggles to control network
Cutting cancer rates: Data, models and a happy ending?
How surgery might be making cancer prognoses worse
Silicon Valley jolted by magnitude 6.1 quake – its biggest in 25 years
Did the earth move for you at VMworld – oh, OK. It just did. A lot
VMware's high-wire balancing act: EVO might drag us ALL down
Get it right, EMC, or there'll be STORAGE CIVIL WAR. Mark my words
prev story

Whitepapers

Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Endpoint data privacy in the cloud is easier than you think
Innovations in encryption and storage resolve issues of data privacy and key requirements for companies to look for in a solution.
Scale data protection with your virtual environment
To scale at the rate of virtualization growth, data protection solutions need to adopt new capabilities and simplify current features.
Boost IT visibility and business value
How building a great service catalog relieves pressure points and demonstrates the value of IT service management.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?