Original URL: http://www.theregister.co.uk/2012/10/07/adapteva_epiphany_parallel_computing_kickstarter/

This supercomputing board can be yours for $99. Here's how

Adapteva's parallel dash for community cash

By Timothy Prickett Morgan

Posted in HPC, 7th October 2012 22:40 GMT

Feature Adapteva, an upstart RISC processor and co-processor designer that has tried to break into the big-time with its Epiphany chips for the past several years, is sick and tired of the old way of trying to get design wins to fund its future development.

So it has started up a community project called Parallella that seeks to get users to pay for development directly through crowdfunding via Kickstarter.

"We're going to build a community around parallel computing," Andreas Olofsson, CEO and co-founder of Adapteva, tells El Reg. "It will be kind of like the Raspberry Pi, but with real performance."

He is quick to add that he has nothing against Raspberry Pi, but rather that a hybrid architecture, marrying ARM processors and Epiphany massively parallel RISC coprocessors, is the way to go.

A community of enthusiasts monkeying around with hardware might be able to sustain the development of current and future Epiphany chips if the Kickstarter plan pans out. The initial target price to get a board is $99 compared to the $35 for a Raspberry Pi board, which is a little high but not unreasonably so.

Vibrant communities sprang up around Beagle boards and Arduino kits, too, so there is some precedence for this. Perhaps more significantly, the Parallella community approach is as reasonable and sensible as begging for money from venture capitalists and trying to go up against Intel with its Xeon and Xeon Phi coprocessors or AMD with its FirePro GPU coprocessors or Nvidia with its Tesla GPU coprocessors. And it is less demeaning, too. Provided it actually raises the necessary funds.

Risc-y business

Adapteva was founded in February 2008, and as Olofsson, which designed digital signal processors at Analog Devices for a decade, jokingly explained to El Reg, "I got the RISC memo from 1980 and I paid attention."

The idea with RISC was to have chips with relatively simple instructions and to do complex things by combining operations in quick succession; the theory was that a simple RISC chip could get more work done than a CISC processor, and there are not enough pixels in the world to settle that argument in this story.

Suffice it to say that Olofsson and his compatriots at Adapteva – Roman Trogan, director of hardware development, Oleg Raikhman, director of software development, and Yaniv Sapir, director of application development, who all hail from Analog Devices and worked on the TigerSHARC DSPs – believe that for parallel computing to take off, devices have to be simple, cheap, and accessible. And so they designed the Epiphany line of processors to be used as coprocessors as well to make them more accessible and therefore useful.

Adapteva's Epiphany-IV chip

Adapteva's Epiphany-IV chip

But the problem that Adapteva is chasing is much larger than providing cheap parallel computing for hobbyists. The company wants to be at the forefront of exascale computing, and to do so by providing the cheapest and most energy-efficient floating point operations on the planet.

"We have been out there for four years now, and we see that the pickup for parallel processing is too slow," says Olofsson. "There are too many gatekeepers, and too many people can't afford the $10,000 startup fee for a reference board to run tests and do development."

The Parallella Kickstarter funding program is about changing that, with users being given an older generation Epiphany board if they help fund the development of the future ones.

Before we get into that program, we need to talk about the Epiphany chips. They have their own instruction set, although Olofsson says he was inspired by MIPS and ARM RISC processors as well as the DSPs that he and his co-founders know so well. And like the massively multicore processors from Tilera, the idea behind the Adapteva chips is to take hoards of very modest RISC processors and lash them together with an on-chip interconnect.

Olofsson took this approach not because he loves minimalist core designs out of a three-decade old textbook, but because this is the only approach that will fit into the thermal envelope that will limit exascale-class systems.

Adapteva sees the parallel challenge

Adapteva sees the parallel computing challenge as its opportunity

Like most people in the processor and coprocessor chip rackets, Adapteva thinks the future of computing is both parallel and heterogeneous. T be even more specific, the company believes that you need a clean slate approach on the coprocessors because this is the only way to get the coprocessors, which will do most of the heavy lifting on compute, to be much more efficient than the usual suspect processors we are used to on our desktops, inside out handhelds, and in our data centers.

The Epiphany core has a mere 35 instructions – yup, that is RISC alright – and the current Epiphany-IV has a dual-issue core with 64 registers and delivers 50 gigaflops per watt. It has one arithmetic logic unit (ALU) and one floating point unit and a 32KB static RAM on the other side of those registers.

Each core also has a router that has four ports that can be extended out to a 64x64 array of cores for a total of 4,096 cores. The currently shipping Epiphany-III chip is implemented in 65 nanometer processors and sports 16 cores, and the Epiphany-IV is implemented in 28 nanometer processes and offers 64 cores.

Block diagram of the Epiphany chip

Block diagram of the Epiphany chip

The secret sauce in the Epiphany design is the memory architecture, which allows any core to access the SRAM of any other core on the die. This SRAM is mapped as a single address space across the cores, greatly simplifying memory management. Each core has a direct memory access (DMA) unit that can prefetch data from external flash memory.

The initial design didn't even have main memory or external peripherals, if you can believe it, and used an LVDS I/O port with 8GB/sec of bandwidth to move data on and off the chip from processors. The 32-bit address space is broken into 4,096 1MB chunks, one potentially for each core that could in theory be crammed onto a single die if process shrinking continues.

Scaling up to exascale

The Epiphany design is meant to scale to around 1GHz with 64 cores on a die and deliver performance at around 25 milliwatts per core. This Epiphany-IV chip, which was previewed last October, is currently running at 800MHz and delivers that 50 gigaflops per watt.

"It has the best energy efficiency in the world," brags Olofsson, "even better than the GPUs." Chips came back from the fab – Not Taiwan Semiconductor Manufacturing Corp, but rather the AMD foundry spinout, GlobalFoundries – in July of this year and the processors "are yielding well."

How the Epiphany chip stacks up to FPGAs, CPUs, GPUs, and DSPs

How the Epiphany chip stacks up to FPGAs, CPUs, GPUs, and DSPs

The Epiphany chip runs plain old ANSI C and C++, adheres to IEEE floating point math, and uses OpenCL interfaces to offload parallel processing from a CPU. Nothing crazy or special. When you marry it to an x86 or ARM core, it looks like a math coprocessor. A very tiny one, at only 10 square millimeters without its package on.

Or roughly 1/30th the size of a GPU at 1/30th the floating point performance but running at 1/70th the power of the GPU (as you can see from the table above). The benefit, says Olofsson, is that you can get the programming efficiency of a DSP or CPU with better power efficiency than is provided by a GPU coprocessor.

And that is why Adapteva, which Olofsson funded out of his pocket for the first two years and then in the past two years with $2.35m in debt and Series A equity funding, thinks it can be a player in the race to exascale computing.

Adapteva's ambitious goals with the Epiphany chip

Adapteva's ambitious goals with the Epiphany chip

Looking ahead, Adapteva has set its sights on two different processors for the exascale timeframe in 2018. One is an entry coprocessor with 1,000 cores on a die that delivers 2 teraflops of performance in a 2 watt thermal envelope, and the other is a massive chip with 64,000 cores with 1MB of SRAM per core that can deliver 100 teraflops of floating point coprocessing at 100 watts. Both chips will deliver 1 teraflops per watt using 7 nanometer wafer baking processes, if all goes well.

Which is where we come into the picture with the Parallella community effort.

Adapteva wants to raise cash to fund development as well as to seed a community of developers for its coprocessors that will in turn feed back inputs into the hardware designs. It isn't quite open source hardware, but it is about as close as you can get.

A Feast for Epiphany

The Kickstarter program that Adapteva has started, which you can participate in here, took a bit longer to get out the door than expected because Kickstarter changed its hardware project rules just before Adapteva was prepped to launch. But it is here, and you can kick in dough for the Epiphany cause - and if you kick in enough dough, you get gear to tool around with.

The first level of support is a Parallella supporter level, where you donate $15 or more, and the funds go directly to development of future Epiphany chips. At the Parallella maker level, you fork over $99 or more and you will get a Parallella board with an Epiphany-III board, which includes a dual-core ARM Cortex-A9 processor, and a software stack to help you code up software for it.

At the Parallella pair level, which has a minimum pledge of $199, you get two of the Parallella boards so you can do compression/decompression, encoding/decoding, transmission/receiving applications without having to borrow someone else's board. The goal is to raise $750,000 in funding at the first level.

If you are eager to get your hands on a 64-core Epiphany-IV chip, and if Adapteva reaches its stretch goal of $3m in pledges, then if you pledge $199 or more at the 64-core level, you will get a single board with the Epiphany-IV.

If Adapteva doesn't reach the $3m level, you'll get two Epiphany boards with the ARM coprocessors. Pony up $499 for developer level and you will get a unique serial number on your Epiphany-III board and you get early access to the Parallella SDK. And cough up $5,000 and you are at the early access level and you get the developer stack three months ahead of everyone else.

As El Reg goes to press, Adapteva's Kickstarter project has 1,593 backers and has raised $201,445. The company has to reach its $750,000 funding goal by October 27 at 6 PM Eastern. ®