The Register® — Biting the hand that feeds IT

Feeds

Boffins fawn over dirt cheap server clusters

Fast array of wimpy nodes

SaaS data loss: The problem you didn’t know you had

A team of researchers at Carnegie Mellon University have been studying how they can make cheap, low-powered, and relatively unimpressive server nodes gang up and do more work than the two-socket x64 server that is the workhorse of the IT industry. They have come up with an approach called FAWN, which is short for Fast Array of Wimpy Nodes.

Last week at the ACM's Symposium on Operating Systems Principles, Carnegie Mellon researchers working with Intel Labs presented a paper (pdf) on the FAWN concept, demonstrating where a combination of wimpy server nodes built on motherboards that usually end up in inexpensive PCs or homegrown media servers can do more of the Web 2.0-style query work per unit of energy than more powerful boxes.

This is actually the second paper that the FAWN project has put out this year; the first one (pdf), introducing the work done by David Andersen, Jason Franklin, Amar Phanishayee, Lawrence Tan, and Vijay Vasudevan of Carnegie Mellon and Michael Kaminsky of Intel Labs.

This is by no means the first time that researchers, hyperscale data centers and IT suppliers have taken a look at clustering low-powered and relatively wimpy server nodes together to aggregate large amounts of computing, memory, and I/O capacity together. But the FAWN project researchers are trying to push the envelope - the power envelope, that is - and push it down as low as it can practically go.

To demonstrate what an array of FAWN machines can do, The Carnegie Mellon researchers say they have created a key value storage cluster that is similar in concept to Amazon's Dynamo and the open source projects Memcached (which is championed these days by Facebook, among others) and Voldemort (a distributed key value storage system, which is a database but not a relational or object database and which is used by LinkedIn). The prototype FAWN machines - the project is in its third generation - are the kinds of things you could build in your living room on a fairly modest budget.

The first generation wimpy nodes consisted of a cluster of eight baby PCs in beige boxes networked with a cheap switch, and the second generation mounted 14 boards together on a bare frame and didn't even bother with a chassis. The picture shown is of the second generation FAWN setup, but according to the paper published at SOSP, the latest cluster has 21 nodes, each using a 500 MHz Atom processor, 256 MB of main memory, and 4 GB of CompactFlash storage.

This is not, by any stretch of the imagination, a powerful PC. In fact, this is wimpy even for a wimpy PC. But according to the CMU researchers, here is the key thing that has Google Network Appliance kicking in money for the FAWN project alongside Intel: Each one of those wimpy nodes consumers under 5 watts of juice as it is running at near peak performance processing queries and retrieving data from the FAWN distributed store (FAWN-DS).

A node can do 1,300 256-byte queries per second, according to the paper and process 364 queries per joule of energy. This, say the techies, is two orders of magnitude better bang per joule than a regular server can deliver.

Carnegie Mellon FAWN Cluster

The second-generation FAWN Project cluster

One of the key factors behind the wimpy nodes doing so well is that on such a node, processing, main memory and flash memory speeds are more in synch than they are on a modern x64 or RISC server. Because CPUs are revving so much faster than the I/O devices that feed them, they are often tapping their feet, waiting for data.

To compensate for this, modern CPUs have layered on all kinds of features - speculative execution, out-of-order execution, superscalar execution, simultaneous multithreading, branch prediction and the like as well as two or three layers of cache - that try to make up for the big gap between CPUs and their I/O. The problem is, these features not only cost money, they also consume a lot of power. So you get the CPUs to do the work, but at a big cost.

Without naming any names, the Carnegie Mellon researchers say that a quad-core superscalar processor running at several gigahertz can process approximately 100 million instructions per joule, but these in-order, relatively stupid chips used in the wimpy nodes can deliver one billion instructions per joule while running at a much lower frequency.

To make some comparisons, the researchers took a single server node based on an Intel desktop quad-core Q6700 processor and put 2 GB of memory on it as well as a Mtron Mobi solid state disk. This machine was set up to run Linux with a tickless kernel (2.6.27) and all of the power management features were optimized. The machine consumed 64 watts when idle, and from 83 watts to 90 watts when it was loaded up doing query work.

The Intel server node was able to process 4.771 random 256 byte reads, providing an efficiency rating of 52 queries per joule. The 21-node FAWN cluster idled at 83 watts, and peaked at 99 watts during puts and 91 watts during gets. This is 36,000 queries against a 20 GB dataset, which is what gives you the 364 queries per joule (including the power drawn from the switch linking the nodes). Nodes based on the desktop mobo that Carnegie Mellon tested using disk drives instead of SSD did awful, as you would expect, delivering only 17 queries per joule.

The techies at Carnegie Mellon and Intel are not nuts. They are not suggesting that there is no place for disk, but rather that you have to have the technology reflect the dataset size and query rate you are trying to deliver, and you have to make choices.

CMU FAWN Tradeoffs

The tradeoffs between queries and data set size

As their research shows, there is a trade-off that seems to be unavoidable for now, because disk drives are more capacious than flash drives, which are fatter than main memory. And that is this: If you want to query large data sets, you need to use disks and that means you can't hit the high query rates of a FAWN array using only main memory (possible if you datasets are really tiny) or flash. That means you will pay a lot more for servers, and they will be a lot less efficient. ®

Steps to Take Before Choosing a Business Continuity Partner

Latest Comments

This is hilarious

Wow, I'm groundbreaking. I've been running a FAWN for years. An array of old beige boxes, obsolete for desktop purposes but great for running a web server, a mail server, a file server a backup server, a little fist sized print server appliance, the list goes on.

I wasn't sure if I was being cost effective and green by recycling obsolete gear or if I was horribly behind the times and needed to spend $20K to consolidate all this onto one higher powered server supporting half a dozen virtual machines. Now I find out I'm on the cutting edge, proving that server virtualization is a shell game with no payoff. Ultimately you have to provide the processor and RAM to support the same amount of processing and making it denser will always cost more and run hotter. Time to go sell all those server and virtualization stocks.

0
0

Beowolf anyone?

This has been done. Its called Beowolf. It was a NASA project where they took a mass of old machines that were going to be discarded and built computing clusters with them. This is not a new idea.

Matt Bryant, check supermicro for atom based servers. Newegg was selling them for about $359.00 US for a box with 2GB of RAM and a harddrive. You can buy them without the drive and boot them from USB thumbdrives.

SUPERMICRO SYS-5015A-H 1U Barebone Server Intel 945GC Intel Atom 330 Dual-Core 1.6GHz processor

rackmount (1U) only 14 inches deep. very nice solid little box. I've been deploying other similar units for years.

0
0

RE: not much use for bursty workloads

True, but not every task in the enterprise is customer-facing, there's a lot of batch stuff still going on in the back-office. And even if your requirement has bursty workloads, there are technologies out there to allow you to power-up and power-down servers either to schedule or demand (last time I checked, Corrie did follow a predictable schedule, along with a predictably mind-numbing storyline). I've long nagged our vendor rep for some Atom-powered blades that will fit the same chassis as the x64 blades as we have some tasks which would suit this kind of low-CPU-power-but-continual-grind computing.

As an example, we have a rack of old DL320s with Celeron CPUs. They were bought years ago for a project where we needed a bank of backup servers for some Linux clusters, but our commercial backup software of the time didn't cover Linux. Instead, one of our Linux gurus (and he was the real item, sandals and all!) wrote a simple backup program that sits on top of standard x86 Red Hat, and CPU power is not really required. At the time, the simplest and cheapest servers were the DL320s. I would love to replace them with blades, but the current IBM and hp x64 blades we use are overkill for the task. Atom-based blades or servers, especially if we could boot them from flash or USB keys, would be perfect as we could just port the current Linux code, whereas anything like T2+ would be both too expensive and require a code re-write (and there's no Linux support on T2+) or a new backup app. Unfortunately, there is little chance of management allowing us to build a DIY FAWN rack for the job as they insist everything has to be covered by support contracts.

So, I'm looking at hp and IBM to step up and make me some Atom blades that will fit the generic chassis. Come on, guys, it can't be that hard?

0
0

More from The Register

SCO vs. IBM battle resumes over ownership of Unix
Zombie lawsuit back and wants to suck the brains out of Linux
 breaking news
You don't need phone lines or cable for ANYTHING, says Dish
The satellite-dish man can sort you out with phone and broadband over the air too
 breaking news
What's HP got under wraps? Looks awfully flash and tape shaped
What happens in Vegas won't stay there - we've got the details
Microsoft borks botnet takedown in Citadel snafu
Stupid Redmond kicked over our honeypots, wail white hats
IBM's $1bn layoffs latest: Now axe swings in US, Canada - reports
Union claims 121 storage bods canned after dismal sales
NetApp musters muscular cluster bluster for ONTAP busters
Storage array OS overhauled to juggle more nodes, go down on you, er, less
HP adds 'Haswell' Xeon E3s to entry ProLiant servers
Gussies up MicroServer for SMBs, adds baby switches