Feeds

MIT scientists craft a storage system fit for THE ENTIRE UNIVERSE

BlueDBM tackles storage network gremlins with FPGAs

The Essential Guide to IT Transformation

Distributed file systems may be cheap to run, but their performance can be atrocious when the network becomes saturated, and some boffins are hoping to change this so to better simulate our universe.

MIT researchers have tried to solve the network saturation problems bought about by SSD-loaded distributed storage systems with a new approach named BlueDBM, and hope the approach will give scientists a boost when running complex simulations.

One potential application of the BlueDBM system is speeding a University of Washington simulation of the universe.

"Scientists need to query this rather enormous dataset to track which particles are interacting with which other particles, but running those kind of queries is time-consuming,” MIT Sang-Woo Jun told MIT News. "We hope to provide a real-time interface that scientists can use to look at the information more easily."

This tech sees the boffins sit field-programmable gate arrays (FPGA) between the host computer and the storage, and lash them together via their own network. The result is a low-latency, high bandwidth, scalable storage system that has an order of magnitude greater performance than Microsoft's rival "CORFU" system [PDF].

The secret to this performance increase is the combination of PCIe-based flash storage with a storage controller implemented on an FPGA that is linked to all other controllers by multi-gigabit low latency serial links with a SERialize/DESerializer (SERDES) function that is implemented directly within each FPGA.

By doing this "each node is able to access remote storage with negligible performance degradation," they write. "Not only does the controller-to-controller network provide pooling of storage capacity, but it also allows combining the throughput of all nodes on the network, resulting in linear throughput scaling with more nodes."

When the MIT boffins evaluated a four-node implementation of the BlueDBM system they found it had a zippy network with an average packet latency of around 0.5 microseconds.

"Considering that the typical latency of a flash read is several tens of microseconds requests in our network can, in theory, traverse dozens of nodes before the network latency becomes a significant portion of the storage read latency," they write in the Scalable Multi-Access Flash Store for Big Data Analytics paper [PDF].

From he perspective of an end-user, the prototype BlueDBM system has "has an average latency to client applications of about 70 microseconds, which is an order of magnitude lower than existing distributed flash systems," they write.

More information on the system will be available in late February, when the researchers present their paper at the FPGA 2014 summit in Monterrey, California. ®

The Essential Guide to IT Transformation

More from The Register

next story
Sysadmin Day 2014: Quick, there's still time to get the beers in
He walked over the broken glass, killed the thugs... and er... reconnected the cables*
Auntie remains MYSTIFIED by that weekend BBC iPlayer and website outage
Still doing 'forensics' on the caching layer – Beeb digi wonk
Microsoft says 'weird things' can happen during Windows Server 2003 migrations
Fix coming for bug that makes Kerberos croak when you run two domain controllers
Multipath TCP speeds up the internet so much that security breaks
Black Hat research says proposed protocol will bork network probes, flummox firewalls
Cisco says network virtualisation won't pay off everywhere
Another sign of strain in the Borg/VMware relationship?
Forrester says Australia, not China, is next boom market for cloud
It's cloudy but fine down under, analyst says
prev story

Whitepapers

Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Boost IT visibility and business value
How building a great service catalog relieves pressure points and demonstrates the value of IT service management.
Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
The Essential Guide to IT Transformation
ServiceNow discusses three IT transformations that can help CIO's automate IT services to transform IT and the enterprise.
Maximize storage efficiency across the enterprise
The HP StoreOnce backup solution offers highly flexible, centrally managed, and highly efficient data protection for any enterprise.