Original URL: http://www.theregister.co.uk/2008/07/18/facebook_cassandra_microsoft/
Microsoft, Facebook, Google box clever on really big systems
The enemy of my enemy...
Posted in Applications, 18th July 2008 17:02 GMT
Free whitepaper – Ensuring high service levels in cloud computing
Facebook's decision to release under open source (http://code.google.com/p/the-cassandra-project/) a large-scale data management (http://www.slideshare.net/jhammerb/data-presentations-cassandra-sigmod/) project similar to - and inspired by - Google's BigTable (http://labs.google.com/papers/bigtable.html) has received backing from an unusual quarter: Microsoft.
Data center futures architect and distinguished database developer James Hamilton (http://www.mvdirona.com/jrh/Work/), has complemented the pimply faced social network for releasing what he said (http://perspectives.mvdirona.com/2008/07/12/FacebookReleasesCassandraAsOpenSource.aspx) "looks like a well-engineered system."
Hamilton spent ten years at IBM working on DB2 before joining Microsoft in 1997 to work on SQL Server, and recently collaborated (http://www.nowpublishers.com/product.aspx?product=DBS&doi=1900000002) with fellow database guru Michael Stonebraker on an examination of future database architectures.
Stonebraker put the cat among the pigeons by slagging off (http://www.theregister.co.uk/2008/01/24/stonebraker_dewitt_mapreduce/) Google's MapReduce (http://labs.google.com/papers/mapreduce.html) database tool earlier this year - he called it a step backwards. Stonebraker also noted there are limitations in BigTable and its open-source equivalent Hbase (http://hadoop.apache.org/).
Conspiracy theorists might find it interesting that Stonebraker's co-author David de Witt joined (http://www.theregister.co.uk/2008/04/28/microsoft_hires_dewitt/) Microsoft in April to head up a new research effort into large databases. They should also remember Microsoft is an investor in Facebook.
Adding insult to injury, Facebook has put its project - called Cassandra - up on Google Code. Cassandra is not alone on Google code. Another BigTable clone called Hypertable (http://www.hypertable.org/about.html) was set up on Google Code (http://code.google.com/p/hypertable/wiki/SourceCode?tm=4) earlier this year.
Like BigTable, Cassandra is designed to get round the limitations of traditional relational databases in large-scale, online applications.
Cassandra is the work of a Facebook team led by Jeff Hammerbacher (http://jeffhammerbacher.com/), an ex-Harvard student who was recruited by Facebook chief Mark Zuckerberg from troubled Wall Street bank Bear Stearns.
Hammerbacher is now reported (http://venturebeat.com/2008/07/10/rumor-jeff-hammerbacher-a-key-early-facebook-employee-is-leaving/) to have become the latest Faceboook employee to have served his notice. Can we expect this Facebook youf to show up in well-remunerated style at Redmond this Fall?
And, will all this prompt Google to put BigTable - or even MapReduce - into open source? Given it decided to release its Protocol Buffers (http://www.theregister.co.uk/2008/07/14/dziuba_google_protocol_buffer/page2.html) technology to open source this week - its not beyond the bounds of possibility.
There is a precedent: Facebook released Thrift, its clone of Protocol Buffers, to open source last year.
The games continue.®
