Feeds

Database gurus slammed for Google post

MapReduce: a "major step backwards"

Gartner critical capabilities for enterprise endpoint backup

A database pioneer and honored computer science professor have come under heavy fire for issuing a strong critique of Google's MapReduce technology for processing large unstructured databases.

Ingres inventor and Postgres architect Mike Stonebraker and his colleague, University of Wisconsin computer science professor David DeWitt, have been accused of "not getting" data in the clouds while others have demanded the duo retract what's been branded a "highly inaccurate article".

Stonebraker and DeWitt had criticized MapReduce and slammed moves to introduce MapReduce into the academic curriculum.

They called MapReduce a major step backwards because it is "sub optimal", lacks the features commonly associated with database management systems (DBMS) and is incompatible with "all of the tools DBMS users have come to depend on". They also said that it is not '"novel". They conclude that MapReduce ignores many of the developments in parallel DBMS technology over the last 25 years.

Their joint blog post drew fire from bloggers and a barrage of commentators coming out in support of MapReduce, including a detailed riposte that claimed DeWitt and Stonebraker don't know what they are talking about.

The gist of the counter argument is that MapReduce can't be compared to a relational DBMS because it is a technique for dealing with large amounts of unstructured data rather than the formal tabular data in relational DBMS. Google reckons it processes 20PB of unstructured data a day using MapReduce.

The almost complete lack of support for the view put forward by DeWitt and Stonebraker suggests they might well have misunderstood MapReduce's role in modern data processing.

Given the eminent background of both academics, though, this is surprising. DeWitt has researched large parallel DBMS since the 1980s and, in addition to his pioneering work on Ingres and Postgres, Stonebraker is currently active in the large DBMS area with his new company Vertica.

DeWitt has published more than 100 technical papers and been honored for contributions to database systems having started in the mid 1970s on a NASA- and DARPA-funded project looking at scalable object-relational system for managing very large geo-spatial data sets.

Requests for an interview have yet to be answered.®

Secure remote control for conventional and virtual desktops

More from The Register

next story
Why has the web gone to hell? Market chaos and HUMAN NATURE
Tim Berners-Lee isn't happy, but we should be
Mozilla's 'Tiles' ads debut in new Firefox nightlies
You can try turning them off and on again
Microsoft boots 1,500 dodgy apps from the Windows Store
DEVELOPERS! DEVELOPERS! DEVELOPERS! Naughty, misleading developers!
'Stop dissing Google or quit': OK, I quit, says Code Club co-founder
And now a message from our sponsors: 'STFU or else'
Apple promises to lift Curse of the Drained iPhone 5 Battery
Have you tried turning it off and...? Never mind, here's a replacement
Uber, Lyft and cutting corners: The true face of the Sharing Economy
Casual labour and tired ideas = not really web-tastic
Linux turns 23 and Linus Torvalds celebrates as only he can
No, not with swearing, but by controlling the release cycle
prev story

Whitepapers

Gartner critical capabilities for enterprise endpoint backup
Learn why inSync received the highest overall rating from Druva and is the top choice for the mobile workforce.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Rethinking backup and recovery in the modern data center
Combining intelligence, operational analytics, and automation to enable efficient, data-driven IT organizations using the HP ABR approach.
Consolidation: The Foundation for IT Business Transformation
In this whitepaper learn how effective consolidation of IT and business resources can enable multiple, meaningful business benefits.
Next gen security for virtualised datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.