Feeds

NoSQL hopeful cozies up to Hadoop data-muncher

Big data love-in

Next gen security for virtualised datacentres

NoSQL data store CouchDB has become Hadoop’s latest convert with delivery of a connector tying together the two big-data architectures.

CouchDB user Couchbase has announced a certified Couchbase Hadoop Connector, developed with Hadoop shop Cloudera.

The connector potentially simplifies movement of data between the Couchbase Server, which Couchbase says is "powered" by CouchDB, and the Cloudera Distribution including Hadoop (CDH). Couchbase uses capabilities of CouchDB such as mobile and sync. Both CouchDB and Hadoop, meanwhile, are Apache Software Foundation (ASF) projects.

The connector does this using Sqoop, a plug-in that's an Apache incubator project. Sqoop can stream data from the Couchbase system to the Cloudera Hadoop distribution.

According to Couchbase, the Sqoop addition will enable both consistent application performance and heavy MapReduce data-crunching of data sets.

The plug-in targets web applications such as ads targeting, where low-latency is needed along with high throughput.

The Hadoop Connector was certified via Cloudera’s Certified Technology program.

Couchbase wasn't the only one cozying up to Hadoop this week. Open-source enterprise data integration start-up Talend announced that version 5 of its suite features enhanced support for Hadoop's data warehouse Hive, the Pig data analysis tool and Sqoop.

Couchbase is closely aligned with the NoSQL crowd. It uses and supports the CouchDB document store with employees such as Jan Lenhardt still contributing to the CouchDB Apache project. Couchbase also utilizes Memcached in its Membase Server. The company's customers include the BBC and Zynga.

CouchDB recently made the news when it was dropped from Canonical's Ubuntu One service after Canonical tried and failed to make the document store scale to millions of users and databases over a period of three years.

Cloudera is home to Hadoop founder Doug Cutting. Hadoop was inspired by Google’s MapReduce for large-scale data-munching.

Cloudera was first to deliver productisation and support for Hadoop, but was this year joined by Yahoo! spin out Hortonworks. The latter has this year worked with Microsoft to develop a Hadoop plug-in for Microsoft’s SQL Server, a move that has ended the life of Microsoft’s own big data cruncher, Dryad. ®

This article has been updated to clarify Couchbase's use and support for CouchDB.

Build a business case: developing custom apps

More from The Register

next story
Why has the web gone to hell? Market chaos and HUMAN NATURE
Tim Berners-Lee isn't happy, but we should be
Mozilla's 'Tiles' ads debut in new Firefox nightlies
You can try turning them off and on again
Microsoft boots 1,500 dodgy apps from the Windows Store
DEVELOPERS! DEVELOPERS! DEVELOPERS! Naughty, misleading developers!
'Stop dissing Google or quit': OK, I quit, says Code Club co-founder
And now a message from our sponsors: 'STFU or else'
Apple promises to lift Curse of the Drained iPhone 5 Battery
Have you tried turning it off and...? Never mind, here's a replacement
Uber, Lyft and cutting corners: The true face of the Sharing Economy
Casual labour and tired ideas = not really web-tastic
Linux turns 23 and Linus Torvalds celebrates as only he can
No, not with swearing, but by controlling the release cycle
prev story

Whitepapers

Gartner critical capabilities for enterprise endpoint backup
Learn why inSync received the highest overall rating from Druva and is the top choice for the mobile workforce.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Rethinking backup and recovery in the modern data center
Combining intelligence, operational analytics, and automation to enable efficient, data-driven IT organizations using the HP ABR approach.
Consolidation: The Foundation for IT Business Transformation
In this whitepaper learn how effective consolidation of IT and business resources can enable multiple, meaningful business benefits.
Next gen security for virtualised datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.