Feeds

NoSQL hopeful cozies up to Hadoop data-muncher

Big data love-in

Boost IT visibility and business value

NoSQL data store CouchDB has become Hadoop’s latest convert with delivery of a connector tying together the two big-data architectures.

CouchDB user Couchbase has announced a certified Couchbase Hadoop Connector, developed with Hadoop shop Cloudera.

The connector potentially simplifies movement of data between the Couchbase Server, which Couchbase says is "powered" by CouchDB, and the Cloudera Distribution including Hadoop (CDH). Couchbase uses capabilities of CouchDB such as mobile and sync. Both CouchDB and Hadoop, meanwhile, are Apache Software Foundation (ASF) projects.

The connector does this using Sqoop, a plug-in that's an Apache incubator project. Sqoop can stream data from the Couchbase system to the Cloudera Hadoop distribution.

According to Couchbase, the Sqoop addition will enable both consistent application performance and heavy MapReduce data-crunching of data sets.

The plug-in targets web applications such as ads targeting, where low-latency is needed along with high throughput.

The Hadoop Connector was certified via Cloudera’s Certified Technology program.

Couchbase wasn't the only one cozying up to Hadoop this week. Open-source enterprise data integration start-up Talend announced that version 5 of its suite features enhanced support for Hadoop's data warehouse Hive, the Pig data analysis tool and Sqoop.

Couchbase is closely aligned with the NoSQL crowd. It uses and supports the CouchDB document store with employees such as Jan Lenhardt still contributing to the CouchDB Apache project. Couchbase also utilizes Memcached in its Membase Server. The company's customers include the BBC and Zynga.

CouchDB recently made the news when it was dropped from Canonical's Ubuntu One service after Canonical tried and failed to make the document store scale to millions of users and databases over a period of three years.

Cloudera is home to Hadoop founder Doug Cutting. Hadoop was inspired by Google’s MapReduce for large-scale data-munching.

Cloudera was first to deliver productisation and support for Hadoop, but was this year joined by Yahoo! spin out Hortonworks. The latter has this year worked with Microsoft to develop a Hadoop plug-in for Microsoft’s SQL Server, a move that has ended the life of Microsoft’s own big data cruncher, Dryad. ®

This article has been updated to clarify Couchbase's use and support for CouchDB.

The essential guide to IT transformation

More from The Register

next story
Munich considers dumping Linux for ... GULP ... Windows!
Give a penguinista a hug, the Outlook's not good for open source's poster child
The Return of BSOD: Does ANYONE trust Microsoft patches?
Sysadmins, you're either fighting fires or seen as incompetents now
Intel's Raspberry Pi rival Galileo can now run Windows
Behold the Internet of Things. Wintel Things
Microsoft cries UNINSTALL in the wake of Blue Screens of Death™
Cache crash causes contained choloric calamity
Eat up Martha! Microsoft slings handwriting recog into OneNote on Android
Freehand input on non-Windows kit for the first time
Time to move away from Windows 7 ... whoa, whoa, who said anything about Windows 8?
Start migrating now to avoid another XPocalypse – Gartner
You'll find Yoda at the back of every IT conference
The piss always taking is he. Bastard the.
prev story

Whitepapers

5 things you didn’t know about cloud backup
IT departments are embracing cloud backup, but there’s a lot you need to know before choosing a service provider. Learn all the critical things you need to know.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Build a business case: developing custom apps
Learn how to maximize the value of custom applications by accelerating and simplifying their development.
Rethinking backup and recovery in the modern data center
Combining intelligence, operational analytics, and automation to enable efficient, data-driven IT organizations using the HP ABR approach.
Next gen security for virtualised datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.