Feeds

NoSQL hopeful cozies up to Hadoop data-muncher

Big data love-in

Remote control for virtualized desktops

NoSQL data store CouchDB has become Hadoop’s latest convert with delivery of a connector tying together the two big-data architectures.

CouchDB user Couchbase has announced a certified Couchbase Hadoop Connector, developed with Hadoop shop Cloudera.

The connector potentially simplifies movement of data between the Couchbase Server, which Couchbase says is "powered" by CouchDB, and the Cloudera Distribution including Hadoop (CDH). Couchbase uses capabilities of CouchDB such as mobile and sync. Both CouchDB and Hadoop, meanwhile, are Apache Software Foundation (ASF) projects.

The connector does this using Sqoop, a plug-in that's an Apache incubator project. Sqoop can stream data from the Couchbase system to the Cloudera Hadoop distribution.

According to Couchbase, the Sqoop addition will enable both consistent application performance and heavy MapReduce data-crunching of data sets.

The plug-in targets web applications such as ads targeting, where low-latency is needed along with high throughput.

The Hadoop Connector was certified via Cloudera’s Certified Technology program.

Couchbase wasn't the only one cozying up to Hadoop this week. Open-source enterprise data integration start-up Talend announced that version 5 of its suite features enhanced support for Hadoop's data warehouse Hive, the Pig data analysis tool and Sqoop.

Couchbase is closely aligned with the NoSQL crowd. It uses and supports the CouchDB document store with employees such as Jan Lenhardt still contributing to the CouchDB Apache project. Couchbase also utilizes Memcached in its Membase Server. The company's customers include the BBC and Zynga.

CouchDB recently made the news when it was dropped from Canonical's Ubuntu One service after Canonical tried and failed to make the document store scale to millions of users and databases over a period of three years.

Cloudera is home to Hadoop founder Doug Cutting. Hadoop was inspired by Google’s MapReduce for large-scale data-munching.

Cloudera was first to deliver productisation and support for Hadoop, but was this year joined by Yahoo! spin out Hortonworks. The latter has this year worked with Microsoft to develop a Hadoop plug-in for Microsoft’s SQL Server, a move that has ended the life of Microsoft’s own big data cruncher, Dryad. ®

This article has been updated to clarify Couchbase's use and support for CouchDB.

Choosing a cloud hosting partner with confidence

More from The Register

next story
Microsoft to bake Skype into IE, without plugins
Redmond thinks the Object Real-Time Communications API for WebRTC is ready to roll
Microsoft promises Windows 10 will mean two-factor auth for all
Sneak peek at security features Redmond's baking into new OS
Mozilla: Spidermonkey ATE Apple's JavaScriptCore, THRASHED Google V8
Moz man claims the win on rivals' own benchmarks
FTDI yanks chip-bricking driver from Windows Update, vows to fight on
Next driver to battle fake chips with 'non-invasive' methods
PEAK APPLE: iOS 8 is least popular Cupertino mobile OS in all of HUMAN HISTORY
'Nerd release' finally staggers past 50 per cent adoption
DEATH by PowerPoint: Microsoft warns of 0-day attack hidden in slides
Might put out patch in update, might chuck it out sooner
Ubuntu 14.10 tries pulling a Steve Ballmer on cloudy offerings
Oi, Windows, centOS and openSUSE – behave, we're all friends here
Was ist das? Eine neue Suse Linux Enterprise? Ausgezeichnet!
Version 12 first major-number Suse release since 2009
prev story

Whitepapers

Why cloud backup?
Combining the latest advancements in disk-based backup with secure, integrated, cloud technologies offer organizations fast and assured recovery of their critical enterprise data.
Getting started with customer-focused identity management
Learn why identity is a fundamental requirement to digital growth, and how without it there is no way to identify and engage customers in a meaningful way.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Intelligent flash storage arrays
Tegile Intelligent Storage Arrays with IntelliFlash helps IT boost storage utilization and effciency while delivering unmatched storage savings and performance.
Getting ahead of the compliance curve
Learn about new services that make it easy to discover and manage certificates across the enterprise and how to get ahead of the compliance curve.