Feeds

NoSQL hopeful cozies up to Hadoop data-muncher

Big data love-in

7 Elements of Radically Simple OS Migration

NoSQL data store CouchDB has become Hadoop’s latest convert with delivery of a connector tying together the two big-data architectures.

CouchDB user Couchbase has announced a certified Couchbase Hadoop Connector, developed with Hadoop shop Cloudera.

The connector potentially simplifies movement of data between the Couchbase Server, which Couchbase says is "powered" by CouchDB, and the Cloudera Distribution including Hadoop (CDH). Couchbase uses capabilities of CouchDB such as mobile and sync. Both CouchDB and Hadoop, meanwhile, are Apache Software Foundation (ASF) projects.

The connector does this using Sqoop, a plug-in that's an Apache incubator project. Sqoop can stream data from the Couchbase system to the Cloudera Hadoop distribution.

According to Couchbase, the Sqoop addition will enable both consistent application performance and heavy MapReduce data-crunching of data sets.

The plug-in targets web applications such as ads targeting, where low-latency is needed along with high throughput.

The Hadoop Connector was certified via Cloudera’s Certified Technology program.

Couchbase wasn't the only one cozying up to Hadoop this week. Open-source enterprise data integration start-up Talend announced that version 5 of its suite features enhanced support for Hadoop's data warehouse Hive, the Pig data analysis tool and Sqoop.

Couchbase is closely aligned with the NoSQL crowd. It uses and supports the CouchDB document store with employees such as Jan Lenhardt still contributing to the CouchDB Apache project. Couchbase also utilizes Memcached in its Membase Server. The company's customers include the BBC and Zynga.

CouchDB recently made the news when it was dropped from Canonical's Ubuntu One service after Canonical tried and failed to make the document store scale to millions of users and databases over a period of three years.

Cloudera is home to Hadoop founder Doug Cutting. Hadoop was inspired by Google’s MapReduce for large-scale data-munching.

Cloudera was first to deliver productisation and support for Hadoop, but was this year joined by Yahoo! spin out Hortonworks. The latter has this year worked with Microsoft to develop a Hadoop plug-in for Microsoft’s SQL Server, a move that has ended the life of Microsoft’s own big data cruncher, Dryad. ®

This article has been updated to clarify Couchbase's use and support for CouchDB.

Endpoint data privacy in the cloud is easier than you think

More from The Register

next story
PEAK LANDFILL: Why tablet gloom is good news for Windows users
Sinofsky's hybrid strategy looks dafter than ever
Leaked Windows Phone 8.1 Update specs tease details of Nokia's next mobes
New screen sizes, dual SIMs, voice over LTE, and more
Fiendishly complex password app extension ships for iOS 8
Just slip it in, won't hurt a bit, 1Password makers urge devs
Mozilla keeps its Beard, hopes anti-gay marriage troubles are now over
Plenty on new CEO's todo list – starting with Firefox's slipping grasp
Apple: We'll unleash OS X Yosemite beta on the MASSES on 24 July
Starting today, regular fanbois will be guinea pigs, it tells Reg
Another day, another Firefox: Version 31 is upon us ALREADY
Web devs, Mozilla really wants you to like this one
Secure microkernel that uses maths to be 'bug free' goes open source
Hacker-repelling, drone-protecting code will soon be yours to tweak as you see fit
Cloudy CoreOS Linux distro declares itself production-ready
Lightweight, container-happy Linux gets first Stable release
prev story

Whitepapers

7 Elements of Radically Simple OS Migration
Avoid the typical headaches of OS migration during your next project by learning about 7 elements of radically simple OS migration.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Consolidation: The Foundation for IT Business Transformation
In this whitepaper learn how effective consolidation of IT and business resources can enable multiple, meaningful business benefits.
Solving today's distributed Big Data backup challenges
Enable IT efficiency and allow a firm to access and reuse corporate information for competitive advantage, ultimately changing business outcomes.
A new approach to endpoint data protection
What is the best way to ensure comprehensive visibility, management, and control of information on both company-owned and employee-owned devices?