Feeds

Big Data open-source duo united under Apache

Hadoop and Cassandra search together

5 things you didn’t know about cloud backup

As big-data hook ups go, they don't get much bigger: NoSQL and distributed computing pin ups Cassandra and Hadoop have been united by the Apache Software Foundation.

ASF has released Apache Cassandra 0.6, adding support for its Hadoop project. Both Cassandra and Hadoop are ASF projects, with Cassandra only graduating from Apache's early phase incubator phase in February.

The union will allow users to run analytics queries using the Hadoop map reduce framework against data held inside Cassandra.

Hadoop is an open-source project party based on Google's MapReduce technology that found large-scale use inside Yahoj!. Cassandra is one of a family of NoSQL systems that started life as a way to store and serve frequently accessed data in massive systems spanning tens of thousands of servers and millions of users. The idea is NoSQL is faster and its architecturally easier to construct that using a traditional relational database system in these environments.

The Cassandra NoSQL technology started at Facebook and became an ASF incubator project in 2009. Users include Digg, Cisco WebEx, Rackspace, Reddit, and Twitter.

As more data has been put into NoSQL systems. it has inevitably followed that those running them want to query it rather than simply use NoSQL as a holding pen for things like Facebook status updates, Tweets, or Digg posts.

Earlier this week, Gear6 announced that it's adding native query capabilities to its Memecached distribution, to create what it called a "NoSQL-like store".

Other improvement in Cassandra 0.6 include integrated row cache to eliminate the need for a separate caching layer, which ASF said would help simplify architectures, and a 30 per cent across-the-board increase in speed to handle increasing write loads by big customers. ®

Gartner critical capabilities for enterprise endpoint backup

More from The Register

next story
Why has the web gone to hell? Market chaos and HUMAN NATURE
Tim Berners-Lee isn't happy, but we should be
Apple promises to lift Curse of the Drained iPhone 5 Battery
Have you tried turning it off and...? Never mind, here's a replacement
'Stop dissing Google or quit': OK, I quit, says Code Club co-founder
And now a message from our sponsors: 'STFU or else'
Microsoft boots 1,500 dodgy apps from the Windows Store
DEVELOPERS! DEVELOPERS! DEVELOPERS! Naughty, misleading developers!
Linux turns 23 and Linus Torvalds celebrates as only he can
No, not with swearing, but by controlling the release cycle
Scratched PC-dispatch patch patched, hatched in batch rematch
Windows security update fixed after triggering blue screens (and screams) of death
This is how I set about making a fortune with my own startup
Would you leave your well-paid job to chase your dream?
prev story

Whitepapers

Best practices for enterprise data
Discussing how technology providers have innovated in order to solve new challenges, creating a new framework for enterprise data.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Advanced data protection for your virtualized environments
Find a natural fit for optimizing protection for the often resource-constrained data protection process found in virtual environments.
How modern custom applications can spur business growth
Learn how to create, deploy and manage custom applications without consuming or expanding the need for scarce, expensive IT resources.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?