Feeds

Cloudera Hadoop plugs trunk into Netezza iron

Stuffed elephant meets data warehousing

HP ProLiant Gen8: Integrated lifecycle automation

Cloudera – the commercial Hadoop outfit – has teamed with data analytics maven Netezza to build a connector between its stuffed elephant distro and Netezza's Twinfin data warehousing appliances

Due at the end of the third quarter, the connector will allow users to move data from Netezza appliance to the Cloudera Distribution for Hadoop (CDH) – and vice versa.

Based on research papers describing Google’s proprietary infrastructure, the open source Hadoop is a way of crunching massive amounts of data across a network of distributed machines. Named after the yellow stuffed elephant belonging to the son of project founder Doug Cutting, the platform underpins net services offered by everyone from Yahoo! and Facebook and Twitter to Microsoft. Yes, Microsoft.

Meanwhile, Netezza's TwinFin blade servers offer a customized PostgreSQL database. Like other data warehouses, running ad hoc SQL queries against epic data sets.

"One thing we have seen at Cloudera is substantial existing use of Netezza's product in our big enterprise accounts," Cloudera CEO Mike Olson tells The Reg. "And they're looking as Hadoop as a complement to the existing Netezza use. We view ourselves as another piece of the puzzle, solving a different problem: complex data and hard core exhaustive analytics, more exotic algorithms running over complex data at scale."

But Olson also stresses that after its crunched by Hadoop, users will be able to move data back to the Netzza appliance for additional exploration. "Enterprises want to take structured data – customer and transaction data – and combine it will all the unstructured data coming off their websites...that might not fit into a tabular schema well."

Hadoop, for instance, might be used to crunch data relating to user behavior on a website. "What we call Web 2.0 sites have users that move around on their site, post status updates, interact with other individuals. All of that activity is captured in web logs that can't easily be digested using existing relational system. Hadoop can look at all that activity, identify individual users, digest their behavior, and begin to make predictions about behavior," Olson continues.

"But these companies want to combine want these users do with who they are, but that information is often in a system like Netezza's."

"A lot of people assumed that Hadoop was innately competitive with Netazza and other players," says Olson. "But that's not what we're seeing. We're seeing an appetite for this new technology [Hadoop] to solve problems with data – but it has to work well with existing and expanding investments in [data warehousing]."

Last month, Cloudera teamed with Oracle-tools shop Quest Software to build a Hadoop connector for Oracle. It's due in Q3 as well. ®

Reducing security risks from open source software

More from The Register

next story
Sysadmin Day 2014: Quick, there's still time to get the beers in
He walked over the broken glass, killed the thugs... and er... reconnected the cables*
Amazon Reveals One Weird Trick: A Loss On Almost $20bn In Sales
Investors really hate it: Share price plunge as growth SLOWS in key AWS division
Auntie remains MYSTIFIED by that weekend BBC iPlayer and website outage
Still doing 'forensics' on the caching layer – Beeb digi wonk
SHOCK and AWS: The fall of Amazon's deflationary cloud
Just as Jeff Bezos did to books and CDs, Amazon's rivals are now doing to it
BlackBerry: Toss the server, mate... BES is in the CLOUD now
BlackBerry Enterprise Services takes aim at SMEs - but there's a catch
The triumph of VVOL: Everyone's jumping into bed with VMware
'Bandwagon'? Yes, we're on it and so what, say big dogs
Carbon tax repeal won't see data centre operators cut prices
Rackspace says electricity isn't a major cost, Equinix promises 'no levy'
prev story

Whitepapers

Designing a Defense for Mobile Applications
Learn about the various considerations for defending mobile applications - from the application architecture itself to the myriad testing technologies.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Top 8 considerations to enable and simplify mobility
In this whitepaper learn how to successfully add mobile capabilities simply and cost effectively.
Seven Steps to Software Security
Seven practical steps you can begin to take today to secure your applications and prevent the damages a successful cyber-attack can cause.
Boost IT visibility and business value
How building a great service catalog relieves pressure points and demonstrates the value of IT service management.