Feeds

Cloudera promises 'Google-like' Big Data dream in minutes

Hadoop shop automates so you don't have to

Security for virtualized datacentres

Updated Cloudera has delivered a "substantial" update to its open source Hadoop distribution.

On Wednesday, Cloudera rolled out Cloudera Enterprise 3.5, two months after shipping a major upgrade to its Hadoop distribution called Cloudera Distribution of Apache Hadoop (CDH) 3.0.

Whereas CDH 3.0 expanded Cloudera's Hadoop stack from three components to 10, the idea behind Cloudera Enterprise 3.5 is to make that distro easier to manage and deploy for IT shops outside the ranks of Hadoop's super-users like Facebook, Yahoo!, and LinkedIn. Cloudera provides services and support for such mainstream users.

The changes will let you install and configure a full Google-like infrastructure "in a couple of minutes", product vice president Charles Zedlewski told The Register. "We have done a substantial update." Hadoop is based on Google's GFS and MapReduce platforms.

Zedlewski said Cloudera Enterprise 3.5 automates configuration changes, service restarts, and the addition and removal of hardware. There's also an Activity Monitor that consolidates user activity across components to provide both a real-time and historical view of user activities and jobs.

"[We have] expanded the capability of the management suite from monitoring and discovery of issues, to diagnostic problems, to automating changes, and setting long-term changes," Zedlewski said.

Cloudera has also enhanced the Hadoop Resource and Authorization Manager, facilitating rollbacks and improving security with LDAP systems.

Hadoop is an architecture for crunching huge amounts of data using a network of distributed servers. Nutch web crawler creator Doug Cutting based the platform on research papers describing Google GFS and MapReduce., and it is now an Apache Software Foundation (ASF) project.

Today, Cloudera also released a free "Express" edition of its Service and Configuration Manager module used in Cloudera Enterprise 3.5 that will automate the installation and configuration of Hadoop on a cluster of up to 50 nodes. Meanwhile, the company has also donated code for its packaging and testing suite to ASF, under a project called Bigtop. The idea is to help improve packaging and interoperability testing for Hadoop and related modules.

Among Bigtop's initial committers is Canonical, chief commercial steward of Ubuntu. Cloudera has supported packaging for Ubuntu Linux for the last two-and-a-half years.

Zedlewski said Cloudera will add a further three or four modules to the current stack, among them a compression algorithm that leverages Google's Snappy to speed up data import and export.

This will be added in an update to CDH in the next month or so, Zedlewski said. Other components are due in "the CDH 4 timeframe", he said, while Cloudera is also looking at enhancements around high-availability features in the core Hadoop module. Zedlewski would not provide a date for CDH 4, but he said "work is well underway". ®

This article has been updated to clarify Cloudera has delivered Cloudera Enterprise 3.5.

Security and trust: The backbone of doing business over the internet

More from The Register

next story
Phones 4u slips into administration after EE cuts ties with Brit mobe retailer
More than 5,500 jobs could be axed if rescue mission fails
JINGS! Microsoft Bing called Scots indyref RIGHT!
Redmond sporran metrics get one in the ten ring
Driving with an Apple Watch could land you with a £100 FINE
Bad news for tech-addicted fanbois behind the wheel
Murdoch to Europe: Inflict MORE PAIN on Google, please
'Platform for piracy' must be punished, or it'll kill us in FIVE YEARS
Phones 4u website DIES as wounded mobe retailer struggles to stay above water
Founder blames 'ruthless network partners' for implosion
Sony says year's losses will be FOUR TIMES DEEPER than thought
Losses of more than $2 BILLION loom over troubled Japanese corp
Radio hams can encrypt, in emergencies, says Ofcom
Consultation promises new spectrum and hints at relaxed licence conditions
Why Oracle CEO Larry Ellison had to go ... Except he hasn't
Silicon Valley's veteran seadog in piratical Putin impression
Big Content Australia just blew a big hole in its credibility
AHEDA's research on average content prices did not expose methodology, so appears less than rigourous
prev story

Whitepapers

Secure remote control for conventional and virtual desktops
Balancing user privacy and privileged access, in accordance with compliance frameworks and legislation. Evaluating any potential remote control choice.
WIN a very cool portable ZX Spectrum
Win a one-off portable Spectrum built by legendary hardware hacker Ben Heck
Storage capacity and performance optimization at Mizuno USA
Mizuno USA turn to Tegile storage technology to solve both their SAN and backup issues.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
The next step in data security
With recent increased privacy concerns and computers becoming more powerful, the chance of hackers being able to crack smaller-sized RSA keys increases.