Feeds

Hadoop Daddy Doug Cutting says there's an elephant in the room

The actual Hadoop elephant the project is named after, and a need for better security

Internet Security Threat Report 2014

Not many soft toys make their mark on the worlds of business and technology, but yesterday The Reg beheld “Hadoop”, the toy elephant that gave its name to the big data tool about which we've all heard so much.

Hadoop the toy elephant appeared at CeBIT Australia in the company of Doug Cutting, one of the co-creators of Hadoop the big data tool.

Cutting brought Hadoop along to the show, propping him on the lectern during his keynote and tucking him into his blazer during our chat. Hadoop once belonged to Cutting's then-five-year-old son, who invented its name. Cutting liked it so much he adopted the moniker for the software.

It didn't take much prompting to get Hadoop out into the open air, but Cutting devised a rule stating Hadoop has to be photographed in the company of two others. So the shot below is not your correspondent starf*cking*, but complying with Cutting's wishes.

Hadoop the Elephant, Simon Sharwood and Hadoop co-creator Doug Cutting

Your correspondent, Hadoop the Elephant and Hadoop co-creator Doug Cutting

If you look closely at Hadoop, the elephant, it has a small hole on its leg. Cutting says Hadoop, the software, also has a hole in its list of security features. Making Hadoop more robust is therefore something Cutting expects he, and the Hadoop community will prioritise in coming months.

Another thing Cloudera expects to prioritise is training: Cutting said there's more demand for Hadoop than there are people capable of using it. He added that he doesn't think the world necessarily needs “Data Scientists”, a term he thinks is reasonable as it describes a role that's not that of a conventional statistician. But he also said Hadoop and general big data skills don't need an entirely new class of worker. “Spreadsheets let people who were not in IT do interesting computations and go to their boss and say: 'Look! I took this sales data and I did this to it and I have this cool idea about what to do next'.”

“That really opened up computing to a much larger percentage of [workers inside] a company. We are at a similar point now where more people could be using the tools, but they need to learn the new capabilities.”

Once they do learn Hadoop, Cuttting said he expects they'll exercise it with on-premises Hadoop clusters. While elastic cloud servers appear a natural way to deploy Hadoop, he said the majority of Cloudera customers find it makes more sense to acquire and operate their own kit rather than picking servers off the shelf as and when required.

Scale isn't the reason, he said. Price is. “If you're running something 24 by 7 and it is pretty large, it is cheaper to run it yourself.” Cloudera, he therefore opined, may not have picked the very best name as it turns out the cloud era is coming a little later than it imagined, at least so far as its products are concerned.

Cutting welcomed Intel's recent and colossal investment in Cloudera and defended it by pointing out it will not lead to the company being trapped in orbit around Chipzilla because “it is not an exlcusive deal. We are not required to leave anything else behind. We can do another deal with AMD or ARM,” but must prioritise work on Intel platforms before heading in other directions. ®

Bootnote

*Some readers might be thinking this correspondent has a thing for shots with technology industry celebrity soft toys. To those advancing that theory, the shot below really is Steve Wozniak. Not an Ewok.

Choosing a cloud hosting partner with confidence

More from The Register

next story
Preview redux: Microsoft ships new Windows 10 build with 7,000 changes
Latest bleeding-edge bits borrow Action Center from Windows Phone
Google opens Inbox – email for people too thick to handle email
Print this article out and give it to someone tech-y if you get stuck
Microsoft promises Windows 10 will mean two-factor auth for all
Sneak peek at security features Redmond's baking into new OS
FTDI yanks chip-bricking driver from Windows Update, vows to fight on
Next driver to battle fake chips with 'non-invasive' methods
UNIX greybeards threaten Debian fork over systemd plan
'Veteran Unix Admins' fear desktop emphasis is betraying open source
Entity Framework goes 'code first' as Microsoft pulls visual design tool
Visual Studio database diagramming's out the window
Google+ goes TITSUP. But WHO knew? How long? Anyone ... Hello ...
Wobbly Gmail, Contacts, Calendar on the other hand ...
prev story

Whitepapers

Why cloud backup?
Combining the latest advancements in disk-based backup with secure, integrated, cloud technologies offer organizations fast and assured recovery of their critical enterprise data.
A strategic approach to identity relationship management
ForgeRock commissioned Forrester to evaluate companies’ IAM practices and requirements when it comes to customer-facing scenarios versus employee-facing ones.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
New hybrid storage solutions
Tackling data challenges through emerging hybrid storage solutions that enable optimum database performance whilst managing costs and increasingly large data stores.