The Register® — Biting the hand that feeds IT

Feeds

Hadoop Hive stung into action, swarms around SQL

More relational, more useful to humans, we're promised

Ensure Ease of Recovery with Asigra’s Agentless Software

Hortonworks has unveiled the Stinger Initiative, a project to make Hadoop’s Hive data warehouse friendlier with SQL and faster.

Hortonworks has also unveiled two accompanying Hadoop projects, which it’s submitted to the Apache Software Foundation (ASF) in the hope they become community-supported projects. They are a runtime called Tez and a sign-in and authentication system called Gateway. Both Tez and Gateway are ASF incubator projects. You can read more about them here.

Hadoop services startup Hortonworks said Stinger would “enhance Hive with more SQL and better performance” for what it called “human-time use cases”.

Translated, Stinger should make Hive friendlier and faster to use in data querying and analytics normally undertaken by SQL and relational tools.

Hive, like the rest of the Hadoop architecture, has thrived on crunching batches of data – Hadoop is a open-source implementation of Google’s MapReduce and a NoSQL system.

However, the NoSQL crowds realised they need to make their architectures work better with SQL-like tools used by businesses in the real world.

The standard SQL interface for Hive was HiveQL, but it doesn't match the latest SQL standard - and support for HiveQL is not widespread, so banking your data infrastructure on it is a bit of a gamble. ASF's HiveQL project web page is depricated, and simply points you to the HiveQL programming manual.

According to Hortonworks, Stinger will make Hive “a more suitable tool for the decision support queries people want to perform on Hadoop”.

This means the addition of analytics features such as the OVER clause, support for subqueries in WHERE and aligning Hive’s type system with the standard SQL model.

The plan is to speed up Hive, too. There’s a new executing engine to increase the number of records per second Hive can process, a new columnar file format to provide “a more modern, efficient and high performing” means to store Hive data, and the Tez runtime framework to speed up workload speeds by eliminating unnecessary talks and synchronization barriers and that reads and writes to HDFS.

A preview of Stinger is planned ahead of the Hadoop Summit in Amsterdam in March. ®

Requirements Checklist for Choosing a Cloud Backup and Recovery Service Provider

If someone wants a relational DB...

... then why not use one from the start? Why would you use a system that sort of maybe not quite at a push supports the relational model?

4
0
Anonymous Coward

Re: If someone wants a relational DB...

Oracle.

0
0

More from The Register

Bjarne Again: Hallelujah for C++
Plus: Now officially OK to admit you never used STL algorithms
Interwebs taunt Sir Jony over Apple eye candy makeover
Hey Ive, Ive... add more unicorns, willya?
SCO vs. IBM battle resumes over ownership of Unix
Zombie lawsuit back and wants to suck the brains out of Linux
Red Hat to ditch MySQL for MariaDB in RHEL 7
So long, Oracle! Don't let the door hit you on the way out
Shy? Socially inadequate? Fiddling with your phone could help
App 'tells the brutal truth' about social inadequates' chatup lines
Java EE 7 melds HTML5 with enterprise apps
New release arrives with GlassFish, NetBeans support
 breaking news
'Office Facebook' firm Tibbr wants you to PAY for mobe-meetings app
Great idea. Punters won't cough for it though
 breaking news
The only Waze is Google: Ad giant tipped to gobble map app 'for $1.3bn'
Pac-Man-satnav-ish upstart in bidding war with Apple, Facebook
 breaking news
PM Cameron calls for modern, programmable computers! (We think)
IT education musings to G8 chiefs to mystify IT industry
Apple at WWDC: Sleek new iOS, death of the big cats, pint-sized Mac Pro
CEO Cook: 'The biggest change to iOS since the introduction of the iPhone'