Feeds

Hadoop Hive stung into action, swarms around SQL

More relational, more useful to humans, we're promised

HP ProLiant Gen8: Integrated lifecycle automation

Hortonworks has unveiled the Stinger Initiative, a project to make Hadoop’s Hive data warehouse friendlier with SQL and faster.

Hortonworks has also unveiled two accompanying Hadoop projects, which it’s submitted to the Apache Software Foundation (ASF) in the hope they become community-supported projects. They are a runtime called Tez and a sign-in and authentication system called Gateway. Both Tez and Gateway are ASF incubator projects. You can read more about them here.

Hadoop services startup Hortonworks said Stinger would “enhance Hive with more SQL and better performance” for what it called “human-time use cases”.

Translated, Stinger should make Hive friendlier and faster to use in data querying and analytics normally undertaken by SQL and relational tools.

Hive, like the rest of the Hadoop architecture, has thrived on crunching batches of data – Hadoop is a open-source implementation of Google’s MapReduce and a NoSQL system.

However, the NoSQL crowds realised they need to make their architectures work better with SQL-like tools used by businesses in the real world.

The standard SQL interface for Hive was HiveQL, but it doesn't match the latest SQL standard - and support for HiveQL is not widespread, so banking your data infrastructure on it is a bit of a gamble. ASF's HiveQL project web page is depricated, and simply points you to the HiveQL programming manual.

According to Hortonworks, Stinger will make Hive “a more suitable tool for the decision support queries people want to perform on Hadoop”.

This means the addition of analytics features such as the OVER clause, support for subqueries in WHERE and aligning Hive’s type system with the standard SQL model.

The plan is to speed up Hive, too. There’s a new executing engine to increase the number of records per second Hive can process, a new columnar file format to provide “a more modern, efficient and high performing” means to store Hive data, and the Tez runtime framework to speed up workload speeds by eliminating unnecessary talks and synchronization barriers and that reads and writes to HDFS.

A preview of Stinger is planned ahead of the Hadoop Summit in Amsterdam in March. ®

The Power of One eBook: Top reasons to choose HP BladeSystem

More from The Register

next story
Apple fanbois SCREAM as update BRICKS their Macbook Airs
Ragegasm spills over as firmware upgrade kills machines
HIDDEN packet sniffer spy tech in MILLIONS of iPhones, iPads – expert
Don't panic though – Apple's backdoor is not wide open to all, guru tells us
Mozilla fixes CRITICAL security holes in Firefox, urges v31 upgrade
Misc memory hazards 'could be exploited' - and guess what, one's a Javascript vuln
NO MORE ALL CAPS and other pleasures of Visual Studio 14
Unpicking a packed preview that breaks down ASP.NET
Captain Kirk sets phaser to SLAUGHTER after trying new Facebook app
William Shatner less-than-impressed by Zuck's celebrity-only app
Cheer up, Nokia fans. It can start making mobes again in 18 months
The real winner of the Nokia sale is *drumroll* ... Nokia
EU dons gloves, pokes Google's deals with Android mobe makers
El Reg cops a squint at investigatory letters
Chrome browser has been DRAINING PC batteries for YEARS
Google is only now fixing ancient, energy-sapping bug
prev story

Whitepapers

Designing a Defense for Mobile Applications
Learn about the various considerations for defending mobile applications - from the application architecture itself to the myriad testing technologies.
How modern custom applications can spur business growth
Learn how to create, deploy and manage custom applications without consuming or expanding the need for scarce, expensive IT resources.
Reducing security risks from open source software
Follow a few strategies and your organization can gain the full benefits of open source and the cloud without compromising the security of your applications.
Boost IT visibility and business value
How building a great service catalog relieves pressure points and demonstrates the value of IT service management.
Consolidation: the foundation for IT and business transformation
In this whitepaper learn how effective consolidation of IT and business resources can enable multiple, meaningful business benefits.