Feeds

On Open Source data warehousing

Greenplum position

Security for virtualized datacentres

YI greeted with some scepticism the initial launch of Greenplum earlier this year (here and here) and was unconvinced about the future of an Open Source data warehousing model. Nevertheless, the company has pushed ahead with its plans and has made a number of significant advances.

To begin with, it has simplified its nomenclature, which was previously confusing. Now there is just Bizgres, which is the Open Source platform and, sometime this autumn, there will be Bizgres MPP (massively parallel processing), which will be Greenplum's high performance appliance (that is, with integrated hardware) and is a chargeable version of Bizgres.

The latest version of Bizgres is 0.7, and it won't be until the MPP release that the product will come out of beta. That said, Greenplum has already introduced a number of new features into Postgres (upon which Bizgres is based), most notably through the introduction of an installer suite, data partitioning and bitmap scans (which act in a fashion similar to bitmap vectors in Oracle). At this stage partitioning capability is limited, being based on constraints only, but the company plans to expand these facilities in later releases. It also intends to introduce bitmap indexing. In the Bizgres MPP release the company will be adding an optimiser.

The other notable move forward is in terms of the partnerships that Greenplum has been forming. Bizgres is now bundled with both an ETL (extract, transform and load) tool and a business intelligence front-end. Both of these are also Open Source products; in the first case from Kinetic Research (this is a Java-based product) and in the second, from JasperSoft. The other partnership that has been announced is with O'Reilly Connection, which will provide the equivalent of "Linked-In", but for engineers and developers interested in progressing this Open Source approach to data warehousing. In other words, this provides a way for developers working in this area to identify and communicate with each other.

This is all very encouraging and Greenplum reports a lot of interest in what it is doing, even though, at the time of writing, it has yet to gain any customers. Customers, of course, are the key: with reference sites the company will be much better placed to take its vision into the market. The difficulty is that even having customers does not necessarily lead to references: some companies shun IT-type publicity while, in any case, implementing a data warehouse is not a short-term exercise, so it may be a while before Greenplum can point to real customer benefits.

On the other hand, Greenplum is much more pro-active in its marketing than some of the other new vendors in this space and, obviously, the Open Source message carries a premium. All of this is encouraging but the jury is still out as to the future of Open Source data warehousing.

© IT-Analysis.com

Internet Security Threat Report 2014

More from The Register

next story
ONE MILLION people already running Windows 10
A third of them are doing it in VMs, but early feedback focuses on frippery
Netscape Navigator - the browser that started it all - turns 20
It was 20 years ago today, Marc Andreeesen taught the band to play
Sway: Microsoft's new Office app doesn't have an Undo function
Content aggregation, meet the workplace ... oh
Sign off my IT project or I’ll PHONE your MUM
Honestly, it’s a piece of piss
Do Moan! MONSTER 6-day EMAIL OUTAGE hits Domain Monster
Customers freaked out by frightful service
Return of the Jedi – Apache reclaims web server crown
.london, .hamburg and .公司 - that's .com in Chinese - storm the web server charts
NetWare sales revive in China thanks to that man Snowden
If it ain't Microsoft, it's in fashion behind the Great Firewall
prev story

Whitepapers

Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Win a year’s supply of chocolate
There is no techie angle to this competition so we're not going to pretend there is, but everyone loves chocolate so who cares.
Why cloud backup?
Combining the latest advancements in disk-based backup with secure, integrated, cloud technologies offer organizations fast and assured recovery of their critical enterprise data.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Saudi Petroleum chooses Tegile storage solution
A storage solution that addresses company growth and performance for business-critical applications of caseware archive and search along with other key operational systems.