Feeds

On Open Source data warehousing

Greenplum position

Choosing a cloud hosting partner with confidence

YI greeted with some scepticism the initial launch of Greenplum earlier this year (here and here) and was unconvinced about the future of an Open Source data warehousing model. Nevertheless, the company has pushed ahead with its plans and has made a number of significant advances.

To begin with, it has simplified its nomenclature, which was previously confusing. Now there is just Bizgres, which is the Open Source platform and, sometime this autumn, there will be Bizgres MPP (massively parallel processing), which will be Greenplum's high performance appliance (that is, with integrated hardware) and is a chargeable version of Bizgres.

The latest version of Bizgres is 0.7, and it won't be until the MPP release that the product will come out of beta. That said, Greenplum has already introduced a number of new features into Postgres (upon which Bizgres is based), most notably through the introduction of an installer suite, data partitioning and bitmap scans (which act in a fashion similar to bitmap vectors in Oracle). At this stage partitioning capability is limited, being based on constraints only, but the company plans to expand these facilities in later releases. It also intends to introduce bitmap indexing. In the Bizgres MPP release the company will be adding an optimiser.

The other notable move forward is in terms of the partnerships that Greenplum has been forming. Bizgres is now bundled with both an ETL (extract, transform and load) tool and a business intelligence front-end. Both of these are also Open Source products; in the first case from Kinetic Research (this is a Java-based product) and in the second, from JasperSoft. The other partnership that has been announced is with O'Reilly Connection, which will provide the equivalent of "Linked-In", but for engineers and developers interested in progressing this Open Source approach to data warehousing. In other words, this provides a way for developers working in this area to identify and communicate with each other.

This is all very encouraging and Greenplum reports a lot of interest in what it is doing, even though, at the time of writing, it has yet to gain any customers. Customers, of course, are the key: with reference sites the company will be much better placed to take its vision into the market. The difficulty is that even having customers does not necessarily lead to references: some companies shun IT-type publicity while, in any case, implementing a data warehouse is not a short-term exercise, so it may be a while before Greenplum can point to real customer benefits.

On the other hand, Greenplum is much more pro-active in its marketing than some of the other new vendors in this space and, obviously, the Open Source message carries a premium. All of this is encouraging but the jury is still out as to the future of Open Source data warehousing.

© IT-Analysis.com

Internet Security Threat Report 2014

More from The Register

next story
Preview redux: Microsoft ships new Windows 10 build with 7,000 changes
Latest bleeding-edge bits borrow Action Center from Windows Phone
Google opens Inbox – email for people too thick to handle email
Print this article out and give it to someone tech-y if you get stuck
Microsoft promises Windows 10 will mean two-factor auth for all
Sneak peek at security features Redmond's baking into new OS
UNIX greybeards threaten Debian fork over systemd plan
'Veteran Unix Admins' fear desktop emphasis is betraying open source
Entity Framework goes 'code first' as Microsoft pulls visual design tool
Visual Studio database diagramming's out the window
Google+ goes TITSUP. But WHO knew? How long? Anyone ... Hello ...
Wobbly Gmail, Contacts, Calendar on the other hand ...
DEATH by PowerPoint: Microsoft warns of 0-day attack hidden in slides
Might put out patch in update, might chuck it out sooner
prev story

Whitepapers

Choosing cloud Backup services
Demystify how you can address your data protection needs in your small- to medium-sized business and select the best online backup service to meet your needs.
Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Storage capacity and performance optimization at Mizuno USA
Mizuno USA turn to Tegile storage technology to solve both their SAN and backup issues.