Feeds

The case for open source ETL

What you see is what you get

Secure remote control for conventional and virtual desktops

Comment As far as I have been able to discover, there are four open source ETL (extract, transform and load) tools on the market. Somewhat surprisingly, two of them are homonyms: KETL and Kettle, the other two being Enhydra Octopus and CloverETL.

Kettle is based on an ETTL paradigm, the extra ‘T’ standing for transport (which seems an unnecessary complication) and wins the prize for the product with the most sense of humour as it has four components that are variously named Spoon, Pan, Chef and Kitchen.

The most interesting question is where the market for open source ETL is. Looking at the products one would have to assume that they are mostly in the same space as the Sesame Software product that I discussed recently. That is, they are aimed at developers that know what they are doing and do not need (and do not get) a graphical drag-and-drop style product. The exception is Kettle, which looks much more like an PowerCenter or DataStage.

Another difference in open source products can be in implementation. Kinetic Networks, for example, the developers of KETL, reckons that you may need some implementation assistance with its product. In part, this is a result of the product’s origins: it was originally developed for in-house use and in conjunction with professional services engagements, so it is not really surprising if there are aspects of the product that have not been automated for the open source market yet.

In general, what you see is what you get with open source products, though there are add-on products for both of the homonymous products. In the case of Kettle one of the partners offers an SAP connector while Kinetic Networks has a number of options that it offers in conjunction with KETL, notably an MPP (massively parallel processing) option for improved performance, a data profiling extension and a clickstream capability.

As far as I can see there are no options (apart from support) available with either Enhydra Octopus or CloverETL, the latter being a product that generates Java. This, despite its attractions (especially for ISVs), is still a relatively rare capability: ETL Solutions has a product that generates Java and ETI plans to, but otherwise this is not generally available, so it represents a potential market for CloverETL that is not available to its open source counterparts.

Enhydra Octopus is distinguished by the fact that it has different companies offering support for the product in Europe, Japan and the United States whereas the other products only have limited support options (USA for KETL, Austria/Belgium for Kettle and the Czech Republic for Enhydra Octopus).

In other words, each of the products has something different going for it, though none of them will trouble the likes of Informatica, IBM or Ab Initio.

Copyright © 2005, IT-Analysis.com

The essential guide to IT transformation

More from The Register

next story
The Return of BSOD: Does ANYONE trust Microsoft patches?
Sysadmins, you're either fighting fires or seen as incompetents now
Munich considers dumping Linux for ... GULP ... Windows!
Give a penguinista a hug, the Outlook's not good for open source's poster child
Intel's Raspberry Pi rival Galileo can now run Windows
Behold the Internet of Things. Wintel Things
Linux Foundation says many Linux admins and engineers are certifiable
Floats exam program to help IT employers lock up talent
Microsoft cries UNINSTALL in the wake of Blue Screens of Death™
Cache crash causes contained choloric calamity
Eat up Martha! Microsoft slings handwriting recog into OneNote on Android
Freehand input on non-Windows kit for the first time
prev story

Whitepapers

Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Top 10 endpoint backup mistakes
Avoid the ten endpoint backup mistakes to ensure that your critical corporate data is protected and end user productivity is improved.
Top 8 considerations to enable and simplify mobility
In this whitepaper learn how to successfully add mobile capabilities simply and cost effectively.
Rethinking backup and recovery in the modern data center
Combining intelligence, operational analytics, and automation to enable efficient, data-driven IT organizations using the HP ABR approach.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.