Feeds

The case for open source ETL

What you see is what you get

Beginner's guide to SSL certificates

Comment As far as I have been able to discover, there are four open source ETL (extract, transform and load) tools on the market. Somewhat surprisingly, two of them are homonyms: KETL and Kettle, the other two being Enhydra Octopus and CloverETL.

Kettle is based on an ETTL paradigm, the extra ‘T’ standing for transport (which seems an unnecessary complication) and wins the prize for the product with the most sense of humour as it has four components that are variously named Spoon, Pan, Chef and Kitchen.

The most interesting question is where the market for open source ETL is. Looking at the products one would have to assume that they are mostly in the same space as the Sesame Software product that I discussed recently. That is, they are aimed at developers that know what they are doing and do not need (and do not get) a graphical drag-and-drop style product. The exception is Kettle, which looks much more like an PowerCenter or DataStage.

Another difference in open source products can be in implementation. Kinetic Networks, for example, the developers of KETL, reckons that you may need some implementation assistance with its product. In part, this is a result of the product’s origins: it was originally developed for in-house use and in conjunction with professional services engagements, so it is not really surprising if there are aspects of the product that have not been automated for the open source market yet.

In general, what you see is what you get with open source products, though there are add-on products for both of the homonymous products. In the case of Kettle one of the partners offers an SAP connector while Kinetic Networks has a number of options that it offers in conjunction with KETL, notably an MPP (massively parallel processing) option for improved performance, a data profiling extension and a clickstream capability.

As far as I can see there are no options (apart from support) available with either Enhydra Octopus or CloverETL, the latter being a product that generates Java. This, despite its attractions (especially for ISVs), is still a relatively rare capability: ETL Solutions has a product that generates Java and ETI plans to, but otherwise this is not generally available, so it represents a potential market for CloverETL that is not available to its open source counterparts.

Enhydra Octopus is distinguished by the fact that it has different companies offering support for the product in Europe, Japan and the United States whereas the other products only have limited support options (USA for KETL, Austria/Belgium for Kettle and the Czech Republic for Enhydra Octopus).

In other words, each of the products has something different going for it, though none of them will trouble the likes of Informatica, IBM or Ab Initio.

Copyright © 2005, IT-Analysis.com

Remote control for virtualized desktops

More from The Register

next story
PEAK APPLE: iOS 8 is least popular Cupertino mobile OS in all of HUMAN HISTORY
'Nerd release' finally staggers past 50 per cent adoption
Microsoft to bake Skype into IE, without plugins
Redmond thinks the Object Real-Time Communications API for WebRTC is ready to roll
Microsoft promises Windows 10 will mean two-factor auth for all
Sneak peek at security features Redmond's baking into new OS
Mozilla: Spidermonkey ATE Apple's JavaScriptCore, THRASHED Google V8
Moz man claims the win on rivals' own benchmarks
Yes, Virginia, there IS a W3C HTML5 standard – as of now, that is
You asked for it! You begged for it! Then you gave up! And now it's HERE!
FTDI yanks chip-bricking driver from Windows Update, vows to fight on
Next driver to battle fake chips with 'non-invasive' methods
DEATH by PowerPoint: Microsoft warns of 0-day attack hidden in slides
Might put out patch in update, might chuck it out sooner
Ubuntu 14.10 tries pulling a Steve Ballmer on cloudy offerings
Oi, Windows, centOS and openSUSE – behave, we're all friends here
prev story

Whitepapers

Cloud and hybrid-cloud data protection for VMware
Learn how quick and easy it is to configure backups and perform restores for VMware environments.
Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Intelligent flash storage arrays
Tegile Intelligent Storage Arrays with IntelliFlash helps IT boost storage utilization and effciency while delivering unmatched storage savings and performance.
Website security in corporate America
Find out how you rank among other IT managers testing your website's vulnerabilities.