Feeds

The evolution of integration platforms

MDM + DIP = IMP

Choosing a cloud hosting partner with confidence

Comment Once upon a time there were ETL (extract, transform and load) tools and then, quite separately, products for data cleansing and matching started to appear.

However, it took some time before the vendors of the former realised the synergies that existed with the latter. The (partial) exception was Prism, which developed its Quality Manager product as a tool primarily for monitoring the process of data cleansing as opposed to the actual activity of cleansing. However, when Ardent bought Prism, it did so to get its hands on what is now DataStage TX, rather than for Quality Manager.

Nevertheless, it was the descendants of Ardent (subsequently Informix, then Ascential and now IBM) that were the first to recognise the benefits of an integrated platform comprising both ETL and data quality capabilities, which by this time had been augmented by data profiling and analysis.

This trend has continued ever since with, in turn, SAS, Informatica and Business Objects all acquiring major data quality vendors. These companies have been relatively late to follow what is now IBM's lead, but they have done so for a reason in that SAS and Informatica, in particular, have concentrated on metadata management and a unified (as opposed to merely integrated) environment in the first instance and then acquired complementary software, whereas Ascential/IBM did it (is doing it) the other way around.

However, this isn't the end of the story, because data integration has also expanded into, first, support for semi-structured data (SWIFT messages, EDI and so forth) and, more recently, unstructured data. In addition, EII (enterprise information integration) and federated query capability has been added into the mix, with Informatica embedding source code from Composite Software, Business Objects acquiring Medience (recently launched as BusinessObjects Data Federator), IBM integrating with WebSphere Information Integrator, Synopsis building its own facilities, and so on.

This is, effectively, the state of play today (or soon will be) in terms of data integration platforms: ETL, EII, data quality and metadata management in a unified platform. However, things do not stand still and the next step will be to include master data management (MDM) within this platform. As usual, some vendors (notably IBM and SAS) will include their own capabilities while others, at least initially, will partner with third parties. For example, Purisma already runs in conjunction with Business Objects' platform.

Now, if you include MDM along with data integration I think that extends beyond what most people think of as data integration and we might prefer to call that an Information Management Platform (IMP). So, I believe that we will see the extension of data integration platforms (DIP) into IMPs.

Having come to this conclusion, I am planning research for the remainder of this year that will mirror this trend: we have already published an ETL report this year, and an MDM report (written by Harriet Fryman) will shortly be available, while I am starting work on a Data Quality report, which I will follow with a renewed look at the EII/Data Federation market. Finally, I will put all of that together into a Platforms report that will cover both DIPs and IMPs.

Copyright © 2006, IT-Analysis.com

A short biography of Phillip Howard is here.

Business security measures using SSL

More from The Register

next story
'Windows 9' LEAK: Microsoft's playing catchup with Linux
Multiple desktops and live tiles in restored Start button star in new vids
Not appy with your Chromebook? Well now it can run Android apps
Google offers beta of tricky OS-inside-OS tech
New 'Cosmos' browser surfs the net by TXT alone
No data plan? No WiFi? No worries ... except sluggish download speed
Greater dev access to iOS 8 will put us AT RISK from HACKERS
Knocking holes in Apple's walled garden could backfire, says securo-chap
NHS grows a NoSQL backbone and rips out its Oracle Spine
Open source? In the government? Ha ha! What, wait ...?
Google extends app refund window to two hours
You now have 120 minutes to finish that game instead of 15
Intel: Hey, enterprises, drop everything and DO HADOOP
Big Data analytics projected to run on more servers than any other app
iOS 8 release: WebGL now runs everywhere. Hurrah for 3D graphics!
HTML 5's pretty neat ... when your browser supports it
prev story

Whitepapers

Providing a secure and efficient Helpdesk
A single remote control platform for user support is be key to providing an efficient helpdesk. Retain full control over the way in which screen and keystroke data is transmitted.
Saudi Petroleum chooses Tegile storage solution
A storage solution that addresses company growth and performance for business-critical applications of caseware archive and search along with other key operational systems.
Security and trust: The backbone of doing business over the internet
Explores the current state of website security and the contributions Symantec is making to help organizations protect critical data and build trust with customers.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.