Feeds

What the Hell is IBM Information Integrator?

It's a database thang

  • alert
  • submit to reddit

Top 5 reasons to deploy VMware with Tegile

Briefing Note Yesterday IBM announced its new Information Integrator family of products, writes Phil Howard. Ultimately this will consist of three offerings (although in the longer term the three products will probably converge), based on SQL, an object oriented API and an XML API respectively. However, the last of these, which will use XQuery, has not been announced yet, as it is awaiting the final definition of the XQuery standard.

The two products that are announced (in beta) are Information Integrator, previously known by the code name of Xperanto; and Information Integrator for Content, where the former is the relational product and the latter is designed to provide integration in content management environments. In practice, the latter represents a re-positioning of IBM's Enterprise Information Portal (a subset of WebSphere Portal, which is really the company's enterprise information portal) for accessing mainly IBM content repositories together with other content and data sources.

The Content product is not really new, so I want to focus on Information Integrator, which is now available for beta testing and which is scheduled for general availability in mid-summer. In today's article I will discuss the details of Information Integrator and tomorrow I will consider the circumstances under which it will be most appropriate to look at Information Integrator as opposed to alternative technologies such as ETL (extract, transform and load) tools. I will also consider some of the environments in which use of Information Integrator may be most beneficial.

Information Integrator (which is in version 8.1 to align it with the latest release of DB2) is based primarily upon the facilities of DB2, SQL and DataJoiner. The basic concept is predicated upon a federated database approach in which multiple heterogeneous databases appear to the user as if they were a single database.

Both Microsoft and IBM have espoused this approach for some time, while Oracle has preferred to concentrate upon centralisation. However, the downside of centralisation is that you have to rip out and replace existing databases, with all the pain that that entails.

Microsoft, meanwhile, has relatively limited support for federated databases in SQL Server 2000 and, even then, it tends to be limited to SQL Server support, whereas IBM has taken a more agnostic approach, supporting all sorts of relational databases within a federation. It is not hard to say, therefore, that IBM is the market leader in this space.

However, it is also important to realise that Information Integrator is not limited to accessing relational data sources - it can also access XML, flat files, Microsoft Excel, ODBC, Web and other content stores and so on, although updates and replication are limited to relational sources in the first release. Thus (for those of you who know the product) the full capabilities of DataJoiner have not been implemented in this release.

There are some key features of Information Integrator that should be mentioned. In particular, you can query data wherever it resides, as if it was at a single location, with a single view across all the relevant data sources. The product supports queries by caching query tables across federated sources, while the optimiser will validate the SQL used against the source database and will automatically compensate if the relevant syntax is not supported on the remote database. Other features of the federation capabilities of the product include the ability to publish the results of a query to a message queue and to compose, transform and validate XML documents.

In terms of updates, I have already mentioned replication and Information Integrator effectively acts as a replication server, initially supporting Oracle, Informix, Microsoft, Sybase and Teradata databases, as well as DB2. Functions are flexible with support for both one to many and many to one topologies; table-based or transaction-based data movement, which may be dependent on whether you have batch or online requirements; and latency which may be scheduled, interval-based or continuous.

While a brief article such as this can give no more than a flavour of a product like Information Integrator (and Bloor Research will be publishing a full report on the product in due course), it should be clear that in the right environment Information Integrator has much to offer. What those environments might be, I will discuss tomorrow.

© IT-Analysis.com

Beginner's guide to SSL certificates

More from The Register

next story
It's Big, it's Blue... it's simply FABLESS! IBM's chip-free future
Or why the reversal of globalisation ain't gonna 'appen
'Hmm, why CAN'T I run a water pipe through that rack of media servers?'
Leaving Las Vegas for Armenia kludging and Dubai dune bashing
Microsoft and Dell’s cloud in a box: Instant Azure for the data centre
A less painful way to run Microsoft’s private cloud
Facebook slurps 'paste sites' for STOLEN passwords, sprinkles on hash and salt
Zuck's ad empire DOESN'T see details in plain text. Phew!
Windows 10: Forget Cloudobile, put Security and Privacy First
But - dammit - It would be insane to say 'don't collect, because NSA'
CAGE MATCH: Microsoft, Dell open co-located bit barns in Oz
Whole new species of XaaS spawning in the antipodes
prev story

Whitepapers

Cloud and hybrid-cloud data protection for VMware
Learn how quick and easy it is to configure backups and perform restores for VMware environments.
A strategic approach to identity relationship management
ForgeRock commissioned Forrester to evaluate companies’ IAM practices and requirements when it comes to customer-facing scenarios versus employee-facing ones.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Three 1TB solid state scorchers up for grabs
Big SSDs can be expensive but think big and think free because you could be the lucky winner of one of three 1TB Samsung SSD 840 EVO drives that we’re giving away worth over £300 apiece.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.