Feeds

A brief look at Trillium's Discovery 5.0

Beyond profiling

Intelligent flash storage arrays

I recently received an update from Trillium Software on its latest release (5.0) of Discovery, the data profiling and analysis product it acquired from Avellino.

Even though the product is all about profiling and analysis there are actually no new profiling and analysis capabilities (apart from ease of use and similar things – including selective profiling, which allows you to profile only what you want) in the product.

Why is this? Simply that there is only so much profiling that you can do, and Discovery does it all. This, of course, raises the point that sooner or later (probably sooner) other products will be in the same position. So how can Trillium keep ahead of the trailing pack?

Trillium started to answer this question in its previous release with support for user-defined business rules and it has extended this capability further in its latest version, so that you can now apply business rules to do things such as create calculated columns or concatenate fields.

Now, if you think about it, these are transformations in the same way that ETL (extract, transform and load) applies transformations. And since Discovery can read data from source systems and output it again to a target this means that Discovery could actually be used for ETL purposes.

Of course, there is no job stream definition or scheduling (though the latter could be implemented externally via the command line capability that has been introduced in this release), and Discovery is more aimed at business users than the developers that do the bulk of the work with conventional ETL tools but, for simple jobs, it is certainly capable of being used in that way. Note, however, that this is not a target market for Trillium: it would not want to upset its partners in the data integration space.

Business rules have also been upgraded with the ability to apply them, dynamically, to external data. That is, you can apply business rules without having to pull the (external)data into the Discovery Metabase (see below). In fact this, in a rather different way highlights the general thrust of release 5.0, which is very much focused on the external environment. Also, there are enhanced abilities to embed profiling in third party environments, extended integration with Trillium’s data cleansing and matching software, and an email notification capability, amongst others.

Finally, and this is not part of the new release per se, Trillium told me that it still gets grief over its metabase (where it stores all its data and metadata), because it does not run on top of SQL Server or Oracle or some other popular relational database. Let me state my opinion here firmly: this is a very good thing.

The point about the information in the Discovery metabase is that it does not change: you may enter new data and you may purge old data but there is very little, if any, updating of data. In other words a database designed for transactional processing is not appropriate. In fact, Trillium uses BerkeleyDB from Sleepycat Software for its metabase, which is entirely appropriate—it should provide better performance with less overhead and reduced administration—which is as good a set of arguments as it gets for not using a relational database.

Copyright © 2006, IT-Analysis.com

Top 5 reasons to deploy VMware with Tegile

More from The Register

next story
Preview redux: Microsoft ships new Windows 10 build with 7,000 changes
Latest bleeding-edge bits borrow Action Center from Windows Phone
Google opens Inbox – email for people too thick to handle email
Print this article out and give it to someone tech-y if you get stuck
Microsoft promises Windows 10 will mean two-factor auth for all
Sneak peek at security features Redmond's baking into new OS
UNIX greybeards threaten Debian fork over systemd plan
'Veteran Unix Admins' fear desktop emphasis is betraying open source
Entity Framework goes 'code first' as Microsoft pulls visual design tool
Visual Studio database diagramming's out the window
Google+ goes TITSUP. But WHO knew? How long? Anyone ... Hello ...
Wobbly Gmail, Contacts, Calendar on the other hand ...
DEATH by PowerPoint: Microsoft warns of 0-day attack hidden in slides
Might put out patch in update, might chuck it out sooner
prev story

Whitepapers

Choosing cloud Backup services
Demystify how you can address your data protection needs in your small- to medium-sized business and select the best online backup service to meet your needs.
Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Storage capacity and performance optimization at Mizuno USA
Mizuno USA turn to Tegile storage technology to solve both their SAN and backup issues.