Feeds

Shrinking primary databases

Clear out the redundant data to make clear space

Boost IT visibility and business value

Stripping out old records from primary databases could pay big bucks in terms of reclaimed disk capacity, faster database operations and backups. Clearpace has technology to do this and archive the extracted records in de-duplicated form on cheap SATA arrays.

UK-based HP reseller 2e2 will resell and use Clearpace's NParchive product to take rarely accessed records out of Oracle and similar databases and store them in de-duplicated form in a separate archive. If you have a 500GB Oracle database it can strip out, say, 200GB of data that hasn't been accessed for weeks, de-duplicate it and store it in compressed form in a SATA disk archive. There it occupies 10-50GB of space and is still accessible at disk speed - obviously not as fast as the original raw data but you get it pretty quickly.

The primary database is now a 300GB one; its processes run faster and its DR (disaster recovery) and development copies are smaller. With this multiplier you're saving up to 600GB of primary capacity.

Clearpace's NParchive stores data as either unique original data or pointers to it, in a tree-like pattern to reflect the original records. It can be searched with SQL from common tools offered by Business Objects, Crystal Reports, COGNOS and others. It can ingest data from different primary databases - SQL Server, Oracle, DB2 or whatever - and store them in a tamper-proof way.

ESG boss Steve Duplessie blogged on deduplication recently. He talked about data in primary storage that's rarely accessed and doesn't change, saying: "This is the stage where we next want to apply a massive reduction in the copies of data we have. It's still 'primary' storage, but by applying de-dupe here we can probably chop 50 per cent or more of our overall capacity off at the knees.

"The next step is to figure out how to slide the de-dupe lever closer to the point of creation, and the biggest value point is going to be [with this data]."

Clearpace has technology to vacuum up this data from primary databases, shrinking and speeding them, and stuffing it in de-duplicated and compressed form, perhaps twenty times smaller - possibly more - into a single searchable disk-based archive. It's not de-duplicating the structured database information in situ but it is reducing primary databases in size and preserving old data in a much-reduced form. The de-dupe lever has been slid closer to the creation point. That sounds pretty good. ®

Whitepaper

Clearpace has teamed up with HP to produce a free whitepaper on this subject, called The Six R's of Application Archiving. You can download it from Reg Whitepapers.

The essential guide to IT transformation

More from The Register

next story
The Return of BSOD: Does ANYONE trust Microsoft patches?
Sysadmins, you're either fighting fires or seen as incompetents now
Microsoft: Azure isn't ready for biz-critical apps … yet
Microsoft will move its own IT to the cloud to avoid $200m server bill
US regulators OK sale of IBM's x86 server biz to Lenovo
Now all that remains is for gov't offices to ban the boxes
Flash could be CHEAPER than SAS DISK? Come off it, NetApp
Stats analysis reckons we'll hit that point in just three years
Oracle reveals 32-core, 10 BEEELLION-transistor SPARC M7
New chip scales to 1024 cores, 8192 threads 64 TB RAM, at speeds over 3.6GHz
Object storage bods Exablox: RAID is dead, baby. RAID is dead
Bring your own disks to its object appliances
Nimble's latest mutants GORGE themselves on unlucky forerunners
Crossing Sandy Bridges without stopping for breath
prev story

Whitepapers

5 things you didn’t know about cloud backup
IT departments are embracing cloud backup, but there’s a lot you need to know before choosing a service provider. Learn all the critical things you need to know.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Build a business case: developing custom apps
Learn how to maximize the value of custom applications by accelerating and simplifying their development.
Rethinking backup and recovery in the modern data center
Combining intelligence, operational analytics, and automation to enable efficient, data-driven IT organizations using the HP ABR approach.
Next gen security for virtualised datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.