Feeds

Shrinking primary databases

Clear out the redundant data to make clear space

Protecting against web application threats using SSL

Stripping out old records from primary databases could pay big bucks in terms of reclaimed disk capacity, faster database operations and backups. Clearpace has technology to do this and archive the extracted records in de-duplicated form on cheap SATA arrays.

UK-based HP reseller 2e2 will resell and use Clearpace's NParchive product to take rarely accessed records out of Oracle and similar databases and store them in de-duplicated form in a separate archive. If you have a 500GB Oracle database it can strip out, say, 200GB of data that hasn't been accessed for weeks, de-duplicate it and store it in compressed form in a SATA disk archive. There it occupies 10-50GB of space and is still accessible at disk speed - obviously not as fast as the original raw data but you get it pretty quickly.

The primary database is now a 300GB one; its processes run faster and its DR (disaster recovery) and development copies are smaller. With this multiplier you're saving up to 600GB of primary capacity.

Clearpace's NParchive stores data as either unique original data or pointers to it, in a tree-like pattern to reflect the original records. It can be searched with SQL from common tools offered by Business Objects, Crystal Reports, COGNOS and others. It can ingest data from different primary databases - SQL Server, Oracle, DB2 or whatever - and store them in a tamper-proof way.

ESG boss Steve Duplessie blogged on deduplication recently. He talked about data in primary storage that's rarely accessed and doesn't change, saying: "This is the stage where we next want to apply a massive reduction in the copies of data we have. It's still 'primary' storage, but by applying de-dupe here we can probably chop 50 per cent or more of our overall capacity off at the knees.

"The next step is to figure out how to slide the de-dupe lever closer to the point of creation, and the biggest value point is going to be [with this data]."

Clearpace has technology to vacuum up this data from primary databases, shrinking and speeding them, and stuffing it in de-duplicated and compressed form, perhaps twenty times smaller - possibly more - into a single searchable disk-based archive. It's not de-duplicating the structured database information in situ but it is reducing primary databases in size and preserving old data in a much-reduced form. The de-dupe lever has been slid closer to the creation point. That sounds pretty good. ®

Whitepaper

Clearpace has teamed up with HP to produce a free whitepaper on this subject, called The Six R's of Application Archiving. You can download it from Reg Whitepapers.

Choosing a cloud hosting partner with confidence

More from The Register

next story
Wanna keep your data for 1,000 YEARS? No? Hard luck, HDS wants you to anyway
Combine Blu-ray and M-DISC and you get this monster
Google+ GOING, GOING ... ? Newbie Gmailers no longer forced into mandatory ID slurp
Mountain View distances itself from lame 'network thingy'
US boffins demo 'twisted radio' mux
OAM takes wireless signals to 32 Gbps
Apple flops out 2FA for iCloud in bid to stop future nude selfie leaks
Millions of 4chan users howl with laughter as Cupertino slams stable door
Students playing with impressive racks? Yes, it's cluster comp time
The most comprehensive coverage the world has ever seen. Ever
Run little spreadsheet, run! IBM's Watson is coming to gobble you up
Big Blue's big super's big appetite for big data in big clouds for big analytics
Seagate's triple-headed Cerberus could SAVE the DISK WORLD
... and possibly bring us even more HAMR time. Yay!
prev story

Whitepapers

Secure remote control for conventional and virtual desktops
Balancing user privacy and privileged access, in accordance with compliance frameworks and legislation. Evaluating any potential remote control choice.
WIN a very cool portable ZX Spectrum
Win a one-off portable Spectrum built by legendary hardware hacker Ben Heck
Storage capacity and performance optimization at Mizuno USA
Mizuno USA turn to Tegile storage technology to solve both their SAN and backup issues.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
The next step in data security
With recent increased privacy concerns and computers becoming more powerful, the chance of hackers being able to crack smaller-sized RSA keys increases.