Feeds

Shrinking primary databases

Clear out the redundant data to make clear space

Internet Security Threat Report 2014

Stripping out old records from primary databases could pay big bucks in terms of reclaimed disk capacity, faster database operations and backups. Clearpace has technology to do this and archive the extracted records in de-duplicated form on cheap SATA arrays.

UK-based HP reseller 2e2 will resell and use Clearpace's NParchive product to take rarely accessed records out of Oracle and similar databases and store them in de-duplicated form in a separate archive. If you have a 500GB Oracle database it can strip out, say, 200GB of data that hasn't been accessed for weeks, de-duplicate it and store it in compressed form in a SATA disk archive. There it occupies 10-50GB of space and is still accessible at disk speed - obviously not as fast as the original raw data but you get it pretty quickly.

The primary database is now a 300GB one; its processes run faster and its DR (disaster recovery) and development copies are smaller. With this multiplier you're saving up to 600GB of primary capacity.

Clearpace's NParchive stores data as either unique original data or pointers to it, in a tree-like pattern to reflect the original records. It can be searched with SQL from common tools offered by Business Objects, Crystal Reports, COGNOS and others. It can ingest data from different primary databases - SQL Server, Oracle, DB2 or whatever - and store them in a tamper-proof way.

ESG boss Steve Duplessie blogged on deduplication recently. He talked about data in primary storage that's rarely accessed and doesn't change, saying: "This is the stage where we next want to apply a massive reduction in the copies of data we have. It's still 'primary' storage, but by applying de-dupe here we can probably chop 50 per cent or more of our overall capacity off at the knees.

"The next step is to figure out how to slide the de-dupe lever closer to the point of creation, and the biggest value point is going to be [with this data]."

Clearpace has technology to vacuum up this data from primary databases, shrinking and speeding them, and stuffing it in de-duplicated and compressed form, perhaps twenty times smaller - possibly more - into a single searchable disk-based archive. It's not de-duplicating the structured database information in situ but it is reducing primary databases in size and preserving old data in a much-reduced form. The de-dupe lever has been slid closer to the creation point. That sounds pretty good. ®

Whitepaper

Clearpace has teamed up with HP to produce a free whitepaper on this subject, called The Six R's of Application Archiving. You can download it from Reg Whitepapers.

Top 5 reasons to deploy VMware with Tegile

More from The Register

next story
Docker's app containers are coming to Windows Server, says Microsoft
MS chases app deployment speeds already enjoyed by Linux devs
Intel, Cisco and co reveal PLANS to keep tabs on WORLD'S MACHINES
Connecting everything to everything... Er, good idea?
SDI wars: WTF is software defined infrastructure?
This time we play for ALL the marbles
'Urika': Cray unveils new 1,500-core big data crunching monster
6TB of DRAM, 38TB of SSD flash and 120TB of disk storage
Facebook slurps 'paste sites' for STOLEN passwords, sprinkles on hash and salt
Zuck's ad empire DOESN'T see details in plain text. Phew!
'Hmm, why CAN'T I run a water pipe through that rack of media servers?'
Leaving Las Vegas for Armenia kludging and Dubai dune bashing
Windows 10: Forget Cloudobile, put Security and Privacy First
But - dammit - It would be insane to say 'don't collect, because NSA'
Oracle hires former SAP exec for cloudy push
'We know Larry said cloud was gibberish, and insane, and idiotic, but...'
prev story

Whitepapers

Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Win a year’s supply of chocolate
There is no techie angle to this competition so we're not going to pretend there is, but everyone loves chocolate so who cares.
Why cloud backup?
Combining the latest advancements in disk-based backup with secure, integrated, cloud technologies offer organizations fast and assured recovery of their critical enterprise data.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Saudi Petroleum chooses Tegile storage solution
A storage solution that addresses company growth and performance for business-critical applications of caseware archive and search along with other key operational systems.