What kind of Big Data is yours? Is it data bauxite, data aluminium ... or data Dreamliner?

And can it beat a psychic octopus

Internet Security Threat Report 2014

Analysis Data is valuable. There, we’ve said it, do you feel better? The question is, has data as an information currency - and an entity in and of itself - become inherently more valuable?

Now that we have real time transactional big data analytics to enrich our lives, does this mean that the 1s and 0s inside every binary now somehow equate to a higher cost per pound, dollar and kilo than before? Is one type of data more valuable than another?

Yes, obviously, you could say that unstructured data is worth less than structured data because we have already applied some taxonomy and categorisation to it, but is this wholly fair and correct?

Structured data is post-production processed data most of the time. Unstructured data is (potentially) still full of hidden treasure and just as valuable; we just haven’t mined, brined and refined it yet.

Those pushing the Big Data agenda are intrinsically arguing unstructured data is more valuable and they are the ones who can help you unlock its value.

“They” being pushers of analytics tools, MapReduce and NoSQL-databases, plus, increasingly, relational technologies and platforms that bridge into NoSQL and MapReduce.

Surely there must be some way to push this madness back into its box. Some kind of Economist-style Big Mac Index; the alpha nerds' idea of a “fun” exercise to ascribe universal value to currencies based on the gold standard of the US dollar.

Alas, there is no “international data value scale”, but surely there needs to be one. It’s something Gartner periodically speculates about, while "data evangelists" at big systems companies bang on about the “value” of data.

What we really need is a gauge and hierarchy of some kind. God forbid, even a Magic Quadrant would have been a start. (OK, maybe that's going too far.)

It's all the same when you squint at it

The trouble is, any two datasets are essentially heterogeneous. This core truism means that market valuation of data is not always appropriate. Is there some way of assessing the inner worth of data so that we can treat it with appropriate worth and gravitas and potentially even trade with it?

Business analytics company SAS UK & Ireland may have come closest to the mark by commissioning its Data On The Balance Sheet study that was carried out by the Centre for Economics and Business Research (Cebr) from June last year. The report discussed the economic arguments for placing “data as an asset” on the company balance sheet and in the wider financial reporting framework.

“Data does not have a physical presence and therefore may be considered to have an infinite life when compared alongside physical assets. However, data can depreciate quickly if it is readily outdated. Nevertheless, data must be accounted for," says the report.

As SAS reminds us, data is an “intangible private asset” in terms of the modern balance sheet. Or, in longer form, “data as an asset is defined as any resource controlled by a company which generates future economic benefits and has an associated cost or value which can be reliably measured.”

So much for economics, what does the industry think?

“Data itself has some value, but it's like an uncut gem. Should there be a scale? Certainly yes for things that are time critical. We should also be able to measure how trustworthy data is (for things like online reviews). But I've not seen any customer place a physical dollar value on data,” said Matt Smith, chief technology officer for the Northern EMEA region at Software AG.

Somebody who’s actually put a value on data is the UK chief technology officer of Rackspace, Nigel Beighton. He therefore disagrees with Smith.

“If data is lost or stolen it has a direct value in terms of governmental fines and the cost of regenerating it. Secondly, I don’t think any retailer today would NOT know the incremental value of greater data accuracy down to the penny and the pound,” he said.

“When I worked at lastminute.com we knew the exact cash value of search returns for any given type of journey request on the site. The trick is knowing what you want to do with any piece of data, then the value of it (or some notional meaningful significance) is easy to understand. If data can not be ascribed to a specific task or process, then yes, essentially it remains valueless until that point,” added Beighton.

Beighton wouldn’t be pushed on what actual values lastminute.com put on its data.

But the realm we seem to be moving into is rather than a value on the generic zeros and ones, value comes from the information that that zeros and ones based on where they hail from or the subject matter they cover or the insights they reveal.

Adding value to processed data = £££

Matt Ballantine, founder of Stamp London, a digital consultancy, goes along with this analysis. According to Ballantine, the value of the data comes from how it’s used.

“Bauxite, for example, sells at around $45/ton and aluminum for around $2,000 a ton. In the form of a Boeing 787 Dreamliner - OK, I'm taking a few liberties to make the point - it’s about $1.8m per ton. Where's the value? Processing, design and manufacture,” Ballantine says.

He adds a reality check for all those big-data zealots and tech companies who think they’ve got analysis cracked using their clustered servers and machine learning.

“There was lots of PR buzz this week surrounding the fact that Microsoft Cortana accurately predicted the results of 15 out of 16 World Cup matches. That's great. But Paul the Octopus managed 12 out of 14, which statistically, ain't that far off. Perhaps the real value of big data analytics is only marginally better than a psychic octopus," he notes.

Do we need a data value scale then? Or should we rest easy in the knowledge that data needs a purpose before it can be measured to a certain value scale?

Let’s resurrect Paul the Octopus from the dead, point him at a pile of binary numbers and work from there. ®

Internet Security Threat Report 2014

More from The Register

next story
Docker's app containers are coming to Windows Server, says Microsoft
MS chases app deployment speeds already enjoyed by Linux devs
IBM storage revenues sink: 'We are disappointed,' says CEO
Time to put the storage biz up for sale?
'Hmm, why CAN'T I run a water pipe through that rack of media servers?'
Leaving Las Vegas for Armenia kludging and Dubai dune bashing
'Urika': Cray unveils new 1,500-core big data crunching monster
6TB of DRAM, 38TB of SSD flash and 120TB of disk storage
Facebook slurps 'paste sites' for STOLEN passwords, sprinkles on hash and salt
Zuck's ad empire DOESN'T see details in plain text. Phew!
SDI wars: WTF is software defined infrastructure?
This time we play for ALL the marbles
Windows 10: Forget Cloudobile, put Security and Privacy First
But - dammit - It would be insane to say 'don't collect, because NSA'
prev story


Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Cloud and hybrid-cloud data protection for VMware
Learn how quick and easy it is to configure backups and perform restores for VMware environments.
Three 1TB solid state scorchers up for grabs
Big SSDs can be expensive but think big and think free because you could be the lucky winner of one of three 1TB Samsung SSD 840 EVO drives that we’re giving away worth over £300 apiece.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.