The Register® — Biting the hand that feeds IT

Feeds

Microsoft opens trenchcoat, reveals 'in-memory' Big Data column

Just the (100 billion) facts, man

Customer Success Testimonial: Recovery is Everything

If there’s one thing scarier than the big data tsunami, tech vendors tell us, then it’s tech vendors getting left out of the big-data conversation.

Microsoft is the latest software maker to crowbar itself into the debate on big data, this time claiming a place at the table on in-memory databases.

According to a blog here and flagged here, in-memory database technologies are reaching a tipping point and will become common during the next 5-10 years.

And guess what? Microsoft is poised to exploit this. SQL Server Technical Fellow Dave Campbell writes:

"Microsoft has been investing in, and shipping, in-memory database technologies for some time."

Campbell identified “in-memory” in the Microsoft world as a column-based storage engine in Word and Excel. This has now shipped with the newly released SQL Server 2012 as the xVelocity in-memory analytics engine that's part of SQL Server Analysis Services.

Campbell claimed a 200 times performance gain for one SQL Server 2012 customer “through the use of this new in-memory optimized columnstore index type.”

Microsoft’s man promised more from Redmond’s labs.

“Microsoft is also investing in other in-memory database technologies which will ship as the technology and opportunities mature,” he said. He didn’t reveal details but said that this includes an in-memory database solution in the company’s lab “and building our real-world scenarios to demonstrate the potential.”

“One such scenario, based upon one of Microsoft’s online services businesses, contains a fact table of 100 billion rows. In this scenario we can perform three calculations per fact – 300 billion calculations in total, with a query response time of 1/3 of a second. There are no user defined aggregations in this implementation; we actually scan over the compressed column store in real time,” he said.

When it comes to in-memory, database giant Oracle at least has some legitimacy. Years before Big Data was a blob on the horizon, Larry Ellison’s database beast swallowed tiny TimesTen in 2005. TimesTen uses replication and access techniques to keep data in the memory of a system rather than write to disk cache.

Last week, Ellison’s great white whale SAP revived its own HANA in-memory platform, announcing a $337m data base adoption program and $155m SAP HANA Real-Time Fund for startups and entrepreneurs to develop real-time apps.

SAP Ventures, the ERP giant’s venture-capital wing, has also joined Toshiba, Juniper Network and others putting $50m into flash array start-up Violin Memory.

This might account for Microsoft's claims.

While Oracle might be the in-memory leader, though, it isn't above a little shameless bandwagon-jumping when it needs to. The database giant in January 2011 was laying some tenuous claims of its own this time on NoSQL.

"Is Berkeley DB a 'NoSQL' solution today?" Oracle asked here of the embedded database it bought in 2006.

"Nope. Could Berkeley DB grow into a NoSQL solution? Absolutely" - given the right changes. ®

Customer Success Testimonial: Recovery is Everything

Anonymous Coward

Quite recently...

... someone asked for a "sqlite-type nosql" solution. Because, you know, databases are hot shit. Or because the data was all about nested objects and such. But for how much data? "50-100 objects". Ah. Use the built-in serialisation and drop the result in a file, there's a good developer.

We have computing power that approaches "free", there's yet another database hype going on, and this time we have so much data it's not funny any longer. What it all means? Most of it is crud; how much truly interesting stuff is there to be found on facebook, google+, and so on? Not that much. But we like to believe that with enough massaging the lead turns into gold.

How many users of various "alternative databases" actually need that? Quite a lot fewer than're actually trying to use the things. Nothing wrong with using whatever tool works and doesn't have nasty side effects like subtly corrupting your data, but that doesn't automatically make for a "best" or even "good enough" solution. Sometimes it's good to remember that there's still problems around best solved by using a key/value store without any additional "NoSQL" or "XML" or other buzzword-du-jour layers. What was it again, the master shows in the choosing?

4
0
Anonymous Coward

Re: Quite recently...

There are no alternative databases here... just researching/exploiting (decades later) alternative storage and indexing structures. Think of it: the columnar storage in SQL Server 2012 is built on top of regular row-oriented structures but exploits column compression along with smart ordering. For this new thing just try to be a bit more creative: in memory databases doesn't necessarily have drop any ACID property, nor they have to use b-trees...

Typical NoSQL thingamajigs simply want to store-fast-think-later. It's not that by storing things by keys/values make the conceptual data model disappear. Sooner of later people will have to analyze the stored data and being so close and tied to a (simple) storage mechanism immediately fires back. They'll have to understand the real data model behind property bags and they'll need a powerful language to be able to make requests. Maybe a language that combines the power of algebra and calculus. So they'll end up building new MaybeSQL fads until they realize that it would have been better to start off with a proper RDBMS, and learn about Data Dependent Routing techniques, instead of using map-reduce to do basic things like hash-joins.

0
0

More from The Register

Nuke plants to rely on PDP-11 code UNTIL 2050!
Programmers and their walking sticks converge in Canada
Bjarne Again: Hallelujah for C++
Plus: Now officially OK to admit you never used STL algorithms
Interwebs taunt Sir Jony over Apple eye candy makeover
Hey Ive, Ive... add more unicorns, willya?
SCO vs. IBM battle resumes over ownership of Unix
Zombie lawsuit back and wants to suck the brains out of Linux
Red Hat to ditch MySQL for MariaDB in RHEL 7
So long, Oracle! Don't let the door hit you on the way out
Shy? Socially inadequate? Fiddling with your phone could help
App 'tells the brutal truth' about social inadequates' chatup lines
Java EE 7 melds HTML5 with enterprise apps
New release arrives with GlassFish, NetBeans support
 breaking news
'Office Facebook' firm Tibbr wants you to PAY for mobe-meetings app
Great idea. Punters won't cough for it though
 breaking news
The only Waze is Google: Ad giant tipped to gobble map app 'for $1.3bn'
Pac-Man-satnav-ish upstart in bidding war with Apple, Facebook
 breaking news
PM Cameron calls for modern, programmable computers! (We think)
IT education musings to G8 chiefs to mystify IT industry