Feeds

'Red Hat for stats' goes toe-to-toe with SAS

Analyze this

High performance access to file storage

Revolution Analytics – the company that launched last year to be the "Red Hat for stats", providing an extended version of the open source R programming language and runtime – is going directly after analytics juggernaut SAS Institute with its latest release: R Enterprise 4.2.

With the updated release, Revolution's Enterprise R can read and write data in the proprietary data format used by SAS tools, which has been around for decades and which is called the SAS File Format. Jeff Erhardt, who was a heavy R user when he worked at chip makers Advanced Micro Devices and Spansion and who has been chief operating officer at Revolution since the company came out with its open core version of R last June, says that the file format has allowed SAS to effectively lock customers into using its tools.

By supporting the SAS file format natively, customers can continue to keep their data in the same format as they are accustomed to, but can add new users to do new analytics work using the R Enterprise tool, bypassing the need to buy more SAS licenses.

Customers who move to the R Enterprise tool can also convert the SAS files to a binary "big data" format created by the company called XDF, which is loosely based on NoSQL database principles. This XDF format, which was announced last August with R Enterprise 4.0. That 4.0 code also had better multithreading capabilities than the earlier releases of the R Enterprise engine, allowing it to make better use of processor cores and threads than the open source version of the R engine.

The update last summer also included a clustering feature based on remote procedure call (RPC) to cluster multiple servers together to parallelize the R engine and therefore speed up data crunching for analytics work. "You'll be able to get the same results in SAS," boasts Erhardt, "but now we can do it much faster and at a fraction of the cost."

Erhardt says that on tests that Revolution Analytics has done pitting its souped up R engine (the components of which are not available as open source code) against the SAS tools, running linear regressions, logistic regressions, and cross tabulations, the supported R Enterprise licenses cost no more than half what it would cost to buy SAS and would deliver about twice the speed on a given set of hardware. In some cases, Erhardt claims the performance difference in favor of its R Enterprise engine can be an order of magnitude, and that it runs better on commodity x64 iron.

SAS has been running its code on Hewlett-Packard's Neoview data warehouse appliance, which is based on Intel's Itanium processors and which HP euthanizedtwo weeks ago. HP only had a few dozen customers using this Neoview appliance, which was based on the Integrity systems running the NonStop kernel and parallel database.That product was discontinued a week after HP and Microsoft launched database appliances based on HP's x64-based ProLiant servers and Microsoft's Windows Server 2008 and SQL Server 2008 database.

The core SAS 9.X Foundation tools can be deployed on AIX, HP-UX, Solaris, Linux, Windows, and OpenVMS. Thus far, HP, Microsoft, and SAS have not announced a new appliance based on the EDW or related Windows-based data warehousing appliances, but this could be in the works.

In any event, Revolution Analytics is going after SAS both directly by allowing its R implementation to use SAS files and indirectly through a deal called the SAS to R Challenge. Under this deal, between now and March 31, Revolution R will convert "representative SAS code" to R code free of charge and run it against SAS data to demonstrate "terabyte-class data analyses" to prove that its code is faster at crunching data than the SAS original. You can apply to the challenge here.

The R Enterprise 4.2 update is now fully capable on both Windows and Linux. In the prior releases, the big data XDF format, which is wrapped up in a feature called RevoScale R, was only available on Windows boxes, and the Web services integration to hook data analytics into other applications was only available on Linux. R Enterprise runs on 32-bit and 64-bit Windows XP and 7 desktops as well as servers and on Red Hat Enterprise Linux 5; RHE 6 is not yet supported.

R Enterprise 4.2 is available now, and costs $1,000 per workstation and $25,000 per server (that price is for a two-socket machine using six-core processors). The server version has the Web services and clustering features. Prior to the addition of the XDF big data option, the server version cost $15,000 per machine, so the NoSQL-ish format costs $10,000.

There are over two million R users worldwide, and over 2,500 open source plug ins have been created for the open source R engine by academics, quants, and others. There is a big installed base of users who want more scalability than the open source R can deliver, and now Revolution Analytics needs to get some bigtime partners to help it push its wares against SAS and IBM. ®

High performance access to file storage

More from The Register

next story
Windows 8.1, which you probably haven't upgraded to yet, ALREADY OBSOLETE
Pre-Update versions of new Windows version will no longer support patches
Android engineer: We DIDN'T copy Apple OR follow Samsung's orders
Veep testifies for Samsung during Apple patent trial
OpenSSL Heartbleed: Bloody nose for open-source bleeding hearts
Bloke behind the cockup says not enough people are helping crucial crypto project
Microsoft lobs pre-release Windows Phone 8.1 at devs who dare
App makers can load it before anyone else, but if they do they're stuck with it
Half of Twitter's 'active users' are SILENT STALKERS
Nearly 50% have NEVER tweeted a word
Windows XP still has 27 per cent market share on its deathbed
Windows 7 making some gains on XP Death Day
Internet-of-stuff startup dumps NoSQL for ... SQL?
NoSQL taste great at first but lacks proper nutrients, says startup cloud whiz
US taxman blows Win XP deadline, must now spend millions on custom support
Gov't IT likened to 'a Model T with a lot of things on top of it'
Microsoft TIER SMEAR changes app prices whether devs ask or not
Some go up, some go down, Redmond goes silent
prev story

Whitepapers

Securing web applications made simple and scalable
In this whitepaper learn how automated security testing can provide a simple and scalable way to protect your web applications.
Five 3D headsets to be won!
We were so impressed by the Durovis Dive headset we’ve asked the company to give some away to Reg readers.
HP ArcSight ESM solution helps Finansbank
Based on their experience using HP ArcSight Enterprise Security Manager for IT security operations, Finansbank moved to HP ArcSight ESM for fraud management.
The benefits of software based PBX
Why you should break free from your proprietary PBX and how to leverage your existing server hardware.
Mobile application security study
Download this report to see the alarming realities regarding the sheer number of applications vulnerable to attack, as well as the most common and easily addressable vulnerability errors.