Feeds

EMC picks Greenplum Data Computing appliance

Sweet

Internet Security Threat Report 2014

EMC has launched its Greenplum Data Computing Appliance (DCA), promising twice the performance of Oracle's Exadata system.

The DCA is an online analytical processing (OLAP) engine for looking at business transaction data, mining it, and getting information out of it that can better describe customer behaviour, to help mobile phone suppliers reduce customer churn, losing them to competing suppliers, for example.

It uses Greenplum's massively parallel processing, shared-nothing architecture. A single rack has 16 segment servers inside, each using two Intel Xeon E5670, 6-core, 2.93GHz processors, making 192 Intel cores in total. The rack also has two redundant servers for co-ordination operations; they don't do data mining work.

There can be up to 24 racks, totalling 4,608 data mining cores. A DCA rack has 36TB of usable uncompressed disk space, using 600GB drives. EMC says there is 144TB with compression. The amount of compression varies with the type of data and EMC is using a generalised 4X compression factor.

The DCA is an integrated IT stack system, including database, compute, storage and network resources in a single product. It is available in half-rack, full-rack, and multiple-rack appliance configurations, and scales up to 3.46PB with compression.

Describing it, and not being ironic, Greenplum founder and EMC data products division CTO Luke Lonnergan said: "We don’t need anything esoteric."

He says we are entering an era of big data and massively parallel systems are needed to ingest it, mine (digest) it, and spit out results fast.

Customers can integrate it with EMC's Data Domain deduplicated backup, recovery and replication technologies, for data protection. Replication can also be provided by EMC's RecoverPoint product for disaster recovery.

This hardware runs v4.0 of the Greenplum database and EMC promises "the fastest data loading and best price/performance in the data warehousing industry". One DCA rack can ingest data at 10TB/hour, twice as fast, EMC says, as Oracle's Exadata system, and five times faster than Netezza and Teradata products. Performance scales linearly and a 24-rack system would theoretically ingest data at 240TB/hour.

Lonnergan said: "The strength of the appliance model is that it lands on the floor tested and configured at the point of manufacture, the weakness has been that many of these products are infrastructure islands.

"The DCA can be deployed and operated as a stand-alone Appliance, turn it on and data goes in while decisions come out, but you can connect it to an EMC array if you choose, replicate it with RecoverPoint and back it up to Data Domain.

"You’re now storing the data on your production arrays, getting long distance continuous remote replication with bookmarking and backing it up to deduplication storage with built-in integrity checking and bandwidth-optimised replication… it’s no longer an island in your data centre. It’s part of the infrastructure."

The Greenplum 4.0 database is shipping and available separately as software-only, to be run on X86 hardware, such as, EMC suggests, the Virtual Computing Environment (VCE) coalition Vblock infrastructure packages. The DCA product is available immediately. Pricing was not revealed. ®

Top 5 reasons to deploy VMware with Tegile

More from The Register

next story
Docker's app containers are coming to Windows Server, says Microsoft
MS chases app deployment speeds already enjoyed by Linux devs
Intel, Cisco and co reveal PLANS to keep tabs on WORLD'S MACHINES
Connecting everything to everything... Er, good idea?
SDI wars: WTF is software defined infrastructure?
This time we play for ALL the marbles
'Urika': Cray unveils new 1,500-core big data crunching monster
6TB of DRAM, 38TB of SSD flash and 120TB of disk storage
Facebook slurps 'paste sites' for STOLEN passwords, sprinkles on hash and salt
Zuck's ad empire DOESN'T see details in plain text. Phew!
'Hmm, why CAN'T I run a water pipe through that rack of media servers?'
Leaving Las Vegas for Armenia kludging and Dubai dune bashing
Windows 10: Forget Cloudobile, put Security and Privacy First
But - dammit - It would be insane to say 'don't collect, because NSA'
Oracle hires former SAP exec for cloudy push
'We know Larry said cloud was gibberish, and insane, and idiotic, but...'
prev story

Whitepapers

Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Win a year’s supply of chocolate
There is no techie angle to this competition so we're not going to pretend there is, but everyone loves chocolate so who cares.
Why cloud backup?
Combining the latest advancements in disk-based backup with secure, integrated, cloud technologies offer organizations fast and assured recovery of their critical enterprise data.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
Saudi Petroleum chooses Tegile storage solution
A storage solution that addresses company growth and performance for business-critical applications of caseware archive and search along with other key operational systems.