Teradata pumps data warehouses with six-core Xeons

Flashy BI box cures performance anxiety

  • alert
  • submit to reddit

Security for virtualized datacentres

Teradata might be the pioneer of data warehousing on cheap x64 server clusters and the use of appliance packaging to tune machines and their software to attack specific workloads, but Oracle and IBM want to eat Teradata's lunch. And its breakfast and dinner, too. That means Teradata has to keep upgrading its hardware and database software and partnering to bring more functionality onto its data warehouse and analytics appliances, making them more useful to the customers who shell out big bucks for them.

At the Partners user group conference in San Diego today, Teradata launched a completely refreshed lineup of entry, midrange, and high-performance data warehousing and analytics appliances, and also tossed in a new flash-heavy extreme performance machine to take on Oracle's new Exadata X2-8 appliances and IBM's Smart Analytics System (SAS) appliances as well as the Netezza appliances that will soon become part of the Big Blue product catalog unless someone swoops in and tries to steal away Netezza. (This means you, NEC or Dell.) EMC is also putting the heat on Teradata with its own Greenplum Data Computing Appliances, and has much deeper pockets than a free-standing Greenplum could ever hope to have.

In a way, the fact that Oracle, IBM, and EMC are stepping up their game into BI and analytics is good for Teradata because these companies have much bigger marketing budgets than Teradata and any expansion of awareness is good for all players. With Teradata boasting a market capitalization of $6.4bn and an acquisition cost of at least $10bn, Teradata is too expensive for Oracle, IBM, or EMC to acquire without doing a merger or taking out some big loans. And that means Teradata, which had $1.82bn in sales and $288m in net income in the trailing four quarters, is going to have to compete against bigger firms that may not share its deep expertise but which have plenty of data center customers who trust their iron and software just the same.

First and foremost, Teradata has to stay in the performance game, and that means getting current on new Xeon-based server iron a bit faster than it has done thus far as well as giving up and playing the TPC-H benchmarketing game to prove the bang and the bang for the buck that its x64 clusters running the eponymous database software can deliver for customers.

Today, Teradata is delivering new server nodes for its clusters based on the "Westmere-EP" Xeon 5600 processors, which are the six-shooter x64 processors that Intel debuted way back in March of this year. The new six-core chips offer roughly 40 to 50 per cent more raw performance than the quad-core Xeon 5500 processors used across the Teradata appliance lineup. And Teradata says that with the new Teradata 13.10 clustered database, also announced today alongside the new Westmere-EP iron, has been tuned not only for the Xeon 5600s but also to take full advantage of the HyperThreading simultaneous multithreading (SMT) in the chips and therefore each two-socket node in the Teradata clusters makes full use of its 24 threads.

The performance gains depend on what Teradata appliance customers buy, and in some cases, the addition of flash disks as well as new data compression techniques, combined with the CPU performance bump, are yielding big performance increases for Teradata machines.

As El Reg previously divulged, Teradata resells tweaked versions of Dell PowerEdge servers as well as LSI and EMC storage in its various appliances.

At the bottom of the refreshed Teradata lineup is the Data Mart Appliance 560, a single-rack appliance based on two-socket server nodes using Intel's six-core 2.93 GHz Xeon 5670 processors and supporting up to 48 GB of DDR3 main memory. (The spec sheet says DDR2, but that is not possible.) The DMA 560 has four hot-swappable 10K RPM SAS disks with 600 GB of capacity, and has various network adapters to link it to the outside world as well as ESCON and FICON adapters for linking the appliance and its database to IBM mainframes.

Up to three 24-drive storage trays can be added to the box, using either 300 GB or 600 GB disks in 2.5-inch form factors, providing either a 5.8 TB or 11.7 TB user data capacity for customers. (Teradata recommends RAID 1 mirroring, but does not require it, RAID 0 striping and RAID 5 data protection are also supported in the disk controllers embedded in the servers.) The cluster runs Novell's SUSE Linux Enterprise Server 10, the Teradata 13.10 database, and has a management console that runs on Microsoft's Windows Server 2003. This DMA 560 machine is being positioned not only as an entry data mart box, but also as a BI application test and development machine.

The flagship product in the Teradata lineup is the Active Enterprise Data Warehouse 5650, which is a multi-rack solution that scales up to 86 PB of user data capacity in the warehouses. Teradata says that by upgrading to the new processor nodes and the Teradata 13.10 database, customers using the current 5600H nodes will see around a 43 per cent performance boost per node. Teradata is putting 300 GB and 450 GB Fibre Channel disks spinning at 15K RPM in the Storage 6844 arrays it peddles for the data warehouse racks.

The EDW 5650 warehouse comes in two flavors. If customers want to mix and match the new nodes with prior nodes, they have to buy the 5650C server nodes, which only have one six-core processor and 48 GB per node plus the BYNET V4 proprietary point-to-point, fault-tolerant interconnect that Teradata uses in many of its warehouses and support for up to 11.2 TB of capacity per node without compression (13.8 TB with 30 per cent compression) with 100 drives per node. The 5650H nodes have two Xeon X5670 processors and up to 96 GB of memory and 188 disk drives per node.

That works out to 21 TB of user capacity per node with 450 GB drives and 26 TB with typical data compression rates. In a standard configuration, the EDW 5650 data warehouses can scale to 1,024 nodes, but if you need more crunching, Teradata can push it up to 4,096 nodes. That 86 PB maximum is for 4,096 nodes without any compression.

The EDW 5650 can support SUSE Linux Enterprise Server 10 or Windows Server 2003; both have to be at 64-bit version levels. No word on when SLES 11 and Windows Server 2008 R2 will be supported.

Providing a secure and efficient Helpdesk

Next page: Extreme data

More from The Register

next story
IBM storage revenues sink: 'We are disappointed,' says CEO
Time to put the storage biz up for sale?
'Hmm, why CAN'T I run a water pipe through that rack of media servers?'
Leaving Las Vegas for Armenia kludging and Dubai dune bashing
Facebook slurps 'paste sites' for STOLEN passwords, sprinkles on hash and salt
Zuck's ad empire DOESN'T see details in plain text. Phew!
Windows 10: Forget Cloudobile, put Security and Privacy First
But - dammit - It would be insane to say 'don't collect, because NSA'
CAGE MATCH: Microsoft, Dell open co-located bit barns in Oz
Whole new species of XaaS spawning in the antipodes
VMware's tool to harden virtual networks: a spreadsheet
NSX security guide lands in intriguing format
prev story


Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
Three 1TB solid state scorchers up for grabs
Big SSDs can be expensive but think big and think free because you could be the lucky winner of one of three 1TB Samsung SSD 840 EVO drives that we’re giving away worth over £300 apiece.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.