Articles about mapreduce

The Register breaking news

MapR cranks out updated Hadoop data muncher

There are a slew of companies that want to be the Red Hat for open source Hadoop data chewing, making money by beefing it up and selling support for the collection of programs. MapR Technologies, which came out of stealth mode in May, has some proprietary extensions to Hadoop, but all of the goodies being added with MapR …
The Register breaking news

Oracle rolls its own NoSQL and Hadoop

OpenWorld There's no shortage of ego at Oracle, as evidenced by the effusion of confidence behind the company's OpenWorld announcement of the not-so-humbly named Big Data Appliance. And then there were the o'erweening keynote presentations by some of the software giant and systems player's top brass on Monday, which included a montage …
The Register breaking news

Revolution speeds stats on Hadoop clusters

Revolution Analytics, the company that is extending R, the open source statistical programming language, with proprietary extensions, is making available a free set of extensions that allow its R engine to run atop Hadoop clusters. Now statisticians that are familiar with R can do analysis on unstructured data stored in the …
graph up

Teradata update chews up, spits out columns

Data warehousing pioneer Teradata is turning up the dial on its eponymous parallel database to 14, adding in slew of new features that include the ability to process columnar data as well as the more standard-row-based chewing in relational databases. The company has also released an appliance running the hybrid row/column …
The Register breaking news

New customers drive Teradata to new heights

When announcing its financial performance for its most recent quarter, data-warehousing pioneer Teradata proved that it's continuing to do well despite heavy pressure from IBM, Oracle, EMC, and HP – and also, like its rivals, that its still trying to figure out just how large of an opportunity business analytics on unstructured …
The Register breaking news

Google's epic graph cruncher mimicked with open source

Unlike Facebook or Yahoo!, Google is loath to open source its back-end software. For many, this is a sore point, as the search giant has built its famously distributed infrastructure atop countless open source tools fashioned outside the walls of the Googleplex. But Mountain View does give back in less-direct ways. In some …
Cade Metz, 30 Jun 2011
hands waving dollar bills in the air

Cloudera promises 'Google-like' Big Data dream in minutes

Updated Cloudera has delivered a "substantial" update to its open source Hadoop distribution. On Wednesday, Cloudera rolled out Cloudera Enterprise 3.5, two months after shipping a major upgrade to its Hadoop distribution called Cloudera Distribution of Apache Hadoop (CDH) 3.0. Whereas CDH 3.0 expanded Cloudera's Hadoop stack from …
Gavin Clarke, 29 Jun 2011
The Register breaking news

Ex-Google engineer dubs Goofrastructure 'truly obsolete'

A former Google engineer who worked on a library at the heart of "nearly every Java server at Google" has dubbed the company's much-ballyhooed backend software "well and truly obsolete". In a blog post published earlier this week, Dhanji R. Prasanna announced that he had resigned from the company, and though he praised Google …
Cade Metz, 8 Jun 2011
The Register breaking news

App Engine: Google's deepest secrets as a service

Google will never open source its back end. You'll never run the Google File System or Google MapReduce or Google BigTable on your own servers. Except on the rarest of occasions, the company won't even discuss the famously distributed software that underpins its sweeping collection of web services. But if you like, you can …
Cade Metz, 7 Jun 2011
fingers pointing at man

Platform wants to out-map, out-reduce Hadoop

Chewing on big data using the MapReduce protocol, and the open source Hadoop stack that implements it, is all the rage these days. But there is more than one way to stuff an elephant. The Hadoop tool created by Yahoo! (and named after a stuffed elephant) is now managed by the Apache Software Foundation, and it is the tool of …
The Register breaking news

The Brangelina of Big Data: Cassandra mates with Hadoop

Think of it as the Brangelina of Big Data. DataStax, an open-source startup based in Northern California, has combined Cassandra, the distributed database developed at Facebook, with Hadoop, the epic-number-crunching platform based on Google's backend infrastructure. Known as "Brisk", this Big Data mashup was unveiled on …
Cade Metz, 23 Mar 2011

Microsoft answers Google MapReduce with 'Dryad' beta

Microsoft has released a beta version of its Dryad platform, a Windows-happy answer to Google's MapReduce distributed number crunching platform. Yes, there's already a burgeoning open source version of MapReduce: the Apache Hadoop project. And at least one Microsoft unit is both using Hadoop and actually contributing code to …
Cade Metz, 20 Dec 2010
fingers pointing at man

Steve Jobs embraces Google's bastard grid child

Apple has embraced Hadoop, the open source distributed-computing platform based on Google's famously proprietary backend infrastructure. According to a recent Apple job listing entitled "Senior Software Engineer - Hadoop", the company is using or planning to use the entire Hadoop stack, from the HDFS file system and the Hadoop …
Cade Metz, 1 Dec 2010
DVD it in many colours

Yahoo! invites world of boffins into 4,000-node Hadoop cluster

Yahoo! has opened up its Hadoop research cluster to computer science boffins at four additional US universities, including Stanford, the University of Washington, the University of Michigan, and Purdue. The company's M45 cluster — a Hadoop setup spanning 4,000 processors and 1.5-petabyte of disk space inside a data center at …
Cade Metz, 4 Nov 2010

Google Percolator – global search jolt sans MapReduce comedown

Google Caffeine — the revamped search infrastructure recently rolled out across Google's worldwide network of data centers — is based on a distributed data-processing system known as Percolator. Designed by Google and, until now, jealously guarded by Google, Percolator is a platform for "incremental processing" — a means of …
Cade Metz, 24 Sep 2010

Google ops czar condemns multi-core extremists

Google is the modern data poster-child for parallel computing. It's famous for splintering enormous calculations into tiny pieces that can then be processed across an epic network of machines. But when it comes to spreading workloads across multi-core processors, the company has called for a certain amount of restraint. With a …
Cade Metz, 17 Sep 2010

Google search index splits with MapReduce

Exclusive Google Caffeine — the remodeled search infrastructure rolled out across Google's worldwide data center network earlier this year — is not based on MapReduce, the distributed number-crunching platform that famously underpinned the company's previous indexing system. As the likes of Yahoo!, Facebook, and Microsoft work to …
Cade Metz, 9 Sep 2010
arrow pointing up

Google Caffeine jolts worldwide search machine

Google has completed the roll-out of its next-generation search infrastructure, the indexing system overhaul known as "Caffeine." According to the company, the new setup provides "50 per cent fresher" results than its previous system. Mountain View rolled out a public test of the Caffeine index in a single data center last …
Cade Metz, 9 Jun 2010

Create a news alert about mapreduce, or find more stories about mapreduce.

Biting the hand that feeds IT © 1998–2018