Distributed data processing with Hadoop, Part 2: Going further

Posted by Scott_Ruecker on Jun 4, 2010 4:34 AM EDT
IBM/developerWorks; By M. Tim Jones
Mail this story
Print this story

The first article in this series showed how to use Hadoop in a single-node cluster. This article continues with a more advanced setup that uses multiple nodes for parallel processing. It demonstrates the various node types required for multinode clusters and explores MapReduce functionality in a parallel environment. This article also digs into the management aspects of Hadoop -- both command line and Web based.

Full Story

  Nav
» Read more about: Story Type: Tutorial; Groups: IBM, Linux

« Return to the newswire homepage

This topic does not have any threads posted yet!

You cannot post until you login.