NoSQL: Running Ruby Map/Reduce with Apache Hadoop

| | bookmark | email

Running Ruby Map/Reduce with Apache Hadoop

Here I demonstrate, with repeatable steps, how to fire-up a Hadoop cluster on Amazon EC2, load data onto the HDFS (Hadoop Distributed File-System), write map-reduce scripts in Ruby and use them to run a map-reduce job on your Hadoop cluster. You will not need to ssh into the cluster, as all tasks are run from your local machine.

tags:hadoop,mapreduce

via NoSQL databases