h3. Setting up your cluster: * "Running Hadoop On Ubuntu Linux (Single-Node Cluster)":http://www.michael-noll.com/wiki/Running_Hadoop_On_Ubuntu_Linux_(Single-Node_Cluster) and "unning Hadoop On Ubuntu Linux (Multi-Node Cluster).":http://www.michael-noll.com/wiki/Running_Hadoop_On_Ubuntu_Linux_(Multi-Node_Cluster) * "Running Hadoop MapReduce on Amazon EC2 and S3":http://developer.amazonwebservices.com/connect/entry.jspa?externalID=873 * "Hadoop Overview by Doug Cutting":http://video.google.com/videoplay?docid=-4912926263813234341 - the founder of the Hadoop project. 49m * "Cluster Computing and Map|Reduce":http://www.youtube.com/results?search_query=cluster+computing+and+mapreduce ** "Lecture 1: Overview":http://www.youtube.com/watch?v=yjPBkvYh-ss ** "Lecture 2 (technical): Map|Reduce":http://www.youtube.com/watch?v=-vD6PUdf3Js ** "Lecture 3 (technical): GFS (Google File System)":http://www.youtube.com/watch?v=5Eib_H_zCEY ** "Lecture 4 (theoretical): Canopy Clustering":http://www.youtube.com/watch?v=1ZDybXl212Q ** "Lecture 5 (theoretical): Breadth-First Search":http://www.youtube.com/watch?v=BT-piFBP4fE * http://www.cloudera.com/hadoop-training ** "Thinking at Scale":http://www.cloudera.com/hadoop-training-thinking-at-scale ** "Mapreduce and HDFS":http://www.cloudera.com/hadoop-training-mapreduce-hdfs ** "A Tour of the Hadoop Ecosystem":http://www.cloudera.com/hadoop-training-ecosystem-tour ** "Programming with Hadoop":http://www.cloudera.com/hadoop-training-programming-with-hadoop ** "Hadoop and Hive: introduction":http://www.cloudera.com/hadoop-training-hive-introduction ** "Hadoop and Hive: tutorial":http://www.cloudera.com/hadoop-training-hive-tutorial ** "Hadoop and Pig: Introduction":http://www.cloudera.com/hadoop-training-pig-introduction ** "Hadoop and Pig: Tutorial":http://www.cloudera.com/hadoop-training-pig-tutorial ** "Mapreduce Algorithms":http://www.cloudera.com/hadoop-training-mapreduce-algorithms ** "Exercise: Getting started with Hadoop":http://www.cloudera.com/hadoop-training-exercise-getting-started-with-hadoop ** "Exercise: Writing mapreduce programs":http://www.cloudera.com/hadoop-training-exercise-writing-mapreduce-programs --------------------------------------------------------------------------- * "Hadoop Wiki: Hadoop Streaming":http://wiki.apache.org/hadoop/HadoopStreaming * "Hadoop Docs: Hadoop Streaming":http://hadoop.apache.org/common/docs/current/streaming.html