Distributed Systems Lab.
Main Contact: Assistant Prof. Pietro Michiardi
Institut Eurecom,
2229, route des Cretes,
BP 193, F-06560 Sophia-Antipolis (FRANCE)
Tel: +33.(0)4.93.00.81.45
FAX: +33.(0)4.93.00.82.00
email: Pietro dot Michiardi at eurecom.fr
Summer School on Cloud Computing: Challenges and opportunities
Franco-German Summer School 2011 for Junior Scientists
17 – 22.7.2011
Lecture Title: Hadoop MapReduce in Practice
Speaker: Prof. Pietro Michiardi
Contributors: Dr. Antonio Barbuzzi, Mario Pastorelli
This page supports the Lecture on practical exercises on Hadoop
MapReduce. In the following, you will find links to the lecture
itself, a link to a broader tutorial covering also the theory
behind MapReduce, and some basic software requirements to work
with the exercises.
The tutorial and this lecture have been profoundly influenced by
two "must-have" books on Hadoop and MapReduce:
- Tom White, Hadoop, The Definitive Guide,
Y!Press, O'Reilly
- Jimmy Lin, Chris Dyer, Data-Intensive Text
Processing with MapReduce, Morgan Claypool ed.
Please note that the structure of the following slides follows the
vairious chapters of the above books. Moreover, several other
presentations and tutorial have been used to complement the book
material.
Use the following links to download the lecture slides and the
full tutorial:
Software setup:
In order to complete all the excercises for the course, you need to download and install java sdk and eclipse. You need also to download Hadoop 0.20.203.0.
Links:
- Java download page:
[Link]
- Hadoop download page (hadoop-0.20.203.0):
[Link]
- Eclipse download page:
[Link]
Additional documentation for the laboratory:
The following [Link] contains a
"cheat-sheet" to help students with common commands on Hadoop, and
with additional information on the Summer School Cluster we prepared.