CSci 4409. Programming for Parallel Architecture: Resources
Articles and other reading resources
Required reading is marked in light green.
Programming resources and software downloads
and HDFS lecture by Aaron Kimball, the first engineer
with Hadoop by Aaron Kimball (key Hadoop terminology and
approaches) Terminology you need to know: job, task,
JobTracker, TaskTracker, NameNode, mapper, reducer, InputSplit,
RecordReader, RecordWriter, Partitioner. You also need
to know main Java interfaces used by Hadoop. Also, you need to
know what Hadoop streaming refers to.
- A wikipedia article
on Hadoop: quite detailed and helpful.
package with downloadable examples (the classical Hadoop
wordcount example done in Clojure).
- Hadoop + Clojure lecture
by Stuart Sierra (work done with Tim Dysinger)
- November 9:
Training: MapReduce Algorithms by
- Apache Hadoop tutorial (Java), including two versions of the wordcount example.
The views and opinions expressed in this page are strictly those of the page author. The contents of this page have not been reviewed or approved by the University of Minnesota.