Hadoop: The Definitive Guide, Fourth Edition: http://shop.oreilly.com/product/0636920033448.do
Code and Data: http://hadoopbook.com/code.html
Download ncdc weather dataset: https://gist.github.com/rehevkor5/2e407950ca687b36fc54
Building and Running:
Reading Notes
- Chapter 1: Meet Hadoop
- Chapter 3: HDFS
- Chapter 10: Setting Up a Hadoop Cluster
- Chapter 11: Administering Hadoop
Skipped sections
- Chapter 2 MapReduce
- Chapter 3 HDFS
- HDFS Federation
- HDFS High Availability
- The Java Interface
- Data Flow, Coherency Model