Intro to Hadoop and MapReduce

4
Join & Subscribe
Udacity
Free Online Course
English
4 weeks long
selfpaced

Overview

The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. Learn the fundamental principles behind it, and how you can use its power to make sense of your Big Data.

Syllabus

  • Big Data
    • What is Big Data?,The problems big data creates.,How Apache Hadoop addresses these problems.
  • HDFS and MapReduce
    • Discover how HDFS distributes data over multiple computers.,Learn how MapReduce enables analyzing datasets in parallel across multiple machines.
  • MapReduce code
    • Write your own MapReduce code.
  • MapReduce Design Patterns
    • Use common patterns for MapReduce programs to analyze Udacity forum data.

Taught by

Ian Wrigley and Sarah Sproehnle