Here is a great overview of Hadoop for the beginner.
Hadoop is most often associated with big data.
A look at the different Hadoop solutions such as Clouder, Hortonworks, MapR and Intel.
Hear Pythian’s CTO Alex Gorbachev discuss these tools and the overall Hadoop ecosystem.
Mike Olson, Chief Strategy Officer and Co-Founder at Cloudera, explains Apache Spark’s origins, its rise in popularity in the open source community, and how Spark is primed to replace MapReduce as the general processing engine in Hadoop.
Here’s a great ten minute video from Hortonworks explaining the purpose of Apache Storm for Hadoop.
Programming thousands of machines is no easy task.
One approach pioneered by Google is known as MapReduce.
MapReduce provides a programming model that simplifies programming thousands of machines by breaking down distributed programs into two steps: map, and reduce.
Udacity has made a large portion of their Introduction to Hadoop and MapReduce available for free on YouTube.
72 videos of complete big data bliss
Rafael Coss, manager Big Data Enablement for IBM, explains Hive defined in 3 minutes.
Rafael Coss, manager Big Data Enablement for IBM, defines Pig in 3 minutes.