Best resources to learn and understand Hadoop

Here are some best resource to learn and understand Hadoop. Tutorials Free videos - MapR Academia Udacity course Hortonworks Sandbox Hadoop Ecosystem Running Hadoop Map-Reduce Hadoop Screencasts Reza Shiftehfar's blog I Reza Shiftehfar's blog II Reza Shiftehfar's blog III Reza Shiftehfar's blog IV Reza Shiftehfar's blog V Reza Shiftehfar's blog VI Reza Shiftehfar's blog ...

How to Run a Simple Apache Spark App in CDH 5

Getting started with Spark (now shipping inside CDH 5) is easy using this simple example. Apache Spark is a general-purpose, cluster computing framework that, like MapReduce in Apache Hadoop, offers powerful abstractions for processing large datasets. For various reasons pertaining to ...

Using Scala To Work With Hadoop

Cloudera has a great toolkit to work with Hadoop.  Specifically it is focused on building distributed systems and services on top of the Hadoop Ecosystem. http://cloudera.github.io/cdk/docs/0.2.0/cdk-data/guide.html And the examples are in Scala!!!! Here is how you you work with generic stuff on the ...
1 7 8 9 10 11 13