Best resources to learn and understand Hadoop
Here are some best resource to learn and understand Hadoop.
Tutorials
Free videos - MapR Academia
Udacity course
Hortonworks Sandbox
Hadoop Ecosystem
Running Hadoop Map-Reduce
Hadoop Screencasts
Reza Shiftehfar's blog I
Reza Shiftehfar's blog II
Reza Shiftehfar's blog III
Reza Shiftehfar's blog IV
Reza Shiftehfar's blog V
Reza Shiftehfar's blog VI
Reza Shiftehfar's blog ...
5 Areas Where Big Data is Doing Well
It's not often that an IT innovation is as disruptive as Big-Data. The tecnology has brought a paradigm shift in our approach, storage, usage and monetization of data assets. Every path-breaking innovation has its share of hype, expectations and "Big ...
5 Big Data Use Cases To Watch
We hear a lot about big data's ability to deliver usable insights -- but what does this mean exactly?
It's often unclear how enterprises are using big-data technologies beyond proof-of-concept projects. Some of this might be a byproduct of corporate secrecy. ...
5 technologies that will help big data cross the chasm
We’re on the cusp of a real turning point for big data. Its applications are becoming clearer, its tools are getting easier and its architectures are maturing in a hurry. It’s no longer just about log files, clickstreams and tweets. ...
Snappy compression with Pig and native MapReduce
Assuming you have installed Hadoop on your cluster, if not please follow http://code.google.com/p/hadoop-snappy/
This is the machine config of my cluster nodes, though the steps that follow could be followed with your installation/machine configs
pkommireddi@pkommireddi-wsl:/tools/hadoop/pig-0.9.1/lib$ uname -a
Linux pkommireddi-wsl 2.6.32-37-generic #81-Ubuntu SMP Fri ...
Cloudera, MongoDB partner to mash up NoSQL, Hadoop
Hadoop specialist Cloudera announced a strategic partnership with MongoDB this week that will allow Cloudera customers to store Hadoop data in their NoSQL MongoDB databases. The move is a huge win for MongoDB, which is quickly emerging as one of ...
Will Big Data Analytics Replace BI?
John Leonard of Computing.co.uk recently asked, “In 10 years’ time will big data analytics have replaced traditional relational business intelligence systems? That was one of the questions that Computing asked UK IT decision-makers in the quantitative survey that formed part of our recent big data ...
Bringing the Best of Apache Hive 0.13 to CDH Users
More than 300 bug fixes and stable features in Apache Hive 0.13 have already been backported into CDH 5.0.0.
Last week, the Hive community voted to release Hive 0.13. We’re excited about the continued efforts and progress in the project and ...
Apache Ambari 1.5.1 is Released!
Apache Ambari community proudly released version 1.5.1. This is the result of constant, concerted collaboration among the Ambari project’s many members. This release represents the work of over 30 individuals over 5 months and, combined with the Ambari 1.5.0 release, ...
Using Apache Hadoop and Impala together with MySQL for data analysis
Apache Hadoop is commonly used for data analysis. It is fast for data loads and scalable. In a previous post I showed how to integrate MySQL with Hadoop. In this post I will show how to export a table from ...






