Best resources to learn and understand Hadoop
Here are some best resource to learn and understand Hadoop.
Tutorials
Free videos - MapR Academia
Udacity course
Hortonworks Sandbox
Hadoop Ecosystem
Running Hadoop Map-Reduce
Hadoop Screencasts
Reza Shiftehfar's blog I
Reza Shiftehfar's blog II
Reza Shiftehfar's blog III
Reza Shiftehfar's blog IV
Reza Shiftehfar's blog V
Reza Shiftehfar's blog VI
Reza Shiftehfar's blog ...
Splunk Extends Analysis To NoSQL Databases
Splunk Enterprise gets multisite support, improved visualization. Hunk adds analysis for Accumulo, Cassandra, MongoDB, and Neo4j.
Splunk keeps rolling along, well ahead of an open-source threat that some thought might flatten it. The company last week sprinted ahead yet again, introducing ...
Cascading 3.0 Future-Proofs Data-Centric Application Development on Hadoop
Concurrent, Inc., the company behind Cascading, an open source application development framework for building data applications on Hadoop, has announced Cascading 3.0, which CEO Gary Nakamura says will give enterprises the flexibility to build their data-oriented applications on Hadoop once, and ...
5 Areas Where Big Data is Doing Well
It's not often that an IT innovation is as disruptive as Big-Data. The tecnology has brought a paradigm shift in our approach, storage, usage and monetization of data assets. Every path-breaking innovation has its share of hype, expectations and "Big ...
Splunk’s Hunk 6.1enables faster analytics for Hadoop and NoSQL Data Stores
Splunk Inc. announced version 6.1 of Hunk: Splunk Analytics for Hadoop and NoSQL Data Stores. Hunk 6.1 makes it even faster and easier to turn raw, unstructured data in Hadoop and NoSQL data stores into business insights. Accelerated reports in ...
5 Big Data Use Cases To Watch
We hear a lot about big data's ability to deliver usable insights -- but what does this mean exactly?
It's often unclear how enterprises are using big-data technologies beyond proof-of-concept projects. Some of this might be a byproduct of corporate secrecy. ...
5 technologies that will help big data cross the chasm
We’re on the cusp of a real turning point for big data. Its applications are becoming clearer, its tools are getting easier and its architectures are maturing in a hurry. It’s no longer just about log files, clickstreams and tweets. ...
5 tips to get started with big data
Everyone seems to be talking about "big data" these days. Do you wonder what you’re missing out on? Let’s take a look at how you can get started with Big Data.
Learn what it is, and what it is not. While ...
6 big data trends in 2014
Data are being generated by every device imaginable. Big data are arriving from multiple sources at an alarming velocity, volume, variety and veracity.
It is estimated that 2.5 quintillion bytes of data are created each day—so much that 90 percent of ...
Snappy compression with Pig and native MapReduce
Assuming you have installed Hadoop on your cluster, if not please follow http://code.google.com/p/hadoop-snappy/
This is the machine config of my cluster nodes, though the steps that follow could be followed with your installation/machine configs
pkommireddi@pkommireddi-wsl:/tools/hadoop/pig-0.9.1/lib$ uname -a
Linux pkommireddi-wsl 2.6.32-37-generic #81-Ubuntu SMP Fri ...






