The Hadoop Ecosystem: HDFS, Yarn, Hive, Pig, HBase and growing

Hadoop is the leading open-source software framework developed for scalable, reliable and distributed computing. With the world producing data in the zettabyte range there is a growing need for cheap, scalable, reliable and fast computing to process and make sense ...

07 December 2015 Analytics, Cassandra, Cloud Computing, Cloudera, Couchbase, Google, Hadoop News, Hadoop Tutorials, HBase, Hive, Impala, MapReduce News, MongoDB News, NoSQL News, Pig, Predictive Analytics, Splunk

Snappy compression with Pig and native MapReduce

Assuming you have installed Hadoop on your cluster, if not please follow http://code.google.com/p/hadoop-snappy/ This is the machine config of my cluster nodes, though the steps that follow could be followed with your installation/machine configs pkommireddi@pkommireddi-wsl:/tools/hadoop/pig-0.9.1/lib$ uname -a Linux pkommireddi-wsl 2.6.32-37-generic #81-Ubuntu SMP Fri ...

03 May 2014 Analytics, Big Data, Cloudera, Hadoop News, Hadoop Tutorials, HBase, Hive, Impala, MapReduce News, MongoDB News, NoSQL News, Pig, Predictive Analytics

Apache Tez 0.3 Released

The Apache Tez community has voted to release 0.3 of the software. Apache™ Tez is a replacement of MapReduce that provides a powerful framework for executing a complex topology of tasks. Tez 0.3.0 is an important release towards making the software ...

13 March 2014 Analytics, Big Data, Cassandra, Hadoop News, Hadoop Tutorials, HBase, Hive, Impala, MapReduce News, MongoDB News, NoSQL News, Pig, Predictive Analytics, Splunk

Introduction to Apache Hive and Pig

Apache Hive is a framework that sits on top of Hadoop for doing ad-hoc queries on data in Hadoop. Hive supports HiveQL which is similar to SQL, but doesn't support the complete constructs of SQL. Hive coverts the HiveQL query into ...

29 January 2014 Big Data, Cloudera, Couchbase, Hadoop News, HBase, Hive, Impala, MapReduce News, MongoDB News, NoSQL News, Pig

When to use Pig Latin versus Hive SQL?

Once your big data is loaded into Hadoop, what’s the best way to use that data? You’ll need some way to filter and aggregate the data, and then apply the results for something useful. Collecting terabytes and petabytes of web ...

25 January 2014 Big Data, Hadoop News, Hadoop Tutorials, HBase, Hive, MapReduce News, NoSQL News, Pig

Free Cloudera Impala Book

Get free Cloudera Impala, in PDF format, for free from the Cloudera website, in association with the Strata Conference and Hadoop World. See the below link for the book info from the publisher as well as the link to download ...

21 January 2014 Big Data, Cassandra, Cloudera, Hadoop News, Hadoop Tutorials, HBase, Hive, Impala, MapReduce News, MongoDB News, NoSQL News, Pig, Predictive Analytics