5 tips to get started with big data
Everyone seems to be talking about "big data" these days. Do you wonder what you’re missing out on? Let’s take a look at how you can get started with Big Data.
Learn what it is, and what it is not. While ...
6 big data trends in 2014
Data are being generated by every device imaginable. Big data are arriving from multiple sources at an alarming velocity, volume, variety and veracity.
It is estimated that 2.5 quintillion bytes of data are created each day—so much that 90 percent of ...
Snappy compression with Pig and native MapReduce
Assuming you have installed Hadoop on your cluster, if not please follow http://code.google.com/p/hadoop-snappy/
This is the machine config of my cluster nodes, though the steps that follow could be followed with your installation/machine configs
pkommireddi@pkommireddi-wsl:/tools/hadoop/pig-0.9.1/lib$ uname -a
Linux pkommireddi-wsl 2.6.32-37-generic #81-Ubuntu SMP Fri ...
A New Python Client for Impala
The new Python client for Impala will bring smiles to Pythonistas!
As a data scientist, I love using the Python data stack. I also love using Impala to work with very large data sets. But things that take me out of ...
Cloudera, MongoDB partner to mash up NoSQL, Hadoop
Hadoop specialist Cloudera announced a strategic partnership with MongoDB this week that will allow Cloudera customers to store Hadoop data in their NoSQL MongoDB databases. The move is a huge win for MongoDB, which is quickly emerging as one of ...
Will Big Data Analytics Replace BI?
John Leonard of Computing.co.uk recently asked, “In 10 years’ time will big data analytics have replaced traditional relational business intelligence systems? That was one of the questions that Computing asked UK IT decision-makers in the quantitative survey that formed part of our recent big data ...
Intro to Machine Learning
Machine learning is sub set of artificial intelligence and it is study of systems that can learn from data. A machine learning system could be trained. Core of machine learning deals with representation and generalization.
Machine learning is a "Field of ...
Apache Ambari 1.5.1 is Released!
Apache Ambari community proudly released version 1.5.1. This is the result of constant, concerted collaboration among the Ambari project’s many members. This release represents the work of over 30 individuals over 5 months and, combined with the Ambari 1.5.0 release, ...
Hadoop or Warehousing, or Both?
One of the thornier questions facing enterprise executives in these days of broad infrastructural change is how to deal with Big Data. On the surface, it may seem like a no-brainer: No matter how big the data load becomes, there ...
Top 7 Tips to Succeed with Big Data
Today all the businesses are focusing and investing on big data Analytics to offer reliable services and to get profits. Big data is playing vital role in making the better business decisions by enabling data scientists and other users to ...






