5 Areas Where Big Data is Doing Well
It's not often that an IT innovation is as disruptive as Big-Data. The tecnology has brought a paradigm shift in our approach, storage, usage and monetization of data assets. Every path-breaking innovation has its share of hype, expectations and "Big ...
7 Facts About Hadoop That You Should Know
Where there is Big Data, there is Hadoop and vice versa. With Big Data analytics becoming as big as they have, Hadoop has become a mainstay in the technology industry.
Hereare a few facts that you should keep in mind when ...
Splunk’s Hunk 6.1enables faster analytics for Hadoop and NoSQL Data Stores
Splunk Inc. announced version 6.1 of Hunk: Splunk Analytics for Hadoop and NoSQL Data Stores. Hunk 6.1 makes it even faster and easier to turn raw, unstructured data in Hadoop and NoSQL data stores into business insights. Accelerated reports in ...
5 Big Data Use Cases To Watch
We hear a lot about big data's ability to deliver usable insights -- but what does this mean exactly?
It's often unclear how enterprises are using big-data technologies beyond proof-of-concept projects. Some of this might be a byproduct of corporate secrecy. ...
5 technologies that will help big data cross the chasm
We’re on the cusp of a real turning point for big data. Its applications are becoming clearer, its tools are getting easier and its architectures are maturing in a hurry. It’s no longer just about log files, clickstreams and tweets. ...
Snappy compression with Pig and native MapReduce
Assuming you have installed Hadoop on your cluster, if not please follow http://code.google.com/p/hadoop-snappy/
This is the machine config of my cluster nodes, though the steps that follow could be followed with your installation/machine configs
pkommireddi@pkommireddi-wsl:/tools/hadoop/pig-0.9.1/lib$ uname -a
Linux pkommireddi-wsl 2.6.32-37-generic #81-Ubuntu SMP Fri ...
A New Python Client for Impala
The new Python client for Impala will bring smiles to Pythonistas!
As a data scientist, I love using the Python data stack. I also love using Impala to work with very large data sets. But things that take me out of ...
Will Big Data Analytics Replace BI?
John Leonard of Computing.co.uk recently asked, “In 10 years’ time will big data analytics have replaced traditional relational business intelligence systems? That was one of the questions that Computing asked UK IT decision-makers in the quantitative survey that formed part of our recent big data ...
Bringing the Best of Apache Hive 0.13 to CDH Users
More than 300 bug fixes and stable features in Apache Hive 0.13 have already been backported into CDH 5.0.0.
Last week, the Hive community voted to release Hive 0.13. We’re excited about the continued efforts and progress in the project and ...
Intro to Machine Learning
Machine learning is sub set of artificial intelligence and it is study of systems that can learn from data. A machine learning system could be trained. Core of machine learning deals with representation and generalization.
Machine learning is a "Field of ...






