Kafka 0.9 and MapR Streams put streaming data in the spotlight

As the Internet of Things (IoT) heats up as a topic, and as big data continues along on its path to maturity, streaming data technology, which fits perfectly in the intersection of the two, is having its shining moment. How appropriate, ...

BI Professionals Spend 50-90% of Their Time ‘Cleaning’ Raw Data for Analytics

Last year, the NYT shined a light on big data’s “janitor” problem – that data scientists and business intelligence pros spend too much time cleaning, not evaluating data. But how big of an issue is it, really? Xplenty just wrapped a commissioned study of ...

MapR 5.0 Hadoop supports real-time applications

With a focus on real-time applications, MapR Technologies took the wraps off the 5.0 version of its Hadoop distribution. "Eighteen percent of our customers have 50 or more applications running on a single cluster," says Jack Norris, chief marketing officer ...

Couchbase NoSQL Database Goes Mobile

NoSQL database vendor Couchbase Inc. is seeking to let developers build always-available, data-driven apps for mobile users wherever they are, even when they don't have connectivity to the cloud. The Couchbase Mobile suite was released yesterday, featuring three components, including an ...

Snappy compression with Pig and native MapReduce

Assuming you have installed Hadoop on your cluster, if not please follow http://code.google.com/p/hadoop-snappy/ This is the machine config of my cluster nodes, though the steps that follow could be followed with your installation/machine configs pkommireddi@pkommireddi-wsl:/tools/hadoop/pig-0.9.1/lib$ uname -a Linux pkommireddi-wsl 2.6.32-37-generic #81-Ubuntu SMP Fri ...