Kafka 0.9 and MapR Streams put streaming data in the spotlight
As the Internet of Things (IoT) heats up as a topic, and as big data continues along on its path to maturity, streaming data technology, which fits perfectly in the intersection of the two, is having its shining moment.
How appropriate, ...
BI Professionals Spend 50-90% of Their Time âCleaningâ Raw Data for Analytics
Last year, the NYT shined a light on big dataâs âjanitorâ problem â that data scientists and business intelligence pros spend too much time cleaning, not evaluating data. But how big of an issue is it, really?
Xplenty just wrapped a commissioned study of ...
MapR 5.0 Hadoop supports real-time applications
With a focus on real-time applications, MapR Technologies took the wraps off the 5.0 version of its Hadoop distribution.
"Eighteen percent of our customers have 50 or more applications running on a single cluster," says Jack Norris, chief marketing officer ...
Couchbase NoSQL Database Goes Mobile
NoSQL database vendor Couchbase Inc. is seeking to let developers build always-available, data-driven apps for mobile users wherever they are, even when they don't have connectivity to the cloud.
The Couchbase Mobile suite was released yesterday, featuring three components, including an ...
Snappy compression with Pig and native MapReduce
Assuming you have installed Hadoop on your cluster, if not please follow http://code.google.com/p/hadoop-snappy/
This is the machine config of my cluster nodes, though the steps that follow could be followed with your installation/machine configs
pkommireddi@pkommireddi-wsl:/tools/hadoop/pig-0.9.1/lib$ uname -a
Linux pkommireddi-wsl 2.6.32-37-generic #81-Ubuntu SMP Fri ...
Configure Eclipse for MapReduce
1. Download load eclipse Europa or Indigo
2. Download Hadoop eclipse plugin eg: hadoop-eclipse-plugin-1.0.3.jar
3. Copy jar in eclipse plugin folder
4. Open eclipse
5. Add Map/Reduce server
6. Add New DFS Location
Location name: localhost
Map/Reduce Master:
port: 9001
DFS Master
port: 9000
Finish
7. New -> others -> Map/Reducer Project
-> ...
How MapRâs M7 Platform Improves NoSQL and Hadoop
The M7 Edition. Sounds like a high performance sports car, doesnât it? In reality, M7 is MapRâs enterprise-grade platform that provides its own unique brand of high-performance, dependability and ease of use to both NoSQL and Hadoop applications. M7 removes ...






