Configure Eclipse for MapReduce
1. Download load eclipse Europa or Indigo
2. Download Hadoop eclipse plugin eg: hadoop-eclipse-plugin-1.0.3.jar
3. Copy jar in eclipse plugin folder
4. Open ...
A guide to NoSQL offerings
Amazon Web Services: DynamoDB is a NoSQL database service that makes it simple and cost-effective to store and retrieve any ...
Apache Mahout is moving on from MapReduce
Apache Mahout, a machine learning library for Hadoop since 2009, is joining the exodus away from MapReduce. The project’s community has decided ...
Apache Falcon-Data Governance for Hadoop
Apache Falcon is a data governance engine that defines, schedules, and monitors data management policies. Falcon allows Hadoop administrators to ...
Google Launches BigQuery Streaming For Real-Time, Big-Data Analytics
BigQuery, Google’s cloud-based tool for quickly analyzing very large datasets, is getting a massive price cut today (up to 85 percent). ...
Pivotal juices Hadoop with in-memory database and SQL querying
Pivotal, an EMC/VMware spin-off that has big plans to deliver big data analytics through platform as a service, has whisked ...
How to Contribute to HBase and Hadoop2
By Nick Dimiduk
In case you haven’t heard, Hadoop2 is on the way! There are loads more new features than I ...
HBase BlockCache Showdown
The HBase BlockCache is an important structure for enabling low latency reads. As of HBase 0.96.0, there are no less ...
How-to Implement Role-based Security in Impala using Apache Sentry
Apache Sentry (incubating) is the Apache Hadoop ecosystem tool for role-based access control (RBAC). In this how-to, I will demonstrate ...
What Can GPFS on Hadoop Do For You?
The Hadoop Distributed File System (HDFS) is considered a core component of Hadoop, but it’s not an essential one. Lately, ...






