Snappy compression with Pig and native MapReduce
Assuming you have installed Hadoop on your cluster, if not please follow http://code.google.com/p/hadoop-snappy/
This is the machine config of my cluster nodes, though the steps that follow could be followed with your installation/machine configs
pkommireddi@pkommireddi-wsl:/tools/hadoop/pig-0.9.1/lib$ uname -a
Linux pkommireddi-wsl 2.6.32-37-generic #81-Ubuntu SMP Fri ...
Configure Eclipse for MapReduce
1. Download load eclipse Europa or Indigo
2. Download Hadoop eclipse plugin eg: hadoop-eclipse-plugin-1.0.3.jar
3. Copy jar in eclipse plugin folder
4. Open eclipse
5. Add Map/Reduce server
6. Add New DFS Location
Location name: localhost
Map/Reduce Master:
port: 9001
DFS Master
port: 9000
Finish
7. New -> others -> Map/Reducer Project
-> ...
How MapRās M7 Platform Improves NoSQL and Hadoop
The M7 Edition. Sounds like a high performance sports car, doesnāt it? In reality, M7 is MapRās enterprise-grade platform that provides its own unique brand of high-performance, dependability and ease of use to both NoSQL and Hadoop applications. M7 removes ...
Anatomy of a MapReduce Job
Hadoop Ecosystem and MapReduce
There is an extensive list of products and projects that either extend Hadoopās functionality or expose some existing capability in new ways. For example, executing SQL-like queries on top of Hadoop has spwaned several products. Facebook started ...
HBase Architecture
HBase ā The Basics:
HBase is an open-source, NoSQL, distributed, non-relational, versioned, multi-dimensional, column-oriented store which has been modeled after Google BigTable that runs on top of HDFS.Ā āāNoSQLā is a broad term meaning that the database isnāt an RDBMS which ...
Introduction to Impala
Impala in terms of Hadoop has got the significance because of its,
Scalability
Flexibility
Efficiency
Whatās Impala?
Impala isā¦
Interactive SQLāImpala is typically 5 to 65 times faster than Hive as it minimized the response time to just seconds, not minutes.
Nearly ANSI-92 standard and compatible with ...
Hadoop Interview Questions – MapReduce
Looking out for Hadoop Interview Questions that are frequently asked by employers?
What is MapReduce?
It is a framework or a programming model that is used for processing large data sets over clusters of computers using distributed programming.
What are 'maps' and 'reduces'?
'Maps' ...






