HBase Architecture
HBase ā The Basics:
HBase is an open-source, NoSQL, distributed, non-relational, versioned, multi-dimensional, column-oriented store which has been modeled after Google BigTable that runs on top of HDFS.Ā āāNoSQLā is a broad term meaning that the database isnāt an RDBMS which ...
Introduction to Impala
Impala in terms of Hadoop has got the significance because of its,
Scalability
Flexibility
Efficiency
Whatās Impala?
Impala isā¦
Interactive SQLāImpala is typically 5 to 65 times faster than Hive as it minimized the response time to just seconds, not minutes.
Nearly ANSI-92 standard and compatible with ...
Free Cloudera Impala Book
Get free Cloudera Impala, in PDF format, for free from the Cloudera website, in association with the Strata Conference and Hadoop World.Ā See the below link for the book info from the publisher as well as the link to download ...
Hadoop Cluster Commissioning and Decommissioning Nodes
To add new nodes to the cluster:
1. Add the network addresses of the new nodes to the include file.
hdfs-site.xml
<property>
<name>dfs.hosts</name>
<value>/<hadoop-home>/conf/includes</value>
<final>true</final>
</property>
mapred-site.xml
<property>
<name>mapred.hosts</name>
<value>/<hadoop-home>/conf/includes</value>
<final>true</final>
</property>
Datanodes that are permitted to connect to the namenode are specified in a
file whose name is specified by the dfs.hosts property.
Includes file ...
BigData TechCon-Learn HOW TO Master Big Data, Mar 31-Apr 2, Boston
Big Data TechCon, March 31-April 2, Boston, is the āhow-toā big data event. Use code BIGDATA for $200 discount.Ā www.bigdatatechcon.com
Plan now to attend Big Data TechCon, March 31-April 2 in Boston, to learn HOW-TO accommodate the terabytes and petabytes of data ...
3 Tools Companies Can Use to Harness the Power of Big Data
To the individual user, Big Data might simply mean a new 3-terabyte hard drive, which can be acquired for a hundred bucks or so. But real Big Data projects require clusters of servers, vast amounts of storage, and specialized software ...
5 lessons we learned about data science in 2013
Most people know what marketing executives do every day. They try to catch peopleās attention through email, ads, tweets, and press releases. As for data scientists, well, their work is not nearly as well understood.
Thatās been slowly changing this year ...
Big Data In 2014-6 Bold Predictions
How will big data evolve in 2014? The future is anyone's guess, of course, but we thought we'd compile a tasty holiday assortment of prognostications from executives working in the big data trenches. So without further delay, here they are ...
The 3 most common ways data junkies are using Hadoop
Just a few weeks ago, Apache Hadoop 2.0 was declared generally availableāa huge milestone for the Hadoop market as it unlocks the vision of interacting with stored data in unprecedented ways. Hadoop remains the typical underpinning technology of āBig Data,ā ...
A Quick Guide To Choosing The Right Way To Use Hadoop
By Dan Woods, The market for Hadoop and related products is one of the most active in all of enterprise software. Iāve developed a simple framework that can help quickly explain the differences in the way Hadoop distributions are created ...






