Hadoop Cluster Commissioning and Decommissioning Nodes
To add new nodes to the cluster:
1. Add the network addresses of the new nodes to the include file.
hdfs-site.xml
<property>
<name>dfs.hosts</name>
<value>/<hadoop-home>/conf/includes</value>
<final>true</final>
</property>
mapred-site.xml
<property>
<name>mapred.hosts</name>
<value>/<hadoop-home>/conf/includes</value>
<final>true</final>
</property>
Datanodes that are permitted to connect to the namenode are specified in a
file whose name is specified by the dfs.hosts property.
Includes file ...
Hive Interview Questions
What is Hive?
Hive is a data warehouse system for Hadoop that facilitates easy data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop compatible file systems.
Hive was originally developed at Facebook. It’s now a Hadoop subproject with ...
BigData TechCon-Learn HOW TO Master Big Data, Mar 31-Apr 2, Boston
Big Data TechCon, March 31-April 2, Boston, is the “how-to” big data event. Use code BIGDATA for $200 discount. www.bigdatatechcon.com
Plan now to attend Big Data TechCon, March 31-April 2 in Boston, to learn HOW-TO accommodate the terabytes and petabytes of data ...
5 Steps To Master Big Data and Predictive Analytics
As recently as the past two years, one of the seminal issues regarding Big Data was storage, especially with respect to the exponential growth and size of unstructured data that did not fit into databases (e.g., video feeds, PowerPoint presentations). ...
YARN and MapReduce 2.0 elevates big data Hadoop and scheduled processing
Are you ready to take advantage of the multi-application functionality of Hadoop with MapReduce 2.0, or as it is more affectionately known, YARN? According to Arun Murthy, a big-data architect and a co-founder of Hortonworks, YARN substantially improves the functionality ...
3 Tools Companies Can Use to Harness the Power of Big Data
To the individual user, Big Data might simply mean a new 3-terabyte hard drive, which can be acquired for a hundred bucks or so. But real Big Data projects require clusters of servers, vast amounts of storage, and specialized software ...
5 lessons we learned about data science in 2013
Most people know what marketing executives do every day. They try to catch people’s attention through email, ads, tweets, and press releases. As for data scientists, well, their work is not nearly as well understood.
That’s been slowly changing this year ...
Big Data In 2014-6 Bold Predictions
How will big data evolve in 2014? The future is anyone's guess, of course, but we thought we'd compile a tasty holiday assortment of prognostications from executives working in the big data trenches. So without further delay, here they are ...
Big Data 2.0-the next generation of Big Data
In the last few years we have seen Big Data generate a lot of buzz along with the launch of several successful big data products. The big data ecosystem has now reached a tipping point where the basic infrastructural capabilities ...
The 3 most common ways data junkies are using Hadoop
Just a few weeks ago, Apache Hadoop 2.0 was declared generally available–a huge milestone for the Hadoop market as it unlocks the vision of interacting with stored data in unprecedented ways. Hadoop remains the typical underpinning technology of “Big Data,” ...






