Hadoop Cluster Interview Questions
Which are the three modes in which Hadoop can be run?
The three modes in which Hadoop can be run are:
1. standalone (local) mode
2. Pseudo-distributed mode
3. Fully distributed mode
What are the features of Stand alone (local) mode?
In stand-alone mode there are ...
Hadoop Resources
History of Hadoop
Spotlight on the early history of Hadoop
The history of Hadoop: From 4 nodes to the future of data
Big Ideas: Demystifying Hadoop
What is MapReduce?
"Cluster Computing and MapReduce Lecture" series in YouTube
http://www.youtube.com/watch?v=yjPBkvYh-ss
http://www.youtube.com/watch?v=-vD6PUdf3Js
http://www.youtube.com/watch?v=5Eib_H_zCEY
http://www.youtube.com/watch?v=1ZDybXl212Q
http://www.youtube.com/watch?v=BT-piFBP4fE
http://labs.google.com/papers/mapreduce.html
http://code.google.com/edu/parallel/mapreduce-tutorial.htmlÂ
What is Hadoop?
http://radar.oreilly.com/2012/02/what-is-apache-hadoop.html
http://gigaom.com/cloud/what-it-really-means-when-someone-says-hadoop
http://www.ibm.com/developerworks/data/library/techarticle/dm-1209hadoopbigdata/
What is HDFS?
The paper covers most of ...
When to use Pig Latin versus Hive SQL?
Once your big data is loaded into Hadoop, what’s the best way to use that data? You’ll need some way to filter and aggregate the data, and then apply the results for something useful. Collecting terabytes and petabytes of web ...
Hadoop FS Shell Commands
Hadoop file system (fs) shell commands are used to perform various file operations like copying file, changing permissions, viewing the contents of the file, changing ownership of files, creating directories etc.
The syntax of fs shell command is
hadoop fs <args>
All the ...
Data Export from Hadoop MapReduce to Database
Hadoop has become a huge part of Data Warehouse in most companies. It is used for a variety of use-cases: Search and Web Indexing, Machine learning, Analytics and Reporting, and so on. Most organizations are building Hadoop clusters in addition ...
Hadoop Interview Questions – MapReduce
Looking out for Hadoop Interview Questions that are frequently asked by employers?
What is MapReduce?
It is a framework or a programming model that is used for processing large data sets over clusters of computers using distributed programming.
What are 'maps' and 'reduces'?
'Maps' ...
YARN and MapReduce 2.0 elevates big data Hadoop and scheduled processing
Are you ready to take advantage of the multi-application functionality of Hadoop with MapReduce 2.0, or as it is more affectionately known, YARN? According to Arun Murthy, a big-data architect and a co-founder of Hortonworks, YARN substantially improves the functionality ...
Should DBAs learn Hadoop?
sqlI’m a DBA with a working knowledge of Oracle, SQL Server, and MySQL. I have been reading more and more about big data, should I learn Hadoop?
First, thank you for emailing your question. The mere nature of your question tells ...
Has Hadoop Entered the Mainstream?
After all the hype around Apache Hadoop, we are starting to see some successful implementations that highlight its benefits. However, we do not yet see the kind of enterprise adoption typically associated with a technology that promises to manage high ...
How to tell if your database is outdated-5 warning signs
A company's database is an important component of business intelligence and helps to integrate the various departments within the enterprise. For years, companies relied on databases with ETL (extract, transform, load) processing to extract, clean and integrate data into the ...






