What Can GPFS on Hadoop Do For You?
The Hadoop Distributed File System (HDFS) is considered a core component of Hadoop, but it’s not an essential one. Lately, IBM has been talking up the benefits of hooking Hadoop up to the General Parallel File System (GPFS). IBM has ...
Apache Tez 0.3 Released
The Apache Tez community has voted to release 0.3 of the software.
Apache™ Tez is a replacement of MapReduce that provides a powerful framework for executing a complex topology of tasks. Tez 0.3.0 is an important release towards making the software ...
Avoiding Split Brainedness in HA Hadoop Clusters
The US Patent Office recently granted Zettaset a patent for the underlying technology in its Hadoop high availability that prevents a "split-brain" situation where multiple master nodes think they're in control of the Hadoop cluster. It's a feather in the ...
Hadoop YARN adds more application threads for big data users
Even Hadoop's most enthusiastic proponents might admit that its marriage to MapReduce has limited what the open source technology can do. But with the advent of Hadoop 2 and its key component, the Hadoop YARN resource manager, the distributed processing ...
Data transfer between MySql and Cassandra using Sqoop
Sqoop is a tool designed to transfer data between Hadoop and relational databases. You can use Sqoop to import data from a relational database management system (RDBMS) such as MySQL or Oracle into the Hadoop Distributed File System (HDFS), transform ...
SQL is what’s next for Hadoop
Today all the companies are trying to let users run SQL queries from inside Hadoop as it is open-source software framework. Companies are using Hive and HiveQL languages in Hadoop implementation but Hive is mainly depends on MapReduce. Business intelligence ...
How MapR’s M7 Platform Improves NoSQL and Hadoop
The M7 Edition. Sounds like a high performance sports car, doesn’t it? In reality, M7 is MapR’s enterprise-grade platform that provides its own unique brand of high-performance, dependability and ease of use to both NoSQL and Hadoop applications. M7 removes ...
MongoDB NoSQL Database Interview Questions
MongoDB is the best free open source NoSQL document oriented database. If you are preparing for the technical interview on MongoDB NoSQL database, must prepare the following MongoDB NoSQL database interview questions. These MongoDB NoSQL database interview questions cover basic ...
6 things to make your Big Data project succeed
Some things that you can do to actually make the Big Data project you take on succeed. Â The first thing you need to do is stop trying to make 'Big Data' succeed and instead start focusing on how you educate ...
Integrating Hadoop into Business Intelligence and Data Warehousing
Information from SAS and TDWI Research
The purpose of this report is to accelerate users’ understanding of the many new products and practices based on Hadoop technologies that have emerged in recent years. While Hadoop usage is a minority practice today, ...






