How to Run a Simple Apache Spark App in CDH 5
Getting started with Spark (now shipping inside CDH 5) is easy using this simple example.
Apache Spark is a general-purpose, cluster computing framework that, like MapReduce in Apache Hadoop, offers powerful abstractions for processing large datasets. For various reasons pertaining to ...
Hadoop or Warehousing, or Both?
One of the thornier questions facing enterprise executives in these days of broad infrastructural change is how to deal with Big Data. On the surface, it may seem like a no-brainer: No matter how big the data load becomes, there ...
Top 30 Big Data Companies to watch in 2025
. The Big Data space is heating up ā to the point that many pundits already see it as the over-hyped heir to "cloud." The hype may be a bit much, but Big Data is already living up to its ...
Cassandra-Database Solution for modern day applications?
Cassandra is a one stop choice for data driven organizations dealing with real-time Big Data operations for their core functionalities. Now what makes it so dear to the developers and organizations dealing huge databases is a bunch of features that ...
Top 7 Tips to Succeed with Big Data
Today all the businesses are focusing and investing on big data Analytics to offer reliable services and to get profits. Big data is playing vital role in making the better business decisions by enabling data scientists and other users to ...
10 Big Data Predictions for 2014
Big data was seen as one of the biggest buzzwords of 2013 and companies are spending a lot on Big data analytics.Ā The storage and analysis of large and/or complex data sets using a series of techniques including, but not ...
10 Big Data Analytics Use Cases for Healthcare IT
Big data means a lot of things to a lot of different people, but what is becoming increasingly clear as the largest market players strategies start to unfold, big data is about real-time analysis and data driven decision-making. Now Big ...
Oracle Launches NoSQL Database 3.0
Oracle NoSQL DatabaseĀ 3.0 has been released by Oracle, adding improvements to enable the security, usability, and performance required for enterprise development and IT needs. Available in two versions, the Oracle NoSQL Database 3.0 Enterprise Edition and Oracle NoSQL Database 3.0 ...
How to Contribute to HBase and Hadoop2
By Nick Dimiduk
In case you havenāt heard, Hadoop2 is on the way! There are loads more new features than I can begin to enumerate, including lots of interesting enhancements to HDFS for online applications like HBase. One of the most ...
Apache Tez 0.3 Released
The Apache Tez community has voted to release 0.3 of the software.
Apache⢠Tez is a replacement of MapReduce that provides a powerful framework for executing a complex topology of tasks. Tez 0.3.0 is an important release towards making the software ...






