12 Very Important Tools For Hadoop Users
When it comes to Big Data analysis, Hadoop is at the forefront of things. Almost every company worth mentioning is looking for people specialised in Hadoop. These tools manage various aspects of big data analysis using Hadoop.
1. Ambari
The Apache Ambari ...
Enjoy machine learning with Mahout on Hadoop
"Mahout" is a Hindi term for a person who rides an elephant. The elephant, in this case, is Hadoop -- and Mahout is one of the many projects that can sit on top of Hadoop, although you do not always ...
How Accurate is Mahout for Summing Numbers?
A question was recently posted on the Mahout mailing list suggesting that the Mahout math library was "unwashed" because it didn't use Kahan summation. Â My feeling is that this complaint is not founded and Mahout is considerably more washed than ...
Apache Spark is now part of MapR’s Hadoop distribution
Hadoop vendor MapR is getting in early on the Apache Spark action, too, announcing on Thursday that it’s adding the Spark stack to its Hadoop distribution as part of a partnership with Spark startup Databricks (Ion Stoica, the co-founder and CEO of ...
Apache Mahout is moving on from MapReduce
Apache Mahout, a machine learning library for Hadoop since 2009, is joining the exodus away from MapReduce. The project’s community has decided to rework Mahout to support the increasingly popular Apache Spark in-memory data-processing framework, as well as the H2O engine for ...






