Data Science –What’s the big deal about it?
Thomas Davenport, an American academic and publisher for Harvard Business Review, once said that Data Scientist is “the Sexiest Job of the 21st Century”. But why is there such a big hype and mythos about Data Scientists and Data Science?
The ...
Why Extended Attributes are Coming to HDFS
Extended attributes in HDFS will facilitate at-rest encryption for Project Rhino, but they have many other uses, too.
Many mainstream Linux filesystems implement extended attributes, which let you associate metadata with a file or directory beyond common “fixed” attributes like filesize, ...
How to Create a Database in MongoDB
MongoDB is one of the “NoSQL” types of database solutions used to store and query big data. Old SQL developers might find Mongo a bit counterintuitive. With normal, relational databases, you create a database, then tables and then insert your ...
VMware Updates Big Data Extensions with Hadoop 2 Support
VMware Inc. updated its Big Data Extensions (BDE) for its vSphere virtualization platform, including support for Hadoop 2.
BDE's set of integrated management tools -- built into vSphere -- help organizations deploy, run and manage Hadoop. With BDE, vSphere users can ...
12 Very Important Tools For Hadoop Users
When it comes to Big Data analysis, Hadoop is at the forefront of things. Almost every company worth mentioning is looking for people specialised in Hadoop. These tools manage various aspects of big data analysis using Hadoop.
1. Ambari
The Apache Ambari ...
40 Best Data Visualization Tools and Software list for 2025
Data visualizations are everywhere today. From creating a visual representation of data points to impress potential investors, report on progress, or even visualize concepts for customer segments, data visualizations are a valuable tool in a variety of settings. When it ...
Facebook HydraBase adds reliability to Hadoop’s HBase
Facebook's becoming almost as notable for its adventures with open source projects as it is for its social network of more than 1 billion users. The company's latest experiment: revising one of Hadoop's key components to make it more reliable ...
Altiscale Hadoop-as-a-Service Delivers Apache Hive 0.13
Altiscale, Inc., a leading innovator in Hadoop-as-a-Service (HaaS) solutions, has announced the availability of Apache Hive™ 0.13 on its HaaS platform, just weeks since its general software release to the industry. For data scientists and businesses that rely on insights ...
Actian, HP Vertica Join SQL-On-Hadoop Bandwagon
Actian on Tuesday joined the long list of companies that have introduced a way to support SQL access and querying on top of Hadoop. The announcement comes just a week after HP upgraded SQL-on-Hadoop functionality it introduced late last year ...
Can Super-Fast Apache Spark Light Up Hadoop?
it the Hadoop Swiss Army knife of cluster computing frameworks. The Apache Software Foundation just rolled out Apache Spark v1.0, which it's calling a "super-fast, open-source, large-scale Relevant Products/Services data Relevant Products/Services processing and advanced analytics Relevant Products/Services engine."
That's a ...






