9 Questions to Ask Before Kicking off any Big Data Project
What do you get when you combine rebranded analytics systems, a minefield of consultants turned “big data experts,” and insanely expensive “big data servers” that look suspiciously similar to commodity machines?
You get: the most complicated space for any business or ...
Apache Parquet paves the way for better Hadoop data storage
Apache Parquet, which provides columnar storage in Hadoop, is now a top-level Apache Software Foundation (ASF)-sponsored project, paving the way for its more advanced use in the Hadoop ecosystem.
Already adopted by Netflix and Twitter, Parquet began in 2013 as a ...
Hadoop update 2.7.0 Released
The Apache Hadoop community is happy to announce the release of Apache Hadoop 2.7.0! We want to express our gratitude to every contributor, reviewer and committer.
The Hadoop community fixed 923 JIRAs in total as part of the 2.7.0 release. Of ...
5 NoSQL Predictions for 2015
2014 has been another interesting year in Big Data with long-awaited IPOs, massive fund-raising and increased awareness around the data management market. So, what does this mean for the future of Enterprise NoSQL and the impact on enterprises across industries ...
AWS joins Azure and Watson in bringing machine learning to big data
AMAZON WEB SERVICES (AWS) has announced that it will soon offer machine learning as an option to customers.
The technology is the same creepy stuff that uses algorithms to recommend things that you might also like when you browse Mazon's retail ...
Apache Tajo brings data warehousing to Hadoop
Organizations that want to extract more intelligence from their Hadoop deployments might find help from the relatively little known Tajo open source data warehouse software, which the Apache Software Foundation has pronounced as ready for commercial use.
The new version of ...
4 Hot Big Data Analytics Roles in IT Industry
TimesJobs data shows the IT industry is now moving away from the generic data scientist role to specific big data-related roles to build their analytics solution portfolios.
Big data is the IT industry's new buzzword. Industries have been talking about how ...
MapReduce for C: Run Native Code in Hadoop
Google announced the release of MapReduce for C (MR4C), an open source framework that allows you to run native code in Hadoop.
MR4C was originally developed at Skybox Imaging to facilitate large scale satellite image processing and geospatial data science. We ...
MongoDB 3.0 NoSQL database integrates WiredTiger engine and Ops Manager tool
MongoDB has announced the latest release of the NoSQL database, incorporating the WiredTiger storage engine for greater performance, and introducing Ops Manager, a new application for managing MongoDB deployments.
MongoDB 3.0 is set to be generally available in March, when the ...
3 useful tools for big data log analysis
When looking around the data center, it's difficult to ignore the potential in all of the big data available from infrastructure systems. There are server and application logs, data from network and storage taps, and metadata from databases and applications. ...






