The Hadoop Ecosystem: HDFS, Yarn, Hive, Pig, HBase and growing

Hadoop is the leading open-source software framework developed for scalable, reliable and distributed computing. With the world producing data in the zettabyte range there is a growing need for cheap, scalable, reliable and fast computing to process and make sense ...

How to become a Data Scientist for Free

By Nir Goldstein, ReSkill Statistical analysis and data mining were the top skills that got people hired in 2014 based on LinkedIn analysis of 330 million LinkedIn member profiles. We live in an increasingly data driven world, and businesses are aggressively ...

Things You Should Know About Big Data

People using Big Data need to know certain things about it, especially the technologies and architecture. Understanding the biggest challenges while implementing the processes related to Big Data in their storage environments. After its arrival on the scenario, for some times ...

7 Ways to Get Ready for the Big Data of the Future

Data science is in the midst of transformation, with Big Data technologies starting to significantly encroach on the market share of traditional RDBMSs (relational database management systems). Spending worldwide on Big Data is forecast to hit $114 billion by 2018, ...

Apache Drill 1.0 is Now Generally Available

Today, we are extremely excited and proud to announce the general availability (GA) of Apache Drill 1.0, as part of the MapR Distribution. Congratulations to the Drill community on this significant milestone and achievement! Incubated in September 2012 as an Apache ...

Apache Parquet paves the way for better Hadoop data storage

Apache Parquet, which provides columnar storage in Hadoop, is now a top-level Apache Software Foundation (ASF)-sponsored project, paving the way for its more advanced use in the Hadoop ecosystem. Already adopted by Netflix and Twitter, Parquet began in 2013 as a ...

AWS joins Azure and Watson in bringing machine learning to big data

AMAZON WEB SERVICES (AWS) has announced that it will soon offer machine learning as an option to customers. The technology is the same creepy stuff that uses algorithms to recommend things that you might also like when you browse Mazon's retail ...