The Top 5 Hadoop Distributions

A new report by Forrester Research’s big data analysts says that adopting Hadoop is “mandatory” for any organization that wishes to do advanced analytics and get actionable insights on their data. Forrester estimates that between 60% and 73% of data that ...

Data Lake Showdown: Object Store or HDFS?

The explosion of data is causing people to rethink their long-term storage strategies. Most agree that distributed systems, one way or another, will be involved. But when it comes down to picking the distributed system–be it a file-based system like ...

4 Considerations When Choosing a Hadoop Distribution

Choosing the right Hadoop distribution can be a tricky process. Many businesses looking to adopt Hadoop in their data infrastructure have a hard time figuring out what really differentiates one distribution from another. With so many options available, it’s easy ...

Apache Parquet paves the way for better Hadoop data storage

Apache Parquet, which provides columnar storage in Hadoop, is now a top-level Apache Software Foundation (ASF)-sponsored project, paving the way for its more advanced use in the Hadoop ecosystem. Already adopted by Netflix and Twitter, Parquet began in 2013 as a ...

Apache Flink: Possible replacement for Hadoop?

Earlier this week, Apache Software Foundation unveiled its latest Top Level Project (TLP), Flink. Flink is just one of the many data processing tools that have emerged since the Java-based distributed computing platform Hadoop has increased adoption in the enterprise space. Flink is ...