Data Lake Showdown: Object Store or HDFS?

The explosion of data is causing people to rethink their long-term storage strategies. Most agree that distributed systems, one way or another, will be involved. But when it comes down to picking the distributed system–be it a file-based system like ...

4 Considerations When Choosing a Hadoop Distribution

Choosing the right Hadoop distribution can be a tricky process. Many businesses looking to adopt Hadoop in their data infrastructure have a hard time figuring out what really differentiates one distribution from another. With so many options available, it’s easy ...

LinkedIn fills another SQL-on-Hadoop niche

Social networks generate colossal amounts of data that have come to defy the use of conventional data-processing tools, so it's no wonder their engineering teams have built their own toolsets -- such as Facebook and its machine-learning tools. Enter LinkedIn, now ...

MapR 5.0 Hadoop supports real-time applications

With a focus on real-time applications, MapR Technologies took the wraps off the 5.0 version of its Hadoop distribution. "Eighteen percent of our customers have 50 or more applications running on a single cluster," says Jack Norris, chief marketing officer ...

7 Ways to Get Ready for the Big Data of the Future

Data science is in the midst of transformation, with Big Data technologies starting to significantly encroach on the market share of traditional RDBMSs (relational database management systems). Spending worldwide on Big Data is forecast to hit $114 billion by 2018, ...

How to Beat Your Competition Using Analytics in Insurance

AXA, LV= and Co-operative Insurance divulge how you can beat your competition using analytics in insurance Competition in insurance is heating up, and analytics is widely recognised as a key tool for gaining an edge. According to AXA, LV= and Co-operative Insurance; ...

Apache Drill 1.0 is Now Generally Available

Today, we are extremely excited and proud to announce the general availability (GA) of Apache Drill 1.0, as part of the MapR Distribution. Congratulations to the Drill community on this significant milestone and achievement! Incubated in September 2012 as an Apache ...

9 Questions to Ask Before Kicking off any Big Data Project

What do you get when you combine rebranded analytics systems, a minefield of consultants turned “big data experts,” and insanely expensive “big data servers” that look suspiciously similar to commodity machines? You get: the most complicated space for any business or ...

Apache Parquet paves the way for better Hadoop data storage

Apache Parquet, which provides columnar storage in Hadoop, is now a top-level Apache Software Foundation (ASF)-sponsored project, paving the way for its more advanced use in the Hadoop ecosystem. Already adopted by Netflix and Twitter, Parquet began in 2013 as a ...
1 18 19 20 21 22 41