Stinger and Tez: A primer
What is Stinger?
The Stinger initiative aims to redesign Hive to make it what people want today: Hive is currently used for large batch jobs and works great in that sense; but people also want interactive queries, and Hive is too ...
The Hadoop Ecosystem: HDFS, Yarn, Hive, Pig, HBase and growing
Hadoop is the leading open-source software framework developed for scalable, reliable and distributed computing. With the world producing data in the zettabyte range there is a growing need for cheap, scalable, reliable and fast computing to process and make sense ...
5 Questions Enterprises Should Ask When Selecting a NoSQL Database
By Barry Perkins
With the need for more flexibility when it comes to defining and handling large amounts of data, NoSQL has emerged as a feasible alternative to relational databases.
NoSQL databases enable better application development productivity, greater ability to scale dynamically ...
Big Data Analytics and its Importance
When we talk about Big Data, we find some people saying that it’s just and no action while at the other side of the shore we find another set of people who claim that Big data delivers usable insights. My ...
The famed MongoDB document database and its benefits
Over the past few years, there have been talks about data management and administration of databases. It is indeed important for data to be managed properly for the best results. Businesses get to enjoy lots of benefits if they manage ...
Big Data trends for 2015 Infographic
Here is DataRPM's infographic on its analysis of big data and its predictions for 2015.
NoSQL, NewSQL, or RDBMS-How To Choose
When should you choose a NoSQL or NewSQL option versus a conventional relational database management system? Here are 10 telltale traits that will help you make the right choice.
Today's databases are not only expected to be flexible enough to handle ...
Altiscale Hadoop-as-a-Service Delivers Apache Hive 0.13
Altiscale, Inc., a leading innovator in Hadoop-as-a-Service (HaaS) solutions, has announced the availability of Apache Hiveâ„¢ 0.13 on its HaaS platform, just weeks since its general software release to the industry. For data scientists and businesses that rely on insights ...
6 big data trends in 2014
Data are being generated by every device imaginable. Big data are arriving from multiple sources at an alarming velocity, volume, variety and veracity.
It is estimated that 2.5 quintillion bytes of data are created each day—so much that 90 percent of ...
Bringing the Best of Apache Hive 0.13 to CDH Users
More than 300 bug fixes and stable features in Apache Hive 0.13 have already been backported into CDH 5.0.0.
Last week, the Hive community voted to release Hive 0.13. We’re excited about the continued efforts and progress in the project and ...






