Stinger and Tez: A primer
What is Stinger?
The Stinger initiative aims to redesign Hive to make it what people want today: Hive is currently used for large batch jobs and works great in that sense; but people also want interactive queries, and Hive is too ...
Top 10 Biggest Big Data Companies by Revenue
Everything about the big data market is big, especially the rate at which the market is growing. According to the Wikibon 2015 Big Data Market Shares report, big data market revenues grew by 22 percent last year alone. Here’s a ...
Big Data 101: A Beginner’s Guide To Big Data terminology
Data science can be confusing enough without all of the complicated lingo and jargon. For many, the terms NoSQL, DaaS and Neural Networking instill nothing more than the hesitant thought, “this sounds data-related.” It can be difficult to tell a ...
Top 10 Most in-demand Internet of Things skills
The Internet of Things (IoT) is in the midst of an explosion, as more connected devices proliferate. But there's not enough talent with the right skills to manage and execute on IoT projects. In fact, insufficient staffing and lack of ...
5 Enterprise Alternatives to Hadoop
Hadoop's progression from a large scale, batch oriented analytics tool to an ecosystem full of vendors, applications, tools and services has coincided with the rise of the big data market.
While Hadoop has become almost synonymous with the market in which ...
5 Steps for Securing Your Data In Hadoop
Data security remains a top concern for data professionals. To help organizations put up a best defense, Reiner Kappenberger, senior executive focused on big data and Hadoop at HPE Seecurity-Data Security offers five steps on how to best secure data ...
Data Science – The MUST KNOW to become a successful Data Scientist!
Data Science / Data Analytics / Business analytics is all about analyzing the data, which is getting generated through multiple sources. Sources range from traditional databases to satellite signals to sensors in Internet of Things, and the list will go ...
30 Coolest Big Data Business Analytics Vendors
Working with big data remains one of the biggest IT challenges that businesses, government agencies and other organizations face today. It's also one of the biggest opportunities for IT vendors and for solution and strategic service providers. The big data ...
Overview of HADOOP from its very Basic till a Career Path
That cute little yellow elephant that often pops up in your window is the symbol of a technology named as Hadoop. This is a java-based programming framework which helps in processing large sets of data. Today with the rise of ...
How is fault tolerance handled in Spark streaming?
Spark Streaming components
Data model
All data is modeled as RDDs, built by design with lineage of deterministic operations, i.e. any re-computation always leads to the same result. Essentially the same process (however with a different mechanism) as in Hadoop's fault-tolerance for ...






