How to become a Data Scientist for Free

By Nir Goldstein, ReSkill Statistical analysis and data mining were the top skills that got people hired in 2014 based on LinkedIn analysis of 330 million LinkedIn member profiles. We live in an increasingly data driven world, and businesses are aggressively ...

Splunk’s Hunk 6.1enables faster analytics for Hadoop and NoSQL Data Stores

Splunk Inc. announced  version 6.1 of Hunk: Splunk Analytics for Hadoop and NoSQL Data Stores. Hunk 6.1 makes it even faster and easier to turn raw, unstructured data in Hadoop and NoSQL data stores into business insights. Accelerated reports in ...

How to Run a Simple Apache Spark App in CDH 5

Getting started with Spark (now shipping inside CDH 5) is easy using this simple example. Apache Spark is a general-purpose, cluster computing framework that, like MapReduce in Apache Hadoop, offers powerful abstractions for processing large datasets. For various reasons pertaining to ...

Apache Spark is now part of MapR’s Hadoop distribution

Hadoop vendor MapR is getting in early on the Apache Spark action, too, announcing on Thursday that it’s adding the Spark stack to its Hadoop distribution as part of a partnership with Spark startup Databricks (Ion Stoica, the co-founder and CEO of ...

Integrating Hadoop into Business Intelligence and Data Warehousing

Information from SAS and TDWI Research The purpose of this report is to accelerate users’ understanding of the many new products and practices based on Hadoop technologies that have emerged in recent years. While Hadoop usage is a minority practice today, ...