How to become a Data Scientist for Free
By Nir Goldstein, ReSkill
Statistical analysis and data mining were the top skills that got people hired in 2014 based on LinkedIn analysis of 330 million LinkedIn member profiles. We live in an increasingly data driven world, and businesses are aggressively ...
6 sparkling features of Apache Spark!
What is Apache Spark? Why there is a serious buzz going-on about this? If you are into BigData analytics business then, should you really care about Spark? Hope this post will help to answer some of these questions which might have ...
Splunk’s Hunk 6.1enables faster analytics for Hadoop and NoSQL Data Stores
Splunk Inc. announced version 6.1 of Hunk: Splunk Analytics for Hadoop and NoSQL Data Stores. Hunk 6.1 makes it even faster and easier to turn raw, unstructured data in Hadoop and NoSQL data stores into business insights. Accelerated reports in ...
How to Run a Simple Apache Spark App in CDH 5
Getting started with Spark (now shipping inside CDH 5) is easy using this simple example.
Apache Spark is a general-purpose, cluster computing framework that, like MapReduce in Apache Hadoop, offers powerful abstractions for processing large datasets. For various reasons pertaining to ...
Apache Spark is now part of MapR’s Hadoop distribution
Hadoop vendor MapR is getting in early on the Apache Spark action, too, announcing on Thursday that it’s adding the Spark stack to its Hadoop distribution as part of a partnership with Spark startup Databricks (Ion Stoica, the co-founder and CEO of ...
How to use Cassandra CQL in your Java Application
Cassandra Query Language (CQL) is a SQL (Structured Query Language)-like language for querying Cassandra. Cassandra's data model is a partitioned row store with tunable consistency. Rows are organized into tables; the first component of a table's primary key is the ...
Why Apache Spark is a Crossover Hit for Data Scientists
Spark is a compelling multi-purpose platform for use cases that span investigative, as well as operational, analytics.
Data science is a broad church. I am a data scientist — or so I’ve been told — but what I do is actually ...
MariaDB-An Open Source Alternative and Replacement of MySQL
MySQL was the best open source database in the market before Sun Microsystems purchased it. After this, Oracle purchased Sun Microsystems and MySQL came in the hands of Oracle. After this purchase a fair number of MySQL users started looking ...
Why NoSQL became MORE SQL
One of the key points I raised was about how many folks were just slapping on Big Data badges to the same old same old, another was that Map Reduce really doesn't work they way traditional IT estates behave which ...
Integrating Hadoop into Business Intelligence and Data Warehousing
Information from SAS and TDWI Research
The purpose of this report is to accelerate users’ understanding of the many new products and practices based on Hadoop technologies that have emerged in recent years. While Hadoop usage is a minority practice today, ...






