big-dataIndustry hype around big data continued in full force in 2014. And, vying for a piece of what IDC expects to be a $32.4 billion market by 2017, a number of big data startups either burst onto the scene or continued to fine-tune their strategies.

Ranging from companies whose big data solutions are helping doctors better treat cancer, to those whose platforms promise to drastically simplify big data search and analytics, here are 10 of the coolest big data startups of the year (so far).

1. SumAll

SumAll offers an online analytics platform that lets users visualize data from 42 (and counting) social media and e-commerce sites, including Facebook, Twitter, eBay and Instagram, in one intuitive, interactive chart. The company rolled out several stand-out new features this year, including an alert system for Twitter that pings users when they get, for instance, a certain number of retweets or mentions (good or bad).

Also cool about SumAll is the fact that it allocated 10 percent of its ownership to a non-profit, SumAll.org, which aims to use data analytics for “social good.”

2. Luminoso

CEO: Dr. Catherine Havasi

It’s been a big year so far for Luminoso, a Cambridge, Mass.-based startup specializing in text analytics.

As a spin-out of the Massachusetts Institute of Technology (MIT) Media Lab, Luminoso leverages what it calls “the world’s first cloud-based, massively multilingual, scalable solution” for understanding and analyzing text. The purpose of the platform, Luminoso says, is to turn big data into big insights, helping organizations understand how customers really feel about their company or product by deriving meaning from even the most indirect language or subtlest of hints.

Luminoso garnered a lot of attention this year when it was selected by Sony to fuel the tech giant’s “One Stadium Live,” an online portal compiling all World Cup-related social media content from Twitter, Facebook, and Google+. The startup in June also nabbed a $6.5 million in Series A funding.

3. Flatiron Health

Co-Founders: Nat Turner and Zach Weinberg

Founded in 2012 by former Google employees Nat Turner and Zach Weinberg, Flatiron Health is harnessing the power of big data analytics to help doctors better understand, and treat, one of the world’s most complex diseases: cancer.

Flatiron Health, based in New York, is developing OncologyCloud, an advanced data platform that is 100 percent focused on oncology. The idea is that the platform aggregates and transforms critical data from electronic medical records and billing systems in real time to provide a comprehensive view of each patient’s experience in the oncology office.

4. Domo

CEO: Josh James

Aiming to bring the right information, at the right time, to enterprise users’ fingertips, Domo offers a cloud-based platform designed to give users real-time access to data scattered throughout different sources via a single dashboard.

Founded in 2011, Domo says its platform can quickly derive structured and unstructured data from almost any source, whether a spreadsheet, a database, or a social media site.

5. Alpine Data Labs

Founded in 2010, San Francisco-based Alpine Data Labs offers a platform that lets users create analytical queries using a simple and familiar drag-and-drop approach. The platform works with both Hadoop-based data sources and traditional relational databases. It also has built-in collaboration features that let team members, no matter how far, work together on a single predictive analytics model.

Alpine Data Labs in November raised $16 million in Series B venture funding, bringing its total funding to $23.5 million.

6. Altiscale

Altiscale, formed in 2012 by former Yahoo CTO Raymie Stata, is a big data startup that prides itself on having developed the industry’s first cloud service to run Apache Hadoop, the open-source platform for managing big data.

7. Tamr

Another hot big data startup to emerge from MIT is Tamr, a Cambridge, Mass.-based company promising to dramatically reduce the time and effort required to connect and enrich data sources.

Tamr formally launched in the spring of this year and is backed by a number of investors, including Google Ventures and New Enterprise Associates. The Tamr platform leverages machine learning algorithms to identify data sources, understand the relationships between them, and then curate massive amounts of siloed data.

8. Cloudera

The Palo Alto, Calif.-based company has been busy so far in 2014. In addition to rolling out the latest version of its Hadoop platform — Cloudera Enterprise 5 – Cloudera in April bulked up its Connect Partner Program with new resources, including training and certification, for its more than 900 solution provider partners.

In June, Cloudera announced the closing of a $900 million round in funding, and named Kim Stevenson, corporate vice president and CIO at Intel, to its board of directors.

9. DataGravity

DataGravity describes itself as an early stage company with a mission to turn data into information. The Nashua, N.H.-based company, expected to launch its first product sometime this year, says it’s developing a technology that will transform stored data into easily digestible information without the need for complex software packages.

10. Elasticsearch

Elasticsearch has generated a lot of interest over the past year and a half with its open-source search solution that’s purpose-built for big data.

In what it’s dubbed the “most advanced search and analytics engine” on the plant, Elasticsearch’s platform is designed to quickly help users search through massive amounts of data, “scrub” that data clean, and then visualize that data through a number of analytics tools.

Elasticsearch says it’s seeing massive growth in the market. In June, when the company closed a $70 million funding round, Elasticsearch said the adoption of its three core products — Elasticsearch, Logstash and Kibana — has grown three-fold, climbing to more than 8 million total downloads. Source