Making Hadoop Simple On-premises and in the Cloud

An increasing number of enterprises are either currently using or are planning to use cloud deployment models to expand their IT infrastructure for big data. We find that organizations are looking for an open and flexible platform that enables them ...

Apache Flink: Possible replacement for Hadoop?

Earlier this week, Apache Software Foundation unveiled its latest Top Level Project (TLP), Flink. Flink is just one of the many data processing tools that have emerged since the Java-based distributed computing platform Hadoop has increased adoption in the enterprise space. Flink is ...

Hadoop predictions for 2015

Hadoop adoption and innovation is moving forward at a fast pace, playing a critical role in today's data economy. But, how fast and far will Hadoop go heading into 2015? Prediction 1: Hadooponomics makes enterprise adoption mandatory. The jury is in. Hadoop has been ...

120 Companies Hiring Hadoop Developers

There has been a consistent growth in the demand for people who know Big Data, Hadoop and related technologies. Demand is basically coming from two sectors, one is where data processing environment is already mature like Data warehousing, Data Integration ...

Data Science –What’s the big deal about it?

Thomas Davenport, an American academic and publisher for Harvard Business Review, once said that Data Scientist is “the Sexiest Job of the 21st Century”. But why is there such a big hype and mythos about Data Scientists and Data Science? The ...

Why Extended Attributes are Coming to HDFS

Extended attributes in HDFS will facilitate at-rest encryption for Project Rhino, but they have many other uses, too. Many mainstream Linux filesystems implement extended attributes, which let you associate metadata with a file or directory beyond common “fixed” attributes like filesize, ...

Can Super-Fast Apache Spark Light Up Hadoop?

it the Hadoop Swiss Army knife of cluster computing frameworks. The Apache Software Foundation just rolled out Apache Spark v1.0, which it's calling a "super-fast, open-source, large-scale Relevant Products/Services data Relevant Products/Services processing and advanced analytics Relevant Products/Services engine." That's a ...
1 7 8 9 10 11 12