Splunk Extends Analysis To NoSQL Databases

Splunk Enterprise gets multisite support, improved visualization. Hunk adds analysis for Accumulo, Cassandra, MongoDB, and Neo4j. Splunk keeps rolling along, well ahead of an open-source threat that some thought might flatten it. The company last week sprinted ahead yet again, introducing ...

Snappy compression with Pig and native MapReduce

Assuming you have installed Hadoop on your cluster, if not please follow http://code.google.com/p/hadoop-snappy/ This is the machine config of my cluster nodes, though the steps that follow could be followed with your installation/machine configs pkommireddi@pkommireddi-wsl:/tools/hadoop/pig-0.9.1/lib$ uname -a Linux pkommireddi-wsl 2.6.32-37-generic #81-Ubuntu SMP Fri ...

Apache Falcon-Data Governance for Hadoop

Apache Falcon is a data governance engine that defines, schedules, and monitors data management policies. Falcon allows Hadoop administrators to centrally define their data pipelines, and then Falcon uses those definitions to auto-generate workflows in Apache Oozie. InMobi is one of ...

Avoiding Split Brainedness in HA Hadoop Clusters

The US Patent Office recently granted Zettaset a patent for the underlying technology in its Hadoop high availability that prevents a "split-brain" situation where multiple master nodes think they're in control of the Hadoop cluster. It's a feather in the ...

Integrating Hadoop into Business Intelligence and Data Warehousing

Information from SAS and TDWI Research The purpose of this report is to accelerate users’ understanding of the many new products and practices based on Hadoop technologies that have emerged in recent years. While Hadoop usage is a minority practice today, ...
1 3 4 5 6