How to build a single-node Hadoop/R system

RDataMining.com recently published the tutorial “Building an R Hadoop System” with step-by-step procedures for installing Hadoop, R, and RHadoop (including the rmr2 package) on a standard Mac system. (The same procedures will likely work on any Linux-based system as well, with minor tweaks.) Since the Hadoop system is configured in standalong mode on the single machine, you don’t have to worry about any of the details around intra-node communication and distributing software across the nodes of a multi-node cluster. The whole process takes about 30 minutes to set up, after which you can start on the Mapreduce in R tutorial from the Revolution Analytics github repository. Read more























