Getting a local test instance of Hadoop looks like a bit of a bear to configure, after consulting the following very clear, but still very complicated references:
- http://www.slideshare.net/ChicagoHUG/getting-started-with-r-hadoop-chug-20120815
- http://home.mit.bme.hu/~ikocsis/notes/2013/10/28/rhadoop-sandbox-with-the-cloudera-quickstart-vm/
- http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/
- https://hadoop.apache.org/docs/r1.2.1/single_node_setup.html
Are there recommended VMs that contain properly configured hadoop-streaming.jar and RHadoop?