I use the docker image from cloudera, but it seems the configuration not quite right. Because I do this:
hadoop jar /usr/lib/hadoop*/contrib/streaming/hadoop-streaming*cdh*.jar \
-mapper mapper -reducer reducer \
-file mapper -file reducer \
-input input -output output
I got this all the time:
18/03/14 02:34:33 INFO ipc.Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 9 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
This is how I did prior to running the process above.
- Increase Docker memory into 8GB
- Start the container, by running this in the host
docker run -p 7180:7180 \
--hostname=quickstart.cloudera --privileged=true \
-t -i cloudera/quickstart:latest \
/usr/bin/docker-quickstart
Start the manager /home/cloudera/cloudera-manager --express
Open cloudera manager to start HDFS
- Upload sample input into HDFS