I have a MapReduce job written in python using MRJob library. The job takes around 30 mins to complete on my local machine. While running the same job on the EMR, I am seeing no output for a long time (~=1hr). I had to close down the job. Also the jobs which take very small time on my local machine run very well on EMR. I tried increasing the timeout but the task tracker kills the reducer after the timeout. I get the following error message
`Task attempt_201301181027_0001_r_000000_0 failed to report status for 3600 seconds.Killing!
The EMR creates 4 mappers and one reducer for my job. Also the reducer code that I have written requires a lot of looping through the entire data set which is around 11-12 MB. I Expect the job to finish in around same time as that of my local machine but it doesnt happen. Whats the solution??`