Following the error. It is clear that the cluster is terminating after step 1.
This issue is because the botocore
package is deprecated.
A solution to this could be:
- Start a persistent cluster
- Use that cluster ID to run the mr job in emr
Commands:
mrjob create-cluster
Make sure you have configured the cluster-info in mrjob.config
file. The above command lets you create a persistent cluster.
python3 MovieSimilarities.py -r emr --cluster-id "your-cluster-id"
--items=ml-100k/u.item ml-100k/u.data > sims2t.txt
Now specify the cluster id to run your cluster.