I'm trying to update a running EMR cluster with pip install on all the slave machines. How can I do that?
I can't do it with a bootstrap step because it is a long running EMR and I can't take it down.
The EMR cluster is running Spark & Yarn, so I would normally use spark slaves.sh, but I can't find that script on the master node. Is it installed in a place I haven't found? Or is there some way to install it?
I've seen other questions that say use yarn distributed-shell, but I can't find a working example of how to do that.
BTW, the cluster is using EMR 4.8.0, Spark 1.6.1, I believe.