I've got a hadoop cluster that I'm doing data analytics on using Numpy, SciPy, and Pandas. I'd like to be able to submit my hadoop jobs as a zip/tar file using the '--file ' argument to a command. That zip file should have EVERYTHING that my python program needs to execute such that no matter what node my script executes on in the cluster, I won't run into an ImportError at runtime.
Due to company policies, installing these libraries on every node isn't exactly feasible, especially for exploratory/agile development. I do have pip and virtualenv installed to create sandboxes as needed though.
I've looked at zipimport and python packaging but none of that seems to fulfill my needs/I'm having difficulty using these tools.
Has anyone had any luck doing this? I can't seem to find any success stories online.
Thanks!