I'm using MRJob on machine A
to launch MapReduce jobs on machines B_0
thru B_10
. The job has dependencies that require it to be run not with the default /bin/python
(i.e. the output of which python
on machine A
) but with /path/to/weird/python
, which exists on the B
's but not on A
.
How do I tell mrjob
to use /bin/python
locally to launch the job, but /path/to/weird/python
to run it on the B
's once it's in the Hadoop cluster?
The --interpreter
argument seems to determine the interpreter for both local and Hadoop. Is there another option to specify them individually?
Or is there some reason that the interpreter used must lie at the same path on both machines?