I have downloaded the graphframes
package (from here) and saved it on my local disk. Now, I would like to use it. So, I use the following command:
IPYTHON_OPTS="notebook --no-browser" pyspark --num-executors=4 --name gorelikboris_notebook_1 --py-files ~/temp/graphframes-0.1.0-spark1.5.jar --jars ~/temp/graphframes-0.1.0-spark1.5.jar --packages graphframes:graphframes:0.1.0-spark1.5
All the pyspark functionality works as expected, except for the new graphframes
package: whenever I try to import graphframes
, I get an ImportError
. When I examine sys.path
, I can see the following two paths:
/tmp/spark-1eXXX/userFiles-9XXX/graphframes_graphframes-0.1.0-spark1.5.jar
and /tmp/spark-1eXXX/userFiles-9XXX/graphframes-0.1.0-spark1.5.jar
, however these files don't exist. Moreover, the /tmp/spark-1eXXX/userFiles-9XXX/
directory is empty.
What am I missing?