Storing Python packages in HDFS for Livy PySpark

Asked Nov 15 '18 at 18:29

Active Nov 15 '18 at 18:51

Viewed 222 times

I am submitting PySpark jobs to the cluster through Livy. Currently the dependent python packages like NumPy, Pandas, Keras etc are installed on all the datanodes. Was wondering if all of these packages can be stored centrally in HDFS and how can you configure Livy, PySpark to read these from HDFS instead of from that datanode.

edited Nov 15 '18 at 18:51

zero323

322,348
103
959
935

asked Nov 15 '18 at 18:29

danoomistmatiste

Did you find a solution to your question? – Bleser Jun 10 '19 at 07:01

Storing Python packages in HDFS for Livy PySpark

0 Answers0