3

Is there a way to use

pip install

or something like that to install packages on a existing dataproc cluster? Or will I need to re-create and set the packages on PIP_PACKAGES?

bad_coder
  • 11,289
  • 20
  • 44
  • 72
Danilo
  • 123
  • 7

1 Answers1

0

There is no automatic way of installing packages on all nodes of an existing cluster. Depending on your use case, you may have to ssh into master and worker machines to install these packages. It will be easier to use initialization actions instead and create a new cluster.

Gaurangi Saxena
  • 236
  • 1
  • 4
  • I just want to add a documentation link https://cloud.google.com/dataproc/docs/tutorials/python-configuration – DenisOgr Oct 31 '20 at 22:51