1

Is there any example / documentation which shows how to implement skflow(tf.contrib.learn) in multi-machine and multi-GPUs environment?

I want to implement KMeansClustring using skflow(tf.contrib.learn) in a multi-machine and multi-GPUs environment. How should I proceed?

Rahul
  • 645
  • 1
  • 9
  • 21

1 Answers1

1

Ideally you'd be running a cluster manager (Kubernetes, Mesos), otherwise setting up multi-machine training is rather difficult.

For a core TensorFlow example, see here (under the docker/kubernetes directories): https://github.com/tensorflow/ecosystem

We'll update the examples there to use both Estimator and multi-GPU setups soon.