Questions tagged [kubeflow-katib]
4 questions
1
vote
0 answers
Getting file from GCP bucket in Katib
I am setting up a YAML template for Katib, which will follow the same optimizing script and process but with different inputs, so I would like to get the file from a bucket given the path to the file itself.
I used it on a different pipeline with…

Pedro Zaterka
- 21
- 4
1
vote
0 answers
How to prevent OOMKilled when running parallel KATIB trials with unbalanced resource requirements?
I've got very unbalanced (exponential) memory requirements for different Katib trials. When running smaller trials it is perfectly fine to run 16 in parallel on my 4 node cluster - but when the larger ones run they use up a lot of memory and I get…

Romeo Kienzler
- 3,373
- 3
- 36
- 58
0
votes
0 answers
Delete attempt of Kubernetes resource reports not found, even though it can be listed with "kubectl get"
I am running Kubeflow pipeline on a single node Rancher K3S cluster. Katib is deployed to create training jobs (Kind: TFJob) along with experiments (a CRD).
I can list the experiment resources with kubectl get experiments -n . However,…

Mark S
- 1
0
votes
1 answer
How to mount PVC to a (katib) Job specification?
I'd like to mount a PVC to a (katib) Job specification but can't find anything in the documentation nor any example?
I'm pretty sure that this should be possible as a Job is orchestrating pods and pods can do so. Or am I missing something?
Please…

Romeo Kienzler
- 3,373
- 3
- 36
- 58