Questions tagged [google-cloud-ai-platform-pipelines]

42 questions
15
votes
3 answers

Kubeflow vs Vertex AI Pipelines

I was exploring kubeflow pipelines and Vertex AI pipelines. From what I understand, Vertex AI pipelines is a managed version of kubeflow pipelines so one doesn't need to deploy a full fledged kubeflow instance. In that respect, pricing aside, Vertex…
4
votes
1 answer

How to schedule repeated runs of a custom training job in Vertex AI

I have packaged my training code as a python package and then am able to run it as a custom training job on Vertex AI. Now, I wanted to be able to schedule this job to run, say every 2 weeks, and re-train the model. The Scheduling settings in the…
3
votes
1 answer

Autoscaling VertexAI pipeline components

I am exploring VertexAI pipelines and understand that it is a managed alternative to, say, AI Platform pipelines (where you have to deploy a GKE cluster to be able to run Kubeflow pipelines). What I am not clear on is whether VertexAI will autoscale…
3
votes
0 answers

Minimal permissions to submit a kubeflow pipeline

Issue I am trying to figure out what the minimum permissions are to run the following code: pipeline = Pipeline(...) client = kfp.Client(host=...) client.create_run_from_pipeline_func( pipeline.build(), …
sixtyfootersdude
  • 25,859
  • 43
  • 145
  • 213
3
votes
2 answers

Connecting to AI Platform Pipelines using the Kubeflow Pipelines SDK with Python and PyCharm

Using windows 10 and Python 3.9 with PyCharm IDE I am trying to list the pipelines using the following code: import kfp client = kfp.Client(host='.......bb92ea05-dot-us-central2.pipelines.googleusercontent.com') client.list_pipelines() I know…
3
votes
1 answer

GCP AI Platform - Pipelines - Clusters - Does not have minimum availability

I can't create pipelines. I can't even load the samples / tutorials on the AI Platform Pipelines Dashboard because it doesn't seem to be able to proxy to whatever it needs to. An error occurred Error occured while trying to proxy to: ... I looked…
Sterls
  • 723
  • 12
  • 22
2
votes
0 answers

Internal error on batch prediction job. Reproduction conditions are unidentified

I trained some tabular forecasting model and executed batch prediction jobs with Vertex AI, then sometimes the following error occurred 30-60 minutes after the start of the jobs. Internal error occurred. Please retry in a few minutes. If you still…
2
votes
1 answer

Google cloud object detection model training error

I have a problem training a computer vision Model in google could, I am sure that the problem is related to GPU. I know that google say be default you have 1 GPU put the training fails with this message error : "The request for 8 K80 accelerators…
2
votes
1 answer

Unable to view Vertex AI pipeline node logs

I created a Vertex AI pipeline to perform a simple ML flow of creating a dataset, training a model on it and then predicting on the test set. There is a python function based component (train-logistic-model) where I train the model. However, in the…
2
votes
2 answers

Specify machine type for a single TFX pipeline component in Vertex AI

I'm using TFX to build an AI Pipeline on Vertex AI. I've followed this tutorial to get started, then I adapted the pipeline to my own data which has over 100M rows of time series data. A couple of my components get killed midway because of memory…
Simon Corcos
  • 962
  • 14
  • 31
1
vote
2 answers

AI Platform Pipeline 2.0.0 deployment not visible on GCP Console

On GCP, I created a new kubernetes cluster 1.26 and deployed kubeflow pipelines 2.0.0 on it. The deployment went fine but on the AI Platform Pipeline page, I don't see the new deployment. Due to this, I am unable to open the newly deployed 2.0.0 kfp…
1
vote
1 answer

Scheduling Vertex AI Pipeline - Error 503

I successfully trained and deployed a pipeline in Vertex AI using Kubeflow for a retrieval model. Now I want to schedule this pipeline run every 8 minutes. Here's my code: from kfp.v2.google.client import AIPlatformClient api_client =…
1
vote
0 answers

Kubeflow pipeline fails in GCP - using cluster with Kubeflow pipeline integartion

I am using kubeflow v2 to compile my script and uploading that yaml file to Kubeflow. The runs are not succeeding and giving me below error: FileNotFoundError: [Errno 2] No such file or directory:…
1
vote
0 answers

How do you specify multiple values for the inputTensorName key in INPUTMETADATA spec in Vertex Explainable AI for a Functional API model?

I want to add explanation to my model running in Vertex AI using the Vertex AI SDK.I get a silent error when running the batch prediction using ModelBatchPredictOp, where the ModelBatchPredictOp node runs infinitely.Here is my ModelBatchPredictOp…
1
vote
1 answer

Error when trying to use CustomPythonPackageTrainingJobRunOp in VertexAI pipeline

I am using the google cloud pipeline component CustomPythonPackageTrainingJobRunOp in a VertexAI pipeline . I have been able to run this package successfully as a CustomTrainingJob before. I can see multiple (11) error messages in the logs but the…
1
2 3