Questions tagged [google-cloud-vertex-ai]

Usage questions relating to Google Cloud Platform's Vertex AI: https://cloud.google.com/vertex-ai/docs

692 questions
2
votes
0 answers

Cannot use tensorboard with Vertex AI Custom job

I'm trying to launch a custom training job using Vertex AI through XManager. When running Custom jobs with tensorboard enabled I get a tensorboard instance in experiments -> tensorboard instances and a button on the custom job page that says OPEN…
2
votes
0 answers

Internal error on batch prediction job. Reproduction conditions are unidentified

I trained some tabular forecasting model and executed batch prediction jobs with Vertex AI, then sometimes the following error occurred 30-60 minutes after the start of the jobs. Internal error occurred. Please retry in a few minutes. If you still…
2
votes
2 answers

connect vertex ai endpoint through .Net

Is there any way to connected google cloud platform service vertex ai endpoint through .Net code ? I am new to gcp vertex. any help is really apricated.
tt0206
  • 747
  • 3
  • 9
  • 24
2
votes
1 answer

Vertex AI 504 Errors in batch job - How to fix/troubleshoot

We have a Vertex AI model that takes a relatively long time to return a prediction. When hitting the model endpoint with one instance, things work fine. But batch jobs of size say 1000 instances end up with around 150 504 errors (upstream request…
2
votes
1 answer

Run a Vertex AI two tower model locally

I have successfully trained a Two Tower model on Google Vertex AI as per the guide here. I now would like to download the model and try some inference locally on my own machine, I have been battling with various errors for a while and now am stuck…
2
votes
0 answers

Vertex AI Pipeline is not using the GPU

I am building a customized pipeline with the following step: trainer_task = (trainer(download_task.output).set_cpu_request("16").set_memory_request("60G"). add_node_selector_constraint('cloud.google.com/gke-accelerator',…
2
votes
2 answers

Vertex AI batch prediction location

When I initiate a batch prediction job on Vertex AI of google cloud, I have to specify a cloud storage bucket location. Suppose I provided the bucket location, 'my_bucket/prediction/', then the prediction files are stored in something like:…
2
votes
1 answer

Specifying Machine Type in Vertex AI Pipeline

I have a pipeline component defined like this: data_task = run_ssd_data_op( labels_path=input_labels, data_config=config_task.outputs["output_data_config"], training_config=config_task.outputs["output_training_config"], …
James
  • 3,957
  • 4
  • 37
  • 82
2
votes
1 answer

How can I pass parameters to a Vertex AI Platform Pipeline?

I have created a Vertex AI pipeline similar to this. Now the pipeline has reference to a csv file. So if this csv file changes the pipeline needs to be recreated. Is there any way to pass a new csv as a parameter to the pipeline when it is re-run?…
schoon
  • 2,858
  • 3
  • 46
  • 78
2
votes
1 answer

Google Vertex AI image AutoML classification when an important image feature is text inside the image

I'd like to do image classification. In my dataset, despite the fact that images features is a strong component for this classification (colors, shapes, etc), some categories of images will be hard to distinguish without interpreting the text inside…
2
votes
1 answer

Google Cloud Vertex AI Notebook Scheduled Runs Aren't Running Code?

I've followed their instructions to how to set up a managed Jupyter notebook and schedule a run, and I tossed in some pretty standard parameters and my bucket. After setting up the schedule, however, the run just comes out as "Failed", and when I…
2
votes
0 answers

Use a model trained by Google Cloud Vertex AI accelerated with TRT on Jetson Nano

I am trying to standardize our deployment workflow for machine vision systems. So we were thinking of the following workflow. Deployment workflow So, we want to create the prototype for the same, so we followed the workflow. So, there is no problem…
2
votes
2 answers

Why is my GCP Vertex pipeline api_endpoint not right?

My API_ENDPOINT is set to europe-west1-aiplatform.googleapis.com. I define a pipeline: def pipeline(project: str = PROJECT_ID, region: str = REGION, api_endpoint: str = API_ENDPOINT): when I run it: job =…
2
votes
1 answer

Vertex AI Endpoints scales to 0 before increasing number of replicas

I have an endpoint in us-east which serves a custom imported model (docker image). This endpoint uses min replicas = 1 and max replicas = 100. Sometimes, Vertex AI will require the model to scale from 1 to 2. However, there seems to be an issue…
2
votes
1 answer

Is it possible to output a list of artifacts of the same type using kubeflow pipelines?

I'm trying to output multiple confusion matrices from a kubeflow pipeline component, like in this example that only outputs one: def eval_model( test_set: Input[Dataset], xgb_model: Input[Model], metrics: Output[ClassificationMetrics], …