I am attempting to tune a VertexAI Language model. When hitting 'Start Tuning' it loads for about a second and then stops. I also receive an internal error occurred message when attempting to train from the console.
After following the recommend troubleshooting tips of google by running this command:
PROJECT_ID=(I put my project id here)
curl \
-X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
https://europe-west4-aiplatform.googleapis.com/ui/projects/${PROJECT_ID}/locations/europe-west4/datasets \
-d '{
"display_name": "test-name1",
"metadata_schema_uri": "gs://google-cloud-aiplatform/schema/dataset/metadata/image_1.0.0.yaml",
"saved_queries": [{
"display_name": "saved_query_name",
"problem_type": "IMAGE_CLASSIFICATION_MULTI_LABEL"
}]
}'
and then trying again, it still doesn't work.
I then tried to run the console command:
PROJECT_ID=(I put my project id here)
DATASET_URI=(Here I put my valid dataset uri)
OUTPUT_DIR=(I put my output dir here)
curl \
-X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json; charset=utf-8" \
"https://europe-west4-aiplatform.googleapis.com/v1/projects/${PROJECT_ID}/locations/europe-west4/pipelineJobs?pipelineJobId=tune-large-model-$(date +%Y%m%d%H%M%S)" -d \
$'{
"displayName": "DisplayName",
"runtimeConfig": {
"gcsOutputDirectory": "'${OUTPUT_DIR}'",
"parameterValues": {
"project": "'${PROJECT_ID}'",
"model_display_name": "ModelName",
"dataset_uri": "'${DATASET_URI}'",
"location": "us-central1",
"large_model_reference": "text-bison@001",
"train_steps": 500
}
},
"templateUri": "https://us-kfp.pkg.dev/ml-pipeline/large-language-model-pipelines/tune-large-model/v1.0.0"
This command returned the following output:
{
"error": {
"code": 500,
"message": "Internal error encountered.",
"status": "INTERNAL"
}
}