I'm new to GCP so pardon for prehaps asking/missing something obvious here.
I'm trying to deploy and create a version resource on GCP with a custom pytorch model. Everything have been working fine until now until I try to create the a new version of the model. Then I keep getting: INVALID_ARGUMENT: Machine type is not available on this endpoint.
I've tried switching between different types from their list here without luck. What am I missing?
Here's the script I run to deploy:
MODEL_NAME='test_iris'
MODEL_VERSION='v1'
RUNTIME_VERSION='2.4'
MODEL_CLASS='model.PyTorchIrisClassifier'
PYTORCH_PACKAGE='gs://${BUCKET_NAME}/packages/torch-1.8.1+cpu-cp37-cp37m-linux_x86_64.whl'
DIST_PACKAGE='gs://${BUCKET_NAME}/models/Test_model-0.1.tar.gz'
GCS_MODEL_DIR='models/'
REGION="europe-west1"
# Creating model on AI platform
gcloud alpha ai-platform models create ${MODEL_NAME}\
--region=europe-west1 --enable-logging \
--enable-console-logging
gcloud beta ai-platform versions create ${MODEL_VERSION} --model=${MODEL_NAME} \
--origin=gs://${BUCKET_NAME}/${GCS_MODEL_DIR} \
--python-version=3.7 \
--machine-type=mls1-c4-m2\
--runtime-version=${RUNTIME_VERSION} \
--package-uris=${DIST_PACKAGE},${PYTORCH_PACKAGE} \
--prediction-class=${MODEL_CLASS}
Thanks!