My GPU quota has been increased on GCP and it shows as available, but for some reason cannot access it.
I tried various regions, also deleted the previous instance with the smaller quota within the same project. However, anytime I try the new quota it throws an error of exceeding the quota. I was able to recreate an instance with the previous, smaller quota.
Here is the code: gcloud compute instances create $INSTANCE_NAME --zone=$ZONE --image-family=$IMAGE_FAMILY --image-project=my-project --maintenance-policy=TERMINATE --accelerator="type=nvidia-tesla-p4,count=2” --machine-type=$INSTANCE_TYPE --boot-disk-size=200GB --metadata="install-nvidia-driver=True”
The quota can be also be requested via GCP Console as described in other post. This is not a question on how to apply for a quota increase, but rather why after getting Global GPU quota one cannot access it in any of the regions.