Training a classification model on one V100 GPU (Standard_NC6s_v3) on Azure ML take couples of days which is extremely slow. Has anybody experienced that before?
I suspect that the data loading might be the bottleneck. However, I tried to copy the data from Azure blob storage into compute cluster docker container. However, it didn't change anything.