I'm training YOLO on custom dataset(using Alexey AB implementation of Darknet). It has 3 classes of images where class 1 has 45k images, and remaining two have around 1k images.
After training it for 6k iterations, the loss is in between 1.5 and 2. However, when i tried running it on a video, it is only detecting class 1.
I would like to know what is the reason for this, is it because of the imbalance in the no of images in the dataset? Is there a way to solve this problem?