so i'm trying to make object detection model based on Keras RetinaNet and ResNet-152 backbone. I have followed every tutorial that explain how to do it. Up untill now, i have trained my model where the loss i achieve is 0.81, i thought it was an excellent result.
Here's the example of bounding box process i did in label img
But somehow, when i tried to predict the result to new images (4 images), the result is really weird, here's the each of image result
Filename, boxes, scores, labels
Did i do something wrong in dataset creation? because i only tweak some code, like
- i comment "setup_gpu(gpu)" command
- i change the default backbone which is from "resnet50" to "resnet152" in train.py argument section
For the code, i used https://github.com/fizyr/keras-retinanet
Edit :
These are the examples when i tried to predict even with my own training data. Althought one of them is correct, but the confidence score is very low, it's not even reached 0.5.