Currently, I am implementing multi-class semantic segmentation pipeline for my custom dataset. At the end of training/evaluation, I will get the corresponding masks of my classes/objects. From the output masks, I need to find the pose of those masks(of one particular class).
What I have in my mind for proceeding further is by either using keypoints detection(similar to human pose estimation) or by panoptic segmentation. My question is, am I going in the right way? If so,
- which would be a better option to proceed?
- Can you link an implementation you came across fro the similar problem I mentioned
Thanks in advance!!