I am new to programming and have recently been trying to learn object detection using YOLOv5. I can perform the detection of a custom object, but I am struggling to record the frames in which the objects were detected.
My goal is to compare the frames detected by my model to the frames where the object is already annotated framewise. As in, I annotate the frames with my object for "X.mp4", using VIA 3.0(http://www.robots.ox.ac.uk/~vgg/software/via/demo/via_video_annotator.html) and when the same video is run through my model, it returns the frames with the objects, for me to compare.
Ideally, I would want my program to return the frames and the time (in mins and secs) of the video where objects were detected.
Apologies if my question is unclear and is very silly. As mentioned before, any help is appreciated.
Thanks