I trained a YOLOv7 model on a custom dataset and converted it to ONNX. The input of the model on Netron reads "Float32(1,3,640,640)" which I understand. The output, however, is unclear to me as other tutorials mentioned there should be 6 elements representing bounding box position and size (xywh objectness and class number) but this model outputs 7 elements with an extra 0 as follows:
Float32(concatoutput_dim_0,7)
The output (sample data):
0, 24.744838, 50, 24.744838, 70.46938, 1 ,1, 0, 40.495939, 30.95939, 40.495939, 123.2848439, 1, 1, ... (Each 7 values start with zero, and the second and fourth are almost always equal). Does the 0 mean (dimension 0)?
Successfully converted yolov7 to onnx, and pre-processed input image. Output is unclear as to why it has a 7th element.