I'm currently writing my BA Thesis about Hardware and Frameworks for AI-Inference. In my research, I looked up TensorRT and find ab table which I don't really understand.
Sadly there is no real explanation for this except the title. I understand that there are different CNN models with different number of layers and that till to a certain point adding up Layers it will result in an increase of accuracy but to much layers also can result in errors.
But I don't understand how it is related to fp32 and int 8 and what this table is trying to tell me. It would be nice if someone could help me out here. Also I don't really know what they mean with "retraining".
Thanks for any answer