I am using stable diffusion code to train a diffusion model. Their paper mentioned FID scores assessed on 5000 samples using 100 DDIM steps. The git repository does not have the implementation for how they computed the FID score over 5000 samples. I plan on using the torch metrics FID to compute it. What is not clear to me how to do it for random set of 5000 images in pytorch lightning run? My question is how to report FID score metric for my training and validation epochs?
I have never used generative AI metrics before.