link: https://www.kaggle.com/c/diabetic-retinopathy-detection/discussion/15617
Github: https://github.com/sveitser/kaggle_diabetic
Hello, I am new to CNNs and recently I am studying this solution. The author drew a table of the networks his group designed. The units, filter and stride all make sense to me, but I just don't know what the "size" means. Is it more likely to mean the batch size or the image size?
I thought it should be image size at first, but there are two reasons it should not:
As they described in their report, they just cropped the original imgaes to 128x128,256x256,512x512 pixels and didn't do any other iamge preprocessing.
After reading their codes (from the Github link) , I found their setting for the InputLayer is:
(InputLayer, {'shape': (None, 3, cnf['w'], cnf['h'])}),
which confirms the description in their competition report.
Therefore, I think the input size should be 3x128x128, instead of 448.
Here are my questions:
1.If the input image size is not 448, what does 448 mean?
2.If it means batch size, why would they choose 448? and
3.why would they let the batch size decrease(basically /2) to 224 111 56 27 13 6 2 from the 1st layer to the 19 layer?