Does the accuracy of the deep learning program drop if I do not put in the default input shape into the pretrained model?

Question

As the title says, I want to know whether input shape affects the accuracy of the deep learning model.

Also, can pre-trained models (like Xception) be used on grayscale images?

P.S. : I recently started learning deep learning so if possible please explain in simple terms.

score 0 · Accepted Answer · answered Apr 03 '19 at 04:32

Usually, with convolutional neural networks, differences in the image shape (the width/height of an image) will not matter. However, differences in the # of channels in the image (equivalently the depth of the image), will affect the performance. In fact, there will usually be dimension mismatch errors you get if the model was trained for greyscale/colour and you put in the other type.

score 0 · Answer 2 · answered Jun 01 '22 at 22:15

Generally, input scale matters. Changing to grayscale matters for sure. Details depend on the training data. That is, if the training data contains the object with the same scale you use, it might not make a big difference, if not it makes a difference. Deep learning is mostly not invariant to any changes in the data. CNNs show some invariance to translation, but that is about it. Rotation, scaling, color distortion, brightness etc. all impact performance negatively - if these conditions have not been part of the training.

The paper https://arxiv.org/abs/2106.06057 published at IJCNN 2022 investigates a classifier on rotated and scaled images on simple datasets like MNIST (digits) and show that performance deteriorates a lot. There are also other papers that showed the same thing.

Does the accuracy of the deep learning program drop if I do not put in the default input shape into the pretrained model?

2 Answers2