We are working on a term project and we are using MNIST training set. Although our classifier works well on MNIST test set (>94% accuracy), its performance is significantly low on our prepared dataset. Details of our prepared dataset as follows:
- We created 28x28 images using paint program.
- The background of the images we prepared is black, digits are drawn with white (same as MNIST).
- When we check a MNIST image and our prepared image, they look really same.
Regarding the pixel values, we try different combinations and
- We map all pixel values from (0, 255) to (0,1) continuous.
- We map all pixel values from (0, 255) to (0, 1) binary where only digit pixels equal to 1.
The best score on our prepared dataset is approximately 70% whereas MNIST test set performance is always high (>94%). Moreover, classifier makes some very strange mistakes such as it predicts 3 --true digit is 0.
Anyone familiar with the MNIST? I think problem is related to pixel values, but I didn't figure out why it happens. When I use imshow
both of the images look exactly the same.