EasyOCR not recognizing simple numbers

Question

I am trying to analyze a page footer in a video and retrieve the current page number. I got the frame collection working but I am struggling on reading the page number itself, using EasyOCR.

I already tried using pytesseract, but that doesnt work well. I have misinterpreted numbers: 10 gets recognized as 113, 6 as 41 and so on. Overall its very inconsistent, even though I format my input image correctly with grayscale, threshholding and cropping (only analyzing the pagenumber area of the footer).

Here is the code:

def getPageNumberTest(path, psm):
    image = cv2.imread(path)
    height = len(image)
    width = len(image[0])
    # the height of the footer
    footerHeight = 90 # int(height / 15.5)
    # retrieve only the footer from the image
    cropped = image[height-footerHeight:height,0:width]        

    results = reader.readtext(cropped)

Which gives me the following output:

Is there a setting I am missing? Is there a way to instruct EasyOCR to look for numbers only? Any help or hint is appreciated!

EDIT:

After some fiddling around with some optimizations of the number-images, I am now back to the beginning, not optimizing the images at all. All thats left is the conversion to gray-scale and a resize.

This is what a normal input looks like:

But the results are:

Which is weird, because for most numbers (especially for single digits) this works flawlessly, yielding certainties of over 95%...

I tried deblurring, threshholding, denoising with cv2.filter2D(), blurring,...

When I use threshholding, for example, my output looks like this (ignoring the "1", same applies for the single digit "1"):

I had a look into pattern matching, which isnt an option because I don't know the pagenumber shape beforehand...

Please add unmodified original image to your question and also try with higher resolution image if possible. — B200011011, Dec 08 '20 at 12:37
I edited my question. I cant get a higher resolution because the pages are taken from a video... — Leon Bohmann, Dec 09 '20 at 17:34
What is your kernel numbers, in the cv2, to have the last image? — Dr.jacky, Jan 21 '22 at 13:15

score 0 · Answer 1 · edited Jan 07 '22 at 00:27

0

txt = pytesseract.image_to_string(final_image, config='--psm 13 --oem 3 -c tessedit_char_whitelist=0123456789')

edited Jan 07 '22 at 00:27

Flair

2,609
1
29
41

answered Jan 04 '22 at 06:46

frank

1

1

Your answer could be improved with additional supporting information. Please [edit] to add further details, such as citations or documentation, so that others can confirm that your answer is correct. You can find more information on how to write good answers [in the help center](/help/how-to-answer). – Community Jan 04 '22 at 08:53

score 0 · Answer 2 · answered Aug 16 '22 at 12:35

0

According to my tests, PaddleOCR works better than easyOCR in most scenes.

answered Aug 16 '22 at 12:35

jun huang

1
1

Your answer could be improved with additional supporting information. Please [edit] to add further details, such as citations or documentation, so that others can confirm that your answer is correct. You can find more information on how to write good answers [in the help center](/help/how-to-answer). – Community Aug 19 '22 at 14:15

EasyOCR not recognizing simple numbers

2 Answers2