pytesseract: convert pictures of 7-segment numbers to text

Question

I'm trying to convert pictures like this: 7-segment into text with pytesseract:

7-segment

I tried different PSM modes and a whitelist with only 0123456789, but the best output of pytesseract was '5' instead of '125'.

Is there a way to configure pytesseract in way that it can convert my pictures? Or are there any extensions?

Thank you.

import pytesseract
from PIL import Image, ImageTk

img = Image.open('test.png')

text = pytesseract.image_to_string(img, config=("-c tessedit_char_whitelist=0123456789 --psm 7"))

print(text)

score 0 · Answer 1 · answered Nov 30 '21 at 06:54

0

Read and follow docs
Use letsgodigital as ocr "language"

answered Nov 30 '21 at 06:54

user898678

2,994
2
18
17

Unfortunatelly the letsgodigital language doesn‘t work for me. Many values are wrong. :( – plexx1337 Dec 01 '21 at 07:32
It works perfectly on the image you provided (well if you read docs too.) – user898678 Dec 01 '21 at 14:28

extragen · Answer 2 · 2023-06-30T19:38:54.673

0

here it is a better trained models
copy any of model or all inside your tesseract folder C:\Program Files\Tesseract-OCR\tessdata
configurate tesseract to use model -l ssd, txt = pytesseract.image_to_string(img, config="-l ssd")

edited Jun 30 '23 at 19:38

answered Jun 30 '23 at 19:16

extragen

153
1
7

pytesseract: convert pictures of 7-segment numbers to text

2 Answers2