Is there any OCR that can be trained for new symbols?

Question

Is there any free/open source OCR available that can be trained for new symbols and can also output the coordinates of symbol found in the target image? I have read that tesseract OCR can be trained, but can it give me coordinates after OCR? any example? I need the code/steps to train a ocr using an image that contains one sybmol. There are around 20 symbols each in one image to be trained. and then use the trained OCR to detect those sybmols in the target image and if found, then give coordinates too.

nguyenq · Accepted Answer · 2017-04-21T13:33:12.440

4

You can train Tesseract to recognize new symbols. The hocr format contains the coordinates of the recognized words.

https://github.com/tesseract-ocr/tesseract/wiki/Training-Tesseract

http://vietocr.sourceforge.net/training.html

https://github.com/tesseract-ocr/tesseract/wiki/Command-Line-Usage#hocr-output

edited Apr 21 '17 at 13:33

answered Jun 05 '11 at 03:14

nguyenq

8,212
1
16
16

Do you have any more detail on this? The links are old/dead. – Tiago Apr 17 '17 at 10:21

Is there any OCR that can be trained for new symbols?

1 Answers1