what OCR algorithm does Microsoft Office Document Imaging use? Can I use it without installing office 2003?

Asked Apr 28 '11 at 14:38

Active May 11 '15 at 01:59

Viewed 853 times

Related to this question, I found a program, JOCR that has good results. It turns out it just uses Microsoft Office Document Imaging to do the OCR for it using COM. That works for me - I can spend some time trying to get it to work from Python (where I need it).

My questions are: what OCR algorithm do they use? Is it possible to get it (or one like it - tesseract and ocropus seem to only work with scanned images or pictures, but MODI OCR works great with on-screen text) elsewhere? Is it possible to install it on a computer without installing all of Office 2003?

edited May 23 '17 at 11:58

Community

asked Apr 28 '11 at 14:38

Claudiu

224,032
165
485
680

Office is propriatary and so is their OCR most likely. Have you tried looking into tesseract? Also I don't see how this question relates to python? – Jakob Guldberg Aaes Jan 24 '22 at 15:29

what OCR algorithm does Microsoft Office Document Imaging use? Can I use it without installing office 2003?

0 Answers0