From your experience, what is the most accurate open-source Optical Character Recognition (OCR) library/software to read Japanese text?
I just tried nhocr, its mistake rate is over 2% even on an extremely clean high-definition document.
From your experience, what is the most accurate open-source Optical Character Recognition (OCR) library/software to read Japanese text?
I just tried nhocr, its mistake rate is over 2% even on an extremely clean high-definition document.
Based on the lack of answers it sounds like nhocr IS the most accurate open-source OCR for Japanese.
I have had some R&D experience with ABBYY's solution - FineReader Engine. It was version 8.1 at the time, and I am not up to date with their newest revisions. But at the time - it was simply the best I could find for our handheld scanner product. I highly recommend it.
BTW, you can get a free version of ABBYY OCR package for end-users when purchasing a XEROX PE220 printer, which it comes bundled with. That printer was on my desk for several years. There must be other printers coming with it bundled inside. Xerox was betting on thei OCR as the best as well.