Tesseract image to text

Question

I use this code:

Bitmap image = new Bitmap(Application.StartupPath + "\\" + "1111.jpg");
tessnet2.Tesseract ocr = new tessnet2.Tesseract();
// ocr.SetVariable("tessedit_char_whitelist", "0123456789"); // If digit only
ocr.Init(null, "eng", false); // To use correct tessdata
List<tessnet2.Word> result = ocr.DoOCR(image, Rectangle.Empty);
foreach (tessnet2.Word word in result)
    MessageBox.Show(String.Format("{0} : {1}", word.Confidence, word.Text));

But answer is: 100: ~ This load image:

enter image description here

Why answer is "100: ~" ?

As far as I've experienced `Tesseract` sucks fairly seriously, I would not use such an unreliable library. — King King, Dec 09 '13 at 12:46
Before passing your image to tesseract, try to binarize it first, it helps a lot — OuSs, Feb 17 '14 at 13:43
@KingKing Tesseract is very stupid about preprocessing of the input image, but works pretty well if the image is adjusted first. It needs to be rotated to horizontal, blown up in resolution, and have low-frequency brightness changes removed before being run through the engine. — endolith, May 24 '14 at 14:33

score 1 · Answer 1 · answered Feb 21 '14 at 08:02

1

The size of your text is too small. So the result is "~" for the scanned text. If you use a bigger font (like 14 or 16) tessnet2 work really good.

answered Feb 21 '14 at 08:02

a.h

11
1

Tesseract image to text

1 Answers1