Is there any open source OCR library written in .NET, or written in any language but can be used in an ASP.NET application? Or is there any open source OCR API available in the market for image to tabular formats ?
Asked
Active
Viewed 1.5k times
5 Answers
3
There is Tesseract and although it has no native .net bindings, it's pretty complete.
The "one to watch" is OCRopus - A Google-sponsored effort that isn't ready for production work yet.

Oli
- 235,628
- 64
- 220
- 299
-
1OCROpus can use the Tesseract engine, and does some layout analysis and other image pre-processing. – sventechie Dec 04 '09 at 19:16
1
Tesseract OCR was developed by HP and open sourced, I believe. I'm not sure how good the quality is.

paxdiablo
- 854,327
- 234
- 1,573
- 1,953
1
As the others are writing Tesseract is a good open source OCR implementation.
Eventhough it is good it is only an OCR engine. It does not do layout detection. If your input is not pure text you could have to figure out the layout yourself.

Morten Holdflod Møller
- 897
- 1
- 6
- 9