Questions tagged [ocr]

Optical Character Recognition, usually abbreviated to OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text. The following topics, although some being distinct fields of application, are also commonly referred to as OCR: Handwritten Text Recognition (HTR), Optical Word Recognition (OWR), Intelligent Character Recognition (ICR), Intelligent Word Recognition (IWR).

Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text. It is widely used to convert books and documents into electronic files, to computerize a record-keeping system in an office, or to publish the text on a website.

OCR @Wikipedia

Frequently-asked questions:

6124 questions
16
votes
7 answers

Need good OCR for printed source code listing, any ideas?

At my work, I sometimes have to take some printed source code and manually type the source code into a text editor. Do not ask why. Obviously typing it up takes a long time and always extra time to debug typing errors (oops missed a "$" sign…
Trevor Boyd Smith
  • 18,164
  • 32
  • 127
  • 177
16
votes
2 answers

free OCR processing API in PHP/jQuery/JavaScript

Is there any free PHP OCR API, that is well documented in PHP, jQuery or JavaScript? I'm programming a web app for iOS and I'd like to utilize the new photo upload API in iOS 6 safari to use for OCR image processing. Thanks a ton! :) (Edit: Oh and a…
user1935281
16
votes
2 answers

iOS Tesseract OCR Image Preperation

I would like to implement an OCR application that would recognize text from Photos. I succeeded in Compiling and Integration the Tesseract Engine in iOS, I succeeded in getting reasonable detection when photographing clear documents (or a photoshot…
alandalusi
  • 1,145
  • 4
  • 18
  • 39
16
votes
1 answer

Image processing for OCR with leptonica (inverse color text)

I am trying to process the following image with leptonica to extract text with tesseract. Original Image: Tesseract on the original image yields this: i s l D2J1FiiE-l191x1iitmwii9 uhiaiislz-2 Q ~37 Bottom linez With a little time! you can learn…
jasonlfunk
  • 5,159
  • 4
  • 29
  • 39
16
votes
1 answer

How to convert an image into character segments?

Often in the process of OCR, an image file is essentially cut into segments, and each character is recgnised as a segment each. For instance, has to be transformed to something like Also, is there any algorithm for Asian languages like Telugu…
tuxnani
  • 3,634
  • 6
  • 21
  • 33
15
votes
5 answers

How to extract a stroke from a Chinese character

I've been trying many times to create an algorithm to extract stroke information from Chinese characters. I've tried various methods but none was very satisfying, probably because of my limited knowledge of graphics algorithms in…
laurent
  • 88,262
  • 77
  • 290
  • 428
15
votes
1 answer

Difference between Tesseract 3 and Tesseract 4?

What are the major differences between Tesseract 3 and Tesseract 4 ? And why should I choose one over the other ?
F.Lin
  • 333
  • 1
  • 3
  • 12
15
votes
3 answers

Open-source OCR library for Arabic

I was looking around for an OCR library - optimally it would be open-source - that I could use on some Arabic pdfs. Googling it didn't result in anything useful. I was wondering if anyone knows a related OCR library or even one that works on related…
Mohammed
  • 2,693
  • 2
  • 17
  • 8
15
votes
6 answers

Google Mobile Vision library not downloading

I am trying to implement the Google Mobile Vision TextRecogniser API into my app, to read text off a given Image. When I try to use the feature, I get this error: W/DynamiteModule: Local module descriptor class for…
user7541761
15
votes
3 answers

OCR for known font

im searching for an OCR lib, that can be parameterized with a font, because I always know it and I believe the recognition results will be lots better this way. Does anyone know ?
Paul
  • 241
  • 3
  • 4
15
votes
1 answer

Why am I getting "tiff page 1 not found" Lebtonica warning in Tesseract?

I just started using Tesseract. I am following the instructions described here. I have created a test image like this: training/text2image --text=test.txt --outputbase=eng.Arial.exp0 --font='Arial' --fonts_dir=/usr/share/fonts Now I want to train…
15
votes
2 answers

Translate a picture to text on Android?

I know that google goggles will translate a picture to text. What I want to do is allow the user to take a picture (no problem), translate that picture to text(?) and then do some processing on the text (no problem). Is there any API on android that…
Keith
  • 1,119
  • 2
  • 12
  • 23
15
votes
2 answers

OCR algorithm improvement

I'm creating an OCR based on Java. My objective is to extract text from a video file (post-processing). It has been a difficult search, trying to find free, open-source OCR that works purely on Java. I found Tess4J to be the only popular option,…
metsburg
  • 2,021
  • 1
  • 20
  • 32
15
votes
3 answers

Image cleaning before OCR application

I have been experimenting with PyTesser for the past couple of hours and it is a really nice tool. Couple of things I noticed about the accuracy of PyTesser: File with icons, images and text - 5-10% accurate File with only text(images and icons…
zenCoder
  • 740
  • 4
  • 13
  • 31
15
votes
1 answer

Real TIme Image Processing (OCR)

I am planing to develop an app like Word Lens. Can any one suggest a good library that I can use? or any one explain technology behind the Word Lens App? is it reel time image matching or OCR? I know some image processing library like OpenCv,…
Ab'initio
  • 5,368
  • 4
  • 28
  • 40