Questions tagged [ocr]

Optical Character Recognition, usually abbreviated to OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text. The following topics, although some being distinct fields of application, are also commonly referred to as OCR: Handwritten Text Recognition (HTR), Optical Word Recognition (OWR), Intelligent Character Recognition (ICR), Intelligent Word Recognition (IWR).

Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text. It is widely used to convert books and documents into electronic files, to computerize a record-keeping system in an office, or to publish the text on a website.

OCR @Wikipedia

Frequently-asked questions:

6124 questions
19
votes
1 answer

Preprocessing poorly scanned handwritten digits

I have a few thousands of PDF files containing B&W images (1bit) from digitalized paper forms. I'm trying to OCR some fields, but sometime the writing is too faint: I've just learned about morphological transforms. They are really cool!!! I feel…
Paulo Scardine
  • 73,447
  • 11
  • 124
  • 153
19
votes
8 answers

Get correct image orientation by Google Cloud Vision api (TEXT_DETECTION)

I tried Google Cloud Vision api (TEXT_DETECTION) on 90 degrees rotated image. It still can return recognized text correctly. (see image below) That means the engine can recognize text even the image is 90, 180, 270 degrees rotated. However the…
Jack Fan
  • 2,143
  • 3
  • 18
  • 25
19
votes
2 answers

Handwriting recognition API's for android applications

Is there any good Handwriting recognition API to help develop application on Android Platform? Google has released (around July 2012 http://www.google.com/insidesearch/features/search/handwritinginput/index.html) a feature to use handwriting for…
liramu
  • 191
  • 1
  • 1
  • 4
18
votes
3 answers

Convert Non-Searchable Pdf to Searchable Pdf in Windows Python

Need a solution to convert a PDF file where every page is image and a page can either contains text, table or combination of both to a searchable pdf. I have used ABBY FineReader Online which is doing the job perfectly well but I am looking for a…
Rahul Agarwal
  • 4,034
  • 7
  • 27
  • 51
18
votes
4 answers

What's the best way to ocr as much text as possible from video game screenshots?

I'm trying to use the tesseract ocr tool to extract ocr text from video games(I'm pre processing screenshots and passing them to command line tool tsv output and parsing that). I'd like to use it for test automation not unlike selenium web testing.…
Roman A. Taycher
  • 18,619
  • 19
  • 86
  • 141
18
votes
1 answer

chinese character recognition using Tesseract OCR

I have been using Tesseract 3.0.2 OCR SDK for image text extraction. But if I use Chinese text images and pass through OCR then Tesseract doesn't provide me the Chinese characters instead of that I am getting numeric and english characters. But I…
Nishant Tyagi
  • 9,893
  • 3
  • 40
  • 61
18
votes
5 answers

Select only specific parts of the image

I am completely new in "Optical character recognition" and need some more information and advices because I was not able to find is it possible to do what I need. I have the following task: An images are given as input - they will have the same…
gotqn
  • 42,737
  • 46
  • 157
  • 243
18
votes
3 answers

Stroke Width Transform (SWT) implementation (Python)

Can anyone describe how can i implement SWT in python using opencv or simplecv ?
pylover
  • 7,670
  • 8
  • 51
  • 73
18
votes
5 answers

Can I do a "string contains X" with a percentage accuracy in python?

I need to do some OCR on a large chunk of text and check if it contains a certain string but due to the inaccuracy of the OCR I need it to check if it contains something like a ~85% match for the string. For example I may OCR a chunk of text to make…
Jacxel
  • 1,634
  • 8
  • 22
  • 33
18
votes
3 answers

what's the best image input type for tesseract?

I'm using tesseract on a project and want to know the best image input type for tesseract to give the best output. Is Binary&TIFF the best input or there's something else?
chostDevil
  • 1,041
  • 5
  • 17
  • 24
17
votes
5 answers

Scoreboard digit recognition using OpenCV

I am trying to extract numbers from a typical scoreboard that you would find at a high school gym. I have each number in a digital "alarm clock" font and have managed to perspective correct, threshold and extract a given digit from the video…
pyromanfo
  • 353
  • 2
  • 9
17
votes
5 answers

Batch OCR Program for PDFs

This has been asked before, but I don't really know if the answers help me. Here is my problem: I got a bunch of (10,000 or so) pdf files. Some were text files that were saved using adobe's print feature (so their text is perfect and I don't want…
Aquat33nfan
  • 195
  • 1
  • 1
  • 3
17
votes
12 answers

(-215:Assertion failed) !_src.empty() in function 'cv::cvtColor' with cv::imread

I am trying to recognize text from an image to then have the text outputted; however, this error spits out: Traceback (most recent call last): File "C:/Users/Benji's Beast/AppData/Local/Programs/Python/Python37-32/imageDet.py", line 41, in…
Benji
  • 197
  • 1
  • 1
  • 5
17
votes
2 answers

Can I use OCR to detect font style (bold, italic)?

I am interested in using OCR to extract bold and italic words from a simple text. For example, if I input a clear image with text like so: "The quick brown fox jumps over the lazy dog." I would like to get an output like so: bold("brown", "jumps"),…
vamin
  • 2,178
  • 6
  • 26
  • 30
17
votes
3 answers

Open source OCR

I'm looking for an open source OCR library that runs on Linux. I need this to work for PNGs and PDFs. Mostly I would like to interface this library from java or ruby. Any idea if there is anything available? Regards.
Chris
  • 171
  • 1
  • 1
  • 3