Questions tagged [ocr]

Optical Character Recognition, usually abbreviated to OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text. The following topics, although some being distinct fields of application, are also commonly referred to as OCR: Handwritten Text Recognition (HTR), Optical Word Recognition (OWR), Intelligent Character Recognition (ICR), Intelligent Word Recognition (IWR).

Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text. It is widely used to convert books and documents into electronic files, to computerize a record-keeping system in an office, or to publish the text on a website.

OCR @Wikipedia

Frequently-asked questions:

Simple Digit Recognition OCR in OpenCV-Python

6124 questions

votes

1 answer

Preprocessing poorly scanned handwritten digits

I have a few thousands of PDF files containing B&W images (1bit) from digitalized paper forms. I'm trying to OCR some fields, but sometime the writing is too faint: I've just learned about morphological transforms. They are really cool!!! I feel…

python opencv ocr

asked Jul 10 '17 at 21:04

Paulo Scardine

73,447
11
124
153

votes

8 answers

Get correct image orientation by Google Cloud Vision api (TEXT_DETECTION)

I tried Google Cloud Vision api (TEXT_DETECTION) on 90 degrees rotated image. It still can return recognized text correctly. (see image below) That means the engine can recognize text even the image is 90, 180, 270 degrees rotated. However the…

ocr google-cloud-platform google-cloud-vision

asked Dec 22 '16 at 14:36

Jack Fan

2,143
3
18
25

votes

2 answers

Handwriting recognition API's for android applications

Is there any good Handwriting recognition API to help develop application on Android Platform? Google has released (around July 2012 http://www.google.com/insidesearch/features/search/handwritinginput/index.html) a feature to use handwriting for…

android ocr handwriting-recognition

asked Oct 25 '12 at 07:07

liramu

votes

3 answers

Convert Non-Searchable Pdf to Searchable Pdf in Windows Python

Need a solution to convert a PDF file where every page is image and a page can either contains text, table or combination of both to a searchable pdf. I have used ABBY FineReader Online which is doing the job perfectly well but I am looking for a…

python python-3.x pdf ocr

asked Aug 21 '18 at 12:57

Rahul Agarwal

4,034
7
27
51

votes

4 answers

What's the best way to ocr as much text as possible from video game screenshots?

I'm trying to use the tesseract ocr tool to extract ocr text from video games(I'm pre processing screenshots and passing them to command line tool tsv output and parsing that). I'd like to use it for test automation not unlike selenium web testing.…

python automated-tests ocr tesseract ui-automation

asked May 04 '18 at 07:49

Roman A. Taycher

18,619
19
86
141

votes

1 answer

chinese character recognition using Tesseract OCR

I have been using Tesseract 3.0.2 OCR SDK for image text extraction. But if I use Chinese text images and pass through OCR then Tesseract doesn't provide me the Chinese characters instead of that I am getting numeric and english characters. But I…

iphone ios ocr tesseract

asked May 16 '13 at 07:41

Nishant Tyagi

9,893
3
40
61

votes

5 answers

Select only specific parts of the image

I am completely new in "Optical character recognition" and need some more information and advices because I was not able to find is it possible to do what I need. I have the following task: An images are given as input - they will have the same…

php ocr

asked Dec 24 '12 at 13:47

gotqn

42,737
46
157
243

votes

3 answers

Stroke Width Transform (SWT) implementation (Python)

Can anyone describe how can i implement SWT in python using opencv or simplecv ?

python opencv computer-vision ocr simplecv

asked Jun 20 '12 at 09:06

pylover

7,670
8
51
73

votes

5 answers

Can I do a "string contains X" with a percentage accuracy in python?

I need to do some OCR on a large chunk of text and check if it contains a certain string but due to the inaccuracy of the OCR I need it to check if it contains something like a ~85% match for the string. For example I may OCR a chunk of text to make…

python string comparison ocr

asked Jun 01 '12 at 11:09

Jacxel

1,634
8
22
33

votes

3 answers

what's the best image input type for tesseract?

I'm using tesseract on a project and want to know the best image input type for tesseract to give the best output. Is Binary&TIFF the best input or there's something else?

image-processing ocr tesseract

asked Apr 17 '12 at 14:17

chostDevil

1,041
5
17
24

votes

5 answers

Scoreboard digit recognition using OpenCV

I am trying to extract numbers from a typical scoreboard that you would find at a high school gym. I have each number in a digital "alarm clock" font and have managed to perspective correct, threshold and extract a given digit from the video…

image-processing opencv computer-vision ocr

asked Nov 09 '11 at 19:16

pyromanfo

votes

5 answers

Batch OCR Program for PDFs

This has been asked before, but I don't really know if the answers help me. Here is my problem: I got a bunch of (10,000 or so) pdf files. Some were text files that were saved using adobe's print feature (so their text is perfect and I don't want…

pdf ocr batch-processing

asked May 17 '11 at 04:36

Aquat33nfan

votes

12 answers

(-215:Assertion failed) !_src.empty() in function 'cv::cvtColor' with cv::imread

I am trying to recognize text from an image to then have the text outputted; however, this error spits out: Traceback (most recent call last): File "C:/Users/Benji's Beast/AppData/Local/Programs/Python/Python37-32/imageDet.py", line 41, in…

python opencv ocr tesseract python-tesseract

asked Dec 26 '18 at 01:18

Benji

votes

2 answers

Can I use OCR to detect font style (bold, italic)?

I am interested in using OCR to extract bold and italic words from a simple text. For example, if I input a clear image with text like so: "The quick brown fox jumps over the lazy dog." I would like to get an output like so: bold("brown", "jumps"),…

ocr font-face tesseract

asked Mar 02 '11 at 04:17

vamin

2,178
6
26
30

votes

3 answers

Open source OCR

I'm looking for an open source OCR library that runs on Linux. I need this to work for PNGs and PDFs. Mostly I would like to interface this library from java or ruby. Any idea if there is anything available? Regards.

java ruby linux pdf ocr

asked Mar 01 '11 at 07:55

Chris

Prev 1 2 3

…

99 100 Next