Questions tagged [pytesser]

PyTesser is an Optical Character Recognition module for Python. It takes as input an image or image file and outputs a string.

PyTesser is an Optical Character Recognition module for Python. It takes as input an image or image file and outputs a string.

PyTesser uses the Tesseract OCR engine, converting images to an accepted format and calling the Tesseract executable as an external script. A Windows executable is provided along with the Python scripts. The scripts should work in other operating systems as well.

http://code.google.com/p/pytesser/

105 questions
1
vote
2 answers

python import : ModuleNotFoundError: No module named 'pytesser'

Hi I am new to python and OCR. I am trying to implement OCR. I have installed all the dependencies which are required for OCR. Also important thing is I am using Python provided by Anaconda custom. So when I tried to import pytesser it shows me…
nilkash
  • 7,408
  • 32
  • 99
  • 176
1
vote
1 answer

How to install pytesser on Debian?

I want to use image_to_string in my python script for reading numbers from image. i import everything from pytesser from pytesser import * i read this question and i installed pytesseract pip install pytesseract and tesseract-ocr apt-get…
parik
  • 2,313
  • 12
  • 39
  • 67
1
vote
2 answers

Not Found Error in Pytesser3

import pytesser3 import Image a = Image.open("C:\\Users\\dell\\Desktop\\test.png") b = pytesser3.image_to_string(a) print(b) I tried to make a simple OCR program but whenever I run it I get the following error. I have even uninstalled pytesser3…
antisycop
  • 93
  • 2
  • 10
1
vote
1 answer

Trying Tesseract on Windows CMD

I'm having trouble using Tesseract-OCR with the pytesseract Python wrapper. I figured that the problem might come from Tesseract itself, not from the wrapper. So I tried Tesseract in CMD : C:\Users\Thomas\Desktop>tesseract.exe 'blabla.jpg'…
1
vote
0 answers

Localise and recognise text in image file and save them into text file or .xls file

I have an image file which contains characters and numbers in tabular form and I want to write code in python by which, The content of the file can be recognized and save it into a text file in the same order as it is an image file. is like this.…
vishal gupta
  • 31
  • 1
  • 7
1
vote
2 answers

Pytesseract doesn't accept pyautogui screenshot, Windows, Python 3.6

What I'm trying to do is to make a screenshot of a number with pyautogui and tranform the number to a string with pytesseract. The code: import pyautogui import time import PIL from PIL import Image import…
Rubeus
  • 11
  • 1
  • 4
1
vote
1 answer

HEROKU Error opening data file /app/vendor/tesseract-ocr/tessdata/eng.traineddata

I have a Django app which is deployed in Heroku. I'm trying to read text from image using pytesseract .I can run this app in localhost without problem but in heroku its showing an error Error opening data file…
user7500581
1
vote
0 answers

How can I hide the console window when I run tesseract with pytesser

I'm new on Python. I'm working in an OCR project. I'm using Python 2.7.12 on Windows 7. I have installed tesseract in the path "C:\Program Files (x86)\Tesseract-OCR" I found the pytesser library v0.0.2 here:…
Alex Ortega
  • 45
  • 11
1
vote
1 answer

PyTesseract not working

After installing PyTesseract, I went into my Mac's terminal to make sure it works. I got it to output into a text file perfectly find, but after running this code in Enthought Canopy: import pytesseract from PIL import Image print…
Nelson
  • 922
  • 1
  • 9
  • 23
1
vote
0 answers

text extraction failing from image pytesser

Issue with text extraction. I am new at using image extraction in python. from pytesser import * import Image file = 'C:\\Image1.bmp' image = Image.open(file) # Open image object using PIL print "1",image_to_string(image) # Run tesseract.exe…
smi
  • 31
  • 5
1
vote
1 answer

Installing pytesser on Ubuntu 14.04

I want to use pytesser OCR and I would like to make same changes in the system so that I can import it as a module from anywhere in the system. I tried using advice given on (Installing pytesser) but It's not working for me.
Balraj Parmar
  • 13
  • 1
  • 4
1
vote
0 answers

image_to_string() causes WindowsError

When I try to perform ocr on an image to get a string using Pytesseract: import pytesseract import Image im = Image.open(str('C:\\1host.jpg')) im.load() print pytesseract.image_to_string(im) The following error occurs: D:\Python27\python.exe…
1
vote
1 answer

Getting started with Python OCR on windows?

I have never used python before, and I am not sure where to start. My goal is to take image data, of numbers and multicolored background, and reliably get the correct characters identified. I looked into the tools necessary for this and I found the…
1
vote
1 answer

pytesser - next line of text in image?

I'm using pytesser on simple images with plain text. It works great! However, in python it prints each line of text on a new line. But the string it outputs has no "\n" or new line delimiters that I can pull out. How does it print each new line of…
Ciaran
  • 478
  • 1
  • 8
  • 23
1
vote
2 answers

Error using pytesseract

I am using pytesseract to convert images to text. I successfully installed pytesseract with pip command. But when i run the script, it shows me error : No module named Tesseract. These are my codes : from tesseract import image_to_string image =…
Jay Mehta
  • 1,511
  • 15
  • 20