Questions tagged [pytesser]

PyTesser is an Optical Character Recognition module for Python. It takes as input an image or image file and outputs a string.

PyTesser is an Optical Character Recognition module for Python. It takes as input an image or image file and outputs a string.

PyTesser uses the Tesseract OCR engine, converting images to an accepted format and calling the Tesseract executable as an external script. A Windows executable is provided along with the Python scripts. The scripts should work in other operating systems as well.

http://code.google.com/p/pytesser/

105 questions
0
votes
1 answer

image to text in python

from PIL import Image import pytesseract import cv2 import cv2 img= cv2.imread('D:/ss.png') pytesseract.pytesseract.tesseract_cmd="C:\Program Files (x86)\Tesseract- OCR\tesseract.exe' tex =…
RAVI RK
  • 5
  • 3
0
votes
0 answers

Error in pytesseract importing

Getting following error in pytesseract importing. Code: from pytesseract import image_to_string from PIL import Image tessdata_dir_config='--tessdata-dir "C:\\Program Files (x86)\\Tesseract-OCR\\tessdata"' im = Image.open(r'E:\\mini…
0
votes
0 answers

how to increase the processing speed?

I am working on a small project that is reading text from images. It is taking almost 10 hrs for 20k images now, running on Ubuntu 14.0 Below is a part of the code that is taking quite a lot of time. Please advice /help in increasing the speed or…
Shri
  • 89
  • 2
  • 8
0
votes
2 answers

How to increase likeliness of image recognition with pytesseract

I'm trying to convert this list of images I have to text. The images are fairly small but VERY readable (15x160, with only grey text and a white background) I can't seem to get pytesseract to read the image properly. I tried to increase the size…
bake
  • 55
  • 1
  • 9
0
votes
1 answer

Cannot extract text from image using pytesseract

I am writing a python script to extract image from a file using pytesseract. But when I run the script I am getting the below error. from PIL import Image from pytesseract import image_to_string import os tessdata_dir_config = '--tessdata-dir…
Ranit Das
  • 64
  • 1
  • 11
0
votes
1 answer

OCR - pytesseract issue

I'm trying to apply below code: import cv2 import numpy as np import pytesseract from PIL import Image # Path of working folder on Disk src_path = "C:/TEST/" def get_string(img_path): # Read image with opencv img = cv2.imread(img_path) …
Pretbc
  • 11
  • 4
0
votes
0 answers

ModuleNotFoundError: No module named 'pytesser'

I have successfully installed pytessearct(up-to-date version) and the environment path(c:\users\hp\appdata\local\programs\python\python36-32\lib\site-packages) is set but when I am trying to use/import pytesseract library it is showing no module…
0
votes
1 answer

Extract text from image using Pytesser

I have installed Pytesser to extract text from image from PIL import Image from pytesser import * print image_to_string the above commands are working but when i type the code from pytesser import * from PIL…
Anmol Narang
  • 532
  • 5
  • 10
0
votes
1 answer

Python Ocr Licence Plate Recognition

I tried converting the image into gray with adaptive threshold and Thesh_Binary_Inv gray = cv2.cvtColor(img1,cv2.COLOR_BGR2GRAY) adapt1 = cv2.adaptiveThreshold(gray,130,cv2.ADAPTIVE_THRESH_GAUSSIAN_C,cv2.THRESH_BINARY_INV,153,40) and trying to get…
Raaj Vamsy
  • 3
  • 2
  • 6
0
votes
0 answers

CAPTCHA recognition algorithm on Python

I'm trying to design a CAPTCHA recognition algorithm using Python, OpenCV and Tesseract. The problem is the CAPTCHA digits are misaligned and randomly clustered within the image. Pytesseract fails and returns empty lists most of the time. The data…
0
votes
1 answer

pytesser set path to pattern files

Working at the university, I am experiencing the problem of not being able to change config files like "digits" in the tessdata as I do not have admin rights. So I want to run everything from my home, creating patterns, configs and training data…
mamamamama
  • 101
  • 1
  • 15
0
votes
3 answers

python FileNotFoundError when using pytesseract

I trying to capture a part of the current screen to detect some number on the screen, but when the code run got this error: Traceback (most recent call last): File "C:/Users/Administrator/PycharmProjects/bot/detect_num.py", line 12, in
AlphaWolf
  • 395
  • 2
  • 7
  • 16
0
votes
1 answer

FileNotFoundError: [WinError 2] The system cannot find the file specified while using pytesseract python-3.x

I am using pytesseract to read image as text in python. Following is my code: from PIL import Image from pytesseract import image_to_string import os.path if (os.path.exists('image.png')): filename = 'image.png' image =…
muazfaiz
  • 4,611
  • 14
  • 50
  • 88
0
votes
1 answer

Python 2.7 PyTesseract AttributeError: 'PixelAccess' object has no attribute 'split'

Traceback (most recent call last): File "C:\Python27\Stuff\imagetotext.py", line 9, in i = pytesseract.image_to_string(img) File "C:\Python27\lib\site-packages\pytesseract\pytesseract.py", line 143, in image_to_string if…
user6374210
0
votes
0 answers

inaccurate result using pytesserract

This image_1 image_2 The problem is that the result is not accurate ,the result should be as follows >>>>>>>>> "Prv:2.700 0p:2,70o Lo1:31.5&5 (mg: —5 Hi:2,71D val: 8.55" Does anybody know how to optimize the image to achieve the best result…
Okta
  • 3
  • 5