Questions tagged [pytesser]

PyTesser is an Optical Character Recognition module for Python. It takes as input an image or image file and outputs a string.

PyTesser is an Optical Character Recognition module for Python. It takes as input an image or image file and outputs a string.

PyTesser uses the Tesseract OCR engine, converting images to an accepted format and calling the Tesseract executable as an external script. A Windows executable is provided along with the Python scripts. The scripts should work in other operating systems as well.

http://code.google.com/p/pytesser/

105 questions
1
vote
1 answer

pytesseract shows " 'str' object has no attribute 'save' " error

when i run the following code for pytesseract >>> import pytesseract >>> import Image >>> print pytesseract.image_to_string("plate.png") it shows the below error Traceback (most recent call last): File "", line 1, in print…
jeena rita
  • 13
  • 1
  • 1
  • 4
0
votes
0 answers

Facing OCR multilanguage problem, how to ignore other langs from extracting

I am tryin to OCR an image which contains multiple language(English & Hindi) when I OCR it with py-tesseract the Hindi language words are translated into English words resulting in meaningless strings, so is there anyway to detect only English…
0
votes
1 answer

pip install tesserocr fails by giving error in Windows "Tesseract library not found in LIBPATH: []"

I am trying to install tesserocr and I am getting the below error when I do pip install tesserocr Here is the setup which I have done. Want to know what is the issue and how can I resolve it ? Downloaded the latest from…
VBha
  • 17
  • 3
0
votes
0 answers

Why isn't pytesseract recognizing this image?

It can read other images fine, it just can not read this one enter image description here import numpy as np from pytesseract import pytesseract import cv2 import numpy as np from PIL import Image import os path_to_tesseract = r'C:\Program…
twitterL9
  • 23
  • 4
0
votes
0 answers

Configuring tesseract giving file not found error

I am extracting text from image(jpg file) using pytesseract. The extracted text has extra spaces. Actual image text - Aditya Output text - A di ty a I am using the below code: text =…
0
votes
1 answer

How to get coordinates of the overall bounding box of a text image?

original image img = cv2.imread('eng2.png') d = pytesseract.image_to_data(img, output_type=Output.DICT) n_boxes = len(d['level']) for i in range(n_boxes): (x, y, w, h) = (d['left'][i], d['top'][i], d['width'][i], d['height'][i]) …
sha25
  • 23
  • 6
0
votes
1 answer

Pytesseract Not Recognising Text

I am trying to use Pytesseract to read the digits from the following image: Low Resolution Image Unfortunately, the program is not returning with any solution, even after using greyscale, thresholding, noise detection or canny edge detection. When…
0
votes
0 answers

pytesseract Erase table borders as delicately as possible

I am using pytesseract and opencv to erase the borders of an image. to extract text from image using tesseract This is the source code I wrote based on this post. What's the way to remove all lines and borders in image(keep texts)…
IronJo
  • 31
  • 5
0
votes
2 answers

How can I make pytesseract read slahed 0 correctly

I am trying to read the phone number on the image. Since the image is very clear, I didn't apply any preprocessing yet pytesseract fails to recognize 0 correctly sometimes. I tried to train on similar font but it gives the same result. An example…
0
votes
1 answer

I want to extract table information from OCR Data

I want to extract table information from OCR data, I have raw text and it's text. I tried pytesseract but couldn't find the actual Implementation. Here is an image: https://drive.google.com/open?id=1CGJwbmf5snoXvwlQAsRAxIRRixbT_Q8l I tried this:…
user10227621
0
votes
0 answers

This line of code fixed a pytesseract path issue I was having but I don't understand how it works, can someone explain how it works

So I was trying to run some code using pytesseract, and I got this error: raise TesseractNotFoundError() TesseractNotFoundError: tesseract is not installed or it's not in your path There was a post on this site that provided the solution, I added…
Jed Bartlet
  • 1,963
  • 2
  • 11
  • 12
0
votes
1 answer

WindowsError: [Error 2] The system cannot find the file specified pytesseract

I'm trying to run this python code, but it keeps throwing the same error. I'm using pytesseract on a windows machine (coding in pycharm) to scan an image. I made a project on cloud9 that haevily used pytesseract in amazon web services and everything…
user9774586
0
votes
0 answers

Python image to string converting none of modules works

I am using module like pytesseract, textract, pyocr, all with Image modul. After opening .jpg file I can display the file but I just can't read the text from the file. I use three kinds of modules nad all th etime I had the same error: The system…
Cris
  • 11
  • 5
0
votes
1 answer

How to read text from image if image contain only alphabet or number

I need to get the text from image but in my image it only has one number it could be anything between 1-9 i need to get that number. i am using pytesseract to do this but on reading it shows empty text. the below is my image: below is what i am…
Sachhya
  • 1,260
  • 1
  • 9
  • 16
0
votes
1 answer

Image to Text Pytesseract Error

import pytesseract from PIL import Image, ImageEnhance, ImageFilter pytesseract.pytesseract.tesseract_cmd="C:\Program Files (x86)\Tesseract- OCR\tesseract.exe" im = Image.open("d:\ss.png") # the second one im =…
RAVI RK
  • 5
  • 3