Questions tagged [python-tesseract]

Python-tesseract is a wrapper class for Tesseract OCR that allows any conventional image files (JPG, GIF, PNG, TIFF, etc.) to be read and get its text, data of text, or even convert it to pdf.

Python-tesseract is a wrapper class for OCR that allows any conventional image files (JPG, GIF, PNG, TIFF, etc.) to be read and decoded into usable text.

Tesseract is advertised as the most accurate open source OCR engine available. It was developed at HP Labs between 1985 and 1995 and then remained dormant until 2006 when Google revived the project.

For more information, please see the Python-tesseract page or the Tesseract page.

1664 questions
0
votes
1 answer

jupyter notebook won't launch due to "Library not loaded" error

Jupyter notebook always launched with no problem. Until yesterday... I tried to pip install pytesseract then went off to do something else and now when I try to start jupyter notebook, this is what I get every single time: File…
biogeek
  • 561
  • 1
  • 4
  • 13
0
votes
1 answer

how to extract the data from image using python

Hi I am a newbie to ocr models, I have 2000 Receipt jpeg images, I am trying to extract the data from an image, it shows some error, Please how to do. It tried this : from PIL import Image import glob image_list = [] for filename in…
0
votes
1 answer

pytesseract error in reading screenshot on clipboard

I am using pytesseract in python code and for reading [screenshot on clipboard]. It gives me following error. PS D:\Git\NOTEMAN> python nsu63w.py Exception in Tkinter callback Traceback (most recent call last): File…
0
votes
0 answers

How do you detect bubble letter numbers using pytesseract

I'm using pytesseract and opencv to be able to detect numbers fairly real time off of a game. However, it is very inconsistent. I've changed around a lot of settings and refined the image a lot, yet 90 percent of the time it still doesn't work,…
Bob Stone
  • 90
  • 1
  • 13
0
votes
0 answers

How to extract background image and text from a image

I have an image and am trying to separate the background image and text. For text I have used pytesseract and it gives me all the data. Now my aim is to translate this text and place it back on the image. For that I need the background image and…
Megha Bhamare
  • 77
  • 1
  • 10
0
votes
0 answers

how to improve tesseract-OCR for this image

In reality I have a simple image but the result is always very bad. I tried to binarize but tesseract-OCR should already do it. One result is for example: 'T MRR F 4 4 G 0 4 2 3 0 0 G'. The image is: The original is:
stefanodv
  • 463
  • 3
  • 11
  • 20
0
votes
0 answers

reading handwritten input to identify a student

I teach. I would like to solve the following problem: Collect student tests and have an OCR read input from the test and detect which exam belongs to which student. I've been using pytesseract pretty successfully to solve this problem. I give each…
Curtis Pro
  • 19
  • 3
0
votes
1 answer

Extract text in multiple background from image

I have multiple image with different background, i need to ignore background and extract Number from my image. ex: after test, i have this result : because of background color, it's very difficult to extract text.. i'm using this code: image =…
0
votes
1 answer

Extract digit from noisy image

Extract digit from noisy image I want to extract text from an image taken by mobile phone camera. First I try to convert the image to greyscale by using this code: imgg = Image.open('originale.jpg').convert('LA') Second i try to threshold the grey…
solo_s
  • 1
  • 3
0
votes
0 answers

Tesseract OCR error, probably because of traineddata

I am using pytesseract wrapper, with Legacy Tesseract (oem 0). This is my code line to extract text from image: try: # extracting ocr data from image ocr_data = pytesseract.image_to_data( img, lang="eng",…
M Asad Ali
  • 106
  • 7
0
votes
2 answers

How can i have Tessaract on Windows?

I am working on a Text Recognition Solution and I need to use Tesseract on Windows OS. Is there a command line to know if it's already installed? If not how can I get it?
0
votes
1 answer

tesseract does not worked although I installed it and add it to the Path

I have installed tesseract 64 bit from this page. I also add "C:\Program Files\Tesseract-OCR" to the system path. I went to "System Properties", then "Advanced" tab, then I clicked on "Environment Variables...". then I added "C:\Program…
MRM
  • 1,099
  • 2
  • 12
  • 29
0
votes
2 answers

How to use the webcam to capture an image and extract the information on it using python?

I was trying to capture am image on the webcam and extract the text information on it using the language of python. Here is the code: import cv2 import matplotlib.pyplot as plt import numpy as np import pytesseract from PIL import Image from…
Leon S. Kennedy
  • 159
  • 3
  • 11
0
votes
1 answer

Failed to load module pytesseract

I am trying capture text file from image and I am referring to this link https://www.simplifiedpython.net/how-to-extract-text-from-image-in-python/ So far I already done installing the pytesseract and Image library using this command: pip install…
Syntax Rommel
  • 932
  • 2
  • 16
  • 40
0
votes
1 answer

Trouble getting tesseract to work on python

I'm having some trouble when I try to run a code using tesseract on jupyter notebook or on pycharm. I suspect is a problem with the installation on Windows 7 but I'm not sure what am I doing wrong. So I've tried many different thing, from pip…