Questions tagged [paddleocr]

57 questions
0
votes
0 answers

Error while using PaddleOCR: list index out of range

I tried to use PaddleOCR for rading arabic texts. I followed the instructions given here: https://learnopencv.com/optical-character-recognition-using-paddleocr/ When I switch the language to engl. i get also an error. here is the code for reading…
Kassy
  • 1
  • 1
0
votes
0 answers

Alignment Issue for Document Text Extraction

I am using PaddleOCR to extract text from documents such as payslip and print the result. Corrently the code will print the words in the correct line and order however, i am struggling to get the words printed in roughly the same alignment, this is…
0
votes
0 answers

PaddleOCR installation fails due to dependency conflict with paddlepaddle

The latest paddleocr seems incompatible with the latest paddlepaddle due to conflicting protobuf dependencies. Has anyone else been able to fix this issue?
0
votes
1 answer

PaddleOCR installation failure

when I am trying to install PaddleOCR, I'm running into an installation error while trying to install pymupdf. I'm currently on Python 3.8.16 error: command 'swig' failed with exit status 1 [end of output] note: This error originates from a…
0
votes
1 answer

Is There a Way for PaddleOCR to Read PIL Images from Memory?

Previously I was using pytesseract and it was too slow for me so I switched to paddleOCR. Then I find out that PaddleOCR can't seem to recognize PIL Image in RAM Memory. Then I tried to turn it into lists, ndarrays and so on, everything that…
pockspocky
  • 38
  • 3
0
votes
0 answers

Why is PaddleOCR CRNN rec_algorithom recognizing text with no match at all?

Here, I am using DB as the detection algorithm and CRNN as the recognition algorithm. The texts I want to detect are English words. The texts I am getting from it are as follows- 'text':…
0
votes
1 answer

PaddleOCR model not detecting angled text even though use_cls is true

I am running a pre-trained PaddleOCR model (detection + recognition + cls) in Sagemaker following this tutorial. The inference works and I am able to successfully read words from images and get their confidences and bounding boxes. I am creating my…
219CID
  • 340
  • 5
  • 15
0
votes
0 answers

Why is pyinstaller not analyzing hidden import 'paddle.fluid.proto.framework_pb2'

I have a python script that uses paddleOCR which runs fine. Using pyinstaller, the exe is generate successfully but when the exe is run, it generates this error: Traceback (most recent call last): File "mri_ocr_core.py", line 1, in File…
Wai
  • 33
  • 3
0
votes
0 answers

Correct spelling errors OCR results

I would need to correct some OCR results done by PaddleOCR. For example if I have as a result "Ilive inNew York" I would like to get "I live in New York". What library can I use to correct these spelling errors?
user192837
  • 78
  • 5
0
votes
0 answers

How to install paddleOCR (CPU version) for CentOS 7?

I have been trying to install paddleOCR CPU version for centos7. While running the program file, I encountered an error given below. Error First, we installed virtual environment with python version 3.8.1 followed by the commands given below- I…
0
votes
1 answer

Failed to build lanms-neo Polygon3

I have been trying to install paddleocr but the code exits with this error. I have tried multiple versions of VS Build Tools but the problem persists. Can someone help me solve this error, I have copied the error and output that the code exits…
0
votes
1 answer

How to split the pandas dataframe column result?

I try to split the text column in df['text'], but paddleocr output the text with conf, so I don't know how to separate it into two part, I tried using lstrip but didnt work. The result as follow: ID Text 0 …
lan
  • 1
0
votes
0 answers

PaddleOCR FileNotFoundError: [WinError 2] The system cannot find the file specified

I am working on making License Plate Module for the below project and I am getting FileNotFoundError: [WinError 2] The system cannot find the file specified when running the project. It works fine using an interactive window Project…
Mike Lud
  • 11
  • 1
0
votes
0 answers

Why do we need to convert PaddleOCR to PyTorch model?

I can run PaddleOCR model directly but why do I need to convert it PyTorch model? Since It has not had many resources, I didn't find any article or blog describing why we need to convert.
Rakibul Islam
  • 325
  • 1
  • 3
  • 13
0
votes
0 answers

Colab crashing while running Paddle OCR

I used the Googl Colab to extract the tables from the images using Paddle OCR. It was working fine till last week. But from Monday onwards, the colab is crashing when I run the code to finding layout. Not sure what causes this issue. Any…
Pravin
  • 241
  • 2
  • 14