Questions tagged [document-layout-analysis]
23 questions
0
votes
0 answers
I want to extract information from pdf
I have a pdf in which two order number is mentioned on different page I have to check order number is the same or not. I have a little idea about document layout analysis. Anyone can help me with how can i do this? I have to match different things…

rahul verma
- 1
- 2
0
votes
0 answers
Identify the orientation of page
Image of portrait page viewed horizontally
Hello suppose we have this photo of a page(or many pages) ,
you rotate if 90,180 degree anticlockwise
I want to write code in order to detect the position of the page number to find out the orientation of…

Kebab Sharif
- 1
- 1
0
votes
0 answers
OpenCV checkbox detection and if checked
I'm working on a scanned document to verify if a checkbox is checked or not.
The form contains 3 checkboxes and are being checked off by humans.
example1
example2
I am having difficulties isolating the 3 checkboxes and determine if it is checked or…

Bigfoot48
- 11
- 3
0
votes
1 answer
How to split scanned image into three separate images?
Input image given here
I have a scanned image of doctor's prescription, and I want to use python code to split the image into three separate sections, as shown in image below.
Cropped1 image
Cropped2 image
Cropped3 image
What I have so…

Monika Choudhary
- 11
- 4
0
votes
0 answers
Extracting numeric table data from an image using Python
I want to extract numeric data from an image of a table (png/jpeg/etc.) using Python. I don't mind if it's some deep learning algorithm but it doesn't have to be if there is already an existing library.
I've tried various script that I found online.…

tpoh
- 261
- 3
- 11
0
votes
0 answers
how to find an image among text in python?
ok, so here is my probleme, i have a lot of jpg files, that is a text about plant and all stuff, and there is images with it that i'm not interested in, so i would like to detect them, then to fill the image with white, so text recognition that i'll…

Armand Marcelli
- 5
- 5
0
votes
1 answer
Word coordinates with pdfminer
I am trying to write a pdf layout analyzer in python. I have been able to produce some fairly decent results using Pandas for the work. However, so far I have been using data generated by JPedal, but the coordinates produced seem to contain some…

root
- 76,608
- 25
- 108
- 120
-1
votes
1 answer
Draw a rectangle around a string of words using pytesseract
Here is my image:
I can recognize the words:
I need to check if there is a line with specific text in the image and highlight this line with a rectangle.
For example. I check if there is "times, it was the worst". And then I expect to see:
How…

Ivan
- 163
- 7