Questions tagged [document-layout-analysis]

23 questions
0
votes
0 answers

I want to extract information from pdf

I have a pdf in which two order number is mentioned on different page I have to check order number is the same or not. I have a little idea about document layout analysis. Anyone can help me with how can i do this? I have to match different things…
0
votes
0 answers

Identify the orientation of page

Image of portrait page viewed horizontally Hello suppose we have this photo of a page(or many pages) , you rotate if 90,180 degree anticlockwise I want to write code in order to detect the position of the page number to find out the orientation of…
0
votes
0 answers

OpenCV checkbox detection and if checked

I'm working on a scanned document to verify if a checkbox is checked or not. The form contains 3 checkboxes and are being checked off by humans. example1 example2 I am having difficulties isolating the 3 checkboxes and determine if it is checked or…
0
votes
1 answer

How to split scanned image into three separate images?

Input image given here I have a scanned image of doctor's prescription, and I want to use python code to split the image into three separate sections, as shown in image below. Cropped1 image Cropped2 image Cropped3 image What I have so…
0
votes
0 answers

Extracting numeric table data from an image using Python

I want to extract numeric data from an image of a table (png/jpeg/etc.) using Python. I don't mind if it's some deep learning algorithm but it doesn't have to be if there is already an existing library. I've tried various script that I found online.…
tpoh
  • 261
  • 3
  • 11
0
votes
0 answers

how to find an image among text in python?

ok, so here is my probleme, i have a lot of jpg files, that is a text about plant and all stuff, and there is images with it that i'm not interested in, so i would like to detect them, then to fill the image with white, so text recognition that i'll…
0
votes
1 answer

Word coordinates with pdfminer

I am trying to write a pdf layout analyzer in python. I have been able to produce some fairly decent results using Pandas for the work. However, so far I have been using data generated by JPedal, but the coordinates produced seem to contain some…
root
  • 76,608
  • 25
  • 108
  • 120
-1
votes
1 answer

Draw a rectangle around a string of words using pytesseract

Here is my image: I can recognize the words: I need to check if there is a line with specific text in the image and highlight this line with a rectangle. For example. I check if there is "times, it was the worst". And then I expect to see: How…
Ivan
  • 163
  • 7
1
2