0

I am trying to detect text from aadhar card(ID) using Tesseract OCR but i am getting incomplete result.

1

like not detecting 'Government of India' which is available on the top of the id and in some cases not detecting the 'name' and 'gender'

I have tried to get the complete text by applying image preprocessing techniques

  1. Grayscale conversion
  2. adaptive thresholding:

cv2.adaptiveThreshold(gray_image,255,cv2.ADAPTIVE_THRESH_MEAN_C,cv2.THRESH_BINARY,9,20)

cv2.adaptiveThreshold(gray_image, 255, cv2.ADAPTIVE_THRESH_GAUSSIAN_C, cv2.THRESH_BINARY, 11,2)

after applying thresholding i got almost complete text but also some incorrect text which is not in the id card

and here another problem: block size and constant value need to determine by the developer and same value is not going to work with different images

Please let me know if you have any idea regarding this problem....

Christoph Rackwitz
  • 11,317
  • 4
  • 27
  • 36
Groot
  • 1

0 Answers0