Hi i am trying read pdf file in python
one of the text as shown below reading as "METER READING DATES: 04 8 2TO05 7 2"
below is the my code:
pdf_path = pdf_path
poppler_path=r'C:\poppler-0.68.0\bin'
images = pdf2image.convert_from_path(pdf_path,poppler_path=r'C:\poppler-0.68.0\bin')
print (value)
pil_im = images[0] # assuming that we're interested in the first page only
ocr_dict = pytesseract.image_to_data(pil_im, lang='eng', output_type=Output.DICT)
text = " ".join(ocr_dict['text'])
Now my question is how to read text properly as shown in picture . Thanks in advance .