0

When i try to parse the pdf, i can't get the content of pdf but getting random symbols and characters. What is the reason behind it? This should give the proper text. I have tried using PyPDF2 also still can not get the text.

filename = "test2.pdf"
with fitz.open(filename) as f:
    for p in f:
        print("\n\n")
        print(p.get_text(sort=True))

Result : enter image description here This type of result i get.

0 Answers0