I am trying to extract data from an invoice. I found invoice2data will do that job. I have pip installed invoice2data.
from invoice2data import extract_data
This is getting imported.
result = extract_data('sample.pdf')
When I run the above line it is showing that
OSError: pdftotext not installed. Can be downloaded from https://poppler.freedesktop.org/
When I try pip installing pdftotext it was showing virtual c++ 14.0 is required.I installed it using build tools. Again it was showing the same error. So I downloaded the files from https://pypi.org/project/pdftotext/ and pasted the extracted files in my anaconda/Lib/sitepackages directory. Now when I try to pip install pdftotext it is showing Requirement already satisfied: pdftotext in c:\users\vicky\anaconda3\lib\site-packages (2.1.2)
Now, when I try to extract data from the pdf it is again showing the same error that pdftotext is not installed. How can I overcome this error or is there any other package that will satisfy my requirement?
Thanks in advance.