I am trying to use pdftotext in order to convert .pdf files to text for further processing of files in python, but I am getting following problem:
It works for some .pdf files, though my output for some files looks like (which is wrong):
(0)
(0)
(0)
(0)
(0)
(0)
000 0000000 0000000000 0000000 00000 000 00
000000000 0000 0000 0000000 00000000000 00000000
000000 000 0000000 000000.
000 000000 0000000 00000000 0000000 0 00000
00000 00 0000000 000000.
When I look at it, it seems to me that one 0 character represents exactly one character.
So my question is, what can be possibly wrong? And how can I fix output of pdftotext?