There is a pdf document,I want to convert it to xml or html.
Since the pdf document contains some tables,when it have converted to xml or html,I can not know which is table data and which is text.
I want to get tables data to store the database.
Can xpdf or mupdf make it?
Thanks.