I need to display html formatted pdfs in the browser and select, copy and edit the formatted html.
So far I tried the pdftohtml command line utility and the pdf.js platform from mozilla. I just can't seem to do both tasks with one utility very well. For instance, I can display the pdf extremly accurate with pdf.js but I can't generate a formatted text layer, the overlay that is created for selecting text only has positioning styles but no font styles.
What would be the best approach to solve this problem?