I'm looking for a way to extract text and the position of that text from a PDF with a "text layer". My goal is to show a PDF with the extracted text as a layer and to have the user select certain lines as areas of interest.
pdftotext only shows me the text in rows, but without position information. I checked TET from PDFlib but they don't have a trial version and it doesn't seem like the libraries are actively maintained anymore.
The program should work on Linux