Some PDF contains text with "strange" encoding. E.g. there http://www.iwb.ch/media/Unternehmen/Dokumente/inserat_leiter_pm.pdf If I copy the text for example in acrobat reader and paste somewhere I don't get the same characters as I see. PdfBox has also problem with extracting this text.
My question is how can I detect in PdfBox that some fonts are using this "strange" encoding? I don't need to decode them only detect.