I have PDF file in Arabic that has text with font Type3 when I extract text using PDFBox some characters are empty and their font equals null? I want to know what is the problem?
code:
protected void processTextPosition(TextPosition text) {
String character=text.getCharacter(); // is empty
String font=text.getFont().getBaseFont(); // equal null
}
stream produced with iText: ( dJ� v{d W�cG�)Tj
I speak about these question marks, why do I get the characters in this format?
These question marks appeared in my stream as "SOH-STX-ETX-EOT", not one character. The character inside PDF is shown as 'd' and 'J'!