Generating ToUnicode CMaps (Programmatically or Visually)

Asked Apr 15 '20 at 04:14

Active Apr 15 '20 at 04:36

Viewed 148 times

I have several problematic PDFs, which I am attempting to convert to PDF/A-1a.

These documents utilize CID Identity-H embedded subsets, generated with Acrobat Distiller 20.0. I have performed searches for tools which could utilize OCR to scan the rendered output for the purpose of either automatically generating the missing ToUnicode CMaps, or, at the very least, offering high probability candidates for user selection, yet have been unsuccessful in this endeavor.

The glyphs are clearly legible. If this is not the preferred method of building ToUnicode CMaps for these files, is there a common visual utility (or method) for building the code points? I have attempted font extraction into FontForge, which has failed (FontForge output states that the PDFs are unreadable).

Thank you!

edited Apr 15 '20 at 04:36

asked Apr 15 '20 at 04:14

Kadaj Nakamura

PDF/A-1a is required. – Kadaj Nakamura Apr 15 '20 at 20:03
Can you post the file in question? Or similar file with same issue? – Ryan Apr 15 '20 at 20:40

Generating ToUnicode CMaps (Programmatically or Visually)

0 Answers0