How to recognize characters in International Phonetic Alphabet when OCR

Question

When doing the OCR of a dictionary pdf using DocumentAI, some IPA characters are often included, i.e. ʷ ə etc. Is there a way to recognize them correctly, such as setting a certain language hint? Currently ʷ is recognized as w and ə as a.

score 1 · Accepted Answer · answered Jun 15 '23 at 15:53

1

Document AI only detects IPA characters that are in a supported language.

However, this could be a useful feature, so I made a Public Issue Tracker for this feature request. https://issuetracker.google.com/287464641

answered Jun 15 '23 at 15:53

Holt Skinner

1,692
1
8
21

How to recognize characters in International Phonetic Alphabet when OCR

1 Answers1