I have a ttf file which contains Unicode and the corresponding font. As the figure shows:
The red box is the Unicode, and the text above it is the corresponding font. How could I extract the Unicode from the font file?
I have a ttf file which contains Unicode and the corresponding font. As the figure shows:
The red box is the Unicode, and the text above it is the corresponding font. How could I extract the Unicode from the font file?
A Unicode font maps characters to glyphs. The process is described in this SO question: How does a Unicode character get mapped to a glyph in a font?
If a font maps a character to a glyph that doesn't look the what the character should be, there is no way to find out what other character does represent the glyph being shown (short of doing OCR on a rendered bitmap of the character).
In your case, the Java character (and Unicode codepoint) U+EDBC is in a Unicode Private Use Area:
In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. [...] The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments.
That means that there is not even an intended standard meaning for these characters. It is possible that there is some documentation for this font where you may find the meaning of the codepoints.
If not, your only option is to create your own mapping table from the characters used on the web page to standard unicode codepoints that, you believe, are the closest representation of the glyphs that the font shows.