1

So, I'm looking into coding some binary data as text, using single graphical units, after using already all the available ones for Java and UTF16, I'd like to expand my "pool".

I recently discovered grapheme clusters, where you can combine different elements/chars in order to create a single graphical unit, or, as they are properly called, a grapheme cluster.

For example: "‍♂️", which holds:

  • base character
  • skintone modifier
  • ligature character (zero width joiner)
  • male sign
  • variation selector

(more here or here)

I wonder if there is a comprensive list of all the possibile combinations, or at least some specifications which state clearly which emoticons can be combined how and how long they can be at maximum

elect
  • 6,765
  • 10
  • 53
  • 119

1 Answers1

2

Well, the obvious starting point is unicode itself: https://unicode.org

It does not provide a single, comprehensive list, as far as I know, but several lists for groups of symbols, e.g.

Emojis without modifiers: https://unicode.org/emoji/charts/full-emoji-list.html

Emojis with modifiers: https://unicode.org/emoji/charts/full-emoji-modifiers.html

Hulk
  • 6,399
  • 1
  • 30
  • 52