Cyrillic script in Unicode
As of Unicode version 15.1, Cyrillic script is encoded across several blocks:
- Cyrillic: U+0400–U+04FF, 256 characters
- Cyrillic Supplement: U+0500–U+052F, 48 characters
- Cyrillic Extended-A: U+2DE0–U+2DFF, 32 characters
- Cyrillic Extended-B: U+A640–U+A69F, 96 characters
- Cyrillic Extended-C: U+1C80–U+1C8F, 9 characters
- Cyrillic Extended-D: U+1E030–U+1E08F, 63 characters
- Phonetic Extensions: U+1D2B, U+1D78, 2 Cyrillic characters
- Combining Half Marks: U+FE2E–U+FE2F, 2 Cyrillic characters
The Cyrillic script | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Slavic letters | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Non-Slavic letters | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Archaic or unused letters | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The characters in the range U+0400–U+045F are basically the characters from ISO 8859-5 moved upward by 864 positions. The next characters in the Cyrillic block, range U+0460–U+0489, are historical letters, some of which are still used for Church Slavonic. The characters in the range U+048A–U+04FF and the complete Cyrillic Supplement block (U+0500-U+052F) are additional letters for various languages that are written with Cyrillic script. Two characters are in the Phonetic Extensions block: U+1D2B ᴫ CYRILLIC LETTER SMALL CAPITAL EL from the Uralic Phonetic Alphabet and U+1D78 ᵸ MODIFIER LETTER CYRILLIC EN for transcribing nasal vowels.
Unicode includes few precomposed accented Cyrillic letters; the others can be combined by adding U+0301 ("combining acute accent") after the accented vowel (e.g., е́ у́ э́); see below.
Several diacritical marks not specific to Cyrillic can be used with Cyrillic text, including:
- in Combining Diacritical Marks block U+0300–U+036F.
- U+0301 ◌́ COMBINING ACUTE ACCENT (as common Cyrillic stress mark).To input an accented letter with acute accent: for the letter R (for example), digit R0301 (without space between letter and number), then select 0301 only and press Alt + X = Ŕ.
- U+0300 ◌̀ COMBINING ACUTE GRAVE (as stress mark in Bulgarian).
- U+0303 ◌̃ COMBINING TILDE (in non Slavic languages)
- U+0304 ◌̄ COMBINING MACRON (in non Slavic languages)
- U+0306 ◌̆ COMBINING BREVE (with й but also other letters in non Slavic languages)
- U+0307 ◌̇ COMBINING DOT ABOVE (in transliterations of other writing systems)
- U+0308 ◌̈ COMBINING DIAERESIS (in non Slavic languages)
- U+030A ◌̊ COMBINING RING ABOVE (in non Slavic languages)
- U+030B ◌̋ COMBINING DOUBLE ACUTE (in non Slavic languages)
- U+030C ◌̌ COMBINING CARON (in non Slavic languages)
- U+030F ◌̏ COMBINING DOUBLE GRAVE (with ѷ in old spelling)
- U+0311 ◌̑ COMBINING INVERTED BREVE (in 19th century Aleut alphabet)
- U+0323 ◌̣ COMBINING DOT BELOW (in transliterations of other writing systems)
- U+0328 ◌̨ COMBINING OGONEK (in 19th century Lithuanian or Polish cyrillic alphabets)
- U+0331 ◌̱ COMBINING MACRON BELOW (in transliterations of other writing systems)
- U+033E ◌̾ COMBINING VERTICAL TILDE (in 19t century Polish cyrillic alphabet)
- in Combining Diacritical Marks for Symbols block U+20D0–U+20F0
- U+20DD ◌⃝ COMBINING ENCLOSING CIRCLE (as Cyrillic ten thousands sign).
In the table below, small letters are ordered according to their Unicode numbers; capital letters are placed immediately before the corresponding small letters. Standard Unicode names and canonical decompositions are included.