List_of_Unicode_characters

List of Unicode characters

List of Unicode characters

Add article description


As of Unicode version 15.1, there are 149,878 characters with code points, covering 161 modern and historical scripts, as well as multiple symbol sets. This article includes the 1,062 characters in the Multilingual European Character Set 2 (MES-2) subset, and some additional related characters.

Unicode logo

Character reference overview

HTML and XML provide ways to reference Unicode characters when the characters themselves either cannot or should not be used. A numeric character reference refers to a character by its Universal Character Set/Unicode code point, and a character entity reference refers to a character by a predefined name.

A numeric character reference uses the format

&#nnnn;

or

&#xhhhh;

where nnnn is the code point in decimal form, and hhhh is the code point in hexadecimal form. The x must be lowercase in XML documents. The nnnn or hhhh may be any number of digits and may include leading zeros. The hhhh may mix uppercase and lowercase, though uppercase is the usual style.

In contrast, a character entity reference refers to a character by the name of an entity which has the desired character as its replacement text. The entity must either be predefined (built into the markup language) or explicitly declared in a Document Type Definition (DTD). The format is the same as for any entity reference:

&name;

where name is the case-sensitive name of the entity. The semicolon is required.

Because numbers are harder for humans to remember than names, character entity references are most often written by humans, while numeric character references are most often produced by computer programs.[1]

Control codes

65 characters, including DEL. All belong to the common script.

More information Code, Decimal ...

Footnotes:

1 Control-C has typically been used as a "break" or "interrupt" key.
2 Control-D has been used to signal "end of file" for text typed in at the terminal on Unix / Linux systems. Windows, DOS, and older minicomputers used Control-Z for this purpose.
3 Control-G is an artifact of the days when teletypes were in use. Important messages could be signalled by striking the bell on the teletype. This was carried over on PCs by generating a buzz sound.
4 Line feed is used for "end of line" in text files on Unix / Linux systems.
5 Carriage Return (accompanied by line feed) is used as "end of line" character by Windows, DOS, and most minicomputers other than Unix- / Linux-based systems
6 Control-O has been the "discard output" key. Output is not sent to the terminal, but discarded, until another Control-o is typed.
7 Control-Q has been used to tell a host computer to resume sending output after it was stopped by Control-S.
8 Control-S has been used to tell a host computer to postpone sending output to the terminal. Output is suspended until restarted by the Control-Q key.
9 Control-U was originally used by Digital Equipment Corporation computers to cancel the current line of typed-in text. Other manufacturers used Control-X for this purpose.
10 Control-X was commonly used to cancel a line of input typed in at the terminal.
11 Control-Z has commonly been used on minicomputers, Windows and DOS systems to indicate "end of file" either on a terminal or in a text file. Unix / Linux systems use Control-D to indicate end-of-file at a terminal.

Latin script

The Unicode Standard (version 15.1) classifies 1,481 characters as belonging to the Latin script.

Basic Latin

95 characters; the 52 alphabet characters belong to the Latin script. The remaining 43 belong to the common script.
The 33 characters classified as ASCII Punctuation & Symbols are also sometimes referred to as ASCII special characters. Often only these characters (and not other Unicode punctuation) are what is meant when an organization says a password "requires punctuation marks".

More information Code, Glyph ...

Latin-1 Supplement

96 characters; the 62 letters, and two ordinal indicators belong to the Latin script. The remaining 32 belong to the common script.

More information Code, Glyph ...

Latin Extended-A

128 characters; all belong to the Latin script.

More information Code, Glyph ...

Latin Extended-B

208 characters; all belong to the Latin script; 33 in the MES-2 subset.

More information Code, Glyph ...

Latin Extended Additional

256 characters; all belong to the Latin script; 23 in the MES-2 subset.

More information Code, Glyph ...

Additional Latin Extended

Phonetic scripts

IPA Extensions

96 characters; all belong to the Latin script; three in the MES-2 subset.

More information Code, Glyph ...

Spacing modifier letters

80 characters; 15 in the MES-2 subset.

More information Code, Glyph ...

Phonetic Extensions

Combining marks

More information Code, Glyph ...

Greek and Coptic

144 code points; 135 assigned characters; 85 in the MES-2 subset.

More information Code, Glyph ...

Greek Extended

For polytonic orthography. 256 code points; 233 assigned characters, all in the MES-2 subset (#670 – 902).

Greek Extended[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1F0x
U+1F1x
U+1F2x
U+1F3x Ἷ
U+1F4x
U+1F5x
U+1F6x
U+1F7x
U+1F8x
U+1F9x
U+1FAx
U+1FBx ᾿
U+1FCx
U+1FDx
U+1FEx
U+1FFx
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Cyrillic

256 characters; 191 in the MES-2 subset.

More information Code, Glyph ...

Cyrillic supplements

Armenian

Armenian[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+053x Ա Բ Գ Դ Ե Զ Է Ը Թ Ժ Ի Լ Խ Ծ Կ
U+054x Հ Ձ Ղ Ճ Մ Յ Ն Շ Ո Չ Պ Ջ Ռ Ս Վ Տ
U+055x Ր Ց Ւ Փ Ք Օ Ֆ ՙ ՚ ՛ ՜ ՝ ՞ ՟
U+056x ՠ ա բ գ դ ե զ է ը թ ժ ի լ խ ծ կ
U+057x հ ձ ղ ճ մ յ ն շ ո չ պ ջ ռ ս վ տ
U+058x ր ց ւ փ ք օ ֆ և ֈ ։ ֊ ֍ ֎ ֏
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Semitic languages

Arabic

Arabic[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+060x  ؀   ؁   ؂   ؃   ؄   ؅  ؆ ؇ ؈ ؉ ؊ ؋ ، ؍ ؎ ؏
U+061x ؐ ؑ ؒ ؓ ؔ ؕ ؖ ؗ ؘ ؙ ؚ ؛ ALM ؝ ؞ ؟
U+062x ؠ ء آ أ ؤ إ ئ ا ب ة ت ث ج ح خ د
U+063x ذ ر ز س ش ص ض ط ظ ع غ ػ ؼ ؽ ؾ ؿ
U+064x ـ ف ق ك ل م ن ه و ى ي ً ٌ ٍ َ ُ
U+065x ِ ّ ْ ٓ ٔ ٕ ٖ ٗ ٘ ٙ ٚ ٛ ٜ ٝ ٞ ٟ
U+066x ٠ ١ ٢ ٣ ٤ ٥ ٦ ٧ ٨ ٩ ٪ ٫ ٬ ٭ ٮ ٯ
U+067x ٰ ٱ ٲ ٳ ٴ ٵ ٶ ٷ ٸ ٹ ٺ ٻ ټ ٽ پ ٿ
U+068x ڀ ځ ڂ ڃ ڄ څ چ ڇ ڈ ډ ڊ ڋ ڌ ڍ ڎ ڏ
U+069x ڐ ڑ ڒ ړ ڔ ڕ ږ ڗ ژ ڙ ښ ڛ ڜ ڝ ڞ ڟ
U+06Ax ڠ ڡ ڢ ڣ ڤ ڥ ڦ ڧ ڨ ک ڪ ګ ڬ ڭ ڮ گ
U+06Bx ڰ ڱ ڲ ڳ ڴ ڵ ڶ ڷ ڸ ڹ ں ڻ ڼ ڽ ھ ڿ
U+06Cx ۀ ہ ۂ ۃ ۄ ۅ ۆ ۇ ۈ ۉ ۊ ۋ ی ۍ ێ ۏ
U+06Dx ې ۑ ے ۓ ۔ ە ۖ ۗ ۘ ۙ ۚ ۛ ۜ  ۝  ۞ ۟
U+06Ex ۠ ۡ ۢ ۣ ۤ ۥ ۦ ۧ ۨ ۩ ۪ ۫ ۬ ۭ ۮ ۯ
U+06Fx ۰ ۱ ۲ ۳ ۴ ۵ ۶ ۷ ۸ ۹ ۺ ۻ ۼ ۽ ۾ ۿ
Notes
1.^ As of Unicode version 15.1
2.^ Unicode code point U+0673 is deprecated as of Unicode version 6.0

Hebrew

Hebrew[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+059x ֑  ֒  ֓  ֔  ֕  ֖  ֗  ֘  ֙  ֚  ֛  ֜  ֝  ֞  ֟ 
U+05Ax ֠  ֡  ֢  ֣  ֤  ֥  ֦  ֧  ֨  ֩  ֪  ֫  ֬  ֭  ֮  ֯ 
U+05Bx ְ  ֱ  ֲ  ֳ  ִ  ֵ  ֶ  ַ  ָ  ֹ  ֺ  ֻ  ּ  ֽ  ־ ֿ 
U+05Cx ׀ ׁ  ׂ  ׃ ׄ  ׅ  ׆ ׇ 
U+05Dx א ב ג ד ה ו ז ח ט י ך כ ל ם מ ן
U+05Ex נ ס ע ף פ ץ צ ק ר ש ת ׯ
U+05Fx װ ױ ײ ׳ ״
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Syriac

Syriac[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+070x ܀ ܁ ܂ ܃ ܄ ܅ ܆ ܇ ܈ ܉ ܊ ܋ ܌ ܍ SAM
U+071x ܐ ܑ ܒ ܓ ܔ ܕ ܖ ܗ ܘ ܙ ܚ ܛ ܜ ܝ ܞ ܟ
U+072x ܠ ܡ ܢ ܣ ܤ ܥ ܦ ܧ ܨ ܩ ܪ ܫ ܬ ܭ ܮ ܯ
U+073x ܰ ܱ ܲ ܳ ܴ ܵ ܶ ܷ ܸ ܹ ܺ ܻ ܼ ܽ ܾ ܿ
U+074x ݀ ݁ ݂ ݃ ݄ ݅ ݆ ݇ ݈ ݉ ݊ ݍ ݎ ݏ
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Mandaic

Samaritan

Thaana

Thaana[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+078x ހ ށ ނ ރ ބ ޅ ކ އ ވ މ ފ ދ ތ ލ ގ ޏ
U+079x ސ ޑ ޒ ޓ ޔ ޕ ޖ ޗ ޘ ޙ ޚ ޛ ޜ ޝ ޞ ޟ
U+07Ax ޠ ޡ ޢ ޣ ޤ ޥ ަ ާ ި ީ ު ޫ ެ ޭ ޮ ޯ
U+07Bx ް ޱ
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Brahmic (Indic) scripts

The range from U+0900 to U+0DFF includes Devanagari, Bengali script, Gurmukhi, Gujarati script, Odia alphabet, Tamil script, Telugu script, Kannada script, Malayalam script, and Sinhala script.

Devanagari

Devanagari[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+090x
U+091x
U+092x
U+093x ि
U+094x
U+095x
U+096x
U+097x ॿ
Notes
1.^ As of Unicode version 15.1

Bengali and Assamese

Bengali[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+098x
U+099x
U+09Ax
U+09Bx ি
U+09Cx
U+09Dx
U+09Ex
U+09Fx
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Gurmukhi

Gurmukhi[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+0A0x
U+0A1x
U+0A2x
U+0A3x ਿ
U+0A4x
U+0A5x
U+0A6x
U+0A7x
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Gujarati

Gujarati[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+0A8x
U+0A9x
U+0AAx
U+0ABx િ
U+0ACx
U+0ADx
U+0AEx
U+0AFx ૿
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Oriya

Oriya[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+0B0x
U+0B1x
U+0B2x
U+0B3x ି
U+0B4x
U+0B5x
U+0B6x
U+0B7x
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Tamil

Tamil[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+0B8x
U+0B9x
U+0BAx
U+0BBx ி
U+0BCx
U+0BDx
U+0BEx
U+0BFx
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Telugu

Telugu[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+0C0x
U+0C1x
U+0C2x
U+0C3x ి
U+0C4x
U+0C5x
U+0C6x
U+0C7x ౿
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Kannada

Kannada[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+0C8x
U+0C9x
U+0CAx
U+0CBx ಿ
U+0CCx
U+0CDx
U+0CEx
U+0CFx      
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Malayalam

Malayalam[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+0D0x
U+0D1x
U+0D2x
U+0D3x ി
U+0D4x     
U+0D5x
U+0D6x
U+0D7x ൿ
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Sinhala

Sinhala[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+0D8x
U+0D9x
U+0DAx
U+0DBx
U+0DCx
U+0DDx
U+0DEx
U+0DFx
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Other Brahmic scripts

Other Brahmic and Indic scripts in Unicode include:

Other South and Central Asian writing systems

Southeast Asian writing systems

Georgian

Georgian[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+10Ax
U+10Bx
U+10Cx
U+10Dx
U+10Ex
U+10Fx
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

African scripts

Ge'ez/Ethiopic script

Ethiopic[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+120x
U+121x
U+122x
U+123x
U+124x
U+125x
U+126x
U+127x
U+128x
U+129x
U+12Ax
U+12Bx
U+12Cx
U+12Dx
U+12Ex
U+12Fx
U+130x
U+131x
U+132x
U+133x
U+134x
U+135x
U+136x
U+137x
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Other African scripts

American scripts

Unified Canadian Aboriginal Syllabics

Unified Canadian Aboriginal Syllabics[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+140x
U+141x
U+142x
U+143x
U+144x
U+145x
U+146x
U+147x
U+148x
U+149x
U+14Ax
U+14Bx
U+14Cx
U+14Dx
U+14Ex
U+14Fx
U+150x
U+151x
U+152x
U+153x
U+154x
U+155x
U+156x
U+157x
U+158x
U+159x
U+15Ax
U+15Bx
U+15Cx
U+15Dx
U+15Ex
U+15Fx
U+160x
U+161x
U+162x
U+163x
U+164x
U+165x
U+166x
U+167x
Notes
1.^ As of Unicode version 15.1

Other American scripts

Mongolian

Mongolian[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+180x FVS
1
FVS
2
FVS
3
MVS FVS
4
U+181x
U+182x
U+183x
U+184x
U+185x
U+186x
U+187x
U+188x
U+189x
U+18Ax
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Unicode symbols

More information Code, Glyph ...

General Punctuation

112 code points; 111 assigned characters; 24 in the MES-2 subset.

General Punctuation[1][2][3]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+200x NQ
 SP 
MQ
 SP 
EN
 SP 
EM
 SP 
 3/M 
SP
 4/M 
SP
 6/M 
SP
F
 SP 
P
 SP 
TH
 SP 
H
 SP 
ZW
 SP 
ZW
 NJ 
 ZW 
J
 LRM   RLM 
U+201x  NB 
U+202x L
 SEP 
P
 SEP 
 LRE   RLE   PDF   LRO   RLO   NNB 
SP
U+203x
U+204x
U+205x MM
  SP  
U+206x  WJ   ƒ()    ×     ,     +    LRI   RLI   FSI   PDI  I
 SS 
A
 SS 
I
 AFS 
A
 AFS 
NA
 DS 
NO
 DS 
Notes
1.^ As of Unicode version 15.1
2.^ Grey area indicates non-assigned code point
3.^ Unicode code points U+206A - U+206F are deprecated as of Unicode version 3.0

Superscripts and Subscripts

Superscripts and Subscripts[1][2][3]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+207x
U+208x
U+209x
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points
3.^ Refer to the Latin-1 Supplement Unicode block for characters ¹ (U+00B9), ² (U+00B2) and ³ (U+00B3)

Currency Symbols

Currency Symbols[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+20Ax
U+20Bx
U+20Cx
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Letterlike Symbols

Letterlike Symbols[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+210x
U+211x
U+212x
U+213x
U+214x
Notes
1.^ As of Unicode version 15.1

Number Forms

Number Forms[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+215x
U+216x
U+217x
U+218x
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Arrows

Arrows[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+219x
U+21Ax
U+21Bx
U+21Cx
U+21Dx
U+21Ex
U+21Fx
Notes
1.^ As of Unicode version 15.1

Mathematical symbols

Mathematical Operators[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+220x
U+221x
U+222x
U+223x
U+224x
U+225x
U+226x
U+227x
U+228x
U+229x
U+22Ax
U+22Bx
U+22Cx
U+22Dx
U+22Ex
U+22Fx
Notes
1.^ As of Unicode version 15.1

Miscellaneous Technical

Miscellaneous Technical[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+230x
U+231x
U+232x
U+233x
U+234x
U+235x
U+236x
U+237x
U+238x
U+239x
U+23Ax
U+23Bx
U+23Cx
U+23Dx
U+23Ex
U+23Fx
Notes
1.^ As of Unicode version 15.1
2.^ Unicode code points U+2329 and U+232A are deprecated as of Unicode version 5.2

Control Pictures

Control Pictures[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+240x
U+241x
U+242x
U+243x  
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Optical Character Recognition

Optical Character Recognition[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+244x
U+245x
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Enclosed Alphanumerics

Enclosed Alphanumerics[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+246x
U+247x
U+248x
U+249x
U+24Ax
U+24Bx
U+24Cx
U+24Dx
U+24Ex
U+24Fx
Notes
1.^ As of Unicode version 15.1

Box Drawing

Box Drawing[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+250x
U+251x
U+252x
U+253x
U+254x
U+255x
U+256x
U+257x
Notes
1.^ As of Unicode version 15.1

Block Elements

More information Code, Glyph ...

Geometric Shapes

More information Code, Glyph ...

Miscellaneous Symbols

Miscellaneous Symbols[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+260x
U+261x
U+262x
U+263x
U+264x
U+265x
U+266x
U+267x
U+268x
U+269x
U+26Ax
U+26Bx
U+26Cx
U+26Dx
U+26Ex
U+26Fx
Notes
1.^ As of Unicode version 15.1

Symbols for Legacy Computing

Symbols for Legacy Computing[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1FB0x 🬀 🬁 🬂 🬃 🬄 🬅 🬆 🬇 🬈 🬉 🬊 🬋 🬌 🬍 🬎 🬏
U+1FB1x 🬐 🬑 🬒 🬓 🬔 🬕 🬖 🬗 🬘 🬙 🬚 🬛 🬜 🬝 🬞 🬟
U+1FB2x 🬠 🬡 🬢 🬣 🬤 🬥 🬦 🬧 🬨 🬩 🬪 🬫 🬬 🬭 🬮 🬯
U+1FB3x 🬰 🬱 🬲 🬳 🬴 🬵 🬶 🬷 🬸 🬹 🬺 🬻 🬼 🬽 🬾 🬿
U+1FB4x 🭀 🭁 🭂 🭃 🭄 🭅 🭆 🭇 🭈 🭉 🭊 🭋 🭌 🭍 🭎 🭏
U+1FB5x 🭐 🭑 🭒 🭓 🭔 🭕 🭖 🭗 🭘 🭙 🭚 🭛 🭜 🭝 🭞 🭟
U+1FB6x 🭠 🭡 🭢 🭣 🭤 🭥 🭦 🭧 🭨 🭩 🭪 🭫 🭬 🭭 🭮 🭯
U+1FB7x 🭰 🭱 🭲 🭳 🭴 🭵 🭶 🭷 🭸 🭹 🭺 🭻 🭼 🭽 🭾 🭿
U+1FB8x 🮀 🮁 🮂 🮃 🮄 🮅 🮆 🮇 🮈 🮉 🮊 🮋 🮌 🮍 🮎 🮏
U+1FB9x 🮐 🮑 🮒 🮔 🮕 🮖 🮗 🮘 🮙 🮚 🮛 🮜 🮝 🮞 🮟
U+1FBAx 🮠 🮡 🮢 🮣 🮤 🮥 🮦 🮧 🮨 🮩 🮪 🮫 🮬 🮭 🮮 🮯
U+1FBBx 🮰 🮱 🮲 🮳 🮴 🮵 🮶 🮷 🮸 🮹 🮺 🮻 🮼 🮽 🮾 🮿
U+1FBCx 🯀 🯁 🯂 🯃 🯄 🯅 🯆 🯇 🯈 🯉 🯊
U+1FBDx  
U+1FBEx  
U+1FBFx 🯰 🯱 🯲 🯳 🯴 🯵 🯶 🯷 🯸 🯹
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Dingbats

More information Code, Result ...

East Asian writing systems

CJK Symbols and Punctuation

CJK Symbols and Punctuation[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+300x ID
 SP 
U+301x
U+302x
U+303x   
Notes
1.^ As of Unicode version 15.1

Hiragana

Hiragana[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+304x
U+305x
U+306x
U+307x
U+308x
U+309x
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Katakana

Katakana[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+30Ax
U+30Bx
U+30Cx
U+30Dx
U+30Ex
U+30Fx
Notes
1.^ As of Unicode version 15.1

Bopomofo

Bopomofo[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+310x
U+311x
U+312x
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Hangul Jamo and Compatibility Jamo

Hangul Jamo[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+110x
U+111x
U+112x
U+113x
U+114x
U+115x  HC 
F
U+116x  HJ 
F
U+117x
U+118x
U+119x
U+11Ax
U+11Bx
U+11Cx
U+11Dx
U+11Ex
U+11Fx
Notes
1.^ As of Unicode version 15.1
2. : Hangul jamo with a green background are modern-usage characters which can be converted into precomposed Hangul syllables under Unicode normalization form NFC.
Hangul jamo with a white background are used for archaic Korean only, and there are no corresponding precomposed Hangul syllables.
"Conjoining Jamo Behavior" (PDF). The Unicode Standard. March 2020.
Hangul Compatibility Jamo[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+313x
U+314x
U+315x
U+316x   HF  
U+317x
U+318x
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Kanbun

Kanbun[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+319x
Notes
1.^ As of Unicode version 15.1

Enclosed CJK Letters and Months

Enclosed CJK Letters and Months[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+320x
U+321x
U+322x
U+323x
U+324x
U+325x
U+326x
U+327x
U+328x
U+329x
U+32Ax
U+32Bx
U+32Cx
U+32Dx
U+32Ex
U+32Fx
Notes
1.^ As of Unicode version 15.1
2.^ Grey area indicates non-assigned code point

CJK Compatibility

CJK Compatibility[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+330x
U+331x
U+332x
U+333x
U+334x
U+335x
U+336x
U+337x
U+338x
U+339x
U+33Ax
U+33Bx
U+33Cx
U+33Dx
U+33Ex
U+33Fx
Notes
1.^ As of Unicode version 15.1

CJK Compatibility Forms

CJK Compatibility Forms[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+FE3x ︿
U+FE4x
Notes
1.^ As of Unicode version 15.1

CJK Unified Ideographs

CJK Radicals

Other East Asian writing systems

Alphabetic Presentation Forms

Alphabetic Presentation Forms[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+FB0x
U+FB1x  
U+FB2x
U+FB3x
U+FB4x
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Ancient and historic scripts

Shavian

Shavian[1]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1045x 𐑐 𐑑 𐑒 𐑓 𐑔 𐑕 𐑖 𐑗 𐑘 𐑙 𐑚 𐑛 𐑜 𐑝 𐑞 𐑟
U+1046x 𐑠 𐑡 𐑢 𐑣 𐑤 𐑥 𐑦 𐑧 𐑨 𐑩 𐑪 𐑫 𐑬 𐑭 𐑮 𐑯
U+1047x 𐑰 𐑱 𐑲 𐑳 𐑴 𐑵 𐑶 𐑷 𐑸 𐑹 𐑺 𐑻 𐑼 𐑽 𐑾 𐑿
Notes
1.^ As of Unicode version 15.1

Notational systems

Braille

Music

Shorthand

Sutton SignWriting

Emoji

Alchemical symbols

Alchemical Symbols[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1F70x 🜀 🜁 🜂 🜃 🜄 🜅 🜆 🜇 🜈 🜉 🜊 🜋 🜌 🜍 🜎 🜏
U+1F71x 🜐 🜑 🜒 🜓 🜔 🜕 🜖 🜗 🜘 🜙 🜚 🜛 🜜 🜝 🜞 🜟
U+1F72x 🜠 🜡 🜢 🜣 🜤 🜥 🜦 🜧 🜨 🜩 🜪 🜫 🜬 🜭 🜮 🜯
U+1F73x 🜰 🜱 🜲 🜳 🜴 🜵 🜶 🜷 🜸 🜹 🜺 🜻 🜼 🜽 🜾 🜿
U+1F74x 🝀 🝁 🝂 🝃 🝄 🝅 🝆 🝇 🝈 🝉 🝊 🝋 🝌 🝍 🝎 🝏
U+1F75x 🝐 🝑 🝒 🝓 🝔 🝕 🝖 🝗 🝘 🝙 🝚 🝛 🝜 🝝 🝞 🝟
U+1F76x 🝠 🝡 🝢 🝣 🝤 🝥 🝦 🝧 🝨 🝩 🝪 🝫 🝬 🝭 🝮 🝯
U+1F77x 🝰 🝱 🝲 🝳 🝴 🝵 🝶 🝻 🝼 🝽 🝾 🝿
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Game symbols

Mahjong Tiles

Mahjong Tiles[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1F00x 🀀 🀁 🀂 🀃 🀄 🀅 🀆 🀇 🀈 🀉 🀊 🀋 🀌 🀍 🀎 🀏
U+1F01x 🀐 🀑 🀒 🀓 🀔 🀕 🀖 🀗 🀘 🀙 🀚 🀛 🀜 🀝 🀞 🀟
U+1F02x 🀠 🀡 🀢 🀣 🀤 🀥 🀦 🀧 🀨 🀩 🀪 🀫
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Domino Tiles

Domino Tiles[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1F03x 🀰 🀱 🀲 🀳 🀴 🀵 🀶 🀷 🀸 🀹 🀺 🀻 🀼 🀽 🀾 🀿
U+1F04x 🁀 🁁 🁂 🁃 🁄 🁅 🁆 🁇 🁈 🁉 🁊 🁋 🁌 🁍 🁎 🁏
U+1F05x 🁐 🁑 🁒 🁓 🁔 🁕 🁖 🁗 🁘 🁙 🁚 🁛 🁜 🁝 🁞 🁟
U+1F06x 🁠 🁡 🁢 🁣 🁤 🁥 🁦 🁧 🁨 🁩 🁪 🁫 🁬 🁭 🁮 🁯
U+1F07x 🁰 🁱 🁲 🁳 🁴 🁵 🁶 🁷 🁸 🁹 🁺 🁻 🁼 🁽 🁾 🁿
U+1F08x 🂀 🂁 🂂 🂃 🂄 🂅 🂆 🂇 🂈 🂉 🂊 🂋 🂌 🂍 🂎 🂏
U+1F09x 🂐 🂑 🂒 🂓
Notes
  1. ^ As of Unicode version 15.1
  2. ^ Grey areas indicate non-assigned code points

Playing Cards

Playing Cards[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1F0Ax 🂠 🂡 🂢 🂣 🂤 🂥 🂦 🂧 🂨 🂩 🂪 🂫 🂬 🂭 🂮
U+1F0Bx 🂱 🂲 🂳 🂴 🂵 🂶 🂷 🂸 🂹 🂺 🂻 🂼 🂽 🂾 🂿
U+1F0Cx 🃁 🃂 🃃 🃄 🃅 🃆 🃇 🃈 🃉 🃊 🃋 🃌 🃍 🃎 🃏
U+1F0Dx 🃑 🃒 🃓 🃔 🃕 🃖 🃗 🃘 🃙 🃚 🃛 🃜 🃝 🃞 🃟
U+1F0Ex 🃠 🃡 🃢 🃣 🃤 🃥 🃦 🃧 🃨 🃩 🃪 🃫 🃬 🃭 🃮 🃯
U+1F0Fx 🃰 🃱 🃲 🃳 🃴 🃵
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Chess Symbols

Chess Symbols[1][2]
Official Unicode Consortium code chart (PDF)
 0123456789ABCDEF
U+1FA0x 🨀 🨁 🨂 🨃 🨄 🨅 🨆 🨇 🨈 🨉 🨊 🨋 🨌 🨍 🨎 🨏
U+1FA1x 🨐 🨑 🨒 🨓 🨔 🨕 🨖 🨗 🨘 🨙 🨚 🨛 🨜 🨝 🨞 🨟
U+1FA2x 🨠 🨡 🨢 🨣 🨤 🨥 🨦 🨧 🨨 🨩 🨪 🨫 🨬 🨭 🨮 🨯
U+1FA3x 🨰 🨱 🨲 🨳 🨴 🨵 🨶 🨷 🨸 🨹 🨺 🨻 🨼 🨽 🨾 🨿
U+1FA4x 🩀 🩁 🩂 🩃 🩄 🩅 🩆 🩇 🩈 🩉 🩊 🩋 🩌 🩍 🩎 🩏
U+1FA5x 🩐 🩑 🩒 🩓
U+1FA6x 🩠 🩡 🩢 🩣 🩤 🩥 🩦 🩧 🩨 🩩 🩪 🩫 🩬 🩭
Notes
1.^ As of Unicode version 15.1
2.^ Grey areas indicate non-assigned code points

Special areas and format characters

See also


References

  1. Carey, Patrick (2015). New perspectives on XML : comprehensive. Sasha Vodnik (3rd ed.). p. 36. ISBN 978-1-285-07582-2. OCLC 904969019.
  2. Deprecated as of Unicode version 5.2.0 "U+0149 Latin small letter n preceded by apostrophe was encoded for use in Afrikaans. The character is deprecated, and its use is strongly discouraged. In nearly all cases it is better represented by a sequence of an apostrophe followed by “n”." pg. 208

Share this article:

This article uses material from the Wikipedia article List_of_Unicode_characters, and is written by contributors. Text is available under a CC BY-SA 4.0 International License; additional terms may apply. Images, videos and audio are available under their respective licenses.