Unicode has defined over 149,000 characters for use in computer-based environments. Many of them represent digits, and some others represent numbers that require more than one digit. This report displays a selection of them in a variety of scripts, some ancient or obscure. Because Unicode can handle over one million characters, the Unicode Consortium is not currently constrained by space limitations, and more scripts are likely to be added in the future.
In this report, nearly every script that supports at least the digits 1 through 9 is included. However, any script that does not support these is omitted. Although Unicode has symbols for many fractions, they are not shown in this report, in part because their support varies greatly from one script to another.
Wikipedia has an article about the classifications of Unicode numerals. Also, the present author has a report that covers various Unicode letters and digits intended for mathematical use.
Throughout this report, each code point — the number the Unicode uses to represent a character — is shown in hexadecimal rather than decimal notation. Those less than 1000 are shown with leading zeroes, as is the custom in Unicode documentation.
Few fonts support all of these characters. In general, the Consortium selects the lowest available numbers as code points whenever it defines new characters. Hence, characters with low numbers, having been established earlier, are more likely to be supported than those with high numbers. On some displays, an unimplemented character will be indicated with an image such as the following, to stand for the character whose code point is for example 26E84:
Many scripts support the decimal digits zero through nine, with few if any other integers. Their listings are divided into two pages:
Here is sample line from one of those tables:
– 0 – | – 1 – | – 2 – | – 3 – | – 4 – | – 5 – | – 6 – | – 7 – | – 8 – | – 9 – | ||
1C00-1C4F Lepcha | ᱀ 1C40 |
᱁ 1C41 |
᱂ 1C42 |
᱃ 1C43 |
᱄ 1C44 |
᱅ 1C45 |
᱆ 1C46 |
᱇ 1C47 |
᱈ 1C48 |
᱉ 1C49 |
Several scripts support the twenty digits of vigesimal notation (zero through nineteen). Sample line:
16E40-16E9F Medefaidrin | 𖺀 0: 16E80 |
𖺁 1: 16E81 |
𖺂 2: 16E82 |
𖺃 3: 16E83 |
𖺄 4: 16E84 |
𖺅 5: 16E85 |
𖺆 6: 16E86 |
𖺇 7: 16E87 |
𖺈 8: 16E88 |
𖺉 9: 16E89 |
𖺊 10: 16E8A |
𖺋 11: 16E8B |
𖺌 12: 16E8C |
𖺍 13: 16E8D |
𖺎 14: 16E8E |
𖺏 15: 16E8F |
𖺐 16: 16E90 |
𖺑 17: 16E91 |
𖺒 18: 16E92 |
𖺓 19: 16E93 |
In each cell, the number before the colon is the value of the digit, in decimal. After the colon is the code point, in hexadecimal.
Displayed similarly are some other scripts that may include multiples of 10, 100, 1000, and so forth. Sample:
102E0-102FF Coptic Epact Numbers | 𐋡 1: 102E1 |
𐋢 2: 102E2 |
𐋣 3: 102E3 |
𐋤 4: 102E4 |
𐋥 5: 102E5 |
𐋦 6: 102E6 |
𐋧 7: 102E7 |
𐋨 8: 102E8 |
𐋩 9: 102E9 |
𐋪 10: 102EA |
𐋫 20: 102EB |
𐋬 30: 102EC |
𐋭 40: 102ED |
𐋮 50: 102EE |
𐋯 60: 102EF |
𐋰 70: 102F0 |
𐋱 80: 102F1 |
𐋲 90: 102F2 |
|
𐋳 100: 102F3 |
𐋴 200: 102F4 |
𐋵 300: 102F5 |
𐋶 400: 102F6 |
𐋷 500: 102F7 |
𐋸 600: 102F8 |
𐋹 700: 102F9 |
𐋺 800: 102FA |
𐋻 900: 102FB |
The following scripts are among those anticipated to be added for Unicode 16.0:
10D40-10D8F Garay | digits 0-9: 10D40-10D49 |
116D0-116FF Myanmar Extended-C | Pao digits 0-9: 116D0-116D9 |
Eastern Pwo Karen digits 0-9: 116DA-116E3 | |
11BC0-11BFF Sunuwar | digits 0-9: 11BF0-11BF9 |
16100-1613F Gurung Khema | digits 0-9: 16130-16139 |
16D40-16D7F Kirat Rai | digits 0-9: 16D70-16D79 |
1E5D0-1E5FF Ol Onal | digits 0-9: 1E5F1-1E5FA |
The present author has not been able to find a Unicode treatment of Babylonian cuneiform numerals, but they would be a valuable addition to this report. Still, similar markings are to be found in Unicode's Cuneiform Numbers and Punctuation chart.