Network Working Group P. Faltstrom, Ed. Internet-Draft Cisco Intended status: Standards Track October 23, 2006 Expires: April 26, 2007 The Unicode Codepoints and IDN draft-faltstrom-idnabis-tables-01.txt Status of this Memo By submitting this Internet-Draft, each author represents that any applicable patent or other IPR claims of which he or she is aware have been or will be disclosed, and any of which he or she becomes aware will be disclosed, in accordance with Section 6 of BCP 79. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet- Drafts. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt. The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html. This Internet-Draft will expire on April 26, 2007. Copyright Notice Copyright (C) The Internet Society (2006). Abstract This document specifies the codepoints in the Unicode tables that are safe for use in the standards for Internationalized Domain Names, IDN. Faltstrom Expires April 26, 2007 [Page 1] Internet-Draft Unicode Codepoints October 2006 Table of Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 4 2. Classes of Codepoints in Unicode . . . . . . . . . . . . . . 4 2.1. Classes of Code Points in Unicode 5.0 . . . . . . . . . . 4 2.2. Problematic classes . . . . . . . . . . . . . . . . . . . 22 2.2.1. Lu (Input) . . . . . . . . . . . . . . . . . . . . . 22 2.2.2. Lt (Input) . . . . . . . . . . . . . . . . . . . . . 22 2.2.3. Lo (Maybe) . . . . . . . . . . . . . . . . . . . . . 22 2.2.4. Mn (Possibly not) . . . . . . . . . . . . . . . . . . 22 2.2.5. Mc (Maybe) . . . . . . . . . . . . . . . . . . . . . 22 2.2.6. Me (Possibly not) . . . . . . . . . . . . . . . . . . 23 2.2.7. Nd (Maybe) . . . . . . . . . . . . . . . . . . . . . 23 2.2.8. Nl (Possibly not) . . . . . . . . . . . . . . . . . . 23 2.2.9. Pd (Exclude) . . . . . . . . . . . . . . . . . . . . 23 2.2.10. Po (Exclude) . . . . . . . . . . . . . . . . . . . . 23 3. Codepoint blocks in Unicode . . . . . . . . . . . . . . . . . 23 3.1. Unicode Blocks . . . . . . . . . . . . . . . . . . . . . 23 3.2. Notes . . . . . . . . . . . . . . . . . . . . . . . . . . 23 3.2.1. Note [1] . . . . . . . . . . . . . . . . . . . . . . 23 3.2.2. Note [2] . . . . . . . . . . . . . . . . . . . . . . 23 3.2.3. Note [3] . . . . . . . . . . . . . . . . . . . . . . 24 3.2.4. Note [4] . . . . . . . . . . . . . . . . . . . . . . 24 3.2.5. Note [5] . . . . . . . . . . . . . . . . . . . . . . 24 3.2.6. Note [6] . . . . . . . . . . . . . . . . . . . . . . 24 3.2.7. Note [7] . . . . . . . . . . . . . . . . . . . . . . 24 3.2.8. Note [8] . . . . . . . . . . . . . . . . . . . . . . 24 3.2.9. Note [9] . . . . . . . . . . . . . . . . . . . . . . 24 3.2.10. Note [10] . . . . . . . . . . . . . . . . . . . . . . 25 3.2.11. Note [11] . . . . . . . . . . . . . . . . . . . . . . 25 3.2.12. Note [12] . . . . . . . . . . . . . . . . . . . . . . 25 4. Individual codepoints in Unicode . . . . . . . . . . . . . . 25 4.1. 0000-007F Basic Latin . . . . . . . . . . . . . . . . . . 25 4.2. 0080-00FF Latin-1 Supplement . . . . . . . . . . . . . . 28 4.3. 0100-017F Latin Extended-A . . . . . . . . . . . . . . . 32 4.4. 0180-024F Latin Extended-B . . . . . . . . . . . . . . . 37 4.5. 0250-02AF IPA Extensions . . . . . . . . . . . . . . . . 45 4.6. 02B0-02FF Spacing Modifier Letters . . . . . . . . . . . 48 4.7. 0300-036F Combining Diacritical Marks . . . . . . . . . . 51 4.8. 0370-03FF Greek and Coptic . . . . . . . . . . . . . . . 56 4.9. 0400-04FF Cyrillic . . . . . . . . . . . . . . . . . . . 59 4.10. 0530-058F Armenian . . . . . . . . . . . . . . . . . . . 68 4.11. 0590-05FF Hebrew . . . . . . . . . . . . . . . . . . . . 70 4.12. 0600-06FF Arabic . . . . . . . . . . . . . . . . . . . . 74 4.13. 0700-074F Syriac . . . . . . . . . . . . . . . . . . . . 82 4.14. 0750-077F Arabic supplement . . . . . . . . . . . . . . . 84 4.15. 0780-07BF Thaana . . . . . . . . . . . . . . . . . . . . 86 4.16. 07C0-07FF NKo . . . . . . . . . . . . . . . . . . . . . . 88 Faltstrom Expires April 26, 2007 [Page 2] Internet-Draft Unicode Codepoints October 2006 4.17. 0900-097F Devanagari . . . . . . . . . . . . . . . . . . 89 4.18. 0980-09FF Bengali . . . . . . . . . . . . . . . . . . . . 93 4.19. 0A00-0A7F Gurmukhi . . . . . . . . . . . . . . . . . . . 96 4.20. 0A80-0AFF Gujarati . . . . . . . . . . . . . . . . . . . 99 4.21. 0B00-0B7F Oriya . . . . . . . . . . . . . . . . . . . . . 102 4.22. 0B80-0BFF Tamil . . . . . . . . . . . . . . . . . . . . . 105 4.23. 0C00-0C7F Telugu . . . . . . . . . . . . . . . . . . . . 108 4.24. 0C80-0CFF Kannada . . . . . . . . . . . . . . . . . . . . 111 4.25. 0D00-0D7F Malayalam . . . . . . . . . . . . . . . . . . . 114 4.26. 0D80-0DFF Sinhala . . . . . . . . . . . . . . . . . . . . 117 4.27. 0E00-0E7F Thai . . . . . . . . . . . . . . . . . . . . . 121 4.28. 0E80-0EFF Lao . . . . . . . . . . . . . . . . . . . . . . 124 4.29. 0F00-0FFF Tibetan . . . . . . . . . . . . . . . . . . . . 127 5. Outstanding Issues . . . . . . . . . . . . . . . . . . . . . 134 6. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 135 7. Security Considerations . . . . . . . . . . . . . . . . . . . 135 8. Contributors . . . . . . . . . . . . . . . . . . . . . . . . 135 9. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . 135 10. References . . . . . . . . . . . . . . . . . . . . . . . . . 136 10.1. Normative References . . . . . . . . . . . . . . . . . . 136 10.2. Informative References . . . . . . . . . . . . . . . . . 136 Author's Address . . . . . . . . . . . . . . . . . . . . . . . . 136 Intellectual Property and Copyright Statements . . . . . . . . . 137 Faltstrom Expires April 26, 2007 [Page 3] Internet-Draft Unicode Codepoints October 2006 1. Introduction RFC 4690 [RFC4690] describes an inclusion based approach for selecting the codepoints from The Unicode Standard [Unicode5] that should be included in the list of codepoints that may be used in Internationalized Domain Names. Specifically, RFC 4690 [RFC4690] says the following: The IAB has concluded that there is a consensus within the broader community that lists of code points should be specified by the use of an inclusion-based mechanism (i.e., identifying the characters that are permitted), rather than by excluding a small number of characters from the total Unicode set as Stringprep [RFC3454] and Nameprep [RFC3491] do today. That conclusion should be reviewed by the IETF community and action taken as appropriate. This document reviews the collections of codepoints in Unicode from two perspectives, those of character classes and those associated with individual characters and code blocks, in order to identify groups of characters that should clearly be included in IDNs, those that should clearly not be included, and those that still raise open issues, sometimes because there are complex trade offs involved and sometimes just due to lack of sufficient information. It is based on Unicode 5.0, rather than the earlier Unicode 3.2, in order to take advantage of the expanded character repertoire and better definitions in the newer version. This document is part of a series that, together, constitute a preliminary proposal for updating the IDNA standards to resolve issues uncovered in recent years, cover a broader range of scripts, and provide for migration to newer versions of Unicode. See [IDNA-issues] for a broader discussion. 2. Classes of Codepoints in Unicode The Unicode Standard [Unicode5] classifies the codepoints in a number of different classes. This table list the classes, and note in the "Comments" column whether characters of that class are appropriate to include or not. Classes not identified in this chart, such as "Cc" (control characters) are not permitted in IDNs. 2.1. Classes of Code Points in Unicode 5.0 Faltstrom Expires April 26, 2007 [Page 4] Internet-Draft Unicode Codepoints October 2006 +------+---+----+----+-------+---+---+---+----+----+----+----+------+ | Disc | C | St | En | Block | L | L | L | Lo | Mc | Nd | Mn | Clas | | ussi | 1 | ar | d | | u | l | t | | | | | ses | | on | | t | | | | | | | | | | | +------+---+----+----+-------+---+---+---+----+----+----+----+------+ | Incl | | 00 | 00 | Basic | X | X | | | | X | | Ps | | ude | | 00 | 7F | Latin | | | | | | | | Lu | | | | | | | | | | | | | | Cc | | | | | | | | | | | | | | Pd | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Sm | | | | | | | | | | | | | | Pc | | | | | | | | | | | | | | Po | | | | | | | | | | | | | | Zs | | | | | | | | | | | | | | Ll | | | | | | | | | | | | | | Sk | | | | | | | | | | | | | | Pe | | | | | | | | | | | | | | Sc | | Incl | | 00 | 00 | Latin | X | X | | | | | | Pf | | ude | | 80 | FF | -1 | | | | | | | | Lu | | | | | | Supp | | | | | | | | Cc | | | | | | lemen | | | | | | | | So | | | | | | t | | | | | | | | Sm | | | | | | | | | | | | | | Po | | | | | | | | | | | | | | No | | | | | | | | | | | | | | Pi | | | | | | | | | | | | | | Cf | | | | | | | | | | | | | | Zs | | | | | | | | | | | | | | Ll | | | | | | | | | | | | | | Sk | | | | | | | | | | | | | | Sc | | Incl | | 01 | 01 | Latin | X | X | | | | | | Lu | | ude | | 00 | 7F | Exten | | | | | | | | Ll | | | | | | ded-A | | | | | | | | | | Incl | | 01 | 02 | Latin | X | X | X | X | | | | Lo | | ude | | 80 | 4F | Exten | | | | | | | | Lu | | | | | | ded-B | | | | | | | | Ll | | | | | | | | | | | | | | Lt | | Note | | 02 | 02 | IPA | | X | | X | | | | Lo | | [1] | | 50 | AF | Exten | | | | | | | | Ll | | | | | | sions | | | | | | | | | | No | N | 02 | 02 | Spaci | | | | | | | | Lm | | | o | B0 | FF | ng | | | | | | | | Sk | | | | | | Modi | | | | | | | | | | | | | | fier | | | | | | | | | | | | | | Let | | | | | | | | | | | | | | ters | | | | | | | | | Faltstrom Expires April 26, 2007 [Page 5] Internet-Draft Unicode Codepoints October 2006 | Note | M | 03 | 03 | Combi | | | | | | | X | Mn | | [2] | n | 00 | 6F | ning | | | | | | | | | | | | | | Diac | | | | | | | | | | | | | | ritic | | | | | | | | | | | | | | alMar | | | | | | | | | | | | | | ks | | | | | | | | | | Incl | | 03 | 03 | Greek | X | X | | | | | | Lm | | ude | | 70 | FF | and | | | | | | | | Lu | | | | | | Copti | | | | | | | | Sm | | | | | | c | | | | | | | | Ll | | | | | | | | | | | | | | Sk | | | | | | | | | | | | | | Po | | Incl | | 04 | 04 | Cyril | X | X | | | | | X | Lu | | ude | | 00 | FF | lic | | | | | | | | Me | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | So | | | | | | | | | | | | | | Ll | | Incl | | 05 | 05 | Cyril | X | X | | | | | | Lu | | ude | | 00 | 2F | lic | | | | | | | | Ll | | | | | | Supp | | | | | | | | | | | | | | lemen | | | | | | | | | | | | | | t | | | | | | | | | | Incl | | 05 | 05 | Armen | X | X | | | | | | Lm | | ude | | 30 | 8F | ian | | | | | | | | Lu | | | | | | | | | | | | | | Pd | | | | | | | | | | | | | | Ll | | | | | | | | | | | | | | Po | | Note | | 05 | 05 | Hebre | | | | X | | | X | Lo | | [3] | | 90 | FF | w | | | | | | | | Mn | | | | | | | | | | | | | | Po | | Note | | 06 | 06 | Arabi | | | | X | | X | X | Lm | | [3] | | 00 | FF | c | | | | | | | | So | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Po | | | | | | | | | | | | | | Lo | | | | | | | | | | | | | | Cf | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Me | | | | | | | | | | | | | | Sc | | Note | | 07 | 07 | Syria | | | | X | | | X | Lo | | [3] | | 00 | 4F | c | | | | | | | | Mn | | | | | | | | | | | | | | Cf | | | | | | | | | | | | | | Po | | Note | | 07 | 07 | Arabi | | | | X | | | | Lo | | [3] | | 50 | 7F | c | | | | | | | | | | | | | | Supp | | | | | | | | | | | | | | lemen | | | | | | | | | | | | | | t | | | | | | | | | Faltstrom Expires April 26, 2007 [Page 6] Internet-Draft Unicode Codepoints October 2006 | Note | | 07 | 07 | Thaan | | | | X | | | X | Lo | | [3] | | 80 | BF | a | | | | | | | | Mn | | Note | | 07 | 07 | NKo | | | | X | | X | X | Lm | | [3] | | C0 | FF | | | | | | | | | Lo | | | | | | | | | | | | | | So | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Po | | Note | | 09 | 09 | Devan | | | | X | X | X | X | Lo | | [4] | | 00 | 7F | agari | | | | | | | | Mc | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Po | | Note | | 09 | 09 | Benga | | | | X | X | X | X | Lo | | [4] | | 80 | FF | li | | | | | | | | No | | | | | | | | | | | | | | So | | | | | | | | | | | | | | Mc | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Sc | | Note | | 0A | 0A | Gurmu | | | | X | X | X | X | Lo | | [4] | | 00 | 7F | khi | | | | | | | | Mc | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Nd | | Note | | 0A | 0A | Gujar | | | | X | X | X | X | Lo | | [4] | | 80 | FF | ati | | | | | | | | Mc | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Sc | | Note | | 0B | 0B | Oriya | | | | X | X | X | X | Lo | | [4] | | 00 | 7F | | | | | | | | | So | | | | | | | | | | | | | | Mc | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Nd | | Note | | 0B | 0B | Tamil | | | | X | X | X | X | Lo | | [4, | | 80 | FF | | | | | | | | | No | | 5] | | | | | | | | | | | | So | | | | | | | | | | | | | | Mc | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Sc | | Note | | 0C | 0C | Telug | | | | X | X | X | X | Lo | | [4] | | 00 | 7F | u | | | | | | | | Mn | | | | | | | | | | | | | | Mc | | | | | | | | | | | | | | Nd | Faltstrom Expires April 26, 2007 [Page 7] Internet-Draft Unicode Codepoints October 2006 | Note | | 0C | 0C | Kanna | | | | X | X | X | X | Lo | | [4] | | 80 | FF | da | | | | | | | | So | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Mc | | | | | | | | | | | | | | Nd | | Note | | 0D | 0D | Malay | | | | X | X | X | X | Lo | | [4] | | 00 | 7F | alam | | | | | | | | Mn | | | | | | | | | | | | | | Mc | | | | | | | | | | | | | | Nd | | Note | | 0D | 0D | Sinha | | | | X | X | | X | Lo | | [4] | | 80 | FF | la | | | | | | | | Mn | | | | | | | | | | | | | | Mc | | | | | | | | | | | | | | Po | | Note | | 0E | 0E | Thai | | | | X | | X | X | Lm | | [4] | | 00 | 7F | | | | | | | | | Lo | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Po | | | | | | | | | | | | | | Sc | | Note | | 0E | 0E | Lao | | | | X | | X | X | Lm | | [4] | | 80 | FF | | | | | | | | | Lo | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Nd | | Note | | 0F | 0F | Tibet | | | | X | X | X | X | Ps | | [6] | | 00 | FF | an | | | | | | | | Mc | | | | | | | | | | | | | | So | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Po | | | | | | | | | | | | | | Lo | | | | | | | | | | | | | | No | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Pe | | Note | | 10 | 10 | Myanm | | | | X | X | X | X | Lo | | [7] | | 00 | 9F | ar | | | | | | | | Mn | | | | | | | | | | | | | | Mc | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Po | | Incl | | 10 | 10 | Georg | X | | | X | | | | Lm | | ude | | A0 | FF | ian | | | | | | | | Lo | | | | | | | | | | | | | | Lu | | | | | | | | | | | | | | Po | | Note | | 11 | 11 | Hangu | | | | X | | | | Lo | | [8] | | 00 | FF | lJamo | | | | | | | | | | Incl | | 12 | 13 | Ethio | | | | X | | | X | Lo | | ude | | 00 | 7F | pic | | | | | | | | No | | | | | | | | | | | | | | So | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Po | Faltstrom Expires April 26, 2007 [Page 8] Internet-Draft Unicode Codepoints October 2006 | Incl | | 13 | 13 | Ethio | | | | X | | | | Lo | | ude | | 80 | 9F | pic | | | | | | | | So | | | | | | supp | | | | | | | | | | | | | | lemen | | | | | | | | | | | | | | t | | | | | | | | | | Note | | 13 | 13 | Chero | | | | X | | | | Lo | | [9] | | A0 | FF | kee | | | | | | | | | | Incl | | 14 | 16 | Unifi | | | | X | | | | Lo | | ude | | 00 | 7F | ed | | | | | | | | Po | | | | | | Cana | | | | | | | | | | | | | | dian | | | | | | | | | | | | | | Abo | | | | | | | | | | | | | | rigin | | | | | | | | | | | | | | al Sy | | | | | | | | | | | | | | llabi | | | | | | | | | | | | | | cs | | | | | | | | | | Note | | 16 | 16 | Ogham | | | | X | | | | Ps | | [10] | | 80 | 9F | | | | | | | | | Lo | | | | | | | | | | | | | | Zs | | | | | | | | | | | | | | Pe | | Incl | | 16 | 16 | Runic | | | | X | | | | Nl | | ude | | A0 | FF | | | | | | | | | Lo | | | | | | | | | | | | | | Po | | Note | | 17 | 17 | Tagal | | | | X | | | X | Lo | | [10] | | 00 | 1F | og | | | | | | | | Mn | | Incl | | 17 | 17 | Hanun | | | | X | | | X | Lo | | ude | | 20 | 3F | oo | | | | | | | | Mn | | | | | | | | | | | | | | Po | | Incl | | 17 | 17 | Buhid | | | | X | | | X | Lo | | ude | | 40 | 5F | | | | | | | | | Mn | | Incl | | 17 | 17 | Tagba | | | | X | | | X | Lo | | ude | | 60 | 7F | nwa | | | | | | | | Mn | | Note | | 17 | 17 | Khmer | | | | X | X | X | X | Lm | | [8] | | 80 | FF | | | | | | | | | Lo | | | | | | | | | | | | | | No | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Mc | | | | | | | | | | | | | | Cf | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Sc | | | | | | | | | | | | | | Po | | Incl | | 18 | 18 | Mongo | | | | X | | X | X | Lm | | ude | | 00 | AF | lian | | | | | | | | Lo | | | | | | | | | | | | | | Pd | | | | | | | | | | | | | | Zs | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Po | Faltstrom Expires April 26, 2007 [Page 9] Internet-Draft Unicode Codepoints October 2006 | Note | | 19 | 19 | Limbu | | | | X | X | X | X | Lo | | [11] | | 00 | 4F | | | | | | | | | So | | | | | | | | | | | | | | Mc | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Po | | Note | | 19 | 19 | Tai | | | | X | | | | Lo | | [11] | | 50 | 7F | Le | | | | | | | | | | Note | | 19 | 19 | New | | | | X | X | X | | Lo | | [11] | | 80 | DF | Tai | | | | | | | | Mc | | | | | | Lue | | | | | | | | Nd | | | | | | | | | | | | | | Po | | Note | N | 19 | 19 | Khmer | | | | | | | | So | | [8] | o | E0 | FF | symbo | | | | | | | | | | | | | | ls | | | | | | | | | | Note | | 1A | 1A | Bugin | | | | X | X | | X | Lo | | [11] | | 00 | 1F | ese | | | | | | | | Mc | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Po | | Note | | 1B | 1B | Balin | | | | X | X | X | X | Lo | | [11] | | 00 | 7F | ese | | | | | | | | So | | | | | | | | | | | | | | Mc | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Nd | | | | | | | | | | | | | | Po | | Note | | 1D | 1D | Phone | | X | | | | | | Lm | | [9, | | 00 | 7F | tic | | | | | | | | Ll | | 11] | | | | Exte | | | | | | | | | | | | | | nsion | | | | | | | | | | | | | | s | | | | | | | | | | Note | | 1D | 1D | Phone | | X | | | | | | Lm | | [9, | | 80 | BF | tic | | | | | | | | Ll | | 11] | | | | Exte | | | | | | | | | | | | | | nsion | | | | | | | | | | | | | | s Sup | | | | | | | | | | | | | | pleme | | | | | | | | | | | | | | nt | | | | | | | | | | No | M | 1D | 1D | Combi | | | | | | | X | Mn | | | n | C0 | FF | ning | | | | | | | | | | | | | | Diac | | | | | | | | | | | | | | ritic | | | | | | | | | | | | | | alMar | | | | | | | | | | | | | | ksSup | | | | | | | | | | | | | | pleme | | | | | | | | | | | | | | nt | | | | | | | | | Faltstrom Expires April 26, 2007 [Page 10] Internet-Draft Unicode Codepoints October 2006 | Note | | 1E | 1E | Latin | X | X | | | | | | Lu | | [11] | | 00 | FF | Exten | | | | | | | | Ll | | | | | | ded | | | | | | | | | | | | | | Addi | | | | | | | | | | | | | | tiona | | | | | | | | | | | | | | l | | | | | | | | | | Note | | 1F | 1F | Greek | X | X | X | | | | | Lu | | [11] | | 00 | FF | Exten | | | | | | | | Ll | | | | | | ded | | | | | | | | Sk | | | | | | | | | | | | | | Lt | | No | N | 20 | 20 | Gener | | | | | | | | Ps | | | o | 00 | 6F | al | | | | | | | | Pf | | | | | | Punc | | | | | | | | Pd | | | | | | tuati | | | | | | | | Sm | | | | | | on | | | | | | | | Pc | | | | | | | | | | | | | | Zp | | | | | | | | | | | | | | Po | | | | | | | | | | | | | | Pi | | | | | | | | | | | | | | Zs | | | | | | | | | | | | | | Cf | | | | | | | | | | | | | | Pe | | | | | | | | | | | | | | Zl | | No | | 20 | 20 | Super | | X | | | | | | Lm | | | | 70 | 9F | scrip | | | | | | | | Ps | | | | | | tsand | | | | | | | | No | | | | | | Sub | | | | | | | | Sm | | | | | | scrip | | | | | | | | Ll | | | | | | ts | | | | | | | | Pe | | No | N | 20 | 20 | Curre | | | | | | | | Sc | | | o | A0 | CF | ncy | | | | | | | | | | | | | | Symb | | | | | | | | | | | | | | ols | | | | | | | | | | No | M | 20 | 20 | Combi | | | | | | | X | Me | | | n | D0 | FF | ning | | | | | | | | Mn | | No | | 21 | 21 | Lette | X | X | | X | | | | Lo | | | | 00 | 4F | rlike | | | | | | | | Lu | | | | | | Symb | | | | | | | | So | | | | | | ols | | | | | | | | Sm | | | | | | | | | | | | | | Ll | | No | | 21 | 21 | Numbe | X | X | | | | | | Nl | | | | 50 | 8F | r | | | | | | | | Lu | | | | | | Form | | | | | | | | No | | | | | | s | | | | | | | | Ll | | No | N | 21 | 21 | Arrow | | | | | | | | So | | | o | 90 | FF | s | | | | | | | | Sm | Faltstrom Expires April 26, 2007 [Page 11] Internet-Draft Unicode Codepoints October 2006 | No | N | 22 | 22 | Mathe | | | | | | | | Sm | | | o | 00 | FF | matic | | | | | | | | | | | | | | al | | | | | | | | | | | | | | Ope | | | | | | | | | | | | | | rator | | | | | | | | | | | | | | s | | | | | | | | | | No | N | 23 | 23 | Misce | | | | | | | | Ps | | | o | 00 | FF | llane | | | | | | | | So | | | | | | ous | | | | | | | | Sm | | | | | | Tec | | | | | | | | Pe | | | | | | hnica | | | | | | | | | | | | | | l | | | | | | | | | | No | N | 24 | 24 | Contr | | | | | | | | So | | | o | 00 | 3F | ol | | | | | | | | | | | | | | Pict | | | | | | | | | | | | | | ures | | | | | | | | | | No | N | 24 | 24 | Optic | | | | | | | | So | | | o | 40 | 5F | al | | | | | | | | | | | | | | Char | | | | | | | | | | | | | | acter | | | | | | | | | | | | | | Rec | | | | | | | | | | | | | | ognit | | | | | | | | | | | | | | ion | | | | | | | | | | No | N | 24 | 24 | Enclo | | | | | | | | No | | | o | 60 | FF | sed | | | | | | | | So | | | | | | Alph | | | | | | | | | | | | | | anume | | | | | | | | | | | | | | rics | | | | | | | | | | No | N | 25 | 25 | Box | | | | | | | | So | | | o | 00 | 7F | Drawi | | | | | | | | | | | | | | ng | | | | | | | | | | No | N | 25 | 25 | Block | | | | | | | | So | | | o | 80 | 9F | Eleme | | | | | | | | | | | | | | nts | | | | | | | | | | No | N | 25 | 25 | Geome | | | | | | | | So | | | o | A0 | FF | tric | | | | | | | | Sm | | | | | | Shap | | | | | | | | | | | | | | es | | | | | | | | | | No | N | 26 | 26 | Misce | | | | | | | | So | | | o | 00 | FF | llane | | | | | | | | Sm | | | | | | ous | | | | | | | | | | | | | | Sym | | | | | | | | | | | | | | bols | | | | | | | | | | No | N | 27 | 27 | Dingb | | | | | | | | Ps | | | o | 00 | BF | ats | | | | | | | | No | | | | | | | | | | | | | | So | | | | | | | | | | | | | | Pe | Faltstrom Expires April 26, 2007 [Page 12] Internet-Draft Unicode Codepoints October 2006 | No | N | 27 | 27 | Misce | | | | | | | | Ps | | | o | C0 | EF | llane | | | | | | | | Sm | | | | | | ous | | | | | | | | Pe | | | | | | Mat | | | | | | | | | | | | | | hemat | | | | | | | | | | | | | | ical | | | | | | | | | | | | | | S | | | | | | | | | | | | | | ymbol | | | | | | | | | | | | | | s-A | | | | | | | | | | No | N | 27 | 27 | Suppl | | | | | | | | Sm | | | o | F0 | FF | ement | | | | | | | | | | | | | | al | | | | | | | | | | | | | | Arr | | | | | | | | | | | | | | ows-A | | | | | | | | | | No | N | 28 | 28 | Brail | | | | | | | | So | | | o | 00 | FF | le | | | | | | | | | | | | | | Patt | | | | | | | | | | | | | | erns | | | | | | | | | | No | N | 29 | 29 | Suppl | | | | | | | | Sm | | | o | 00 | 7F | ement | | | | | | | | | | | | | | al | | | | | | | | | | | | | | Arr | | | | | | | | | | | | | | ows-B | | | | | | | | | | No | N | 29 | 29 | Misce | | | | | | | | Ps | | | o | 80 | FF | llane | | | | | | | | Sm | | | | | | ous | | | | | | | | Pe | | | | | | Mat | | | | | | | | | | | | | | hemat | | | | | | | | | | | | | | ical | | | | | | | | | | | | | | S | | | | | | | | | | | | | | ymbol | | | | | | | | | | | | | | s-B | | | | | | | | | | No | N | 2A | 2A | Suppl | | | | | | | | Sm | | | o | 00 | FF | ement | | | | | | | | | | | | | | al | | | | | | | | | | | | | | Mat | | | | | | | | | | | | | | hemat | | | | | | | | | | | | | | ical | | | | | | | | | | | | | | S | | | | | | | | | | | | | | ymbol | | | | | | | | | | | | | | s | | | | | | | | | | No | N | 2B | 2B | Misce | | | | | | | | So | | | o | 00 | FF | llane | | | | | | | | | | | | | | ous | | | | | | | | | | | | | | Sym | | | | | | | | | | | | | | bols | | | | | | | | | | | | | | andAr | | | | | | | | | | | | | | rows | | | | | | | | | Faltstrom Expires April 26, 2007 [Page 13] Internet-Draft Unicode Codepoints October 2006 | Note | | 2C | 2C | Glago | X | X | | | | | | Lu | | [11] | | 00 | 5F | litic | | | | | | | | Ll | | Note | | 2C | 2C | Latin | X | X | | | | | | Lu | | [11] | | 60 | 7F | Exten | | | | | | | | Ll | | | | | | ded-C | | | | | | | | | | Incl | | 2C | 2C | Copti | X | X | | | | | | Lu | | ude | | 80 | FF | c | | | | | | | | No | | | | | | | | | | | | | | So | | | | | | | | | | | | | | Ll | | | | | | | | | | | | | | Po | | Note | | 2D | 2D | Georg | | X | | | | | | Ll | | [11] | | 00 | 2F | ian | | | | | | | | | | | | | | Supp | | | | | | | | | | | | | | lemen | | | | | | | | | | | | | | t | | | | | | | | | | Incl | | 2D | 2D | Tifin | | | | X | | | | Lm | | ude | | 30 | 7F | agh | | | | | | | | Lo | | Incl | | 2D | 2D | Ethio | | | | X | | | | Lo | | ude | | 80 | DF | pic | | | | | | | | | | | | | | Exte | | | | | | | | | | | | | | nded | | | | | | | | | | No | N | 2E | 2E | Suppl | | | | | | | | Pf | | | o | 00 | 7F | ement | | | | | | | | Pd | | | | | | al | | | | | | | | Pi | | | | | | Pun | | | | | | | | Po | | | | | | ctuat | | | | | | | | | | | | | | ion | | | | | | | | | | No | N | 2E | 2E | CJK | | | | | | | | So | | | o | 80 | FF | Radic | | | | | | | | | | | | | | als | | | | | | | | | | | | | | Supp | | | | | | | | | | | | | | lemen | | | | | | | | | | | | | | t | | | | | | | | | | No, | N | 2F | 2F | Kangx | | | | | | | | So | | Note | o | 00 | DF | i | | | | | | | | | | [12] | | | | Radi | | | | | | | | | | | | | | cals | | | | | | | | | | No, | N | 2F | 2F | Ideog | | | | | | | | So | | Note | o | F0 | FF | raphi | | | | | | | | | | [12] | | | | cDesc | | | | | | | | | | | | | | ripti | | | | | | | | | | | | | | onCha | | | | | | | | | | | | | | racte | | | | | | | | | | | | | | rs | | | | | | | | | Faltstrom Expires April 26, 2007 [Page 14] Internet-Draft Unicode Codepoints October 2006 | No | | 30 | 30 | CJK | | | | X | | | X | Ps | | | | 00 | 3F | Symbo | | | | | | | | Lm | | | | | | ls an | | | | | | | | Pd | | | | | | dPunc | | | | | | | | So | | | | | | tuati | | | | | | | | Po | | | | | | on | | | | | | | | Nl | | | | | | | | | | | | | | Lo | | | | | | | | | | | | | | Zs | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Pe | | Incl | | 30 | 30 | Hirag | | | | X | | | X | Lm | | ude | | 40 | 9F | ana | | | | | | | | Lo | | | | | | | | | | | | | | Mn | | | | | | | | | | | | | | Sk | | Incl | | 30 | 30 | Katak | | | | X | | | | Lm | | ude | | A0 | FF | ana | | | | | | | | Lo | | | | | | | | | | | | | | Pd | | | | | | | | | | | | | | Po | | Note | | 31 | 31 | Bopom | | | | X | | | | Lo | | [11, | | 00 | 2F | ofo | | | | | | | | | | 12] | | | | | | | | | | | | | | Note | | 31 | 31 | Hangu | | | | X | | | | Lo | | [8] | | 30 | 8F | l | | | | | | | | | | | | | | Comp | | | | | | | | | | | | | | atibi | | | | | | | | | | | | | | lity | | | | | | | | | | | | | | Ja | | | | | | | | | | | | | | mo | | | | | | | | | | No | N | 31 | 31 | Kanbu | | | | | | | | No | | | o | 90 | 9F | n | | | | | | | | So | | Note | | 31 | 31 | Bopom | | | | X | | | | Lo | | [11, | | A0 | BF | ofo | | | | | | | | | | 12] | | | | Exte | | | | | | | | | | | | | | nded | | | | | | | | | | No, | N | 31 | 31 | CJK | | | | | | | | So | | Note | o | C0 | EF | Strok | | | | | | | | | | [12] | | | | es | | | | | | | | | | Note | | 31 | 31 | Katak | | | | X | | | | Lo | | [12] | | F0 | FF | ana | | | | | | | | | | | | | | Phon | | | | | | | | | | | | | | etic | | | | | | | | | | | | | | Ext | | | | | | | | | | | | | | ensio | | | | | | | | | | | | | | ns | | | | | | | | | Faltstrom Expires April 26, 2007 [Page 15] Internet-Draft Unicode Codepoints October 2006 | No | N | 32 | 32 | Enclo | | | | | | | | No | | | o | 00 | FF | sed | | | | | | | | So | | | | | | CJK | | | | | | | | | | | | | | Lett | | | | | | | | | | | | | | ers a | | | | | | | | | | | | | | ndMon | | | | | | | | | | | | | | ths | | | | | | | | | | No | N | 33 | 33 | CJK | | | | | | | | So | | | o | 00 | FF | Compa | | | | | | | | | | | | | | tibil | | | | | | | | | | | | | | ity | | | | | | | | | | Note | | 34 | 4D | CJK | | | | X | | | | Lo | | [12] | | 00 | BF | Unifi | | | | | | | | | | | | | | ed | | | | | | | | | | | | | | Ideo | | | | | | | | | | | | | | graph | | | | | | | | | | | | | | s Ext | | | | | | | | | | | | | | ensio | | | | | | | | | | | | | | n A | | | | | | | | | | No | N | 4D | 4D | Yijin | | | | | | | | So | | | o | C0 | FF | g | | | | | | | | | | | | | | Hexa | | | | | | | | | | | | | | gram | | | | | | | | | | | | | | Sym | | | | | | | | | | | | | | bols | | | | | | | | | | Note | | 4E | 9F | CJK | | | | X | | | | Lo | | [12] | | 00 | FF | Unifi | | | | | | | | | | | | | | ed | | | | | | | | | | | | | | Ideo | | | | | | | | | | | | | | graph | | | | | | | | | | | | | | s | | | | | | | | | | Note | | A0 | A4 | Yi | | | | X | | | | Lm | | [11] | | 00 | 8F | Sylla | | | | | | | | Lo | | | | | | bles | | | | | | | | | | No, | N | A4 | A4 | Yi | | | | | | | | So | | Note | o | 90 | CF | Radic | | | | | | | | | | [11, | | | | als | | | | | | | | | | 12] | | | | | | | | | | | | | | No | N | A7 | A7 | Modif | | | | | | | | Lm | | | o | 00 | 1F | ier | | | | | | | | Sk | | | | | | Tone | | | | | | | | | | | | | | Lett | | | | | | | | | | | | | | ers | | | | | | | | | | No | N | A7 | A7 | Latin | | | | | | | | Sk | | | o | 20 | FF | Exten | | | | | | | | | | | | | | ded-D | | | | | | | | | Faltstrom Expires April 26, 2007 [Page 16] Internet-Draft Unicode Codepoints October 2006 | Note | | A8 | A8 | Sylot | | | | X | X | | X | Lo | | [11] | | 00 | 2F | i | | | | | | | | So | | | | | | Nagr | | | | | | | | Mn | | | | | | i | | | | | | | | Mc | | Note | | A8 | A8 | Phags | | | | X | | | | Lo | | [11] | | 40 | 7F | -pa | | | | | | | | Po | | Note | | AC | D7 | Hangu | | | | X | | | | Lo | | [8] | | 00 | AF | l | | | | | | | | | | | | | | Syll | | | | | | | | | | | | | | ables | | | | | | | | | | No | N | D8 | DB | High | | | | | | | | Cs | | | o | 00 | 7F | Surro | | | | | | | | | | | | | | gates | | | | | | | | | | No | N | DB | DB | High | | | | | | | | Cs | | | o | 80 | FF | Priva | | | | | | | | | | | | | | te Us | | | | | | | | | | | | | | eSurr | | | | | | | | | | | | | | ogate | | | | | | | | | | | | | | s | | | | | | | | | | No | N | DC | DF | Low | | | | | | | | Cs | | | o | 00 | FF | Surro | | | | | | | | | | | | | | gates | | | | | | | | | | No | N | E0 | F8 | Priva | | | | | | | | Co | | | o | 00 | FF | te Us | | | | | | | | | | | | | | e | | | | | | | | | | No | | F9 | FA | CJK | | | | X | | | | Lo | | | | 00 | FF | Compa | | | | | | | | | | | | | | tibil | | | | | | | | | | | | | | ity | | | | | | | | | | | | | | Ide | | | | | | | | | | | | | | ograp | | | | | | | | | | | | | | hs | | | | | | | | | | No | | FB | FB | Alpha | | X | | X | | | X | Lo | | | | 00 | 4F | betic | | | | | | | | Mn | | | | | | Pres | | | | | | | | Sm | | | | | | entat | | | | | | | | Ll | | | | | | ion | | | | | | | | | | | | | | Fo | | | | | | | | | | | | | | rms | | | | | | | | | | No | | FB | FD | Arabi | | | | X | | | | Ps | | | | 50 | FF | c | | | | | | | | Lo | | | | | | Pres | | | | | | | | So | | | | | | entat | | | | | | | | Pe | | | | | | ion | | | | | | | | Sc | | | | | | Fo | | | | | | | | | | | | | | rms-A | | | | | | | | | Faltstrom Expires April 26, 2007 [Page 17] Internet-Draft Unicode Codepoints October 2006 | No | M | FE | FE | Varia | | | | | | | X | Mn | | | n | 00 | 0F | tion | | | | | | | | | | | | | | Sele | | | | | | | | | | | | | | ctors | | | | | | | | | | No | N | FE | FE | Verti | | | | | | | | Ps | | | o | 10 | 1F | cal | | | | | | | | Pe | | | | | | Form | | | | | | | | Po | | | | | | s | | | | | | | | | | No | M | FE | FE | Combi | | | | | | | X | Mn | | | n | 20 | 2F | ning | | | | | | | | | | | | | | Half | | | | | | | | | | | | | | Mark | | | | | | | | | | | | | | s | | | | | | | | | | No | N | FE | FE | CJK | | | | | | | | Ps | | | o | 30 | 4F | Compa | | | | | | | | Pd | | | | | | tibil | | | | | | | | Pe | | | | | | ity | | | | | | | | Pc | | | | | | For | | | | | | | | Po | | | | | | ms | | | | | | | | | | No | N | FE | FE | Small | | | | | | | | Ps | | | o | 50 | 6F | Form | | | | | | | | Pd | | | | | | Varia | | | | | | | | Sm | | | | | | nts | | | | | | | | Pe | | | | | | | | | | | | | | Sc | | | | | | | | | | | | | | Po | | No | | FE | FE | Arabi | | | | X | | | | Lo | | | | 70 | FF | c | | | | | | | | Cf | | | | | | Pres | | | | | | | | | | | | | | entat | | | | | | | | | | | | | | ion | | | | | | | | | | | | | | Fo | | | | | | | | | | | | | | rms-B | | | | | | | | | | No | | FE | FE | Speci | | | | | | | | | | | | FF | FF | als | | | | | | | | | | No | | FF | FF | Halfw | X | X | | X | | X | | Lm | | | | 00 | EF | idth | | | | | | | | Ps | | | | | | and | | | | | | | | Lu | | | | | | Full | | | | | | | | Pd | | | | | | width | | | | | | | | So | | | | | | For | | | | | | | | Nd | | | | | | ms | | | | | | | | Sm | | | | | | | | | | | | | | Pc | | | | | | | | | | | | | | Po | | | | | | | | | | | | | | Lo | | | | | | | | | | | | | | Ll | | | | | | | | | | | | | | Pe | | | | | | | | | | | | | | Sk | | | | | | | | | | | | | | Sc | Faltstrom Expires April 26, 2007 [Page 18] Internet-Draft Unicode Codepoints October 2006 | No | N | FF | FF | Speci | | | | | | | | So | | | o | F0 | FF | als | | | | | | | | Cf | | Note | | 10 | 10 | Linea | | | | X | | | | Lo | | [10] | | 00 | 07 | rB | | | | | | | | | | | | 0 | F | Syll | | | | | | | | | | | | | | abary | | | | | | | | | | Note | | 10 | 10 | Linea | | | | X | | | | Lo | | [10] | | 08 | 0F | rB | | | | | | | | | | | | 0 | F | Ideo | | | | | | | | | | | | | | grams | | | | | | | | | | Note | N | 10 | 10 | Aegea | | | | | | | | No | | [10] | o | 10 | 13 | n | | | | | | | | So | | | | 0 | F | Numb | | | | | | | | Po | | | | | | ers | | | | | | | | | | Note | N | 10 | 10 | Ancie | | | | | | | | Nl | | [10] | o | 14 | 18 | nt | | | | | | | | No | | | | 0 | F | Gree | | | | | | | | So | | | | | | kNumb | | | | | | | | | | | | | | ers | | | | | | | | | | Note | | 10 | 10 | Old | | | | X | | | | Lo | | [10] | | 30 | 32 | Itali | | | | | | | | No | | | | 0 | F | c | | | | | | | | | | Note | | 10 | 10 | Gothi | | | | X | | | | Nl | | [10] | | 33 | 34 | c | | | | | | | | Lo | | | | 0 | F | | | | | | | | | | | Note | | 10 | 10 | Ugari | | | | X | | | | Lo | | [10] | | 38 | 39 | tic | | | | | | | | Po | | | | 0 | F | | | | | | | | | | | Note | | 10 | 10 | Old | | | | X | | | | Nl | | [10] | | 3A | 3D | Persi | | | | | | | | Lo | | | | 0 | F | an | | | | | | | | Po | | Note | | 10 | 10 | Deser | X | X | | | | | | Lu | | [10] | | 40 | 44 | et | | | | | | | | Ll | | | | 0 | F | | | | | | | | | | | No, | | 10 | 10 | Shavi | | | | X | | | | Lo | | Note | | 45 | 47 | an | | | | | | | | | | [10] | | 0 | F | | | | | | | | | | | No, | | 10 | 10 | Osman | | | | X | | X | | Lo | | Note | | 48 | 4A | ya | | | | | | | | Nd | | [10] | | 0 | F | | | | | | | | | | | Note | | 10 | 10 | Cypri | | | | X | | | | Lo | | [10] | | 80 | 83 | ot | | | | | | | | | | | | 0 | F | Syll | | | | | | | | | | | | | | abary | | | | | | | | | | Note | | 10 | 10 | Phoen | | | | X | | | | Lo | | [10] | | 90 | 91 | ician | | | | | | | | No | | | | 0 | F | | | | | | | | | Po | Faltstrom Expires April 26, 2007 [Page 19] Internet-Draft Unicode Codepoints October 2006 | Note | | 10 | 10 | Kharo | | | | X | | | X | Lo | | [10] | | A0 | A5 | shthi | | | | | | | | No | | | | 0 | F | | | | | | | | | Mn | | | | | | | | | | | | | | Po | | Note | | 12 | 12 | Cunei | | | | X | | | | Lo | | [10] | | 00 | 3F | form | | | | | | | | | | | | 0 | F | | | | | | | | | | | Note | N | 12 | 12 | Cunei | | | | | | | | Nl | | [10] | o | 40 | 47 | form | | | | | | | | Po | | | | 0 | F | Numb | | | | | | | | | | | | | | ers a | | | | | | | | | | | | | | ndPun | | | | | | | | | | | | | | ctuat | | | | | | | | | | | | | | ion | | | | | | | | | | No | N | 1D | 1D | Byzan | | | | | | | | So | | | o | 00 | 0F | tine | | | | | | | | | | | | 0 | F | Musi | | | | | | | | | | | | | | cal | | | | | | | | | | | | | | Sym | | | | | | | | | | | | | | bols | | | | | | | | | | No | | 1D | 1D | Music | | | | | X | | X | Cf | | | | 10 | 1F | al | | | | | | | | Mn | | | | 0 | F | Symb | | | | | | | | Mc | | | | | | ols | | | | | | | | So | | Note | M | 1D | 1D | Ancie | | | | | | | X | Mn | | [10] | n | 20 | 24 | nt | | | | | | | | So | | | | 0 | F | Gree | | | | | | | | | | | | | | kMusi | | | | | | | | | | | | | | cal | | | | | | | | | | | | | | Not | | | | | | | | | | | | | | ation | | | | | | | | | | No | N | 1D | 1D | Tai | | | | | | | | So | | | o | 30 | 35 | Xuan | | | | | | | | | | | | 0 | F | Jing | | | | | | | | | | | | | | Symbo | | | | | | | | | | | | | | ls | | | | | | | | | | Note | N | 1D | 1D | Count | | | | | | | | No | | [10] | o | 36 | 37 | ing | | | | | | | | | | | | 0 | F | Rod | | | | | | | | | | | | | | Nume | | | | | | | | | | | | | | rals | | | | | | | | | Faltstrom Expires April 26, 2007 [Page 20] Internet-Draft Unicode Codepoints October 2006 | No | | 1D | 1D | Mathe | X | X | | | | X | | Lu | | | | 40 | 7F | matic | | | | | | | | Nd | | | | 0 | F | al | | | | | | | | Sm | | | | | | Alp | | | | | | | | Ll | | | | | | hanum | | | | | | | | | | | | | | eric | | | | | | | | | | | | | | S | | | | | | | | | | | | | | ymbol | | | | | | | | | | | | | | s | | | | | | | | | | No | | 20 | 2A | CJK | | | | X | | | | Lo | | | | 00 | 6D | Unifi | | | | | | | | | | | | 0 | F | ed | | | | | | | | | | | | | | Ideo | | | | | | | | | | | | | | graph | | | | | | | | | | | | | | s Ext | | | | | | | | | | | | | | ensio | | | | | | | | | | | | | | n B | | | | | | | | | | No | | 2F | 2F | CJK | | | | X | | | | Lo | | | | 80 | A1 | Compa | | | | | | | | | | | | 0 | F | tibil | | | | | | | | | | | | | | ity | | | | | | | | | | | | | | Ide | | | | | | | | | | | | | | ograp | | | | | | | | | | | | | | hs Su | | | | | | | | | | | | | | pplem | | | | | | | | | | | | | | ent | | | | | | | | | | No | N | E0 | E0 | Tags | | | | | | | | Cf | | | o | 00 | 07 | | | | | | | | | | | | | 0 | F | | | | | | | | | | | No | M | E0 | E0 | Varia | | | | | | | X | Mn | | | n | 10 | 1E | tion | | | | | | | | | | | | 0 | F | Sele | | | | | | | | | | | | | | ctors | | | | | | | | | | | | | | Sup | | | | | | | | | | | | | | pleme | | | | | | | | | | | | | | nt | | | | | | | | | | No | N | F0 | FF | Suppl | | | | | | | | Co | | | o | 00 | FF | ement | | | | | | | | | | | | 0 | F | ary | | | | | | | | | | | | | | Pri | | | | | | | | | | | | | | vate | | | | | | | | | | | | | | UseAr | | | | | | | | | | | | | | ea-A | | | | | | | | | Faltstrom Expires April 26, 2007 [Page 21] Internet-Draft Unicode Codepoints October 2006 | No | N | 10 | 10 | Suppl | | | | | | | | Co | | | o | 00 | FF | ement | | | | | | | | | | | | 00 | FF | ary | | | | | | | | | | | | | | Pri | | | | | | | | | | | | | | vate | | | | | | | | | | | | | | UseAr | | | | | | | | | | | | | | ea-B | | | | | | | | | +------+---+----+----+-------+---+---+---+----+----+----+----+------+ 2.2. Problematic classes 2.2.1. Lu (Input) The Class "Lu" is for Letter, Uppercase. As the DNS is case insensitive, all uppercase letters are to be case folded to lower case. This implies codepoints of class "Lu" will never exist when looking up a domain name, in a zone file or similar. In order to preserve as much symmetry as possible with ASCII domain names and the original DNS rules [RFC1035] should be allowed in Input (before case folding) according to the algorithm described in IDNAbis [idnabis]. 2.2.2. Lt (Input) The Class "Lt" is for Letter, Titlecase. As with class "Lu", codepoints from this class should only be available in Input. 2.2.3. Lo (Maybe) The Class "Lo" is for Letter, Other. This class include codepoints that in general should be allowed to be used in IDN. 2.2.4. Mn (Possibly not) The Class "Mn" is for Mark, Nonspacing. This class includes codepoints that in general should not be allowed to be used in IDN, but some scripts do have codepoints in this class that should be allowed if possible since they are critical for writing the language. 2.2.5. Mc (Maybe) The Class "Mc" is for Mark, Spacing Combining. This class include codepoints that in general should not be allowed to use in IDN, but some scripts do have codepoints in this class that should be carefully considered as exceptions. Faltstrom Expires April 26, 2007 [Page 22] Internet-Draft Unicode Codepoints October 2006 2.2.6. Me (Possibly not) The Class "Me" is for Mark, Enclosing. 2.2.7. Nd (Maybe) The Class "Nd" is for Number, Decimal Digit. This class includes codepoints that in general should be allowed for use in IDNs. 2.2.8. Nl (Possibly not) The Class "Nl" is for Number, Letter. This class include codepoints that in general should not be allowed to be used in IDN. 2.2.9. Pd (Exclude) The Class "Pd" is for Punctuation, Dash. This class include codepoints that in general should not be allowed to be used in IDNs, but the class include the '-' that is allowed by the basic DNS specifications and that cannot now be disallowed. 2.2.10. Po (Exclude) The Class "Po" is for Punctuation, Other. This class include codepoints that in general should not be allowed to be used in IDN. 3. Codepoint blocks in Unicode The following is a table that identifies the classes of codepoints that exist within each block of codepoints in The Unicode Standard. It also notes whether that block of codepoints is to be included or not in IDN. 3.1. Unicode Blocks 3.2. Notes 3.2.1. Note [1] A small number of IPA Extensions are used as extended Latin characters in some languages, and hence need to be segregated from this table and, if possible, made available in IDNs. 3.2.2. Note [2] The Unicode character class Mn (Mark, Nonspacing) does not distinguish between elements that are only meaningful in right-to- Faltstrom Expires April 26, 2007 [Page 23] Internet-Draft Unicode Codepoints October 2006 left scripts and elements that are only meaningful in left-to-right scripts. This is not adequate given other issues, such as those discussed in [IDNA-bidi]. 3.2.3. Note [3] Resolution of Note [2] is needed for adequate treatment. Significant cases have been identified in Dhivehi and Yiddish [IDNA-bidi] are believed to exist in other languages written with right-to-left scripts that result in rejection of legitimate constructs with the current version of Stringprep. A major philosophical conflict arises here between combining and precomposed characters. 3.2.4. Note [4] Shaping issues require special attention for correct rendering and interpretation of characters. Precomposition is an issue as are some presentation forms that are defined in Unicode. 3.2.5. Note [5] These characters raise the issues noted in Note [4] and some additional issues with combining characters. 3.2.6. Note [6] Verticality has not been fully considered in the DNS context. 3.2.7. Note [7] Subject of significant controversy at present. This document recommends against adding to DNS list until the controversy is resolved. 3.2.8. Note [8] Requires advice of expert community. It is not clear whether some issues with Unicode definitions for these characters and the associated script have been fully resolved. 3.2.9. Note [9] Important language community, but characters have significant potential for confusion. Faltstrom Expires April 26, 2007 [Page 24] Internet-Draft Unicode Codepoints October 2006 3.2.10. Note [10] Script not believed to be in current use, even by research communities relevant to IDN. 3.2.11. Note [11] Requires more information and advice from expert community. 3.2.12. Note [12] CJK (and related) radicals, compatibility, composing and Chinese phonetic characters are believed to be inappropriate on the basis of JET and CDNC work (see RFC 4713 [RFC4713]). 4. Individual codepoints in Unicode If, instead, we examine the individual codepoints of Unicode Codepoints between U+0000 and U+0FFF, retaining the decisions made based on the classes above, we get the following result. 4.1. 0000-007F Basic Latin +----------+--------+--------+-------+------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------+ | Exclude | U+000 | U+0000 | Cc | | | Exclude | U+0001 | U+0001 | Cc | | | Exclude | U+0002 | U+0002 | Cc | | | Exclude | U+0003 | U+0003 | Cc | | | Exclude | U+0004 | U+0004 | Cc | | | Exclude | U+0005 | U+0005 | Cc | | | Exclude | U+0006 | U+0006 | Cc | | | Exclude | U+0007 | U+0007 | Cc | | | Exclude | U+0008 | U+0008 | Cc | | | Exclude | U+0009 | U+0009 | Cc | | | Exclude | U+000A | U+000A | Cc | | | Exclude | U+000B | U+000B | Cc | | | Exclude | U+000C | U+000C | Cc | | | Exclude | U+000D | U+000D | Cc | | | Exclude | U+000E | U+000E | Cc | | | Exclude | U+000F | U+000F | Cc | | | Exclude | U+0010 | U+0010 | Cc | | | Exclude | U+0011 | U+0011 | Cc | | | Exclude | U+0012 | U+0012 | Cc | | | Exclude | U+0013 | U+0013 | Cc | | | Exclude | U+0014 | U+0014 | Cc | | Faltstrom Expires April 26, 2007 [Page 25] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0015 | U+0015 | Cc | | | Exclude | U+0016 | U+0016 | Cc | | | Exclude | U+0017 | U+0017 | Cc | | | Exclude | U+0018 | U+0018 | Cc | | | Exclude | U+0019 | U+0019 | Cc | | | Exclude | U+001A | U+001A | Cc | | | Exclude | U+001B | U+001B | Cc | | | Exclude | U+001C | U+001C | Cc | | | Exclude | U+001D | U+001D | Cc | | | Exclude | U+001E | U+001E | Cc | | | Exclude | U+001F | U+001F | Cc | | | Exclude | U+0020 | U+0020 | Zs | SPACE | | Exclude | U+0021 | U+0021 | Po | EXCLAMATION MARK | | Exclude | U+0022 | U+0022 | Po | QUOTATION MARK | | Exclude | U+0023 | U+0023 | Po | NUMBER SIGN | | Exclude | U+0024 | U+0024 | Sc | DOLLAR SIGN | | Exclude | U+0025 | U+0025 | Po | PERCENT SIGN | | Exclude | U+0026 | U+0026 | Po | AMPERSAND | | Exclude | U+0027 | U+0027 | Po | APOSTROPHE | | Exclude | U+0028 | U+0028 | Ps | LEFT PARENTHESIS | | Exclude | U+0029 | U+0029 | Pe | RIGHT PARENTHESIS | | Exclude | U+002A | U+002A | Po | ASTERISK | | Exclude | U+002B | U+002B | Sm | PLUS SIGN | | Exclude | U+002C | U+002C | Po | COMMA | | Exclude | U+002D | U+002D | Pd | HYPHEN-MINUS | | Exclude | U+002E | U+002E | Po | FULL STOP | | Exclude | U+002F | U+002F | Po | SOLIDUS | | Maybe | U+0030 | U+0030 | Nd | DIGIT ZERO | | Maybe | U+0031 | U+0031 | Nd | DIGIT ONE | | Maybe | U+0032 | U+0032 | Nd | DIGIT TWO | | Maybe | U+0033 | U+0033 | Nd | DIGIT THREE | | Maybe | U+0034 | U+0034 | Nd | DIGIT FOUR | | Maybe | U+0035 | U+0035 | Nd | DIGIT FIVE | | Maybe | U+0036 | U+0036 | Nd | DIGIT SIX | | Maybe | U+0037 | U+0037 | Nd | DIGIT SEVEN | | Maybe | U+0038 | U+0038 | Nd | DIGIT EIGHT | | Maybe | U+0039 | U+0039 | Nd | DIGIT NINE | | Exclude | U+003A | U+003A | Po | COLON | | Exclude | U+003B | U+003B | Po | SEMICOLON | | Exclude | U+003C | U+003C | Sm | LESS-THAN SIGN | | Exclude | U+003D | U+003D | Sm | EQUALS SIGN | | Exclude | U+003E | U+003E | Sm | GREATER-THAN SIGN | | Exclude | U+003F | U+003F | Po | QUESTION MARK | | Exclude | U+0040 | U+0040 | Po | COMMERCIAL AT | | Input | U+0041 | U+0041 | Lu | LATIN CAPITAL LETTER A | | Input | U+0042 | U+0042 | Lu | LATIN CAPITAL LETTER B | | Input | U+0043 | U+0043 | Lu | LATIN CAPITAL LETTER C | | Input | U+0044 | U+0044 | Lu | LATIN CAPITAL LETTER D | Faltstrom Expires April 26, 2007 [Page 26] Internet-Draft Unicode Codepoints October 2006 | Input | U+0045 | U+0045 | Lu | LATIN CAPITAL LETTER E | | Input | U+0046 | U+0046 | Lu | LATIN CAPITAL LETTER F | | Input | U+0047 | U+0047 | Lu | LATIN CAPITAL LETTER G | | Input | U+0048 | U+0048 | Lu | LATIN CAPITAL LETTER H | | Input | U+0049 | U+0049 | Lu | LATIN CAPITAL LETTER I | | Input | U+004A | U+004A | Lu | LATIN CAPITAL LETTER J | | Input | U+004B | U+004B | Lu | LATIN CAPITAL LETTER K | | Input | U+004C | U+004C | Lu | LATIN CAPITAL LETTER L | | Input | U+004D | U+004D | Lu | LATIN CAPITAL LETTER M | | Input | U+004E | U+004E | Lu | LATIN CAPITAL LETTER N | | Input | U+004F | U+004F | Lu | LATIN CAPITAL LETTER O | | Input | U+0050 | U+0050 | Lu | LATIN CAPITAL LETTER P | | Input | U+0051 | U+0051 | Lu | LATIN CAPITAL LETTER Q | | Input | U+0052 | U+0052 | Lu | LATIN CAPITAL LETTER R | | Input | U+0053 | U+0053 | Lu | LATIN CAPITAL LETTER S | | Input | U+0054 | U+0054 | Lu | LATIN CAPITAL LETTER T | | Input | U+0055 | U+0055 | Lu | LATIN CAPITAL LETTER U | | Input | U+0056 | U+0056 | Lu | LATIN CAPITAL LETTER V | | Input | U+0057 | U+0057 | Lu | LATIN CAPITAL LETTER W | | Input | U+0058 | U+0058 | Lu | LATIN CAPITAL LETTER X | | Input | U+0059 | U+0059 | Lu | LATIN CAPITAL LETTER Y | | Input | U+005A | U+005A | Lu | LATIN CAPITAL LETTER Z | | Exclude | U+005B | U+005B | Ps | LEFT SQUARE BRACKET | | Exclude | U+005C | U+005C | Po | REVERSE SOLIDUS | | Exclude | U+005D | U+005D | Pe | RIGHT SQUARE BRACKET | | | U+005E | U+005E | | CIRCUMFLEX ACCENT | | Exclude | U+005F | U+005F | Pc | LOW LINE | | | U+0060 | U+0060 | | GRAVE ACCENT | | Include | U+0061 | U+0061 | Ll | LATIN SMALL LETTER A | | Include | U+0062 | U+0062 | Ll | LATIN SMALL LETTER B | | Include | U+0063 | U+0063 | Ll | LATIN SMALL LETTER C | | Include | U+0064 | U+0064 | Ll | LATIN SMALL LETTER D | | Include | U+0065 | U+0065 | Ll | LATIN SMALL LETTER E | | Include | U+0066 | U+0066 | Ll | LATIN SMALL LETTER F | | Include | U+0067 | U+0067 | Ll | LATIN SMALL LETTER G | | Include | U+0068 | U+0068 | Ll | LATIN SMALL LETTER H | | Include | U+0069 | U+0069 | Ll | LATIN SMALL LETTER I | | Include | U+006A | U+006A | Ll | LATIN SMALL LETTER J | | Include | U+006B | U+006B | Ll | LATIN SMALL LETTER K | | Include | U+006C | U+006C | Ll | LATIN SMALL LETTER L | | Include | U+006D | U+006D | Ll | LATIN SMALL LETTER M | | Include | U+006E | U+006E | Ll | LATIN SMALL LETTER N | | Include | U+006F | U+006F | Ll | LATIN SMALL LETTER O | | Include | U+0070 | U+0070 | Ll | LATIN SMALL LETTER P | | Include | U+0071 | U+0071 | Ll | LATIN SMALL LETTER Q | | Include | U+0072 | U+0072 | Ll | LATIN SMALL LETTER R | | Include | U+0073 | U+0073 | Ll | LATIN SMALL LETTER S | | Include | U+0074 | U+0074 | Ll | LATIN SMALL LETTER T | Faltstrom Expires April 26, 2007 [Page 27] Internet-Draft Unicode Codepoints October 2006 | Include | U+0075 | U+0075 | Ll | LATIN SMALL LETTER U | | Include | U+0076 | U+0076 | Ll | LATIN SMALL LETTER V | | Include | U+0077 | U+0077 | Ll | LATIN SMALL LETTER W | | Include | U+0078 | U+0078 | Ll | LATIN SMALL LETTER X | | Include | U+0079 | U+0079 | Ll | LATIN SMALL LETTER Y | | Include | U+007A | U+007A | Ll | LATIN SMALL LETTER Z | | Exclude | U+007B | U+007B | Ps | LEFT CURLY BRACKET | | Exclude | U+007C | U+007C | Sm | VERTICAL LINE | | Exclude | U+007D | U+007D | Pe | RIGHT CURLY BRACKET | | Exclude | U+007E | U+007E | Sm | TILDE | | Exclude | U+007F | U+007F | Cc | | +----------+--------+--------+-------+------------------------+ 4.2. 0080-00FF Latin-1 Supplement +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Exclude | U+0080 | U+0080 | Cc | | | Exclude | U+0081 | U+0081 | Cc | | | Exclude | U+0082 | U+0082 | Cc | | | Exclude | U+0083 | U+0083 | Cc | | | Exclude | U+0084 | U+0084 | Cc | | | Exclude | U+0085 | U+0085 | Cc | | | Exclude | U+0086 | U+0086 | Cc | | | Exclude | U+0087 | U+0087 | Cc | | | Exclude | U+0088 | U+0088 | Cc | | | Exclude | U+0089 | U+0089 | Cc | | | Exclude | U+008A | U+008A | Cc | | | Exclude | U+008B | U+008B | Cc | | | Exclude | U+008C | U+008C | Cc | | | Exclude | U+008D | U+008D | Cc | | | Exclude | U+008E | U+008E | Cc | | | Exclude | U+008F | U+008F | Cc | | | Exclude | U+0090 | U+0090 | Cc | | | Exclude | U+0091 | U+0091 | Cc | | | Exclude | U+0092 | U+0092 | Cc | | | Exclude | U+0093 | U+0093 | Cc | | | Exclude | U+0094 | U+0094 | Cc | | | Exclude | U+0095 | U+0095 | Cc | | | Exclude | U+0096 | U+0096 | Cc | | | Exclude | U+0097 | U+0097 | Cc | | | Exclude | U+0098 | U+0098 | Cc | | | Exclude | U+0099 | U+0099 | Cc | | | Exclude | U+009A | U+009A | Cc | | | Exclude | U+009B | U+009B | Cc | | | Exclude | U+009C | U+009C | Cc | | | Exclude | U+009D | U+009D | Cc | | Faltstrom Expires April 26, 2007 [Page 28] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+009E | U+009E | Cc | | | Exclude | U+009F | U+009F | Cc | | | Exclude | U+00A0 | U+0020 | Zs | SPACE | | Exclude | U+00A1 | U+00A1 | Po | INVERTED EXCLAMATION MARK | | Exclude | U+00A2 | U+00A2 | Sc | CENT SIGN | | Exclude | U+00A3 | U+00A3 | Sc | POUND SIGN | | Exclude | U+00A4 | U+00A4 | Sc | CURRENCY SIGN | | Exclude | U+00A5 | U+00A5 | Sc | YEN SIGN | | Exclude | U+00A6 | U+00A6 | So | BROKEN BAR | | Exclude | U+00A7 | U+00A7 | So | SECTION SIGN | | Exclude | U+00A8 | U+0020 | Mn Zs | SPACE | | Exclude | U+00A9 | U+00A9 | So | COPYRIGHT SIGN | | Include | U+00AA | U+0061 | Ll | LATIN SMALL LETTER A | | Exclude | U+00AB | U+00AB | Pi | LEFT-POINTING DOUBLE ANGLE | | | | | | QUOTATION MARK | | Exclude | U+00AC | U+00AC | Sm | NOT SIGN | | Exclude | U+00AD | U+00AD | Cf | SOFT HYPHEN | | Exclude | U+00AE | U+00AE | So | REGISTERED SIGN | | Exclude | U+00AF | U+0020 | Mn Zs | SPACE | | Exclude | U+00B0 | U+00B0 | So | DEGREE SIGN | | Exclude | U+00B1 | U+00B1 | Sm | PLUS-MINUS SIGN | | Maybe | U+00B2 | U+0032 | Nd | DIGIT TWO | | Maybe | U+00B3 | U+0033 | Nd | DIGIT THREE | | Exclude | U+00B4 | U+0020 | Mn Zs | SPACE | | Include | U+00B5 | U+03BC | Ll | GREEK SMALL LETTER MU | | Exclude | U+00B6 | U+00B6 | So | PILCROW SIGN | | Exclude | U+00B7 | U+00B7 | Po | MIDDLE DOT | | Exclude | U+00B8 | U+0020 | Mn Zs | SPACE | | Maybe | U+00B9 | U+0031 | Nd | DIGIT ONE | | Include | U+00BA | U+006F | Ll | LATIN SMALL LETTER O | | Exclude | U+00BB | U+00BB | Pf | RIGHT-POINTING DOUBLE ANGLE | | | | | | QUOTATION MARK | | Exclude | U+00BC | U+0031 | Nd Sm | DIGIT ONE | | Exclude | U+00BD | U+0031 | Nd Sm | DIGIT ONE | | Exclude | U+00BE | U+0033 | Nd Sm | DIGIT THREE | | Exclude | U+00BF | U+00BF | Po | INVERTED QUESTION MARK | | Input | U+00C0 | U+00C0 | Lu | LATIN CAPITAL LETTER A WITH | | | | | | GRAVE | | Input | U+00C1 | U+00C1 | Lu | LATIN CAPITAL LETTER A WITH | | | | | | ACUTE | | Input | U+00C2 | U+00C2 | Lu | LATIN CAPITAL LETTER A WITH | | | | | | CIRCUMFLEX | | Input | U+00C3 | U+00C3 | Lu | LATIN CAPITAL LETTER A WITH | | | | | | TILDE | | Input | U+00C4 | U+00C4 | Lu | LATIN CAPITAL LETTER A WITH | | | | | | DIAERESIS | | Input | U+00C5 | U+00C5 | Lu | LATIN CAPITAL LETTER A WITH | | | | | | RING ABOVE | Faltstrom Expires April 26, 2007 [Page 29] Internet-Draft Unicode Codepoints October 2006 | Input | U+00C6 | U+00C6 | Lu | LATIN CAPITAL LETTER AE | | Input | U+00C7 | U+00C7 | Lu | LATIN CAPITAL LETTER C WITH | | | | | | CEDILLA | | Input | U+00C8 | U+00C8 | Lu | LATIN CAPITAL LETTER E WITH | | | | | | GRAVE | | Input | U+00C9 | U+00C9 | Lu | LATIN CAPITAL LETTER E WITH | | | | | | ACUTE | | Input | U+00CA | U+00CA | Lu | LATIN CAPITAL LETTER E WITH | | | | | | CIRCUMFLEX | | Input | U+00CB | U+00CB | Lu | LATIN CAPITAL LETTER E WITH | | | | | | DIAERESIS | | Input | U+00CC | U+00CC | Lu | LATIN CAPITAL LETTER I WITH | | | | | | GRAVE | | Input | U+00CD | U+00CD | Lu | LATIN CAPITAL LETTER I WITH | | | | | | ACUTE | | Input | U+00CE | U+00CE | Lu | LATIN CAPITAL LETTER I WITH | | | | | | CIRCUMFLEX | | Input | U+00CF | U+00CF | Lu | LATIN CAPITAL LETTER I WITH | | | | | | DIAERESIS | | Input | U+00D0 | U+00D0 | Lu | LATIN CAPITAL LETTER ETH | | Input | U+00D1 | U+00D1 | Lu | LATIN CAPITAL LETTER N WITH | | | | | | TILDE | | Input | U+00D2 | U+00D2 | Lu | LATIN CAPITAL LETTER O WITH | | | | | | GRAVE | | Input | U+00D3 | U+00D3 | Lu | LATIN CAPITAL LETTER O WITH | | | | | | ACUTE | | Input | U+00D4 | U+00D4 | Lu | LATIN CAPITAL LETTER O WITH | | | | | | CIRCUMFLEX | | Input | U+00D5 | U+00D5 | Lu | LATIN CAPITAL LETTER O WITH | | | | | | TILDE | | Input | U+00D6 | U+00D6 | Lu | LATIN CAPITAL LETTER O WITH | | | | | | DIAERESIS | | Exclude | U+00D7 | U+00D7 | Sm | MULTIPLICATION SIGN | | Input | U+00D8 | U+00D8 | Lu | LATIN CAPITAL LETTER O WITH | | | | | | STROKE | | Input | U+00D9 | U+00D9 | Lu | LATIN CAPITAL LETTER U WITH | | | | | | GRAVE | | Input | U+00DA | U+00DA | Lu | LATIN CAPITAL LETTER U WITH | | | | | | ACUTE | | Input | U+00DB | U+00DB | Lu | LATIN CAPITAL LETTER U WITH | | | | | | CIRCUMFLEX | | Input | U+00DC | U+00DC | Lu | LATIN CAPITAL LETTER U WITH | | | | | | DIAERESIS | | Input | U+00DD | U+00DD | Lu | LATIN CAPITAL LETTER Y WITH | | | | | | ACUTE | | Input | U+00DE | U+00DE | Lu | LATIN CAPITAL LETTER THORN | | Include | U+00DF | U+00DF | Ll | LATIN SMALL LETTER SHARP S | Faltstrom Expires April 26, 2007 [Page 30] Internet-Draft Unicode Codepoints October 2006 | Include | U+00E0 | U+00E0 | Ll | LATIN SMALL LETTER A WITH | | | | | | GRAVE | | Include | U+00E1 | U+00E1 | Ll | LATIN SMALL LETTER A WITH | | | | | | ACUTE | | Include | U+00E2 | U+00E2 | Ll | LATIN SMALL LETTER A WITH | | | | | | CIRCUMFLEX | | Include | U+00E3 | U+00E3 | Ll | LATIN SMALL LETTER A WITH | | | | | | TILDE | | Include | U+00E4 | U+00E4 | Ll | LATIN SMALL LETTER A WITH | | | | | | DIAERESIS | | Include | U+00E5 | U+00E5 | Ll | LATIN SMALL LETTER A WITH | | | | | | RING ABOVE | | Include | U+00E6 | U+00E6 | Ll | LATIN SMALL LETTER AE | | Include | U+00E7 | U+00E7 | Ll | LATIN SMALL LETTER C WITH | | | | | | CEDILLA | | Include | U+00E8 | U+00E8 | Ll | LATIN SMALL LETTER E WITH | | | | | | GRAVE | | Include | U+00E9 | U+00E9 | Ll | LATIN SMALL LETTER E WITH | | | | | | ACUTE | | Include | U+00EA | U+00EA | Ll | LATIN SMALL LETTER E WITH | | | | | | CIRCUMFLEX | | Include | U+00EB | U+00EB | Ll | LATIN SMALL LETTER E WITH | | | | | | DIAERESIS | | Include | U+00EC | U+00EC | Ll | LATIN SMALL LETTER I WITH | | | | | | GRAVE | | Include | U+00ED | U+00ED | Ll | LATIN SMALL LETTER I WITH | | | | | | ACUTE | | Include | U+00EE | U+00EE | Ll | LATIN SMALL LETTER I WITH | | | | | | CIRCUMFLEX | | Include | U+00EF | U+00EF | Ll | LATIN SMALL LETTER I WITH | | | | | | DIAERESIS | | Include | U+00F0 | U+00F0 | Ll | LATIN SMALL LETTER ETH | | Include | U+00F1 | U+00F1 | Ll | LATIN SMALL LETTER N WITH | | | | | | TILDE | | Include | U+00F2 | U+00F2 | Ll | LATIN SMALL LETTER O WITH | | | | | | GRAVE | | Include | U+00F3 | U+00F3 | Ll | LATIN SMALL LETTER O WITH | | | | | | ACUTE | | Include | U+00F4 | U+00F4 | Ll | LATIN SMALL LETTER O WITH | | | | | | CIRCUMFLEX | | Include | U+00F5 | U+00F5 | Ll | LATIN SMALL LETTER O WITH | | | | | | TILDE | | Include | U+00F6 | U+00F6 | Ll | LATIN SMALL LETTER O WITH | | | | | | DIAERESIS | | Exclude | U+00F7 | U+00F7 | Sm | DIVISION SIGN | | Include | U+00F8 | U+00F8 | Ll | LATIN SMALL LETTER O WITH | | | | | | STROKE | Faltstrom Expires April 26, 2007 [Page 31] Internet-Draft Unicode Codepoints October 2006 | Include | U+00F9 | U+00F9 | Ll | LATIN SMALL LETTER U WITH | | | | | | GRAVE | | Include | U+00FA | U+00FA | Ll | LATIN SMALL LETTER U WITH | | | | | | ACUTE | | Include | U+00FB | U+00FB | Ll | LATIN SMALL LETTER U WITH | | | | | | CIRCUMFLEX | | Include | U+00FC | U+00FC | Ll | LATIN SMALL LETTER U WITH | | | | | | DIAERESIS | | Include | U+00FD | U+00FD | Ll | LATIN SMALL LETTER Y WITH | | | | | | ACUTE | | Include | U+00FE | U+00FE | Ll | LATIN SMALL LETTER THORN | | Include | U+00FF | U+00FF | Ll | LATIN SMALL LETTER Y WITH | | | | | | DIAERESIS | +----------+--------+--------+-------+------------------------------+ 4.3. 0100-017F Latin Extended-A +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Input | U+0100 | U+0100 | Lu | LATIN CAPITAL LETTER A WITH | | | | | | MACRON | | Include | U+0101 | U+0101 | Ll | LATIN SMALL LETTER A WITH | | | | | | MACRON | | Input | U+0102 | U+0102 | Lu | LATIN CAPITAL LETTER A WITH | | | | | | BREVE | | Include | U+0103 | U+0103 | Ll | LATIN SMALL LETTER A WITH | | | | | | BREVE | | Input | U+0104 | U+0104 | Lu | LATIN CAPITAL LETTER A WITH | | | | | | OGONEK | | Include | U+0105 | U+0105 | Ll | LATIN SMALL LETTER A WITH | | | | | | OGONEK | | Input | U+0106 | U+0106 | Lu | LATIN CAPITAL LETTER C WITH | | | | | | ACUTE | | Include | U+0107 | U+0107 | Ll | LATIN SMALL LETTER C WITH | | | | | | ACUTE | | Input | U+0108 | U+0108 | Lu | LATIN CAPITAL LETTER C WITH | | | | | | CIRCUMFLEX | | Include | U+0109 | U+0109 | Ll | LATIN SMALL LETTER C WITH | | | | | | CIRCUMFLEX | | Input | U+010A | U+010A | Lu | LATIN CAPITAL LETTER C WITH | | | | | | DOT ABOVE | | Include | U+010B | U+010B | Ll | LATIN SMALL LETTER C WITH | | | | | | DOT ABOVE | | Input | U+010C | U+010C | Lu | LATIN CAPITAL LETTER C WITH | | | | | | CARON | | Include | U+010D | U+010D | Ll | LATIN SMALL LETTER C WITH | | | | | | CARON | Faltstrom Expires April 26, 2007 [Page 32] Internet-Draft Unicode Codepoints October 2006 | Input | U+010E | U+010E | Lu | LATIN CAPITAL LETTER D WITH | | | | | | CARON | | Include | U+010F | U+010F | Ll | LATIN SMALL LETTER D WITH | | | | | | CARON | | Input | U+0110 | U+0110 | Lu | LATIN CAPITAL LETTER D WITH | | | | | | STROKE | | Include | U+0111 | U+0111 | Ll | LATIN SMALL LETTER D WITH | | | | | | STROKE | | Input | U+0112 | U+0112 | Lu | LATIN CAPITAL LETTER E WITH | | | | | | MACRON | | Include | U+0113 | U+0113 | Ll | LATIN SMALL LETTER E WITH | | | | | | MACRON | | Input | U+0114 | U+0114 | Lu | LATIN CAPITAL LETTER E WITH | | | | | | BREVE | | Include | U+0115 | U+0115 | Ll | LATIN SMALL LETTER E WITH | | | | | | BREVE | | Input | U+0116 | U+0116 | Lu | LATIN CAPITAL LETTER E WITH | | | | | | DOT ABOVE | | Include | U+0117 | U+0117 | Ll | LATIN SMALL LETTER E WITH | | | | | | DOT ABOVE | | Input | U+0118 | U+0118 | Lu | LATIN CAPITAL LETTER E WITH | | | | | | OGONEK | | Include | U+0119 | U+0119 | Ll | LATIN SMALL LETTER E WITH | | | | | | OGONEK | | Input | U+011A | U+011A | Lu | LATIN CAPITAL LETTER E WITH | | | | | | CARON | | Include | U+011B | U+011B | Ll | LATIN SMALL LETTER E WITH | | | | | | CARON | | Input | U+011C | U+011C | Lu | LATIN CAPITAL LETTER G WITH | | | | | | CIRCUMFLEX | | Include | U+011D | U+011D | Ll | LATIN SMALL LETTER G WITH | | | | | | CIRCUMFLEX | | Input | U+011E | U+011E | Lu | LATIN CAPITAL LETTER G WITH | | | | | | BREVE | | Include | U+011F | U+011F | Ll | LATIN SMALL LETTER G WITH | | | | | | BREVE | | Input | U+0120 | U+0120 | Lu | LATIN CAPITAL LETTER G WITH | | | | | | DOT ABOVE | | Include | U+0121 | U+0121 | Ll | LATIN SMALL LETTER G WITH | | | | | | DOT ABOVE | | Input | U+0122 | U+0122 | Lu | LATIN CAPITAL LETTER G WITH | | | | | | CEDILLA | | Include | U+0123 | U+0123 | Ll | LATIN SMALL LETTER G WITH | | | | | | CEDILLA | | Input | U+0124 | U+0124 | Lu | LATIN CAPITAL LETTER H WITH | | | | | | CIRCUMFLEX | | Include | U+0125 | U+0125 | Ll | LATIN SMALL LETTER H WITH | | | | | | CIRCUMFLEX | Faltstrom Expires April 26, 2007 [Page 33] Internet-Draft Unicode Codepoints October 2006 | Input | U+0126 | U+0126 | Lu | LATIN CAPITAL LETTER H WITH | | | | | | STROKE | | Include | U+0127 | U+0127 | Ll | LATIN SMALL LETTER H WITH | | | | | | STROKE | | Input | U+0128 | U+0128 | Lu | LATIN CAPITAL LETTER I WITH | | | | | | TILDE | | Include | U+0129 | U+0129 | Ll | LATIN SMALL LETTER I WITH | | | | | | TILDE | | Input | U+012A | U+012A | Lu | LATIN CAPITAL LETTER I WITH | | | | | | MACRON | | Include | U+012B | U+012B | Ll | LATIN SMALL LETTER I WITH | | | | | | MACRON | | Input | U+012C | U+012C | Lu | LATIN CAPITAL LETTER I WITH | | | | | | BREVE | | Include | U+012D | U+012D | Ll | LATIN SMALL LETTER I WITH | | | | | | BREVE | | Input | U+012E | U+012E | Lu | LATIN CAPITAL LETTER I WITH | | | | | | OGONEK | | Include | U+012F | U+012F | Ll | LATIN SMALL LETTER I WITH | | | | | | OGONEK | | Input | U+0130 | U+0130 | Lu | LATIN CAPITAL LETTER I WITH | | | | | | DOT ABOVE | | Include | U+0131 | U+0131 | Ll | LATIN SMALL LETTER DOTLESS I | | Input | U+0132 | U+0049 | Lu | LATIN CAPITAL LETTER I | | Include | U+0133 | U+0069 | Ll | LATIN SMALL LETTER I | | Input | U+0134 | U+0134 | Lu | LATIN CAPITAL LETTER J WITH | | | | | | CIRCUMFLEX | | Include | U+0135 | U+0135 | Ll | LATIN SMALL LETTER J WITH | | | | | | CIRCUMFLEX | | Input | U+0136 | U+0136 | Lu | LATIN CAPITAL LETTER K WITH | | | | | | CEDILLA | | Include | U+0137 | U+0137 | Ll | LATIN SMALL LETTER K WITH | | | | | | CEDILLA | | Include | U+0138 | U+0138 | Ll | LATIN SMALL LETTER KRA | | Input | U+0139 | U+0139 | Lu | LATIN CAPITAL LETTER L WITH | | | | | | ACUTE | | Include | U+013A | U+013A | Ll | LATIN SMALL LETTER L WITH | | | | | | ACUTE | | Input | U+013B | U+013B | Lu | LATIN CAPITAL LETTER L WITH | | | | | | CEDILLA | | Include | U+013C | U+013C | Ll | LATIN SMALL LETTER L WITH | | | | | | CEDILLA | | Input | U+013D | U+013D | Lu | LATIN CAPITAL LETTER L WITH | | | | | | CARON | | Include | U+013E | U+013E | Ll | LATIN SMALL LETTER L WITH | | | | | | CARON | | Exclude | U+013F | U+004C | Lu Po | LATIN CAPITAL LETTER L | | Exclude | U+0140 | U+006C | Ll Po | LATIN SMALL LETTER L | Faltstrom Expires April 26, 2007 [Page 34] Internet-Draft Unicode Codepoints October 2006 | Input | U+0141 | U+0141 | Lu | LATIN CAPITAL LETTER L WITH | | | | | | STROKE | | Include | U+0142 | U+0142 | Ll | LATIN SMALL LETTER L WITH | | | | | | STROKE | | Input | U+0143 | U+0143 | Lu | LATIN CAPITAL LETTER N WITH | | | | | | ACUTE | | Include | U+0144 | U+0144 | Ll | LATIN SMALL LETTER N WITH | | | | | | ACUTE | | Input | U+0145 | U+0145 | Lu | LATIN CAPITAL LETTER N WITH | | | | | | CEDILLA | | Include | U+0146 | U+0146 | Ll | LATIN SMALL LETTER N WITH | | | | | | CEDILLA | | Input | U+0147 | U+0147 | Lu | LATIN CAPITAL LETTER N WITH | | | | | | CARON | | Include | U+0148 | U+0148 | Ll | LATIN SMALL LETTER N WITH | | | | | | CARON | | Exclude | U+0149 | U+02BC | Ll Lm | MODIFIER LETTER APOSTROPHE | | Input | U+014A | U+014A | Lu | LATIN CAPITAL LETTER ENG | | Include | U+014B | U+014B | Ll | LATIN SMALL LETTER ENG | | Input | U+014C | U+014C | Lu | LATIN CAPITAL LETTER O WITH | | | | | | MACRON | | Include | U+014D | U+014D | Ll | LATIN SMALL LETTER O WITH | | | | | | MACRON | | Input | U+014E | U+014E | Lu | LATIN CAPITAL LETTER O WITH | | | | | | BREVE | | Include | U+014F | U+014F | Ll | LATIN SMALL LETTER O WITH | | | | | | BREVE | | Input | U+0150 | U+0150 | Lu | LATIN CAPITAL LETTER O WITH | | | | | | DOUBLE ACUTE | | Include | U+0151 | U+0151 | Ll | LATIN SMALL LETTER O WITH | | | | | | DOUBLE ACUTE | | Input | U+0152 | U+0152 | Lu | LATIN CAPITAL LIGATURE OE | | Include | U+0153 | U+0153 | Ll | LATIN SMALL LIGATURE OE | | Input | U+0154 | U+0154 | Lu | LATIN CAPITAL LETTER R WITH | | | | | | ACUTE | | Include | U+0155 | U+0155 | Ll | LATIN SMALL LETTER R WITH | | | | | | ACUTE | | Input | U+0156 | U+0156 | Lu | LATIN CAPITAL LETTER R WITH | | | | | | CEDILLA | | Include | U+0157 | U+0157 | Ll | LATIN SMALL LETTER R WITH | | | | | | CEDILLA | | Input | U+0158 | U+0158 | Lu | LATIN CAPITAL LETTER R WITH | | | | | | CARON | | Include | U+0159 | U+0159 | Ll | LATIN SMALL LETTER R WITH | | | | | | CARON | | Input | U+015A | U+015A | Lu | LATIN CAPITAL LETTER S WITH | | | | | | ACUTE | Faltstrom Expires April 26, 2007 [Page 35] Internet-Draft Unicode Codepoints October 2006 | Include | U+015B | U+015B | Ll | LATIN SMALL LETTER S WITH | | | | | | ACUTE | | Input | U+015C | U+015C | Lu | LATIN CAPITAL LETTER S WITH | | | | | | CIRCUMFLEX | | Include | U+015D | U+015D | Ll | LATIN SMALL LETTER S WITH | | | | | | CIRCUMFLEX | | Input | U+015E | U+015E | Lu | LATIN CAPITAL LETTER S WITH | | | | | | CEDILLA | | Include | U+015F | U+015F | Ll | LATIN SMALL LETTER S WITH | | | | | | CEDILLA | | Input | U+0160 | U+0160 | Lu | LATIN CAPITAL LETTER S WITH | | | | | | CARON | | Include | U+0161 | U+0161 | Ll | LATIN SMALL LETTER S WITH | | | | | | CARON | | Input | U+0162 | U+0162 | Lu | LATIN CAPITAL LETTER T WITH | | | | | | CEDILLA | | Include | U+0163 | U+0163 | Ll | LATIN SMALL LETTER T WITH | | | | | | CEDILLA | | Input | U+0164 | U+0164 | Lu | LATIN CAPITAL LETTER T WITH | | | | | | CARON | | Include | U+0165 | U+0165 | Ll | LATIN SMALL LETTER T WITH | | | | | | CARON | | Input | U+0166 | U+0166 | Lu | LATIN CAPITAL LETTER T WITH | | | | | | STROKE | | Include | U+0167 | U+0167 | Ll | LATIN SMALL LETTER T WITH | | | | | | STROKE | | Input | U+0168 | U+0168 | Lu | LATIN CAPITAL LETTER U WITH | | | | | | TILDE | | Include | U+0169 | U+0169 | Ll | LATIN SMALL LETTER U WITH | | | | | | TILDE | | Input | U+016A | U+016A | Lu | LATIN CAPITAL LETTER U WITH | | | | | | MACRON | | Include | U+016B | U+016B | Ll | LATIN SMALL LETTER U WITH | | | | | | MACRON | | Input | U+016C | U+016C | Lu | LATIN CAPITAL LETTER U WITH | | | | | | BREVE | | Include | U+016D | U+016D | Ll | LATIN SMALL LETTER U WITH | | | | | | BREVE | | Input | U+016E | U+016E | Lu | LATIN CAPITAL LETTER U WITH | | | | | | RING ABOVE | | Include | U+016F | U+016F | Ll | LATIN SMALL LETTER U WITH | | | | | | RING ABOVE | | Input | U+0170 | U+0170 | Lu | LATIN CAPITAL LETTER U WITH | | | | | | DOUBLE ACUTE | | Include | U+0171 | U+0171 | Ll | LATIN SMALL LETTER U WITH | | | | | | DOUBLE ACUTE | | Input | U+0172 | U+0172 | Lu | LATIN CAPITAL LETTER U WITH | | | | | | OGONEK | Faltstrom Expires April 26, 2007 [Page 36] Internet-Draft Unicode Codepoints October 2006 | Include | U+0173 | U+0173 | Ll | LATIN SMALL LETTER U WITH | | | | | | OGONEK | | Input | U+0174 | U+0174 | Lu | LATIN CAPITAL LETTER W WITH | | | | | | CIRCUMFLEX | | Include | U+0175 | U+0175 | Ll | LATIN SMALL LETTER W WITH | | | | | | CIRCUMFLEX | | Input | U+0176 | U+0176 | Lu | LATIN CAPITAL LETTER Y WITH | | | | | | CIRCUMFLEX | | Include | U+0177 | U+0177 | Ll | LATIN SMALL LETTER Y WITH | | | | | | CIRCUMFLEX | | Input | U+0178 | U+0178 | Lu | LATIN CAPITAL LETTER Y WITH | | | | | | DIAERESIS | | Input | U+0179 | U+0179 | Lu | LATIN CAPITAL LETTER Z WITH | | | | | | ACUTE | | Include | U+017A | U+017A | Ll | LATIN SMALL LETTER Z WITH | | | | | | ACUTE | | Input | U+017B | U+017B | Lu | LATIN CAPITAL LETTER Z WITH | | | | | | DOT ABOVE | | Include | U+017C | U+017C | Ll | LATIN SMALL LETTER Z WITH | | | | | | DOT ABOVE | | Input | U+017D | U+017D | Lu | LATIN CAPITAL LETTER Z WITH | | | | | | CARON | | Include | U+017E | U+017E | Ll | LATIN SMALL LETTER Z WITH | | | | | | CARON | | Include | U+017F | U+0073 | Ll | LATIN SMALL LETTER S | +----------+--------+--------+-------+------------------------------+ 4.4. 0180-024F Latin Extended-B +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Include | U+0180 | U+0180 | Ll | LATIN SMALL LETTER B WITH | | | | | | STROKE | | Input | U+0181 | U+0181 | Lu | LATIN CAPITAL LETTER B WITH | | | | | | HOOK | | Input | U+0182 | U+0182 | Lu | LATIN CAPITAL LETTER B WITH | | | | | | TOPBAR | | Include | U+0183 | U+0183 | Ll | LATIN SMALL LETTER B WITH | | | | | | TOPBAR | | Input | U+0184 | U+0184 | Lu | LATIN CAPITAL LETTER TONE | | | | | | SIX | | Include | U+0185 | U+0185 | Ll | LATIN SMALL LETTER TONE SIX | | Input | U+0186 | U+0186 | Lu | LATIN CAPITAL LETTER OPEN O | | Input | U+0187 | U+0187 | Lu | LATIN CAPITAL LETTER C WITH | | | | | | HOOK | | Include | U+0188 | U+0188 | Ll | LATIN SMALL LETTER C WITH | | | | | | HOOK | Faltstrom Expires April 26, 2007 [Page 37] Internet-Draft Unicode Codepoints October 2006 | Input | U+0189 | U+0189 | Lu | LATIN CAPITAL LETTER AFRICAN | | | | | | D | | Input | U+018A | U+018A | Lu | LATIN CAPITAL LETTER D WITH | | | | | | HOOK | | Input | U+018B | U+018B | Lu | LATIN CAPITAL LETTER D WITH | | | | | | TOPBAR | | Include | U+018C | U+018C | Ll | LATIN SMALL LETTER D WITH | | | | | | TOPBAR | | Include | U+018D | U+018D | Ll | LATIN SMALL LETTER TURNED | | | | | | DELTA | | Input | U+018E | U+018E | Lu | LATIN CAPITAL LETTER | | | | | | REVERSED E | | Input | U+018F | U+018F | Lu | LATIN CAPITAL LETTER SCHWA | | Input | U+0190 | U+0190 | Lu | LATIN CAPITAL LETTER OPEN E | | Input | U+0191 | U+0191 | Lu | LATIN CAPITAL LETTER F WITH | | | | | | HOOK | | Include | U+0192 | U+0192 | Ll | LATIN SMALL LETTER F WITH | | | | | | HOOK | | Input | U+0193 | U+0193 | Lu | LATIN CAPITAL LETTER G WITH | | | | | | HOOK | | Input | U+0194 | U+0194 | Lu | LATIN CAPITAL LETTER GAMMA | | Include | U+0195 | U+0195 | Ll | LATIN SMALL LETTER HV | | Input | U+0196 | U+0196 | Lu | LATIN CAPITAL LETTER IOTA | | Input | U+0197 | U+0197 | Lu | LATIN CAPITAL LETTER I WITH | | | | | | STROKE | | Input | U+0198 | U+0198 | Lu | LATIN CAPITAL LETTER K WITH | | | | | | HOOK | | Include | U+0199 | U+0199 | Ll | LATIN SMALL LETTER K WITH | | | | | | HOOK | | Include | U+019A | U+019A | Ll | LATIN SMALL LETTER L WITH | | | | | | BAR | | Include | U+019B | U+019B | Ll | LATIN SMALL LETTER LAMBDA | | | | | | WITH STROKE | | Input | U+019C | U+019C | Lu | LATIN CAPITAL LETTER TURNED | | | | | | M | | Input | U+019D | U+019D | Lu | LATIN CAPITAL LETTER N WITH | | | | | | LEFT HOOK | | Include | U+019E | U+019E | Ll | LATIN SMALL LETTER N WITH | | | | | | LONG RIGHT LEG | | Input | U+019F | U+019F | Lu | LATIN CAPITAL LETTER O WITH | | | | | | MIDDLE TILDE | | Input | U+01A0 | U+01A0 | Lu | LATIN CAPITAL LETTER O WITH | | | | | | HORN | | Include | U+01A1 | U+01A1 | Ll | LATIN SMALL LETTER O WITH | | | | | | HORN | | Input | U+01A2 | U+01A2 | Lu | LATIN CAPITAL LETTER OI | | Include | U+01A3 | U+01A3 | Ll | LATIN SMALL LETTER OI | Faltstrom Expires April 26, 2007 [Page 38] Internet-Draft Unicode Codepoints October 2006 | Input | U+01A4 | U+01A4 | Lu | LATIN CAPITAL LETTER P WITH | | | | | | HOOK | | Include | U+01A5 | U+01A5 | Ll | LATIN SMALL LETTER P WITH | | | | | | HOOK | | Input | U+01A6 | U+01A6 | Lu | LATIN LETTER YR | | Input | U+01A7 | U+01A7 | Lu | LATIN CAPITAL LETTER TONE | | | | | | TWO | | Include | U+01A8 | U+01A8 | Ll | LATIN SMALL LETTER TONE TWO | | Input | U+01A9 | U+01A9 | Lu | LATIN CAPITAL LETTER ESH | | Include | U+01AA | U+01AA | Ll | LATIN LETTER REVERSED ESH | | | | | | LOOP | | Include | U+01AB | U+01AB | Ll | LATIN SMALL LETTER T WITH | | | | | | PALATAL HOOK | | Input | U+01AC | U+01AC | Lu | LATIN CAPITAL LETTER T WITH | | | | | | HOOK | | Include | U+01AD | U+01AD | Ll | LATIN SMALL LETTER T WITH | | | | | | HOOK | | Input | U+01AE | U+01AE | Lu | LATIN CAPITAL LETTER T WITH | | | | | | RETROFLEX HOOK | | Input | U+01AF | U+01AF | Lu | LATIN CAPITAL LETTER U WITH | | | | | | HORN | | Include | U+01B0 | U+01B0 | Ll | LATIN SMALL LETTER U WITH | | | | | | HORN | | Input | U+01B1 | U+01B1 | Lu | LATIN CAPITAL LETTER UPSILON | | Input | U+01B2 | U+01B2 | Lu | LATIN CAPITAL LETTER V WITH | | | | | | HOOK | | Input | U+01B3 | U+01B3 | Lu | LATIN CAPITAL LETTER Y WITH | | | | | | HOOK | | Include | U+01B4 | U+01B4 | Ll | LATIN SMALL LETTER Y WITH | | | | | | HOOK | | Input | U+01B5 | U+01B5 | Lu | LATIN CAPITAL LETTER Z WITH | | | | | | STROKE | | Include | U+01B6 | U+01B6 | Ll | LATIN SMALL LETTER Z WITH | | | | | | STROKE | | Input | U+01B7 | U+01B7 | Lu | LATIN CAPITAL LETTER EZH | | Input | U+01B8 | U+01B8 | Lu | LATIN CAPITAL LETTER EZH | | | | | | REVERSED | | Include | U+01B9 | U+01B9 | Ll | LATIN SMALL LETTER EZH | | | | | | REVERSED | | Include | U+01BA | U+01BA | Ll | LATIN SMALL LETTER EZH WITH | | | | | | TAIL | | Maybe | U+01BB | U+01BB | Lo | LATIN LETTER TWO WITH STROKE | | Input | U+01BC | U+01BC | Lu | LATIN CAPITAL LETTER TONE | | | | | | FIVE | | Include | U+01BD | U+01BD | Ll | LATIN SMALL LETTER TONE FIVE | | Include | U+01BE | U+01BE | Ll | LATIN LETTER INVERTED | | | | | | GLOTTAL STOP WITH STROKE | | Include | U+01BF | U+01BF | Ll | LATIN LETTER WYNN | Faltstrom Expires April 26, 2007 [Page 39] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+01C0 | U+01C0 | Lo | LATIN LETTER DENTAL CLICK | | Maybe | U+01C1 | U+01C1 | Lo | LATIN LETTER LATERAL CLICK | | Maybe | U+01C2 | U+01C2 | Lo | LATIN LETTER ALVEOLAR CLICK | | Maybe | U+01C3 | U+01C3 | Lo | LATIN LETTER RETROFLEX CLICK | | Input | U+01C4 | U+0044 | Lu | LATIN CAPITAL LETTER D | | Include | U+01C5 | U+0044 | Lu Ll | LATIN CAPITAL LETTER D | | Include | U+01C6 | U+0064 | Ll | LATIN SMALL LETTER D | | Input | U+01C7 | U+004C | Lu | LATIN CAPITAL LETTER L | | Include | U+01C8 | U+004C | Lu Ll | LATIN CAPITAL LETTER L | | Include | U+01C9 | U+006C | Ll | LATIN SMALL LETTER L | | Input | U+01CA | U+004E | Lu | LATIN CAPITAL LETTER N | | Include | U+01CB | U+004E | Lu Ll | LATIN CAPITAL LETTER N | | Include | U+01CC | U+006E | Ll | LATIN SMALL LETTER N | | Input | U+01CD | U+01CD | Lu | LATIN CAPITAL LETTER A WITH | | | | | | CARON | | Include | U+01CE | U+01CE | Ll | LATIN SMALL LETTER A WITH | | | | | | CARON | | Input | U+01CF | U+01CF | Lu | LATIN CAPITAL LETTER I WITH | | | | | | CARON | | Include | U+01D0 | U+01D0 | Ll | LATIN SMALL LETTER I WITH | | | | | | CARON | | Input | U+01D1 | U+01D1 | Lu | LATIN CAPITAL LETTER O WITH | | | | | | CARON | | Include | U+01D2 | U+01D2 | Ll | LATIN SMALL LETTER O WITH | | | | | | CARON | | Input | U+01D3 | U+01D3 | Lu | LATIN CAPITAL LETTER U WITH | | | | | | CARON | | Include | U+01D4 | U+01D4 | Ll | LATIN SMALL LETTER U WITH | | | | | | CARON | | Input | U+01D5 | U+01D5 | Lu | LATIN CAPITAL LETTER U WITH | | | | | | DIAERESIS AND MACRON | | Include | U+01D6 | U+01D6 | Ll | LATIN SMALL LETTER U WITH | | | | | | DIAERESIS AND MACRON | | Input | U+01D7 | U+01D7 | Lu | LATIN CAPITAL LETTER U WITH | | | | | | DIAERESIS AND ACUTE | | Include | U+01D8 | U+01D8 | Ll | LATIN SMALL LETTER U WITH | | | | | | DIAERESIS AND ACUTE | | Input | U+01D9 | U+01D9 | Lu | LATIN CAPITAL LETTER U WITH | | | | | | DIAERESIS AND CARON | | Include | U+01DA | U+01DA | Ll | LATIN SMALL LETTER U WITH | | | | | | DIAERESIS AND CARON | | Input | U+01DB | U+01DB | Lu | LATIN CAPITAL LETTER U WITH | | | | | | DIAERESIS AND GRAVE | | Include | U+01DC | U+01DC | Ll | LATIN SMALL LETTER U WITH | | | | | | DIAERESIS AND GRAVE | | Include | U+01DD | U+01DD | Ll | LATIN SMALL LETTER TURNED E | | Input | U+01DE | U+01DE | Lu | LATIN CAPITAL LETTER A WITH | | | | | | DIAERESIS AND MACRON | Faltstrom Expires April 26, 2007 [Page 40] Internet-Draft Unicode Codepoints October 2006 | Include | U+01DF | U+01DF | Ll | LATIN SMALL LETTER A WITH | | | | | | DIAERESIS AND MACRON | | Input | U+01E0 | U+01E0 | Lu | LATIN CAPITAL LETTER A WITH | | | | | | DOT ABOVE AND MACRON | | Include | U+01E1 | U+01E1 | Ll | LATIN SMALL LETTER A WITH | | | | | | DOT ABOVE AND MACRON | | Input | U+01E2 | U+01E2 | Lu | LATIN CAPITAL LETTER AE WITH | | | | | | MACRON | | Include | U+01E3 | U+01E3 | Ll | LATIN SMALL LETTER AE WITH | | | | | | MACRON | | Input | U+01E4 | U+01E4 | Lu | LATIN CAPITAL LETTER G WITH | | | | | | STROKE | | Include | U+01E5 | U+01E5 | Ll | LATIN SMALL LETTER G WITH | | | | | | STROKE | | Input | U+01E6 | U+01E6 | Lu | LATIN CAPITAL LETTER G WITH | | | | | | CARON | | Include | U+01E7 | U+01E7 | Ll | LATIN SMALL LETTER G WITH | | | | | | CARON | | Input | U+01E8 | U+01E8 | Lu | LATIN CAPITAL LETTER K WITH | | | | | | CARON | | Include | U+01E9 | U+01E9 | Ll | LATIN SMALL LETTER K WITH | | | | | | CARON | | Input | U+01EA | U+01EA | Lu | LATIN CAPITAL LETTER O WITH | | | | | | OGONEK | | Include | U+01EB | U+01EB | Ll | LATIN SMALL LETTER O WITH | | | | | | OGONEK | | Input | U+01EC | U+01EC | Lu | LATIN CAPITAL LETTER O WITH | | | | | | OGONEK AND MACRON | | Include | U+01ED | U+01ED | Ll | LATIN SMALL LETTER O WITH | | | | | | OGONEK AND MACRON | | Input | U+01EE | U+01EE | Lu | LATIN CAPITAL LETTER EZH | | | | | | WITH CARON | | Include | U+01EF | U+01EF | Ll | LATIN SMALL LETTER EZH WITH | | | | | | CARON | | Include | U+01F0 | U+01F0 | Ll | LATIN SMALL LETTER J WITH | | | | | | CARON | | Input | U+01F1 | U+0044 | Lu | LATIN CAPITAL LETTER D | | Include | U+01F2 | U+0044 | Lu Ll | LATIN CAPITAL LETTER D | | Include | U+01F3 | U+0064 | Ll | LATIN SMALL LETTER D | | Input | U+01F4 | U+01F4 | Lu | LATIN CAPITAL LETTER G WITH | | | | | | ACUTE | | Include | U+01F5 | U+01F5 | Ll | LATIN SMALL LETTER G WITH | | | | | | ACUTE | | Input | U+01F6 | U+01F6 | Lu | LATIN CAPITAL LETTER HWAIR | | Input | U+01F7 | U+01F7 | Lu | LATIN CAPITAL LETTER WYNN | | Input | U+01F8 | U+01F8 | Lu | LATIN CAPITAL LETTER N WITH | | | | | | GRAVE | Faltstrom Expires April 26, 2007 [Page 41] Internet-Draft Unicode Codepoints October 2006 | Include | U+01F9 | U+01F9 | Ll | LATIN SMALL LETTER N WITH | | | | | | GRAVE | | Input | U+01FA | U+01FA | Lu | LATIN CAPITAL LETTER A WITH | | | | | | RING ABOVE AND ACUTE | | Include | U+01FB | U+01FB | Ll | LATIN SMALL LETTER A WITH | | | | | | RING ABOVE AND ACUTE | | Input | U+01FC | U+01FC | Lu | LATIN CAPITAL LETTER AE WITH | | | | | | ACUTE | | Include | U+01FD | U+01FD | Ll | LATIN SMALL LETTER AE WITH | | | | | | ACUTE | | Input | U+01FE | U+01FE | Lu | LATIN CAPITAL LETTER O WITH | | | | | | STROKE AND ACUTE | | Include | U+01FF | U+01FF | Ll | LATIN SMALL LETTER O WITH | | | | | | STROKE AND ACUTE | | Input | U+0200 | U+0200 | Lu | LATIN CAPITAL LETTER A WITH | | | | | | DOUBLE GRAVE | | Include | U+0201 | U+0201 | Ll | LATIN SMALL LETTER A WITH | | | | | | DOUBLE GRAVE | | Input | U+0202 | U+0202 | Lu | LATIN CAPITAL LETTER A WITH | | | | | | INVERTED BREVE | | Include | U+0203 | U+0203 | Ll | LATIN SMALL LETTER A WITH | | | | | | INVERTED BREVE | | Input | U+0204 | U+0204 | Lu | LATIN CAPITAL LETTER E WITH | | | | | | DOUBLE GRAVE | | Include | U+0205 | U+0205 | Ll | LATIN SMALL LETTER E WITH | | | | | | DOUBLE GRAVE | | Input | U+0206 | U+0206 | Lu | LATIN CAPITAL LETTER E WITH | | | | | | INVERTED BREVE | | Include | U+0207 | U+0207 | Ll | LATIN SMALL LETTER E WITH | | | | | | INVERTED BREVE | | Input | U+0208 | U+0208 | Lu | LATIN CAPITAL LETTER I WITH | | | | | | DOUBLE GRAVE | | Include | U+0209 | U+0209 | Ll | LATIN SMALL LETTER I WITH | | | | | | DOUBLE GRAVE | | Input | U+020A | U+020A | Lu | LATIN CAPITAL LETTER I WITH | | | | | | INVERTED BREVE | | Include | U+020B | U+020B | Ll | LATIN SMALL LETTER I WITH | | | | | | INVERTED BREVE | | Input | U+020C | U+020C | Lu | LATIN CAPITAL LETTER O WITH | | | | | | DOUBLE GRAVE | | Include | U+020D | U+020D | Ll | LATIN SMALL LETTER O WITH | | | | | | DOUBLE GRAVE | | Input | U+020E | U+020E | Lu | LATIN CAPITAL LETTER O WITH | | | | | | INVERTED BREVE | | Include | U+020F | U+020F | Ll | LATIN SMALL LETTER O WITH | | | | | | INVERTED BREVE | | Input | U+0210 | U+0210 | Lu | LATIN CAPITAL LETTER R WITH | | | | | | DOUBLE GRAVE | Faltstrom Expires April 26, 2007 [Page 42] Internet-Draft Unicode Codepoints October 2006 | Include | U+0211 | U+0211 | Ll | LATIN SMALL LETTER R WITH | | | | | | DOUBLE GRAVE | | Input | U+0212 | U+0212 | Lu | LATIN CAPITAL LETTER R WITH | | | | | | INVERTED BREVE | | Include | U+0213 | U+0213 | Ll | LATIN SMALL LETTER R WITH | | | | | | INVERTED BREVE | | Input | U+0214 | U+0214 | Lu | LATIN CAPITAL LETTER U WITH | | | | | | DOUBLE GRAVE | | Include | U+0215 | U+0215 | Ll | LATIN SMALL LETTER U WITH | | | | | | DOUBLE GRAVE | | Input | U+0216 | U+0216 | Lu | LATIN CAPITAL LETTER U WITH | | | | | | INVERTED BREVE | | Include | U+0217 | U+0217 | Ll | LATIN SMALL LETTER U WITH | | | | | | INVERTED BREVE | | Input | U+0218 | U+0218 | Lu | LATIN CAPITAL LETTER S WITH | | | | | | COMMA BELOW | | Include | U+0219 | U+0219 | Ll | LATIN SMALL LETTER S WITH | | | | | | COMMA BELOW | | Input | U+021A | U+021A | Lu | LATIN CAPITAL LETTER T WITH | | | | | | COMMA BELOW | | Include | U+021B | U+021B | Ll | LATIN SMALL LETTER T WITH | | | | | | COMMA BELOW | | Input | U+021C | U+021C | Lu | LATIN CAPITAL LETTER YOGH | | Include | U+021D | U+021D | Ll | LATIN SMALL LETTER YOGH | | Input | U+021E | U+021E | Lu | LATIN CAPITAL LETTER H WITH | | | | | | CARON | | Include | U+021F | U+021F | Ll | LATIN SMALL LETTER H WITH | | | | | | CARON | | Input | U+0220 | U+0220 | Lu | LATIN CAPITAL LETTER N WITH | | | | | | LONG RIGHT LEG | | Include | U+0221 | U+0221 | Ll | LATIN SMALL LETTER D WITH | | | | | | CURL | | Input | U+0222 | U+0222 | Lu | LATIN CAPITAL LETTER OU | | Include | U+0223 | U+0223 | Ll | LATIN SMALL LETTER OU | | Input | U+0224 | U+0224 | Lu | LATIN CAPITAL LETTER Z WITH | | | | | | HOOK | | Include | U+0225 | U+0225 | Ll | LATIN SMALL LETTER Z WITH | | | | | | HOOK | | Input | U+0226 | U+0226 | Lu | LATIN CAPITAL LETTER A WITH | | | | | | DOT ABOVE | | Include | U+0227 | U+0227 | Ll | LATIN SMALL LETTER A WITH | | | | | | DOT ABOVE | | Input | U+0228 | U+0228 | Lu | LATIN CAPITAL LETTER E WITH | | | | | | CEDILLA | | Include | U+0229 | U+0229 | Ll | LATIN SMALL LETTER E WITH | | | | | | CEDILLA | | Input | U+022A | U+022A | Lu | LATIN CAPITAL LETTER O WITH | | | | | | DIAERESIS AND MACRON | Faltstrom Expires April 26, 2007 [Page 43] Internet-Draft Unicode Codepoints October 2006 | Include | U+022B | U+022B | Ll | LATIN SMALL LETTER O WITH | | | | | | DIAERESIS AND MACRON | | Input | U+022C | U+022C | Lu | LATIN CAPITAL LETTER O WITH | | | | | | TILDE AND MACRON | | Include | U+022D | U+022D | Ll | LATIN SMALL LETTER O WITH | | | | | | TILDE AND MACRON | | Input | U+022E | U+022E | Lu | LATIN CAPITAL LETTER O WITH | | | | | | DOT ABOVE | | Include | U+022F | U+022F | Ll | LATIN SMALL LETTER O WITH | | | | | | DOT ABOVE | | Input | U+0230 | U+0230 | Lu | LATIN CAPITAL LETTER O WITH | | | | | | DOT ABOVE AND MACRON | | Include | U+0231 | U+0231 | Ll | LATIN SMALL LETTER O WITH | | | | | | DOT ABOVE AND MACRON | | Input | U+0232 | U+0232 | Lu | LATIN CAPITAL LETTER Y WITH | | | | | | MACRON | | Include | U+0233 | U+0233 | Ll | LATIN SMALL LETTER Y WITH | | | | | | MACRON | | Include | U+0234 | U+0234 | Ll | LATIN SMALL LETTER L WITH | | | | | | CURL | | Include | U+0235 | U+0235 | Ll | LATIN SMALL LETTER N WITH | | | | | | CURL | | Include | U+0236 | U+0236 | Ll | LATIN SMALL LETTER T WITH | | | | | | CURL | | Exclude | U+0237 | U+0237 | Cn | LATIN SMALL LETTER DOTLESS J | | Exclude | U+0238 | U+0238 | Cn | LATIN SMALL LETTER DB | | | | | | DIGRAPH | | Exclude | U+0239 | U+0239 | Cn | LATIN SMALL LETTER QP | | | | | | DIGRAPH | | Exclude | U+023A | U+023A | Cn | LATIN CAPITAL LETTER A WITH | | | | | | STROKE | | Exclude | U+023B | U+023B | Cn | LATIN CAPITAL LETTER C WITH | | | | | | STROKE | | Exclude | U+023C | U+023C | Cn | LATIN SMALL LETTER C WITH | | | | | | STROKE | | Exclude | U+023D | U+023D | Cn | LATIN CAPITAL LETTER L WITH | | | | | | BAR | | Exclude | U+023E | U+023E | Cn | LATIN CAPITAL LETTER T WITH | | | | | | DIAGONAL STROKE | | Exclude | U+023F | U+023F | Cn | LATIN SMALL LETTER S WITH | | | | | | SWASH TAIL | | Exclude | U+0240 | U+0240 | Cn | LATIN SMALL LETTER Z WITH | | | | | | SWASH TAIL | | Exclude | U+0241 | U+0241 | Cn | LATIN CAPITAL LETTER GLOTTAL | | | | | | STOP | | Exclude | U+0242 | U+0242 | Cn | | | Exclude | U+0243 | U+0243 | Cn | | | Exclude | U+0244 | U+0244 | Cn | | Faltstrom Expires April 26, 2007 [Page 44] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0245 | U+0245 | Cn | | | Exclude | U+0246 | U+0246 | Cn | | | Exclude | U+0247 | U+0247 | Cn | | | Exclude | U+0248 | U+0248 | Cn | | | Exclude | U+0249 | U+0249 | Cn | | | Exclude | U+024A | U+024A | Cn | | | Exclude | U+024B | U+024B | Cn | | | Exclude | U+024C | U+024C | Cn | | | Exclude | U+024D | U+024D | Cn | | | Exclude | U+024E | U+024E | Cn | | | Exclude | U+024F | U+024F | Cn | | +----------+--------+--------+-------+------------------------------+ 4.5. 0250-02AF IPA Extensions +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Include | U+0250 | U+0250 | Ll | LATIN SMALL LETTER TURNED A | | Include | U+0251 | U+0251 | Ll | LATIN SMALL LETTER ALPHA | | Include | U+0252 | U+0252 | Ll | LATIN SMALL LETTER TURNED | | | | | | ALPHA | | Include | U+0253 | U+0253 | Ll | LATIN SMALL LETTER B WITH | | | | | | HOOK | | Include | U+0254 | U+0254 | Ll | LATIN SMALL LETTER OPEN O | | Include | U+0255 | U+0255 | Ll | LATIN SMALL LETTER C WITH | | | | | | CURL | | Include | U+0256 | U+0256 | Ll | LATIN SMALL LETTER D WITH | | | | | | TAIL | | Include | U+0257 | U+0257 | Ll | LATIN SMALL LETTER D WITH | | | | | | HOOK | | Include | U+0258 | U+0258 | Ll | LATIN SMALL LETTER REVERSED | | | | | | E | | Include | U+0259 | U+0259 | Ll | LATIN SMALL LETTER SCHWA | | Include | U+025A | U+025A | Ll | LATIN SMALL LETTER SCHWA | | | | | | WITH HOOK | | Include | U+025B | U+025B | Ll | LATIN SMALL LETTER OPEN E | | Include | U+025C | U+025C | Ll | LATIN SMALL LETTER REVERSED | | | | | | OPEN E | | Include | U+025D | U+025D | Ll | LATIN SMALL LETTER REVERSED | | | | | | OPEN E WITH HOOK | | Include | U+025E | U+025E | Ll | LATIN SMALL LETTER CLOSED | | | | | | REVERSED OPEN E | | Include | U+025F | U+025F | Ll | LATIN SMALL LETTER DOTLESS J | | | | | | WITH STROKE | | Include | U+0260 | U+0260 | Ll | LATIN SMALL LETTER G WITH | | | | | | HOOK | | Include | U+0261 | U+0261 | Ll | LATIN SMALL LETTER SCRIPT G | Faltstrom Expires April 26, 2007 [Page 45] Internet-Draft Unicode Codepoints October 2006 | Include | U+0262 | U+0262 | Ll | LATIN LETTER SMALL CAPITAL G | | Include | U+0263 | U+0263 | Ll | LATIN SMALL LETTER GAMMA | | Include | U+0264 | U+0264 | Ll | LATIN SMALL LETTER RAMS HORN | | Include | U+0265 | U+0265 | Ll | LATIN SMALL LETTER TURNED H | | Include | U+0266 | U+0266 | Ll | LATIN SMALL LETTER H WITH | | | | | | HOOK | | Include | U+0267 | U+0267 | Ll | LATIN SMALL LETTER HENG WITH | | | | | | HOOK | | Include | U+0268 | U+0268 | Ll | LATIN SMALL LETTER I WITH | | | | | | STROKE | | Include | U+0269 | U+0269 | Ll | LATIN SMALL LETTER IOTA | | Include | U+026A | U+026A | Ll | LATIN LETTER SMALL CAPITAL I | | Include | U+026B | U+026B | Ll | LATIN SMALL LETTER L WITH | | | | | | MIDDLE TILDE | | Include | U+026C | U+026C | Ll | LATIN SMALL LETTER L WITH | | | | | | BELT | | Include | U+026D | U+026D | Ll | LATIN SMALL LETTER L WITH | | | | | | RETROFLEX HOOK | | Include | U+026E | U+026E | Ll | LATIN SMALL LETTER LEZH | | Include | U+026F | U+026F | Ll | LATIN SMALL LETTER TURNED M | | Include | U+0270 | U+0270 | Ll | LATIN SMALL LETTER TURNED M | | | | | | WITH LONG LEG | | Include | U+0271 | U+0271 | Ll | LATIN SMALL LETTER M WITH | | | | | | HOOK | | Include | U+0272 | U+0272 | Ll | LATIN SMALL LETTER N WITH | | | | | | LEFT HOOK | | Include | U+0273 | U+0273 | Ll | LATIN SMALL LETTER N WITH | | | | | | RETROFLEX HOOK | | Include | U+0274 | U+0274 | Ll | LATIN LETTER SMALL CAPITAL N | | Include | U+0275 | U+0275 | Ll | LATIN SMALL LETTER BARRED O | | Include | U+0276 | U+0276 | Ll | LATIN LETTER SMALL CAPITAL | | | | | | OE | | Include | U+0277 | U+0277 | Ll | LATIN SMALL LETTER CLOSED | | | | | | OMEGA | | Include | U+0278 | U+0278 | Ll | LATIN SMALL LETTER PHI | | Include | U+0279 | U+0279 | Ll | LATIN SMALL LETTER TURNED R | | Include | U+027A | U+027A | Ll | LATIN SMALL LETTER TURNED R | | | | | | WITH LONG LEG | | Include | U+027B | U+027B | Ll | LATIN SMALL LETTER TURNED R | | | | | | WITH HOOK | | Include | U+027C | U+027C | Ll | LATIN SMALL LETTER R WITH | | | | | | LONG LEG | | Include | U+027D | U+027D | Ll | LATIN SMALL LETTER R WITH | | | | | | TAIL | | Include | U+027E | U+027E | Ll | LATIN SMALL LETTER R WITH | | | | | | FISHHOOK | | Include | U+027F | U+027F | Ll | LATIN SMALL LETTER REVERSED | | | | | | R WITH FISHHOOK | Faltstrom Expires April 26, 2007 [Page 46] Internet-Draft Unicode Codepoints October 2006 | Include | U+0280 | U+0280 | Ll | LATIN LETTER SMALL CAPITAL R | | Include | U+0281 | U+0281 | Ll | LATIN LETTER SMALL CAPITAL | | | | | | INVERTED R | | Include | U+0282 | U+0282 | Ll | LATIN SMALL LETTER S WITH | | | | | | HOOK | | Include | U+0283 | U+0283 | Ll | LATIN SMALL LETTER ESH | | Include | U+0284 | U+0284 | Ll | LATIN SMALL LETTER DOTLESS J | | | | | | WITH STROKE AND HOOK | | Include | U+0285 | U+0285 | Ll | LATIN SMALL LETTER SQUAT | | | | | | REVERSED ESH | | Include | U+0286 | U+0286 | Ll | LATIN SMALL LETTER ESH WITH | | | | | | CURL | | Include | U+0287 | U+0287 | Ll | LATIN SMALL LETTER TURNED T | | Include | U+0288 | U+0288 | Ll | LATIN SMALL LETTER T WITH | | | | | | RETROFLEX HOOK | | Include | U+0289 | U+0289 | Ll | LATIN SMALL LETTER U BAR | | Include | U+028A | U+028A | Ll | LATIN SMALL LETTER UPSILON | | Include | U+028B | U+028B | Ll | LATIN SMALL LETTER V WITH | | | | | | HOOK | | Include | U+028C | U+028C | Ll | LATIN SMALL LETTER TURNED V | | Include | U+028D | U+028D | Ll | LATIN SMALL LETTER TURNED W | | Include | U+028E | U+028E | Ll | LATIN SMALL LETTER TURNED Y | | Include | U+028F | U+028F | Ll | LATIN LETTER SMALL CAPITAL Y | | Include | U+0290 | U+0290 | Ll | LATIN SMALL LETTER Z WITH | | | | | | RETROFLEX HOOK | | Include | U+0291 | U+0291 | Ll | LATIN SMALL LETTER Z WITH | | | | | | CURL | | Include | U+0292 | U+0292 | Ll | LATIN SMALL LETTER EZH | | Include | U+0293 | U+0293 | Ll | LATIN SMALL LETTER EZH WITH | | | | | | CURL | | Include | U+0294 | U+0294 | Ll | LATIN LETTER GLOTTAL STOP | | Include | U+0295 | U+0295 | Ll | LATIN LETTER PHARYNGEAL | | | | | | VOICED FRICATIVE | | Include | U+0296 | U+0296 | Ll | LATIN LETTER INVERTED | | | | | | GLOTTAL STOP | | Include | U+0297 | U+0297 | Ll | LATIN LETTER STRETCHED C | | Include | U+0298 | U+0298 | Ll | LATIN LETTER BILABIAL CLICK | | Include | U+0299 | U+0299 | Ll | LATIN LETTER SMALL CAPITAL B | | Include | U+029A | U+029A | Ll | LATIN SMALL LETTER CLOSED | | | | | | OPEN E | | Include | U+029B | U+029B | Ll | LATIN LETTER SMALL CAPITAL G | | | | | | WITH HOOK | | Include | U+029C | U+029C | Ll | LATIN LETTER SMALL CAPITAL H | | Include | U+029D | U+029D | Ll | LATIN SMALL LETTER J WITH | | | | | | CROSSED-TAIL | | Include | U+029E | U+029E | Ll | LATIN SMALL LETTER TURNED K | | Include | U+029F | U+029F | Ll | LATIN LETTER SMALL CAPITAL L | Faltstrom Expires April 26, 2007 [Page 47] Internet-Draft Unicode Codepoints October 2006 | Include | U+02A0 | U+02A0 | Ll | LATIN SMALL LETTER Q WITH | | | | | | HOOK | | Include | U+02A1 | U+02A1 | Ll | LATIN LETTER GLOTTAL STOP | | | | | | WITH STROKE | | Include | U+02A2 | U+02A2 | Ll | LATIN LETTER REVERSED | | | | | | GLOTTAL STOP WITH STROKE | | Include | U+02A3 | U+02A3 | Ll | LATIN SMALL LETTER DZ | | | | | | DIGRAPH | | Include | U+02A4 | U+02A4 | Ll | LATIN SMALL LETTER DEZH | | | | | | DIGRAPH | | Include | U+02A5 | U+02A5 | Ll | LATIN SMALL LETTER DZ | | | | | | DIGRAPH WITH CURL | | Include | U+02A6 | U+02A6 | Ll | LATIN SMALL LETTER TS | | | | | | DIGRAPH | | Include | U+02A7 | U+02A7 | Ll | LATIN SMALL LETTER TESH | | | | | | DIGRAPH | | Include | U+02A8 | U+02A8 | Ll | LATIN SMALL LETTER TC | | | | | | DIGRAPH WITH CURL | | Include | U+02A9 | U+02A9 | Ll | LATIN SMALL LETTER FENG | | | | | | DIGRAPH | | Include | U+02AA | U+02AA | Ll | LATIN SMALL LETTER LS | | | | | | DIGRAPH | | Include | U+02AB | U+02AB | Ll | LATIN SMALL LETTER LZ | | | | | | DIGRAPH | | Include | U+02AC | U+02AC | Ll | LATIN LETTER BILABIAL | | | | | | PERCUSSIVE | | Include | U+02AD | U+02AD | Ll | LATIN LETTER BIDENTAL | | | | | | PERCUSSIVE | | Include | U+02AE | U+02AE | Ll | LATIN SMALL LETTER TURNED H | | | | | | WITH FISHHOOK | | Include | U+02AF | U+02AF | Ll | LATIN SMALL LETTER TURNED H | | | | | | WITH FISHHOOK AND TAIL | +----------+--------+--------+-------+------------------------------+ 4.6. 02B0-02FF Spacing Modifier Letters +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Include | U+02B0 | U+0068 | Ll | LATIN SMALL LETTER H | | Include | U+02B1 | U+0266 | Ll | LATIN SMALL LETTER H WITH | | | | | | HOOK | | Include | U+02B2 | U+006A | Ll | LATIN SMALL LETTER J | | Include | U+02B3 | U+0072 | Ll | LATIN SMALL LETTER R | | Include | U+02B4 | U+0279 | Ll | LATIN SMALL LETTER TURNED R | | Include | U+02B5 | U+027B | Ll | LATIN SMALL LETTER TURNED R | | | | | | WITH HOOK | Faltstrom Expires April 26, 2007 [Page 48] Internet-Draft Unicode Codepoints October 2006 | Include | U+02B6 | U+0281 | Ll | LATIN LETTER SMALL CAPITAL | | | | | | INVERTED R | | Include | U+02B7 | U+0077 | Ll | LATIN SMALL LETTER W | | Include | U+02B8 | U+0079 | Ll | LATIN SMALL LETTER Y | | Exclude | U+02B9 | U+02B9 | Lm | MODIFIER LETTER PRIME | | Exclude | U+02BA | U+02BA | Lm | MODIFIER LETTER DOUBLE PRIME | | Exclude | U+02BB | U+02BB | Lm | MODIFIER LETTER TURNED COMMA | | Exclude | U+02BC | U+02BC | Lm | MODIFIER LETTER APOSTROPHE | | Exclude | U+02BD | U+02BD | Lm | MODIFIER LETTER REVERSED | | | | | | COMMA | | Exclude | U+02BE | U+02BE | Lm | MODIFIER LETTER RIGHT HALF | | | | | | RING | | Exclude | U+02BF | U+02BF | Lm | MODIFIER LETTER LEFT HALF | | | | | | RING | | Exclude | U+02C0 | U+02C0 | Lm | MODIFIER LETTER GLOTTAL STOP | | Exclude | U+02C1 | U+02C1 | Lm | MODIFIER LETTER REVERSED | | | | | | GLOTTAL STOP | | | U+02C2 | U+02C2 | | MODIFIER LETTER LEFT | | | | | | ARROWHEAD | | | U+02C3 | U+02C3 | | MODIFIER LETTER RIGHT | | | | | | ARROWHEAD | | | U+02C4 | U+02C4 | | MODIFIER LETTER UP ARROWHEAD | | | U+02C5 | U+02C5 | | MODIFIER LETTER DOWN | | | | | | ARROWHEAD | | Exclude | U+02C6 | U+02C6 | Lm | MODIFIER LETTER CIRCUMFLEX | | | | | | ACCENT | | Exclude | U+02C7 | U+02C7 | Lm | CARON | | Exclude | U+02C8 | U+02C8 | Lm | MODIFIER LETTER VERTICAL | | | | | | LINE | | Exclude | U+02C9 | U+02C9 | Lm | MODIFIER LETTER MACRON | | Exclude | U+02CA | U+02CA | Lm | MODIFIER LETTER ACUTE ACCENT | | Exclude | U+02CB | U+02CB | Lm | MODIFIER LETTER GRAVE ACCENT | | Exclude | U+02CC | U+02CC | Lm | MODIFIER LETTER LOW VERTICAL | | | | | | LINE | | Exclude | U+02CD | U+02CD | Lm | MODIFIER LETTER LOW MACRON | | Exclude | U+02CE | U+02CE | Lm | MODIFIER LETTER LOW GRAVE | | | | | | ACCENT | | Exclude | U+02CF | U+02CF | Lm | MODIFIER LETTER LOW ACUTE | | | | | | ACCENT | | Exclude | U+02D0 | U+02D0 | Lm | MODIFIER LETTER TRIANGULAR | | | | | | COLON | | Exclude | U+02D1 | U+02D1 | Lm | MODIFIER LETTER HALF | | | | | | TRIANGULAR COLON | | | U+02D2 | U+02D2 | | MODIFIER LETTER CENTRED | | | | | | RIGHT HALF RING | | | U+02D3 | U+02D3 | | MODIFIER LETTER CENTRED LEFT | | | | | | HALF RING | | | U+02D4 | U+02D4 | | MODIFIER LETTER UP TACK | Faltstrom Expires April 26, 2007 [Page 49] Internet-Draft Unicode Codepoints October 2006 | | U+02D5 | U+02D5 | | MODIFIER LETTER DOWN TACK | | | U+02D6 | U+02D6 | | MODIFIER LETTER PLUS SIGN | | | U+02D7 | U+02D7 | | MODIFIER LETTER MINUS SIGN | | Exclude | U+02D8 | U+0020 | Mn Zs | SPACE | | Exclude | U+02D9 | U+0020 | Mn Zs | SPACE | | Exclude | U+02DA | U+0020 | Mn Zs | SPACE | | Exclude | U+02DB | U+0020 | Mn Zs | SPACE | | Exclude | U+02DC | U+0020 | Mn Zs | SPACE | | Exclude | U+02DD | U+0020 | Mn Zs | SPACE | | | U+02DE | U+02DE | | MODIFIER LETTER RHOTIC HOOK | | | U+02DF | U+02DF | | MODIFIER LETTER CROSS ACCENT | | Include | U+02E0 | U+0263 | Ll | LATIN SMALL LETTER GAMMA | | Include | U+02E1 | U+006C | Ll | LATIN SMALL LETTER L | | Include | U+02E2 | U+0073 | Ll | LATIN SMALL LETTER S | | Include | U+02E3 | U+0078 | Ll | LATIN SMALL LETTER X | | Include | U+02E4 | U+0295 | Ll | LATIN LETTER PHARYNGEAL | | | | | | VOICED FRICATIVE | | | U+02E5 | U+02E5 | | MODIFIER LETTER EXTRA-HIGH | | | | | | TONE BAR | | | U+02E6 | U+02E6 | | MODIFIER LETTER HIGH TONE | | | | | | BAR | | | U+02E7 | U+02E7 | | MODIFIER LETTER MID TONE BAR | | | U+02E8 | U+02E8 | | MODIFIER LETTER LOW TONE BAR | | | U+02E9 | U+02E9 | | MODIFIER LETTER EXTRA-LOW | | | | | | TONE BAR | | | U+02EA | U+02EA | | MODIFIER LETTER YIN | | | | | | DEPARTING TONE MARK | | | U+02EB | U+02EB | | MODIFIER LETTER YANG | | | | | | DEPARTING TONE MARK | | | U+02EC | U+02EC | | MODIFIER LETTER VOICING | | | U+02ED | U+02ED | | MODIFIER LETTER UNASPIRATED | | Exclude | U+02EE | U+02EE | Lm | MODIFIER LETTER DOUBLE | | | | | | APOSTROPHE | | | U+02EF | U+02EF | | MODIFIER LETTER LOW DOWN | | | | | | ARROWHEAD | | | U+02F0 | U+02F0 | | MODIFIER LETTER LOW UP | | | | | | ARROWHEAD | | | U+02F1 | U+02F1 | | MODIFIER LETTER LOW LEFT | | | | | | ARROWHEAD | | | U+02F2 | U+02F2 | | MODIFIER LETTER LOW RIGHT | | | | | | ARROWHEAD | | | U+02F3 | U+02F3 | | MODIFIER LETTER LOW RING | | | U+02F4 | U+02F4 | | MODIFIER LETTER MIDDLE GRAVE | | | | | | ACCENT | | | U+02F5 | U+02F5 | | MODIFIER LETTER MIDDLE | | | | | | DOUBLE GRAVE ACCENT | | | U+02F6 | U+02F6 | | MODIFIER LETTER MIDDLE | | | | | | DOUBLE ACUTE ACCENT | Faltstrom Expires April 26, 2007 [Page 50] Internet-Draft Unicode Codepoints October 2006 | | U+02F7 | U+02F7 | | MODIFIER LETTER LOW TILDE | | | U+02F8 | U+02F8 | | MODIFIER LETTER RAISED COLON | | | U+02F9 | U+02F9 | | MODIFIER LETTER BEGIN HIGH | | | | | | TONE | | | U+02FA | U+02FA | | MODIFIER LETTER END HIGH | | | | | | TONE | | | U+02FB | U+02FB | | MODIFIER LETTER BEGIN LOW | | | | | | TONE | | | U+02FC | U+02FC | | MODIFIER LETTER END LOW TONE | | | U+02FD | U+02FD | | MODIFIER LETTER SHELF | | | U+02FE | U+02FE | | MODIFIER LETTER OPEN SHELF | | | U+02FF | U+02FF | | MODIFIER LETTER LOW LEFT | | | | | | ARROW | +----------+--------+--------+-------+------------------------------+ 4.7. 0300-036F Combining Diacritical Marks +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Possibly | U+0300 | U+0300 | Mn | COMBINING GRAVE ACCENT | | not | | | | | | Possibly | U+0301 | U+0301 | Mn | COMBINING ACUTE ACCENT | | not | | | | | | Possibly | U+0302 | U+0302 | Mn | COMBINING CIRCUMFLEX ACCENT | | not | | | | | | Possibly | U+0303 | U+0303 | Mn | COMBINING TILDE | | not | | | | | | Possibly | U+0304 | U+0304 | Mn | COMBINING MACRON | | not | | | | | | Possibly | U+0305 | U+0305 | Mn | COMBINING OVERLINE | | not | | | | | | Possibly | U+0306 | U+0306 | Mn | COMBINING BREVE | | not | | | | | | Possibly | U+0307 | U+0307 | Mn | COMBINING DOT ABOVE | | not | | | | | | Possibly | U+0308 | U+0308 | Mn | COMBINING DIAERESIS | | not | | | | | | Possibly | U+0309 | U+0309 | Mn | COMBINING HOOK ABOVE | | not | | | | | | Possibly | U+030A | U+030A | Mn | COMBINING RING ABOVE | | not | | | | | | Possibly | U+030B | U+030B | Mn | COMBINING DOUBLE ACUTE | | not | | | | ACCENT | | Possibly | U+030C | U+030C | Mn | COMBINING CARON | | not | | | | | | Possibly | U+030D | U+030D | Mn | COMBINING VERTICAL LINE | | not | | | | ABOVE | Faltstrom Expires April 26, 2007 [Page 51] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+030E | U+030E | Mn | COMBINING DOUBLE VERTICAL | | not | | | | LINE ABOVE | | Possibly | U+030F | U+030F | Mn | COMBINING DOUBLE GRAVE | | not | | | | ACCENT | | Possibly | U+0310 | U+0310 | Mn | COMBINING CANDRABINDU | | not | | | | | | Possibly | U+0311 | U+0311 | Mn | COMBINING INVERTED BREVE | | not | | | | | | Possibly | U+0312 | U+0312 | Mn | COMBINING TURNED COMMA ABOVE | | not | | | | | | Possibly | U+0313 | U+0313 | Mn | COMBINING COMMA ABOVE | | not | | | | | | Possibly | U+0314 | U+0314 | Mn | COMBINING REVERSED COMMA | | not | | | | ABOVE | | Possibly | U+0315 | U+0315 | Mn | COMBINING COMMA ABOVE RIGHT | | not | | | | | | Possibly | U+0316 | U+0316 | Mn | COMBINING GRAVE ACCENT BELOW | | not | | | | | | Possibly | U+0317 | U+0317 | Mn | COMBINING ACUTE ACCENT BELOW | | not | | | | | | Possibly | U+0318 | U+0318 | Mn | COMBINING LEFT TACK BELOW | | not | | | | | | Possibly | U+0319 | U+0319 | Mn | COMBINING RIGHT TACK BELOW | | not | | | | | | Possibly | U+031A | U+031A | Mn | COMBINING LEFT ANGLE ABOVE | | not | | | | | | Possibly | U+031B | U+031B | Mn | COMBINING HORN | | not | | | | | | Possibly | U+031C | U+031C | Mn | COMBINING LEFT HALF RING | | not | | | | BELOW | | Possibly | U+031D | U+031D | Mn | COMBINING UP TACK BELOW | | not | | | | | | Possibly | U+031E | U+031E | Mn | COMBINING DOWN TACK BELOW | | not | | | | | | Possibly | U+031F | U+031F | Mn | COMBINING PLUS SIGN BELOW | | not | | | | | | Possibly | U+0320 | U+0320 | Mn | COMBINING MINUS SIGN BELOW | | not | | | | | | Possibly | U+0321 | U+0321 | Mn | COMBINING PALATALIZED HOOK | | not | | | | BELOW | | Possibly | U+0322 | U+0322 | Mn | COMBINING RETROFLEX HOOK | | not | | | | BELOW | | Possibly | U+0323 | U+0323 | Mn | COMBINING DOT BELOW | | not | | | | | | Possibly | U+0324 | U+0324 | Mn | COMBINING DIAERESIS BELOW | | not | | | | | | Possibly | U+0325 | U+0325 | Mn | COMBINING RING BELOW | | not | | | | | Faltstrom Expires April 26, 2007 [Page 52] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+0326 | U+0326 | Mn | COMBINING COMMA BELOW | | not | | | | | | Possibly | U+0327 | U+0327 | Mn | COMBINING CEDILLA | | not | | | | | | Possibly | U+0328 | U+0328 | Mn | COMBINING OGONEK | | not | | | | | | Possibly | U+0329 | U+0329 | Mn | COMBINING VERTICAL LINE | | not | | | | BELOW | | Possibly | U+032A | U+032A | Mn | COMBINING BRIDGE BELOW | | not | | | | | | Possibly | U+032B | U+032B | Mn | COMBINING INVERTED DOUBLE | | not | | | | ARCH BELOW | | Possibly | U+032C | U+032C | Mn | COMBINING CARON BELOW | | not | | | | | | Possibly | U+032D | U+032D | Mn | COMBINING CIRCUMFLEX ACCENT | | not | | | | BELOW | | Possibly | U+032E | U+032E | Mn | COMBINING BREVE BELOW | | not | | | | | | Possibly | U+032F | U+032F | Mn | COMBINING INVERTED BREVE | | not | | | | BELOW | | Possibly | U+0330 | U+0330 | Mn | COMBINING TILDE BELOW | | not | | | | | | Possibly | U+0331 | U+0331 | Mn | COMBINING MACRON BELOW | | not | | | | | | Possibly | U+0332 | U+0332 | Mn | COMBINING LOW LINE | | not | | | | | | Possibly | U+0333 | U+0333 | Mn | COMBINING DOUBLE LOW LINE | | not | | | | | | Possibly | U+0334 | U+0334 | Mn | COMBINING TILDE OVERLAY | | not | | | | | | Possibly | U+0335 | U+0335 | Mn | COMBINING SHORT STROKE | | not | | | | OVERLAY | | Possibly | U+0336 | U+0336 | Mn | COMBINING LONG STROKE | | not | | | | OVERLAY | | Possibly | U+0337 | U+0337 | Mn | COMBINING SHORT SOLIDUS | | not | | | | OVERLAY | | Possibly | U+0338 | U+0338 | Mn | COMBINING LONG SOLIDUS | | not | | | | OVERLAY | | Possibly | U+0339 | U+0339 | Mn | COMBINING RIGHT HALF RING | | not | | | | BELOW | | Possibly | U+033A | U+033A | Mn | COMBINING INVERTED BRIDGE | | not | | | | BELOW | | Possibly | U+033B | U+033B | Mn | COMBINING SQUARE BELOW | | not | | | | | | Possibly | U+033C | U+033C | Mn | COMBINING SEAGULL BELOW | | not | | | | | | Possibly | U+033D | U+033D | Mn | COMBINING X ABOVE | | not | | | | | Faltstrom Expires April 26, 2007 [Page 53] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+033E | U+033E | Mn | COMBINING VERTICAL TILDE | | not | | | | | | Possibly | U+033F | U+033F | Mn | COMBINING DOUBLE OVERLINE | | not | | | | | | Possibly | U+0340 | U+0300 | Mn | COMBINING GRAVE ACCENT | | not | | | | | | Possibly | U+0341 | U+0301 | Mn | COMBINING ACUTE ACCENT | | not | | | | | | Possibly | U+0342 | U+0342 | Mn | COMBINING GREEK PERISPOMENI | | not | | | | | | Possibly | U+0343 | U+0313 | Mn | COMBINING COMMA ABOVE | | not | | | | | | Possibly | U+0344 | U+0308 | Mn | COMBINING DIAERESIS | | not | | | | | | Possibly | U+0345 | U+0345 | Mn | COMBINING GREEK | | not | | | | YPOGEGRAMMENI | | Possibly | U+0346 | U+0346 | Mn | COMBINING BRIDGE ABOVE | | not | | | | | | Possibly | U+0347 | U+0347 | Mn | COMBINING EQUALS SIGN BELOW | | not | | | | | | Possibly | U+0348 | U+0348 | Mn | COMBINING DOUBLE VERTICAL | | not | | | | LINE BELOW | | Possibly | U+0349 | U+0349 | Mn | COMBINING LEFT ANGLE BELOW | | not | | | | | | Possibly | U+034A | U+034A | Mn | COMBINING NOT TILDE ABOVE | | not | | | | | | Possibly | U+034B | U+034B | Mn | COMBINING HOMOTHETIC ABOVE | | not | | | | | | Possibly | U+034C | U+034C | Mn | COMBINING ALMOST EQUAL TO | | not | | | | ABOVE | | Possibly | U+034D | U+034D | Mn | COMBINING LEFT RIGHT ARROW | | not | | | | BELOW | | Possibly | U+034E | U+034E | Mn | COMBINING UPWARDS ARROW | | not | | | | BELOW | | Possibly | U+034F | U+034F | Mn | COMBINING GRAPHEME JOINER | | not | | | | | | Possibly | U+0350 | U+0350 | Mn | COMBINING RIGHT ARROWHEAD | | not | | | | ABOVE | | Possibly | U+0351 | U+0351 | Mn | COMBINING LEFT HALF RING | | not | | | | ABOVE | | Possibly | U+0352 | U+0352 | Mn | COMBINING FERMATA | | not | | | | | | Possibly | U+0353 | U+0353 | Mn | COMBINING X BELOW | | not | | | | | | Possibly | U+0354 | U+0354 | Mn | COMBINING LEFT ARROWHEAD | | not | | | | BELOW | | Possibly | U+0355 | U+0355 | Mn | COMBINING RIGHT ARROWHEAD | | not | | | | BELOW | Faltstrom Expires April 26, 2007 [Page 54] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+0356 | U+0356 | Mn | COMBINING RIGHT ARROWHEAD | | not | | | | AND UP ARROWHEAD BELOW | | Possibly | U+0357 | U+0357 | Mn | COMBINING RIGHT HALF RING | | not | | | | ABOVE | | Exclude | U+0358 | U+0358 | Cn | COMBINING DOT ABOVE RIGHT | | Exclude | U+0359 | U+0359 | Cn | COMBINING ASTERISK BELOW | | Exclude | U+035A | U+035A | Cn | COMBINING DOUBLE RING BELOW | | Exclude | U+035B | U+035B | Cn | COMBINING ZIGZAG ABOVE | | Exclude | U+035C | U+035C | Cn | COMBINING DOUBLE BREVE BELOW | | Possibly | U+035D | U+035D | Mn | COMBINING DOUBLE BREVE | | not | | | | | | Possibly | U+035E | U+035E | Mn | COMBINING DOUBLE MACRON | | not | | | | | | Possibly | U+035F | U+035F | Mn | COMBINING DOUBLE MACRON | | not | | | | BELOW | | Possibly | U+0360 | U+0360 | Mn | COMBINING DOUBLE TILDE | | not | | | | | | Possibly | U+0361 | U+0361 | Mn | COMBINING DOUBLE INVERTED | | not | | | | BREVE | | Possibly | U+0362 | U+0362 | Mn | COMBINING DOUBLE RIGHTWARDS | | not | | | | ARROW BELOW | | Possibly | U+0363 | U+0363 | Mn | COMBINING LATIN SMALL LETTER | | not | | | | A | | Possibly | U+0364 | U+0364 | Mn | COMBINING LATIN SMALL LETTER | | not | | | | E | | Possibly | U+0365 | U+0365 | Mn | COMBINING LATIN SMALL LETTER | | not | | | | I | | Possibly | U+0366 | U+0366 | Mn | COMBINING LATIN SMALL LETTER | | not | | | | O | | Possibly | U+0367 | U+0367 | Mn | COMBINING LATIN SMALL LETTER | | not | | | | U | | Possibly | U+0368 | U+0368 | Mn | COMBINING LATIN SMALL LETTER | | not | | | | C | | Possibly | U+0369 | U+0369 | Mn | COMBINING LATIN SMALL LETTER | | not | | | | D | | Possibly | U+036A | U+036A | Mn | COMBINING LATIN SMALL LETTER | | not | | | | H | | Possibly | U+036B | U+036B | Mn | COMBINING LATIN SMALL LETTER | | not | | | | M | | Possibly | U+036C | U+036C | Mn | COMBINING LATIN SMALL LETTER | | not | | | | R | | Possibly | U+036D | U+036D | Mn | COMBINING LATIN SMALL LETTER | | not | | | | T | | Possibly | U+036E | U+036E | Mn | COMBINING LATIN SMALL LETTER | | not | | | | V | | Possibly | U+036F | U+036F | Mn | COMBINING LATIN SMALL LETTER | | not | | | | X | +----------+--------+--------+-------+------------------------------+ Faltstrom Expires April 26, 2007 [Page 55] Internet-Draft Unicode Codepoints October 2006 4.8. 0370-03FF Greek and Coptic +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Exclude | U+0370 | U+0370 | Cn | | | Exclude | U+0371 | U+0371 | Cn | | | Exclude | U+0372 | U+0372 | Cn | | | Exclude | U+0373 | U+0373 | Cn | | | Exclude | U+0374 | U+02B9 | Lm | MODIFIER LETTER PRIME | | | U+0375 | U+0375 | | GREEK LOWER NUMERAL SIGN | | Exclude | U+0376 | U+0376 | Cn | | | Exclude | U+0377 | U+0377 | Cn | | | Exclude | U+0378 | U+0378 | Cn | | | Exclude | U+0379 | U+0379 | Cn | | | Exclude | U+037A | U+0020 | Mn Zs | SPACE | | Exclude | U+037B | U+037B | Cn | | | Exclude | U+037C | U+037C | Cn | | | Exclude | U+037D | U+037D | Cn | | | Exclude | U+037E | U+003B | Po | SEMICOLON | | Exclude | U+037F | U+037F | Cn | | | Exclude | U+0380 | U+0380 | Cn | | | Exclude | U+0381 | U+0381 | Cn | | | Exclude | U+0382 | U+0382 | Cn | | | Exclude | U+0383 | U+0383 | Cn | | | Exclude | U+0384 | U+0020 | Mn Zs | SPACE | | Exclude | U+0385 | U+0020 | Mn Zs | SPACE | | Input | U+0386 | U+0386 | Lu | GREEK CAPITAL LETTER ALPHA | | | | | | WITH TONOS | | Exclude | U+0387 | U+00B7 | Po | MIDDLE DOT | | Input | U+0388 | U+0388 | Lu | GREEK CAPITAL LETTER EPSILON | | | | | | WITH TONOS | | Input | U+0389 | U+0389 | Lu | GREEK CAPITAL LETTER ETA | | | | | | WITH TONOS | | Input | U+038A | U+038A | Lu | GREEK CAPITAL LETTER IOTA | | | | | | WITH TONOS | | Exclude | U+038B | U+038B | Cn | | | Input | U+038C | U+038C | Lu | GREEK CAPITAL LETTER OMICRON | | | | | | WITH TONOS | | Exclude | U+038D | U+038D | Cn | | | Input | U+038E | U+038E | Lu | GREEK CAPITAL LETTER UPSILON | | | | | | WITH TONOS | | Input | U+038F | U+038F | Lu | GREEK CAPITAL LETTER OMEGA | | | | | | WITH TONOS | | Include | U+0390 | U+0390 | Ll | GREEK SMALL LETTER IOTA WITH | | | | | | DIALYTIKA AND TONOS | | Input | U+0391 | U+0391 | Lu | GREEK CAPITAL LETTER ALPHA | | Input | U+0392 | U+0392 | Lu | GREEK CAPITAL LETTER BETA | Faltstrom Expires April 26, 2007 [Page 56] Internet-Draft Unicode Codepoints October 2006 | Input | U+0393 | U+0393 | Lu | GREEK CAPITAL LETTER GAMMA | | Input | U+0394 | U+0394 | Lu | GREEK CAPITAL LETTER DELTA | | Input | U+0395 | U+0395 | Lu | GREEK CAPITAL LETTER EPSILON | | Input | U+0396 | U+0396 | Lu | GREEK CAPITAL LETTER ZETA | | Input | U+0397 | U+0397 | Lu | GREEK CAPITAL LETTER ETA | | Input | U+0398 | U+0398 | Lu | GREEK CAPITAL LETTER THETA | | Input | U+0399 | U+0399 | Lu | GREEK CAPITAL LETTER IOTA | | Input | U+039A | U+039A | Lu | GREEK CAPITAL LETTER KAPPA | | Input | U+039B | U+039B | Lu | GREEK CAPITAL LETTER LAMDA | | Input | U+039C | U+039C | Lu | GREEK CAPITAL LETTER MU | | Input | U+039D | U+039D | Lu | GREEK CAPITAL LETTER NU | | Input | U+039E | U+039E | Lu | GREEK CAPITAL LETTER XI | | Input | U+039F | U+039F | Lu | GREEK CAPITAL LETTER OMICRON | | Input | U+03A0 | U+03A0 | Lu | GREEK CAPITAL LETTER PI | | Input | U+03A1 | U+03A1 | Lu | GREEK CAPITAL LETTER RHO | | Exclude | U+03A2 | U+03A2 | Cn | | | Input | U+03A3 | U+03A3 | Lu | GREEK CAPITAL LETTER SIGMA | | Input | U+03A4 | U+03A4 | Lu | GREEK CAPITAL LETTER TAU | | Input | U+03A5 | U+03A5 | Lu | GREEK CAPITAL LETTER UPSILON | | Input | U+03A6 | U+03A6 | Lu | GREEK CAPITAL LETTER PHI | | Input | U+03A7 | U+03A7 | Lu | GREEK CAPITAL LETTER CHI | | Input | U+03A8 | U+03A8 | Lu | GREEK CAPITAL LETTER PSI | | Input | U+03A9 | U+03A9 | Lu | GREEK CAPITAL LETTER OMEGA | | Input | U+03AA | U+03AA | Lu | GREEK CAPITAL LETTER IOTA | | | | | | WITH DIALYTIKA | | Input | U+03AB | U+03AB | Lu | GREEK CAPITAL LETTER UPSILON | | | | | | WITH DIALYTIKA | | Include | U+03AC | U+03AC | Ll | GREEK SMALL LETTER ALPHA | | | | | | WITH TONOS | | Include | U+03AD | U+03AD | Ll | GREEK SMALL LETTER EPSILON | | | | | | WITH TONOS | | Include | U+03AE | U+03AE | Ll | GREEK SMALL LETTER ETA WITH | | | | | | TONOS | | Include | U+03AF | U+03AF | Ll | GREEK SMALL LETTER IOTA WITH | | | | | | TONOS | | Include | U+03B0 | U+03B0 | Ll | GREEK SMALL LETTER UPSILON | | | | | | WITH DIALYTIKA AND TONOS | | Include | U+03B1 | U+03B1 | Ll | GREEK SMALL LETTER ALPHA | | Include | U+03B2 | U+03B2 | Ll | GREEK SMALL LETTER BETA | | Include | U+03B3 | U+03B3 | Ll | GREEK SMALL LETTER GAMMA | | Include | U+03B4 | U+03B4 | Ll | GREEK SMALL LETTER DELTA | | Include | U+03B5 | U+03B5 | Ll | GREEK SMALL LETTER EPSILON | | Include | U+03B6 | U+03B6 | Ll | GREEK SMALL LETTER ZETA | | Include | U+03B7 | U+03B7 | Ll | GREEK SMALL LETTER ETA | | Include | U+03B8 | U+03B8 | Ll | GREEK SMALL LETTER THETA | | Include | U+03B9 | U+03B9 | Ll | GREEK SMALL LETTER IOTA | | Include | U+03BA | U+03BA | Ll | GREEK SMALL LETTER KAPPA | | Include | U+03BB | U+03BB | Ll | GREEK SMALL LETTER LAMDA | Faltstrom Expires April 26, 2007 [Page 57] Internet-Draft Unicode Codepoints October 2006 | Include | U+03BC | U+03BC | Ll | GREEK SMALL LETTER MU | | Include | U+03BD | U+03BD | Ll | GREEK SMALL LETTER NU | | Include | U+03BE | U+03BE | Ll | GREEK SMALL LETTER XI | | Include | U+03BF | U+03BF | Ll | GREEK SMALL LETTER OMICRON | | Include | U+03C0 | U+03C0 | Ll | GREEK SMALL LETTER PI | | Include | U+03C1 | U+03C1 | Ll | GREEK SMALL LETTER RHO | | Include | U+03C2 | U+03C2 | Ll | GREEK SMALL LETTER FINAL | | | | | | SIGMA | | Include | U+03C3 | U+03C3 | Ll | GREEK SMALL LETTER SIGMA | | Include | U+03C4 | U+03C4 | Ll | GREEK SMALL LETTER TAU | | Include | U+03C5 | U+03C5 | Ll | GREEK SMALL LETTER UPSILON | | Include | U+03C6 | U+03C6 | Ll | GREEK SMALL LETTER PHI | | Include | U+03C7 | U+03C7 | Ll | GREEK SMALL LETTER CHI | | Include | U+03C8 | U+03C8 | Ll | GREEK SMALL LETTER PSI | | Include | U+03C9 | U+03C9 | Ll | GREEK SMALL LETTER OMEGA | | Include | U+03CA | U+03CA | Ll | GREEK SMALL LETTER IOTA WITH | | | | | | DIALYTIKA | | Include | U+03CB | U+03CB | Ll | GREEK SMALL LETTER UPSILON | | | | | | WITH DIALYTIKA | | Include | U+03CC | U+03CC | Ll | GREEK SMALL LETTER OMICRON | | | | | | WITH TONOS | | Include | U+03CD | U+03CD | Ll | GREEK SMALL LETTER UPSILON | | | | | | WITH TONOS | | Include | U+03CE | U+03CE | Ll | GREEK SMALL LETTER OMEGA | | | | | | WITH TONOS | | Exclude | U+03CF | U+03CF | Cn | | | Include | U+03D0 | U+03B2 | Ll | GREEK SMALL LETTER BETA | | Include | U+03D1 | U+03B8 | Ll | GREEK SMALL LETTER THETA | | Input | U+03D2 | U+03A5 | Lu | GREEK CAPITAL LETTER UPSILON | | Input | U+03D3 | U+038E | Lu | GREEK CAPITAL LETTER UPSILON | | | | | | WITH TONOS | | Input | U+03D4 | U+03AB | Lu | GREEK CAPITAL LETTER UPSILON | | | | | | WITH DIALYTIKA | | Include | U+03D5 | U+03C6 | Ll | GREEK SMALL LETTER PHI | | Include | U+03D6 | U+03C0 | Ll | GREEK SMALL LETTER PI | | Include | U+03D7 | U+03D7 | Ll | GREEK KAI SYMBOL | | Input | U+03D8 | U+03D8 | Lu | GREEK LETTER ARCHAIC KOPPA | | Include | U+03D9 | U+03D9 | Ll | GREEK SMALL LETTER ARCHAIC | | | | | | KOPPA | | Input | U+03DA | U+03DA | Lu | GREEK LETTER STIGMA | | Include | U+03DB | U+03DB | Ll | GREEK SMALL LETTER STIGMA | | Input | U+03DC | U+03DC | Lu | GREEK LETTER DIGAMMA | | Include | U+03DD | U+03DD | Ll | GREEK SMALL LETTER DIGAMMA | | Input | U+03DE | U+03DE | Lu | GREEK LETTER KOPPA | | Include | U+03DF | U+03DF | Ll | GREEK SMALL LETTER KOPPA | | Input | U+03E0 | U+03E0 | Lu | GREEK LETTER SAMPI | | Include | U+03E1 | U+03E1 | Ll | GREEK SMALL LETTER SAMPI | | Input | U+03E2 | U+03E2 | Lu | COPTIC CAPITAL LETTER SHEI | Faltstrom Expires April 26, 2007 [Page 58] Internet-Draft Unicode Codepoints October 2006 | Include | U+03E3 | U+03E3 | Ll | COPTIC SMALL LETTER SHEI | | Input | U+03E4 | U+03E4 | Lu | COPTIC CAPITAL LETTER FEI | | Include | U+03E5 | U+03E5 | Ll | COPTIC SMALL LETTER FEI | | Input | U+03E6 | U+03E6 | Lu | COPTIC CAPITAL LETTER KHEI | | Include | U+03E7 | U+03E7 | Ll | COPTIC SMALL LETTER KHEI | | Input | U+03E8 | U+03E8 | Lu | COPTIC CAPITAL LETTER HORI | | Include | U+03E9 | U+03E9 | Ll | COPTIC SMALL LETTER HORI | | Input | U+03EA | U+03EA | Lu | COPTIC CAPITAL LETTER GANGIA | | Include | U+03EB | U+03EB | Ll | COPTIC SMALL LETTER GANGIA | | Input | U+03EC | U+03EC | Lu | COPTIC CAPITAL LETTER SHIMA | | Include | U+03ED | U+03ED | Ll | COPTIC SMALL LETTER SHIMA | | Input | U+03EE | U+03EE | Lu | COPTIC CAPITAL LETTER DEI | | Include | U+03EF | U+03EF | Ll | COPTIC SMALL LETTER DEI | | Include | U+03F0 | U+03BA | Ll | GREEK SMALL LETTER KAPPA | | Include | U+03F1 | U+03C1 | Ll | GREEK SMALL LETTER RHO | | Include | U+03F2 | U+03C2 | Ll | GREEK SMALL LETTER FINAL | | | | | | SIGMA | | Include | U+03F3 | U+03F3 | Ll | GREEK LETTER YOT | | Input | U+03F4 | U+0398 | Lu | GREEK CAPITAL LETTER THETA | | Include | U+03F5 | U+03B5 | Ll | GREEK SMALL LETTER EPSILON | | Exclude | U+03F6 | U+03F6 | Sm | GREEK REVERSED LUNATE | | | | | | EPSILON SYMBOL | | Input | U+03F7 | U+03F7 | Lu | GREEK CAPITAL LETTER SHO | | Include | U+03F8 | U+03F8 | Ll | GREEK SMALL LETTER SHO | | Input | U+03F9 | U+03A3 | Lu | GREEK CAPITAL LETTER SIGMA | | Input | U+03FA | U+03FA | Lu | GREEK CAPITAL LETTER SAN | | Include | U+03FB | U+03FB | Ll | GREEK SMALL LETTER SAN | | Exclude | U+03FC | U+03FC | Cn | GREEK RHO WITH STROKE SYMBOL | | Exclude | U+03FD | U+03FD | Cn | GREEK CAPITAL REVERSED | | | | | | LUNATE SIGMA SYMBOL | | Exclude | U+03FE | U+03FE | Cn | GREEK CAPITAL DOTTED LUNATE | | | | | | SIGMA SYMBOL | | Exclude | U+03FF | U+03FF | Cn | GREEK CAPITAL REVERSED | | | | | | DOTTED LUNATE SIGMA SYMBOL | +----------+--------+--------+-------+------------------------------+ 4.9. 0400-04FF Cyrillic +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Input | U+0400 | U+0400 | Lu | CYRILLIC CAPITAL LETTER IE | | | | | | WITH GRAVE | | Input | U+0401 | U+0401 | Lu | CYRILLIC CAPITAL LETTER IO | | Input | U+0402 | U+0402 | Lu | CYRILLIC CAPITAL LETTER DJE | | Input | U+0403 | U+0403 | Lu | CYRILLIC CAPITAL LETTER GJE | | Input | U+0404 | U+0404 | Lu | CYRILLIC CAPITAL LETTER | | | | | | UKRAINIAN IE | Faltstrom Expires April 26, 2007 [Page 59] Internet-Draft Unicode Codepoints October 2006 | Input | U+0405 | U+0405 | Lu | CYRILLIC CAPITAL LETTER DZE | | Input | U+0406 | U+0406 | Lu | CYRILLIC CAPITAL LETTER | | | | | | BYELORUSSIAN-UKRAINIAN I | | Input | U+0407 | U+0407 | Lu | CYRILLIC CAPITAL LETTER YI | | Input | U+0408 | U+0408 | Lu | CYRILLIC CAPITAL LETTER JE | | Input | U+0409 | U+0409 | Lu | CYRILLIC CAPITAL LETTER LJE | | Input | U+040A | U+040A | Lu | CYRILLIC CAPITAL LETTER NJE | | Input | U+040B | U+040B | Lu | CYRILLIC CAPITAL LETTER TSHE | | Input | U+040C | U+040C | Lu | CYRILLIC CAPITAL LETTER KJE | | Input | U+040D | U+040D | Lu | CYRILLIC CAPITAL LETTER I | | | | | | WITH GRAVE | | Input | U+040E | U+040E | Lu | CYRILLIC CAPITAL LETTER | | | | | | SHORT U | | Input | U+040F | U+040F | Lu | CYRILLIC CAPITAL LETTER DZHE | | Input | U+0410 | U+0410 | Lu | CYRILLIC CAPITAL LETTER A | | Input | U+0411 | U+0411 | Lu | CYRILLIC CAPITAL LETTER BE | | Input | U+0412 | U+0412 | Lu | CYRILLIC CAPITAL LETTER VE | | Input | U+0413 | U+0413 | Lu | CYRILLIC CAPITAL LETTER GHE | | Input | U+0414 | U+0414 | Lu | CYRILLIC CAPITAL LETTER DE | | Input | U+0415 | U+0415 | Lu | CYRILLIC CAPITAL LETTER IE | | Input | U+0416 | U+0416 | Lu | CYRILLIC CAPITAL LETTER ZHE | | Input | U+0417 | U+0417 | Lu | CYRILLIC CAPITAL LETTER ZE | | Input | U+0418 | U+0418 | Lu | CYRILLIC CAPITAL LETTER I | | Input | U+0419 | U+0419 | Lu | CYRILLIC CAPITAL LETTER | | | | | | SHORT I | | Input | U+041A | U+041A | Lu | CYRILLIC CAPITAL LETTER KA | | Input | U+041B | U+041B | Lu | CYRILLIC CAPITAL LETTER EL | | Input | U+041C | U+041C | Lu | CYRILLIC CAPITAL LETTER EM | | Input | U+041D | U+041D | Lu | CYRILLIC CAPITAL LETTER EN | | Input | U+041E | U+041E | Lu | CYRILLIC CAPITAL LETTER O | | Input | U+041F | U+041F | Lu | CYRILLIC CAPITAL LETTER PE | | Input | U+0420 | U+0420 | Lu | CYRILLIC CAPITAL LETTER ER | | Input | U+0421 | U+0421 | Lu | CYRILLIC CAPITAL LETTER ES | | Input | U+0422 | U+0422 | Lu | CYRILLIC CAPITAL LETTER TE | | Input | U+0423 | U+0423 | Lu | CYRILLIC CAPITAL LETTER U | | Input | U+0424 | U+0424 | Lu | CYRILLIC CAPITAL LETTER EF | | Input | U+0425 | U+0425 | Lu | CYRILLIC CAPITAL LETTER HA | | Input | U+0426 | U+0426 | Lu | CYRILLIC CAPITAL LETTER TSE | | Input | U+0427 | U+0427 | Lu | CYRILLIC CAPITAL LETTER CHE | | Input | U+0428 | U+0428 | Lu | CYRILLIC CAPITAL LETTER SHA | | Input | U+0429 | U+0429 | Lu | CYRILLIC CAPITAL LETTER | | | | | | SHCHA | | Input | U+042A | U+042A | Lu | CYRILLIC CAPITAL LETTER HARD | | | | | | SIGN | | Input | U+042B | U+042B | Lu | CYRILLIC CAPITAL LETTER YERU | | Input | U+042C | U+042C | Lu | CYRILLIC CAPITAL LETTER SOFT | | | | | | SIGN | | Input | U+042D | U+042D | Lu | CYRILLIC CAPITAL LETTER E | Faltstrom Expires April 26, 2007 [Page 60] Internet-Draft Unicode Codepoints October 2006 | Input | U+042E | U+042E | Lu | CYRILLIC CAPITAL LETTER YU | | Input | U+042F | U+042F | Lu | CYRILLIC CAPITAL LETTER YA | | Include | U+0430 | U+0430 | Ll | CYRILLIC SMALL LETTER A | | Include | U+0431 | U+0431 | Ll | CYRILLIC SMALL LETTER BE | | Include | U+0432 | U+0432 | Ll | CYRILLIC SMALL LETTER VE | | Include | U+0433 | U+0433 | Ll | CYRILLIC SMALL LETTER GHE | | Include | U+0434 | U+0434 | Ll | CYRILLIC SMALL LETTER DE | | Include | U+0435 | U+0435 | Ll | CYRILLIC SMALL LETTER IE | | Include | U+0436 | U+0436 | Ll | CYRILLIC SMALL LETTER ZHE | | Include | U+0437 | U+0437 | Ll | CYRILLIC SMALL LETTER ZE | | Include | U+0438 | U+0438 | Ll | CYRILLIC SMALL LETTER I | | Include | U+0439 | U+0439 | Ll | CYRILLIC SMALL LETTER SHORT | | | | | | I | | Include | U+043A | U+043A | Ll | CYRILLIC SMALL LETTER KA | | Include | U+043B | U+043B | Ll | CYRILLIC SMALL LETTER EL | | Include | U+043C | U+043C | Ll | CYRILLIC SMALL LETTER EM | | Include | U+043D | U+043D | Ll | CYRILLIC SMALL LETTER EN | | Include | U+043E | U+043E | Ll | CYRILLIC SMALL LETTER O | | Include | U+043F | U+043F | Ll | CYRILLIC SMALL LETTER PE | | Include | U+0440 | U+0440 | Ll | CYRILLIC SMALL LETTER ER | | Include | U+0441 | U+0441 | Ll | CYRILLIC SMALL LETTER ES | | Include | U+0442 | U+0442 | Ll | CYRILLIC SMALL LETTER TE | | Include | U+0443 | U+0443 | Ll | CYRILLIC SMALL LETTER U | | Include | U+0444 | U+0444 | Ll | CYRILLIC SMALL LETTER EF | | Include | U+0445 | U+0445 | Ll | CYRILLIC SMALL LETTER HA | | Include | U+0446 | U+0446 | Ll | CYRILLIC SMALL LETTER TSE | | Include | U+0447 | U+0447 | Ll | CYRILLIC SMALL LETTER CHE | | Include | U+0448 | U+0448 | Ll | CYRILLIC SMALL LETTER SHA | | Include | U+0449 | U+0449 | Ll | CYRILLIC SMALL LETTER SHCHA | | Include | U+044A | U+044A | Ll | CYRILLIC SMALL LETTER HARD | | | | | | SIGN | | Include | U+044B | U+044B | Ll | CYRILLIC SMALL LETTER YERU | | Include | U+044C | U+044C | Ll | CYRILLIC SMALL LETTER SOFT | | | | | | SIGN | | Include | U+044D | U+044D | Ll | CYRILLIC SMALL LETTER E | | Include | U+044E | U+044E | Ll | CYRILLIC SMALL LETTER YU | | Include | U+044F | U+044F | Ll | CYRILLIC SMALL LETTER YA | | Include | U+0450 | U+0450 | Ll | CYRILLIC SMALL LETTER IE | | | | | | WITH GRAVE | | Include | U+0451 | U+0451 | Ll | CYRILLIC SMALL LETTER IO | | Include | U+0452 | U+0452 | Ll | CYRILLIC SMALL LETTER DJE | | Include | U+0453 | U+0453 | Ll | CYRILLIC SMALL LETTER GJE | | Include | U+0454 | U+0454 | Ll | CYRILLIC SMALL LETTER | | | | | | UKRAINIAN IE | | Include | U+0455 | U+0455 | Ll | CYRILLIC SMALL LETTER DZE | | Include | U+0456 | U+0456 | Ll | CYRILLIC SMALL LETTER | | | | | | BYELORUSSIAN-UKRAINIAN I | | Include | U+0457 | U+0457 | Ll | CYRILLIC SMALL LETTER YI | Faltstrom Expires April 26, 2007 [Page 61] Internet-Draft Unicode Codepoints October 2006 | Include | U+0458 | U+0458 | Ll | CYRILLIC SMALL LETTER JE | | Include | U+0459 | U+0459 | Ll | CYRILLIC SMALL LETTER LJE | | Include | U+045A | U+045A | Ll | CYRILLIC SMALL LETTER NJE | | Include | U+045B | U+045B | Ll | CYRILLIC SMALL LETTER TSHE | | Include | U+045C | U+045C | Ll | CYRILLIC SMALL LETTER KJE | | Include | U+045D | U+045D | Ll | CYRILLIC SMALL LETTER I WITH | | | | | | GRAVE | | Include | U+045E | U+045E | Ll | CYRILLIC SMALL LETTER SHORT | | | | | | U | | Include | U+045F | U+045F | Ll | CYRILLIC SMALL LETTER DZHE | | Input | U+0460 | U+0460 | Lu | CYRILLIC CAPITAL LETTER | | | | | | OMEGA | | Include | U+0461 | U+0461 | Ll | CYRILLIC SMALL LETTER OMEGA | | Input | U+0462 | U+0462 | Lu | CYRILLIC CAPITAL LETTER YAT | | Include | U+0463 | U+0463 | Ll | CYRILLIC SMALL LETTER YAT | | Input | U+0464 | U+0464 | Lu | CYRILLIC CAPITAL LETTER | | | | | | IOTIFIED E | | Include | U+0465 | U+0465 | Ll | CYRILLIC SMALL LETTER | | | | | | IOTIFIED E | | Input | U+0466 | U+0466 | Lu | CYRILLIC CAPITAL LETTER | | | | | | LITTLE YUS | | Include | U+0467 | U+0467 | Ll | CYRILLIC SMALL LETTER LITTLE | | | | | | YUS | | Input | U+0468 | U+0468 | Lu | CYRILLIC CAPITAL LETTER | | | | | | IOTIFIED LITTLE YUS | | Include | U+0469 | U+0469 | Ll | CYRILLIC SMALL LETTER | | | | | | IOTIFIED LITTLE YUS | | Input | U+046A | U+046A | Lu | CYRILLIC CAPITAL LETTER BIG | | | | | | YUS | | Include | U+046B | U+046B | Ll | CYRILLIC SMALL LETTER BIG | | | | | | YUS | | Input | U+046C | U+046C | Lu | CYRILLIC CAPITAL LETTER | | | | | | IOTIFIED BIG YUS | | Include | U+046D | U+046D | Ll | CYRILLIC SMALL LETTER | | | | | | IOTIFIED BIG YUS | | Input | U+046E | U+046E | Lu | CYRILLIC CAPITAL LETTER KSI | | Include | U+046F | U+046F | Ll | CYRILLIC SMALL LETTER KSI | | Input | U+0470 | U+0470 | Lu | CYRILLIC CAPITAL LETTER PSI | | Include | U+0471 | U+0471 | Ll | CYRILLIC SMALL LETTER PSI | | Input | U+0472 | U+0472 | Lu | CYRILLIC CAPITAL LETTER FITA | | Include | U+0473 | U+0473 | Ll | CYRILLIC SMALL LETTER FITA | | Input | U+0474 | U+0474 | Lu | CYRILLIC CAPITAL LETTER | | | | | | IZHITSA | | Include | U+0475 | U+0475 | Ll | CYRILLIC SMALL LETTER | | | | | | IZHITSA | | Input | U+0476 | U+0476 | Lu | CYRILLIC CAPITAL LETTER | | | | | | IZHITSA WITH DOUBLE GRAVE | | | | | | ACCENT | Faltstrom Expires April 26, 2007 [Page 62] Internet-Draft Unicode Codepoints October 2006 | Include | U+0477 | U+0477 | Ll | CYRILLIC SMALL LETTER | | | | | | IZHITSA WITH DOUBLE GRAVE | | | | | | ACCENT | | Input | U+0478 | U+0478 | Lu | CYRILLIC CAPITAL LETTER UK | | Include | U+0479 | U+0479 | Ll | CYRILLIC SMALL LETTER UK | | Input | U+047A | U+047A | Lu | CYRILLIC CAPITAL LETTER | | | | | | ROUND OMEGA | | Include | U+047B | U+047B | Ll | CYRILLIC SMALL LETTER ROUND | | | | | | OMEGA | | Input | U+047C | U+047C | Lu | CYRILLIC CAPITAL LETTER | | | | | | OMEGA WITH TITLO | | Include | U+047D | U+047D | Ll | CYRILLIC SMALL LETTER OMEGA | | | | | | WITH TITLO | | Input | U+047E | U+047E | Lu | CYRILLIC CAPITAL LETTER OT | | Include | U+047F | U+047F | Ll | CYRILLIC SMALL LETTER OT | | Input | U+0480 | U+0480 | Lu | CYRILLIC CAPITAL LETTER | | | | | | KOPPA | | Include | U+0481 | U+0481 | Ll | CYRILLIC SMALL LETTER KOPPA | | Exclude | U+0482 | U+0482 | So | CYRILLIC THOUSANDS SIGN | | Possibly | U+0483 | U+0483 | Mn | COMBINING CYRILLIC TITLO | | not | | | | | | Possibly | U+0484 | U+0484 | Mn | COMBINING CYRILLIC | | not | | | | PALATALIZATION | | Possibly | U+0485 | U+0485 | Mn | COMBINING CYRILLIC DASIA | | not | | | | PNEUMATA | | Possibly | U+0486 | U+0486 | Mn | COMBINING CYRILLIC PSILI | | not | | | | PNEUMATA | | Exclude | U+0487 | U+0487 | Cn | | | Possibly | U+0488 | U+0488 | Me | COMBINING CYRILLIC HUNDRED | | not | | | | THOUSANDS SIGN | | Possibly | U+0489 | U+0489 | Me | COMBINING CYRILLIC MILLIONS | | not | | | | SIGN | | Input | U+048A | U+048A | Lu | CYRILLIC CAPITAL LETTER | | | | | | SHORT I WITH TAIL | | Include | U+048B | U+048B | Ll | CYRILLIC SMALL LETTER SHORT | | | | | | I WITH TAIL | | Input | U+048C | U+048C | Lu | CYRILLIC CAPITAL LETTER | | | | | | SEMISOFT SIGN | | Include | U+048D | U+048D | Ll | CYRILLIC SMALL LETTER | | | | | | SEMISOFT SIGN | | Input | U+048E | U+048E | Lu | CYRILLIC CAPITAL LETTER ER | | | | | | WITH TICK | | Include | U+048F | U+048F | Ll | CYRILLIC SMALL LETTER ER | | | | | | WITH TICK | | Input | U+0490 | U+0490 | Lu | CYRILLIC CAPITAL LETTER GHE | | | | | | WITH UPTURN | | Include | U+0491 | U+0491 | Ll | CYRILLIC SMALL LETTER GHE | | | | | | WITH UPTURN | Faltstrom Expires April 26, 2007 [Page 63] Internet-Draft Unicode Codepoints October 2006 | Input | U+0492 | U+0492 | Lu | CYRILLIC CAPITAL LETTER GHE | | | | | | WITH STROKE | | Include | U+0493 | U+0493 | Ll | CYRILLIC SMALL LETTER GHE | | | | | | WITH STROKE | | Input | U+0494 | U+0494 | Lu | CYRILLIC CAPITAL LETTER GHE | | | | | | WITH MIDDLE HOOK | | Include | U+0495 | U+0495 | Ll | CYRILLIC SMALL LETTER GHE | | | | | | WITH MIDDLE HOOK | | Input | U+0496 | U+0496 | Lu | CYRILLIC CAPITAL LETTER ZHE | | | | | | WITH DESCENDER | | Include | U+0497 | U+0497 | Ll | CYRILLIC SMALL LETTER ZHE | | | | | | WITH DESCENDER | | Input | U+0498 | U+0498 | Lu | CYRILLIC CAPITAL LETTER ZE | | | | | | WITH DESCENDER | | Include | U+0499 | U+0499 | Ll | CYRILLIC SMALL LETTER ZE | | | | | | WITH DESCENDER | | Input | U+049A | U+049A | Lu | CYRILLIC CAPITAL LETTER KA | | | | | | WITH DESCENDER | | Include | U+049B | U+049B | Ll | CYRILLIC SMALL LETTER KA | | | | | | WITH DESCENDER | | Input | U+049C | U+049C | Lu | CYRILLIC CAPITAL LETTER KA | | | | | | WITH VERTICAL STROKE | | Include | U+049D | U+049D | Ll | CYRILLIC SMALL LETTER KA | | | | | | WITH VERTICAL STROKE | | Input | U+049E | U+049E | Lu | CYRILLIC CAPITAL LETTER KA | | | | | | WITH STROKE | | Include | U+049F | U+049F | Ll | CYRILLIC SMALL LETTER KA | | | | | | WITH STROKE | | Input | U+04A0 | U+04A0 | Lu | CYRILLIC CAPITAL LETTER | | | | | | BASHKIR KA | | Include | U+04A1 | U+04A1 | Ll | CYRILLIC SMALL LETTER | | | | | | BASHKIR KA | | Input | U+04A2 | U+04A2 | Lu | CYRILLIC CAPITAL LETTER EN | | | | | | WITH DESCENDER | | Include | U+04A3 | U+04A3 | Ll | CYRILLIC SMALL LETTER EN | | | | | | WITH DESCENDER | | Input | U+04A4 | U+04A4 | Lu | CYRILLIC CAPITAL LIGATURE EN | | | | | | GHE | | Include | U+04A5 | U+04A5 | Ll | CYRILLIC SMALL LIGATURE EN | | | | | | GHE | | Input | U+04A6 | U+04A6 | Lu | CYRILLIC CAPITAL LETTER PE | | | | | | WITH MIDDLE HOOK | | Include | U+04A7 | U+04A7 | Ll | CYRILLIC SMALL LETTER PE | | | | | | WITH MIDDLE HOOK | | Input | U+04A8 | U+04A8 | Lu | CYRILLIC CAPITAL LETTER | | | | | | ABKHASIAN HA | | Include | U+04A9 | U+04A9 | Ll | CYRILLIC SMALL LETTER | | | | | | ABKHASIAN HA | Faltstrom Expires April 26, 2007 [Page 64] Internet-Draft Unicode Codepoints October 2006 | Input | U+04AA | U+04AA | Lu | CYRILLIC CAPITAL LETTER ES | | | | | | WITH DESCENDER | | Include | U+04AB | U+04AB | Ll | CYRILLIC SMALL LETTER ES | | | | | | WITH DESCENDER | | Input | U+04AC | U+04AC | Lu | CYRILLIC CAPITAL LETTER TE | | | | | | WITH DESCENDER | | Include | U+04AD | U+04AD | Ll | CYRILLIC SMALL LETTER TE | | | | | | WITH DESCENDER | | Input | U+04AE | U+04AE | Lu | CYRILLIC CAPITAL LETTER | | | | | | STRAIGHT U | | Include | U+04AF | U+04AF | Ll | CYRILLIC SMALL LETTER | | | | | | STRAIGHT U | | Input | U+04B0 | U+04B0 | Lu | CYRILLIC CAPITAL LETTER | | | | | | STRAIGHT U WITH STROKE | | Include | U+04B1 | U+04B1 | Ll | CYRILLIC SMALL LETTER | | | | | | STRAIGHT U WITH STROKE | | Input | U+04B2 | U+04B2 | Lu | CYRILLIC CAPITAL LETTER HA | | | | | | WITH DESCENDER | | Include | U+04B3 | U+04B3 | Ll | CYRILLIC SMALL LETTER HA | | | | | | WITH DESCENDER | | Input | U+04B4 | U+04B4 | Lu | CYRILLIC CAPITAL LIGATURE TE | | | | | | TSE | | Include | U+04B5 | U+04B5 | Ll | CYRILLIC SMALL LIGATURE TE | | | | | | TSE | | Input | U+04B6 | U+04B6 | Lu | CYRILLIC CAPITAL LETTER CHE | | | | | | WITH DESCENDER | | Include | U+04B7 | U+04B7 | Ll | CYRILLIC SMALL LETTER CHE | | | | | | WITH DESCENDER | | Input | U+04B8 | U+04B8 | Lu | CYRILLIC CAPITAL LETTER CHE | | | | | | WITH VERTICAL STROKE | | Include | U+04B9 | U+04B9 | Ll | CYRILLIC SMALL LETTER CHE | | | | | | WITH VERTICAL STROKE | | Input | U+04BA | U+04BA | Lu | CYRILLIC CAPITAL LETTER SHHA | | Include | U+04BB | U+04BB | Ll | CYRILLIC SMALL LETTER SHHA | | Input | U+04BC | U+04BC | Lu | CYRILLIC CAPITAL LETTER | | | | | | ABKHASIAN CHE | | Include | U+04BD | U+04BD | Ll | CYRILLIC SMALL LETTER | | | | | | ABKHASIAN CHE | | Input | U+04BE | U+04BE | Lu | CYRILLIC CAPITAL LETTER | | | | | | ABKHASIAN CHE WITH DESCENDER | | Include | U+04BF | U+04BF | Ll | CYRILLIC SMALL LETTER | | | | | | ABKHASIAN CHE WITH DESCENDER | | Input | U+04C0 | U+04C0 | Lu | CYRILLIC LETTER PALOCHKA | | Input | U+04C1 | U+04C1 | Lu | CYRILLIC CAPITAL LETTER ZHE | | | | | | WITH BREVE | | Include | U+04C2 | U+04C2 | Ll | CYRILLIC SMALL LETTER ZHE | | | | | | WITH BREVE | Faltstrom Expires April 26, 2007 [Page 65] Internet-Draft Unicode Codepoints October 2006 | Input | U+04C3 | U+04C3 | Lu | CYRILLIC CAPITAL LETTER KA | | | | | | WITH HOOK | | Include | U+04C4 | U+04C4 | Ll | CYRILLIC SMALL LETTER KA | | | | | | WITH HOOK | | Input | U+04C5 | U+04C5 | Lu | CYRILLIC CAPITAL LETTER EL | | | | | | WITH TAIL | | Include | U+04C6 | U+04C6 | Ll | CYRILLIC SMALL LETTER EL | | | | | | WITH TAIL | | Input | U+04C7 | U+04C7 | Lu | CYRILLIC CAPITAL LETTER EN | | | | | | WITH HOOK | | Include | U+04C8 | U+04C8 | Ll | CYRILLIC SMALL LETTER EN | | | | | | WITH HOOK | | Input | U+04C9 | U+04C9 | Lu | CYRILLIC CAPITAL LETTER EN | | | | | | WITH TAIL | | Include | U+04CA | U+04CA | Ll | CYRILLIC SMALL LETTER EN | | | | | | WITH TAIL | | Input | U+04CB | U+04CB | Lu | CYRILLIC CAPITAL LETTER | | | | | | KHAKASSIAN CHE | | Include | U+04CC | U+04CC | Ll | CYRILLIC SMALL LETTER | | | | | | KHAKASSIAN CHE | | Input | U+04CD | U+04CD | Lu | CYRILLIC CAPITAL LETTER EM | | | | | | WITH TAIL | | Include | U+04CE | U+04CE | Ll | CYRILLIC SMALL LETTER EM | | | | | | WITH TAIL | | Exclude | U+04CF | U+04CF | Cn | | | Input | U+04D0 | U+04D0 | Lu | CYRILLIC CAPITAL LETTER A | | | | | | WITH BREVE | | Include | U+04D1 | U+04D1 | Ll | CYRILLIC SMALL LETTER A WITH | | | | | | BREVE | | Input | U+04D2 | U+04D2 | Lu | CYRILLIC CAPITAL LETTER A | | | | | | WITH DIAERESIS | | Include | U+04D3 | U+04D3 | Ll | CYRILLIC SMALL LETTER A WITH | | | | | | DIAERESIS | | Input | U+04D4 | U+04D4 | Lu | CYRILLIC CAPITAL LIGATURE A | | | | | | IE | | Include | U+04D5 | U+04D5 | Ll | CYRILLIC SMALL LIGATURE A IE | | Input | U+04D6 | U+04D6 | Lu | CYRILLIC CAPITAL LETTER IE | | | | | | WITH BREVE | | Include | U+04D7 | U+04D7 | Ll | CYRILLIC SMALL LETTER IE | | | | | | WITH BREVE | | Input | U+04D8 | U+04D8 | Lu | CYRILLIC CAPITAL LETTER | | | | | | SCHWA | | Include | U+04D9 | U+04D9 | Ll | CYRILLIC SMALL LETTER SCHWA | | Input | U+04DA | U+04DA | Lu | CYRILLIC CAPITAL LETTER | | | | | | SCHWA WITH DIAERESIS | | Include | U+04DB | U+04DB | Ll | CYRILLIC SMALL LETTER SCHWA | | | | | | WITH DIAERESIS | Faltstrom Expires April 26, 2007 [Page 66] Internet-Draft Unicode Codepoints October 2006 | Input | U+04DC | U+04DC | Lu | CYRILLIC CAPITAL LETTER ZHE | | | | | | WITH DIAERESIS | | Include | U+04DD | U+04DD | Ll | CYRILLIC SMALL LETTER ZHE | | | | | | WITH DIAERESIS | | Input | U+04DE | U+04DE | Lu | CYRILLIC CAPITAL LETTER ZE | | | | | | WITH DIAERESIS | | Include | U+04DF | U+04DF | Ll | CYRILLIC SMALL LETTER ZE | | | | | | WITH DIAERESIS | | Input | U+04E0 | U+04E0 | Lu | CYRILLIC CAPITAL LETTER | | | | | | ABKHASIAN DZE | | Include | U+04E1 | U+04E1 | Ll | CYRILLIC SMALL LETTER | | | | | | ABKHASIAN DZE | | Input | U+04E2 | U+04E2 | Lu | CYRILLIC CAPITAL LETTER I | | | | | | WITH MACRON | | Include | U+04E3 | U+04E3 | Ll | CYRILLIC SMALL LETTER I WITH | | | | | | MACRON | | Input | U+04E4 | U+04E4 | Lu | CYRILLIC CAPITAL LETTER I | | | | | | WITH DIAERESIS | | Include | U+04E5 | U+04E5 | Ll | CYRILLIC SMALL LETTER I WITH | | | | | | DIAERESIS | | Input | U+04E6 | U+04E6 | Lu | CYRILLIC CAPITAL LETTER O | | | | | | WITH DIAERESIS | | Include | U+04E7 | U+04E7 | Ll | CYRILLIC SMALL LETTER O WITH | | | | | | DIAERESIS | | Input | U+04E8 | U+04E8 | Lu | CYRILLIC CAPITAL LETTER | | | | | | BARRED O | | Include | U+04E9 | U+04E9 | Ll | CYRILLIC SMALL LETTER BARRED | | | | | | O | | Input | U+04EA | U+04EA | Lu | CYRILLIC CAPITAL LETTER | | | | | | BARRED O WITH DIAERESIS | | Include | U+04EB | U+04EB | Ll | CYRILLIC SMALL LETTER BARRED | | | | | | O WITH DIAERESIS | | Input | U+04EC | U+04EC | Lu | CYRILLIC CAPITAL LETTER E | | | | | | WITH DIAERESIS | | Include | U+04ED | U+04ED | Ll | CYRILLIC SMALL LETTER E WITH | | | | | | DIAERESIS | | Input | U+04EE | U+04EE | Lu | CYRILLIC CAPITAL LETTER U | | | | | | WITH MACRON | | Include | U+04EF | U+04EF | Ll | CYRILLIC SMALL LETTER U WITH | | | | | | MACRON | | Input | U+04F0 | U+04F0 | Lu | CYRILLIC CAPITAL LETTER U | | | | | | WITH DIAERESIS | | Include | U+04F1 | U+04F1 | Ll | CYRILLIC SMALL LETTER U WITH | | | | | | DIAERESIS | | Input | U+04F2 | U+04F2 | Lu | CYRILLIC CAPITAL LETTER U | | | | | | WITH DOUBLE ACUTE | | Include | U+04F3 | U+04F3 | Ll | CYRILLIC SMALL LETTER U WITH | | | | | | DOUBLE ACUTE | Faltstrom Expires April 26, 2007 [Page 67] Internet-Draft Unicode Codepoints October 2006 | Input | U+04F4 | U+04F4 | Lu | CYRILLIC CAPITAL LETTER CHE | | | | | | WITH DIAERESIS | | Include | U+04F5 | U+04F5 | Ll | CYRILLIC SMALL LETTER CHE | | | | | | WITH DIAERESIS | | Exclude | U+04F6 | U+04F6 | Cn | CYRILLIC CAPITAL LETTER GHE | | | | | | WITH DESCENDER | | Exclude | U+04F7 | U+04F7 | Cn | CYRILLIC SMALL LETTER GHE | | | | | | WITH DESCENDER | | Input | U+04F8 | U+04F8 | Lu | CYRILLIC CAPITAL LETTER YERU | | | | | | WITH DIAERESIS | | Include | U+04F9 | U+04F9 | Ll | CYRILLIC SMALL LETTER YERU | | | | | | WITH DIAERESIS | | Exclude | U+04FA | U+04FA | Cn | | | Exclude | U+04FB | U+04FB | Cn | | | Exclude | U+04FC | U+04FC | Cn | | | Exclude | U+04FD | U+04FD | Cn | | | Exclude | U+04FE | U+04FE | Cn | | | Exclude | U+04FF | U+04FF | Cn | | +----------+--------+--------+-------+------------------------------+ 4.10. 0530-058F Armenian +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Exclude | U+0530 | U+0530 | Cn | | | Input | U+0531 | U+0531 | Lu | ARMENIAN CAPITAL LETTER AYB | | Input | U+0532 | U+0532 | Lu | ARMENIAN CAPITAL LETTER BEN | | Input | U+0533 | U+0533 | Lu | ARMENIAN CAPITAL LETTER GIM | | Input | U+0534 | U+0534 | Lu | ARMENIAN CAPITAL LETTER DA | | Input | U+0535 | U+0535 | Lu | ARMENIAN CAPITAL LETTER ECH | | Input | U+0536 | U+0536 | Lu | ARMENIAN CAPITAL LETTER ZA | | Input | U+0537 | U+0537 | Lu | ARMENIAN CAPITAL LETTER EH | | Input | U+0538 | U+0538 | Lu | ARMENIAN CAPITAL LETTER ET | | Input | U+0539 | U+0539 | Lu | ARMENIAN CAPITAL LETTER TO | | Input | U+053A | U+053A | Lu | ARMENIAN CAPITAL LETTER ZHE | | Input | U+053B | U+053B | Lu | ARMENIAN CAPITAL LETTER INI | | Input | U+053C | U+053C | Lu | ARMENIAN CAPITAL LETTER LIWN | | Input | U+053D | U+053D | Lu | ARMENIAN CAPITAL LETTER XEH | | Input | U+053E | U+053E | Lu | ARMENIAN CAPITAL LETTER CA | | Input | U+053F | U+053F | Lu | ARMENIAN CAPITAL LETTER KEN | | Input | U+0540 | U+0540 | Lu | ARMENIAN CAPITAL LETTER HO | | Input | U+0541 | U+0541 | Lu | ARMENIAN CAPITAL LETTER JA | | Input | U+0542 | U+0542 | Lu | ARMENIAN CAPITAL LETTER GHAD | | Input | U+0543 | U+0543 | Lu | ARMENIAN CAPITAL LETTER CHEH | | Input | U+0544 | U+0544 | Lu | ARMENIAN CAPITAL LETTER MEN | | Input | U+0545 | U+0545 | Lu | ARMENIAN CAPITAL LETTER YI | | Input | U+0546 | U+0546 | Lu | ARMENIAN CAPITAL LETTER NOW | Faltstrom Expires April 26, 2007 [Page 68] Internet-Draft Unicode Codepoints October 2006 | Input | U+0547 | U+0547 | Lu | ARMENIAN CAPITAL LETTER SHA | | Input | U+0548 | U+0548 | Lu | ARMENIAN CAPITAL LETTER VO | | Input | U+0549 | U+0549 | Lu | ARMENIAN CAPITAL LETTER CHA | | Input | U+054A | U+054A | Lu | ARMENIAN CAPITAL LETTER PEH | | Input | U+054B | U+054B | Lu | ARMENIAN CAPITAL LETTER JHEH | | Input | U+054C | U+054C | Lu | ARMENIAN CAPITAL LETTER RA | | Input | U+054D | U+054D | Lu | ARMENIAN CAPITAL LETTER SEH | | Input | U+054E | U+054E | Lu | ARMENIAN CAPITAL LETTER VEW | | Input | U+054F | U+054F | Lu | ARMENIAN CAPITAL LETTER TIWN | | Input | U+0550 | U+0550 | Lu | ARMENIAN CAPITAL LETTER REH | | Input | U+0551 | U+0551 | Lu | ARMENIAN CAPITAL LETTER CO | | Input | U+0552 | U+0552 | Lu | ARMENIAN CAPITAL LETTER YIWN | | Input | U+0553 | U+0553 | Lu | ARMENIAN CAPITAL LETTER PIWR | | Input | U+0554 | U+0554 | Lu | ARMENIAN CAPITAL LETTER KEH | | Input | U+0555 | U+0555 | Lu | ARMENIAN CAPITAL LETTER OH | | Input | U+0556 | U+0556 | Lu | ARMENIAN CAPITAL LETTER FEH | | Exclude | U+0557 | U+0557 | Cn | | | Exclude | U+0558 | U+0558 | Cn | | | Exclude | U+0559 | U+0559 | Lm | ARMENIAN MODIFIER LETTER | | | | | | LEFT HALF RING | | Exclude | U+055A | U+055A | Po | ARMENIAN APOSTROPHE | | Exclude | U+055B | U+055B | Po | ARMENIAN EMPHASIS MARK | | Exclude | U+055C | U+055C | Po | ARMENIAN EXCLAMATION MARK | | Exclude | U+055D | U+055D | Po | ARMENIAN COMMA | | Exclude | U+055E | U+055E | Po | ARMENIAN QUESTION MARK | | Exclude | U+055F | U+055F | Po | ARMENIAN ABBREVIATION MARK | | Exclude | U+0560 | U+0560 | Cn | | | Include | U+0561 | U+0561 | Ll | ARMENIAN SMALL LETTER AYB | | Include | U+0562 | U+0562 | Ll | ARMENIAN SMALL LETTER BEN | | Include | U+0563 | U+0563 | Ll | ARMENIAN SMALL LETTER GIM | | Include | U+0564 | U+0564 | Ll | ARMENIAN SMALL LETTER DA | | Include | U+0565 | U+0565 | Ll | ARMENIAN SMALL LETTER ECH | | Include | U+0566 | U+0566 | Ll | ARMENIAN SMALL LETTER ZA | | Include | U+0567 | U+0567 | Ll | ARMENIAN SMALL LETTER EH | | Include | U+0568 | U+0568 | Ll | ARMENIAN SMALL LETTER ET | | Include | U+0569 | U+0569 | Ll | ARMENIAN SMALL LETTER TO | | Include | U+056A | U+056A | Ll | ARMENIAN SMALL LETTER ZHE | | Include | U+056B | U+056B | Ll | ARMENIAN SMALL LETTER INI | | Include | U+056C | U+056C | Ll | ARMENIAN SMALL LETTER LIWN | | Include | U+056D | U+056D | Ll | ARMENIAN SMALL LETTER XEH | | Include | U+056E | U+056E | Ll | ARMENIAN SMALL LETTER CA | | Include | U+056F | U+056F | Ll | ARMENIAN SMALL LETTER KEN | | Include | U+0570 | U+0570 | Ll | ARMENIAN SMALL LETTER HO | | Include | U+0571 | U+0571 | Ll | ARMENIAN SMALL LETTER JA | | Include | U+0572 | U+0572 | Ll | ARMENIAN SMALL LETTER GHAD | | Include | U+0573 | U+0573 | Ll | ARMENIAN SMALL LETTER CHEH | | Include | U+0574 | U+0574 | Ll | ARMENIAN SMALL LETTER MEN | | Include | U+0575 | U+0575 | Ll | ARMENIAN SMALL LETTER YI | Faltstrom Expires April 26, 2007 [Page 69] Internet-Draft Unicode Codepoints October 2006 | Include | U+0576 | U+0576 | Ll | ARMENIAN SMALL LETTER NOW | | Include | U+0577 | U+0577 | Ll | ARMENIAN SMALL LETTER SHA | | Include | U+0578 | U+0578 | Ll | ARMENIAN SMALL LETTER VO | | Include | U+0579 | U+0579 | Ll | ARMENIAN SMALL LETTER CHA | | Include | U+057A | U+057A | Ll | ARMENIAN SMALL LETTER PEH | | Include | U+057B | U+057B | Ll | ARMENIAN SMALL LETTER JHEH | | Include | U+057C | U+057C | Ll | ARMENIAN SMALL LETTER RA | | Include | U+057D | U+057D | Ll | ARMENIAN SMALL LETTER SEH | | Include | U+057E | U+057E | Ll | ARMENIAN SMALL LETTER VEW | | Include | U+057F | U+057F | Ll | ARMENIAN SMALL LETTER TIWN | | Include | U+0580 | U+0580 | Ll | ARMENIAN SMALL LETTER REH | | Include | U+0581 | U+0581 | Ll | ARMENIAN SMALL LETTER CO | | Include | U+0582 | U+0582 | Ll | ARMENIAN SMALL LETTER YIWN | | Include | U+0583 | U+0583 | Ll | ARMENIAN SMALL LETTER PIWR | | Include | U+0584 | U+0584 | Ll | ARMENIAN SMALL LETTER KEH | | Include | U+0585 | U+0585 | Ll | ARMENIAN SMALL LETTER OH | | Include | U+0586 | U+0586 | Ll | ARMENIAN SMALL LETTER FEH | | Include | U+0587 | U+0565 | Ll | ARMENIAN SMALL LETTER ECH | | Exclude | U+0588 | U+0588 | Cn | | | Exclude | U+0589 | U+0589 | Po | ARMENIAN FULL STOP | | Exclude | U+058A | U+058A | Pd | ARMENIAN HYPHEN | | Exclude | U+058B | U+058B | Cn | | | Exclude | U+058C | U+058C | Cn | | | Exclude | U+058D | U+058D | Cn | | | Exclude | U+058E | U+058E | Cn | | | Exclude | U+058F | U+058F | Cn | | +----------+--------+--------+-------+------------------------------+ 4.11. 0590-05FF Hebrew +------------+--------+--------+-------+----------------------------+ | Include? | Code | NFKC | Class | Name | +------------+--------+--------+-------+----------------------------+ | Exclude | U+0590 | U+0590 | Cn | | | Possibly | U+0591 | U+0591 | Mn | HEBREW ACCENT ETNAHTA | | not | | | | | | Possibly | U+0592 | U+0592 | Mn | HEBREW ACCENT SEGOL | | not | | | | | | Possibly | U+0593 | U+0593 | Mn | HEBREW ACCENT SHALSHELET | | not | | | | | | Possibly | U+0594 | U+0594 | Mn | HEBREW ACCENT ZAQEF QATAN | | not | | | | | | Possibly | U+0595 | U+0595 | Mn | HEBREW ACCENT ZAQEF GADOL | | not | | | | | | Possibly | U+0596 | U+0596 | Mn | HEBREW ACCENT TIPEHA | | not | | | | | | Possibly | U+0597 | U+0597 | Mn | HEBREW ACCENT REVIA | | not | | | | | Faltstrom Expires April 26, 2007 [Page 70] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+0598 | U+0598 | Mn | HEBREW ACCENT ZARQA | | not | | | | | | Possibly | U+0599 | U+0599 | Mn | HEBREW ACCENT PASHTA | | not | | | | | | Possibly | U+059A | U+059A | Mn | HEBREW ACCENT YETIV | | not | | | | | | Possibly | U+059B | U+059B | Mn | HEBREW ACCENT TEVIR | | not | | | | | | Possibly | U+059C | U+059C | Mn | HEBREW ACCENT GERESH | | not | | | | | | Possibly | U+059D | U+059D | Mn | HEBREW ACCENT GERESH | | not | | | | MUQDAM | | Possibly | U+059E | U+059E | Mn | HEBREW ACCENT GERSHAYIM | | not | | | | | | Possibly | U+059F | U+059F | Mn | HEBREW ACCENT QARNEY PARA | | not | | | | | | Possibly | U+05A0 | U+05A0 | Mn | HEBREW ACCENT TELISHA | | not | | | | GEDOLA | | Possibly | U+05A1 | U+05A1 | Mn | HEBREW ACCENT PAZER | | not | | | | | | Exclude | U+05A2 | U+05A2 | Cn | HEBREW ACCENT ATNAH HAFUKH | | Possibly | U+05A3 | U+05A3 | Mn | HEBREW ACCENT MUNAH | | not | | | | | | Possibly | U+05A4 | U+05A4 | Mn | HEBREW ACCENT MAHAPAKH | | not | | | | | | Possibly | U+05A5 | U+05A5 | Mn | HEBREW ACCENT MERKHA | | not | | | | | | Possibly | U+05A6 | U+05A6 | Mn | HEBREW ACCENT MERKHA | | not | | | | KEFULA | | Possibly | U+05A7 | U+05A7 | Mn | HEBREW ACCENT DARGA | | not | | | | | | Possibly | U+05A8 | U+05A8 | Mn | HEBREW ACCENT QADMA | | not | | | | | | Possibly | U+05A9 | U+05A9 | Mn | HEBREW ACCENT TELISHA | | not | | | | QETANA | | Possibly | U+05AA | U+05AA | Mn | HEBREW ACCENT YERAH BEN | | not | | | | YOMO | | Possibly | U+05AB | U+05AB | Mn | HEBREW ACCENT OLE | | not | | | | | | Possibly | U+05AC | U+05AC | Mn | HEBREW ACCENT ILUY | | not | | | | | | Possibly | U+05AD | U+05AD | Mn | HEBREW ACCENT DEHI | | not | | | | | | Possibly | U+05AE | U+05AE | Mn | HEBREW ACCENT ZINOR | | not | | | | | | Possibly | U+05AF | U+05AF | Mn | HEBREW MARK MASORA CIRCLE | | not | | | | | Faltstrom Expires April 26, 2007 [Page 71] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+05B0 | U+05B0 | Mn | HEBREW POINT SHEVA | | not | | | | | | Possibly | U+05B1 | U+05B1 | Mn | HEBREW POINT HATAF SEGOL | | not | | | | | | Possibly | U+05B2 | U+05B2 | Mn | HEBREW POINT HATAF PATAH | | not | | | | | | Possibly | U+05B3 | U+05B3 | Mn | HEBREW POINT HATAF QAMATS | | not | | | | | | Possibly | U+05B4 | U+05B4 | Mn | HEBREW POINT HIRIQ | | not | | | | | | Possibly | U+05B5 | U+05B5 | Mn | HEBREW POINT TSERE | | not | | | | | | Possibly | U+05B6 | U+05B6 | Mn | HEBREW POINT SEGOL | | not | | | | | | Possibly | U+05B7 | U+05B7 | Mn | HEBREW POINT PATAH | | not | | | | | | Possibly | U+05B8 | U+05B8 | Mn | HEBREW POINT QAMATS | | not | | | | | | Possibly | U+05B9 | U+05B9 | Mn | HEBREW POINT HOLAM | | not | | | | | | Exclude | U+05BA | U+05BA | Cn | | | Possibly | U+05BB | U+05BB | Mn | HEBREW POINT QUBUTS | | not | | | | | | Possibly | U+05BC | U+05BC | Mn | HEBREW POINT DAGESH OR | | not | | | | MAPIQ | | Possibly | U+05BD | U+05BD | Mn | HEBREW POINT METEG | | not | | | | | | Exclude | U+05BE | U+05BE | Po | HEBREW PUNCTUATION MAQAF | | Possibly | U+05BF | U+05BF | Mn | HEBREW POINT RAFE | | not | | | | | | Exclude | U+05C0 | U+05C0 | Po | HEBREW PUNCTUATION PASEQ | | Possibly | U+05C1 | U+05C1 | Mn | HEBREW POINT SHIN DOT | | not | | | | | | Possibly | U+05C2 | U+05C2 | Mn | HEBREW POINT SIN DOT | | not | | | | | | Exclude | U+05C3 | U+05C3 | Po | HEBREW PUNCTUATION SOF | | | | | | PASUQ | | Possibly | U+05C4 | U+05C4 | Mn | HEBREW MARK UPPER DOT | | not | | | | | | Exclude | U+05C5 | U+05C5 | Cn | HEBREW MARK LOWER DOT | | Exclude | U+05C6 | U+05C6 | Cn | HEBREW PUNCTUATION NUN | | | | | | HAFUKHA | | Exclude | U+05C7 | U+05C7 | Cn | HEBREW POINT QAMATS QATAN | | Exclude | U+05C8 | U+05C8 | Cn | | | Exclude | U+05C9 | U+05C9 | Cn | | | Exclude | U+05CA | U+05CA | Cn | | | Exclude | U+05CB | U+05CB | Cn | | | Exclude | U+05CC | U+05CC | Cn | | Faltstrom Expires April 26, 2007 [Page 72] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+05CD | U+05CD | Cn | | | Exclude | U+05CE | U+05CE | Cn | | | Exclude | U+05CF | U+05CF | Cn | | | Maybe | U+05D0 | U+05D0 | Lo | HEBREW LETTER ALEF | | Maybe | U+05D1 | U+05D1 | Lo | HEBREW LETTER BET | | Maybe | U+05D2 | U+05D2 | Lo | HEBREW LETTER GIMEL | | Maybe | U+05D3 | U+05D3 | Lo | HEBREW LETTER DALET | | Maybe | U+05D4 | U+05D4 | Lo | HEBREW LETTER HE | | Maybe | U+05D5 | U+05D5 | Lo | HEBREW LETTER VAV | | Maybe | U+05D6 | U+05D6 | Lo | HEBREW LETTER ZAYIN | | Maybe | U+05D7 | U+05D7 | Lo | HEBREW LETTER HET | | Maybe | U+05D8 | U+05D8 | Lo | HEBREW LETTER TET | | Maybe | U+05D9 | U+05D9 | Lo | HEBREW LETTER YOD | | Maybe | U+05DA | U+05DA | Lo | HEBREW LETTER FINAL KAF | | Maybe | U+05DB | U+05DB | Lo | HEBREW LETTER KAF | | Maybe | U+05DC | U+05DC | Lo | HEBREW LETTER LAMED | | Maybe | U+05DD | U+05DD | Lo | HEBREW LETTER FINAL MEM | | Maybe | U+05DE | U+05DE | Lo | HEBREW LETTER MEM | | Maybe | U+05DF | U+05DF | Lo | HEBREW LETTER FINAL NUN | | Maybe | U+05E0 | U+05E0 | Lo | HEBREW LETTER NUN | | Maybe | U+05E1 | U+05E1 | Lo | HEBREW LETTER SAMEKH | | Maybe | U+05E2 | U+05E2 | Lo | HEBREW LETTER AYIN | | Maybe | U+05E3 | U+05E3 | Lo | HEBREW LETTER FINAL PE | | Maybe | U+05E4 | U+05E4 | Lo | HEBREW LETTER PE | | Maybe | U+05E5 | U+05E5 | Lo | HEBREW LETTER FINAL TSADI | | Maybe | U+05E6 | U+05E6 | Lo | HEBREW LETTER TSADI | | Maybe | U+05E7 | U+05E7 | Lo | HEBREW LETTER QOF | | Maybe | U+05E8 | U+05E8 | Lo | HEBREW LETTER RESH | | Maybe | U+05E9 | U+05E9 | Lo | HEBREW LETTER SHIN | | Maybe | U+05EA | U+05EA | Lo | HEBREW LETTER TAV | | Exclude | U+05EB | U+05EB | Cn | | | Exclude | U+05EC | U+05EC | Cn | | | Exclude | U+05ED | U+05ED | Cn | | | Exclude | U+05EE | U+05EE | Cn | | | Exclude | U+05EF | U+05EF | Cn | | | Maybe | U+05F0 | U+05F0 | Lo | HEBREW LIGATURE YIDDISH | | | | | | DOUBLE VAV | | Maybe | U+05F1 | U+05F1 | Lo | HEBREW LIGATURE YIDDISH | | | | | | VAV YOD | | Maybe | U+05F2 | U+05F2 | Lo | HEBREW LIGATURE YIDDISH | | | | | | DOUBLE YOD | | Exclude | U+05F3 | U+05F3 | Po | HEBREW PUNCTUATION GERESH | | Exclude | U+05F4 | U+05F4 | Po | HEBREW PUNCTUATION | | | | | | GERSHAYIM | | Exclude | U+05F5 | U+05F5 | Cn | | | Exclude | U+05F6 | U+05F6 | Cn | | | Exclude | U+05F7 | U+05F7 | Cn | | | Exclude | U+05F8 | U+05F8 | Cn | | Faltstrom Expires April 26, 2007 [Page 73] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+05F9 | U+05F9 | Cn | | | Exclude | U+05FA | U+05FA | Cn | | | Exclude | U+05FB | U+05FB | Cn | | | Exclude | U+05FC | U+05FC | Cn | | | Exclude | U+05FD | U+05FD | Cn | | | Exclude | U+05FE | U+05FE | Cn | | | Exclude | U+05FF | U+05FF | Cn | | +------------+--------+--------+-------+----------------------------+ 4.12. 0600-06FF Arabic +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Exclude | U+0600 | U+0600 | Cf | ARABIC NUMBER SIGN | | Exclude | U+0601 | U+0601 | Cf | ARABIC SIGN SANAH | | Exclude | U+0602 | U+0602 | Cf | ARABIC FOOTNOTE MARKER | | Exclude | U+0603 | U+0603 | Cf | ARABIC SIGN SAFHA | | Exclude | U+0604 | U+0604 | Cn | | | Exclude | U+0605 | U+0605 | Cn | | | Exclude | U+0606 | U+0606 | Cn | | | Exclude | U+0607 | U+0607 | Cn | | | Exclude | U+0608 | U+0608 | Cn | | | Exclude | U+0609 | U+0609 | Cn | | | Exclude | U+060A | U+060A | Cn | | | Exclude | U+060B | U+060B | Cn | AFGHANI SIGN | | Exclude | U+060C | U+060C | Po | ARABIC COMMA | | Exclude | U+060D | U+060D | Po | ARABIC DATE SEPARATOR | | Exclude | U+060E | U+060E | So | ARABIC POETIC VERSE SIGN | | Exclude | U+060F | U+060F | So | ARABIC SIGN MISRA | | Possibly | U+0610 | U+0610 | Mn | ARABIC SIGN SALLALLAHOU | | not | | | | ALAYHE WASSALLAM | | Possibly | U+0611 | U+0611 | Mn | ARABIC SIGN ALAYHE ASSALLAM | | not | | | | | | Possibly | U+0612 | U+0612 | Mn | ARABIC SIGN RAHMATULLAH | | not | | | | ALAYHE | | Possibly | U+0613 | U+0613 | Mn | ARABIC SIGN RADI ALLAHOU | | not | | | | ANHU | | Possibly | U+0614 | U+0614 | Mn | ARABIC SIGN TAKHALLUS | | not | | | | | | Possibly | U+0615 | U+0615 | Mn | ARABIC SMALL HIGH TAH | | not | | | | | | Exclude | U+0616 | U+0616 | Cn | | | Exclude | U+0617 | U+0617 | Cn | | | Exclude | U+0618 | U+0618 | Cn | | | Exclude | U+0619 | U+0619 | Cn | | | Exclude | U+061A | U+061A | Cn | | | Exclude | U+061B | U+061B | Po | ARABIC SEMICOLON | Faltstrom Expires April 26, 2007 [Page 74] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+061C | U+061C | Cn | | | Exclude | U+061D | U+061D | Cn | | | Exclude | U+061E | U+061E | Cn | ARABIC TRIPLE DOT | | | | | | PUNCTUATION MARK | | Exclude | U+061F | U+061F | Po | ARABIC QUESTION MARK | | Exclude | U+0620 | U+0620 | Cn | | | Maybe | U+0621 | U+0621 | Lo | ARABIC LETTER HAMZA | | Maybe | U+0622 | U+0622 | Lo | ARABIC LETTER ALEF WITH | | | | | | MADDA ABOVE | | Maybe | U+0623 | U+0623 | Lo | ARABIC LETTER ALEF WITH | | | | | | HAMZA ABOVE | | Maybe | U+0624 | U+0624 | Lo | ARABIC LETTER WAW WITH HAMZA | | | | | | ABOVE | | Maybe | U+0625 | U+0625 | Lo | ARABIC LETTER ALEF WITH | | | | | | HAMZA BELOW | | Maybe | U+0626 | U+0626 | Lo | ARABIC LETTER YEH WITH HAMZA | | | | | | ABOVE | | Maybe | U+0627 | U+0627 | Lo | ARABIC LETTER ALEF | | Maybe | U+0628 | U+0628 | Lo | ARABIC LETTER BEH | | Maybe | U+0629 | U+0629 | Lo | ARABIC LETTER TEH MARBUTA | | Maybe | U+062A | U+062A | Lo | ARABIC LETTER TEH | | Maybe | U+062B | U+062B | Lo | ARABIC LETTER THEH | | Maybe | U+062C | U+062C | Lo | ARABIC LETTER JEEM | | Maybe | U+062D | U+062D | Lo | ARABIC LETTER HAH | | Maybe | U+062E | U+062E | Lo | ARABIC LETTER KHAH | | Maybe | U+062F | U+062F | Lo | ARABIC LETTER DAL | | Maybe | U+0630 | U+0630 | Lo | ARABIC LETTER THAL | | Maybe | U+0631 | U+0631 | Lo | ARABIC LETTER REH | | Maybe | U+0632 | U+0632 | Lo | ARABIC LETTER ZAIN | | Maybe | U+0633 | U+0633 | Lo | ARABIC LETTER SEEN | | Maybe | U+0634 | U+0634 | Lo | ARABIC LETTER SHEEN | | Maybe | U+0635 | U+0635 | Lo | ARABIC LETTER SAD | | Maybe | U+0636 | U+0636 | Lo | ARABIC LETTER DAD | | Maybe | U+0637 | U+0637 | Lo | ARABIC LETTER TAH | | Maybe | U+0638 | U+0638 | Lo | ARABIC LETTER ZAH | | Maybe | U+0639 | U+0639 | Lo | ARABIC LETTER AIN | | Maybe | U+063A | U+063A | Lo | ARABIC LETTER GHAIN | | Exclude | U+063B | U+063B | Cn | | | Exclude | U+063C | U+063C | Cn | | | Exclude | U+063D | U+063D | Cn | | | Exclude | U+063E | U+063E | Cn | | | Exclude | U+063F | U+063F | Cn | | | Exclude | U+0640 | U+0640 | Lm | ARABIC TATWEEL | | Maybe | U+0641 | U+0641 | Lo | ARABIC LETTER FEH | | Maybe | U+0642 | U+0642 | Lo | ARABIC LETTER QAF | | Maybe | U+0643 | U+0643 | Lo | ARABIC LETTER KAF | | Maybe | U+0644 | U+0644 | Lo | ARABIC LETTER LAM | | Maybe | U+0645 | U+0645 | Lo | ARABIC LETTER MEEM | Faltstrom Expires April 26, 2007 [Page 75] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0646 | U+0646 | Lo | ARABIC LETTER NOON | | Maybe | U+0647 | U+0647 | Lo | ARABIC LETTER HEH | | Maybe | U+0648 | U+0648 | Lo | ARABIC LETTER WAW | | Maybe | U+0649 | U+0649 | Lo | ARABIC LETTER ALEF MAKSURA | | Maybe | U+064A | U+064A | Lo | ARABIC LETTER YEH | | Possibly | U+064B | U+064B | Mn | ARABIC FATHATAN | | not | | | | | | Possibly | U+064C | U+064C | Mn | ARABIC DAMMATAN | | not | | | | | | Possibly | U+064D | U+064D | Mn | ARABIC KASRATAN | | not | | | | | | Possibly | U+064E | U+064E | Mn | ARABIC FATHA | | not | | | | | | Possibly | U+064F | U+064F | Mn | ARABIC DAMMA | | not | | | | | | Possibly | U+0650 | U+0650 | Mn | ARABIC KASRA | | not | | | | | | Possibly | U+0651 | U+0651 | Mn | ARABIC SHADDA | | not | | | | | | Possibly | U+0652 | U+0652 | Mn | ARABIC SUKUN | | not | | | | | | Possibly | U+0653 | U+0653 | Mn | ARABIC MADDAH ABOVE | | not | | | | | | Possibly | U+0654 | U+0654 | Mn | ARABIC HAMZA ABOVE | | not | | | | | | Possibly | U+0655 | U+0655 | Mn | ARABIC HAMZA BELOW | | not | | | | | | Possibly | U+0656 | U+0656 | Mn | ARABIC SUBSCRIPT ALEF | | not | | | | | | Possibly | U+0657 | U+0657 | Mn | ARABIC INVERTED DAMMA | | not | | | | | | Possibly | U+0658 | U+0658 | Mn | ARABIC MARK NOON GHUNNA | | not | | | | | | Exclude | U+0659 | U+0659 | Cn | ARABIC ZWARAKAY | | Exclude | U+065A | U+065A | Cn | ARABIC VOWEL SIGN SMALL V | | | | | | ABOVE | | Exclude | U+065B | U+065B | Cn | ARABIC VOWEL SIGN INVERTED | | | | | | SMALL V ABOVE | | Exclude | U+065C | U+065C | Cn | ARABIC VOWEL SIGN DOT BELOW | | Exclude | U+065D | U+065D | Cn | ARABIC REVERSED DAMMA | | Exclude | U+065E | U+065E | Cn | ARABIC FATHA WITH TWO DOTS | | Exclude | U+065F | U+065F | Cn | | | Maybe | U+0660 | U+0660 | Nd | ARABIC-INDIC DIGIT ZERO | | Maybe | U+0661 | U+0661 | Nd | ARABIC-INDIC DIGIT ONE | | Maybe | U+0662 | U+0662 | Nd | ARABIC-INDIC DIGIT TWO | | Maybe | U+0663 | U+0663 | Nd | ARABIC-INDIC DIGIT THREE | | Maybe | U+0664 | U+0664 | Nd | ARABIC-INDIC DIGIT FOUR | | Maybe | U+0665 | U+0665 | Nd | ARABIC-INDIC DIGIT FIVE | Faltstrom Expires April 26, 2007 [Page 76] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0666 | U+0666 | Nd | ARABIC-INDIC DIGIT SIX | | Maybe | U+0667 | U+0667 | Nd | ARABIC-INDIC DIGIT SEVEN | | Maybe | U+0668 | U+0668 | Nd | ARABIC-INDIC DIGIT EIGHT | | Maybe | U+0669 | U+0669 | Nd | ARABIC-INDIC DIGIT NINE | | Exclude | U+066A | U+066A | Po | ARABIC PERCENT SIGN | | Exclude | U+066B | U+066B | Po | ARABIC DECIMAL SEPARATOR | | Exclude | U+066C | U+066C | Po | ARABIC THOUSANDS SEPARATOR | | Exclude | U+066D | U+066D | Po | ARABIC FIVE POINTED STAR | | Maybe | U+066E | U+066E | Lo | ARABIC LETTER DOTLESS BEH | | Maybe | U+066F | U+066F | Lo | ARABIC LETTER DOTLESS QAF | | Possibly | U+0670 | U+0670 | Mn | ARABIC LETTER SUPERSCRIPT | | not | | | | ALEF | | Maybe | U+0671 | U+0671 | Lo | ARABIC LETTER ALEF WASLA | | Maybe | U+0672 | U+0672 | Lo | ARABIC LETTER ALEF WITH WAVY | | | | | | HAMZA ABOVE | | Maybe | U+0673 | U+0673 | Lo | ARABIC LETTER ALEF WITH WAVY | | | | | | HAMZA BELOW | | Maybe | U+0674 | U+0674 | Lo | ARABIC LETTER HIGH HAMZA | | Maybe | U+0675 | U+0627 | Lo | ARABIC LETTER ALEF | | Maybe | U+0676 | U+0648 | Lo | ARABIC LETTER WAW | | Maybe | U+0677 | U+06C7 | Lo | ARABIC LETTER U | | Maybe | U+0678 | U+064A | Lo | ARABIC LETTER YEH | | Maybe | U+0679 | U+0679 | Lo | ARABIC LETTER TTEH | | Maybe | U+067A | U+067A | Lo | ARABIC LETTER TTEHEH | | Maybe | U+067B | U+067B | Lo | ARABIC LETTER BEEH | | Maybe | U+067C | U+067C | Lo | ARABIC LETTER TEH WITH RING | | Maybe | U+067D | U+067D | Lo | ARABIC LETTER TEH WITH THREE | | | | | | DOTS ABOVE DOWNWARDS | | Maybe | U+067E | U+067E | Lo | ARABIC LETTER PEH | | Maybe | U+067F | U+067F | Lo | ARABIC LETTER TEHEH | | Maybe | U+0680 | U+0680 | Lo | ARABIC LETTER BEHEH | | Maybe | U+0681 | U+0681 | Lo | ARABIC LETTER HAH WITH HAMZA | | | | | | ABOVE | | Maybe | U+0682 | U+0682 | Lo | ARABIC LETTER HAH WITH TWO | | | | | | DOTS VERTICAL ABOVE | | Maybe | U+0683 | U+0683 | Lo | ARABIC LETTER NYEH | | Maybe | U+0684 | U+0684 | Lo | ARABIC LETTER DYEH | | Maybe | U+0685 | U+0685 | Lo | ARABIC LETTER HAH WITH THREE | | | | | | DOTS ABOVE | | Maybe | U+0686 | U+0686 | Lo | ARABIC LETTER TCHEH | | Maybe | U+0687 | U+0687 | Lo | ARABIC LETTER TCHEHEH | | Maybe | U+0688 | U+0688 | Lo | ARABIC LETTER DDAL | | Maybe | U+0689 | U+0689 | Lo | ARABIC LETTER DAL WITH RING | | Maybe | U+068A | U+068A | Lo | ARABIC LETTER DAL WITH DOT | | | | | | BELOW | | Maybe | U+068B | U+068B | Lo | ARABIC LETTER DAL WITH DOT | | | | | | BELOW AND SMALL TAH | | Maybe | U+068C | U+068C | Lo | ARABIC LETTER DAHAL | Faltstrom Expires April 26, 2007 [Page 77] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+068D | U+068D | Lo | ARABIC LETTER DDAHAL | | Maybe | U+068E | U+068E | Lo | ARABIC LETTER DUL | | Maybe | U+068F | U+068F | Lo | ARABIC LETTER DAL WITH THREE | | | | | | DOTS ABOVE DOWNWARDS | | Maybe | U+0690 | U+0690 | Lo | ARABIC LETTER DAL WITH FOUR | | | | | | DOTS ABOVE | | Maybe | U+0691 | U+0691 | Lo | ARABIC LETTER RREH | | Maybe | U+0692 | U+0692 | Lo | ARABIC LETTER REH WITH SMALL | | | | | | V | | Maybe | U+0693 | U+0693 | Lo | ARABIC LETTER REH WITH RING | | Maybe | U+0694 | U+0694 | Lo | ARABIC LETTER REH WITH DOT | | | | | | BELOW | | Maybe | U+0695 | U+0695 | Lo | ARABIC LETTER REH WITH SMALL | | | | | | V BELOW | | Maybe | U+0696 | U+0696 | Lo | ARABIC LETTER REH WITH DOT | | | | | | BELOW AND DOT ABOVE | | Maybe | U+0697 | U+0697 | Lo | ARABIC LETTER REH WITH TWO | | | | | | DOTS ABOVE | | Maybe | U+0698 | U+0698 | Lo | ARABIC LETTER JEH | | Maybe | U+0699 | U+0699 | Lo | ARABIC LETTER REH WITH FOUR | | | | | | DOTS ABOVE | | Maybe | U+069A | U+069A | Lo | ARABIC LETTER SEEN WITH DOT | | | | | | BELOW AND DOT ABOVE | | Maybe | U+069B | U+069B | Lo | ARABIC LETTER SEEN WITH | | | | | | THREE DOTS BELOW | | Maybe | U+069C | U+069C | Lo | ARABIC LETTER SEEN WITH | | | | | | THREE DOTS BELOW AND THREE | | | | | | DOTS ABOVE | | Maybe | U+069D | U+069D | Lo | ARABIC LETTER SAD WITH TWO | | | | | | DOTS BELOW | | Maybe | U+069E | U+069E | Lo | ARABIC LETTER SAD WITH THREE | | | | | | DOTS ABOVE | | Maybe | U+069F | U+069F | Lo | ARABIC LETTER TAH WITH THREE | | | | | | DOTS ABOVE | | Maybe | U+06A0 | U+06A0 | Lo | ARABIC LETTER AIN WITH THREE | | | | | | DOTS ABOVE | | Maybe | U+06A1 | U+06A1 | Lo | ARABIC LETTER DOTLESS FEH | | Maybe | U+06A2 | U+06A2 | Lo | ARABIC LETTER FEH WITH DOT | | | | | | MOVED BELOW | | Maybe | U+06A3 | U+06A3 | Lo | ARABIC LETTER FEH WITH DOT | | | | | | BELOW | | Maybe | U+06A4 | U+06A4 | Lo | ARABIC LETTER VEH | | Maybe | U+06A5 | U+06A5 | Lo | ARABIC LETTER FEH WITH THREE | | | | | | DOTS BELOW | | Maybe | U+06A6 | U+06A6 | Lo | ARABIC LETTER PEHEH | | Maybe | U+06A7 | U+06A7 | Lo | ARABIC LETTER QAF WITH DOT | | | | | | ABOVE | Faltstrom Expires April 26, 2007 [Page 78] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+06A8 | U+06A8 | Lo | ARABIC LETTER QAF WITH THREE | | | | | | DOTS ABOVE | | Maybe | U+06A9 | U+06A9 | Lo | ARABIC LETTER KEHEH | | Maybe | U+06AA | U+06AA | Lo | ARABIC LETTER SWASH KAF | | Maybe | U+06AB | U+06AB | Lo | ARABIC LETTER KAF WITH RING | | Maybe | U+06AC | U+06AC | Lo | ARABIC LETTER KAF WITH DOT | | | | | | ABOVE | | Maybe | U+06AD | U+06AD | Lo | ARABIC LETTER NG | | Maybe | U+06AE | U+06AE | Lo | ARABIC LETTER KAF WITH THREE | | | | | | DOTS BELOW | | Maybe | U+06AF | U+06AF | Lo | ARABIC LETTER GAF | | Maybe | U+06B0 | U+06B0 | Lo | ARABIC LETTER GAF WITH RING | | Maybe | U+06B1 | U+06B1 | Lo | ARABIC LETTER NGOEH | | Maybe | U+06B2 | U+06B2 | Lo | ARABIC LETTER GAF WITH TWO | | | | | | DOTS BELOW | | Maybe | U+06B3 | U+06B3 | Lo | ARABIC LETTER GUEH | | Maybe | U+06B4 | U+06B4 | Lo | ARABIC LETTER GAF WITH THREE | | | | | | DOTS ABOVE | | Maybe | U+06B5 | U+06B5 | Lo | ARABIC LETTER LAM WITH SMALL | | | | | | V | | Maybe | U+06B6 | U+06B6 | Lo | ARABIC LETTER LAM WITH DOT | | | | | | ABOVE | | Maybe | U+06B7 | U+06B7 | Lo | ARABIC LETTER LAM WITH THREE | | | | | | DOTS ABOVE | | Maybe | U+06B8 | U+06B8 | Lo | ARABIC LETTER LAM WITH THREE | | | | | | DOTS BELOW | | Maybe | U+06B9 | U+06B9 | Lo | ARABIC LETTER NOON WITH DOT | | | | | | BELOW | | Maybe | U+06BA | U+06BA | Lo | ARABIC LETTER NOON GHUNNA | | Maybe | U+06BB | U+06BB | Lo | ARABIC LETTER RNOON | | Maybe | U+06BC | U+06BC | Lo | ARABIC LETTER NOON WITH RING | | Maybe | U+06BD | U+06BD | Lo | ARABIC LETTER NOON WITH | | | | | | THREE DOTS ABOVE | | Maybe | U+06BE | U+06BE | Lo | ARABIC LETTER HEH | | | | | | DOACHASHMEE | | Maybe | U+06BF | U+06BF | Lo | ARABIC LETTER TCHEH WITH DOT | | | | | | ABOVE | | Maybe | U+06C0 | U+06C0 | Lo | ARABIC LETTER HEH WITH YEH | | | | | | ABOVE | | Maybe | U+06C1 | U+06C1 | Lo | ARABIC LETTER HEH GOAL | | Maybe | U+06C2 | U+06C2 | Lo | ARABIC LETTER HEH GOAL WITH | | | | | | HAMZA ABOVE | | Maybe | U+06C3 | U+06C3 | Lo | ARABIC LETTER TEH MARBUTA | | | | | | GOAL | | Maybe | U+06C4 | U+06C4 | Lo | ARABIC LETTER WAW WITH RING | | Maybe | U+06C5 | U+06C5 | Lo | ARABIC LETTER KIRGHIZ OE | | Maybe | U+06C6 | U+06C6 | Lo | ARABIC LETTER OE | | Maybe | U+06C7 | U+06C7 | Lo | ARABIC LETTER U | Faltstrom Expires April 26, 2007 [Page 79] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+06C8 | U+06C8 | Lo | ARABIC LETTER YU | | Maybe | U+06C9 | U+06C9 | Lo | ARABIC LETTER KIRGHIZ YU | | Maybe | U+06CA | U+06CA | Lo | ARABIC LETTER WAW WITH TWO | | | | | | DOTS ABOVE | | Maybe | U+06CB | U+06CB | Lo | ARABIC LETTER VE | | Maybe | U+06CC | U+06CC | Lo | ARABIC LETTER FARSI YEH | | Maybe | U+06CD | U+06CD | Lo | ARABIC LETTER YEH WITH TAIL | | Maybe | U+06CE | U+06CE | Lo | ARABIC LETTER YEH WITH SMALL | | | | | | V | | Maybe | U+06CF | U+06CF | Lo | ARABIC LETTER WAW WITH DOT | | | | | | ABOVE | | Maybe | U+06D0 | U+06D0 | Lo | ARABIC LETTER E | | Maybe | U+06D1 | U+06D1 | Lo | ARABIC LETTER YEH WITH THREE | | | | | | DOTS BELOW | | Maybe | U+06D2 | U+06D2 | Lo | ARABIC LETTER YEH BARREE | | Maybe | U+06D3 | U+06D3 | Lo | ARABIC LETTER YEH BARREE | | | | | | WITH HAMZA ABOVE | | Exclude | U+06D4 | U+06D4 | Po | ARABIC FULL STOP | | Maybe | U+06D5 | U+06D5 | Lo | ARABIC LETTER AE | | Possibly | U+06D6 | U+06D6 | Mn | ARABIC SMALL HIGH LIGATURE | | not | | | | SAD WITH LAM WITH ALEF | | | | | | MAKSURA | | Possibly | U+06D7 | U+06D7 | Mn | ARABIC SMALL HIGH LIGATURE | | not | | | | QAF WITH LAM WITH ALEF | | | | | | MAKSURA | | Possibly | U+06D8 | U+06D8 | Mn | ARABIC SMALL HIGH MEEM | | not | | | | INITIAL FORM | | Possibly | U+06D9 | U+06D9 | Mn | ARABIC SMALL HIGH LAM ALEF | | not | | | | | | Possibly | U+06DA | U+06DA | Mn | ARABIC SMALL HIGH JEEM | | not | | | | | | Possibly | U+06DB | U+06DB | Mn | ARABIC SMALL HIGH THREE DOTS | | not | | | | | | Possibly | U+06DC | U+06DC | Mn | ARABIC SMALL HIGH SEEN | | not | | | | | | Exclude | U+06DD | U+06DD | Cf | ARABIC END OF AYAH | | Possibly | U+06DE | U+06DE | Me | ARABIC START OF RUB EL HIZB | | not | | | | | | Possibly | U+06DF | U+06DF | Mn | ARABIC SMALL HIGH ROUNDED | | not | | | | ZERO | | Possibly | U+06E0 | U+06E0 | Mn | ARABIC SMALL HIGH UPRIGHT | | not | | | | RECTANGULAR ZERO | | Possibly | U+06E1 | U+06E1 | Mn | ARABIC SMALL HIGH DOTLESS | | not | | | | HEAD OF KHAH | | Possibly | U+06E2 | U+06E2 | Mn | ARABIC SMALL HIGH MEEM | | not | | | | ISOLATED FORM | | Possibly | U+06E3 | U+06E3 | Mn | ARABIC SMALL LOW SEEN | | not | | | | | Faltstrom Expires April 26, 2007 [Page 80] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+06E4 | U+06E4 | Mn | ARABIC SMALL HIGH MADDA | | not | | | | | | Exclude | U+06E5 | U+06E5 | Lm | ARABIC SMALL WAW | | Exclude | U+06E6 | U+06E6 | Lm | ARABIC SMALL YEH | | Possibly | U+06E7 | U+06E7 | Mn | ARABIC SMALL HIGH YEH | | not | | | | | | Possibly | U+06E8 | U+06E8 | Mn | ARABIC SMALL HIGH NOON | | not | | | | | | Exclude | U+06E9 | U+06E9 | So | ARABIC PLACE OF SAJDAH | | Possibly | U+06EA | U+06EA | Mn | ARABIC EMPTY CENTRE LOW STOP | | not | | | | | | Possibly | U+06EB | U+06EB | Mn | ARABIC EMPTY CENTRE HIGH | | not | | | | STOP | | Possibly | U+06EC | U+06EC | Mn | ARABIC ROUNDED HIGH STOP | | not | | | | WITH FILLED CENTRE | | Possibly | U+06ED | U+06ED | Mn | ARABIC SMALL LOW MEEM | | not | | | | | | Maybe | U+06EE | U+06EE | Lo | ARABIC LETTER DAL WITH | | | | | | INVERTED V | | Maybe | U+06EF | U+06EF | Lo | ARABIC LETTER REH WITH | | | | | | INVERTED V | | Maybe | U+06F0 | U+06F0 | Nd | EXTENDED ARABIC-INDIC DIGIT | | | | | | ZERO | | Maybe | U+06F1 | U+06F1 | Nd | EXTENDED ARABIC-INDIC DIGIT | | | | | | ONE | | Maybe | U+06F2 | U+06F2 | Nd | EXTENDED ARABIC-INDIC DIGIT | | | | | | TWO | | Maybe | U+06F3 | U+06F3 | Nd | EXTENDED ARABIC-INDIC DIGIT | | | | | | THREE | | Maybe | U+06F4 | U+06F4 | Nd | EXTENDED ARABIC-INDIC DIGIT | | | | | | FOUR | | Maybe | U+06F5 | U+06F5 | Nd | EXTENDED ARABIC-INDIC DIGIT | | | | | | FIVE | | Maybe | U+06F6 | U+06F6 | Nd | EXTENDED ARABIC-INDIC DIGIT | | | | | | SIX | | Maybe | U+06F7 | U+06F7 | Nd | EXTENDED ARABIC-INDIC DIGIT | | | | | | SEVEN | | Maybe | U+06F8 | U+06F8 | Nd | EXTENDED ARABIC-INDIC DIGIT | | | | | | EIGHT | | Maybe | U+06F9 | U+06F9 | Nd | EXTENDED ARABIC-INDIC DIGIT | | | | | | NINE | | Maybe | U+06FA | U+06FA | Lo | ARABIC LETTER SHEEN WITH DOT | | | | | | BELOW | | Maybe | U+06FB | U+06FB | Lo | ARABIC LETTER DAD WITH DOT | | | | | | BELOW | | Maybe | U+06FC | U+06FC | Lo | ARABIC LETTER GHAIN WITH DOT | | | | | | BELOW | | Exclude | U+06FD | U+06FD | So | ARABIC SIGN SINDHI AMPERSAND | Faltstrom Expires April 26, 2007 [Page 81] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+06FE | U+06FE | So | ARABIC SIGN SINDHI | | | | | | POSTPOSITION MEN | | Maybe | U+06FF | U+06FF | Lo | ARABIC LETTER HEH WITH | | | | | | INVERTED V | +----------+--------+--------+-------+------------------------------+ 4.13. 0700-074F Syriac +------------+--------+--------+-------+----------------------------+ | Include? | Code | NFKC | Class | Name | +------------+--------+--------+-------+----------------------------+ | Exclude | U+0700 | U+0700 | Po | SYRIAC END OF PARAGRAPH | | Exclude | U+0701 | U+0701 | Po | SYRIAC SUPRALINEAR FULL | | | | | | STOP | | Exclude | U+0702 | U+0702 | Po | SYRIAC SUBLINEAR FULL STOP | | Exclude | U+0703 | U+0703 | Po | SYRIAC SUPRALINEAR COLON | | Exclude | U+0704 | U+0704 | Po | SYRIAC SUBLINEAR COLON | | Exclude | U+0705 | U+0705 | Po | SYRIAC HORIZONTAL COLON | | Exclude | U+0706 | U+0706 | Po | SYRIAC COLON SKEWED LEFT | | Exclude | U+0707 | U+0707 | Po | SYRIAC COLON SKEWED RIGHT | | Exclude | U+0708 | U+0708 | Po | SYRIAC SUPRALINEAR COLON | | | | | | SKEWED LEFT | | Exclude | U+0709 | U+0709 | Po | SYRIAC SUBLINEAR COLON | | | | | | SKEWED RIGHT | | Exclude | U+070A | U+070A | Po | SYRIAC CONTRACTION | | Exclude | U+070B | U+070B | Po | SYRIAC HARKLEAN OBELUS | | Exclude | U+070C | U+070C | Po | SYRIAC HARKLEAN METOBELUS | | Exclude | U+070D | U+070D | Po | SYRIAC HARKLEAN ASTERISCUS | | Exclude | U+070E | U+070E | Cn | | | Exclude | U+070F | U+070F | Cf | SYRIAC ABBREVIATION MARK | | Maybe | U+0710 | U+0710 | Lo | SYRIAC LETTER ALAPH | | Possibly | U+0711 | U+0711 | Mn | SYRIAC LETTER SUPERSCRIPT | | not | | | | ALAPH | | Maybe | U+0712 | U+0712 | Lo | SYRIAC LETTER BETH | | Maybe | U+0713 | U+0713 | Lo | SYRIAC LETTER GAMAL | | Maybe | U+0714 | U+0714 | Lo | SYRIAC LETTER GAMAL | | | | | | GARSHUNI | | Maybe | U+0715 | U+0715 | Lo | SYRIAC LETTER DALATH | | Maybe | U+0716 | U+0716 | Lo | SYRIAC LETTER DOTLESS | | | | | | DALATH RISH | | Maybe | U+0717 | U+0717 | Lo | SYRIAC LETTER HE | | Maybe | U+0718 | U+0718 | Lo | SYRIAC LETTER WAW | | Maybe | U+0719 | U+0719 | Lo | SYRIAC LETTER ZAIN | | Maybe | U+071A | U+071A | Lo | SYRIAC LETTER HETH | | Maybe | U+071B | U+071B | Lo | SYRIAC LETTER TETH | | Maybe | U+071C | U+071C | Lo | SYRIAC LETTER TETH | | | | | | GARSHUNI | | Maybe | U+071D | U+071D | Lo | SYRIAC LETTER YUDH | Faltstrom Expires April 26, 2007 [Page 82] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+071E | U+071E | Lo | SYRIAC LETTER YUDH HE | | Maybe | U+071F | U+071F | Lo | SYRIAC LETTER KAPH | | Maybe | U+0720 | U+0720 | Lo | SYRIAC LETTER LAMADH | | Maybe | U+0721 | U+0721 | Lo | SYRIAC LETTER MIM | | Maybe | U+0722 | U+0722 | Lo | SYRIAC LETTER NUN | | Maybe | U+0723 | U+0723 | Lo | SYRIAC LETTER SEMKATH | | Maybe | U+0724 | U+0724 | Lo | SYRIAC LETTER FINAL | | | | | | SEMKATH | | Maybe | U+0725 | U+0725 | Lo | SYRIAC LETTER E | | Maybe | U+0726 | U+0726 | Lo | SYRIAC LETTER PE | | Maybe | U+0727 | U+0727 | Lo | SYRIAC LETTER REVERSED PE | | Maybe | U+0728 | U+0728 | Lo | SYRIAC LETTER SADHE | | Maybe | U+0729 | U+0729 | Lo | SYRIAC LETTER QAPH | | Maybe | U+072A | U+072A | Lo | SYRIAC LETTER RISH | | Maybe | U+072B | U+072B | Lo | SYRIAC LETTER SHIN | | Maybe | U+072C | U+072C | Lo | SYRIAC LETTER TAW | | Maybe | U+072D | U+072D | Lo | SYRIAC LETTER PERSIAN | | | | | | BHETH | | Maybe | U+072E | U+072E | Lo | SYRIAC LETTER PERSIAN | | | | | | GHAMAL | | Maybe | U+072F | U+072F | Lo | SYRIAC LETTER PERSIAN | | | | | | DHALATH | | Possibly | U+0730 | U+0730 | Mn | SYRIAC PTHAHA ABOVE | | not | | | | | | Possibly | U+0731 | U+0731 | Mn | SYRIAC PTHAHA BELOW | | not | | | | | | Possibly | U+0732 | U+0732 | Mn | SYRIAC PTHAHA DOTTED | | not | | | | | | Possibly | U+0733 | U+0733 | Mn | SYRIAC ZQAPHA ABOVE | | not | | | | | | Possibly | U+0734 | U+0734 | Mn | SYRIAC ZQAPHA BELOW | | not | | | | | | Possibly | U+0735 | U+0735 | Mn | SYRIAC ZQAPHA DOTTED | | not | | | | | | Possibly | U+0736 | U+0736 | Mn | SYRIAC RBASA ABOVE | | not | | | | | | Possibly | U+0737 | U+0737 | Mn | SYRIAC RBASA BELOW | | not | | | | | | Possibly | U+0738 | U+0738 | Mn | SYRIAC DOTTED ZLAMA | | not | | | | HORIZONTAL | | Possibly | U+0739 | U+0739 | Mn | SYRIAC DOTTED ZLAMA | | not | | | | ANGULAR | | Possibly | U+073A | U+073A | Mn | SYRIAC HBASA ABOVE | | not | | | | | | Possibly | U+073B | U+073B | Mn | SYRIAC HBASA BELOW | | not | | | | | | Possibly | U+073C | U+073C | Mn | SYRIAC HBASA-ESASA DOTTED | | not | | | | | Faltstrom Expires April 26, 2007 [Page 83] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+073D | U+073D | Mn | SYRIAC ESASA ABOVE | | not | | | | | | Possibly | U+073E | U+073E | Mn | SYRIAC ESASA BELOW | | not | | | | | | Possibly | U+073F | U+073F | Mn | SYRIAC RWAHA | | not | | | | | | Possibly | U+0740 | U+0740 | Mn | SYRIAC FEMININE DOT | | not | | | | | | Possibly | U+0741 | U+0741 | Mn | SYRIAC QUSHSHAYA | | not | | | | | | Possibly | U+0742 | U+0742 | Mn | SYRIAC RUKKAKHA | | not | | | | | | Possibly | U+0743 | U+0743 | Mn | SYRIAC TWO VERTICAL DOTS | | not | | | | ABOVE | | Possibly | U+0744 | U+0744 | Mn | SYRIAC TWO VERTICAL DOTS | | not | | | | BELOW | | Possibly | U+0745 | U+0745 | Mn | SYRIAC THREE DOTS ABOVE | | not | | | | | | Possibly | U+0746 | U+0746 | Mn | SYRIAC THREE DOTS BELOW | | not | | | | | | Possibly | U+0747 | U+0747 | Mn | SYRIAC OBLIQUE LINE ABOVE | | not | | | | | | Possibly | U+0748 | U+0748 | Mn | SYRIAC OBLIQUE LINE BELOW | | not | | | | | | Possibly | U+0749 | U+0749 | Mn | SYRIAC MUSIC | | not | | | | | | Possibly | U+074A | U+074A | Mn | SYRIAC BARREKH | | not | | | | | | Exclude | U+074B | U+074B | Cn | | | Exclude | U+074C | U+074C | Cn | | | Maybe | U+074D | U+074D | Lo | SYRIAC LETTER SOGDIAN | | | | | | ZHAIN | | Maybe | U+074E | U+074E | Lo | SYRIAC LETTER SOGDIAN | | | | | | KHAPH | | Maybe | U+074F | U+074F | Lo | SYRIAC LETTER SOGDIAN FE | +------------+--------+--------+-------+----------------------------+ 4.14. 0750-077F Arabic supplement +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Exclude | U+0750 | U+0750 | Cn | ARABIC LETTER BEH WITH THREE | | | | | | DOTS HORIZONTALLY BELOW | | Exclude | U+0751 | U+0751 | Cn | ARABIC LETTER BEH WITH DOT | | | | | | BELOW AND THREE DOTS ABOVE | | Exclude | U+0752 | U+0752 | Cn | ARABIC LETTER BEH WITH THREE | | | | | | DOTS POINTING UPWARDS BELOW | Faltstrom Expires April 26, 2007 [Page 84] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0753 | U+0753 | Cn | ARABIC LETTER BEH WITH THREE | | | | | | DOTS POINTING UPWARDS BELOW | | | | | | AND TWO DOTS ABOVE | | Exclude | U+0754 | U+0754 | Cn | ARABIC LETTER BEH WITH TWO | | | | | | DOTS BELOW AND DOT ABOVE | | Exclude | U+0755 | U+0755 | Cn | ARABIC LETTER BEH WITH | | | | | | INVERTED SMALL V BELOW | | Exclude | U+0756 | U+0756 | Cn | ARABIC LETTER BEH WITH SMALL | | | | | | V | | Exclude | U+0757 | U+0757 | Cn | ARABIC LETTER HAH WITH TWO | | | | | | DOTS ABOVE | | Exclude | U+0758 | U+0758 | Cn | ARABIC LETTER HAH WITH THREE | | | | | | DOTS POINTING UPWARDS BELOW | | Exclude | U+0759 | U+0759 | Cn | ARABIC LETTER DAL WITH TWO | | | | | | DOTS VERTICALLY BELOW AND | | | | | | SMALL TAH | | Exclude | U+075A | U+075A | Cn | ARABIC LETTER DAL WITH | | | | | | INVERTED SMALL V BELOW | | Exclude | U+075B | U+075B | Cn | ARABIC LETTER REH WITH | | | | | | STROKE | | Exclude | U+075C | U+075C | Cn | ARABIC LETTER SEEN WITH FOUR | | | | | | DOTS ABOVE | | Exclude | U+075D | U+075D | Cn | ARABIC LETTER AIN WITH TWO | | | | | | DOTS ABOVE | | Exclude | U+075E | U+075E | Cn | ARABIC LETTER AIN WITH THREE | | | | | | DOTS POINTING DOWNWARDS | | | | | | ABOVE | | Exclude | U+075F | U+075F | Cn | ARABIC LETTER AIN WITH TWO | | | | | | DOTS VERTICALLY ABOVE | | Exclude | U+0760 | U+0760 | Cn | ARABIC LETTER FEH WITH TWO | | | | | | DOTS BELOW | | Exclude | U+0761 | U+0761 | Cn | ARABIC LETTER FEH WITH THREE | | | | | | DOTS POINTING UPWARDS BELOW | | Exclude | U+0762 | U+0762 | Cn | ARABIC LETTER KEHEH WITH DOT | | | | | | ABOVE | | Exclude | U+0763 | U+0763 | Cn | ARABIC LETTER KEHEH WITH | | | | | | THREE DOTS ABOVE | | Exclude | U+0764 | U+0764 | Cn | ARABIC LETTER KEHEH WITH | | | | | | THREE DOTS POINTING UPWARDS | | | | | | BELOW | | Exclude | U+0765 | U+0765 | Cn | ARABIC LETTER MEEM WITH DOT | | | | | | ABOVE | | Exclude | U+0766 | U+0766 | Cn | ARABIC LETTER MEEM WITH DOT | | | | | | BELOW | | Exclude | U+0767 | U+0767 | Cn | ARABIC LETTER NOON WITH TWO | | | | | | DOTS BELOW | | Exclude | U+0768 | U+0768 | Cn | ARABIC LETTER NOON WITH | | | | | | SMALL TAH | Faltstrom Expires April 26, 2007 [Page 85] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0769 | U+0769 | Cn | ARABIC LETTER NOON WITH | | | | | | SMALL V | | Exclude | U+076A | U+076A | Cn | ARABIC LETTER LAM WITH BAR | | Exclude | U+076B | U+076B | Cn | ARABIC LETTER REH WITH TWO | | | | | | DOTS VERTICALLY ABOVE | | Exclude | U+076C | U+076C | Cn | ARABIC LETTER REH WITH HAMZA | | | | | | ABOVE | | Exclude | U+076D | U+076D | Cn | ARABIC LETTER SEEN WITH TWO | | | | | | DOTS VERTICALLY ABOVE | | Exclude | U+076E | U+076E | Cn | | | Exclude | U+076F | U+076F | Cn | | | Exclude | U+0770 | U+0770 | Cn | | | Exclude | U+0771 | U+0771 | Cn | | | Exclude | U+0772 | U+0772 | Cn | | | Exclude | U+0773 | U+0773 | Cn | | | Exclude | U+0774 | U+0774 | Cn | | | Exclude | U+0775 | U+0775 | Cn | | | Exclude | U+0776 | U+0776 | Cn | | | Exclude | U+0777 | U+0777 | Cn | | | Exclude | U+0778 | U+0778 | Cn | | | Exclude | U+0779 | U+0779 | Cn | | | Exclude | U+077A | U+077A | Cn | | | Exclude | U+077B | U+077B | Cn | | | Exclude | U+077C | U+077C | Cn | | | Exclude | U+077D | U+077D | Cn | | | Exclude | U+077E | U+077E | Cn | | | Exclude | U+077F | U+077F | Cn | | +----------+--------+--------+-------+------------------------------+ 4.15. 0780-07BF Thaana +--------------+--------+--------+-------+-------------------------+ | Include? | Code | NFKC | Class | Name | +--------------+--------+--------+-------+-------------------------+ | Maybe | U+0780 | U+0780 | Lo | THAANA LETTER HAA | | Maybe | U+0781 | U+0781 | Lo | THAANA LETTER SHAVIYANI | | Maybe | U+0782 | U+0782 | Lo | THAANA LETTER NOONU | | Maybe | U+0783 | U+0783 | Lo | THAANA LETTER RAA | | Maybe | U+0784 | U+0784 | Lo | THAANA LETTER BAA | | Maybe | U+0785 | U+0785 | Lo | THAANA LETTER LHAVIYANI | | Maybe | U+0786 | U+0786 | Lo | THAANA LETTER KAAFU | | Maybe | U+0787 | U+0787 | Lo | THAANA LETTER ALIFU | | Maybe | U+0788 | U+0788 | Lo | THAANA LETTER VAAVU | | Maybe | U+0789 | U+0789 | Lo | THAANA LETTER MEEMU | | Maybe | U+078A | U+078A | Lo | THAANA LETTER FAAFU | | Maybe | U+078B | U+078B | Lo | THAANA LETTER DHAALU | | Maybe | U+078C | U+078C | Lo | THAANA LETTER THAA | | Maybe | U+078D | U+078D | Lo | THAANA LETTER LAAMU | Faltstrom Expires April 26, 2007 [Page 86] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+078E | U+078E | Lo | THAANA LETTER GAAFU | | Maybe | U+078F | U+078F | Lo | THAANA LETTER GNAVIYANI | | Maybe | U+0790 | U+0790 | Lo | THAANA LETTER SEENU | | Maybe | U+0791 | U+0791 | Lo | THAANA LETTER DAVIYANI | | Maybe | U+0792 | U+0792 | Lo | THAANA LETTER ZAVIYANI | | Maybe | U+0793 | U+0793 | Lo | THAANA LETTER TAVIYANI | | Maybe | U+0794 | U+0794 | Lo | THAANA LETTER YAA | | Maybe | U+0795 | U+0795 | Lo | THAANA LETTER PAVIYANI | | Maybe | U+0796 | U+0796 | Lo | THAANA LETTER JAVIYANI | | Maybe | U+0797 | U+0797 | Lo | THAANA LETTER CHAVIYANI | | Maybe | U+0798 | U+0798 | Lo | THAANA LETTER TTAA | | Maybe | U+0799 | U+0799 | Lo | THAANA LETTER HHAA | | Maybe | U+079A | U+079A | Lo | THAANA LETTER KHAA | | Maybe | U+079B | U+079B | Lo | THAANA LETTER THAALU | | Maybe | U+079C | U+079C | Lo | THAANA LETTER ZAA | | Maybe | U+079D | U+079D | Lo | THAANA LETTER SHEENU | | Maybe | U+079E | U+079E | Lo | THAANA LETTER SAADHU | | Maybe | U+079F | U+079F | Lo | THAANA LETTER DAADHU | | Maybe | U+07A0 | U+07A0 | Lo | THAANA LETTER TO | | Maybe | U+07A1 | U+07A1 | Lo | THAANA LETTER ZO | | Maybe | U+07A2 | U+07A2 | Lo | THAANA LETTER AINU | | Maybe | U+07A3 | U+07A3 | Lo | THAANA LETTER GHAINU | | Maybe | U+07A4 | U+07A4 | Lo | THAANA LETTER QAAFU | | Maybe | U+07A5 | U+07A5 | Lo | THAANA LETTER WAAVU | | Possibly not | U+07A6 | U+07A6 | Mn | THAANA ABAFILI | | Possibly not | U+07A7 | U+07A7 | Mn | THAANA AABAAFILI | | Possibly not | U+07A8 | U+07A8 | Mn | THAANA IBIFILI | | Possibly not | U+07A9 | U+07A9 | Mn | THAANA EEBEEFILI | | Possibly not | U+07AA | U+07AA | Mn | THAANA UBUFILI | | Possibly not | U+07AB | U+07AB | Mn | THAANA OOBOOFILI | | Possibly not | U+07AC | U+07AC | Mn | THAANA EBEFILI | | Possibly not | U+07AD | U+07AD | Mn | THAANA EYBEYFILI | | Possibly not | U+07AE | U+07AE | Mn | THAANA OBOFILI | | Possibly not | U+07AF | U+07AF | Mn | THAANA OABOAFILI | | Possibly not | U+07B0 | U+07B0 | Mn | THAANA SUKUN | | Maybe | U+07B1 | U+07B1 | Lo | THAANA LETTER NAA | | Exclude | U+07B2 | U+07B2 | Cn | | | Exclude | U+07B3 | U+07B3 | Cn | | | Exclude | U+07B4 | U+07B4 | Cn | | | Exclude | U+07B5 | U+07B5 | Cn | | | Exclude | U+07B6 | U+07B6 | Cn | | | Exclude | U+07B7 | U+07B7 | Cn | | | Exclude | U+07B8 | U+07B8 | Cn | | | Exclude | U+07B9 | U+07B9 | Cn | | | Exclude | U+07BA | U+07BA | Cn | | | Exclude | U+07BB | U+07BB | Cn | | | Exclude | U+07BC | U+07BC | Cn | | | Exclude | U+07BD | U+07BD | Cn | | Faltstrom Expires April 26, 2007 [Page 87] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+07BE | U+07BE | Cn | | | Exclude | U+07BF | U+07BF | Cn | | +--------------+--------+--------+-------+-------------------------+ 4.16. 07C0-07FF NKo +----------+--------+--------+-------+------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------+ | Exclude | U+07C0 | U+07C0 | Cn | | | Exclude | U+07C1 | U+07C1 | Cn | | | Exclude | U+07C2 | U+07C2 | Cn | | | Exclude | U+07C3 | U+07C3 | Cn | | | Exclude | U+07C4 | U+07C4 | Cn | | | Exclude | U+07C5 | U+07C5 | Cn | | | Exclude | U+07C6 | U+07C6 | Cn | | | Exclude | U+07C7 | U+07C7 | Cn | | | Exclude | U+07C8 | U+07C8 | Cn | | | Exclude | U+07C9 | U+07C9 | Cn | | | Exclude | U+07CA | U+07CA | Cn | | | Exclude | U+07CB | U+07CB | Cn | | | Exclude | U+07CC | U+07CC | Cn | | | Exclude | U+07CD | U+07CD | Cn | | | Exclude | U+07CE | U+07CE | Cn | | | Exclude | U+07CF | U+07CF | Cn | | | Exclude | U+07D0 | U+07D0 | Cn | | | Exclude | U+07D1 | U+07D1 | Cn | | | Exclude | U+07D2 | U+07D2 | Cn | | | Exclude | U+07D3 | U+07D3 | Cn | | | Exclude | U+07D4 | U+07D4 | Cn | | | Exclude | U+07D5 | U+07D5 | Cn | | | Exclude | U+07D6 | U+07D6 | Cn | | | Exclude | U+07D7 | U+07D7 | Cn | | | Exclude | U+07D8 | U+07D8 | Cn | | | Exclude | U+07D9 | U+07D9 | Cn | | | Exclude | U+07DA | U+07DA | Cn | | | Exclude | U+07DB | U+07DB | Cn | | | Exclude | U+07DC | U+07DC | Cn | | | Exclude | U+07DD | U+07DD | Cn | | | Exclude | U+07DE | U+07DE | Cn | | | Exclude | U+07DF | U+07DF | Cn | | | Exclude | U+07E0 | U+07E0 | Cn | | | Exclude | U+07E1 | U+07E1 | Cn | | | Exclude | U+07E2 | U+07E2 | Cn | | | Exclude | U+07E3 | U+07E3 | Cn | | | Exclude | U+07E4 | U+07E4 | Cn | | | Exclude | U+07E5 | U+07E5 | Cn | | | Exclude | U+07E6 | U+07E6 | Cn | | Faltstrom Expires April 26, 2007 [Page 88] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+07E7 | U+07E7 | Cn | | | Exclude | U+07E8 | U+07E8 | Cn | | | Exclude | U+07E9 | U+07E9 | Cn | | | Exclude | U+07EA | U+07EA | Cn | | | Exclude | U+07EB | U+07EB | Cn | | | Exclude | U+07EC | U+07EC | Cn | | | Exclude | U+07ED | U+07ED | Cn | | | Exclude | U+07EE | U+07EE | Cn | | | Exclude | U+07EF | U+07EF | Cn | | | Exclude | U+07F0 | U+07F0 | Cn | | | Exclude | U+07F1 | U+07F1 | Cn | | | Exclude | U+07F2 | U+07F2 | Cn | | | Exclude | U+07F3 | U+07F3 | Cn | | | Exclude | U+07F4 | U+07F4 | Cn | | | Exclude | U+07F5 | U+07F5 | Cn | | | Exclude | U+07F6 | U+07F6 | Cn | | | Exclude | U+07F7 | U+07F7 | Cn | | | Exclude | U+07F8 | U+07F8 | Cn | | | Exclude | U+07F9 | U+07F9 | Cn | | | Exclude | U+07FA | U+07FA | Cn | | | Exclude | U+07FB | U+07FB | Cn | | | Exclude | U+07FC | U+07FC | Cn | | | Exclude | U+07FD | U+07FD | Cn | | | Exclude | U+07FE | U+07FE | Cn | | | Exclude | U+07FF | U+07FF | Cn | | +----------+--------+--------+-------+------+ 4.17. 0900-097F Devanagari +------------+--------+--------+-------+----------------------------+ | Include? | Code | NFKC | Class | Name | +------------+--------+--------+-------+----------------------------+ | Exclude | U+0900 | U+0900 | Cn | | | Possibly | U+0901 | U+0901 | Mn | DEVANAGARI SIGN | | not | | | | CANDRABINDU | | Possibly | U+0902 | U+0902 | Mn | DEVANAGARI SIGN ANUSVARA | | not | | | | | | Maybe | U+0903 | U+0903 | Mc | DEVANAGARI SIGN VISARGA | | Maybe | U+0904 | U+0904 | Lo | DEVANAGARI LETTER SHORT A | | Maybe | U+0905 | U+0905 | Lo | DEVANAGARI LETTER A | | Maybe | U+0906 | U+0906 | Lo | DEVANAGARI LETTER AA | | Maybe | U+0907 | U+0907 | Lo | DEVANAGARI LETTER I | | Maybe | U+0908 | U+0908 | Lo | DEVANAGARI LETTER II | | Maybe | U+0909 | U+0909 | Lo | DEVANAGARI LETTER U | | Maybe | U+090A | U+090A | Lo | DEVANAGARI LETTER UU | | Maybe | U+090B | U+090B | Lo | DEVANAGARI LETTER VOCALIC | | | | | | R | Faltstrom Expires April 26, 2007 [Page 89] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+090C | U+090C | Lo | DEVANAGARI LETTER VOCALIC | | | | | | L | | Maybe | U+090D | U+090D | Lo | DEVANAGARI LETTER CANDRA E | | Maybe | U+090E | U+090E | Lo | DEVANAGARI LETTER SHORT E | | Maybe | U+090F | U+090F | Lo | DEVANAGARI LETTER E | | Maybe | U+0910 | U+0910 | Lo | DEVANAGARI LETTER AI | | Maybe | U+0911 | U+0911 | Lo | DEVANAGARI LETTER CANDRA O | | Maybe | U+0912 | U+0912 | Lo | DEVANAGARI LETTER SHORT O | | Maybe | U+0913 | U+0913 | Lo | DEVANAGARI LETTER O | | Maybe | U+0914 | U+0914 | Lo | DEVANAGARI LETTER AU | | Maybe | U+0915 | U+0915 | Lo | DEVANAGARI LETTER KA | | Maybe | U+0916 | U+0916 | Lo | DEVANAGARI LETTER KHA | | Maybe | U+0917 | U+0917 | Lo | DEVANAGARI LETTER GA | | Maybe | U+0918 | U+0918 | Lo | DEVANAGARI LETTER GHA | | Maybe | U+0919 | U+0919 | Lo | DEVANAGARI LETTER NGA | | Maybe | U+091A | U+091A | Lo | DEVANAGARI LETTER CA | | Maybe | U+091B | U+091B | Lo | DEVANAGARI LETTER CHA | | Maybe | U+091C | U+091C | Lo | DEVANAGARI LETTER JA | | Maybe | U+091D | U+091D | Lo | DEVANAGARI LETTER JHA | | Maybe | U+091E | U+091E | Lo | DEVANAGARI LETTER NYA | | Maybe | U+091F | U+091F | Lo | DEVANAGARI LETTER TTA | | Maybe | U+0920 | U+0920 | Lo | DEVANAGARI LETTER TTHA | | Maybe | U+0921 | U+0921 | Lo | DEVANAGARI LETTER DDA | | Maybe | U+0922 | U+0922 | Lo | DEVANAGARI LETTER DDHA | | Maybe | U+0923 | U+0923 | Lo | DEVANAGARI LETTER NNA | | Maybe | U+0924 | U+0924 | Lo | DEVANAGARI LETTER TA | | Maybe | U+0925 | U+0925 | Lo | DEVANAGARI LETTER THA | | Maybe | U+0926 | U+0926 | Lo | DEVANAGARI LETTER DA | | Maybe | U+0927 | U+0927 | Lo | DEVANAGARI LETTER DHA | | Maybe | U+0928 | U+0928 | Lo | DEVANAGARI LETTER NA | | Maybe | U+0929 | U+0929 | Lo | DEVANAGARI LETTER NNNA | | Maybe | U+092A | U+092A | Lo | DEVANAGARI LETTER PA | | Maybe | U+092B | U+092B | Lo | DEVANAGARI LETTER PHA | | Maybe | U+092C | U+092C | Lo | DEVANAGARI LETTER BA | | Maybe | U+092D | U+092D | Lo | DEVANAGARI LETTER BHA | | Maybe | U+092E | U+092E | Lo | DEVANAGARI LETTER MA | | Maybe | U+092F | U+092F | Lo | DEVANAGARI LETTER YA | | Maybe | U+0930 | U+0930 | Lo | DEVANAGARI LETTER RA | | Maybe | U+0931 | U+0931 | Lo | DEVANAGARI LETTER RRA | | Maybe | U+0932 | U+0932 | Lo | DEVANAGARI LETTER LA | | Maybe | U+0933 | U+0933 | Lo | DEVANAGARI LETTER LLA | | Maybe | U+0934 | U+0934 | Lo | DEVANAGARI LETTER LLLA | | Maybe | U+0935 | U+0935 | Lo | DEVANAGARI LETTER VA | | Maybe | U+0936 | U+0936 | Lo | DEVANAGARI LETTER SHA | | Maybe | U+0937 | U+0937 | Lo | DEVANAGARI LETTER SSA | | Maybe | U+0938 | U+0938 | Lo | DEVANAGARI LETTER SA | | Maybe | U+0939 | U+0939 | Lo | DEVANAGARI LETTER HA | | Exclude | U+093A | U+093A | Cn | | Faltstrom Expires April 26, 2007 [Page 90] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+093B | U+093B | Cn | | | Possibly | U+093C | U+093C | Mn | DEVANAGARI SIGN NUKTA | | not | | | | | | Maybe | U+093D | U+093D | Lo | DEVANAGARI SIGN AVAGRAHA | | Maybe | U+093E | U+093E | Mc | DEVANAGARI VOWEL SIGN AA | | Maybe | U+093F | U+093F | Mc | DEVANAGARI VOWEL SIGN I | | Maybe | U+0940 | U+0940 | Mc | DEVANAGARI VOWEL SIGN II | | Possibly | U+0941 | U+0941 | Mn | DEVANAGARI VOWEL SIGN U | | not | | | | | | Possibly | U+0942 | U+0942 | Mn | DEVANAGARI VOWEL SIGN UU | | not | | | | | | Possibly | U+0943 | U+0943 | Mn | DEVANAGARI VOWEL SIGN | | not | | | | VOCALIC R | | Possibly | U+0944 | U+0944 | Mn | DEVANAGARI VOWEL SIGN | | not | | | | VOCALIC RR | | Possibly | U+0945 | U+0945 | Mn | DEVANAGARI VOWEL SIGN | | not | | | | CANDRA E | | Possibly | U+0946 | U+0946 | Mn | DEVANAGARI VOWEL SIGN | | not | | | | SHORT E | | Possibly | U+0947 | U+0947 | Mn | DEVANAGARI VOWEL SIGN E | | not | | | | | | Possibly | U+0948 | U+0948 | Mn | DEVANAGARI VOWEL SIGN AI | | not | | | | | | Maybe | U+0949 | U+0949 | Mc | DEVANAGARI VOWEL SIGN | | | | | | CANDRA O | | Maybe | U+094A | U+094A | Mc | DEVANAGARI VOWEL SIGN | | | | | | SHORT O | | Maybe | U+094B | U+094B | Mc | DEVANAGARI VOWEL SIGN O | | Maybe | U+094C | U+094C | Mc | DEVANAGARI VOWEL SIGN AU | | Possibly | U+094D | U+094D | Mn | DEVANAGARI SIGN VIRAMA | | not | | | | | | Exclude | U+094E | U+094E | Cn | | | Exclude | U+094F | U+094F | Cn | | | Maybe | U+0950 | U+0950 | Lo | DEVANAGARI OM | | Possibly | U+0951 | U+0951 | Mn | DEVANAGARI STRESS SIGN | | not | | | | UDATTA | | Possibly | U+0952 | U+0952 | Mn | DEVANAGARI STRESS SIGN | | not | | | | ANUDATTA | | Possibly | U+0953 | U+0953 | Mn | DEVANAGARI GRAVE ACCENT | | not | | | | | | Possibly | U+0954 | U+0954 | Mn | DEVANAGARI ACUTE ACCENT | | not | | | | | | Exclude | U+0955 | U+0955 | Cn | | | Exclude | U+0956 | U+0956 | Cn | | | Exclude | U+0957 | U+0957 | Cn | | | Possibly | U+0958 | U+0915 | Lo Mn | DEVANAGARI LETTER KA | | not | | | | | Faltstrom Expires April 26, 2007 [Page 91] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+0959 | U+0916 | Lo Mn | DEVANAGARI LETTER KHA | | not | | | | | | Possibly | U+095A | U+0917 | Lo Mn | DEVANAGARI LETTER GA | | not | | | | | | Possibly | U+095B | U+091C | Lo Mn | DEVANAGARI LETTER JA | | not | | | | | | Possibly | U+095C | U+0921 | Lo Mn | DEVANAGARI LETTER DDA | | not | | | | | | Possibly | U+095D | U+0922 | Lo Mn | DEVANAGARI LETTER DDHA | | not | | | | | | Possibly | U+095E | U+092B | Lo Mn | DEVANAGARI LETTER PHA | | not | | | | | | Possibly | U+095F | U+092F | Lo Mn | DEVANAGARI LETTER YA | | not | | | | | | Maybe | U+0960 | U+0960 | Lo | DEVANAGARI LETTER VOCALIC | | | | | | RR | | Maybe | U+0961 | U+0961 | Lo | DEVANAGARI LETTER VOCALIC | | | | | | LL | | Possibly | U+0962 | U+0962 | Mn | DEVANAGARI VOWEL SIGN | | not | | | | VOCALIC L | | Possibly | U+0963 | U+0963 | Mn | DEVANAGARI VOWEL SIGN | | not | | | | VOCALIC LL | | Exclude | U+0964 | U+0964 | Po | DEVANAGARI DANDA | | Exclude | U+0965 | U+0965 | Po | DEVANAGARI DOUBLE DANDA | | Maybe | U+0966 | U+0966 | Nd | DEVANAGARI DIGIT ZERO | | Maybe | U+0967 | U+0967 | Nd | DEVANAGARI DIGIT ONE | | Maybe | U+0968 | U+0968 | Nd | DEVANAGARI DIGIT TWO | | Maybe | U+0969 | U+0969 | Nd | DEVANAGARI DIGIT THREE | | Maybe | U+096A | U+096A | Nd | DEVANAGARI DIGIT FOUR | | Maybe | U+096B | U+096B | Nd | DEVANAGARI DIGIT FIVE | | Maybe | U+096C | U+096C | Nd | DEVANAGARI DIGIT SIX | | Maybe | U+096D | U+096D | Nd | DEVANAGARI DIGIT SEVEN | | Maybe | U+096E | U+096E | Nd | DEVANAGARI DIGIT EIGHT | | Maybe | U+096F | U+096F | Nd | DEVANAGARI DIGIT NINE | | Exclude | U+0970 | U+0970 | Po | DEVANAGARI ABBREVIATION | | | | | | SIGN | | Exclude | U+0971 | U+0971 | Cn | | | Exclude | U+0972 | U+0972 | Cn | | | Exclude | U+0973 | U+0973 | Cn | | | Exclude | U+0974 | U+0974 | Cn | | | Exclude | U+0975 | U+0975 | Cn | | | Exclude | U+0976 | U+0976 | Cn | | | Exclude | U+0977 | U+0977 | Cn | | | Exclude | U+0978 | U+0978 | Cn | | | Exclude | U+0979 | U+0979 | Cn | | | Exclude | U+097A | U+097A | Cn | | | Exclude | U+097B | U+097B | Cn | | | Exclude | U+097C | U+097C | Cn | | Faltstrom Expires April 26, 2007 [Page 92] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+097D | U+097D | Cn | DEVANAGARI LETTER GLOTTAL | | | | | | STOP | | Exclude | U+097E | U+097E | Cn | | | Exclude | U+097F | U+097F | Cn | | +------------+--------+--------+-------+----------------------------+ 4.18. 0980-09FF Bengali +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Exclude | U+0980 | U+0980 | Cn | | | Possibly | U+0981 | U+0981 | Mn | BENGALI SIGN CANDRABINDU | | not | | | | | | Maybe | U+0982 | U+0982 | Mc | BENGALI SIGN ANUSVARA | | Maybe | U+0983 | U+0983 | Mc | BENGALI SIGN VISARGA | | Exclude | U+0984 | U+0984 | Cn | | | Maybe | U+0985 | U+0985 | Lo | BENGALI LETTER A | | Maybe | U+0986 | U+0986 | Lo | BENGALI LETTER AA | | Maybe | U+0987 | U+0987 | Lo | BENGALI LETTER I | | Maybe | U+0988 | U+0988 | Lo | BENGALI LETTER II | | Maybe | U+0989 | U+0989 | Lo | BENGALI LETTER U | | Maybe | U+098A | U+098A | Lo | BENGALI LETTER UU | | Maybe | U+098B | U+098B | Lo | BENGALI LETTER VOCALIC R | | Maybe | U+098C | U+098C | Lo | BENGALI LETTER VOCALIC L | | Exclude | U+098D | U+098D | Cn | | | Exclude | U+098E | U+098E | Cn | | | Maybe | U+098F | U+098F | Lo | BENGALI LETTER E | | Maybe | U+0990 | U+0990 | Lo | BENGALI LETTER AI | | Exclude | U+0991 | U+0991 | Cn | | | Exclude | U+0992 | U+0992 | Cn | | | Maybe | U+0993 | U+0993 | Lo | BENGALI LETTER O | | Maybe | U+0994 | U+0994 | Lo | BENGALI LETTER AU | | Maybe | U+0995 | U+0995 | Lo | BENGALI LETTER KA | | Maybe | U+0996 | U+0996 | Lo | BENGALI LETTER KHA | | Maybe | U+0997 | U+0997 | Lo | BENGALI LETTER GA | | Maybe | U+0998 | U+0998 | Lo | BENGALI LETTER GHA | | Maybe | U+0999 | U+0999 | Lo | BENGALI LETTER NGA | | Maybe | U+099A | U+099A | Lo | BENGALI LETTER CA | | Maybe | U+099B | U+099B | Lo | BENGALI LETTER CHA | | Maybe | U+099C | U+099C | Lo | BENGALI LETTER JA | | Maybe | U+099D | U+099D | Lo | BENGALI LETTER JHA | | Maybe | U+099E | U+099E | Lo | BENGALI LETTER NYA | | Maybe | U+099F | U+099F | Lo | BENGALI LETTER TTA | | Maybe | U+09A0 | U+09A0 | Lo | BENGALI LETTER TTHA | | Maybe | U+09A1 | U+09A1 | Lo | BENGALI LETTER DDA | | Maybe | U+09A2 | U+09A2 | Lo | BENGALI LETTER DDHA | | Maybe | U+09A3 | U+09A3 | Lo | BENGALI LETTER NNA | Faltstrom Expires April 26, 2007 [Page 93] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+09A4 | U+09A4 | Lo | BENGALI LETTER TA | | Maybe | U+09A5 | U+09A5 | Lo | BENGALI LETTER THA | | Maybe | U+09A6 | U+09A6 | Lo | BENGALI LETTER DA | | Maybe | U+09A7 | U+09A7 | Lo | BENGALI LETTER DHA | | Maybe | U+09A8 | U+09A8 | Lo | BENGALI LETTER NA | | Exclude | U+09A9 | U+09A9 | Cn | | | Maybe | U+09AA | U+09AA | Lo | BENGALI LETTER PA | | Maybe | U+09AB | U+09AB | Lo | BENGALI LETTER PHA | | Maybe | U+09AC | U+09AC | Lo | BENGALI LETTER BA | | Maybe | U+09AD | U+09AD | Lo | BENGALI LETTER BHA | | Maybe | U+09AE | U+09AE | Lo | BENGALI LETTER MA | | Maybe | U+09AF | U+09AF | Lo | BENGALI LETTER YA | | Maybe | U+09B0 | U+09B0 | Lo | BENGALI LETTER RA | | Exclude | U+09B1 | U+09B1 | Cn | | | Maybe | U+09B2 | U+09B2 | Lo | BENGALI LETTER LA | | Exclude | U+09B3 | U+09B3 | Cn | | | Exclude | U+09B4 | U+09B4 | Cn | | | Exclude | U+09B5 | U+09B5 | Cn | | | Maybe | U+09B6 | U+09B6 | Lo | BENGALI LETTER SHA | | Maybe | U+09B7 | U+09B7 | Lo | BENGALI LETTER SSA | | Maybe | U+09B8 | U+09B8 | Lo | BENGALI LETTER SA | | Maybe | U+09B9 | U+09B9 | Lo | BENGALI LETTER HA | | Exclude | U+09BA | U+09BA | Cn | | | Exclude | U+09BB | U+09BB | Cn | | | Possibly | U+09BC | U+09BC | Mn | BENGALI SIGN NUKTA | | not | | | | | | Maybe | U+09BD | U+09BD | Lo | BENGALI SIGN AVAGRAHA | | Maybe | U+09BE | U+09BE | Mc | BENGALI VOWEL SIGN AA | | Maybe | U+09BF | U+09BF | Mc | BENGALI VOWEL SIGN I | | Maybe | U+09C0 | U+09C0 | Mc | BENGALI VOWEL SIGN II | | Possibly | U+09C1 | U+09C1 | Mn | BENGALI VOWEL SIGN U | | not | | | | | | Possibly | U+09C2 | U+09C2 | Mn | BENGALI VOWEL SIGN UU | | not | | | | | | Possibly | U+09C3 | U+09C3 | Mn | BENGALI VOWEL SIGN VOCALIC R | | not | | | | | | Possibly | U+09C4 | U+09C4 | Mn | BENGALI VOWEL SIGN VOCALIC | | not | | | | RR | | Exclude | U+09C5 | U+09C5 | Cn | | | Exclude | U+09C6 | U+09C6 | Cn | | | Maybe | U+09C7 | U+09C7 | Mc | BENGALI VOWEL SIGN E | | Maybe | U+09C8 | U+09C8 | Mc | BENGALI VOWEL SIGN AI | | Exclude | U+09C9 | U+09C9 | Cn | | | Exclude | U+09CA | U+09CA | Cn | | | Maybe | U+09CB | U+09CB | Mc | BENGALI VOWEL SIGN O | | Maybe | U+09CC | U+09CC | Mc | BENGALI VOWEL SIGN AU | | Possibly | U+09CD | U+09CD | Mn | BENGALI SIGN VIRAMA | | not | | | | | Faltstrom Expires April 26, 2007 [Page 94] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+09CE | U+09CE | Cn | BENGALI LETTER KHANDA TA | | Exclude | U+09CF | U+09CF | Cn | | | Exclude | U+09D0 | U+09D0 | Cn | | | Exclude | U+09D1 | U+09D1 | Cn | | | Exclude | U+09D2 | U+09D2 | Cn | | | Exclude | U+09D3 | U+09D3 | Cn | | | Exclude | U+09D4 | U+09D4 | Cn | | | Exclude | U+09D5 | U+09D5 | Cn | | | Exclude | U+09D6 | U+09D6 | Cn | | | Maybe | U+09D7 | U+09D7 | Mc | BENGALI AU LENGTH MARK | | Exclude | U+09D8 | U+09D8 | Cn | | | Exclude | U+09D9 | U+09D9 | Cn | | | Exclude | U+09DA | U+09DA | Cn | | | Exclude | U+09DB | U+09DB | Cn | | | Possibly | U+09DC | U+09A1 | Lo Mn | BENGALI LETTER DDA | | not | | | | | | Possibly | U+09DD | U+09A2 | Lo Mn | BENGALI LETTER DDHA | | not | | | | | | Exclude | U+09DE | U+09DE | Cn | | | Possibly | U+09DF | U+09AF | Lo Mn | BENGALI LETTER YA | | not | | | | | | Maybe | U+09E0 | U+09E0 | Lo | BENGALI LETTER VOCALIC RR | | Maybe | U+09E1 | U+09E1 | Lo | BENGALI LETTER VOCALIC LL | | Possibly | U+09E2 | U+09E2 | Mn | BENGALI VOWEL SIGN VOCALIC L | | not | | | | | | Possibly | U+09E3 | U+09E3 | Mn | BENGALI VOWEL SIGN VOCALIC | | not | | | | LL | | Exclude | U+09E4 | U+09E4 | Cn | | | Exclude | U+09E5 | U+09E5 | Cn | | | Maybe | U+09E6 | U+09E6 | Nd | BENGALI DIGIT ZERO | | Maybe | U+09E7 | U+09E7 | Nd | BENGALI DIGIT ONE | | Maybe | U+09E8 | U+09E8 | Nd | BENGALI DIGIT TWO | | Maybe | U+09E9 | U+09E9 | Nd | BENGALI DIGIT THREE | | Maybe | U+09EA | U+09EA | Nd | BENGALI DIGIT FOUR | | Maybe | U+09EB | U+09EB | Nd | BENGALI DIGIT FIVE | | Maybe | U+09EC | U+09EC | Nd | BENGALI DIGIT SIX | | Maybe | U+09ED | U+09ED | Nd | BENGALI DIGIT SEVEN | | Maybe | U+09EE | U+09EE | Nd | BENGALI DIGIT EIGHT | | Maybe | U+09EF | U+09EF | Nd | BENGALI DIGIT NINE | | Maybe | U+09F0 | U+09F0 | Lo | BENGALI LETTER RA WITH | | | | | | MIDDLE DIAGONAL | | Maybe | U+09F1 | U+09F1 | Lo | BENGALI LETTER RA WITH LOWER | | | | | | DIAGONAL | | Exclude | U+09F2 | U+09F2 | Sc | BENGALI RUPEE MARK | | Exclude | U+09F3 | U+09F3 | Sc | BENGALI RUPEE SIGN | | Exclude | U+09F4 | U+09F4 | No | BENGALI CURRENCY NUMERATOR | | | | | | ONE | Faltstrom Expires April 26, 2007 [Page 95] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+09F5 | U+09F5 | No | BENGALI CURRENCY NUMERATOR | | | | | | TWO | | Exclude | U+09F6 | U+09F6 | No | BENGALI CURRENCY NUMERATOR | | | | | | THREE | | Exclude | U+09F7 | U+09F7 | No | BENGALI CURRENCY NUMERATOR | | | | | | FOUR | | Exclude | U+09F8 | U+09F8 | No | BENGALI CURRENCY NUMERATOR | | | | | | ONE LESS THAN THE | | | | | | DENOMINATOR | | Exclude | U+09F9 | U+09F9 | No | BENGALI CURRENCY DENOMINATOR | | | | | | SIXTEEN | | Exclude | U+09FA | U+09FA | So | BENGALI ISSHAR | | Exclude | U+09FB | U+09FB | Cn | | | Exclude | U+09FC | U+09FC | Cn | | | Exclude | U+09FD | U+09FD | Cn | | | Exclude | U+09FE | U+09FE | Cn | | | Exclude | U+09FF | U+09FF | Cn | | +----------+--------+--------+-------+------------------------------+ 4.19. 0A00-0A7F Gurmukhi +--------------+--------+--------+-------+--------------------------+ | Include? | Code | NFKC | Class | Name | +--------------+--------+--------+-------+--------------------------+ | Exclude | U+0A00 | U+0A00 | Cn | | | Possibly not | U+0A01 | U+0A01 | Mn | GURMUKHI SIGN ADAK BINDI | | Possibly not | U+0A02 | U+0A02 | Mn | GURMUKHI SIGN BINDI | | Maybe | U+0A03 | U+0A03 | Mc | GURMUKHI SIGN VISARGA | | Exclude | U+0A04 | U+0A04 | Cn | | | Maybe | U+0A05 | U+0A05 | Lo | GURMUKHI LETTER A | | Maybe | U+0A06 | U+0A06 | Lo | GURMUKHI LETTER AA | | Maybe | U+0A07 | U+0A07 | Lo | GURMUKHI LETTER I | | Maybe | U+0A08 | U+0A08 | Lo | GURMUKHI LETTER II | | Maybe | U+0A09 | U+0A09 | Lo | GURMUKHI LETTER U | | Maybe | U+0A0A | U+0A0A | Lo | GURMUKHI LETTER UU | | Exclude | U+0A0B | U+0A0B | Cn | | | Exclude | U+0A0C | U+0A0C | Cn | | | Exclude | U+0A0D | U+0A0D | Cn | | | Exclude | U+0A0E | U+0A0E | Cn | | | Maybe | U+0A0F | U+0A0F | Lo | GURMUKHI LETTER EE | | Maybe | U+0A10 | U+0A10 | Lo | GURMUKHI LETTER AI | | Exclude | U+0A11 | U+0A11 | Cn | | | Exclude | U+0A12 | U+0A12 | Cn | | | Maybe | U+0A13 | U+0A13 | Lo | GURMUKHI LETTER OO | | Maybe | U+0A14 | U+0A14 | Lo | GURMUKHI LETTER AU | | Maybe | U+0A15 | U+0A15 | Lo | GURMUKHI LETTER KA | | Maybe | U+0A16 | U+0A16 | Lo | GURMUKHI LETTER KHA | | Maybe | U+0A17 | U+0A17 | Lo | GURMUKHI LETTER GA | Faltstrom Expires April 26, 2007 [Page 96] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0A18 | U+0A18 | Lo | GURMUKHI LETTER GHA | | Maybe | U+0A19 | U+0A19 | Lo | GURMUKHI LETTER NGA | | Maybe | U+0A1A | U+0A1A | Lo | GURMUKHI LETTER CA | | Maybe | U+0A1B | U+0A1B | Lo | GURMUKHI LETTER CHA | | Maybe | U+0A1C | U+0A1C | Lo | GURMUKHI LETTER JA | | Maybe | U+0A1D | U+0A1D | Lo | GURMUKHI LETTER JHA | | Maybe | U+0A1E | U+0A1E | Lo | GURMUKHI LETTER NYA | | Maybe | U+0A1F | U+0A1F | Lo | GURMUKHI LETTER TTA | | Maybe | U+0A20 | U+0A20 | Lo | GURMUKHI LETTER TTHA | | Maybe | U+0A21 | U+0A21 | Lo | GURMUKHI LETTER DDA | | Maybe | U+0A22 | U+0A22 | Lo | GURMUKHI LETTER DDHA | | Maybe | U+0A23 | U+0A23 | Lo | GURMUKHI LETTER NNA | | Maybe | U+0A24 | U+0A24 | Lo | GURMUKHI LETTER TA | | Maybe | U+0A25 | U+0A25 | Lo | GURMUKHI LETTER THA | | Maybe | U+0A26 | U+0A26 | Lo | GURMUKHI LETTER DA | | Maybe | U+0A27 | U+0A27 | Lo | GURMUKHI LETTER DHA | | Maybe | U+0A28 | U+0A28 | Lo | GURMUKHI LETTER NA | | Exclude | U+0A29 | U+0A29 | Cn | | | Maybe | U+0A2A | U+0A2A | Lo | GURMUKHI LETTER PA | | Maybe | U+0A2B | U+0A2B | Lo | GURMUKHI LETTER PHA | | Maybe | U+0A2C | U+0A2C | Lo | GURMUKHI LETTER BA | | Maybe | U+0A2D | U+0A2D | Lo | GURMUKHI LETTER BHA | | Maybe | U+0A2E | U+0A2E | Lo | GURMUKHI LETTER MA | | Maybe | U+0A2F | U+0A2F | Lo | GURMUKHI LETTER YA | | Maybe | U+0A30 | U+0A30 | Lo | GURMUKHI LETTER RA | | Exclude | U+0A31 | U+0A31 | Cn | | | Maybe | U+0A32 | U+0A32 | Lo | GURMUKHI LETTER LA | | Possibly not | U+0A33 | U+0A32 | Lo Mn | GURMUKHI LETTER LA | | Exclude | U+0A34 | U+0A34 | Cn | | | Maybe | U+0A35 | U+0A35 | Lo | GURMUKHI LETTER VA | | Possibly not | U+0A36 | U+0A38 | Lo Mn | GURMUKHI LETTER SA | | Exclude | U+0A37 | U+0A37 | Cn | | | Maybe | U+0A38 | U+0A38 | Lo | GURMUKHI LETTER SA | | Maybe | U+0A39 | U+0A39 | Lo | GURMUKHI LETTER HA | | Exclude | U+0A3A | U+0A3A | Cn | | | Exclude | U+0A3B | U+0A3B | Cn | | | Possibly not | U+0A3C | U+0A3C | Mn | GURMUKHI SIGN NUKTA | | Exclude | U+0A3D | U+0A3D | Cn | | | Maybe | U+0A3E | U+0A3E | Mc | GURMUKHI VOWEL SIGN AA | | Maybe | U+0A3F | U+0A3F | Mc | GURMUKHI VOWEL SIGN I | | Maybe | U+0A40 | U+0A40 | Mc | GURMUKHI VOWEL SIGN II | | Possibly not | U+0A41 | U+0A41 | Mn | GURMUKHI VOWEL SIGN U | | Possibly not | U+0A42 | U+0A42 | Mn | GURMUKHI VOWEL SIGN UU | | Exclude | U+0A43 | U+0A43 | Cn | | | Exclude | U+0A44 | U+0A44 | Cn | | | Exclude | U+0A45 | U+0A45 | Cn | | | Exclude | U+0A46 | U+0A46 | Cn | | | Possibly not | U+0A47 | U+0A47 | Mn | GURMUKHI VOWEL SIGN EE | Faltstrom Expires April 26, 2007 [Page 97] Internet-Draft Unicode Codepoints October 2006 | Possibly not | U+0A48 | U+0A48 | Mn | GURMUKHI VOWEL SIGN AI | | Exclude | U+0A49 | U+0A49 | Cn | | | Exclude | U+0A4A | U+0A4A | Cn | | | Possibly not | U+0A4B | U+0A4B | Mn | GURMUKHI VOWEL SIGN OO | | Possibly not | U+0A4C | U+0A4C | Mn | GURMUKHI VOWEL SIGN AU | | Possibly not | U+0A4D | U+0A4D | Mn | GURMUKHI SIGN VIRAMA | | Exclude | U+0A4E | U+0A4E | Cn | | | Exclude | U+0A4F | U+0A4F | Cn | | | Exclude | U+0A50 | U+0A50 | Cn | | | Exclude | U+0A51 | U+0A51 | Cn | | | Exclude | U+0A52 | U+0A52 | Cn | | | Exclude | U+0A53 | U+0A53 | Cn | | | Exclude | U+0A54 | U+0A54 | Cn | | | Exclude | U+0A55 | U+0A55 | Cn | | | Exclude | U+0A56 | U+0A56 | Cn | | | Exclude | U+0A57 | U+0A57 | Cn | | | Exclude | U+0A58 | U+0A58 | Cn | | | Possibly not | U+0A59 | U+0A16 | Lo Mn | GURMUKHI LETTER KHA | | Possibly not | U+0A5A | U+0A17 | Lo Mn | GURMUKHI LETTER GA | | Possibly not | U+0A5B | U+0A1C | Lo Mn | GURMUKHI LETTER JA | | Maybe | U+0A5C | U+0A5C | Lo | GURMUKHI LETTER RRA | | Exclude | U+0A5D | U+0A5D | Cn | | | Possibly not | U+0A5E | U+0A2B | Lo Mn | GURMUKHI LETTER PHA | | Exclude | U+0A5F | U+0A5F | Cn | | | Exclude | U+0A60 | U+0A60 | Cn | | | Exclude | U+0A61 | U+0A61 | Cn | | | Exclude | U+0A62 | U+0A62 | Cn | | | Exclude | U+0A63 | U+0A63 | Cn | | | Exclude | U+0A64 | U+0A64 | Cn | | | Exclude | U+0A65 | U+0A65 | Cn | | | Maybe | U+0A66 | U+0A66 | Nd | GURMUKHI DIGIT ZERO | | Maybe | U+0A67 | U+0A67 | Nd | GURMUKHI DIGIT ONE | | Maybe | U+0A68 | U+0A68 | Nd | GURMUKHI DIGIT TWO | | Maybe | U+0A69 | U+0A69 | Nd | GURMUKHI DIGIT THREE | | Maybe | U+0A6A | U+0A6A | Nd | GURMUKHI DIGIT FOUR | | Maybe | U+0A6B | U+0A6B | Nd | GURMUKHI DIGIT FIVE | | Maybe | U+0A6C | U+0A6C | Nd | GURMUKHI DIGIT SIX | | Maybe | U+0A6D | U+0A6D | Nd | GURMUKHI DIGIT SEVEN | | Maybe | U+0A6E | U+0A6E | Nd | GURMUKHI DIGIT EIGHT | | Maybe | U+0A6F | U+0A6F | Nd | GURMUKHI DIGIT NINE | | Possibly not | U+0A70 | U+0A70 | Mn | GURMUKHI TIPPI | | Possibly not | U+0A71 | U+0A71 | Mn | GURMUKHI ADDAK | | Maybe | U+0A72 | U+0A72 | Lo | GURMUKHI IRI | | Maybe | U+0A73 | U+0A73 | Lo | GURMUKHI URA | | Maybe | U+0A74 | U+0A74 | Lo | GURMUKHI EK ONKAR | | Exclude | U+0A75 | U+0A75 | Cn | | | Exclude | U+0A76 | U+0A76 | Cn | | | Exclude | U+0A77 | U+0A77 | Cn | | Faltstrom Expires April 26, 2007 [Page 98] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0A78 | U+0A78 | Cn | | | Exclude | U+0A79 | U+0A79 | Cn | | | Exclude | U+0A7A | U+0A7A | Cn | | | Exclude | U+0A7B | U+0A7B | Cn | | | Exclude | U+0A7C | U+0A7C | Cn | | | Exclude | U+0A7D | U+0A7D | Cn | | | Exclude | U+0A7E | U+0A7E | Cn | | | Exclude | U+0A7F | U+0A7F | Cn | | +--------------+--------+--------+-------+--------------------------+ 4.20. 0A80-0AFF Gujarati +-------------+--------+--------+-------+---------------------------+ | Include? | Code | NFKC | Class | Name | +-------------+--------+--------+-------+---------------------------+ | Exclude | U+0A80 | U+0A80 | Cn | | | Possibly | U+0A81 | U+0A81 | Mn | GUJARATI SIGN CANDRABINDU | | not | | | | | | Possibly | U+0A82 | U+0A82 | Mn | GUJARATI SIGN ANUSVARA | | not | | | | | | Maybe | U+0A83 | U+0A83 | Mc | GUJARATI SIGN VISARGA | | Exclude | U+0A84 | U+0A84 | Cn | | | Maybe | U+0A85 | U+0A85 | Lo | GUJARATI LETTER A | | Maybe | U+0A86 | U+0A86 | Lo | GUJARATI LETTER AA | | Maybe | U+0A87 | U+0A87 | Lo | GUJARATI LETTER I | | Maybe | U+0A88 | U+0A88 | Lo | GUJARATI LETTER II | | Maybe | U+0A89 | U+0A89 | Lo | GUJARATI LETTER U | | Maybe | U+0A8A | U+0A8A | Lo | GUJARATI LETTER UU | | Maybe | U+0A8B | U+0A8B | Lo | GUJARATI LETTER VOCALIC R | | Maybe | U+0A8C | U+0A8C | Lo | GUJARATI LETTER VOCALIC L | | Maybe | U+0A8D | U+0A8D | Lo | GUJARATI VOWEL CANDRA E | | Exclude | U+0A8E | U+0A8E | Cn | | | Maybe | U+0A8F | U+0A8F | Lo | GUJARATI LETTER E | | Maybe | U+0A90 | U+0A90 | Lo | GUJARATI LETTER AI | | Maybe | U+0A91 | U+0A91 | Lo | GUJARATI VOWEL CANDRA O | | Exclude | U+0A92 | U+0A92 | Cn | | | Maybe | U+0A93 | U+0A93 | Lo | GUJARATI LETTER O | | Maybe | U+0A94 | U+0A94 | Lo | GUJARATI LETTER AU | | Maybe | U+0A95 | U+0A95 | Lo | GUJARATI LETTER KA | | Maybe | U+0A96 | U+0A96 | Lo | GUJARATI LETTER KHA | | Maybe | U+0A97 | U+0A97 | Lo | GUJARATI LETTER GA | | Maybe | U+0A98 | U+0A98 | Lo | GUJARATI LETTER GHA | | Maybe | U+0A99 | U+0A99 | Lo | GUJARATI LETTER NGA | | Maybe | U+0A9A | U+0A9A | Lo | GUJARATI LETTER CA | | Maybe | U+0A9B | U+0A9B | Lo | GUJARATI LETTER CHA | | Maybe | U+0A9C | U+0A9C | Lo | GUJARATI LETTER JA | | Maybe | U+0A9D | U+0A9D | Lo | GUJARATI LETTER JHA | | Maybe | U+0A9E | U+0A9E | Lo | GUJARATI LETTER NYA | Faltstrom Expires April 26, 2007 [Page 99] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0A9F | U+0A9F | Lo | GUJARATI LETTER TTA | | Maybe | U+0AA0 | U+0AA0 | Lo | GUJARATI LETTER TTHA | | Maybe | U+0AA1 | U+0AA1 | Lo | GUJARATI LETTER DDA | | Maybe | U+0AA2 | U+0AA2 | Lo | GUJARATI LETTER DDHA | | Maybe | U+0AA3 | U+0AA3 | Lo | GUJARATI LETTER NNA | | Maybe | U+0AA4 | U+0AA4 | Lo | GUJARATI LETTER TA | | Maybe | U+0AA5 | U+0AA5 | Lo | GUJARATI LETTER THA | | Maybe | U+0AA6 | U+0AA6 | Lo | GUJARATI LETTER DA | | Maybe | U+0AA7 | U+0AA7 | Lo | GUJARATI LETTER DHA | | Maybe | U+0AA8 | U+0AA8 | Lo | GUJARATI LETTER NA | | Exclude | U+0AA9 | U+0AA9 | Cn | | | Maybe | U+0AAA | U+0AAA | Lo | GUJARATI LETTER PA | | Maybe | U+0AAB | U+0AAB | Lo | GUJARATI LETTER PHA | | Maybe | U+0AAC | U+0AAC | Lo | GUJARATI LETTER BA | | Maybe | U+0AAD | U+0AAD | Lo | GUJARATI LETTER BHA | | Maybe | U+0AAE | U+0AAE | Lo | GUJARATI LETTER MA | | Maybe | U+0AAF | U+0AAF | Lo | GUJARATI LETTER YA | | Maybe | U+0AB0 | U+0AB0 | Lo | GUJARATI LETTER RA | | Exclude | U+0AB1 | U+0AB1 | Cn | | | Maybe | U+0AB2 | U+0AB2 | Lo | GUJARATI LETTER LA | | Maybe | U+0AB3 | U+0AB3 | Lo | GUJARATI LETTER LLA | | Exclude | U+0AB4 | U+0AB4 | Cn | | | Maybe | U+0AB5 | U+0AB5 | Lo | GUJARATI LETTER VA | | Maybe | U+0AB6 | U+0AB6 | Lo | GUJARATI LETTER SHA | | Maybe | U+0AB7 | U+0AB7 | Lo | GUJARATI LETTER SSA | | Maybe | U+0AB8 | U+0AB8 | Lo | GUJARATI LETTER SA | | Maybe | U+0AB9 | U+0AB9 | Lo | GUJARATI LETTER HA | | Exclude | U+0ABA | U+0ABA | Cn | | | Exclude | U+0ABB | U+0ABB | Cn | | | Possibly | U+0ABC | U+0ABC | Mn | GUJARATI SIGN NUKTA | | not | | | | | | Maybe | U+0ABD | U+0ABD | Lo | GUJARATI SIGN AVAGRAHA | | Maybe | U+0ABE | U+0ABE | Mc | GUJARATI VOWEL SIGN AA | | Maybe | U+0ABF | U+0ABF | Mc | GUJARATI VOWEL SIGN I | | Maybe | U+0AC0 | U+0AC0 | Mc | GUJARATI VOWEL SIGN II | | Possibly | U+0AC1 | U+0AC1 | Mn | GUJARATI VOWEL SIGN U | | not | | | | | | Possibly | U+0AC2 | U+0AC2 | Mn | GUJARATI VOWEL SIGN UU | | not | | | | | | Possibly | U+0AC3 | U+0AC3 | Mn | GUJARATI VOWEL SIGN | | not | | | | VOCALIC R | | Possibly | U+0AC4 | U+0AC4 | Mn | GUJARATI VOWEL SIGN | | not | | | | VOCALIC RR | | Possibly | U+0AC5 | U+0AC5 | Mn | GUJARATI VOWEL SIGN | | not | | | | CANDRA E | | Exclude | U+0AC6 | U+0AC6 | Cn | | | Possibly | U+0AC7 | U+0AC7 | Mn | GUJARATI VOWEL SIGN E | | not | | | | | Faltstrom Expires April 26, 2007 [Page 100] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+0AC8 | U+0AC8 | Mn | GUJARATI VOWEL SIGN AI | | not | | | | | | Maybe | U+0AC9 | U+0AC9 | Mc | GUJARATI VOWEL SIGN | | | | | | CANDRA O | | Exclude | U+0ACA | U+0ACA | Cn | | | Maybe | U+0ACB | U+0ACB | Mc | GUJARATI VOWEL SIGN O | | Maybe | U+0ACC | U+0ACC | Mc | GUJARATI VOWEL SIGN AU | | Possibly | U+0ACD | U+0ACD | Mn | GUJARATI SIGN VIRAMA | | not | | | | | | Exclude | U+0ACE | U+0ACE | Cn | | | Exclude | U+0ACF | U+0ACF | Cn | | | Maybe | U+0AD0 | U+0AD0 | Lo | GUJARATI OM | | Exclude | U+0AD1 | U+0AD1 | Cn | | | Exclude | U+0AD2 | U+0AD2 | Cn | | | Exclude | U+0AD3 | U+0AD3 | Cn | | | Exclude | U+0AD4 | U+0AD4 | Cn | | | Exclude | U+0AD5 | U+0AD5 | Cn | | | Exclude | U+0AD6 | U+0AD6 | Cn | | | Exclude | U+0AD7 | U+0AD7 | Cn | | | Exclude | U+0AD8 | U+0AD8 | Cn | | | Exclude | U+0AD9 | U+0AD9 | Cn | | | Exclude | U+0ADA | U+0ADA | Cn | | | Exclude | U+0ADB | U+0ADB | Cn | | | Exclude | U+0ADC | U+0ADC | Cn | | | Exclude | U+0ADD | U+0ADD | Cn | | | Exclude | U+0ADE | U+0ADE | Cn | | | Exclude | U+0ADF | U+0ADF | Cn | | | Maybe | U+0AE0 | U+0AE0 | Lo | GUJARATI LETTER VOCALIC | | | | | | RR | | Maybe | U+0AE1 | U+0AE1 | Lo | GUJARATI LETTER VOCALIC | | | | | | LL | | Possibly | U+0AE2 | U+0AE2 | Mn | GUJARATI VOWEL SIGN | | not | | | | VOCALIC L | | Possibly | U+0AE3 | U+0AE3 | Mn | GUJARATI VOWEL SIGN | | not | | | | VOCALIC LL | | Exclude | U+0AE4 | U+0AE4 | Cn | | | Exclude | U+0AE5 | U+0AE5 | Cn | | | Maybe | U+0AE6 | U+0AE6 | Nd | GUJARATI DIGIT ZERO | | Maybe | U+0AE7 | U+0AE7 | Nd | GUJARATI DIGIT ONE | | Maybe | U+0AE8 | U+0AE8 | Nd | GUJARATI DIGIT TWO | | Maybe | U+0AE9 | U+0AE9 | Nd | GUJARATI DIGIT THREE | | Maybe | U+0AEA | U+0AEA | Nd | GUJARATI DIGIT FOUR | | Maybe | U+0AEB | U+0AEB | Nd | GUJARATI DIGIT FIVE | | Maybe | U+0AEC | U+0AEC | Nd | GUJARATI DIGIT SIX | | Maybe | U+0AED | U+0AED | Nd | GUJARATI DIGIT SEVEN | | Maybe | U+0AEE | U+0AEE | Nd | GUJARATI DIGIT EIGHT | | Maybe | U+0AEF | U+0AEF | Nd | GUJARATI DIGIT NINE | | Exclude | U+0AF0 | U+0AF0 | Cn | | Faltstrom Expires April 26, 2007 [Page 101] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0AF1 | U+0AF1 | Sc | GUJARATI RUPEE SIGN | | Exclude | U+0AF2 | U+0AF2 | Cn | | | Exclude | U+0AF3 | U+0AF3 | Cn | | | Exclude | U+0AF4 | U+0AF4 | Cn | | | Exclude | U+0AF5 | U+0AF5 | Cn | | | Exclude | U+0AF6 | U+0AF6 | Cn | | | Exclude | U+0AF7 | U+0AF7 | Cn | | | Exclude | U+0AF8 | U+0AF8 | Cn | | | Exclude | U+0AF9 | U+0AF9 | Cn | | | Exclude | U+0AFA | U+0AFA | Cn | | | Exclude | U+0AFB | U+0AFB | Cn | | | Exclude | U+0AFC | U+0AFC | Cn | | | Exclude | U+0AFD | U+0AFD | Cn | | | Exclude | U+0AFE | U+0AFE | Cn | | | Exclude | U+0AFF | U+0AFF | Cn | | +-------------+--------+--------+-------+---------------------------+ 4.21. 0B00-0B7F Oriya +--------------+--------+--------+-------+--------------------------+ | Include? | Code | NFKC | Class | Name | +--------------+--------+--------+-------+--------------------------+ | Exclude | U+0B00 | U+0B00 | Cn | | | Possibly not | U+0B01 | U+0B01 | Mn | ORIYA SIGN CANDRABINDU | | Maybe | U+0B02 | U+0B02 | Mc | ORIYA SIGN ANUSVARA | | Maybe | U+0B03 | U+0B03 | Mc | ORIYA SIGN VISARGA | | Exclude | U+0B04 | U+0B04 | Cn | | | Maybe | U+0B05 | U+0B05 | Lo | ORIYA LETTER A | | Maybe | U+0B06 | U+0B06 | Lo | ORIYA LETTER AA | | Maybe | U+0B07 | U+0B07 | Lo | ORIYA LETTER I | | Maybe | U+0B08 | U+0B08 | Lo | ORIYA LETTER II | | Maybe | U+0B09 | U+0B09 | Lo | ORIYA LETTER U | | Maybe | U+0B0A | U+0B0A | Lo | ORIYA LETTER UU | | Maybe | U+0B0B | U+0B0B | Lo | ORIYA LETTER VOCALIC R | | Maybe | U+0B0C | U+0B0C | Lo | ORIYA LETTER VOCALIC L | | Exclude | U+0B0D | U+0B0D | Cn | | | Exclude | U+0B0E | U+0B0E | Cn | | | Maybe | U+0B0F | U+0B0F | Lo | ORIYA LETTER E | | Maybe | U+0B10 | U+0B10 | Lo | ORIYA LETTER AI | | Exclude | U+0B11 | U+0B11 | Cn | | | Exclude | U+0B12 | U+0B12 | Cn | | | Maybe | U+0B13 | U+0B13 | Lo | ORIYA LETTER O | | Maybe | U+0B14 | U+0B14 | Lo | ORIYA LETTER AU | | Maybe | U+0B15 | U+0B15 | Lo | ORIYA LETTER KA | | Maybe | U+0B16 | U+0B16 | Lo | ORIYA LETTER KHA | | Maybe | U+0B17 | U+0B17 | Lo | ORIYA LETTER GA | | Maybe | U+0B18 | U+0B18 | Lo | ORIYA LETTER GHA | | Maybe | U+0B19 | U+0B19 | Lo | ORIYA LETTER NGA | Faltstrom Expires April 26, 2007 [Page 102] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0B1A | U+0B1A | Lo | ORIYA LETTER CA | | Maybe | U+0B1B | U+0B1B | Lo | ORIYA LETTER CHA | | Maybe | U+0B1C | U+0B1C | Lo | ORIYA LETTER JA | | Maybe | U+0B1D | U+0B1D | Lo | ORIYA LETTER JHA | | Maybe | U+0B1E | U+0B1E | Lo | ORIYA LETTER NYA | | Maybe | U+0B1F | U+0B1F | Lo | ORIYA LETTER TTA | | Maybe | U+0B20 | U+0B20 | Lo | ORIYA LETTER TTHA | | Maybe | U+0B21 | U+0B21 | Lo | ORIYA LETTER DDA | | Maybe | U+0B22 | U+0B22 | Lo | ORIYA LETTER DDHA | | Maybe | U+0B23 | U+0B23 | Lo | ORIYA LETTER NNA | | Maybe | U+0B24 | U+0B24 | Lo | ORIYA LETTER TA | | Maybe | U+0B25 | U+0B25 | Lo | ORIYA LETTER THA | | Maybe | U+0B26 | U+0B26 | Lo | ORIYA LETTER DA | | Maybe | U+0B27 | U+0B27 | Lo | ORIYA LETTER DHA | | Maybe | U+0B28 | U+0B28 | Lo | ORIYA LETTER NA | | Exclude | U+0B29 | U+0B29 | Cn | | | Maybe | U+0B2A | U+0B2A | Lo | ORIYA LETTER PA | | Maybe | U+0B2B | U+0B2B | Lo | ORIYA LETTER PHA | | Maybe | U+0B2C | U+0B2C | Lo | ORIYA LETTER BA | | Maybe | U+0B2D | U+0B2D | Lo | ORIYA LETTER BHA | | Maybe | U+0B2E | U+0B2E | Lo | ORIYA LETTER MA | | Maybe | U+0B2F | U+0B2F | Lo | ORIYA LETTER YA | | Maybe | U+0B30 | U+0B30 | Lo | ORIYA LETTER RA | | Exclude | U+0B31 | U+0B31 | Cn | | | Maybe | U+0B32 | U+0B32 | Lo | ORIYA LETTER LA | | Maybe | U+0B33 | U+0B33 | Lo | ORIYA LETTER LLA | | Exclude | U+0B34 | U+0B34 | Cn | | | Maybe | U+0B35 | U+0B35 | Lo | ORIYA LETTER VA | | Maybe | U+0B36 | U+0B36 | Lo | ORIYA LETTER SHA | | Maybe | U+0B37 | U+0B37 | Lo | ORIYA LETTER SSA | | Maybe | U+0B38 | U+0B38 | Lo | ORIYA LETTER SA | | Maybe | U+0B39 | U+0B39 | Lo | ORIYA LETTER HA | | Exclude | U+0B3A | U+0B3A | Cn | | | Exclude | U+0B3B | U+0B3B | Cn | | | Possibly not | U+0B3C | U+0B3C | Mn | ORIYA SIGN NUKTA | | Maybe | U+0B3D | U+0B3D | Lo | ORIYA SIGN AVAGRAHA | | Maybe | U+0B3E | U+0B3E | Mc | ORIYA VOWEL SIGN AA | | Possibly not | U+0B3F | U+0B3F | Mn | ORIYA VOWEL SIGN I | | Maybe | U+0B40 | U+0B40 | Mc | ORIYA VOWEL SIGN II | | Possibly not | U+0B41 | U+0B41 | Mn | ORIYA VOWEL SIGN U | | Possibly not | U+0B42 | U+0B42 | Mn | ORIYA VOWEL SIGN UU | | Possibly not | U+0B43 | U+0B43 | Mn | ORIYA VOWEL SIGN VOCALIC | | | | | | R | | Exclude | U+0B44 | U+0B44 | Cn | | | Exclude | U+0B45 | U+0B45 | Cn | | | Exclude | U+0B46 | U+0B46 | Cn | | | Maybe | U+0B47 | U+0B47 | Mc | ORIYA VOWEL SIGN E | | Maybe | U+0B48 | U+0B48 | Mc | ORIYA VOWEL SIGN AI | Faltstrom Expires April 26, 2007 [Page 103] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0B49 | U+0B49 | Cn | | | Exclude | U+0B4A | U+0B4A | Cn | | | Maybe | U+0B4B | U+0B4B | Mc | ORIYA VOWEL SIGN O | | Maybe | U+0B4C | U+0B4C | Mc | ORIYA VOWEL SIGN AU | | Possibly not | U+0B4D | U+0B4D | Mn | ORIYA SIGN VIRAMA | | Exclude | U+0B4E | U+0B4E | Cn | | | Exclude | U+0B4F | U+0B4F | Cn | | | Exclude | U+0B50 | U+0B50 | Cn | | | Exclude | U+0B51 | U+0B51 | Cn | | | Exclude | U+0B52 | U+0B52 | Cn | | | Exclude | U+0B53 | U+0B53 | Cn | | | Exclude | U+0B54 | U+0B54 | Cn | | | Exclude | U+0B55 | U+0B55 | Cn | | | Possibly not | U+0B56 | U+0B56 | Mn | ORIYA AI LENGTH MARK | | Maybe | U+0B57 | U+0B57 | Mc | ORIYA AU LENGTH MARK | | Exclude | U+0B58 | U+0B58 | Cn | | | Exclude | U+0B59 | U+0B59 | Cn | | | Exclude | U+0B5A | U+0B5A | Cn | | | Exclude | U+0B5B | U+0B5B | Cn | | | Possibly not | U+0B5C | U+0B21 | Lo Mn | ORIYA LETTER DDA | | Possibly not | U+0B5D | U+0B22 | Lo Mn | ORIYA LETTER DDHA | | Exclude | U+0B5E | U+0B5E | Cn | | | Maybe | U+0B5F | U+0B5F | Lo | ORIYA LETTER YYA | | Maybe | U+0B60 | U+0B60 | Lo | ORIYA LETTER VOCALIC RR | | Maybe | U+0B61 | U+0B61 | Lo | ORIYA LETTER VOCALIC LL | | Exclude | U+0B62 | U+0B62 | Cn | | | Exclude | U+0B63 | U+0B63 | Cn | | | Exclude | U+0B64 | U+0B64 | Cn | | | Exclude | U+0B65 | U+0B65 | Cn | | | Maybe | U+0B66 | U+0B66 | Nd | ORIYA DIGIT ZERO | | Maybe | U+0B67 | U+0B67 | Nd | ORIYA DIGIT ONE | | Maybe | U+0B68 | U+0B68 | Nd | ORIYA DIGIT TWO | | Maybe | U+0B69 | U+0B69 | Nd | ORIYA DIGIT THREE | | Maybe | U+0B6A | U+0B6A | Nd | ORIYA DIGIT FOUR | | Maybe | U+0B6B | U+0B6B | Nd | ORIYA DIGIT FIVE | | Maybe | U+0B6C | U+0B6C | Nd | ORIYA DIGIT SIX | | Maybe | U+0B6D | U+0B6D | Nd | ORIYA DIGIT SEVEN | | Maybe | U+0B6E | U+0B6E | Nd | ORIYA DIGIT EIGHT | | Maybe | U+0B6F | U+0B6F | Nd | ORIYA DIGIT NINE | | Exclude | U+0B70 | U+0B70 | So | ORIYA ISSHAR | | Maybe | U+0B71 | U+0B71 | Lo | ORIYA LETTER WA | | Exclude | U+0B72 | U+0B72 | Cn | | | Exclude | U+0B73 | U+0B73 | Cn | | | Exclude | U+0B74 | U+0B74 | Cn | | | Exclude | U+0B75 | U+0B75 | Cn | | | Exclude | U+0B76 | U+0B76 | Cn | | | Exclude | U+0B77 | U+0B77 | Cn | | | Exclude | U+0B78 | U+0B78 | Cn | | Faltstrom Expires April 26, 2007 [Page 104] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0B79 | U+0B79 | Cn | | | Exclude | U+0B7A | U+0B7A | Cn | | | Exclude | U+0B7B | U+0B7B | Cn | | | Exclude | U+0B7C | U+0B7C | Cn | | | Exclude | U+0B7D | U+0B7D | Cn | | | Exclude | U+0B7E | U+0B7E | Cn | | | Exclude | U+0B7F | U+0B7F | Cn | | +--------------+--------+--------+-------+--------------------------+ 4.22. 0B80-0BFF Tamil +--------------+--------+--------+-------+--------------------------+ | Include? | Code | NFKC | Class | Name | +--------------+--------+--------+-------+--------------------------+ | Exclude | U+0B80 | U+0B80 | Cn | | | Exclude | U+0B81 | U+0B81 | Cn | | | Possibly not | U+0B82 | U+0B82 | Mn | TAMIL SIGN ANUSVARA | | Maybe | U+0B83 | U+0B83 | Lo | TAMIL SIGN VISARGA | | Exclude | U+0B84 | U+0B84 | Cn | | | Maybe | U+0B85 | U+0B85 | Lo | TAMIL LETTER A | | Maybe | U+0B86 | U+0B86 | Lo | TAMIL LETTER AA | | Maybe | U+0B87 | U+0B87 | Lo | TAMIL LETTER I | | Maybe | U+0B88 | U+0B88 | Lo | TAMIL LETTER II | | Maybe | U+0B89 | U+0B89 | Lo | TAMIL LETTER U | | Maybe | U+0B8A | U+0B8A | Lo | TAMIL LETTER UU | | Exclude | U+0B8B | U+0B8B | Cn | | | Exclude | U+0B8C | U+0B8C | Cn | | | Exclude | U+0B8D | U+0B8D | Cn | | | Maybe | U+0B8E | U+0B8E | Lo | TAMIL LETTER E | | Maybe | U+0B8F | U+0B8F | Lo | TAMIL LETTER EE | | Maybe | U+0B90 | U+0B90 | Lo | TAMIL LETTER AI | | Exclude | U+0B91 | U+0B91 | Cn | | | Maybe | U+0B92 | U+0B92 | Lo | TAMIL LETTER O | | Maybe | U+0B93 | U+0B93 | Lo | TAMIL LETTER OO | | Maybe | U+0B94 | U+0B94 | Lo | TAMIL LETTER AU | | Maybe | U+0B95 | U+0B95 | Lo | TAMIL LETTER KA | | Exclude | U+0B96 | U+0B96 | Cn | | | Exclude | U+0B97 | U+0B97 | Cn | | | Exclude | U+0B98 | U+0B98 | Cn | | | Maybe | U+0B99 | U+0B99 | Lo | TAMIL LETTER NGA | | Maybe | U+0B9A | U+0B9A | Lo | TAMIL LETTER CA | | Exclude | U+0B9B | U+0B9B | Cn | | | Maybe | U+0B9C | U+0B9C | Lo | TAMIL LETTER JA | | Exclude | U+0B9D | U+0B9D | Cn | | | Maybe | U+0B9E | U+0B9E | Lo | TAMIL LETTER NYA | | Maybe | U+0B9F | U+0B9F | Lo | TAMIL LETTER TTA | | Exclude | U+0BA0 | U+0BA0 | Cn | | | Exclude | U+0BA1 | U+0BA1 | Cn | | Faltstrom Expires April 26, 2007 [Page 105] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0BA2 | U+0BA2 | Cn | | | Maybe | U+0BA3 | U+0BA3 | Lo | TAMIL LETTER NNA | | Maybe | U+0BA4 | U+0BA4 | Lo | TAMIL LETTER TA | | Exclude | U+0BA5 | U+0BA5 | Cn | | | Exclude | U+0BA6 | U+0BA6 | Cn | | | Exclude | U+0BA7 | U+0BA7 | Cn | | | Maybe | U+0BA8 | U+0BA8 | Lo | TAMIL LETTER NA | | Maybe | U+0BA9 | U+0BA9 | Lo | TAMIL LETTER NNNA | | Maybe | U+0BAA | U+0BAA | Lo | TAMIL LETTER PA | | Exclude | U+0BAB | U+0BAB | Cn | | | Exclude | U+0BAC | U+0BAC | Cn | | | Exclude | U+0BAD | U+0BAD | Cn | | | Maybe | U+0BAE | U+0BAE | Lo | TAMIL LETTER MA | | Maybe | U+0BAF | U+0BAF | Lo | TAMIL LETTER YA | | Maybe | U+0BB0 | U+0BB0 | Lo | TAMIL LETTER RA | | Maybe | U+0BB1 | U+0BB1 | Lo | TAMIL LETTER RRA | | Maybe | U+0BB2 | U+0BB2 | Lo | TAMIL LETTER LA | | Maybe | U+0BB3 | U+0BB3 | Lo | TAMIL LETTER LLA | | Maybe | U+0BB4 | U+0BB4 | Lo | TAMIL LETTER LLLA | | Maybe | U+0BB5 | U+0BB5 | Lo | TAMIL LETTER VA | | Exclude | U+0BB6 | U+0BB6 | Cn | TAMIL LETTER SHA | | Maybe | U+0BB7 | U+0BB7 | Lo | TAMIL LETTER SSA | | Maybe | U+0BB8 | U+0BB8 | Lo | TAMIL LETTER SA | | Maybe | U+0BB9 | U+0BB9 | Lo | TAMIL LETTER HA | | Exclude | U+0BBA | U+0BBA | Cn | | | Exclude | U+0BBB | U+0BBB | Cn | | | Exclude | U+0BBC | U+0BBC | Cn | | | Exclude | U+0BBD | U+0BBD | Cn | | | Maybe | U+0BBE | U+0BBE | Mc | TAMIL VOWEL SIGN AA | | Maybe | U+0BBF | U+0BBF | Mc | TAMIL VOWEL SIGN I | | Possibly not | U+0BC0 | U+0BC0 | Mn | TAMIL VOWEL SIGN II | | Maybe | U+0BC1 | U+0BC1 | Mc | TAMIL VOWEL SIGN U | | Maybe | U+0BC2 | U+0BC2 | Mc | TAMIL VOWEL SIGN UU | | Exclude | U+0BC3 | U+0BC3 | Cn | | | Exclude | U+0BC4 | U+0BC4 | Cn | | | Exclude | U+0BC5 | U+0BC5 | Cn | | | Maybe | U+0BC6 | U+0BC6 | Mc | TAMIL VOWEL SIGN E | | Maybe | U+0BC7 | U+0BC7 | Mc | TAMIL VOWEL SIGN EE | | Maybe | U+0BC8 | U+0BC8 | Mc | TAMIL VOWEL SIGN AI | | Exclude | U+0BC9 | U+0BC9 | Cn | | | Maybe | U+0BCA | U+0BCA | Mc | TAMIL VOWEL SIGN O | | Maybe | U+0BCB | U+0BCB | Mc | TAMIL VOWEL SIGN OO | | Maybe | U+0BCC | U+0BCC | Mc | TAMIL VOWEL SIGN AU | | Possibly not | U+0BCD | U+0BCD | Mn | TAMIL SIGN VIRAMA | | Exclude | U+0BCE | U+0BCE | Cn | | | Exclude | U+0BCF | U+0BCF | Cn | | | Exclude | U+0BD0 | U+0BD0 | Cn | | | Exclude | U+0BD1 | U+0BD1 | Cn | | Faltstrom Expires April 26, 2007 [Page 106] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0BD2 | U+0BD2 | Cn | | | Exclude | U+0BD3 | U+0BD3 | Cn | | | Exclude | U+0BD4 | U+0BD4 | Cn | | | Exclude | U+0BD5 | U+0BD5 | Cn | | | Exclude | U+0BD6 | U+0BD6 | Cn | | | Maybe | U+0BD7 | U+0BD7 | Mc | TAMIL AU LENGTH MARK | | Exclude | U+0BD8 | U+0BD8 | Cn | | | Exclude | U+0BD9 | U+0BD9 | Cn | | | Exclude | U+0BDA | U+0BDA | Cn | | | Exclude | U+0BDB | U+0BDB | Cn | | | Exclude | U+0BDC | U+0BDC | Cn | | | Exclude | U+0BDD | U+0BDD | Cn | | | Exclude | U+0BDE | U+0BDE | Cn | | | Exclude | U+0BDF | U+0BDF | Cn | | | Exclude | U+0BE0 | U+0BE0 | Cn | | | Exclude | U+0BE1 | U+0BE1 | Cn | | | Exclude | U+0BE2 | U+0BE2 | Cn | | | Exclude | U+0BE3 | U+0BE3 | Cn | | | Exclude | U+0BE4 | U+0BE4 | Cn | | | Exclude | U+0BE5 | U+0BE5 | Cn | | | Exclude | U+0BE6 | U+0BE6 | Cn | TAMIL DIGIT ZERO | | Maybe | U+0BE7 | U+0BE7 | Nd | TAMIL DIGIT ONE | | Maybe | U+0BE8 | U+0BE8 | Nd | TAMIL DIGIT TWO | | Maybe | U+0BE9 | U+0BE9 | Nd | TAMIL DIGIT THREE | | Maybe | U+0BEA | U+0BEA | Nd | TAMIL DIGIT FOUR | | Maybe | U+0BEB | U+0BEB | Nd | TAMIL DIGIT FIVE | | Maybe | U+0BEC | U+0BEC | Nd | TAMIL DIGIT SIX | | Maybe | U+0BED | U+0BED | Nd | TAMIL DIGIT SEVEN | | Maybe | U+0BEE | U+0BEE | Nd | TAMIL DIGIT EIGHT | | Maybe | U+0BEF | U+0BEF | Nd | TAMIL DIGIT NINE | | Exclude | U+0BF0 | U+0BF0 | No | TAMIL NUMBER TEN | | Exclude | U+0BF1 | U+0BF1 | No | TAMIL NUMBER ONE HUNDRED | | Exclude | U+0BF2 | U+0BF2 | No | TAMIL NUMBER ONE | | | | | | THOUSAND | | Exclude | U+0BF3 | U+0BF3 | So | TAMIL DAY SIGN | | Exclude | U+0BF4 | U+0BF4 | So | TAMIL MONTH SIGN | | Exclude | U+0BF5 | U+0BF5 | So | TAMIL YEAR SIGN | | Exclude | U+0BF6 | U+0BF6 | So | TAMIL DEBIT SIGN | | Exclude | U+0BF7 | U+0BF7 | So | TAMIL CREDIT SIGN | | Exclude | U+0BF8 | U+0BF8 | So | TAMIL AS ABOVE SIGN | | Exclude | U+0BF9 | U+0BF9 | Sc | TAMIL RUPEE SIGN | | Exclude | U+0BFA | U+0BFA | So | TAMIL NUMBER SIGN | | Exclude | U+0BFB | U+0BFB | Cn | | | Exclude | U+0BFC | U+0BFC | Cn | | | Exclude | U+0BFD | U+0BFD | Cn | | | Exclude | U+0BFE | U+0BFE | Cn | | | Exclude | U+0BFF | U+0BFF | Cn | | +--------------+--------+--------+-------+--------------------------+ Faltstrom Expires April 26, 2007 [Page 107] Internet-Draft Unicode Codepoints October 2006 4.23. 0C00-0C7F Telugu +-------------+--------+--------+-------+---------------------------+ | Include? | Code | NFKC | Class | Name | +-------------+--------+--------+-------+---------------------------+ | Exclude | U+0C00 | U+0C00 | Cn | | | Maybe | U+0C01 | U+0C01 | Mc | TELUGU SIGN CANDRABINDU | | Maybe | U+0C02 | U+0C02 | Mc | TELUGU SIGN ANUSVARA | | Maybe | U+0C03 | U+0C03 | Mc | TELUGU SIGN VISARGA | | Exclude | U+0C04 | U+0C04 | Cn | | | Maybe | U+0C05 | U+0C05 | Lo | TELUGU LETTER A | | Maybe | U+0C06 | U+0C06 | Lo | TELUGU LETTER AA | | Maybe | U+0C07 | U+0C07 | Lo | TELUGU LETTER I | | Maybe | U+0C08 | U+0C08 | Lo | TELUGU LETTER II | | Maybe | U+0C09 | U+0C09 | Lo | TELUGU LETTER U | | Maybe | U+0C0A | U+0C0A | Lo | TELUGU LETTER UU | | Maybe | U+0C0B | U+0C0B | Lo | TELUGU LETTER VOCALIC R | | Maybe | U+0C0C | U+0C0C | Lo | TELUGU LETTER VOCALIC L | | Exclude | U+0C0D | U+0C0D | Cn | | | Maybe | U+0C0E | U+0C0E | Lo | TELUGU LETTER E | | Maybe | U+0C0F | U+0C0F | Lo | TELUGU LETTER EE | | Maybe | U+0C10 | U+0C10 | Lo | TELUGU LETTER AI | | Exclude | U+0C11 | U+0C11 | Cn | | | Maybe | U+0C12 | U+0C12 | Lo | TELUGU LETTER O | | Maybe | U+0C13 | U+0C13 | Lo | TELUGU LETTER OO | | Maybe | U+0C14 | U+0C14 | Lo | TELUGU LETTER AU | | Maybe | U+0C15 | U+0C15 | Lo | TELUGU LETTER KA | | Maybe | U+0C16 | U+0C16 | Lo | TELUGU LETTER KHA | | Maybe | U+0C17 | U+0C17 | Lo | TELUGU LETTER GA | | Maybe | U+0C18 | U+0C18 | Lo | TELUGU LETTER GHA | | Maybe | U+0C19 | U+0C19 | Lo | TELUGU LETTER NGA | | Maybe | U+0C1A | U+0C1A | Lo | TELUGU LETTER CA | | Maybe | U+0C1B | U+0C1B | Lo | TELUGU LETTER CHA | | Maybe | U+0C1C | U+0C1C | Lo | TELUGU LETTER JA | | Maybe | U+0C1D | U+0C1D | Lo | TELUGU LETTER JHA | | Maybe | U+0C1E | U+0C1E | Lo | TELUGU LETTER NYA | | Maybe | U+0C1F | U+0C1F | Lo | TELUGU LETTER TTA | | Maybe | U+0C20 | U+0C20 | Lo | TELUGU LETTER TTHA | | Maybe | U+0C21 | U+0C21 | Lo | TELUGU LETTER DDA | | Maybe | U+0C22 | U+0C22 | Lo | TELUGU LETTER DDHA | | Maybe | U+0C23 | U+0C23 | Lo | TELUGU LETTER NNA | | Maybe | U+0C24 | U+0C24 | Lo | TELUGU LETTER TA | | Maybe | U+0C25 | U+0C25 | Lo | TELUGU LETTER THA | | Maybe | U+0C26 | U+0C26 | Lo | TELUGU LETTER DA | | Maybe | U+0C27 | U+0C27 | Lo | TELUGU LETTER DHA | | Maybe | U+0C28 | U+0C28 | Lo | TELUGU LETTER NA | | Exclude | U+0C29 | U+0C29 | Cn | | | Maybe | U+0C2A | U+0C2A | Lo | TELUGU LETTER PA | Faltstrom Expires April 26, 2007 [Page 108] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0C2B | U+0C2B | Lo | TELUGU LETTER PHA | | Maybe | U+0C2C | U+0C2C | Lo | TELUGU LETTER BA | | Maybe | U+0C2D | U+0C2D | Lo | TELUGU LETTER BHA | | Maybe | U+0C2E | U+0C2E | Lo | TELUGU LETTER MA | | Maybe | U+0C2F | U+0C2F | Lo | TELUGU LETTER YA | | Maybe | U+0C30 | U+0C30 | Lo | TELUGU LETTER RA | | Maybe | U+0C31 | U+0C31 | Lo | TELUGU LETTER RRA | | Maybe | U+0C32 | U+0C32 | Lo | TELUGU LETTER LA | | Maybe | U+0C33 | U+0C33 | Lo | TELUGU LETTER LLA | | Exclude | U+0C34 | U+0C34 | Cn | | | Maybe | U+0C35 | U+0C35 | Lo | TELUGU LETTER VA | | Maybe | U+0C36 | U+0C36 | Lo | TELUGU LETTER SHA | | Maybe | U+0C37 | U+0C37 | Lo | TELUGU LETTER SSA | | Maybe | U+0C38 | U+0C38 | Lo | TELUGU LETTER SA | | Maybe | U+0C39 | U+0C39 | Lo | TELUGU LETTER HA | | Exclude | U+0C3A | U+0C3A | Cn | | | Exclude | U+0C3B | U+0C3B | Cn | | | Exclude | U+0C3C | U+0C3C | Cn | | | Exclude | U+0C3D | U+0C3D | Cn | | | Possibly | U+0C3E | U+0C3E | Mn | TELUGU VOWEL SIGN AA | | not | | | | | | Possibly | U+0C3F | U+0C3F | Mn | TELUGU VOWEL SIGN I | | not | | | | | | Possibly | U+0C40 | U+0C40 | Mn | TELUGU VOWEL SIGN II | | not | | | | | | Maybe | U+0C41 | U+0C41 | Mc | TELUGU VOWEL SIGN U | | Maybe | U+0C42 | U+0C42 | Mc | TELUGU VOWEL SIGN UU | | Maybe | U+0C43 | U+0C43 | Mc | TELUGU VOWEL SIGN VOCALIC | | | | | | R | | Maybe | U+0C44 | U+0C44 | Mc | TELUGU VOWEL SIGN VOCALIC | | | | | | RR | | Exclude | U+0C45 | U+0C45 | Cn | | | Possibly | U+0C46 | U+0C46 | Mn | TELUGU VOWEL SIGN E | | not | | | | | | Possibly | U+0C47 | U+0C47 | Mn | TELUGU VOWEL SIGN EE | | not | | | | | | Possibly | U+0C48 | U+0C48 | Mn | TELUGU VOWEL SIGN AI | | not | | | | | | Exclude | U+0C49 | U+0C49 | Cn | | | Possibly | U+0C4A | U+0C4A | Mn | TELUGU VOWEL SIGN O | | not | | | | | | Possibly | U+0C4B | U+0C4B | Mn | TELUGU VOWEL SIGN OO | | not | | | | | | Possibly | U+0C4C | U+0C4C | Mn | TELUGU VOWEL SIGN AU | | not | | | | | | Possibly | U+0C4D | U+0C4D | Mn | TELUGU SIGN VIRAMA | | not | | | | | | Exclude | U+0C4E | U+0C4E | Cn | | Faltstrom Expires April 26, 2007 [Page 109] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0C4F | U+0C4F | Cn | | | Exclude | U+0C50 | U+0C50 | Cn | | | Exclude | U+0C51 | U+0C51 | Cn | | | Exclude | U+0C52 | U+0C52 | Cn | | | Exclude | U+0C53 | U+0C53 | Cn | | | Exclude | U+0C54 | U+0C54 | Cn | | | Possibly | U+0C55 | U+0C55 | Mn | TELUGU LENGTH MARK | | not | | | | | | Possibly | U+0C56 | U+0C56 | Mn | TELUGU AI LENGTH MARK | | not | | | | | | Exclude | U+0C57 | U+0C57 | Cn | | | Exclude | U+0C58 | U+0C58 | Cn | | | Exclude | U+0C59 | U+0C59 | Cn | | | Exclude | U+0C5A | U+0C5A | Cn | | | Exclude | U+0C5B | U+0C5B | Cn | | | Exclude | U+0C5C | U+0C5C | Cn | | | Exclude | U+0C5D | U+0C5D | Cn | | | Exclude | U+0C5E | U+0C5E | Cn | | | Exclude | U+0C5F | U+0C5F | Cn | | | Maybe | U+0C60 | U+0C60 | Lo | TELUGU LETTER VOCALIC RR | | Maybe | U+0C61 | U+0C61 | Lo | TELUGU LETTER VOCALIC LL | | Exclude | U+0C62 | U+0C62 | Cn | | | Exclude | U+0C63 | U+0C63 | Cn | | | Exclude | U+0C64 | U+0C64 | Cn | | | Exclude | U+0C65 | U+0C65 | Cn | | | Maybe | U+0C66 | U+0C66 | Nd | TELUGU DIGIT ZERO | | Maybe | U+0C67 | U+0C67 | Nd | TELUGU DIGIT ONE | | Maybe | U+0C68 | U+0C68 | Nd | TELUGU DIGIT TWO | | Maybe | U+0C69 | U+0C69 | Nd | TELUGU DIGIT THREE | | Maybe | U+0C6A | U+0C6A | Nd | TELUGU DIGIT FOUR | | Maybe | U+0C6B | U+0C6B | Nd | TELUGU DIGIT FIVE | | Maybe | U+0C6C | U+0C6C | Nd | TELUGU DIGIT SIX | | Maybe | U+0C6D | U+0C6D | Nd | TELUGU DIGIT SEVEN | | Maybe | U+0C6E | U+0C6E | Nd | TELUGU DIGIT EIGHT | | Maybe | U+0C6F | U+0C6F | Nd | TELUGU DIGIT NINE | | Exclude | U+0C70 | U+0C70 | Cn | | | Exclude | U+0C71 | U+0C71 | Cn | | | Exclude | U+0C72 | U+0C72 | Cn | | | Exclude | U+0C73 | U+0C73 | Cn | | | Exclude | U+0C74 | U+0C74 | Cn | | | Exclude | U+0C75 | U+0C75 | Cn | | | Exclude | U+0C76 | U+0C76 | Cn | | | Exclude | U+0C77 | U+0C77 | Cn | | | Exclude | U+0C78 | U+0C78 | Cn | | | Exclude | U+0C79 | U+0C79 | Cn | | | Exclude | U+0C7A | U+0C7A | Cn | | | Exclude | U+0C7B | U+0C7B | Cn | | | Exclude | U+0C7C | U+0C7C | Cn | | Faltstrom Expires April 26, 2007 [Page 110] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0C7D | U+0C7D | Cn | | | Exclude | U+0C7E | U+0C7E | Cn | | | Exclude | U+0C7F | U+0C7F | Cn | | +-------------+--------+--------+-------+---------------------------+ 4.24. 0C80-0CFF Kannada +-------------+--------+--------+-------+---------------------------+ | Include? | Code | NFKC | Class | Name | +-------------+--------+--------+-------+---------------------------+ | Exclude | U+0C80 | U+0C80 | Cn | | | Exclude | U+0C81 | U+0C81 | Cn | | | Maybe | U+0C82 | U+0C82 | Mc | KANNADA SIGN ANUSVARA | | Maybe | U+0C83 | U+0C83 | Mc | KANNADA SIGN VISARGA | | Exclude | U+0C84 | U+0C84 | Cn | | | Maybe | U+0C85 | U+0C85 | Lo | KANNADA LETTER A | | Maybe | U+0C86 | U+0C86 | Lo | KANNADA LETTER AA | | Maybe | U+0C87 | U+0C87 | Lo | KANNADA LETTER I | | Maybe | U+0C88 | U+0C88 | Lo | KANNADA LETTER II | | Maybe | U+0C89 | U+0C89 | Lo | KANNADA LETTER U | | Maybe | U+0C8A | U+0C8A | Lo | KANNADA LETTER UU | | Maybe | U+0C8B | U+0C8B | Lo | KANNADA LETTER VOCALIC R | | Maybe | U+0C8C | U+0C8C | Lo | KANNADA LETTER VOCALIC L | | Exclude | U+0C8D | U+0C8D | Cn | | | Maybe | U+0C8E | U+0C8E | Lo | KANNADA LETTER E | | Maybe | U+0C8F | U+0C8F | Lo | KANNADA LETTER EE | | Maybe | U+0C90 | U+0C90 | Lo | KANNADA LETTER AI | | Exclude | U+0C91 | U+0C91 | Cn | | | Maybe | U+0C92 | U+0C92 | Lo | KANNADA LETTER O | | Maybe | U+0C93 | U+0C93 | Lo | KANNADA LETTER OO | | Maybe | U+0C94 | U+0C94 | Lo | KANNADA LETTER AU | | Maybe | U+0C95 | U+0C95 | Lo | KANNADA LETTER KA | | Maybe | U+0C96 | U+0C96 | Lo | KANNADA LETTER KHA | | Maybe | U+0C97 | U+0C97 | Lo | KANNADA LETTER GA | | Maybe | U+0C98 | U+0C98 | Lo | KANNADA LETTER GHA | | Maybe | U+0C99 | U+0C99 | Lo | KANNADA LETTER NGA | | Maybe | U+0C9A | U+0C9A | Lo | KANNADA LETTER CA | | Maybe | U+0C9B | U+0C9B | Lo | KANNADA LETTER CHA | | Maybe | U+0C9C | U+0C9C | Lo | KANNADA LETTER JA | | Maybe | U+0C9D | U+0C9D | Lo | KANNADA LETTER JHA | | Maybe | U+0C9E | U+0C9E | Lo | KANNADA LETTER NYA | | Maybe | U+0C9F | U+0C9F | Lo | KANNADA LETTER TTA | | Maybe | U+0CA0 | U+0CA0 | Lo | KANNADA LETTER TTHA | | Maybe | U+0CA1 | U+0CA1 | Lo | KANNADA LETTER DDA | | Maybe | U+0CA2 | U+0CA2 | Lo | KANNADA LETTER DDHA | | Maybe | U+0CA3 | U+0CA3 | Lo | KANNADA LETTER NNA | | Maybe | U+0CA4 | U+0CA4 | Lo | KANNADA LETTER TA | | Maybe | U+0CA5 | U+0CA5 | Lo | KANNADA LETTER THA | Faltstrom Expires April 26, 2007 [Page 111] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0CA6 | U+0CA6 | Lo | KANNADA LETTER DA | | Maybe | U+0CA7 | U+0CA7 | Lo | KANNADA LETTER DHA | | Maybe | U+0CA8 | U+0CA8 | Lo | KANNADA LETTER NA | | Exclude | U+0CA9 | U+0CA9 | Cn | | | Maybe | U+0CAA | U+0CAA | Lo | KANNADA LETTER PA | | Maybe | U+0CAB | U+0CAB | Lo | KANNADA LETTER PHA | | Maybe | U+0CAC | U+0CAC | Lo | KANNADA LETTER BA | | Maybe | U+0CAD | U+0CAD | Lo | KANNADA LETTER BHA | | Maybe | U+0CAE | U+0CAE | Lo | KANNADA LETTER MA | | Maybe | U+0CAF | U+0CAF | Lo | KANNADA LETTER YA | | Maybe | U+0CB0 | U+0CB0 | Lo | KANNADA LETTER RA | | Maybe | U+0CB1 | U+0CB1 | Lo | KANNADA LETTER RRA | | Maybe | U+0CB2 | U+0CB2 | Lo | KANNADA LETTER LA | | Maybe | U+0CB3 | U+0CB3 | Lo | KANNADA LETTER LLA | | Exclude | U+0CB4 | U+0CB4 | Cn | | | Maybe | U+0CB5 | U+0CB5 | Lo | KANNADA LETTER VA | | Maybe | U+0CB6 | U+0CB6 | Lo | KANNADA LETTER SHA | | Maybe | U+0CB7 | U+0CB7 | Lo | KANNADA LETTER SSA | | Maybe | U+0CB8 | U+0CB8 | Lo | KANNADA LETTER SA | | Maybe | U+0CB9 | U+0CB9 | Lo | KANNADA LETTER HA | | Exclude | U+0CBA | U+0CBA | Cn | | | Exclude | U+0CBB | U+0CBB | Cn | | | Possibly | U+0CBC | U+0CBC | Mn | KANNADA SIGN NUKTA | | not | | | | | | Maybe | U+0CBD | U+0CBD | Lo | KANNADA SIGN AVAGRAHA | | Maybe | U+0CBE | U+0CBE | Mc | KANNADA VOWEL SIGN AA | | Possibly | U+0CBF | U+0CBF | Mn | KANNADA VOWEL SIGN I | | not | | | | | | Maybe | U+0CC0 | U+0CC0 | Mc | KANNADA VOWEL SIGN II | | Maybe | U+0CC1 | U+0CC1 | Mc | KANNADA VOWEL SIGN U | | Maybe | U+0CC2 | U+0CC2 | Mc | KANNADA VOWEL SIGN UU | | Maybe | U+0CC3 | U+0CC3 | Mc | KANNADA VOWEL SIGN | | | | | | VOCALIC R | | Maybe | U+0CC4 | U+0CC4 | Mc | KANNADA VOWEL SIGN | | | | | | VOCALIC RR | | Exclude | U+0CC5 | U+0CC5 | Cn | | | Possibly | U+0CC6 | U+0CC6 | Mn | KANNADA VOWEL SIGN E | | not | | | | | | Maybe | U+0CC7 | U+0CC7 | Mc | KANNADA VOWEL SIGN EE | | Maybe | U+0CC8 | U+0CC8 | Mc | KANNADA VOWEL SIGN AI | | Exclude | U+0CC9 | U+0CC9 | Cn | | | Maybe | U+0CCA | U+0CCA | Mc | KANNADA VOWEL SIGN O | | Maybe | U+0CCB | U+0CCB | Mc | KANNADA VOWEL SIGN OO | | Possibly | U+0CCC | U+0CCC | Mn | KANNADA VOWEL SIGN AU | | not | | | | | | Possibly | U+0CCD | U+0CCD | Mn | KANNADA SIGN VIRAMA | | not | | | | | | Exclude | U+0CCE | U+0CCE | Cn | | Faltstrom Expires April 26, 2007 [Page 112] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0CCF | U+0CCF | Cn | | | Exclude | U+0CD0 | U+0CD0 | Cn | | | Exclude | U+0CD1 | U+0CD1 | Cn | | | Exclude | U+0CD2 | U+0CD2 | Cn | | | Exclude | U+0CD3 | U+0CD3 | Cn | | | Exclude | U+0CD4 | U+0CD4 | Cn | | | Maybe | U+0CD5 | U+0CD5 | Mc | KANNADA LENGTH MARK | | Maybe | U+0CD6 | U+0CD6 | Mc | KANNADA AI LENGTH MARK | | Exclude | U+0CD7 | U+0CD7 | Cn | | | Exclude | U+0CD8 | U+0CD8 | Cn | | | Exclude | U+0CD9 | U+0CD9 | Cn | | | Exclude | U+0CDA | U+0CDA | Cn | | | Exclude | U+0CDB | U+0CDB | Cn | | | Exclude | U+0CDC | U+0CDC | Cn | | | Exclude | U+0CDD | U+0CDD | Cn | | | Maybe | U+0CDE | U+0CDE | Lo | KANNADA LETTER FA | | Exclude | U+0CDF | U+0CDF | Cn | | | Maybe | U+0CE0 | U+0CE0 | Lo | KANNADA LETTER VOCALIC RR | | Maybe | U+0CE1 | U+0CE1 | Lo | KANNADA LETTER VOCALIC LL | | Exclude | U+0CE2 | U+0CE2 | Cn | | | Exclude | U+0CE3 | U+0CE3 | Cn | | | Exclude | U+0CE4 | U+0CE4 | Cn | | | Exclude | U+0CE5 | U+0CE5 | Cn | | | Maybe | U+0CE6 | U+0CE6 | Nd | KANNADA DIGIT ZERO | | Maybe | U+0CE7 | U+0CE7 | Nd | KANNADA DIGIT ONE | | Maybe | U+0CE8 | U+0CE8 | Nd | KANNADA DIGIT TWO | | Maybe | U+0CE9 | U+0CE9 | Nd | KANNADA DIGIT THREE | | Maybe | U+0CEA | U+0CEA | Nd | KANNADA DIGIT FOUR | | Maybe | U+0CEB | U+0CEB | Nd | KANNADA DIGIT FIVE | | Maybe | U+0CEC | U+0CEC | Nd | KANNADA DIGIT SIX | | Maybe | U+0CED | U+0CED | Nd | KANNADA DIGIT SEVEN | | Maybe | U+0CEE | U+0CEE | Nd | KANNADA DIGIT EIGHT | | Maybe | U+0CEF | U+0CEF | Nd | KANNADA DIGIT NINE | | Exclude | U+0CF0 | U+0CF0 | Cn | | | Exclude | U+0CF1 | U+0CF1 | Cn | | | Exclude | U+0CF2 | U+0CF2 | Cn | | | Exclude | U+0CF3 | U+0CF3 | Cn | | | Exclude | U+0CF4 | U+0CF4 | Cn | | | Exclude | U+0CF5 | U+0CF5 | Cn | | | Exclude | U+0CF6 | U+0CF6 | Cn | | | Exclude | U+0CF7 | U+0CF7 | Cn | | | Exclude | U+0CF8 | U+0CF8 | Cn | | | Exclude | U+0CF9 | U+0CF9 | Cn | | | Exclude | U+0CFA | U+0CFA | Cn | | | Exclude | U+0CFB | U+0CFB | Cn | | | Exclude | U+0CFC | U+0CFC | Cn | | | Exclude | U+0CFD | U+0CFD | Cn | | | Exclude | U+0CFE | U+0CFE | Cn | | Faltstrom Expires April 26, 2007 [Page 113] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0CFF | U+0CFF | Cn | | +-------------+--------+--------+-------+---------------------------+ 4.25. 0D00-0D7F Malayalam +-------------+--------+--------+-------+---------------------------+ | Include? | Code | NFKC | Class | Name | +-------------+--------+--------+-------+---------------------------+ | Exclude | U+0D00 | U+0D00 | Cn | | | Exclude | U+0D01 | U+0D01 | Cn | | | Maybe | U+0D02 | U+0D02 | Mc | MALAYALAM SIGN ANUSVARA | | Maybe | U+0D03 | U+0D03 | Mc | MALAYALAM SIGN VISARGA | | Exclude | U+0D04 | U+0D04 | Cn | | | Maybe | U+0D05 | U+0D05 | Lo | MALAYALAM LETTER A | | Maybe | U+0D06 | U+0D06 | Lo | MALAYALAM LETTER AA | | Maybe | U+0D07 | U+0D07 | Lo | MALAYALAM LETTER I | | Maybe | U+0D08 | U+0D08 | Lo | MALAYALAM LETTER II | | Maybe | U+0D09 | U+0D09 | Lo | MALAYALAM LETTER U | | Maybe | U+0D0A | U+0D0A | Lo | MALAYALAM LETTER UU | | Maybe | U+0D0B | U+0D0B | Lo | MALAYALAM LETTER VOCALIC | | | | | | R | | Maybe | U+0D0C | U+0D0C | Lo | MALAYALAM LETTER VOCALIC | | | | | | L | | Exclude | U+0D0D | U+0D0D | Cn | | | Maybe | U+0D0E | U+0D0E | Lo | MALAYALAM LETTER E | | Maybe | U+0D0F | U+0D0F | Lo | MALAYALAM LETTER EE | | Maybe | U+0D10 | U+0D10 | Lo | MALAYALAM LETTER AI | | Exclude | U+0D11 | U+0D11 | Cn | | | Maybe | U+0D12 | U+0D12 | Lo | MALAYALAM LETTER O | | Maybe | U+0D13 | U+0D13 | Lo | MALAYALAM LETTER OO | | Maybe | U+0D14 | U+0D14 | Lo | MALAYALAM LETTER AU | | Maybe | U+0D15 | U+0D15 | Lo | MALAYALAM LETTER KA | | Maybe | U+0D16 | U+0D16 | Lo | MALAYALAM LETTER KHA | | Maybe | U+0D17 | U+0D17 | Lo | MALAYALAM LETTER GA | | Maybe | U+0D18 | U+0D18 | Lo | MALAYALAM LETTER GHA | | Maybe | U+0D19 | U+0D19 | Lo | MALAYALAM LETTER NGA | | Maybe | U+0D1A | U+0D1A | Lo | MALAYALAM LETTER CA | | Maybe | U+0D1B | U+0D1B | Lo | MALAYALAM LETTER CHA | | Maybe | U+0D1C | U+0D1C | Lo | MALAYALAM LETTER JA | | Maybe | U+0D1D | U+0D1D | Lo | MALAYALAM LETTER JHA | | Maybe | U+0D1E | U+0D1E | Lo | MALAYALAM LETTER NYA | | Maybe | U+0D1F | U+0D1F | Lo | MALAYALAM LETTER TTA | | Maybe | U+0D20 | U+0D20 | Lo | MALAYALAM LETTER TTHA | | Maybe | U+0D21 | U+0D21 | Lo | MALAYALAM LETTER DDA | | Maybe | U+0D22 | U+0D22 | Lo | MALAYALAM LETTER DDHA | | Maybe | U+0D23 | U+0D23 | Lo | MALAYALAM LETTER NNA | | Maybe | U+0D24 | U+0D24 | Lo | MALAYALAM LETTER TA | | Maybe | U+0D25 | U+0D25 | Lo | MALAYALAM LETTER THA | Faltstrom Expires April 26, 2007 [Page 114] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0D26 | U+0D26 | Lo | MALAYALAM LETTER DA | | Maybe | U+0D27 | U+0D27 | Lo | MALAYALAM LETTER DHA | | Maybe | U+0D28 | U+0D28 | Lo | MALAYALAM LETTER NA | | Exclude | U+0D29 | U+0D29 | Cn | | | Maybe | U+0D2A | U+0D2A | Lo | MALAYALAM LETTER PA | | Maybe | U+0D2B | U+0D2B | Lo | MALAYALAM LETTER PHA | | Maybe | U+0D2C | U+0D2C | Lo | MALAYALAM LETTER BA | | Maybe | U+0D2D | U+0D2D | Lo | MALAYALAM LETTER BHA | | Maybe | U+0D2E | U+0D2E | Lo | MALAYALAM LETTER MA | | Maybe | U+0D2F | U+0D2F | Lo | MALAYALAM LETTER YA | | Maybe | U+0D30 | U+0D30 | Lo | MALAYALAM LETTER RA | | Maybe | U+0D31 | U+0D31 | Lo | MALAYALAM LETTER RRA | | Maybe | U+0D32 | U+0D32 | Lo | MALAYALAM LETTER LA | | Maybe | U+0D33 | U+0D33 | Lo | MALAYALAM LETTER LLA | | Maybe | U+0D34 | U+0D34 | Lo | MALAYALAM LETTER LLLA | | Maybe | U+0D35 | U+0D35 | Lo | MALAYALAM LETTER VA | | Maybe | U+0D36 | U+0D36 | Lo | MALAYALAM LETTER SHA | | Maybe | U+0D37 | U+0D37 | Lo | MALAYALAM LETTER SSA | | Maybe | U+0D38 | U+0D38 | Lo | MALAYALAM LETTER SA | | Maybe | U+0D39 | U+0D39 | Lo | MALAYALAM LETTER HA | | Exclude | U+0D3A | U+0D3A | Cn | | | Exclude | U+0D3B | U+0D3B | Cn | | | Exclude | U+0D3C | U+0D3C | Cn | | | Exclude | U+0D3D | U+0D3D | Cn | | | Maybe | U+0D3E | U+0D3E | Mc | MALAYALAM VOWEL SIGN AA | | Maybe | U+0D3F | U+0D3F | Mc | MALAYALAM VOWEL SIGN I | | Maybe | U+0D40 | U+0D40 | Mc | MALAYALAM VOWEL SIGN II | | Possibly | U+0D41 | U+0D41 | Mn | MALAYALAM VOWEL SIGN U | | not | | | | | | Possibly | U+0D42 | U+0D42 | Mn | MALAYALAM VOWEL SIGN UU | | not | | | | | | Possibly | U+0D43 | U+0D43 | Mn | MALAYALAM VOWEL SIGN | | not | | | | VOCALIC R | | Exclude | U+0D44 | U+0D44 | Cn | | | Exclude | U+0D45 | U+0D45 | Cn | | | Maybe | U+0D46 | U+0D46 | Mc | MALAYALAM VOWEL SIGN E | | Maybe | U+0D47 | U+0D47 | Mc | MALAYALAM VOWEL SIGN EE | | Maybe | U+0D48 | U+0D48 | Mc | MALAYALAM VOWEL SIGN AI | | Exclude | U+0D49 | U+0D49 | Cn | | | Maybe | U+0D4A | U+0D4A | Mc | MALAYALAM VOWEL SIGN O | | Maybe | U+0D4B | U+0D4B | Mc | MALAYALAM VOWEL SIGN OO | | Maybe | U+0D4C | U+0D4C | Mc | MALAYALAM VOWEL SIGN AU | | Possibly | U+0D4D | U+0D4D | Mn | MALAYALAM SIGN VIRAMA | | not | | | | | | Exclude | U+0D4E | U+0D4E | Cn | | | Exclude | U+0D4F | U+0D4F | Cn | | | Exclude | U+0D50 | U+0D50 | Cn | | | Exclude | U+0D51 | U+0D51 | Cn | | Faltstrom Expires April 26, 2007 [Page 115] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0D52 | U+0D52 | Cn | | | Exclude | U+0D53 | U+0D53 | Cn | | | Exclude | U+0D54 | U+0D54 | Cn | | | Exclude | U+0D55 | U+0D55 | Cn | | | Exclude | U+0D56 | U+0D56 | Cn | | | Maybe | U+0D57 | U+0D57 | Mc | MALAYALAM AU LENGTH MARK | | Exclude | U+0D58 | U+0D58 | Cn | | | Exclude | U+0D59 | U+0D59 | Cn | | | Exclude | U+0D5A | U+0D5A | Cn | | | Exclude | U+0D5B | U+0D5B | Cn | | | Exclude | U+0D5C | U+0D5C | Cn | | | Exclude | U+0D5D | U+0D5D | Cn | | | Exclude | U+0D5E | U+0D5E | Cn | | | Exclude | U+0D5F | U+0D5F | Cn | | | Maybe | U+0D60 | U+0D60 | Lo | MALAYALAM LETTER VOCALIC | | | | | | RR | | Maybe | U+0D61 | U+0D61 | Lo | MALAYALAM LETTER VOCALIC | | | | | | LL | | Exclude | U+0D62 | U+0D62 | Cn | | | Exclude | U+0D63 | U+0D63 | Cn | | | Exclude | U+0D64 | U+0D64 | Cn | | | Exclude | U+0D65 | U+0D65 | Cn | | | Maybe | U+0D66 | U+0D66 | Nd | MALAYALAM DIGIT ZERO | | Maybe | U+0D67 | U+0D67 | Nd | MALAYALAM DIGIT ONE | | Maybe | U+0D68 | U+0D68 | Nd | MALAYALAM DIGIT TWO | | Maybe | U+0D69 | U+0D69 | Nd | MALAYALAM DIGIT THREE | | Maybe | U+0D6A | U+0D6A | Nd | MALAYALAM DIGIT FOUR | | Maybe | U+0D6B | U+0D6B | Nd | MALAYALAM DIGIT FIVE | | Maybe | U+0D6C | U+0D6C | Nd | MALAYALAM DIGIT SIX | | Maybe | U+0D6D | U+0D6D | Nd | MALAYALAM DIGIT SEVEN | | Maybe | U+0D6E | U+0D6E | Nd | MALAYALAM DIGIT EIGHT | | Maybe | U+0D6F | U+0D6F | Nd | MALAYALAM DIGIT NINE | | Exclude | U+0D70 | U+0D70 | Cn | | | Exclude | U+0D71 | U+0D71 | Cn | | | Exclude | U+0D72 | U+0D72 | Cn | | | Exclude | U+0D73 | U+0D73 | Cn | | | Exclude | U+0D74 | U+0D74 | Cn | | | Exclude | U+0D75 | U+0D75 | Cn | | | Exclude | U+0D76 | U+0D76 | Cn | | | Exclude | U+0D77 | U+0D77 | Cn | | | Exclude | U+0D78 | U+0D78 | Cn | | | Exclude | U+0D79 | U+0D79 | Cn | | | Exclude | U+0D7A | U+0D7A | Cn | | | Exclude | U+0D7B | U+0D7B | Cn | | | Exclude | U+0D7C | U+0D7C | Cn | | | Exclude | U+0D7D | U+0D7D | Cn | | | Exclude | U+0D7E | U+0D7E | Cn | | Faltstrom Expires April 26, 2007 [Page 116] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0D7F | U+0D7F | Cn | | +-------------+--------+--------+-------+---------------------------+ 4.26. 0D80-0DFF Sinhala +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Exclude | U+0D80 | U+0D80 | Cn | | | Exclude | U+0D81 | U+0D81 | Cn | | | Maybe | U+0D82 | U+0D82 | Mc | SINHALA SIGN ANUSVARAYA | | Maybe | U+0D83 | U+0D83 | Mc | SINHALA SIGN VISARGAYA | | Exclude | U+0D84 | U+0D84 | Cn | | | Maybe | U+0D85 | U+0D85 | Lo | SINHALA LETTER AYANNA | | Maybe | U+0D86 | U+0D86 | Lo | SINHALA LETTER AAYANNA | | Maybe | U+0D87 | U+0D87 | Lo | SINHALA LETTER AEYANNA | | Maybe | U+0D88 | U+0D88 | Lo | SINHALA LETTER AEEYANNA | | Maybe | U+0D89 | U+0D89 | Lo | SINHALA LETTER IYANNA | | Maybe | U+0D8A | U+0D8A | Lo | SINHALA LETTER IIYANNA | | Maybe | U+0D8B | U+0D8B | Lo | SINHALA LETTER UYANNA | | Maybe | U+0D8C | U+0D8C | Lo | SINHALA LETTER UUYANNA | | Maybe | U+0D8D | U+0D8D | Lo | SINHALA LETTER IRUYANNA | | Maybe | U+0D8E | U+0D8E | Lo | SINHALA LETTER IRUUYANNA | | Maybe | U+0D8F | U+0D8F | Lo | SINHALA LETTER ILUYANNA | | Maybe | U+0D90 | U+0D90 | Lo | SINHALA LETTER ILUUYANNA | | Maybe | U+0D91 | U+0D91 | Lo | SINHALA LETTER EYANNA | | Maybe | U+0D92 | U+0D92 | Lo | SINHALA LETTER EEYANNA | | Maybe | U+0D93 | U+0D93 | Lo | SINHALA LETTER AIYANNA | | Maybe | U+0D94 | U+0D94 | Lo | SINHALA LETTER OYANNA | | Maybe | U+0D95 | U+0D95 | Lo | SINHALA LETTER OOYANNA | | Maybe | U+0D96 | U+0D96 | Lo | SINHALA LETTER AUYANNA | | Exclude | U+0D97 | U+0D97 | Cn | | | Exclude | U+0D98 | U+0D98 | Cn | | | Exclude | U+0D99 | U+0D99 | Cn | | | Maybe | U+0D9A | U+0D9A | Lo | SINHALA LETTER ALPAPRAANA | | | | | | KAYANNA | | Maybe | U+0D9B | U+0D9B | Lo | SINHALA LETTER MAHAAPRAANA | | | | | | KAYANNA | | Maybe | U+0D9C | U+0D9C | Lo | SINHALA LETTER ALPAPRAANA | | | | | | GAYANNA | | Maybe | U+0D9D | U+0D9D | Lo | SINHALA LETTER MAHAAPRAANA | | | | | | GAYANNA | | Maybe | U+0D9E | U+0D9E | Lo | SINHALA LETTER KANTAJA | | | | | | NAASIKYAYA | | Maybe | U+0D9F | U+0D9F | Lo | SINHALA LETTER SANYAKA | | | | | | GAYANNA | | Maybe | U+0DA0 | U+0DA0 | Lo | SINHALA LETTER ALPAPRAANA | | | | | | CAYANNA | Faltstrom Expires April 26, 2007 [Page 117] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0DA1 | U+0DA1 | Lo | SINHALA LETTER MAHAAPRAANA | | | | | | CAYANNA | | Maybe | U+0DA2 | U+0DA2 | Lo | SINHALA LETTER ALPAPRAANA | | | | | | JAYANNA | | Maybe | U+0DA3 | U+0DA3 | Lo | SINHALA LETTER MAHAAPRAANA | | | | | | JAYANNA | | Maybe | U+0DA4 | U+0DA4 | Lo | SINHALA LETTER TAALUJA | | | | | | NAASIKYAYA | | Maybe | U+0DA5 | U+0DA5 | Lo | SINHALA LETTER TAALUJA | | | | | | SANYOOGA NAAKSIKYAYA | | Maybe | U+0DA6 | U+0DA6 | Lo | SINHALA LETTER SANYAKA | | | | | | JAYANNA | | Maybe | U+0DA7 | U+0DA7 | Lo | SINHALA LETTER ALPAPRAANA | | | | | | TTAYANNA | | Maybe | U+0DA8 | U+0DA8 | Lo | SINHALA LETTER MAHAAPRAANA | | | | | | TTAYANNA | | Maybe | U+0DA9 | U+0DA9 | Lo | SINHALA LETTER ALPAPRAANA | | | | | | DDAYANNA | | Maybe | U+0DAA | U+0DAA | Lo | SINHALA LETTER MAHAAPRAANA | | | | | | DDAYANNA | | Maybe | U+0DAB | U+0DAB | Lo | SINHALA LETTER MUURDHAJA | | | | | | NAYANNA | | Maybe | U+0DAC | U+0DAC | Lo | SINHALA LETTER SANYAKA | | | | | | DDAYANNA | | Maybe | U+0DAD | U+0DAD | Lo | SINHALA LETTER ALPAPRAANA | | | | | | TAYANNA | | Maybe | U+0DAE | U+0DAE | Lo | SINHALA LETTER MAHAAPRAANA | | | | | | TAYANNA | | Maybe | U+0DAF | U+0DAF | Lo | SINHALA LETTER ALPAPRAANA | | | | | | DAYANNA | | Maybe | U+0DB0 | U+0DB0 | Lo | SINHALA LETTER MAHAAPRAANA | | | | | | DAYANNA | | Maybe | U+0DB1 | U+0DB1 | Lo | SINHALA LETTER DANTAJA | | | | | | NAYANNA | | Exclude | U+0DB2 | U+0DB2 | Cn | | | Maybe | U+0DB3 | U+0DB3 | Lo | SINHALA LETTER SANYAKA | | | | | | DAYANNA | | Maybe | U+0DB4 | U+0DB4 | Lo | SINHALA LETTER ALPAPRAANA | | | | | | PAYANNA | | Maybe | U+0DB5 | U+0DB5 | Lo | SINHALA LETTER MAHAAPRAANA | | | | | | PAYANNA | | Maybe | U+0DB6 | U+0DB6 | Lo | SINHALA LETTER ALPAPRAANA | | | | | | BAYANNA | | Maybe | U+0DB7 | U+0DB7 | Lo | SINHALA LETTER MAHAAPRAANA | | | | | | BAYANNA | | Maybe | U+0DB8 | U+0DB8 | Lo | SINHALA LETTER MAYANNA | | Maybe | U+0DB9 | U+0DB9 | Lo | SINHALA LETTER AMBA BAYANNA | | Maybe | U+0DBA | U+0DBA | Lo | SINHALA LETTER YAYANNA | Faltstrom Expires April 26, 2007 [Page 118] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0DBB | U+0DBB | Lo | SINHALA LETTER RAYANNA | | Exclude | U+0DBC | U+0DBC | Cn | | | Maybe | U+0DBD | U+0DBD | Lo | SINHALA LETTER DANTAJA | | | | | | LAYANNA | | Exclude | U+0DBE | U+0DBE | Cn | | | Exclude | U+0DBF | U+0DBF | Cn | | | Maybe | U+0DC0 | U+0DC0 | Lo | SINHALA LETTER VAYANNA | | Maybe | U+0DC1 | U+0DC1 | Lo | SINHALA LETTER TAALUJA | | | | | | SAYANNA | | Maybe | U+0DC2 | U+0DC2 | Lo | SINHALA LETTER MUURDHAJA | | | | | | SAYANNA | | Maybe | U+0DC3 | U+0DC3 | Lo | SINHALA LETTER DANTAJA | | | | | | SAYANNA | | Maybe | U+0DC4 | U+0DC4 | Lo | SINHALA LETTER HAYANNA | | Maybe | U+0DC5 | U+0DC5 | Lo | SINHALA LETTER MUURDHAJA | | | | | | LAYANNA | | Maybe | U+0DC6 | U+0DC6 | Lo | SINHALA LETTER FAYANNA | | Exclude | U+0DC7 | U+0DC7 | Cn | | | Exclude | U+0DC8 | U+0DC8 | Cn | | | Exclude | U+0DC9 | U+0DC9 | Cn | | | Possibly | U+0DCA | U+0DCA | Mn | SINHALA SIGN AL-LAKUNA | | not | | | | | | Exclude | U+0DCB | U+0DCB | Cn | | | Exclude | U+0DCC | U+0DCC | Cn | | | Exclude | U+0DCD | U+0DCD | Cn | | | Exclude | U+0DCE | U+0DCE | Cn | | | Maybe | U+0DCF | U+0DCF | Mc | SINHALA VOWEL SIGN | | | | | | AELA-PILLA | | Maybe | U+0DD0 | U+0DD0 | Mc | SINHALA VOWEL SIGN KETTI | | | | | | AEDA-PILLA | | Maybe | U+0DD1 | U+0DD1 | Mc | SINHALA VOWEL SIGN DIGA | | | | | | AEDA-PILLA | | Possibly | U+0DD2 | U+0DD2 | Mn | SINHALA VOWEL SIGN KETTI | | not | | | | IS-PILLA | | Possibly | U+0DD3 | U+0DD3 | Mn | SINHALA VOWEL SIGN DIGA | | not | | | | IS-PILLA | | Possibly | U+0DD4 | U+0DD4 | Mn | SINHALA VOWEL SIGN KETTI | | not | | | | PAA-PILLA | | Exclude | U+0DD5 | U+0DD5 | Cn | | | Possibly | U+0DD6 | U+0DD6 | Mn | SINHALA VOWEL SIGN DIGA | | not | | | | PAA-PILLA | | Exclude | U+0DD7 | U+0DD7 | Cn | | | Maybe | U+0DD8 | U+0DD8 | Mc | SINHALA VOWEL SIGN | | | | | | GAETTA-PILLA | | Maybe | U+0DD9 | U+0DD9 | Mc | SINHALA VOWEL SIGN KOMBUVA | | Maybe | U+0DDA | U+0DDA | Mc | SINHALA VOWEL SIGN DIGA | | | | | | KOMBUVA | Faltstrom Expires April 26, 2007 [Page 119] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0DDB | U+0DDB | Mc | SINHALA VOWEL SIGN KOMBU | | | | | | DEKA | | Maybe | U+0DDC | U+0DDC | Mc | SINHALA VOWEL SIGN KOMBUVA | | | | | | HAA AELA-PILLA | | Maybe | U+0DDD | U+0DDD | Mc | SINHALA VOWEL SIGN KOMBUVA | | | | | | HAA DIGA AELA-PILLA | | Maybe | U+0DDE | U+0DDE | Mc | SINHALA VOWEL SIGN KOMBUVA | | | | | | HAA GAYANUKITTA | | Maybe | U+0DDF | U+0DDF | Mc | SINHALA VOWEL SIGN | | | | | | GAYANUKITTA | | Exclude | U+0DE0 | U+0DE0 | Cn | | | Exclude | U+0DE1 | U+0DE1 | Cn | | | Exclude | U+0DE2 | U+0DE2 | Cn | | | Exclude | U+0DE3 | U+0DE3 | Cn | | | Exclude | U+0DE4 | U+0DE4 | Cn | | | Exclude | U+0DE5 | U+0DE5 | Cn | | | Exclude | U+0DE6 | U+0DE6 | Cn | | | Exclude | U+0DE7 | U+0DE7 | Cn | | | Exclude | U+0DE8 | U+0DE8 | Cn | | | Exclude | U+0DE9 | U+0DE9 | Cn | | | Exclude | U+0DEA | U+0DEA | Cn | | | Exclude | U+0DEB | U+0DEB | Cn | | | Exclude | U+0DEC | U+0DEC | Cn | | | Exclude | U+0DED | U+0DED | Cn | | | Exclude | U+0DEE | U+0DEE | Cn | | | Exclude | U+0DEF | U+0DEF | Cn | | | Exclude | U+0DF0 | U+0DF0 | Cn | | | Exclude | U+0DF1 | U+0DF1 | Cn | | | Maybe | U+0DF2 | U+0DF2 | Mc | SINHALA VOWEL SIGN DIGA | | | | | | GAETTA-PILLA | | Maybe | U+0DF3 | U+0DF3 | Mc | SINHALA VOWEL SIGN DIGA | | | | | | GAYANUKITTA | | Exclude | U+0DF4 | U+0DF4 | Po | SINHALA PUNCTUATION | | | | | | KUNDDALIYA | | Exclude | U+0DF5 | U+0DF5 | Cn | | | Exclude | U+0DF6 | U+0DF6 | Cn | | | Exclude | U+0DF7 | U+0DF7 | Cn | | | Exclude | U+0DF8 | U+0DF8 | Cn | | | Exclude | U+0DF9 | U+0DF9 | Cn | | | Exclude | U+0DFA | U+0DFA | Cn | | | Exclude | U+0DFB | U+0DFB | Cn | | | Exclude | U+0DFC | U+0DFC | Cn | | | Exclude | U+0DFD | U+0DFD | Cn | | | Exclude | U+0DFE | U+0DFE | Cn | | | Exclude | U+0DFF | U+0DFF | Cn | | +----------+--------+--------+-------+------------------------------+ 4.27. 0E00-0E7F Thai Faltstrom Expires April 26, 2007 [Page 120] Internet-Draft Unicode Codepoints October 2006 +-------------+--------+--------+-------+---------------------------+ | Include? | Code | NFKC | Class | Name | +-------------+--------+--------+-------+---------------------------+ | Exclude | U+0E00 | U+0E00 | Cn | | | Maybe | U+0E01 | U+0E01 | Lo | THAI CHARACTER KO KAI | | Maybe | U+0E02 | U+0E02 | Lo | THAI CHARACTER KHO KHAI | | Maybe | U+0E03 | U+0E03 | Lo | THAI CHARACTER KHO KHUAT | | Maybe | U+0E04 | U+0E04 | Lo | THAI CHARACTER KHO KHWAI | | Maybe | U+0E05 | U+0E05 | Lo | THAI CHARACTER KHO KHON | | Maybe | U+0E06 | U+0E06 | Lo | THAI CHARACTER KHO | | | | | | RAKHANG | | Maybe | U+0E07 | U+0E07 | Lo | THAI CHARACTER NGO NGU | | Maybe | U+0E08 | U+0E08 | Lo | THAI CHARACTER CHO CHAN | | Maybe | U+0E09 | U+0E09 | Lo | THAI CHARACTER CHO CHING | | Maybe | U+0E0A | U+0E0A | Lo | THAI CHARACTER CHO CHANG | | Maybe | U+0E0B | U+0E0B | Lo | THAI CHARACTER SO SO | | Maybe | U+0E0C | U+0E0C | Lo | THAI CHARACTER CHO CHOE | | Maybe | U+0E0D | U+0E0D | Lo | THAI CHARACTER YO YING | | Maybe | U+0E0E | U+0E0E | Lo | THAI CHARACTER DO CHADA | | Maybe | U+0E0F | U+0E0F | Lo | THAI CHARACTER TO PATAK | | Maybe | U+0E10 | U+0E10 | Lo | THAI CHARACTER THO THAN | | Maybe | U+0E11 | U+0E11 | Lo | THAI CHARACTER THO | | | | | | NANGMONTHO | | Maybe | U+0E12 | U+0E12 | Lo | THAI CHARACTER THO | | | | | | PHUTHAO | | Maybe | U+0E13 | U+0E13 | Lo | THAI CHARACTER NO NEN | | Maybe | U+0E14 | U+0E14 | Lo | THAI CHARACTER DO DEK | | Maybe | U+0E15 | U+0E15 | Lo | THAI CHARACTER TO TAO | | Maybe | U+0E16 | U+0E16 | Lo | THAI CHARACTER THO THUNG | | Maybe | U+0E17 | U+0E17 | Lo | THAI CHARACTER THO THAHAN | | Maybe | U+0E18 | U+0E18 | Lo | THAI CHARACTER THO THONG | | Maybe | U+0E19 | U+0E19 | Lo | THAI CHARACTER NO NU | | Maybe | U+0E1A | U+0E1A | Lo | THAI CHARACTER BO BAIMAI | | Maybe | U+0E1B | U+0E1B | Lo | THAI CHARACTER PO PLA | | Maybe | U+0E1C | U+0E1C | Lo | THAI CHARACTER PHO PHUNG | | Maybe | U+0E1D | U+0E1D | Lo | THAI CHARACTER FO FA | | Maybe | U+0E1E | U+0E1E | Lo | THAI CHARACTER PHO PHAN | | Maybe | U+0E1F | U+0E1F | Lo | THAI CHARACTER FO FAN | | Maybe | U+0E20 | U+0E20 | Lo | THAI CHARACTER PHO | | | | | | SAMPHAO | | Maybe | U+0E21 | U+0E21 | Lo | THAI CHARACTER MO MA | | Maybe | U+0E22 | U+0E22 | Lo | THAI CHARACTER YO YAK | | Maybe | U+0E23 | U+0E23 | Lo | THAI CHARACTER RO RUA | | Maybe | U+0E24 | U+0E24 | Lo | THAI CHARACTER RU | | Maybe | U+0E25 | U+0E25 | Lo | THAI CHARACTER LO LING | | Maybe | U+0E26 | U+0E26 | Lo | THAI CHARACTER LU | | Maybe | U+0E27 | U+0E27 | Lo | THAI CHARACTER WO WAEN | | Maybe | U+0E28 | U+0E28 | Lo | THAI CHARACTER SO SALA | Faltstrom Expires April 26, 2007 [Page 121] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0E29 | U+0E29 | Lo | THAI CHARACTER SO RUSI | | Maybe | U+0E2A | U+0E2A | Lo | THAI CHARACTER SO SUA | | Maybe | U+0E2B | U+0E2B | Lo | THAI CHARACTER HO HIP | | Maybe | U+0E2C | U+0E2C | Lo | THAI CHARACTER LO CHULA | | Maybe | U+0E2D | U+0E2D | Lo | THAI CHARACTER O ANG | | Maybe | U+0E2E | U+0E2E | Lo | THAI CHARACTER HO NOKHUK | | Maybe | U+0E2F | U+0E2F | Lo | THAI CHARACTER PAIYANNOI | | Maybe | U+0E30 | U+0E30 | Lo | THAI CHARACTER SARA A | | Possibly | U+0E31 | U+0E31 | Mn | THAI CHARACTER MAI | | not | | | | HAN-AKAT | | Maybe | U+0E32 | U+0E32 | Lo | THAI CHARACTER SARA AA | | Possibly | U+0E33 | U+0E4D | Lo Mn | THAI CHARACTER NIKHAHIT | | not | | | | | | Possibly | U+0E34 | U+0E34 | Mn | THAI CHARACTER SARA I | | not | | | | | | Possibly | U+0E35 | U+0E35 | Mn | THAI CHARACTER SARA II | | not | | | | | | Possibly | U+0E36 | U+0E36 | Mn | THAI CHARACTER SARA UE | | not | | | | | | Possibly | U+0E37 | U+0E37 | Mn | THAI CHARACTER SARA UEE | | not | | | | | | Possibly | U+0E38 | U+0E38 | Mn | THAI CHARACTER SARA U | | not | | | | | | Possibly | U+0E39 | U+0E39 | Mn | THAI CHARACTER SARA UU | | not | | | | | | Possibly | U+0E3A | U+0E3A | Mn | THAI CHARACTER PHINTHU | | not | | | | | | Exclude | U+0E3B | U+0E3B | Cn | | | Exclude | U+0E3C | U+0E3C | Cn | | | Exclude | U+0E3D | U+0E3D | Cn | | | Exclude | U+0E3E | U+0E3E | Cn | | | Exclude | U+0E3F | U+0E3F | Sc | THAI CURRENCY SYMBOL BAHT | | Maybe | U+0E40 | U+0E40 | Lo | THAI CHARACTER SARA E | | Maybe | U+0E41 | U+0E41 | Lo | THAI CHARACTER SARA AE | | Maybe | U+0E42 | U+0E42 | Lo | THAI CHARACTER SARA O | | Maybe | U+0E43 | U+0E43 | Lo | THAI CHARACTER SARA AI | | | | | | MAIMUAN | | Maybe | U+0E44 | U+0E44 | Lo | THAI CHARACTER SARA AI | | | | | | MAIMALAI | | Maybe | U+0E45 | U+0E45 | Lo | THAI CHARACTER | | | | | | LAKKHANGYAO | | Exclude | U+0E46 | U+0E46 | Lm | THAI CHARACTER MAIYAMOK | | Possibly | U+0E47 | U+0E47 | Mn | THAI CHARACTER MAITAIKHU | | not | | | | | | Possibly | U+0E48 | U+0E48 | Mn | THAI CHARACTER MAI EK | | not | | | | | | Possibly | U+0E49 | U+0E49 | Mn | THAI CHARACTER MAI THO | | not | | | | | Faltstrom Expires April 26, 2007 [Page 122] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+0E4A | U+0E4A | Mn | THAI CHARACTER MAI TRI | | not | | | | | | Possibly | U+0E4B | U+0E4B | Mn | THAI CHARACTER MAI | | not | | | | CHATTAWA | | Possibly | U+0E4C | U+0E4C | Mn | THAI CHARACTER | | not | | | | THANTHAKHAT | | Possibly | U+0E4D | U+0E4D | Mn | THAI CHARACTER NIKHAHIT | | not | | | | | | Possibly | U+0E4E | U+0E4E | Mn | THAI CHARACTER YAMAKKAN | | not | | | | | | Exclude | U+0E4F | U+0E4F | Po | THAI CHARACTER FONGMAN | | Maybe | U+0E50 | U+0E50 | Nd | THAI DIGIT ZERO | | Maybe | U+0E51 | U+0E51 | Nd | THAI DIGIT ONE | | Maybe | U+0E52 | U+0E52 | Nd | THAI DIGIT TWO | | Maybe | U+0E53 | U+0E53 | Nd | THAI DIGIT THREE | | Maybe | U+0E54 | U+0E54 | Nd | THAI DIGIT FOUR | | Maybe | U+0E55 | U+0E55 | Nd | THAI DIGIT FIVE | | Maybe | U+0E56 | U+0E56 | Nd | THAI DIGIT SIX | | Maybe | U+0E57 | U+0E57 | Nd | THAI DIGIT SEVEN | | Maybe | U+0E58 | U+0E58 | Nd | THAI DIGIT EIGHT | | Maybe | U+0E59 | U+0E59 | Nd | THAI DIGIT NINE | | Exclude | U+0E5A | U+0E5A | Po | THAI CHARACTER ANGKHANKHU | | Exclude | U+0E5B | U+0E5B | Po | THAI CHARACTER KHOMUT | | Exclude | U+0E5C | U+0E5C | Cn | | | Exclude | U+0E5D | U+0E5D | Cn | | | Exclude | U+0E5E | U+0E5E | Cn | | | Exclude | U+0E5F | U+0E5F | Cn | | | Exclude | U+0E60 | U+0E60 | Cn | | | Exclude | U+0E61 | U+0E61 | Cn | | | Exclude | U+0E62 | U+0E62 | Cn | | | Exclude | U+0E63 | U+0E63 | Cn | | | Exclude | U+0E64 | U+0E64 | Cn | | | Exclude | U+0E65 | U+0E65 | Cn | | | Exclude | U+0E66 | U+0E66 | Cn | | | Exclude | U+0E67 | U+0E67 | Cn | | | Exclude | U+0E68 | U+0E68 | Cn | | | Exclude | U+0E69 | U+0E69 | Cn | | | Exclude | U+0E6A | U+0E6A | Cn | | | Exclude | U+0E6B | U+0E6B | Cn | | | Exclude | U+0E6C | U+0E6C | Cn | | | Exclude | U+0E6D | U+0E6D | Cn | | | Exclude | U+0E6E | U+0E6E | Cn | | | Exclude | U+0E6F | U+0E6F | Cn | | | Exclude | U+0E70 | U+0E70 | Cn | | | Exclude | U+0E71 | U+0E71 | Cn | | | Exclude | U+0E72 | U+0E72 | Cn | | | Exclude | U+0E73 | U+0E73 | Cn | | | Exclude | U+0E74 | U+0E74 | Cn | | Faltstrom Expires April 26, 2007 [Page 123] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0E75 | U+0E75 | Cn | | | Exclude | U+0E76 | U+0E76 | Cn | | | Exclude | U+0E77 | U+0E77 | Cn | | | Exclude | U+0E78 | U+0E78 | Cn | | | Exclude | U+0E79 | U+0E79 | Cn | | | Exclude | U+0E7A | U+0E7A | Cn | | | Exclude | U+0E7B | U+0E7B | Cn | | | Exclude | U+0E7C | U+0E7C | Cn | | | Exclude | U+0E7D | U+0E7D | Cn | | | Exclude | U+0E7E | U+0E7E | Cn | | | Exclude | U+0E7F | U+0E7F | Cn | | +-------------+--------+--------+-------+---------------------------+ 4.28. 0E80-0EFF Lao +--------------+--------+--------+-------+------------------------+ | Include? | Code | NFKC | Class | Name | +--------------+--------+--------+-------+------------------------+ | Exclude | U+0E80 | U+0E80 | Cn | | | Maybe | U+0E81 | U+0E81 | Lo | LAO LETTER KO | | Maybe | U+0E82 | U+0E82 | Lo | LAO LETTER KHO SUNG | | Exclude | U+0E83 | U+0E83 | Cn | | | Maybe | U+0E84 | U+0E84 | Lo | LAO LETTER KHO TAM | | Exclude | U+0E85 | U+0E85 | Cn | | | Exclude | U+0E86 | U+0E86 | Cn | | | Maybe | U+0E87 | U+0E87 | Lo | LAO LETTER NGO | | Maybe | U+0E88 | U+0E88 | Lo | LAO LETTER CO | | Exclude | U+0E89 | U+0E89 | Cn | | | Maybe | U+0E8A | U+0E8A | Lo | LAO LETTER SO TAM | | Exclude | U+0E8B | U+0E8B | Cn | | | Exclude | U+0E8C | U+0E8C | Cn | | | Maybe | U+0E8D | U+0E8D | Lo | LAO LETTER NYO | | Exclude | U+0E8E | U+0E8E | Cn | | | Exclude | U+0E8F | U+0E8F | Cn | | | Exclude | U+0E90 | U+0E90 | Cn | | | Exclude | U+0E91 | U+0E91 | Cn | | | Exclude | U+0E92 | U+0E92 | Cn | | | Exclude | U+0E93 | U+0E93 | Cn | | | Maybe | U+0E94 | U+0E94 | Lo | LAO LETTER DO | | Maybe | U+0E95 | U+0E95 | Lo | LAO LETTER TO | | Maybe | U+0E96 | U+0E96 | Lo | LAO LETTER THO SUNG | | Maybe | U+0E97 | U+0E97 | Lo | LAO LETTER THO TAM | | Exclude | U+0E98 | U+0E98 | Cn | | | Maybe | U+0E99 | U+0E99 | Lo | LAO LETTER NO | | Maybe | U+0E9A | U+0E9A | Lo | LAO LETTER BO | | Maybe | U+0E9B | U+0E9B | Lo | LAO LETTER PO | | Maybe | U+0E9C | U+0E9C | Lo | LAO LETTER PHO SUNG | | Maybe | U+0E9D | U+0E9D | Lo | LAO LETTER FO TAM | Faltstrom Expires April 26, 2007 [Page 124] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0E9E | U+0E9E | Lo | LAO LETTER PHO TAM | | Maybe | U+0E9F | U+0E9F | Lo | LAO LETTER FO SUNG | | Exclude | U+0EA0 | U+0EA0 | Cn | | | Maybe | U+0EA1 | U+0EA1 | Lo | LAO LETTER MO | | Maybe | U+0EA2 | U+0EA2 | Lo | LAO LETTER YO | | Maybe | U+0EA3 | U+0EA3 | Lo | LAO LETTER LO LING | | Exclude | U+0EA4 | U+0EA4 | Cn | | | Maybe | U+0EA5 | U+0EA5 | Lo | LAO LETTER LO LOOT | | Exclude | U+0EA6 | U+0EA6 | Cn | | | Maybe | U+0EA7 | U+0EA7 | Lo | LAO LETTER WO | | Exclude | U+0EA8 | U+0EA8 | Cn | | | Exclude | U+0EA9 | U+0EA9 | Cn | | | Maybe | U+0EAA | U+0EAA | Lo | LAO LETTER SO SUNG | | Maybe | U+0EAB | U+0EAB | Lo | LAO LETTER HO SUNG | | Exclude | U+0EAC | U+0EAC | Cn | | | Maybe | U+0EAD | U+0EAD | Lo | LAO LETTER O | | Maybe | U+0EAE | U+0EAE | Lo | LAO LETTER HO TAM | | Maybe | U+0EAF | U+0EAF | Lo | LAO ELLIPSIS | | Maybe | U+0EB0 | U+0EB0 | Lo | LAO VOWEL SIGN A | | Possibly not | U+0EB1 | U+0EB1 | Mn | LAO VOWEL SIGN MAI KAN | | Maybe | U+0EB2 | U+0EB2 | Lo | LAO VOWEL SIGN AA | | Possibly not | U+0EB3 | U+0ECD | Lo Mn | LAO NIGGAHITA | | Possibly not | U+0EB4 | U+0EB4 | Mn | LAO VOWEL SIGN I | | Possibly not | U+0EB5 | U+0EB5 | Mn | LAO VOWEL SIGN II | | Possibly not | U+0EB6 | U+0EB6 | Mn | LAO VOWEL SIGN Y | | Possibly not | U+0EB7 | U+0EB7 | Mn | LAO VOWEL SIGN YY | | Possibly not | U+0EB8 | U+0EB8 | Mn | LAO VOWEL SIGN U | | Possibly not | U+0EB9 | U+0EB9 | Mn | LAO VOWEL SIGN UU | | Exclude | U+0EBA | U+0EBA | Cn | | | Possibly not | U+0EBB | U+0EBB | Mn | LAO VOWEL SIGN MAI KON | | Possibly not | U+0EBC | U+0EBC | Mn | LAO SEMIVOWEL SIGN LO | | Maybe | U+0EBD | U+0EBD | Lo | LAO SEMIVOWEL SIGN NYO | | Exclude | U+0EBE | U+0EBE | Cn | | | Exclude | U+0EBF | U+0EBF | Cn | | | Maybe | U+0EC0 | U+0EC0 | Lo | LAO VOWEL SIGN E | | Maybe | U+0EC1 | U+0EC1 | Lo | LAO VOWEL SIGN EI | | Maybe | U+0EC2 | U+0EC2 | Lo | LAO VOWEL SIGN O | | Maybe | U+0EC3 | U+0EC3 | Lo | LAO VOWEL SIGN AY | | Maybe | U+0EC4 | U+0EC4 | Lo | LAO VOWEL SIGN AI | | Exclude | U+0EC5 | U+0EC5 | Cn | | | Exclude | U+0EC6 | U+0EC6 | Lm | LAO KO LA | | Exclude | U+0EC7 | U+0EC7 | Cn | | | Possibly not | U+0EC8 | U+0EC8 | Mn | LAO TONE MAI EK | | Possibly not | U+0EC9 | U+0EC9 | Mn | LAO TONE MAI THO | | Possibly not | U+0ECA | U+0ECA | Mn | LAO TONE MAI TI | | Possibly not | U+0ECB | U+0ECB | Mn | LAO TONE MAI CATAWA | | Possibly not | U+0ECC | U+0ECC | Mn | LAO CANCELLATION MARK | | Possibly not | U+0ECD | U+0ECD | Mn | LAO NIGGAHITA | Faltstrom Expires April 26, 2007 [Page 125] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0ECE | U+0ECE | Cn | | | Exclude | U+0ECF | U+0ECF | Cn | | | Maybe | U+0ED0 | U+0ED0 | Nd | LAO DIGIT ZERO | | Maybe | U+0ED1 | U+0ED1 | Nd | LAO DIGIT ONE | | Maybe | U+0ED2 | U+0ED2 | Nd | LAO DIGIT TWO | | Maybe | U+0ED3 | U+0ED3 | Nd | LAO DIGIT THREE | | Maybe | U+0ED4 | U+0ED4 | Nd | LAO DIGIT FOUR | | Maybe | U+0ED5 | U+0ED5 | Nd | LAO DIGIT FIVE | | Maybe | U+0ED6 | U+0ED6 | Nd | LAO DIGIT SIX | | Maybe | U+0ED7 | U+0ED7 | Nd | LAO DIGIT SEVEN | | Maybe | U+0ED8 | U+0ED8 | Nd | LAO DIGIT EIGHT | | Maybe | U+0ED9 | U+0ED9 | Nd | LAO DIGIT NINE | | Exclude | U+0EDA | U+0EDA | Cn | | | Exclude | U+0EDB | U+0EDB | Cn | | | Maybe | U+0EDC | U+0EAB | Lo | LAO LETTER HO SUNG | | Maybe | U+0EDD | U+0EAB | Lo | LAO LETTER HO SUNG | | Exclude | U+0EDE | U+0EDE | Cn | | | Exclude | U+0EDF | U+0EDF | Cn | | | Exclude | U+0EE0 | U+0EE0 | Cn | | | Exclude | U+0EE1 | U+0EE1 | Cn | | | Exclude | U+0EE2 | U+0EE2 | Cn | | | Exclude | U+0EE3 | U+0EE3 | Cn | | | Exclude | U+0EE4 | U+0EE4 | Cn | | | Exclude | U+0EE5 | U+0EE5 | Cn | | | Exclude | U+0EE6 | U+0EE6 | Cn | | | Exclude | U+0EE7 | U+0EE7 | Cn | | | Exclude | U+0EE8 | U+0EE8 | Cn | | | Exclude | U+0EE9 | U+0EE9 | Cn | | | Exclude | U+0EEA | U+0EEA | Cn | | | Exclude | U+0EEB | U+0EEB | Cn | | | Exclude | U+0EEC | U+0EEC | Cn | | | Exclude | U+0EED | U+0EED | Cn | | | Exclude | U+0EEE | U+0EEE | Cn | | | Exclude | U+0EEF | U+0EEF | Cn | | | Exclude | U+0EF0 | U+0EF0 | Cn | | | Exclude | U+0EF1 | U+0EF1 | Cn | | | Exclude | U+0EF2 | U+0EF2 | Cn | | | Exclude | U+0EF3 | U+0EF3 | Cn | | | Exclude | U+0EF4 | U+0EF4 | Cn | | | Exclude | U+0EF5 | U+0EF5 | Cn | | | Exclude | U+0EF6 | U+0EF6 | Cn | | | Exclude | U+0EF7 | U+0EF7 | Cn | | | Exclude | U+0EF8 | U+0EF8 | Cn | | | Exclude | U+0EF9 | U+0EF9 | Cn | | | Exclude | U+0EFA | U+0EFA | Cn | | | Exclude | U+0EFB | U+0EFB | Cn | | | Exclude | U+0EFC | U+0EFC | Cn | | | Exclude | U+0EFD | U+0EFD | Cn | | Faltstrom Expires April 26, 2007 [Page 126] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0EFE | U+0EFE | Cn | | | Exclude | U+0EFF | U+0EFF | Cn | | +--------------+--------+--------+-------+------------------------+ 4.29. 0F00-0FFF Tibetan +----------+--------+--------+-------+------------------------------+ | Include? | Code | NFKC | Class | Name | +----------+--------+--------+-------+------------------------------+ | Maybe | U+0F00 | U+0F00 | Lo | TIBETAN SYLLABLE OM | | Exclude | U+0F01 | U+0F01 | So | TIBETAN MARK GTER YIG MGO | | | | | | TRUNCATED A | | Exclude | U+0F02 | U+0F02 | So | TIBETAN MARK GTER YIG MGO | | | | | | -UM RNAM BCAD MA | | Exclude | U+0F03 | U+0F03 | So | TIBETAN MARK GTER YIG MGO | | | | | | -UM GTER TSHEG MA | | Exclude | U+0F04 | U+0F04 | Po | TIBETAN MARK INITIAL YIG MGO | | | | | | MDUN MA | | Exclude | U+0F05 | U+0F05 | Po | TIBETAN MARK CLOSING YIG MGO | | | | | | SGAB MA | | Exclude | U+0F06 | U+0F06 | Po | TIBETAN MARK CARET YIG MGO | | | | | | PHUR SHAD MA | | Exclude | U+0F07 | U+0F07 | Po | TIBETAN MARK YIG MGO TSHEG | | | | | | SHAD MA | | Exclude | U+0F08 | U+0F08 | Po | TIBETAN MARK SBRUL SHAD | | Exclude | U+0F09 | U+0F09 | Po | TIBETAN MARK BSKUR YIG MGO | | Exclude | U+0F0A | U+0F0A | Po | TIBETAN MARK BKA- SHOG YIG | | | | | | MGO | | Exclude | U+0F0B | U+0F0B | Po | TIBETAN MARK INTERSYLLABIC | | | | | | TSHEG | | Exclude | U+0F0C | U+0F0B | Po | TIBETAN MARK INTERSYLLABIC | | | | | | TSHEG | | Exclude | U+0F0D | U+0F0D | Po | TIBETAN MARK SHAD | | Exclude | U+0F0E | U+0F0E | Po | TIBETAN MARK NYIS SHAD | | Exclude | U+0F0F | U+0F0F | Po | TIBETAN MARK TSHEG SHAD | | Exclude | U+0F10 | U+0F10 | Po | TIBETAN MARK NYIS TSHEG SHAD | | Exclude | U+0F11 | U+0F11 | Po | TIBETAN MARK RIN CHEN SPUNGS | | | | | | SHAD | | Exclude | U+0F12 | U+0F12 | Po | TIBETAN MARK RGYA GRAM SHAD | | Exclude | U+0F13 | U+0F13 | So | TIBETAN MARK CARET -DZUD | | | | | | RTAGS ME LONG CAN | | Exclude | U+0F14 | U+0F14 | So | TIBETAN MARK GTER TSHEG | | Exclude | U+0F15 | U+0F15 | So | TIBETAN LOGOTYPE SIGN CHAD | | | | | | RTAGS | | Exclude | U+0F16 | U+0F16 | So | TIBETAN LOGOTYPE SIGN LHAG | | | | | | RTAGS | | Exclude | U+0F17 | U+0F17 | So | TIBETAN ASTROLOGICAL SIGN | | | | | | SGRA GCAN -CHAR RTAGS | Faltstrom Expires April 26, 2007 [Page 127] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+0F18 | U+0F18 | Mn | TIBETAN ASTROLOGICAL SIGN | | not | | | | -KHYUD PA | | Possibly | U+0F19 | U+0F19 | Mn | TIBETAN ASTROLOGICAL SIGN | | not | | | | SDONG TSHUGS | | Exclude | U+0F1A | U+0F1A | So | TIBETAN SIGN RDEL DKAR GCIG | | Exclude | U+0F1B | U+0F1B | So | TIBETAN SIGN RDEL DKAR GNYIS | | Exclude | U+0F1C | U+0F1C | So | TIBETAN SIGN RDEL DKAR GSUM | | Exclude | U+0F1D | U+0F1D | So | TIBETAN SIGN RDEL NAG GCIG | | Exclude | U+0F1E | U+0F1E | So | TIBETAN SIGN RDEL NAG GNYIS | | Exclude | U+0F1F | U+0F1F | So | TIBETAN SIGN RDEL DKAR RDEL | | | | | | NAG | | Maybe | U+0F20 | U+0F20 | Nd | TIBETAN DIGIT ZERO | | Maybe | U+0F21 | U+0F21 | Nd | TIBETAN DIGIT ONE | | Maybe | U+0F22 | U+0F22 | Nd | TIBETAN DIGIT TWO | | Maybe | U+0F23 | U+0F23 | Nd | TIBETAN DIGIT THREE | | Maybe | U+0F24 | U+0F24 | Nd | TIBETAN DIGIT FOUR | | Maybe | U+0F25 | U+0F25 | Nd | TIBETAN DIGIT FIVE | | Maybe | U+0F26 | U+0F26 | Nd | TIBETAN DIGIT SIX | | Maybe | U+0F27 | U+0F27 | Nd | TIBETAN DIGIT SEVEN | | Maybe | U+0F28 | U+0F28 | Nd | TIBETAN DIGIT EIGHT | | Maybe | U+0F29 | U+0F29 | Nd | TIBETAN DIGIT NINE | | Exclude | U+0F2A | U+0F2A | No | TIBETAN DIGIT HALF ONE | | Exclude | U+0F2B | U+0F2B | No | TIBETAN DIGIT HALF TWO | | Exclude | U+0F2C | U+0F2C | No | TIBETAN DIGIT HALF THREE | | Exclude | U+0F2D | U+0F2D | No | TIBETAN DIGIT HALF FOUR | | Exclude | U+0F2E | U+0F2E | No | TIBETAN DIGIT HALF FIVE | | Exclude | U+0F2F | U+0F2F | No | TIBETAN DIGIT HALF SIX | | Exclude | U+0F30 | U+0F30 | No | TIBETAN DIGIT HALF SEVEN | | Exclude | U+0F31 | U+0F31 | No | TIBETAN DIGIT HALF EIGHT | | Exclude | U+0F32 | U+0F32 | No | TIBETAN DIGIT HALF NINE | | Exclude | U+0F33 | U+0F33 | No | TIBETAN DIGIT HALF ZERO | | Exclude | U+0F34 | U+0F34 | So | TIBETAN MARK BSDUS RTAGS | | Possibly | U+0F35 | U+0F35 | Mn | TIBETAN MARK NGAS BZUNG NYI | | not | | | | ZLA | | Exclude | U+0F36 | U+0F36 | So | TIBETAN MARK CARET -DZUD | | | | | | RTAGS BZHI MIG CAN | | Possibly | U+0F37 | U+0F37 | Mn | TIBETAN MARK NGAS BZUNG SGOR | | not | | | | RTAGS | | Exclude | U+0F38 | U+0F38 | So | TIBETAN MARK CHE MGO | | Possibly | U+0F39 | U+0F39 | Mn | TIBETAN MARK TSA -PHRU | | not | | | | | | Exclude | U+0F3A | U+0F3A | Ps | TIBETAN MARK GUG RTAGS GYON | | Exclude | U+0F3B | U+0F3B | Pe | TIBETAN MARK GUG RTAGS GYAS | | Exclude | U+0F3C | U+0F3C | Ps | TIBETAN MARK ANG KHANG GYON | | Exclude | U+0F3D | U+0F3D | Pe | TIBETAN MARK ANG KHANG GYAS | | Maybe | U+0F3E | U+0F3E | Mc | TIBETAN SIGN YAR TSHES | | Maybe | U+0F3F | U+0F3F | Mc | TIBETAN SIGN MAR TSHES | | Maybe | U+0F40 | U+0F40 | Lo | TIBETAN LETTER KA | Faltstrom Expires April 26, 2007 [Page 128] Internet-Draft Unicode Codepoints October 2006 | Maybe | U+0F41 | U+0F41 | Lo | TIBETAN LETTER KHA | | Maybe | U+0F42 | U+0F42 | Lo | TIBETAN LETTER GA | | Possibly | U+0F43 | U+0F42 | Lo Mn | TIBETAN LETTER GA | | not | | | | | | Maybe | U+0F44 | U+0F44 | Lo | TIBETAN LETTER NGA | | Maybe | U+0F45 | U+0F45 | Lo | TIBETAN LETTER CA | | Maybe | U+0F46 | U+0F46 | Lo | TIBETAN LETTER CHA | | Maybe | U+0F47 | U+0F47 | Lo | TIBETAN LETTER JA | | Exclude | U+0F48 | U+0F48 | Cn | | | Maybe | U+0F49 | U+0F49 | Lo | TIBETAN LETTER NYA | | Maybe | U+0F4A | U+0F4A | Lo | TIBETAN LETTER TTA | | Maybe | U+0F4B | U+0F4B | Lo | TIBETAN LETTER TTHA | | Maybe | U+0F4C | U+0F4C | Lo | TIBETAN LETTER DDA | | Possibly | U+0F4D | U+0F4C | Lo Mn | TIBETAN LETTER DDA | | not | | | | | | Maybe | U+0F4E | U+0F4E | Lo | TIBETAN LETTER NNA | | Maybe | U+0F4F | U+0F4F | Lo | TIBETAN LETTER TA | | Maybe | U+0F50 | U+0F50 | Lo | TIBETAN LETTER THA | | Maybe | U+0F51 | U+0F51 | Lo | TIBETAN LETTER DA | | Possibly | U+0F52 | U+0F51 | Lo Mn | TIBETAN LETTER DA | | not | | | | | | Maybe | U+0F53 | U+0F53 | Lo | TIBETAN LETTER NA | | Maybe | U+0F54 | U+0F54 | Lo | TIBETAN LETTER PA | | Maybe | U+0F55 | U+0F55 | Lo | TIBETAN LETTER PHA | | Maybe | U+0F56 | U+0F56 | Lo | TIBETAN LETTER BA | | Possibly | U+0F57 | U+0F56 | Lo Mn | TIBETAN LETTER BA | | not | | | | | | Maybe | U+0F58 | U+0F58 | Lo | TIBETAN LETTER MA | | Maybe | U+0F59 | U+0F59 | Lo | TIBETAN LETTER TSA | | Maybe | U+0F5A | U+0F5A | Lo | TIBETAN LETTER TSHA | | Maybe | U+0F5B | U+0F5B | Lo | TIBETAN LETTER DZA | | Possibly | U+0F5C | U+0F5B | Lo Mn | TIBETAN LETTER DZA | | not | | | | | | Maybe | U+0F5D | U+0F5D | Lo | TIBETAN LETTER WA | | Maybe | U+0F5E | U+0F5E | Lo | TIBETAN LETTER ZHA | | Maybe | U+0F5F | U+0F5F | Lo | TIBETAN LETTER ZA | | Maybe | U+0F60 | U+0F60 | Lo | TIBETAN LETTER -A | | Maybe | U+0F61 | U+0F61 | Lo | TIBETAN LETTER YA | | Maybe | U+0F62 | U+0F62 | Lo | TIBETAN LETTER RA | | Maybe | U+0F63 | U+0F63 | Lo | TIBETAN LETTER LA | | Maybe | U+0F64 | U+0F64 | Lo | TIBETAN LETTER SHA | | Maybe | U+0F65 | U+0F65 | Lo | TIBETAN LETTER SSA | | Maybe | U+0F66 | U+0F66 | Lo | TIBETAN LETTER SA | | Maybe | U+0F67 | U+0F67 | Lo | TIBETAN LETTER HA | | Maybe | U+0F68 | U+0F68 | Lo | TIBETAN LETTER A | | Possibly | U+0F69 | U+0F40 | Lo Mn | TIBETAN LETTER KA | | not | | | | | | Maybe | U+0F6A | U+0F6A | Lo | TIBETAN LETTER FIXED-FORM RA | Faltstrom Expires April 26, 2007 [Page 129] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0F6B | U+0F6B | Cn | | | Exclude | U+0F6C | U+0F6C | Cn | | | Exclude | U+0F6D | U+0F6D | Cn | | | Exclude | U+0F6E | U+0F6E | Cn | | | Exclude | U+0F6F | U+0F6F | Cn | | | Exclude | U+0F70 | U+0F70 | Cn | | | Possibly | U+0F71 | U+0F71 | Mn | TIBETAN VOWEL SIGN AA | | not | | | | | | Possibly | U+0F72 | U+0F72 | Mn | TIBETAN VOWEL SIGN I | | not | | | | | | Possibly | U+0F73 | U+0F71 | Mn | TIBETAN VOWEL SIGN AA | | not | | | | | | Possibly | U+0F74 | U+0F74 | Mn | TIBETAN VOWEL SIGN U | | not | | | | | | Possibly | U+0F75 | U+0F71 | Mn | TIBETAN VOWEL SIGN AA | | not | | | | | | Possibly | U+0F76 | U+0FB2 | Mn | TIBETAN SUBJOINED LETTER RA | | not | | | | | | Possibly | U+0F77 | U+0FB2 | Mn | TIBETAN SUBJOINED LETTER RA | | not | | | | | | Possibly | U+0F78 | U+0FB3 | Mn | TIBETAN SUBJOINED LETTER LA | | not | | | | | | Possibly | U+0F79 | U+0FB3 | Mn | TIBETAN SUBJOINED LETTER LA | | not | | | | | | Possibly | U+0F7A | U+0F7A | Mn | TIBETAN VOWEL SIGN E | | not | | | | | | Possibly | U+0F7B | U+0F7B | Mn | TIBETAN VOWEL SIGN EE | | not | | | | | | Possibly | U+0F7C | U+0F7C | Mn | TIBETAN VOWEL SIGN O | | not | | | | | | Possibly | U+0F7D | U+0F7D | Mn | TIBETAN VOWEL SIGN OO | | not | | | | | | Possibly | U+0F7E | U+0F7E | Mn | TIBETAN SIGN RJES SU NGA RO | | not | | | | | | Maybe | U+0F7F | U+0F7F | Mc | TIBETAN SIGN RNAM BCAD | | Possibly | U+0F80 | U+0F80 | Mn | TIBETAN VOWEL SIGN REVERSED | | not | | | | I | | Possibly | U+0F81 | U+0F71 | Mn | TIBETAN VOWEL SIGN AA | | not | | | | | | Possibly | U+0F82 | U+0F82 | Mn | TIBETAN SIGN NYI ZLA NAA DA | | not | | | | | | Possibly | U+0F83 | U+0F83 | Mn | TIBETAN SIGN SNA LDAN | | not | | | | | | Possibly | U+0F84 | U+0F84 | Mn | TIBETAN MARK HALANTA | | not | | | | | | Exclude | U+0F85 | U+0F85 | Po | TIBETAN MARK PALUTA | | Possibly | U+0F86 | U+0F86 | Mn | TIBETAN SIGN LCI RTAGS | | not | | | | | Faltstrom Expires April 26, 2007 [Page 130] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+0F87 | U+0F87 | Mn | TIBETAN SIGN YANG RTAGS | | not | | | | | | Maybe | U+0F88 | U+0F88 | Lo | TIBETAN SIGN LCE TSA CAN | | Maybe | U+0F89 | U+0F89 | Lo | TIBETAN SIGN MCHU CAN | | Maybe | U+0F8A | U+0F8A | Lo | TIBETAN SIGN GRU CAN RGYINGS | | Maybe | U+0F8B | U+0F8B | Lo | TIBETAN SIGN GRU MED RGYINGS | | Exclude | U+0F8C | U+0F8C | Cn | | | Exclude | U+0F8D | U+0F8D | Cn | | | Exclude | U+0F8E | U+0F8E | Cn | | | Exclude | U+0F8F | U+0F8F | Cn | | | Possibly | U+0F90 | U+0F90 | Mn | TIBETAN SUBJOINED LETTER KA | | not | | | | | | Possibly | U+0F91 | U+0F91 | Mn | TIBETAN SUBJOINED LETTER KHA | | not | | | | | | Possibly | U+0F92 | U+0F92 | Mn | TIBETAN SUBJOINED LETTER GA | | not | | | | | | Possibly | U+0F93 | U+0F92 | Mn | TIBETAN SUBJOINED LETTER GA | | not | | | | | | Possibly | U+0F94 | U+0F94 | Mn | TIBETAN SUBJOINED LETTER NGA | | not | | | | | | Possibly | U+0F95 | U+0F95 | Mn | TIBETAN SUBJOINED LETTER CA | | not | | | | | | Possibly | U+0F96 | U+0F96 | Mn | TIBETAN SUBJOINED LETTER CHA | | not | | | | | | Possibly | U+0F97 | U+0F97 | Mn | TIBETAN SUBJOINED LETTER JA | | not | | | | | | Exclude | U+0F98 | U+0F98 | Cn | | | Possibly | U+0F99 | U+0F99 | Mn | TIBETAN SUBJOINED LETTER NYA | | not | | | | | | Possibly | U+0F9A | U+0F9A | Mn | TIBETAN SUBJOINED LETTER TTA | | not | | | | | | Possibly | U+0F9B | U+0F9B | Mn | TIBETAN SUBJOINED LETTER | | not | | | | TTHA | | Possibly | U+0F9C | U+0F9C | Mn | TIBETAN SUBJOINED LETTER DDA | | not | | | | | | Possibly | U+0F9D | U+0F9C | Mn | TIBETAN SUBJOINED LETTER DDA | | not | | | | | | Possibly | U+0F9E | U+0F9E | Mn | TIBETAN SUBJOINED LETTER NNA | | not | | | | | | Possibly | U+0F9F | U+0F9F | Mn | TIBETAN SUBJOINED LETTER TA | | not | | | | | | Possibly | U+0FA0 | U+0FA0 | Mn | TIBETAN SUBJOINED LETTER THA | | not | | | | | | Possibly | U+0FA1 | U+0FA1 | Mn | TIBETAN SUBJOINED LETTER DA | | not | | | | | | Possibly | U+0FA2 | U+0FA1 | Mn | TIBETAN SUBJOINED LETTER DA | | not | | | | | Faltstrom Expires April 26, 2007 [Page 131] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+0FA3 | U+0FA3 | Mn | TIBETAN SUBJOINED LETTER NA | | not | | | | | | Possibly | U+0FA4 | U+0FA4 | Mn | TIBETAN SUBJOINED LETTER PA | | not | | | | | | Possibly | U+0FA5 | U+0FA5 | Mn | TIBETAN SUBJOINED LETTER PHA | | not | | | | | | Possibly | U+0FA6 | U+0FA6 | Mn | TIBETAN SUBJOINED LETTER BA | | not | | | | | | Possibly | U+0FA7 | U+0FA6 | Mn | TIBETAN SUBJOINED LETTER BA | | not | | | | | | Possibly | U+0FA8 | U+0FA8 | Mn | TIBETAN SUBJOINED LETTER MA | | not | | | | | | Possibly | U+0FA9 | U+0FA9 | Mn | TIBETAN SUBJOINED LETTER TSA | | not | | | | | | Possibly | U+0FAA | U+0FAA | Mn | TIBETAN SUBJOINED LETTER | | not | | | | TSHA | | Possibly | U+0FAB | U+0FAB | Mn | TIBETAN SUBJOINED LETTER DZA | | not | | | | | | Possibly | U+0FAC | U+0FAB | Mn | TIBETAN SUBJOINED LETTER DZA | | not | | | | | | Possibly | U+0FAD | U+0FAD | Mn | TIBETAN SUBJOINED LETTER WA | | not | | | | | | Possibly | U+0FAE | U+0FAE | Mn | TIBETAN SUBJOINED LETTER ZHA | | not | | | | | | Possibly | U+0FAF | U+0FAF | Mn | TIBETAN SUBJOINED LETTER ZA | | not | | | | | | Possibly | U+0FB0 | U+0FB0 | Mn | TIBETAN SUBJOINED LETTER -A | | not | | | | | | Possibly | U+0FB1 | U+0FB1 | Mn | TIBETAN SUBJOINED LETTER YA | | not | | | | | | Possibly | U+0FB2 | U+0FB2 | Mn | TIBETAN SUBJOINED LETTER RA | | not | | | | | | Possibly | U+0FB3 | U+0FB3 | Mn | TIBETAN SUBJOINED LETTER LA | | not | | | | | | Possibly | U+0FB4 | U+0FB4 | Mn | TIBETAN SUBJOINED LETTER SHA | | not | | | | | | Possibly | U+0FB5 | U+0FB5 | Mn | TIBETAN SUBJOINED LETTER SSA | | not | | | | | | Possibly | U+0FB6 | U+0FB6 | Mn | TIBETAN SUBJOINED LETTER SA | | not | | | | | | Possibly | U+0FB7 | U+0FB7 | Mn | TIBETAN SUBJOINED LETTER HA | | not | | | | | | Possibly | U+0FB8 | U+0FB8 | Mn | TIBETAN SUBJOINED LETTER A | | not | | | | | | Possibly | U+0FB9 | U+0F90 | Mn | TIBETAN SUBJOINED LETTER KA | | not | | | | | | Possibly | U+0FBA | U+0FBA | Mn | TIBETAN SUBJOINED LETTER | | not | | | | FIXED-FORM WA | Faltstrom Expires April 26, 2007 [Page 132] Internet-Draft Unicode Codepoints October 2006 | Possibly | U+0FBB | U+0FBB | Mn | TIBETAN SUBJOINED LETTER | | not | | | | FIXED-FORM YA | | Possibly | U+0FBC | U+0FBC | Mn | TIBETAN SUBJOINED LETTER | | not | | | | FIXED-FORM RA | | Exclude | U+0FBD | U+0FBD | Cn | | | Exclude | U+0FBE | U+0FBE | So | TIBETAN KU RU KHA | | Exclude | U+0FBF | U+0FBF | So | TIBETAN KU RU KHA BZHI MIG | | | | | | CAN | | Exclude | U+0FC0 | U+0FC0 | So | TIBETAN CANTILLATION SIGN | | | | | | HEAVY BEAT | | Exclude | U+0FC1 | U+0FC1 | So | TIBETAN CANTILLATION SIGN | | | | | | LIGHT BEAT | | Exclude | U+0FC2 | U+0FC2 | So | TIBETAN CANTILLATION SIGN | | | | | | CANG TE-U | | Exclude | U+0FC3 | U+0FC3 | So | TIBETAN CANTILLATION SIGN | | | | | | SBUB -CHAL | | Exclude | U+0FC4 | U+0FC4 | So | TIBETAN SYMBOL DRIL BU | | Exclude | U+0FC5 | U+0FC5 | So | TIBETAN SYMBOL RDO RJE | | Possibly | U+0FC6 | U+0FC6 | Mn | TIBETAN SYMBOL PADMA GDAN | | not | | | | | | Exclude | U+0FC7 | U+0FC7 | So | TIBETAN SYMBOL RDO RJE RGYA | | | | | | GRAM | | Exclude | U+0FC8 | U+0FC8 | So | TIBETAN SYMBOL PHUR PA | | Exclude | U+0FC9 | U+0FC9 | So | TIBETAN SYMBOL NOR BU | | Exclude | U+0FCA | U+0FCA | So | TIBETAN SYMBOL NOR BU NYIS | | | | | | -KHYIL | | Exclude | U+0FCB | U+0FCB | So | TIBETAN SYMBOL NOR BU GSUM | | | | | | -KHYIL | | Exclude | U+0FCC | U+0FCC | So | TIBETAN SYMBOL NOR BU BZHI | | | | | | -KHYIL | | Exclude | U+0FCD | U+0FCD | Cn | | | Exclude | U+0FCE | U+0FCE | Cn | | | Exclude | U+0FCF | U+0FCF | So | TIBETAN SIGN RDEL NAG GSUM | | Exclude | U+0FD0 | U+0FD0 | Cn | TIBETAN MARK BSKA- SHOG GI | | | | | | MGO RGYAN | | Exclude | U+0FD1 | U+0FD1 | Cn | TIBETAN MARK MNYAM YIG GI | | | | | | MGO RGYAN | | Exclude | U+0FD2 | U+0FD2 | Cn | | | Exclude | U+0FD3 | U+0FD3 | Cn | | | Exclude | U+0FD4 | U+0FD4 | Cn | | | Exclude | U+0FD5 | U+0FD5 | Cn | | | Exclude | U+0FD6 | U+0FD6 | Cn | | | Exclude | U+0FD7 | U+0FD7 | Cn | | | Exclude | U+0FD8 | U+0FD8 | Cn | | | Exclude | U+0FD9 | U+0FD9 | Cn | | | Exclude | U+0FDA | U+0FDA | Cn | | | Exclude | U+0FDB | U+0FDB | Cn | | | Exclude | U+0FDC | U+0FDC | Cn | | Faltstrom Expires April 26, 2007 [Page 133] Internet-Draft Unicode Codepoints October 2006 | Exclude | U+0FDD | U+0FDD | Cn | | | Exclude | U+0FDE | U+0FDE | Cn | | | Exclude | U+0FDF | U+0FDF | Cn | | | Exclude | U+0FE0 | U+0FE0 | Cn | | | Exclude | U+0FE1 | U+0FE1 | Cn | | | Exclude | U+0FE2 | U+0FE2 | Cn | | | Exclude | U+0FE3 | U+0FE3 | Cn | | | Exclude | U+0FE4 | U+0FE4 | Cn | | | Exclude | U+0FE5 | U+0FE5 | Cn | | | Exclude | U+0FE6 | U+0FE6 | Cn | | | Exclude | U+0FE7 | U+0FE7 | Cn | | | Exclude | U+0FE8 | U+0FE8 | Cn | | | Exclude | U+0FE9 | U+0FE9 | Cn | | | Exclude | U+0FEA | U+0FEA | Cn | | | Exclude | U+0FEB | U+0FEB | Cn | | | Exclude | U+0FEC | U+0FEC | Cn | | | Exclude | U+0FED | U+0FED | Cn | | | Exclude | U+0FEE | U+0FEE | Cn | | | Exclude | U+0FEF | U+0FEF | Cn | | | Exclude | U+0FF0 | U+0FF0 | Cn | | | Exclude | U+0FF1 | U+0FF1 | Cn | | | Exclude | U+0FF2 | U+0FF2 | Cn | | | Exclude | U+0FF3 | U+0FF3 | Cn | | | Exclude | U+0FF4 | U+0FF4 | Cn | | | Exclude | U+0FF5 | U+0FF5 | Cn | | | Exclude | U+0FF6 | U+0FF6 | Cn | | | Exclude | U+0FF7 | U+0FF7 | Cn | | | Exclude | U+0FF8 | U+0FF8 | Cn | | | Exclude | U+0FF9 | U+0FF9 | Cn | | | Exclude | U+0FFA | U+0FFA | Cn | | | Exclude | U+0FFB | U+0FFB | Cn | | | Exclude | U+0FFC | U+0FFC | Cn | | | Exclude | U+0FFD | U+0FFD | Cn | | | Exclude | U+0FFE | U+0FFE | Cn | | | Exclude | U+0FFF | U+0FFF | Cn | | +----------+--------+--------+-------+------------------------------+ 5. Outstanding Issues As the listings and discussions above clearly show, the original objective of the IDNA work, that of avoiding making any codepoint-by- codepoint decisions in the IETF is going to be difficult or impossible to achieve. There are many cases in which DNS considerations require excluding a character as having too much risk of confusion or other security problems while cultural and linguistic considerations argue for including it as, e.g., "necessary to write a particular language". No one is really in a good position to Faltstrom Expires April 26, 2007 [Page 134] Internet-Draft Unicode Codepoints October 2006 evaluate the tradeoffs in such situations, but, because DNS stability and the integrity of references that use it are at stake and the characters must be processed using Internet applications protocols, most of them IETF responsibilities, it seems clear that the IETF is the best (or least bad) forum for doing so. These are also, probably fortunately, not decisions that must be resolved all at once. In many cases, characters should probably be left on a "pending" list until a community of users of the language (not outsiders offering opinions) are able to come forward and discuss both their requirements for characters in IDNs and the potential issues involving using those characters, especially if the application area requires interaction, or sharing the same DNS zone, with other languages or scripts. So, for this work to be successful, the IETF will need to define a long-term plan for reviewing and adding additional scripts and characters. That plan might involve an internal IETF effort or delegation to another body, but the issue must be addressed if we are going to be able to build on this work to move forward. 6. IANA Considerations ...To be supplied. This work will ultimately require registries of characters that are acceptable for use in IDNs. See Section 5. 7. Security Considerations The security issues associated with this work are discussed in [IDNA-issues]. 8. Contributors While the listed editors held the pen, this document represents the joint work and conclusions of an ad hoc design team. In addition to the editors this consisted of, Harald Alvestrand, Tina Dam, Cary Karp, and John Klensin. 9. Acknowledgements 10. References Faltstrom Expires April 26, 2007 [Page 135] Internet-Draft Unicode Codepoints October 2006 10.1. Normative References [RFC4690] Klensin, J., Faltstrom, P., and Karp, C., "Review and Recommendations for Internationalized Domain Names (IDNs)", RFC 4690, September 2006. [Unicode5] The Unicode Consortium, "The Unicode Standard, Version 5.0", Boston, MA, Addison-Wesley ISBN 0-321-48091-0, 2007. [idnabis] Klensin, J., "Proposed Issues and Changes for IDNA - An Overview", Work in progress draft-klensin-..., October 2006. 10.2. Informative References [IDNA-bidi] Alvestrand, H., Ed. and C. Karp, "An IDNA problem in right-to-left scripts", Oct 2006. [IDNA-issues] Klensin, J., Ed., "Proposed Issues and Changes for IDNA - An Overview", October 2006. [RFC1035] Mockapetris, P., "Domain names - implementation and specification", STD 13, RFC 1035, November 1987. [RFC3454] Hoffman, P. and M. Blanchet, "Preparation of Internationalized Strings ("stringprep")", RFC 3454, December 2002. [RFC3491] Hoffman, P. and M. Blanchet, "Nameprep: A Stringprep Profile for Internationalized Domain Names (IDN)", RFC 3491, March 2003. [RFC4713] Lee, X., Mao, W., Chen, E., Hsu, N., and J. Klensin, "Registration and Administration Recommendations for Chinese Domain Names", RFC 4713, October 2006. Author's Address Patrik Faltstrom (editor) Cisco Systems Email: paf@cisco.com Faltstrom Expires April 26, 2007 [Page 136] Internet-Draft Unicode Codepoints October 2006 Full Copyright Statement Copyright (C) The Internet Society (2006). This document is subject to the rights, licenses and restrictions contained in BCP 78, and except as set forth therein, the authors retain all their rights. This document and the information contained herein are provided on an "AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY AND THE INTERNET ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Intellectual Property The IETF takes no position regarding the validity or scope of any Intellectual Property Rights or other rights that might be claimed to pertain to the implementation or use of the technology described in this document or the extent to which any license under such rights might or might not be available; nor does it represent that it has made any independent effort to identify any such rights. Information on the procedures with respect to rights in RFC documents can be found in BCP 78 and BCP 79. Copies of IPR disclosures made to the IETF Secretariat and any assurances of licenses to be made available, or the result of an attempt made to obtain a general license or permission for the use of such proprietary rights by implementers or users of this specification can be obtained from the IETF on-line IPR repository at http://www.ietf.org/ipr. The IETF invites any interested party to bring to its attention any copyrights, patents or patent applications, or other proprietary rights that may cover technology that may be required to implement this standard. Please address the information to the IETF at ietf-ipr@ietf.org. Acknowledgment Funding for the RFC Editor function is provided by the IETF Administrative Support Activity (IASA). Faltstrom Expires April 26, 2007 [Page 137]