Draft Additional Repertoire for ISO/IEC 10646:2016 (5Th Edition)

Title: Draft additional repertoire for ISO/IEC 10646:2016 (5th edition) DIS Date: 2016-10-26 L2/15-170 WG2 N4728 Source: Michel Suignard, project editor Status: Project Editor's summary of the character repertoire addition as included in the DIS ballot Action: For review by WG2 and UTC experts Distribution: WG2 and UTC Replaces: Status This document presents a summary of all characters that constitute the tentative new repertoire for ISO/IEC 10646 5th edition, with code positions, representative glyphs and character names. Manner of Presentation The character names and code points shown are the same for Unicode and ISO/IEC 10646, including annotations. Note to Reviewers UTC/WG2 Reviewers, please use this document as a description of the result from the disposition of ballot comments for the CD.2. Contents This document lists 8447 characters. The following list shows all 26 blocks (existing or new) to which characters are proposed to be added, or which have been affected by other changes documented here. 0860-086F Syriac Supplement see document L2/15-088 0980-09FF Bengali See document: L2/15-161 L2/15-172 0A80-0AFF Gujarati see document L2/15-103 0D00-0D7F Malayalam See document L2/14-015R L2/14-292 L2/15-045 1CD0-1CFF Vedic Extensions See document: L2/15-160 1DC0-1DFF Combining Diacritical Marks Supplement See document L2/14-285 L2/15-173R 20A0-20CF Currency Symbols See document L2/15-299 2300-23FF Miscellaneous Technical See document L2/15-031 2B00-2BFF Miscellaneous Symbols and Arrows See document L2/15-083 2E00-2E7F Supplemental Punctuation See document L2/15-173 3100-312F Bopomofo See document WG2 N4695 4E00-9FEA CJK Unified Ideographs See document L2/14-228 IRG N2110 Res 45.5 10300-1032F Old Italic See document WG2 N4395 N4669 L2/12-386 L2/15-169 11400-1147F Newa See document L2/14-285 11A00-11A4F Zanabazar Square See document WG2 N4541 L2/14-028 11A50-11AAF Soyombo See document WG2 N4655 L2/15-004 11D00-11D5F Masaram Gondi see document L2/15-090 16FE0-16FFF Ideographic Symbols and Punctuation See document N4525 N4522 1B170-1B2FF Nushu See document WG2 N4472 N4693 N4697 L2/13-160 1E900-1E95F Adlam See document WG2 N4628 L2/14-219 1F100-1F1FF Enclosed Alphanumeric Supplement See document WG2 N4671 1F200-1F2FF Enclosed Ideographic Supplement see document L2/14-278R 1F300-1F5FF Miscellaneous Symbols and Pictographs see document L2/15-054R4 1F680-1F6FF Transport and Map Symbols see document L2/15-054R4 L2/15-195R 1F900-1F9FF Supplemental Symbols and Pictographs see document L2/15-054R4 L2/15-195R 2CBE0-2EBEF CJK Unified Ideographs Extension F See document WG2 N4580 0860 Syriac Supplement 086F 086 Syriac letters Used for writing Suriyani Malayalam, which is also known as Garshuni (Karshoni) and Syriac Malayalam 0 0860 SYRIAC LETTER MALAYALAM NGA 0860 0861 SYRIAC LETTER MALAYALAM JA 0862 SYRIAC LETTER MALAYALAM NYA 1 0863 SYRIAC LETTER MALAYALAM TTA 0864 SYRIAC LETTER MALAYALAM NNA 0861 0865 SYRIAC LETTER MALAYALAM NNNA 0866 SYRIAC LETTER MALAYALAM BHA 2 0867 SYRIAC LETTER MALAYALAM RA 0868 SYRIAC LETTER MALAYALAM LLA 0862 0869 SYRIAC LETTER MALAYALAM LLLA 086A SYRIAC LETTER MALAYALAM SSA 3 0863 4 0864 5 0865 6 0866 7 0867 8 0868 9 0869 A 086A B C D E F Printed: 26-May-2016 3 0980 Bengali 09FF 098 099 09A 09B 09C 09D 09E 09F 0 ঀ ঐ ঠ র $ী ৠ ৰ 0980 0990 09A0 09B0 09C0 09E0 09F0 1 $ঁ ড $ু ৡ ৱ 0981 09A1 09C1 09E1 09F1 2 $ং ঢ ল $ূ $ৢ ৲ 0982 09A2 09B2 09C2 09E2 09F2 3 $ঃ ও ণ $ৃ $ৣ ৳ 0983 0993 09A3 09C3 09E3 09F3 4 ঔ ত $ৄ ৴ 0994 09A4 09C4 09F4 5 অ ক থ ৵ 0985 0995 09A5 09F5 6 আ খ দ শ ০ ৶ 0986 0996 09A6 09B6 09E6 09F6 7 ই গ ধ ষ $ে $ৗ ১ ৷ 0987 0997 09A7 09B7 09C7 09D7 09E7 09F7 8 ঈ ঘ ন স $ৈ ২ ৸ 0988 0998 09A8 09B8 09C8 09E8 09F8 9 উ ঙ হ ৩ ৹ 0989 0999 09B9 09E9 09F9 A ঊ চ প ৪ ৺ 098A 099A 09AA 09EA 09FA B ঋ ছ ফ $ো ৫ ৻ 098B 099B 09AB 09CB 09EB 09FB C ঌ জ ব $় $ৌ ড় ৬ 098C 099C 09AC 09BC 09CC 09DC 09EC 09FC D ঝ ভ ঽ $্ ঢ় ৭ ॰ 099D 09AD 09BD 09CD 09DD 09ED 09FD E ঞ ম $া ৎ ৮ 099E 09AE 09BE 09CE 09EE F এ ট য $ি য় ৯ 098F 099F 09AF 09BF 09DF 09EF 4 Printed: 26-May-2016 0980 Bengali 09E3 The Bengali script is also known as Bangla. In Assam, the 09B3 " <reserved> preferred name of the script is Asamiya or Assamese. The 09B4 " <reserved> Assamese language has also been written historically using 09B5 " <reserved> distinct regional scripts known as Kamrupi. 09B6 শ BENGALI LETTER SHA 09B7 ষ BENGALI LETTER SSA Various signs 09B8 স BENGALI LETTER SA 0980 ঀ BENGALI ANJI 09B9 হ BENGALI LETTER HA = siddham, siddhirastu • used at the beginning of texts as an invocation Various signs 0981 $ঁ BENGALI SIGN CANDRABINDU 09BC $় BENGALI SIGN NUKTA 0982 $ং BENGALI SIGN ANUSVARA • for extending the alphabet to new letters 0983 $ঃ BENGALI SIGN VISARGA 09BD ঽ BENGALI SIGN AVAGRAHA Independent vowels Dependent vowel signs 0985 অ BENGALI LETTER A 09BE $া BENGALI VOWEL SIGN AA 0986 আ BENGALI LETTER AA 09BF $ি BENGALI VOWEL SIGN I 0987 ই BENGALI LETTER I • stands to the left of the consonant 0988 ঈ BENGALI LETTER II 09C0 $ী BENGALI VOWEL SIGN II 0989 উ BENGALI LETTER U 09C1 $ু BENGALI VOWEL SIGN U 098A ঊ BENGALI LETTER UU 09C2 $ূ BENGALI VOWEL SIGN UU 098B ঋ BENGALI LETTER VOCALIC R 09C3 $ৃ BENGALI VOWEL SIGN VOCALIC R 098C ঌ BENGALI LETTER VOCALIC L 09C4 $ৄ BENGALI VOWEL SIGN VOCALIC RR 098D " <reserved> 09C5 " <reserved> 098E " <reserved> 09C6 " <reserved> 098F এ BENGALI LETTER E 09C7 $ে BENGALI VOWEL SIGN E 0990 ঐ BENGALI LETTER AI • stands to the left of the consonant 0991 " <reserved> 09C8 $ৈ BENGALI VOWEL SIGN AI 0992 " <reserved> • stands to the left of the consonant 0993 ও BENGALI LETTER O Two-part dependent vowel signs 0994 ঔ BENGALI LETTER AU These vowel signs have glyph pieces which stand on both sides Consonants of the consonant; they follow the consonant in logical order, 0995 ক BENGALI LETTER KA and should be handled as a unit for most processing. 0996 খ BENGALI LETTER KHA 09CB $ো BENGALI VOWEL SIGN O 0997 গ BENGALI LETTER GA ≡ 09C7 $ে 09BE $া 0998 ঘ BENGALI LETTER GHA 09CC $ৌ BENGALI VOWEL SIGN AU 0999 ঙ BENGALI LETTER NGA ≡ 09C7 $ে 09D7 $ৗ 099A চ BENGALI LETTER CA Virama 099B ছ BENGALI LETTER CHA 09CD $্ BENGALI SIGN VIRAMA 099C জ BENGALI LETTER JA = hasant (Bengali term for halant) 099D ঝ BENGALI LETTER JHA 099E ঞ BENGALI LETTER NYA Additional consonant 099F ট BENGALI LETTER TTA 09CE ৎ BENGALI LETTER KHANDA TA 09A0 ঠ BENGALI LETTER TTHA • a dead consonant form of ta, without implicit 09A1 ড BENGALI LETTER DDA vowel, used in some sequences 09A2 ঢ BENGALI LETTER DDHA Sign 09A3 ণ BENGALI LETTER NNA 09D7 $ৗ BENGALI AU LENGTH MARK 09A4 ত BENGALI LETTER TA 09A5 থ BENGALI LETTER THA Additional consonants 09A6 দ BENGALI LETTER DA 09DC ড় BENGALI LETTER RRA 09A7 ধ BENGALI LETTER DHA ≡ 09A1 ড 09BC $় 09DD ঢ় BENGALI LETTER RHA 09A8 ন BENGALI LETTER NA ≡ ঢ $় 09A9 " <reserved> 09A2 09BC 09DE " <reserved> 09AA প BENGALI LETTER PA 09DF য় BENGALI LETTER YYA 09AB ফ BENGALI LETTER PHA ≡ য $় 09AC ব BENGALI LETTER BA 09AF 09BC = Bengali va, wa Additional vowels for Sanskrit 09AD ভ BENGALI LETTER BHA 09E0 ৠ BENGALI LETTER VOCALIC RR 09AE ম BENGALI LETTER MA 09E1 ৡ BENGALI LETTER VOCALIC LL 09AF য BENGALI LETTER YA 09E2 $ৢ BENGALI VOWEL SIGN VOCALIC L 09B0 র BENGALI LETTER RA 09E3 $ৣ BENGALI VOWEL SIGN VOCALIC LL 09B1 " <reserved> 09B2 ল BENGALI LETTER LA Printed: 26-May-2016 5 09E4 Bengali 09FD Reserved For viram punctuation, use the generic Indic 0964 and 0965. Note that these punctuation marks are referred to as dahri and double dahri in Bangla. 09E4 " <reserved> → 0964 । devanagari danda 09E5 " <reserved> → 0965 ॥ devanagari double danda Digits 09E6 ০ BENGALI DIGIT ZERO 09E7 ১ BENGALI DIGIT ONE 09E8 ২ BENGALI DIGIT TWO 09E9 ৩ BENGALI DIGIT THREE 09EA ৪ BENGALI DIGIT FOUR 09EB ৫ BENGALI DIGIT FIVE 09EC ৬ BENGALI DIGIT SIX 09ED ৭ BENGALI DIGIT SEVEN 09EE ৮ BENGALI DIGIT EIGHT 09EF ৯ BENGALI DIGIT NINE Additions for Assamese 09F0 ৰ BENGALI LETTER RA WITH MIDDLE DIAGONAL = Assamese letter ra 09F1 ৱ BENGALI LETTER RA WITH LOWER DIAGONAL = Assamese letter wa = bengali letter va with lower diagonal (1.0) Currency signs 09F2 ৲ BENGALI RUPEE MARK = taka • historic currency sign 09F3 ৳ BENGALI RUPEE SIGN = Bangladeshi taka Historic symbols for fractional values The use of these signs is not limited to currency, despite the character names. 09F4 ৴ BENGALI CURRENCY NUMERATOR ONE • not in current usage 09F5 ৵ BENGALI CURRENCY NUMERATOR TWO • not in current usage 09F6 ৶ BENGALI CURRENCY NUMERATOR THREE • not in current usage 09F7 ৷ BENGALI CURRENCY NUMERATOR FOUR 09F8 ৸ BENGALI CURRENCY NUMERATOR ONE LESS THAN THE DENOMINATOR 09F9 ৹ BENGALI CURRENCY DENOMINATOR SIXTEEN Sign 09FA ৺ BENGALI ISSHAR = ishvar • represents the name of a deity = svargiya • written before the name of a deceased person Historic currency sign 09FB ৻ BENGALI GANDA MARK Signs 09FC BENGALI LETTER VEDIC ANUSVARA • denotes a Vedic anusvara 09FD ॰ BENGALI ABBREVIATION SIGN 6 Printed: 26-May-2016 0A80 Gujarati 0AFF 0A8 0A9 0AA 0AB 0AC 0AD 0AE 0AF 0 ઐ ઠ ર $ી ૐૠ ૰ 0A90 0AA0 0AB0 0AC0 0AD0 0AE0 0AF0 1 $ઁઑ ડ $ુ ૡ ૱ 0A81 0A91 0AA1 0AC1 0AE1 0AF1 2 $ં ઢ લ $ૂ $ૢ 0A82 0AA2 0AB2 0AC2 0AE2 3 $ઃ ઓ ણ ળ $ૃ $ૣ 0A83 0A93 0AA3 0AB3 0AC3 0AE3 4 ઔ ત $ૄ 0A94 0AA4 0AC4 5 અ ક થ વ $ૅ 0A85 0A95 0AA5 0AB5 0AC5 6 આ ખ દ શ ૦ 0A86 0A96 0AA6 0AB6 0AE6 7 ઇ ગ ધ ષ $ે ૧ 0A87 0A97 0AA7 0AB7 0AC7 0AE7 8 ઈ ઘ ન સ $ૈ ૨ 0A88 0A98 0AA8 0AB8 0AC8 0AE8 9 ઉ ઙ હ $ૉ ૩ ૹ 0A89 0A99 0AB9 0AC9 0AE9 0AF9 A ઊ ચ પ ૪ $ૺ 0A8A 0A9A 0AAA 0AEA 0AFA B ઋ છ ફ $ો ૫ $ૻ 0A8B 0A9B 0AAB 0ACB 0AEB 0AFB C ઌ જ બ $઼ $ૌ ૬ $ૼ 0A8C 0A9C 0AAC 0ABC 0ACC 0AEC 0AFC D ઍ ઝ ભ ઽ $્ ૭ $૽ 0A8D 0A9D 0AAD 0ABD 0ACD 0AED 0AFD E ઞ મ $ા ૮ $૾ 0A9E 0AAE 0ABE 0AEE 0AFE F

Draft Additional Repertoire for ISO/IEC 10646:2016 (5Th Edition)

75 Characters Maximum

Know Your Keyboard Description Key 1,4 Join/Virama/Halant 2

Analysis of Comments for Telugu Script LGR Proposal for the Root Zone Revision: June 30, 2019

The Unicode Standard, Version 4.0--Online Edition

An Introduction to Indic Scripts

Roadmaps to ISO/IEC 10646 and the Unicode Standard

N4185 Preliminary Proposal to Encode Siddham in ISO/IEC 10646

Phonetics of Sgaw Karen in Thailand

A Barrier to Indic-Language Implementation of Unicode Is the Perception That Encoding Order in Unicode Is Equivalent to Lingui

Encoding Diversity for All the World's Languages

World Braille Usage, Third Edition

Iso/Iec Jtc1/Sc2/Wg2 N2383