Wg2 N5125 & L2/19-386

ISO/IEC JTC1/SC2/WG2 N5125 L2/19-386 Title: Proposal to change the code chart font for 21 blocks in Unicode Version 14.0 Author: Ken Lunde Date: 2019-12-02 This document proposes to change the code chart font for 21 blocks—affecting 1,762 characters—that are CJK-related or otherwise include characters that are used primarily in CJK con- texts and are therefore supported in fonts for typical CJK typefaces. The suggested target for this change is Unicode Version 14.0 (2021). The purpose of this font change is four-fold: • Reduce the number of fonts that are necessary for code chart production. • Improve the consistency of the glyphs for similar or related characters by using a uniform typeface design and typeface style, at least for characters whose glyphs do not need to ad- here to a particular specification.. • Simplify code chart font management by covering as many complete blocks as possible using a single font. • Ensure that the font is accessible via open source. The actual font, CJK Symbols, will be open-sourced in both TrueType and OpenType/CFF for- mats under the terms of the SIL Open Font License, Version 1.1, and therefore can be updated to include glyphs for additional blocks, or to add glyphs to blocks that are already supported. This is maintenance work that I plan to take on for the foreseeable future. Code Tables This and subsequent pages include left-to-right, top-to-bottom code tables that serve as a glyph synopsis for the CJK Symbols font. Some of these code tables include a special note at the end that points out a peculiarity or specifies an action that needs to be taken for code chart production. Of the 21 blocks that are covered, 19 are covered in their entirety as of the Unicode Version 13.0 repertoire. The two blocks that include only a subset of their characters have an appro- priate note inked in red. Block names are inked in blue, and serve as links to the perpetually- current code charts. Code Chart Excerpts The final 21 pages of this proposal include the current—Unicode Version 13.0 BETA—code chart excerpts for the 21 affected blocks. These can be used to draw a comparison between the current representative glyphs with those offered in the CJK Symbols font. Spacing Modifier Letters—only U+02EA & U+02EB 0 1 2 3 4 5 6 7 8 9 A B C D E F U+02E ˪ ˫ 1 Enclosed Alphanumerics 0 1 2 3 4 5 6 7 8 9 A B C D E F U+246 ① ② ③ ④ ⑤ ⑥ ⑦ ⑧ ⑨ ⑩ ⑪ ⑫ ⑬ ⑭ ⑮ ⑯ U+247 ⑰ ⑱ ⑲ ⑳ ⑴ ⑵ ⑶ ⑷ ⑸ ⑹ ⑺ ⑻ ⑼ ⑽ ⑾ ⑿ U+248 ⒀ ⒁ ⒂ ⒃ ⒄ ⒅ ⒆ ⒇ ⒈ ⒉ ⒊ ⒋ ⒌ ⒍ ⒎ ⒏ U+249 ⒐ ⒑ ⒒ ⒓ ⒔ ⒕ ⒖ ⒗ ⒘ ⒙ ⒚ ⒛ ⒜ ⒝ ⒞ ⒟ U+24A ⒠ ⒡ ⒢ ⒣ ⒤ ⒥ ⒦ ⒧ ⒨ ⒩ ⒪ ⒫ ⒬ ⒭ ⒮ ⒯ U+24B ⒰ ⒱ ⒲ ⒳ ⒴ ⒵ Ⓐ Ⓑ Ⓒ Ⓓ Ⓔ Ⓕ Ⓖ Ⓗ Ⓘ Ⓙ U+24C Ⓚ Ⓛ Ⓜ Ⓝ Ⓞ Ⓟ Ⓠ Ⓡ Ⓢ Ⓣ Ⓤ Ⓥ Ⓦ Ⓧ Ⓨ Ⓩ U+24D ⓐ ⓑ ⓒ ⓓ ⓔ ⓕ ⓖ ⓗ ⓘ ⓙ ⓚ ⓛ ⓜ ⓝ ⓞ ⓟ U+24E ⓠ ⓡ ⓢ ⓣ ⓤ ⓥ ⓦ ⓧ ⓨ ⓩ ⓪ ⓫ ⓬ ⓭ ⓮ ⓯ U+24F ⓰ ⓱ ⓲ ⓳ ⓴ ⓵ ⓶ ⓷ ⓸ ⓹ ⓺ ⓻ ⓼ ⓽ ⓾ ⓿ Dingbats—only U+2776 through U+2793 0 1 2 3 4 5 6 7 8 9 A B C D E F U+277 ❶ ❷ ❸ ❹ ❺ ❻ ❼ ❽ ❾ ❿ U+278 ➀ ➁ ➂ ➃ ➄ ➅ ➆ ➇ ➈ ➉ ➊ ➋ ➌ ➍ ➎ ➏ U+279 ➐ ➑ ➒ ➓ Ideographic Description Characters 0 1 2 3 4 5 6 7 8 9 A B C D E F U+2FF ⿰ ⿱ ⿲ ⿳ ⿴ ⿵ ⿶ ⿷ ⿸ ⿹ ⿺ ⿻ CJK Symbols and Punctuation 0 1 2 3 4 5 6 7 8 9 A B C D E F U+300 、。〃〄々〆〇〈〉《》「」『』 2 0 1 2 3 4 5 6 7 8 9 A B C D E F U+301 【】〒〓〔〕〖〗〘〙〚〛〜〝〞〟 U+302 〠〡〢〣〤〥〦〧〨〩〪〭〮〯〫〬 U+303 〰〱〲〳〴〵〶〷〸〹〺〻〼〽〾〿 Special Note: The two-em tall glyphs for U+3031 VERTICAL KANA REPEAT MARK and U+3032 VERTICAL KANA REPEAT WITH VOICED SOUND MARK should fit in the code chart proper, but I recommend that they be given separate annotations, as shown below, so that their glyphs do not collide on the subsequent pages that show character names and other metadata: implemented as a glyph that is two-em tall Hiragana 0 1 2 3 4 5 6 7 8 9 A B C D E F U+304 ぁあぃいぅうぇえぉおかがきぎく U+305 ぐけげこごさざしじすずせぜそぞた U+306 だちぢっつづてでとどなにぬねのは U+307 ばぱひびぴふぶぷへべぺほぼぽまみ U+308 むめもゃやゅゆょよらりるれろゎわ U+309 ゐゑをんゔゕゖ゙゚゛゜ゝゞゟ Katakana 0 1 2 3 4 5 6 7 8 9 A B C D E F U+30A ゠ァアィイゥウェエォオカガキギク U+30B グケゲコゴサザシジスズセゼソゾタ U+30C ダチヂッツヅテデトドナニヌネノハ U+30D バパヒビピフブプヘベペホボポマミ U+30E ムメモャヤュユョヨラリルレロヮワ 3 0 1 2 3 4 5 6 7 8 9 A B C D E F U+30F ヰヱヲンヴヵヶヷヸヹヺ・ーヽヾヿ Bopomofo 0 1 2 3 4 5 6 7 8 9 A B C D E F U+310 ㄅㄆㄇㄈㄉㄊㄋㄌㄍㄎㄏ U+311 ㄐㄑㄒㄓㄔㄕㄖㄗㄘㄙㄚㄛㄜㄝㄞㄟ U+312 ㄠㄡㄢㄣㄤㄥㄦㄧㄨㄩㄪㄫㄬㄭㄮㄯ Hangul Compatibility Jamo 0 1 2 3 4 5 6 7 8 9 A B C D E F U+313 ㄱ ㄲ ㄳ ㄴ ㄵ ㄶ ㄷ ㄸ ㄹ ㄺ ㄻ ㄼ ㄽ ㄾ ㄿ U+314 ㅀ ㅁ ㅂ ㅃ ㅄ ㅅ ㅆ ㅇ ㅈ ㅉ ㅊ ㅋ ㅌ ㅍ ㅎ ㅏ U+315 ㅐ ㅑ ㅒ ㅓ ㅔ ㅕ ㅖ ㅗ ㅘ ㅙ ㅚ ㅛ ㅜ ㅝ ㅞ ㅟ U+316 ㅠ ㅡ ㅢ ㅣ ㅤ ㅥ ㅦ ㅧ ㅨ ㅩ ㅪ ㅫ ㅬ ㅭ ㅮ ㅯ U+317 ㅰ ㅱ ㅲ ㅳ ㅴ ㅵ ㅶ ㅷ ㅸ ㅹ ㅺ ㅻ ㅼ ㅽ ㅾ ㅿ U+318 ㆀ ㆁ ㆂ ㆃ ㆄ ㆅ ㆆ ㆇ ㆈ ㆉ ㆊ ㆋ ㆌ ㆍ ㆎ Kanbun—Option A 0 1 2 3 4 5 6 7 8 9 A B C D E F U+319 ㆐ ㆑ ㆒ ㆓ ㆔ ㆕ ㆖ ㆗ ㆘ ㆙ ㆚ ㆛ ㆜ ㆝ ㆞ ㆟ Special Note: The glyphs for this block are shown in superscript-style for consistency with the current code charts. Kanbun—Option B 0 1 2 3 4 5 6 7 8 9 A B C D E F U+319 ㆐ ㆑ ㆒ ㆓ ㆔ ㆕ ㆖ ㆗ ㆘ ㆙ ㆚ ㆛ ㆜ ㆝ ㆞ ㆟ Special Note: The glyphs for this block are shown at full size to reflect current industry prac- tice. The Core Specification text for the Kanbun block would need to be adjusted. 4 Bopomofo Extended 0 1 2 3 4 5 6 7 8 9 A B C D E F U+31A ㆠ ㆡ ㆢ ㆣ ㆤ ㆥ ㆦ ㆧ ㆨ ㆩ ㆪ ㆫ ㆬ ㆭ ㆮ ㆯ U+31B ㆰ ㆱ ㆲ ㆳ ㆴ ㆵ ㆶ ㆷ ㆸ ㆹ ㆺ ㆻ ㆼ ㆽ ㆾ ㆿ CJK Strokes 0 1 2 3 4 5 6 7 8 9 A B C D E F U+31C ㇀㇁㇂㇃㇄㇅㇆㇇㇈㇉㇊㇋㇌㇍㇎㇏ U+31D ㇐㇑㇒㇓㇔㇕㇖㇗㇘㇙㇚㇛㇜㇝㇞㇟ U+31E ㇠㇡㇢㇣ Katakana Phonetic Extensions 0 1 2 3 4 5 6 7 8 9 A B C D E F U+31F ㇰㇱㇲㇳㇴㇵㇶㇷㇸㇹㇺㇻㇼㇽㇾㇿ Enclosed CJK Letters and Months 0 1 2 3 4 5 6 7 8 9 A B C D E F U+320 ㈀㈁㈂㈃㈄㈅㈆㈇㈈㈉㈊㈋㈌㈍㈎㈏ U+321 ㈐㈑㈒㈓㈔㈕㈖㈗㈘㈙㈚㈛㈜㈝㈞ U+322 ㈠㈡㈢㈣㈤㈥㈦㈧㈨㈩㈪㈫㈬㈭㈮㈯ U+323 ㈰㈱㈲㈳㈴㈵㈶㈷㈸㈹㈺㈻㈼㈽㈾㈿ U+324 ㉀㉁㉂㉃㉄㉅㉆㉇㉈㉉㉊㉋㉌㉍㉎㉏ U+325 ㉐㉑㉒㉓㉔㉕㉖㉗㉘㉙㉚㉛㉜㉝㉞㉟ U+326 ㉠㉡㉢㉣㉤㉥㉦㉧㉨㉩㉪㉫㉬㉭㉮㉯ U+327 ㉰㉱㉲㉳㉴㉵㉶㉷㉸㉹㉺㉻㉼㉽㉾㉿ U+328 ㊀㊁㊂㊃㊄㊅㊆㊇㊈㊉㊊㊋㊌㊍㊎㊏ 5 0 1 2 3 4 5 6 7 8 9 A B C D E F U+329 ㊐㊑㊒㊓㊔㊕㊖㊗㊘㊙㊚㊛㊜㊝㊞㊟ U+32A ㊠㊡㊢㊣㊤㊥㊦㊧㊨㊩㊪㊫㊬㊭㊮㊯ U+32B ㊰㊱㊲㊳㊴㊵㊶㊷㊸㊹㊺㊻㊼㊽㊾㊿ U+32C ㋀㋁㋂㋃㋄㋅㋆㋇㋈㋉㋊㋋㋌㋍㋎㋏ U+32D ㋐㋑㋒㋓㋔㋕㋖㋗㋘㋙㋚㋛㋜㋝㋞㋟ U+32E ㋠㋡㋢㋣㋤㋥㋦㋧㋨㋩㋪㋫㋬㋭㋮㋯ U+32F ㋰㋱㋲㋳㋴㋵㋶㋷㋸㋹㋺㋻㋼㋽㋾㋿ Special Note: The glyph for U+327F KOREAN STANDARD SYMBOL adheres to the specifications for that character. CJK Compatibility 0 1 2 3 4 5 6 7 8 9 A B C D E F U+330 ㌀㌁㌂㌃㌄㌅㌆㌇㌈㌉㌊㌋㌌㌍㌎㌏ U+331 ㌐㌑㌒㌓㌔㌕㌖㌗㌘㌙㌚㌛㌜㌝㌞㌟ U+332 ㌠㌡㌢㌣㌤㌥㌦㌧㌨㌩㌪㌫㌬㌭㌮㌯ U+333 ㌰㌱㌲㌳㌴㌵㌶㌷㌸㌹㌺㌻㌼㌽㌾㌿ U+334 ㍀㍁㍂㍃㍄㍅㍆㍇㍈㍉㍊㍋㍌㍍㍎㍏ U+335 ㍐㍑㍒㍓㍔㍕㍖㍗㍘㍙㍚㍛㍜㍝㍞㍟ U+336 ㍠㍡㍢㍣㍤㍥㍦㍧㍨㍩㍪㍫㍬㍭㍮㍯ U+337 ㍰㍱㍲㍳㍴㍵㍶㍷㍸㍹㍺㍻㍼㍽㍾㍿ U+338 ㎀㎁㎂㎃㎄㎅㎆㎇㎈㎉㎊㎋㎌㎍㎎㎏ U+339 ㎐㎑㎒㎓㎔㎕㎖㎗㎘㎙㎚㎛㎜㎝㎞㎟ U+33A ㎠㎡㎢㎣㎤㎥㎦㎧㎨㎩㎪㎫㎬㎭㎮㎯ 6 0 1 2 3 4 5 6 7 8 9 A B C D E F U+33B ㎰㎱㎲㎳㎴㎵㎶㎷㎸㎹㎺㎻㎼㎽㎾㎿ U+33C ㏀㏁㏂㏃㏄㏅㏆㏇㏈㏉㏊㏋㏌㏍㏎㏏ U+33D ㏐㏑㏒㏓㏔㏕㏖㏗㏘㏙㏚㏛㏜㏝㏞㏟ U+33E ㏠㏡㏢㏣㏤㏥㏦㏧㏨㏩㏪㏫㏬㏭㏮㏯ U+33F ㏰㏱㏲㏳㏴㏵㏶㏷㏸㏹㏺㏻㏼㏽㏾㏿ Vertical Forms 0 1 2 3 4 5 6 7 8 9 A B C D E F U+FE1 ︐ ︑ ︒ ︓ ︔ ︕ ︖ ︗ ︘ ︙ CJK Compatibility Forms 0 1 2 3 4 5 6 7 8 9 A B C D E F U+FE3 ︰︱︲︳︴︵︶︷︸︹︺︻︼︽︾︿ U+FE4 ﹀﹁﹂﹃﹄﹅﹆﹇﹈﹉﹊﹋﹌﹍﹎﹏ Small Form Variants 0 1 2 3 4 5 6 7 8 9 A B C D E F U+FE5 ﹐ ﹑ ﹒ ﹔ ﹕ ﹖ ﹗ ﹘ ﹙ ﹚ ﹛ ﹜ ﹝ ﹞ ﹟ U+FE6 ﹠ ﹡ ﹢ ﹣ ﹤ ﹥ ﹦ ﹨ ﹩ ﹪ ﹫ Halfwidth and Fullwidth Forms 0 1 2 3 4 5 6 7 8 9 A B C D E F U+FF0 ！＂＃＄％＆＇（）＊＋，－．／ U+FF1 ０１２３４５６７８９：；＜＝＞？ U+FF2 ＠ＡＢＣＤＥＦＧＨＩＪＫＬＭＮＯ U+FF3 ＰＱＲＳＴＵＶＷＸＹＺ［＼］＾＿ 7 0 1 2 3 4 5 6 7 8 9 A B C D E F U+FF4 ｀ａｂｃｄｅｆｇｈｉｊｋｌｍｎｏ U+FF5 ｐｑｒｓｔｕｖｗｘｙｚ｛｜｝～｟ U+FF6 ｠｡｢｣､･ｦｧｨｩｪｫｬｭｮｯ U+FF7 ｰｱｲｳｴｵｶｷｸｹｺｻｼｽｾｿ U+FF8 ﾀﾁﾂﾃﾄﾅﾆﾇﾈﾉﾊﾋﾌﾍﾎﾏ U+FF9 ﾐﾑﾒﾓﾔﾕﾖﾗﾘﾙﾚﾛﾜﾝﾞﾟ U+FFA ﾠﾡﾢﾣﾤﾥﾦﾧﾨﾩﾪﾫﾬﾭﾮﾯ U+FFB ﾰﾱﾲﾳﾴﾵﾶﾷﾸﾹﾺﾻﾼﾽﾾ U+FFC ￂￃￄￅￆￇￊￋￌￍￎￏ U+FFD ￒￓￔￕￖￗￚￛￜ U+FFE ￠￡￢￣￤￥￦￨￩￪￫￬￭￮ Enclosed Alphanumeric Supplement 0 1 2 3 4 5 6 7 8 9 A B C D E F U+1F10 U+1F11 U+1F12 U+1F13 U+1F14 U+1F15 U+1F16 U+1F17 8 0 1 2 3 4 5 6 7 8 9 A B C D E F U+1F18 U+1F19 U+1F1A U+1F1E U+1F1F Special Note: The glyphs for U+1F10D through U+1F10F and U+1F16D through U+1F16F ad- here to the specifications for those characters.

Wg2 N5125 & L2/19-386

Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress

The Unicode Standard, Version 4.0--Online Edition

The Unicode Standard, Version 3.0, Issued by the Unicode Consor- Tium and Published by Addison-Wesley

Outline of the Course

Using the Unicode Standard for Linguistic Data: Preliminary Guidelines∗

Section 18.1, Han

10646-2CD US Comment

Unicode Groups

Greek Abcdefghijklmno

Unicodestandard the UNICODE CONSORTIUM .4Vaddison

The Unicode Standard Version 3.0

10.1 Han CJK Unified Ideographs