CJK Unified Ideographs Extension a Range: 3400–4DBF

Total Page:16

File Type:pdf, Size:1020Kb

CJK Unified Ideographs Extension a Range: 3400–4DBF CJK Unified Ideographs Extension A Range: 3400–4DBF This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation. Copying characters from the character code tables or list of character names is not recommended, because for production reasons the PDF files for the code charts cannot guarantee that the correct character codes will always be copied. Fonts The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be expected in actual fonts. The particular fonts used in these charts were provided to the Unicode Consortium by a number of different font designers, who own the rights to the fonts. See https://www.unicode.org/charts/fonts.html for a list. Terms of Use You may freely use these code charts for personal or internal business uses only. You may not incorporate them either wholly or in part into any product or publication, or otherwise distribute them without express written permission from the Unicode Consortium. However, you may provide links to these charts. The fonts and font data used in production of these code charts may NOT be extracted, or used in any other way in any product or publication, without permission or license granted by the typeface owner(s). The Unicode Consortium is not liable for errors or omissions in this file or the standard itself. Information on characters added to the Unicode Standard since the publication of the most recent version of the Unicode Standard, as well as on characters currently being considered for addition to the Unicode Standard can be found on the Unicode web site. See https://www.unicode.org/pending/pending.html and https://www.unicode.org/alloc/Pipeline.html. Copyright © 1991-2021 Unicode, Inc. All rights reserved. 3400 CJK Unified Ideographs Extension A 3439 3400 3413 3427 ⼀ 1.4 㐀 㐀 㐀 ⼄ 5.5 㐓 ⼅ 6.3 㐧 GKX-0078.01 T6-222C JA-2121 K3-212E JA-2125 3401 3414 3428 ⼀ 1.5 㐁 㐁 㐁 ⼄ 5.5 㐔 ⼅ 6.7 㐨 㐨 G5-3024 T4-2224 K3-2121 K3-212F G5-3044 T3-2741 3402 3415 3429 ⼀ 1.5 㐂 ⼄ 5.5 㐕 ⼆ 7.6 㐩 㐩 JA3-2E23 K3-2130 GHZ-10023.01 T3-286C 3403 3416 342A ⼁ 2.2 㐃 ⼄ 5.6 㐖 㐖 ⼇ 8.4 㐪 K3-2122 G3-3032 T4-2336 JA-2126 3404 3417 342B ⼁ 2.2 㐄 㐄 㐄 ⼄ 5.6 㐗 ⼇ 8.4 㐫 㐫 GKX-0079.02 T6-2130 JA-2123 K3-2131 GHZ-10283.02 T3-2323 3405 3418 342C ⼃ 4.1 㐅 㐅 㐅 ⼄ 5.6 㐘 ⼇ 8.5 㐬 㐬 㐬 GKX-0081.18 T6-2123 JA-2124 K3-2132 G5-334D T4-2337 JA4-2132 3406 3419 342D ⼃ 4.5 㐆 㐆 㐆 ⼄ 5.7 㐙 ⼇ 8.6 㐭 㐭 㐭 G5-3076 TF-216C J4-212D K3-2133 GKX-0089.01 T4-2534 JA-2128 3407 341A 342E ⼄ 5.2 㐇 ⼄ 5.7 㐚 ⼇ 8.11 㐮 㐮 㐮 K3-2123 K3-2134 GHZ-10291.08 T3-4034 JA4-2133 3408 341B 342F ⼄ 5.2 㐈 ⼄ 5.7 㐛 ⼇ 8.15 㐯 㐯 K3-2124 K3-2135 GHZ-10294.02 JA-212A 3409 341C 3430 ⼄ 5.2 㐉 ⼄ 5.8 㐜 㐜 㐜 ⼈ 9.3 㐰 㐰 㐰 K3-2125 G3-3024 T4-2835 K3-2136 GKX-0092.08 T4-2159 JA-212B 340A 341D 3431 ⼄ 5.3 㐊 ⼄ 5.8 㐝 ⼈ 9.3 㐱 㐱 㐱 K3-2126 K3-2137 G5-313D T3-2175 V2-8875 340B 341E 3432 ⼄ 5.3 㐋 ⼄ 5.8 㐞 ⼈ 9.3 㐲 㐲 㐲 K3-2127 K3-2138 GKX-0093.08 T3-216E K3-213F 340C 㐌 341F 3433 ⼄ 5.4 㐌 㐌 ⼄ 5.8 㐟 ⼈ 9.3 㐳 㐳 G3-302B T4-2157 K6-1000 K3-2139 GKX-0094.03 T3-2171 3420 3434 㐌 ⼄ 5.8 㐠 ⼈ 9.3 㐴 㐴 V2-8874 K3-213A GKX-0094.05 T3-2173 340D 3421 3435 ⼄ 5.4 㐍 ⼄ 5.10 㐡 㐡 ⼈ 9.3 㐵 㐵 K3-2128 GKX-0084.31 T3-343B GS-2269 H-9277 340E 3422 3436 ⼄ 5.4 㐎 ⼄ 5.10 㐢 ⼈ 9.3 㐶 㐶 K3-2129 K3-213C TF-2144 JA-212C 340F 3423 3437 ⼄ 5.4 㐏 ⼄ 5.10 㐣 ⼈ 9.3 㐷 K3-212A K3-213B G7-2326 3410 3424 3438 ⼄ 5.5 㐐 ⼄ 5.11 㐤 㐤 ⼈ 9.4 㐸 㐸 K3-212B GHZ-10263.07 T3-396D G7-2321 T6-234E 3411 3425 3439 ⼄ 5.5 㐑 ⼄ 5.15 㐥 ⼈ 9.4 㐹 㐹 K3-212C K3-213D GKX-0094.14 T3-2271 3412 3426 ⼄ 5.5 㐒 ⼄ 5.18 㐦 K3-212D K3-213E The Unicode Standard 14.0, Copyright © 1991-2021 Unicode, Inc. All rights reserved. 343A CJK Unified Ideographs Extension A 346F 343A 344C 345F ⼈ 9.4 㐺 㐺 㐺 ⼈ 9.6 㑌 㑌 㑌 ⼈ 9.7 㑟 㑟 GKX-0095.22 T4-2231 JA-212D G5-3170 T4-253C K3-2147 GKX-0105.22 T4-283C 3460 㐺 㑌 ⼈ 9.7 㑠 K3-2144 H-89D5 G7-232C 343B 344D 3461 ⼈ 9.4 㐻 㐻 㐻 ⼈ 9.6 㑍 㑍 㑍 ⼈ 9.7 㑡 G7-2327 T3-2269 V2-8876 GKX-0102.11 T3-2745 V2-6E49 K3-2149 343C 344E 3462 ⼈ 9.4 㐼 㐼 ⼈ 9.6 㑎 㑎 ⼈ 9.7 㑢 G5-3156 T3-226A G7-232A T5-252B G3-3134 343D 344F 3463 ⼈ 9.4 㐽 ⼈ 9.6 㑏 㑏 㑏 ⼈ 9.8 㑣 㑣 㑣 G7-2325 GKX-0102.23 T4-253B V2-6E4B GS-226D T5-2B6C K3-214A 343E 3450 3464 ⼈ 9.4 㐾 㐾 ⼈ 9.6 㑐 㑐 㑐 ⼈ 9.9 㑤 㑤 㑤 G7-2328 TF-2172 G3-307C T4-2538 JA-2131 GKX-0106.15 T3-3449 JA-213A 343F 3451 ⼈ 9.4 㐿 ⼈ 9.6 㑑 㑤 K3-2142 T3-2750 H-93CD 3440 3452 3465 ⼈ 9.4 㑀 㑀 ⼈ 9.6 㑒 ⼈ 9.8 㑥 㑥 K3-2143 H-96DF JA-2132 G5-3231 T3-2F52 3441 3453 3466 ⼈ 9.5 㑁 㑁 㑁 ⼈ 9.6 㑓 㑓 ⼈ 9.8 㑦 㑦 㑦 GKX-0096.15 T5-2334 JA-212E TF-254A JA-2133 G5-323E T4-2B65 JA-2136 3442 3454 ⼈ 9.5 㑂 㑂 㑂 ⼈ 9.6 㑔 㑦 GKX-0096.19 T3-244A JA-212F G7-2322 K3-214B 3443 3455 3467 ⼈ 9.5 㑃 㑃 㑃 ⼈ 9.6 㑕 ⼈ 9.8 㑧 㑧 㑧 G5-316F T3-2447 K3-2145 G7-2324 GHZ-10179.07 T6-3538 JA-2137 3444 3456 ⼈ 9.5 㑄 㑄 ⼈ 9.6 㑖 㑧 GKX-0098.02 T3-244D K3-2148 K3-214C 3445 3457 3468 ⼈ ⼈ 9.5 㑅 㑅 㑅 ⼈ 9.7 㑗 㑗 9.8 㑨 GHZ-10136.01 T6-2571 JA-2130 G5-3223 T3-2B31 JA4-215E 3446 3458 3469 ⼈ ⼈ 9.5 㑆 ⼈ 9.7 㑘 㑘 㑘 9.8 㑩 GS-2268 GKX-0103.12 T6-2E5A JA-2134 GS-226F 3447 3459 346A ⼈ ⼈ 9.5 㑇 ⼈ 9.7 㑙 㑙 9.8 㑪 G7-223F G3-3135 T4-2839 JA4-2156 3448 345A 346B ⼈ ⼈ ⼈ 9.8 㑫 9.5 㑈 9.7 㑚 㑚 㑫 G7-2323 GKX-0103.29 T3-2B30 G7-232B V0-3034 3449 345B 346C ⼈ ⼈ 9.6 㑉 㑉 ⼈ 9.7 㑛 㑛 9.8 㑬 GKX-0100.05 T3-2746 GS-226C T5-2821 K3-214D 344A 345C 346D ⼈ ⼈ 9.6 㑊 㑊 㑊 ⼈ 9.7 㑜 㑜 9.8 㑭 GKX-0100.20 T5-2525 K3-2146 G3-313E T4-283A K3-214E 345D 346E ⼈ 㑊 ⼈ 9.7 㑝 㑝 9.9 㑮 㑮 H-8CF4 GKX-0105.06 T3-2B2A G3-3132 T4-3045 344B 345E 346F ⼈ ⼈ 9.6 㑋 㑋 ⼈ 9.7 㑞 㑞 㑞 9.9 㑯 㑯 G3-3122 T4-2539 G5-3226 T4-2837 JA-2135 GKX-0111.13 T3-343E The Unicode Standard 14.0, Copyright © 1991-2021 Unicode, Inc. All rights reserved. 3470 CJK Unified Ideographs Extension A 34A8 3470 3482 3495 ⼈ 9.9 㑰 㑰 ⼈ 9.12 㒂 㒂 ⼈ 9.13 㒕 㒕 GKX-0111.16 T3-3448 G7-232D T5-4457 GKX-0119.14 T3-4B24 3471 3483 3496 ⼈ 9.9 㑱 㑱 㑱 ⼈ 9.12 㒃 㒃 㒃 ⼈ 9.13 㒖 㒖 㒖 GHZ-10198.07 T4-304A K3-2150 GKX-0116.09 T4-4237 K3-2158 GHZ-10217.11 T3-4B28 H-8F5D 3472 3484 3497 ⼈ 9.9 㑲 㑲 ⼈ 9.12 㒄 㒄 ⼈ 9.13 㒗 GKJ-00058 K3-214F G3-3235 T4-4233 T3-4B2A 3473 3485 3498 ⼈ 9.10 㑳 㑳 㑳 ⼈ 9.12 㒅 㒅 㒅 ⼈ 9.13 㒘 G3-3070 T4-3638 K3-2153 G5-3270 T4-422E K3-2159 G3-3162 3486 3499 㑳 ⼈ 9.12 㒆 㒆 ⼈ 9.14 㒙 㒙 㒙 H-9BDF G5-326E T3-4578 GKX-0119.25 T3-5032 V2-6E65 3474 3487 349A ⼈ 9.10 㑴 㑴 ⼈ 9.12 㒇 㒇 㒇 ⼈ 9.14 㒚 㒚 G5-3260 T3-3973 GKX-0117.13 T6-5A73 K3-215A G3-3243 T4-4F5D 3475 3488 349B ⼈ 9.10 㑵 㑵 㑵 ⼈ 9.12 㒈 㒈 ⼈ 9.14 㒛 㒛 㒛 G5-325C T4-3632 K3-2151 G3-322E T4-422F GKX-0119.36 T5-5359 K3-215F 3476 3489 349C ⼈ 9.10 㑶 㑶 ⼈ 9.12 㒉 㒉 ⼈ 9.14 㒜 㒜 㒜 G3-3060 T4-3637 GKX-0117.22 T3-4573 G5-3148 T4-4F5B K3-2160 3477 348A 349D ⼈ 9.10 㑷 㑷 㑷 ⼈ 9.12 㒊 㒊 ⼈ 9.15 㒝 㒝 GKX-0113.06 T6-4655 K3-2152 G3-3244 T4-4F59 G5-327B T3-543F 3478 348B 349E ⼈ 9.10 㑸 㑸 ⼈ 9.12 㒋 㒋 ⼈ 9.15 㒞 㒞 GHZ-10201.02 JA-213B GKX-0117.28 T3-4572 GKX-0120.19 T3-5753 3479 348C 349F ⼈ 9.10 㑹 㑹 ⼈ 9.12 㒌 㒌 ⼈ 9.16 㒟 㒟 㒟 T6-497B JA-213C GKX-0118.03 T3-456F G3-3170 T4-5A67 K3-2162 347A 348D 34A0 ⼈ 9.10 㑺 㑺 㑺 ⼈ 9.12 㒍 㒍 ⼈ 9.16 㒠 㒠 GHZ-10203.09 T3-3974 H-FA68 G5-3272 T3-4577 G3-3248 T4-5A68 347B 348E 34A1 ⼈ 9.11 㑻 㑻 ⼈ 9.12 㒎 㒎 ⼈ 9.17 㒡 㒡 GKX-0114.13 T3-4035 G3-3231 T4-4232 G3-3249 T4-5F49 347C 348F 34A2 ⼈ 9.11 㑼 㑼 ⼈ 9.12 㒏 ⼈ 9.17 㒢 GKX-0114.21 T3-4038 K3-215B K3-2164 347D 3490 34A3 ⼈ 9.11 㑽 㑽 㑽 ⼈ 9.12 㒐 ⼈ 9.17 㒣 GS-226A T3-403B H-89DA K3-215C K3-2165 347E 3491 34A4 ⼈ 9.11 㑾 㑾 㑾 ⼈ 9.13 㒑 㒑 ⼈ 9.18 㒤 㒤 㒤 G5-3267 T4-3C2C K3-2154 G3-3072 T4-487B G5-3253 T4-632F K3-2166 3492 34A5 㑾 ⼈ 9.13 㒒 㒒 㒒 ⼈ 9.18 㒥 㒥 㒥 H-8F59 G5-3165 T3-4B26 J4-217E GKX-0121.33 T3-5C33 H-89DC 347F 3493 34A6 ⼈ 9.11 㑿 㑿 ⼈ 9.13 㒓 㒓 㒓 ⼈ 9.18 㒦 㒦 G3-3230 T4-3C2D G5-3174 T3-4B25 K3-215D G5-3327 T3-5C31 3480 34A7 ⼈ 9.11 㒀 㒀 㒀 㒓 ⼈ 9.19 㒧 㒧 㒧 G3-322D T4-3C28 K3-2155 H-89DB G5-325D T7-5274 K3-2167 3481 3494 34A8 ⼈ 9.11 㒁 㒁 㒁 ⼈ 9.13 㒔 㒔 㒔 ⼈ 9.19 㒨 㒨 G3-3227 T4-3C2A K3-2156 G3-323F T4-487D K3-215E G5-3329 T7-574C The Unicode Standard 14.0, Copyright © 1991-2021 Unicode, Inc.
Recommended publications
  • International Standard Iso/Iec 10646
    This is a preview - click here to buy the full publication INTERNATIONAL ISO/IEC STANDARD 10646 Sixth edition 2020-12 Information technology — Universal coded character set (UCS) Technologies de l'information — Jeu universel de caractères codés (JUC) Reference number ISO/IEC 10646:2020(E) © ISO/IEC 2020 This is a preview - click here to buy the full publication ISO/IEC 10646:2020 (E) CONTENTS 1 Scope ..................................................................................................................................................1 2 Normative references .........................................................................................................................1 3 Terms and definitions .........................................................................................................................2 4 Conformance ......................................................................................................................................8 4.1 General ....................................................................................................................................8 4.2 Conformance of information interchange .................................................................................8 4.3 Conformance of devices............................................................................................................8 5 Electronic data attachments ...............................................................................................................9 6 General structure
    [Show full text]
  • Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress
    1 Assessment of Options for Handling Full Unicode Character Encodings in MARC21 A Study for the Library of Congress Part 1: New Scripts Jack Cain Senior Consultant Trylus Computing, Toronto 1 Purpose This assessment intends to study the issues and make recommendations on the possible expansion of the character set repertoire for bibliographic records in MARC21 format. 1.1 “Encoding Scheme” vs. “Repertoire” An encoding scheme contains codes by which characters are represented in computer memory. These codes are organized according to a certain methodology called an encoding scheme. The list of all characters so encoded is referred to as the “repertoire” of characters in the given encoding schemes. For example, ASCII is one encoding scheme, perhaps the one best known to the average non-technical person in North America. “A”, “B”, & “C” are three characters in the repertoire of this encoding scheme. These three characters are assigned encodings 41, 42 & 43 in ASCII (expressed here in hexadecimal). 1.2 MARC8 "MARC8" is the term commonly used to refer both to the encoding scheme and its repertoire as used in MARC records up to 1998. The ‘8’ refers to the fact that, unlike Unicode which is a multi-byte per character code set, the MARC8 encoding scheme is principally made up of multiple one byte tables in which each character is encoded using a single 8 bit byte. (It also includes the EACC set which actually uses fixed length 3 bytes per character.) (For details on MARC8 and its specifications see: http://www.loc.gov/marc/.) MARC8 was introduced around 1968 and was initially limited to essentially Latin script only.
    [Show full text]
  • CJKV Unified Ideographs Extension C
    22nd International Unicode Conference (IUC22) Unicode and the Web: Evolution or Revolution? September 9 - 13, 2002, San Jose, California http://www.unicode.org/iuc/iuc22/ CJKV Unified Ideographs Extension C Richard S. COOK Linguistics Department University of California, Berkeley [email protected] http://stedt.berkeley.edu/ 2002-09-18-10:31 INTRODUCTION This presentation is concerned with introducing the audience to some of the issues surrounding Ideographic Rapporteur Group (ISO/IEC JTC1/SC2/WG2/IRG) work on “CJK Unified Ideographs Extension C” (Ext C), including the following: (1) The IRG methodology constraining glyph submissions for Ext C1 (why more Han characters and which?) (2) The method of preparing glyph submissions for the Unicode Technical Committee (UTC) (3) IRG member submissions for Ext C1, introducing some of the submitted glyphs, the print sources for the glyph submissions (4) The IRG process of submission evaluation (5) The impact of submitted glyphs on the “Han Variant” problem (see Cook, IUC-19) (6) Plans for Ext C2 UTC submissions 22nd International Unicode Conference1 San Jose, California, September 2002 CJKV Unified Ideographs Extension C BACKGROUND As many people already know, The Unicode Standard 3.2 is the best thing ever to happen to the digitization of Chinese texts. The immense work done to produce the CJKV1 part of this standard, undertaken by the Ideographic Rapporteur Group (IRG)2, has pushed CJKV computing to higher levels than many had ever thought possible. With the IRG’s creation of “Extension B”, 42,711 new characters were added to The Unicode Standard, so that it now encodes a total of 70,207 unique “ideographs”.3 The issue is somewhat complicated by things such as “compatibility characters which are not actually compatibility characters”.
    [Show full text]
  • Hong Kong Supplementary Character Set – 2016 (Draft)
    中 文 界 面 諮 詢 委 員 會 工 作 小 組 文 件 編 號 2017/02 (B) Hong Kong Supplementary Character Set – 2016 (Draft) Office of the Government Chief Information Officer & Official Languages Division, Civil Service Bureau The Government of the Hong Kong Special Administrative Region April 2017 1/21 中 文 界 面 諮 詢 委 員 會 工 作 小 組 文 件 編 號 2017/02 (B) Table of Contents Preface Section 1 Overview……………….……………………………………………. 1 - 1 Section 2 Coding Scheme of the HKSCS–2016….……………………………. 2 - 1 Section 3 HKSCS–2016 under the Architecture of the ISO/IEC 10646………. 3 - 1 Table 1: Code Table of the HKSCS–2016……………………………………….. i - 1 Table 2: Newly Included Characters in the HKSCS–2016...………………….…. ii - 1 Table 3: Compatibility Characters in the HKSCS–2016…......………………..…. iii - 1 2/21 中 文 界 面 諮 詢 委 員 會 工 作 小 組 文 件 編 號 2017/02 (B) Preface After the first release of the Hong Kong Supplementary Character Set (HKSCS) in 1999, there have been three updated versions. The HKSCS-2001, HKSCS-2004 and HKSCS-2008 were published with 116, 123 and 68 new characters added respectively. A total of 5 009 characters were included in the HKSCS-2008. These publications formed the foundation for promoting the adoption of the ISO/IEC 10646 international coding standard, and were widely supported and adopted by the IT sector and members of the public. The ISO/IEC 10646 international coding standard is developed by the International Organization for Standardization (ISO) to provide a common technical basis for the storage and exchange of electronic information.
    [Show full text]
  • Section 18.1, Han
    The Unicode® Standard Version 13.0 – Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2020 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at http://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see http://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 13.0. Includes index. ISBN 978-1-936213-26-9 (http://www.unicode.org/versions/Unicode13.0.0/) 1.
    [Show full text]
  • Character Properties 4
    The Unicode® Standard Version 14.0 – Core Specification To learn about the latest version of the Unicode Standard, see https://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2021 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at https://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see https://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 14.0. Includes index. ISBN 978-1-936213-29-0 (https://www.unicode.org/versions/Unicode14.0.0/) 1.
    [Show full text]
  • About the Code Charts 24
    The Unicode® Standard Version 13.0 – Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2020 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at http://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see http://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 13.0. Includes index. ISBN 978-1-936213-26-9 (http://www.unicode.org/versions/Unicode13.0.0/) 1.
    [Show full text]
  • Unicode Support in the Solaris Operating Environment
    Unicode Support in the Solaris Operating Environment Sun Microsystems, Inc. 901 San Antonio Road Palo Alto, CA 94303-4900 U.S.A. Part Number 806-5584 May 2000 Copyright 2000 Sun Microsystems, Inc. 901 San Antonio Road, Palo Alto, California 94303-4900 U.S.A. All rights reserved. This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd. Sun, Sun Microsystems, the Sun logo, docs.sun.com, AnswerBook, AnswerBook2, and Solaris are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. The OPEN LOOK and SunTM Graphical User Interface was developed by Sun Microsystems, Inc. for its users and licensees. Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry. Sun holds a non-exclusive license from Xerox to the Xerox Graphical User Interface, which license also covers Sun’s licensees who implement OPEN LOOK GUIs and otherwise comply with Sun’s written license agreements.
    [Show full text]
  • The Unicode Standard, Version 4.0--Online Edition
    This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consor- tium and published by Addison-Wesley. The material has been modified slightly for this online edi- tion, however the PDF files have not been modified to reflect the corrections found on the Updates and Errata page (http://www.unicode.org/errata/). For information on more recent versions of the standard, see http://www.unicode.org/standard/versions/enumeratedversions.html. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial capital letters. However, not all words in initial capital letters are trademark designations. The Unicode® Consortium is a registered trademark, and Unicode™ is a trademark of Unicode, Inc. The Unicode logo is a trademark of Unicode, Inc., and may be registered in some jurisdictions. The authors and publisher have taken care in preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. Dai Kan-Wa Jiten used as the source of reference Kanji codes was written by Tetsuji Morohashi and published by Taishukan Shoten.
    [Show full text]
  • The Unicode Standard, Version 3.0, Issued by the Unicode Consor- Tium and Published by Addison-Wesley
    The Unicode Standard Version 3.0 The Unicode Consortium ADDISON–WESLEY An Imprint of Addison Wesley Longman, Inc. Reading, Massachusetts · Harlow, England · Menlo Park, California Berkeley, California · Don Mills, Ontario · Sydney Bonn · Amsterdam · Tokyo · Mexico City Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial capital letters. However, not all words in initial capital letters are trademark designations. The authors and publisher have taken care in preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. If these files have been purchased on computer-readable media, the sole remedy for any claim will be exchange of defective media within ninety days of receipt. Dai Kan-Wa Jiten used as the source of reference Kanji codes was written by Tetsuji Morohashi and published by Taishukan Shoten. ISBN 0-201-61633-5 Copyright © 1991-2000 by Unicode, Inc. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or other- wise, without the prior written permission of the publisher or Unicode, Inc.
    [Show full text]
  • Irg N2492 L2/21-118R
    ISO/IEC JTC1/SC2/WG2/IRG N2492 L2/21-118R Universal Multiple-Octet Coded Character Set International Organization for Standardization Doc Type: ISO/IEC JTC1/SC2/WG2/IRG Title: Preliminary proposal to add a new provisional kIDS property (Unihan) Authors: Ken Lunde & John H. Jenkins Status: Member Body Contribution Action: For consideration by the IRG and UTC Date: 2021-08-11 (revised) The purpose of this document, which is a revised preliminary proposal to add a new provi- sional Unihan database property, kIDS, is threefold: 1. Outline the standardization timeline 2. Identify any barriers early on in the process through constructive and meaningful feedback from both the UTC and IRG 3. Solicit help in collecting ideograph components for use in IDSes IDS is an abbreviation for Ideographic Description Sequences, which is extensively documented in Section 18.2, Ideographic Description Characters, of the Core Specification of the Unicode Standard. The standardization timeline has two targets, both of which are subject to change: Unicode Version 15.0 (2022) and Unicode Version 16.0 (2023). Unicode Version 15.0 (2022) The targets for Unicode Version 15.0 are to: 1. Encode up to five new Ideographic Description Characters (IDCs) 2. Encode a modest number of ideograph components as new CJK Unified Ideographs for use in IDSes This then sets the stage for adding the provisional kIDS property in the subsequent version of the Unicode Standard. New Ideographic Description Characters Four new IDCs were most recently proposed in L2/18-012 (aka IRG N2273)
    [Show full text]
  • New Ideographs in Unicode 3.0 and Beyond
    New Ideographs in Unicode 3.0 and Beyond John H. Jenkins International and Text Group Apple Computer, Inc. 1) Background The Unicode Standard, version 2.1, contains a total of 21,204 East Asian ideographs. More than half (nearly 55%) of the encoded characters in the standard are ideographs. This ideographic repertoire, commonly referred to as “Unihan,” is already larger than the ideographic repertoires of most other major character set standards. The exceptions, however, use different unification rules than those used in Unihan, so although they provide more glyphic variants for characters than does Unihan, they actually encode about the same number of characters as Unihan. Nonetheless, Unihan is far from being an exhaustive set of ideographs—tens of thousands more remain unencoded. As a result, additions and extensions to Unihan will continue to be made as the Unicode Standard develops. The history of East Asian ideographs can be reliably traced back to the second millennium BCE, and all the major features of the current system were in place by the Zhou dynasty (ca. 1100 BCE). The shapes of the ideographs have altered over the centuries, and the Chinese language has continued to develop with new words coming into existence and old ones being dropped, but the writing system has endured. Chinese ideographs constitute the oldest writing system in the world still in common use. 15th International Unicode Conference 1 San Jose, CA, August/September 1999 New Ideographs in Unicode 3.0 and Beyond This long history is one of the major reasons why the collection of ideographs is so vast.
    [Show full text]