TYPE: Multiple-Byte Graphic Character Set REGISTRATION NUMBER: 233 DATE of REGISTRATION: 2004-04-13 ESCAPE SEQUENCE: G0

Total Page:16

File Type:pdf, Size:1020Kb

TYPE: Multiple-Byte Graphic Character Set REGISTRATION NUMBER: 233 DATE of REGISTRATION: 2004-04-13 ESCAPE SEQUENCE: G0 TYPE: REGISTRATION NUMBER: 233 Multiple-byte Graphic Character Set DATE OF REGISTRATION: 2004-04-13 ESCAPE SEQUENCE: G0: ESC 2/4 2/8 5/1 G1: ESC 2/4 2/9 5/1 G2: ESC 2/4 2/10 5/1 G3: ESC 2/4 2/11 5/1 C0: - C1: - NAME: Japanese Graphic Character Set for Information Interchange, Plane 1 (Update of ISO-IR 228) DESCRIPTION: A set of 8797 graphic characters contained in 94X94 character cell plane that can be invoked 7-bits or 8-bits double byte environment. This registration added 10 graphic characters to ISO-IR 228 and changed character names of 128 graphic characters for compatibility purpose with ISO/IEC 10646. Also, shapes of 168 graphic characters have been modified. (See attachment for details) SPONSOR: Japanese Industrial Standard Committee (JISC) ORIGIN: JIS X 0213:2000 Plane 1 and JIS X 0213/AMENDMENT 1: 2004 OWNER OF ORIGIN: JISC COPYRIGHT OWNER: JISC FIELD OF UTILIZATION: Information Interchange between Data processing systems (including their related devices) or Data communication Systems. Applicable for both Data and Document processing. NOTE: Background Information on JIS X 0213/Amd 1 is attached to this registration. 表1 漢字集合1面の符号表 1面 1 区 ~ 31 区(1 点 ~ 23 点) b700000000000000000000000 b611111111111111111111111 b500000000000000011111111 b400000001111111100000000 b300011110000111100001111 b201100110011001100110011 b110101010101010101010101 b7 b6 b5 b4 b3 b2 b1 1234567891011121314151617181920212223 0100001 1 !"#$%&'()*+,-./01234567 0100010 2 !"#$%&'()*+,-./01234567 0100011 3 !"#$%&'()*+,-./01234567 0100100 4 !"#$%&'()*+,-./01234567 0100101 5 !"#$%&'()*+,-./01234567 0100110 6 !"#$%&'()*+,-./01234567 0100111 7 !"#$%&'()*+,-./01234567 0101000 8 !"#$%&'()*+,-./01234567 0101001 9 !"#$%&'()*+,-./01234567 0101010 10 !"#$%&'()*+,-./01234567 0101011 11 !"#$%&'()*+,-./01234567 0101100 12 !"#$%&'()*+,-./01234567 0101101 13 !"#$%&'()*+,-./01234567 0101110 14 O "#$%&'()*+,-./01234567 0101111 15 !"#$%&'()*+,-./01234567 0110000 16 亜唖娃阿哀愛挨姶x 葵茜穐悪握渥旭葦2 鯵梓圧斡扱 0110001 17 院陰隠韻吋右宇烏羽) 雨卯鵜窺丑碓臼渦嘘唄欝蔚鰻 0110010 18 押旺横欧殴王翁- 鴬鴎黄岡沖荻億屋憶臆桶牡乙俺卸 0110011 19 魁 5 械海灰界皆絵芥L 開階貝凱劾外咳害崖慨概涯碍 0110100 20 粥刈苅瓦乾侃冠寒刊勘勧巻喚堪姦完官寛干幹患感慣 0110101 21 機帰毅気汽畿祈季稀紀? 規記貴起軌輝飢騎鬼亀偽儀 0110110 22 供侠僑兇競共凶協匡I 叫喬境峡強彊怯恐恭挟教橋況 0110111 23 掘窟沓靴轡窪熊隈P 栗繰桑鍬勲君薫訓群軍郡卦袈= 0111000 24 検権I 犬献研硯絹県肩見謙賢軒遣Z 険顕験鹸元原厳 0111001 25 后喉坑垢好孔孝宏工巧^ 幸広庚康弘恒慌抗拘控攻昂 0111010 26 此頃今困坤墾婚恨懇昏昆根梱混痕紺艮魂些佐g 唆嵯 0111011 27 察拶撮擦札殺i 雑皐% 捌 $ d 皿晒三傘参山惨撒散桟 0111100 28 次滋治爾璽痔磁示而耳自蒔辞汐鹿式識鴫竺軸宍雫七 0111101 29 宗就州修愁拾洲秀秋終繍習臭舟蒐衆襲讐蹴輯週p 酬 0111110 30 勝匠升召u 商唱嘗奨妾娼宵将小少尚庄床廠彰承抄招 0111111 31 拭植殖燭織職色触食} 辱尻伸信侵唇娠寝審心慎振新 表1 漢字集合1面の符号表 (続き) 1面 1 区 ~ 31 区(24 点 ~ 47 点) b7000000001111111111111111 b6111111110000000000000000 b5111111110000000000000000 b4111111110000000011111111 b3000011110000111100001111 b2001100110011001100110011 b1010101010101010101010101 b7 b6 b5 b4 b3 b2 b1 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 0100001 1 89:;<=>?@ABCDEFGHIJKLMNO 0100010 2 89:;<=>?@ABCDEFGHIJKLMNO 0100011 3 89:;<=>?@ABCDEFGHIJKLMNO 0100100 4 89:;<=>?@ABCDEFGHIJKLMNO 0100101 5 89:;<=>?@ABCDEFGHIJKLMNO 0100110 6 89:;<=>?@ABCDEFGHIJKLMNO 0100111 7 89:;<=>?@ABCDEFGHIJKLMNO 0101000 8 89:;<=>?@ABCDEFGHIJKLMNO 0101001 9 89:;<=>?@ABCDEFGHIJKLMNO 0101010 10 89:;<=>?@ABCDEFGHIJKLMNO 0101011 11 89:;<=>?@ABCDEFGHIJKLMNO 0101100 12 89:;<=>?@ABCDEFGHIJKLMNO 0101101 13 89:;<=>?@ABCDEFGHIJKLMNO 0101110 14 89:;<=>?@ABCDEFGHIJKLMNO 0101111 15 89:;<=>?@ABCDEFGHIJKLMNO 0110000 16 宛姐虻% 絢綾鮎或粟袷安庵按暗案闇鞍杏以伊位依偉囲 0110001 17 姥 : 浦瓜閏: 云運雲荏l 叡営嬰影映曳栄永泳洩瑛盈穎 0110010 18 恩温穏音下化仮何伽価佳加可嘉夏嫁家寡科暇果架歌河 0110011 19 蓋街該鎧骸浬馨蛙垣柿蛎鈎劃嚇各廓拡撹格核殻獲確穫 0110100 20 憾換敢柑桓棺款歓汗漢澗潅環甘監看竿管簡緩缶9 肝艦 0110101 21 妓宜戯技擬欺犠疑@ 義蟻誼議掬菊鞠吉吃喫桔橘詰砧杵 0110110 22 狂狭矯胸脅興蕎郷鏡響K 驚仰凝尭暁業局曲極玉桐粁M 0110111 23 係傾刑兄啓圭珪型契形径恵慶慧憩掲携敬景桂渓畦稽系 0111000 24 幻弦減源玄現絃舷言\ 限乎個古呼固姑孤己庫弧戸故枯 0111001 25 晃更杭校` 構江洪浩港溝甲皇硬稿糠紅紘絞綱耕考肯肱 0111010 26 左差査沙瑳砂詐鎖裟坐座挫債催再最哉塞妻宰彩才採栽 0111011 27 燦珊産算纂蚕讃賛酸< 斬暫残仕仔伺使刺司史嗣四士始 0111100 28 叱執失嫉室悉湿漆疾質実蔀篠偲柴芝屡蕊縞舎写射捨赦 0111101 29 集醜什住充十従戎柔汁渋獣縦重銃叔夙宿淑祝縮粛塾熟 0111110 30 掌捷昇昌昭晶松梢樟樵沼消渉湘焼焦照症省硝礁祥称章 0111111 31 晋森榛浸深申疹真神秦紳臣芯薪親診身辛進針震人仁刃 表1 漢字集合1面の符号表 (続き) 1面 1 区 ~ 31 区(48 点 ~ 71 点) b7111111111111111111111111 b6000000000000000011111111 b5111111111111111100000000 b4000000001111111100000000 b3000011110000111100001111 b2001100110011001100110011 b1010101010101010101010101 b7 b6 b5 b4 b3 b2 b1 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 0100001 1 PQRSTUVWXYZ[\]^_`abcdefg 0100010 2 PQRSTUVWXYZ[\]^_`abcdefg 0100011 3 PQRSTUVWXYZ[\]^_`abcdefg 0100100 4 PQRSTUVWXYZ[\]^_`abcdefg 0100101 5 PQRSTUVWXYZ[\]^_`abcdefg 0100110 6 PQRSTUVWXYZ[\]^_`abcdefg 0100111 7 PQRSTUVWXYZ[\]^_`abcdefg 0101000 8 PQRSTUVWXYZ[\]^_`abcdefg 0101001 9 PQRSTUVWXYZ[\]^_`abcdefg 0101010 10 PQRSTUVWXYZ[\]^_`abcdefg 0101011 11 PQRSTUVWXYZ[\]^_`abcdefg 0101100 12 PQRSTUVWXYZ[\]^_`abcdefg 0101101 13 PQRSTUVWXYZ[\]^_`abcdefg 0101110 14 PQRSTUVWXYZ[\]^_`abcdefg 0101111 15 PQRSTUVWXYZ[\]^_`abcdefg 0110000 16 夷委威尉惟意慰易椅為畏異移維緯胃萎衣謂違遺医井亥 0110001 17 頴英衛詠鋭液疫益駅悦謁越閲榎厭円園堰奄宴延怨掩援 0110010 18 火珂禍禾稼箇花苛茄荷華菓蝦課嘩貨0 過霞蚊俄峨我8 0110011 19 覚角赫較郭閣隔革学岳楽額顎掛笠樫橿梶鰍潟割喝恰括 0110100 20 莞観諌貫還鑑間閑関陥韓館舘丸含岸巌玩癌眼岩; 贋雁 0110101 21 黍却客脚虐逆丘久仇休及吸宮弓急救朽求C 泣 D 球究窮 0110110 22 勤均巾錦斤欣欽琴禁禽筋緊芹菌衿襟謹近金吟銀九倶句 0110111 23 経継繋罫茎荊蛍計詣警軽頚鶏芸迎鯨劇戟撃激T 桁傑欠 0111000 24 湖狐糊袴股胡菰虎誇跨鈷雇顧鼓五互伍午呉吾娯後御悟 0111001 25 腔 b 航荒行衡講貢購郊酵鉱砿鋼閤降項香高鴻剛劫号合 0111010 26 歳済災采犀砕砦祭斎細菜裁載際剤在材罪財冴坂阪堺h 0111011 27 姉姿子屍市師志思指支孜斯施旨枝止死氏獅祉私糸紙紫 0111100 28 斜煮社紗者謝車遮蛇邪借勺尺>=爵酌釈錫若寂弱惹主 0111101 29 出術述俊峻春瞬竣舜駿准循旬r 殉淳準潤盾純巡遵醇順 0111110 30 笑粧紹肖菖蒋蕉衝裳訟証詔詳象賞醤鉦鍾鐘障z 上丈丞 0111111 31 塵壬尋甚尽腎~ 迅陣靭笥諏須酢図厨W 吹垂帥推水炊睡 表1 漢字集合1面の符号表 (続き) 1面 1 区 ~ 31 区(72 点 ~ 94 点) b711111111111111111111111 b611111111111111111111111 b500000000111111111111111 b411111111000000001111111 b300001111000011110000111 b200110011001100110011001 b101010101010101010101010 b7 b6 b5 b4 b3 b2 b1 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 0100001 1 hijklmnopqrstuvwxyz{|}~ 0100010 2 hijklmnopqrstuvwxyz{|}~ 0100011 3 hijklmnopqrstuvwxyz{|}~ 0100100 4 hijklmnopqrstuvwxyz{|}~ 0100101 5 hijklmnopqrstuvwxyz{|}~ 0100110 6 hijklmnopqrstuvwxyz{|}~ 0100111 7 hijklmnopqrstuvwxyz{|}~ 0101000 8 hijklmnopqrstuvwxyz{|}~ 0101001 9 hijklmnopqrstuvwxyz{|}~ 0101010 10 hijklmnopqrstuvwxyz{|}~ 0101011 11 hijklmnopqrstuvwxyz{|}~ 0101100 12 hijklmnopqrstuvwxyz{|}~ 0101101 13 hijklmnopqrstuvwxyz{|}~ 0101110 14 hijklmnopqrstuvwxyz{|}~ 0101111 15 hijklmnopqrstuvwxyz{|}a 0110000 16 域育郁磯一壱& 逸稲k 芋 ' 允印咽員因姻引飲( 胤蔭 0110001 17 沿演炎焔煙燕猿縁艶苑薗遠鉛鴛塩於汚甥凹央奥往応 0110010 18 画臥芽蛾賀雅餓駕介会解回塊壊3 快怪悔4 懐戒拐改 0110011 19 活渇滑8 褐轄且鰹叶椛樺z 株兜竃蒲m 鎌噛鴨栢茅萱 0110100 20 頑顔願企伎危喜器基奇嬉寄岐希幾忌揮机旗既期棋棄 0110101 21 E 級糾給旧牛去居巨拒拠挙渠虚許距鋸漁禦魚亨享京 0110110 22 区狗玖矩苦躯駆駈駒具愚虞; 空偶寓遇隅串n 釧 ( 屈 0110111 23 決潔穴結血訣月件倹V 健兼券剣喧圏堅嫌建憲懸拳X 0111000 24 梧檎瑚碁語誤護醐乞鯉交佼侯候倖光公功効勾厚口向 0111001 25 壕拷濠豪轟麹克刻告国穀酷f 黒獄漉腰8 忽惚骨狛込 0111010 26 肴咲崎埼碕鷺作削咋搾昨朔柵窄策索錯桜鮭笹匙冊刷 0111011 27 肢脂至視詞詩試誌諮資賜雌飼歯事似侍児字寺慈持時 0111100 28 取守手朱殊狩珠種腫趣酒首儒受呪寿授樹綬需囚収周 0111101 29 処初所暑曙渚庶緒署書st諸助叙女序徐恕鋤除傷償 0111110 30 乗冗剰城場壌嬢常情擾条| 浄状畳穣蒸譲醸錠嘱埴飾 0111111 31 粋翠衰遂酔錐錘随瑞髄崇嵩数枢趨雛据杉椙菅頗雀裾 表1 漢字集合1面の符号表 (続き) 1面 32 区 ~ 63 区(1 点 ~ 23 点) b700000000000000000000000 b611111111111111111111111 b500000000000000011111111 b400000001111111100000000 b300011110000111100001111 b201100110011001100110011 b110101010101010101010101 b7 b6 b5 b4 b3 b2 b1 1234567891011121314151617181920212223 1000000 32 澄 x 寸世瀬畝是凄制勢姓征性成政整星晴棲栖正清牲 1000001 33 繊羨腺舛船薦+ 賎践選遷銭銑閃鮮前善漸然全禅繕膳 1000010 34 臓蔵贈造促側則即息捉束測足速俗属賊族続卒袖其* 1000011 35 叩但達辰奪脱巽竪N 棚谷狸鱈; 誰丹単嘆坦担探旦? 1000100 36 帖帳庁弔張彫徴懲挑暢朝潮牒町眺聴脹腸蝶調諜超跳 1000101 37 邸 K 釘鼎泥摘L 敵滴的笛適鏑M 哲徹撤轍迭鉄典填天 1000110 38 董蕩藤討謄豆踏逃透鐙陶頭騰闘働動同堂導憧撞洞瞳 1000111 39 如尿韮任妊忍認濡] 祢寧葱猫熱年念捻撚燃粘乃廼之 1001000 40 函箱硲B 肇筈櫨幡肌畑畠八鉢溌発醗髪伐罰抜筏閥鳩 1001001 41 鼻柊` 匹疋髭彦膝菱肘弼必畢筆g 桧姫媛紐百h 俵彪 1001010 42 福腹複覆淵弗払沸仏物鮒分吻噴墳憤扮焚奮粉糞紛雰 1001011 43 法泡烹砲縫胞芳萌y 蜂褒訪豊邦鋒飽鳳鵬乏亡傍剖坊 1001100 44 漫蔓味未魅巳箕岬密蜜湊蓑稔脈妙粍民眠務夢無牟矛 1001101 45 諭輸唯佑優勇友宥幽悠憂揖有柚湧涌猶' 由祐裕誘遊 1001110 46 痢裏裡里離陸律率立葎掠略劉流溜琉留硫粒隆竜龍侶 1001111 47 蓮連錬呂魯櫓炉賂路露労婁廊弄朗楼3 浪漏牢狼篭老 1010000 48 弌丐丕个丱丶丼丿乂乖乘亂亅豫亊舒弍于亞亟亠亢亰 1010001 49 僉僊傳僂僖僞僥僭僣僮價僵儉儁儂儖儕儔儚儡儺儷儼 1010010 50 辧劬劭劼劵勁勍勗勞勣勦飭勠勳勵勸勹匆匈甸匍匐匏 1010011 51 咫哂咤咾咼哘哥哦唏唔哽哮哭哺哢唹啀啣啌售啜啅啖 1010100 52 圈國圍圓團圖嗇圜圦圷圸坎圻址坏坩埀垈坡坿垉垓垠 1010101 53 奸妁妝佞侫妣妲姆姨姜妍姙姚娥娟娑娜娉娚婀婬婉娵 1010110 54 屐屏孱屬屮乢屶屹岌岑岔妛岫岻岶岼岷峅岾峇峙峩峽 1010111 55 廖廣廝廚廛廢廡廨廩廬廱廳廰廴廸廾弃弉彝彜弋弑弖 1011000 56 悄悛悖悗悒悧悋惡悸惠惓悴忰悽惆悵惘慍愕愆惶惷愀 1011001 57 戞戡截戮戰戲戳r 扎扞扣扛扠扨扼抂抉找抒抓抖拔抃 1011010 58 據擒擅擇撻擘擂擱擧舉擠擡抬擣擯攬擶擴擲擺攀擽攘 1011011 59 曄暸曖曚曠昿曦曩曰曵曷朏朖朞朦朧霸朮朿朶杁朸朷 1011100 60 棔棧棕椶椒椄棗棣椥棹棠棯椨椪椚椣椡棆楹楷楜楸楫 1011101 61 檗蘗檻櫃櫂檸檳檬櫞櫑櫟檪櫚櫪櫻欅蘖櫺欒欖鬱欟欸 1011110 62 沺泛泯泙泪洟衍洶洫洽洸洙洵洳洒洌浣涓浤浚浹浙涎 1011111 63 漾漓滷澆潺潸澁澀潯潛濳潭澂潼潘澎澑濂潦澳澣澡澤 表1 漢字集合1面の符号表 (続き) 1面 32 区 ~ 63 区(24 点 ~ 47 点) b7000000001111111111111111 b6111111110000000000000000 b5111111110000000000000000 b4111111110000000011111111 b3000011110000111100001111 b2001100110011001100110011 b1010101010101010101010101 b7 b6 b5 b4 b3 b2 b1 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 1000000 32 生盛精聖声製西誠誓請逝醒青静斉税脆隻席惜戚斥昔析 1000001 33 糎 6 塑岨措曾曽楚狙疏疎礎祖租粗素組蘇訴阻2 鼠僧創 1000010 34 存孫尊損村9 他多太汰詑唾堕妥惰打柁舵楕陀駄騨体堆 1000011 35 淡湛炭短端箪綻耽胆蛋誕鍛団壇弾断暖檀段男談値知地 1000100 36 銚長頂鳥勅F 直朕沈珍賃鎮陳津墜椎G 追 H 痛通塚栂掴 1000101 37 展店添纏甜貼転顛点伝殿澱田電R 吐 S 塗妬T 徒斗杜渡 1000110 38 童胴萄道銅峠鴇匿得徳涜特督禿篤毒独読栃橡凸突椴届 1000111 39 埜嚢悩濃納能脳膿農覗蚤巴把播覇杷波派琶破婆罵芭馬 1001000 40 噺塙蛤隼伴判半反e 帆搬斑板氾汎版犯班畔繁般藩販範 1001001 41 標氷漂瓢票表評E j 描病秒苗錨鋲蒜蛭鰭品彬斌浜k 貧 1001010 42 文聞丙併兵塀幣平弊柄並p 閉陛米頁僻壁癖碧別q F 箆 1001011 43 妨帽忘忙房暴望某棒冒紡肪膨謀貌貿鉾防吠頬北僕卜墨 1001100 44 霧鵡椋婿娘冥名命明盟迷銘鳴姪牝滅免棉綿緬面麺摸模 1001101 45 邑郵雄融夕予余与誉輿預傭幼妖容庸揚揺擁曜楊様洋溶 1001110 46 慮旅虜了亮僚両凌寮料梁涼猟療瞭稜糧良諒遼量陵領力 1001111 47 聾蝋郎六麓禄肋録論倭和話歪賄脇惑枠鷲亙亘鰐詫藁蕨 1010000 48 亳亶从仍仄仆仂仗仞仭仟价伉佚估佛佝佗佇佶侈侏侘佻 1010001 49 儻儿兀兒兌兔兢竸兩兪兮冀冂囘册冉冏冑冓冕冖* 冦冢 1010010 50 匕匚匣匯匱匳匸區卆卅丗卉卍凖卞卩卮夘卻卷厂厖厠厦 1010011 51 啗唸唳啝喙喀咯喊喟啻啾喘喞單啼喃喩喇喨嗚嗅嗟嗄嗜 1010100 52 垳垤垪垰埃埆埔埒埓堊埖埣堋堙堝塲堡塢塋塰毀塒堽塹 1010101 53 娶婢婪媚媼媾嫋嫂媽嫣嫗嫦嫩嫖嫺嫻嬌嬋嬖嬲嫐嬪嬶嬾 1010110 54 峺峭嶌峪崋崕崗嵜崟崛崑崔崢崚崙崘嵌嵒嵎嵋嵬嵳嵶嶇 1010111 55 弩弭弸彁彈彌彎弯彑彖彗彙彡彭彳彷徃徂彿徊很徑徇從 1011000 56 惴惺愃愡惻惱愍愎慇愾愨愧慊愿愼愬愴愽慂慄慳慷慘慙 1011001 57 抔拗拑抻拏拿拆擔拈拜拌拊拂拇抛拉挌拮拱挧挂挈拯拵 1011010 58 攜攅攤攣攫攴攵攷收攸畋效敖敕敍敘敞敝敲數斂斃變斛 1011011 59 杆杞杠杙杣杤枉杰枩杼杪枌枋枦枡枅枷柯枴柬枳柩枸柤
Recommended publications
  • Legacy Character Sets & Encodings
    Legacy & Not-So-Legacy Character Sets & Encodings Ken Lunde CJKV Type Development Adobe Systems Incorporated bc ftp://ftp.oreilly.com/pub/examples/nutshell/cjkv/unicode/iuc15-tb1-slides.pdf Tutorial Overview dc • What is a character set? What is an encoding? • How are character sets and encodings different? • Legacy character sets. • Non-legacy character sets. • Legacy encodings. • How does Unicode fit it? • Code conversion issues. • Disclaimer: The focus of this tutorial is primarily on Asian (CJKV) issues, which tend to be complex from a character set and encoding standpoint. 15th International Unicode Conference Copyright © 1999 Adobe Systems Incorporated Terminology & Abbreviations dc • GB (China) — Stands for “Guo Biao” (国标 guóbiâo ). — Short for “Guojia Biaozhun” (国家标准 guójiâ biâozhün). — Means “National Standard.” • GB/T (China) — “T” stands for “Tui” (推 tuî ). — Short for “Tuijian” (推荐 tuîjiàn ). — “T” means “Recommended.” • CNS (Taiwan) — 中國國家標準 ( zhôngguó guójiâ biâozhün) in Chinese. — Abbreviation for “Chinese National Standard.” 15th International Unicode Conference Copyright © 1999 Adobe Systems Incorporated Terminology & Abbreviations (Cont’d) dc • GCCS (Hong Kong) — Abbreviation for “Government Chinese Character Set.” • JIS (Japan) — 日本工業規格 ( nihon kôgyô kikaku) in Japanese. — Abbreviation for “Japanese Industrial Standard.” — 〄 • KS (Korea) — 한국 공업 규격 (韓國工業規格 hangug gongeob gyugyeog) in Korean. — Abbreviation for “Korean Standard.” — ㉿ — Designation change from “C” to “X” on August 20, 1997. 15th International Unicode Conference Copyright © 1999 Adobe Systems Incorporated Terminology & Abbreviations (Cont’d) dc • TCVN (Vietnam) — Tiu Chun Vit Nam in Vietnamese. — Means “Vietnamese Standard.” • CJKV — Chinese, Japanese, Korean, and Vietnamese. 15th International Unicode Conference Copyright © 1999 Adobe Systems Incorporated What Is A Character Set? dc • A collection of characters that are intended to be used together to create meaningful text.
    [Show full text]
  • International Language Environments Guide
    International Language Environments Guide Sun Microsystems, Inc. 4150 Network Circle Santa Clara, CA 95054 U.S.A. Part No: 806–6642–10 May, 2002 Copyright 2002 Sun Microsystems, Inc. 4150 Network Circle, Santa Clara, CA 95054 U.S.A. All rights reserved. This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd. Sun, Sun Microsystems, the Sun logo, docs.sun.com, AnswerBook, AnswerBook2, Java, XView, ToolTalk, Solstice AdminTools, SunVideo and Solaris are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. SunOS, Solaris, X11, SPARC, UNIX, PostScript, OpenWindows, AnswerBook, SunExpress, SPARCprinter, JumpStart, Xlib The OPEN LOOK and Sun™ Graphical User Interface was developed by Sun Microsystems, Inc. for its users and licensees. Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry.
    [Show full text]
  • International Language Environments Guide for Oracle® Solaris 11.4
    International Language Environments ® Guide for Oracle Solaris 11.4 Part No: E61001 November 2020 International Language Environments Guide for Oracle Solaris 11.4 Part No: E61001 Copyright © 2011, 2020, Oracle and/or its affiliates. License Restrictions Warranty/Consequential Damages Disclaimer This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited. Warranty Disclaimer The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing. Restricted Rights Notice If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, then the following notice is applicable: U.S. GOVERNMENT END USERS: Oracle programs (including any operating system, integrated software, any programs embedded, installed or activated on delivered hardware, and modifications of such programs) and Oracle computer documentation or other Oracle data delivered to or accessed by U.S. Government end users are
    [Show full text]
  • NAME DESCRIPTION Supported Encodings
    Perl version 5.8.6 documentation - Encode::Supported NAME Encode::Supported -- Encodings supported by Encode DESCRIPTION Encoding Names Encoding names are case insensitive. White space in names is ignored. In addition, an encoding may have aliases. Each encoding has one "canonical" name. The "canonical" name is chosen from the names of the encoding by picking the first in the following sequence (with a few exceptions). The name used by the Perl community. That includes 'utf8' and 'ascii'. Unlike aliases, canonical names directly reach the method so such frequently used words like 'utf8' don't need to do alias lookups. The MIME name as defined in IETF RFCs. This includes all "iso-"s. The name in the IANA registry. The name used by the organization that defined it. In case de jure canonical names differ from that of the Encode module, they are always aliased if it ever be implemented. So you can safely tell if a given encoding is implemented or not just by passing the canonical name. Because of all the alias issues, and because in the general case encodings have state, "Encode" uses an encoding object internally once an operation is in progress. Supported Encodings As of Perl 5.8.0, at least the following encodings are recognized. Note that unless otherwise specified, they are all case insensitive (via alias) and all occurrence of spaces are replaced with '-'. In other words, "ISO 8859 1" and "iso-8859-1" are identical. Encodings are categorized and implemented in several different modules but you don't have to use Encode::XX to make them available for most cases.
    [Show full text]
  • Adobe Technical Note #5078: the Adobe-Japan1-6 Character Collection 2
    Adobe Enterprise & Developer Support bc Adobe Technical Note #5078 The Adobe-Japan1-6 Character Collection Introduction The purpose of this document is to define and describe the Adobe-Japan1-6 character collection, which enumerates 23,058 glyphs, and whose designation is derived from the following three /CIDSystemInfo dictionary entries: ● /Registry (Adobe) ● /Ordering (Japan1) ● /Supplement 6 CIDFont resources that reference this character collection must include a /CIDSystemInfo dictionary that matches the /Registry and /Ordering strings shown above. This document is designed for font developers, for the purpose of developing Japanese fonts for use with PostScript products, or for developing OpenType Japanese fonts. It is also useful for application developers and end users who need to know more about the glyphs in this character collection. This document expects that its readers are familiar with the CID-keyed font file format, which is described in Adobe Technical Note #5014, entitled Adobe CMap and CIDFont Files Specification.* A character collection contains the glyphs that are required to develop font products for a specific language, script, or market. Specific encodings are defined through the use of CMap resources that are instantiated as files, and generally reference a subset of the character collection. The character collection that results from each Supplement includes the glyphs associated with all earlier Supplements. For example, Supplement 6 includes all glyphs defined in Supplements 0 through 5. The Adobe-Japan1-6 character collection enumerates 23,058 glyphs, specifically CIDs 0 through 23057, among seven Supplements, designated 0 through 6. Adobe-Japan1-6 completely supports the current JIS (Japanese Industrial Standard) character set standards, and earlier vintages thereof, specifically JIS X 0208:1997, JIS X 0213:2004, and JIS X 0212-1990.
    [Show full text]
  • Pdflib Tutorial 9.0.1
    ABC PDFlib, PDFlib+PDI, PPS A library for generating PDF on the fly PDFlib 9.0.1 Tutorial For use with C, C++, Cobol, COM, Java, .NET, Objective-C, Perl, PHP, Python, REALbasic/Xojo, RPG, Ruby Copyright © 1997–2013 PDFlib GmbH and Thomas Merz. All rights reserved. PDFlib users are granted permission to reproduce printed or digital copies of this manual for internal use. PDFlib GmbH Franziska-Bilek-Weg 9, 80339 München, Germany www.pdflib.com phone +49 • 89 • 452 33 84-0 fax +49 • 89 • 452 33 84-99 If you have questions check the PDFlib mailing list and archive at tech.groups.yahoo.com/group/pdflib Licensing contact: [email protected] Support for commercial PDFlib licensees: [email protected] (please include your license number) This publication and the information herein is furnished as is, is subject to change without notice, and should not be construed as a commitment by PDFlib GmbH. PDFlib GmbH assumes no responsibility or lia- bility for any errors or inaccuracies, makes no warranty of any kind (express, implied or statutory) with re- spect to this publication, and expressly disclaims any and all warranties of merchantability, fitness for par- ticular purposes and noninfringement of third party rights. PDFlib and the PDFlib logo are registered trademarks of PDFlib GmbH. PDFlib licensees are granted the right to use the PDFlib name and logo in their product documentation. However, this is not required. Adobe, Acrobat, PostScript, and XMP are trademarks of Adobe Systems Inc. AIX, IBM, OS/390, WebSphere, iSeries, and zSeries are trademarks of International Business Machines Corporation.
    [Show full text]
  • ISO/IEC 10646:2020 6Th Edition
    This preview is downloaded from www.sis.se. Buy the entire standard via https://www.sis.se/std-80026835 INTERNATIONAL ISO/IEC STANDARD 10646 Sixth edition 2020-12 Information technology — Universal coded character set (UCS) Technologies de l'information — Jeu universel de caractères codés (JUC) Reference number ISO/IEC 10646:2020(E) © ISO/IEC 2020 This preview is downloaded from www.sis.se. Buy the entire standard via https://www.sis.se/std-80026835 ISO/IEC 10646:2020(E) COPYRIGHT PROTECTED DOCUMENT © ISO/IEC 2020 All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below or ISO’s member body in the country of the requester. ISO copyright office CP 401 • Ch. de Blandonnet 8 CH-1214 Vernier, Geneva Phone:Website: +41 www.iso.org 22 749 01 11 Email: [email protected] Published in Switzerland ii © ISO/IEC 2020 – All rights reserved This preview is downloaded from www.sis.se. Buy the entire standard via https://www.sis.se/std-80026835 ISO/IEC 10646:2020 (E) CONTENTS 1 Scope ..................................................................................................................................................1 2 Normative references .........................................................................................................................1
    [Show full text]
  • To the BMP and Beyond!
    To the BMP and beyond! Eric Muller Adobe Systems © Adobe Systems - To the BMP and beyond! July 20, 2006 - Slide 1 Content 1. Why Unicode 2. Character model 3. Principles of the Abstract Character Set 4. The characters in 5.0 5. Development of the standard 6. Processing 7. Unicode and other standards 8. Resources © Adobe Systems - To the BMP and beyond! July 20, 2006 - Slide 2 Part I Why Unicode © Adobe Systems - To the BMP and beyond! July 20, 2006 - Slide 3 ASCII 128 characters = 41 supports meaningful exchange of text data very limited: not even adequate for English: Adobe® he said “Hi!” résumé† cañon © Adobe Systems - To the BMP and beyond! July 20, 2006 - Slide 4 Many other standards national or regional standards ISO-Latin-1/9: targets Western Europe JIS: targets Japan platform standards Microsoft code pages Apple: MacRoman, etc. Adobe: PDFDocEncoding, etc. but none for many writing systems! © Adobe Systems - To the BMP and beyond! July 20, 2006 - Slide 5 Unicode enables world-wide interchange of data contains all the major living scripts simple enough to be implemented everywhere supports legacy data and implementation allows a single implementation of a product supports multilingual users and organizations conforms to international standards can serve as the fundation for other standards © Adobe Systems - To the BMP and beyond! July 20, 2006 - Slide 6 Part II Character model © Adobe Systems - To the BMP and beyond! July 20, 2006 - Slide 7 Four layers abstract character set smallest components of written language coded character set adds name and code point character encoding forms representation in computer character encoding schemes byte serialization © Adobe Systems - To the BMP and beyond! July 20, 2006 - Slide 8 Abstract character set character: the smallest component of written language that has semantic value wide variation across scripts alphabetic, syllabary, abugidas, abjad, logographic even within scripts, e.g.
    [Show full text]
  • Uptex — Unicode Version of Ptex with CJK Extensions 286 Tugboat, Volume 34 (2013), No
    TUGboat, Volume 34 (2013), No. 3 285 (option ) and Babel (Latin, Cyrillic, upTEX | Unicode version of pTEX inputenc utf8 with CJK extensions Greek, etc.) packages are available in upLATEX. Takuji Tanaka 1.5 Compatibility with pTEX Abstract Since upTEX keeps the typesetting procedures of pTEX, most macros for p(LA)TEX are expected to upT X is a Unicode extension of ASCII's pT X (a E E work in up(LA)TEX with minimal or no modifications. Japanese-localized TEX). It not only improves Jap- anese support, but also handles Chinese and Ko- 1.6 Limitation rean characters, i.e., Kanji (Hanzi, Hanja), Kana, Current upTEX has several limitations. CJK symbols, and Hangul with Unicode. Moreover, In the area of multilingual support, upTEX han- it can process multilingual typesetting of original dle CJK, Latin, Cyrillic and Greek. However, upTEX LATEX with inputenc and Babel (Latin, Cyrillic, cannot directly treat complex characters, e.g., Ara- Greek, etc.) by switching its \kcatcode tables. This bic, Hindi. paper describes the main features of upTEX. upTEX includes the "-TEX extensions, thanks to Kitagawa [4]. On the other hand, upTEX does not 1 Introduction have pdfTEX extensions yet. 1.1 Motivation The (u)pTEX engine remains based on original T X and written using WEB. It includes two spe- The objective of upTEX [1] is the \Unicodization" of E cial extensions: (1) (u)pT X treats Japanese (CJK) ASCII Corp.'s pTEX [2]. p(LA)TEX is the most pop- E glyph metrics with an extended format of tfm, called ular TEX system in Japan and is widely used as a jfm; (2) (u)pT X defines a special DVI command 255 typesetter for commercial printing.
    [Show full text]
  • (12) United States Patent (10) Patent N0.: US 7,900,143 B2 Xu (45) Date of Patent: Mar
    US007900143B2 (12) United States Patent (10) Patent N0.: US 7,900,143 B2 Xu (45) Date of Patent: Mar. 1, 2011 (54) LARGE CHARACTER SET BROWSER 6,718,519 B1* 4/2004 Taieb ....................... .. 715/542 (75) Inventor: Yueheng Xu, Beaverton, OR (US) OTHER PUBLICATIONS Roger Molesworth, The Open Interchange of Electronic Documents, (73) Assignee: Intel Corporation, Santa Clara, CA Standards & Practices in Electronic Data Interchange, IEEE Col (Us) loquium on May 21, 1991, pp. 1-6.* Apple Computer, Inc., Programming With The Text Encoding Con Notice: Subject to any disclaimer, the term of this version Manager, Technical Publication, Apr. 1999, pp. 1-286. patent is extended or adjusted under 35 K. Lunde, CJKJNF, CJKINF Version 1.3, Jun. 16, 1995, pp. 1-50. USC 154(b) by 2155 days. Zhu, et al., Network Working Group, Request for Comments 1922, “Chinese Character Encoding For Internet Messages,” Mar. 1996, pp. 1-27. (21) Appl. No.: 09/748,895 Searle, “A Brief History of Character Codes in North America, Europe, and East Asia,” Sakamura Laboratory, University Museum, (22) Filed: Dec. 27, 2000 University of Tokyo, 1999, pp. 1-19. Goldsmith, et al., Network Working Group, Request for Comments (65) Prior Publication Data 2152, “A Mail-Safe Transmission Format of Unicode,” May 1997, US 2002/0120654 A1 Aug. 29, 2002 pp. 1-15. Japanese Patent Of?ce, English Language Translation of Inquiry Int. Cl. Letter Dated Jul. 11, 2008, pp. 1-3. (51) Japan Standards Association, “7-bit and 8-bit double byte coded G06F 17/22 (2006.01) extended Kanji sets for information interchange,” JIS X 0213:2000, (52) US.
    [Show full text]
  • 1.2 Design Goals
    The Unicode® Standard Version 12.0 – Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2019 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at http://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see http://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 12.0. Includes index. ISBN 978-1-936213-22-1 (http://www.unicode.org/versions/Unicode12.0.0/) 1.
    [Show full text]
  • Unicode による JIS X 0213 実装 ~情報システムにおける日本語処理~
    田丸 健三郎 マイクロソフト イノベーション センター http://www.microsoft.com/japan/mic 1 プラットフォームと相互運用性 文字集合の重要性 文字コードの歴史とマイクロソフト 文字コード…ちょっとその前に JIS? Unicode 2 Shift JIS JIS78 – JIS C 6226:1978 Unicode JIS83 – JIS C 6226:1983 UCS (Universal Character JIS90 – JIS X 0208:1990 Set) JIS97 – JIS X 0208:1997 BMP (Basic Multilingual JIS2000 – JIS X 0213:2000 Plane) JIS2004 – JIS X 0213:2004 Surrogate Pair 補助漢字 – JIS X 0212:1990 8bit, 16bit, 32bit 10646 – ISO/IEC 10646 Wide Character ASCII – ANSI INCITS 4 半角、全角 UI (User Interface) 注意:略称は他の文献などで使われている例と違うことがあります。例えば、JIS78 vs. 78JIS 3 www.w3c.org www.ieee.org http://www.ecma-international.org/ www.ietf.org www.ws-i.org 4 Linux, Tru64 Services Services for Macintosh for UNIX Services for Novell NDS, NetWare UNIX NIS, Novell Active Groupwise, Directory Systems Center Meta- directory Services Interix SQL Host BizTalk Integration Exchange Server Informix, XML/SOAP Smart Phones Web services and PDAs 5 安定した符号化文字集合 は、ICT機器間の相互運 用性確保の根幹となる基 本要件の一つ 6 7 第一世代 (1983 - ) Shift JIS MS-DOS 2 - , MS OS/2 1.x, MS-Windows 2.x, (JIS78 or JIS83) + メーカー拡張 3.0 第二世代(1993 - ) マイクロソフト 標準 キャラクタ セット JIS90, 10646 Windows 3.1, NT 3.1/3.5, 95, NT 3.51/4.0 第三世代 (1998 - ) 補助漢字をUCS拡張 補助漢字 Windows 98, NT 4.0 SP4, 2000, Me, XP 第四世代 (2007 - ) 国語審議会答申に基づく最新規格への対応 Windows Vista, Windows Server 2008, JIS2004 Windows XP (with JIS2004 Pack) 8 1982年12月MS-DOS 2.0説明会にて発表 1983年 MS-DOS 2.0 で対応 半角は 1Byte、全角は2Bytes 1st Byteで半角と全角を判断 2nd Byte に、Delimiter(‘¥’ など)のコードが存在 文字集合は JIS78(または、JIS83)を基本 メーカー拡張文字を収容可能 拡張性に乏しい 9 1992年12月 Version 1.0 完成 1993年5月 Windows 3.1 で対応 符号化方式は、シフト
    [Show full text]