Quick viewing(Text Mode)

ISO/IEC International Standard 10646-1

ISO/IEC International Standard 10646-1

INTERNATIONAL ISO/IEC

This preview is downloaded from www.sis.se.STANDARD Buy entire standard via https://www.sis.se/std-91259910646

First edition 2003-12-15 AMENDMENT 7 2010-07-15

Information technology — Universal Multiple-Octet Coded Character Set (UCS) — AMENDMENT 7: Mandaic, Batak, Brahmi, and other characters

Technologies l'information — Jeu universel de caractères codés sur plusieurs octets (JUC) — AMENDEMENT 7: Mandaique, batak, brahmi, et autres caractères

Reference number ISO/IEC 10646:2003/Amd.7:2010()

© ISO/IEC 2010

ISO/IEC 10646:2003/Amd.7:2010(E)

This preview is downloaded from www.sis.se. Buy the entire standard via https://www.sis.se/std-912599

PDF disclaimer This PDF file may contain embedded typefaces. In accordance with Adobe's licensing policy, this file may printed or viewed but shall not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In downloading this file, parties accept therein the responsibility of not infringing Adobe's licensing policy. The ISO Central Secretariat accepts no liability in this area. Adobe is a trademark of Adobe Systems Incorporated. Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF-creation parameters were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the address given below.

COPYRIGHT PROTECTED DOCUMENT

© ISO/IEC 2010 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISO's member body in the country of the requester. ISO copyright office Case postale 56 • CH-1211 Geneva 20 Tel. + 41 22 749 01 11 Fax + 41 22 749 09 47 E-mail [email protected] Web www.iso.org Published in Switzerland

ii © ISO/IEC 2010 – All rights reserved

ISO/IEC 10646:2003/Amd.7:2010(E)

This preview is downloaded from www.sis.se. Buy the entire standard via https://www.sis.se/std-912599

Foreword

ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission) form the specialized system for worldwide standardization. National bodies that are members of ISO or IEC participate in the development of International Standards through technical committees established by the respective organization to deal with particular fields of technical activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other international organizations, governmental and non-governmental, in liaison with ISO and IEC, also take part in the work. In the field of information technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC 1.

International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2.

The main task of the joint technical committee is to prepare International Standards. Draft International Standards adopted by the joint technical committee are circulated to national bodies for voting. Publication as an International Standard requires approval by at least 75 % of the national bodies casting a vote.

Attention is drawn to the possibility that some of the elements of this document may be the subject of patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights.

Amendment 7 to ISO/IEC 10646 was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology, Subcommittee SC 2, Coded character sets.

© ISO/IEC 2010 – All rights reserved iii

This preview is downloaded from www.sis.se. Buy the entire standard via https://www.sis.se/std-912599 This preview is downloaded from www.sis.se. Buy the entire standard via https://www.sis.se/std-912599

ISO/IEC 10646:2003/Amd.7:2010 (E)

Information technology — Universal Multiple-Octet Coded Character Set (UCS) —

AMENDMENT 7: Mandaic, Batak, Brahmi, and other characters

Page 1, Clause 1, Scope Page 25, Clause 29, Named UCS Sequence Identifiers In the list item specifying the supplementary planes, after „(SIP)‟, insert ‘the Tertiary Ideographic (TIP)’. Insert the following additional 23 sequence identifiers In the list item enumerating the plane names replace ‘BMP, SMP, SIP, SSP’ with ‘BMP, SMP, SIP, TIP, SSP’. <02E9, 02E5> MODIFIER LETTER EXTRA-LOW EXTRA- HIGH CONTOUR TONE BAR Page 2, Clause 4, Terms and definitions <00E6, 0300> LATIN SMALL LETTER WITH GRAVE <0254, 0300> LATIN SMALL LETTER OPEN WITH GRAVE <0254, 0301> LATIN SMALL LETTER OPEN O WITH ACUTE Insert the following before the current 4.44 Unpaired <028C, 0300> LATIN SMALL LETTER TURNED V WITH RC-element (as previously amended) and update ac- GRAVE cordingly all following term numbers and cross refer- <028C, 0301> LATIN SMALL LETTER TURNED V WITH ences. ACUTE <0259, 0300> LATIN SMALL LETTER WITH GRAVE 4.44 Tertiary Ideographic Plane (TIP) <0259, 0301> LATIN SMALL LETTER SCHWA WITH ACUTE <025A, 0300> LATIN SMALL LETTER HOOKED SCHWA Plane 03 of Group 00. WITH GRAVE <025A, 0301> LATIN SMALL LETTER HOOKED SCHWA Page 4, Clause 5, General structure of the WITH ACUTE UCS <304B, 309A> LETTER BIDAKUON NGA <304D, 309A> HIRAGANA LETTER BIDAKUON NGI <304F, 309A> HIRAGANA LETTER BIDAKUON NGU After the paragraph starting by „ISO/IEC 10646 defines <3051, 309A> HIRAGANA LETTER BIDAKUON NGE graphic characters and their coded representation‟, <3053, 309A> HIRAGANA LETTER BIDAKUON NGO insert the following paragraph. <30AB, 309A> LETTER BIDAKUON NGA <30AD, 309A> KATAKANA LETTER BIDAKUON NGI The Tertiary Ideographic Plane (TIP), Plane 03 of Group <30AF, 309A> KATAKANA LETTER BIDAKUON NGU 00, is reserved for ideographic characters and is cur- <30B1, 309A> KATAKANA LETTER BIDAKUON NGE rently empty. <30B3, 309A> KATAKANA LETTER BIDAKUON NGO <30BB, 309A> KATAKANA LETTER AINU CE Page 14, Sub-clause 20.3, Format characters <30C4, 309A> KATAKANA LETTER AINU TU <30C8, 309A> KATAKANA LETTER AINU TO Insert the following entry in the list of formats charac- ters: Page 30-1348, Clause 34, Code Tables and list of character names 2D7F TIFINAGH CONSONANT JOINER 1. Modifications to existing blocks

Insert the additional character glyphs and names at the indicated positions in the blocks given below.

© ISO/IEC 2010 – All rights reserved 1 This preview is downloaded from www.sis.se. Buy the entire standard via https://www.sis.se/std-912599

ISO/IEC 10646:2003/Amd.7:2010 (E)

Plane 00 Page 1351, Annex A.1 Oriya In the alphabetical list of keywords in Note 3, add col- lection “1042” to the entries “Hiragana” and “Katakana”. Malayalam Tifinagh Latin Extended- In the alphabetical list of keywords in Note 3, insert the following entries: Arabic Presentation Forms-A Batak 158 Brahmi 1041 These blocks contain new characters and names at the Mandaic 157 following code positions: 0526-527, 0B72-0B77, 0D29, 0D3A, 2D70, 2D7F, Page 1352, Annex A.2.1 3097, A78D-A78E, FBB2-FBC1 In the list of blocks in the BMP, insert the following new 2. New blocks entries:

Insert the following additional blocks. MANDAIC 0840-085F BATAK 1BC0-1BFF Plane 00 Mandaic Page 1353, Annex A.2.2 Batak In the list of blocks in the SMP, insert the following new Plane 01 entries:

Brahmi BRAHMI 11000-1107F Supplement KANA SUPPLEMENT 1B000-1B0FF

These blocks add new characters and names at the Page 1358, Annex B, List of combining cha- following code positions: racters 0840-085B, 085E, 1BC0-1BF3, 1BFC-1BFF, 11000- 1104D, 11052-1106F, 1B000-1B001 Insert the following new entries: 0859 MANDAIC AFFRICATION MARK Page 1349, Annex A.1 085A MANDAIC VOCALIZATION MARK 085B MANDAIC GEMINATION MARK 1BE6 BATAK SIGN TOMPI In the list of collection numbers and names, after 1BE7 BATAK VOWEL SIGN E 1BE8 BATAK VOWEL SIGN PAKPAK E 156 MEETEI MAYEK 1BE9 BATAK VOWEL SIGN EE 1BEA BATAK VOWEL SIGN insert new entries as follows: 1BEB BATAK VOWEL SIGN KARO I 1BEC BATAK VOWEL SIGN O 157 MANDAIC 0840-085F 1BED BATAK VOWEL SIGN KARO O 158 BATAK 1BC0-1BFF 1BEE BATAK VOWEL SIGN 1BEF BATAK VOWEL SIGN U FOR SIMALUNGUN SA after 1BF0 BATAK CONSONANT SIGN NG 1BF1 BATAK CONSONANT SIGN 1040 ENCLOSED IDEOGRAPHIC SUPPLEMENT 1BF2 BATAK PANGOLAT 1BF3 BATAK PANONGONAN 11000 BRAHMI SIGN CANDRABINDU insert new entries as follows: 11001 BRAHMI SIGN ANUSVARA 1041 BRAHMI 11000-1107F 11002 BRAHMI SIGN VISARGA 1042 KANA SUPPLEMENT 1B000-1B0FF 11038 BRAHMI VOWEL SIGN AA 11039 BRAHMI VOWEL SIGN BHATTIPROLU AA 1103A BRAHMI VOWEL SIGN I 1103B BRAHMI VOWEL SIGN II 1103C BRAHMI VOWEL SIGN U 1103D BRAHMI VOWEL SIGN UU

2 © ISO/IEC 2010 – All rights reserved This preview is downloaded from www.sis.se. Buy the entire standard via https://www.sis.se/std-912599

ISO/IEC 10646:2003/Amd.7:2010 (E)

1103E BRAHMI VOWEL SIGN VOCALIC R indicate that the previous character and the following 1103F BRAHMI VOWEL SIGN VOCALIC RR character are part of a bi-consonant cluster. 11040 BRAHMI VOWEL SIGN VOCALIC L 11041 BRAHMI VOWEL SIGN VOCALIC LL 11042 BRAHMI VOWEL SIGN E Page 1379, Annex G 11043 BRAHMI VOWEL SIGN AI 11044 BRAHMI VOWEL SIGN O 11045 BRAHMI VOWEL SIGN AU Insert each of the new character name entries at the 11046 BRAHMI VIRAMA appropriate position, ordered alphabetically by the cha- racter name, in the list of character names in Annex G. Page 1379, Annex F.2, Script-specific format These new names are provided in a machine-readable characters format that is accessible as a link to this document. Click on this highlighted text to access the file contain- Insert the following sub-clause ing the new names. NOTE – The content is also available as a separate viewa- ble file in the same file directory as this document. The file is F.2.6 Tifinagh consonant joiner named: “Am7names.txt”.

TIFINAGH CONSONANT JOINER (2D7F): This cha- racter suppresses an inherent vowel, and functions to

© ISO/IEC 2010 – All rights reserved 3 This preview is downloaded from www.sis.se. Buy the entire standard via https://www.sis.se/std-912599 ISO/IEC 10646:2003/Amd.7:2010 (E) 0500 Cyrillic Supplement 052F

050 051 052

0 Ԁ Ԑ Ԡ

0500 0510 0520

1 ԁ ԑ ԡ

0501 0511 0521

2 Ԃ Ԓ Ԣ

0502 0512 0522

3 ԃ ԓ ԣ

0503 0513 0523

4 Ԅ Ԕ Ԥ

0504 0514 0524

5 ԅ ԕ ԥ

0505 0515 0525

6 Ԇ Ԗ Ԧ

0506 0516 0526

7 ԇ ԗ ԧ

0507 0517 0527

8 Ԉ Ԙ

0508 0518

9 ԉ ԙ

0509 0519

A Ԋ Ԛ

050A 051A

B ԋ ԛ

050B 051B

C Ԍ Ԝ

050C 051C

D ԍ ԝ

050D 051D

E Ԏ Ԟ

050E 051E

F ԏ ԟ

050F 051F

4 © ISO/IEC 2010 – All rights reserved This preview is downloaded from www.sis.se. Buy the entire standard via https://www.sis.se/std-912599 ISO/IEC 10646:2003/Amd.7:2010 (E) 0500 Cyrillic Supplement 0527

Komi letters 0525 ԥ CYRILLIC SMALL LETTER WITH DESCENDER 0500 Ԁ CYRILLIC CAPITAL LETTER • used in modern Abkhaz orthography 0501 ԁ CYRILLIC SMALL LETTER KOMI DE → 04A7 ҧ cyrillic small letter pe with middle 0502 Ԃ CYRILLIC CAPITAL LETTER KOMI hook 0503 ԃ CYRILLIC SMALL LETTER KOMI DJE Azerbaijani letters 0504 Ԅ CYRILLIC CAPITAL LETTER KOMI 0526 Ԧ CYRILLIC CAPITAL LETTER WITH 0505 ԅ CYRILLIC SMALL LETTER KOMI ZJE DESCENDER 0506 Ԇ CYRILLIC CAPITAL LETTER KOMI DZJE 0527 ԧ CYRILLIC SMALL LETTER SHHA WITH 0507 ԇ CYRILLIC SMALL LETTER KOMI DZJE DESCENDER 0508 Ԉ CYRILLIC CAPITAL LETTER KOMI 0509 ԉ CYRILLIC SMALL LETTER KOMI LJE 050A Ԋ CYRILLIC CAPITAL LETTER KOMI 050B ԋ CYRILLIC SMALL LETTER KOMI NJE 050C Ԍ CYRILLIC CAPITAL LETTER KOMI 050D ԍ CYRILLIC SMALL LETTER KOMI SJE 050E Ԏ CYRILLIC CAPITAL LETTER 050F ԏ CYRILLIC SMALL LETTER KOMI TJE Khanty letters 0510 Ԑ CYRILLIC CAPITAL LETTER REVERSED 0511 ԑ CYRILLIC SMALL LETTER REVERSED ZE • also used for Enets 0512 Ԓ CYRILLIC CAPITAL LETTER WITH HOOK 0513 ԓ CYRILLIC SMALL LETTER EL WITH HOOK • also used for Chukchi and Itelmen Mordvin letters 0514 Ԕ CYRILLIC CAPITAL LETTER 0515 ԕ CYRILLIC SMALL LETTER LHA = voiceless l 0516 Ԗ CYRILLIC CAPITAL LETTER 0517 ԗ CYRILLIC SMALL LETTER RHA = voiceless r 0518 Ԙ CYRILLIC CAPITAL LETTER 0519 ԙ CYRILLIC SMALL LETTER YAE Kurdish letters 051A Ԛ CYRILLIC CAPITAL LETTER 051B ԛ CYRILLIC SMALL LETTER QA 051C Ԝ CYRILLIC CAPITAL LETTER 051D ԝ CYRILLIC SMALL LETTER WE Aleut letters 051E Ԟ CYRILLIC CAPITAL LETTER ALEUT 051F ԟ CYRILLIC SMALL LETTER ALEUT KA • used for [] in Aleut Chuvash letters These are obsolete letters formerly used in Jakovlev's Chuvash orthography. 0520 Ԡ CYRILLIC CAPITAL LETTER EL WITH MIDDLE HOOK 0521 ԡ CYRILLIC SMALL LETTER EL WITH MIDDLE HOOK = palatalized l 0522 Ԣ CYRILLIC CAPITAL LETTER WITH MIDDLE HOOK 0523 ԣ CYRILLIC SMALL LETTER EN WITH MIDDLE HOOK = palatalized n Abkhaz letters 0524 Ԥ CYRILLIC CAPITAL LETTER PE WITH DESCENDER

© ISO/IEC 2010 – All rights reserved 5 This preview is downloaded from www.sis.se. Buy the entire standard via https://www.sis.se/std-912599 ISO/IEC 10646:2003/Amd.7:2010 (E) 0840 Mandaic 085F

084 085 Letters 0840 ࡀ MANDAIC LETTER HALQA = a 0 ࡀ ࡐ 0841 ࡁ MANDAIC LETTER AB 0840 0850 0842 ࡂ MANDAIC LETTER AG 0843 ࡃ MANDAIC LETTER AD 1 ࡁ ࡑ 0844 ࡄ MANDAIC LETTER AH 0845 ࡅ MANDAIC LETTER USHENNA 0841 0851 = u 0846 ࡆ MANDAIC LETTER AZ 2 ࡂ ࡒ 0847 ࡇ MANDAIC LETTER IT 0848 ࡈ MANDAIC LETTER ATT 0842 0852 0849 ࡉ MANDAIC LETTER AKSA = i 3 ࡃ ࡓ 084A ࡊ MANDAIC LETTER AK

0843 0853 084B ࡋ MANDAIC LETTER AL 084C ࡌ MANDAIC LETTER AM 084D ࡍ MANDAIC LETTER AN 4 ࡄ ࡔ 084E ࡎ MANDAIC LETTER AS 0844 0854 084F ࡏ MANDAIC LETTER IN 0850 ࡐ MANDAIC LETTER AP 5 ࡅ ࡕ 0851 ࡑ MANDAIC LETTER ASZ 0852 ࡒ MANDAIC LETTER AQ 0845 0855 0853 ࡓ MANDAIC LETTER AR 0854 ࡔ MANDAIC LETTER ASH 6 ࡆ ࡖ 0855 ࡕ MANDAIC LETTER AT 0856 ࡖ MANDAIC LETTER DUSHENNA 0846 0856 = di 0857 ࡗ MANDAIC LETTER KAD 7 ࡇ ࡗ 0858 ࡘ MANDAIC LETTER AIN 0847 0857 Diacritics 0859 ࡙ MANDAIC AFFRICATION MARK 8 ࡈ ࡘ 085A ࡚ MANDAIC VOCALIZATION MARK 085B ࡛ MANDAIC GEMINATION MARK 0848 0858 Punctuation 085E ࡞ MANDAIC PUNCTUATION 9 ࡉ ࡙

0849 0859

A ࡊ ࡚

084A 085A

B ࡋ ࡛

084B 085B

C ࡌ

084C

D ࡍ

084D

E ࡎ ࡞

084E 085E

F ࡏ

084F

6 © ISO/IEC 2010 – All rights reserved This preview is downloaded from www.sis.se. Buy the entire standard via https://www.sis.se/std-912599 ISO/IEC 10646:2003/Amd.7:2010 (E) 0B00 Oriya 0B7F

0B0 0B1 0B2 0B3 0B4 0B5 0B6 0B7

0 ଐ ଠ ର $ୀ ୠ ୰

0B10 0B20 0B30 0B40 0B60 0B70

1 $ଁ ଡ $ୁ ୡ ୱ

0B01 0B21 0B41 0B61 0B71

2 $ଂ ଢ ଲ $ୂ $ୢ ⁄1

0B02 0B22 0B32 0B42 0B62 0B72

3 $ଃ ଓ ଣ ଳ $ୃ $ୣ ⁄1

0B03 0B13 0B23 0B33 0B43 0B63 0B73

4 ଔ ତ $ୄ ⁄3

0B14 0B24 0B44 0B74

5 ଅ କ ଥ ଵ ⁄1

0B05 0B15 0B25 0B35 0B75

6 ଆ ଖ ଦ ଶ $ୖ ୦ ⁄1

0B06 0B16 0B26 0B36 0B56 0B66 0B76

7 ଇ ଗ ଧ ଷ $େ $ୗ ୧ ⁄3

0B07 0B17 0B27 0B37 0B47 0B57 0B67 0B77

8 ଈ ଘ ନ ସ $ୈ ୨

0B08 0B18 0B28 0B38 0B48 0B68

9 ଉ ଙ ହ ୩

0B09 0B19 0B39 0B69

A ଊ ଚ ପ ୪

0B0A 0B1A 0B2A 0B6A

B ଋ ଛ ଫ $ୋ ୫

0B0B 0B1B 0B2B 0B4B 0B6B

C ଌ ଜ ବ $଼ $ୌ ଡ଼ ୬

0B0C 0B1C 0B2C 0B3C 0B4C 0B5C 0B6C

D ଝ ଭ ଽ $୍ ଢ଼ ୭

0B1D 0B2D 0B3D 0B4D 0B5D 0B6D

E ଞ ମ $ା ୮

0B1E 0B2E 0B3E 0B6E

F ଏ ଟ ଯ $ି ୟ ୯

0B0F 0B1F 0B2F 0B3F 0B5F 0B6F

© ISO/IEC 2010 – All rights reserved 7