ISO/IEC JTC1/SC2/WG2 Nxxxx

ISO/IECJTC1/SC2/WG2Nxxxx 2000-11-29 UniversalMultiple-OctetCodedCharacterSet InternationalOrganizationforStandardization Organisationinternationaledenormalisation еждународнаяорганизацияпостандартизации Doc Type: Document for Ballot Title: PDAM text for Amendment 1 to ISO/IEC 10646-1:2000 Source: Michel Suignard, Michael Everson, Asmus Freytag Status: Expert Contribution Action: For JTC1/SC2 ballot Date: 2000-11-29 © ISO/IEC PDAM for ISO/IEC 10646-1: 2000/Amd. 1: 2001 (E) Information technology — Universal Multiple-Octet Coded Character Set (UCS) — Part 1: Architecture and Basic Multilingual Plane AMENDMENT 1: Mathematical symbols and other characters Item 1. Mathematical and other characters Table 155 - Row FE: VARIATION SELECTORS 1. Modifications to existing tables The following code positions indicate the new characters to be added in new tables: Insert the following additional character glyphs and 0500-052F, 1700-171F, 1720-173F, 1740-175F, names at the indicated positions in the tables given 1760-177F, 2900-297F, 2980-29FF, 2A00-2A7F, below in Item 15, the character names replacing the 2A80-2AFF, 31F0-31FF, FE00-FE0F.] existing entries which read “(This position shall not be used)”. [EDITOR’S NOTE - In the tables in Item 15, the Item 2. Reference to Part 2 following code positions indicate the new characters Page v, Foreword added to those tables: Insert the following italicized text after the Part 1 0363-036F, 03D8-03D9, 03F4-03F6, 048A-048B, description: 04C5-04C6, 04C9-04CA, 04CD-04CE, 07B1, 17DD, 2047, 204E-2051, 2057, 205F-2062, 2071, 20B0-20B1, — Part 2: Supplementary Planes 20E4-20E8, 213D-214B, 21F5-21FF, 22F2-22FF, 237C, Remove the text “Additional parts will specify other 239B-23CE, 24EB-24FE, 2596-259F, 25F8-25FF, planes”. 2616-2617, 2672-2679, 303B-303D, 3095-3096, 309F, 30A0, 30FF, 3251-325F, 32B1-32BF, A4A2-A4A3, Page 9, clause 14 A4B4, A4C1, A4C5, FA30-FA68, FE45-FE46 In the first paragraph, change the text “listed in annex 2. New tables. B” to “listed in annex B of each part of ISO/IEC After page 279 10646”. Insert the following additional character glyphs and Page 9, clause 14.1 names at the indicated positions in the tables given Change the text “(see clause B.1)” to “(see clause B.1 below in Item 15, the new character names replacing of each part of ISO/IEC 10646)”. the existing entries at the specified code locations Page 9, clause 14.2 which read “(This position shall not be used)”. Change the text “listed in clause B.2” to “listed in Table 144 - Row 04: CYRILLIC clause B.2 of each part of ISO/IEC 10646”. SUPPLEMENTARY Table 145 - Row 17: TAGALOG Page 13, clause 24 Table 146 - Row 17: HANUNOO In the first paragraph, change the text “clause B.1” to Table 147 - Row 17: BUHID “clause B.1 of each part of ISO/IEC 10646”, and Table 148 - Row 17: TAGBANWA change the text “clause B.2” to “clause B.2 of each Table 149 - Row 29: SUPPLEMENTAL ARROWS part of ISO/IEC 10646”. Table 150 - Row 29: MISCELLANEOUS MATHEMATICAL SYMBOLS Table 151 - Row 2A: SUPPLEMENTAL Item 3. Restriction of code positions MATHEMATICAL OPERATORS Page 6, figure 2 Table 152 - Row 2A: SUPPLEMENTAL In the legend “Private use planes 0F, 10, E0 - FF” MATHEMATICAL OPERATORS delete “E0 - FF”, and delete the upward arrows Table 153 - Row 31: KATAKANA PHONETIC pointing to planes E0 to FF. EXTENSIONS Table 154 - Row AF: CJK COMPATIBILITY IDEOGRAPHS Page 8, clause 9 2 PDAM for ISO/IEC 10646-1: 2000/Amd. 1: 2001 (E) © ISO/IEC Amend 9.1 to read as follows: 9.1 Planes reserved for future standardization Item 5. UCS Sequence Identifiers Planes 11 to DFF in Group 00 and all planes in all Page 8, clause 6 other groups (i.e. Planes 00 to FF in Groups 01 to After clause 6.5, add a new clause 6.6 (UCS 57F) are reserved for future standardization, and thus Sequence Identifiers): those code positions shall not be used for any other purpose. “An entity that is represented by a sequence of ‘n’ code positions from the standard, is identified by a Code positions in these planes do not have a UCS Sequence Identifier (USI) having the following mapping to the UTF-16 form (see Annex C). form: NOTE - To ensure continued interoperability between the UTF-16 form and other coded representations of the <UID1, UID2, UID3, .. UIDn> UCS, it is intended that no characters will ever be where UID1, UID2, UID3, etc. represent the unique allocated to code positions above 0010 FFFF. identifiers identifiers of the corresponding characters In 9.2 (Planes accessible by UTF-16) delete 2nd from the standards, in the same sequence as required paragraph as follows: to represent the identified entity. The syntax for UID1, Code positions in Planes 11 to FF of Group 00, or in UID2, UID3, etc. is specified in clause 6.5. A COMMA Planes 00 to FF of other groups, do not have a character (optionally followed by a SPACE character) mapping to the UTF-16 form. separates the UIDs. The UCS Sequence Identifier shall include at least two UIDs; it shall begin with a Page 8, clause 10 LESS-THAN SIGN and be terminated by a In 10.2 (Code positions for private use characters) GREATER-THAN SIGN.” amend the first two paragraphs to read as follows: The code positions of the 32 groups from Group 60 to Item 6. Characters eserved for internal Group 7F shall be for private use. processing uses The code positions of Plane 0F and Plane 10, and of the 32 planes from Plane E0 to Plane FF, of Group 00 Page 8, clause 7 shall be for private use. Remove the note. Page 881, annex A Page 8, clause 8 From collection 400, delete “, & E0 - FF”. Replace the last paragraph as follows: Under collection 500, change “PRIVATE USE “Code positions 0000 FDD0 to 0000 FDEF, GROUPS G=60 - 7F” to “(This collection number shall 0000 FFFE, and 0000 FFFF are reserved. not be used, see Note 2.)”. NOTE - Code position 0000 FFFE is reserved for Under Note 1, change the text “90, and 91” to “90, 91, “signature” (see annex H). Code positions 0000 FDD0 to 93, 94, 95, and 96”. 0000 FDEF, and 0000 FFFF can be used for internal processing uses requiring numeric values which are Under Note 2, change the text “58, and 59” to “58, 59, guaranteed not to be coded characters, such as in and 500”. Also correct the text “international terminating tables, or signaling end-of-text. Furthermore, Standards” to “International Standard”. since 0000 FFFF is the largest BMP value, it may also In the alphabetical list of collection names, add be used as the final value in binary or sequential parentheses around collection 500 under “Private searching index within the context of the BMP.” use”. Change the annotations for the characters at 0000 FDD0 - 0000 FDEF, 0000 FFFE, and 0000 FFFF to “(This position is permanently reserved)”. Item 4. Extension of U+ notation Page 7, clause 6 Item 7. Alternate format characters Change “four-digit form” and “4-digit form” to “four-to- six-digit form” throughout. Page 8, Clause 8 In clause 6.5.b change “It is not defined if the first four Insert the following paragraph and note after the digits of the eight-digit form are not all zeros” to “It is second paragraph which ends “reserved for control not defined if the eight-digit form is greater than characters.”) 0010 FFFF” and append at the end of the text Code positions 0000 2060 to 0000 206F, 0000 FFF0 “Leading zeros are suppressed for values greater to 0000 FFFC, and 000E 0000 to 000E 0FFF are than 0000 FFFF.”. reserved for Alternate Format Characters. Change “{+}xxxx” in the BNF form to “{+}(xxxx | xxxxx NOTE - Unassigned code positions in that range may be | xxxxxx)”. ignored in normal processing and display. 3 © ISO/IEC PDAM for ISO/IEC 10646-1: 2000/Amd. 1: 2001 (E) Item 8. Collections for MES 44 4A 7F 82 A3-A4 A7 AC AF 21 05 16 22 26 5B-5E 90-95 A8 Page 880, annex A.1 22 00 02-03 06 08-09 0F 11-12 19-1A 1E-1F In the list of collection numbers and names, after 27-2B 48 59 60-61 64-65 82-83 95 97 271 COMBINING CHARACTERS B-2 23 02 10 20-21 29-2A insert new entries as follows: 25 00 02 0C 10 14 18 1C 24 2C 34 3C 50-6C 80 84 88 8C 90-93 A0 AC B2 BA BC C4 CA-CB 281 MES-1 see A.4.1 * D8-D9 282 MES-2 see A.4.2 * 26 3A-3C 40 42 60 63 65-66 6A-6B 283 MES-3A see A.4.3 FB 01-02 FF FD 284 MES-3B see A.4.4 * Page 881, annex A.1 A4.3 283 MES-3A In the alphabetical list of keywords, after 283 MES-3A consists of the following collections Mathematical operators from A.1 above. insert Collection number and name MES 281 282 283 284 1 BASIC LATIN Page 884, annex A 2 LATIN-1 SUPPLEMENT 3 LATIN EXTENDED-A At the end of Annex A add a new clause A.4 as 4 LATIN EXTENDED-B follows. 5 IPA EXTENSIONS A.4 Other collections within the BMP 6 SPACING MODIFIER LETTERS The collections specified within this clause are entirely 7 COMBINING DIACRITICAL MARKS within Plane 00. 8 BASIC GREEK NOTE - The acronym MES indicates Multilingual 9 GREEK SYMBOLS AND COPTIC European Subset. 10 CYRILLIC 11 ARMENIAN 27 BASIC GEORGIAN A.4.1 281 MES-1 30 LATIN EXTENDED ADDITIONAL 281 MES-1 is specified by the following ranges of 31 GREEK EXTENDED code positions as indicated for each row.

ISO/IEC JTC1/SC2/WG2 Nxxxx

Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress

U.S. Government Printing Office Style Manual, 2008

Character Properties 4

The Writing Revolution

Chapter 6, Writing Systems and Punctuation

The Unicode Standard, Version 3.0, Issued by the Unicode Consor- Tium and Published by Addison-Wesley

Unifoundry.Com GNU Unifont Glyphs

Internationalized Domain Names (Idns)

Fonts in Mpdf Version 5.X Mpdf Version 5 Supports Truetype Fonts, Reading and Embedding Directly from the .Ttf Font Files

U.S. Government Publishing Office Style Manual

Example 53. New Mpdf V5-0 Fonts

N2308 2000-11-29