SC2/WG2 N2499 ISO/IEC International Standard Working Draft International Standard 10646 3 Edition ISO/IEC WD 10646 3 Edition

SC2/WG2 N2499 ISO/IEC International Standard Working Draft International Standard 10646 3 Edition ISO/IEC WD 10646 3 Edition

SC2/WG2 N2499 ISO/IEC International Standard rd Working Draft International Standard 10646 3 Edition rd ISO/IEC WD 10646 3 Edition 2002-08-01 Information technology — Universal Multiple-Octet Coded Character Set (UCS) — Architecture and Basic Multilingual Plane Supplementary Planes Working Draft ISO/IEC 10646:2003 Reserved for final ISO Copyright statement ii Working Draft ISO/IEC 10646:2003 Contents Page 1 Scope .................................................................................................................1 2 Conformance......................................................................................................1 3 Normative references.........................................................................................2 4 Terms and definitions.........................................................................................2 5 General structure of the UCS.............................................................................4 6 Basic structure and nomenclature......................................................................5 7 General requirements for the UCS.....................................................................9 8 The Basic Multilingual Plane ..............................................................................9 9 Supplementary planes.....................................................................................10 10 Private use groups, planes, and zones ............................................................10 11 Revision and updating of the UCS ...................................................................10 12 Subsets ............................................................................................................10 13 Coded representation forms of the UCS ..........................................................11 14 Implementation levels......................................................................................11 15 Use of control functions with the UCS..............................................................11 16 Declaration of identification of features ............................................................12 17 Structure of the code tables and lists ...............................................................13 18 Block names.....................................................................................................13 19 Characters in bi-directional context..................................................................14 20 Special characters............................................................................................14 21 Presentation forms of characters .....................................................................17 22 Compatibility characters...................................................................................18 23 Order of characters ..........................................................................................18 24 Normalization forms.........................................................................................18 25 Combining characters ......................................................................................18 26 Special features of individual scripts ................................................................20 27 Source references for CJK Ideographs............................................................20 28 Character names and annotations ...................................................................22 29 Structure of the Basic Multilingual Plane..........................................................25 30 Structure of the Supplementary Multilingual Plane for Scripts and symbols....27 31 Structure of the Supplementary Ideographic Plane .........................................27 32 Supplementary Special-purpose Plane............................................................27 33 Code tables and lists of character names ........................................................28 Annexes A Collections of graphic characters for subsets ..............................................1001 B List of combining characters ........................................................................1011 C Transformation format for 16 planes of Group 00 (UTF-16) ........................1017 iii Working Draft ISO/IEC 10646:2003 D UCS Transformation Format 8 (UTF-8) ....................................................... 1020 E Mirrored characters in Arabic bi-directional context..................................... 1024 F Alternate format characters.......................................................................... 1027 G Alphabetically sorted list of character names............................................... 1032 H The use of “signatures” to identify UCS ....................................................... 1033 J Recommendation for combined receiving/originating devices with internal storage ......................................................................................................... 1034 K Notations of octet value representations...................................................... 1035 L Character naming guidelines ....................................................................... 1036 M Sources of characters .................................................................................. 1038 N External references to character repertoires................................................ 1042 P Additional information on characters............................................................ 1044 Q Code mapping table for Hangul syllables .................................................... 1047 R Names of Hangul syllables .......................................................................... 1048 S Procedure for the unification and arrangement of CJK Ideographs............. 1049 T Language tagging using Tag Characters..................................................... 1057 U Usage of musical symbols ........................................................................... 1059 iv Working Draft ISO/IEC 10646:2003 Foreword ISO (the International Organization for Standardization) and IEC (the International Elec- trotechnical Commission) form the specialized system for worldwide standardization. National bodies that are members of ISO or IEC participate in the development of Inter- national Standards through technical committees established by the respective organi- zation to deal with particular fields or technical activity. ISO and IEC technical commit- tees collaborate in fields of mutual interest. Other international organizations, govern- mental and non-governmental, in liaison with ISO and IEC, also take part in the work. International Standards are drafted in accordance with the rules given on the ISO/IEC Directives, Part 3. In the field of information technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC1. Draft international Standards adopted by the joint technical committee are circulated to national bodies for voting. Publication as an International Standard requires approval by at least 75% of the national bodies casting a vote. Attention is drawn to the possibility that some of the element of this part of ISO/IEC 10646 may be the subject of patent rights, ISO and IEC shall not be held responsible for identifying any or all such patent rights. International Standards ISO/IEC 10646 was prepared by Joint Technical Committee ISO/IEC JTC1, Information technology, Subcommittee SC 2, Coded Character sets. This third edition cancels and replaces the previous editions of this International Stan- dard which was published in two parts: Part 1 second edition (ISO/IEC 10646-1:2000) and Part 2 first edition (ISO/IEC 10646-2:2001). It also incorporates Amendments 1 and 2 to Part 1 and Amendment 1 to Part 2. Annexes A to D form a normative part of ISO/IEC 10646. Annexes E to U are for infor- mation only. The standard contains material which may only be available to users who obtain their copy in a machine readable format. That material consists of the following printable files: CJKUA_SR.txt CJKC0SR.txt Allnames.txt v Working Draft ISO/IEC 10646:2003 Introduction ISO/IEC 10646 specifies the Universal Multiple-Octet Coded Character Set (UCS). It is applicable to the representation, transmission, interchange, processing, storage, input and presentation of the written form of the languages (scripts) of the world as well as additional symbols. ISO/IEC 10464 specifies the overall architecture, the Basic Multilingual Plane (BMP) and the Supplementary Planes of the UCS. vi Working Draft of ISO/IEC 10646:2003 3rd edition ISO/IEC 10646: 2003(E) Information technology — Universal Multiple-Octet Coded Character Set (UCS) — 1 Scope tionally provides details of character properties, processing algorithms, and definitions that are useful to implementers. ISO/IEC 10646 specifies the Universal Multiple-Octet NOTE 3 – Previous editions of ISO/IEC 10646 were pub- Coded Character Set (UCS). It is applicable to the lished in parts: Part 1 specified the architecture and the representation, transmission, interchange, processing, BMP, Part 2 specified the SMP, SIP and SSP. storage, input, and presentation of the written form of the languages of the world as well as of additional symbols. 2 Conformance This document: 2.1 General - specifies the architecture of ISO/IEC 10646, Whenever private use characters are used as spec- ified in ISO/IEC 10646, the characters themselves - defines terms used in ISO/IEC 10646, shall not be covered by these conformance

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    95 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us