WG2 N3275 ISO/IEC International Standard ISO/IEC 10646
Total Page:16
File Type:pdf, Size:1020Kb
WG2 N3275 ISO/IEC International Standard ISO/IEC 10646 Final Committee Draft Information technology – Universal Multiple-Octet Coded Character Set (UCS) Technologie de l’information – Jeu universel de caractères codés sur plusieurs octets (JUC) Second edition, 2008 © ISO/IEC 10646:2007 (E) Final Committee Draft (FCD) PDF disclaimer This PDF file may contain embedded typefaces. In accordance with Adobe's licensing policy, this file may be printed or viewed but shall not be edited unless the typefaces which are embedded are licensed to and installed on the com puter performing the editing. In downloading this file, parties accept therein the responsibility of not infringing Adobe's licensing policy. The ISO Central Secretariat accepts no liability in this area. Adobe is a trademark of Adobe Systems Incorporated. Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF - creation parameters were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the addre ss given below. © ISO/IEC 2007 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the a d- dress below or ISO's member body in the country of the requester. ISO copyright office Case postale 56 CH-1211 Geneva 20 Tel. + 41 22 749 01 11 Fax + 41 22 749 09 47 E-mail [email protected] Web www.iso.ch Printed in Switzerland 2 © ISO/IEC 2007 – All rights reserved © ISO/IEC 10646:2007 (E) Final Committee Draft (FCD) CONTENTS Foreword................................................................................................................................................ 10 Introduction ............................................................................................................................................ 11 1 Scope ............................................................................................................................................. 12 2 Conformance .................................................................................................................................. 12 2.1 General ................................................................................................................................ 12 2.2 Conformance of information interchange ............................................................................. 12 2.3 Conformance of devices ...................................................................................................... 13 3 Normative references ..................................................................................................................... 13 4 Terms and definitions ..................................................................................................................... 14 5 General structure of the UCS ......................................................................................................... 20 6 Basic structure and nomenclature .................................................................................................. 20 6.1 Structure ............................................................................................................................... 20 6.2 Coding of characters ............................................................................................................ 24 6.3 Type of code points .............................................................................................................. 24 6.4 Naming of characters ........................................................................................................... 26 6.5 Short identifiers for code points (UIDs) ................................................................................ 26 6.6 UCS Sequence Identifiers .................................................................................................... 27 6.7 Octet sequence identifiers ................................................................................................... 27 7 Revision and updating of the UCS ................................................................................................. 29 8 Subsets ........................................................................................................................................... 29 8.1 Limited subset ...................................................................................................................... 29 8.2 Selected subset .................................................................................................................... 30 9 UCS encoding forms ...................................................................................................................... 30 9.1 UTF-8 ................................................................................................................................... 30 9.2 UTF-16 ................................................................................................................................. 31 9.3 UTF-32 (UCS-4) ................................................................................................................... 32 10 UCS Encoding schemes ................................................................................................................ 32 10.1 UTF-8 ................................................................................................................................... 32 10.2 UTF-16BE. ........................................................................................................................... 32 10.3 UTF-16LE ............................................................................................................................. 32 10.4 UTF-16 ................................................................................................................................. 32 10.5 UTF-32BE ............................................................................................................................ 33 10.6 UTF-32LE ............................................................................................................................. 33 10.7 UTF-32 ................................................................................................................................. 33 11 Use of control functions with the UCS ............................................................................................ 34 12 Declaration of identification of features .......................................................................................... 35 12.1 Purpose and context of identification ................................................................................... 35 12.2 Identification of a UCS encoding form ................................................................................. 35 12.3 Identification of subsets of graphic characters ..................................................................... 36 12.4 Identification of control function set...................................................................................... 36 12.5 Identification of the coding system of ISO/IEC 2022 ........................................................... 37 13 Structure of the code tables and lists ............................................................................................. 37 © ISO/IEC 2007 – All rights reserved 3 © ISO/IEC 10646:2007 (E) Final Committee Draft (FCD) 14 Block and collection names ............................................................................................................ 38 14.1 Block names ......................................................................................................................... 38 14.2 Collection names .................................................................................................................. 38 15 Mirrored characters in bidirectional context ................................................................................... 38 15.1 Mirrored characters .............................................................................................................. 38 15.2 Directionality of bidirectional text ......................................................................................... 38 16 Special characters .......................................................................................................................... 38 16.1 Space characters ................................................................................................................. 39 16.2 Currency symbols ................................................................................................................ 39 16.3 Format Characters ............................................................................................................... 39 16.4 Ideographic description characters ...................................................................................... 40 16.5 Variation selectors and variation sequences ....................................................................... 40 17 Presentation forms of characters ................................................................................................... 43 18 Compatibility characters