The Unicode Standard, Version 4.0--Online Edition
Total Page:16
File Type:pdf, Size:1020Kb
This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consor- tium and published by Addison-Wesley. The material has been modified slightly for this online edi- tion, however the PDF files have not been modified to reflect the corrections found on the Updates and Errata page (http://www.unicode.org/errata/). For information on more recent versions of the standard, see http://www.unicode.org/standard/versions/enumeratedversions.html. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial capital letters. However, not all words in initial capital letters are trademark designations. The Unicode® Consortium is a registered trademark, and Unicode™ is a trademark of Unicode, Inc. The Unicode logo is a trademark of Unicode, Inc., and may be registered in some jurisdictions. The authors and publisher have taken care in preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. Dai Kan-Wa Jiten used as the source of reference Kanji codes was written by Tetsuji Morohashi and published by Taishukan Shoten. Cover and CD-ROM label design: Steve Mehallo, http://www.mehallo.com The publisher offers discounts on this book when ordered in quantity for bulk purchases and special sales. For more information, customers in the U.S. please contact U.S. Corporate and Government Sales, (800) 382-3419, [email protected]. For sales outside of the U.S., please contact International Sales, +1 317 581 3793, [email protected] Visit Addison-Wesley on the Web: http://www.awprofessional.com Library of Congress Cataloging-in-Publication Data The Unicode Standard, Version 4.0 : the Unicode Consortium /Joan Aliprand... [et al.]. p. cm. Includes bibliographical references and index. ISBN 0-321-18578-1 (alk. paper) 1. Unicode (Computer character set). I. Aliprand, Joan. QA268.U545 2004 005.7’2—dc21 2003052158 Copyright © 1991–2003 by Unicode, Inc. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or other- wise, without the prior written permission of the publisher or Unicode, Inc. Printed in the United States of America. Published simultaneously in Canada. For information on obtaining permission for use of material from this work, please submit a written request to the Unicode Consortium, Post Office Box 39146, Mountain View, CA 94039-1476, USA, Fax +1 650 693 3010 or to Pearson Education, Inc., Rights and Contracts Department, 75 Arlington Street, Suite 300 Boston, MA 02116, USA, Fax: +1 617 848 7047. ISBN 0-321-18578-1 Text printed on recycled paper 1 2 3 4 5 6 7 8 9 10—CRW—0706050403 First printing, August 2003 Indices I I.1 Unicode Names Index The Unicode Names index contains three types of entries. • Formal character names—all uppercase • Alternative character names (aliases)—all lowercase • Character group names—mixed case (titlecase) Formal character names are unmodified from the character names lists, although the name strings may be indexed by different words in the names. Alternative character names and character group names are occasionally modified slightly to make them understandable out of context (for example, from “Hangul” to “Korean Hangul”). Not every character is indexed. Large groups of similar characters, including CJK ideo- graphs, Korean Hangul syllables, and compatibility characters, are indexed by their charac- ter group names, such as block names, subblocks, alphabet names, relevant standards, or group summaries (for example, “Roman Numerals”). A WITH ACUTE, LATIN CAPITAL LETTER . 00C1 A, COMBINING LATIN SMALL LETTER . .0363 A WITH ACUTE, LATIN SMALL LETTER . 00E1 A, LATIN LETTER SMALL CAPITAL . 1D00 A WITH BREVE, LATIN SMALL LETTER . 0103 a, latin small letter script . .0251 A WITH CARON, LATIN SMALL LETTER. 01CE A, LATIN SMALL LETTER TURNED . .0250 A WITH CIRCUMFLEX, LATIN CAPITAL ABBREVIATION MARK, ARMENIAN . .055F LETTER . 00C2 ABBREVIATION MARK, SYRIAC . .070F A WITH CIRCUMFLEX, LATIN SMALL ABBREVIATION SIGN, DEVANAGARI . .0970 LETTER . 00E2 Abbreviations, Squared Latin. .3371 A WITH DIAERESIS, LATIN CAPITAL Aboriginal Syllabics, Unified Canadian . .1400 LETTER . 00C4 ABOVE RIGHT, COMBINING COMMA . .0315 A WITH DIAERESIS, LATIN SMALL LETTER. 00E4 above, cedilla . .0312 A WITH DOT ABOVE, LATIN SMALL ABOVE, COMBINING ALMOST EQUAL TO . 034C LETTER . 0227 ABOVE, COMBINING ANTICLOCKWISE A WITH DOT BELOW, LATIN SMALL ARROW. 20D4 LETTER . 1EA1 ABOVE, COMBINING BRIDGE . .0346 A WITH DOUBLE GRAVE, LATIN SMALL ABOVE, COMBINING CLOCKWISE LETTER . 0201 ARROW. 20D5 A WITH GRAVE, LATIN CAPITAL LETTER . 00C0 ABOVE, COMBINING COMMA. .0313 A WITH GRAVE, LATIN SMALL LETTER . 00E0 above, combining counterclockwise arrow . 20D4 A WITH HOOK ABOVE, LATIN SMALL ABOVE, COMBINING DOT . .0307 LETTER . 1EA3 ABOVE, COMBINING DOUBLE VERTICAL A WITH INVERTED BREVE, LATIN SMALL LINE. 030E LETTER . 0203 ABOVE, COMBINING FOUR DOTS . .20DC A WITH MACRON, LATIN SMALL LETTER. 0101 ABOVE, COMBINING HOMOTHETIC. 034B A WITH OGONEK, LATIN SMALL LETTER . 0105 ABOVE, COMBINING HOOK. .0309 A WITH RIGHT HALF RING, LATIN SMALL ABOVE, COMBINING LEFT ANGLE . 031A LETTER . 1E9A ABOVE, COMBINING LEFT ARROW . 20D6 A WITH RING ABOVE, LATIN CAPITAL ABOVE, COMBINING LEFT HALF RING . .0351 LETTER . 00C5 ABOVE, COMBINING LEFT HARPOON . 20D0 A WITH RING ABOVE, LATIN SMALL ABOVE, COMBINING LEFT RIGHT LETTER . 00E5 ARROW. 20E1 A WITH RING BELOW, LATIN SMALL ABOVE, COMBINING NOT TILDE . 034A LETTER . 1E01 ABOVE, COMBINING REVERSED COMMA . .0314 A WITH TILDE, LATIN CAPITAL LETTER . 00C3 ABOVE, COMBINING RIGHT ARROW . 20D7 A WITH TILDE, LATIN SMALL LETTER . 00E3 ABOVE, COMBINING RIGHT ARROWHEAD.0350 The Unicode Standard 4.0 8 Aug 03 1407 I.1 Unicode Names Index Indices ABOVE, COMBINING RIGHT HALF RING . .0357 African Letters for Clicks . 01C0 ABOVE, COMBINING RIGHT HARPOON. 20D1 Ainu, Katakana Extensions for . 31F0 ABOVE, COMBINING RING. 030A AIRPLANE . 2708 ABOVE, COMBINING THREE DOTS . 20DB AL-LAKUNA, SINHALA SIGN . .0DCA ABOVE, COMBINING TURNED COMMA. .0312 aldus leaf . 2766 ABOVE, COMBINING VERTICAL LINE . 030D ALEF SYMBOL. 2135 ABOVE, COMBINING WIDE BRIDGE . 20E9 Ali Gali Extensions, Mongolian. 1880 ABOVE, COMBINING X . 033D ALL AROUND-PROFILE . .232E ABOVE, DOT . 02D9 ALL EQUAL TO . 224C above, double dot . .0308 ALL, FOR . 2200 ABOVE, RING . .02DA ALMOST EQUAL TO . 2248 above, v . 030C ALMOST EQUAL TO ABOVE, COMBINING . 034C absolute continuity . 2AA1 ALMOST EQUAL TO, NOT . 2249 absolute value . 007C ALPHA, LATIN SMALL LETTER. 0251 abstract syntax bracket, left . 301A ALPHA, LATIN SMALL LETTER TURNED. 0252 abstract syntax bracket, right. 301B Alphabetic Presentation Forms. .FB00 ACCENT BELOW, COMBINING ACUTE . .0317 Alphanumeric Symbols, Mathematical . 1D400 ACCENT BELOW, COMBINING Alphanumerics, Enclosed . 2460 CIRCUMFLEX . 032D alternating current . 223F ACCENT BELOW, COMBINING GRAVE. .0316 ALTERNATION MARK, PART. 303D ACCENT, ACUTE. 00B4 ALTERNATIVE KEY SYMBOL. 2387 ACCENT, CIRCUMFLEX . 005E ALVEOLAR CLICK, LATIN LETTER . 01C2 ACCENT, COMBINING ACUTE . .0301 AMPERSAND. 0026 ACCENT, COMBINING CIRCUMFLEX. .0302 AMPERSAND, ARABIC SIGN SINDHI . 06FD ACCENT, COMBINING DOUBLE ACUTE . 030B AMPERSAND, TURNED . .214B ACCENT, COMBINING DOUBLE GRAVE . 030F ANCHOR, INTERLINEAR ANNOTATION. .FFF9 ACCENT, COMBINING GRAVE . .0300 AND, CURLY LOGICAL . 22CF ACCENT, DOUBLE ACUTE . .02DD AND, LOGICAL . 2227 ACCENT, GRAVE . .0060 AND, N-ARY LOGICAL . 22C0 ACCENT, MODIFIER LETTER ACUTE . 02CA Ands and Ors, Logical . .2A51 ACCENT, MODIFIER LETTER ANGKHANKHU, THAI CHARACTER . 0E5A CIRCUMFLEX . 02C6 ANGLE . 2220 ACCENT, MODIFIER LETTER CROSS. 02DF ANGLE ABOVE, COMBINING LEFT. .031A ACCENT, MODIFIER LETTER GRAVE . 02CB angle arc . 2222 ACCENT, MODIFIER LETTER LOW ACUTE . 02CF ANGLE BELOW, COMBINING LEFT. 0349 ACCENT, MODIFIER LETTER LOW GRAVE . 02CE ANGLE BRACKET, LEFT . 3008 accent, spacing acute. 00B4 ANGLE BRACKET, LEFT DOUBLE . .300A accent, spacing circumflex . 005E ANGLE BRACKET, LEFT-POINTING . 2329 accent, spacing grave. .0060 ANGLE BRACKET, MATHEMATICAL LEFT. .27E8 accent, swedish grave . 02DF ANGLE BRACKET, MATHEMATICAL LEFT ACCOUNT OF . .2100 DOUBLE. 27EA ACKNOWLEDGE. .0006 ANGLE BRACKET, MATHEMATICAL RIGHT.27E9 ACKNOWLEDGE, NEGATIVE. .0015 ANGLE BRACKET, MATHEMATICAL RIGHT ACKNOWLEDGE, SYMBOL FOR . .2406 DOUBLE. 27EB ACKNOWLEDGE, SYMBOL FOR NEGATIVE .2415 ANGLE BRACKET, RIGHT . 3009 actuarial bend . 20E7 ANGLE BRACKET, RIGHT DOUBLE. .300B ACUTE ACCENT . 00B4 ANGLE BRACKET, RIGHT-POINTING. .232A ACUTE ACCENT BELOW, COMBINING . .0317 ANGLE QUOTATION MARK, LEFT-POINTING ACUTE ACCENT, COMBINING . .0301 DOUBLE. 00AB ACUTE ACCENT, COMBINING DOUBLE . 030B ANGLE QUOTATION MARK, RIGHT-POINTING ACUTE ACCENT, DOUBLE . .02DD DOUBLE. 00BB ACUTE ACCENT, MODIFIER LETTER . 02CA ANGLE QUOTATION MARK, SINGLE ACUTE ACCENT,.