The Unicode Standard, Version 5.0, Provided for Online Access, Content Searching, and Accessibility
Total Page:16
File Type:pdf, Size:1020Kb
Electronic Edition This file is part of the electronic edition of The Unicode Standard, Version 5.0, provided for online access, content searching, and accessibility. It may not be printed. Bookmarks linking to specific chapters or sections of the whole Unicode Standard are available at http://www.unicode.org/versions/Unicode5.0.0/bookmarks.html Purchasing the Book For convenient access to the full text of the standard as a useful reference book, we recommend pur- chasing the printed version. The book is available from the Unicode Consortium, the publisher, and booksellers. Purchase of the standard in book format contributes to the ongoing work of the Uni- code Consortium. Details about the book publication and ordering information may be found at http://www.unicode.org/book/aboutbook.html Joining Unicode You or your organization may benefit by joining the Unicode Consortium: for more information, see Joining the Unicode Consortium at http://www.unicode.org/consortium/join.html This PDF file is an excerpt from The Unicode Standard, Version 5.0, issued by the Unicode Consortiu- mand published by Addison-Wesley. The material has been modified slightly for this electronic edi- ton, however, the PDF files have not been modified to reflect the corrections found on the Updates and Errata page (http://www.unicode.org/errata/). For information on more recent versions of the standard, see http://www.unicode.org/versions/enumeratedversions.html. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. The Unicode® Consortium is a registered trademark, and Unicode™ is a trademark of Unicode, Inc. The Unicode logo is a trademark of Unicode, Inc., and may be registered in some jurisdictions. The authors and publisher have taken care in the preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. Dai Kan-Wa Jiten, used as the source of reference Kanji codes, was written by Tetsuji Morohashi and published by Taishukan Shoten. Cover and CD-ROM label design: Steve Mehallo, www.mehallo.com The publisher offers excellent discounts on this book when ordered in quantity for bulk purchases or special sales, which may include electronic versions and/or custom covers and content particular to your business, training goals, marketing focus, and branding interests. For more information, please contact U.S. Corporate and Government Sales, (800) 382-3419, [email protected]. For sales outside the United States please contact International Sales, [email protected] Visit us on the Web: www.awprofessional.com Library of Congress Cataloging-in-Publication Data The Unicode Standard / the Unicode Consortium ; edited by Julie D. Allen ... [et al.]. — Version 5.0. p. cm. Includes bibliographical references and index. ISBN 0-321-48091-0 (hardcover : alk. paper) 1. Unicode (Computer character set) I. Allen, Julie D. II. Unicode Consortium. QA268.U545 2007 005.7'22—dc22 2006023526 Copyright © 1991–2007 Unicode, Inc. All rights reserved. Printed in the United States of America. This publication is protected by copy- right, and permission must be obtained from the publisher prior to any prohibited reproduction, storage in a retrieval system, or transmission in any form or by any means, electronic, mechanical, photocopying, recording, or likewise. For information regarding permissions, write to Pearson Edu- cation, Inc., Rights and Contracts Department, 75 Arlington Street, Suite 300, Boston, MA 02116. Fax: (617) 848-7047 ISBN 0-321-48091-0 Text printed in the United States on recycled paper at Courier in Westford, Massachusetts. First printing, October 2006 Indices I I.1 Unicode Names Index The Unicode Names index contains three types of entries. • Formal character names—all uppercase • Alternative character names (aliases)—all lowercase • Character group names—mixed case (titlecase) Formal character names are unmodified from the character names lists, although the name strings may be indexed by different words in the names. Alternative character names and character group names are occasionally modified slightly to make them understandable out of context (for example, from “Hangul” to “Korean Hangul”). Not every character is indexed. Large groups of similar characters, including CJK ideo- graphs, Korean Hangul syllables, and compatibility characters, are indexed by their charac- ter group names, such as block names, subblocks, alphabet names, relevant standards, or group summaries (for example, “Roman Numerals”). A WITH ACUTE, LATIN CAPITAL LETTER . 00C1 A WITH OGONEK, LATIN SMALL LETTER. .0105 A WITH ACUTE, LATIN SMALL LETTER . 00E1 A WITH RIGHT HALF RING, LATIN SMALL A WITH BREVE, LATIN SMALL LETTER . 0103 LETTER. 1E9A A WITH CARON, LATIN SMALL LETTER. .01CE A WITH RING ABOVE, LATIN CAPITAL A WITH CIRCUMFLEX, LATIN CAPITAL LETTER. 00C5 LETTER . 00C2 A WITH RING ABOVE, LATIN SMALL A WITH CIRCUMFLEX, LATIN SMALL LETTER. 00E5 LETTER . 00E2 A WITH RING BELOW, LATIN SMALL A WITH DIAERESIS, LATIN CAPITAL LETTER. 1E01 LETTER . 00C4 A WITH STROKE, LATIN CAPITAL LETTER . 023A A WITH DIAERESIS, LATIN SMALL LETTER. 00E4 A WITH TILDE, LATIN CAPITAL LETTER . 00C3 A WITH DOT ABOVE, LATIN SMALL A WITH TILDE, LATIN SMALL LETTER. 00E3 LETTER . 0227 A, COMBINING LATIN SMALL LETTER . .0363 A WITH DOT BELOW, LATIN SMALL A, LATIN LETTER SMALL CAPITAL . 1D00 LETTER . 1EA1 a, latin small letter script . .0251 A WITH DOUBLE GRAVE, LATIN SMALL A, LATIN SMALL LETTER TURNED . .0250 LETTER . 0201 ABBREVIATION MARK, ARMENIAN . .055F A WITH GRAVE, LATIN CAPITAL LETTER . 00C0 ABBREVIATION MARK, SYRIAC . .070F A WITH GRAVE, LATIN SMALL LETTER . 00E0 ABBREVIATION SIGN, DEVANAGARI . .0970 A WITH HOOK ABOVE, LATIN SMALL Abbreviations, Squared Latin. .3371 LETTER . 1EA3 Aboriginal Syllabics, Unified Canadian . .1400 A WITH INVERTED BREVE, LATIN SMALL ABOVE RIGHT, COMBINING COMMA . .0315 LETTER . 0203 ABOVE RIGHT, COMBINING DOT. .0358 A WITH MACRON, LATIN SMALL LETTER. 0101 above, cedilla . .0312 The Unicode Standard 5.0 – Electronic edition Copyright © 1991–2007 Unicode, Inc. 1180 Indices ABOVE, COMBINING ALMOST EQUAL TO . 034C ACCENT, MODIFIER LETTER ACUTE . 02CA ABOVE, COMBINING ANTICLOCKWISE ACCENT, MODIFIER LETTER CIRCUMFLEX . 02C6 ARROW. 20D4 ACCENT, MODIFIER LETTER CROSS . 02DF ABOVE, COMBINING BRIDGE . .0346 ACCENT, MODIFIER LETTER GRAVE . 02CB ABOVE, COMBINING CLOCKWISE ARROW . 20D5 ACCENT, MODIFIER LETTER LOW ACUTE . 02CF ABOVE, COMBINING COMMA. .0313 ACCENT, MODIFIER LETTER LOW GRAVE . 02CE above, combining counterclockwise arrow. 20D4 accent, spacing acute . .00B4 ABOVE, COMBINING DOT . .0307 accent, spacing circumflex. .005E ABOVE, COMBINING DOUBLE VERTICAL accent, spacing grave . 0060 LINE. 030E accent, swedish grave . 02DF ABOVE, COMBINING FOUR DOTS . .20DC ACCOUNT OF. 2100 ABOVE, COMBINING HOMOTHETIC. 034B ACKNOWLEDGE . 0006 ABOVE, COMBINING HOOK. .0309 ACKNOWLEDGE, NEGATIVE . 0015 ABOVE, COMBINING LEFT ANGLE . 031A ACKNOWLEDGE, SYMBOL FOR. 2406 ABOVE, COMBINING LEFT ARROW . 20D6 ACKNOWLEDGE, SYMBOL FOR NEGATIVE . 2415 ABOVE, COMBINING LEFT HALF RING . .0351 acrophonic symbol three, epidaurean . 205D ABOVE, COMBINING LEFT HARPOON . 20D0 actuarial bend . .20E7 ABOVE, COMBINING LEFT RIGHT ARROW. 20E1 ACUTE ACCENT. .00B4 ABOVE, COMBINING NOT TILDE . 034A ACUTE ACCENT BELOW, COMBINING . 0317 ABOVE, COMBINING REVERSED COMMA . .0314 ACUTE ACCENT, COMBINING. 0301 ABOVE, COMBINING RIGHT ARROW . 20D7 ACUTE ACCENT, COMBINING DOUBLE . .030B ABOVE, COMBINING RIGHT ARROWHEAD.0350 ACUTE ACCENT, DOUBLE. 02DD ABOVE, COMBINING RIGHT HALF RING . .0357 ACUTE ACCENT, MODIFIER LETTER . 02CA ABOVE, COMBINING RIGHT HARPOON. 20D1 ACUTE ACCENT, MODIFIER LETTER LOW . 02CF ABOVE, COMBINING RING. 030A acute accent, spacing . .00B4 ABOVE, COMBINING THREE DOTS . 20DB ACUTE TONE MARK, COMBINING. 0341 ABOVE, COMBINING TURNED COMMA. .0312 ADDAK, GURMUKHI . .0A71 ABOVE, COMBINING VERTICAL LINE . 030D ADDRESSED TO THE SUBJECT. 2101 ABOVE, COMBINING WIDE BRIDGE . 20E9 ADI SHAKTI. 262C ABOVE, COMBINING X . 033D AE, LATIN CAPITAL LETTER . 00C6 ABOVE, COMBINING ZIGZAG . 035B ae, latin capital ligature . 00C6 ABOVE, DOT . 02D9 AE, LATIN LETTER SMALL CAPITAL . 1D01 above, double dot . .0308 AE, LATIN SMALL LETTER. .00E6 ABOVE, RING . 02DA AE, LATIN SMALL LETTER TURNED. 1D02 above, v . 030C ae, latin small ligature . .00E6 absolute continuity . 2AA1 AEGEAN WORD SEPARATOR DOT . 10101 absolute value . 007C AEGEAN WORD SEPARATOR LINE . 10100 abstract syntax bracket, left . 301A AESCULAPIUS, STAFF OF. 2695 abstract syntax bracket, right. 301B AFGHANI SIGN. .060B abzüglich . .2052 African Letters for Clicks . 01C0 AC CURRENT. 23E6 AIN, LATIN LETTER. 1D25 ACCENT BELOW, COMBINING ACUTE . .0317 Ainu, Katakana Extensions for . 31F0 ACCENT BELOW, COMBINING AIRPLANE . 2708 CIRCUMFLEX . 032D AKTIESELSKAB . 214D ACCENT BELOW, COMBINING GRAVE. .0316 AL-LAKUNA, SINHALA SIGN . .0DCA