The Unicode Standard, Version 5.0--Electronic Edition

Total Page:16

File Type:pdf, Size:1020Kb

The Unicode Standard, Version 5.0--Electronic Edition Electronic Edition This file is part of the electronic edition of The Unicode Standard, Version 5.0, provided for online access, content searching, and accessibility. It may not be printed. Bookmarks linking to specific chapters or sections of the whole Unicode Standard are available at http://www.unicode.org/versions/Unicode5.0.0/bookmarks.html Purchasing the Book For convenient access to the full text of the standard as a useful reference book, we recommend pur- chasing the printed version. The book is available from the Unicode Consortium, the publisher, and booksellers. Purchase of the standard in book format contributes to the ongoing work of the Uni- code Consortium. Details about the book publication and ordering information may be found at http://www.unicode.org/book/aboutbook.html Joining Unicode You or your organization may benefit by joining the Unicode Consortium: for more information, see Joining the Unicode Consortium at http://www.unicode.org/consortium/join.html This PDF file is an excerpt from The Unicode Standard, Version 5.0, issued by the Unicode Consortiu- mand published by Addison-Wesley. The material has been modified slightly for this electronic edi- ton, however, the PDF files have not been modified to reflect the corrections found on the Updates and Errata page (http://www.unicode.org/errata/). For information on more recent versions of the standard, see http://www.unicode.org/versions/enumeratedversions.html. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. The Unicode® Consortium is a registered trademark, and Unicode™ is a trademark of Unicode, Inc. The Unicode logo is a trademark of Unicode, Inc., and may be registered in some jurisdictions. The authors and publisher have taken care in the preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. Dai Kan-Wa Jiten, used as the source of reference Kanji codes, was written by Tetsuji Morohashi and published by Taishukan Shoten. Cover and CD-ROM label design: Steve Mehallo, www.mehallo.com The publisher offers excellent discounts on this book when ordered in quantity for bulk purchases or special sales, which may include electronic versions and/or custom covers and content particular to your business, training goals, marketing focus, and branding interests. For more information, please contact U.S. Corporate and Government Sales, (800) 382-3419, [email protected]. For sales outside the United States please contact International Sales, [email protected] Visit us on the Web: www.awprofessional.com Library of Congress Cataloging-in-Publication Data The Unicode Standard / the Unicode Consortium ; edited by Julie D. Allen ... [et al.]. — Version 5.0. p. cm. Includes bibliographical references and index. ISBN 0-321-48091-0 (hardcover : alk. paper) 1. Unicode (Computer character set) I. Allen, Julie D. II. Unicode Consortium. QA268.U545 2007 005.7'22—dc22 2006023526 Copyright © 1991–2007 Unicode, Inc. All rights reserved. Printed in the United States of America. This publication is protected by copy- right, and permission must be obtained from the publisher prior to any prohibited reproduction, storage in a retrieval system, or transmission in any form or by any means, electronic, mechanical, photocopying, recording, or likewise. For information regarding permissions, write to Pearson Edu- cation, Inc., Rights and Contracts Department, 75 Arlington Street, Suite 300, Boston, MA 02116. Fax: (617) 848-7047 ISBN 0-321-48091-0 Text printed in the United States on recycled paper at Courier in Westford, Massachusetts. First printing, October 2006 Chapter 7 European Alphabetic Scripts 7 Modern European alphabetic scripts are derived from or influenced by the Greek script, which itself was an adaptation of the Phoenician alphabet. A Greek innovation was writing the letters from left to right, which is the writing direction for all the scripts derived from or inspired by Greek. The European alphabetic scripts and additional characters described in this chapter are Latin Cyrillic Georgian Greek Glagolitic Modifier letters Coptic Armenian Combining marks The European scripts are all written from left to right. Many have separate lowercase and uppercase forms of the alphabet. Spaces are used to separate words. Accents and diacritical marks are used to indicate phonetic features and to extend the use of base scripts to addi- tional languages. Some of these modification marks have evolved into small free-standing signs that can be treated as characters in their own right. The Latin script is used to write or transliterate texts in a wide variety of languages. The International Phonetic Alphabet (IPA) is an extension of the Latin alphabet, enabling it to represent the phonetics of all languages. Other Latin phonetic extensions are used for the Uralic Phonetic Alphabet. The Latin alphabet is derived from the alphabet used by the Etruscans, who had adopted a Western variant of the classical Greek alphabet (Section 14.2, Old Italic). Originally it con- tained only 24 capital letters. The modern Latin alphabet as it is found in the Basic Latin block owes its appearance to innovations of scribes during the Middle Ages and practices of the early Renaissance printers. The Cyrillic script was developed in the ninth century and is also based on Greek. Like Latin, Cyrillic is used to write or transliterate texts in many languages. The Georgian and Armenian scripts were devised in the fifth century and are influenced by Greek. Modern Georgian does not have separate uppercase and lowercase forms. The Coptic script was the last stage in the development of Egyptian writing. It represented the adaptation of the Greek alphabet to writing Egyptian, with the retention of forms from Demotic for sounds not adequately represented by Greek letters. Although primarily used The Unicode Standard 5.0 – Electronic edition Copyright © 1991–2007 Unicode, Inc. 226 European Alphabetic Scripts in Egypt from the fourth to the tenth century, it is described in this chapter because of its close relationship to the Greek script. Glagolitic is an early Slavic script related in some ways to both the Greek and the Cyrillic scripts. It was widely used in the Balkans but gradually died out, surviving the longest in Croatia. Like Coptic, however, it still has some modern use in liturgical contexts. This chapter also describes modifier letters and combining marks used with the Latin script and other scripts. The block descriptions for other archaic European alphabetic scripts, such as Gothic, Ogham, Old Italic, and Runic, can be found in Chapter 14, Archaic Scripts. 7.1 Latin The Latin script was derived from the Greek script. Today it is used to write a wide variety of languages all over the world. In the process of adapting it to other languages, numerous extensions have been devised. The most common is the addition of diacritical marks. Fur- thermore, the creation of digraphs, inverse or reverse forms, and outright new characters have all been used to extend the Latin script. The Latin script is written in linear sequence from left to right. Spaces are used to separate words and provide the primary line breaking opportunities. Hyphens are used where lines are broken in the middle of a word. (For more information, see Unicode Standard Annex #14, “Line Breaking Properties.”) Latin letters come in uppercase and lowercase pairs. Languages. Some indication of language or other usage is given for many characters within the names lists accompanying the character charts. Diacritical Marks. Speakers of different languages treat the addition of a diacritical mark to a base letter differently. In some languages, the combination is treated as a letter in the alphabet for the language. In others, such as English, the same words can often be spelled with and without the diacritical mark without implying any difference. Most languages that use the Latin script treat letters with diacritical marks as variations of the base letter, but do not accord the combination the full status of an independent letter in the alphabet. Widely used accented character combinations are provided as single characters to accom- modate interoperation with pervasive practice in legacy encodings. Combining diacritical marks can express these and all other accented letters as combining character sequences. In the Unicode Standard, all diacritical marks are encoded in sequence after the base char- acters to which they apply. For more details, see the subsection “Combining Diacritical Marks” in Section 7.9, Combining Marks, and also Section 2.11, Combining Characters. Alternative Glyphs. Some characters have alternative representations, although they have a common semantic. In such cases, a preferred glyph is chosen to represent the character in the code charts, even though it may not be the form used under all circumstances. Some Copyright © 1991-2007, Unicode, Inc. The Unicode Standard 5.0 – Electronic edition 7.1 Latin 227 Latin examples to illustrate this point are provided in Figure 7-1 and discussed in the text that follows. Figure 7-1. Alternative Glyphs in Latin a a g g @ A U S T W V " C D, L R Common typographical variations of basic Latin letters include the open- and closed-loop forms of the lowercase letters “a” and “g”, as shown in the first example in Figure 7-1.
Recommended publications
  • Armenian Secret and Invented Languages and Argots
    Armenian Secret and Invented Languages and Argots The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Russell, James R. Forthcoming. Armenian secret and invented languages and argots. Proceedings of the Institute of Linguistics of the Russian Academy of Sciences. Citable link http://nrs.harvard.edu/urn-3:HUL.InstRepos:9938150 Terms of Use This article was downloaded from Harvard University’s DASH repository, and is made available under the terms and conditions applicable to Open Access Policy Articles, as set forth at http:// nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of- use#OAP 1 ARMENIAN SECRET AND INVENTED LANGUAGES AND ARGOTS. By James R. Russell, Harvard University. Светлой памяти Карена Никитича Юзбашяна посвящается это исследование. CONTENTS: Preface 1. Secret languages and argots 2. Philosophical and hypothetical languages 3. The St. Petersburg Manuscript 4. The Argot of the Felt-Beaters 5. Appendices: 1. Description of St. Petersburg MS A 29 2. Glossary of the Ṙuštuni language 3. Glossary of the argot of the Felt-Beaters of Moks 4. Texts in the “Third Script” of MS A 29 List of Plates Bibliography PREFACE Much of the research for this article was undertaken in Armenia and Russia in June and July 2011 and was funded by a generous O’Neill grant through the Davis Center for Russian and Eurasian Studies at Harvard. For their eager assistance and boundless hospitality I am grateful to numerous friends and colleagues who made my visit pleasant and successful. For their generous assistance in Erevan and St.
    [Show full text]
  • List of Aleph Tables
    List of Aleph Tables Version 22 CONFIDENTIAL INFORMATION The information herein is the property of Ex Libris Ltd. or its affiliates and any misuse or abuse will result in economic loss. DO NOT COPY UNLESS YOU HAVE BEEN GIVEN SPECIFIC WRITTEN AUTHORIZATION FROM EX LIBRIS LTD. This document is provided for limited and restricted purposes in accordance with a binding contract with Ex Libris Ltd. or an affiliate. The information herein includes trade secrets and is confidential. DISCLAIMER The information in this document will be subject to periodic change and updating. Please confirm that you have the most current documentation. There are no warranties of any kind, express or implied, provided in this documentation, other than those expressly agreed upon in the applicable Ex Libris contract. This information is provided AS IS. Unless otherwise agreed, Ex Libris shall not be liable for any damages for use of this document, including, without limitation, consequential, punitive, indirect or direct damages. Any references in this document to third-party material (including third-party Web sites) are provided for convenience only and do not in any manner serve as an endorsement of that third-party material or those Web sites. The third-party materials are not part of the materials for this Ex Libris product and Ex Libris has no liability for such materials. TRADEMARKS "Ex Libris," the Ex Libris bridge , Primo, Aleph, Alephino, Voyager, SFX, MetaLib, Verde, DigiTool, Preservation, URM, Voyager, ENCompass, Endeavor eZConnect, WebVoyage, Citation Server, LinkFinder and LinkFinder Plus, and other marks are trademarks or registered trademarks of Ex Libris Ltd.
    [Show full text]
  • Nl 6 1999-2000
    & ST. SHENOUDA COPTIC NEWSLETTER SUBSCRIBER'S EDITION Quarterly Newsletter Published by the St. Shenouda Center for Coptic Studies 1494 S. Robertson Blvd., Ste. 204, LA, CA 90035 Tel: (310) 271-8329 Fax: (310) 558-1863 Mailing Address: 1701 So. Wooster St. Los Angeles, CA 90035, U.S.A. October, 1999 Volume 6(N.S. 3), No. 1 In This Issue: The Second St. Shenouda Conference of Coptic Studies (4) by Hany N. Takla ............1 Conference Abstracts (2) by Hany N. Takla ...................................................................7 The 7th International Congress of Coptic Studies by Dr. J. van der Vliet......................10 A Tribute to Professor Paul van Moorsel by Dr. Mat Immerzeel ...................................12 News by Hany N. Takla ..................................................................................................14 The Second St. Shenouda Conference of Coptic StudiesNewsletter (August 13 - 14, 1999 - Los Angeles, California) (4) (by Hany N. Takla) Introduction: For a second time in as many years, scholar, Bishop Samuel of Shibin al-Qanatar, the Society held its annual Conference of Coptic Egypt. Notably present was Prof. James Robinson, Studies. This time it was held at, its probable the retired director of the Claremont Institute for permanent future site, the Campus of the CopticChristianity and Antiquity (ICA). University of California, Los Angeles (UCLA). Several of the presenters came from different parts As planned, this gathering brought together several of the United States: Prof. Boulos Ayad Ayad, segments of the population that had the common Boulder Co; Dr. Bastiaan Van Elderen, Grand interest of Coptic Studies. This mixture of the Haven MI; Dr. Fawzy Estafanous, Cleveland OH; young and old, the amateurs and professionals, and Mr.
    [Show full text]
  • The Unicode Cookbook for Linguists: Managing Writing Systems Using Orthography Profiles
    Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch Year: 2017 The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles Moran, Steven ; Cysouw, Michael DOI: https://doi.org/10.5281/zenodo.290662 Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-135400 Monograph The following work is licensed under a Creative Commons: Attribution 4.0 International (CC BY 4.0) License. Originally published at: Moran, Steven; Cysouw, Michael (2017). The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles. CERN Data Centre: Zenodo. DOI: https://doi.org/10.5281/zenodo.290662 The Unicode Cookbook for Linguists Managing writing systems using orthography profiles Steven Moran & Michael Cysouw Change dedication in localmetadata.tex Preface This text is meant as a practical guide for linguists, and programmers, whowork with data in multilingual computational environments. We introduce the basic concepts needed to understand how writing systems and character encodings function, and how they work together. The intersection of the Unicode Standard and the International Phonetic Al- phabet is often not met without frustration by users. Nevertheless, thetwo standards have provided language researchers with a consistent computational architecture needed to process, publish and analyze data from many different languages. We bring to light common, but not always transparent, pitfalls that researchers face when working with Unicode and IPA. Our research uses quantitative methods to compare languages and uncover and clarify their phylogenetic relations. However, the majority of lexical data available from the world’s languages is in author- or document-specific orthogra- phies.
    [Show full text]
  • On the Use of Coptic Numerals in Egypt in the 16 Th Century
    ON THE USE OF COPTIC NUMERALS IN EGYPT IN THE 16 TH CENTURY Mutsuo KAWATOKO* I. Introduction According to the researches, it is assumed that the culture of the early Islamic period in Egypt was very similar to the contemporary Coptic (Qibti)/ Byzantine (Rumi) culture. This is most evident in their language, especially in writing. It was mainly Greek and Coptic which adopted the letters deriving from Greek and Demotic. Thus, it was normal in those days for the official documents to be written in Greek, and, the others written in Coptic.(1) Gold, silver and copper coins were also minted imitating Byzantine Solidus (gold coin) and Follis (copper coin) and Sassanian Drahm (silver coin), and they were sometimes decorated with the representation of the religious legends, such as "Allahu", engraved in a blank space. In spite of such situation, around A. H. 79 (698), Caliph 'Abd al-Malik b. Marwan implemented the coinage reformation to promote Arabisation of coins, and in A. H. 87 (706), 'Abd Allahi b. 'Abd al-Malik, the governor- general of Egypt, pursued Arabisation of official documentation under a decree by Caliph Walid b. 'Abd al-Malik.(2) As a result, the Arabic letters came into the immediate use for the coin inscriptions and gradually for the official documents. However, when the figures were involved, the Greek or the Coptic numerals were used together with the Arabic letters.(3) The Abjad Arabic numerals were also created by assigning the numerical values to the Arabic alphabetic (abjad) letters just like the Greek numerals, but they did not spread very much.(4) It was in the latter half of the 8th century that the Indian numerals, generally regarded as the forerunners of the Arabic numerals, were introduced to the Islamic world.
    [Show full text]
  • Unicode and Code Page Support
    Natural for Mainframes Unicode and Code Page Support Version 4.2.6 for Mainframes October 2009 This document applies to Natural Version 4.2.6 for Mainframes and to all subsequent releases. Specifications contained herein are subject to change and these changes will be reported in subsequent release notes or new editions. Copyright © Software AG 1979-2009. All rights reserved. The name Software AG, webMethods and all Software AG product names are either trademarks or registered trademarks of Software AG and/or Software AG USA, Inc. Other company and product names mentioned herein may be trademarks of their respective owners. Table of Contents 1 Unicode and Code Page Support .................................................................................... 1 2 Introduction ..................................................................................................................... 3 About Code Pages and Unicode ................................................................................ 4 About Unicode and Code Page Support in Natural .................................................. 5 ICU on Mainframe Platforms ..................................................................................... 6 3 Unicode and Code Page Support in the Natural Programming Language .................... 7 Natural Data Format U for Unicode-Based Data ....................................................... 8 Statements .................................................................................................................. 9 Logical
    [Show full text]
  • Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress
    1 Assessment of Options for Handling Full Unicode Character Encodings in MARC21 A Study for the Library of Congress Part 1: New Scripts Jack Cain Senior Consultant Trylus Computing, Toronto 1 Purpose This assessment intends to study the issues and make recommendations on the possible expansion of the character set repertoire for bibliographic records in MARC21 format. 1.1 “Encoding Scheme” vs. “Repertoire” An encoding scheme contains codes by which characters are represented in computer memory. These codes are organized according to a certain methodology called an encoding scheme. The list of all characters so encoded is referred to as the “repertoire” of characters in the given encoding schemes. For example, ASCII is one encoding scheme, perhaps the one best known to the average non-technical person in North America. “A”, “B”, & “C” are three characters in the repertoire of this encoding scheme. These three characters are assigned encodings 41, 42 & 43 in ASCII (expressed here in hexadecimal). 1.2 MARC8 "MARC8" is the term commonly used to refer both to the encoding scheme and its repertoire as used in MARC records up to 1998. The ‘8’ refers to the fact that, unlike Unicode which is a multi-byte per character code set, the MARC8 encoding scheme is principally made up of multiple one byte tables in which each character is encoded using a single 8 bit byte. (It also includes the EACC set which actually uses fixed length 3 bytes per character.) (For details on MARC8 and its specifications see: http://www.loc.gov/marc/.) MARC8 was introduced around 1968 and was initially limited to essentially Latin script only.
    [Show full text]
  • Descriptive Metadata Guidelines for RLG Cultural Materials I Many Thanks Also to These Individuals Who Reviewed the Final Draft of the Document
    ������������������������������� �������������������������� �������� ����������������������������������� ��������������������������������� ��������������������������������������� ���������������������������������������������������� ������������������������������������������������� � ���������������������������������������������� ������������������������������������������������ ����������������������������������������������������������� ������������������������������������������������������� ���������������������������������������������������� �� ���������������������������������������������� ������������������������������������������� �������������������� ������������������� ���������������������������� ��� ���������������������������������������� ����������� ACKNOWLEDGMENTS Many thanks to the members of the RLG Cultural Materials Alliance—Description Advisory Group for their participation in developing these guidelines: Ardie Bausenbach Library of Congress Karim Boughida Getty Research Institute Terry Catapano Columbia University Mary W. Elings Bancroft Library University of California, Berkeley Michael Fox Minnesota Historical Society Richard Rinehart Berkeley Art Museum & Pacific Film Archive University of California, Berkeley Elizabeth Shaw Aziza Technology Associates, LLC Neil Thomson Natural History Museum (UK) Layna White San Francisco Museum of Modern Art Günter Waibel RLG staff liaison Thanks also to RLG staff: Joan Aliprand Arnold Arcolio Ricky Erway Fae Hamilton Descriptive Metadata Guidelines for RLG Cultural Materials i Many
    [Show full text]
  • Materials of the Riga 3Rd International Conference on Hellenic Studies
    Materials of the Riga 3rd International Conference on Hellenic Studies Latvijas Universitāte Humanitāro zinātņu fakultāte Klasiskās filoloģijas katedra Hellēnistikas centrs HELLĒŅU DIMENSIJA Rīgas 3. starptautiskās hellēnistikas konferences materiāli Sastādītāji: Brigita Aleksejeva Ojārs Lāms Ilze Rūmniece Latvijas Universitāte University of Latvia Faculty of Humanities Chair of Classical Philology Centre for Hellenic Studies HELLENIC DIMENSION Materials of the Riga 3rd International Conference on Hellenic Studies Editors: Brigita Aleksejeva Ojārs Lāms Ilze Rūmniece University of Latvia UDK 930(063) He 396 The book is financially supported by the Hellenic Republic Ministry of Culture and Tourism and the University of Latvia Grāmata izdota ar Grieķijas Republikas Kultūras un tūrisma ministrijas un Latvijas Universitātes atbalstu Support for Conference Proceedings by ERAF Project Support for the international cooperation projects and other international cooperation activities in research and technology at the University of Latvia No. 2010/0202/2DP/2.1.1.2.0/10/APIA/VIAA/013 IEGULDĪJUMS TAVĀ NĀKOTNĒ Editorial board: Gunnar de Boel (Belgium) Igor Surikov (Russia) Thanassis Agathos (Greece) Kateřina Loudová (The Czech Republic) Valda Čakare (Latvia) Ojārs Lāms (Latvia) Ilze Rūmniece (Latvia) Nijolė Juchnevičienė (Lithuania) Tudor Dinu (Romania) Language editing Normunds Titāns Translating Rasma Mozere Cover design: Agris Dzilna Layout: Andra Liepiņa © Brigita Aleksejeva, Ojārs Lāms, Ilze Rūmniece, editors, 2012 © University of Latvia, 2012 ISBN 978-9984-45-469-6 CONTENTS / SATURS Introduction 8 Ievads 10 I ANCIENT TIMES SENLAIKI 11 Vassilis Patronis ECONOMIC IDEAS OF ANCIENT GREEK PHILOSOPHERS: ASSESSING THEIR IMPACT ON THE FORMATION OF THE WORLD ECONOMIC THOUGHT 12 Sengrieķu filozofu idejas par ekonomiku: izvērtējot ietekmi uz pasaules ekonomiskās domas veidošanos Nijolė Juchnevičienė HISTORIOGRAPHIC SCIENTIFIC DISCOURSE AND THE TRADITION OF GEOGRAPHY 22 Zinātniski historiogrāfiskais diskurss un ģeogrāfijas tradīcija Igor E.
    [Show full text]
  • Petit Manuel Unix®
    Août 2010 Petit Manuel Unix® Jacques MADELAINE Département d’informatique Université de CAEN 14032 CAEN CEDEX La première édition de ce manuel décrivait SMX un Unix développé à l’INRIA pour la machine française SM90, la deuxième édition une adaptation pour SPIX, un Unix pour SM90 basé sur System V et développé par Bull. Il a été ensuite modifié et corrigé pour SunOS l’Unix de Sun Microsystems, puis pour Solaris. La cinquième version a été adaptée pour tenir compte des particularités du système GNU-Linux. Un chapitre supplémentaire dédié aux accès réseau a été ensuite ajouté. Rappelons que presque toutes les commandes décrites vont fonctionner comme indiqué sur tout système Unix commercial (Solaris, HP-UX, AIX, ...) ou libre (Linux, OpenBSD, FreeBSD, NetBSD, ...). Mes remerciements à Sara Aubry pour sa relecture attentive etàFrançois Girault pour avoir fourni la mise en tableau des commandes d’emacs. Mes remerciements à Davy Gigan pour m’avoir poussé à publier la version html en octobre 2003. 1 INTRODUCTION() INTRODUCTION() 2Petit manuel Unix 2002 INTRODUCTION NOM intro − introduction to the mini manual − introduction au petit manuel DESCRIPTION Ce manuel donne les principales commandes de Unix. Unix est une famille de systèmes d’exploitation ; les commandes décrites existent, sauf précision contraire, sous Linux et Solaris, les deux systèmes disponibles au département. Seules les principales options sont données, reportez-vous au manuel en ligne pour une liste exhaustive.Chaque commande est décrite par trois sections : NOM qui donne le nom de la commande, son nom en anglais (le nom Unix étant un mnémonique anglais ne correspondant pas toujours bien aveclefrançais) et en français.
    [Show full text]
  • ISO/IEC JTC1/SC2/WG2 N 2005 Date: 1999-05-29
    ISO INTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE DE NORMALISATION --------------------------------------------------------------------------------------- ISO/IEC JTC1/SC2/WG2 Universal Multiple-Octet Coded Character Set (UCS) -------------------------------------------------------------------------------- ISO/IEC JTC1/SC2/WG2 N 2005 Date: 1999-05-29 TITLE: ISO/IEC 10646-1 Second Edition text, Draft 2 SOURCE: Bruce Paterson, project editor STATUS: Working paper of JTC1/SC2/WG2 ACTION: For review and comment by WG2 DISTRIBUTION: Members of JTC1/SC2/WG2 1. Scope This paper provides a second draft of the text sections of the Second Edition of ISO/IEC 10646-1. It replaces the previous paper WG2 N 1796 (1998-06-01). This draft text includes: - Clauses 1 to 27 (replacing the previous clauses 1 to 26), - Annexes A to R (replacing the previous Annexes A to T), and is attached here as “Draft 2 for ISO/IEC 10646-1 : 1999” (pages ii & 1 to 77). Published and Draft Amendments up to Amd.31 (Tibetan extended), Technical Corrigenda nos. 1, 2, and 3, and editorial corrigenda approved by WG2 up to 1999-03-15, have been applied to the text. The draft does not include: - character glyph tables and name tables (these will be provided in a separate WG2 document from AFII), - the alphabetically sorted list of character names in Annex E (now Annex G), - markings to show the differences from the previous draft. A separate WG2 paper will give the editorial corrigenda applied to this text since N 1796. The editorial corrigenda are as agreed at WG2 meetings #34 to #36. Editorial corrigenda applicable to the character glyph tables and name tables, as listed in N1796 pages 2 to 5, have already been applied to the draft character tables prepared by AFII.
    [Show full text]
  • Unicode Alphabets for L ATEX
    Unicode Alphabets for LATEX Specimen Mikkel Eide Eriksen March 11, 2020 2 Contents MUFI 5 SIL 21 TITUS 29 UNZ 117 3 4 CONTENTS MUFI Using the font PalemonasMUFI(0) from http://mufi.info/. Code MUFI Point Glyph Entity Name Unicode Name E262 � OEligogon LATIN CAPITAL LIGATURE OE WITH OGONEK E268 � Pdblac LATIN CAPITAL LETTER P WITH DOUBLE ACUTE E34E � Vvertline LATIN CAPITAL LETTER V WITH VERTICAL LINE ABOVE E662 � oeligogon LATIN SMALL LIGATURE OE WITH OGONEK E668 � pdblac LATIN SMALL LETTER P WITH DOUBLE ACUTE E74F � vvertline LATIN SMALL LETTER V WITH VERTICAL LINE ABOVE E8A1 � idblstrok LATIN SMALL LETTER I WITH TWO STROKES E8A2 � jdblstrok LATIN SMALL LETTER J WITH TWO STROKES E8A3 � autem LATIN ABBREVIATION SIGN AUTEM E8BB � vslashura LATIN SMALL LETTER V WITH SHORT SLASH ABOVE RIGHT E8BC � vslashuradbl LATIN SMALL LETTER V WITH TWO SHORT SLASHES ABOVE RIGHT E8C1 � thornrarmlig LATIN SMALL LETTER THORN LIGATED WITH ARM OF LATIN SMALL LETTER R E8C2 � Hrarmlig LATIN CAPITAL LETTER H LIGATED WITH ARM OF LATIN SMALL LETTER R E8C3 � hrarmlig LATIN SMALL LETTER H LIGATED WITH ARM OF LATIN SMALL LETTER R E8C5 � krarmlig LATIN SMALL LETTER K LIGATED WITH ARM OF LATIN SMALL LETTER R E8C6 UU UUlig LATIN CAPITAL LIGATURE UU E8C7 uu uulig LATIN SMALL LIGATURE UU E8C8 UE UElig LATIN CAPITAL LIGATURE UE E8C9 ue uelig LATIN SMALL LIGATURE UE E8CE � xslashlradbl LATIN SMALL LETTER X WITH TWO SHORT SLASHES BELOW RIGHT E8D1 æ̊ aeligring LATIN SMALL LETTER AE WITH RING ABOVE E8D3 ǽ̨ aeligogonacute LATIN SMALL LETTER AE WITH OGONEK AND ACUTE 5 6 CONTENTS
    [Show full text]