Phonetic Extensions Supplement Range: 1D80–1DBF

Total Page:16

File Type:pdf, Size:1020Kb

Phonetic Extensions Supplement Range: 1D80–1DBF Phonetic Extensions Supplement Range: 1D80–1DBF This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation. Copying characters from the character code tables or list of character names is not recommended, because for production reasons the PDF files for the code charts cannot guarantee that the correct character codes will always be copied. Fonts The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be expected in actual fonts. The particular fonts used in these charts were provided to the Unicode Consortium by a number of different font designers, who own the rights to the fonts. See https://www.unicode.org/charts/fonts.html for a list. Terms of Use You may freely use these code charts for personal or internal business uses only. You may not incorporate them either wholly or in part into any product or publication, or otherwise distribute them without express written permission from the Unicode Consortium. However, you may provide links to these charts. The fonts and font data used in production of these code charts may NOT be extracted, or used in any other way in any product or publication, without permission or license granted by the typeface owner(s). The Unicode Consortium is not liable for errors or omissions in this file or the standard itself. Information on characters added to the Unicode Standard since the publication of the most recent version of the Unicode Standard, as well as on characters currently being considered for addition to the Unicode Standard can be found on the Unicode web site. See https://www.unicode.org/pending/pending.html and https://www.unicode.org/alloc/Pipeline.html. Copyright © 1991-2021 Unicode, Inc. All rights reserved. 1D80 Phonetic Extensions Supplement 1DBF 1D8 1D9 1DA 1DB 0 ᶀ ᶐ ᶠ ᶰ 1D80 1D90 1DA0 1DB0 1 ᶁ ᶑ ᶡ ᶱ 1D81 1D91 1DA1 1DB1 2 ᶂ ᶒ ᶢ ᶲ 1D82 1D92 1DA2 1DB2 3 ᶃ ᶓ ᶣ ᶳ 1D83 1D93 1DA3 1DB3 4 ᶄ ᶔ ᶤ ᶴ 1D84 1D94 1DA4 1DB4 5 ᶅ ᶕ ᶥ ᶵ 1D85 1D95 1DA5 1DB5 6 ᶆ ᶖ ᶦ ᶶ 1D86 1D96 1DA6 1DB6 7 ᶇ ᶗ ᶧ ᶷ 1D87 1D97 1DA7 1DB7 8 ᶈ ᶘ ᶨ ᶸ 1D88 1D98 1DA8 1DB8 9 ᶉ ᶙ ᶩ ᶹ 1D89 1D99 1DA9 1DB9 A ᶊ ᶚ ᶪ ᶺ 1D8A 1D9A 1DAA 1DBA B ᶋ ᶛ ᶫ ᶻ 1D8B 1D9B 1DAB 1DBB C ᶌ ᶜ ᶬ ᶼ 1D8C 1D9C 1DAC 1DBC D ᶍ ᶝ ᶭ ᶽ 1D8D 1D9D 1DAD 1DBD E ᶎ ᶞ ᶮ ᶾ 1D8E 1D9E 1DAE 1DBE F ᶏ ᶟ ᶯ ᶿ 1D8F 1D9F 1DAF 1DBF The Unicode Standard 14.0, Copyright © 1991-2021 Unicode, Inc. All rights reserved. 1D80 Phonetic Extensions Supplement 1DB0 Latin letters with palatal hook 1D9A ᶚ LATIN SMALL LETTER EZH WITH RETROFLEX HOOK An additional letter with palatal hook is found in another block. Modifier letters → 01AB ƫ latin small letter t with palatal hook Other modifier letters can be found in the Spacing Modifier 1D80 ᶀ LATIN SMALL LETTER B WITH PALATAL HOOK Letters, Phonetic Extensions, as well as Superscripts and 1D81 ᶁ LATIN SMALL LETTER D WITH PALATAL HOOK Subscripts blocks. 1D82 ᶂ LATIN SMALL LETTER F WITH PALATAL HOOK 1D9B ᶛ MODIFIER LETTER SMALL TURNED ALPHA 1D83 ᶃ LATIN SMALL LETTER G WITH PALATAL HOOK ≈ <super> 0252 ɒ 1D84 ᶄ LATIN SMALL LETTER K WITH PALATAL HOOK 1D9C ᶜ MODIFIER LETTER SMALL C 1D85 ᶅ LATIN SMALL LETTER L WITH PALATAL HOOK ≈ <super> 0063 c 1D86 ᶆ LATIN SMALL LETTER M WITH PALATAL HOOK 1D9D ᶝ MODIFIER LETTER SMALL C WITH CURL 1D87 ᶇ LATIN SMALL LETTER N WITH PALATAL HOOK ≈ <super> 0255 ɕ 1D88 ᶈ LATIN SMALL LETTER P WITH PALATAL HOOK 1D9E ᶞ MODIFIER LETTER SMALL ETH 1D89 ᶉ LATIN SMALL LETTER R WITH PALATAL HOOK ≈ <super> 00F0 ð 1D8A ᶊ LATIN SMALL LETTER S WITH PALATAL HOOK 1D9F ᶟ MODIFIER LETTER SMALL REVERSED OPEN E 1D8B ᶋ LATIN SMALL LETTER ESH WITH PALATAL ≈ <super> 025C ɜ HOOK 1DA0 ᶠ MODIFIER LETTER SMALL F 1D8C ᶌ LATIN SMALL LETTER V WITH PALATAL HOOK ≈ <super> 0066 f 1D8D ᶍ LATIN SMALL LETTER X WITH PALATAL HOOK 1DA1 ᶡ MODIFIER LETTER SMALL DOTLESS J WITH 1D8E ᶎ LATIN SMALL LETTER Z WITH PALATAL HOOK STROKE • uppercase is A7C6 Ᶎ ≈ <super> 025F ɟ 1DA2 ᶢ MODIFIER LETTER SMALL SCRIPT G Latin letters with retroflex hook ≈ <super> 0261 ɡ IPA recommends transcribing vowels with r-coloring 1DA3 ᶣ MODIFIER LETTER SMALL TURNED H (rhoticity) with the rhotic hook instead. ≈ <super> 0265 ɥ → 02DE ˞ modifier letter rhotic hook 1DA4 ᶤ MODIFIER LETTER SMALL I WITH STROKE Additional letters with retroflex hook are found in other ≈ <super> 0268 ɨ blocks. 1DA5 ᶥ MODIFIER LETTER SMALL IOTA → 01AE Ʈ latin capital letter t with retroflex hook ≈ <super> 0269 ɩ → 0256 ɖ latin small letter d with tail 1DA6 ᶦ MODIFIER LETTER SMALL CAPITAL I → 026D ɭ latin small letter l with retroflex hook • not for use in UPA → 0273 ɳ latin small letter n with retroflex hook → 1D35 ᴵ modifier letter capital i → 027B ɻ latin small letter turned r with hook ≈ <super> 026A ɪ 027D ɽ latin small letter r with tail 1DA7 ᶧ MODIFIER LETTER SMALL CAPITAL I WITH → STROKE → 0282 ʂ latin small letter s with hook <super> 1D7B ᵻ → 0285 ʅ latin small letter squat reversed esh ≈ 1DA8 ᶨ MODIFIER LETTER SMALL J WITH CROSSED- → 0288 ʈ latin small letter t with retroflex hook TAIL 0290 ʐ latin small letter z with retroflex hook → <super> 029D ʝ latin small letter turned h with fishhook and tail ≈ → 02AF ʯ 1DA9 ᶩ MODIFIER LETTER SMALL L WITH RETROFLEX 1D8F ᶏ LATIN SMALL LETTER A WITH RETROFLEX HOOK HOOK <super> 026D ɭ LATIN SMALL LETTER ALPHA WITH RETROFLEX ≈ 1D90 ᶐ 1DAA ᶪ MODIFIER LETTER SMALL L WITH PALATAL HOOK HOOK 1D91 ᶑ LATIN SMALL LETTER D WITH HOOK AND TAIL ≈ <super> 1D85 ᶅ 1D92 ᶒ LATIN SMALL LETTER E WITH RETROFLEX MODIFIER LETTER SMALL CAPITAL L HOOK 1DAB ᶫ • not for use in UPA 1D93 ᶓ LATIN SMALL LETTER OPEN E WITH RETROFLEX HOOK → 1D38 ᴸ modifier letter capital l <super> 029F ʟ 1D94 ᶔ LATIN SMALL LETTER REVERSED OPEN E WITH ≈ RETROFLEX HOOK 1DAC ᶬ MODIFIER LETTER SMALL M WITH HOOK ≈ <super> 0271 ɱ → 025D ɝ latin small letter reversed open e with MODIFIER LETTER SMALL TURNED M WITH hook 1DAD ᶭ LONG LEG 1D95 ᶕ LATIN SMALL LETTER SCHWA WITH RETROFLEX HOOK ≈ <super> 0270 ɰ 1DAE ᶮ MODIFIER LETTER SMALL N WITH LEFT HOOK → 025A ɚ latin small letter schwa with hook 1D96 ᶖ LATIN SMALL LETTER I WITH RETROFLEX HOOK ≈ <super> 0272 ɲ MODIFIER LETTER SMALL N WITH RETROFLEX 1D97 ᶗ LATIN SMALL LETTER OPEN O WITH 1DAF ᶯ RETROFLEX HOOK HOOK 1D98 ᶘ LATIN SMALL LETTER ESH WITH RETROFLEX ≈ <super> 0273 ɳ HOOK 1DB0 ᶰ MODIFIER LETTER SMALL CAPITAL N 1D99 ᶙ LATIN SMALL LETTER U WITH RETROFLEX • not for use in UPA HOOK → 1D3A ᴺ modifier letter capital n ≈ <super> 0274 ɴ The Unicode Standard 14.0, Copyright © 1991-2021 Unicode, Inc. All rights reserved. 1DB1 Phonetic Extensions Supplement 1DBF 1DB1 ᶱ MODIFIER LETTER SMALL BARRED O ≈ <super> 0275 ɵ 1DB2 ᶲ MODIFIER LETTER SMALL PHI ≈ <super> 0278 ɸ 1DB3 ᶳ MODIFIER LETTER SMALL S WITH HOOK ≈ <super> 0282 ʂ 1DB4 ᶴ MODIFIER LETTER SMALL ESH ≈ <super> 0283 ʃ 1DB5 ᶵ MODIFIER LETTER SMALL T WITH PALATAL HOOK ≈ <super> 01AB ƫ 1DB6 ᶶ MODIFIER LETTER SMALL U BAR ≈ <super> 0289 ʉ 1DB7 ᶷ MODIFIER LETTER SMALL UPSILON ≈ <super> 028A ʊ 1DB8 ᶸ MODIFIER LETTER SMALL CAPITAL U • not for use in UPA → 1D41 ᵁ modifier letter capital u ≈ <super> 1D1C ᴜ 1DB9 ᶹ MODIFIER LETTER SMALL V WITH HOOK ≈ <super> 028B ʋ 1DBA ᶺ MODIFIER LETTER SMALL TURNED V ≈ <super> 028C ʌ 1DBB ᶻ MODIFIER LETTER SMALL Z ≈ <super> 007A z 1DBC ᶼ MODIFIER LETTER SMALL Z WITH RETROFLEX HOOK ≈ <super> 0290 ʐ 1DBD ᶽ MODIFIER LETTER SMALL Z WITH CURL ≈ <super> 0291 ʑ 1DBE ᶾ MODIFIER LETTER SMALL EZH ≈ <super> 0292 ʒ 1DBF ᶿ MODIFIER LETTER SMALL THETA ≈ <super> 03B8 θ The Unicode Standard 14.0, Copyright © 1991-2021 Unicode, Inc.
Recommended publications
  • 8 December 2004 (Revised 10 January 2005) Topic: Unicode Technical Meeting #101, 15 -18 November 2004, Cupertino, California
    To: LSA and UC Berkeley Communities From: Deborah Anderson, UCB representative and LSA liaison Date: 8 December 2004 (revised 10 January 2005) Topic: Unicode Technical Meeting #101, 15 -18 November 2004, Cupertino, California As the UC Berkeley representative and LSA liaison, I am most interested in the proposals for new characters and scripts that were discussed at the UTC, so these topics are the focus of this report. For the full minutes, readers should consult the "Unicode Technical Committee Minutes" web page (http://www.unicode.org/consortum/utc-minutes.html), where the minutes from this meeting will be posted several weeks hence. I. Proposals for New Scripts and Additional Characters A summary of the proposals and the UTC's decisions are listed below. As the proposals discussed below are made public, I will post the URLs on the SEI web page (www.linguistics.berkeley.edu/sei). A. Linguistics Characters Lorna Priest of SIL International submitted three proposals for additional linguistics characters. Most of the characters proposed are used in the orthographies of languages from Africa, Asia, Mexico, Central and South America. (For details on the proposed characters, with a description of their use and an image, see the appendix to this document.) Two characters from these proposals were not approved by the UTC because there are already characters encoded that are very similar. The evidence did not adequately demonstrate that the proposed characters are used distinctively. The two problematical proposed characters were: the modifier straight letter apostrophe (used for a glottal stop, similar to ' APOSTROPHE U+0027) and the Latin small "at" sign (used for Arabic loanwords in an orthography for the Koalib language from the Sudan, similar to @ COMMERCIAL AT U+0040).
    [Show full text]
  • The Unicode Cookbook for Linguists: Managing Writing Systems Using Orthography Profiles
    Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch Year: 2017 The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles Moran, Steven ; Cysouw, Michael DOI: https://doi.org/10.5281/zenodo.290662 Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-135400 Monograph The following work is licensed under a Creative Commons: Attribution 4.0 International (CC BY 4.0) License. Originally published at: Moran, Steven; Cysouw, Michael (2017). The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles. CERN Data Centre: Zenodo. DOI: https://doi.org/10.5281/zenodo.290662 The Unicode Cookbook for Linguists Managing writing systems using orthography profiles Steven Moran & Michael Cysouw Change dedication in localmetadata.tex Preface This text is meant as a practical guide for linguists, and programmers, whowork with data in multilingual computational environments. We introduce the basic concepts needed to understand how writing systems and character encodings function, and how they work together. The intersection of the Unicode Standard and the International Phonetic Al- phabet is often not met without frustration by users. Nevertheless, thetwo standards have provided language researchers with a consistent computational architecture needed to process, publish and analyze data from many different languages. We bring to light common, but not always transparent, pitfalls that researchers face when working with Unicode and IPA. Our research uses quantitative methods to compare languages and uncover and clarify their phylogenetic relations. However, the majority of lexical data available from the world’s languages is in author- or document-specific orthogra- phies.
    [Show full text]
  • The Coinage of Akragas C
    ACTA UNIVERSITATIS UPSALIENSIS Studia Numismatica Upsaliensia 6:1 STUDIA NUMISMATICA UPSALIENSIA 6:1 The Coinage of Akragas c. 510–406 BC Text and Plates ULLA WESTERMARK I STUDIA NUMISMATICA UPSALIENSIA Editors: Harald Nilsson, Hendrik Mäkeler and Ragnar Hedlund 1. Uppsala University Coin Cabinet. Anglo-Saxon and later British Coins. By Elsa Lindberger. 2006. 2. Münzkabinett der Universität Uppsala. Deutsche Münzen der Wikingerzeit sowie des hohen und späten Mittelalters. By Peter Berghaus and Hendrik Mäkeler. 2006. 3. Uppsala universitets myntkabinett. Svenska vikingatida och medeltida mynt präglade på fastlandet. By Jonas Rundberg and Kjell Holmberg. 2008. 4. Opus mixtum. Uppsatser kring Uppsala universitets myntkabinett. 2009. 5. ”…achieved nothing worthy of memory”. Coinage and authority in the Roman empire c. AD 260–295. By Ragnar Hedlund. 2008. 6:1–2. The Coinage of Akragas c. 510–406 BC. By Ulla Westermark. 2018 7. Musik på medaljer, mynt och jetonger i Nils Uno Fornanders samling. By Eva Wiséhn. 2015. 8. Erik Wallers samling av medicinhistoriska medaljer. By Harald Nilsson. 2013. © Ulla Westermark, 2018 Database right Uppsala University ISSN 1652-7232 ISBN 978-91-513-0269-0 urn:nbn:se:uu:diva-345876 (http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-345876) Typeset in Times New Roman by Elin Klingstedt and Magnus Wijk, Uppsala Printed in Sweden on acid-free paper by DanagårdLiTHO AB, Ödeshög 2018 Distributor: Uppsala University Library, Box 510, SE-751 20 Uppsala www.uu.se, [email protected] The publication of this volume has been assisted by generous grants from Uppsala University, Uppsala Sven Svenssons stiftelse för numismatik, Stockholm Gunnar Ekströms stiftelse för numismatisk forskning, Stockholm Faith and Fred Sandstrom, Haverford, PA, USA CONTENTS FOREWORDS .........................................................................................
    [Show full text]
  • Unicode Alphabets for L ATEX
    Unicode Alphabets for LATEX Specimen Mikkel Eide Eriksen March 11, 2020 2 Contents MUFI 5 SIL 21 TITUS 29 UNZ 117 3 4 CONTENTS MUFI Using the font PalemonasMUFI(0) from http://mufi.info/. Code MUFI Point Glyph Entity Name Unicode Name E262 � OEligogon LATIN CAPITAL LIGATURE OE WITH OGONEK E268 � Pdblac LATIN CAPITAL LETTER P WITH DOUBLE ACUTE E34E � Vvertline LATIN CAPITAL LETTER V WITH VERTICAL LINE ABOVE E662 � oeligogon LATIN SMALL LIGATURE OE WITH OGONEK E668 � pdblac LATIN SMALL LETTER P WITH DOUBLE ACUTE E74F � vvertline LATIN SMALL LETTER V WITH VERTICAL LINE ABOVE E8A1 � idblstrok LATIN SMALL LETTER I WITH TWO STROKES E8A2 � jdblstrok LATIN SMALL LETTER J WITH TWO STROKES E8A3 � autem LATIN ABBREVIATION SIGN AUTEM E8BB � vslashura LATIN SMALL LETTER V WITH SHORT SLASH ABOVE RIGHT E8BC � vslashuradbl LATIN SMALL LETTER V WITH TWO SHORT SLASHES ABOVE RIGHT E8C1 � thornrarmlig LATIN SMALL LETTER THORN LIGATED WITH ARM OF LATIN SMALL LETTER R E8C2 � Hrarmlig LATIN CAPITAL LETTER H LIGATED WITH ARM OF LATIN SMALL LETTER R E8C3 � hrarmlig LATIN SMALL LETTER H LIGATED WITH ARM OF LATIN SMALL LETTER R E8C5 � krarmlig LATIN SMALL LETTER K LIGATED WITH ARM OF LATIN SMALL LETTER R E8C6 UU UUlig LATIN CAPITAL LIGATURE UU E8C7 uu uulig LATIN SMALL LIGATURE UU E8C8 UE UElig LATIN CAPITAL LIGATURE UE E8C9 ue uelig LATIN SMALL LIGATURE UE E8CE � xslashlradbl LATIN SMALL LETTER X WITH TWO SHORT SLASHES BELOW RIGHT E8D1 æ̊ aeligring LATIN SMALL LETTER AE WITH RING ABOVE E8D3 ǽ̨ aeligogonacute LATIN SMALL LETTER AE WITH OGONEK AND ACUTE 5 6 CONTENTS
    [Show full text]
  • IPA Extensions
    IPA Extensions Unicode Character HTML 4.0 Numerical HTML encoding Name U+0250 ɐ &#592; LATIN SMALL LETTER TURNED A U+0251 ɑ &#593; LATIN SMALL LETTER ALPHA U+0252 ɒ &#594; LATIN SMALL LETTER TURNED ALPHA U+0253 ɓ &#595; LATIN SMALL LETTER B WITH HOOK U+0254 ɔ &#596; LATIN SMALL LETTER OPEN O U+0255 ɕ &#597; LATIN SMALL LETTER C WITH CURL U+0256 ɖ &#598; LATIN SMALL LETTER D WITH TAIL U+0257 ɗ &#599; LATIN SMALL LETTER D WITH HOOK U+0258 ɘ &#600; LATIN SMALL LETTER REVERSED E U+0259 ə &#601; LATIN SMALL LETTER SCHWA U+025A ɚ &#602; LATIN SMALL LETTER SCHWA WITH HOOK U+025B ɛ &#603; LATIN SMALL LETTER OPEN E U+025C ɜ &#604; LATIN SMALL LETTER REVERSED OPEN E U+025D ɝ &#605; LATIN SMALL LETTER REVERSED OPEN E WITH HOOK U+025E ɞ &#606; LATIN SMALL LETTER CLOSED REVERSED OPEN E U+025F ɟ &#607; LATIN SMALL LETTER DOTLESS J WITH STROKE U+0260 ɠ &#608; LATIN SMALL LETTER G WITH HOOK U+0261 ɡ &#609; LATIN SMALL LETTER SCRIPT G U+0262 ɢ &#610; LATIN LETTER SMALL CAPITAL G U+0263 ɣ &#611; LATIN SMALL LETTER GAMMA U+0264 ɤ &#612; LATIN SMALL LETTER RAMS HORN U+0265 ɥ &#613; LATIN SMALL LETTER TURNED H U+0266 ɦ &#614; LATIN SMALL LETTER H WITH HOOK U+0267 ɧ &#615; LATIN SMALL LETTER HENG WITH HOOK U+0268 ɨ &#616; LATIN SMALL LETTER I WITH STROKE U+0269 ɩ &#617; LATIN SMALL LETTER IOTA U+026A ɪ &#618; LATIN LETTER SMALL CAPITAL I U+026B ɫ &#619; LATIN SMALL LETTER L WITH MIDDLE TILDE U+026C ɬ &#620; LATIN SMALL LETTER L WITH BELT U+026D ɭ &#621; LATIN SMALL LETTER L WITH RETROFLEX HOOK U+026E ɮ &#622; LATIN SMALL LETTER LEZH U+026F ɯ &#623; LATIN SMALL
    [Show full text]
  • African Fonts and Open Source
    African fonts and Open Source Denis Moyogo Jacquerye September 17th 2008 ATypI ‘o8 Conference St. Petersburg, Russia, September 2008 1 African fonts and Open Source Denis Moyogo Jacquerye African fonts and Open Source This talk is about: ● African Orthographies (relevance, groups, requirements) ● Technologies for them (Unicode, OpenType) ● Implementation ● Raise awareness and interest ● Case for Open Source ATypI ‘o8 Conference St. Petersburg, Russia, September 2008 2 African fonts and Open Source Denis Moyogo Jacquerye Speaker Denis Moyogo Jacquerye ● Computer Scientist and Linguist ● Africanization consultant ● DejaVu Fonts co-leader ● African Network for Localization (ANLoc) ATypI ‘o8 Conference St. Petersburg, Russia, September 2008 3 African fonts and Open Source Denis Moyogo Jacquerye ANLoc African fonts work part of ANLoc project ● Facilitate localization ● Empowering through ICT ● Network of experts ● Sub-projects: Locales, Keyboards, Fonts, Spell checkers, Terminology, Training, Localization software, Policy. ATypI ‘o8 Conference St. Petersburg, Russia, September 2008 4 African fonts and Open Source Denis Moyogo Jacquerye African languages ● Lots of African languages (over 2000) ● 25 spoken by about half ● 80% don't have orthographies ● 20% do! ● Can emulate! ATypI ‘o8 Conference St. Petersburg, Russia, September 2008 5 African fonts and Open Source Denis Moyogo Jacquerye African languages ● Used every day by most ● Education is mostly in European language ● Used in spoken media ● Interest is rising ATypI ‘o8 Conference St. Petersburg,
    [Show full text]
  • L2/06-036: Proposal to Encode Characters for Ordbok Över Finlands
    ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS 1 FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 10646TP PT Please fill all the sections A, B and C below. Please read Principles and Procedures Document (P & P) from http://www.dkuug.dk/JTC1/SC2/WG2/docs/principles.htmlHTU UTH for guidelines and details before filling this form. Please ensure you are using the latest Form from http://www.dkuug.dk/JTC1/SC2/WG2/docs/summaryform.htmlHTU .UTH See also http://www.dkuug.dk/JTC1/SC2/WG2/docs/roadmaps.htmlHTU UTH for latest Roadmaps. A. Administrative 1. Title: Proposal to encode characters for Ordbok över Finlands svenska folkmål in the UCS 2. Requester's name: Therese Leinonen, Klaas Ruppel, Erkki I. Kolehmainen, Caroline Sandström 3. Requester type (Member body/Liaison/Individual contribution): Member Body 4. Submission date: 2006-01-26 5. Requester's reference (if applicable): 6. Choose one of the following: This is a complete proposal: YES (or) More information will be provided later: NO B. Technical – General 1. Choose one of the following: a. This proposal is for a new script (set of characters): NO Proposed name of script: b. The proposal is for addition of character(s) to an existing block: YES Name of the existing block: Phonetic Extensions Supplement 2. Number of characters in proposal: 3 3. Proposed category (select one from below - see section 2.2 of P&P document): A-Contemporary B.1-Specialized (small collection) B.2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4.
    [Show full text]
  • 1 Symbols (2286)
    1 Symbols (2286) USV Symbol Macro(s) Description 0009 \textHT <control> 000A \textLF <control> 000D \textCR <control> 0022 ” \textquotedbl QUOTATION MARK 0023 # \texthash NUMBER SIGN \textnumbersign 0024 $ \textdollar DOLLAR SIGN 0025 % \textpercent PERCENT SIGN 0026 & \textampersand AMPERSAND 0027 ’ \textquotesingle APOSTROPHE 0028 ( \textparenleft LEFT PARENTHESIS 0029 ) \textparenright RIGHT PARENTHESIS 002A * \textasteriskcentered ASTERISK 002B + \textMVPlus PLUS SIGN 002C , \textMVComma COMMA 002D - \textMVMinus HYPHEN-MINUS 002E . \textMVPeriod FULL STOP 002F / \textMVDivision SOLIDUS 0030 0 \textMVZero DIGIT ZERO 0031 1 \textMVOne DIGIT ONE 0032 2 \textMVTwo DIGIT TWO 0033 3 \textMVThree DIGIT THREE 0034 4 \textMVFour DIGIT FOUR 0035 5 \textMVFive DIGIT FIVE 0036 6 \textMVSix DIGIT SIX 0037 7 \textMVSeven DIGIT SEVEN 0038 8 \textMVEight DIGIT EIGHT 0039 9 \textMVNine DIGIT NINE 003C < \textless LESS-THAN SIGN 003D = \textequals EQUALS SIGN 003E > \textgreater GREATER-THAN SIGN 0040 @ \textMVAt COMMERCIAL AT 005C \ \textbackslash REVERSE SOLIDUS 005E ^ \textasciicircum CIRCUMFLEX ACCENT 005F _ \textunderscore LOW LINE 0060 ‘ \textasciigrave GRAVE ACCENT 0067 g \textg LATIN SMALL LETTER G 007B { \textbraceleft LEFT CURLY BRACKET 007C | \textbar VERTICAL LINE 007D } \textbraceright RIGHT CURLY BRACKET 007E ~ \textasciitilde TILDE 00A0 \nobreakspace NO-BREAK SPACE 00A1 ¡ \textexclamdown INVERTED EXCLAMATION MARK 00A2 ¢ \textcent CENT SIGN 00A3 £ \textsterling POUND SIGN 00A4 ¤ \textcurrency CURRENCY SIGN 00A5 ¥ \textyen YEN SIGN 00A6
    [Show full text]
  • FUPA) in the UCS Source: Michael Everson and Klaas Ruppel Version: 2.0 Date: 1998-11-02
    Title: Encoding Finno-Ugric Phonetic Alphabet (FUPA) in the UCS Source: Michael Everson and Klaas Ruppel Version: 2.0 Date: 1998-11-02 The Finno-Ugric Phonetic Alphabet (FUPA) 0. Introduction This paper presents a collection of characters, diacritics, and notation marks used in the FUPA scheme. This transcription is and has been used creatively by different scientists in different countries at different times (Lagercrantz probably made the most baroque use of the system); therefore the phonetic or phonematic values of the elements of the FUPA are intentionally left aside here. However, all the users of this scheme have one thing in common: they use the FUPA in a technical way, as described below. We prefer the term ÒFUPAÓ to ÒFUTÓ (Finno- Ugric Transcription) because of its analogy to the IPA (International Phonetic Alphabet). It is not the intention of this paper to encourage exact (or over-exact) phonetic transcription or the extensive use of diacritics. The intention is rather to facilitate the use of the FUPA by the Uralicist community in the context of the Universal Character Set (UCS), by ensuring that a comprehensive analysis of the system leads to the possibility of encoding FUPA texts, past and present, with the UCS. In the exploratory versions of this document, firm advice on how to use FUPA will not be given; when the study is complete, however, advice on which characters in the UCS refer to whch letters used in the FUPA system will be given. An example of this, we believe, will be the advice for the LATIN SMALL LETTER ENG to be used in coded representations of FUPA texts, past and present, for the velar nasal, in preference to the GREEK SMALL LETTER ETA, although a great many printed texts use an eta glyph (Ç) and not an eng glyph (Ë).
    [Show full text]
  • The Brill Typeface User Guide & Complete List of Characters
    The Brill Typeface User Guide & Complete List of Characters Version 2.06, October 31, 2014 Pim Rietbroek Preamble Few typefaces – if any – allow the user to access every Latin character, every IPA character, every diacritic, and to have these combine in a typographically satisfactory manner, in a range of styles (roman, italic, and more); even fewer add full support for Greek, both modern and ancient, with specialised characters that papyrologists and epigraphers need; not to mention coverage of the Slavic languages in the Cyrillic range. The Brill typeface aims to do just that, and to be a tool for all scholars in the humanities; for Brill’s authors and editors; for Brill’s staff and service providers; and finally, for anyone in need of this tool, as long as it is not used for any commercial gain.* There are several fonts in different styles, each of which has the same set of characters as all the others. The Unicode Standard is rigorously adhered to: there is no dependence on the Private Use Area (PUA), as it happens frequently in other fonts with regard to characters carrying rare diacritics or combinations of diacritics. Instead, all alphabetic characters can carry any diacritic or combination of diacritics, even stacked, with automatic correct positioning. This is made possible by the inclusion of all of Unicode’s combining characters and by the application of extensive OpenType Glyph Positioning programming. Credits The Brill fonts are an original design by John Hudson of Tiro Typeworks. Alice Savoie contributed to Brill bold and bold italic. The black-letter (‘Fraktur’) range of characters was made by Karsten Lücke.
    [Show full text]
  • A Multilingual Lexical Database Application with a Structured Interlingua
    SIMuLLDA a Multilingual Lexical Database Application using a Structured Interlingua SIMuLLDA een toepassing van een meertalig lexicaal gegevensbestand met gebruikmaking van een gestructureerde tussentaal (met een samenvatting in het Nederlands) Proefschrift ter verkrijging van de graad van doctor aan de Universiteit Utrecht op het gezag van de Rector Magnificus, Prof. dr. W.H. Gispen, ingevolge het besluit van het College voor Promoties in het openbaar te verdedigen op vrijdag 7 juni 2002 des middags te 4:15 uur door Maarten Janssen geboren op 28 januari 1971 te Nijmegen Promotoren: Prof. dr. H.J. Verkuyl UiL-OTS, Universiteit Utrecht Prof. dr. A. Visser Faculteit Wijsbegeerte, Universiteit Utrecht Contents Preface vii 1 Multilingual Lexical Databases 1 1.1 Multilingual Lexical Databases . 1 1.2 Current Approaches and their Shortcomings . 2 1.2.1 Parallel Wordlists . 2 1.2.2 Hub-and-Spoke Model . 5 1.2.3 WordNet and EuroWordNet . 9 1.2.4 Acquilex et al. 15 1.2.5 Corpus Based Approaches . 19 1.3 Conclusion to Chapter 1 . 21 2 FCA and SIMuLLDA 23 2.1 Formal Concept Analysis . 23 2.1.1 Partial Ordering . 27 2.1.2 Hasse Diagrams . 30 2.2 Connotative Context . 31 2.3 The SIMuLLDA System . 35 2.3.1 Multilinguality . 38 2.3.2 Lexical Gap Filling . 43 2.4 Formal Properties of FCA . 45 2.4.1 FCA and Lattices . 45 2.4.2 Smallest Common Concept . 46 2.4.3 Maximal Filled Sub-Tables . 46 2.4.4 Distributive and Atomic Lattices . 47 2.4.5 Extending Contexts . 48 2.4.6 Models and the Number of Concepts .
    [Show full text]
  • The Unicode Standard, Version 10.0
    Phonetic Extensions Supplement Range: 1D80–1DBF This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 10.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See http://www.unicode.org/errata/ for an up-to-date list of errata. See http://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See http://www.unicode.org/charts/PDF/Unicode-10.0/ for charts showing only the characters added in Unicode 10.0. See http://www.unicode.org/Public/10.0.0/charts/ for a complete archived file of character code charts for Unicode 10.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 10.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 10.0, online at http://www.unicode.org/versions/Unicode10.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, and #45, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See http://www.unicode.org/ucd/ and http://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation.
    [Show full text]