UNITIPA Symbol List of the International Phonetic Alphabet

Total Page:16

File Type:pdf, Size:1020Kb

UNITIPA Symbol List of the International Phonetic Alphabet CONSONANTS (PULMONIC) Name: VOICELESS BILABIAL PLOSIVE Name: VOICED BILABIAL PLOSIVE IPA name: Lower-case P IPA name: Lower-case B IPA number: 101 IPA number: 102 Unicode name: LATIN SMALL LETTER P Unicode name: LATIN SMALL LETTER B Unicode range: Basic Latin Unicode range: Basic Latin Hex value: 0070 Hex value: 0062 TIPA code: p TIPA code: b AFII code: E2A2 AFII code: E2A3 Name: VOICELESS DENTAL/ALVEOLAR PLOSIVE Name: VOICED DENTAL/ALVEOLAR PLOSIVE IPA name: Lower-case T IPA name: Lower-case D IPA number: 103 IPA number: 104 Unicode name: LATIN SMALL LETTER T Unicode name: LATIN SMALL LETTER D p Unicode range: Basic Latin b Unicode range: Basic Latin Hex value: 0074 Hex value: 0064 TIPA code: t TIPA code: d AFII code: E2B0 AFII code: E2B1 Name: VOICELESS RETROFLEX PLOSIVE Name: VOICED RETROFLEX PLOSIVE IPA name: Right-tail T IPA name: Right-tail D IPA number: 105 IPA number: 106 Unicode name: LATIN SMALL LETTER T W/ RETROFLEX HOOK Unicode name: LATIN SMALL LETTER D W/ TAIL t Unicode range: IPA Extensions d Unicode range: IPA Extensions Hex value: 0288 Hex value: 0256 TIPA code: \:t TIPA code: \:d AFII code: E2C7 AFII code: E2C8 Name: VOICELESS PALATAL PLOSIVE Name: VOICED PALATAL PLOSIVE IPA name: Lower-case C IPA name: Barred dotless J IPA number: 107 IPA number: 108 Unicode name: LATIN SMALL LETTER C Unicode name: LATIN SMALL LETTER DOTLESS J W/ STROKE ʈ Unicode range: Basic Latin ɖ Unicode range: IPA Extensions Hex value: 0063 Hex value: 025F TIPA code: c TIPA code: \textbardotlessj AFII code: E2D8 AFII code: E2D9 Name: VOICELESS VELAR PLOSIVE Name: VOICED VELAR PLOSIVE IPA name: Lower-case K IPA name: Opentail G IPA number: 109 IPA number: 110 Unicode name: LATIN SMALL LETTER K Unicode name: LATIN SMALL LETTER SCRIPT G c Unicode range: Basic Latin ɟ Unicode range: IPA Extensions Hex value: 006B Hex value: 0261 TIPA code: k TIPA code: g AFII code: E2DE AFII code: E2DF Name: VOICELESS UVULAR PLOSIVE Name: VOICED UVULAR PLOSIVE IPA name: Lower-case Q IPA name: Small capital G IPA number: 111 IPA number: 112 Unicode name: LATIN SMALL LETTER Q Unicode name: LATIN LETTER SMALL CAPITAL G k Unicode range: Basic Latin ɡ Unicode range: IPA Extensions Hex value: 0071 Hex value: 0262 TIPA code: q TIPA code: \;G AFII code: E2E6 AFII code: E2E7 Name: GLOTTAL PLOSIVE Name: VOICED BILABIAL NASAL IPA name: Glottal stop IPA name: Lower-case M IPA number: 113 IPA number: 114 Unicode name: LATIN LETTER GLOTTAL STOP Unicode name: LATIN SMALL LETTER M q Unicode range: IPA Extensions ɢ Unicode range: Basic Latin Hex value: 0294 Hex value: 006D TIPA code: P TIPA code: m AFII code: E2ED AFII code: E2A1 ʔ (CC-BY-SA) 2020 IPA Typefaces: Doulosm SIL (metatext); unitipa (symbols) CONSONANTS (PULMONIC) Name: VOICED LABIODENTAL NASAL Name: VOICED DENTAL/ALVEOLAR NASAL IPA name: Left-tail M (at right) IPA name: Lower-case N IPA number: 115 IPA number: 116 Unicode name: LATIN SMALL LETTER M W/ HOOK Unicode name: LATIN SMALL LETTER N Unicode range: IPA Extensions Unicode range: Basic Latin Hex value: 0271 Hex value: 006E TIPA code: M TIPA code: n AFII code: E2AB AFII code: E2AF Name: VOICED RETROFLEX NASAL Name: VOICED PALATAL NASAL IPA name: Right-tail N IPA name: Left-tail N (at left) IPA number: 117 IPA number: 118 Unicode name: LATIN SMALL LETTER N W/ RETROFLEX HOOK Unicode name: LATIN SMALL LETTER N W/ LEFT HOOK ɱ Unicode range: IPA Extensions n Unicode range: IPA Extensions Hex value: 0273 Hex value: 0272 TIPA code: \:n TIPA code: \textltailn AFII code: E2C6 AFII code: E2D7 Name: VOICED VELAR NASAL Name: VOICED UVULAR NASAL IPA name: Eng IPA name: Small capital N IPA number: 119 IPA number: 120 Unicode name: LATIN SMALL LETTER ENG Unicode name: LATIN LETTER SMALL CAPITAL N ɳ Unicode range: Latin Extended-A ɲ Unicode range: IPA Extensions Hex value: 014B Hex value: 0274 TIPA code: N TIPA code: \;N AFII code: E2DD AFII code: E2E4 Name: VOICED BILABIAL TRILL Name: VOICED DENTAL/ALVEOLAR TRILL IPA name: Small capital B IPA name: Lower-case R IPA number: 121 IPA number: 122 Unicode name: LATIN LETTER SMALL CAPITAL B Unicode name: LATIN SMALL LETTER R ŋ Unicode range: IPA Extensions ɴ Unicode range: Basic Latin Hex value: 0299 Hex value: 0072 TIPA code: \;B TIPA code: r AFII code: E2F0 AFII code: E2C0 Name: VOICED UVULAR TRILL Name: VOICED LABIODENTAL FLAP IPA name: Small capital R IPA name: Right-hook V IPA number: 123 IPA number: 184 Unicode name: LATIN LETTER SMALL CAPITAL R Unicode name: LATIN SMALL LETTER V W/ RIGHT HOOK ʙ Unicode range: IPA Extensions r Unicode range: Latin Extended-C Hex value: 0280 Hex value: 2C71 TIPA code: \;R TIPA code: N/A AFII code: E2EA AFII code: N/A Name: VOICED DENTAL/ALVEOLAR TAP Name: VOICED RETROFLEX FLAP IPA name: Fish-hook R IPA name: Right-tail R IPA number: 124 IPA number: 125 Unicode name: LATIN SMALL LETTER R W/ FISHHOOK Unicode name: LATIN SMALL LETTER R W/ TAIL ʀ Unicode range: IPA Extensions ⱱ Unicode range: IPA Extensions Hex value: 027E Hex value: 027D TIPA code: R TIPA code: \:r AFII code: E2C1 AFII code: E2CD Name: VOICELESS BILABIAL FRICATIVE Name: VOICED BILABIAL FRICATIVE IPA name: Phi IPA name: Beta IPA number: 126 IPA number: 127 Unicode name: LATIN SMALL LETTER PHI Unicode name: GREEK SMALL LETTER BETA ɾ Unicode range: IPA Extensions ɽ Unicode range: Greek and Coptic Hex value: 0278 Hex value: 03B2 TIPA code: F TIPA code: B AFII code: E2A4 AFII code: E2A5 ɸ (CC-BY-SA) 2020 IPA Typefaces: Doulosβ SIL (metatext); unitipa (symbols) CONSONANTS (PULMONIC) Name: VOICELESS LABIODENTAL FRICATIVE Name: VOICED LABIODENTAL FRICATIVE IPA name: Lower-case F IPA name: Lower-case V IPA number: 128 IPA number: 129 Unicode name: LATIN SMALL LETTER F Unicode name: LATIN SMALL LETTER V Unicode range: Basic Latin Unicode range: Basic Latin Hex value: 0066 Hex value: 0076 TIPA code: f TIPA code: v AFII code: E2AC AFII code: E2AD Name: VOICELESS DENTAL FRICATIVE Name: VOICED DENTAL FRICATIVE IPA name: Theta IPA name: Eth IPA number: 130 IPA number: 131 Unicode name: GREEK SMALL LETTER THETA Unicode name: LATIN SMALL LETTER ETH f Unicode range: Greek and Coptic v Unicode range: Latin-1 Supplement Hex value: 03B8 Hex value: 00F0 TIPA code: T TIPA code: D AFII code: E2B2 AFII code: E2B3 Name: VOICELESS ALVEOLAR FRICATIVE Name: VOICED ALVEOLAR FRICATIVE IPA name: Lower-case S IPA name: Lower-case Z IPA number: 132 IPA number: 133 Unicode name: LATIN SMALL LETTER S Unicode name: LATIN SMALL LETTER Z θ Unicode range: Basic Latin ð Unicode range: Basic Latin Hex value: 0073 Hex value: 007A TIPA code: s TIPA code: z AFII code: E2B6 AFII code: E2B7 Name: VOICELESS POSTALVEOLAR FRICATIVE Name: VOICED POSTALVEOLAR FRICATIVE IPA name: Esh IPA name: Ezh; Tailed Z IPA number: 134 IPA number: 135 Unicode name: LATIN SMALL LETTER ESH Unicode name: LATIN SMALL LETTER EZH s Unicode range: IPA Extensions z Unicode range: IPA Extensions Hex value: 0283 Hex value: 0292 TIPA code: S TIPA code: Z AFII code: E2D0 AFII code: E2D1 Name: VOICELESS RETROFLEX FRICATIVE Name: VOICED RETROFLEX FRICATIVE IPA name: Right-tail S (at left) IPA name: Right-tail Z IPA number: 136 IPA number: 137 Unicode name: LATIN SMALL LETTER S W/ HOOK Unicode name: LATIN SMALL LETTER Z W/ RETROFLEX HOOK ʃ Unicode range: IPA Extensions ʒ Unicode range: IPA Extensions Hex value: 0282 Hex value: 0290 TIPA code: \:s TIPA code: \:z AFII code: E2C9 AFII code: E2CA Name: VOICELESS PALATAL FRICATIVE Name: VOICED PALATAL FRICATIVE IPA name: C cedilla IPA name: Curly-tail J IPA number: 138 IPA number: 139 Unicode name: LATIN SMALL LETTER C W/ CEDILLA Unicode name: LATIN SMALL LETTER J W/ CROSSED-TAIL ʂ Unicode range: Latin-1 Supplement ʐ Unicode range: IPA Extensions Hex value: 00E7 Hex value: 029D TIPA code: \c{c} TIPA code: J AFII code: E2DA AFII code: E2F3 Name: VOICELESS VELAR FRICATIVE Name: VOICED VELAR FRICATIVE IPA name: Lower-case X IPA name: Gamma IPA number: 140 IPA number: 141 Unicode name: LATIN SMALL LETTER X Unicode name: LATIN SMALL LETTER GAMMA ç Unicode range: Basic Latin ʝ Unicode range: IPA Extensions Hex value: 0078 Hex value: 0263 TIPA code: x TIPA code: G AFII code: E2E0 AFII code: E2E1 x (CC-BY-SA) 2020 IPA Typefaces: Doulosɣ SIL (metatext); unitipa (symbols) CONSONANTS (PULMONIC) Name: VOICELESS UVULAR FRICATIVE Name: VOICED UVULAR FRICATIVE IPA name: Chi IPA name: Inverted small capital R IPA number: 142 IPA number: 143 Unicode name: GREEK SMALL LETTER CHI Unicode name: LATIN LETTER SMALL CAPITAL INVERTED R Unicode range: Greek and Coptic Unicode range: IPA Extensions Hex value: 03C7 Hex value: 0281 TIPA code: X TIPA code: K AFII code: E2E8 AFII code: E2E9 Name: VOICELESS PHARYNGEAL FRICATIVE Name: VOICED PHARYNGEAL FRICATIVE/APPROXIMANT IPA name: Barred H IPA name: Reversed glottal stop IPA number: 144 IPA number: 145 Unicode name: LATIN SMALL LETTER H W/ STROKE Unicode name: LATIN LETTER PHARYNGEAL VOICED FRICATIVE χ Unicode range: Latin Extended-A ʁ Unicode range: IPA Extensions Hex value: 0127 Hex value: 0295 TIPA code: \textcrh TIPA code: Q AFII code: E2EB AFII code: E2EC Name: VOICELESS GLOTTAL FRICATIVE Name: VOICED GLOTTAL FRICATIVE IPA name: Lower-case H IPA name: Hooktop H IPA number: 146 IPA number: 147 Unicode name: LATIN SMALL LETTER H Unicode name: LATIN SMALL LETTER H W/ HOOK ħ Unicode range: Basic Latin ʕ Unicode range: IPA Extensions Hex value: 0068 Hex value: 0266 TIPA code: h TIPA code: H AFII code: E2EE AFII code: E2EF Name: VOICELESS DENTAL/ALVEOLAR LATERAL FRICATIVE Name: VOICED DENTAL/ALVEOLAR LATERAL FRICATIVE IPA name: Belted L IPA name: L-Ezh ligature IPA number: 148 IPA number: 149 Unicode name: LATIN SMALL LETTER L W/ BELT
Recommended publications
  • Unicode Request for Cyrillic Modifier Letters Superscript Modifiers
    Unicode request for Cyrillic modifier letters L2/21-107 Kirk Miller, [email protected] 2021 June 07 This is a request for spacing superscript and subscript Cyrillic characters. It has been favorably reviewed by Sebastian Kempgen (University of Bamberg) and others at the Commission for Computer Supported Processing of Medieval Slavonic Manuscripts and Early Printed Books. Cyrillic-based phonetic transcription uses superscript modifier letters in a manner analogous to the IPA. This convention is widespread, found in both academic publication and standard dictionaries. Transcription of pronunciations into Cyrillic is the norm for monolingual dictionaries, and Cyrillic rather than IPA is often found in linguistic descriptions as well, as seen in the illustrations below for Slavic dialectology, Yugur (Yellow Uyghur) and Evenki. The Great Russian Encyclopedia states that Cyrillic notation is more common in Russian studies than is IPA (‘Transkripcija’, Bol’šaja rossijskaja ènciplopedija, Russian Ministry of Culture, 2005–2019). Unicode currently encodes only three modifier Cyrillic letters: U+A69C ⟨ꚜ⟩ and U+A69D ⟨ꚝ⟩, intended for descriptions of Baltic languages in Latin script but ubiquitous for Slavic languages in Cyrillic script, and U+1D78 ⟨ᵸ⟩, used for nasalized vowels, for example in descriptions of Chechen. The requested spacing modifier letters cannot be substituted by the encoded combining diacritics because (a) some authors contrast them, and (b) they themselves need to be able to take combining diacritics, including diacritics that go under the modifier letter, as in ⟨ᶟ̭̈⟩BA . (See next section and e.g. Figure 18. ) In addition, some linguists make a distinction between spacing superscript letters, used for phonetic detail as in the IPA tradition, and spacing subscript letters, used to denote phonological concepts such as archiphonemes.
    [Show full text]
  • The Unicode Cookbook for Linguists: Managing Writing Systems Using Orthography Profiles
    Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch Year: 2017 The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles Moran, Steven ; Cysouw, Michael DOI: https://doi.org/10.5281/zenodo.290662 Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-135400 Monograph The following work is licensed under a Creative Commons: Attribution 4.0 International (CC BY 4.0) License. Originally published at: Moran, Steven; Cysouw, Michael (2017). The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles. CERN Data Centre: Zenodo. DOI: https://doi.org/10.5281/zenodo.290662 The Unicode Cookbook for Linguists Managing writing systems using orthography profiles Steven Moran & Michael Cysouw Change dedication in localmetadata.tex Preface This text is meant as a practical guide for linguists, and programmers, whowork with data in multilingual computational environments. We introduce the basic concepts needed to understand how writing systems and character encodings function, and how they work together. The intersection of the Unicode Standard and the International Phonetic Al- phabet is often not met without frustration by users. Nevertheless, thetwo standards have provided language researchers with a consistent computational architecture needed to process, publish and analyze data from many different languages. We bring to light common, but not always transparent, pitfalls that researchers face when working with Unicode and IPA. Our research uses quantitative methods to compare languages and uncover and clarify their phylogenetic relations. However, the majority of lexical data available from the world’s languages is in author- or document-specific orthogra- phies.
    [Show full text]
  • Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress
    1 Assessment of Options for Handling Full Unicode Character Encodings in MARC21 A Study for the Library of Congress Part 1: New Scripts Jack Cain Senior Consultant Trylus Computing, Toronto 1 Purpose This assessment intends to study the issues and make recommendations on the possible expansion of the character set repertoire for bibliographic records in MARC21 format. 1.1 “Encoding Scheme” vs. “Repertoire” An encoding scheme contains codes by which characters are represented in computer memory. These codes are organized according to a certain methodology called an encoding scheme. The list of all characters so encoded is referred to as the “repertoire” of characters in the given encoding schemes. For example, ASCII is one encoding scheme, perhaps the one best known to the average non-technical person in North America. “A”, “B”, & “C” are three characters in the repertoire of this encoding scheme. These three characters are assigned encodings 41, 42 & 43 in ASCII (expressed here in hexadecimal). 1.2 MARC8 "MARC8" is the term commonly used to refer both to the encoding scheme and its repertoire as used in MARC records up to 1998. The ‘8’ refers to the fact that, unlike Unicode which is a multi-byte per character code set, the MARC8 encoding scheme is principally made up of multiple one byte tables in which each character is encoded using a single 8 bit byte. (It also includes the EACC set which actually uses fixed length 3 bytes per character.) (For details on MARC8 and its specifications see: http://www.loc.gov/marc/.) MARC8 was introduced around 1968 and was initially limited to essentially Latin script only.
    [Show full text]
  • Unicode Alphabets for L ATEX
    Unicode Alphabets for LATEX Specimen Mikkel Eide Eriksen March 11, 2020 2 Contents MUFI 5 SIL 21 TITUS 29 UNZ 117 3 4 CONTENTS MUFI Using the font PalemonasMUFI(0) from http://mufi.info/. Code MUFI Point Glyph Entity Name Unicode Name E262 � OEligogon LATIN CAPITAL LIGATURE OE WITH OGONEK E268 � Pdblac LATIN CAPITAL LETTER P WITH DOUBLE ACUTE E34E � Vvertline LATIN CAPITAL LETTER V WITH VERTICAL LINE ABOVE E662 � oeligogon LATIN SMALL LIGATURE OE WITH OGONEK E668 � pdblac LATIN SMALL LETTER P WITH DOUBLE ACUTE E74F � vvertline LATIN SMALL LETTER V WITH VERTICAL LINE ABOVE E8A1 � idblstrok LATIN SMALL LETTER I WITH TWO STROKES E8A2 � jdblstrok LATIN SMALL LETTER J WITH TWO STROKES E8A3 � autem LATIN ABBREVIATION SIGN AUTEM E8BB � vslashura LATIN SMALL LETTER V WITH SHORT SLASH ABOVE RIGHT E8BC � vslashuradbl LATIN SMALL LETTER V WITH TWO SHORT SLASHES ABOVE RIGHT E8C1 � thornrarmlig LATIN SMALL LETTER THORN LIGATED WITH ARM OF LATIN SMALL LETTER R E8C2 � Hrarmlig LATIN CAPITAL LETTER H LIGATED WITH ARM OF LATIN SMALL LETTER R E8C3 � hrarmlig LATIN SMALL LETTER H LIGATED WITH ARM OF LATIN SMALL LETTER R E8C5 � krarmlig LATIN SMALL LETTER K LIGATED WITH ARM OF LATIN SMALL LETTER R E8C6 UU UUlig LATIN CAPITAL LIGATURE UU E8C7 uu uulig LATIN SMALL LIGATURE UU E8C8 UE UElig LATIN CAPITAL LIGATURE UE E8C9 ue uelig LATIN SMALL LIGATURE UE E8CE � xslashlradbl LATIN SMALL LETTER X WITH TWO SHORT SLASHES BELOW RIGHT E8D1 æ̊ aeligring LATIN SMALL LETTER AE WITH RING ABOVE E8D3 ǽ̨ aeligogonacute LATIN SMALL LETTER AE WITH OGONEK AND ACUTE 5 6 CONTENTS
    [Show full text]
  • Organized All of Mozart's Compositions Into a Long Fist: a Michel Listing
    'AdlbCcDdEeffGgHhliJjKkLIMmNnOoPp Qy RrSsTt LJuVvWwXxYy Zz1234567890&fECESS(£%!?0[1 PUBLISHED BY INTERNATIONALTYPEFACE CORPORATION, VOLUME NINE, NUMBER TWO, JUNE 1982 UPPER AND LOWER CASE. THE INTERNATIONAL JOURNAL OF TYPOGRAPHICS Ludwig von Michel (shown, below) organized all of Mozart's compositions into a long fist: a Michel Listing. We've gone a step further and organized Mozart into an 8-page color section starting on page 36. 2 EDITORIAL VOLUME NINE. NUMBER TWO, JUNE. 1982 EDITOR: EDWARD GOTTSCHALL ART DIRECTOR: BOB FARBER TYPOG EDITORIAL/DESIGN CONSULTANTS: LOUIS DORFSMAN, ALAN PECKOLICK EDITORIAL DIRECTORS: AARON BURNS. EDWARD RONDTHALER ASSOCIATE EDITOR: MARION MULLER CONTRIBUTING EDITOR: ALLAN HALEY RESEARCH DIRECTOR: RHODA SPARSER LUBALIN BUSINESS MANAGER: JOHN PRENTKI ADVERTISING/PRODUCTION MANAGER: HELENA WALLSCHLAG RAPITY ASSISTANT TO THE EDITOR: JULIET TRAVISON ART/PRODUCTION: ILENE MEHL, ANDREA COSTA. SID TIMM SUBSCRIPTIONS: ELOISE COLEMAN ©INTERNATIONAL TYPEFACE CORPORATION 1982 PUBLISHED FOUR TIMES A YEAR IN MARCH. JUNE, SEPTEMBER AND DECEMBER BY INTERNATIONAL.TYPEFACE CORPORATION NEEDS TO BE 2 HAMMARSKJOLD PLAZA. NEW YORK, NY 10017 A JOINTLY OWNED SUBSIDIARY OF LUBALIN, BURNS G CO.. INC. AND PHOTO-LETTERING. INC. CONTROLLED CIRCULATION POSTAGE PAID AT NEW YORK, NY AND AT FARMINGDALE. NV USTS PURL 073430 ISSN 0362-6245 PUBLISHED IN USA ITC FOUNDERS: FELT AARON BURNS. PRESIDENT EDWARD RONDTHALER, CHAIRMAN EMERITUS HERB LUBALIN EXECUTIVE VICE PRESIDENT 1970-1981 ITC OFFICERS 1982: GEORGE SOHN, CHAIRMAN AARON BURNS. PRESIDENT EDWARD GOTTSCHALL. EXECUTIVE VICE PRESIDENT BOB FARBER, SENIOR VICE PRESIDENT ith all the current em- JOHN PRENTKI. VICE PRESIDENT. FINANCE AND GENERAL MANAGER EDWARD BENGUIAT. VICE PRESIDENT W phasis (ours included) on technologies, one needs to be U.S.
    [Show full text]
  • Spacing Modifier Letters Range: 02B0–02FF the Unicode Standard
    Spacing Modifier Letters Range: 02B0–02FF The Unicode Standard, Version 4.0 This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 4.0. Characters in this chart that are new for The Unicode Standard, Version 4.0 are shown in conjunction with any existing characters. For ease of reference, the new characters have been highlighted in the chart grid and in the names list. This file will not be updated with errata, or when additional characters are assigned to the Unicode Standard. See http://www.unicode.org/charts for access to a complete list of the latest character charts. Disclaimer These charts are provided as the on-line reference to the character contents of the Unicode Standard, Version 4.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this excerpt file, please consult the appropriate sections of The Unicode Standard, Version 4.0 (ISBN 0-321-18578-1), as well as Unicode Standard Annexes #9, #11, #14, #15, #24 and #29, the other Unicode Technical Reports and the Unicode Character Database, which are available on-line. See http://www.unicode.org/Public/UNIDATA/UCD.html and http://www.unicode.org/unicode/reports A thorough understanding of the information contained in these additional sources is required for a successful implementation. Fonts The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be expected in actual fonts.
    [Show full text]
  • MUFI Character Recommendation V. 3.0: Code Chart Order
    MUFI character recommendation Characters in the official Unicode Standard and in the Private Use Area for Medieval texts written in the Latin alphabet ⁋ ※ ð ƿ ᵹ ᴆ ※ ¶ ※ Part 2: Code chart order ※ Version 3.0 (5 July 2009) ※ Compliant with the Unicode Standard version 5.1 ____________________________________________________________________________________________________________________ ※ Medieval Unicode Font Initiative (MUFI) ※ www.mufi.info ISBN 978-82-8088-403-9 MUFI character recommendation ※ Part 2: code chart order version 3.0 p. 2 / 245 Editor Odd Einar Haugen, University of Bergen, Norway. Background Version 1.0 of the MUFI recommendation was published electronically and in hard copy on 8 December 2003. It was the result of an almost two-year-long electronic discussion within the Medieval Unicode Font Initiative (http://www.mufi.info), which was established in July 2001 at the International Medi- eval Congress in Leeds. Version 1.0 contained a total of 828 characters, of which 473 characters were selected from various charts in the official part of the Unicode Standard and 355 were located in the Private Use Area. Version 1.0 of the recommendation is compliant with the Unicode Standard version 4.0. Version 2.0 is a major update, published electronically on 22 December 2006. It contains a few corrections of misprints in version 1.0 and 516 additional char- acters (of which 123 are from charts in the official part of the Unicode Standard and 393 are additions to the Private Use Area). There are also 18 characters which have been decommissioned from the Private Use Area due to the fact that they have been included in later versions of the Unicode Standard (and, in one case, because a character has been withdrawn).
    [Show full text]
  • 1 Symbols (2286)
    1 Symbols (2286) USV Symbol Macro(s) Description 0009 \textHT <control> 000A \textLF <control> 000D \textCR <control> 0022 ” \textquotedbl QUOTATION MARK 0023 # \texthash NUMBER SIGN \textnumbersign 0024 $ \textdollar DOLLAR SIGN 0025 % \textpercent PERCENT SIGN 0026 & \textampersand AMPERSAND 0027 ’ \textquotesingle APOSTROPHE 0028 ( \textparenleft LEFT PARENTHESIS 0029 ) \textparenright RIGHT PARENTHESIS 002A * \textasteriskcentered ASTERISK 002B + \textMVPlus PLUS SIGN 002C , \textMVComma COMMA 002D - \textMVMinus HYPHEN-MINUS 002E . \textMVPeriod FULL STOP 002F / \textMVDivision SOLIDUS 0030 0 \textMVZero DIGIT ZERO 0031 1 \textMVOne DIGIT ONE 0032 2 \textMVTwo DIGIT TWO 0033 3 \textMVThree DIGIT THREE 0034 4 \textMVFour DIGIT FOUR 0035 5 \textMVFive DIGIT FIVE 0036 6 \textMVSix DIGIT SIX 0037 7 \textMVSeven DIGIT SEVEN 0038 8 \textMVEight DIGIT EIGHT 0039 9 \textMVNine DIGIT NINE 003C < \textless LESS-THAN SIGN 003D = \textequals EQUALS SIGN 003E > \textgreater GREATER-THAN SIGN 0040 @ \textMVAt COMMERCIAL AT 005C \ \textbackslash REVERSE SOLIDUS 005E ^ \textasciicircum CIRCUMFLEX ACCENT 005F _ \textunderscore LOW LINE 0060 ‘ \textasciigrave GRAVE ACCENT 0067 g \textg LATIN SMALL LETTER G 007B { \textbraceleft LEFT CURLY BRACKET 007C | \textbar VERTICAL LINE 007D } \textbraceright RIGHT CURLY BRACKET 007E ~ \textasciitilde TILDE 00A0 \nobreakspace NO-BREAK SPACE 00A1 ¡ \textexclamdown INVERTED EXCLAMATION MARK 00A2 ¢ \textcent CENT SIGN 00A3 £ \textsterling POUND SIGN 00A4 ¤ \textcurrency CURRENCY SIGN 00A5 ¥ \textyen YEN SIGN 00A6
    [Show full text]
  • FUPA) in the UCS Source: Michael Everson and Klaas Ruppel Version: 2.0 Date: 1998-11-02
    Title: Encoding Finno-Ugric Phonetic Alphabet (FUPA) in the UCS Source: Michael Everson and Klaas Ruppel Version: 2.0 Date: 1998-11-02 The Finno-Ugric Phonetic Alphabet (FUPA) 0. Introduction This paper presents a collection of characters, diacritics, and notation marks used in the FUPA scheme. This transcription is and has been used creatively by different scientists in different countries at different times (Lagercrantz probably made the most baroque use of the system); therefore the phonetic or phonematic values of the elements of the FUPA are intentionally left aside here. However, all the users of this scheme have one thing in common: they use the FUPA in a technical way, as described below. We prefer the term ÒFUPAÓ to ÒFUTÓ (Finno- Ugric Transcription) because of its analogy to the IPA (International Phonetic Alphabet). It is not the intention of this paper to encourage exact (or over-exact) phonetic transcription or the extensive use of diacritics. The intention is rather to facilitate the use of the FUPA by the Uralicist community in the context of the Universal Character Set (UCS), by ensuring that a comprehensive analysis of the system leads to the possibility of encoding FUPA texts, past and present, with the UCS. In the exploratory versions of this document, firm advice on how to use FUPA will not be given; when the study is complete, however, advice on which characters in the UCS refer to whch letters used in the FUPA system will be given. An example of this, we believe, will be the advice for the LATIN SMALL LETTER ENG to be used in coded representations of FUPA texts, past and present, for the velar nasal, in preference to the GREEK SMALL LETTER ETA, although a great many printed texts use an eta glyph (Ç) and not an eng glyph (Ë).
    [Show full text]
  • The Brill Typeface User Guide & Complete List of Characters
    The Brill Typeface User Guide & Complete List of Characters Version 2.06, October 31, 2014 Pim Rietbroek Preamble Few typefaces – if any – allow the user to access every Latin character, every IPA character, every diacritic, and to have these combine in a typographically satisfactory manner, in a range of styles (roman, italic, and more); even fewer add full support for Greek, both modern and ancient, with specialised characters that papyrologists and epigraphers need; not to mention coverage of the Slavic languages in the Cyrillic range. The Brill typeface aims to do just that, and to be a tool for all scholars in the humanities; for Brill’s authors and editors; for Brill’s staff and service providers; and finally, for anyone in need of this tool, as long as it is not used for any commercial gain.* There are several fonts in different styles, each of which has the same set of characters as all the others. The Unicode Standard is rigorously adhered to: there is no dependence on the Private Use Area (PUA), as it happens frequently in other fonts with regard to characters carrying rare diacritics or combinations of diacritics. Instead, all alphabetic characters can carry any diacritic or combination of diacritics, even stacked, with automatic correct positioning. This is made possible by the inclusion of all of Unicode’s combining characters and by the application of extensive OpenType Glyph Positioning programming. Credits The Brill fonts are an original design by John Hudson of Tiro Typeworks. Alice Savoie contributed to Brill bold and bold italic. The black-letter (‘Fraktur’) range of characters was made by Karsten Lücke.
    [Show full text]
  • Times and Helvetica Fonts Under Development
    ΩTimes and ΩHelvetica Fonts Under Development: Step One Yannis Haralambous, John Plaice To cite this version: Yannis Haralambous, John Plaice. ΩTimes and ΩHelvetica Fonts Under Development: Step One. Tugboat, TeX Users Group, 1996, Proceedings of the 1996 Annual Meeting, 17 (2), pp.126-146. hal- 02101600 HAL Id: hal-02101600 https://hal.archives-ouvertes.fr/hal-02101600 Submitted on 25 Apr 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. ΩTimes and ΩHelvetica Fonts Under Development: Step One Yannis Haralambous Atelier Fluxus Virus, 187, rue Nationale, F-59800 Lille, France [email protected] John Plaice D´epartement d’informatique, Universit´e Laval, Ste-Foy (Qu´ebec) Canada G1K 7P4 [email protected] TheTruthIsOutThere and publishers request that their texts be typeset —ChrisCARTER, The X-Files (1993) in Times; Helvetica (especially the bold series) is often used as a titling font. Like Computer Modern, Times is a very neutral font that can be used in a Introduction wide range of documents, ranging from poetry to ΩTimes and ΩHelvetica will be public domain technical documentation.. virtual Times- and Helvetica-like fonts based upon It would surely be more fun to prepare a real PostScript fonts, which we call “Glyph Con- Bembo- or Stempel Garamond-like font for the serifs tainers”.
    [Show full text]
  • A Multilingual Lexical Database Application with a Structured Interlingua
    SIMuLLDA a Multilingual Lexical Database Application using a Structured Interlingua SIMuLLDA een toepassing van een meertalig lexicaal gegevensbestand met gebruikmaking van een gestructureerde tussentaal (met een samenvatting in het Nederlands) Proefschrift ter verkrijging van de graad van doctor aan de Universiteit Utrecht op het gezag van de Rector Magnificus, Prof. dr. W.H. Gispen, ingevolge het besluit van het College voor Promoties in het openbaar te verdedigen op vrijdag 7 juni 2002 des middags te 4:15 uur door Maarten Janssen geboren op 28 januari 1971 te Nijmegen Promotoren: Prof. dr. H.J. Verkuyl UiL-OTS, Universiteit Utrecht Prof. dr. A. Visser Faculteit Wijsbegeerte, Universiteit Utrecht Contents Preface vii 1 Multilingual Lexical Databases 1 1.1 Multilingual Lexical Databases . 1 1.2 Current Approaches and their Shortcomings . 2 1.2.1 Parallel Wordlists . 2 1.2.2 Hub-and-Spoke Model . 5 1.2.3 WordNet and EuroWordNet . 9 1.2.4 Acquilex et al. 15 1.2.5 Corpus Based Approaches . 19 1.3 Conclusion to Chapter 1 . 21 2 FCA and SIMuLLDA 23 2.1 Formal Concept Analysis . 23 2.1.1 Partial Ordering . 27 2.1.2 Hasse Diagrams . 30 2.2 Connotative Context . 31 2.3 The SIMuLLDA System . 35 2.3.1 Multilinguality . 38 2.3.2 Lexical Gap Filling . 43 2.4 Formal Properties of FCA . 45 2.4.1 FCA and Lattices . 45 2.4.2 Smallest Common Concept . 46 2.4.3 Maximal Filled Sub-Tables . 46 2.4.4 Distributive and Atomic Lattices . 47 2.4.5 Extending Contexts . 48 2.4.6 Models and the Number of Concepts .
    [Show full text]