Language Specific Peculiarities Document for Georgian As Spoken in Georgia 1. Special Handling of Dialects
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress
1 Assessment of Options for Handling Full Unicode Character Encodings in MARC21 A Study for the Library of Congress Part 1: New Scripts Jack Cain Senior Consultant Trylus Computing, Toronto 1 Purpose This assessment intends to study the issues and make recommendations on the possible expansion of the character set repertoire for bibliographic records in MARC21 format. 1.1 “Encoding Scheme” vs. “Repertoire” An encoding scheme contains codes by which characters are represented in computer memory. These codes are organized according to a certain methodology called an encoding scheme. The list of all characters so encoded is referred to as the “repertoire” of characters in the given encoding schemes. For example, ASCII is one encoding scheme, perhaps the one best known to the average non-technical person in North America. “A”, “B”, & “C” are three characters in the repertoire of this encoding scheme. These three characters are assigned encodings 41, 42 & 43 in ASCII (expressed here in hexadecimal). 1.2 MARC8 "MARC8" is the term commonly used to refer both to the encoding scheme and its repertoire as used in MARC records up to 1998. The ‘8’ refers to the fact that, unlike Unicode which is a multi-byte per character code set, the MARC8 encoding scheme is principally made up of multiple one byte tables in which each character is encoded using a single 8 bit byte. (It also includes the EACC set which actually uses fixed length 3 bytes per character.) (For details on MARC8 and its specifications see: http://www.loc.gov/marc/.) MARC8 was introduced around 1968 and was initially limited to essentially Latin script only. -
How to Edit IPA 1 How to Use SAMPA for Editing IPA 2 How to Use X
version July 19 How to edit IPA When you want to enter the International Phonetic Association (IPA) character set with a computer keyboard, you need to know how to enter each IPA character with a sequence of keyboard strokes. This document describes a number of techniques. The complete SAMPA and RTR mapping can be found in the attached html documents. The main html document (ipa96.html) comes in a pdf-version (ipa96.pdf) too. 1 How to use SAMPA for editing IPA The Speech Assessment Method (SAM) Phonetic Alphabet has been developed by John Wells (http://www.phon.ucl.ac.uk/home/sampa). The goal was to map 176 IPA characters into the range of 7-bit ASCII, which is a set of 96 characters. The principle is to represent a single IPA character by a single ASCII character. This table is an example for five vowels: Description IPA SAMPA script a ɑ A ae ligature æ { turned a ɐ 6 epsilon ɛ E schwa ə @ A visual represenation of a keyboard shows the mapping on screen. The source for the SAMPA mapping used is "Handbook of multimodal an spoken dialogue systems", D Gibbon, Kluwer Academic Publishers 2000. 2 How to use X-SAMPA for editing IPA The multi-character extension to SAMPA has also been developed by John Wells (http://www.phon.ucl.ac.uk/home/sampa/x-sampa.htm). The basic principle used is to form chains of ASCII characters, that represent a single IPA character, e.g. This table lists some examples Description IPA X-SAMPA beta β B small capital B ʙ B\ lower-case B b b lower-case P p p Phi ɸ p\ The X-SAMPA mapping is in preparation and will be included in the next release. -
Building a Universal Phonetic Model for Zero-Resource Languages
Building a Universal Phonetic Model for Zero-Resource Languages Paul Moore MInf Project (Part 2) Interim Report Master of Informatics School of Informatics University of Edinburgh 2020 3 Abstract Being able to predict phones from speech is a challenge in and of itself, but what about unseen phones from different languages? In this project, work was done towards building precisely this kind of universal phonetic model. Using the GlobalPhone language corpus, phones’ articulatory features, a recurrent neu- ral network, open-source libraries, and an innovative prediction system, a model was created to predict phones based on their features alone. The results show promise, especially for using these models on languages within the same family. 4 Acknowledgements Once again, a huge thank you to Steve Renals, my supervisor, for all his assistance. I greatly appreciated his practical advice and reasoning when I got stuck, or things seemed overwhelming, and I’m very thankful that he endorsed this project. I’m immensely grateful for the support my family and friends have provided in the good times and bad throughout my studies at university. A big shout-out to my flatmates Hamish, Mark, Stephen and Iain for the fun and laugh- ter they contributed this year. I’m especially grateful to Hamish for being around dur- ing the isolation from Coronavirus and for helping me out in so many practical ways when I needed time to work on this project. Lastly, I wish to thank Jesus Christ, my Saviour and my Lord, who keeps all these things in their proper perspective, and gives me strength each day. -
Equivalences Between Different Phonetic Alphabets
Equivalences between different phonetic alphabets by Carlos Daniel Hern´andezMena Description IPA Mexbet X-SAMPA IPA Symbol in LATEX Voiceless bilabial plosive p p p p Voiceless dental plosive” t t t d ntextsubbridgeftg Voiceless velar plosive k k k k Voiceless palatalized plosive kj k j k j kntextsuperscriptfjg Voiced bilabial plosive b b b b Voiced bilabial approximant B VB o ntextloweringfntextbetag fl Voiced dental plosive d” d d d ntextsubbridgefdg Voiced dental fricative flD DD o ntextloweringfntextipafn;Dgg Voiced velar plosive g g g g Voiced velar fricative Èfl GG o ntextloweringfntextbabygammag Voiceless palato-alveolar affricate t“S tS tS ntextroundcapftntexteshg Voiceless labiodental fricative f f f f Voiceless alveolar fricative s s s s Voiced alveolar fricative z z z z Voiceless dental fricative” s s [ s d ntextsubbridgefsg Voiced dental fricative” z z [ z d ntextsubbridgefzg Voiceless postalveolar fricative S SS ntextesh Voiceless velar fricative x x x x Voiced palatal fricative J Z jn ntextctj Voiced postalveolar affricate d“Z dZ dZ ntextroundcapfdntextyoghg Voiced bilabial nasal m m m m Voiced alveolar nasal n n n n Voiced labiodental nasal M MF ntextltailm Voiced dental nasal n” n [ n d ntextsubbridgefng Voiced palatalized nasal nj n j n j nntextsuperscriptfjg Voiced velarized nasal nÈ N n G nntextsuperscript fntextbabygammag Voiced palatal nasal ñ n∼ J ntextltailn Voiced alveolar lateral approximant l l l l Voiced dental lateral” l l [ l d ntextsubbridgeflg Voiced palatalized lateral lj l j l j lntextsuperscriptfjg Lowered -
Proposal for the Georgian Script Root Zone LGR
Proposal for the Georgian Script Root Zone LGR LGR Version 2 Date: 2016-11-24 Document version: 1.1 Authors: GEORGIAN SCRIPT GENERATION PANEL 1 General Information/ Overview/ Abstract The purpose of this document is to give an overview of the proposed Georgian LGR in the XML format and the rationale behind the design decisions taken. It includes a discussion of relevant features of the script, the communities or languages using it, the process and methodology used and information on the contributors. The formal specification of the LGR can be found in the accompanying XML document: • Proposed-LGR-Georgian-20160915.xml Labels for testing can be found in the accompanying text document: • Labels-GeorgianScript-20160915.txt 2 Script for which the LGR is proposed ISO 15924 Code: Geor ISO 15924 Key N°: 240 ISO 15924 English Name: Georgian Mkhedruli Latin transliteration of native script name: Mkhedruli Native name of the script: მხედრული Maximal Starting Repertoire (MSR) version: MSR-2 Proposal for a Georgian Script Root Zone LGR Georgian Script GP 3 Background on Script and Principal Languages Using It The Georgian scripts are the three writing systems used to write the Georgian language: Asomtavruli, Nuskhuri and Mkhedruli. Mkhedruli (Georgian: მხედრული) is the current Georgian script and is therefore the standard script for modern Georgian and its related Kartvelian languages, whereas Asomtavruli and Nuskhuri are used only in ceremonial religious texts and iconography. In the following, the term Georgian script is used synonymously with Mkhedruli. Like the two other scripts, Mkhedruli is purely unicameral. Mkhedruli first appears in the 10th century - the oldest Mkhedruli inscription found is dated back to 982 AD. -
Icelandic Phonetic Transcription
A Short Overview of the Icelandic Sound System Pronunciation Variants and Phonetic Transcription IPA Version Eiríkur Rögnvaldsson SÍM 2020 This document was written in December 2019 and January 2020 for the SÍM consortium as a part of the Icelandic National Language Technology Program. The document is made in two versions – one using the IPA transcription system and the other using the X-SAMPA transcription system. This is the IPA version. Both versions begin with a table showing the mappings between the two systems. The document is distributed under the CC BY 4.0 license. 2 1 An Overview of the Icelandic Sound System Icelandic speech sounds can be divided into two main groups; consonants and vowels. Icelandic consonants can be further divided into four classes: plosives (stops), fricatives (and approximants), nasals, and liquids (laterals and trills/taps). Within the vowel group, a further distinction can be made between monophthongs and diphthongs. The following table gives an overview of the phonemes of the Icelandic IPA and X- SAMPA symbol set, grouped by the phoneme classes to which they belong (according to the manner of their articulation). Consonants IPA SAMPA Orthography IPA SAMPA Gloss Plosives p p bera [pɛːra] /pE:ra/ ‘carry’ pʰ p_h pera [pʰɛːra] /p_hE:ra/ ‘pear’ t t dalur [taːlʏr] /ta:lYr/ ‘valley’ tʰ t_h tala [tʰaːla] /t_ha:la/ ‘talk’ c c gera [cɛːra] /cE:ra/ ‘do’ cʰ c_h kæla [cʰaiːla] /c_hai:la/ ‘cool off’ k k galdur [kaltʏr] /kaltYr/ ‘magic’ kʰ k_h kaldur [kʰaltʏr] /k_haltYr/ ‘cold’ Fricatives v v vera [vɛːra] /vE:ra/ ‘be’ -
The Unicode Standard, Version 3.0, Issued by the Unicode Consor- Tium and Published by Addison-Wesley
The Unicode Standard Version 3.0 The Unicode Consortium ADDISON–WESLEY An Imprint of Addison Wesley Longman, Inc. Reading, Massachusetts · Harlow, England · Menlo Park, California Berkeley, California · Don Mills, Ontario · Sydney Bonn · Amsterdam · Tokyo · Mexico City Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial capital letters. However, not all words in initial capital letters are trademark designations. The authors and publisher have taken care in preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. If these files have been purchased on computer-readable media, the sole remedy for any claim will be exchange of defective media within ninety days of receipt. Dai Kan-Wa Jiten used as the source of reference Kanji codes was written by Tetsuji Morohashi and published by Taishukan Shoten. ISBN 0-201-61633-5 Copyright © 1991-2000 by Unicode, Inc. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or other- wise, without the prior written permission of the publisher or Unicode, Inc. -
Plant and Fungal Use in Tusheti, Khevsureti, and Pshavi, Sakartvelo
Acta Societatis Botanicorum Poloniae DOI: 10.5586/asbp.3517 ORIGINAL RESEARCH PAPER Publication history Received: 2016-06-29 Accepted: 2016-09-30 Plant and fungal use in Tusheti, Khevsureti, Published: 2016-12-30 and Pshavi, Sakartvelo (Republic of Georgia), Handling editor Łukasz Łuczaj, Institute of Applied Biotechnology and Caucasus Basic Sciences, University of Rzeszów, Poland Rainer W. Bussmann1*, Narel Y. Paniagua-Zambrana2, Shalva Authors’ contributions 3 4 3 3 RWB, NYPZ, SS, ZK, DK, DT, Sikharulidze , Zaal Kikvidze , David Kikodze , David Tchelidze , and KB designed the study; Ketevan Batsatsashvili3, Robert E. Hart1 RWB, NYPZ, SS, ZK, DT, and KB 1 conducted the fieldwork; RHE William L. Brown Center, Missouri Botanical Garden, P.O. Box 299, St. Louis, MO 63166-0299, conducted the main statistical United States 2 analysis; RBU, NYPZ, and RHE Herbario Nacional de Bolivia, Instituto de Ecología-UMSA, Campus Universitario, Cota Cota Calle analyzed the data and wrote 27, La Paz, Bolivia 3 the manuscript; all authors read, Institute of Botany and Bakuriani Alpine Botanical Garden, Ilia State University, Botanikuri 1, corrected, and approved the 0105 Tbilisi, Georgia 4 manuscript 4-D Research Institute, Ilia State University, Cholokasvili 5, 0162 Tbilisi, Georgia * Corresponding author. Email: [email protected] Funding Funding was provided by the William L. Brown Center endowment. Abstract In this study, we documented traditional plant use in Tusheti, Khevsureti, and Competing interests No competing interests have Pshavi and hypothesized that (i) plant use knowledge in general would be higher been declared. in isolated high elevation communities, and that (ii) use of home gardens would be much more restricted to lower elevation settings. -
The Consonant Phonotactics of Georgian
The Consonant Phonotactics of Georgian Published by LOT phone: +31 30 253 6006 3512 JK Utrecht fax: +31 30 253 6000 The Netherlands e-mail: [email protected] http://www.let.uu.nl/lot/ Cover illustration: The oldest specimen of Georgian writing, found in the Bolnisi Sioni Temple (5th century). ISBN 90-76864-24-1 NUGI 941 Copyright © 2002 by Marika Butskhrikidze. All rights reserved. The Consonant Phonotactics of Georgian PROEFSCHRIFT ter verkrijging van de graad van Doctor aan de Universiteit Leiden, op gezag van de Rector Magnificus Dr. D.D. Breimer, hoogleraar in de faculteit der Wiskunde en Natuurwetenschappen en die der Geneeskunde, volgens besluit van het College voor Promoties te verdedigen op woensdag 6 november 2002 te klokke 16.15 uur door Marika Butskhrikidze geboren te Tbilisi (Georgië) in 1971 Promotiecommissie promotores: Prof. dr. C.J. Ewen Prof. dr. H.G. van der Hulst (University of Connecticut) co-promotor: Dr. J.M. van de Weijer referent: Prof. dr. E.V. Hume (Ohio State University) overige leden: Prof. dr. V.J. van Heuven Dr. H.J. Smeets YX=5YQL&HPVP6REOHEV [To my parents] Contents Acknowledgements .......................................................................... xi Abbreviations and Symbols .......................................................... xiii 1 Preliminaries ....................................................................... 1 1.0. Introduction: the ‘problem’ ............................................................................... 1 1.1. Ontological premises ....................................................................................... -
Introductory Materials Akan Twi Asante 1
Humboldt-Universität zu Berlin Sprach- und literaturwissenschaftliche Fakultät Institut für deutsche Sprache und Linguistik MA Historische Linguistik Modul 9: Methoden Linguistischer Datenerhebung Urbane Feldforschung Sommersemester 2018 Dozent: PD Dr. habil. Frank Seifart Verfasserin: Polina Zabolotskikh Matrikelnummer: 595976 Introductory Materials Akan Twi Asante 1. Bibliographical survey This paper is dedicated to the dialect Asante of Akan Twi, one of languages of West Africa. The topic of this small research is usage of phrasal verbs (emu) ye den, (emu) ye duru and (emu) ye hare, which is described in the grammar outlook, written by J. G. Christaller in 1875 and called Grammar of the Asante and Fante Language called Tschi [Chwee, Twi]. This is a brief grammar outlook, describing all main phonetic, morphologic and syntactic aspects of Akan Twi. There is a lack of fundamental linguistic studies of Akan Twi, the majority of the works are either general essays or textbooks for school students, what means there are mostly works one can consult only on basic patterns of verbal conjunction and noun inflection in Akan and its dialects. The ones, used used while writing this paper are: Asante-Twi learners' reference grammar written by David Adu-Amankwah in 2003 and A Comprehensive Course in Twi (Asante) by Florence A. Dolphyne, published in 1996 as well as the Twi, basic course written by James E Redden, and Nana Owusu, issued in 1963. Unfortunately there are only a few references to the dialectal differences between Akan dialects in the mentioned general works, but there is a lack of a thorough research on this topic. -
Georgian Style Guide
Georgian Style Guide Published: December, 2017 Microsoft Georgian Style Guide Table of Contents 1 About this style guide ......................................................................................................................... 4 1.1 Recommended style references .............................................................................................. 4 2 Microsoft voice ...................................................................................................................................... 5 2.1 Choices that reflect Microsoft voice ...................................................................................... 5 2.1.1 Word choice ........................................................................................................................... 6 2.1.2 Words and phrases to avoid ............................................................................................ 8 2.2 Sample Microsoft voice text ..................................................................................................... 9 2.2.1 Address the user to take action .................................................................................... 10 2.2.2 Promote a feature .............................................................................................................. 11 2.2.3 Provide how-to guidelines .............................................................................................. 11 2.2.4 Explanatory text and support ....................................................................................... -
1/2 SAMPA Symbol IPA Equivalent Description # Pause { [Æ] Open Lax
SAMPA-IPA equivalences University of Toronto Romance Phonetics Database SAMPA Symbol IPA Equivalent Description # Pause { [æ] Open lax front vowel @ [ə] Mid central unrounded vowel 1 [ɨ] Close central unrounded vowel 2 [ø] Close-mid front rounded vowel 3 [ɜ] Open-mid central unrounded vowel 6 [ɐ] Open central vowel 6~ [ɐ̃] Open central nasal vowel 9 [œ] Open-mid front rounded vowel 9~ [œ̃] Open-mid front rounded nasal vowel a [a] Open front unrounded vowel a~ [ã] Open front unrounded nasal vowel aj [aj] Open front diphthong aw [aw] Open front diphthong A [ɑ] Open back unrounded vowel b [b] Voiced bilabial stop b: [bː] Geminate voiced bilabial stop d [d] Voiced dental/alveolar stop d: [dː] Geminate voiced dental/alveolar stop D [ð] Voiced interdental fricative dz [dz] Voiced alveolar affricate dz: [dzː] Geminated voiced alveolar affricate dZ [dʒ] Voiced post-alveolar affricate dZ: [dʒː] Geminate Voiced post-alveolar affricate e [e] Close-mid front unrounded vowel e~ [ẽ] Close-mid front nasal unrounded vowel e_X [e̯] Non-syllabic tense mid front vowel ej [ej] Mid front diphthong E [ɛ] Open-mid front unrounded vowel E~ [ɛ̃] Open-mid front unrounded nasal vowel f [f] Voiceless labio-dental fricative g [ɡ] Voiced velar stop g: [ɡː] Geminate voiced velar stop h [h] Voiceless glottal fricative H [ɥ] Rounded palatal glide i [i] Close front unrounded vowel i~ [i]̃ Unrounded tense high front nasal vowel I [ɪ] Close front lax vowel j [j] Unrounded palatal glide jj [ʝ] Voiced palatal fricative 1/2 SAMPA-IPA equivalences University of Toronto Romance