Index Next: Table 03 Submission 12-Mar-1998
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Vowel Quality and Phonological Projection
i Vowel Quality and Phonological Pro jection Marc van Oostendorp PhD Thesis Tilburg University September Acknowledgements The following p eople have help ed me prepare and write this dissertation John Alderete Elena Anagnostop oulou Sjef Barbiers Outi BatEl Dorothee Beermann Clemens Bennink Adams Bo domo Geert Bo oij Hans Bro ekhuis Norb ert Corver Martine Dhondt Ruud and Henny Dhondt Jo e Emonds Dicky Gilb ers Janet Grijzenhout Carlos Gussenhoven Gert jan Hakkenb erg Marco Haverkort Lars Hellan Ben Hermans Bart Holle brandse Hannekevan Ho of Angeliek van Hout Ro eland van Hout Harry van der Hulst Riny Huybregts Rene Kager HansPeter Kolb Emiel Krah mer David Leblanc Winnie Lechner Klarien van der Linde John Mc Carthy Dominique Nouveau Rolf Noyer Jaap and Hannyvan Oosten dorp Paola Monachesi Krisztina Polgardi Alan Prince Curt Rice Henk van Riemsdijk Iggy Ro ca Sam Rosenthall Grazyna Rowicka Lisa Selkirk Chris Sijtsma Craig Thiersch MiekeTrommelen Rub en van der Vijver Janneke Visser Riet Vos Jero en van de Weijer Wim Zonneveld Iwant to thank them all They have made the past four years for what it was the most interesting and happiest p erio d in mylife until now ii Contents Intro duction The Headedness of Syllables The Headedness Hyp othesis HH Theoretical Background Syllable Structure Feature geometry Sp ecication and Undersp ecicati on Skeletal tier Mo del of the grammar Optimality Theory Data Organisation of the thesis Chapter Chapter -
The Festvox Indic Frontend for Grapheme-To-Phoneme Conversion
The Festvox Indic Frontend for Grapheme-to-Phoneme Conversion Alok Parlikar, Sunayana Sitaram, Andrew Wilkinson and Alan W Black Carnegie Mellon University Pittsburgh, USA aup, ssitaram, aewilkin, [email protected] Abstract Text-to-Speech (TTS) systems convert text into phonetic pronunciations which are then processed by Acoustic Models. TTS frontends typically include text processing, lexical lookup and Grapheme-to-Phoneme (g2p) conversion stages. This paper describes the design and implementation of the Indic frontend, which provides explicit support for many major Indian languages, along with a unified framework with easy extensibility for other Indian languages. The Indic frontend handles many phenomena common to Indian languages such as schwa deletion, contextual nasalization, and voicing. It also handles multi-script synthesis between various Indian-language scripts and English. We describe experiments comparing the quality of TTS systems built using the Indic frontend to grapheme-based systems. While this frontend was designed keeping TTS in mind, it can also be used as a general g2p system for Automatic Speech Recognition. Keywords: speech synthesis, Indian language resources, pronunciation 1. Introduction in models of the spectrum and the prosody. Another prob- lem with this approach is that since each grapheme maps Intelligible and natural-sounding Text-to-Speech to a single “phoneme” in all contexts, this technique does (TTS) systems exist for a number of languages of the world not work well in the case of languages that have pronun- today. However, for low-resource, high-population lan- ciation ambiguities. We refer to this technique as “Raw guages, such as languages of the Indian subcontinent, there Graphemes.” are very few high-quality TTS systems available. -
The Unicode Cookbook for Linguists: Managing Writing Systems Using Orthography Profiles
Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch Year: 2017 The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles Moran, Steven ; Cysouw, Michael DOI: https://doi.org/10.5281/zenodo.290662 Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-135400 Monograph The following work is licensed under a Creative Commons: Attribution 4.0 International (CC BY 4.0) License. Originally published at: Moran, Steven; Cysouw, Michael (2017). The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles. CERN Data Centre: Zenodo. DOI: https://doi.org/10.5281/zenodo.290662 The Unicode Cookbook for Linguists Managing writing systems using orthography profiles Steven Moran & Michael Cysouw Change dedication in localmetadata.tex Preface This text is meant as a practical guide for linguists, and programmers, whowork with data in multilingual computational environments. We introduce the basic concepts needed to understand how writing systems and character encodings function, and how they work together. The intersection of the Unicode Standard and the International Phonetic Al- phabet is often not met without frustration by users. Nevertheless, thetwo standards have provided language researchers with a consistent computational architecture needed to process, publish and analyze data from many different languages. We bring to light common, but not always transparent, pitfalls that researchers face when working with Unicode and IPA. Our research uses quantitative methods to compare languages and uncover and clarify their phylogenetic relations. However, the majority of lexical data available from the world’s languages is in author- or document-specific orthogra- phies. -
UC Santa Barbara Dotawo: a Journal of Nubian Studies
UC Santa Barbara Dotawo: A Journal of Nubian Studies Title The Consonant System of Abu Jinuk (Kordofan Nubian) Permalink https://escholarship.org/uc/item/3kf38308 Journal Dotawo: A Journal of Nubian Studies, 2(1) Author Alshareef, Waleed Publication Date 2015-06-01 DOI 10.5070/D62110020 License https://creativecommons.org/licenses/by-nc/4.0/ 4.0 Peer reviewed eScholarship.org Powered by the California Digital Library University of California The Consonant System of 155 Abu Jinuk (Kordofan Nubian) Waleed Alshareef 1. Introduction Abu Jinuk is a Kordofan Nubian language mainly spoken in the northwestern Nuba Mountains of Sudan. Kordofan Nubian is a branch of the Nubian language family. According to Rilly,1 Nubian belongs to the northern East Sudanic subgroup which is part of the East Sudanic branch of the Nilo-Saharan phylum. According to the Sultan of the Abu Jinuk tribe, the population in 2010 was 5,896 of whom 3,556 speakers live in the Nuba Moun- tains and 2,340 are scattered in different towns of Sudan.2 Accord- ing to the informants, the people call themselves and their language [d̪εkla] meaning “the great grandfather.” The Arabic term “Abu Ji- nuk,” by which they are known in linguistic literature, is the name of their mountain. By the non-Arab neighboring groups, the Abu Jinuk people are called [εlεk], which means “the explorers.” Abu Jinuk is an undescribed language. No linguistic studies have been devoted to the phonology of this language. Therefore, exam- ining the consonant system of Abu Jinuk is thought to be the first linguistic investigation of this language. -
Phonetics and Phonology in Russian Unstressed Vowel Reduction: a Study in Hyperarticulation
Phonetics and Phonology in Russian Unstressed Vowel Reduction: A Study in Hyperarticulation Jonathan Barnes Boston University (Short title: Hyperarticulating Russian Unstressed Vowels) Jonathan Barnes Department of Modern Foreign Languages and Literatures Boston University 621 Commonwealth Avenue, Room 119 Boston, MA 02215 Tel: (617) 353-6222 Fax: (617) 353-4641 [email protected] Abstract: Unstressed vowel reduction figures centrally in recent literature on the phonetics-phonology interface, in part owing to the possibility of a causal relationship between a phonetic process, duration-dependent undershoot, and the phonological neutralizations observed in systems of unstressed vocalism. Of particular interest in this light has been Russian, traditionally described as exhibiting two distinct phonological reduction patterns, differing both in degree and distribution. This study uses hyperarticulation to investigate the relationship between phonetic duration and reduction in Russian, concluding that these two reduction patterns differ not in degree, but in the level of representation at which they apply. These results are shown to have important consequences not just for theories of vowel reduction, but for other problems in the phonetics-phonology interface as well, incomplete neutralization in particular. Introduction Unstressed vowel reduction has been a subject of intense interest in recent debate concerning the nature of the phonetics-phonology interface. This is the case at least in part due to the existence of two seemingly analogous processes bearing this name, one typically called phonetic, and the other phonological. Phonological unstressed vowel reduction is a phenomenon whereby a given language's full vowel inventory can be realized only in lexically stressed syllables, while in unstressed syllables some number of neutralizations of contrast take place, with the result that only a subset of the inventory is realized on the surface. -
Issue 31.2.Indd
Akanlig-Pare, G../Legon Journal of the Humanities Vol. 31.2 (2020) DOI: https://dx.doi.org/10.4314/ljh.v31i2 .3 Palatalization in Central Bùlì George Akanlig-Pare Senior Lecturer University of Ghana, Legon Email: [email protected] Submitted: May 6, 2020/Accepted: November 20, 2020/Published: January 28, 2021 Abstract Palatalization is a process through which non-palatal consonants acquire palatality, either through a shift in place of articulation from a non-palatal region to the hard palate or through the superimposition of palatal qualities on a non-palatal consonant. In both cases, there is a front, non-low vowel or a palatal glide that triggers the process. In this paper, I examine the palatalization phenomena in Bùlì using Feature Geometry within the non- linear generative phonological framework. I argue that both full and secondary palatalization occur in Buli. The paper further explains that, the long high front vowel /i:/, triggers the formation of a palato-alveolar aff ricate which is realized in the Central dialect of Bùlì, where the Northern and Central dialects retain the derived palatal stop. Keywords: Palatalization, Bùlì, Feature Geometry, synchronic, diachronic Introduction Although linguists generally agree that palatalization is a process through which non-palatal consonants acquire palatality, they diff er in their accounts of the phonological processes that characterize it. As Halle (2005, p.1) states, “… palatalization raises numerous theoretical questions about which there is at present no agreement among phonologists”. Cross linguistic surveys conducted on the process reveal a number of issues that lead to the present state of aff airs. -
Philological Sciences. Linguistics” / Journal of Language Relationship Issue 3 (2010)
Российский государственный гуманитарный университет Russian State University for the Humanities RGGU BULLETIN № 5/10 Scientific Journal Series “Philological Sciences. Linguistics” / Journal of Language Relationship Issue 3 (2010) Moscow 2010 ВЕСТНИК РГГУ № 5/10 Научный журнал Серия «Филологические науки. Языкознание» / «Вопросы языкового родства» Выпуск 3 (2010) Москва 2010 УДК 81(05) ББК 81я5 Главный редактор Е.И. Пивовар Заместитель главного редактора Д.П. Бак Ответственный секретарь Б.Г. Власов Главный художник В.В. Сурков Редакционный совет серии «Филологические науки. Языкознание» / «Вопросы языкового родства» Председатель Вяч. Вс. Иванов (Москва – Лос-Анджелес) М. Е. Алексеев (Москва) В. Блажек (Брно) У. Бэкстер (Анн Арбор) В. Ф. Выдрин (Санкт-Петербург) М. Гелл-Манн (Санта Фе) А. Б. Долгопольский (Хайфа) Ф. Кортландт (Лейден) А. Лубоцкий (Лейден) Редакционная коллегия серии: В. А. Дыбо (главный редактор) Г. С. Старостин (заместитель главного редактора) Т. А. Михайлова (ответственный секретарь) К. В. Бабаев С. Г. Болотов А. В. Дыбо О. А. Мудрак В. Е. Чернов ISSN 1998-6769 © Российский государственный гуманитарный университет, 2010 УДК 81(05) ББК 81я5 Вопросы языкового родства: Международный научный журнал / Рос. гос. гуманитар. ун-т; Рос. Акад. наук. Ин-т языкознания; под ред. В. А. Дыбо. ― М., 2010. ― № 3. ― X + 176 с. ― (Вестник РГГУ: Научный журнал; Серия «Филологические науки. Языко- знание»; № 05/10). Journal of Language Relationship: International Scientific Periodical / Russian State Uni- versity for the Humanities; Russian Academy of Sciences. Institute of Linguistics; Ed. by V. A. Dybo. ― Moscow, 2010. ― Nº 3. ― X + 176 p. ― (RSUH Bulletin: Scientific Periodical; Linguistics Series; Nº 05/10). ISSN 1998-6769 http ://journal.nostratic.ru [email protected] Дополнительные знаки: С. -
Ling 230/503: Articulatory Phonetics and Transcription English Vowels
Ling 230/503: Articulatory Phonetics and Transcription Broad vs. narrow transcription. A narrow transcription is one in which the transcriber records much phonetic detail without attention to the way in which the sounds of the language form a system. A broad transcription omits those details of a narrow transcription which the transcriber feels are not worth recording. Normally these details will be aspects of the speech event which are: (1) predictable or (2) would not differentiate two token utterances of the same type in the judgment of speakers or (3) are presumed not to figure in the systematic phonology of the language. IPA vs. American transcription There are two commonly used systems of phonetic transcription, the International Phonetics Association or IPA system and the American system. In many cases these systems overlap, but in certain cases there are important distinctions. Students need to learn both systems and have to be flexible about the use of symbols. English Vowels Short vowels /ɪ ɛ æ ʊ ʌ ɝ/ ‘pit’ pɪt ‘put’ pʊt ‘pet’ pɛt ‘putt’ pʌt ‘pat’ pæt ‘pert’ pɝt (or pr̩t) Long vowels /i(ː), u(ː), ɑ(ː), ɔ(ː)/ ‘beat’ biːt (or bit) ‘boot’ buːt (or but) ‘(ro)bot’ bɑːt (or bɑt) ‘bought’ bɔːt (or bɔt) Diphthongs /eɪ, aɪ, aʊ, oʊ, ɔɪ, ju(ː)/ ‘bait’ beɪt ‘boat’ boʊt ‘bite’ bɑɪt (or baɪt) ‘bout’ bɑʊt (or baʊt) ‘Boyd’ bɔɪd (or boɪd) ‘cute’ kjuːt (or kjut) The property of length, denoted by [ː], can be predicted based on the quality of the vowel. For this reason it is quite common to omit the length mark [ː]. -
Part 1: Introduction to The
PREVIEW OF THE IPA HANDBOOK Handbook of the International Phonetic Association: A guide to the use of the International Phonetic Alphabet PARTI Introduction to the IPA 1. What is the International Phonetic Alphabet? The aim of the International Phonetic Association is to promote the scientific study of phonetics and the various practical applications of that science. For both these it is necessary to have a consistent way of representing the sounds of language in written form. From its foundation in 1886 the Association has been concerned to develop a system of notation which would be convenient to use, but comprehensive enough to cope with the wide variety of sounds found in the languages of the world; and to encourage the use of thjs notation as widely as possible among those concerned with language. The system is generally known as the International Phonetic Alphabet. Both the Association and its Alphabet are widely referred to by the abbreviation IPA, but here 'IPA' will be used only for the Alphabet. The IPA is based on the Roman alphabet, which has the advantage of being widely familiar, but also includes letters and additional symbols from a variety of other sources. These additions are necessary because the variety of sounds in languages is much greater than the number of letters in the Roman alphabet. The use of sequences of phonetic symbols to represent speech is known as transcription. The IPA can be used for many different purposes. For instance, it can be used as a way to show pronunciation in a dictionary, to record a language in linguistic fieldwork, to form the basis of a writing system for a language, or to annotate acoustic and other displays in the analysis of speech. -
How to Edit IPA 1 How to Use SAMPA for Editing IPA 2 How to Use X
version July 19 How to edit IPA When you want to enter the International Phonetic Association (IPA) character set with a computer keyboard, you need to know how to enter each IPA character with a sequence of keyboard strokes. This document describes a number of techniques. The complete SAMPA and RTR mapping can be found in the attached html documents. The main html document (ipa96.html) comes in a pdf-version (ipa96.pdf) too. 1 How to use SAMPA for editing IPA The Speech Assessment Method (SAM) Phonetic Alphabet has been developed by John Wells (http://www.phon.ucl.ac.uk/home/sampa). The goal was to map 176 IPA characters into the range of 7-bit ASCII, which is a set of 96 characters. The principle is to represent a single IPA character by a single ASCII character. This table is an example for five vowels: Description IPA SAMPA script a ɑ A ae ligature æ { turned a ɐ 6 epsilon ɛ E schwa ə @ A visual represenation of a keyboard shows the mapping on screen. The source for the SAMPA mapping used is "Handbook of multimodal an spoken dialogue systems", D Gibbon, Kluwer Academic Publishers 2000. 2 How to use X-SAMPA for editing IPA The multi-character extension to SAMPA has also been developed by John Wells (http://www.phon.ucl.ac.uk/home/sampa/x-sampa.htm). The basic principle used is to form chains of ASCII characters, that represent a single IPA character, e.g. This table lists some examples Description IPA X-SAMPA beta β B small capital B ʙ B\ lower-case B b b lower-case P p p Phi ɸ p\ The X-SAMPA mapping is in preparation and will be included in the next release. -
Sample File “H” = a Voiceless Alveolar Affricate: “Ts.” the “Ts” of “Hats” Or “Pots.”
Credits front layout and design Victor Raymond cover illustration Giovanna Fregni editorial help SampleChris Davis file ©M. A. R. Barker, 2002 Bednálljan THE SCRIPT OF THE FIRST IMPERIUM By M. A. R. BARKER “Bednálljan Salarvyáni” is a Khíshan language, related to Tsolyáni, Mu’ugalavyáni, and others of the family. It is a member in a long tradition, dating all the way back to Llyáni in far-off Livyánu; yet the stages of this process are anything but clear. One important fact is its close relationship to Irzákh, the tongue of the Dragon Warriors of N’lüss. The language can only tenuously be connected to Bednállja, the small princpality that once occupied the shores of Tamkáde BaySample in what is now file Western Salarvyá. The First Imperium, the empire founded by Queen Nayári of Jakálla, in southern Tsolyánu, was the primary cause for the prominence of both the language and the name: what “Bednálljan Salarvyáni.” Had she and her court not spoken of Bednállja as their original cultural and spiritual “homeland,” the language might well have been called something else entirely. “Bednálljan Salarvyáni” is not a single unified linguistic corpus. There were many dialectical changes during the First Imperium. Time and events have eroded the visibility of many of these: cognates, morphological and syntactic similarities, and sound shifts. What is left is a basic strong relationship, however, as can be seen from tomb inscriptions and historical texts, plus such non-linguistic cultural sequences as pottery, coins, and later records. Perhaps a dozen major dialects emerged from the chaos of the crumbling kingdoms of the Fisherman Kings. -
Wagner 1 Dutch Fricatives Dutch Is an Indo-European Language of The
Wagner 1 Dutch Fricatives Dutch is an Indo-European language of the West Germanic branch, closely related to Frisian and English. It is spoken by about 22 million people in the Netherlands, Belgium, Aruba, Suriname, and the Netherlands Antilles, nations in which the Dutch language also has an official status. Additionally, Dutch is still spoken in parts of Indonesia as a result of colonial rule, but it has been replaced by Afrikaans in South Africa. In the Netherlands, Algemeen Nederlands is the standard form of the language taught in schools and used by the government. It is generally spoken in the western part of the country, in the provinces of Nord and Zuid Holland. The Netherlands is divided into northern and southern regions by the Rijn river and these regions correspond to the areas where the main dialects of Dutch are distinguished from each other. My consultant, Annemarie Toebosch, is from the village of Bemmel in the Netherlands. Her dialect is called Betuws Dutch because Bemmel is located in the Betuwe region between two branches of the Rijn river. Toebosch lived in the Netherlands until the age of 25 when she relocated to the United States for graduate school. Today she is a professor of linguistics at the University of Michigan-Flint and speaks only English in her professional career. For the past 18 months she has been speaking Dutch more at home with her son, but in the 9 years prior to that, she only spoke Dutch once or twice a week with her parents and other relatives who remain in the Netherlands.