Qt0n17f74x.Pdf
Total Page:16
File Type:pdf, Size:1020Kb

Load more
Recommended publications
-
Recognition of Nasalized and Non- Nasalized Vowels
RECOGNITION OF NASALIZED AND NON- NASALIZED VOWELS Submitted By: BILAL A. RAJA Advised by: Dr. Carol Espy Wilson and Tarun Pruthi Speech Communication Lab, Dept of Electrical & Computer Engineering University of Maryland, College Park Maryland Engineering Research Internship Teams (MERIT) Program 2006 1 TABLE OF CONTENTS 1- Abstract 3 2- What Is Nasalization? 3 2.1 – Production of Nasal Sound 3 2.2 – Common Spectral Characteristics of Nasalization 4 3 – Expected Results 6 3.1 – Related Previous Researches 6 3.2 – Hypothesis 7 4- Method 7 4.1 – The Task 7 4.2 – The Technique Used 8 4.3 – HMM Tool Kit (HTK) 8 4.3.1 – Data Preparation 9 4.3.2 – Training 9 4.3.3 – Testing 9 4.3.4 – Analysis 9 5 – Results 10 5.1 – Experiment 1 11 5.2 – Experiment 2 11 6 – Conclusion 12 7- References 13 2 1 - ABSTRACT When vowels are adjacent to nasal consonants (/m,n,ng/), they often become nasalized for at least some part of their duration. This nasalization is known to lead to changes in perceived vowel quality. The goal of this project is to verify if it is beneficial to first recognize nasalization in vowels and treat the three groups of vowels (those occurring before nasal consonants, those occurring after nasal consonants, and the oral vowels which are vowels that are not adjacent to nasal consonants) separately rather than collectively for recognizing the vowel identities. The standard Mel-Frequency Cepstral Coefficients (MFCCs) and the Hidden Markov Model (HMM) paradigm have been used for this purpose. The results show that when the system is trained on vowels in general, the recognition of nasalized vowels is 17% below that of oral vowels. -
The Phonemic − Syllabic Comparisons of Standard Malay and Palembang Malay Using a Historical Linguistic Perspective
Passage2013, 1(2), 35-44 The Phonemic − Syllabic Comparisons of Standard Malay and Palembang Malay Using a Historical Linguistic Perspective By: Novita Arsillah English Language and Literature Program (E-mail: [email protected] / mobile: +6281312203723) ABSTRACT This study is a historical linguistic investigation entitled The Phonemic − Syllabic Comparisons of Standard Malay and Palembang Malay Using a Historical Linguistic Perspective which aims to explore the types of sound changes found in Palembang Malay. The investigation uses a historical linguistic comparative method to compare the phonemic and syllabic changes between an ancestral language Standard Malay and its decent language Palembang Malay. Standard Malay refers to the Wilkinson dictionary in 1904. The participants of this study are seven native speakers of Palembang Malay whose ages range from 20 to 40 years old. The data were collected from the voices of the participants that were recorded along group conversations and interviews. This study applies the theoretical framework of sound changes which proposed by Terry Crowley in 1997 and Lily Campbell in 1999. The findings show that there are nine types of sound changes that were found as the results, namely assimilation (42.35%), lenition (20%), sound addition (3.53%), metathesis (1.18%), dissimilation (1.76%), abnormal sound changes (3.53%), split (13.53%), vowel rising (10.59%), and monophthongisation (3.53%). Keywords: Historical linguistics, standard Malay, Palembang Malay, comparative method, sound change, phoneme, syllable. 35 Novita Arsillah The Phonemic − Syllabic Comparisons of Standard Malay and Palembang Malay Using a Historical Linguistic Perspective INTRODUCTION spelling system in this study since it is considered to be the first Malay spelling This study classifies into the system that is used widely in Malaya, field of historical linguistics that Singapore, and Brunei (Omar, 1989). -
UC Berkeley Dissertations, Department of Linguistics
UC Berkeley Dissertations, Department of Linguistics Title The Aeroacoustics of Nasalized Fricatives Permalink https://escholarship.org/uc/item/00h9g9gg Author Shosted, Ryan K Publication Date 2006 eScholarship.org Powered by the California Digital Library University of California The Aeroacoustics of Nasalized Fricatives by Ryan Keith Shosted B.A. (Brigham Young University) 2000 M.A. (University of California, Berkeley) 2003 A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Linguistics in the GRADUATE DIVISION of the UNIVERSITY OF CALIFORNIA, BERKELEY Committee in charge: John J. Ohala, Chair Keith Johnson Milton M. Azevedo Fall 2006 The dissertation of Ryan Keith Shosted is approved: Chair Date Date Date University of California, Berkeley Fall 2006 The Aeroacoustics of Nasalized Fricatives Copyright 2006 by Ryan Keith Shosted 1 Abstract The Aeroacoustics of Nasalized Fricatives by Ryan Keith Shosted Doctor of Philosophy in Linguistics University of California, Berkeley John J. Ohala, Chair Understanding the relationship of aerodynamic laws to the unique geometry of the hu- man vocal tract allows us to make phonological and typological predictions about speech sounds typified by particular aerodynamic regimes. For example, some have argued that the realization of nasalized fricatives is improbable because fricatives and nasals have an- tagonistic aerodynamic specifications. Fricatives require high pressure behind the suprala- ryngeal constriction as a precondition for high particle velocity. Nasalization, on the other hand, vents back pressure by allowing air to escape through the velopharyngeal orifice. This implies that an open velopharyngeal port will reduce oral particle velocity, thereby potentially extinguishing frication. By using a mechanical model of the vocal tract and spoken fricatives that have undergone coarticulatory nasalization, it is shown that nasal- ization must alter the spectral characteristics of fricatives, e.g. -
Part 1: Introduction to The
PREVIEW OF THE IPA HANDBOOK Handbook of the International Phonetic Association: A guide to the use of the International Phonetic Alphabet PARTI Introduction to the IPA 1. What is the International Phonetic Alphabet? The aim of the International Phonetic Association is to promote the scientific study of phonetics and the various practical applications of that science. For both these it is necessary to have a consistent way of representing the sounds of language in written form. From its foundation in 1886 the Association has been concerned to develop a system of notation which would be convenient to use, but comprehensive enough to cope with the wide variety of sounds found in the languages of the world; and to encourage the use of thjs notation as widely as possible among those concerned with language. The system is generally known as the International Phonetic Alphabet. Both the Association and its Alphabet are widely referred to by the abbreviation IPA, but here 'IPA' will be used only for the Alphabet. The IPA is based on the Roman alphabet, which has the advantage of being widely familiar, but also includes letters and additional symbols from a variety of other sources. These additions are necessary because the variety of sounds in languages is much greater than the number of letters in the Roman alphabet. The use of sequences of phonetic symbols to represent speech is known as transcription. The IPA can be used for many different purposes. For instance, it can be used as a way to show pronunciation in a dictionary, to record a language in linguistic fieldwork, to form the basis of a writing system for a language, or to annotate acoustic and other displays in the analysis of speech. -
Issues in the Distribution of the Glottal Stop
Theoretical Issues in the Representation of the Glottal Stop in Blackfoot* Tyler Peterson University of British Columbia 1.0 Introduction This working paper examines the phonemic and phonetic status and distribution of the glottal stop in Blackfoot.1 Observe the phonemic distribution (1), four phonological operations (2), and three surface realizations (3) of the glottal stop: (1) a. otsi ‘to swim’ c. otsi/ ‘to splash with water’ b. o/tsi ‘to take’ (2) a. Metathesis: V/VC ® VV/C nitáó/mai/takiwa nit-á/-omai/takiwa 1-INCHOAT-believe ‘now I believe’ b. Metathesis ® Deletion: V/VùC ® VVù/C ® VVùC kátaookaawaatsi káta/-ookaa-waatsi INTERROG-sponsor.sundance-3s.NONAFFIRM ‘Did she sponsor a sundance?’ (Frantz, 1997: 154) c. Metathesis ® Degemination: V/V/C ® VV//C ® VV/C áó/tooyiniki á/-o/too-yiniki INCHOAT-arrive(AI)-1s/2s ‘when I/you arrive’ d. Metathesis ® Deletion ® V-lengthening: V/VC ® VV/C ® VVùC kátaoottakiwaatsi káta/-ottaki-waatsi INTEROG-bartender-3s.NONAFFIRM ‘Is he a bartender?’ (Frantz, 1997) (3) Surface realizations: [V/C] ~ [VV0C] ~ [VùC] aikai/ni ‘He dies’ a. [aIkaI/ni] b. [aIkaII0ni] c. [aIkajni] The glottal stop appears to have a unique status within the Blackfoot consonant inventory. The examples in (1) (and §2.1 below) suggest that it appears as a fully contrastive phoneme in the language. However, the glottal stop is put through variety of phonological processes (metathesis, syncope and degemination) that no other consonant in the language is subject to. Also, it has a variety of surfaces realizations in the form of glottalization on an adjacent vowel (cf. -
Chapter 9 Consonant Substitution in Child Language (Ikwere) Roseline I
Chapter 9 Consonant substitution in child language (Ikwere) Roseline I. C. Alerechi University of Port Harcourt The Ikwere language is spoken in four out of the twenty-three Local Government Areas (LGAs) of Rivers State of Nigeria, namely, Port Harcourt, Obio/Akpor, Emohua and Ikwerre LGAs. Like Kana, Kalabari and Ekpeye, it is one of the major languages of Rivers State of Nigeria used in broadcasting in the electronic media. The Ikwere language is classified asan Igboid language of the West Benue-Congo family of the Niger-Congo phylum of languages (Williamson 1988: 67, 71, Williamson & Blench 2000: 31). This paper treats consonant substi- tution in the speech of the Ikwere child. It demonstrates that children use of a language can contribute to the divergent nature of that language as they always strive for simplification of the target language. Using simple descriptive method of data analysis, the paper identifies the various substitutions of consonant sounds, which characterize the Ikwere children’s ut- terances. It stresses that the substitutions are regular and rule governed and hence implies the operation of some phonological processes. Some of the processes are strengthening and weakening of consonants, loss of suction of labial implosives causing them to become labial plosives, devoicing of voiced consonants, etc. While some of these processes are identical with the adult language, others are peculiar to children, demonstrating the relationships between the phonological processes in both forms of speech. It is worthy of note that high- lighting the relationships and differences will make for effective communication between children and adults. 1 Introduction The Ikwere language is spoken in four out of the twenty-three Local Government Areas (LGAs) of Rivers State of Nigeria, namely, Port Harcourt, Obio/Akpor, Emohua and Ik- werre LGAs. -
Towards Zero-Shot Learning for Automatic Phonemic Transcription
The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20) Towards Zero-Shot Learning for Automatic Phonemic Transcription Xinjian Li, Siddharth Dalmia, David R. Mortensen, Juncheng Li, Alan W Black, Florian Metze Language Technologies Institute, School of Computer Science Carnegie Mellon University {xinjianl, sdalmia, dmortens, junchenl, awb, fmetze}@cs.cmu.edu Abstract Hermann and Goldwater 2018), which typically uses an un- supervised technique to learn representations which can be Automatic phonemic transcription tools are useful for low- resource language documentation. However, due to the lack used towards speech processing tasks. of training sets, only a tiny fraction of languages have phone- However, those unsupervised approaches could not gener- mic transcription tools. Fortunately, multilingual acoustic ate phonemes directly and there has been few works study- modeling provides a solution given limited audio training ing zero-shot learning for unseen phonemes transcription, data. A more challenging problem is to build phonemic tran- which consist of learning an acoustic model without any au- scribers for languages with zero training data. The difficulty dio data or text data for a given target language and unseen of this task is that phoneme inventories often differ between phonemes. In this work, we aim to solve this problem to the training languages and the target language, making it in- transcribe unseen phonemes for unseen languages without feasible to recognize unseen phonemes. In this work, we ad- dress this problem by adopting the idea of zero-shot learning. considering any target data, audio or text. Our model is able to recognize unseen phonemes in the tar- The prediction of unseen objects has been studied for a get language without any training data. -
Local Identity and Sound Change in Glasgow a Pilot Study1
Local identity and sound change in Glasgow A pilot study1 Natalie Braber2 Zoe Butterfint3 Abstract This paper outlines a pilot study investigation into the potential link between local identity and language change in Glasgow. Results are presented from part of the pilot study, specifically the variation noted in two phonological variables - the realisation of the alveolar lateral approximant /l/, and the occurrence of so-called T-glottalling - and are discussed in the light of local identity. Glasgow is historically a heavily stigmatised, often stereotyped city and home to an equally stigmatised linguistic variety: Glaswegian. Recent investigations have highlighted processes of linguistic change occurring in this linguistic variety (most notable Stuart-Smith, 1999a, 2003; Stuart-Smith et al., 2006, 2007), and this study sets out to investigate the potential link between these changes, speaker attitudes to Glasgow and their sense of Glaswegian identity. The data elicitation method employed is an extended version of that used by Stuart- Smith & Tweedie (2000): semi-structured interviews supplemented by a read word list. Methodological issues and considerations for future investigation are discussed on the basis of the findings of this pilot study. 1. Introduction A wealth of literature exists examining the relationship between identity and linguistic change, from Labov’s pioneering investigation in Martha’s Vineyard (1963), to present-day research in Middlesborough (Llamas, 1999, 2007) and Berwick-upon- Tweed (Pichler & Watt. 2004; Pichler, Watt and Llamas, 2004). This paper discusses findings from a small-scale pilot study carried out in 2005/6, which aimed to extend that body of work, further investigating the complex ‘interdependence of language and place identity’ (Llamas, 2007:579) in one very specific locale: Glasgow. -
Velar Segments in Old English and Old Irish
In: Jacek Fisiak (ed.) Proceedings of the Sixth International Conference on Historical Linguistics. Amsterdam: John Benjamins, 1985, 267-79. Velar segments in Old English and Old Irish Raymond Hickey University of Bonn The purpose of this paper is to look at a section of the phoneme inventories of the oldest attested stage of English and Irish, velar segments, to see how they are manifested phonetically and to consider how they relate to each other on the phonological level. The reason I have chosen to look at two languages is that it is precisely when one compares two language systems that one notices that structural differences between languages on one level will be correlated by differences on other levels demonstrating their interrelatedness. Furthermore it is necessary to view segments on a given level in relation to other segments. The group under consideration here is just one of several groups. Velar segments viewed within the phonological system of both Old English and Old Irish cor relate with three other major groups, defined by place of articulation: palatals, dentals, and labials. The relationship between these groups is not the same in each language for reasons which are morphological: in Old Irish changes in grammatical category are frequently indicated by palatalizing a final non-palatal segment (labial, dental, or velar). The same function in Old English is fulfilled by suffixes and /or prefixes. This has meant that for Old English the phonetically natural and lower-level alternation of velar elements with palatal elements in a palatal environ ment was to be found whereas in Old Irish this alternation had been denaturalized and had lost its automatic character. -
Package 'Phontools'
Package ‘phonTools’ July 31, 2015 Title Tools for Phonetic and Acoustic Analyses Version 0.2-2.1 Date 2015-07-30 Author Santiago Barreda Maintainer Santiago Barreda <[email protected]> Depends R (>= 2.15) Description Contains tools for the organization, display, and analysis of the sorts of data fre- quently encountered in phonetics research and experimentation, including the easy cre- ation of IPA vowel plots, and the creation and manipulation of WAVE audio files. License BSD_2_clause + file LICENSE URL http://www.santiagobarreda.com/rscripts.html NeedsCompilation no Repository CRAN Date/Publication 2015-07-31 01:00:47 R topics documented: a96..............................................3 b95 .............................................4 combocalc . .4 createtemplate . .5 errorbars . .6 f73..............................................7 f99..............................................8 fastacf . .9 Ffilter . 10 findformants . 11 FIRfilter . 13 formanttrack . 14 freqresponse . 16 h95 ............................................. 17 hotelling.test . 18 1 2 R topics documented: imputeformants . 20 interpolate . 21 ldboundary . 22 ldclassify . 23 loadsound . 25 loadtable . 26 lpc.............................................. 26 makeFIR . 27 makesound . 29 multiplot . 30 normalize . 31 normalize.compare . 33 ntypes . 34 p73 ............................................. 35 pb52............................................. 36 peakfind . 37 phasor . 37 pickIPA . 38 pitchtrack . 40 playsound . 41 polezero . 42 powertrack . 43 preemphasis . 44 -
Dynamic Modeling for Chinese Shaanxi Xi'an Dialect Visual
UNIVERSITY OF MISKOLC FACULTY OF MECHANICAL ENGINEERING AND INFORMATICS Dynamic Modeling for Chinese Shaanxi Xi’an Dialect Visual Speech Summary of PhD dissertation Lu Zhao Information Science ‘JÓZSEF HATVANY’ DOCTORAL SCHOOL OF INFORMATION SCIENCE, ENGINEERING AND TECHNOLOGY ACADEMIC SUPERVISOR Dr. László Czap Miskolc 2020 Content Introduction ............................................................................................................. 1 Chapter 1 Phonetic Aspects of the Chinese Shaanxi Xi’an Dialect ......................... 3 1.1. Design of the Chinese Shaanxi Xi’an Dialect X-SAMPA ..................... 3 1.2. Theoretical method of transcription from the Chinese character to X-SAMPA .................................................................................................... 5 Chapter 2 Visemes of the Chinese Shaanxi Xi’an Dialect....................................... 6 2.1. Quantitative description of lip static visemes ........................................ 6 2.2. Analysis of static tongue visemes of the Chinese Shaanxi Xi’an Dialect ...................................................................................................................... 7 Chapter 3 Dynamic Modeling of the Chinese Shaanxi Xi’an Dialect Speech ....... 11 3.1. The interaction between phonemes and the corresponding lip shape ... 11 3.2. Dynamic tongue viseme classification ................................................. 12 3.3. Results of face animation on the Chinese Shaanxi Xi’an Dialect talking head ........................................................................................................... -
Reverse Engineering: Emphatic Consonants and the Adaptation of Vowels in French Loanwords Into Moroccan Arabic *
Brill’s Annual of Afroasiatic Languages and Linguistics 1 (2009) 41–74 brill.nl/baall Reverse Engineering: Emphatic Consonants and the Adaptation of Vowels in French Loanwords into Moroccan Arabic * Michael Kenstowicz and Nabila Louriz [email protected] [email protected] Abstract On the basis of two large corpora of French (and Spanish) loanwords into Moroccan Arabic, the paper documents and analyzes the phenomenon noted by Heath (1989) in which a pharyngeal- ized consonant is introduced in the adaptation of words with mid and low vowels such as moquette > [MokeT] = /MukiT/ ‘carpet’. It is found that French back vowels are readily adapted with pharyngealized emphatics while the front vowels tend to resist this correspondence. Th e implications of the phenomenon for general models of loanword adaptation are considered. It is concluded that auditory similarity and salience are critical alternative dimensions of faithfulness that may override correspondences based on phonologically contrastive features. Keywords pharyngealization , enhancement , auditory salience , weighted constraints , harmony 1. Introduction One of the most interesting questions in the theory of loanword adaptation concerns the role of redundant features. It is well known that phonological contrasts on consonants are often correlated with the realization of the same or a related (enhancing) feature on adjacent vowels. Such redundant proper- ties are known to play a role in speech perception and frequently share or take * A preliminary version of this paper was written while the second author was a Fulbright scholar at MIT (spring-summer 2008) and was presented at the Linguistics Colloquium, Paris 8 (November 2008). Th e paper was completed while the fi rst author was Visiting Professor at the Research Institute for Languages and Cultures of Asia and Africa, Tokyo University of Foreign Studies in the winter of 2008-9.