Using Phonetic Knowledge in Tools and Resources for Natural Language Processing and Pronunciation Evaluation
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Contrastive Study of the Ibibio and Igbo Sound Systems
A CONTRASTIVE STUDY OF THE IBIBIO AND IGBO SOUND SYSTEMS GOD’SPOWER ETIM Department Of Languages And Communication Abia State Polytechnic P.M.B. 7166, Aba, Abia State, Nigeria. [email protected] ABSTRACT This research strives to contrast the consonant phonemes, vowel phonemes and tones of Ibibio and Igbo in order to describe their similarities and differences. The researcher adopted the descriptive method, and relevant data on the phonology of the two languages were gathered and analyzed within the framework of CA before making predictions and conclusions. Ibibio consists of ten vowels and fourteen consonant phonemes, while Igbo is made up of eight vowels and twenty-eight consonants. The results of contrastive analysis of the two languages showed that there are similarities as well as differences in the sound systems of the languages. There are some sounds in Ibibio which are not present in Igbo. Also many sounds are in Igbo which do not exist in Ibibio. Both languages share the phonemes /e, a, i, o, ɔ, u, p, b, t, d, k, kp, m, n, ɲ, j, ŋ, f, s, j, w/. All the phonemes in Ibibio are present in Igbo except /ɨ/, /ʉ/, and /ʌ/. Igbo has two vowel segments /ɪ/ and /ʊ/ and also fourteen consonant phonemes /g, gb, kw, gw, ŋw, v, z, ʃ, h, ɣ, ʧ, ʤ, l, r/ which Ibibio lacks. Both languages have high, low and downstepped tones but Ibibio further has contour or gliding tones which are not tone types in Igbo. Also, the downstepped tone in Ibibio is conventionally marked with exclamation point, while in Igbo, it is conventionally marked with a raised macron over the segments bearing it. -
Rule-Based Standard Arabic Phonetization at Phoneme, Allophone, and Syllable Level
Fadi Sindran, Firas Mualla, Tino Haderlein, Khaled Daqrouq & Elmar Nöth Rule-Based Standard Arabic Phonetization at Phoneme, Allophone, and Syllable Level Fadi Sindran [email protected] Faculty of Engineering Department of Computer Science /Pattern Recognition Lab Friedrich-Alexander-Universität Erlangen-Nürnberg Erlangen, 91058, Germany Firas Mualla [email protected] Faculty of Engineering Department of Computer Science /Pattern Recognition Lab Friedrich-Alexander-Universität Erlangen-Nürnberg Erlangen, 91058, Germany Tino Haderlein [email protected] Faculty of Engineering Department of Computer Science/Pattern Recognition Lab Friedrich-Alexander-Universität Erlangen-Nürnberg Erlangen, 91058, Germany Khaled Daqrouq [email protected] Department of Electrical and Computer Engineering King Abdulaziz University, Jeddah, 22254, Saudi Arabia Elmar Nöth [email protected] Faculty of Engineering Department of Computer Science /Pattern Recognition Lab Friedrich-Alexander-Universität Erlangen-Nürnberg Erlangen, 91058, Germany Abstract Phonetization is the transcription from written text into sounds. It is used in many natural language processing tasks, such as speech processing, speech synthesis, and computer-aided pronunciation assessment. A common phonetization approach is the use of letter-to-sound rules developed by linguists for the transcription from grapheme to sound. In this paper, we address the problem of rule-based phonetization of standard Arabic. 1The paper contributions can be summarized as follows: 1) Discussion of the transcription rules of standard Arabic which were used in literature on the phonemic and phonetic level. 2) Improvements of existing rules are suggested and new rules are introduced. Moreover, a comprehensive algorithm covering the phenomenon of pharyngealization in standard Arabic is proposed. -
A Brief Description of Consonants in Modern Standard Arabic
Linguistics and Literature Studies 2(7): 185-189, 2014 http://www.hrpub.org DOI: 10.13189/lls.2014.020702 A Brief Description of Consonants in Modern Standard Arabic Iram Sabir*, Nora Alsaeed Al-Jouf University, Sakaka, KSA *Corresponding Author: [email protected] Copyright © 2014 Horizon Research Publishing All rights reserved. Abstract The present study deals with “A brief Modern Standard Arabic. This study starts from an description of consonants in Modern Standard Arabic”. This elucidation of the phonetic bases of sounds classification. At study tries to give some information about the production of this point shows the first limit of the study that is basically Arabic sounds, the classification and description of phonetic rather than phonological description of sounds. consonants in Standard Arabic, then the definition of the This attempt of classification is followed by lists of the word consonant. In the present study we also investigate the consonant sounds in Standard Arabic with a key word for place of articulation in Arabic consonants we describe each consonant. The criteria of description are place and sounds according to: bilabial, labio-dental, alveolar, palatal, manner of articulation and voicing. The attempt of velar, uvular, and glottal. Then the manner of articulation, description has been made to lead to the drawing of some the characteristics such as phonation, nasal, curved, and trill. fundamental conclusion at the end of the paper. The aim of this study is to investigate consonant in MSA taking into consideration that all 28 consonants of Arabic alphabets. As a language Arabic is one of the most 2. -
Lexical Tone Perception in Chinese Mandarin Anna Björklund
Lexical Tone Perception in Chinese Mandarin Anna Björklund University of Florida 1 ABSTRACT Perceptual asymmetry, where Stimulus A is more often confused for B than B is for A or vice versa, has been observed in multiple lexical contexts, such as vowels (Polka & Bohn, 2003) and consonants (Dar, Mariam & Keren-Portnoy, 2018). Because historically perceptual space was assumed to be Euclidean, perception of stimuli was in turn assumed to be symmetrical, and observations of asymmetry were explained as simply response bias (Polka & Bohn, 2003). However, further examination of such biases has suggested that they are a much more fundamental occurrence. This study examines the effects of memory load and native language on perceptual bias in lexical tone perception of Mandarin Chinese for native speakers of Mandarin and English. Native speakers of both languages were given an AX categorical discrimination task with each combination of the four lexical Mandarin tones. To examine the effects of training on this bias, they were then trained on one of two tones (1 and 4) and given the same sequence of discrimination tasks again. Each pre- and post-training discrimination task featured both 250 ms and 1000 ms ISI intervals. 2 0. INTRODUCTION TO PERCEPTUAL ASYMMETRY It is often assumed that perception is symmetrical: that is, when presented with two stimuli, A and B, it is equally as easy to discriminate going from stimuli A to B as going from B to A. Although there had long been data that conflicted with such a model, the aberrant data was explained as response bias (Polka and Bohn, 2003), and subsequently ignored. -
LT3212 Phonetics Assignment 4 Mavis, Wong Chak Yin
LT3212 Phonetics Assignment 4 Mavis, Wong Chak Yin Essay Title: The sound system of Japanese This essay aims to introduce the sound system of Japanese, including the inventories of consonants, vowels, and diphthongs. The phonological variations of the sound segments in different phonetic environments are also included. For the illustration, word examples are given and they are presented in the following format: [IPA] (Romaji: “meaning”). Consonants In Japanese, there are 14 core consonants, and some of them have a lot of allophonic variations. The various types of consonants classified with respect to their manner of articulation are presented as follows. Stop Japanese has six oral stops or plosives, /p b t d k g/, which are classified into three place categories, bilabial, alveolar, and velar, as listed below. In each place category, there is a pair of plosives with the contrast in voicing. /p/ = a voiceless bilabial plosive [p]: [ippai] (ippai: “A cup of”) /b/ = a voiced bilabial plosive [b]: [baɴ] (ban: “Night”) /t/ = a voiceless alveolar plosive [t]: [oto̞ ːto̞ ] (ototo: “Brother”) /d/ = a voiced alveolar plosive [d]: [to̞ mo̞ datɕi] (tomodachi: “Friend”) /k/ = a voiceless velar plosive [k]: [kaiɰa] (kaiwa: “Conversation”) /g/ = a voiced velar plosive [g]: [ɡakɯβsai] (gakusai: “Student”) Phonetically, Japanese also has a glottal stop [ʔ] which is commonly produced to separate the neighboring vowels occurring in different syllables. This phonological phenomenon is known as ‘glottal stop insertion’. The glottal stop may be realized as a pause, which is used to indicate the beginning or the end of an utterance. For instance, the word “Japanese money” is actually pronounced as [ʔe̞ ɴ], instead of [je̞ ɴ], and the pronunciation of “¥15” is [dʑɯβːɡo̞ ʔe̞ ɴ]. -
Ling 230/503: Articulatory Phonetics and Transcription English Vowels
Ling 230/503: Articulatory Phonetics and Transcription Broad vs. narrow transcription. A narrow transcription is one in which the transcriber records much phonetic detail without attention to the way in which the sounds of the language form a system. A broad transcription omits those details of a narrow transcription which the transcriber feels are not worth recording. Normally these details will be aspects of the speech event which are: (1) predictable or (2) would not differentiate two token utterances of the same type in the judgment of speakers or (3) are presumed not to figure in the systematic phonology of the language. IPA vs. American transcription There are two commonly used systems of phonetic transcription, the International Phonetics Association or IPA system and the American system. In many cases these systems overlap, but in certain cases there are important distinctions. Students need to learn both systems and have to be flexible about the use of symbols. English Vowels Short vowels /ɪ ɛ æ ʊ ʌ ɝ/ ‘pit’ pɪt ‘put’ pʊt ‘pet’ pɛt ‘putt’ pʌt ‘pat’ pæt ‘pert’ pɝt (or pr̩t) Long vowels /i(ː), u(ː), ɑ(ː), ɔ(ː)/ ‘beat’ biːt (or bit) ‘boot’ buːt (or but) ‘(ro)bot’ bɑːt (or bɑt) ‘bought’ bɔːt (or bɔt) Diphthongs /eɪ, aɪ, aʊ, oʊ, ɔɪ, ju(ː)/ ‘bait’ beɪt ‘boat’ boʊt ‘bite’ bɑɪt (or baɪt) ‘bout’ bɑʊt (or baʊt) ‘Boyd’ bɔɪd (or boɪd) ‘cute’ kjuːt (or kjut) The property of length, denoted by [ː], can be predicted based on the quality of the vowel. For this reason it is quite common to omit the length mark [ː]. -
Part 1: Introduction to The
PREVIEW OF THE IPA HANDBOOK Handbook of the International Phonetic Association: A guide to the use of the International Phonetic Alphabet PARTI Introduction to the IPA 1. What is the International Phonetic Alphabet? The aim of the International Phonetic Association is to promote the scientific study of phonetics and the various practical applications of that science. For both these it is necessary to have a consistent way of representing the sounds of language in written form. From its foundation in 1886 the Association has been concerned to develop a system of notation which would be convenient to use, but comprehensive enough to cope with the wide variety of sounds found in the languages of the world; and to encourage the use of thjs notation as widely as possible among those concerned with language. The system is generally known as the International Phonetic Alphabet. Both the Association and its Alphabet are widely referred to by the abbreviation IPA, but here 'IPA' will be used only for the Alphabet. The IPA is based on the Roman alphabet, which has the advantage of being widely familiar, but also includes letters and additional symbols from a variety of other sources. These additions are necessary because the variety of sounds in languages is much greater than the number of letters in the Roman alphabet. The use of sequences of phonetic symbols to represent speech is known as transcription. The IPA can be used for many different purposes. For instance, it can be used as a way to show pronunciation in a dictionary, to record a language in linguistic fieldwork, to form the basis of a writing system for a language, or to annotate acoustic and other displays in the analysis of speech. -
Phonetic Matching Toolkit with State-Of-The-Art Meta-Soundex
PHONETIC MATCHING TOOLKIT WITH STATE-OF-THE-ART META-SOUNDEX ALGORITHM (ENGLISH AND SPANISH) _____________ A Thesis Presented to The Faculty of the Department of Computer Science Sam Houston State University _____________ In Partial Fulfillment of the Requirements for the Degree of Master of Science _____________ by Keerthi Koneru December, 2016 PHONETIC MATCHING TOOLKIT WITH STATE-OF-THE-ART META-SOUNDEX ALGORITHM (ENGLISH AND SPANISH) by Keerthi Koneru ______________ APPROVED: Cihan Varol, PhD Thesis Director Narasimha Shashidhar, PhD Committee Member Bing Zhou, PhD Committee Member John B. Pascarella, PhD Dean, College of Science and Engineering Technology DEDICATION I dedicate this dissertation to all my family members who had always motivated and supported me to successfully achieve this. A special dedication to my father, Koneru Paramesh Babu, whom I regard as the most influential person in my life, who always showed me the right direction and responsible for grooming me into a better individual. Last but not the least, I would like to dedicate to my brother Sridhar Reddy and my friend Subash Kumar, who constantly motivated me to work comprehensively and effectively, which helped me a lot in successfully achieving this milestone. iii ABSTRACT Koneru, Keerthi, Phonetic matching toolkit with state-of-the-art Meta-Soundex algorithm (English and Spanish). Master of Science (Computing and Information Science), December, 2016, Sam Houston State University, Huntsville, Texas. Researchers confront major problems while searching for various kinds of data in large imprecise databases, as they are not spelled correctly or in the way they were expected to be spelled. As a result, they cannot find the word they sought. -
Lecture # 07 (Phonetics)
Phonetics Phonetics (Introduction), Syllables, - text-to-speech recognition, - Onset, nucleus - Phones, - Coda, Rime, Speech sounds and Phonetic - Consonants and vowels, transcription, Phonological Categories and - International Phonetic Alphabet Pronunciation variation, (IPA), - ARPAbet, - Phonetic features, Articulatory Phonetics, - Predicting Phonetic variation, - The Vocal organs, - Factors Influencing Phonetic - Consonants: Place of Articulation, variation, - Consonants: Manner of Articulation, - Vowels, - Syllables, @Copyrights: Natural Language Processing (NLP) Organized by Dr. Ahmad Jalal (http://portals.au.edu.pk/imc/) 1. Phonetics (Introduction) “ What is the proper pronunciation of word “Special”” Text-to-speech conversation (converting strings of text words into acoustic waveforms). “Phonics” methods of teaching to children; - is first glance like a purely modern educational debate. Phonetics is the study of linguistic sounds, - how they are produced by the articulators of the human vocal tract, - how they are realized acoustically, - and how this acoustic realization can be digitized and processed. A key element of both speech recognition and text-to-speech systems: - how words are pronounced in terms of individual speech units called phones. @Copyrights: Natural Language Processing (NLP) Organized by Dr. Ahmad Jalal (http://portals.au.edu.pk/imc/) 2. Speech Sounds and Phonetic Transcription The study of the pronunciation of words is part of the field of phonetics, We model the pronunciation of a word as a string of symbols which represent phones or segments. - A phone is a speech sound; - phones are represented with phonetic symbols that bear some resemblance to a letter. We use 2 different alphabets for describing phones as; (1) The International Phonetic Alphabet (IPA) - is an evolving standard originally developed by the International Phonetic Association in 1888 with the goal of transcribing the sounds of all human languages. -
Dominance in Coronal Nasal Place Assimilation: the Case of Classical Arabic
http://elr.sciedupress.com English Linguistics Research Vol. 9, No. 3; 2020 Dominance in Coronal Nasal Place Assimilation: The Case of Classical Arabic Zainab Sa’aida Correspondence: Zainab Sa’aida, Department of English, Tafila Technical University, Tafila 66110, Jordan. ORCID: https://orcid.org/0000-0001-6645-6957, E-mail: [email protected] Received: August 16, 2020 Accepted: Sep. 15, 2020 Online Published: Sep. 21, 2020 doi:10.5430/elr.v9n3p25 URL: https://doi.org/10.5430/elr.v9n3p25 Abstract The aim of this study is to investigate place assimilation processes of coronal nasal in classical Arabic. I hypothesise that coronal nasal behaves differently in different assimilatory situations in classical Arabic. Data of the study were collected from the Holy Quran. It was referred to Quran.com for the pronunciations and translations of the data. Data of the study were analysed from the perspective of Mohanan’s dominance in assimilation model. Findings of the study have revealed that coronal nasal shows different assimilatory behaviours when it occurs in different syllable positions. Coronal nasal onset seems to fail to assimilate a whole or a portion of the matrix of a preceding obstruent or sonorant coda within a phonological word. However, coronal nasal in the coda position shows different phonological behaviours. Keywords: assimilation, dominance, coronal nasal, onset, coda, classical Arabic 1. Introduction An assimilatory situation in natural languages has two elements in which one element dominates the other. Nasal place assimilation occurs when a nasal phoneme takes on place features of an adjacent consonant. This study aims at investigating place assimilation processes of coronal nasal in classical Arabic (CA, henceforth). -
Building a Universal Phonetic Model for Zero-Resource Languages
Building a Universal Phonetic Model for Zero-Resource Languages Paul Moore MInf Project (Part 2) Interim Report Master of Informatics School of Informatics University of Edinburgh 2020 3 Abstract Being able to predict phones from speech is a challenge in and of itself, but what about unseen phones from different languages? In this project, work was done towards building precisely this kind of universal phonetic model. Using the GlobalPhone language corpus, phones’ articulatory features, a recurrent neu- ral network, open-source libraries, and an innovative prediction system, a model was created to predict phones based on their features alone. The results show promise, especially for using these models on languages within the same family. 4 Acknowledgements Once again, a huge thank you to Steve Renals, my supervisor, for all his assistance. I greatly appreciated his practical advice and reasoning when I got stuck, or things seemed overwhelming, and I’m very thankful that he endorsed this project. I’m immensely grateful for the support my family and friends have provided in the good times and bad throughout my studies at university. A big shout-out to my flatmates Hamish, Mark, Stephen and Iain for the fun and laugh- ter they contributed this year. I’m especially grateful to Hamish for being around dur- ing the isolation from Coronavirus and for helping me out in so many practical ways when I needed time to work on this project. Lastly, I wish to thank Jesus Christ, my Saviour and my Lord, who keeps all these things in their proper perspective, and gives me strength each day. -
UC Berkeley Dissertations, Department of Linguistics
UC Berkeley Dissertations, Department of Linguistics Title Relationship between Perceptual Accuracy and Information Measures: A cross-linguistic Study Permalink https://escholarship.org/uc/item/7tx5t8bt Author Kang, Shinae Publication Date 2015 eScholarship.org Powered by the California Digital Library University of California Relationship between perceptual accuracy and information measures: A cross-linguistic study by Shinae Kang A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Linguistics in the Graduate Division of the University of California, Berkeley Committee in charge: Professor Keith A. Johnson, Chair Professor Sharon Inkelas Professor Susan S. Lin Professor Robert T. Knight Fall 2015 Relationship between perceptual accuracy and information measures: A cross-linguistic study Copyright 2015 by Shinae Kang 1 Abstract Relationship between perceptual accuracy and information measures: A cross-linguistic study by Shinae Kang Doctor of Philosophy in Linguistics University of California, Berkeley Professor Keith A. Johnson, Chair The current dissertation studies how the information conveyed by different speech el- ements of English, Japanese and Korean correlates with perceptual accuracy. Two well- established information measures are used: weighted negative contextual predictability (in- formativity) of a speech element; and the contribution of a speech element to syllable differ- entiation, or functional load. This dissertation finds that the correlation between information and perceptual accuracy differs depending on both the type of information measure and the language of the listener. To compute the information measures, Chapter 2 introduces a new corpus consisting of all the possible syllables for each of the three languages. The chapter shows that the two information measures are inversely correlated.