Speech Sounds of American English

Total Page:16

File Type:pdf, Size:1020Kb

Speech Sounds of American English Speech Sounds of American English • There are over 40 speech sounds in American English which can be organized by their basic manner of production Manner Class Number Vowels 18 Fricatives 8 Stops 6 Nasals 3 Semivowels 4 Affricates 2 Aspirant 1 • Vowels, glides, and consonants differ in degree of constriction • Sonorant consonants have no pressure build up at constriction • Nasal consonants lower the velum allowing airflow in nasal cavity • Continuant consonants do not block airflow in oral cavity 6.345 Automatic Speech Recognition Speech Sounds 1 Vowel Production • No significant constriction in the vocal tract • Usually produced with periodic excitation • Acoustic characteristics depend on the position of the jaw, tongue, and lips [i][@][a][u] 6.345 Automatic Speech Recognition Speech Sounds 2 Vowels of American English • There are approximately 18 vowels in American English made up of monothongs, diphthongs, and reduced vowels (schwa’s) /i¤ / iy beat /O/ ao bought /a¤ / ay bite /I/ ih bit /^/ahbut /O¤ / oy Boyd /e¤ /eybait /o⁄ / ow boat /a⁄ /awbout /E/ehbet /U/ uh book [{] ax about /@/aebat /u/ uw boot [|]ixroses /a/ aa Bob /5/ er Bert [}] axr butter • They are often described by the articulatory features: High/Low, Front/Back, Retroflexed, Rounded,andTense/Lax 6.345 Automatic Speech Recognition Speech Sounds 3 Spectrograms of the Cardinal Vowels Time (seconds) Time (seconds) Time (seconds) Time (seconds) 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 16 16 16 16 16 16 16 16 Zero Crossing Rate Zero Crossing Rate Zero Crossing Rate Zero Crossing Rate kHz 8 8 kHzkHz 8 8 kHzkHz 8 8 kHzkHz 8 8 kHz 0 0 0 0 0 0 0 0 Total Energy Total Energy Total Energy Total Energy dB dB dB dB dB dB dB dB Energy -- 125 Hz to 750 Hz Energy -- 125 Hz to 750 Hz Energy -- 125 Hz to 750 Hz Energy -- 125 Hz to 750 Hz dB dB dB dB dB dB dB dB 8 8 8 8 8 8 8 8 Wide Band Spectrogram Wide Band Spectrogram Wide Band Spectrogram Wide Band Spectrogram 7 7 7 7 7 7 7 7 6 6 6 6 6 6 6 6 5 5 5 5 5 5 5 5 kHz 4 4 kHzkHz 4 4 kHzkHz 4 4 kHzkHz 4 4 kHz 3 3 3 3 3 3 3 3 2 2 2 2 2 2 2 2 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 Waveform Waveform Waveform Waveform 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 beet bat bott boot /bi¤ t//b@t//bat//but/ 6.345 Automatic Speech Recognition Speech Sounds 4 Vowel Formant Averages • Vowels are often characterized by the lower three formants • High/Low is correlated with the first formant, F1 • Front/Back is correlated with the second formant, F2 • Retroflexion is marked by a low third formant, F3 Female Speakers Male Speakers 3500 3500 F3 F2 F1 F3 F2 F1 3000 3000 2500 2500 2000 2000 1500 1500 1000 1000 Average Frequency (Hz) Average Frequency (Hz) 500 500 0 0 i¤ I e¤ E @ a O ^ o⁄ U u 5 { | i¤ I e¤ E @ a O ^ o⁄ U u 5 { | Vowel Vowel 6.345 Automatic Speech Recognition Speech Sounds 5 Vowel Durations • Each vowel has a different intrinsic duration • Schwa’s have distinctly shorter durations (50ms) • /I, E, ^, U/ are the shortest monothongs • Context can greatly influence vowel duration Female Speakers Male Speakers 250 250 200 200 150 150 100 100 Average Duration (ms) Average Duration (ms) 50 50 0 0 i¤ I e¤ E @ a O ^ o⁄ U u 5 { | a⁄ o¤ a¤ ¤u i¤ I e¤ E @ a O ^ o⁄ U u 5 { | a⁄ o¤ a¤ ¤u Vowel Vowel 6.345 Automatic Speech Recognition Speech Sounds 6 Rob's Happy Little Vowel Chart "So inaccurate, yet so useful." is m F3 ight ink y l o! Th ow to g FRONT ? ay i Yo e w ur pal 5 is th I e E @ uÚ ^,{ TENSE = Towards Edges tends to be longer LAX = Towards Center Increases O tends to be shorter 2 U a F o BACK u SCHWAS: Plain [{] About [{ba⁄t] Front [|] Roses [ro⁄z|z] HIGH MID LOW Retroflex [}] Forever [f}Ev}] F1 Increases 6.345 Automatic Speech Recognition Speech Sounds 7 Fricative Production • Turbulence produced at narrow constriction • Constriction position determines acoustic characteristics • Can be produced with periodic excitation [f][T][s][S] 6.345 Automatic Speech Recognition Speech Sounds 8 Fricatives of American English • There are 8 fricatives in American English • Four places of articulation: Labio-Dental (Labial), Interdental (Dental), Alveolar,andPalato-Alveolar (Palatal) • They are often described by the features Voiced/Unvoiced,or Strident/Non-Strident (constriction behind alveolar ridge) Type Unvoiced Voiced Labial /f/ffee /v/vv Dental /T/ththief/D/dhthee Alveolar /s/ssee /z/zz Palatal /S/ sh she /Z/zhGigi 6.345 Automatic Speech Recognition Speech Sounds 9 Spectrograms of Unvoiced Fricatives Time (seconds) Time (seconds) Time (seconds) Time (seconds) 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 16 16 16 16 16 16 16 16 Zero Crossing Rate Zero Crossing Rate Zero Crossing Rate Zero Crossing Rate kHz 8 8 kHzkHz 8 8 kHzkHz 8 8 kHzkHz 8 8 kHz 0 0 0 0 0 0 0 0 Total Energy Total Energy Total Energy Total Energy dB dB dB dB dB dB dB dB Energy -- 125 Hz to 750 Hz Energy -- 125 Hz to 750 Hz Energy -- 125 Hz to 750 Hz Energy -- 125 Hz to 750 Hz dB dB dB dB dB dB dB dB 8 8 8 8 8 8 8 8 Wide Band Spectrogram Wide Band Spectrogram Wide Band Spectrogram Wide Band Spectrogram 7 7 7 7 7 7 7 7 6 6 6 6 6 6 6 6 5 5 5 5 5 5 5 5 kHz 4 4 kHzkHz 4 4 kHzkHz 4 4 kHzkHz 4 4 kHz 3 3 3 3 3 3 3 3 2 2 2 2 2 2 2 2 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 Waveform Waveform Waveform Waveform 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 fee thief see she /fi¤ //Ti¤ f//si¤ //Si¤ / 6.345 Automatic Speech Recognition Speech Sounds 10 Fricative Energy NON-STRIDENT STRIDENT Probability Density unadjusted for frequency 0.0 0.02 0.04 0.06 -100 -90 -80 -70 -60 -50 -40 Average Total Energy Strident fricatives tend to be stronger than non-strident fricatives. 6.345 Automatic Speech Recognition Speech Sounds 11 Fricative Durations UNVOICED VOICED Probability Density unadjusted for frequency 02468101214 0.0 0.05 0.10 0.15 0.20 0.25 0.30 Duration Voiced fricatives tend to be shorter than unvoiced fricatives. 6.345 Automatic Speech Recognition Speech Sounds 12 Examples of Fricative Voicing Contrast Time (seconds) Time (seconds) Time (seconds) Time (seconds) 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 16 16 16 16 16 16 16 16 Zero Crossing Rate Zero Crossing Rate Zero Crossing Rate Zero Crossing Rate kHz 8 8 kHzkHz 8 8 kHzkHz 8 8 kHzkHz 8 8 kHz 0 0 0 0 0 0 0 0 Total Energy Total Energy Total Energy Total Energy dB dB dB dB dB dB dB dB Energy -- 125 Hz to 750 Hz Energy -- 125 Hz to 750 Hz Energy -- 125 Hz to 750 Hz Energy -- 125 Hz to 750 Hz dB dB dB dB dB dB dB dB 8 8 8 8 8 8 8 8 Wide Band Spectrogram Wide Band Spectrogram Wide Band Spectrogram Wide Band Spectrogram 7 7 7 7 7 7 7 7 6 6 6 6 6 6 6 6 5 5 5 5 5 5 5 5 kHz 4 4 kHzkHz 4 4 kHzkHz 4 4 kHzkHz 4 4 kHz 3 3 3 3 3 3 3 3 2 2 2 2 2 2 2 2 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 Waveform Waveform Waveform Waveform 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 sue zoo face phase /su//zu//fe¤ s//fe¤ z/ 6.345 Automatic Speech Recognition Speech Sounds 13 Rob's Friendly Little Consonant Chart "Somewhat more accurate, yet somewhat less useful." The Semi-vowels: Place of Articulation yiis like an extreme Labial Dental Alveolar Palatal Velar wuis like an extreme lois like an extreme p b t d k g r5is like an extreme The Odds and Ends: f v T D s z S Z h (unvoiced h) H (voiced h) FricativeWeak Stop (Non-strident) Strong (Strident) F (flap) FÊ (nasalized flap) m n4 ? (glottal stop) Manner of Articulation Nasal Voicing: Unvoiced Voiced The Affricates: Ctis like +S Jdis like +Z 6.345 Automatic Speech Recognition Speech Sounds 14 What is this word? Time (seconds) 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 1.1 16 16 Zero Crossing Rate kHz 8 8 kHz 0 0 Total Energy dB dB Energy -- 125 Hz to 750 Hz dB dB 8 8 Wide Band Spectrogram 7 7 6 6 5 5 kHz 4 4 kHz 3 3 2 2 1 1 0 0 Waveform 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 1.1 6.345 Automatic Speech Recognition Speech Sounds 15 Stop Production • Complete closure in the vocal tract, pressure build up • Sudden release of the constriction, turbulence noise • Can have periodic excitation during closure [b][d][g] 6.345 Automatic Speech Recognition Speech Sounds 16 Stops of American English • There are 6 stop consonants in American English • Three places of articulation: Labial, Alveolar,andVelar • Each place of articulation has a voiced and unvoiced stop Type Voiced Unvoiced Labial /b/ b bought /p/ p pot Alveolar /d/ d dot /t/ t tot Velar /g/ggot /k/kcot •Unvoiced stops are typically aspirated • Voiced stops usually exhibit a “voice-bar’’ during closure • Information about formant transitions and release useful for classification 6.345 Automatic Speech Recognition Speech Sounds 17 Spectrograms of Unvoiced Stops Time (seconds) Time (seconds) Time (seconds) 0.0 0.1 0.2 0.3 0.4 0.0 0.1 0.2 0.3 0.4 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 16 16 16 16 16 16 Zero Crossing Rate Zero Crossing Rate Zero Crossing Rate kHz88 kHz kHz88 kHz kHz 8 8 kHz 00000 0 Total Energy Total Energy Total Energy dB dB dB dB dB dB Energy -- 125 Hz to 750 Hz Energy -- 125 Hz to 750 Hz Energy -- 125 Hz to 750 Hz dB dB dB dB dB dB 8 8 8 8 8 8 Wide Band Spectrogram Wide Band Spectrogram Wide Band Spectrogram 7 7 7 7 7 7 6 6 6 6 6 6 5 5 5 5 5 5 kHz4 4 kHz kHz4 4 kHz kHz 4 4
Recommended publications
  • Study on Unique Pharyngeal and Uvular Consonants in Foreign Accented Arabic
    Study on Unique Pharyngeal and Uvular Consonants in Foreign Accented Arabic Yousef A. Alotaibi, Khondaker Abdullah-Al-Mamun, and Ghulam Muhammad Department of Computer Engineering, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia {yalotaibi, mamun, ghulam}@ccis.ksu.edu.sa common to all Arabic-speaking countries. It is the language Abstract used in the media (television, radio, press, etc.), in official This paper investigates the unique pharyngeal and uvular speeches, in universities and schools, and, generally speaking, consonants of Arabic from the automatic speech recognition in any kind of formal communication situation [4]. (ASR) point of view. Comparisons of the recognition error Arabic texts are almost never fully diacritized and are thus rates for these phonemes are analyzed in five experiments that potentially unsuitable for automatic speech recognition and involve different combinations of native and non-native synthesis [3]. Table 1 shows all Arabic alphabet letters and Arabic speakers. The most three confusing consonants for their correspondences to consonant and semivowel phonemes. every investigated consonant are uncovered and discussed. This table also shows the phonetic description of each Results confirm that these Arabic distinct consonants are a phoneme including the place of articulation. In this paper we major source of difficulty for ASR. While the recognition rate use Language Data Consortium (LDC) WestPoint Modern for certain of these unique consonants such as /H/ can drop MSA database phoneme set rather than International Phonetic below 35% when uttered by non-native speakers, there are Alphabet (IPA). This was decided because we are using advantages to including non-native speakers in ASR.
    [Show full text]
  • A Brief Description of Consonants in Modern Standard Arabic
    Linguistics and Literature Studies 2(7): 185-189, 2014 http://www.hrpub.org DOI: 10.13189/lls.2014.020702 A Brief Description of Consonants in Modern Standard Arabic Iram Sabir*, Nora Alsaeed Al-Jouf University, Sakaka, KSA *Corresponding Author: [email protected] Copyright © 2014 Horizon Research Publishing All rights reserved. Abstract The present study deals with “A brief Modern Standard Arabic. This study starts from an description of consonants in Modern Standard Arabic”. This elucidation of the phonetic bases of sounds classification. At study tries to give some information about the production of this point shows the first limit of the study that is basically Arabic sounds, the classification and description of phonetic rather than phonological description of sounds. consonants in Standard Arabic, then the definition of the This attempt of classification is followed by lists of the word consonant. In the present study we also investigate the consonant sounds in Standard Arabic with a key word for place of articulation in Arabic consonants we describe each consonant. The criteria of description are place and sounds according to: bilabial, labio-dental, alveolar, palatal, manner of articulation and voicing. The attempt of velar, uvular, and glottal. Then the manner of articulation, description has been made to lead to the drawing of some the characteristics such as phonation, nasal, curved, and trill. fundamental conclusion at the end of the paper. The aim of this study is to investigate consonant in MSA taking into consideration that all 28 consonants of Arabic alphabets. As a language Arabic is one of the most 2.
    [Show full text]
  • The Festvox Indic Frontend for Grapheme-To-Phoneme Conversion
    The Festvox Indic Frontend for Grapheme-to-Phoneme Conversion Alok Parlikar, Sunayana Sitaram, Andrew Wilkinson and Alan W Black Carnegie Mellon University Pittsburgh, USA aup, ssitaram, aewilkin, [email protected] Abstract Text-to-Speech (TTS) systems convert text into phonetic pronunciations which are then processed by Acoustic Models. TTS frontends typically include text processing, lexical lookup and Grapheme-to-Phoneme (g2p) conversion stages. This paper describes the design and implementation of the Indic frontend, which provides explicit support for many major Indian languages, along with a unified framework with easy extensibility for other Indian languages. The Indic frontend handles many phenomena common to Indian languages such as schwa deletion, contextual nasalization, and voicing. It also handles multi-script synthesis between various Indian-language scripts and English. We describe experiments comparing the quality of TTS systems built using the Indic frontend to grapheme-based systems. While this frontend was designed keeping TTS in mind, it can also be used as a general g2p system for Automatic Speech Recognition. Keywords: speech synthesis, Indian language resources, pronunciation 1. Introduction in models of the spectrum and the prosody. Another prob- lem with this approach is that since each grapheme maps Intelligible and natural-sounding Text-to-Speech to a single “phoneme” in all contexts, this technique does (TTS) systems exist for a number of languages of the world not work well in the case of languages that have pronun- today. However, for low-resource, high-population lan- ciation ambiguities. We refer to this technique as “Raw guages, such as languages of the Indian subcontinent, there Graphemes.” are very few high-quality TTS systems available.
    [Show full text]
  • LINGUISTICS 221 LECTURE #3 the BASIC SOUNDS of ENGLISH 1. STOPS a Stop Consonant Is Produced with a Complete Closure of Airflow
    LINGUISTICS 221 LECTURE #3 Introduction to Phonetics and Phonology THE BASIC SOUNDS OF ENGLISH 1. STOPS A stop consonant is produced with a complete closure of airflow in the vocal tract; the air pressure has built up behind the closure; the air rushes out with an explosive sound when released. The term plosive is also used for oral stops. ORAL STOPS: e.g., [b] [t] (= plosives) NASAL STOPS: e.g., [m] [n] (= nasals) There are three phases of stop articulation: i. CLOSING PHASE (approach or shutting phase) The articulators are moving from an open state to a closed state; ii. CLOSURE PHASE (= occlusion) Blockage of the airflow in the oral tract; iii. RELEASE PHASE Sudden reopening; it may be accompanied by a burst of air. ORAL STOPS IN ENGLISH a. BILABIAL STOPS: The blockage is made with the two lips. spot [p] voiceless baby [b] voiced 1 b. ALVEOLAR STOPS: The blade (or the tip) of the tongue makes a closure with the alveolar ridge; the sides of the tongue are along the upper teeth. lamino-alveolar stops or Check your apico-alveolar stops pronunciation! stake [t] voiceless deep [d] voiced c. VELAR STOPS: The closure is between the back of the tongue (= dorsum) and the velum. dorso-velar stops scar [k] voiceless goose [g] voiced 2. NASALS (= nasal stops) The air is stopped in the oral tract, but the velum is lowered so that the airflow can go through the nasal tract. All nasals are voiced. NASALS IN ENGLISH a. BILABIAL NASAL: made [m] b. ALVEOLAR NASAL: need [n] c.
    [Show full text]
  • Phonological Processes
    Phonological Processes Phonological processes are patterns of articulation that are developmentally appropriate in children learning to speak up until the ages listed below. PHONOLOGICAL PROCESS DESCRIPTION AGE ACQUIRED Initial Consonant Deletion Omitting first consonant (hat → at) Consonant Cluster Deletion Omitting both consonants of a consonant cluster (stop → op) 2 yrs. Reduplication Repeating syllables (water → wawa) Final Consonant Deletion Omitting a singleton consonant at the end of a word (nose → no) Unstressed Syllable Deletion Omitting a weak syllable (banana → nana) 3 yrs. Affrication Substituting an affricate for a nonaffricate (sheep → cheep) Stopping /f/ Substituting a stop for /f/ (fish → tish) Assimilation Changing a phoneme so it takes on a characteristic of another sound (bed → beb, yellow → lellow) 3 - 4 yrs. Velar Fronting Substituting a front sound for a back sound (cat → tat, gum → dum) Backing Substituting a back sound for a front sound (tap → cap) 4 - 5 yrs. Deaffrication Substituting an affricate with a continuant or stop (chip → sip) 4 yrs. Consonant Cluster Reduction (without /s/) Omitting one or more consonants in a sequence of consonants (grape → gape) Depalatalization of Final Singles Substituting a nonpalatal for a palatal sound at the end of a word (dish → dit) 4 - 6 yrs. Stopping of /s/ Substituting a stop sound for /s/ (sap → tap) 3 ½ - 5 yrs. Depalatalization of Initial Singles Substituting a nonpalatal for a palatal sound at the beginning of a word (shy → ty) Consonant Cluster Reduction (with /s/) Omitting one or more consonants in a sequence of consonants (step → tep) Alveolarization Substituting an alveolar for a nonalveolar sound (chew → too) 5 yrs.
    [Show full text]
  • The Violability of Backness in Retroflex Consonants
    The violability of backness in retroflex consonants Paul Boersma University of Amsterdam Silke Hamann ZAS Berlin February 11, 2005 Abstract This paper addresses remarks made by Flemming (2003) to the effect that his analysis of the interaction between retroflexion and vowel backness is superior to that of Hamann (2003b). While Hamann maintained that retroflex articulations are always back, Flemming adduces phonological as well as phonetic evidence to prove that retroflex consonants can be non-back and even front (i.e. palatalised). The present paper, however, shows that the phonetic evidence fails under closer scrutiny. A closer consideration of the phonological evidence shows, by making a principled distinction between articulatory and perceptual drives, that a reanalysis of Flemming’s data in terms of unviolated retroflex backness is not only possible but also simpler with respect to the number of language-specific stipulations. 1 Introduction This paper is a reply to Flemming’s article “The relationship between coronal place and vowel backness” in Phonology 20.3 (2003). In a footnote (p. 342), Flemming states that “a key difference from the present proposal is that Hamann (2003b) employs inviolable articulatory constraints, whereas it is a central thesis of this paper that the constraints relating coronal place to tongue-body backness are violable”. The only such constraint that is violable for Flemming but inviolable for Hamann is the constraint that requires retroflex coronals to be articulated with a back tongue body. Flemming expresses this as the violable constraint RETRO!BACK, or RETRO!BACKCLO if it only requires that the closing phase of a retroflex consonant be articulated with a back tongue body.
    [Show full text]
  • Phonetics in Phonology” in This SICOL).1
    Emergent Stops John J. Ohala University of California, Berkeley Two of the most fundamental distinctions between classes of speech sounds is that between sonorants and obstruents and between continuants and non-continuants. Sonorants are characterized as sounds which have no constriction small enough to impede the flow of air to the point of creating any audible turbulence; obstruents, as sounds which have a constriction which does impede the flow of air to the point of creating turbulence a stop burst. Continuants are sounds which could be extended indefinitely whereas non-continuants involve a momentary and abrupt attenuation of the speech signal amplitude. This being the case, it is a rather remarkable phonological event when a stop, which is an non-continuant obstruent, appears as it were, “out of nowhere” surrounded by speech sounds which are either sonorants and/or continuants. Typically these are referred to in the phonological literature as ‘epenthetic’ or ‘intrusive’ stops, terms which reflect the belief that they were introduced by some external cause. Some examples are given in (1) (for references, see Ohala 1995, in press). (1) Engl. youngster [»j√Nkst‘] < j√N + st‘ Engl. warmth [wç”mpT] < warm + T Engl. Thompson < Thom + son (proper name) Engl. dempster ‘judge’ < deem + ster Sotho vontSa ‘to show’ < *voniSa (causative of ‘to see’) Cl. Greek andros < ane# ros ‘man’ French chambre < Latin kame(ra 'room' Spanish alhambra < Arabic al hamra ‘the red’ Latin templum < *tem - lo ‘a section’ A common explanation for these stops is to characterize them as ways to make the transition easier between the flanking sounds or to “repair” ill-formed phonotactics (Piggot and Singh 1985).
    [Show full text]
  • Phonetics and Phonology in Russian Unstressed Vowel Reduction: a Study in Hyperarticulation
    Phonetics and Phonology in Russian Unstressed Vowel Reduction: A Study in Hyperarticulation Jonathan Barnes Boston University (Short title: Hyperarticulating Russian Unstressed Vowels) Jonathan Barnes Department of Modern Foreign Languages and Literatures Boston University 621 Commonwealth Avenue, Room 119 Boston, MA 02215 Tel: (617) 353-6222 Fax: (617) 353-4641 [email protected] Abstract: Unstressed vowel reduction figures centrally in recent literature on the phonetics-phonology interface, in part owing to the possibility of a causal relationship between a phonetic process, duration-dependent undershoot, and the phonological neutralizations observed in systems of unstressed vocalism. Of particular interest in this light has been Russian, traditionally described as exhibiting two distinct phonological reduction patterns, differing both in degree and distribution. This study uses hyperarticulation to investigate the relationship between phonetic duration and reduction in Russian, concluding that these two reduction patterns differ not in degree, but in the level of representation at which they apply. These results are shown to have important consequences not just for theories of vowel reduction, but for other problems in the phonetics-phonology interface as well, incomplete neutralization in particular. Introduction Unstressed vowel reduction has been a subject of intense interest in recent debate concerning the nature of the phonetics-phonology interface. This is the case at least in part due to the existence of two seemingly analogous processes bearing this name, one typically called phonetic, and the other phonological. Phonological unstressed vowel reduction is a phenomenon whereby a given language's full vowel inventory can be realized only in lexically stressed syllables, while in unstressed syllables some number of neutralizations of contrast take place, with the result that only a subset of the inventory is realized on the surface.
    [Show full text]
  • English Phonetic Vowel Shortening and Lengthening As Perceptually Active for the Poles
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE Title: English phonetic vowel shortening and lengthening as perceptually active for the Poles Author: Arkadiusz Rojczyk Citation style: Rojczyk Arkadiusz. (2007). English phonetic vowel shortening and lengthening as perceptually active for the Poles. W: J. Arabski (red.), "On foreign language acquisition and effective learning" (S. 237-249). Katowice : Wydawnictwo Uniwersytetu Śląskiego Phonological subsystem English phonetic vowel shortening and lengthening as perceptually active for the Poles Arkadiusz Rojczyk University of Silesia, Katowice 1. Introduction Auditory perception is one of the most thriving and active domains of psy­ cholinguistics. Recent years have witnessed manifold attempts to investigate and understand how humans perceive linguistic sounds and what principles govern the process. The major assumption underlying the studies on auditory perception is a simple fact that linguistic sounds are means of conveying meaning, encoded by a speaker, to a hearer. The speaker, via articulatory gestures, transmits a message encoded in acoustic waves. This is the hearer, on the other hand, who, using his perceptual apparatus, reads acoustic signals and decodes them into the meaning. Therefore, in phonetic terms, the whole communicative act can be divided into three stages. Articulatory - the speaker encodes the message by sound production. Acoustic - the message is transmitted to the hearer as acous­ tic waves. Auditory - the hearer perceives sounds and decodes them into the meaning. Each of the three stages is indispensable for effective communication. Of the three aforementioned stages of communication, the auditory percep­ tion is the least amenable to precise depiction and understanding.
    [Show full text]
  • Lecture 5 Sound Change
    An articulatory theory of sound change An articulatory theory of sound change Hypothesis: Most common initial motivation for sound change is the automation of production. Tokens reduced online, are perceived as reduced and represented in the exemplar cluster as reduced. Therefore we expect sound changes to reflect a decrease in gestural magnitude and an increase in gestural overlap. What are some ways to test the articulatory model? The theory makes predictions about what is a possible sound change. These predictions could be tested on a cross-linguistic database. Sound changes that take place in the languages of the world are very similar (Blevins 2004, Bateman 2000, Hajek 1997, Greenberg et al. 1978). We should consider both common and rare changes and try to explain both. Common and rare changes might have different characteristics. Among the properties we could look for are types of phonetic motivation, types of lexical diffusion, gradualness, conditioning environment and resulting segments. Common vs. rare sound change? We need a database that allows us to test hypotheses concerning what types of changes are common and what types are not. A database of sound changes? Most sound changes have occurred in undocumented periods so that we have no record of them. Even in cases with written records, the phonetic interpretation may be unclear. Only a small number of languages have historic records. So any sample of known sound changes would be biased towards those languages. A database of sound changes? Sound changes are known only for some languages of the world: Languages with written histories. Sound changes can be reconstructed by comparing related languages.
    [Show full text]
  • Part 1: Introduction to The
    PREVIEW OF THE IPA HANDBOOK Handbook of the International Phonetic Association: A guide to the use of the International Phonetic Alphabet PARTI Introduction to the IPA 1. What is the International Phonetic Alphabet? The aim of the International Phonetic Association is to promote the scientific study of phonetics and the various practical applications of that science. For both these it is necessary to have a consistent way of representing the sounds of language in written form. From its foundation in 1886 the Association has been concerned to develop a system of notation which would be convenient to use, but comprehensive enough to cope with the wide variety of sounds found in the languages of the world; and to encourage the use of thjs notation as widely as possible among those concerned with language. The system is generally known as the International Phonetic Alphabet. Both the Association and its Alphabet are widely referred to by the abbreviation IPA, but here 'IPA' will be used only for the Alphabet. The IPA is based on the Roman alphabet, which has the advantage of being widely familiar, but also includes letters and additional symbols from a variety of other sources. These additions are necessary because the variety of sounds in languages is much greater than the number of letters in the Roman alphabet. The use of sequences of phonetic symbols to represent speech is known as transcription. The IPA can be used for many different purposes. For instance, it can be used as a way to show pronunciation in a dictionary, to record a language in linguistic fieldwork, to form the basis of a writing system for a language, or to annotate acoustic and other displays in the analysis of speech.
    [Show full text]
  • Roger Lass the Idea: What Is Schwa?
    Stellenbosch Papers in Linguistics, Vol. 15, 1986, 01-30 doi: 10.5774/15-0-95 SPIL 15 (]986) 1 - 30 ON SCHW.A. * Roger Lass The idea: what is schwa? Everybody knows what schwa is or do they? This vene- rable Hebraic equivocation, with its later avatars like "neutral vowel", MUT'melvokaZ, etc. seems to be solidly es­ tablished in our conceptual and transcriptional armories. Whether it should be is another matter. In its use as a transcriptional symbol, I suggest, it represents a somewhat unsavoury and dispensable collection of theoretical and empirical sloppinesses and ill-advised reifications. It also embodies a major category confusion. That is, [8J is the only "phonetic symbol" that in accepted usage has only "phonological" or functional reference, not (precise) phone­ tic content. As we will see, there is a good deal to be said against raJ as a symbol for unstressed vowels, though there is often at least a weak excuse for invoking it. But "stressed schwa", prominent in discussions of Afrikaans and English (among other languages) is probably just about inex­ ·cusable. v Schwa (shwa, shva, se wa , etc.) began life as a device in Hebrew orthography. In "pointed" or "vocalized" script (i.e. where vowels rather than just consonantal skeletons are represented) the symbol ":" under a consonant graph appa­ rently represented some kind of "overshort" and/or "inde­ terminate" vowel: something perhaps of the order of a Slavic jeT', but nonhigh and generally neither front nor back though see below. In (Weingreen 1959:9, note b) it is described as,"a quick vowel-like sound", which is "pronounced like 'e' in 'because'''.
    [Show full text]