The Phonation Factor in the Categorical Perception of Mandarin Tones

Total Page:16

File Type:pdf, Size:1020Kb

The Phonation Factor in the Categorical Perception of Mandarin Tones ICPhS XVII Regular Session Hong Kong, 17-21 August 2011 THE PHONATION FACTOR IN THE CATEGORICAL PERCEPTION OF MANDARIN TONES Ruo-Xiao Yang The Chinese University of Hong Kong, Hong Kong [email protected] ABSTRACT Tone is a critical element for tone languages.. Generally, tone is produced by the vibration of This study presents a new way of observing the vocal folds, which is often measured as categorical nature of Mandarin tone perception by fundamental frequency (F0) acoustically and considering the phonation factors in the perceived as pitch. When tone is described or experiment design. In order to demonstrate the perceived, the F0, sometimes as well as intensity reliability of experiment materials, a pitch and duration, is taken as its most critical physical synchronous overlap-add (PSOLA) method is used correlates. However, this standpoint is only based to produce the speech stimuli, and an inverse- on the physical or acoustic representations of tone filtering method is applied to assess the but overlook its articulator, the vocal folds. invariability of sound source of the stimuli. Results Vocal folds are the source of each sound, which provide evidence for the prediction that phonation connect the dynamics from lung beneath and the factors are adopted in perceiving tones by resonance from vocal tract above. The vibration of Mandarin speakers. Especially for the tone which vocal folds for speech production can be described has a special voice characteristic, such as the third not only as its vibrating velocity per second, i.e. tone in Mandarin, the perception difference among the fundamental frequency (F0), but also its continua with different voice qualities is much vibrating mode which is often referred as more obvious. Results also suggest that there is phonation types (or loosely, voice qualities). Since categorical perception for each pair of two tones in a speech sound can be divided into segmental Mandarin. These results lead to a discussion about (consonants and vowels) and super-segmental parts the necessity of a wider and more integrated theory under the backgrounds of modern phonology and model for describing tones in tone languages, in phonetics, the phonation part should be sorted to which the phonation factors should be considered. be a super-segmental feature. Keywords: phonation perception, voice quality, Related to surveys on the CP nature of tones, categorical perception, Mandarin tones the usual pervious hypothesis has suggested that F0 is the main or even the most crucial correlate 1. INTRODUCTION for perceiving tones. Perception experiments are Categorical perception (CP) is an important thus designed based on this viewpoint. However, if phenomenon in human cognition because people tone is observed from the “phonation” perspective, sort out surrounding things incessantly every day. F0 might not be the only correlate for Roughly, the differences among things can be distinguishing tones as different phonemic units. perceived with two modes: the “continuous Furthermore, more and more evidence show that perception” and the “categorical perception (CP)”. some tone languages (such as Hani language and CP have been formally thought to be peculiar to Jingpo language) indeed use different phonation speech and color of human perceivers, but it turns types as distinctive features rather than F0/pitch out to be much more general by later research on (Kong, [6]). Thus, it is important and necessary to human infants and animals [4, 8, 9]. CP has been discuss the CP nature of tones within a phonation investigated in different levels of language. In perspective. human speech, the general conclusion about Several previous studies have explored the vowels as continuous perception and consonants as phonation modes of Mandarin tones from a linguistic categorical perception has been widely accepted, viewpoint. Kong used EGG (Electroglottographic) whereas the categorical nature of tone in tone signal [6] and high-speed digital imaging [7] to languages is still controversial. investigate the voice of the four basic tones of Mandarin/standard Chinese in single syllables in 2204 ICPhS XVII Regular Session Hong Kong, 17-21 August 2011 isolated monosyllabic words. These research controlled and thus only [ta] syllable was used. suggested that F0 and phonation features all Previous studies suggest that [ta] is the best contribute to the perception of Mandarin tones. syllable structure for keeping the formant Keating and Esposito [5] also took some preliminary information when manipulating F0. measurements on F0 and other parameters related to For assessing the phonation information during phonation types of the four basic tones of Mandarin. perceiving Mandarin tones, 12 10-step continua They also pointed out some special phonation were synthesized as stimuli by manipulating F0 features in the low falling tone and the end of the through the “pitch synchronous overlap-add” falling tone of Mandarin, i.e., the creaky voice can be method (i.e., PSOLA) in Praat [1]. heard on the low falling tone and visible at the end of Table 1: Syllable with four Mandarin tones and F0 the falling tone. Moreover, they suggested a more parameters of four tones for synthesis. general conclusion that all “tones” (in all languages) Four Tones OT F0 (Hz) TTP F0 (Hz) TT F0 (Hz) may have some correlated variation in phonation, only most of them being subtle within the modal Tone1 (T1) 160 160 160 Tone2 (T2) 100 123 160 ranges and some owning phonological status, but it Tone3 (T3) 90 80 130 was probable that listeners in many tone languages Tone4 (T4) 160 130 80 use the phonation information in recognizing their tones especially out of context. Even though the Figure 1 (A-F): Diagrams of manipulating F0 in 12 relationships of phonation and tones have attracted continua: (A) T1T2/T2T1, (B) T1T3/TT1, (C) T1T4/T4T1, (D) T2T3/T3T2, (E) academic attention, no studies have been done to T2T4/T4T2, (F) T3T4/T4T3. discuss how the phonation information related to tones can be perceived. Moreover, no previous T1-->T2 & T2-->T1 T1-->T3 & T3-->T1 T1-->T4 & T4-->T1 170 170 170 160 160 160 research on CP of tones and perceiving phonation 150 150 150 140 140 140 130 130 130 120 120 120 F0(Hz) F0(Hz) 110 information has attempted to consider the phonation F0(Hz) 110 110 100 100 100 90 90 90 information contributing to perception of tones. We 80 80 80 OT TTP TT OT TTP TT OT TTP TT argue that it is important to look at the phonation Time Points Time Points Time Points factor and CP of tones concurrently in one (A) (B) (C) experiment design in order to achieve a better T2-->T3 & T3-->T2 T2-->T4 & T4-->T2 T3-->T4 & T4-->T3 understanding of CP and how to define the 170 170 170 160 160 160 150 150 150 140 140 140 phonological status of tones in tone languages. 130 130 130 120 120 120 F0(Hz) F0(Hz) F0(Hz) 110 110 110 100 100 100 Therefore, in this experiment, the phonation factor 90 90 90 80 80 80 has been considered concurrently with CP testing. OT TTP TT OT TTP TT OT TTP TT Time Points Time Points Time Points This design permits us to examine whether CP exists in arbitrary two tones of the four basic tones in (D) (E) (F) Mandarin by making continua as stimuli between Taylor [11] explained that TD-PSOLA is each pairs of two tones and to determine whether attempting to mimic the process of safely changing phonation information is used in differentiating tone the pitch without changing filter by separating the categories by synthesized stimuli. effects of each pulse. Thus, under the most ideal 2. EXPERIMENT MATERIALS AND condition, TD-PSOLA will keep the spectral METHODS envelope characteristics and therefore keep the original phonation/voice information from the 2.1. Materials sound source. Furthermore, Lemmetty [10] and The stimuli were manipulated based on four Upperman [12] also proposed that the speech signal Mandarin syllables with four basic tones out of resulted from PSOLA has the same spectrum as the context. To make the syllable more real and keep original signal but with a different F0. Esposito [3] better phonation information (which may be lost in also suggested that PSOLA changes the F0 of a context), the four original syllables are produced signal without changing other properties of the separately by a male speaker whose native voice and applied it to normalize F0 of stimuli for language is Beijing Mandarin. Moreover, in order testing the perception of phonation types. to keep as much sufficient information of original 2.2. Subjects syllables as possible and avoid losing information during manipulation, the syllable structure was 19 participants from Peking University in China 2205 ICPhS XVII Regular Session Hong Kong, 17-21 August 2011 (11 female and 8 male, aged 19-25 years) the tone with this voice quality; therefore, it could participated for a small amount of money in the be roughly concluded here that voice quality or experiment: an identification test followed by a phonation information indeed influenced the discrimination test. All of them were native fluent categorical perception of Mandarin tones. speakers of Mandarin / Putonghua with different Figure 2 (A-F): Identification curves for responses of Chinese dialect backgrounds. 19 subjects in 12 continua to show the differences between pairs of continua with different voice 2.3. Experiment procedures qualities: (A) T1T2 based on the voice of T1 (solid line) and T2 (dashed line); (B) T1T3 based on the The whole experiment was consisted of an voice of T1 (solid line) and T3 (dashed line); (C) identification task and a AX discrimination task. In T1T4 based on the voice of T1 (solid line) and T4 the identification task, subjects listened to stimuli (dashed line); (D) T2T3 based on the voice of T2 presented in isolation.
Recommended publications
  • Phonation Types of Korean Fricatives and Affricates
    ISSN 2005-8063 2017. 12. 31. 말소리와 음성과학 Vol.9 No.4 pp. 51-57 http://dx.doi.org/10.13064/KSSS.2017.9.4.051 Phonation types of Korean fricatives and affricates Goun Lee* Abstract The current study compared the acoustic features of the two phonation types for Korean fricatives (plain: /s/, fortis : /s’/) and the three types for affricates (aspirated : /tsʰ/, lenis : /ts/, and fortis : /ts’/) in order to determine the phonetic status of the plain fricative /s/. Considering the different manners of articulation between fricatives and affricates, we examined four acoustic parameters (rise time, intensity, fundamental frequency, and Cepstral Peak Prominence (CPP) values) of the 20 Korean native speakers’ productions. The results showed that unlike Korean affricates, F0 cannot distinguish two fricatives, and voice quality (CPP values) only distinguishes phonation types of Korean fricatives and affricates by grouping non-fortis sibilants together. Therefore, based on the similarity found in /tsʰ/ and /ts/ and the idiosyncratic pattern found in /s/, this research concludes that non-fortis fricative /s/ cannot be categorized as belonging to either phonation type. Keywords: Korean fricatives, Korean affricates, phonation type, acoustic characteristics, breathy voice 1. Introduction aspirated-like characteristic by remaining voiceless in intervocalic position. Whereas the lenis stops become voiced in intervocalic Korean has very well established three-way contrasts (e.g., position (e.g., /pata/ → / [pada] ‘sea’), the aspirated stops do not aspirated, lenis, and fortis) both in stops and affricates in three undergo intervocalic voicing (e.g., /patʰaŋ/ → [patʰaŋ] places of articulation (bilabial, alveolar, velar). However, this ‘foundation’). Since the plain fricative /s/, like the aspirated stops, distinction does not occur in fricatives, leaving only a two-way does not undergo intervocalic voicing, categorizing the plain contrast between fortis fricative /s’/ and non-fortis (hereafter, plain) fricative as aspirated is also possible.
    [Show full text]
  • Phonological Use of the Larynx: a Tutorial Jacqueline Vaissière
    Phonological use of the larynx: a tutorial Jacqueline Vaissière To cite this version: Jacqueline Vaissière. Phonological use of the larynx: a tutorial. Larynx 97, 1994, Marseille, France. pp.115-126. halshs-00703584 HAL Id: halshs-00703584 https://halshs.archives-ouvertes.fr/halshs-00703584 Submitted on 3 Jun 2012 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Vaissière, J., (1997), "Phonological use of the larynx: a tutorial", Larynx 97, Marseille, 115-126. PHONOLOGICAL USE OF THE LARYNX J. Vaissière UPRESA-CNRS 1027, Institut de Phonétique, Paris, France larynx used as a carrier of paralinguistic information . RÉSUMÉ THE PRIMARY FUNCTION OF THE LARYNX Cette communication concerne le rôle du IS PROTECTIVE larynx dans l'acte de communication. Toutes As stated by Sapir, 1923, les langues du monde utilisent des physiologically, "speech is an overlaid configurations caractéristiques du larynx, aux function, or to be more precise, a group of niveaux segmental, lexical, et supralexical. Nous présentons d'abord l'utilisation des différents types de phonation pour distinguer entre les consonnes et les voyelles dans les overlaid functions. It gets what service it can langues du monde, et également du larynx out of organs and functions, nervous and comme lieu d'articulation des glottales, et la muscular, that come into being and are production des éjectives et des implosives.
    [Show full text]
  • Part 1: Introduction to The
    PREVIEW OF THE IPA HANDBOOK Handbook of the International Phonetic Association: A guide to the use of the International Phonetic Alphabet PARTI Introduction to the IPA 1. What is the International Phonetic Alphabet? The aim of the International Phonetic Association is to promote the scientific study of phonetics and the various practical applications of that science. For both these it is necessary to have a consistent way of representing the sounds of language in written form. From its foundation in 1886 the Association has been concerned to develop a system of notation which would be convenient to use, but comprehensive enough to cope with the wide variety of sounds found in the languages of the world; and to encourage the use of thjs notation as widely as possible among those concerned with language. The system is generally known as the International Phonetic Alphabet. Both the Association and its Alphabet are widely referred to by the abbreviation IPA, but here 'IPA' will be used only for the Alphabet. The IPA is based on the Roman alphabet, which has the advantage of being widely familiar, but also includes letters and additional symbols from a variety of other sources. These additions are necessary because the variety of sounds in languages is much greater than the number of letters in the Roman alphabet. The use of sequences of phonetic symbols to represent speech is known as transcription. The IPA can be used for many different purposes. For instance, it can be used as a way to show pronunciation in a dictionary, to record a language in linguistic fieldwork, to form the basis of a writing system for a language, or to annotate acoustic and other displays in the analysis of speech.
    [Show full text]
  • The Acoustic Consequences of Phonation and Tone Interactions in Jalapa Mazatec
    The acoustic consequences of phonation and tone interactions in Jalapa Mazatec Marc Garellek & Patricia Keating Phonetics Laboratory, Department of Linguistics, UCLA [email protected] San Felipe Jalapa de Dıaz⁄ (Jalapa) Mazatec is unusual in possessing a three-way phonation contrast and three-way level tone contrast independent of phonation. This study investigates the acoustics of how phonation and tone interact in this language, and how such interactions are maintained across variables like speaker sex, vowel timecourse, and presence of aspiration in the onset. Using a large number of words from the recordings of Mazatec made by Paul Kirk and Peter Ladefoged in the 1980s and 1990s, the results of our acoustic and statistical analysis support the claim that spectral measures like H1-H2 and mid- range spectral measures like H1-A2 best distinguish each phonation type, though other measures like Cepstral Peak Prominence are important as well. This is true regardless of tone and speaker sex. The phonation type contrasts are strongest in the first third of the vowel and then weaken towards the end. Although the tone categories remain distinct from one another in terms of F0 throughout the vowel, for laryngealized phonation the tone contrast in F0 is partially lost in the initial third. Consistent with phonological work on languages that cross-classify tone and phonation type (i.e. ‘laryngeally complex’ languages, Silverman 1997), this study shows that the complex orthogonal three-way phonation and tone contrasts do remain acoustically distinct according to the measures studied, despite partial neutralizations in any given measure. 1 Introduction Mazatec is an Otomanguean language of the Popolocan branch.
    [Show full text]
  • My Dissertation Will Be Composed of Roughly 7 Chapters
    Copyright by Hansang Park 2002 Temporal and Spectral Characteristics of Korean Phonation Types by Hansang Park, B.A., M.A. Dissertation Presented to the Faculty of the Graduate School of The University of Texas at Austin in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy The University of Texas at Austin August, 2002 Dedication To my parents, Kapjo Park and Chanam Nam, and to my family, Yookyung Bae, Chanho Park, Dongho Park Acknowledgements I owe this work to so many people. This work would have been impossible without their help. First of all, I am so much indebted to Professor Robert T. Harms. He was my supervisor, professor, and graduate advisor, while I was studying in the Ph.D. program in Linguistics at the University of Texas at Austin. His comments on my work always kept me from deviating from the path I should be following. He encouraged me with wit and humor when I was frustrated. I was nourished to the fullest from discussions with him. I must confess that his broad, profound, and brewed knowledge about phonetics and linguistics is distilled into my work. When I was his teaching assistant, I was impressed at his professionalism in two ways. He was creative in preparing teaching materials and most kind to students during his classes. He taught me how to avoid mechanical grading and how to cherish students’ potentials and insights. He has two things I would like to inherit from him: the artificial larynx buzzer which is very useful to demonstrate the role of the source and filter in speech production and a peace of pipe organ which is helpful to teach the acoustic characteristics of the tube.
    [Show full text]
  • Effects of Consonant Manner and Vowel Height on Intraoral Pressure and Articulatory Contact at Voicing Offset and Onset for Voiceless Obstruentsa)
    Effects of consonant manner and vowel height on intraoral pressure and articulatory contact at voicing offset and onset for voiceless obstruentsa) Laura L. Koenigb) Haskins Laboratories, 300 George Street, New Haven, Connecticut 06511 and Long Island University, One University Plaza, Brooklyn, New York 11201 Susanne Fuchs Center for General Linguistics (ZAS), Schuetzenstrasse 18, Berlin 10117, Germany Jorge C. Lucero Department of Mathematics, University of Brasilia, Brasilia DF 70910-900, Brazil (Received 13 September 2010; revised 3 February 2011; accepted 12 February 2011) In obstruent consonants, a major constriction in the upper vocal tract yields an increase in intraoral pressure (Pio). Phonation requires that subglottal pressure (Psub) exceed Pio by a threshold value, so as the transglottal pressure reaches the threshold, phonation will cease. This work investigates how Pio levels at phonation offset and onset vary before and after different German voiceless obstruents (stop, fricative, affricates, clusters), and with following high vs low vowels. Articulatory contacts, measured using electropalatography, were recorded simultaneously with Pio to clarify how supra- glottal constrictions affect Pio. Effects of consonant type on phonation thresholds could be explained mainly in terms of the magnitude and timing of vocal-fold abduction. Phonation offset occurred at lower values of Pio before fricative-initial sequences than stop-initial sequences, and onset occurred at higher levels of Pio following the unaspirated stops of clusters compared to frica- tives, affricates, and aspirated stops. The vowel effects were somewhat surprising: High vowels had an inhibitory effect at voicing offset (phonation ceasing at lower values of Pio) in short-duration consonant sequences, but a facilitating effect on phonation onset that was consistent across conso- nantal contexts.
    [Show full text]
  • Articulatory Phonetics
    Articulatory Phonetics Lecturer: Dr Anna Sfakianaki HY578 Digital Speech Signal Processing Spring Term 2016-17 CSD University of Crete What is Phonetics? n Phonetics is a branch of Linguistics that systematically studies the sounds of human speech. 1. How speech sounds are produced Production (Articulation) 2. How speech sounds are transmitted Acoustics 3. How speech sounds are received Perception It is an interdisciplinary subject, theoretical as much as experimental. Why do speech engineers need phonetics? n An engineer working on speech signal processing usually ignores the linguistic background of the speech he/she analyzes. (Olaszy, 2005) ¡ How was the utterance planned in the speaker’s brain? ¡ How was it produced by the speaker’s articulation organs? ¡ What sort of contextual influences did it receive? ¡ How will the listener decode the message? Phonetics in Speech Engineering Combined knowledge of articulatory gestures and acoustic properties of speech sounds Categorization of speech sounds Segmentation Speech Database Annotation Algorithms Speech Recognition Speech Synthesis Phonetics in Speech Engineering Speech • diagnosis Disorders • treatment Pronunciation • L2 Teaching Tools • Foreign languages Speech • Hearing aids Intelligibility Enhancement • Other tools A week with a phonetician… n Tuesday n Thursday Articulatory Phonetics Acoustic Phonetics ¡ Speech production ¡ Formants ¡ Sound waves ¡ Fundamental Frequency ¡ Places and manners of articulation ¡ Acoustics of Vowels n Consonants & Vowels n Articulatory vs Acoustic charts ¡ Waveforms of consonants - VOT ¡ Acoustics of Consonants n Formant Transitions ¡ Suprasegmentals n Friday More Acoustic Phonetics… ¡ Interpreting spectrograms ¡ The guessing game… ¡ Individual Differences Peter Ladefoged Home Page: n Professor UCLA (1962-1991) http://www.linguistics.ucla.edu/people/ladefoge/ n Travelled in Europe, Africa, India, China, Australia, etc.
    [Show full text]
  • Acoustic Discriminability of the Complex Phonation System in !Xó˜O
    Acoustic discriminability of the complex phonation system in !Xo´o˜ Marc Garellek Department of Linguistics University of California, San Diego (September 2018 version, to appear in Phonetica) 9500 Gilman Drive #0108, La Jolla, CA 92093-0108 USA Phone: +1 858 534 2412 Fax: +1 858 534 4789 Email: [email protected] Keywords: voice quality, phonation types, harsh voice Page 1 of 42 Abstract Phonation types, or contrastive voice qualities, are minimally produced using complex movements of the vocal folds, but may additionally involve constriction in the supraglottal and pharyngeal cav- ities. These complex articulations in turn produce a multidimensional acoustic output that can be modeled in various ways. In this study, I investigate whether the psychoacoustic model of voice by Kreiman et al. (2014) succeeds at distinguishing six phonation types of !Xo´o.˜ Linear discriminant analysis is performed using parameters from the model averaged over the entire vowel as well as for the first and final halves of the vowel. The results indicate very high classification accuracy for all phonation types. Measures averaged over the vowel’s entire duration are closely correlated with the discriminant functions, suggesting that they are sufficient for distinguishing even dynamic phonation types. Measures from all classes of parameters are correlated with the linear discrimi- nant functions; in particular, the ‘strident’ vowels, which are harsh in quality, are characterized by their noise, changes in spectral tilt, decrease in voicing amplitude and frequency, and raising of the first formant. Despite the large number of contrasts and the time-varying characteristics of many of the phonation types, the phonation contrasts in !Xo´o˜ remain well differentiated acoustically.
    [Show full text]
  • Ingressive Phonation in Contemporary Vocal Music, Works by Helmut Lachenmann, Georges Aperghis, Michael Baldwin, and Nicholas
    © 2012 Amanda DeBoer Bartlett All Rights Reserved iii ABSTRACT Jane Schoonmaker Rodgers, Advisor The use of ingressive phonation (inward singing) in contemporary vocal music is becoming more frequent, yet there is limited research on the physiological demands, risks, and pedagogical requirements of the various ingressive phonation techniques. This paper will discuss ingressive phonation as it is used in contemporary vocal music. The research investigates the ways in which ingressive phonation differs acoustically, physiologically, and aesthetically from typical (egressive) phonation, and explores why and how composers and performers use the various ingressive vocal techniques. Using non-invasive methods, such as electroglottograph waveforms, aerodynamic (pressure, flow, flow resistance) measures, and acoustic analyses of recorded singing, specific data about ingressive phonation were obtained, and various categories of vocal techniques were distinguished. Results are presented for basic vocal exercises and tasks, as well as for specific excerpts from the repertoire, including temA by Helmut Lachenmann and Ursularia by Nicholas DeMaison. The findings of this study were applied to a discussion surrounding pedagogical and aesthetic applications of ingressive phonation in contemporary art music intended for concert performance. Topics of this discussion include physical differences in the production and performance of ingressive phonation, descriptive information regarding the various techniques, as well as notational and practical recommendations for composers. iv This document is dedicated to: my husband, Tom Bartlett my parents, John and Gail DeBoer and my siblings, Mike, Matt, and Leslie DeBoer Thank you for helping me laugh through the process – at times ingressively – and for supporting me endlessly. v ACKNOWLEDGEMENTS I have endless gratitude for my advisor and committee chair, Dr.
    [Show full text]
  • Dynamics of Phonatory Posturing at Phonation Onset
    The Laryngoscope VC 2015 The American Laryngological, Rhinological and Otological Society, Inc. Dynamics of Phonatory Posturing at Phonation Onset Travis L. Shiba, MD; Dinesh K. Chhetri, MD Introduction: In speech and singing, the intrinsic laryngeal muscles set the prephonatory posture prior to the onset of phonation. The timing and shape of the prephonatory glottal posture can directly affect the resulting phonation type. We investigated the dynamics of human laryngeal phonatory posturing. Methods: Onset of vocal fold adduction to phonation was observed in 27 normal subjects using high-speed video recording. Subjects were asked to utter a variety of phonation types (modal, breathy, pressed, /i/ following sniff). Digital vid- eokymography with concurrent acoustic signal was analyzed to assess the timing of the following: onset of adduction to final phonatory posture (FPT), phonation onset time (POT), and phonatory posture time (PPT). Final phonatory posture time was determined as the moment at which the laryngeal configuration used in phonation was first achieved. Results: Thirty-three audiovisual recordings met inclusion criteria. Average FPT, PPT, and POT were as follows: 303, 106, and 409 ms for modal; 430, 104, and 534 ms for breathy; 483, 213, and 696 ms for pressed; and 278, 98, and 376 ms for sniff-/i/. The following posturing features were observed: 1) pressed phonation: increased speed of closure just prior to final posture, complete glottal closure, and increased supraglottic hyperactivity; and 2) breathy phonation: decreased speed of closure prior to final posture, increased posterior glottal gap, and increased midmembranous gap. Conclusions: Phonation onset latency was shortest for modal and longest for pressed voice.
    [Show full text]
  • The Mechanics of Breathing and Speaking
    2 “ Speech is the voice of the heart.” —Anna Quindlen The Need to Breathe: The Mechanics of Breathing and Speaking Frontal Sinus Sphenoid Sinus Nasal Cavity Oral Cavity Pharynx Epiglottis Larynx Trachea Bronchus Superior Lobe Alveoli Heart Bronchioles Middle Lobe Inferior Lobe Diaphragm ©stockshoppe/Shutterstock.com ©stockshoppe/Shutterstock.com Th e parts of our anatomy that are used to speak are also used for other purposes. Speaking is a secondary biological function as all of the basic anatomical parts used in speech are also used for other purposes. Much of the anatomy that is used in speech is also used in ordinary breathing and as such, speech and breathing and breath control in speaking are intimately linked. Th e parts of our body that are used to speak can be broken down in to three areas: (1) Phonation, which includes the larynx, vocal folds, glottis, and epiglottis, (2) Resonation, which includes the use of the pharynx, oral 11 Clark_Dillard_The_Speakers_Voice01E_Ch02_Printer.indd 11 14/07/16 1:14 pm 12 The Speaker’s Voice cavity, and the nasal cavity, and (3) Articulation, which includes the lips, tongue, hard and soft palate, lower jaw, and the gums. Phonation is the process of vibrating the vocal chords for initial pro- duction of speech. Resonation is the amplification and modification of the sound originated in the larynx. Articulation is the production of the indi- vidual sounds or phonemes of a given language. When we breathe we take air into our lungs through the process of respi- ration or inhalation and exhalation. Contracting and relaxing the diaphragm controls this process of respiration.
    [Show full text]
  • The Use of Inhalation Phonation in the Voice Studio: a New Approach to Addressing MTD in Singers
    The Use of Inhalation Phonation in the Voice Studio: A New Approach to Addressing MTD in Singers Kate Emerich Gordon and Ona Reed [Editor’s Note: A tutorial video illustrating procedures outlined in this article may be accessed on the official NATS YouTube channel, https://youtu.be/ uMgaW4ckuws.] INTRODUCTION inging is both an art and an athletic endeavor. It requires a perfect coordination of the respiratory, phonatory, and resonance systems for an optimized vocal production. However, just as athletes can experi- ence decreased performance because of a physical imbalance (e.g., Kate Emerich Gordon Smuscular injury or weakness), a singer can be challenged with a loss of range, power, vocal quality, and decreased ease of phonation with an imbalanced speaking or singing voice. Muscle tension dysphonia (MTD) is a voice disor- der related to muscular imbalance in and around the vocal mechanism.1 The muscle tension patterns in MTD can be significant in nature and even prevent normal phonation from taking place, which is why aphonia (the inability to produce sound) is often grouped with MTD as the letter A (MTD/A).2 Most singers experience a mild to moderate form of MTD, but it is possible for sing- ers to become aphonic because of extreme muscle tension patterns. Singers with MTD present with symptoms that include a high laryngeal carriage, Ona Reed hyperactive contraction of the ventricular folds, and an anterior-posterior squeeze from the arytenoid cartilages forward to the epiglottis.3 Symptoms of MTD include difficulty singing in a particular range of the singing voice, effortful voice production, fatigue and/or hoarseness after singing for a short period of time, and discomfort with voice use.
    [Show full text]