Background: the Production of Sound for Speech

Total Page:16

File Type:pdf, Size:1020Kb

Background: the Production of Sound for Speech Background: the production of sound for speech Adam Albright ([email protected]) LSA 2017 Phonology University of Kentucky How do humans use sound to communicate? I Speech production I Intended meaning, calculate syntactic form, determine phonological form, translate into physical instructions I Speech perception I Sound waves enter ear, auditory analysis, parse for linguistically significant units, determine phonological and syntactic form, reconstruct intended meaning The nature of speech The production of voicing Vowels Consonants 1/54 Properties of speech sounds What makes a speech sound speech? I Speech is noise created by air flowing from the lungs, modified as it exits through the mouth/nose I Speech is just one of many modes of linguistic communication I Hand signs, writing, semaphore, morse code, … I But all except sign language are derivative from spoken speech I Speech is continuous I A fluid stream of sounds, without necessarily any pauses between individual sounds, words, or even sentences I Speech is segmentable I Composed of smaller units: words, morphemes, syllables, and individual sounds The nature of speech The production of voicing Vowels Consonants 2/54 Although it’s not obvious from the waveform, the speech stream is composed of smaller units, which we can refer to discretely I Phrases, words, phones (= individual speech sounds) I explain = e k s p l ai n Speech: continuous, yet segmentable . “If you can’t see that, then I don’t know if I can explain it to you” The nature of speech The production of voicing Vowels Consonants 3/54 I explain = e k s p l ai n Although it’s not obvious from the waveform, the speech stream is composed of smaller units, which we can refer to discretely I Phrases, words, phones (= individual speech sounds) Speech: continuous, yet segmentable . “If you can’t see that, then I don’t know if I can explain it to you” The nature of speech The production of voicing Vowels Consonants 3/54 Although it’s not obvious from the waveform, the speech stream is composed of smaller units, which we can refer to discretely I Phrases, words, phones (= individual speech sounds) Speech: continuous, yet segmentable . “If you can’t see that, then I don’t know if I can explain it to you” I explain = e k s p l ai n The nature of speech The production of voicing Vowels Consonants 3/54 Speech: continuous, yet segmentable . “If you can’t see that, then I don’t know if I can explain it to you” I explain = e k s p l ai n Although it’s not obvious from the waveform, the speech stream is composed of smaller units, which we can refer to discretely I Phrases, words, phones (= individual speech sounds) The nature of speech The production of voicing Vowels Consonants 3/54 Evidence for discrete phones I Phonemic awareness tasks (typically used for testing development in children) I What is the first sound in cat? I Which of these words start with the same sound: cat, pat, coat I Say cat without the [k]; say stand without the [t] I Say cat. What word do you get if you change the [k] to [m]? I Language games I pig latin ! ig-pay atin-lay I Speech errors I “cad you rean the small print?” I (More common: ‘spoonerisms’ of word-initial consonants) I Alphabetic writing The nature of speech The production of voicing Vowels Consonants 4/54 Producing speech Two components: I Aerodynamic component I Air is forced out of lungs, though throat, mouth, and nose I Articulatory component I Air stream is modified in various ways on its way out The nature of speech The production of voicing Vowels Consonants 5/54 The vocal tractArticulation of English sounds: Consonants (6) A bit of anatomy nasal passages palate alveolar ridge brain tongue dorsum lips velum teeth tongue uvula tongue blade larynx (glottis) to the lungs (7) The airstream: (mylungs vocal tract, for illustrativelarynx purposes)vocal tract outside world The nature(8) of speechStep 1:The The production larynx of, voicing or vocalVowels cordsConsonants 6/54 • closed: vibrate, make buzzing noise → voicing • open: no vibration, just let air pass through → no voicing ☞ What would speech sound like if there was no voicing? (9) Step 2: The vocal tract • air flowing through the mouth = oral sounds • air flowing through the nose = nasal sounds (10) Articulation: doing something to block or obstruct the airstream (11) Place of articulation = where is the obstruction? lips = labial articulation alveolar ridge = alveolar articulation palate = palatal articulation velum = velar articulation uvula = uvular articulation glottis = glottal articulation And one other useful term: air escaping around side(s) of the tongue = lateral articulation ☞ how would a labiodental articulation be made? an interdental articulation? an alveopalatal articulation? 2 + The larynx (= the glottis) From http://www.oto-hns.northwestern.edu/Voice/education.html (broken link) The vocal tract The first linguistically significant airstream obstruction: The nature of speech The production of voicing Vowels Consonants 7/54 The vocal tract The first linguistically significant airstream obstruction: + The larynx (= the glottis) From http://www.oto-hns.northwestern.edu/Voice/education.html (broken link) The nature of speech The production of voicing Vowels Consonants 7/54 Manipulations of the vocal folds Vocal folds are held at varying degrees of stiffness/closure I Completely open: for respiration (= breathing) I Close enough to obstruct airflow: whispering, sighing, [h] (aspiration) I Close together, intermediate stiffness: normal (modal) voice I Tightly closed: glottal stop (as in [ʔ]uh-[ʔ]oh) The nature of speech The production of voicing Vowels Consonants 8/54 Voicing is the first major source of noise for speech I Creates a complex periodic wave, which resonates in the mouth and nose, and is otherwise modified on its way into the atmosphere Voicing I When the vocal folds are held somewhat close together, and with the right amount of stiffness, air rushing through causes them to “vibrate” I Vocal folds are forced apart and rapidly close again, repeatedly I Periodic vibrations = voicing I Can feel externally as a “buzzing” on your throat; try [sssszzzzsssszzz] The nature of speech The production of voicing Vowels Consonants 9/54 Voicing I When the vocal folds are held somewhat close together, and with the right amount of stiffness, air rushing through causes them to “vibrate” I Vocal folds are forced apart and rapidly close again, repeatedly I Periodic vibrations = voicing I Can feel externally as a “buzzing” on your throat; try [sssszzzzsssszzz] Voicing is the first major source of noise for speech I Creates a complex periodic wave, which resonates in the mouth and nose, and is otherwise modified on its way into the atmosphere The nature of speech The production of voicing Vowels Consonants 9/54 Source properties: voice quality How would you characterize the difference between these pairs of sounds? I Mpi (Tibeto-Burman, spoken by ≈ 900 people in Thailand) Tone Word Gloss Word Gloss Low rising . si ‘to be putrid’ . si ‘to be dried up’ Low level . si ‘blood’ . si ‘seven’ Mid rising . si ‘to roll rope’ . si ‘to smoke’ Mid level . si (a color) . si (classifier) High falling . si ‘to die’ . si (name) High level . si ‘four’ . si (name) I Regular vs. ‘tense’ voice Hear them here. The nature of speech The production of voicing Vowels Consonants 10/54 Source properties: voice quality !Xóõ (Khoisan, Botswana/Namibia) Plain (voiced) Pharyngealized Strident Breathy . kǁáa . qáʕa . k!ào . k!a̤o ‘camelthorn tree’ ‘long ago’ ‘base’ ‘slope’ The nature of speech The production of voicing Vowels Consonants 11/54 I Emotive use: frustration, surprise, etc. I Individual style: Louis Armstrong, Julie Kavner, etc. description Source properties: voice quality Voice quality contrasts I Bai (Tibeto-Burman, China) (Esling 2002) . tɕi31 ‘field’ . tɕi21 ‘flag’ (breathy) (harsh) I English: what determines the voice quality that a word is spoken with? The nature of speech The production of voicing Vowels Consonants 12/54 Source properties: voice quality Voice quality contrasts I Bai (Tibeto-Burman, China) (Esling 2002) . tɕi31 ‘field’ . tɕi21 ‘flag’ (breathy) (harsh) I English: what determines the voice quality that a word is spoken with? I Emotive use: frustration, surprise, etc. I Individual style: Louis Armstrong, Julie Kavner, etc. description The nature of speech The production of voicing Vowels Consonants 12/54 The vocal tractArticulation as filter of English sounds: Consonants (6) A bit of anatomy nasal passages palate alveolar ridge brain tongue dorsum lips velum teeth tongue uvula tongue blade larynx (glottis) to the lungs (7) The airstream: lungs larynx vocal tract outside world The nature(8) of speechStep 1:The The production larynx of, voicing or vocalVowels cordsConsonants 13/54 • closed: vibrate, make buzzing noise → voicing • open: no vibration, just let air pass through → no voicing ☞ What would speech sound like if there was no voicing? (9) Step 2: The vocal tract • air flowing through the mouth = oral sounds • air flowing through the nose = nasal sounds (10) Articulation: doing something to block or obstruct the airstream (11) Place of articulation = where is the obstruction? lips = labial articulation alveolar ridge = alveolar articulation palate = palatal articulation velum = velar articulation uvula = uvular articulation glottis = glottal articulation And one other useful term: air escaping around side(s) of the tongue = lateral articulation ☞ how would a labiodental articulation be made? an interdental articulation? an alveopalatal articulation?
Recommended publications
  • Pg. 5 CHAPTER-2: MECHANISM of SPEECH PRODUCTION AND
    Acoustic Analysis for Comparison and Identification of Normal and Disguised Speech of Individuals CHAPTER-2: MECHANISM OF SPEECH PRODUCTION AND LITERATURE REVIEW 2.1. Anatomy of speech production Speech is the vocal aspect of communication made by human beings. Human beings are able to express their ideas, feelings and emotions to the listener through speech generated using vocal articulators (NIDCD, June 2010). Development of speech is a constant process and requires a lot of practice. Communication is a string of events which allows the speaker to express their feelings and emotions and the listener to understand them. Speech communication is considered as a thought that is transformed into language for expression. Phil Rose & James R Robertson (2002) mentioned that voices of individuals are complex in nature. They aid in providing vital information related to sex, emotional state or age of the speaker. Although evidence from DNA always remains headlines, DNA can‟t talk. It can‟t be recorded planning, carrying out or confessing to a crime. It can‟t be so apparently directly incriminating. Perhaps, it is these features that contribute to interest and importance of forensic speaker identification. Speech signal is a multidimensional acoustic wave (as seen in figure 2.1.) which provides information about the words or message being spoken, speaker identity, language spoken, physical and mental health, race, age, sex, education level, religious orientation and background of an individual (Schetelig T. & Rabenstein R., May 1998). pg. 5 Acoustic Analysis for Comparison and Identification of Normal and Disguised Speech of Individuals Figure 2.1.: Representation of properties of sound wave (Courtesy "Waves".
    [Show full text]
  • 6. Laryngeal Features
    6. Laryngeal features Before you study this chapter, check whether you are familiar with the following terms: allomorph, allophone, aspiration, devoicing, frequency, glottalization, glottal stop, glottis, hiatus, larynx, organs of speech, phoneme, pulmonic egressive airstream, root, sibilant, suffix, syllabic consonant, utterance, vocal cords/folds, voice assimilation, voiced, voiceless, voicing In this chapter we take a look at the articulatory role of the glottis, the vocal cords/folds and all the different phenomena that are related to the operation of the larynx. This includes voicing and voicelessness, (a comparison of English and Hungarian) voice assimilation, devoicing, aspiration and glottalization, and the effect of voicelessness on preceding vowels, Pre-fortis Clipping. Recall from Chapter 1 that the basic mechanism that is used to produce speech sounds in English and Hungarian is a pulmonic egressive airstream mechanism. Having left the lungs, the air continues upward in the windpipe up to the larynx – the front, shield-like part of which is called Adam's apple – then into the pharynx and the supraglottal cavities, the oral and nasal cavities. In the larynx it has to pass between the vocal cords/folds, two bundles of muscle, which may produce numerous different effects. Before discussing these, let us take a look at the structure and parts of the larynx and their different configurations. As it can be seen in the diagram, the larynx consists of some cartilages – one fixed in the front and two movable ones in the back, plus one more on the top not shown in the diagram, the epiglottis, which can close the windpipe, and a circular one serving as the frame for the larynx –, the Laryngeal features vocal cords connecting the cartilages, and the opening between them, the glottis.
    [Show full text]
  • Phonological Use of the Larynx: a Tutorial Jacqueline Vaissière
    Phonological use of the larynx: a tutorial Jacqueline Vaissière To cite this version: Jacqueline Vaissière. Phonological use of the larynx: a tutorial. Larynx 97, 1994, Marseille, France. pp.115-126. halshs-00703584 HAL Id: halshs-00703584 https://halshs.archives-ouvertes.fr/halshs-00703584 Submitted on 3 Jun 2012 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Vaissière, J., (1997), "Phonological use of the larynx: a tutorial", Larynx 97, Marseille, 115-126. PHONOLOGICAL USE OF THE LARYNX J. Vaissière UPRESA-CNRS 1027, Institut de Phonétique, Paris, France larynx used as a carrier of paralinguistic information . RÉSUMÉ THE PRIMARY FUNCTION OF THE LARYNX Cette communication concerne le rôle du IS PROTECTIVE larynx dans l'acte de communication. Toutes As stated by Sapir, 1923, les langues du monde utilisent des physiologically, "speech is an overlaid configurations caractéristiques du larynx, aux function, or to be more precise, a group of niveaux segmental, lexical, et supralexical. Nous présentons d'abord l'utilisation des différents types de phonation pour distinguer entre les consonnes et les voyelles dans les overlaid functions. It gets what service it can langues du monde, et également du larynx out of organs and functions, nervous and comme lieu d'articulation des glottales, et la muscular, that come into being and are production des éjectives et des implosives.
    [Show full text]
  • Learning [Voice]
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations Fall 2010 Learning [Voice] Joshua Ian Tauberer University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the First and Second Language Acquisition Commons Recommended Citation Tauberer, Joshua Ian, "Learning [Voice]" (2010). Publicly Accessible Penn Dissertations. 288. https://repository.upenn.edu/edissertations/288 Please see my home page, http://razor.occams.info, for the data files and scripts that make this reproducible research. This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/288 For more information, please contact [email protected]. Learning [Voice] Abstract The [voice] distinction between homorganic stops and fricatives is made by a number of acoustic correlates including voicing, segment duration, and preceding vowel duration. The present work looks at [voice] from a number of multidimensional perspectives. This dissertation's focus is a corpus study of the phonetic realization of [voice] in two English-learning infants aged 1;1--3;5. While preceding vowel duration has been studied before in infants, the other correlates of post-vocalic voicing investigated here --- preceding F1, consonant duration, and closure voicing intensity --- had not been measured before in infant speech. The study makes empirical contributions regarding the development of the production of [voice] in infants, not just from a surface- level perspective but also with implications for the phonetics-phonology interface in the adult and developing linguistic systems. Additionally, several methodological contributions will be made in the use of large sized corpora and data modeling techniques. The study revealed that even in infants, F1 at the midpoint of a vowel preceding a voiced consonant was lower by roughly 50 Hz compared to a vowel before a voiceless consonant, which is in line with the effect found in adults.
    [Show full text]
  • Relations Between Speech Production and Speech Perception: Some Behavioral and Neurological Observations
    14 Relations between Speech Production and Speech Perception: Some Behavioral and Neurological Observations Willem J. M. Levelt One Agent, Two Modalities There is a famous book that never appeared: Bever and Weksel (shelved). It contained chapters by several young Turks in the budding new psycho- linguistics community of the mid-1960s. Jacques Mehler’s chapter (coau- thored with Harris Savin) was entitled “Language Users.” A normal language user “is capable of producing and understanding an infinite number of sentences that he has never heard before. The central problem for the psychologist studying language is to explain this fact—to describe the abilities that underlie this infinity of possible performances and state precisely how these abilities, together with the various details . of a given situation, determine any particular performance.” There is no hesi­ tation here about the psycholinguist’s core business: it is to explain our abilities to produce and to understand language. Indeed, the chapter’s purpose was to review the available research findings on these abilities and it contains, correspondingly, a section on the listener and another section on the speaker. This balance was quickly lost in the further history of psycholinguistics. With the happy and important exceptions of speech error and speech pausing research, the study of language use was factually reduced to studying language understanding. For example, Philip Johnson-Laird opened his review of experimental psycholinguistics in the 1974 Annual Review of Psychology with the statement: “The fundamental problem of psycholinguistics is simple to formulate: what happens if we understand sentences?” And he added, “Most of the other problems would be half­ way solved if only we had the answer to this question.” One major other 242 W.
    [Show full text]
  • Aerodynamic and Durational Cues of Phonological Voicing in Whisper Yohann Meynadier, Yulia Gaydina
    Aerodynamic and durational cues of phonological voicing in whisper Yohann Meynadier, Yulia Gaydina To cite this version: Yohann Meynadier, Yulia Gaydina. Aerodynamic and durational cues of phonological voicing in whisper. Interspeech, Aug 2013, Lyon, France. pp.335-339. hal-01211117 HAL Id: hal-01211117 https://hal.archives-ouvertes.fr/hal-01211117 Submitted on 5 Oct 2015 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Aerodynamic and durational cues of phonological voicing in whisper Yohann Meynadier, Yulia Gaydina Aix-Marseille Université, CNRS UMR 7309, LPL, Aix-en-Provence, France [email protected], [email protected] consonants and pre-consonant vowels has long been observed: Abstract (i) vowels are longer before voiced than before voiceless This paper presents analyses on the phonological voicing consonants and, (ii) voiceless obstruents are longer than contrast in whispered speech, which is characterized by the voiced ones [2], see [35, 36] for a review. absence of vocal fold vibrations. In modal speech, besides While the voiced-voiceless difference in duration of pre- glottal vibration, the contrast between voiced and unvoiced consonantal vowels is not still well understood [2, 37, 38], the consonants is realized by other phonetic correlates: e.g.
    [Show full text]
  • Theories of Speech Perception
    Theories of Speech Perception • Motor Theory (Liberman) • Auditory Theory – Close link between perception – Derives from general and production of speech properties of the auditory • Use motor information to system compensate for lack of – Speech perception is not invariants in speech signal species-specific • Determine which articulatory gesture was made, infer phoneme – Human speech perception is an innate, species-specific skill • Because only humans can produce speech, only humans can perceive it as a sequence of phonemes • Speech is special Wilson & friends, 2004 • Perception • Production • /pa/ • /pa/ •/gi/ •/gi/ •Bell • Tap alternate thumbs • Burst of white noise Wilson et al., 2004 • Black areas are premotor and primary motor cortex activated when subjects produced the syllables • White arrows indicate central sulcus • Orange represents areas activated by listening to speech • Extensive activation in superior temporal gyrus • Activation in motor areas involved in speech production (!) Wilson and colleagues, 2004 Is categorical perception innate? Manipulate VOT, Monitor Sucking 4-month-old infants: Eimas et al. (1971) 20 ms 20 ms 0 ms (Different Sides) (Same Side) (Control) Is categorical perception species specific? • Chinchillas exhibit categorical perception as well Chinchilla experiment (Kuhl & Miller experiment) “ba…ba…ba…ba…”“pa…pa…pa…pa…” • Train on end-point “ba” (good), “pa” (bad) • Test on intermediate stimuli • Results: – Chinchillas switched over from staying to running at about the same location as the English b/p
    [Show full text]
  • Laryngeal Physiology and Terminology in CCM Singing
    Faculty of Education Ingvild Vestfall Master’s thesis Laryngeal physiology and terminology in CCM singing A thesis investigating research on the underlying laryngeal physiology of CCM singing techniques, and experiences of teaching CCM genres to adolescents Stemmefysiologi og terminologi i CCM/rytmisk sang En studie av forskning på stemmefysiologi knyttet til sangteknikker i CCM/rytmiske sjangere, og erfaringer med å undervise ungdommer i CCM/rytmisk sang Master in Culture and Language 2018 Consent to lending by University College Library YES ☒ NO ☐ Consent to accessibility in digital archive Brage YES ☒ NO ☐ ii TABLE OF CONTENTS TABLE OF CONTENTS.................................................................................................................. III LIST OF TABLES ........................................................................................................................... V LIST OF FIGURES ........................................................................................................................ VI ABSTRACT ................................................................................................................................. VII SAMMENDRAG (IN NORWEGIAN) .............................................................................................. VIII PREFACE ..................................................................................................................................... IX 1 INTRODUCTION ........................................................................................................................
    [Show full text]
  • Appendix B: Muscles of the Speech Production Mechanism
    Appendix B: Muscles of the Speech Production Mechanism I. MUSCLES OF RESPIRATION A. MUSCLES OF INHALATION (muscles that enlarge the thoracic cavity) 1. Diaphragm Attachments: The diaphragm originates in a number of places: the lower tip of the sternum; the first 3 or 4 lumbar vertebrae and the lower borders and inner surfaces of the cartilages of ribs 7 - 12. All fibers insert into a central tendon (aponeurosis of the diaphragm). Function: Contraction of the diaphragm draws the central tendon down and forward, which enlarges the thoracic cavity vertically. It can also elevate to some extent the lower ribs. The diaphragm separates the thoracic and the abdominal cavities. 2. External Intercostals Attachments: The external intercostals run from the lip on the lower border of each rib inferiorly and medially to the upper border of the rib immediately below. Function: These muscles may have several functions. They serve to strengthen the thoracic wall so that it doesn't bulge between the ribs. They provide a checking action to counteract relaxation pressure. Because of the direction of attachment of their fibers, the external intercostals can raise the thoracic cage for inhalation. 3. Pectoralis Major Attachments: This muscle attaches on the anterior surface of the medial half of the clavicle, the sternum and costal cartilages 1-6 or 7. All fibers come together and insert at the greater tubercle of the humerus. Function: Pectoralis major is primarily an abductor of the arm. It can, however, serve as a supplemental (or compensatory) muscle of inhalation, raising the rib cage and sternum. (In other words, breathing by raising and lowering the arms!) It is mentioned here chiefly because it is encountered in the dissection.
    [Show full text]
  • Illustrating the Production of the International Phonetic Alphabet
    INTERSPEECH 2016 September 8–12, 2016, San Francisco, USA Illustrating the Production of the International Phonetic Alphabet Sounds using Fast Real-Time Magnetic Resonance Imaging Asterios Toutios1, Sajan Goud Lingala1, Colin Vaz1, Jangwon Kim1, John Esling2, Patricia Keating3, Matthew Gordon4, Dani Byrd1, Louis Goldstein1, Krishna Nayak1, Shrikanth Narayanan1 1University of Southern California 2University of Victoria 3University of California, Los Angeles 4University of California, Santa Barbara ftoutios,[email protected] Abstract earlier rtMRI data, such as those in the publicly released USC- TIMIT [5] and USC-EMO-MRI [6] databases. Recent advances in real-time magnetic resonance imaging This paper presents a new rtMRI resource that showcases (rtMRI) of the upper airway for acquiring speech production these technological advances by illustrating the production of a data provide unparalleled views of the dynamics of a speaker’s comprehensive set of speech sounds present across the world’s vocal tract at very high frame rates (83 frames per second and languages, i.e. not restricted to English, encoded as conso- even higher). This paper introduces an effort to collect and nant and vowel symbols in the International Phonetic Alphabet make available on-line rtMRI data corresponding to a large sub- (IPA), which was devised by the International Phonetic Asso- set of the sounds of the world’s languages as encoded in the ciation as a standardized representation of the sounds of spo- International Phonetic Alphabet, with supplementary English ken language [7]. These symbols are meant to represent unique words and phonetically-balanced texts, produced by four promi- speech sounds, and do not correspond to the orthography of any nent phoneticians, using the latest rtMRI technology.
    [Show full text]
  • Articulatory Phonetics
    Articulatory Phonetics Lecturer: Dr Anna Sfakianaki HY578 Digital Speech Signal Processing Spring Term 2016-17 CSD University of Crete What is Phonetics? n Phonetics is a branch of Linguistics that systematically studies the sounds of human speech. 1. How speech sounds are produced Production (Articulation) 2. How speech sounds are transmitted Acoustics 3. How speech sounds are received Perception It is an interdisciplinary subject, theoretical as much as experimental. Why do speech engineers need phonetics? n An engineer working on speech signal processing usually ignores the linguistic background of the speech he/she analyzes. (Olaszy, 2005) ¡ How was the utterance planned in the speaker’s brain? ¡ How was it produced by the speaker’s articulation organs? ¡ What sort of contextual influences did it receive? ¡ How will the listener decode the message? Phonetics in Speech Engineering Combined knowledge of articulatory gestures and acoustic properties of speech sounds Categorization of speech sounds Segmentation Speech Database Annotation Algorithms Speech Recognition Speech Synthesis Phonetics in Speech Engineering Speech • diagnosis Disorders • treatment Pronunciation • L2 Teaching Tools • Foreign languages Speech • Hearing aids Intelligibility Enhancement • Other tools A week with a phonetician… n Tuesday n Thursday Articulatory Phonetics Acoustic Phonetics ¡ Speech production ¡ Formants ¡ Sound waves ¡ Fundamental Frequency ¡ Places and manners of articulation ¡ Acoustics of Vowels n Consonants & Vowels n Articulatory vs Acoustic charts ¡ Waveforms of consonants - VOT ¡ Acoustics of Consonants n Formant Transitions ¡ Suprasegmentals n Friday More Acoustic Phonetics… ¡ Interpreting spectrograms ¡ The guessing game… ¡ Individual Differences Peter Ladefoged Home Page: n Professor UCLA (1962-1991) http://www.linguistics.ucla.edu/people/ladefoge/ n Travelled in Europe, Africa, India, China, Australia, etc.
    [Show full text]
  • Ingressive Phonation in Contemporary Vocal Music, Works by Helmut Lachenmann, Georges Aperghis, Michael Baldwin, and Nicholas
    © 2012 Amanda DeBoer Bartlett All Rights Reserved iii ABSTRACT Jane Schoonmaker Rodgers, Advisor The use of ingressive phonation (inward singing) in contemporary vocal music is becoming more frequent, yet there is limited research on the physiological demands, risks, and pedagogical requirements of the various ingressive phonation techniques. This paper will discuss ingressive phonation as it is used in contemporary vocal music. The research investigates the ways in which ingressive phonation differs acoustically, physiologically, and aesthetically from typical (egressive) phonation, and explores why and how composers and performers use the various ingressive vocal techniques. Using non-invasive methods, such as electroglottograph waveforms, aerodynamic (pressure, flow, flow resistance) measures, and acoustic analyses of recorded singing, specific data about ingressive phonation were obtained, and various categories of vocal techniques were distinguished. Results are presented for basic vocal exercises and tasks, as well as for specific excerpts from the repertoire, including temA by Helmut Lachenmann and Ursularia by Nicholas DeMaison. The findings of this study were applied to a discussion surrounding pedagogical and aesthetic applications of ingressive phonation in contemporary art music intended for concert performance. Topics of this discussion include physical differences in the production and performance of ingressive phonation, descriptive information regarding the various techniques, as well as notational and practical recommendations for composers. iv This document is dedicated to: my husband, Tom Bartlett my parents, John and Gail DeBoer and my siblings, Mike, Matt, and Leslie DeBoer Thank you for helping me laugh through the process – at times ingressively – and for supporting me endlessly. v ACKNOWLEDGEMENTS I have endless gratitude for my advisor and committee chair, Dr.
    [Show full text]