"Comprehenon of Natural and Synthetic Speech: II. Effects of Predictability on Verification of Sentences Controlled for Intelligibility" (D
Total Page:16
File Type:pdf, Size:1020Kb
DOCUMENT RESUME ED 318 056 CS 507 127 AUTHOR Pisoni, David B.; And Others TITLE Research on Speech Perception. Progress Report No. 12. INSTITUTION Indiana Uiiv., Bloomington. Dept. of Psychology. SPONS AGENCY Air Force Armstrong Aerospace Medical Research Lab, Wright- Patterson AFB, OH.; National Institu' es of Health (DHHS), Bethesda, Md.; National Science Foundation, Washington, D.C. PUB DATE 86 CONTRACT AF-F-33615-83-K-0501 GRANT BNS-83-05387; NS-07134-08; NS-12179-10 NOTE 457p.; For other reports in this series, see CS 507 123-129. PUB TYPE Reports - Research/Technical (143) -- Collected Works - General (020) -- Information Analyses (070) EDRS PRICE MFO1 /PC19 Plus Postage. DESCRIPTORS *Acoustic Phonetics; Auditory Discrimination; *Auditory Perception; Communication Research; Computer Software Development; Infants; *Language Processing; Language Research; Linguistics; Speech; *Speech Synt'lesizers IDENTIFIERS Indiana University Bloomington; *Speech Perception; Speech Research; Theory Development ABSTRACT Summarizing research activities in 1986, this is the twelfth annual report of research on speech perception, analysis, synthesis, and recognition conducted in the Speech Research Laboratory of the Department of Psychology at Indiana University. The report contains the following 23 articles: "Comprermision of Digitally Encoded Natural Speech Using a Sentence Verification ...sk (SVT): A -st Report" (D. B. Pisoni and M. J. Dedina); "Comprehenon of Natural and Synthetic Speech: II. Effects of Predictability on Verification of Sentences Controlled for Intelligibility" (D. B. Pisoni and others); "Perceptual Learning of Synthetic Speech Produced by Rule" (S. L. Greenspan and others); "Trading Relations, Acoustic Cue Integration, and Context Effects in Speech Perception" (D. B. Pisoni and P. A. Luce); "Using Template Pattern Structure Information to Improve Speech Recognition Performance" (M. Yuchtman and H. C. Nusbaum); "On Word-Initial Voicing: Converging Sources of Evidence in Phonologically Disordered Speech" (O. A. Gierut and D. A. Dinnsen); "On the Assessment of Productive Phonological Knowledge" (J. A. Gierut); "Generative Phonology and Error Pattern Analyses: Empirical Claims and Differences" (J. A. Gierut); "Effects of Talker Uncertainty on Auditory Word Recognition: A First Report" (J. W. Mullenix and D. B. Pisoni); "Effects of Stress and Final-Consonant Voicing on Vowel Production: Articulatory and Acoustic Analyses"(V. Summers); "Preference Judgments Comparing Different Synthetic Voices" (3. S. Logan and D. B. Pisoni); "Auditory Perception of Complex Sounds: Some Comparisons of Speech vs. Nonspeech Signals" (D. B. Pisoni); "Perceptual Attention in Monitoring Natural and Synthetic Speech" (H. C. Nusbaum and others); "Intelligibility of Phoneme Specific Sentences Using Three Text-to-Speech Systems and a Natural. Speech Control" (J. S. Logan and D. B. Pisoni); "PRONOUNCE: A Program for Pronunciation by Analogy" (M. J. Dedina and H. C. Nusbaum); "The Role of the Lexicon in Speech Perception" (D. B. Pisoni and others); "The Role of Structural Constraints in Auditory Word Recognition" (H. C. Nusbaum and D. B. Pisoni); "A Brief Overview of Speech Synthesis and Recognition Technologies" (D. B. Pisoni); "Developing Methods for Assessing the Performance of Speech Synthesis and Recognition Systems" (D. B. Pisoni and H. C. Nusbaum); "Recognition Performance of Six Isolated Utterance Speech Recognicion Systems" (H. C. Nusbaum and others); "Human Factors Issues for the Next Generation of Speech Recognition Systems" (H. C. Nusbaum and D. B. Pisoni); "Using Speech as an Index of Alcohol Intoxication" (C. S. Martin and M. Yuchtman); "Effects of Wholistic versus Dimensional Training on Learning to Identify Epectographic Displays of Speech" (B. G. Greene); and "Testing the Performance of Isolated Utterance Speech Recognition Devices" (H. C. Nusbaum and others). (SR) *****************************************************************t**** Reproductions supplied by EDRS are the best that can be made * from the original document. RESEARCH ON SPEECH PERCEPTION Progress Report No. 12 (1986) Speech Research Laboratory Department of Psychology Indiana Unit'ersity Bloomington, Indiana 47405 .Supporitil 1)1 Department of flealth and Human Services U.S. Public Health Service National Institutes of Health Research Grant No. NS-12179- I() "PERMISSION TO REPRODUCE THIS National Institutes of Health MATERIAL HAS BEEN GRANTED BY "Training Grant No. NS-07131-08 U.S DEPARTMENT rIF EDUCATION r.)frK. e Uf F durt.onl Research and Improvement ePisok)i EDUCATIONAL RESOURCES INFORMATION CE ^ITE R ;ERIC; :National Science Foundation Tr,s dot urrenl has been reproduced as RestaRh Grant No. liNS-82)-05',8- rece.ved from the person or orcin,talton 1' Mnor r nargeS have been made t1 improve TO THE EDUCATIONAL RESOURCES ,eurock,.ton duality INFORMATION CENTER (ERIC).- and P,,nts of ,,ea, or c,g). mon s staled 1",,S d(K. not r,er r Syanly represent officlad OE RI Pos.twn or pot ry U.S. Air Force Armstrong Aerospace Medical Research Lahoratory (AFSC) Contra( t No. AF-F-1,1;615-83-K-0501 BEST COPY AVAILABLE RESEARCH ON SPEECH PERCEPTION Progress Report No. 12 (1986) David B. Pisoni, Ph.D. Principal Investigator Speech Research Laboratory Department of Psychology Irdiana University Bloomington, Indiana 47405 Research Supported by: Department of Health and Human Services U. S. Public Health Service National Institutes of Health Research Grant No. NS-12179-10 National Institutes of Health Training Grant No. NS-0/134-08 National Science Foundation Research Grant No. BNS 83-05387 and U.S. Air Force Armstrong Aerospace Medical Research Laboratory(AFSC) Contract No. AF-F-33615-83-K-0501 [RESEARCH ON SPEECH PERCEPTION ProgressReport No. 12 (1986)] Table of Contents Introduction iii I. Extender' Manuscripts . 1 Comprehension of digitally encoded natural speechusing a sentence verification task (SVT): A firstreport; David B. Pisoni and Michael J. Dedina 3 Comprehension of natural and synthetic speech: II. Effects of predictability on verification ofsentences controlled for intelligibility; David B. Pisoni, Laura M. Manaus, and MichaelJ. Dedina 19 Perceptual learning of synthetic speechproduced by rule; Steven L. Greenspan, Howard C. Nusbaum, and David B. Pisoni . '43 Trading relations, acousticcue integration, and context effects in speech perception; David B. Pisoni andPaul A. Luce 87 Using template pattern structure informationto improve speech recognition performance; Moshe Yuchtman andHoward C. Nusbaum . 107 On word-initial voicing: Convergingsources of evidence in phonologically disordered speech; Judith A. Gierut and Daniel A. WaInsen 125 On the assessment of productive phonologicalknowledge; Judith A. Gierut 151 Generative phonology and error pattern analyses:Empirical claims and differences; Judith A. Gierut 175 Effects of talker uncertaintyon auditory word recognition: A first report; John W. Mullennix and David B.Pisoni 205 Effects of stress and final-consonant voicingon vowel production: Articulatory and acoustic analyses; Van Summers 223 Preference judgements comparing differentsynthetic voices; John S. Logan and David B. Pisoni 263 II. Short Reports and Work in Progress . 291 Auditory perception of complex sounds:Some comparisons of speech vs. nonspeech signals; David B. Pisani 293 Perceptual attention in monitoring natural andsynthetic speech; Howard C. Nusbaum, Steven L. Greenspan, and David B. Pisani . 307 Intelligibility of phoneme specificsentences using three text-to-speech systems anda natural speech control; John S. Logan and David B. Pisoni 319 PRONOUNCE: A program for pronunciation byanalogy; Michael J. Dedina and Howard C. Nusbaum 335 The role of the lexicon inspeech perception; David B. Pisoni, Paul A. Luce,and Howard C. Nusbaum 349 The role of structural constraints inauditory word recognition; Howard C. Nusbaum and i",avid B. Pisani 361 A brief overview of speech synthesis andrecognition technologies; David B. Pisoni 369 Developing methods for assessing theperformance of speech synthesis and recognition systems; David B. Pisoni and Howard C.Nusbaum 379 Recognition performance of six isolatedutterance speech recognition systems; Howard C. Nusbaum,C. Noah Davis, David B. Pisoni and Ella Davis 389 Human factors issues for thenext generation of speech recognition systems; Howard C. Nusbaum and David B. Pisoni 403 Using speech as an index of alcoholintoxication; Christopher S. Martin and Moshe Yuchtman 413 Effects of wholisticversus dimensional training on learning to identify spectrographic displays of speech; Beth G. Greene . 427 III. Instrumentation and Software Development . 439 Testing the performance of isolatedutterance speech recognition devices; Howard C. Nusbaum,Christopher K. uavis, David B. Pisoni, and Ella K. Davis 441 IV. PubliLations 457 V. SRL Laboratory Staff and Personnel . 461 INTRODUCTION This is the twelfth annual report summarizing the research activities on speech perception, analysis, synthesis, and recognition carried out in the Speech Research Laboratory, Department of Psychology, Indiana University in Bloomington. As withprevious reports, our main goal has been to summarize various research activities over the past year and make them readily available to grantingagencies, sponsors and interested colleagues in the field. Some of the papers contained in this report are extended manuscripts that have been prepared for formal publication as journal articlesor book chapters. Other papers are simply short reports of research presented