The Motor Theory of Speech Perception Revised* Haskins Laboratories
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Of the Linguistic Society of Ameri
1856c ARTHUR S. ABRAMSON Arthur S. Abramson, former Secretary, Vice President, and President (1983) of the Linguistic Society of America, died on December 15, 2017, at the age of ninety-two.1 He was a central figure in the linguistic study of tone, voicing, voice quality, and duration, primarily in their phonetic realization. His seminal work with Leigh Lisker (Lisker & Abramson 1964) introduced the metric VOICE ONSET TIME (VOT), which has become per- haps the most widely used measure in phonetics. He contributed to the field in numerous other ways; in addition to his work for the LSA, he became the first chair of the Linguis- tics Department of the University of Connecticut, served as editor of Language and Speech, and was a long-term researcher and board member of Haskins Laboratories. LIFE. Arthur was born January 26, 1925, in Jersey City, New Jersey. He learned Bib- lical Hebrew in his childhood, and then followed up with post-Biblical Hebrew later on. Yiddish was also part of his background. His parents were born in the US, but his grandparents were not, and they primarily spoke Yiddish. He learned some from his mother, and it came in handy when he served in the US Army in Europe during World War II. (Many years later, he taught it at the University of Connecticut in the Center for Judaic Studies, which he helped establish.) He became fluent in French, starting from classes in high school, where he was lucky enough to have a native speaker as a teacher. (The teacher was Belgian, and Arthur was sometimes said, by French colleagues, to have a Belgian accent.) Arthur had an early introduction to synthetic speech when he at- tended the New York World’s Fair in 1939. -
La Voz Humana
UNIVERSIDAD DE CHILE FACULTAD DE ARTES MAGISTER EN ARTES MEDIALES SHOUT! Tesis para optar al Grado de Magíster en Artes Medialess ALUMNO: Christian Oyarzún Roa PROFESOR GUÍA: Néstor Olhagaray Llanos Santiago, Chile 2012 A los que ya no tienen voz, a los viejos, a los antiguos, a aquellos que el silencio se lleva en la noche. ii AGRADECIMIENTOS Antes que nada quiero agradecer a Néstor Olhagaray, por el tesón en haberse mantenido como profesor guía de este proyecto y por la confianza manifestada más allá de mis infinitos loops y procastinación. También quisiera manifestar mis agradecimientos a Álvaro Sylleros quién, como profesor del Diplomado en Lutería Electrónica en la PUC, estimuló y ayudó a dar forma a parte sustancial de lo aquí expuesto. A Valentina Montero por la ayuda, referencia y asistencia curatorial permanente, A Pablo Retamal, Pía Sommer, Mónica Bate, Marco Aviléz, Francisco Flores, Pablo Castillo, Willy MC, Álvaro Ceppi, Valentina Serrati, Yto Aranda, Leonardo Beltrán, Daniel Tirado y a todos aquellos que olvido y que participaron como informantes clave, usuarios de prueba ó simplemente apoyando entusiastamente el proyecto. Finalmente, quiero agradecer a Claudia González, quien ha sido una figura clave dentro de todo este proceso. Gracias Clau por tu sentido crítico, por tu presencia inspiradora y la ejemplar dedicación y amor por lo que haces. iii Tabla de Contenidos RESÚMEN v INTRODUCCIÓN 1 A Hard Place 1 Descripción 4 Objetivos 6 LA VOZ HUMANA 7 La voz y el habla 7 Balbuceos, risas y gritos 11 Yo no canto por cantar 13 -
Status Report on Speech Research
DOCumENT RESI.TME ED 278 066 CS 505 474 AUTHOR O'Brien, Nancy, Ed. TITLE Status Report on Speech Research: AReport on the Status and Progress of Studieson the Nature of Speech, Instrumentation for Its Investigation,and Practical Applications, April 1-September30, 1986. INSTITUTION Haskins Labs., New Haven, Conn. SPONS AGENCY National Institutes of Health (DHHS),Bethesda, Md.; National Science Foundation, Washington,D.C.; Office of Naval_Research, Washington,D.C. REPORT NO SR-86/87(1986) PUB DATE 86 CONTRACT NICHHD-N01-HD-5-2910; ONR-N00014-83-K-0083 GRANT NICHHD-HD-01994; NIH-BRS-RR-05596;NINCDS-NS-13617; NINCDS-N5-13870; NINCDS-NS-18010;NSF-DNS-8111470; NSF-BNS-8520709 NOTE 319p.; For the previous report,see ED 274 022. AVAILABLE FROMU.S. Department of Commerce, NationalTechnical Information Service, 5285 Port RoyalRd., Springfield, VA 22151. PUB TYPE Reports - ReseimrtM/Technical (143) information Analyses (070)-- Collected Works - General (020) EDRS PRICE MF01/PC13 Plus Postage_. DESCRIPTORS *Communication Research*Morphology (Languages); *Research Methodology; Research Utilization; *Speech Communication ABSTRACT Focusing on the status,progress, instrumentation, and applications of_studieson the nature of speech, this report contains the following research_studies:"The Role of Psychophysics in Understanding Speech Perception" (B.H. Repp); "Specialized Perceiving Systems for Speech and OtherBiologically Significant Sounds" (I. G. Mattingly; A. M. Liberman);"'Voicing' in EnglishA Catalog of Acoustic Features Signaling /b/_versus/p/ in Trochees (L. Lisker); "Categorical Tendenciesin Imitating Self-Produced Isolated Vowels" (B. H. Repp; D. R. Williams);"An Acoustic Analysis of V-to-C and V-to-V: Coarticulatory Effectsin Catalan and Spanish VCV Sequences" (D. -
Aac 2, 3, 4, 5, 6, 50, 51, 52, 53, 55, 56, 57, 58, 59, 60, 61, 62
315 Index A augmentative and alternative communication (AAC) 130, 145, 148 AAC 2, 3, 4, 5, 6, 50, 51, 52, 53, 55, 56, 57, Augmentative and Alternative Communication 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, (AAC) 50 69, 130, 131, 138, 140, 141, 142, 143, augmented speech 116, 126 144, 145, 147, 148, 149, 150, 153, 156, Augmented Speech Communication 117 158, 159, 160, 161, 162, 163, 164, 165, Augmented speech communication (ASC) 116 172, 173, 174, 175, 192, 194, 198, 205, automatic categorization 220 206, 207, 208, 213, 214, 215, 216, 257, automatic speech recognition 100 258, 259, 260, 263, 265 Automatic Speech Recognition (ASR) 189 able-bodied people 220 accessibility 12, 14, 15, 21 B acoustic energy 31 adaptive differential pulse code modulation background noise 56, 67, 130, 132, 134, 135, (ADPCM) 32 136, 137, 141, 142, 144 AIBO 17 BCI 17 aided communication techniques 205 bit rate 31, 32, 33, 41, 45, 47 aided communicator 234, 235, 241, 243, 245, bits per second (bps) 31 248, 250, 252 Blissymbols 235, 238, 239, 240, 241, 242, 243, aided language development 235 244, 246, 247, 250, 256 allophones 34, 35 brain-computer interfaces (BCI) 17 alternate reader 17 Brain Computer Interfaces (BCI’s) 6 alternative and augmentative communication brain-machine interface (BMI) 17 (AAC) 2, 93 C alternative communication 1 amyotrophic lateral sclerosis (ALS) 1 CALL 189, 190, 191, 192, 193, 196, 198, 199, analog-to-digital converter 30, 31 201, 202, 203 anti-aliasing (low-pass) filters 3 1 CASLT 188, 189, 190, 191, 193, 198, 199 aphasia 148, 149, -
The Conductor Model of Online Speech Modulation
The Conductor Model of Online Speech Modulation Fred Cummins Department of Computer Science, University College Dublin [email protected] Abstract. Two observations about prosodic modulation are made. Firstly, many prosodic parameters co-vary when speaking style is altered, and a similar set of variables are affected in particular dysarthrias. Second, there is a need to span the gap between the phenomenologically sim- ple space of intentional speech control and the much higher dimensional space of manifest effects. A novel model of speech production, the Con- ductor, is proposed which posits a functional subsystem in speech pro- duction responsible for the sequencing and modulation of relatively in- variant elements. The ways in which the Conductor can modulate these elements are limited, as its domain of variation is hypothesized to be a relatively low-dimensional space. Some known functions of the cortico- striatal circuits are reviewed and are found to be likely candidates for implementation of the Conductor, suggesting that the model may be well grounded in plausible neurophysiology. Other speech production models which consider the role of the basal ganglia are considered, leading to some observations about the inextricable linkage of linguistic and motor elements in speech as actually produced. 1 The Co-Modulation of Some Prosodic Variables It is a remarkable fact that speakers appear to be able to change many aspects of their speech collectively, and others not at all. I focus here on those prosodic variables which are collectively affected by intentional changes to speaking style, and argue that they are governed by a single modulatory process, the ‘Con- ductor’. -
The Routledge Handbook of Embodied Cognition
THE ROUTLEDGE HANDBOOK OF EMBODIED COGNITION Embodied cognition is one of the foremost areas of study and research in philosophy of mind, philosophy of psychology and cognitive science. The Routledge Handbook of Embodied Cognition is an outstanding guide and reference source to the key philosophers, topics and debates in this exciting subject and essential reading for any student and scholar of philosophy of mind and cognitive science. Comprising over thirty chapters by a team of international contributors, the Handbook is divided into six parts: Historical underpinnings Perspectives on embodied cognition Applied embodied cognition: perception, language, and reasoning Applied embodied cognition: social and moral cognition and emotion Applied embodied cognition: memory, attention, and group cognition Meta-topics. The early chapters of the Handbook cover empirical and philosophical foundations of embodied cognition, focusing on Gibsonian and phenomenological approaches. Subsequent chapters cover additional, important themes common to work in embodied cognition, including embedded, extended and enactive cognition as well as chapters on embodied cognition and empirical research in perception, language, reasoning, social and moral cognition, emotion, consciousness, memory, and learning and development. Lawrence Shapiro is a professor in the Department of Philosophy, University of Wisconsin – Madison, USA. He has authored many articles spanning the range of philosophy of psychology. His most recent book, Embodied Cognition (Routledge, 2011), won the American Philosophical Association’s Joseph B. Gittler award in 2013. Routledge Handbooks in Philosophy Routledge Handbooks in Philosophy are state-of-the-art surveys of emerging, newly refreshed, and important fields in philosophy, providing accessible yet thorough assessments of key problems, themes, thinkers, and recent developments in research. -
WIKI on ACCESSIBILITY Completion Report March 2010
WIKI ON ACCESSIBILITY Completion report March 2010 By Nirmita Narasimhan Programme Manager Centre for Internet and Society Project Title: Wiki on “Accessibility, Disability and the Internet in India” Page | 1 REPORT Accessibility wiki: accessibility.cis-india.org The wiki project was envisaged and funded by the National Internet Exchange of India (www.nixi.in) and has been executed by the Centre for Internet and Society (www.cis-india.org), Bangalore. Project Start date: May 2009 End date: February 2010. Background India has a large percentage of disabled persons in its population— estimated to be over seven per cent as per the Census of 2001. Even this figure is believed to be a gross under representation of the total number of disabled persons residing in this large and diverse country. Taken in figures, this amounts to roughly 70-100 million persons with disabilities in the territory of India. Out of this number, a mere two per cent residing in urban areas have access to information and assistive technologies which enable them to function in society and enhance their performance. There are several reasons for this, one of them being that there is a deplorable lack of awareness which exists on the kinds of disabilities and about ways in which one can provide information and services to disabled persons. Parents, teachers, government authorities and society at large are all equally unaware about the options which exist in technology today to enable persons with disabilities to carry on independent and productive lives. Barring a few exceptions, India is still trapped in an era where a white cane and a Braille slate symbolises the future for blind people, while the world has progressed to newer forms of enabling technology such as screen readers, daisy players, the Kindle and so on. -
Chapter 10 Speech Synthesis Ch T 11 a T Ti S H R Iti Chapter 11 Automatic
Chapter 10 Speech Synthesis Chapt er 11 Aut omati c Speech Recogniti on 1 An Enggpineer’s Perspective Speech production Speech analysis ShSpeech Speech Speech quality coding recognition assessment Speech Speech Speaker synthesis enhancement recognition 2 3 4 History Long before modern electronic signal processing was invented, speech researchers tried to build machines to create human speech. Early examples of 'speaking heads' were made by Gerbert of Aurillac (d. 1003), Albertus Magnus (1198-1280), and Roger Bacon (1214-1294). In 1779, the Danish scientist Christian Kratzenstein, working at the time at the Russian Academy of Sciences, built models of the human vocal tract that could produce the five long vowel sounds (a, e, i, o and u). 5 Kratzenstein's resonators 6 Engineering the vocal tract: Riesz 1937 7 Homer Dudley 1939 VODER Synthesizing speech by electrical means 1939 World’s Fair •Manually controlled through complex keyboard •Operator training was a problem 8 Cooper’s Pattern Playback Haskins Labs for investigating speech perception Works like an inverse of a spectrograph Light from a lamp goes through a rotating disk then througgph spectrog ram into photovoltaic cells Thus amount of light that gets transmitted at eachfh frequency b and correspond s to amount of acoustic energy at that band 9 Cooper’s Pattern Playback 10 Modern TTS systems 1960’s first full TTS: Umeda et al (1968) 1970’s Joe Olive 1977 concatenation of linear-prediction diphones Speak and Spell 1980’s 1979 MIT MITalk (Allen, Hunnicut, Klatt) 1990’s-present Diphone synthesis Unit selection synthesis 11 Types of Modern Synthesis Articulatory Synthesis: Model movements of articulators and acoustics o f voca l trac t Formant Synthesis: Start with acoustics,,/ create rules/filters to create each formant Concatenative Synthesis: Use databases of stored speech to assemble new utterances. -
Radical Embodied Cognitive Science Anthony Chemero
Radical Embodied Cognitive Science Anthony Chemero Radical Embodied Cognitive Science Radical Embodied Cognitive Science Anthony Chemero A Bradford Book The MIT Press Cambridge, Massachusetts London, England ( 2009 Massachusetts Institute of Technology All rights reserved. No part of this book may be reproduced in any form by any elec- tronic or mechanical means (including photocopying, recording, or information storage and retrieval) without permission in writing from the publisher. MIT Press books may be purchased at special quantity discounts for business or sales promotional use. For information, please email [email protected] or write to Special Sales Department, The MIT Press, 55 Hayward Street, Cambridge, MA 02142. This book was set in Stone Serif and Stone Sans on 3B2 by Asco Typesetters, Hong Kong, and was printed and bound in the United States of America. Library of Congress Cataloging-in-Publication Data Chemero, Anthony, 1969–. Radical embodied cognitive science / Anthony Chemero. p. cm.—(A Bradford Book) Includes bibliographical references and index. ISBN 978-0-262-01322-2 (hardcover : alk. paper) 1. Perception—Research. 2. Cognitive science. I. Title. BF311.B514 2009 153—dc22 2009001697 10987654321 For the crowd at Sweet William’s Pub Contents Preface: In Praise of Dr. Fodor ix Acknowledgments xiii I Stage Setting 1 1 Hegel, Behe, Chomsky, Fodor 3 2 Embodied Cognition and Radical Embodied Cognition 17 II Representation and Dynamics 45 3 Theories of Representation 47 4 The Dynamical Stance 67 5 Guides to Discovery 85 III Ecological Psychology 103 6 Information and Direct Perception 105 7 Affordances, etc. 135 IV Philosophical Consequences 163 8 Neurophilosophy Meets Radical Embodied Cognitive Science 165 9 The Metaphysics of Radical Embodiment 183 10 Coda 207 Notes 209 References 221 Index 245 Preface: In Praise of Dr. -
Memory Interference As a Determinant of Language Comprehension Julie A
Language and Linguistics Compass 6/4 (2012): 193–211, 10.1002/lnc3.330 Memory Interference as a Determinant of Language Comprehension Julie A. Van Dyke* and Clinton L. Johns Haskins Laboratories Abstract The parameters of the human memory system constrain the operation of language comprehension processes. In the memory literature, both decay and interference have been proposed as causes of forgetting; however, while there is a long history of research establishing the nature of interference effects in memory, the effects of decay are much more poorly supported. Nevertheless, research investigating the limitations of the human sentence processing mechanism typically focus on decay-based explanations, emphasizing the role of capacity, while the role of interference has received comparatively little attention. This paper reviews both accounts of difficulty in language comprehension by drawing direct connections to research in the memory domain. Capacity-based accounts are found to be untenable, diverging substantially from what is known about the opera- tion of the human memory system. In contrast, recent research investigating comprehension difficulty using a retrieval-interference paradigm is shown to be wholly consistent with both behavioral and neuropsychological memory phenomena. The implications of adopting a retrieval- interference approach to investigating individual variation in language comprehension are discussed. 1. Introduction One of the most fascinating properties of natural language is the fact that related pieces of information need not be adjacent. For instance, the subject of a sentence (athlete) may be separated from its verb (ran) by just a few words (1), or by many words, phrases, or even clauses (2–4): (1) The athlete in the training program ran every day. -
Haskins Newsletter
THE SCIENCE OF THE SPOKEN AND WRITTEN WORD CommuniquéWinter 2019 an evidence-based curriculum. Under Dr. Nicole Landi’s direction, we have now collected extensive From the President's brain and cognitive data on a large group of children Desk at each school and we are gaining important insights into individual differences in core learning profiles in It is a pleasure to introduce our this complex population. Look for updates on this inaugural edition of the Haskins initiative in future editions of this newsletter. Communiqué newsletter! It is our This newsletter also highlights the uniqueness of hope that the Communiqué will Haskins as an international community; a defining become a vehicle for sharing the attribute of a lab that cares about the biology of exciting discoveries and tremendous growth that I human language in all its variations. Two important am privileged to witness daily in my role as President. partnerships are highlighted here: the announcement This newsletter is a component of the development of the Haskins Global Literacy Hub (page 2), and the portfolio taken on by Dr. Julie Van Dyke, who moved founding of a joint developmental neuroscience lab into the new role of Vice President for Research and with the National Taiwan Normal University (page 3). Strategic Initiatives earlier this year. Having served Synergistic relationships across such a wide range as a Senior Scientist at Haskins for 18 years, and with of disciplines and among researchers from over 40 extesive experience in project management, science countries is what makes Haskins Laboratories so communication, and editorial service, she is well unique and special. -
Arthur S. Abramson
LAS0010.1177/0023830918780146Language and SpeechWhalen and Koenig 780146research-article2018 1856 Language Obituary and Speech Language and Speech 2018, Vol. 61(2) 334 –336 Arthur S. Abramson © The Author(s) 2018 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav https://doi.org/10.1177/0023830918780146DOI: 10.1177/0023830918780146 journals.sagepub.com/home/las D. H. Whalen City University of New York, Haskins Laboratories, Yale University, USA Laura L. Koenig Adelphi University, Haskins Laboratories, USA Arthur S. Abramson, editor of Language and Speech from 1975 to 1987, died on 15 December 2017, at the age of 92. He was a central figure in the study of the phonetic realizations of linguistic distinctions of tone, voicing, voice quality, and duration. His seminal work with Leigh Lisker on the metric, voice onset time (VOT) (Lisker & Abramson, 1964), has become perhaps the most extensively used measure in phonetics. He contributed to his profession in many ways beyond his research, serving as the first chair of the Linguistics Department of the University of Connecticut, president (and other roles) of the Linguistic Society of America, and a researcher and Board mem- ber of Haskins Laboratories. In the early 1950s, while still a graduate student in linguistics at Columbia University, New York, Arthur mentioned to colleagues and people at the Department of State that he was very inter- ested in Southeast Asian languages and culture. He was recommended to apply for a Fulbright teaching grant, which he received. Arthur spent 1953–1955 teaching English in Thailand, begin- ning in the south (Songkhla), and later moving to Bangkok. This experience started a decades-long relationship with the language and people of Thailand.