Overview of Spoken Language Communication Technologies

Total Page:16

File Type:pdf, Size:1020Kb

Overview of Spoken Language Communication Technologies 3 Spoken Language Communication Technology 3-1 Overview of Spoken Language Communication Technologies KASHIOKA Hideki The goal of Spoken Language Communication Laboratory, Universal Communication Research Institute, NICT, is to realize multi language communication technologies with spoken language regardless of who, where, when, how and in which language users speak. Toward this goal, we will intensively develop ICT for a human-machine interface, such as multilingual speech recognition, multilingual speech synthesis, and spoken dialogue technology. In this pa- per, we indicate these technologies overview. Keywords Speech recognition, Speech synthesis, Dialogue processing 1 Introduction log system technologies, which are the major technologies that constitute spoken language With the development of information and communication technology. communication technologies, it is desired to achieve communication between various peo- 2 Spoken language ple in various environments and situations, for communication technology example, communication between people in distant locations or who use different languag- Spoken language communication technol- es. To realize such communications, multilin- ogy is the technology of not only recording gual spoken language communications need to and utilizing the information of various be studied for people’s smooth interactions in speech-based daily communications but also any languages, at any time and in any place overcoming communication barriers. The ma- with any expressions. The research and devel- jor technologies for spoken language commu- opment of spoken language communication nication are speech recognition technology for technology is an important issue since spoken transforming speech dialog to text, speech language communication is one of the most synthesis technology for providing the speech natural human communications. In our daily output of text information, and dialog system life, along with the rapid spread of smart technology for supporting speech-based inter- phones, voice services to access various kinds actions. of information are now available and used by many people. 2.1 Speech recognition technology In this paper we give an overview of We often receive a large amount of speech speech recognition, speech synthesis, and dia- information not only in conversations with KASHIOKA Hideki 11 JM-3-1-下版-20121126-KASHIOKA.indd 11 13/01/11 16:24 others but also from various announcements, ken dialog system used in devices like smart TV, radio, and videos on the Internet. Speech phones. Applications in call center systems are recognition technology transforms such speech also expected. The utilization of speech recog- into text. nition for captioning news and other TV pro- For the transform, the technology learns grams or internet videos has been requested from many corpora the models of extracting for a variety of reasons (e.g. for supporting speech features and using them for the trans- people with disabilities or for recording mo- form of speech. It then learns the models of tion pictures). To use speech recognition prac- verbalizing the obtained character strings as tically in these applications, we need to work words, phrases and sentences, and checks the on the issues of noise processing, long sen- input speech against the learned models. The tence processing, and precision improvement models of extracting speech features and using etc. Multilingual support is also an important them for the transform of speech are called issue. acoustic models and the models of verbalizing the obtained character strings as words, phras- 2.2 Speech synthesis technology es and sentences are called language models. Speech information is often necessary in Spoken Language Communication Laboratory various situations. In particular speech synthe- has developed a high-speed highly accurate sis is actually used in public transportation and speech recognition system[1] using Weighted disaster-prevention announcements. When in- Finite State Transducer (WFST) as a search formation that people want to receive in a spo- model, as well as acoustic and language mod- ken form is given as text, speech synthesis els, to browse the results of checking speech technology is used to create speech informa- against these models. The speech recognition tion from that text. system is used in a spoken dialog system and a To synthesize speech from text, the speech speech translation system. We have a Japanese synthesis technology is comprised of a text dictionary of 650,000 words and can process a analysis unit for parsing the text with syntactic six-word speech within 1 RTF (Real Time and semantic information, and a synthesis en- Factor). gine for determining how intonation and Speech that we receive in various environ- rhythm are used to produce sounds from the ments may contain non-speech sounds. To words and phrases in the text. The text analy- perform speech recognition, we therefore need sis unit extracts the word and phrase informa- not only the above-mentioned speech-to-text tion and the synthesis engine uses the acoustic transform technology but also technology to model for speech synthesis. At present, Spoken recognize the non-speech sounds as noise and Language Communication Laboratory uses an a voice activity detection technology to extract HMM speech synthesis method to implement the speech section that includes the target speech synthesis supporting not only Japanese speech. In addition, microphones that receive but also English, Chinese, Korean, Indonesian, speeches are not always ideal for speech rec- Vietnamese, and Malay[2] . Since speech syn- ognition. A familiar example is applications thesis acoustic models cover different lan- on mobile terminals such as smart phones. To guages and have different voice qualities and process various kinds of noises and speech, speech styles, the speech style and voice quali- such applications use noise reduction process- ty can be changed by switching the models. ing and build a speech recognition model (in With a focus on this fact, a voice selector was particular an acoustic model) of higher noise developed. It produces speech in a voice simi- resistance from noise-contained sound data lar to the original speaker’s by selecting a before making speech recognition. model that has similar voice characteristic to A familiar application of speech recogni- the speaker’s in speech translation and other tion is the speech translation system and spo- processes. The naturalness of synthesized 12 Journal of the National Institute of Information and Communications Technology Vol. 59 Nos. 3/4 2012 JJM-3-1-下版-20121126-KASHIOKA.inddM-3-1-下版-20121126-KASHIOKA.indd 1122 113/01/113/01/11 116:246:24 speech is also enhanced by improving the filter can be acknowledged using speech expres- used for the development of the model. sions, intonations and other speech informa- The speech synthesis system is necessary tion. Spoken Language Communication to build a speech translation system and a Laboratory built a tourist information speech speech dialog system. Since it is used in con- dialog corpus and developed a Dialog versations with people, synthesized speech Management System [3] with WFST to with enriched naturalness is required. Speech achieve rapid and accurate speech language synthesis acoustic models are built from a understanding. speech corpus of the same person. However, The dialog system technology is used di- research suggests that the model’s naturalness rectly for building a spoken dialog system. is higher when the corpus is made from con- Unlike QA-systems, users can keep a dialogue versations than when it is made from reading with this system and obtain appropriate infor- text. Since building the speech synthesis mation. It can also be applied to not only the acoustic models costs a lot, the automatic dialog system but also a mechanism that un- building of models is another important issue. derstands and predicts the context. This is an important technology to realize appropriate 2.3 Dialog system technologies context-based processing in various speech To maintain a spoken dialog and make ap- communication technologies. propriate questions and answers, it is neces- sary to perceive the situation and environment 3 Conclusions of the dialog and understand the dialog itself. In speech communications, not only the dia- We have given an overview of the speech log’s content but also the speech information recognition, speech synthesis, and dialog sys- may provide information related to the situa- tem technologies, which all constitute spoken tion and environment. Information can also be language communication technology. The obtained by considering a series of speeches. Spoken Language Communication Laboratory, Dialog system technology is a technology for Universal Communication Research Institute, managing dialogs comprehensively, under- NICT, has been promoting research and devel- standing them, and predicting and producing opment with a focus on the problems and fu- the next speech in various occasions. ture of these elemental technologies. It not Dialog system technologies can be only performs research and development into classified into speech understanding technolo- the individual elemental technologies but also gy for the understanding of speech, context combines them with those developed by other processing technology for producing a re- laboratories to realize an integrated system sponse according to the underlying dialog act that can be used in actual society. Specifically,
Recommended publications
  • Language Development Language Development
    Language Development rom their very first cries, human beings communicate with the world around them. Infants communicate through sounds (crying and cooing) and through body lan- guage (pointing and other gestures). However, sometime between 8 and 18 months Fof age, a major developmental milestone occurs when infants begin to use words to speak. Words are symbolic representations; that is, when a child says “table,” we understand that the word represents the object. Language can be defined as a system of symbols that is used to communicate. Although language is used to communicate with others, we may also talk to ourselves and use words in our thinking. The words we use can influence the way we think about and understand our experiences. After defining some basic aspects of language that we use throughout the chapter, we describe some of the theories that are used to explain the amazing process by which we Language9 A system of understand and produce language. We then look at the brain’s role in processing and pro- symbols that is used to ducing language. After a description of the stages of language development—from a baby’s communicate with others or first cries through the slang used by teenagers—we look at the topic of bilingualism. We in our thinking. examine how learning to speak more than one language affects a child’s language develop- ment and how our educational system is trying to accommodate the increasing number of bilingual children in the classroom. Finally, we end the chapter with information about disorders that can interfere with children’s language development.
    [Show full text]
  • Theories of Language Acquisitionq Susan Goldin-Meadow, University of Chicago, Departments of Psychology and Comparative Human Development, Chicago, IL, USA
    Theories of Language Acquisitionq Susan Goldin-Meadow, University of Chicago, Departments of Psychology and Comparative Human Development, Chicago, IL, USA © 2019 Elsevier Inc. All rights reserved. Theoretical Accounts of Language-Learning 1 Behaviorist Accounts 1 Nativist Accounts 2 Social/Cognitive Accounts 3 Connectionist Accounts 3 Constrained Learning 4 Constrained Invention 5 Is Language Innate? 7 Innateness Defined as Genetic Encoding 7 Innateness Defined as Developmental Resilience 7 Language Is Not a Unitary Phenomenon 8 References 8 The simplest technique to study the process of language-learning is to do nothing more than watch and listen as children talk. In the earliest studies, researcher parents made diaries of their own child’s utterances (e.g., Stern and Stern, 1907; Leopold, 1939–1949). The diarist’s goal was to write down all of the new utterances that the child produced. Diary studies were later replaced by audio and video samples of talk from a number of children, usually over a period of years. The most famous of these modern studies is Roger Brown’s (1973) longitudinal recordings of Adam, Eve, and Sarah. Because transcribing and analyzing child talk is so labor-intensive, each individual language acquisition study typically focuses on a small number of children, often interacting with their primary caregiver at home. However, advances in computer technology have made it possible for researchers to share their transcripts of child talk via the computerized Child Language Data Exchange System (CHILDES, https://childes.talkbank.org). Because this system makes available many transcripts collected by different researchers, a single researcher can now call upon data collected from spontaneous interactions in naturally occurring situations across a wide range of languages, and thus test the robustness of descriptions based on a small sample.
    [Show full text]
  • What It Is and How to Teach It. Center for Applied Linguistics, W
    DOCUMENT RESUME ED 433 722 FL 025 979 AUTHOR Burkart, Grace Stovall TITLE Spoken Language: What It Is and How To Teach It. INSTITUTION Center for Applied Linguistics, Washington, DC. SPONS AGENCY Center for International Education (ED), Washington, DC. PUB DATE 1998-12-00 NOTE 33p.; In its: Out of Class Learning Experiences and Students' Perceptions of Their Impact on English Conversation Skills; see FL 025 966. CONTRACT PO17A50050 -95A PUB TYPE Guides Classroom Teacher (052) EDRS PRICE MF01/PCO2 Plus Postage. DESCRIPTORS Class Activities; Classroom Techniques; *College Instruction; Communicative Competence (Languages); Curriculum Design; Discourse Analysis; Graduate Students; Higher Education; *Interpersonal Communication; *Language Patterns; Language Teachers; *Oral Language; *Second Language Instruction; Skill Development; *Speech Skills; Teacher Education; Teaching Assistants IDENTIFIERS Turn Taking ABSTRACT An instructional module designed to help prepare college-level teaching assistants (TAs) for their duties in second language instruction is presented. The module focuses on the teaching of oral language skills in a beginning or intermediate language course. The first section examines features of spoken language, including the range of uses of spoken language, the concepts of communicative competence and communicative efficiency, distinctions between transactional and interactional talk, formulas in spoken language, and typical spoken language episodes. The second section describes specific approaches and classroom techniques for building language learners' speaking skills, including teaching spoken language for communication, finding a balance between control and spontaneity, outlining a curriculum, using class activities to bridge the gap between language knowledge and language use, and activities teaching both. Questions for classroom discussion are dispersed throughout the text. Contains 30 references.
    [Show full text]
  • Characteristics of Spoken and Written Communication in the Opening and Closing Sections of Instant Messaging
    Portland State University PDXScholar Dissertations and Theses Dissertations and Theses Fall 1-23-2014 Characteristics of Spoken and Written Communication in the Opening and Closing Sections of Instant Messaging Kenta Nishimaki Portland State University Follow this and additional works at: https://pdxscholar.library.pdx.edu/open_access_etds Part of the Interpersonal and Small Group Communication Commons, and the Other Communication Commons Let us know how access to this document benefits ou.y Recommended Citation Nishimaki, Kenta, "Characteristics of Spoken and Written Communication in the Opening and Closing Sections of Instant Messaging" (2014). Dissertations and Theses. Paper 1548. https://doi.org/10.15760/etd.1547 This Thesis is brought to you for free and open access. It has been accepted for inclusion in Dissertations and Theses by an authorized administrator of PDXScholar. Please contact us if we can make this document more accessible: [email protected]. Characteristics of Spoken and Written Communication in the Opening and Closing Sections of Instant Messaging by Kenta Nishimaki A thesis submitted in partial fulfillment of the Requirements for the degree of Master of Arts in Japanese Thesis Committee: Suwako Watanabe, Chair Patricia J. Wetzel Emiko Konomi Portland State University 2013 ABSTRACT This study examines opening and closing segments in instant messaging (IM) and demonstrates how openings and closings differ between oral conversation and instant messaging as well as the factors that account for the difference. Many researchers have discussed the differences and similarities between spoken and written languages. Tannen (1980) claims that spoken and written languages are not distinct categories and there is a continuum between them.
    [Show full text]
  • How Infant Speech Perception Contributes to Language Acquisition Judit Gervain* and Janet F
    Language and Linguistics Compass 2/6 (2008): 1149–1170, 10.1111/j.1749-818x.2008.00089.x How Infant Speech Perception Contributes to Language Acquisition Judit Gervain* and Janet F. Werker University of British Columbia Abstract Perceiving the acoustic signal as a sequence of meaningful linguistic representations is a challenging task, which infants seem to accomplish effortlessly, despite the fact that they do not have a fully developed knowledge of language. The present article takes an integrative approach to infant speech perception, emphasizing how young learners’ perception of speech helps them acquire abstract structural properties of language. We introduce what is known about infants’ perception of language at birth. Then, we will discuss how perception develops during the first 2 years of life and describe some general perceptual mechanisms whose importance for speech perception and language acquisition has recently been established. To conclude, we discuss the implications of these empirical findings for language acquisition. 1. Introduction As part of our everyday life, we routinely interact with children and adults, women and men, as well as speakers using a dialect different from our own. We might talk to them face-to-face or on the phone, in a quiet room or on a busy street. Although the speech signal we receive in these situations can be physically very different (e.g., men have a lower-pitched voice than women or children), we usually have little difficulty understanding what our interlocutors say. Yet, this is no easy task, because the mapping from the acoustic signal to the sounds or words of a language is not straightforward.
    [Show full text]
  • 1 the Origins of Language
    Cambridge University Press 978-1-108-49945-3 — The Study of Language George Yule Excerpt More Information 1 The Origins of Language The first person to set foot on the continent of Australia was a woman named Warramurrungunji. She emerged from the sea onto an island off northern Australia, and then headed inland, creating children and putting each one in a specific place. As she moved across the landscape, Warramurrungunji told each child, “I am putting you here. This is the language you should talk! This is your language!” Erard (2016) This origin story from the Iwaidja people of Australia, illustrated in the painting above, offers an explanation of not only where language came from, but also why there are so many different languages. Among the English-speaking people, there have been multiple attempts to provide a comparable explanation, but not much proof to support any of them. Instead of a belief in a single mythical earth mother, we have a variety of possible beliefs, all fairly speculative. We simply don’t have a definitive answer to the question of how language originated. We do know that the ability to produce sound and simple vocal patterning (a hum versus a grunt, for example) appears to be in an ancient part of the brain that we share with all vertebrates, including fish, frogs, birds and other mammals. But that isn’t human language. We suspect that some type of spoken language must have developed between 100,000 and 50,000 years ago, well before written language (about 5,000 years ago).
    [Show full text]
  • American Sign Language
    U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES ∙ National Institutes of Health NIDCD Fact Sheet | Hearing and Balance American Sign Language What is American Sign Language? American Sign Language (ASL) is a complete, natural language that has the same linguistic properties as spoken languages, with grammar that differs from English. ASL is expressed by movements of the hands and face. It is the primary language of many North Americans who are deaf and hard of hearing, and is used by many hearing people as well. Is sign language the same in other countries? There is no universal sign language. Different sign languages are used in different countries or regions. For example, British Sign Language (BSL) is a different A young boy signs “I love you.” language from ASL, and Americans who know ASL may not understand BSL. Some countries adopt features of ASL in their sign languages. LSF are distinct languages. While they still contain some Where did ASL originate? similar signs, they can no longer be understood by each other’s users. No person or committee invented ASL. The exact beginnings of ASL are not clear, but some suggest that it How does ASL compare with spoken arose more than 200 years ago from the intermixing of language? local sign languages and French Sign Language (LSF, or Langue des Signes Française). Today’s ASL includes some ASL is a language completely separate and distinct elements of LSF plus the original local sign languages; from English. It contains all the fundamental features over time, these have melded and changed into a rich, of language, with its own rules for pronunciation, word complex, and mature language.
    [Show full text]
  • Practical Introduction to Listening and Spoken Language (Lsl) Helix Conference November 2019 Session #1: 1:00-3:00 Pm Michael Boston, Ma, Eipa Cert
    PRACTICAL INTRODUCTION TO LISTENING AND SPOKEN LANGUAGE (LSL) HELIX CONFERENCE NOVEMBER 2019 SESSION #1: 1:00-3:00 PM MICHAEL BOSTON, MA, EIPA CERT CONSTANCE MCGROGAN SLP/L LSLS CERT. AVED ROLE CALL • TEACHERS OF THE DEAF AND HARD-OF-HEARING • SPEECH -LANGUAGE PATHOLOGISTS • AUDIOLOGISTS • REGULAR TEACHERS • SPECIAL EDUCATION TEACHERS • PARENTS • SUPERVISORS, PSYCHOLOGISTS, ADMINISTRATION OF SPECIAL EDUCATION • OTHERS HOW TO SURVIVE OUR SESSION CREATE NOTECARDS WITH QUESTIONS WHAT DO YOU RECALL? CONNECTIONS ARE DEVELOPED IN A CHILD’S BRAIN THROUGH EACH AND EVERY INTERACTION 100 billion neurons 1 trillion Web pages 300 trillion links 100 trillion links Quad trillion links developed quad billion connections DISCLAIMER ORAL METHOD -- TODAY’S FOCUS MANUAL METHOD BLENDED APPROACH **PARENTS CHOICE DRIVES THIS AGENDA. PART 1 MORNING • Introduction to Listening and Spoken language What Why Who • Principles for Listening and Spoken Language • Strategies • The Auditory Learning Guide (ALG) • Certification process for AVT and AVEd AGENDA PART 2 AFTERNOON • Identify and label strategies with roleplay demonstration and sabotaging to teach skills. • Match description to auditory strategy • Develop an age appropriate activity with and appropriate strategy for a given Case Study • Evaluate application of strategies • Troubleshoot and discuss atypical scenarios Get creative! INTRODUCTION TO LSL LSLS – THE IMPACT • (Video) THE AUDITORY – VERBAL APPROACH • In Auditory –verbal therapy the parent/caretaker is the client and is present at all sessions. • Audition is integrated into the child’s lifestyle. • A Developmental rather than a remedial approach is emphasized. • Therapy is diagnostic. DEVELOPMENTAL APPROACH IN VOCABULARY • Example: Normal Expressive Vocabulary development • 1 yr. – first word • 18 mths – 75 words • 2 yrs.
    [Show full text]
  • The Neat Summary of Linguistics
    The Neat Summary of Linguistics Table of Contents Page I Language in perspective 3 1 Introduction 3 2 On the origins of language 4 3 Characterising language 4 4 Structural notions in linguistics 4 4.1 Talking about language and linguistic data 6 5 The grammatical core 6 6 Linguistic levels 6 7 Areas of linguistics 7 II The levels of linguistics 8 1 Phonetics and phonology 8 1.1 Syllable structure 10 1.2 American phonetic transcription 10 1.3 Alphabets and sound systems 12 2 Morphology 13 3 Lexicology 13 4 Syntax 14 4.1 Phrase structure grammar 15 4.2 Deep and surface structure 15 4.3 Transformations 16 4.4 The standard theory 16 5 Semantics 17 6 Pragmatics 18 III Areas and applications 20 1 Sociolinguistics 20 2 Variety studies 20 3 Corpus linguistics 21 4 Language and gender 21 Raymond Hickey The Neat Summary of Linguistics Page 2 of 40 5 Language acquisition 22 6 Language and the brain 23 7 Contrastive linguistics 23 8 Anthropological linguistics 24 IV Language change 25 1 Linguistic schools and language change 26 2 Language contact and language change 26 3 Language typology 27 V Linguistic theory 28 VI Review of linguistics 28 1 Basic distinctions and definitions 28 2 Linguistic levels 29 3 Areas of linguistics 31 VII A brief chronology of English 33 1 External history 33 1.1 The Germanic languages 33 1.2 The settlement of Britain 34 1.3 Chronological summary 36 2 Internal history 37 2.1 Periods in the development of English 37 2.2 Old English 37 2.3 Middle English 38 2.4 Early Modern English 40 Raymond Hickey The Neat Summary of Linguistics Page 3 of 40 I Language in perspective 1 Introduction The goal of linguistics is to provide valid analyses of language structure.
    [Show full text]
  • Linguistic Science and Its Classroom Reflections
    LINGUISTIC SCIENCE AND ITS CLASSROOM REFLECTIONS Mary Jane M. Norris University of Michigan This paper is an attempt to bring together the various as- sumptions, statements, findings, or other contributions gener - ally included within the scope of linguistic science that have a bearing on language teaching, and to show to some extent how they are reflected in the language classroom. These elements of linguistic science can be divided into the following five points: 1. The realization of the nature of language. a. Language is vocal. b. Language symbols are arbitrary. c. Language has system. d. Language is for communication. e. Language is made up of habits. f. There is a relation between a language and the cul- ture in which it is used. g. Languages change. h. No two languages have the same set of patterns of pronunciation, words, and syntax.' 2. The realization (assumption) that patterns of one's native language interfere with learning later the patterns of an- other language. 3. Methods of analyzing and describing languages. 4. Descriptions of some languages. 5. Techniques for comparison of two languages. These points with their reflections, or potential reflections, in the classroom follow. 1Redundancy might be added as a feature of language, a concept which has come to the fore through the work of Claude L. Shannon and W. Weaver, The hdatlieniatical Theory of Co))rtuunicatioii,Urb;lna: Uni - versity of Illinois Press, 1949. Waldo E. Sweet in "The Carrot on the Stick," editorial, Lai2,Suage Leamiig, 9. 1-2, 1959, mentions a use of redundancy, as fill-the-space exercises.
    [Show full text]
  • The Origin of Speech”
    Review of Peter MacNeilage’s “The Origin of Speech” “The Origin of Speech” is a book that reflects Peter MacNeilage’s ideas about the nature and the origin of speech and language. It is at times an exposition of MacNeilage’s theory of the origin of speech, at times an overview of behavioral and neurological organization underlying speech and at times a rather strong critique of mainstream phonology. It presents an unorthodox view of speech, but one that in my opinion meshes better with biology than most other existing theories of speech. MacNeilage’s attempt to work out a detailed biological perspective on speech make this book worthwhile, even though both the book and MacNeilage’s theory have their share of weaknesses. The book’s theme is the biological evolution of speech. Its aim is to present a plausible scenario of the evolution of the first complex utterances. It focuses on Peter MacNeilage’s frame/content theory of the origin of speech, but it also contains a large number of other observations that the author finds relevant for the evolution of speech. Among these are the presentation of the postural origin theory of hemispherical specialization, a comparison of the structure of signed language and spoken language, and a thorough deconstruction of generative phonology. MacNeilage’s (and Davis', 2000) frame/content theory proposes that speech is based on two types of movement. One type is the cyclical motion of the jaw. This results in repeated opening and closing of the mouth. When combined with vocal fold vibration, this produces syllable-like utterances, where the closed part of the cycle sounds like a consonant and the open part sounds like a vowel.
    [Show full text]
  • An Integrated Theory of Language Production and Comprehension
    BEHAVIORAL AND BRAIN SCIENCES (2013) 36, 329–392 doi:10.1017/S0140525X12001495 An integrated theory of language production and comprehension Martin J. Pickering Department of Psychology, University of Edinburgh, Edinburgh EH8 9JZ, United Kingdom [email protected] http://www.ppls.ed.ac.uk/people/martin-pickering Simon Garrod University of Glasgow, Institute of Neuroscience and Psychology, Glasgow G12 8QT, United Kingdom [email protected] http://staff.psy.gla.ac.uk/∼simon/ Abstract: Currently, production and comprehension are regarded as quite distinct in accounts of language processing. In rejecting this dichotomy, we instead assert that producing and understanding are interwoven, and that this interweaving is what enables people to predict themselves and each other. We start by noting that production and comprehension are forms of action and action perception. We then consider the evidence for interweaving in action, action perception, and joint action, and explain such evidence in terms of prediction. Specifically, we assume that actors construct forward models of their actions before they execute those actions, and that perceivers of others’ actions covertly imitate those actions, then construct forward models of those actions. We use these accounts of action, action perception, and joint action to develop accounts of production, comprehension, and interactive language. Importantly, they incorporate well-defined levels of linguistic representation (such as semantics, syntax, and phonology). We show (a) how speakers and comprehenders use covert imitation and forward modeling to make predictions at these levels of representation, (b) how they interweave production and comprehension processes, and (c) how they use these predictions to monitor the upcoming utterances.
    [Show full text]