Human Sentence Processing: Recurrence Or Attention?
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Distributed Computational Cognitive Model for Object Recognition
SCIENCE CHINA xx xxxx Vol. xx No. x: 1{15 doi: A Distributed Computational Cognitive Model for Object Recognition Yong-Jin Liu1∗, Qiu-Fang Fu2, Ye Liu2 & Xiaolan Fu2 1Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China, 2State Key Lab of Brain and Cognitive Science, Institute of Psychology, Chinese Academy of Sciences, Beijing 100101, China Received Month Day, Year; accepted Month Day, Year Abstract Based on cognitive functionalities in human vision processing, we propose a computational cognitive model for object recognition with detailed algorithmic descriptions. The contribution of this paper is of two folds. Firstly, we present a systematic review on psychological and neurophysiological studies, which provide collective evidence for a distributed representation of 3D objects in the human brain. Secondly, we present a computational model which simulates the distributed mechanism of object vision pathway. Experimental results show that the presented computational cognitive model outperforms five representative 3D object recognition algorithms in computer science research. Keywords distributed cognition, computational model, object recognition, human vision system 1 Introduction Object recognition is one of fundamental tasks in computer vision. Many recognition algorithms have been proposed in computer science area (e.g., [1, 2, 3, 4, 5]). While the CPU processing speed can now be reached at 109 Hz, the human brain has a limited speed of 100 Hz at which the neurons process their input. However, compared to state-of-the-art computational algorithms for object recognition, human brain has a distinct advantage in object recognition, i.e., human being can accurately recognize one from unlimited variety of objects within a fraction of a second, even if the object is partially occluded or contaminated by noises. -
The ERP Technique in Children's Sentence Processing: a Review a Técnica De ERP No Processamento De Sentenças De Crianças: U
Revista de Estudos da Linguagem, Belo Horizonte, v.25, n.3, p.1537-1566, 2017 The ERP technique in children’s sentence processing: a review A técnica de ERP no processamento de sentenças de crianças: uma revisão Marília Uchôa Cavalcanti Lott de Moraes Costa Universidade Federal do Rio de Janeiro, Rio de Janeiro, Rio de Janeiro / Brasil [email protected] Resumo: A medição da ativação cerebral por meio da técnica de potenciais relacionados a eventos (ERP) tem sido valiosa para lançar luz sobre diversas cognições humanas. A linguagem é uma das cognições que têm sido estudadas com essa técnica de grande resolução temporal entre o estímulo apresentado e a ativação observada decorrente desse estímulo. A área de aquisição da linguagem tem se beneficiado especialmente dessa técnica, dado que é possível investigar relações entre dados linguísticos e a ativação cerebral sem a necessidade de uma resposta explícita, como apertar um botão ou apontar para uma imagem. O objetivo deste artigo é apresentar o estado da arte sobre o processamento de frases em crianças utilizando a técnica de ERP. Palavras-chave: ERP/EEG; processamento de frases; aquisição da linguagem; N400; P600. Abstract: The measurement of brain activation through the technique of event related potentials (ERP) has been valuable in shedding light on various human cognitions. Language is one of those cognitions that has been studied using this technique, which allows for a more accurate temporal resolution between the stimulus presented and the time in which we observe an activation resulting from this stimulus. The area of eISSN: 2237-2083 DOI: 10.17851/2237-2083.25.3.1537-1566 1538 Revista de Estudos da Linguagem, Belo Horizonte, v.25, n.3, p.1537-1566, 2017 language acquisition has especially benefited from this technique since it is possible to investigate relationships between linguistic data and brain activation without the need for an explicit response. -
The Place of Modeling in Cognitive Science
The place of modeling in cognitive science James L. McClelland Department of Psychology and Center for Mind, Brain, and Computation Stanford University James L. McClelland Department of Psychology Stanford University Stanford, CA 94305 650-736-4278 (v) / 650-725-5699 (f) [email protected] Running Head: Modeling in cognitive science Keywords: Modeling frameworks, computer simulation, connectionist models, Bayesian approaches, dynamical systems, symbolic models of cognition, hybrid models, cognitive architectures Abstract I consider the role of cognitive modeling in cognitive science. Modeling, and the computers that enable it, are central to the field, but the role of modeling is often misunderstood. Models are not intended to capture fully the processes they attempt to elucidate. Rather, they are explorations of ideas about the nature of cognitive processes. As explorations, simplification is essential – it is only through simplification that we can fully understand the implications of the ideas. This is not to say that simplification has no downsides; it does, and these are discussed. I then consider several contemporary frameworks for cognitive modeling, stressing the idea that each framework is useful in its own particular ways. Increases in computer power (by a factor of about 4 million) since 1958 have enabled new modeling paradigms to emerge, but these also depend on new ways of thinking. Will new paradigms emerge again with the next 1,000-fold increase? 1. Introduction With the inauguration of a new journal for cognitive science, thirty years after the first meeting of the Cognitive Science Society, it seems essential to consider the role of computational modeling in our discipline. -
A Bayesian Computational Cognitive Model ∗
A Bayesian Computational Cognitive Model ∗ Zhiwei Shi Hong Hu Zhongzhi Shi Key Laboratory of Intelligent Information Processing Institute of Computing Technology, CAS, China Kexueyuan Nanlu #6, Beijing, China 100080 {shizw, huhong, shizz}@ics.ict.ac.cn Abstract problem solving capability and provide meaningful insights into mechanisms of human intelligence. Computational cognitive modeling has recently Besides the two categories of models, recently, there emerged as one of the hottest issues in the AI area. emerges a novel approach, hybrid neural-symbolic system, Both symbolic approaches and connectionist ap- which concerns the use of problem-specific symbolic proaches present their merits and demerits. Al- knowledge within the neurocomputing paradigm [d'Avila though Bayesian method is suggested to incorporate Garcez et al., 2002]. This new approach shows that both advantages of the two kinds of approaches above, modal logics and temporal logics can be effectively repre- there is no feasible Bayesian computational model sented in artificial neural networks [d'Avila Garcez and concerning the entire cognitive process by now. In Lamb, 2004]. this paper, we propose a variation of traditional Different from the hybrid approach above, we adopt Bayesian network, namely Globally Connected and Bayesian network [Pearl, 1988] to integrate the merits of Locally Autonomic Bayesian Network (GCLABN), rule-based systems and neural networks, due to its virtues as to formally describe a plausible cognitive model. follows. On the one hand, by encoding concepts -
Connectionist Models of Language Processing
Cognitive Studies Preprint 2003, Vol. 10, No. 1, 10–28 Connectionist Models of Language Processing Douglas L. T. Rohde David C. Plaut Massachusetts Institute of Technology Carnegie Mellon University Traditional approaches to language processing have been based on explicit, discrete represen- tations which are difficult to learn from a reasonable linguistic environment—hence, it has come to be accepted that much of our linguistic representations and knowledge is innate. With its focus on learning based upon graded, malleable, distributed representations, connectionist modeling has reopened the question of what could be learned from the environment in the ab- sence of detailed innate knowledge. This paper provides an overview of connectionist models of language processing, at both the lexical and sentence levels. Although connectionist models have been applied to the full and processes of actual language users: Language is as lan- range of perceptual, cognitive, and motor domains (see Mc- guage does. In this regard, errors in performance (e.g., “slips Clelland, Rumelhart, & PDP Research Group, 1986; Quin- of the tongue”; Dell, Schwartz, Martin, Saffran, & Gagnon, lan, 1991; McLeod, Plunkett, & Rolls, 1998), it is in their 1997) are no less valid than skilled language use as a measure application to language that they have evoked the most in- of the underlying nature of language processing. The goal is terest and controversy (e.g., Pinker & Mehler, 1988). This not to abstract away from performance but to articulate com- is perhaps not surprising in light of the special role that lan- putational principles that account for it. guage plays in human cognition and culture. -
The Potentials for Basic Sentence Processing: Differentiating
The Potentials for Basic Sentence 20 Processing: Differentiating Integrative Processes Marta Kutas and Jonathan W. King ABSTRACT We show that analyzing voltage fluctuations known as "event-related brain potentials," or ERPs, recorded from the human scalp can be an effective way of tracking integrative processes in language on-line. This is essential if we are to choose among alternative psychological accounts of language comprehension. We briefly review the data implicating the N400 as an index of semantic integration and describe its use in psycholinguistic research. We then introduce a cognitive neuroscience approach to normal sentence processing, which capitalizes on the ERP's fine temporal resolution as well as its potential linkage to both psychological constructs and activated brain areas. We conclude by describing several reliable ERP effects with different temporal courses, spatial extents, and hypothesized relations to comprehension skill during the reading of simple transitive sentences; these include (1) occipital potentials related to fairly low-level, early visual processing, (2) very slow frontal positive shifts related to high-level integration during construction of a mental model, and (3) various frontotemporal potentials associated with thematic role assignment, clause endings, and manipulating items that are in working memories. In it comes out it goes and in between nobody knows how flashes of vision and snippets of sound get bound to meaning. From percepts to concepts seemingly effortless integration of highly segregated -
Cognitive Psychology for Deep Neural Networks: a Shape Bias Case Study
Cognitive Psychology for Deep Neural Networks: A Shape Bias Case Study Samuel Ritter * 1 David G.T. Barrett * 1 Adam Santoro 1 Matt M. Botvinick 1 Abstract to think of these models as black boxes, and to question whether they can be understood at all (Bornstein, 2016; Deep neural networks (DNNs) have advanced Lipton, 2016). This opacity obstructs both basic research performance on a wide range of complex tasks, seeking to improve these models, and applications of these rapidly outpacing our understanding of the na- models to real world problems (Caruana et al., 2015). ture of their solutions. While past work sought to advance our understanding of these models, Recent pushes have aimed to better understand DNNs: none has made use of the rich history of problem tailor-made loss functions and architectures produce more descriptions, theories, and experimental methods interpretable features (Higgins et al., 2016; Raposo et al., developed by cognitive psychologists to study 2017) while output-behavior analyses unveil previously the human mind. To explore the potential value opaque operations of these networks (Karpathy et al., of these tools, we chose a well-established analy- 2015). Parallel to this work, neuroscience-inspired meth- sis from developmental psychology that explains ods such as activation visualization (Li et al., 2015), abla- how children learn word labels for objects, and tion analysis (Zeiler & Fergus, 2014) and activation maxi- applied that analysis to DNNs. Using datasets mization (Yosinski et al., 2015) have also been applied. of stimuli inspired by the original cognitive psy- Altogether, this line of research developed a set of promis- chology experiments, we find that state-of-the-art ing tools for understanding DNNs, each paper producing one shot learning models trained on ImageNet a glimmer of insight. -
Psych150/ Ling155 Spring 2015 Review Questions: Speech Perception, Word Recognition, Sentence Processing, and Speech Production
Psych150/ Ling155 Spring 2015 Review Questions: Speech perception, word recognition, sentence processing, and speech production Terms/concepts to know: mondegreen, lack of invariance problem, Motor Theory of Speech Perception, McGurk effect, Ganong effect, priming, lexical decision, facilitation, inhibition, neighborhood density, homophone, homograph, Cohort model, TRACE model, Dual Route model, graphemes, developmental dyslexia, acquired dyslexia, parsing, incrementality, garden path sentence, global ambiguity, garden path theory, constraint-based approach, thematic relation, intransitive, transitive, ditransitive, sentential complement, slip of the tongue, tip of the tongue state, serial model, cascaded model, bidirectional cascaded model 1. The lack of invariance problem led to the development of the Motor Theory of Speech Perception (MToSP). What is the lack of invariance problem? How did the MToSP propose to solve that problem? 2. How does the McGurk effect differ from the Ganong effect? 3. Can priming occur without conscious awareness of the prime? What paradigm do researchers use to assess this possibility? 4. How does a lexical decision task work? Can lexical decision be used for both visual (written) and auditory (spoken) experiments? What is the dependent measure in lexical decision tasks? 5. Priming can cause facilitation or inhibition. How would these processes be reflected in reaction times? 6. How do homophones and homographs differ? 7. According to the Cohort model of auditory word recognition, what happens as the beginning of a word is detected? 8. What does the TRACE model predict about possibility of “speaker” and “beaker” being activated at the same time during word recognition? Why? 9. Writing systems differ in terms of the primary level of linguistic representations that they tap into, i.e., that symbols refer to (phonemes, syllables, morphemes). -
Toward Collaboration Between the Fields of Signal and Sentence Processing
Cortical Tracking of Speech: Toward Collaboration between the Fields of Signal and Sentence Processing Eleonora J. Beier1, Suphasiree Chantavarin1,2, Gwendolyn Rehrig1, Fernanda Ferreira1, and Lee M. Miller1 Abstract ■ In recent years, a growing number of studies have used corti- between the two fields could be mutually beneficial. Specifically, cal tracking methods to investigate auditory language processing. we argue that the use of cortical tracking methods may help re- Although most studies that employ cortical tracking stem from solve long-standing debates in the field of sentence processing that the field of auditory signal processing, this approach should also commonly used behavioral and neural measures (e.g., ERPs) have be of interest to psycholinguistics—particularly the subfield of failed to adjudicate. Similarly, signal processing researchers who sentence processing—given its potential to provide insight into usecorticaltrackingmaybeabletoreducenoiseintheneuraldata dynamic language comprehension processes. However, there and broaden the impact of their results by controlling for linguis- has been limited collaboration between these fields, which we tic features of their stimuli and by using simple comprehension suggest is partly because of differences in theoretical background tasks. Overall, we argue that a balance between the methodolog- and methodological constraints, some mutually exclusive. In this ical constraints of the two fields will lead to an overall improved paper, we first review the theories and methodological con- understanding of language processing as well as greater clarity on straints that have historically been prioritized in each field and what mechanisms cortical tracking of speech reflects. Increased provide concrete examples of how some of these constraints collaboration will help resolve debates in both fields and will lead may be reconciled. -
A Neurolinguistic Model of Grammatical Construction Processing
A Neurolinguistic Model of Grammatical Construction Processing Peter Ford Dominey1, Michel Hoen1, and Toshio Inui2 Downloaded from http://mitprc.silverchair.com/jocn/article-pdf/18/12/2088/1756017/jocn.2006.18.12.2088.pdf by guest on 18 May 2021 Abstract & One of the functions of everyday human language is to com- construction processing based on principles of psycholinguis- municate meaning. Thus, when one hears or reads the sen- tics, (2) develop a model of how these functions can be imple- tence, ‘‘John gave a book to Mary,’’ some aspect of an event mented in human neurophysiology, and then (3) demonstrate concerning the transfer of possession of a book from John to the feasibility of the resulting model in processing languages of Mary is (hopefully) transmitted. One theoretical approach to typologically diverse natures, that is, English, French, and Japa- language referred to as construction grammar emphasizes this nese. In this context, particular interest will be directed toward link between sentence structure and meaning in the form of the processing of novel compositional structure of relative grammatical constructions. The objective of the current re- phrases. The simulation results are discussed in the context of search is to (1) outline a functional description of grammatical recent neurophysiological studies of language processing. & INTRODUCTION ditransitive (e.g., Mary gave my mom a new recipe) One of the long-term quests of cognitive neuroscience constructions (Goldberg, 1995). has been to link functional aspects of language process- In this context, the ‘‘usage-based’’ perspective holds ing to its underlying neurophysiology, that is, to under- that the infant begins language acquisition by learning stand how neural mechanisms allow the mapping of the very simple constructions in a progressive development surface structure of a sentence onto a conceptual repre- of processing complexity, with a substantial amount of sentation of its meaning. -
Cognitive Modeling, Symbolic
Cognitive modeling, symbolic Lewis, R.L. (1999). Cognitive modeling, symbolic. In Wilson, R. and Keil, F. (eds.), The MIT Encyclopedia of the Cognitive Sciences. Cambridge, MA: MIT Press. Symbolic cognitive models are theories of human cognition that take the form of working computer programs. A cognitive model is intended to be an explanation of how some aspect of cognition is accomplished by a set of primitive computational processes. A model performs a specific cognitive task or class of tasks and produces behavior that constitutes a set of predictions that can be compared to data from human performance. Task domains that have received considerable attention include problem solving, language comprehension, memory tasks, and human-device interaction. The scientific questions cognitive modeling seeks to answer belong to cognitive psychology, and the computational techniques are often drawn from artificial intelligence. Cognitive modeling differs from other forms of theorizing in psychology in its focus on functionality and computational completeness. Cognitive modeling produces both a theory of human behavior on a task and a computational artifact that performs the task. The theoretical foundation of cognitive modeling is the idea that cognition is a kind of COMPUTATION (see also COMPUTATIONAL THEORY OF MIND). The claim is that what the mind does, in part, is perform cognitive tasks by computing. (The claim is not that the computer is a metaphor for the mind, or that the architectures of modern digital computers can give us insights into human mental architecture.) If this is the case, then it must be possible to explain cognition as a dynamic unfolding of computational processes. -
EEG Theta and Gamma Responses to Semantic Violations in Online Sentence Processing
Brain and Language 96 (2006) 90–105 www.elsevier.com/locate/b&l EEG theta and gamma responses to semantic violations in online sentence processing Lea A. Hald a,c, Marcel C.M. Bastiaansen a,b,¤, Peter Hagoort a,b a Max Planck Institute for Psycholinguistics, P.O. Box 310, 6500 AH Nijmegen, The Netherlands b F. C. Donders Centre for Cognitive Neuroimaging, Radbout Universiteit Nijmegen, P.O. Box 9101, 6500 HB Nijmegen, The Netherlands c Center for Research in Language, University of California, San Diego, 9500 Gilman Dr., Dept. 0526, La Jolla, CA 92093-0526, USA Accepted 18 June 2005 Available online 3 August 2005 Abstract We explore the nature of the oscillatory dynamics in the EEG of subjects reading sentences that contain a semantic violation. More speciWcally, we examine whether increases in theta (t3–7 Hz) and gamma (around 40 Hz) band power occur in response to sen- tences that were either semantically correct or contained a semantically incongruent word (semantic violation). ERP results indicated a classical N400 eVect. A wavelet-based time-frequency analysis revealed a theta band power increase during an interval of 300– 800 ms after critical word onset, at temporal electrodes bilaterally for both sentence conditions, and over midfrontal areas for the semantic violations only. In the gamma frequency band, a predominantly frontal power increase was observed during the processing of correct sentences. This eVect was absent following semantic violations. These results provide a characterization of the oscillatory brain dynamics, and notably of both theta and gamma oscillations, that occur during language comprehension. 2005 Elsevier Inc.