Cognitive Modeling Approaches to Language Comprehension Using Construction Grammar

The AAAI 2017 Spring Symposium on Computational Construction Grammar and Natural Language Understanding Technical Report SS-17-02 Cognitive Modeling Approaches to Language Comprehension Using Construction Grammar Peter Lindes and John E. Laird University of Michigan {plindes, laird}@umich.org Abstract tures embody the hypothesis articulated by Allen Newell This paper examines the relationship between modeling (1990) that the same computational structures are used human sentence comprehension using cognitive architec- across all human mental activity, including language pro- tures and approaches to linguistic knowledge representation cessing. We consider models of language comprehension using construction grammars. We review multiple computa- in both the Soar and ACT-R cognitive architectures. tional models of language understanding that vary in their use of construction grammar and cognitive architectures. Within the context of cognitive architecture, the question We present a case study: Lucia, that uses Embodied Con- of how do humans comprehend natural language leads us struction Grammar (ECG) within the Soar cognitive archi- to the following questions: How is knowledge about lan- tecture to comprehend language used to instruct an embod- guage represented? How is that knowledge encoded in and ied agent. We also examine the tradeoffs between alterna- retrieved from the different memories of a cognitive archi- tive approaches to representing and accessing linguistic knowledge within a cognitive architecture and suggest fu- tecture? How is knowledge used to dynamically create ture research. actionable meaning representations from input sentences rife with ambiguities and ungrammaticalities? Introduction In this paper, we explore seven different sentence processing systems. These systems vary along many dimen- How do human beings comprehend natural language? sions: how they structure their knowledge; whether they Many disciplines within cognitive science have contributed attempt to model human behavior; whether they focus only knowledge to help answer this question. Psychologists and on syntactic processing or also include semantic processing psycholinguists have gathered empirical data on human that creates structures that are used for producing action; eye movements, reaction times, reading times, priming which memories they use to represent the different types of effects, and so on related to language comprehension. knowledge used in language processing; and how they Cognitive linguists have developed theories of image process that knowledge, especially to deal with ambiguity. schemas, metaphor, lexical semantics, and construction grammar to gain insight into the structure of language. Research in cognitive architecture has provided insight on Models of Linguistic Knowledge the computational building blocks underlying language Fundamental to models of sentence processing is the repre- processing. sentation and structure of linguistic knowledge. Construc- We examine language comprehension using construction grammars (Hoffman and Trousdale, 2013) are a set of tion grammars embedded within cognitive models – mod- theories as to how linguistic knowledge is structured and els that attempt to model the human comprehension pro- represented. These theories posit the idea of a construction, cess, and more specifically, models implemented within a which is a unit of lexical or grammatical knowledge that cognitive architecture. Cognitive architectures define the pairs a form with a corresponding meaning. Many varia- fixed structures and functions underlying cognition, includ- tions of construction grammar formalisms have been de- ing the representation, storage, and access of knowledge in veloped (Goldberg 2013). The two frameworks that are both short-term and long-term memory, decision making, most relevant here are Embodied Construction Grammar and interfaces to perception and action. Cognitive architec- (ECG; Bergen and Chang 2013) and Fluid Construction Grammar (FCG; Steels 2013). Copyright © 2017, Association for the Advancement of Artificial Intelli- gence (www.aaai.org). All rights reserved. 213 Embodied Construction Grammar (ECG) Modeling Human Processing ECG (Eppe et al. 2016; Feldman, Dodge, and Bryant 2009) Both ECG and FCG provide theories of how linguistic is rooted in semantic representations based on theories of knowledge is represented and how input sentences are image schemas. It relates linguistic forms to these schemas parsed to create semantic representations. Both have been which can represent perception and mental models, and applied to embodied robotic agents. However, they are not also to action schemas, which describe physical actions embodied in a cognitive architecture nor do they attempt to that can be performed by an agent such that the parameters model human comprehension processes. of an action can be filled in by information from the lan- In contrast, there have been multiple parsers developed guage input. Thus the grammatical representation is closely in cognitive architectures, specifically within ACT-R (An- tied to the perception and action of an embodied agent. derson et al. 2004) and Soar (Laird 2012; Newell 1990). Bryant (2008) created a parser using ECG as its gram- Although ACT-R and Soar differ in many details, at an mar definition language and a probabilistic, left-corner abstract level of analysis they share the same overall struc- parsing algorithm which searches for a best fit representa- ture and processing cycle. They both have a basic pro- tion of a sentence using the given grammar. This parser has cessing cycle that is controlled by knowledge retrieved been used in a number of applications of ECG (Chang, from procedural memory. This knowledge is encoded as 2008; Eppe et al. 2016). Each sentence produces a seman- rules that test the current situation in order to decide what tic representation called a semspec. Recent work uses this action, either internal or external, should be performed. parser to provide a sentence comprehension capability to The current situation is encoded as symbolic structures in robots. working memory (which are buffers in ACT-R). The con- tents of working memory come from perception and re- Fluid Construction Grammar (FCG) trievals from long-term memories, which include the pro- FCG (Steels and Hild, 2012) is a construction grammar cedural memory as well as long-term declarative memo- formalism that has been used with robots and for experi- ries. The internal actions add, remove, or modify structures ments in learning linguistic knowledge by an agent or in working memory; or retrieve information from declara- group of agents. It has the feature that the same construc- tive memories. The external actions initiative motor actions can be used for both comprehension and language tions. All of these processes are modulated by metadata production. that the architecture maintains about its memories and pro- The comprehension processor for FCG “supports stand- cesses. For example, the retrieval of information from de- ard heuristic best-first search” (Steels, De Beule, and Wel- clarative memory can be biased using the frequency and lens 2012). One of the major thrusts of FCG is that the recency of prior accesses of long-term memory structures parser has a number of facilities to enhance its robustness (base-level activation), as well as associations between (Steels and van Trijp 2011) for missing or incomplete memory structures (spreading activation). items. It can coerce items to change their properties, and Because of the context-dependent retrieval of knowledge learning processes can deal with unknown words and from procedural and declarative long-term memories, meanings. ACT-R and Soar do not have the same predetermined se- quential execution structure that is ubiquitous in standard Analysis of Construction Grammar programming languages. Instead, each decision is deter- mined by the current situation – the active goals, percep- A construction grammar provides a formalization for tion, and internal reasoning state of the system. representing the connections between syntactic and seman- Soar and ACT-R have similar procedural learning tic knowledge. Constructions for both lexical and gram- mechanisms that incrementally create new productions matical constructions can include semantic information, based on the co-occurrence of production firings. Soar’s which is a key point in construction grammar models of mechanism, called chunking, creates productions that language (Goldberg 1995). As Jurafsky (1996) points out, summarize the processing in subgoals, while ACT-R’s construction grammars have an advantage over traditional mechanism, called production composition, composes two context-free grammars in that an instance of a construction productions that fire in sequence into a single production. is an actual data structure, not just a label, and can thus In Soar, the processing consists of a series of decision hold semantic and grammatical feature data.Thus, con- cycles where a single operator is selected and applied. Any structions provide a declarative map from form to mean- complex behavior, such as sentence processing, balancing ing. The cognitive models we discuss below provide a dy- a check book, or writing a symposium paper, is composed namic process to construct meanings from input utterances of sequences of primitive operators. When mapping to hu- one word at a time. man behavior, Newell (1990) estimates that each decision cycle takes ~50 ms. Thus, since

Cognitive Modeling Approaches to Language Comprehension Using Construction Grammar

A Distributed Computational Cognitive Model for Object Recognition

The Place of Modeling in Cognitive Science

A Bayesian Computational Cognitive Model ∗

Connectionist Models of Language Processing

Cognitive Psychology for Deep Neural Networks: a Shape Bias Case Study

Cognitive Modeling, Symbolic

Connectionist Models of Cognition Michael S. C. Thomas and James L

Artificial Intelligence and Cognitive Science Have the Same Problem

Models of Cognition: Neurological Possiblity Does Not Indicate Neurological Plausibility

Artificial Neural Nets

A Cognitive Model of Drivers Attention

Using Recurrent Neural Networks to Understand Human Reward Learning