Semantic Representation

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence Semantic Representation Lenhart Schubert University of Rochester Rochester, NY 14627-0226 Abstract intuitively whether a putative semantic interpretation of a given sentence (in context) really does capture its semantic In recent years, there has been renewed interest in content. The following desiderata seem to me like natural the NLP community in genuine language understanding and dialogue. Thus the long-standing issue of how ones, falling into both categories (and all of them have been the semantic content of language should be repre- considered explicitly or tacitly by various researchers at var- sented is reentering the communal discussion. This pa- ious times). per provides a brief “opinionated survey” of broad- • Language-like expressivity of the semantic representa- coverage semantic representation (SR). It suggests mul- tion (SR): All languages allow for the devices familiar tiple desiderata for such representations, and then out- lines more than a dozen approaches to SR–some long- from first-order logic: predicates + connectives + quan- standing, and some more recent, providing quick char- tifiers + equality. But they allow more, including: gen- acterizations, pros, cons, and some comments on imple- eralized quantifiers (most men who smoke), intensional mentations. predicates (believe, intend, resemble), predicate and sentence modification (very, gracefully, nearly, possibly), predicate and sentence reification (Birds evolved from Introduction dinosaurs, Beauty is subjective, That exoplanets exist is There are signs of a resurgence of interest in genuine lan- now certain), and reference to events or situations (Many guage understanding within NLP and AI. These signs in- children had not been not vaccinated against measles; clude various challenges that can be viewed as scaled-back this situation caused sporadic outbreaks of the disease). versions of the Turing Test, in particular RTE (Recogniz- High expressivity is sometimes claimed to be undesirable, ing Textual Entailment) (Dagan, Glicksman, and Magnini on the grounds that it is an obstacle to efficient inference. 2006), COPA (Choice of Plausible Alternatives) (Roem- That is a reasonable stance for designers of specialized mele, Bejan, and Gordon 2011), WSC (Winograd Schema systems that simply don’t require high expressivity. But Challenge) (Levesque, Davis, and Morgenstern 2012), and it is unreasonable on two counts for a general semantic the Aristo Challenge (Clark 2015), as well as a recent ACL representation (which is our focus here): One is that ef- workshop on Semantic Parsing (Artzi, Kwiatkowski, and ficient inference methods for expressively limited subsets Berant 2014), and applications such as natural language of an expressively rich representation are easily embed- (NL) dialogue with robots (Eliasson 2007; Howard, Tellex, ded within the richer representation (as specialized rea- and Roy 2013) and semantics-guided MT (Jones et al. 2012). soning subsystems); the other is that the richer represen- Thus the long-standing issue of how the semantic content tation will enable inferences that are not possible at all, or of language should be represented is reentering the commu- at best much more cumbersome, in a restricted represen- nal discussion. The focus here will be on broad-coverage tation. An analogous, equally unreasonable restriction in semantic representation. programming language design would be to prohibit recur- sion, runtime computation of iteration bounds, and back- Desiderata ward branching, as a way of assuring efficient execution. In a sense, for a representation to be meaningful to a com- Much power would be lost in doing so. puter, it need only have the right computational proper- • Ease of transduction between surface structure and SR, ties, given the tasks the computer is performing. But that irrespective of the domain (i.e., modular, easily under- is not enough. As builders of potentially very large, com- stood, easily edited representations of the transductions); plex systems dependent on symbolic representations, we Richard Montague’s work (Dowty 1979) revealed a beau- also have to be concerned with the naturalness of the rep- tiful, direct correspondence between phrase structure and resentations from our perspective. It should be easy to see meaning structure, one that accounts for entailments (or Copyright c 2015, Association for the Advancement of Artificial their absence) even in intensional contexts. It would be Intelligence (www.aaai.org). All rights reserved. astounding if this correspondence, which is the basis of 4132 compositional semantics and interpretation, were a mere tween language and the SR using machine learning tech- cultural accident. niques, and indeed to learn the forms of the transduction • Availability of referents for anaphoric expressions; pro- rules; the latter may be easy or hard, depending on the di- nouns and definite noun phrases can refer to a wide variety rectness of the correspondence between language and the of entities that were previously introduced explicitly or SR – see the second bullet above. implicitly (including propositions, events, situations, actions, types of actions, kinds of entities and actions, and Approaches more); an adequate SR should make these referents avail- able. We now briefly characterize a wide variety of approaches to SR, pointing out pros and cons and occasionally mentioning • Accord with semantic intuitions; we have many quite extant implementations. Since the characterizations often compelling intuitions about the meanings of the words we group together multiple variants of a general approach, not use (else lexicography would not be possible), about alter- everything said here applies to all members of a group, nor native interpretations of given phrases or sentences, and are all major variants cited. But hopefully the comments do about the entailments of sentences or smaller linguistic capture enough of the distinctive “flavor” of each type of units. A SR should do justice to these intuitions. approach to render the cited pros and cons comprehensible. • Ease of use for inference; humans appear to perform nu- merous inferences both during and as a result of the lan- Structured English; with polarity (and perhaps other) an- guage understanding process, and can answer questions notations; a very simple, direct “semantic representation”! inferentially about the proffered input, in concert with Pros: Traditionally, reliance on ordinary language has background knowledge; SRs should enable similar infer- been considered antithetical to trustworthy reasoning; but encing. a surprising recent development has been the successful implementation of Natural Logic (NLog) for obtaining • (Related) Ease of integration with specialized methods obvious lexical entailments, such as, Jimmy Dean refuses for taxonomies, times, partonomies, spatial/imagistic rep- to move without his blue jeans j= James Dean won’t resentations, collections, symbolic entities, math, proce- dance without pants (MacCartney and Manning 2009; dures, etc.; this is needed if we wish to match human pro- Angeli and Manning 2014). The three main principles used ficiency in these pervasive domains. in NLog are the following: (i) Certain complement-taking • Formal interpretability; model theory formally specifies lexical items (e.g., refuse, without) flip the polarity of what kinds of entities in a domain of discourse are al- their complement; (ii) we can replace a lexical item by lowed as semantic values (denotations) of our SR vocab- a more general (more specific) one in a positive-polarity ulary, and how we can derive the semantic values of more (negative-polarity) one; and (iii) certain implicative verbs complex expressions from those of the basic vocabulary. (e.g., manage, refuse) entail or at least strongly suggest As such it provides a tool to the SR designer for ensuring the truth (or falsity) of their complement. In the example, semantic coherence and consistent use of the SR; in addi- Jimmy Dean can be replaced by the synonym James Dean; tion, it provides a way of determining whether proposed move can be replaced by the more specific term dance in inference methods will yield true conclusions from true the negative environment formed by refuse; blue jeans can premises, or at least yield plausible conclusions (depend- be replaced by the more general term pants in the doubly ing on the kinds of inferences we are after). negative, hence positive environment created by refuse and That said, it should be noted that historically, the devel- without; and refuse entails the negation of its complement, opment of full formal semantics for Boolean logic, first- i.e., from refuse to V we can infer won’t V. order logic, modal logics, and other more esoteric log- A related approach to inference has come out of ics has lagged (often by decades) behind their syntac- RTE research, making use of (soft) entailment rela- tic development (including methods of proof), yet those tions between syntactic subtrees (Dagan et al. 2008; syntactic developments rarely ran astray; careful intuitive Berant, Dagan, and Goldberger 2011; Melamud et al. 2013). consideration of the intended meaning of new constructs, In fact a new area of statistical NL semantics has sprung up guided by analogy with their linguistic counterparts, gen- based on the idea that words are semantically similar if they erally led to sensible results. Much the same can be said tend to co-occur with the same

Load more