Graph Algebraic Combinatory Categorial Grammar

Total Page:16

File Type:pdf, Size:1020Kb

Graph Algebraic Combinatory Categorial Grammar Graph Algebraic Combinatory Categorial Grammar Sebastian Beschke Wolfgang Menzel Department of Informatics University of Hamburg Vogt-Kölln-Straße 30, 22527 Hamburg, Germany {beschke,menzel}@informatik.uni-hamburg.de Abstract large amounts of computational resources during parsing. One central contribution of this paper is This paper describes CCG/AMR, a novel a lexicon induction algorithm that produces a rel- grammar for semantic parsing of Abstract Meaning Representations. CCG/AMR equips atively compact lexicon. Combinatory Categorial Grammar derivations Combinatory Categorial Grammar (CCG) uses with graph semantics by assigning each CCG a transparent syntax-semantic interface that con- combinator an interpretation in terms of a structs meaning representations using λ-calculus. graph algebra. This makes lexicon induction challenging, as in- We provide an algorithm that induces a ducing λ-calculus terms essentially requires solv- CCG/AMR from a corpus and show that it cre- ing higher-order unification (Kwiatkowski et al., ates a compact lexicon with low ambiguity and 2010). In practice, heuristics are employed to achieves a robust coverage of 78% of the ex- manage the search space, but it would be prefer- amined sentences under ideal conditions. able to make use of a less powerful mechanism We also identify several phenomena that affect which better fits the problem domain. Graph alge- any approach relying either on CCG or graph bras such as the HR algebra (Courcelle and Engel- algebraic approaches for AMR parsing. This friet, 2012) constitute such a constrained mecha- includes differences of representation between CCG and AMR, as well as non-compositional nism. By combining CCG with the HR algebra, constructions that are not expressible through we are able to leverage the availability of syntac- a monotonic construction process. To our tic parsers for CCG while making use of tailored knowledge, this paper provides the first anal- semantic construction operations. ysis of these corpus issues. 1.1 Related Work 1 Introduction There is an extensive body of work on seman- With the release of the Abstract Meaning Rep- tic parsing of AMRs, using a range of techniques resentation (AMR) corpus (Knight et al., 2014), including maximum spanning tree-based parsing graph representations of meaning have taken cen- (Flanigan et al., 2014), transition-based parsing tre stage in research on semantic parsing. Se- (Wang et al., 2015; Ballesteros and Al-Onaizan, mantic parsing systems have to address the prob- 2017; Peng et al., 2018), machine translation (van lem of lexicon induction: extracting reusable lexi- Noord and Bos, 2017), graph grammars (Peng cal items from the sentential meaning representa- et al., 2015; Groschwitz et al., 2017), and CCG tions annotated in the AMR corpus. Since the cor- parsing (Artzi et al., 2015; Misra and Artzi, 2016). pus contains sentential meaning representations, The system of Artzi et al.(2015) is most sim- but no indication of their compositional structure, ilar to the present work. They induce a semantic derivations of the meaning representation cannot CCG using a translation of AMR into λ-calculus. be observed. Common approaches enumerate A key difference is that their training algorithm all conceivable lexical items that may have con- combines lexicon induction and parameter train- tributed to the derivation of the meaning represen- ing into a single phase. New lexical items are tation at hand (Artzi et al., 2015; Groschwitz et al., generated during each training step and then fil- 2017). This may produce very large lexicons with tered based upon the model’s current parameters. high degrees of ambiguity, and therefore require In contrast, in this work we focus on lexicon in- 54 Proceedings of the 7th Joint Conference on Lexical and Computational Semantics (*SEM), pages 54–64 New Orleans, June 5-6, 2018. c 2018 Association for Computational Linguistics duction, with parameter training to be performed 2.1 Combinatory Categorial Grammar in a subsequent step. CCG is a grammar formalism centered around a Another related system is presented in Lewis transparent syntax-semantics interface (Steedman, et al.(2015), where a CCG parser is adapted to 2000). A CCG consists of a small set of combina- produce shallow semantic dependency graphs. In tory rules, along with a lexicon of entries defining contrast, the meaning representations employed each word’s syntactic and semantic interpretation. here are abstract and do not directly refer to the A CCG lexical item, as used in this paper, con- sentence under analysis. tains one or several tokens, a syntactic category, Word-to-node alignments on the AMR corpus and a semantic category. The syntactic category is play an important role in this work. We exper- a functional type defining the types of arguments iment with JAMR’s rule-based aligner (Flanigan expected by the words and whether they are ex- et al., 2014) and the statistical ISI aligner (Pour- pected to the left or right. E.g., NP/N expects a damghani et al., 2014). noun (N) to the right (because of the rightward- Graph algebras have recently been applied to facing slash) and returns an NP – it is the type of AMR parsing (Koller, 2015; Groschwitz et al., determiners. (S NP) / NP is the category of tran- \ 2017), but not in combination with CCG. In con- sitive verbs, consuming first an NP from the right trast, we use syntactic CCG derivations to con- and then from the left, and returning a sentence. strain the space of possible derivations. However, See Figure 1a for an example. the idea of using a constrained version of the HR CCG derivations are created by recursively ap- algebra, introduced by Groschwitz et al.(2017), plying combinators to the lexical syntactic cate- is also used here, and our Application operator ef- gories, thus combining them into constituents. Be- fectively subsumes their Apply and Modify opera- sides Application, implementations of CCG also tions. use other combinators such as Composition, as well as specialized combinators for conjunctions 1.2 Tools and punctuation. Syntax parser We use EasyCCG (Lewis and Semantic categories are represented as λ- Steedman, 2014) to obtain syntax derivations. For calculus terms. A combinator is always applied robustness, we extract the ten best derivations to two constituents’ syntactic and semantic cate- produced by EasyCCG based on the CCGBank- gories at the same time, allowing semantic con- rebanked model (Honnibal et al., 2010). struction to be fully syntax-driven. Word-to-node alignments During lexicon induction, we make use of alignments between to- 2.2 Abstract Meaning Representation kens in the sentence and nodes in the meaning rep- The Abstract Meaning Representation (AMR) is resentation. We experiment with JAMR’s aligner a semantic meaning representation language that (Flanigan et al., 2014) and the ISI aligner (Pour- is purposefully syntax-agnostic (Banarescu et al., damghani et al., 2014). 2013). The data set used in this paper, the AMR We use Stanford CoreNLP (Man- Other tools 1.0 release (Knight et al., 2014), consists of En- ning et al., 2014) for tokenisation. glish sentences which have been directly anno- tated with meaning representations by human an- 2 Background notators. The task of semantic parsing is concerned with AMR represents meaning as labeled, directed building formal meaning representations for natu- graphs. Nodes are labeled with concepts, while ral language text. While meaning representations edges represent roles. Predicates and core roles can be elements of any formal language, in this pa- are drawn from PropBank (Kingsbury and Palmer, per we are concerned with Abstract Meaning Rep- 2002). In addition, a set of non-core roles has been resentations (AMRs). We use Combinatory Cat- defined, such as mod, poss, time, etc. egorial Grammar (CCG) as an underlying frame- Whether it was wise to define AMR indepen- work to explain how AMRs may be derived from dently of any derivational process has been de- the surface form words. To do so, we equip CCG bated (Bender et al., 2015), and in Section5 we with graph construction operators drawn from the will show some of the issues that arise when at- HR algebra. These concepts are introduced below. tempting to construct derivations for AMRs. 55 met and married (S NP)/NP conj (S NP)/NP The boy wants to sleep \ root root \ root meeth i andh i marryh i NP (S NP)/(S NP) S NP ARG0 ARG1 op1 op2 ARG0 ARG1 \ root\ \ root h i h i 0 1 0 1 0 1 want sleep h i h i h i h i h i h i ARG0 ARG1 ARG0 conj root boyh i 0 1 0 ((S NP)/NP) ((S NP)/NP) h i h i h i \ root \ \ > op1 h i S NP and op2 \ 2 marry root h i wanth i ARG0 ARG1 0 1 ARG0 ARG1 h i h i ARG0 0 sleep < h i (S NP)/NP) < \ S root op1 andh i op2 root wanth i meet ARG1 marry ARG0 ARG1 ARG0 ARG1 ARG0 boy sleep 0 ARG0 1 h i h i (a) Example for a derivation making use of two (b) Example for a derivation containing one Conjunction Application operations. and one Application operation. Step 1: Application. Insert sleep node into Step 1: Conjunction. Insert marry node into placeholder placeholder 1 ; merge 0 placeholders. 1 of and; shift placeholder 0 of and to 2 . Step 2: Application.h i Inserth i boy into placeholder Steph i 2: Application of the phraseh i and marriedh i to met; the 0 . operands’ same-numbered placeholders are merged. h i Figure 1: Examples for the semantic operations Application and Conjunction. 2.3 The HR Algebra 3 Graph Semantics for CCG During semantic parsing, a complete meaning rep- In CCG, semantic construction is syntax-driven in resentation needs to be built out of smaller compo- that the construction operations that are applied nents.
Recommended publications
  • A Type-Theoretical Analysis of Complex Verb Generation
    A Type-theoretical Analysis of Complex Verb Generation Satoshi Tojo Mitsubishi Research Institute, Inc. 2-22 Harumi 3, Chuo-ku Tokyo 104, Japan e-mail: [email protected], [email protected] Abstract hardly find a verb element in the target language, which corresponds with the original word exactly in Tense and aspect, together with mood and modality, meaning. However we also need to admit that the usually form the entangled structure of a complex verb. problem of knowledge representation of tense and as- They are often hard to translate by machines, because pect (namely time), or that of mood and modality is of both syntactic and semantic differences between lan- not only the problem of verb translation, but that of guages. This problem seriously affects upon the gen- the whole natural language understanding. In this pa- eration process because those verb components in in- per, in order to realize a type calculus in syntax, we terlingua are hardly rearranged correctly in the target depart from a rather mundane subset of tense, aspects, language. We propose here a method in which each and modMities, which will be argued in section 2. verb element is defined as a mathematical function ac- cording to its type of type theory. This formalism gives 1.2 The difference in the structure each element its legal position in the complex verb in the target language and certifies so-called partial trans- In the Japanese translation of (1): lation. In addition, the generation algorithm is totally oyoi-de-i-ta-rashii free from the stepwise calculation, and is available on parallel architecture.
    [Show full text]
  • Development of a General-Purpose Categorial Grammar Treebank
    Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 5195–5201 Marseille, 11–16 May 2020 c European Language Resources Association (ELRA), licensed under CC-BY-NC Development of a General-Purpose Categorial Grammar Treebank Yusuke Kubota, Koji Mineshima, Noritsugu Hayashi, Shinya Okano NINJAL, Ochanomizu University, The University of Tokyo, idem. f10-2 Midori-cho, Tachikawa; 2-2-1 Otsuka, Bunkyo; 3-8-1 Komaba, Megurog, Tokyo, Japan [email protected], [email protected], fhayashi, [email protected] Abstract This paper introduces ABC Treebank, a general-purpose categorial grammar (CG) treebank for Japanese. It is ‘general-purpose’ in the sense that it is not tailored to a specific variant of CG, but rather aims to offer a theory-neutral linguistic resource (as much as possible) which can be converted to different versions of CG (specifically, CCG and Type-Logical Grammar) relatively easily. In terms of linguistic analysis, it improves over the existing Japanese CG treebank (Japanese CCGBank) on the treatment of certain linguistic phenomena (passives, causatives, and control/raising predicates) for which the lexical specification of the syntactic information reflecting local dependencies turns out to be crucial. In this paper, we describe the underlying ‘theory’ dubbed ABC Grammar that is taken as a basis for our treebank, outline the general construction of the corpus, and report on some preliminary results applying the treebank in a semantic parsing system for generating logical representations of sentences. Keywords: treebank, categorial grammar, Japanese, annotation 1. Introduction Table 1: Comparison with related work original corpus output This paper reports on the progress of the construction of a H&S Penn Treebank English CCG general-purpose categorial grammar treebank in Japanese Uematsu et al.
    [Show full text]
  • Direct Compositionality and Variable-Free Semantics: the Case of “Principle B”
    Direct Compositionality and Variable-Free Semantics: The Case of “Principle B” Drafty outline/handout of material to be presented at the Workshop on Direct Compositionality Pauline Jacobson Brown University 0. The big picture goals: (A) To argue that direct compositionality is a much simpler hypothesis about the organization of the grammar than the competing views It is, therefore, the view we should hope is correct. Apparent challenges to this view should perhaps lead us to rethink some of the underlying assumptions which give rise to these challenges. (B) To point out: a consequence of at least a reasonably strong version of direct compositionality is that structured representations (e.g., trees) play no actual role in the grammar: they are not something that grammatical rules/principles can “see”. They are just convenient representation for the lnguist (representing the syntactic proof system and the compositional semantics) (C) To point out the consequences of this with respect to principles regarding the distribution of pronouns (reflexives, etc.). Standard assumption: these require the grammar to include principles which are stated on non strictly-local chucks of representation Hence: the hypothesis of direct compositionality should inspire us to rethink such statements (D) To discuss a second way in which binding principles have traditionally been taken to challenge direct compositionality (at least in its strong form): • there are many cases where there are binding type effects (e.g., "binding" of pronouns, "binding" of reflexives to satisfy Principle A, Principle B violations, etc.) but where on the surface, there is no big enough chunk of representation Typical example : connectivity in specificational sentences (1) What John is is proud of himself.
    [Show full text]
  • Chapter 1 Negation in a Cross-Linguistic Perspective
    Chapter 1 Negation in a cross-linguistic perspective 0. Chapter summary This chapter introduces the empirical scope of our study on the expression and interpretation of negation in natural language. We start with some background notions on negation in logic and language, and continue with a discussion of more linguistic issues concerning negation at the syntax-semantics interface. We zoom in on cross- linguistic variation, both in a synchronic perspective (typology) and in a diachronic perspective (language change). Besides expressions of propositional negation, this book analyzes the form and interpretation of indefinites in the scope of negation. This raises the issue of negative polarity and its relation to negative concord. We present the main facts, criteria, and proposals developed in the literature on this topic. The chapter closes with an overview of the book. We use Optimality Theory to account for the syntax and semantics of negation in a cross-linguistic perspective. This theoretical framework is introduced in Chapter 2. 1 Negation in logic and language The main aim of this book is to provide an account of the patterns of negation we find in natural language. The expression and interpretation of negation in natural language has long fascinated philosophers, logicians, and linguists. Horn’s (1989) Natural history of negation opens with the following statement: “All human systems of communication contain a representation of negation. No animal communication system includes negative utterances, and consequently, none possesses a means for assigning truth value, for lying, for irony, or for coping with false or contradictory statements.” A bit further on the first page, Horn states: “Despite the simplicity of the one-place connective of propositional logic ( ¬p is true if and only if p is not true) and of the laws of inference in which it participate (e.g.
    [Show full text]
  • 1 English Subjectless Tagged Sentences Paul Kay Department Of
    1 English subjectless tagged sentences Paul Kay Department of Linguistics University of California Berkeley, CA 94720 [email protected] 2 Abstract A colloquial English sentence like Fooled us, didn't they? contains a finite main verb but no expressed subject. The identity of the missing subject of fooled is recovered from the tag subject they: compare Fooled us, didn't she?, Fooled us, didn't you? This paper argues (1) that such subjectless tagged sentences (STSs) pose a problem for grammatical approaches based on movement and empty categories and (2) that STSs receive a revealing analysis as part of a finely articulated family of tagged sentence constructions when viewed within a non-derivational, constructional, multiple-inheritance-based approach.* *I would like to thank Peter Culicover, Liliane Haegeman, Charles Fillmore Andreas Kathol and Richard Oehrle for comments on previous versions of this paper, as well as an anonymous reviewer for Language. They have doubtless offered more good advice than I have accepted. 3 0. Introduction. It has been argued from several points of view that whatever can be done with empty categories (ecs) can be done without them (Ades and Steedman 1982, Gazdar et al. 1984, Kaplan and Zaenen 1989, Pollard and Sag 1994 chapter 9, Sag and Fodor 1994, Kay and Fillmore 1999, Sag 1999). It has also been argued that, because there is no hard evidence for their existence, linguistic theory would be better off dispensing with these unobservable entities (Pickering and Barry 1991, Sag and Fodor 1994, Sag 1999).1 The present paper purports to take the argument one step further by showing that there are things that can be done without empty categories that cannot be done with them, at least not with any of the ecs currently available.
    [Show full text]
  • Categorial Grammar and Lexical-Functional Grammar∗
    Categorial Grammar and Lexical-Functional Grammar¤ Reinhard Muskens Department of Linguistics Tilburg University The Netherlands Proceedings of the LFG01 Conference University of Hong Kong, Hong Kong Miriam Butt and Tracy Holloway King (Editors) 2001 CSLI Publications http://csli-publications.stanford.edu/ ¤I wish to thank the participants of LFG2001, especially Mary Dalrymple and Ron Kaplan, for valuable feedback. All comments are very welcome. Abstract This paper introduces ¸-grammar, a form of categorial grammar that has much in common with LFG. Like other forms of categorial grammar, ¸-grammars are multi-dimensional and their components are combined in a strictly parallel fashion. Grammatical representa- tions are combined with the help of linear combinators, closed pure ¸-terms in which each abstractor binds exactly one variable. Mathe- matically this is equivalent to employing linear logic, in use in LFG for semantic composition, but the method seems more practicable. While ¸-grammars could be used to formalize many approaches to grammatical theory, they are certainly natural as a basis for the formalization of LFG. This leads to a theory I would like to call ¸- LFG. In this paper it will be shown how the standard components of LFG can be set up in the framework. We will have descriptions of c-structure, descriptions of f-structure, and semantics. The di®erence between de¯ning and constraining information will be explained in terms of entailment, and requirements on long-distance paths in f- structure will be explained in terms of entailment in the presence of a simple set of axioms. 1 Introduction In this paper I want to discuss a version of Lexical-Functional Grammar (LFG) that is also a version of Categorial Grammar (CG).
    [Show full text]
  • The Categorial Fine-Structure of Natural Language
    1 The categorial fine-structure of natural language Johan van Benthem, Amsterdam & Stanford, March 2003 Abstract Categorial grammar analyzes linguistic syntax and semantics in terms of type theory and lambda calculus. A major attraction of this approach is its unifying power, as its basic function/ argument structures occur across the foundations of mathematics, language and computation. This paper considers, in a light example-based manner, where this elegant logical paradigm stands when confronted with the wear and tear of reality. Starting from a brief history of the Lambek tradition since the 1980s, we discuss three main issues: (a) the fit of the lambda calculus engine to characte- ristic semantic structures in natural language, (b) coexistence of the original type-theoretic and more recent modal interpretations of categorial logics, and (c) the place of categorial grammars in the total architecture of natural language, which involves mixtures of interpretation and inference. 1 From Montague Grammar to Categorial Grammar Logic and linguistics have had lively connections from Antiquity right until today (GAMUT 1991). A recurrent theme in this history is the categorial structure of language and ontology, from Aristotle's grammatical categories to Russell's theory of types in the foundations of mathematics. Further bridges were thrown as logic and linguistics developed, with Lambek 1958 as a major advance in sophistication, many years ahead of its time. Large-scale logical analysis of natural language really took off around 1970, thanks to Montague's work (Montague 1975, Partee 1997). This system connects linguistic structure and logical semantics in the following architecture. The pivotal deep structure are syntax trees formed by grammatical construction rules.
    [Show full text]
  • Chapter 3 Language Analysis and Understanding
    Chapter 3 Language Analysis and Understanding 3.1 Overview a b Annie Zaenen & Hans Uszkoreit a Rank Xerox ResearchCentre, Grenoble, France b Deutsches Forschungszentrum fur Kunstlic he Intelligenz and Universitat des Saarlandes, Saarbruc ken, Germany We understand larger textual units bycombining our understanding of smaller ones. The main aim of linguistic theory is to showhow these larger units of meaning arise out of the combination of the smaller ones. This is mo deled by means of a grammar. Computational linguistics then tries to implement this pro cess in an ecientway.Itis traditional to sub divide the task into syntax and semantics, where syntax describ es how the di erent formal elements of a textual unit, most often the sentence, can b e combined and semantics describ es how the interpretation is calculated. In most language technology applications the enco ded linguistic knowledge, i.e., the grammar, is separated from the pro cessing comp onents. The grammar consists of a lexicon, and rules that syntactically and semantically combine words and phrases into larger phrases and sentences. A variety of representation languages have b een develop ed for the enco ding of linguistic knowledge. Some of these languages are more geared towards conformity with formal linguistic theories, others are designed to facilitate certain pro cessing mo dels or sp ecialized applications. Several language technology pro ducts that are on the market to day employ annotated 109 110 Chapter 3: Language Analysis and Understanding phrase-structure grammars, grammars with several hundreds or thousands of rules describing di erent phrase typ es. Each of these rules is annotated by features and sometimes also by expressions in a programming language.
    [Show full text]
  • The Logic of Categorial Grammars: Lecture Notes Christian Retoré
    The Logic of Categorial Grammars: Lecture Notes Christian Retoré To cite this version: Christian Retoré. The Logic of Categorial Grammars: Lecture Notes. RR-5703, INRIA. 2005, pp.105. inria-00070313 HAL Id: inria-00070313 https://hal.inria.fr/inria-00070313 Submitted on 19 May 2006 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE The Logic of Categorial Grammars Lecture Notes Christian Retoré N° 5703 Septembre 2005 Thème SYM apport de recherche ISRN INRIA/RR--5703--FR+ENG ISSN 0249-6399 The Logic of Categorial Grammars Lecture Notes Christian Retoré ∗ Thème SYM — Systèmes symboliques Projet Signes Rapport de recherche n° 5703 — Septembre 2005 —105 pages Abstract: These lecture notes present categorial grammars as deductive systems, and include detailed proofs of their main properties. The first chapter deals with Ajdukiewicz and Bar-Hillel categorial grammars (AB grammars), their relation to context-free grammars and their learning algorithms. The second chapter is devoted to the Lambek calculus as a deductive system; the weak equivalence with context free grammars is proved; we also define the mapping from a syntactic analysis to a higher-order logical formula, which describes the seman- tics of the parsed sentence.
    [Show full text]
  • An Experimental Approach to Linguistic Representation
    BEHAVIORAL AND BRAIN SCIENCES (2017), Page 1 of 61 doi:10.1017/S0140525X16002028, e282 An experimental approach to linguistic representation Holly P. Branigan Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ, United Kingdom [email protected] http://www.ed.ac.uk/profile/holly-branigan Martin J. Pickering Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ, United Kingdom [email protected] http://www.ed.ac.uk/profile/martin-pickering Abstract: Within the cognitive sciences, most researchers assume that it is the job of linguists to investigate how language is represented, and that they do so largely by building theories based on explicit judgments about patterns of acceptability – whereas it is the task of psychologists to determine how language is processed, and that in doing so, they do not typically question the linguists’ representational assumptions. We challenge this division of labor by arguing that structural priming provides an implicit method of investigating linguistic representations that should end the current reliance on acceptability judgments. Moreover, structural priming has now reached sufficient methodological maturity to provide substantial evidence about such representations. We argue that evidence from speakers’ tendency to repeat their own and others’ structural choices supports a linguistic architecture involving a single shallow level of syntax connected to a semantic level containing information about quantification, thematic relations, and information structure, as well as to a phonological level. Many of the linguistic distinctions often used to support complex (or multilevel) syntactic structure are instead captured by semantics; however, the syntactic level includes some specification of “missing” elements that are not realized at the phonological level.
    [Show full text]
  • Categorial Grammars
    PART 8: CATEGORIAL GRAMMARS Categorial grammar. Adjukiewicz 1935, Bar-Hillel 1953 Bidirectional Categorial grammar: CATB is the smallest set such that: 1. CAT0 is a finite subset of CAT 2. If X, Y CAT then X/Y CAT and X\Y CAT Interpretation: if α X and β X\Y then αβ Y left-concatenation if α X/Y and β Y then αβ X right-concatenation Uni-directional Categorial grammar: CAT is the smallest set such that: 1. CAT0 is a finite subset of CAT 2. If X, Y CAT then X/Y CAT Interpretation: if α X/Y and β Y then αβ X right-concatenation Examples: Bi-directional: Assume basic categories: DP, N, S D = DP/N IV = DP\S TV = IV/DP ADJ = N/N P = (IV/IV)/DP P = (N/N)/DP Examples: IV = DP\S IV forms an S with an NP on its left TV = IV/DP TV forms an IV with an NP on its right Hence: If Ronya DP, and purrs DP\S then Ronya purrs S. Ronya + purrs Ronya purrs DP DP\S S strokes + Ronya strokes Ronya (DP\S)/DP DP (DP\S) Pat + strokes Ronya Pat strokes Ronya DP (DP\S) S 216 Uni-directional: Assume basic categories IV, N, S DP = S/IV D = DP/ N TV = IV/DP ADJ = N/N P = (IV/IV)/DP P = (N/N)/DP Hence: If Ronya S/IV, and purrs IV then Ronya purrs S. Ronya + purrs Ronya purrs S/IV IV S strokes + Ronya strokes Ronya IV/(S/IV) S/IV IV Pat + strokes Ronya Pat strokes Ronya S/IV IV S Fact: (Bar-Hillel et.
    [Show full text]
  • Introduction to Categorial Grammar
    ACS Syntax and Semantics of Natural Language Lecture 1: Introduction to Categorial Grammar Stephen Clark Natural Language and Information Processing (NLIP) Group [email protected] Categorial Grammar (CG) 2 • Main responsibility for defining syntactic form is in the lexicon • Hence CG is a lexicalized theory of grammar – along with other theories of grammar such as HPSG, TAG, LFG, ... • Attractive linguistically because all language-dependent properties re- side in the lexicon – small number of combination rules are language-invariant • Also attractive computationally; e.g. supertagging for Categorial Gram- mar leads to highly efficient parsing (Clark and Curran, 2007) Connection with Semantics 3 • Categorial Grammar has a strong commitment to Frege’s Principle of Compositionality (along with Montague from the 70s): • The meaning of a phrase is a function of the meanings of the parts of the phrase and how those parts are put together – mathematically often described as a homomorphism (structure-preserving mapping) between syntactic and semantic representations (more on this later ...) Contrast with Phrase-Structure Rules 4 • Early Chomskian approach and much work in Generative Grammar uses rewrite rules or productions (as in a Context Free Grammar): S → NP VP VP → TV NP TV → {likes, sees,... } • Categorial Grammar captures the same information by assigning a func- tional type, or category, to grammatical entities • Has roots in early work by Polish mathematician Ajdukiewicz (1935) and even earlier in Russel’s theory of types Lexical Categories 5 • An elementary syntactic structure – a lexical category – is assigned to each word in a sentence, eg: walked: S\NP ‘give me an NP to my left and I return a sentence’ • Think of the lexical category for a verb as a function: NP is the argu- ment, S the result, and the slash indicates the direction of the argument Lexical Categories 6 • Atomic categories: S, N , NP, PP, ..
    [Show full text]