Efficient Processing of Flexible Categorial Grammar

Total Page:16

File Type:pdf, Size:1020Kb

Efficient Processing of Flexible Categorial Grammar EFFICIENT PROCESSING OF FLEXIBLE CATEGORIAL GRAMMAR Gosse Bouma Research Institute for Knowledge Systems Postbus 463, 6200 AL Maastricht The Netherlands e-mail(earn): exriksgb@hmarl5 ABSTRACT* (i) application : A/B B ==> A B B\A ==> A From a processing point of view, however, composition: A/B B/C ==> NC flexible categorial systems are problematic, C\B B~, ==> C~ since they introduce spurious ambiguity. In raising : A ==> (B/A)\B this paper, we present a flexible categorial A ==> B/(A\B) grammar which makes extensive use of the product-operator, first introduced by With this grammar many alternative con- Lambek (1958). The grammar has the prop- stituent structures for a sentence can be erty that for every reading of a sentence, a generated, even where this does not corre- strictly left-branching derivation can be spond to semantic ambiguities. From a lin- given. This leads to the definition of a subset guistic point of view, this has many advan- of the grammar, for which the spurious ambi- tages. Various kind of arguments for giving guity problem does not arise and efficient up traditional conceptions of constituent processing is possible. structure can be given, but the most con- vincing and well-documented case in favour of flexible constituent structure is coordi- 1. Flexibility vs. Ambiguity nation (see Steedman (1985), Dowty (1988), and Zwarts (1986)). Categorial Grammars owe much of their popularity to the fact that they allow for The standard assumption in generative various degrees of flexibility with respect to grammar is that coordination always takes constituent structure. From a processing place between between constituents. Right- point of view, however, flexible categorial node raising constructions and other in- systems are problematic, since they intro- stances of non-constituent conjunction are duce spurious ambiguity. problematic, because it is not clear what the status of the coordinated elements in these The best known example of a flexible constructions is. Flexible categorial gram- categorial grammar is a grammar containing mar presents an elegant solution for such the reduction rules application and compo- cases, since, next to canonical constituent sition, and the category changing rule rais- structures, it also admits various other con- ing 1 • stituent structures. Therefore, the sentences in (2) can be considered to be ordinary in- stances of coordination (of two categories slap and (vp/ap)\vp, respectively). * I would like to thank Esther K0nig, Erik-Jan van der Linden, Michael Moortgat, (2) a. John sold and Mary bought a book Adriaan van Paassen and the participants of s/vp vp/np s/vp vp/np np the Edinburgh Categorial Grammar Weekend, who made useful comments to earlier s/np s/np presentations of this material. All remaining errors and misconceptions are of course my own. 1 Throughout this paper we will be us- ing the notation of Lambek (1958), in which left-directional functor respectively, looking A/B and B\A are a right-directional and a for an argument of category B. - 19- tory rules in the grammar. Some careful rewriting of the original grammar has to take b. J. loves Mary madly and Sue wildly place, before things work as desired. vp/np np vp\vp np vp\vp Pareschi & Steedman (1987) propose an Mary madly efficient chart-parsing algorithm for catego- np vp\vp rial grammars with spurious ambiguity. In- stead of the usual strategy, in which all pos- (vplnp)\vp sible subconstituents are added to the chart, Pareschi & Steedman restrict themselves to (vp/np)Wp adding only those constituents that may lead to a difference in semantics. Thus, in (3) A somewhat different type of argument only the underlined constituents are in the for flexible phrase structure is based on the chart. The "---" constituent is not. way humans process natural language. In Ades & Steedman (1982) it is pointed out (3) John loves Mary madly that humans process natural language in a s/vp vp/np np vp\vp left-to-right, incremental, manner. This pro- cessing aspect is accounted for in a flexible categorial system, where constituents can be built for any part of a sentence. Since syn- tactic rules operate in parallel with semantic Combining 'madly' with the rest would be interpretation rules, building a syntactic impossible or lead to backtracking in the structure for an initial part of a sentence, normal case. Here, the Pareschi & Steedman implies that a corresponding semantic struc- algorithm starts looking for a constituent ture can also be constructed. left adjacent of madly, which contains an el- ement X/vp as a leftmost category. If such a These and other arguments suggest that constituent can be found, it can be concluded there is no such thing as a fixed constituent that the rest of that constituent must structure, but that the order in which ele- (implicitly) be a v p, and thus the validity of ments combine with eachother is rather free. combining vp\vp with this constituent has been established. Therefore, Pareschi & From a parsing point of view, however, Steedman are able to work with only a mini- flexibility appears to be a disadvantage. mal amount of items in the chart. Flexible categorial grammars produce large numbers of, often semantically equivalent, Both Wittenburg and Pareschi & Steed- derivations for a given phrase. This spurious man work with categorial grammars, which ambiguity problem (Wittenburg (1986)) contain restricted versions of composition makes efficient processing of flexible catego- and raising. Although they can be processed rial grammar problematic, since quite often efficiently, there is linguistic evidence that there is an exponential growth of the number they are not fully adequate for analysis of of possible derivations, relative to the length such phenomena as coordination. Since of the string to be parsed. atomic categories can in general not be raised in these grammars, sentence (2b) (in There have been two proposals for which the category n p has to be raised) eliminating spurious ambiguity from the cannot be derived. Furthermore, since grammar. The first is Wittenburg (1987). In composition is not generalized, as in Ades & this paper, a categorial grammar with compo- Steedman (1982), a sentence such as John sition and heavily restricted versions of sold but Mary donated a book to the library raising (for subject n p's only) is considered. would not be derivable. The possibilities for Wittenburg proposes to eliminate spurious left-to-right, incremental, processing are ambiguity by redefining composition. His also limited. Therefore, there is reason to predictive composition rules apply only in look for a more flexible system, for which those cases where they are really needed to efficient parsing is still possible. make a derivation possible. A disadvantage of this method, noticed by Wittenburg, is that one may have to add special predictive composition rules for all general combina- - 20 - 2. Structural Completeness (5) Strong Structural Complete- ness In the next section we present a gram- If a sequence of categories XI .. Xn matical calculus, which is more flexible than reduces to Y, with semantics Y', the systems considered by Wittenburg there is a reduction to Y, with se- (1987) and Pareschi & Steedman (1987), and mantics Y', for any bracketing of therefore is attractive for linguistic pur- XI..Xn into constituents. poses. At the same time, it offers a solution to the spurious ambiguity problem. Grammars with this property, can poten- Spurious ambiguity causes problems for tially circumvent the spurious ambiguity parsing in the systems mentioned above, be- problem, since for these grammars, we only cause there is no systematic relationship have to inspect all left-branching syntax between syntactic structures and semantic trees, to find all possible readings. This representations. That is, there is no way to method will only fail if the set of left- identify in advance, for a given sentence S, a branching trees itself would contain spuri- proper subset of the set of all possible syn- ous ambiguous derivations. In section 4 we tactic structures and associated semantic will show that these can be eliminated from representations, for which it holds that it the calculus presented below. will contain all possible semantic represen- tations of S. 3. The P-calculus Consider now a grammar for which the The P(roduct)-calculus is a categorial following property holds: grammar, based on Lambek (1958), which has the property of strong structural com- (4) Structural Completeness pleteness. If a sequence of categories X1 .. X n reduces to Y, there is a reduction to In Lambek (1958), the foundations of Y for any bracketing of X1..Xn into flexible categorial grammar are formulated constituents. (Moortgat, 1987:5) 2 in the form of a calculus for syntactic cate- gories. Well-known categorial rules, such as Structural complete grammars are interest- application, composition and category-rais- ing linguistically, since they are able to ing, are theorems of this calculus. A largely handle, for instance, all kinds of non-con- neglected aspect of this calculus, is the use stituent conjunction, and also because they of the product-operator. allow for strict left-to-right processing (see Moortgat, 1988). The calculus we present below, was developed as an alternative for Moortgat's The latter observation has consequences (1988) M-system. The M-system is a subset for parsing as well,, since, if we can parse of the Lambek-calculus, which uses, next to every sentence in a strict left-to-right man- application, only a very general form of ner (that is, we produce only strictly left- composition. Since it has no raising, it seems branching syntax trees), the parsing algo- to be an attractive candidate for investigat- rithm can be greatly simplified. Notice, ing the possibilities of left-associative however, that such a parsing strategy is only parsing for categorial grammar.
Recommended publications
  • A Type-Theoretical Analysis of Complex Verb Generation
    A Type-theoretical Analysis of Complex Verb Generation Satoshi Tojo Mitsubishi Research Institute, Inc. 2-22 Harumi 3, Chuo-ku Tokyo 104, Japan e-mail: [email protected], [email protected] Abstract hardly find a verb element in the target language, which corresponds with the original word exactly in Tense and aspect, together with mood and modality, meaning. However we also need to admit that the usually form the entangled structure of a complex verb. problem of knowledge representation of tense and as- They are often hard to translate by machines, because pect (namely time), or that of mood and modality is of both syntactic and semantic differences between lan- not only the problem of verb translation, but that of guages. This problem seriously affects upon the gen- the whole natural language understanding. In this pa- eration process because those verb components in in- per, in order to realize a type calculus in syntax, we terlingua are hardly rearranged correctly in the target depart from a rather mundane subset of tense, aspects, language. We propose here a method in which each and modMities, which will be argued in section 2. verb element is defined as a mathematical function ac- cording to its type of type theory. This formalism gives 1.2 The difference in the structure each element its legal position in the complex verb in the target language and certifies so-called partial trans- In the Japanese translation of (1): lation. In addition, the generation algorithm is totally oyoi-de-i-ta-rashii free from the stepwise calculation, and is available on parallel architecture.
    [Show full text]
  • Development of a General-Purpose Categorial Grammar Treebank
    Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 5195–5201 Marseille, 11–16 May 2020 c European Language Resources Association (ELRA), licensed under CC-BY-NC Development of a General-Purpose Categorial Grammar Treebank Yusuke Kubota, Koji Mineshima, Noritsugu Hayashi, Shinya Okano NINJAL, Ochanomizu University, The University of Tokyo, idem. f10-2 Midori-cho, Tachikawa; 2-2-1 Otsuka, Bunkyo; 3-8-1 Komaba, Megurog, Tokyo, Japan [email protected], [email protected], fhayashi, [email protected] Abstract This paper introduces ABC Treebank, a general-purpose categorial grammar (CG) treebank for Japanese. It is ‘general-purpose’ in the sense that it is not tailored to a specific variant of CG, but rather aims to offer a theory-neutral linguistic resource (as much as possible) which can be converted to different versions of CG (specifically, CCG and Type-Logical Grammar) relatively easily. In terms of linguistic analysis, it improves over the existing Japanese CG treebank (Japanese CCGBank) on the treatment of certain linguistic phenomena (passives, causatives, and control/raising predicates) for which the lexical specification of the syntactic information reflecting local dependencies turns out to be crucial. In this paper, we describe the underlying ‘theory’ dubbed ABC Grammar that is taken as a basis for our treebank, outline the general construction of the corpus, and report on some preliminary results applying the treebank in a semantic parsing system for generating logical representations of sentences. Keywords: treebank, categorial grammar, Japanese, annotation 1. Introduction Table 1: Comparison with related work original corpus output This paper reports on the progress of the construction of a H&S Penn Treebank English CCG general-purpose categorial grammar treebank in Japanese Uematsu et al.
    [Show full text]
  • Direct Compositionality and Variable-Free Semantics: the Case of “Principle B”
    Direct Compositionality and Variable-Free Semantics: The Case of “Principle B” Drafty outline/handout of material to be presented at the Workshop on Direct Compositionality Pauline Jacobson Brown University 0. The big picture goals: (A) To argue that direct compositionality is a much simpler hypothesis about the organization of the grammar than the competing views It is, therefore, the view we should hope is correct. Apparent challenges to this view should perhaps lead us to rethink some of the underlying assumptions which give rise to these challenges. (B) To point out: a consequence of at least a reasonably strong version of direct compositionality is that structured representations (e.g., trees) play no actual role in the grammar: they are not something that grammatical rules/principles can “see”. They are just convenient representation for the lnguist (representing the syntactic proof system and the compositional semantics) (C) To point out the consequences of this with respect to principles regarding the distribution of pronouns (reflexives, etc.). Standard assumption: these require the grammar to include principles which are stated on non strictly-local chucks of representation Hence: the hypothesis of direct compositionality should inspire us to rethink such statements (D) To discuss a second way in which binding principles have traditionally been taken to challenge direct compositionality (at least in its strong form): • there are many cases where there are binding type effects (e.g., "binding" of pronouns, "binding" of reflexives to satisfy Principle A, Principle B violations, etc.) but where on the surface, there is no big enough chunk of representation Typical example : connectivity in specificational sentences (1) What John is is proud of himself.
    [Show full text]
  • Chapter 1 Negation in a Cross-Linguistic Perspective
    Chapter 1 Negation in a cross-linguistic perspective 0. Chapter summary This chapter introduces the empirical scope of our study on the expression and interpretation of negation in natural language. We start with some background notions on negation in logic and language, and continue with a discussion of more linguistic issues concerning negation at the syntax-semantics interface. We zoom in on cross- linguistic variation, both in a synchronic perspective (typology) and in a diachronic perspective (language change). Besides expressions of propositional negation, this book analyzes the form and interpretation of indefinites in the scope of negation. This raises the issue of negative polarity and its relation to negative concord. We present the main facts, criteria, and proposals developed in the literature on this topic. The chapter closes with an overview of the book. We use Optimality Theory to account for the syntax and semantics of negation in a cross-linguistic perspective. This theoretical framework is introduced in Chapter 2. 1 Negation in logic and language The main aim of this book is to provide an account of the patterns of negation we find in natural language. The expression and interpretation of negation in natural language has long fascinated philosophers, logicians, and linguists. Horn’s (1989) Natural history of negation opens with the following statement: “All human systems of communication contain a representation of negation. No animal communication system includes negative utterances, and consequently, none possesses a means for assigning truth value, for lying, for irony, or for coping with false or contradictory statements.” A bit further on the first page, Horn states: “Despite the simplicity of the one-place connective of propositional logic ( ¬p is true if and only if p is not true) and of the laws of inference in which it participate (e.g.
    [Show full text]
  • 1 English Subjectless Tagged Sentences Paul Kay Department Of
    1 English subjectless tagged sentences Paul Kay Department of Linguistics University of California Berkeley, CA 94720 [email protected] 2 Abstract A colloquial English sentence like Fooled us, didn't they? contains a finite main verb but no expressed subject. The identity of the missing subject of fooled is recovered from the tag subject they: compare Fooled us, didn't she?, Fooled us, didn't you? This paper argues (1) that such subjectless tagged sentences (STSs) pose a problem for grammatical approaches based on movement and empty categories and (2) that STSs receive a revealing analysis as part of a finely articulated family of tagged sentence constructions when viewed within a non-derivational, constructional, multiple-inheritance-based approach.* *I would like to thank Peter Culicover, Liliane Haegeman, Charles Fillmore Andreas Kathol and Richard Oehrle for comments on previous versions of this paper, as well as an anonymous reviewer for Language. They have doubtless offered more good advice than I have accepted. 3 0. Introduction. It has been argued from several points of view that whatever can be done with empty categories (ecs) can be done without them (Ades and Steedman 1982, Gazdar et al. 1984, Kaplan and Zaenen 1989, Pollard and Sag 1994 chapter 9, Sag and Fodor 1994, Kay and Fillmore 1999, Sag 1999). It has also been argued that, because there is no hard evidence for their existence, linguistic theory would be better off dispensing with these unobservable entities (Pickering and Barry 1991, Sag and Fodor 1994, Sag 1999).1 The present paper purports to take the argument one step further by showing that there are things that can be done without empty categories that cannot be done with them, at least not with any of the ecs currently available.
    [Show full text]
  • Categorial Grammar and Lexical-Functional Grammar∗
    Categorial Grammar and Lexical-Functional Grammar¤ Reinhard Muskens Department of Linguistics Tilburg University The Netherlands Proceedings of the LFG01 Conference University of Hong Kong, Hong Kong Miriam Butt and Tracy Holloway King (Editors) 2001 CSLI Publications http://csli-publications.stanford.edu/ ¤I wish to thank the participants of LFG2001, especially Mary Dalrymple and Ron Kaplan, for valuable feedback. All comments are very welcome. Abstract This paper introduces ¸-grammar, a form of categorial grammar that has much in common with LFG. Like other forms of categorial grammar, ¸-grammars are multi-dimensional and their components are combined in a strictly parallel fashion. Grammatical representa- tions are combined with the help of linear combinators, closed pure ¸-terms in which each abstractor binds exactly one variable. Mathe- matically this is equivalent to employing linear logic, in use in LFG for semantic composition, but the method seems more practicable. While ¸-grammars could be used to formalize many approaches to grammatical theory, they are certainly natural as a basis for the formalization of LFG. This leads to a theory I would like to call ¸- LFG. In this paper it will be shown how the standard components of LFG can be set up in the framework. We will have descriptions of c-structure, descriptions of f-structure, and semantics. The di®erence between de¯ning and constraining information will be explained in terms of entailment, and requirements on long-distance paths in f- structure will be explained in terms of entailment in the presence of a simple set of axioms. 1 Introduction In this paper I want to discuss a version of Lexical-Functional Grammar (LFG) that is also a version of Categorial Grammar (CG).
    [Show full text]
  • The Categorial Fine-Structure of Natural Language
    1 The categorial fine-structure of natural language Johan van Benthem, Amsterdam & Stanford, March 2003 Abstract Categorial grammar analyzes linguistic syntax and semantics in terms of type theory and lambda calculus. A major attraction of this approach is its unifying power, as its basic function/ argument structures occur across the foundations of mathematics, language and computation. This paper considers, in a light example-based manner, where this elegant logical paradigm stands when confronted with the wear and tear of reality. Starting from a brief history of the Lambek tradition since the 1980s, we discuss three main issues: (a) the fit of the lambda calculus engine to characte- ristic semantic structures in natural language, (b) coexistence of the original type-theoretic and more recent modal interpretations of categorial logics, and (c) the place of categorial grammars in the total architecture of natural language, which involves mixtures of interpretation and inference. 1 From Montague Grammar to Categorial Grammar Logic and linguistics have had lively connections from Antiquity right until today (GAMUT 1991). A recurrent theme in this history is the categorial structure of language and ontology, from Aristotle's grammatical categories to Russell's theory of types in the foundations of mathematics. Further bridges were thrown as logic and linguistics developed, with Lambek 1958 as a major advance in sophistication, many years ahead of its time. Large-scale logical analysis of natural language really took off around 1970, thanks to Montague's work (Montague 1975, Partee 1997). This system connects linguistic structure and logical semantics in the following architecture. The pivotal deep structure are syntax trees formed by grammatical construction rules.
    [Show full text]
  • Chapter 3 Language Analysis and Understanding
    Chapter 3 Language Analysis and Understanding 3.1 Overview a b Annie Zaenen & Hans Uszkoreit a Rank Xerox ResearchCentre, Grenoble, France b Deutsches Forschungszentrum fur Kunstlic he Intelligenz and Universitat des Saarlandes, Saarbruc ken, Germany We understand larger textual units bycombining our understanding of smaller ones. The main aim of linguistic theory is to showhow these larger units of meaning arise out of the combination of the smaller ones. This is mo deled by means of a grammar. Computational linguistics then tries to implement this pro cess in an ecientway.Itis traditional to sub divide the task into syntax and semantics, where syntax describ es how the di erent formal elements of a textual unit, most often the sentence, can b e combined and semantics describ es how the interpretation is calculated. In most language technology applications the enco ded linguistic knowledge, i.e., the grammar, is separated from the pro cessing comp onents. The grammar consists of a lexicon, and rules that syntactically and semantically combine words and phrases into larger phrases and sentences. A variety of representation languages have b een develop ed for the enco ding of linguistic knowledge. Some of these languages are more geared towards conformity with formal linguistic theories, others are designed to facilitate certain pro cessing mo dels or sp ecialized applications. Several language technology pro ducts that are on the market to day employ annotated 109 110 Chapter 3: Language Analysis and Understanding phrase-structure grammars, grammars with several hundreds or thousands of rules describing di erent phrase typ es. Each of these rules is annotated by features and sometimes also by expressions in a programming language.
    [Show full text]
  • The Logic of Categorial Grammars: Lecture Notes Christian Retoré
    The Logic of Categorial Grammars: Lecture Notes Christian Retoré To cite this version: Christian Retoré. The Logic of Categorial Grammars: Lecture Notes. RR-5703, INRIA. 2005, pp.105. inria-00070313 HAL Id: inria-00070313 https://hal.inria.fr/inria-00070313 Submitted on 19 May 2006 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE The Logic of Categorial Grammars Lecture Notes Christian Retoré N° 5703 Septembre 2005 Thème SYM apport de recherche ISRN INRIA/RR--5703--FR+ENG ISSN 0249-6399 The Logic of Categorial Grammars Lecture Notes Christian Retoré ∗ Thème SYM — Systèmes symboliques Projet Signes Rapport de recherche n° 5703 — Septembre 2005 —105 pages Abstract: These lecture notes present categorial grammars as deductive systems, and include detailed proofs of their main properties. The first chapter deals with Ajdukiewicz and Bar-Hillel categorial grammars (AB grammars), their relation to context-free grammars and their learning algorithms. The second chapter is devoted to the Lambek calculus as a deductive system; the weak equivalence with context free grammars is proved; we also define the mapping from a syntactic analysis to a higher-order logical formula, which describes the seman- tics of the parsed sentence.
    [Show full text]
  • An Experimental Approach to Linguistic Representation
    BEHAVIORAL AND BRAIN SCIENCES (2017), Page 1 of 61 doi:10.1017/S0140525X16002028, e282 An experimental approach to linguistic representation Holly P. Branigan Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ, United Kingdom [email protected] http://www.ed.ac.uk/profile/holly-branigan Martin J. Pickering Department of Psychology, University of Edinburgh, Edinburgh, EH8 9JZ, United Kingdom [email protected] http://www.ed.ac.uk/profile/martin-pickering Abstract: Within the cognitive sciences, most researchers assume that it is the job of linguists to investigate how language is represented, and that they do so largely by building theories based on explicit judgments about patterns of acceptability – whereas it is the task of psychologists to determine how language is processed, and that in doing so, they do not typically question the linguists’ representational assumptions. We challenge this division of labor by arguing that structural priming provides an implicit method of investigating linguistic representations that should end the current reliance on acceptability judgments. Moreover, structural priming has now reached sufficient methodological maturity to provide substantial evidence about such representations. We argue that evidence from speakers’ tendency to repeat their own and others’ structural choices supports a linguistic architecture involving a single shallow level of syntax connected to a semantic level containing information about quantification, thematic relations, and information structure, as well as to a phonological level. Many of the linguistic distinctions often used to support complex (or multilevel) syntactic structure are instead captured by semantics; however, the syntactic level includes some specification of “missing” elements that are not realized at the phonological level.
    [Show full text]
  • Categorial Grammars
    PART 8: CATEGORIAL GRAMMARS Categorial grammar. Adjukiewicz 1935, Bar-Hillel 1953 Bidirectional Categorial grammar: CATB is the smallest set such that: 1. CAT0 is a finite subset of CAT 2. If X, Y CAT then X/Y CAT and X\Y CAT Interpretation: if α X and β X\Y then αβ Y left-concatenation if α X/Y and β Y then αβ X right-concatenation Uni-directional Categorial grammar: CAT is the smallest set such that: 1. CAT0 is a finite subset of CAT 2. If X, Y CAT then X/Y CAT Interpretation: if α X/Y and β Y then αβ X right-concatenation Examples: Bi-directional: Assume basic categories: DP, N, S D = DP/N IV = DP\S TV = IV/DP ADJ = N/N P = (IV/IV)/DP P = (N/N)/DP Examples: IV = DP\S IV forms an S with an NP on its left TV = IV/DP TV forms an IV with an NP on its right Hence: If Ronya DP, and purrs DP\S then Ronya purrs S. Ronya + purrs Ronya purrs DP DP\S S strokes + Ronya strokes Ronya (DP\S)/DP DP (DP\S) Pat + strokes Ronya Pat strokes Ronya DP (DP\S) S 216 Uni-directional: Assume basic categories IV, N, S DP = S/IV D = DP/ N TV = IV/DP ADJ = N/N P = (IV/IV)/DP P = (N/N)/DP Hence: If Ronya S/IV, and purrs IV then Ronya purrs S. Ronya + purrs Ronya purrs S/IV IV S strokes + Ronya strokes Ronya IV/(S/IV) S/IV IV Pat + strokes Ronya Pat strokes Ronya S/IV IV S Fact: (Bar-Hillel et.
    [Show full text]
  • Introduction to Categorial Grammar
    ACS Syntax and Semantics of Natural Language Lecture 1: Introduction to Categorial Grammar Stephen Clark Natural Language and Information Processing (NLIP) Group [email protected] Categorial Grammar (CG) 2 • Main responsibility for defining syntactic form is in the lexicon • Hence CG is a lexicalized theory of grammar – along with other theories of grammar such as HPSG, TAG, LFG, ... • Attractive linguistically because all language-dependent properties re- side in the lexicon – small number of combination rules are language-invariant • Also attractive computationally; e.g. supertagging for Categorial Gram- mar leads to highly efficient parsing (Clark and Curran, 2007) Connection with Semantics 3 • Categorial Grammar has a strong commitment to Frege’s Principle of Compositionality (along with Montague from the 70s): • The meaning of a phrase is a function of the meanings of the parts of the phrase and how those parts are put together – mathematically often described as a homomorphism (structure-preserving mapping) between syntactic and semantic representations (more on this later ...) Contrast with Phrase-Structure Rules 4 • Early Chomskian approach and much work in Generative Grammar uses rewrite rules or productions (as in a Context Free Grammar): S → NP VP VP → TV NP TV → {likes, sees,... } • Categorial Grammar captures the same information by assigning a func- tional type, or category, to grammatical entities • Has roots in early work by Polish mathematician Ajdukiewicz (1935) and even earlier in Russel’s theory of types Lexical Categories 5 • An elementary syntactic structure – a lexical category – is assigned to each word in a sentence, eg: walked: S\NP ‘give me an NP to my left and I return a sentence’ • Think of the lexical category for a verb as a function: NP is the argu- ment, S the result, and the slash indicates the direction of the argument Lexical Categories 6 • Atomic categories: S, N , NP, PP, ..
    [Show full text]