Universal Dependencies

Total Page:16

File Type:pdf, Size:1020Kb

Universal Dependencies The <richer> the people, the <bigger> the crates the more <literal> the <better> the less <elaborate> you can be the <better> The <slower> the <better> Linguistic Institute 2017 the <higher> the temperature the <darker> the malt Corpus Linguistics The <older> a database is, the Treebanks & Dependencies <richer> it becomes The <higher> the clay content, the more <sticky> the Nathan Schneider soil The <fresher> the [email protected] soot, the <longer> it needed Amir Zeldes the <higher> the number, the <greater> [email protected] the protection The <older> you are the <greater> the chance The <smaller> the mesh the more <expensive> the net 1 Taking Stock Thus far in this course: • Why Corpora? ‣ Issues in assembling corpora • Markup, tokenization, metadata • Searching, frequency, collocations • Annotation: POS 2 Coming Attractions • Syntactic Treebanks ‣ Annotation: Universal Dependencies • Computational Lexical Semantics ‣ Annotation: Entity Types ‣ WordNet, Frame Semantics, Distributional Vectors 3 POS Tags Aren’t Everything • A POS tag helps narrow down what grammatical constructions a word may participate in. ‣ The main challenge in: “Will Trey Gowdy Benghazi Trump with inquiries?” ‣ But tag doesn’t exactly specify how the token relates to other tokens in the sentence. • We want to be able to search corpora for words in certain syntactic contexts. Helps us answer questions like: ‣ When does adverbial home tend to precede vs. follow the object? (Fillmore 1992, p. 48: take home the leftovers vs. take the leftovers home) ‣ What kinds of nouns prefer to be subjects vs. objects? ‣ Which verbs can take infinitival complements? 4 Ambiguity beyond POS • Lots of constructions = lots of ambiguity! Sometimes humans even notice it: ‣ PP attachment: “Illinois Sends Bill Allowing Gay Marriage to Governor” ‣ Adjective attachment: “Police Shoot Dead Suspect Inside L.A. Emergency Room” ‣ Verb argument attachment: “Attorneys for Afghan family detained by immigration officials in Los Angeles obtain restraining order” (Who was detained?) ‣ Verb argument function (depends on VVD vs. VVN): “Top U.N. Climate Official Denied Meeting with U.S. Secretary of State” • We also want to be able to resolve ambiguity automatically for natural language understanding. ‣ Foreshadowing future lectures: We’ll need additional representations for sense ambiguity, e.g. in “Campaign manager for Donald Trump is charged with battery” Credit: Dirk Hovy and Jonathan May for pointing out some of these headlines 5 Treebanks • A treebank is a corpus of sentences with syntactic trees, for ‣ Corpus-based studies involving syntax ‣ Evaluating syntactic parsers ‣ Training statistical syntactic parsers • Gold standard trees: human-annotated from scratch or manually corrected parser output ‣ Silver trees: uncorrected parser output, will contain many errors 6 Types of Syntax Trees • Phrase structure or constituency trees ‣ Nested bracketing of the sentence, usually with constituent labels like S (sentence), VP (verb phrase), etc., down to POS tags for individual tokens • Dependency trees ‣ Edges (often labeled) connect words directly • Other formalisms: CCG, LFG, HPSG, construction grammar, etc. have other kinds of structure such as nested feature structures 7 Constuency Treebanks • English ‣ Penn Treebank (PTB; Marcus et al., 1993): English; primarily, trees for 1M words of Wall Street Journal news articles in 1989. ‣ OntoNotes (Hovy et al., 2006; Pradhan et al., 2013): extends PTB with more genres (broadcast news, web, …), two additional languages (Arabic, Chinese), and semantic annotations (word senses, named entities, coreference, PropBank predicate-argument structures). OntoNotes 5.0 has 3M words—50% English, 40% Chinese, 10% Arabic. ‣ English Web Treebank (Bies et al., 2012): PTB-style trees, 5 genres—blogs, email, newsgroups, reviews, questions-answers. 250K words. • French Treebank, TIGER (German), … • Other formalisms: CCGBank (converted from PTB); Redwoods (HPSG), … 8 Dependency Treebanks • Prague School dependency syntax ‣ Czech–English Dependency Treebank: WSJ sentences and their Czech translations, in a rich multilayer formalism • Universal Dependencies (UD) corpora in several dozen languages and various genres ‣ including a manually-corrected conversion of the English Web Treebank • Twitter: e.g., Tweebank (Kong et al. 2014) 9 Cost & Licensing • Creating a high-quality treebank is expensive ‣ Need to hire linguists (often linguistics grad students) ‣ Training, guidelines development ‣ Speed of annotation ‣ Quality control • PTB and many other corpora licensed through the Linguistic Data Consortium (LDC) and similar entities 10 Constuency Trees 11 F331 Encourage the creation of local marketing cooperatives and <tree banks> for small woodland growers. sion of the corpus into a standardised knowledge-representation language, or <tree bank>, held on computer A standard generative tree for German Though since 0 days 0 thick clouds t t the sky t cover, tells the weather-report t t t t t fore, that towards 0 evening the sun t t tshines 20/33 12 F331 Encourage the creation of local marketing cooperatives and <tree banks> for small woodland growers. sion of the corpus into a standardised knowledge-representation language, or <tree bank>, held on computer A computational parse equivalent (PTB style) Penn Treebank Though since days thick clouds the sky cover, tells the current weather-report fore, that towards evening the sun shines 21/33 13 Pierre Vinken , 61 years old , will join the board as a nonexecutive director Nov. 29 . P (S (NP-SBJ (NP (NNP Pierre) (NNP Vinken)) (, ,) (ADJP (NP (CD 61) (NNS years)) (JJ old)) (, ,)) (VP (MD will) The first sentence of the Penn Treebank (VP (Wall Street Journal) (VB join) (NP (DT the) (NN board)) (PP-CLR (IN as) (NP (DT a) (JJ nonexecutive) (NN director))) (NP-TMP (NNP Nov.) (CD 29)))) (. .)) 14 Pierre Vinken , 61 years old , will join the board as a nonexecutive director Nov. 29 . P (S (NP-SBJ (NP (NNP Pierre) (NNP Vinken)) (, ,) (ADJP (NP (CD 61) (NNS years)) (JJ old)) (, ,)) (VP (MD will) POS tags (preterminals) / tokens (VP (VB join) (NP (DT the) (NN board)) (PP-CLR (IN as) (NP (DT a) (JJ nonexecutive) (NN director))) (NP-TMP (NNP Nov.) (CD 29)))) (. .)) 15 Pierre Vinken , 61 years old , will join the board as a nonexecutive director Nov. 29 . P (S (NP-SBJ (NP (NNP Pierre) (NNP Vinken)) (, ,) (ADJP (NP (CD 61) (NNS years)) (JJ old)) (, ,)) (VP (MD will) constituents (nonterminals) (VP POS tags (preterminals) / tokens (VB join) (NP (DT the) (NN board)) (PP-CLR (IN as) (NP (DT a) (JJ nonexecutive) (NN director))) (NP-TMP (NNP Nov.) (CD 29)))) (. .)) 16 Pierre Vinken , 61 years old , will join the board as a nonexecutive director Nov. 29 . P (S (NP-SBJ (NP (NNP Pierre) (NNP Vinken)) (, ,) (ADJP (NP (CD 61) (NNS years)) (JJ old)) (, ,)) (VP (MD will) nonbinary branching (VP (VB join) (NP (DT the) (NN board)) (PP-CLR (IN as) (NP (DT a) (JJ nonexecutive) (NN director))) (NP-TMP (NNP Nov.) (CD 29)))) (. .)) 17 Pierre Vinken , 61 years old , will join the board as a nonexecutive director Nov. 29 . P (S (NP-SBJ (NP (NNP Pierre) (NNP Vinken)) (, ,) (ADJP (NP (CD 61) (NNS years)) (JJ old)) (, ,)) (VP (MD will) some constituents have function tags (VP (subject, temporal, (VB join) PP-CLR = “closely related” PP) (NP (DT the) (NN board)) (PP-CLR (IN as) (NP (DT a) (JJ nonexecutive) (NN director))) (NP-TMP (NNP Nov.) (CD 29)))) (. .)) 18 kids saw birds with fish Lexicalized Constituency ParseLexicalized Constituency Parse S-sawS S-saw ¨H ¨¨HH ¨¨ HH ¨ H ¨ H ¨¨ HH ¨¨ HH ¨ H ¨ H ¨¨ HH ¨¨ HH ¨ H ¨ H ¨ H NP-kidsNP VP-sawVP NP-kids VP-saw ¨H ¨¨HH ¨ H ¨ H ¨¨ HH ¨¨ HH ¨ H ¨ H ¨¨ HH ¨¨ HH ¨ H kids ¨ H kids ¨ H V-sawV NP-birdsNP V-saw NP-birds ¨H ¨¨HH ¨ H ¨ H ¨¨ HH ¨¨ HH ¨ H ¨ H ¨¨ HH saw ¨¨ HH saw ¨ H NP-birdsNP PP-fishPP NP-birds PP-fish ¨H ¨H ¨ H ¨¨ HH ¨¨ HH ¨ H ¨ H ¨ H birds P-withP NP-fishNP birds P-with NP-fish with fish with fish It is sometimes useful to create a lexicalized NathanSchneider ENLPLecture18NathanSchneiderconstituency 5 ENLPLecture18parse, where each nonterminal 5 label includes the phrasal head. (How would you determine this?) 19 Head Rules • A set 20of headCHAPTER rules11 systematizesFORMAL theGRAMMARS selection OF ofE NGLISHa lexical head for each constituent so it can be done automatically. • Parent Direction Priority List ADJP Left NNS QP NN $ ADVP JJ VBN VBG ADJP JJR NP JJS DT FW RBR RBS SBAR RB ADVP Right RB RBR RBS FW ADVP TO CD JJR JJ IN NP JJS NN PRN Left PRT Right RP QP Left $ IN NNS NN JJ RB DT CD NCD QP JJR JJS S Left TO IN VP S SBAR ADJP UCP NP SBAR Left WHNP WHPP WHADVP WHADJP IN DT S SQ SINV SBAR FRAG VP Left TO VBD VBN MD VBZ VB VBG VBP VP ADJP NN NNS NP Figure 11.12 Selected head rules from Collins (1999). The set of head rules is often called a head percola- tion table. Jurafsky & Martin: SLP3 online draft, ch. 11 ‣ Traverse11.5 the Grammartree bottom-up. The Equivalence last row says how and to choose Normal a head for Form a VP constituent: ✴ First scan its daughters from left to right until a TO node is encountered; if it is, copy its head (the word it is the tag for). A formal language is defined as a (possibly infinite) set of strings of words. This ✴ Otherwise, scansuggests its daughters that we could from ask left if to two right grammars until a VBD are equivalent node is encountered; by asking if theyif it is, gener- copy its head. ate the same set of strings. In fact, it is possible to have two distinct context-free grammars generate the same language. ✴ … We usually distinguish two kinds of grammar equivalence: weak equivalence and strong equivalence. Two grammars are strongly equivalent if they generate the • Different headednesssame conventions set of strings (e.g.,and if for they PPs—the assign the P same or the phrase N?) structurerequire todifferent each sentence head rules.
Recommended publications
  • Chapter 14: Dependency Parsing
    Speech and Language Processing. Daniel Jurafsky & James H. Martin. Copyright © 2021. All rights reserved. Draft of September 21, 2021. CHAPTER 14 Dependency Parsing The focus of the two previous chapters has been on context-free grammars and constituent-based representations. Here we present another important family of dependency grammars grammar formalisms called dependency grammars. In dependency formalisms, phrasal constituents and phrase-structure rules do not play a direct role. Instead, the syntactic structure of a sentence is described solely in terms of directed binary grammatical relations between the words, as in the following dependency parse: root dobj det nmod (14.1) nsubj nmod case I prefer the morning flight through Denver Relations among the words are illustrated above the sentence with directed, labeled typed dependency arcs from heads to dependents. We call this a typed dependency structure because the labels are drawn from a fixed inventory of grammatical relations. A root node explicitly marks the root of the tree, the head of the entire structure. Figure 14.1 shows the same dependency analysis as a tree alongside its corre- sponding phrase-structure analysis of the kind given in Chapter 12. Note the ab- sence of nodes corresponding to phrasal constituents or lexical categories in the dependency parse; the internal structure of the dependency parse consists solely of directed relations between lexical items in the sentence. These head-dependent re- lationships directly encode important information that is often buried in the more complex phrase-structure parses. For example, the arguments to the verb prefer are directly linked to it in the dependency structure, while their connection to the main verb is more distant in the phrase-structure tree.
    [Show full text]
  • Dependency Grammars and Parsers
    Dependency Grammars and Parsers Deep Processing for NLP Ling571 January 30, 2017 Roadmap Dependency Grammars Definition Motivation: Limitations of Context-Free Grammars Dependency Parsing By conversion to CFG By Graph-based models By transition-based parsing Dependency Grammar CFGs: Phrase-structure grammars Focus on modeling constituent structure Dependency grammars: Syntactic structure described in terms of Words Syntactic/Semantic relations between words Dependency Parse A dependency parse is a tree, where Nodes correspond to words in utterance Edges between nodes represent dependency relations Relations may be labeled (or not) Dependency Relations Speech and Language Processing - 1/29/17 Jurafsky and Martin 5 Dependency Parse Example They hid the letter on the shelf Why Dependency Grammar? More natural representation for many tasks Clear encapsulation of predicate-argument structure Phrase structure may obscure, e.g. wh-movement Good match for question-answering, relation extraction Who did what to whom Build on parallelism of relations between question/relation specifications and answer sentences Why Dependency Grammar? Easier handling of flexible or free word order How does CFG handle variations in word order? Adds extra phrases structure rules for alternatives Minor issue in English, explosive in other langs What about dependency grammar? No difference: link represents relation Abstracts away from surface word order Why Dependency Grammar? Natural efficiencies: CFG: Must derive full trees of many
    [Show full text]
  • Dependency Grammar and the Parsing of Chinese Sentences*
    Dependency Grammar and the Parsing of Chinese Sentences* LAI Bong Yeung Tom Department of Chinese, Translation and Linguistics City University of Hong Kong Department of Computer Science and Technology Tsinghua University, Beijing E-mail: [email protected] HUANG Changning Department of Computer Science and Technology Tsinghua University, Beijing Abstract Dependency Grammar has been used by linguists as the basis of the syntactic components of their grammar formalisms. It has also been used in natural langauge parsing. In China, attempts have been made to use this grammar formalism to parse Chinese sentences using corpus-based techniques. This paper reviews the properties of Dependency Grammar as embodied in four axioms for the well-formedness conditions for dependency structures. It is shown that allowing multiple governors as done by some followers of this formalism is unnecessary. The practice of augmenting Dependency Grammar with functional labels is discussed in the light of building functional structures when the sentence is parsed. This will also facilitate semantic interpretion. 1 Introduction Dependency Grammar (DG) is a grammatical theory proposed by the French linguist Tesniere.[1] Its formal properties were then studied by Gaifman [2] and his results were brought to the attention of the linguistic community by Hayes.[3] Robinson [4] considered the possiblity of using this grammar within a transformation-generative framework and formulated four axioms for the well-formedness of dependency structures. Hudson [5] adopted Dependency Grammar as the basis of the syntactic component of his Word Grammar, though he allowed dependency relations outlawed by Robinson's axioms. Dependency Grammar is concerned directly with individual words.
    [Show full text]
  • Dependency Grammar
    Dependency Grammar Introduc)on Christopher Manning Dependency Grammar and Dependency Structure Dependency syntax postulates that syntac)c structure consists of relaons between lexical items, normally binary asymmetric relaons (“arrows”) called dependencies submitted Bills were by on Brownback ports Senator Republican and immigration of Kansas Christopher Manning Dependency Grammar and Dependency Structure Dependency syntax postulates that syntac)c structure consists of relaons between lexical items, normally binary asymmetric relaons (“arrows”) called dependencies submitted nsubjpass auxpass prep The arrows are Bills were by commonly typed prep pobj with the name of on Brownback pobj nn appos grammacal ports Senator Republican relaons (subject, cc conj prep preposi)onal object, and immigration of apposi)on, etc.) pobj Kansas Christopher Manning Dependency Grammar and Dependency Structure Dependency syntax postulates that syntac)c structure consists of relaons between lexical items, normally binary asymmetric relaons (“arrows”) called dependencies submitted The arrow connects a nsubjpass auxpass prep head (governor, Bills were by superior, regent) with a prep pobj dependent (modifier, on inferior, subordinate) Brownback pobj nn appos ports Senator Republican Usually, dependencies cc conj prep form a tree (connected, and immigration of acyclic, single-head) pobj Kansas Christopher Manning Dependency Grammar and Dependency Structure ROOT Discussion of the outstanding issues was completed . • Some people draw the arrows one way; some the other way! • Tesnière had them point from head to dependent… • Usually add a fake ROOT so every word is a dependent of precisely 1 other node Christopher Manning Dependency Grammar/Parsing History • The idea of dependency structure goes back a long way • To Pāṇini’s grammar (c.
    [Show full text]
  • Dependency Grammar Induction Via Bitext Projection Constraints
    Dependency Grammar Induction via Bitext Projection Constraints Kuzman Ganchev and Jennifer Gillenwater and Ben Taskar Department of Computer and Information Science University of Pennsylvania, Philadelphia PA, USA {kuzman,jengi,taskar}@seas.upenn.edu Abstract ing recent interest in transferring linguistic re- sources from one language to another via parallel Broad-coverage annotated treebanks nec- text. For example, several early works (Yarowsky essary to train parsers do not exist for and Ngai, 2001; Yarowsky et al., 2001; Merlo many resource-poor languages. The wide et al., 2002) demonstrate transfer of shallow pro- availability of parallel text and accurate cessing tools such as part-of-speech taggers and parsers in English has opened up the pos- noun-phrase chunkers by using word-level align- sibility of grammar induction through par- ment models (Brown et al., 1994; Och and Ney, tial transfer across bitext. We consider 2000). generative and discriminative models for Alshawi et al. (2000) and Hwa et al. (2005) dependency grammar induction that use explore transfer of deeper syntactic structure: word-level alignments and a source lan- dependency grammars. Dependency and con- guage parser (English) to constrain the stituency grammar formalisms have long coex- space of possible target trees. Unlike isted and competed in linguistics, especially be- previous approaches, our framework does yond English (Mel’cuk,ˇ 1988). Recently, depen- not require full projected parses, allowing dency parsing has gained popularity as a simpler, partial, approximate transfer through lin- computationally more efficient alternative to con- ear expectation constraints on the space stituency parsing and has spurred several super- of distributions over trees.
    [Show full text]
  • Three Dependency-And-Boundary Models for Grammar Induction
    Three Dependency-and-Boundary Models for Grammar Induction Valentin I. Spitkovsky Hiyan Alshawi Stanford University and Google Inc. Google Inc., Mountain View, CA, 94043 [email protected] [email protected] Daniel Jurafsky Stanford University, Stanford, CA, 94305 [email protected] Abstract DT NN VBZ IN DT NN [The check] is in [the mail]. We present a new family of models for unsu- | {z } | {z } pervised parsing, Dependency and Boundary Subject Object models, that use cues at constituent bound- Figure 1: A partial analysis of our running example. aries to inform head-outward dependency tree generation. We build on three intuitions that Consider the example in Figure 1. Because the are explicit in phrase-structure grammars but determiner (DT) appears at the left edge of the sen- only implicit in standard dependency formu- tence, it should be possible to learn that determiners lations: (i) Distributions of words that oc- may generally be present at left edges of phrases. cur at sentence boundaries — such as English This information could then be used to correctly determiners — resemble constituent edges. parse the sentence-internal determiner in the mail. (ii) Punctuation at sentence boundaries fur- ther helps distinguish full sentences from Similarly, the fact that the noun head (NN) of the ob- fragments like headlines and titles, allow- ject the mail appears at the right edge of the sentence ing us to model grammatical differences be- could help identify the noun check as the right edge tween complete and incomplete sentences. of the subject NP. As with jigsaw puzzles, working (iii) Sentence-internal punctuation boundaries inwards from boundaries helps determine sentence- help with longer-distance dependencies, since internal structures of both noun phrases, neither of punctuation correlates with constituent edges.
    [Show full text]
  • Towards an Implelnentable Dependency Grammar
    Towards an implementable dependency grammar Timo JKrvinen and Pasi Tapanainen Research Unit for Multilingual Language Technology P.O. Box 4, FIN-00014 University of Helsinki, Finland Abstract such as a context-free phrase structure gram- mar. This leads us to a situation where the Syntactic models should be descriptively ade- parsing problem is extremely hard in the gen- I quate and parsable. A syntactic description is eral, theoretical case, but fortunately parsable autonomous in the sense that it has certain ex- in practise. Our result shows that, while in gen- plicit formal properties. Such a description re- eral we have an NP-hard parsing problem, there i lates to the semantic interpretation of the sen- is a specific solution for the given grammar that tences, and to the surface text. As the formal- can be run quickly. Currently, the speed of the ism is implemented in a broad-coverage syntac- parsing system I is several hundred words per tic parser, we concentrate on issues that must I second. be resolved by any practical system that uses In short, the grammar should be empirically such models. The correspondence between the motivated. We have all the reason to believe structure and linear order is discussed. ! that if a linguistic analysis rests on a solid de- scriptive basis, analysis tools based on the the- 1 Introduction ory would be more useful for practical purposes. i The aim of this paper is to define a dependency We are studying the possibilities of using com- grammar framework which is both linguistically putational implementation as a developing and motivated and computationally parsable.
    [Show full text]
  • Dependency Parsing
    Dependency Parsing Sandra Kubler¨ LSA Summer Institute 2017 Dependency Parsing 1(16) Dependency Syntax ◮ The basic idea: ◮ Syntactic structure consists of lexical items, linked by binary asymmetric relations called dependencies. ◮ In the words of Lucien Tesni`ere: ◮ La phrase est un ensemble organis´e dont les ´el´ements constituants sont les mots. [1.2] Tout mot qui fait partie d’une phrase cesse par lui-mˆeme d’ˆetre isol´ecomme dans le dictionnaire. Entre lui et ses voisins, l’esprit aper¸coit des connexions, dont l’ensemble forme la charpente de la phrase. [1.3] Les connexions structurales ´etablissent entre les mots des rapports de d´ependance. Chaque connexion unit en principe un terme sup´erieur `aun terme inf´erieur. [2.1] Le terme sup´erieur re¸coit le nom de r´egissant. Le terme inf´erieur re¸coit le nom de subordonn´e. Ainsi dans la phrase Alfred parle [. ], parle est le r´egissant et Alfred le subordonn´e. [2.2] Dependency Parsing 2(16) Dependency Syntax ◮ The basic idea: ◮ Syntactic structure consists of lexical items, linked by binary asymmetric relations called dependencies. ◮ In the words of Lucien Tesni`ere: ◮ The sentence is an organized whole, the constituent elements of which are words. [1.2] Every word that belongs to a sentence ceases by itself to be isolated as in the dictionary. Between the word and its neighbors, the mind perceives connections, the totality of which forms the structure of the sentence. [1.3] The structural connections establish dependency relations between the words. Each connection in principle unites a superior term and an inferior term.
    [Show full text]
  • 1 Dependency Grammar
    Christopher Manning Dependency Grammar and Dependency Structure Dependency Dependency syntax postulates that syntac)c structure consists of Grammar relaons between lexical items, normally binary asymmetric relaons (“arrows”) called dependencies submitted Bills were by Introduc)on on Brownback ports Senator Republican and immigration of Kansas Christopher Manning Christopher Manning Dependency Grammar and Dependency Grammar and Dependency Structure Dependency Structure Dependency syntax postulates that syntac)c structure consists of Dependency syntax postulates that syntac)c structure consists of relaons between lexical items, normally binary asymmetric relaons between lexical items, normally binary asymmetric relaons (“arrows”) called dependencies relaons (“arrows”) called dependencies submitted submitted nsubjpass auxpass prep The arrow connects a nsubjpass auxpass prep head (governor, The arrows are Bills were by Bills were by prep pobj superior, regent) with a prep pobj commonly typed dependent (modifier, on on with the name of Brownback inferior, subordinate) Brownback pobj nn appos pobj nn appos grammacal ports Senator Republican ports Senator Republican relaons (subject, cc conj prep Usually, dependencies cc conj prep preposi)onal object, and immigration of form a tree (connected, and immigration of apposi)on, etc.) pobj acyclic, single-head) pobj Kansas Kansas Christopher Manning Christopher Manning Dependency Grammar and Dependency Structure Dependency Grammar/Parsing History • The idea of dependency structure goes back a long way • To Pāṇini’s grammar (c. 5th century BCE) • Basic approach of 1st millennium Arabic grammarians ROOT Discussion of the outstanding issues was completed . • Cons)tuency is a new-fangled inven)on • 20th century inven)on (R.S. Wells, 1947) • Modern dependency work o^en linked to work of L.
    [Show full text]
  • Dependency Grammars
    Introduction Phrase Structures Dependency Grammars Dependency Grammars Syntactic Theory Winter Semester 2009/2010 Antske Fokkens Department of Computational Linguistics Saarland University 27 October 2009 Antske Fokkens Syntax — Dependency Grammar 1/42 Introduction Phrase Structures Dependency Grammars Outline 1 Introduction 2 Phrase Structures 3 Dependency Grammars Introduction Dependency relations Properties of Dependencies Antske Fokkens Syntax — Dependency Grammar 2/42 Introduction Phrase Structures Dependency Grammars Outline 1 Introduction 2 Phrase Structures 3 Dependency Grammars Introduction Dependency relations Properties of Dependencies Antske Fokkens Syntax — Dependency Grammar 3/42 Introduction Phrase Structures Dependency Grammars Overview of this lecture In the next three lectures, we will discuss Dependency Grammars: Dependencies and Phrase Structures: basic objectives of syntactic analysis properties of phrase structure grammars Basic definitions of Dependencies What are dependencies? Example analyses Differences and Relations between Dependencies and Phrase Structures Syntactic Theory/CL and Dependencies Meaning to Text Theory Prague Dependency Treebank Antske Fokkens Syntax — Dependency Grammar 4/42 Introduction Phrase Structures Dependency Grammars Syntactic Analysis Syntax investigates the structure of expressions Some reasons for performing syntactic analysis: To understand something about how language works To analyze language: how can we relate speech/written text to meaning? Antske Fokkens Syntax — Dependency Grammar
    [Show full text]
  • Enhancing Freeling Rule-Based Dependency Grammars with Subcategorization Frames
    Enhancing FreeLing Rule-Based Dependency Grammars with Subcategorization Frames Marina Lloberes Irene Castellon´ Llu´ıs Padro´ U. de Barcelona U. de Barcelona U. Politecnica` de Catalunya Barcelona, Spain Barcelona, Spain Barcelona, Spain [email protected] [email protected] [email protected] Abstract explicitly relate the improvements in parser per- formance to the addition of subcategorization. Despite the recent advances in parsing, This paper analyses in detail how subcatego- significant efforts are needed to improve rization frames acquired from an annotated cor- the current parsers performance, such as pus and distributed among fine-grained classes in- the enhancement of the argument/adjunct crease accuracy in rule-based dependency gram- recognition. There is evidence that verb mars. subcategorization frames can contribute to The framework used is that of the FreeLing parser accuracy, but a number of issues re- Dependency Grammars (FDGs) for Spanish and main open. The main aim of this paper is Catalan, using enriched lexical-syntactic informa- to show how subcategorization frames ac- tion about the argument structure of the verb. quired from a syntactically annotated cor- FreeLing (Padro´ and Stanilovsky, 2012) is an pus and organized into fine-grained classes open-source library of multilingual Natural Lan- can improve the performance of two rule- guage Processing (NLP) tools that provide linguis- based dependency grammars. tic analysis for written texts. The FDGs are the core of the FreeLing dependency parser, the Txala 1 Introduction Parser (Atserias et al., 2005). Statistical parsers and rule-based parsers have ad- The remainder of this paper is organized as fol- vanced over recent years.
    [Show full text]
  • Introduction to Dependency Grammar Dependency Grammar Overview
    Introduction Dependency Grammar Introduction to Dependency Grammar ◮ Not a coherent grammatical framework: wide range of different kinds of DG ◮ just as there are wide ranges of ”generative syntax” HS Current Approaches to Dependency Parsing ◮ Different core ideas than phrase structure grammar SS 2010 ◮ We will base a lot of our discussion on Mel’ˇcuk (1988) Dependency grammar is important for those interested in CL: ◮ Increasing interest in dependency-based approaches to Thanks to Markus Dickinson, Joakim Nivre and Sandra Kubler.¨ syntactic parsing in recent years (e.g., CoNLL-X shared task, 2006) Introduction to Dependency Grammar 1(29) Introduction to Dependency Grammar 2(29) Introduction Introduction Overview: constituency Overview: dependency Syntactic structure consists of lexical items, linked by binary asymmetric relations called dependencies. (1) Small birds sing loud songs obj What you might be more used to seeing: nmod sbj nmod S Small birds sing loud songs NP VP sing Small birds sing NP sbj obj birds songs loud songs nmod nmod small loud Introduction to Dependency Grammar 3(29) Introduction to Dependency Grammar 4(29) Introduction Introduction Constituency vs. Relations Simple relation example ◮ DG is based on relationships between words, i.e., dependency relations For the sentence John loves Mary, we have the relations: ◮ A B means A governs B or B depends on A ... → ◮ Dependency relations can refer to syntactic properties, ◮ loves subj John semantic properties, or a combination of the two → ◮ loves Mary Some variants of DG separate syntactic and semantic relations obj → → by representing different layers of dependencies Both John and Mary depend on loves, which makes loves the head, ◮ These relations are generally things like subject, or root, of the sentence (i.e., there is no word that governs loves) object/complement, (pre-/post-)adjunct, etc.
    [Show full text]