The Framenet Tagset for Frame-Semantic and Syntactic Coding of Predicate-Argument Structure
Total Page:16
File Type:pdf, Size:1020Kb
The FrameNet tagset for frame-semantic and syntactic coding of predicate-argument structure Christopher JOHNSON Charles J. FILLMORE FrameNet Project, FrameNet Project, International Computer Science Institute International Computer Science Institute currently: Soliloquy, Inc. 1947 Center St., Suite 600 255 Park Ave S Berkeley, CA, USA 94704 New York NY 10010 [email protected] [email protected] correspond to the FEs of the frame or frames Abstract associated with that word. This paper presents the syntactic and semantic Frames are organized in a structure that can tags used to annotate predicate-argument be modeled by an inheritance lattice. They range structure in the Berkeley FrameNet Project. It from being very general, like case frames briefly explains the theory of frame semantics (Fillmore 1968) or other simple event schemas on which semantic annotation is based, underlying thematic roles, to being lexically discusses possible applications of FrameNet specific. The most interesting frames are those at annotation, and compares FrameNet to other an intermediate level of specificity which prominent iexical resources. encapsulate generalizations about the semantic and syntactic properties of word classes that are Introduction overlooked by thematic role analyses. This paper presents the tagset used to annotate 1.2 An example the predicate-argument structures of English One example is the commercial transaction verbs, adjectives, and nouns in the Berkeley frame, which includes the following FEs: Buyer, FrameNet Project (NSF IR]-9618838, "Tools for Seller, Goods, and Money. The following Lexicon Building"), a corpus-based sentence schemas show how these FEs are computational lexicography project based on the expressed differently by different Commercial theory of frame semantics (see Fillmore 1982). Transaction words: It briefly explains the theoretical background and Buyer bought Goods from Seller for Money shows how frame-semantic annotation creates lexicographic generalizations that are not Buyer paid Seller Money for Goods possible with more traditional linguistic Buyer paid Money to Seller for Goods approaches to argument structure based on thematic roles. Seller sold goods to Buyer for Money Seller sold Buyer Goods for Money 1 The FrameNet Project Buyer spent Money on Goods 1.1 What is frame semantics? (Seller not expressed) Frame semantics characterizes the semantic and Goods cost Buyer Money syntactic properties of predicating words by (Seller not expressed) relating them to semantic frames. These are schematic representations of situations involving Different words assign the Commercial various participants, props, and other conceptual Transaction FEs to different Phrase Types (PTs) roles, each of which is a frame element (FE). and Grammatical Functions (GFs). For example, The semantic arguments of a predicating word buy treats the Buyer as an NP subject (i.e. 56 External Argument) and the Seller as a PP annotations consist of XML markup created complement headed by from, while sell treats the using the Alembic Workbench software from the Buyer as a direct object or a PP complement Mitre Corporation). PT and GF information is headed by to and the Seller as a subject. The added by an automatic phrase labeler developed purpose of FrameNet annotation is to gather by the technical team. information like this about the grammatical realization of FEs for various frames. 2 The FrameNet tagset 1.2 Project goals 2.1 Phrase Types The goals of the project are to create a database Below are the FrameNet Phrase Types. This is of information about English words and the intended to be a comprehensive list of the types frames they inherit, provide annotated corpus of syntactic constituent that can express FEs of examples that illustrate how information about major predicating words of English (nouns, FEs is expressed by complements and modifiers verbs and adjectives). These constituents occur of these words in attested sentences, and either as arguments or as lexicographically contribute to a suite of software tools to support relevant modifiers. In constructing this list, we annotation, database building, and database made extensive reference to the Comlex syntax interface. (Meyers et al. 1995). An important part of FrameNet work is the annotation of corpus sentences with frame- 2.1.1 Noun phrase types semantic information. We use the British NP Noun phrase (the witness) National Corpus (BNC), because no equally N Non-maximal nominal (personal chat) comprehensive corpus exists for American Poss Possessive NP (the child's decision) English (though efforts are underway to create a There Expletive there (there was a fight) comparable American National Corpus--see It Expletive it (it's nice that you came) Fillmore et al. 1998). Each annotated example 2.1.2 Prepositional phrase types sentence shows argument-structure properties of one target verb, adjective or noun. The main PP Prepositional phrase (look at me) task of annotation is to tag the arguments (and Ping PP with gerundive object (keep from occasionally modifiers) of the target with the laughing) names of the FEs that they express. A secondary Part Particle(look it up) task is to mark other lexicographically relevant 2.1.3 Verb phrase types elements, such as support verbs of target nouns, VPfin Finite verb phrase (we atefish) and certain non-meaningful elements that VPbrst Bare stem VP (let us eatfish) indicate lexicographically relevant grammatical VPto To-marked infinitive VP (we want to eat constructions (e.g. extraposition and the sh) existential construction). (See Fillmore & Atkins VPwh WH-VP (we know how to win) 1998 on lexicographic relevance.) VPing Gerundive VP (we like winning) Here is an example sentence from the BNC showing the annotation properties of the 2.1.4 Complement clause types complements of the target verb tell: Sfin Finite clause (it's nice thatyou came) (1) [Maltravers (Speaker, NP, Ext)] decided not Swh WH-clause (ask who won) to tell Sif lflwhether clause (ask if we won) [Stephen (Addressee, NP, Obj)] Sing Gerundive clause (we saw them running) [about the inscription in the Le Carr6 book Sto To-marked clause (we want them to win) (Topic, PPabout, Comp)]. Sforto For-to-marked clause (we would like for them to win) The annotated constituents appear in brackets. Sbrst Bare stem clause (we insist that they Following each constituent is a set of win) parentheses containing the FE, PT and GF associated with that constituent (actual 57 In certain cases FrameNet marks as two 2.3 Nonexpression of FEs constituents what are treated as "small clauses" Besides being expressed by the PTs and GFs in some analyses. For example, in the sentence I listed above, FEs may remain unexpressed under consider Pat a genius, Pat and a genius would the different conditions discussed below (see be tagged separately. Fillmore 1986). 2.1.50therphrase types 2.3.1 Indefinite Null lnstantiation AjP Adjective phrase (an interesting idea) In sentence (1), the FEs Message, Medium and AdvP Adverb phrase (you put that nicely) Code are not expressed. It can however be Quo Quote ("Indeed, "she said) inferred that there must be a Message, Medium 2.2 Grammatical Functions and Code in a communicative event of the type described by this sentence. These FEs, while Below is a list of the FrameNet GFs. This list is conceptually present in this sentence, are intended to characterize all the grammatical optionally expressed, and there are no particular contexts relative to English verbs, adjectives and restrictions on their nonexpression. This is called nouns that are regularly occupied by FE- Indefinite Null Instantiation (INI). expressing constituents. In this list, we do not make the traditional distinction between obliques 2.3.2 Definite Null Instantiation and arguments/complements (the former are With some words, an FE may be unexpressed in simply PP complements). a sentence only if it assumed that the person to 2.2.1 FrameNet grammatical functions whom the sentence is addressed has specific information about the frame element in question. Ext External Argument This kind of non-expression is called Definite (Argument outside phrase headed by target verb, Null Instantiation (DNI). For example, the adjective or noun) words tell, inform and notify allow a Message to Comp Complement be omitted when it is clear what the Message is, (Argument inside phrase headed by target verb, e.g. How did I know you won? Because Pat adjective or noun) already tom me. Mod Modifier 2.3.3 Constructionally licensed null (Non-argument expressing FE of target verb, instantiation adjective or noun) Certain grammatical constructions, such as Passive and Imperative, allow an External Xtrap Extraposed Argument FE to be unexpressed, e.g. Harsh (Verbal or clausal complement extraposed to end things were said, Tell me about yourself. This is of VP) Constructionally Licensed Null Instantiation Obj Object (CNI). (Post-verbal argument; passivizable or does not FrameNet annotation marks prominent alternate with PP) unexpressed FEs as well as FEs that are expressed overtly. In order to achieve this, we Pred Predicate place the symbols INI, DNI and CNI (Secondary predicate complement of target verb immediately after the target word in every or adjective) annotated sentence, and place the appropriate FE Head Head tag on the appropriate symbol. For example, the (Head nominal in attributive use of target case of DNI discussed above would be annotated adjective) as follows: Gen Genitive Determiner (2) [Pat (Speaker)]