Fine-Grained Semantic Conceptualization of Framenet

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16) Fine-Grained Semantic Conceptualization of FrameNet Jin-woo Park Seung-won Hwang Haixun Wang POSTECH, Korea, Republic of Yonsei University, Korea, Republic of Facebook Inc., USA [email protected] [email protected] [email protected] Abstract for training applications such as semantic parser (Das, Mar- tins, and Smith 2012) or question answering systems (Pizzato Understanding verbs is essential for many natural language and Molla´ 2008). Yet, as shown in Table 1, the instances in tasks. To this end, large-scale lexical resources such as FrameNet, being just small samples for the FEs, are often FrameNet have been manually constructed to annotate the insufficient for supporting machine understanding. semantics of verbs (frames) and their arguments (frame elements or FEs) in example sentences. Our goal is to “seman- We thus focus on “semantically conceptualizing” FE in- tically conceptualize” example sentences by connecting FEs stances. Take the EMPLOYER FE for example. Our goal is to to knowledge base (KB) concepts. For example, connecting infer that any instance of company concept is equally plausi- Employer FE to company concept in the KB enables the ble as sample companies mentioned in FrameNet. A closely understanding that any (unseen) company can also be FE examples. However, a naive adoption of existing KB con- related well-known task is “selectional preference” of com- ceptualization technique, focusing on scenarios of conceptu- puting “plausibility” scores for arbitrary instances (Resnik alizing a few terms, cannot 1) scale to many FE instances 1996). That is, our task is to find selectional preferences based (average of 29.7 instances for all FEs) and 2) leverage interde- on concepts of a knowledge base (KB). With such concep- pendence between instances and concepts. We thus propose tualization, we can infer that “Shell hires 100 workers” is a scalable k-truss clustering and a Markov Random Field plausible as Shell is an instance of company concept in the (MRF) model leveraging interdependence between concept- KB. instance, concept-concept, and instance-instance pairs. Our extensive analysis with real-life data validates that our ap- Related work To enable this inference on unobserved sen- proach improves not only the quality of the identified concepts tences, existing work uses WordNet concept (Tonelli et al. for FrameNet, but also that of applications such as selectional 2012) or topic modeling (Ritter, Mausam, and Etzioni 2010) preference. to relate the unobserved instances to the observed instances. For example, WordNet concepts identified for FEs (Tonelli et al. 2012) can infer plausibility of the unobserved instances if 1 Introduction they belong to the same concept from the observed instances. Overview Understanding verbs is critical to understanding However, WordNet concepts, being restricted to about 2,000 text. For example, the verb “employ” is associated with two concepts linked with Wikipedia pages (Bryl et al. 2012), are different semantic classes described as the USING and the often too coarse for our purpose. For example, the WordNet EMPLOYING frames: concept container, enumerating generic containers including envelope or wastebasket, is too general for Container • USING: [The few states]Agent [employed]LU [chemical FE, as it inaccurately predicts wastebasket as a plausibility weapons] . Instrument score of a cooking utensil. More desirably, we may consider using a knowledge base that materializes concepts of finer • EMPLOYING: [Hong Kong manufacturers] [employ] Employer LU granularity, e.g., cooking container for COOKING CREATION [2m workers] [in China] . Employee Place frame. In addition, a desirable KB should cover sufficient FrameNet (Fillmore, Johnson, and Petruck 2003) is a lex- instances for each concept. To argue that WordNet is limited ical resource that describes semantic frames, by annotating in terms of coverage, we analyzed English Gigaword Fifth verbs1 (i.e., lexical units or LUs) and their arguments (i.e., Edition (LDC2011T07) consisting of 180 million sentences frame elements or FEs) that evoke frames. Table 1 shows all from English news. In this corpus, WordNet covers only 33% of the FE instances for the EMPLOYING frame in FrameNet. of instances as shown in Table 2. In contrast, automatically This resource is critical for human understanding of verbs and harvested KBs such as Probase covers many instances including named entities (e.g., Microsoft and British Airways), Copyright c 2016, Association for the Advancement of Artificial which are not covered by WordNet. Intelligence (www.aaai.org). All rights reserved. 1Although FrameNet annotated verbs, nouns, adjectives, and Proposed method To overcome concept and instance spar- adverbs as lexical units, this paper focuses on verbs. sity of manually-built KB, we utilize on Probase (Wu et al. 2638 Table 1: Examples of FE instances in the Employing frame from FrameNet examples. Frame FE Instances british airways, factory, plant, police, housekeeper, bank, firm, executive, company, industry, airline, institute, manufacturer, defendant, woman, Employer paul, mason, man, sister, he, we, i, she, you EMPLOYING person, worker, man, staff, specialist, you, gardener, someone, woman, consultant, actor, winning, contractor, stave, architect, graduate, builder, Employee wife, artist, detective, tradesman, westwood, plaintiff, clark, labourers, outside agency, diver, officer, mark thatcher, vet, her, team Position cleaner, guide, assistant, manager, agent, gardener, aircrew 2.1 FrameNet Table 2: The coverage of FrameNet, WordNet, and Probase. 3 Nouns of several dependents such as nsubj, dobj, pobj, and FrameNet (Fillmore, Johnson, and Petruck 2003) is a iobj of the sentences from Gigagword corpus were used for database of frame semantics (Fillmore 1982). It defines lexi- this coverage test. cal units (LUs) as the pairing of words with their meanings (frames). In our running examples, employ belongs to two FrameNet WordNet Probase frames: USING and EMPLOYING. In each frame, the verb Coverage 25% 33% 75% employ has its frame elements (FEs), which can be direct syntactic dependents. The USING frame, for example, contains two FEs, Agent and Instrument, whose relationships 2012)2, which contains millions of fine-grained concepts au- are described in example sentences. FrameNet contains more tomatically harvested from billions of web documents. This than 190,000 annotated sentences covering about 12,000 LUs gives us the following two benefits observed in prior research: and 10,000 FEs. • Fine-grained conceptualization covers 3 million concepts observed on the Web. 2.2 Probase Probase (Wu et al. 2012) is a probabilistic lexical knowl- • More than 75% instances of a raw corpus (Table 2) were edge base of isA relationships that are extracted from billions already observed in Probase, and its instance coverage of web documents using syntactic patterns such as Hearst can grow indefinitely by automatic harvesting from a patterns (Hearst 1992). For example, from “... mobile com- larger corpus. pany such as a microsoft ...”, we derive the isA relationship microsoft isA mobile company. A naive baseline would be adopting existing conceptual- Bayes Probase contains isA relationships among 3 million con- ization techniques on FE instances (Song et al. 2011). cepts and 40 million instances. Furthermore, each isA rela- builds on naive Bayes model for finding concepts with the CL+Bayes tionship is associated with a variety of probabilities, weights highest posterior for the instances. extends this and scores, including the typicality scores: model to cluster before conceptualization to address: 1) het- • ( | ) erogeneity of concept, e.g., Employer can be both company P c e denotes how typical is concept c for instance e (i.e., and person, and 2) ambiguity of instance, e.g., apple. How- instance typicality score). It is computed as: n(e, c) ever, CL+Bayes targets the scenario of conceptualizing a P (c|e)= (1) n(e, c) few words, e.g., a multi-word query, and cannot 1) scale for c FE instances (average of 29.7 example instances for all FEs where n(e, c) is the frequency observed from e isA c in the in FrameNet) and 2) leverage the interdependence between corpus. Typically, P (c|e) gives higher scores to general concept-instance, concept-concept, instance-instance pairs. concepts than specific concepts. For example, we have In contrast, we propose a new model addressing these P (food|pasta)=0.177 >P(italian food|pasta)= challenges. First, while existing clustering builds on clique 0.014 in Probase. finding, which is NP-hard and sensitive to missing observations, we propose an efficient clustering that is robust to • P (e|c), denotes how typical is instance e for concept c some missing observations. Second, we extend a Bayesian (i.e., concept typicality score). It is computed as: model with one-way directed graph between concept and n(e, c) ( | ) P (e|c)= (2) instance, P e c , to model the interdependence between e n(e, c) concept-concept, instance-instance, and concept-instance. P (e|c) typically gives higher scores to specific con- Our experimental evaluations show the accuracy of both cepts than general concepts. For example, we have conceptualization and selectional preference using real-life P (pasta|italian food)=0.184 >P(pasta|food)= datasets. 0.013 in Probase. 2 Preliminary 3 Existing Conceptualizations In this section, we introduce two lexical knowledge bases, In this section, we introduce

Fine-Grained Semantic Conceptualization of Framenet

Enriching an Explanatory Dictionary with Framenet and Propbank Corpus Examples

WELDA: Enhancing Topic Models by Incorporating Local Word Context

Generative Topic Embedding: a Continuous Representation of Documents

Developing a Large Scale Framenet for Italian: the Iframenet Experience

Linked Data Triples Enhance Document Relevance Classification

Document and Topic Models: Plsa

Improving Distributed Word Representation and Topic Model by Word-Topic Mixture Model

Integrating Wordnet and Framenet Using a Knowledge-Based Word Sense Disambiguation Algorithm

Topic Modeling: Beyond Bag-Of-Words

The Hebrew Framenet Project

INF5830 Introduction to Semantic Role Labeling

Using Frame Semantics in Natural Language Processing