Verb Sense Disambiguation
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Deborah Schiffrin Editor
Meaning, Form, and Use in Context: Linguistic Applications Deborah Schiffrin editor Meaning, Form, and Use in Context: Linguistic Applications Deborah Schiffrin editor Georgetown University Press, Washington, D.C. 20057 BIBLIOGRAPHIC NOTICE Since this series has been variously and confusingly cited as: George- town University Monographic Series on Languages and Linguistics, Monograph Series on Languages and Linguistics, Reports of the Annual Round Table Meetings on Linguistics and Language Study, etc., beginning with the 1973 volume, the title of the series was changed. The new title of the series includes the year of a Round Table and omits both the monograph number and the meeting number, thus: Georgetown University Round Table on Languages and Linguistics 1984, with the regular abbreviation GURT '84. Full bibliographic references should show the form: Kempson, Ruth M. 1984. Pragmatics, anaphora, and logical form. In: Georgetown University Round Table on Languages and Linguistics 1984. Edited by Deborah Schiffrin. Washington, D.C.: Georgetown University Press. 1-10. Copyright (§) 1984 by Georgetown University Press All rights reserved Printed in the United States of America Library of Congress Catalog Number: 58-31607 ISBN 0-87840-119-9 ISSN 0196-7207 CONTENTS Welcoming Remarks James E. Alatis Dean, School of Languages and Linguistics vii Introduction Deborah Schiffrin Chair, Georgetown University Round Table on Languages and Linguistics 1984 ix Meaning and Use Ruth M. Kempson Pragmatics, anaphora, and logical form 1 Laurence R. Horn Toward a new taxonomy for pragmatic inference: Q-based and R-based implicature 11 William Labov Intensity 43 Michael L. Geis On semantic and pragmatic competence 71 Form and Function Sandra A. -
The Case for Word Sense Induction and Disambiguation
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction and Disambiguation Alexander Panchenko‡, Eugen Ruppert‡, Stefano Faralli†, Simone Paolo Ponzetto† and Chris Biemann‡ ‡Language Technology Group, Computer Science Dept., University of Hamburg, Germany †Web and Data Science Group, Computer Science Dept., University of Mannheim, Germany panchenko,ruppert,biemann @informatik.uni-hamburg.de { faralli,simone @informatik.uni-mannheim.de} { } Abstract Word sense induction from domain-specific cor- pora is a supposed to solve this problem. How- The current trend in NLP is the use of ever, most approaches to word sense induction and highly opaque models, e.g. neural net- disambiguation, e.g. (Schutze,¨ 1998; Li and Juraf- works and word embeddings. While sky, 2015; Bartunov et al., 2016), rely on cluster- these models yield state-of-the-art results ing methods and dense vector representations that on a range of tasks, their drawback is make a WSD model uninterpretable as compared poor interpretability. On the example to knowledge-based WSD methods. of word sense induction and disambigua- Interpretability of a statistical model is impor- tion (WSID), we show that it is possi- tant as it lets us understand the reasons behind its ble to develop an interpretable model that predictions (Vellido et al., 2011; Freitas, 2014; Li matches the state-of-the-art models in ac- et al., 2016). Interpretability of WSD models (1) curacy. Namely, we present an unsuper- lets a user understand why in the given context one vised, knowledge-free WSID approach, observed a given sense (e.g., for educational appli- which is interpretable at three levels: word cations); (2) performs a comprehensive analysis of sense inventory, sense feature representa- correct and erroneous predictions, giving rise to tions, and disambiguation procedure. -
Lexical Sense Labeling and Sentiment Potential Analysis Using Corpus-Based Dependency Graph
mathematics Article Lexical Sense Labeling and Sentiment Potential Analysis Using Corpus-Based Dependency Graph Tajana Ban Kirigin 1,* , Sanda Bujaˇci´cBabi´c 1 and Benedikt Perak 2 1 Department of Mathematics, University of Rijeka, R. Matejˇci´c2, 51000 Rijeka, Croatia; [email protected] 2 Faculty of Humanities and Social Sciences, University of Rijeka, SveuˇcilišnaAvenija 4, 51000 Rijeka, Croatia; [email protected] * Correspondence: [email protected] Abstract: This paper describes a graph method for labeling word senses and identifying lexical sentiment potential by integrating the corpus-based syntactic-semantic dependency graph layer, lexical semantic and sentiment dictionaries. The method, implemented as ConGraCNet application on different languages and corpora, projects a semantic function onto a particular syntactical de- pendency layer and constructs a seed lexeme graph with collocates of high conceptual similarity. The seed lexeme graph is clustered into subgraphs that reveal the polysemous semantic nature of a lexeme in a corpus. The construction of the WordNet hypernym graph provides a set of synset labels that generalize the senses for each lexical cluster. By integrating sentiment dictionaries, we introduce graph propagation methods for sentiment analysis. Original dictionary sentiment values are integrated into ConGraCNet lexical graph to compute sentiment values of node lexemes and lexical clusters, and identify the sentiment potential of lexemes with respect to a corpus. The method can be used to resolve sparseness of sentiment dictionaries and enrich the sentiment evaluation of Citation: Ban Kirigin, T.; lexical structures in sentiment dictionaries by revealing the relative sentiment potential of polysemous Bujaˇci´cBabi´c,S.; Perak, B. Lexical Sense Labeling and Sentiment lexemes with respect to a specific corpus. -
Performative Sentences and the Morphosyntax-Semantics Interface in Archaic Vedic
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Journal of South Asian Linguistics JSAL volume 1, issue 1 October 2008 Performative Sentences and the Morphosyntax-Semantics Interface in Archaic Vedic Eystein Dahl, University of Oslo Received November 1, 2007; Revised October 15, 2008 Abstract Performative sentences represent a particularly intriguing type of self-referring assertive clauses, as they constitute an area of linguistics where the relationship between the semantic-grammatical and the pragmatic-contextual dimension of language is especially transparent. This paper examines how the notion of performativity interacts with different tense, aspect and mood categories in Vedic. The claim is that one may distinguish three slightly different constraints on performative sentences, a modal constraint demanding that the proposition is represented as being in full accordance with the Common Ground, an aspectual constraint demanding that there is a coextension relation between event time and reference time and a temporal constraint demanding that the reference time is coextensive with speech time. It is shown that the Archaic Vedic present indicative, aorist indicative and aorist injunctive are quite compatible with these constraints, that the basic modal specifications of present and aorist subjunctive and optative violate the modal constraint on performative sentences, but give rise to speaker-oriented readings which in turn are compatible with that constraint. However, the imperfect, the present injunctive, the perfect indicative and the various modal categories of the perfect stem are argued to be incompatible with the constraints on performative sentences. 1 Introduction Performative sentences represent a particularly intriguing type of self-referring assertive clauses, as they constitute an area of linguistics where the relationship between the semantic-grammatical and the pragmatic-contextual dimension of language is especially transparent. -
The Role of Syntactic and Semantic Level in Middle-Irish Verbal Noun: This Paper Aims to Provide Some Insights Into the Interact
The role of syntactic and semantic level in middle-Irish verbal noun: This paper aims to provide some insights into the interaction between the syntactic level and the semantic one, with special regards on the middle-Irish verbal noun The research concentrates on the syntactic and semantic functions of Verbal Noun in Irish. One of the more striking features of the Celtic languages is their lack of a category of infinitive comparable to that found in most other Indo-European languages. Instead, a nominalization is used, which is usually termed “Verbal Noun”. This Verbal Noun is a true noun: it is declined like other nominals and takes its argument in genitive case. One of the recurring issues in the literature on the Irish Verbal Noun has been whether or not the Verbal Noun is an infinitive. The answer to this question depends on which criteria are used. From a syntactic point of view, a Verbal Noun behaves like a verbal argument and it could assume a nominal function or a verbal function (in this sense it could be the verb of a subordinate clause): a. When a Verbal Noun functions as a noun, it could be syntactically subject, direct object, indirect object, governed by a preposition, noun of a nominal predicate. From a semantic point of view, the verbal noun can code two meanings: 1. an abstract meaning “action of Vx”, i.e. an entity with the traits [-animate], [-human]. For example: From Táin Bó Cúalnge (1967), rigo 90-92: Ra iarfacht Dare do Mac Roth cid ask:IND.PERF.3.SG Dare:NOM.SG. -
All-Words Word Sense Disambiguation Using Concept Embeddings
All-words Word Sense Disambiguation Using Concept Embeddings Rui Suzuki, Kanako Komiya, Masayuki Asahara, Minoru Sasaki, Hiroyuki Shinnou Ibaraki University, 4-12-1 Nakanarusawa, Hitachi, Ibaraki JAPAN, National Institute for Japanese Language and Linguistics, 10-2 Midoricho, Tachikawa, Tokyo, JAPAN, [email protected], kanako.komiya.nlp, minoru.sasaki.01, hiroyuki.shinnou.0828g @vc.ibaraki.ac.jp, [email protected] Abstract All-words word sense disambiguation (all-words WSD) is the task of identifying the senses of all words in a document. Since the sense of a word depends on the context, such as the surrounding words, similar words are believed to have similar sets of surrounding words. We therefore predict the target word senses by calculating the distances between the surrounding word vectors of the target words and their synonyms using word embeddings. In addition, we introduce the new idea of concept embeddings, constructed from concept tag sequences created from the results of previous prediction steps. We predict the target word senses using the distances between surrounding word vectors constructed from word and concept embeddings, via a bootstrapped iterative process. Experimental results show that these concept embeddings were able to improve the performance of Japanese all-words WSD. Keywords: word sense disambiguation, all-words, unsupervised 1. Introduction many words have the same article numbers. Several words Word sense disambiguation (WSD) involves identifying the can have the same precise article number, even when the senses of words in documents. In particular, the WSD task semantic breaks are considered. where the senses of all the words in a document are disam- biguated is referred to as all-words WSD. -
Contrastive Study and Review of Word Sense Disambiguation Techniques S.S
et International Journal on Emerging Technologies 11 (4): 96-103(2020) ISSN No. (Print): 0975-8364 ISSN No. (Online): 2249-3255 Contrastive Study and Review of Word Sense Disambiguation Techniques S.S. Patil 1, R.P. Bhavsar 2 and B.V. Pawar 3 1Research Scholar, Department of Computer Engineering, SSBT’s COET Jalgaon (Maharashtra), India. 2Associate Professor, School of Computer Sciences, KBC North Maharashtra University Jalgaon (Maharashtra), India. 3Professor, School of Computer Sciences, KBC North Maharashtra University Jalgaon (Maharashtra), India. (Corresponding author: S.S. Patil) (Received 25 February 2020, Revised 10 June 2020, Accepted 12 June 2020) (Published by Research Trend, Website: www.researchtrend.net) ABSTRACT: Ambiguous word, phrase or sentence has more than one meaning, ambiguity in the word sense is a fundamental characteristic of any natural language; it makes all the natural language processing (NLP) tasks vary tough, so there is need to resolve it. To resolve word sense ambiguities, human mind can use cognition and world knowledge, but today, machine translation systems are in demand and to resolve such ambiguities, machine can’t use cognition and world knowledge, so it make mistakes in semantics and generates wrong interpretations. In natural language processing and understanding, handling sense ambiguity is one of the central challenges, such words leads to erroneous machine translation and retrieval of unrelated response in information retrieval, word sense disambiguation (WSD) is used to resolve such sense ambiguities. Depending on the use of corpus, WSD techniques are classified into two categories knowledge-based and machine learning, this paper summarizes the contrastive study and review of various WSD techniques. -
Transformative Poetry. a General Introduction and a Case Study of Psalm 2
Perichoresis Volume 14. Issue 2 (2016): 3-20 DOI: 10.1515/perc-2016-0007 TRANSFORMATIVE POETRY. A GENERAL INTRODUCTION AND A CASE STUDY OF PSALM 2 ARCHIBALD L. H. M. VAN WIERINGEN * Tilburg University ABSTRACT. The structure of Psalm 2 is based on direct speeches in the text. These direct speeches characterise the communication that takes place in the text. The text-immanent author addresses the text-immanent reader with a question in the first line and with a makarismos in the last line. The direct speeches of the characters enable the text-immanent reader to undergo a development in his striving towards the beatitude. In this development, the ‘now’ of the birth of the Anointed One / Son is re-enacted in the reading moment of the text-immanent reader. The reception of Psalm 2 makes clear that Hebrews and the Christmas liturgy re-use the text, while retaining this re-enactment character. KEY WORDS: Psalm 2, synchrony, Epistle to the Hebrews , introit midnight Mass Christmas, direct speech Introductory Remarks: the Characteristics of Transformative Poetry Transformations occur in every religion, most strikingly in religious initiation. This is also the case for Judaism and Christianity. In Judaism, the initiatory transformation is marked by circumcision. The ritual includes the new believer in the promises that God made to Abraham. Through circumcision, Abraham experienced in his body the difficult reali- sation of the promise of a son, and circumcision thus became the expression of his trust, his faith, in the Lord. Circumcision is therefore a sign of faith, both for Abraham and also for the generations to come as in Genesis 17:9-14 (Jacob 1934: 431-434). -
Word Senses and Wordnet Lady Bracknell
Speech and Language Processing. Daniel Jurafsky & James H. Martin. Copyright © 2021. All rights reserved. Draft of September 21, 2021. CHAPTER 18 Word Senses and WordNet Lady Bracknell. Are your parents living? Jack. I have lost both my parents. Lady Bracknell. To lose one parent, Mr. Worthing, may be regarded as a misfortune; to lose both looks like carelessness. Oscar Wilde, The Importance of Being Earnest ambiguous Words are ambiguous: the same word can be used to mean different things. In Chapter 6 we saw that the word “mouse” has (at least) two meanings: (1) a small rodent, or (2) a hand-operated device to control a cursor. The word “bank” can mean: (1) a financial institution or (2) a sloping mound. In the quote above from his play The Importance of Being Earnest, Oscar Wilde plays with two meanings of “lose” (to misplace an object, and to suffer the death of a close person). We say that the words ‘mouse’ or ‘bank’ are polysemous (from Greek ‘having word sense many senses’, poly- ‘many’ + sema, ‘sign, mark’).1 A sense (or word sense) is a discrete representation of one aspect of the meaning of a word. In this chapter WordNet we discuss word senses in more detail and introduce WordNet, a large online the- saurus —a database that represents word senses—with versions in many languages. WordNet also represents relations between senses. For example, there is an IS-A relation between dog and mammal (a dog is a kind of mammal) and a part-whole relation between engine and car (an engine is a part of a car). -
How Much Does Word Sense Disambiguation Help in Sentiment Analysis of Micropost Data?
How much does word sense disambiguation help in sentiment analysis of micropost data? Chiraag Sumanth Diana Inkpen PES Institute of Technology University of Ottawa India Canada 6th Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis OUTLINE • Introduction • What is Word Sense Disambiguation • Word Sense Disambiguation and Sentiment Analysis • Dataset • System Description • Results • Error Analysis • Conclusion How much does word sense disambiguation help in sentiment analysis of micropost data? INTRODUCTION • This short paper describes a sentiment analysis system for micro-post data that includes analysis of tweets from Twitter and Short Messaging Service (SMS) text messages. • Use Word Sense Disambiguation techniques in sentiment analysis at the message level, where the entire tweet or SMS text was analysed to determine its dominant sentiment. • Use of Word Sense Disambiguation alone has resulted in an improved sentiment analysis system that outperforms systems built without incorporating Word Sense Disambiguation. How much does word sense disambiguation help in sentiment analysis of micropost data? What is Word Sense Disambiguation (WSD) ? • Any natural language processing system encounters the problem of lexical ambiguity, be it syntactic or semantic. • The resolution of a word’s syntactic ambiguity has largely been by part-of-speech taggers which with high levels of accuracy. • The problem is that words often have more than one meaning, sometimes fairly similar and sometimes completely different. The meaning of a word in a particular usage can only be determined by examining its context. • Word Sense Disambiguation (WSD) is the process of identifying the sense of such words, called polysemic words. How much does word sense disambiguation help in sentiment analysis of micropost data? Word Sense Disambiguation and Sentiment Analysis • Akkaya et al. -
Word Sense Ambiguation: Clustering Related Senses
WORD SENSE AMBIGUATION: CLUSTERING RELATED SENSES William B. Dolan Microsoft Research billdol @ microsoft.corn Abstract Alter describing the algorithm which accomplishes This paper describes a heuristic approach to this task, we go on to briefly discuss its results. automatically identifying which senses of a machine- Finally, we describe the implications of this work readable dictionary (MRD) headword are has for the task of merging multiple. semantically related versus those which correspond to fundamentally different senses of the word. The The arbitrm'iness of sense divisions inclusion of this information in a lexical database The division of word meanings into distinct profoundly alters the nature of sense disambiguation: dictionary senses and entries is frequently arbitrary the appropriate "sense" of a polysemous word may (Atkins and Levin, 1988; Atkins, 1991), as a now correspond to some set of related senses. Our comparison of any two dictionaries quickly makes technique offers benefits both for on-line semantic clear. For example, consider the verb "mo(u)lt", processing and for the challenging task of mapping whose single sense in the American Heritage word senses across multiple MRDs in creating a Dictionary, Third Edition (AHD3) corresponds to merged lexical database. two senses in Longman's Dictionary of Contemporary English (LDOCE): 2 1. Introduction AHD3 The problem of word sense disambiguation is one !y:part :or a!l of a coat or... which has received increased attention in recent covering; sucli:as feathers;cuticle or skin ' work on Natural Language Processing (NLP) and hfformation Retrieval (IR). Given an occurrence of a LDOCE polysemous word in running text, the task as it is (1) "(of a biid ) to !0se o r thro TMoff (tleatberS) at the generally formulated involves examining a set of seasoii when new feathers grow' . -
Survey of Word Sense Disambiguation Approaches
Survey of Word Sense Disambiguation Approaches Xiaohua Zhou and Hyoil Han College of Information Science & Technology, Drexel University 3401 Chestnut Street, Philadelphia, PA 19104 [email protected], [email protected] Abstract can be either symbolic or empirical. Dictionaries provide the definition and partial lexical knowledge for each sense. Word Sense Disambiguation (WSD) is an important but However, dictionaries include little well-defined world challenging technique in the area of natural language knowledge (or common sense). An alternative is for a processing (NLP). Hundreds of WSD algorithms and program to automatically learn world knowledge from systems are available, but less work has been done in regard to choosing the optimal WSD algorithms. This manually sense-tagged examples, called a training corpus. paper summarizes the various knowledge sources used for WSD and classifies existing WSD algorithms The word to be sense tagged always appears in a context. according to their techniques. The rationale, tasks, Context can be represented by a vector, called a context performance, knowledge sources used, computational vector (word, features). Thus, we can disambiguate word complexity, assumptions, and suitable applications for sense by matching a sense knowledge vector and a context each class of WSD algorithms are also discussed. This vector. The conceptual model for WSD is shown in figure 1. paper will provide users with general knowledge for choosing WSD algorithms for their specific applications Lexical Knowledge or for further adaptation. (Symbolic & Empirical) Sense Knowledge Word Sense Disambiguation 1. Introduction World Knowledge (Matching ) (Machine Learning) Contextual Word Sense Disambiguation (WSD) refers to a task that Features automatically assigns a sense, selected from a set of pre- Figure 1.