Word Sense Disambiguation: a Survey
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Arxiv:1809.06223V1 [Cs.CL] 17 Sep 2018
Unsupervised Sense-Aware Hypernymy Extraction Dmitry Ustalov†, Alexander Panchenko‡, Chris Biemann‡, and Simone Paolo Ponzetto† †University of Mannheim, Germany fdmitry,[email protected] ‡University of Hamburg, Germany fpanchenko,[email protected] Abstract from text between two ambiguous words, e.g., apple fruit. However, by definition in Cruse In this paper, we show how unsupervised (1986), hypernymy is a binary relationship between sense representations can be used to im- senses, e.g., apple2 fruit1, where apple2 is the prove hypernymy extraction. We present “food” sense of the word “apple”. In turn, the word a method for extracting disambiguated hy- “apple” can be represented by multiple lexical units, pernymy relationships that propagate hy- e.g., “apple” or “pomiculture”. This sense is dis- pernyms to sets of synonyms (synsets), tinct from the “company” sense of the word “ap- constructs embeddings for these sets, and ple”, which can be denoted as apple3. Thus, more establishes sense-aware relationships be- generally, hypernymy is a relation defined on two tween matching synsets. Evaluation on two sets of disambiguated words; this modeling princi- gold standard datasets for English and Rus- ple was also implemented in WordNet (Fellbaum, sian shows that the method successfully 1998), where hypernymy relations link not words recognizes hypernymy relationships that directly, but instead synsets. This essential prop- cannot be found with standard Hearst pat- erty of hypernymy is however not used or modeled terns and Wiktionary datasets for the re- in the majority of current hypernymy extraction spective languages. approaches. In this paper, we present an approach that addresses this shortcoming. -
Roberto Navigli Curriculum Vitæ
Roberto Navigli Curriculum Vitæ Roma, 18 marzo 2021 Part I { General Information Full Name Roberto Navigli Citizenship Italian E-mail [email protected] Spoken languages Italian, English, French Sito Web http://wwwusers.di.uniroma1.it/∼navigli Part II { Education Type Year Institution Notes (Degree, Experience, etc.) Ph.D. 2007 Sapienza PhD in Computer Science, advisor: prof. Paola Velardi. Thesis: "Structural Semantic Interconnections: a Knowledge-Based WSD Algorithm, its Evaluation and Applications", Winner of the 2017 AI*IA \Marco Cadoli" national prize for the best PhD thesis in AI. University 2001 Sapienza Master degree in Computer Science, 110/110 cum laude. graduation Thesis: "An automatic algorithm for domain ontology learning", supervisor: prof. Paola Velardi. Part III { Appointments III(A) { Academic Appointments Start End Institution Position 09/2017 oggi Sapienza full professor, Dipartimento di Informatica, Universit`adi Roma \La Sapienza". 12/2010 08/2017 Sapienza associate professor, Dipartimento di Informatica, Universit`adi Roma \La Sapienza". 03/2007 12/2010 Sapienza assistant professor, Dipartimento di Informatica, Sapienza. 05/2003 03/2007 Sapienza research fellow, Dipartimento di Informatica, Sapienza. III(B) { Other Appointments Start End Institution Position 05/2000 04/2003 YH Reply software engineer S.p.A. Roma 1 III(C) { Research visits and stays Start End Institution Position 01/2015 01/2015 Center for Advanced Studies Invited Visiting Fellow (CAS), LMU, Germany 2010 2012 University of Wolverhampton Visiting -
The Case for Word Sense Induction and Disambiguation
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction and Disambiguation Alexander Panchenko‡, Eugen Ruppert‡, Stefano Faralli†, Simone Paolo Ponzetto† and Chris Biemann‡ ‡Language Technology Group, Computer Science Dept., University of Hamburg, Germany †Web and Data Science Group, Computer Science Dept., University of Mannheim, Germany panchenko,ruppert,biemann @informatik.uni-hamburg.de { faralli,simone @informatik.uni-mannheim.de} { } Abstract Word sense induction from domain-specific cor- pora is a supposed to solve this problem. How- The current trend in NLP is the use of ever, most approaches to word sense induction and highly opaque models, e.g. neural net- disambiguation, e.g. (Schutze,¨ 1998; Li and Juraf- works and word embeddings. While sky, 2015; Bartunov et al., 2016), rely on cluster- these models yield state-of-the-art results ing methods and dense vector representations that on a range of tasks, their drawback is make a WSD model uninterpretable as compared poor interpretability. On the example to knowledge-based WSD methods. of word sense induction and disambigua- Interpretability of a statistical model is impor- tion (WSID), we show that it is possi- tant as it lets us understand the reasons behind its ble to develop an interpretable model that predictions (Vellido et al., 2011; Freitas, 2014; Li matches the state-of-the-art models in ac- et al., 2016). Interpretability of WSD models (1) curacy. Namely, we present an unsuper- lets a user understand why in the given context one vised, knowledge-free WSID approach, observed a given sense (e.g., for educational appli- which is interpretable at three levels: word cations); (2) performs a comprehensive analysis of sense inventory, sense feature representa- correct and erroneous predictions, giving rise to tions, and disambiguation procedure. -
Ontology Learning and Its Application to Automated Terminology Translation
Natural Language Processing Ontology Learning and Its Application to Automated Terminology Translation Roberto Navigli and Paola Velardi, Università di Roma La Sapienza Aldo Gangemi, Institute of Cognitive Sciences and Technology lthough the IT community widely acknowledges the usefulness of domain ontolo- A gies, especially in relation to the Semantic Web,1,2 we must overcome several bar- The OntoLearn system riers before they become practical and useful tools. Thus far, only a few specific research for automated ontology environments have ontologies. (The “What Is an Ontology?” sidebar on page 24 provides learning extracts a definition and some background.) Many in the and is part of a more general ontology engineering computational-linguistics research community use architecture.4,5 Here, we describe the system and an relevant domain terms WordNet,3 but large-scale IT applications based on experiment in which we used a machine-learned it require heavy customization. tourism ontology to automatically translate multi- from a corpus of text, Thus, a critical issue is ontology construction— word terms from English to Italian. The method can identifying, defining, and entering concept defini- apply to other domains without manual adaptation. relates them to tions. In large, complex application domains, this task can be lengthy, costly, and controversial, because peo- OntoLearn architecture appropriate concepts in ple can have different points of view about the same Figure 1 shows the elements of the architecture. concept. Two main approaches aid large-scale ontol- Using the Ariosto language processor,6 OntoLearn a general-purpose ogy construction. The first one facilitates manual extracts terminology from a corpus of domain text, ontology engineering by providing natural language such as specialized Web sites and warehouses or doc- ontology, and detects processing tools, including editors, consistency uments exchanged among members of a virtual com- checkers, mediators to support shared decisions, and munity. -
Lexical Sense Labeling and Sentiment Potential Analysis Using Corpus-Based Dependency Graph
mathematics Article Lexical Sense Labeling and Sentiment Potential Analysis Using Corpus-Based Dependency Graph Tajana Ban Kirigin 1,* , Sanda Bujaˇci´cBabi´c 1 and Benedikt Perak 2 1 Department of Mathematics, University of Rijeka, R. Matejˇci´c2, 51000 Rijeka, Croatia; [email protected] 2 Faculty of Humanities and Social Sciences, University of Rijeka, SveuˇcilišnaAvenija 4, 51000 Rijeka, Croatia; [email protected] * Correspondence: [email protected] Abstract: This paper describes a graph method for labeling word senses and identifying lexical sentiment potential by integrating the corpus-based syntactic-semantic dependency graph layer, lexical semantic and sentiment dictionaries. The method, implemented as ConGraCNet application on different languages and corpora, projects a semantic function onto a particular syntactical de- pendency layer and constructs a seed lexeme graph with collocates of high conceptual similarity. The seed lexeme graph is clustered into subgraphs that reveal the polysemous semantic nature of a lexeme in a corpus. The construction of the WordNet hypernym graph provides a set of synset labels that generalize the senses for each lexical cluster. By integrating sentiment dictionaries, we introduce graph propagation methods for sentiment analysis. Original dictionary sentiment values are integrated into ConGraCNet lexical graph to compute sentiment values of node lexemes and lexical clusters, and identify the sentiment potential of lexemes with respect to a corpus. The method can be used to resolve sparseness of sentiment dictionaries and enrich the sentiment evaluation of Citation: Ban Kirigin, T.; lexical structures in sentiment dictionaries by revealing the relative sentiment potential of polysemous Bujaˇci´cBabi´c,S.; Perak, B. Lexical Sense Labeling and Sentiment lexemes with respect to a specific corpus. -
All-Words Word Sense Disambiguation Using Concept Embeddings
All-words Word Sense Disambiguation Using Concept Embeddings Rui Suzuki, Kanako Komiya, Masayuki Asahara, Minoru Sasaki, Hiroyuki Shinnou Ibaraki University, 4-12-1 Nakanarusawa, Hitachi, Ibaraki JAPAN, National Institute for Japanese Language and Linguistics, 10-2 Midoricho, Tachikawa, Tokyo, JAPAN, [email protected], kanako.komiya.nlp, minoru.sasaki.01, hiroyuki.shinnou.0828g @vc.ibaraki.ac.jp, [email protected] Abstract All-words word sense disambiguation (all-words WSD) is the task of identifying the senses of all words in a document. Since the sense of a word depends on the context, such as the surrounding words, similar words are believed to have similar sets of surrounding words. We therefore predict the target word senses by calculating the distances between the surrounding word vectors of the target words and their synonyms using word embeddings. In addition, we introduce the new idea of concept embeddings, constructed from concept tag sequences created from the results of previous prediction steps. We predict the target word senses using the distances between surrounding word vectors constructed from word and concept embeddings, via a bootstrapped iterative process. Experimental results show that these concept embeddings were able to improve the performance of Japanese all-words WSD. Keywords: word sense disambiguation, all-words, unsupervised 1. Introduction many words have the same article numbers. Several words Word sense disambiguation (WSD) involves identifying the can have the same precise article number, even when the senses of words in documents. In particular, the WSD task semantic breaks are considered. where the senses of all the words in a document are disam- biguated is referred to as all-words WSD. -
Contrastive Study and Review of Word Sense Disambiguation Techniques S.S
et International Journal on Emerging Technologies 11 (4): 96-103(2020) ISSN No. (Print): 0975-8364 ISSN No. (Online): 2249-3255 Contrastive Study and Review of Word Sense Disambiguation Techniques S.S. Patil 1, R.P. Bhavsar 2 and B.V. Pawar 3 1Research Scholar, Department of Computer Engineering, SSBT’s COET Jalgaon (Maharashtra), India. 2Associate Professor, School of Computer Sciences, KBC North Maharashtra University Jalgaon (Maharashtra), India. 3Professor, School of Computer Sciences, KBC North Maharashtra University Jalgaon (Maharashtra), India. (Corresponding author: S.S. Patil) (Received 25 February 2020, Revised 10 June 2020, Accepted 12 June 2020) (Published by Research Trend, Website: www.researchtrend.net) ABSTRACT: Ambiguous word, phrase or sentence has more than one meaning, ambiguity in the word sense is a fundamental characteristic of any natural language; it makes all the natural language processing (NLP) tasks vary tough, so there is need to resolve it. To resolve word sense ambiguities, human mind can use cognition and world knowledge, but today, machine translation systems are in demand and to resolve such ambiguities, machine can’t use cognition and world knowledge, so it make mistakes in semantics and generates wrong interpretations. In natural language processing and understanding, handling sense ambiguity is one of the central challenges, such words leads to erroneous machine translation and retrieval of unrelated response in information retrieval, word sense disambiguation (WSD) is used to resolve such sense ambiguities. Depending on the use of corpus, WSD techniques are classified into two categories knowledge-based and machine learning, this paper summarizes the contrastive study and review of various WSD techniques. -
Word Senses and Wordnet Lady Bracknell
Speech and Language Processing. Daniel Jurafsky & James H. Martin. Copyright © 2021. All rights reserved. Draft of September 21, 2021. CHAPTER 18 Word Senses and WordNet Lady Bracknell. Are your parents living? Jack. I have lost both my parents. Lady Bracknell. To lose one parent, Mr. Worthing, may be regarded as a misfortune; to lose both looks like carelessness. Oscar Wilde, The Importance of Being Earnest ambiguous Words are ambiguous: the same word can be used to mean different things. In Chapter 6 we saw that the word “mouse” has (at least) two meanings: (1) a small rodent, or (2) a hand-operated device to control a cursor. The word “bank” can mean: (1) a financial institution or (2) a sloping mound. In the quote above from his play The Importance of Being Earnest, Oscar Wilde plays with two meanings of “lose” (to misplace an object, and to suffer the death of a close person). We say that the words ‘mouse’ or ‘bank’ are polysemous (from Greek ‘having word sense many senses’, poly- ‘many’ + sema, ‘sign, mark’).1 A sense (or word sense) is a discrete representation of one aspect of the meaning of a word. In this chapter WordNet we discuss word senses in more detail and introduce WordNet, a large online the- saurus —a database that represents word senses—with versions in many languages. WordNet also represents relations between senses. For example, there is an IS-A relation between dog and mammal (a dog is a kind of mammal) and a part-whole relation between engine and car (an engine is a part of a car). -
Incorporating Trustiness and Collective Synonym/Contrastive Evidence Into Taxonomy Construction
Incorporating Trustiness and Collective Synonym/Contrastive Evidence into Taxonomy Construction #1 #2 3 Luu Anh Tuan , Jung-jae Kim , Ng See Kiong ∗ #School of Computer Engineering, Nanyang Technological University, Singapore [email protected], [email protected] ∗Institute for Infocomm Research, Agency for Science, Technology and Research, Singapore [email protected] Abstract Previous works on the task of taxonomy con- struction capture information about potential tax- Taxonomy plays an important role in many onomic relations between concepts, rank the can- applications by organizing domain knowl- didate relations based on the captured information, edge into a hierarchy of is-a relations be- and integrate the highly ranked relations into a tax- tween terms. Previous works on the taxo- onomic structure. They utilize such information nomic relation identification from text cor- as hypernym patterns (e.g. A is a B, A such as pora lack in two aspects: 1) They do not B) (Kozareva et al., 2008), syntactic dependency consider the trustiness of individual source (Drumond and Girardi, 2010), definition sentences texts, which is important to filter out in- (Navigli et al., 2011), co-occurrence (Zhu et al., correct relations from unreliable sources. 2013), syntactic contextual similarity (Tuan et al., 2) They also do not consider collective 2014), and sibling relations (Bansal et al., 2014). evidence from synonyms and contrastive terms, where synonyms may provide ad- They, however, lack in the three following as- ditional supports to taxonomic relations, pects: 1) Trustiness: Not all sources are trust- while contrastive terms may contradict worthy (e.g. gossip, forum posts written by them. -
Stefano Faralli Curriculum Vitæ
Stefano Faralli Curriculum Vitæ PART 1 - General Information Full name Stefano Faralli orcid http://orcid.org/0000-0003-3684-8815 E-mail [email protected] Home page https://www.unitelmasapienza.it/it/contenuti/personale/stefano-faralli Spoken languages Italian (native language), English and French. PART 2 - Education Oct. 2014 PhD in Computer Science, Sapienza University of Rome. May. 2005 BS + MS in Computer Science, Sapienza University of Rome. PART 3 - Appointments Current • 01/11/2017 - Researcher at University of Rome Unitlema Sapienza, Italy Duties included research and teaching. Research fellowships and PhD • 01/06/2015 - 30/10/2017 Post-doctoral position under the supervision of Professor Simone Paolo Ponzetto University of Mannheim, Germany, Institut f¨urInformatik und Wirtschaftsinformatik Duties included research, undergraduate and graduate teaching, student mentoring. • 01/01/2014 - 31/12/2014 Research fellow, under the supervision of Professors: Roberto Navigli and Paola Velardi Sapienza University of Rome, Dip. Informatica • 01/11/2011 - 20/02/2015 PhD student under the supervision of Professors: Roberto Navigli and Paola Velardi Sapienza University of Rome, Dip. Informatica • 01/01/2011 -31/12/2013 Research fellow under the supervision of Professors: Roberto Navigli and Paola Velardi Sapienza University of Rome, Dip. Informatica • 01/04/2008 -31/03/2009 Research fellow under the supervision of Professor: Augusto Celentano Universit`a"Ca' Foscari" Venezia, Dip. Informatica Research collaboration activities and -
Clustering Enterprise Networks by Patent Analysis
Clustering Enterprise Networks by Patent Analysis Fulvio D´Antonio1, Simone Orsini1, Alessandro Cucchiarelli1, Paola Velardi2 1 Polytechnic University of Marche, Italy, email: cucchiarelli, dantonio, [email protected] 2 Sapienza Università di Roma, Italy, email: dantonio,[email protected] Abstract. The analysis of networks of enterprises can lead to some important insights concerning strategic aspects that can drive the decision making process of different players: business analysts, entrepreneurs, public administrators. In this paper we present the current development status of an integrated methodology to automatically extract enterprise networks from public textual data and analyzing them. We show an application to the enterprises operating in the Italian region of Marche. Keywords: Social Network Analysis, Natural Language Processing, Clustering 1 Introduction Networks of Enterprises [2] are a special kind of social networks in which the nodes represent enterprises and the links indicate some form of relationship among them. The relationships that have been traditionally represented through links are business collaborations, enterprise similarity, mutual exchange of capitals, information flows, or hierarchical relationships like the ones representing supply chains or enterprises aggregation into districts. Social Network Analysis [6] defines a number of measures and techniques that can be used for the evaluation and analysis of enterprise networks. Such measures, if examined by a business analyst, an entrepreneur or a public administrator can lead to some important insights concerning some strategic aspects of the network. We describe here few scenarios in which the analysis can be conveniently applied: Domain analysis The analyst inspects the network in order to understand which are the main productive sectors, the groups of similar enterprises, the relative strengths of such groups and their inter-relationships. -
How Much Does Word Sense Disambiguation Help in Sentiment Analysis of Micropost Data?
How much does word sense disambiguation help in sentiment analysis of micropost data? Chiraag Sumanth Diana Inkpen PES Institute of Technology University of Ottawa India Canada 6th Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis OUTLINE • Introduction • What is Word Sense Disambiguation • Word Sense Disambiguation and Sentiment Analysis • Dataset • System Description • Results • Error Analysis • Conclusion How much does word sense disambiguation help in sentiment analysis of micropost data? INTRODUCTION • This short paper describes a sentiment analysis system for micro-post data that includes analysis of tweets from Twitter and Short Messaging Service (SMS) text messages. • Use Word Sense Disambiguation techniques in sentiment analysis at the message level, where the entire tweet or SMS text was analysed to determine its dominant sentiment. • Use of Word Sense Disambiguation alone has resulted in an improved sentiment analysis system that outperforms systems built without incorporating Word Sense Disambiguation. How much does word sense disambiguation help in sentiment analysis of micropost data? What is Word Sense Disambiguation (WSD) ? • Any natural language processing system encounters the problem of lexical ambiguity, be it syntactic or semantic. • The resolution of a word’s syntactic ambiguity has largely been by part-of-speech taggers which with high levels of accuracy. • The problem is that words often have more than one meaning, sometimes fairly similar and sometimes completely different. The meaning of a word in a particular usage can only be determined by examining its context. • Word Sense Disambiguation (WSD) is the process of identifying the sense of such words, called polysemic words. How much does word sense disambiguation help in sentiment analysis of micropost data? Word Sense Disambiguation and Sentiment Analysis • Akkaya et al.