A Publicly Available Lexical Resource for Opinion Mining

SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining Andrea Esuli∗ and Fabrizio Sebastiani† ∗Istituto di Scienza e Tecnologie dell’Informazione, Consiglio Nazionale delle Ricerche Via Giuseppe Moruzzi 1, 56124 Pisa, Italy E-mail: [email protected] †Dipartimento di Matematica Pura e Applicata, Universita` di Padova Via Giovan Battista Belzoni 7, 35131 Padova, Italy E-mail: [email protected] Abstract Opinion mining (OM) is a recent subdiscipline at the crossroads of information retrieval and computational linguistics which is concerned not with the topic a document is about, but with the opinion it expresses. OM has a rich set of applications, ranging from tracking users’ opinions about products or about political candidates as expressed in online forums, to customer relationship management. In order to aid the extraction of opinions from text, recent research has tried to automatically determine the “PN-polarity” of subjective terms, i.e. identify whether a term that is a marker of opinionated content has a positive or a negative connotation. Research on determining whether a term is indeed a marker of opinionated content (a subjective term) or not (an objective term) has been, instead, much more scarce. In this work we describe SENTIWORDNET, a lexical resource in which each WORDNET synset s is associated to three numerical scores Obj(s), P os(s) and Neg(s), describing how objective, positive, and negative the terms contained in the synset are. The method used to develop SENTIWORDNET is based on the quantitative analysis of the glosses associated to synsets, and on the use of the resulting vectorial term representations for semi-supervised synset classification. The three scores are derived by combining the results produced by a committee of eight ternary classifiers, all characterized by similar accuracy levels but different classification behaviour. SENTIWORDNET is freely available for research purposes, and is endowed with a Web-based graphical user interface. 1. Introduction To aid these tasks, several researchers have attempted to Opinion mining (OM – also known as “sentiment classifi- automatically determine whether a term that is a marker cation”) is a recent subdiscipline at the crossroads of infor- of opinionated content has a Positive or a Negative con- mation retrieval and computational linguistics which is connotation (Esuli and Sebastiani, 2005; Hatzivassiloglou and cerned not with the topic a text is about, but with the opin- McKeown, 1997; Kamps et al., 2004; Kim and Hovy, 2004; ion it expresses. Opinion-driven content management has Takamura et al., 2005; Turney and Littman, 2003), since it several important applications, such as determining critics’ is by considering the combined contribution of these terms opinions about a given product by classifying online prod- that one may hope to solve Tasks 1, 2 and 3. The con- uct reviews, or tracking the shifting attitudes of the general ceptually simplest approach to this latter problem is prob- public towards a political candidate by mining online fo- ably Turney’s (Turney, 2002), who has obtained interest- rums or blogs. Within OM, several subtasks can be iden- ing results on Task 2 by considering the algebraic sum of tified, all of them having to do with tagging a given text the orientations of terms as representative of the orienta- according to expressed opinion: tion of the document they belong to; but more sophisticated approaches are also possible (Hatzivassiloglou and Wiebe, 1. determining text SO-polarity, as in deciding whether a 2000; Riloff et al., 2003; Whitelaw et al., 2005; Wilson et given text has a factual nature (i.e. describes a given al., 2004). situation or event, without expressing a positive or a The task of determining whether a term is indeed a negative opinion on it) or expresses an opinion on its marker of opinionated content (i.e. is Subjective or Ob- subject matter. This amounts to performing binary text jective) has instead received much less attention (Esuli and categorization under categories Subjective and Ob- Sebastiani, 2006; Riloff et al., 2003; Vegnaduzzo, 2004). jective (Pang and Lee, 2004; Yu and Hatzivassiloglou, Note that in these works no distinction between different 2003); senses of a word is attempted, so that the term, and not its senses, are classified (although some such works (Hatzivas- 2. determining text PN-polarity, as in deciding if a given siloglou and McKeown, 1997; Kamps et al., 2004) distin- Subjective text expresses a Positive or a Negative guish between different POSs of a word). opinion on its subject matter (Pang and Lee, 2004; Turney, 2002); In this paper we describe SENTIWORDNET (version 1.0), a lexical resource in which each synset of WORD- 3. determining the strength of text PN-polarity, as in de- NET (version 2.0) is associated to three numerical scores ciding e.g. whether the Positive opinion expressed by Obj(s), P os(s) and Neg(s), describing how Objective, a text on its subject matter is Weakly Positive, Mildly Positive, and Negative the terms contained in the synset Positive, or Strongly Positive (Pang and Lee, 2005; are. The assumption that underlies our switch from terms Wilson et al., 2004). to synsets is that different senses of the same term may 417 have different opinion-related properties. Each of the three semi-supervised synset classification. The three scores are scores ranges from 0.0 to 1.0, and their sum is 1.0 for derived by combining the results produced by a committee each synset. This means that a synset may have nonzero of eight ternary classifiers, each of which has demonstrated, scores for all the three categories, which would indicate in our previous tests, similar accuracy but different charac- that the corresponding terms have, in the sense indicated teristics in terms of classification behaviour. by the synset, each of the three opinion-related proper- SENTIWORDNET is freely available for research pur- ties only to a certain degree1. For example, the synset poses, and is endowed with a Web-based graphical user in- [estimable(3)]2, corresponding to the sense “may be terface. computed or estimated” of the adjective estimable, has an Obj score of 1.0 (and P os and Neg scores of 0.0), while 2. Building SENTIWORDNET the synset [estimable(1)] corresponding to the sense The method we have used to develop SENTIWORDNET is “deserving of respect or high regard” has a P os score of an adaptation to synset classification of our method for de- 0.75, a Neg score of 0.0, and an Obj score of 0.25. ciding the PN-polarity (Esuli and Sebastiani, 2005) and SO- A similar intuition had previously been presented polarity (Esuli and Sebastiani, 2006) of terms. The method in (Kim and Hovy, 2004), whereby a term could have relies on training a set of ternary classifiers3, each of them both a Positive and a Negative PN-polarity, each to a cer- capable of deciding whether a synset is Positive, or Nega- tain degree. A similar point has also recently been made tive, or Objective. Each ternary classifier differs from the in (Andreevskaia and Bergler, 2006), in which terms that other in the training set used to train it and in the learn- possess a given opinion-related property to a higher de- ing device used to train it, thus producing different classi- gree are claimed to be also the ones on which human an- fication results of the WORDNET synsets. Opinion-related notators asked to assign this property agree more. Non- scores for a synset are determined by the (normalized) pro- binary scores are attached to opinion-related properties also portion of ternary classifiers that have assigned the corre- in (Turney and Littman, 2003), but the interpretation here is sponding label to it. If all the ternary classifiers agree in related to the confidence in the correctness of the labelling, assigning the same label to a synset, that label will have the rather than in how strong the term is deemed to possess the maximum score for that synset, otherwise each label will property. have a score proportional to the number of classifiers that We believe that a graded (as opposed to “hard”) eval- have assigned it. uation of opinion-related properties of terms can be help- 2.1. Training a classifier ful in the development of opinion mining applications. A hard classification method will probably label as Objective Each ternary classifier is generated using the semi- any term that has no strong SO-polarity, e.g. terms such as supervised method described in (Esuli and Sebastiani, short or alone. If a sentence contains many such terms, 2006). A semi-supervised method is a learning process a resource based on a hard classification will probably miss whereby only a small subset L ⊂ T r of the training data T r its subtly subjective character, while a graded lexical re- have been manually labelled. In origin the training data in U = T r − L source like SENTIWORDNET may provide enough infor- were instead unlabelled; it is the process itself mation to capture such nuances. that has labelled them, automatically, by using L (with the The method we have used to develop SENTIWORDNET possible addition of other publicly available resources) as is based on our previous work on determining the opinion- input. Our method defines L as the union of three seed (i.e. terms training) sets Lp, Ln and Lo of known Positive, Negative related properties of (Esuli and Sebastiani, 2005; 4 Esuli and Sebastiani, 2006). The method relies on the quan- and Objective synsets, respectively . glosses Lp and Ln are two small sets, which we have defined by titative analysis of the associated to synsets, and on 5 the use of the resulting vectorial term representations for manually selecting the intended synsets for 14 “paradig- matic” Positive and Negative terms (e.g.

Load more