Supervised Learning of a Probabilistic Lexicon of Verb Semantic Classes Yusuke Miyao Jun’ichi Tsujii University of Tokyo University of Tokyo Hongo 7-3-1, Bunkyo-ku, Tokyo, Japan University of Manchester
[email protected] National Center for Text Mining Hongo 7-3-1, Bunkyo-ku, Tokyo, Japan
[email protected] Abstract al., 2005; Li and Brew, 2007; Abend et al., 2008). In other words, those methods are heavily de- The work presented in this paper explores pendent on the availability of a semantic lexicon. a supervised method for learning a prob- Therefore, recent research efforts have invested in abilistic model of a lexicon of VerbNet developing semantic resources, such as WordNet classes. We intend for the probabilis- (Fellbaum, 1998), FrameNet (Baker et al., 1998), tic model to provide a probability dis- and VerbNet (Kipper et al., 2000; Kipper-Schuler, tribution of verb-class associations, over 2005), which greatly advanced research in seman- known and unknown verbs, including pol- tic processing. However, the construction of such ysemous words. In our approach, train- resources is expensive, and it is unrealistic to pre- ing instances are obtained from an ex- suppose the availability of full-coverage lexicons; isting lexicon and/or from an annotated this is the case because unknown words always ap- corpus, while the features, which repre- pear in real texts, and word-semantics associations sent syntactic frames, semantic similarity, may vary (Abend et al., 2008). and selectional preferences, are extracted This paper explores a method for the supervised from unannotated corpora.