Exploring and Visualizing Wordnet Data with GermaNet Rover

Marie Hinrichs, Richard Lawrence, Erhard Hinrichs University of Tübingen, Germany

www.clarin-d.net GERMANET AND ROVER

1

www.clarin-d.net GermaNet: a Wordnet for German

A groups synonyms into synsets and represents relations between synsets.

The hypernym relation forms a hierarchical graph structure.

2

www.clarin-d.net Rover: a web application for GermaNet

Rover displays the data in GermaNet in an interactive interface designed for researchers. It offers:

• advanced searching for synsets • visualizing the hypernym graph • calculating synsets’ semantic relatedness via graph-based measures

3

www.clarin-d.net SYNSET SEARCH

4

www.clarin-d.net Synset Search Overview

Try it: https://weblicht.sfs.uni-tuebingen.de/rover/search

5

www.clarin-d.net Search Options

Advanced searches for • search by regular synsets: expression and edit distance • restrict results by grammatical category, semantic class, and orthographic variant

6

www.clarin-d.net Results List

A summary of each synset in the search results includes:

• words in the synset • its semantic class • associated Wiktionary definitions • summary of conceptual relations to other synsets

7

www.clarin-d.net Conceptual Relations

Selecting a synset displays details about its conceptual relations: • a network diagram of the synset’s hypernyms • related synsets, displayed as navigation buttons

8

www.clarin-d.net Lexical Units and Relations

More details about the words in the selected synset:

• lexically-related words, displayed as navigation buttons • Interlingual index records (pointers to Princeton WordNet) • examples and associated frame types • decomposition of compound nouns

9

www.clarin-d.net SEMANTIC RELATEDNESS

10

www.clarin-d.net Semantic Relatedness Overview

Try it: https://weblicht.sfs.uni-tuebingen.de/rover/semrel

11

www.clarin-d.net Semantic Relatedness Measures

Rover supports six graph-based measures of semantic relatedness between pairs of synsets:

1. Simple Path calculates the length of the path between two synsets via the hypernym relation, relative to the length of the longest such path in GermaNet 2. Wu and Palmer (1994) 3. Leacock and Chodorow (1998) 4. Resnik (1999) 5. Lin (1998) 6. Jiang and Conrath (1997)

12

www.clarin-d.net Visualize Relatedness

Interactively select a pair of synsets and see the results of relatedness measures: • search for synsets using the same options and interface as Synset Search • adjust measures, normalization maximum, and precision

13

www.clarin-d.net Relatedness Results

Once a pair of synsets is selected, Visualize Relatedness displays: • table of calculation results • network diagram showing the path(s) between the two synsets via their least common subsumers

14

www.clarin-d.net Batch Processing

Batch Processing performs relatedness calculations on larger datasets:

• upload a file of up to 200 word pairs • similar options for adjusting measures, normalization maximum, and precision • download results in a delimited format for further processing

15

www.clarin-d.net Thanks!

Try Rover at: https://weblicht.sfs.uni-tuebingen.de/rover/

Send questions, comments, and suggestions for improvement to: [email protected]

(When you are asked to log in, use your academic institution account, or your CLARIN account)

16

www.clarin-d.net References

Jiang and Conrath (1997). Based on Corpus Statistics and Lexical . In Proceedings of the 10th Research on Computational Linguistics International Conference, 19–33. Taipei, Taiwan: The Association for Computational Linguistics and Chinese Language Processing

Leacock and Chodorow (1998). ‘Combining Local Context and WordNet Similarity for Word Sense Identification’. In WordNet: An Electronic Lexical Database. MIT Press.

Lin (1998). ‘An Information-Theoretic Definition of Similarity’. In Proceedings of the Fifteenth International Conference on Machine Learning, 296–304. San Francisco, CA, USA: Morgan Kaufmann Publishers.

Resnik (1999). Semantic Similarity in a Taxonomy: An Information-Based Measure and Its Application to Problems of Ambiguity in Natural Language. Journal of Artificial Intelligence Research 11 (July): 95–130.

Wu and Palmer (1994). Verbs and lexical selection. In Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics, 133–138. ACL ’94. Las Cruces, New Mexico: Association for Computational Linguistics.

17

www.clarin-d.net