LDL-2014 3Rd Workshop on Linked Data in Linguistics

3rd Workshop on Linked Data in Linguistics: Multilingual Knowledge Resources and Natural Language Processing Workshop Programme 08:30 - 09:00 – Opening and Introduction by Workshop Chair(s) 09:00 – 10:00 – Invited Talk Piek Vossen, The Collaborative Inter-Lingual-Index for harmonizing wordnets 10:00 – 10:30 – Session 1: Modeling Lexical-Semantic Resources with lemon Andon Tchechmedjiev, Gilles Sérasset, Jérôme Goulian and Didier Schwab, Attaching Translations to Proper Lexical Senses in DBnary 10:30 – 11:00 Coffee break 11:00-11:20– Session 1: Modeling Lexical-Semantic Resources with lemon John Philip McCrae, Christiane Fellbaum and Philipp Cimiano, Publishing and Linking WordNet using lemon and RDF 11:20-11:40– Session 1: Modeling Lexical-Semantic Resources with lemon Andrey Kutuzov and Maxim Ionov, Releasing genre keywords of Russian movie descriptions as Linguistic Linked Open Data: an experience report 11:40-12:00– Session 2: Metadata Matej Durco and Menzo Windhouwer, From CLARIN Component Metadata to Linked Open Data 12:00-12:20– Session 2: Metadata Gary Lefman, David Lewis and Felix Sasaki, A Brief Survey of Multimedia Annotation Localisation on the Web of Linked Data 12:20-12:50– Session 2: Metadata Daniel Jettka, Karim Kuropka, Cristina Vertan and Heike Zinsmeister, Towards a Linked Open Data Representation of a Grammar Terms Index 12:50-13:00 – Poster slam – Data Challenge 13:00 – 14:00 Lunch break 14:00 – 15:00 – Invited Talk Gerard de Mello, From Linked Data to Tightly Integrated Data 15:00 – 15:30 – Section 3: Crosslinguistic Studies Christian Chiarcos and Maria Sukhareva, Linking Etymological Databases. A case study in Germanic i 15:30 – 16:00 – Section 3: Crosslinguistic Studies Fahad Khan, Federico Boschetti and Francesca Frontini, Using lemon to Model Lexical Semantic Shift in Diachronic Lexical Resources 16:00 – 16:30 Coffee break 16:30 – 17:00 – Section 3: Crosslinguistic Studies Steven Moran and Michael Cysouw, Typology with graphs and matrices 17:00 – 17:30 – Section 3: Crosslinguistic Studies Robert Forkel, The Cross-Linguistic Linked Data project 17:30 – 18:30 – Poster Session – Data Challenge Gilles Sérasset and Andon Tchechmedjiev, Dbnary: Wiktionary as Linked Data for 12 Language Editions with Enhanced Translation Relations Maud Ehrmann, Francesco Cecconi, Daniele Vannella, John Philip McCrae, Philipp Cimiano and Roberto Navigli, A Multilingual Semantic Network as Linked Data: lemon-BabelNet Gabriela Vulcu, Raul Lario Monje, Mario Munoz, Paul Buitelaar and Carlos A. Iglesias, Linked- Data based Domain-Specific Sentiment Lexicons Tomáš Kliegr, Vaclav Zeman and Milan Dojchinovski, Linked Hypernyms Dataset - Generation framework and Use Cases Ismail El Maarouf, Jane Bradbury and Patrick Hanks, PDEV-lemon: a Linked Data implementation of the Pattern Dictionary of English Verbs based on the Lemon model 18:30 – 19:00 – Discussions and Closing ii Editors Goethe-University Frankfurt am Main, Christian Chiarcos Germany John Philip McCrae University of Bielefeld, Germany Petya Osenova Bulgarian Academy of Sciences, Sofia, Bulgaria Cristina Vertan University of Hamburg, Germany Workshop Organizers/Organizing Committee Christian Chiarcos Goethe-University Frankfurt am Main, Germany John Philip McCrae University of Bielefeld, Germany Kiril Simov Bulgarian Academy of Sciences, Sofia, Bulgaria Antonio Branco University of Lisbon, Portugal Nicoletta Calzolari ILC-CNR, Italy Petya Osenova University of Sofia, Bulgaria Milena Slavcheva JRC-Brussels, Belgium Cristina Vertan University of Hamburg, Germany Workshop Programme Committee Eneko Agirre University of the Basque Country, Spain Guadalupe Aguado Universidad Politécnica de Madrid, Spain Interdisciplinary Centre for Social and Peter Bouda Language Documentation, Portugal Steve Cassidy Macquarie University, Australia Damir Cavar Eastern Michigan University, USA Walter Daelemans University of Antwerp, Belgium University of Applied Sciences Potsdam, Ernesto William De Luca Germany Gerard de Melo University of California at Berkeley, USA Institute of Information Sciences, Academia Dongpo Deng Sinica, Taiwan Alexis Dimitriadis Universiteit Utrecht, The Netherlands Jeff Good University at Buffalo, USA Asunción Gómez Pérez Universidad Politécnica de Madrid, Spain Jorge Gracia Universidad Politécnica de Madrid, Spain Walther v. Hahn University of Hamburg, Germany Eva Hajicova Charles University Prague, Czech Republic Radboud Universiteit Nijmegen, The Harald Hammarström Netherlands Yoshihiko Hayashi Osaka University, Japan Sebastian Hellmann Universität Leipzig, Germany Dominic Jones Trinity College Dublin, Ireland Lutz Maicher Universität Leipzig, Germany iii Open Knowledge Foundation Deutschland, Pablo Mendes Germany Universität Zürich, Switzerland/Ludwig Steven Moran Maximilian University, Germany Max Planck Institute for Evolutionary Sebastian Nordhoff Anthropology, Leipzig, Germany Maciej Piasecki Wroclaw University of Technology, Poland Adam Przepiorkowski IPAN, Polish Academy of Sciences, Poland Laurent Romary INRIA, France Deutsches Forschungszentrum für Künstliche Felix Sasaki Intelligenz, Germany iv Table of contents Christian Chiarcos, John McCrae, Petya Osenova, Cristina Vertan, Linked Data in Linguistics vii 2014. Introduction and Overview Piek Vossen, The Collaborative Inter-Lingual-Index for harmonizing wordnets 2 Gerard de Mello, From Linked Data to Tightly Integrated Data 3 Andon Tchechmedjiev, Gilles Sérasset, Jérôme Goulian and Didier Schwab, Attaching 5 Translations to Proper Lexical Senses in DBnary John Philip McCrae, Christiane Fellbaum and Philipp Cimiano, Publishing and Linking 13 WordNet using lemon and RDF Andrey Kutuzov and Maxim Ionov, Releasing genre keywords of Russian movie descriptions 17 as Linguistic Linked Open Data: an experience report Matej Durco, Menzo Windhouwer, From CLARIN Component Metadata to Linked Open Data 23 Gary Lefman, David Lewis and Felix Sasaki, A Brief Survey of Multimedia Annotation 28 Localisation on the Web of Linked Data Daniel Jettka, Karim Kuropka, Cristina Vertan and Heike Zinsmeister, Towards a Linked 33 Open Data Representation of a Grammar Terms Index Christian Chiarcos and Maria Sukhareva, Linking Etymological Databases. A case study in 40 Germanic Fahad Khan, Federico Boschetti and Francesca Frontini, Using lemon to Model Lexical 49 Semantic Shift in Diachronic Lexical Resources Steven Moran and Michael Cysouw, Typology with graphs and matrices 54 Robert Forkel, The Cross-Linguistic Linked Data project 60 Gilles Sérasset and Andon Tchechmedjiev, Dbnary: Wiktionary as Linked Data for 12 67 Language Editions with Enhanced Translation Relations Maud Ehrmann, Francesco Cecconi, Daniele Vannella, John Philip McCrae, Philipp Cimiano 71 and Roberto Navigli, A Multilingual Semantic Network as Linked Data: lemon-BabelNet Gabriela Vulcu, Raul Lario Monje, Mario Munoz, Paul Buitelaar and Carlos A. Iglesias, 76 Linked-Data based Domain-Specific Sentiment Lexicons Tomáš Kliegr, Vaclav Zeman and Milan Dojchinovski, Linked Hypernyms Dataset - 81 Generation framework and Use Cases Ismail El Maarouf, Jane Bradbury and Patrick Hanks, PDEV-lemon: a Linked Data 87 implementation of the Pattern Dictionary of English Verbs based on the Lemon model v Author Index Boschetti, Federico. 49 Bradbury, Jane. .87 Buitelaar, Paul. .76 Cecconi, Francesco. 71 Chiarcos, Christian . .vii, 40 Cimiano, Philipp . .13, 71 McCrae, John . vii, 13, 71 Cysouw, Michael. .54 Dojchinovski, Milan. 81 Durco, Matej. 23 Ehrmann, Maud. .71 Fellbaum, Christiane. .13 Forkel, Robert. .60 Frontini, Francesca. .49 Goulian, Jérôme. .5 Hanks, Patrick. 87 Iglesias, Carlos A. .76 Ionov, Maxim. 17 Jettka, Daniel. .33 Khan, Fahad. .49 Kliegr, Tomáš. 81 Kuropka, Karim. .33 Kutuzov, Andrey. 17 Lario Monje, Raul. 76 Lefman, Gary. 28 Lewis, David. .28 El Maarouf, . ..87 de Mello, Gerard. 3 Moran, Steven. .54 Munoz, Mario. .76 Navigli, Roberto. 71 Osenova, Petya. .vii Sasaki, Felix. .28 Schwab, Didier. .5 Sérasset, Gilles . 5, 67 Sukhareva, Maria. ..40 Tchechmedjiev, Andon . .5, 67 Vannella, Daniele. .71 Vertan, Cristina . vii, 33 Vulcu, Gabriela. .76 Vossen, Piek. .2 Windhouwer, Menzo. .23 Zeman, Vaclav. .81 Zinsmeister, Heike. .33 vi Linked Data in Linguistics 2014. Introduction and Overview Christian Chiarcos1, John McCrae2, Petya Osenova3, Cristina Vertan4 1 Goethe-Universitat¨ Frankfurt am Main, Germany, [email protected] 2 Universitat¨ Bielefeld, Germany, [email protected] 3 University of Soﬁa, Bulgaria, [email protected] 4 Universitat¨ Hamburg, Germany,[email protected] Abstract The Linked Data in Linguistics (LDL) workshop series brings together researchers from various ﬁelds of linguistics, natural language processing, and information technology to present and discuss principles, case studies, and best practices for representing, publishing and linking linguistic data collections. A major outcome of our work is the Linguistic Linked Open Data (LLOD) cloud, an LOD (sub-)cloud of linguistic resources, which covers various linguistic data bases, lexicons, corpora, terminology and metadata repositories. As a general introduction into the topic, we describe the concept of Linked Data, its application in linguistics and the development of the Linguistic Linked Open Data (LLOD) cloud since LDL-2013. We present the contributions of LDL-2014, the associated data challenge and its results and present the newly compiled LLOD cloud diagram. The third instantiation of this series, collocated with the 9th Language Resources and Evaluation

LDL-2014 3Rd Workshop on Linked Data in Linguistics

Hittite Etymologies and Notes*

What the Neurocognitive Study of Inner Language Reveals About Our Inner Space Hélène Loevenbruck

CNS 2014 Program

An Amerind Etymological Dictionary

Talking to Computers in Natural Language

Revealing the Language of Thought Brent Silby 1

3 Corpus Tools for Lexicographers

Natural Language Processing (NLP) for Requirements Engineering: a Systematic Mapping Study

Detecting Politeness in Natural Language by Michael Yeomans, Alejandro Kantor, Dustin Tingley

The Art of Lexicography - Niladri Sekhar Dash

The Importance of Morphology, Etymology, and Phonology

Natural Language Processing