Language Technologies for a Multilingual Europe

Total Page:16

File Type:pdf, Size:1020Kb

Language Technologies for a Multilingual Europe Language technologies for a multilingual Europe TC3 III Edited by Georg Rehm Felix Sasaki Daniel Stein Andreas Witt language Translation and Multilingual Natural science press Language Processing 5 Translation and Multilingual Natural Language Processing Editors: Oliver Czulo (Universität Leipzig), Silvia Hansen-Schirra (Johannes Gutenberg-Universität Mainz), Reinhard Rapp (Johannes Gutenberg-Universität Mainz) In this series: 1. Fantinuoli, Claudio & Federico Zanettin (eds.). New directions in corpus-based translation studies. 2. Hansen-Schirra, Silvia & Sambor Grucza (eds.). Eyetracking and Applied Linguistics. 3. Neumann, Stella, Oliver Čulo & Silvia Hansen-Schirra (eds.). Annotation, exploitation and evaluation of parallel corpora: TC3 I. 4. Czulo, Oliver & Silvia Hansen-Schirra (eds.). Crossroads between Contrastive Linguistics, Translation Studies and Machine Translation: TC3 II. 5. Rehm, Georg, Felix Sasaki, Daniel Stein & Andreas Witt (eds.). Language technologies for a multilingual Europe: TC3 III. 6. Menzel, Katrin, Ekaterina Lapshinova-Koltunski & Kerstin Anna Kunz (eds.). New perspectives on cohesion and coherence: Implications for translation. 7. Hansen-Schirra, Silvia, Oliver Czulo & Sascha Hofmann (eds). Empirical modelling of translation and interpreting. 8. Svoboda, Tomáš, Łucja Biel & Krzysztof Łoboda (eds.). Quality aspects in institutional translation. 9. Fox, Wendy. Can integrated titles improve the viewing experience? Investigating the impact of subtitling on the reception and enjoyment of film using eye tracking and questionnaire data. 10. Moran, Steven & Michael Cysouw. The Unicode cookbook for linguists: Managing writing systems using orthography profiles. ISSN: 2364-8899 Language technologies for a multilingual Europe TC3 III Edited by Georg Rehm Felix Sasaki Daniel Stein Andreas Witt language science press Georg Rehm, Felix Sasaki, Daniel Stein & Andreas Witt (eds.). 2018. Language technologies for a multilingual Europe: TC3 III (Translation and Multilingual Natural Language Processing 5). Berlin: Language Science Press. This title can be downloaded at: http://langsci-press.org/catalog/book/106 © 2018, the authors Published under the Creative Commons Attribution 4.0 Licence (CC BY 4.0): http://creativecommons.org/licenses/by/4.0/ ISBN: 978-3-946234-73-9 (Digital) 978-3-946234-77-7 (Hardcover) ISSN: 2364-8899 DOI:10.5281/zenodo.1291947 Source code available from www.github.com/langsci/106 Collaborative reading: paperhive.org/documents/remote?type=langsci&id=106 Cover and concept of design: Ulrike Harbort Typesetting: Felix Kopecky, Florian Stuhlmann, Iana Stefanova, Sebastian Nordhoff, Stefanie Hegele Proofreading: Ahmet Bilal Özdemir, Alessia Battisti, Alexis Michaud, Amr Zawawy, Anne Kilgus, Brett Reynolds, Benedikt Singpiel, David Lukeš, Eleni Koutso, Eran Asoulin, Ikmi Nur Oktavianti, Jeroen van de Weijer, Matthew Weber, Lea Schäfer, Rosetta Berger, Stathis Selimis Fonts: Linux Libertine, Libertinus Math, Arimo, DejaVu Sans Mono Typesetting software:Ǝ X LATEX Language Science Press Unter den Linden 6 10099 Berlin, Germany langsci-press.org Storage and cataloguing done by FU Berlin Contents Preface iii 1 Editorial Georg Rehm, Felix Sasaki, Daniel Stein & Andreas Witt 1 2 Machine translation: Past, present and future Daniel Stein 5 3 The META-NET strategic research agenda for language technology in Europe: An extended summary Georg Rehm 19 4 Metadata for the multilingual web Felix Sasaki 43 5 State of the art in Translation Memory Technology Uwe Reinke 55 6 Authoring support for controlled language and machine translation: A report from practice Melanie Siegel 85 7 Integration of machine translation in on-line multilingual applications: Domain adaptation Mirela-Ştefania Duma & Cristina Vertan 103 8 Disambiguate yourself: Supporting users in searching documents with query disambiguation suggestions Ernesto William De Luca & Christian Scheel 123 Contents 9 Multilingual knowledge in aligned Wiktionary and OmegaWiki for translation applications Michael Matuschek, Christian M. Meyer & Iryna Gurevych 139 10 The BerbaTek project for Basque: Promoting a less-resourced language via language technology for translation, content management and learning Igor Leturia, Kepa Sarasola, Xabier Arregi, Arantza Diaz de Ilarraza, Eva Navas, Iñaki Sainz, Arantza del Pozo, David Baranda & Urtza Iturraspe 181 Index 205 ii Preface This book, Language Technologies for a Multilingual Europe, is a reissue ofthe Special Issue on Technologies for a Multilingual Europe, which was originally published as Vol. 3, No. 1 of the Open Access online journal Translation: Computa- tion, Corpora, Cognition (TC3). After the editors of TC3 had decided to transition the journal into a different format – into the Open Access book series Translation and Multilingual Natural Language Processing – they invited us to prepare a reis- sue of our compilation, originally published in 2013. While several smaller typos in the original manuscripts have been fixed, the papers in this collection have not been substantially modified with regard to the original publication, which is still, for archival reasons, available at http://www.blogs.uni-mainz.de/fb06-tc3/vol-3- no-1-2013/. Since the original publication, Multilingual Europe has made several impor- tant steps forward. A new set of EU projects on multilingual technologies and ma- chine translation was funded in 2015, e.g., QT21, HimL and CRACKER. The Crack- ing the Language Barrier federation (http://www.cracking-the-language-barrier. eu) was established as an umbrella organisation for all projects and organisations working on technologies for a multilingual Europe. At the time of writing META- NET is organising the next META-FORUM conference, which is to take place in Brussels on 13/14 November 2017. One of the key topics of this conference is the Human Language Project – a large, coordinated funding programme span- ning from education to research to innovation, which aims at bringing about the much needed boost in research and a paradigm shift in processing language au- tomatically. First steps towards the Human Language Project were discussed at a workshop in the European Parliament in early 2017. Moreover, the Common Language Resources and Technology Infrastructure (CLARIN), founded in 2012 by ten European countries, doubled its number of members since then. The editors of this special issue would like to thank the series editors, especially Oliver Culo, for the opportunity to publish a reissue of our original compilation. Georg Rehm, Felix Sasaki, Daniel Stein, Andreas Witt August 28, 2017 Chapter 1 Editorial Georg Rehm Felix Sasaki Daniel Stein Andreas Witt The roots of this special issue of Translation: Computation, Corpora, Cognition go all the way back to 2011. At the end of September of that year, the guest editors or- ganised a workshop at the Conference of the German Society for Computational Linguistics and Language Technology (gscl), which took place in Hamburg. The topic of the gscl 2011 conference – “Multilingual Resources and Multilingual Applications” – had already set the stage for our pre-conference workshop on September 27, 2011, which put special emphasis on “Language Technology for a Multilingual Europe”. Our intention behind this workshop was to bring together various groups con- cerned with the umbrella topics of multilingualism and language technology, es- pecially multilingual technologies. This encompassed, on the one hand, repre- sentatives from research and development in the field of language technologies, and on the other hand users from diverse areas such as, among others, industry, administration and funding agencies. Two examples of language technologies that we mentioned in the call for contributions were Machine Translation and processing of texts from the humanities with methods drawn from language tech- nology, such as automatic topic indexing, and text mining, as well as integrating numerous texts and additional information across languages. What these kinds of application areas and research and development in lan- guage technology have in common is that they either rely – critically – on lan- guage resources (lexicons, corpora, grammars, language models etc.) or produce Georg Rehm, Felix Sasaki, Daniel Stein & Andreas Witt. Editorial. In Georg Rehm, Felix Sasaki, Daniel Stein & Andreas Witt (eds.), Language technologies for a multilingual Europe: TC3 III, 1–4. Berlin: Language Science Press. DOI:10.5281/zenodo.1291922 Georg Rehm, Felix Sasaki, Daniel Stein & Andreas Witt these resources. A multilingual Europe supported by language technology is only possible if an adequate and interoperable infrastructure of resources (including the related tooling) is available for all European and other important languages. It is necessary that the aforementioned groups and other communities of develop- ers and users of language technology stand as a single homogenous community. Only if all members of our (quite heterogeneous and hitherto mostly fragmented) community stand together and speak with one voice, it will be possible to assure the long-term political acceptance of the “Language Technology” topic in Europe. The Workshop “Language Technology for a Multilingual Europe” was co-or- ganised by two gscl working groups (Text Technology and Machine Translation) and meta-net (http://www.meta-net.eu). meta-net, an eu-funded Network of Excellence, is dedicated to building the technological foundations of a multilin- gual European information society. To this end, meta-net
Recommended publications
  • A Collaborativeplatform for Multilingual Ontology
    PhD Dissertation International Doctorate School in Information and Communication Technologies DIT - University of Trento A COLLABORATIVE PLATFORM FOR MULTILINGUAL ONTOLOGY DEVELOPMENT Ahmed Maher Ahmed Tawfik Moustafa Advisor: Professor. Fausto Giunchiglia Università degli Studi di Trento Abstract The world is extremely diverse and its diversity is obvious in the cultural differences and the large number of spoken languages being used all over the world. In this sense, we need to collect and organize a huge amount of knowledge obtained from multiple resources differing from one another in many aspects. A possible approach for doing that is to think of designing effective tools for construction and maintenance of linguistic resources and localized domain ontologies based on well-defined knowledge representation methodologies capable of dealing with diversity and the continuous evolvement of human knowledge. In this thesis, we present a collaborative platform which allows for knowledge organization in a language-independent manner and provides the appropriate mapping from a language independent concept to one specific lexicalization per language. This representation ensures a smooth multilingual enrichment process for linguistic resources and a robust construction of ontologies using language-independent concepts. The collaborative platform is designed following a workflow-based development methodology that models linguistic resources as a set of collaborative objects and assigns a customizable workflow to build and maintain each collaborative object in a community driven manner, with extensive support of modern web 2.0 social and collaborative features. Keywords Knowledge Representation, Multilingual Resources, Ontology Development, Computer Supported Collaborative Work 2 Acknowledgments I am particularly grateful for my supervisor, Professor. Fausto Giunchiglia, for the guidance and advices he has provided throughout my time as a PhD student.
    [Show full text]
  • Latent Semantic Network Induction in the Context of Linked Example Senses
    Latent semantic network induction in the context of linked example senses Hunter Scott Heidenreich Jake Ryland Williams Department of Computer Science Department of Information Science College of Computing and Informatics College of Computing and Informatics [email protected] [email protected] Abstract of a network—much like the Princeton Word- Net (Miller, 1995; Fellbaum, 1998)—that is con- The Princeton WordNet is a powerful tool structed solely from the semi-structured data of for studying language and developing nat- ural language processing algorithms. With Wiktionary. This relies on the noisy annotations significant work developing it further, one of the editors of Wiktionary to naturally induce a line considers its extension through aligning network over the entirety of the English portion of its expert-annotated structure with other lex- Wiktionary. In doing so, the development of this ical resources. In contrast, this work ex- work produces: plores a completely data-driven approach to network construction, forming a wordnet us- • an induced network over Wiktionary, en- ing the entirety of the open-source, noisy, user- riched with semantically linked examples, annotated dictionary, Wiktionary. Compar- forming a directed acyclic graph (DAG); ing baselines to WordNet, we find compelling evidence that our network induction process • an exploration of the task of relationship dis- constructs a network with useful semantic structure. With thousands of semantically- ambiguation as a means to induce network linked examples that demonstrate sense usage construction; and from basic lemmas to multiword expressions (MWEs), we believe this work motivates fu- • an outline for directions of expansion, includ- ture research. ing increasing precision in disambiguation, cross-linking example usages, and aligning 1 Introduction English Wiktionary with other languages.
    [Show full text]
  • Visualization Design for a Web Interface to the Large-Scale Linked Lexical Resource UBY
    Visualization Design for a Web Interface to the Large-Scale Linked Lexical Resource UBY We present the results of a collaboration of visualization experts and computational linguists which aimed at the re-design of the visualization component in the Web user interface (Web UI) to the large-scale linked lexical resource UBY. UBY combines a wide range of information from expert- constructed (e.g., WordNet, FrameNet, VerbNet) and collaboratively constructed (e.g., Wiktionary, Wikipedia) resources for English and German, see https://www.ukp.tu-darmstadt.de/uby. All resources contained in UBY distinguish not only different words but also their senses. A distinguishing feature of UBY is that the different resources are aligned to each other at the word sense level, i.e. there are links connecting equivalent word senses from different resources in UBY. For senses that are linked, information from the aligned resources can be accessed and the resulting enriched sense representations can be used to enhance the performance of Natural Language Processing tasks. Targeted user groups of the UBY Web UI are researchers in the field of Natural Language Processing and in the Digital Humanities (e.g., lexicographers, linguists). In the context of exploring the usually large number of senses for an arbitrary search word, the UBY Web UI should support these user groups in assessing the added value of sense links for particular applications. It is important to emphasize that this is an open research question for most applications. We will present the results of our detailed requirements analysis that revealed a number of central requirements a visualization of all the senses for a given search word and the links between them must meet in order to be useful for this purpose.
    [Show full text]
  • A Large, Interlinked, Syntactically-Rich Lexical Resource for Ontologies
    Semantic Web 0 (0) 1 1 IOS Press lemonUby - a large, interlinked, syntactically-rich lexical resource for ontologies Judith Eckle-Kohler, a;∗ John Philip McCrae b and Christian Chiarcos c a Ubiquitous Knowledge Processing (UKP) Lab, Department of Computer Science, Technische Universität Darmstadt and Information Center for Education, German Institute for International Educational Research, Germany, http://www.ukp.tu-darmstadt.de b Cognitive Interaction Technology (CITEC), Semantic Computing Group, Universität Bielefeld, Germany, http://www.sc.cit-ec.uni-bielefeld.de c Applied Computational Linguistics (ACoLi), Department of Computer Science and Mathematics, Goethe-University Frankfurt am Main, Germany, http://acoli.cs.uni-frankfurt.de Abstract. We introduce lemonUby, a new lexical resource integrated in the Semantic Web which is the result of converting data extracted from the existing large-scale linked lexical resource UBY to the lemon lexicon model. The following data from UBY were converted: WordNet, FrameNet, VerbNet, English and German Wiktionary, the English and German entries of Omega- Wiki, as well as links between pairs of these lexicons at the word sense level (links between VerbNet and FrameNet, VerbNet and WordNet, WordNet and FrameNet, WordNet and Wiktionary, WordNet and German OmegaWiki). We linked lemonUby to other lexical resources and linguistic terminology repositories in the Linguistic Linked Open Data cloud and outline possible applications of this new dataset. Keywords: Lexicon model, lemon, UBY-LMF, UBY, OLiA, ISOcat, WordNet, VerbNet, FrameNet, Wiktionary, OmegaWiki 1. Introduction numerous mappings and linkings of lexica, as well as standards for representing lexical resources, such Recently, the language resource community has begun as the ISO 24613:2008 Lexical Markup Framework to explore the opportunities offered by the Semantic (LMF) [13].
    [Show full text]
  • ZONE DE POLICE DE GAUME Chiny, Etalle, Florenville, Meix-Devant-Virton, Rouvroy, Tintigny, Virton
    Confidentiel Respect, Solidarité, Esprit de service ZONE DE POLICE DE GAUME Chiny, Etalle, Florenville, Meix-devant-Virton, Rouvroy, Tintigny, Virton FORMULAIRE DE DEMANDE DE SURVEILLANCE HABITATION EN CAS D’ABSENCE La surveillance des habitations est un service totalement gratuit, offert par la Zone de Police de Gaume. Chaque citoyen résidant sur ce territoire, a le droit d’en faire la demande. Bien que toute information sur votre absence nous soit utile en termes de prévention, nous ne garantissons pas une surveillance en dessous d’une absence de 5 jours et au-delà d’un mois. Nom : Prénom : Date de naissance : Adresse : Code Postal : Localité : Téléphone / GSM : Adresse mail : Type d’habitation : Autres endroits à surveiller (Magasin, hangar, atelier, abri de jardin…) : Date et heure de départ : vers heures Date et heure de retour : vers heures Possibilité de contact (adresse de votre destination et/ou numéro de téléphone) : Personne de contact (Nom et prénom) : Adresse : Code Postal : Localité : Téléphone : Dispose des clefs de la maison : OUI NON Dispose du code alarme : OUI NON Véhicule(s) dans le garage ou devant la maison : Description(s) : Installation d’un système d’alarme : OUI NON Modèle : Installateur : Minuterie à l’intérieur de l’habitation (déclenchant un éclairage) : OUI NON programmée à : Eclairage de sécurité ou éclairage dissuasif à l’extérieur : OUI NON Chien de garde ou autres animaux : OUI NON Caractéristiques : Je souhaite le passage à mon domicile du conseiller en prévention vol de la zone de Police : OUI NON Autres mesures de prévention ou de sécurité (gardes privés, objets de valeur enregistrés, coffres forts préalablement vidés, serrure supplémentaire à la porte extérieure, personne (voisin) pour tondre la pelouse ou relever le courrier ou soigner les animaux) : 1 DECLARATION DU DEMANDEUR Par ce formulaire, je souhaite obtenir une surveillance policière de mon habitation durant la période indiquée.
    [Show full text]
  • Mining Translations from the Web of Open Linked Data
    Mining translations from the web of open linked data John Philip McCrae Philipp Cimiano University of Bielefeld University of Bielefeld [email protected] [email protected] Abstract OmegaWiki on the web of data should ameliorate In this paper we consider the prospect of the process of harvesting translations from these extracting translations for words from the resources. web of linked data. By searching for We consider two sources for translations from entities that have labels in both English linked data: firstly, we consider mining labels for and German we extract 665,000 transla- concepts from the data contained in the 2010 Bil- tions. We then also consider a linguis- lion Triple Challenge (BTC) data set, as well as tic linked data resource, lemonUby, from DBpedia (Auer et al., 2007) and FreeBase (Bol- which we extract a further 115,000 transla- lacker et al., 2008). Secondly, we mine transla- tions. We combine these translations with tions from lemonUby (Eckle-Kohler et al., 2013), the Moses statistical machine translation, a resource that integrates a number of distinct dic- and we show that the translations extracted tionary language resources in the lemon (Lexi- from the linked data can be used to im- con Model for Ontologies) format (McCrae et al., prove the translation of unknown words. 2012), which is a model for representing rich lex- ical information including forms, sense, morphol- 1 Introduction ogy and syntax of ontology labels. We then con- In recent years there has been a massive explo- sider the process of including these extra transla- sion in the amount and quality of data available tions into an existing translation system, namely as linked data on the web.
    [Show full text]
  • Vous Désirez En Savoir Plus Sur L'épi ? Liste Des Prestataires
    VOUS DÉSIREZ EN SAVOIR PLUS SUR L'ÉPI ? LISTE DES PRESTATAIRES & COMPTOIRS DE CHANGE www.enepisdubonsens.org VOUS Y TROUVEREZ : AUTOUR D'ARLON et AUBANGE ✔ La possibilité de devenir membre de l'Épi Lorrain en ligne. Les comptoirs de change ✔ Le projet de l'Épi expliqué et ses actualités. ✔ L'annuaire des prestataires de l'Épi sur une carte interactive, avec un ARLON DU TIERS ET DU QUART, 153 rue de Neufchâteau classement par catégorie et la possibilité de télécharger cette liste mise à jour ARLON EPICES & TOUT, 103 rue des faubourgs régulièrement. ARLON (Toernich) COMPTOIR DE LA CIGOGNE, 30 rue haute ✔ Des informations complémentaires pertinentes et alternatives sur les monnaies, l'économie, la finance, l'argent,... Les prestataires ✔ L'agenda de l'Épi. Les Bacchanales (café culturel) 37 Grand place, Arlon BENTZ Bougies & Pompes Funèbres 13 rue du Marché au Beurre, Arlon Ferme les Bleuets (oeufs) 101 rue Welschen, Longeau BIO-Lorraine (légumes bio) 17 Fahrengrund, Viville L'ÉPI LORRAIN ASBL Brasserie d'Arlon (brasserie artisanale) 57 rue Godefroid Kurth, Arlon 27, RUE DE VIRTON Le Comptoir de la Cigogne (épicerie fine) 30 rue Haute, Toernich 6769 MEIX-DEVANT-VIRTON Créativlabels (stickers vitrines et murs) 5 rue de la scierie, Frassem Dijest Finger food truck Itinérant sur Arlon et environs Tél : 0473/ 422 330 Envie d'apprendre (coach scolaire) 39 Rue du Maitrank, Bonnert E-mail : [email protected] Ecobati (matériaux de construction écologique) 119 rue de la Semois, Arlon Site web : www.enepisdubonsens.org Épices & tout (épicerie
    [Show full text]
  • Parc Naturel De Gaume »
    18 décembre 2014 Arrêté du Gouvernement wallon portant création du « Parc naturel de Gaume » Le Gouvernement wallon, Vu le décret du 16 juillet 1985 relatif aux parcs naturels, ses articles 2, 3, 4, 5 et 6; Vu les articles D.29-1 à D.29-24, D. 49 à 61 et R. 46, 47 et 49 du Livre Ier du Code de l'Environnement; Vu la constitution d'une association de projet au sens de l'article L 1512-2 du Code de la démocratie locale et de la décentralisation pour la création du « Parc naturel de Gaume » en date du 1er août 2012, regroupant les communes d'Aubange, d'Etalle, de Florenville, de Meix-devant-Virton, de Musson, de Rouvroy, de Saint-Léger, de Tintigny et de Virton; vu que cette association de projet constitue le pouvoir organisateur du « Parc naturel de Gaume »; Vu qu'un comité d'étude a été institué le 1er août 2012 par le pouvoir organisateur; que ce comité d'étude a établi un rapport relatif à la création du parc naturel conformément à ce que prévoit l'article 3, alinéa 2 du décret du 16 juillet 1985 relatif aux parcs naturels; que ce rapport a été présenté au pouvoir organisateur le 20 décembre 2012; Vu que, sur cette base, le pouvoir organisateur a établi un projet de création de parc naturel portant sur la dénomination, les limites, le plan de gestion du parc naturel et l'inscription de tout ou partie du territoire du parc naturel dans un périmètre où s'applique le Règlement général sur les bâtisses en site rural; Vu les avis favorables des conseils communaux des communes d'Aubange, d'Etalle, de Florenville, de Meix-devant-Virton, de
    [Show full text]
  • The Gelao Languages: Preliminary Classification and State of the Art Andrew Hsiu Center for Research in Computational Linguistics (CRCL), Bangkok, Thailand
    The Gelao languages: Preliminary classification and state of the art Andrew Hsiu Center for Research in Computational Linguistics (CRCL), Bangkok, Thailand SEALS XXIII, May 2013 Chulalongkorn University, Bangkok, Thailand Gelao’s position in Kra-Dai Kra-Dai (Tai-Kadai): primary branches Tai Hlai Ong Be Kam-Sui Kra Source: Ostapirat (2000) Goals of this talk Give a demographic overview of a little-known, but very important, language group Put together linguistics, anthropology, and history (mostly in Chinese) to get the “big picture” Propose a new preliminary internal classification of Gelao that takes recent data (post-2000) into account Explore some of Gelao’s interactions with neighboring languages Explain why Gelao is in urgent need of more scholarly attention Why Gelao? Why Gelao deserves much more attention in Kra-Dai studies: 1. Phylogenetic position: Divergent within KD 2. Internal diversity: Many distinct varieties, most likely around 10 languages (mutually unintelligible lects) 3. Historical importance: Major ethnic group in Chinese historical sources; known as Geliao 仡僚, etc. 4. Large present-day ethnic presence: Ethnic population of 500,000; speaker population: < 6,000. 5. Language endangerment: All Gelao varieties (except Central Gelao and Judu Gelao) are moribund with < 50 speakers left; fieldwork is extremely urgent. Internal diversity Gelao is by far the most internally diverse group of Kra languages. Remaining 3 Kra groups: relatively little internal variation – Buyang cluster (6 langs.): Paha, Ecun, Langjia,
    [Show full text]
  • Proceedings of KONVENS 2012 (Main Track: Poster Presentations), Vienna, September 19, 2012 Ments Between Its Lsrs
    Navigating Sense-Aligned Lexical-Semantic Resources: THE WEB INTERFACE TO UBY Iryna Gurevych1,2, Michael Matuschek1, Tri-Duc Nghiem1, Judith Eckle-Kohler1, Silvana Hartmann1, Christian M. Meyer1 1Ubiquitous Knowledge Processing Lab (UKP-TUDA) Department of Computer Science, Technische Universitat¨ Darmstadt 2Ubiquitous Knowledge Processing Lab (UKP-DIPF) German Institute for Educational Research and Educational Information http://www.ukp.tu-darmstadt.de Abstract to the large sense-aligned LSR UBY (Gurevych et al., 2012). UBY is represented in compli- In this paper, we present the Web inter- ance with the ISO standard LMF (Francopoulo face to UBY, a large-scale lexical resource based on the Lexical Markup Framework et al., 2006) and currently contains interoper- (LMF). UBY contains interoperable ver- able versions of nine heterogeneous LSRs in sions of nine resources in two languages. two languages, as well as pairwise sense align- The interface allows to conveniently exam- ments for a subset of them: English WordNet ine and navigate the encoded information (WN), Wiktionary (WKT-en), Wikipedia (WP- in UBY across resource boundaries. Its en), FrameNet (FN), and VerbNet (VN); German main contributions are twofold: 1) The vi- Wiktionary (WKT-de), Wikipedia (WP-de), and sual view allows to examine the sense clus- GermaNet (GN), and the English and German en- ters for a lemma induced by alignments between different resources at the level of tries of OmegaWiki (OW-en/de). word senses. 2) The textual view uniformly The novel aspects of our interface can be sum- presents senses from different resources in marized as 1) A graph-based visualization of detail and offers the possibility to directly sense alignments between the LSRs integrated in compare them in a parallel view.
    [Show full text]
  • Country Pasture/Forage Resource Profiles BELGIUM
    Country Pasture/Forage Resource Profiles BELGIUM by Alain Peeters The designations employed and the presentation of material in this information product do not imply the expression of any opinion whatsoever on the part of the Food and Agriculture Organization of the United Nations (FAO) concerning the legal or development status of any country, territory, city or area or of its authorities, or concerning the delimitation of its frontiers or boundaries. The mention of specific companies or products of manufacturers, whether or not these have been patented, does not imply that these have been endorsed or recommended by FAO in preference to others of a similar nature that are not mentioned. The views expressed in this information product are those of the author(s) and do not necessarily reflect the views of FAO. All rights reserved. FAO encourages the reproduction and dissemination of material in this information product. Non-commercial uses will be authorized free of charge, upon request. Reproduction for resale or other commercial purposes, including educational purposes, may incur fees. Applications for permission to reproduce or disseminate FAO copyright materials, and all queries concerning rights and licences, should be addressed by e-mail to [email protected] or to the Chief, Publishing Policy and Support Branch, Office of Knowledge Exchange, Research and Extension, FAO, Viale delle Terme di Caracalla, 00153 Rome, Italy. © FAO 2010 3 CONTENTS 1. INTRODUCTION 5 Country location 5 2. SOILS AND TOPOGRAPHY 8 Topography and geology 8 Soil types 10 3. CLIMATE AND AGRO-ECOLOGICAL ZONES 11 Climate 11 Geographical regions 13 Agro-ecological regions 14 Forests 22 4.
    [Show full text]
  • Het Land Van De Gaume
    wandeling +1 fietsocht Het land van de Gaume Charmante ruïnes Samyn – David © WBT 8 Samyn – David © WBT Het Land van de Gaume, charmante ruïnes De Gaume vormt een sterk contrast met zijn buur, de Ardennen. Het Ardense massief is sterk afgetekend, het reliëf van Belgisch Lotharingen daarentegen is zacht glooiend en koestert zich doorgaans onder een milde zon. Zowel de zinnelijkheid van het landschap als de bekoorlijke en charmante dorpen zijn echo’s van het zachte klimaat. Montauban Orval 55 km Chiny Habay Florenville Tintigny Etalle 1 3 Orval Montauban Gérouville N840 Ethe Rouvroy Virton Avioth Montquintin 2 Montmédy Torgny 2 Chiny In Chiny zorgt de Semois voor leven in het landschap. In de zomer wordt de rivier tussen Chiny en Lacuisine bevaren door platbodem- schuiten. De ‘Passeurs réunis’ nemen je mee voor een parcours van acht kilometer op een ongerepte Semois, waar alleen gekabbel de stilte verstoort. De gemeente heeft een goed gevulde geschiedenis. In de middeleeuwen waren de graven van Chiny zeer machtig en heersten over de hele streek. Van het kasteel dat neerkijkt op de Semois, daalden de paar- denknechten via onderaardse gangen af naar de rivier om de paarden te drenken. © WBT - P. Pauquay - P. © WBT 1. De site van Montauban Op een uitloper van de rotsen en beschermd plaatse werden ontdekt. Didactische borden door het woud, versmelten de cyclopische mu- staan verspreid over de site. In de lagergelegen ren met het decor. De site van Montauban heeft vallei contrasteert de rust in de schitterende drie opeenvolgende periodes van bezetting beukenlanen met de gonzende activiteiten van gekend.
    [Show full text]