Language Technologies for a Multilingual Europe
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Collaborativeplatform for Multilingual Ontology
PhD Dissertation International Doctorate School in Information and Communication Technologies DIT - University of Trento A COLLABORATIVE PLATFORM FOR MULTILINGUAL ONTOLOGY DEVELOPMENT Ahmed Maher Ahmed Tawfik Moustafa Advisor: Professor. Fausto Giunchiglia Università degli Studi di Trento Abstract The world is extremely diverse and its diversity is obvious in the cultural differences and the large number of spoken languages being used all over the world. In this sense, we need to collect and organize a huge amount of knowledge obtained from multiple resources differing from one another in many aspects. A possible approach for doing that is to think of designing effective tools for construction and maintenance of linguistic resources and localized domain ontologies based on well-defined knowledge representation methodologies capable of dealing with diversity and the continuous evolvement of human knowledge. In this thesis, we present a collaborative platform which allows for knowledge organization in a language-independent manner and provides the appropriate mapping from a language independent concept to one specific lexicalization per language. This representation ensures a smooth multilingual enrichment process for linguistic resources and a robust construction of ontologies using language-independent concepts. The collaborative platform is designed following a workflow-based development methodology that models linguistic resources as a set of collaborative objects and assigns a customizable workflow to build and maintain each collaborative object in a community driven manner, with extensive support of modern web 2.0 social and collaborative features. Keywords Knowledge Representation, Multilingual Resources, Ontology Development, Computer Supported Collaborative Work 2 Acknowledgments I am particularly grateful for my supervisor, Professor. Fausto Giunchiglia, for the guidance and advices he has provided throughout my time as a PhD student. -
Latent Semantic Network Induction in the Context of Linked Example Senses
Latent semantic network induction in the context of linked example senses Hunter Scott Heidenreich Jake Ryland Williams Department of Computer Science Department of Information Science College of Computing and Informatics College of Computing and Informatics [email protected] [email protected] Abstract of a network—much like the Princeton Word- Net (Miller, 1995; Fellbaum, 1998)—that is con- The Princeton WordNet is a powerful tool structed solely from the semi-structured data of for studying language and developing nat- ural language processing algorithms. With Wiktionary. This relies on the noisy annotations significant work developing it further, one of the editors of Wiktionary to naturally induce a line considers its extension through aligning network over the entirety of the English portion of its expert-annotated structure with other lex- Wiktionary. In doing so, the development of this ical resources. In contrast, this work ex- work produces: plores a completely data-driven approach to network construction, forming a wordnet us- • an induced network over Wiktionary, en- ing the entirety of the open-source, noisy, user- riched with semantically linked examples, annotated dictionary, Wiktionary. Compar- forming a directed acyclic graph (DAG); ing baselines to WordNet, we find compelling evidence that our network induction process • an exploration of the task of relationship dis- constructs a network with useful semantic structure. With thousands of semantically- ambiguation as a means to induce network linked examples that demonstrate sense usage construction; and from basic lemmas to multiword expressions (MWEs), we believe this work motivates fu- • an outline for directions of expansion, includ- ture research. ing increasing precision in disambiguation, cross-linking example usages, and aligning 1 Introduction English Wiktionary with other languages. -
Visualization Design for a Web Interface to the Large-Scale Linked Lexical Resource UBY
Visualization Design for a Web Interface to the Large-Scale Linked Lexical Resource UBY We present the results of a collaboration of visualization experts and computational linguists which aimed at the re-design of the visualization component in the Web user interface (Web UI) to the large-scale linked lexical resource UBY. UBY combines a wide range of information from expert- constructed (e.g., WordNet, FrameNet, VerbNet) and collaboratively constructed (e.g., Wiktionary, Wikipedia) resources for English and German, see https://www.ukp.tu-darmstadt.de/uby. All resources contained in UBY distinguish not only different words but also their senses. A distinguishing feature of UBY is that the different resources are aligned to each other at the word sense level, i.e. there are links connecting equivalent word senses from different resources in UBY. For senses that are linked, information from the aligned resources can be accessed and the resulting enriched sense representations can be used to enhance the performance of Natural Language Processing tasks. Targeted user groups of the UBY Web UI are researchers in the field of Natural Language Processing and in the Digital Humanities (e.g., lexicographers, linguists). In the context of exploring the usually large number of senses for an arbitrary search word, the UBY Web UI should support these user groups in assessing the added value of sense links for particular applications. It is important to emphasize that this is an open research question for most applications. We will present the results of our detailed requirements analysis that revealed a number of central requirements a visualization of all the senses for a given search word and the links between them must meet in order to be useful for this purpose. -
A Large, Interlinked, Syntactically-Rich Lexical Resource for Ontologies
Semantic Web 0 (0) 1 1 IOS Press lemonUby - a large, interlinked, syntactically-rich lexical resource for ontologies Judith Eckle-Kohler, a;∗ John Philip McCrae b and Christian Chiarcos c a Ubiquitous Knowledge Processing (UKP) Lab, Department of Computer Science, Technische Universität Darmstadt and Information Center for Education, German Institute for International Educational Research, Germany, http://www.ukp.tu-darmstadt.de b Cognitive Interaction Technology (CITEC), Semantic Computing Group, Universität Bielefeld, Germany, http://www.sc.cit-ec.uni-bielefeld.de c Applied Computational Linguistics (ACoLi), Department of Computer Science and Mathematics, Goethe-University Frankfurt am Main, Germany, http://acoli.cs.uni-frankfurt.de Abstract. We introduce lemonUby, a new lexical resource integrated in the Semantic Web which is the result of converting data extracted from the existing large-scale linked lexical resource UBY to the lemon lexicon model. The following data from UBY were converted: WordNet, FrameNet, VerbNet, English and German Wiktionary, the English and German entries of Omega- Wiki, as well as links between pairs of these lexicons at the word sense level (links between VerbNet and FrameNet, VerbNet and WordNet, WordNet and FrameNet, WordNet and Wiktionary, WordNet and German OmegaWiki). We linked lemonUby to other lexical resources and linguistic terminology repositories in the Linguistic Linked Open Data cloud and outline possible applications of this new dataset. Keywords: Lexicon model, lemon, UBY-LMF, UBY, OLiA, ISOcat, WordNet, VerbNet, FrameNet, Wiktionary, OmegaWiki 1. Introduction numerous mappings and linkings of lexica, as well as standards for representing lexical resources, such Recently, the language resource community has begun as the ISO 24613:2008 Lexical Markup Framework to explore the opportunities offered by the Semantic (LMF) [13]. -
ZONE DE POLICE DE GAUME Chiny, Etalle, Florenville, Meix-Devant-Virton, Rouvroy, Tintigny, Virton
Confidentiel Respect, Solidarité, Esprit de service ZONE DE POLICE DE GAUME Chiny, Etalle, Florenville, Meix-devant-Virton, Rouvroy, Tintigny, Virton FORMULAIRE DE DEMANDE DE SURVEILLANCE HABITATION EN CAS D’ABSENCE La surveillance des habitations est un service totalement gratuit, offert par la Zone de Police de Gaume. Chaque citoyen résidant sur ce territoire, a le droit d’en faire la demande. Bien que toute information sur votre absence nous soit utile en termes de prévention, nous ne garantissons pas une surveillance en dessous d’une absence de 5 jours et au-delà d’un mois. Nom : Prénom : Date de naissance : Adresse : Code Postal : Localité : Téléphone / GSM : Adresse mail : Type d’habitation : Autres endroits à surveiller (Magasin, hangar, atelier, abri de jardin…) : Date et heure de départ : vers heures Date et heure de retour : vers heures Possibilité de contact (adresse de votre destination et/ou numéro de téléphone) : Personne de contact (Nom et prénom) : Adresse : Code Postal : Localité : Téléphone : Dispose des clefs de la maison : OUI NON Dispose du code alarme : OUI NON Véhicule(s) dans le garage ou devant la maison : Description(s) : Installation d’un système d’alarme : OUI NON Modèle : Installateur : Minuterie à l’intérieur de l’habitation (déclenchant un éclairage) : OUI NON programmée à : Eclairage de sécurité ou éclairage dissuasif à l’extérieur : OUI NON Chien de garde ou autres animaux : OUI NON Caractéristiques : Je souhaite le passage à mon domicile du conseiller en prévention vol de la zone de Police : OUI NON Autres mesures de prévention ou de sécurité (gardes privés, objets de valeur enregistrés, coffres forts préalablement vidés, serrure supplémentaire à la porte extérieure, personne (voisin) pour tondre la pelouse ou relever le courrier ou soigner les animaux) : 1 DECLARATION DU DEMANDEUR Par ce formulaire, je souhaite obtenir une surveillance policière de mon habitation durant la période indiquée. -
Mining Translations from the Web of Open Linked Data
Mining translations from the web of open linked data John Philip McCrae Philipp Cimiano University of Bielefeld University of Bielefeld [email protected] [email protected] Abstract OmegaWiki on the web of data should ameliorate In this paper we consider the prospect of the process of harvesting translations from these extracting translations for words from the resources. web of linked data. By searching for We consider two sources for translations from entities that have labels in both English linked data: firstly, we consider mining labels for and German we extract 665,000 transla- concepts from the data contained in the 2010 Bil- tions. We then also consider a linguis- lion Triple Challenge (BTC) data set, as well as tic linked data resource, lemonUby, from DBpedia (Auer et al., 2007) and FreeBase (Bol- which we extract a further 115,000 transla- lacker et al., 2008). Secondly, we mine transla- tions. We combine these translations with tions from lemonUby (Eckle-Kohler et al., 2013), the Moses statistical machine translation, a resource that integrates a number of distinct dic- and we show that the translations extracted tionary language resources in the lemon (Lexi- from the linked data can be used to im- con Model for Ontologies) format (McCrae et al., prove the translation of unknown words. 2012), which is a model for representing rich lex- ical information including forms, sense, morphol- 1 Introduction ogy and syntax of ontology labels. We then con- In recent years there has been a massive explo- sider the process of including these extra transla- sion in the amount and quality of data available tions into an existing translation system, namely as linked data on the web. -
Vous Désirez En Savoir Plus Sur L'épi ? Liste Des Prestataires
VOUS DÉSIREZ EN SAVOIR PLUS SUR L'ÉPI ? LISTE DES PRESTATAIRES & COMPTOIRS DE CHANGE www.enepisdubonsens.org VOUS Y TROUVEREZ : AUTOUR D'ARLON et AUBANGE ✔ La possibilité de devenir membre de l'Épi Lorrain en ligne. Les comptoirs de change ✔ Le projet de l'Épi expliqué et ses actualités. ✔ L'annuaire des prestataires de l'Épi sur une carte interactive, avec un ARLON DU TIERS ET DU QUART, 153 rue de Neufchâteau classement par catégorie et la possibilité de télécharger cette liste mise à jour ARLON EPICES & TOUT, 103 rue des faubourgs régulièrement. ARLON (Toernich) COMPTOIR DE LA CIGOGNE, 30 rue haute ✔ Des informations complémentaires pertinentes et alternatives sur les monnaies, l'économie, la finance, l'argent,... Les prestataires ✔ L'agenda de l'Épi. Les Bacchanales (café culturel) 37 Grand place, Arlon BENTZ Bougies & Pompes Funèbres 13 rue du Marché au Beurre, Arlon Ferme les Bleuets (oeufs) 101 rue Welschen, Longeau BIO-Lorraine (légumes bio) 17 Fahrengrund, Viville L'ÉPI LORRAIN ASBL Brasserie d'Arlon (brasserie artisanale) 57 rue Godefroid Kurth, Arlon 27, RUE DE VIRTON Le Comptoir de la Cigogne (épicerie fine) 30 rue Haute, Toernich 6769 MEIX-DEVANT-VIRTON Créativlabels (stickers vitrines et murs) 5 rue de la scierie, Frassem Dijest Finger food truck Itinérant sur Arlon et environs Tél : 0473/ 422 330 Envie d'apprendre (coach scolaire) 39 Rue du Maitrank, Bonnert E-mail : [email protected] Ecobati (matériaux de construction écologique) 119 rue de la Semois, Arlon Site web : www.enepisdubonsens.org Épices & tout (épicerie -
Parc Naturel De Gaume »
18 décembre 2014 Arrêté du Gouvernement wallon portant création du « Parc naturel de Gaume » Le Gouvernement wallon, Vu le décret du 16 juillet 1985 relatif aux parcs naturels, ses articles 2, 3, 4, 5 et 6; Vu les articles D.29-1 à D.29-24, D. 49 à 61 et R. 46, 47 et 49 du Livre Ier du Code de l'Environnement; Vu la constitution d'une association de projet au sens de l'article L 1512-2 du Code de la démocratie locale et de la décentralisation pour la création du « Parc naturel de Gaume » en date du 1er août 2012, regroupant les communes d'Aubange, d'Etalle, de Florenville, de Meix-devant-Virton, de Musson, de Rouvroy, de Saint-Léger, de Tintigny et de Virton; vu que cette association de projet constitue le pouvoir organisateur du « Parc naturel de Gaume »; Vu qu'un comité d'étude a été institué le 1er août 2012 par le pouvoir organisateur; que ce comité d'étude a établi un rapport relatif à la création du parc naturel conformément à ce que prévoit l'article 3, alinéa 2 du décret du 16 juillet 1985 relatif aux parcs naturels; que ce rapport a été présenté au pouvoir organisateur le 20 décembre 2012; Vu que, sur cette base, le pouvoir organisateur a établi un projet de création de parc naturel portant sur la dénomination, les limites, le plan de gestion du parc naturel et l'inscription de tout ou partie du territoire du parc naturel dans un périmètre où s'applique le Règlement général sur les bâtisses en site rural; Vu les avis favorables des conseils communaux des communes d'Aubange, d'Etalle, de Florenville, de Meix-devant-Virton, de -
The Gelao Languages: Preliminary Classification and State of the Art Andrew Hsiu Center for Research in Computational Linguistics (CRCL), Bangkok, Thailand
The Gelao languages: Preliminary classification and state of the art Andrew Hsiu Center for Research in Computational Linguistics (CRCL), Bangkok, Thailand SEALS XXIII, May 2013 Chulalongkorn University, Bangkok, Thailand Gelao’s position in Kra-Dai Kra-Dai (Tai-Kadai): primary branches Tai Hlai Ong Be Kam-Sui Kra Source: Ostapirat (2000) Goals of this talk Give a demographic overview of a little-known, but very important, language group Put together linguistics, anthropology, and history (mostly in Chinese) to get the “big picture” Propose a new preliminary internal classification of Gelao that takes recent data (post-2000) into account Explore some of Gelao’s interactions with neighboring languages Explain why Gelao is in urgent need of more scholarly attention Why Gelao? Why Gelao deserves much more attention in Kra-Dai studies: 1. Phylogenetic position: Divergent within KD 2. Internal diversity: Many distinct varieties, most likely around 10 languages (mutually unintelligible lects) 3. Historical importance: Major ethnic group in Chinese historical sources; known as Geliao 仡僚, etc. 4. Large present-day ethnic presence: Ethnic population of 500,000; speaker population: < 6,000. 5. Language endangerment: All Gelao varieties (except Central Gelao and Judu Gelao) are moribund with < 50 speakers left; fieldwork is extremely urgent. Internal diversity Gelao is by far the most internally diverse group of Kra languages. Remaining 3 Kra groups: relatively little internal variation – Buyang cluster (6 langs.): Paha, Ecun, Langjia, -
Proceedings of KONVENS 2012 (Main Track: Poster Presentations), Vienna, September 19, 2012 Ments Between Its Lsrs
Navigating Sense-Aligned Lexical-Semantic Resources: THE WEB INTERFACE TO UBY Iryna Gurevych1,2, Michael Matuschek1, Tri-Duc Nghiem1, Judith Eckle-Kohler1, Silvana Hartmann1, Christian M. Meyer1 1Ubiquitous Knowledge Processing Lab (UKP-TUDA) Department of Computer Science, Technische Universitat¨ Darmstadt 2Ubiquitous Knowledge Processing Lab (UKP-DIPF) German Institute for Educational Research and Educational Information http://www.ukp.tu-darmstadt.de Abstract to the large sense-aligned LSR UBY (Gurevych et al., 2012). UBY is represented in compli- In this paper, we present the Web inter- ance with the ISO standard LMF (Francopoulo face to UBY, a large-scale lexical resource based on the Lexical Markup Framework et al., 2006) and currently contains interoper- (LMF). UBY contains interoperable ver- able versions of nine heterogeneous LSRs in sions of nine resources in two languages. two languages, as well as pairwise sense align- The interface allows to conveniently exam- ments for a subset of them: English WordNet ine and navigate the encoded information (WN), Wiktionary (WKT-en), Wikipedia (WP- in UBY across resource boundaries. Its en), FrameNet (FN), and VerbNet (VN); German main contributions are twofold: 1) The vi- Wiktionary (WKT-de), Wikipedia (WP-de), and sual view allows to examine the sense clus- GermaNet (GN), and the English and German en- ters for a lemma induced by alignments between different resources at the level of tries of OmegaWiki (OW-en/de). word senses. 2) The textual view uniformly The novel aspects of our interface can be sum- presents senses from different resources in marized as 1) A graph-based visualization of detail and offers the possibility to directly sense alignments between the LSRs integrated in compare them in a parallel view. -
Country Pasture/Forage Resource Profiles BELGIUM
Country Pasture/Forage Resource Profiles BELGIUM by Alain Peeters The designations employed and the presentation of material in this information product do not imply the expression of any opinion whatsoever on the part of the Food and Agriculture Organization of the United Nations (FAO) concerning the legal or development status of any country, territory, city or area or of its authorities, or concerning the delimitation of its frontiers or boundaries. The mention of specific companies or products of manufacturers, whether or not these have been patented, does not imply that these have been endorsed or recommended by FAO in preference to others of a similar nature that are not mentioned. The views expressed in this information product are those of the author(s) and do not necessarily reflect the views of FAO. All rights reserved. FAO encourages the reproduction and dissemination of material in this information product. Non-commercial uses will be authorized free of charge, upon request. Reproduction for resale or other commercial purposes, including educational purposes, may incur fees. Applications for permission to reproduce or disseminate FAO copyright materials, and all queries concerning rights and licences, should be addressed by e-mail to [email protected] or to the Chief, Publishing Policy and Support Branch, Office of Knowledge Exchange, Research and Extension, FAO, Viale delle Terme di Caracalla, 00153 Rome, Italy. © FAO 2010 3 CONTENTS 1. INTRODUCTION 5 Country location 5 2. SOILS AND TOPOGRAPHY 8 Topography and geology 8 Soil types 10 3. CLIMATE AND AGRO-ECOLOGICAL ZONES 11 Climate 11 Geographical regions 13 Agro-ecological regions 14 Forests 22 4. -
Het Land Van De Gaume
wandeling +1 fietsocht Het land van de Gaume Charmante ruïnes Samyn – David © WBT 8 Samyn – David © WBT Het Land van de Gaume, charmante ruïnes De Gaume vormt een sterk contrast met zijn buur, de Ardennen. Het Ardense massief is sterk afgetekend, het reliëf van Belgisch Lotharingen daarentegen is zacht glooiend en koestert zich doorgaans onder een milde zon. Zowel de zinnelijkheid van het landschap als de bekoorlijke en charmante dorpen zijn echo’s van het zachte klimaat. Montauban Orval 55 km Chiny Habay Florenville Tintigny Etalle 1 3 Orval Montauban Gérouville N840 Ethe Rouvroy Virton Avioth Montquintin 2 Montmédy Torgny 2 Chiny In Chiny zorgt de Semois voor leven in het landschap. In de zomer wordt de rivier tussen Chiny en Lacuisine bevaren door platbodem- schuiten. De ‘Passeurs réunis’ nemen je mee voor een parcours van acht kilometer op een ongerepte Semois, waar alleen gekabbel de stilte verstoort. De gemeente heeft een goed gevulde geschiedenis. In de middeleeuwen waren de graven van Chiny zeer machtig en heersten over de hele streek. Van het kasteel dat neerkijkt op de Semois, daalden de paar- denknechten via onderaardse gangen af naar de rivier om de paarden te drenken. © WBT - P. Pauquay - P. © WBT 1. De site van Montauban Op een uitloper van de rotsen en beschermd plaatse werden ontdekt. Didactische borden door het woud, versmelten de cyclopische mu- staan verspreid over de site. In de lagergelegen ren met het decor. De site van Montauban heeft vallei contrasteert de rust in de schitterende drie opeenvolgende periodes van bezetting beukenlanen met de gonzende activiteiten van gekend.