Kernerman Kdictionaries.Com/Kdn DICTIONARY News Adam Kilgarriff Prize, 2017
Total Page:16
File Type:pdf, Size:1020Kb
Number 25 ● July 2017 Kernerman kdictionaries.com/kdn DICTIONARY News Adam Kilgarriff Prize, 2017 The Adam Kilgarriff Prize was set up in 2016 Language, including an innovative corpus and a as a memorial to our multi-talented friend, who corpus-based dictionary. Dr. Rutkowski explains contributed so much to the fields of lexicography, his project's goals on the last page of this issue. corpus linguistics, and NLP. Along with the other He has accepted an invitation from the organizers Trustees, I’m delighted that the Prize has got off of eLex 2017 in Leiden to give one of the keynote to such a terrific start. Not only did we receive talks at the conference, where he will receive the eight excellent applications, but the projects they Prize in person from Adam’s spouse, Gill Lamden. were based on reflected interesting and original We congratulate all the applicants for their work across the whole spectrum of fields in which impressive submissions, and for making the first Adam himself was engaged. There were submissions in areas iteration of Adam’s prize such a success. Finally, I take this as diverse as corpus building, software tools for language opportunity to thank my fellow Trustees for the great care and research, named entity resources, translation systems, machine thoroughness which they devoted to the task of evaluating all learning, and dictionary-creation. the submissions. We are all confident that this year’s Adam With so many high-quality applications, selecting a winner Kilgarriff Prize has a worthy winner, and we look forward to was a challenging process for the six Trustees, involving inviting applications for the 2019 Prize in due course. several lengthy Skype meetings. But our eventual decision was Michael Rundell unanimous, and we’re all thrilled that the first Adam Kilgarriff Prize is going to Dr. Paweł Rutkowski. Paweł and his team The Trustees of the Adam Kilgarriff Prize are Miloš Jakubíček, at the University of Warsaw have been working for over five Ilan Kernerman, Iztok Kosem, Michael Rundell (Chair), Pavel years on the development of resources for users of Polish Sign Rychlý, and Carole Tiberius. 1 Adam Kilgarriff Prize, 2017 | Michael Rundell 2 Introducing LDL4HELTA: Linked data lexicography for high-end language technology application | Martin Kaltenböck and Ilan Kernerman th 4 Triplifying a dictionary: Some learnings | Timea Turdean and Shrikant Joshi 6 TIAD shared task 2017 – Translation Inference Across Dictionaries | Noam Ordan 25 7 Towards a module for lexicography in OntoLex | Julia Bosque-Gil, Jorge Gracia and jubilee issue Elena Montiel-Ponsoda with a focus on 10 OntoLex 2017 – 1st workshop on the OntoLex model | Philipp Cimiano lexicography and 12 KD API | Morris Alper computational linguistics 13 1st Conference on Language, Data and Knowledge – Galway 2017 | John P. McCrae, Paul Buitelaar, Christian Chiarcos and Sebastian Hellmann 13 Global WordNet Conference – Singapore 2018 | Francis Bond 14 The advent of post-editing lexicography | Miloš Jakubíček 15 Sketch Engine and Lexonomy | Ondřej Matuška 16 Phonetic transcription of dotted Hebrew | Alon Itai 18 LOTKS 2017 – Workshop on Language, Ontology, Terminology and Knowledge Structures | Fahad Khan 19 The saga of Norsk Ordbok: A scholarly dictionary for the Norwegian vernacular and the Nynorsk language | Oddrun Grønvik 24 NORDIX bilingual dictionaries for Nordic and major European languages © 2017 All rights reserved. 25 The First Century of English Monolingual Lexicography. Kusujiro Miyoshi | Michael Adams K DICTIONARIES LTD 26 eLex 2017 – Lexicography from Scratch | Carole Tiberius 8 Nahum Hanavi Street 28 Met zoveel woorden. Gids voor trefzeker taal gebruik. Rik Schutz en Ludo Permentier Tel Aviv 6350310 Israel | Anne Dykstra +972-3-5468102 29 GLOBALEX mid-2017 | Ilan Kernerman [email protected] 30 A brief account of ASIALEX 2017 | Hai Xu http://kdictionaries.com 31 Diccionarios electrónicos: perspectivas para el siglo XXI | Beatriz Sánches Cárdenas and Amelia Sanz 32 The Corpus of Polish Sign Language and the Corpus-based Dictionary of Polish Sign Language Paweł Rutkowski Editor | Ilan Kernerman | ISSN 1565-4745 2 Introducing LDL4HELTA: Linked data lexicography for high-end language technology application Martin Kaltenböck and Ilan Kernerman Background text processing (e.g. MS Word / Office 365), Business is becoming increasingly text mining, content analysis, localization, globalized, and enterprises as well as etc. New major players consist of a range public organisations are increasingly of world leading software and hardware acting in multiple areas, facing challenges corporations (Google, Apple, Microsoft, of cross-lingual and inter-cultural barriers. Facebook, SAP, IBM, Intel, Amazon, etc.) English has been crowned a global language, as well as newcomers offering innovative yet regional identities flourish as well, and solutions. this trend correlates with a rise in human and machine solutions to facilitate and enhance Semantics and lexicography communication across languages and To develop and provide satisfactory LT cultures. Looking at the European market, mechanisms and tools we need suitable for example, we see 24 working languages software as well as high quality data Martin Kaltenböck is in EU28, which make cross-border services and information in the form of corpora, Co-Founder, CFO and considerably complicated. This calls for dictionaries, word form lists and other powerful language technology (LT) and fine lexical resources. Lexicography is Managing Partner of Semantic intense efforts to enable and materialize a vital component in this ecosystem. It Web Company. He leads the the vision of a multilingual digital single follows mainly a qualitative approach LDL4HELTA project, and market (defined as a priority area of the that tends to be both time-consuming and leads and works in several European Commission, see http://ec.europa. cost-intensive. Interoperating lexicography national and international eu/priorities/digital-single-market/). with emerging Semantic Web methods research, industry and public As a result, we see a continuously and technologies is already underway, administration projects – growing LT market, primarily in Europe mainly as part of academic research, but mainly as regards project but also worldwide. The growth leads mainstream lexicographic applications management, requirements LT entrepreneurs to suggest solutions for still make little use of state-of-the-art engineering, and community data- and information-driven organisations linked data (LD) resources. LT, on the and communication activities. to work internationally, to efficiently other hand, follows mainly a quantitative He studied communication, store, access, integrate and disseminate approach along statistical and machine their data, and to allow for both inter- and learning technologies. Bridging the gap psychology and marketing at intra-organisation communication across between these qualitative and quantitative the University of Vienna. borders and language barriers by utilizing approaches is a huge challenge, and https://linkedin.com/in/ software tools. new solutions that combine these fields martinkaltenboeck/ Although the emerging LT industry is fairly successfully stand good chances to be useful young, it has rapidly changed the rules of the for the market. game and excluded major old-time players. In this context, the emerging Semantic It ranges as widely as machine translation Web, with new LD and semantic (e.g. Google Translate, Translation Memory technologies, offers innovative means for tools), speech technologies (e.g. Apple’s Siri information and data retrieval, knowledge digital assistant), education (e.g. e-learning), management systems, and other applications for lexical data exchange and integration. However, while existing methodologies are becoming mature, they still lack sufficient Semantic Web Company is a leading provider of graph-based metadata, refinements for data quality mechanisms, search and analytic solutions, based in Vienna. Its expert team provides provenance methods and security issues. consulting and integration services for semantic data and information There are new RandD initiatives that aim management, and supports customers mainly in North America, Europe to bridge the gap between these disciplines, 2017 and Australia, including global 500 companies. It has recently been but only a few commercial applications. July named on KMWorld’s 2017 list of the 100 Companies That Matter in Groundbreaking projects include LIDER Knowledge Management (after being listed also in 2015 and 2016). (http://lider-project.eu/) and LOD2 – https://semantic-web.com Creating Knowledge out of Interlinked Data, which included the development of PoolParty Semantic Suite is a world-class semantic technology tool that 45+ LOD software components (http:// offers sharply focused solutions for knowledge organization and content lod2.eu). Several freely available sources business. As a semantic middleware, PoolParty enriches information with (and semi-open lexicographic resources) valuable metadata and links business and content assets automatically. have also been developed, such as the http://poolparty.biz Linguistic Linked Open Data Cloud (http:// Kernerman Dictionary News, linguistic-lod.org/llod-cloud), WordNet 3 (https://wordnet.princeton.edu/), BabelNet In order to provide the above-mentioned (http://babelnet.org/), or DBpedia (http:// solutions, an integrated multilingual dbpedia.org). Although these sources metadata