Towards an Open-Source Universal-Dependency Treebank for Erzya
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Open-Source Morphology for Endangered Mordvinic Languages
Open-Source Morphology for Endangered Mordvinic Languages Jack Rueter Mika Hämäläinen Niko Partanen Dept. of Digital Humanities Dept. of Digital Humanities Dept. of Finnish, University of Helsinki University of Helsinki Finno-Ugrian [email protected] and Rootroo Ltd and Scandinavian Studies [email protected] University of Helsinki [email protected] Abstract languages. 2002 saw the publication of the first monolingual dictionary of Erzya (Abramov, 2002), This document describes shared development and the manuscript was proclaimed open by the of finite-state description of two closely re- author for future development. The Mordvin lan- lated but endangered minority languages, Erzya and Moksha. It touches upon mor- guages have continued to receive a fair share of pholexical unity and diversity of the two lan- linguistic research interest in the recent years (Luu- guages and how this provides a motivation for tonen, 2014; Hamari and Aasmäe, 2015; Kashkin shared open-source FST development. We de- and Nikiforova, 2015; Grünthal, 2016). scribe how we have designed the transducers After the release first finite-state transducer for so that they can benefit from existing open- the closely related Komi-Zyrian (Rueter, 2000), source infrastructures and are as reusable as possible. it was only obvious that similar work should be done for Erzya Mordvin. Fortunately, over the 1 Introduction past decade there has been an increasing number of publications on Erzya, relating to its morphol- There are over 5000 languages spoken world wide, ogy (Rueter, 2010), its OCR tools (Silfverberg and and a vast majority of them are endangered (see Rueter, 2015) and universal dependencies (Rueter Moseley 2010). -
Финно-Угроведение, 2020 (№ 61) Научный Журнал Основан В Феврале 1994 Г
ФИННО-УГРОВЕДЕНИЕ, 2020 (№ 61) Научный журнал Основан в феврале 1994 г. ISSN 2312-0312 (Print), ISSN 2713-2250 (Online) Учредитель – Государственное бюджетное научное учреждение при Правительстве Республики Марий Эл «Марийский научно-исследовательский институт языка, литературы и истории им. В.М. Васильева» Редакционный совет: Кузьмин Евгений Петрович, председатель ред. совета, канд. ист. н., директор МарНИИЯЛИ (г. Йошкар-Ола) Пенькова Мария Викентьевна, секретарь ред. совета, канд. филол. н., зам. директора-ученый секретарь МарНИИЯЛИ (г. Йошкар-Ола) Агранат Татьяна Борисовна, д-р филол. н., ведущий научный сотрудник Института языкознания РАН (г. Москва) Воронцов Владимир Степанович, канд. ист. н., старший научный сотрудник Удмуртского института истории, языка и литературы УрО РАН (г. Ижевск) Зайцева Нина Григорьевна, д-р филол. н., зав. сектором языкознания Института языка, литературы и истории КарНЦ РАН (г. Петрозаводск) Косинцева Елена Викторовна, д-р филол. н., доцент, главный научный сотрудник Обско-угорского института прикладных исследований и разработок (г. Ханты-Мансийск) Крыласова Наталья Борисовна, д-р ист. н., главный научный сотрудник Пермского федерального исследовательского центра УрО РАН (г. Пермь) Куршева Галина Александровна, д-р ист. н., директор Научно-исследовательского института гуманитарных наук при Правительстве Республики Мордовия (г. Саранск) Лехтинен Илдико, д-р философии, доцент Хельсинкского университета (г. Хельсинки, Финляндия) Ошаев Алексей Григорьевич, канд. ист. н., декан историко-филологического факультета Марийского государственного университета (г. Йошкар-Ола) Пустаи Янош, профессор, д-р философии, председатель Ассоциации финно-угорских литератур, директор Института Collegium Fenno-Ugricum (г. Будапешт, Венгрия) Ракин Анатолий Николаевич, д-р филол. н., главный научный сотрудник Института языка, литературы и истории Коми НЦ УрО РАН (г. Сыктывкар) Сергеев Олег Арсентьевич, канд. филол. н., старший научный сотрудник МарНИИЯЛИ (г. -
Multilingual Facilitation
Multilingual Facilitation Honoring the career of Jack Rueter Mika Hämäläinen, Niko Partanen and Khalid Alnajjar (eds.) Multilingual Facilitation This book has been authored for Jack Rueter in honor of his 60th birthday. Mika Hämäläinen, Niko Partanen and Khalid Alnajjar (eds.) All papers accepted to appear in this book have undergone a rigorous peer review to ensure high scientific quality. The call for papers has been open to anyone interested. We have accepted submissions in any language that Jack Rueter speaks. Hämäläinen, M., Partanen N., & Alnajjar K. (eds.) (2021) Multilingual Facilitation. University of Helsinki Library. ISBN (print) 979-871-33-6227-0 (Independently published) ISBN (electronic) 978-951-51-5025-7 (University of Helsinki Library) DOI: https://doi.org/10.31885/9789515150257 The contents of this book have been published under the CC BY 4.0 license1. 1 https://creativecommons.org/licenses/by/4.0/ Tabula Gratulatoria Jack Rueter has been in an important figure in our academic lives and we would like to congratulate him on his 60th birthday. Mika Hämäläinen, University of Helsinki Niko Partanen, University of Helsinki Khalid Alnajjar, University of Helsinki Alexandra Kellner, Valtioneuvoston kanslia Anssi Yli-Jyrä, University of Helsinki Cornelius Hasselblatt Elena Skribnik, LMU München Eric & Joel Rueter Heidi Jauhiainen, University of Helsinki Helene Sterr Henry Ivan Rueter Irma Reijonen, Kansalliskirjasto Janne Saarikivi, Helsingin yliopisto Jeremy Bradley, University of Vienna Jörg Tiedemann, University of Helsinki Joshua Wilbur, Tartu Ülikool Juha Kuokkala, Helsingin yliopisto Jukka Mettovaara, Oulun yliopisto Jussi-Pekka Hakkarainen, Kansalliskirjasto Jussi Ylikoski, University of Oulu Kaisla Kaheinen, Helsingin yliopisto Karina Lukin, University of Helsinki Larry Rueter LI Līvõd institūt Lotta Jalava, Kotimaisten kielten keskus Mans Hulden, University of Colorado Marcus & Jackie James Mari Siiroinen, Helsingin yliopisto Marja Lappalainen, M. -
FENNO SUECANA FENNO-UGRICA SUECANA Nova Series 15 UGRICA
FENNO-UGRICA SUECANA Nova series Journal of Fenno -Ugric R esearch in Scandinavia 15 Institutionen för slaviska och baltiska språk, finska, nederländska och tyska Stockholm 2016 FENNO-UGRICA SUECANA Nova series Journal of Fenno-Ugric Research in Scandinavia 15 Editor-in-chief: Jarmo Lainio Issue editors: Peter S. Piispanen & Merlijn de Smit Editorial board: Jarmo Lainio, Stockholm Peter S. Piispanen, Stockholm Merlijn de Smit, Stockholm/Turku Stockholm 2016 © 2016 The authors Institutionen för slaviska och baltiska språk, finska, nederländska och tyska ISSN 0348-3045 ISBN 978-91-981559-0-7 FENNO-UGRICA SUECANA – Nova series 15 ARTICLES • Peter Piispanen : Statistical Dating of Finno-Mordvinic Languages through Comparative Linguistics and Sound Laws, p. 1 – 58 • Ante Aikio & Jussi Ylikoski: The origin of the Finnic l-cases, p. 59 – 158 • Håkan Rydving: Sydsamisk eller umesamisk? ”Södra Tärna” i det samiska språklandskapet, p. 159 – 174 • Riitta-Liisa Valijärvi : Ruotsinsuomalaisten opiskelijoiden kirjallisen tuotoksen morfosyntaksin ja sanaston virheanalyysia, p. 175 – 200 REVIEW No reviews in this issue. REPORTS • Lasse Vuorsola : Atmosfärförändring inom klimatdebatten, p. 201 – 207 REVIEW ARTICLE No review articles in this issue. FUS 15, 2016, ISSN 0348-3045, ISBN 978-91-981559-0-7 (online) Fenno-Ugrica Suecana – Nova Series This is the second issue of the revitalized Fenno-Ugrica Suecana series (now additionally termed ‘Nova Series‘), which continues to focus on issues concerning Fenno-Ugristics and Fennistics, in a wide sense. We invite researchers, teachers and other scientifically interested persons to send us contributions for publication. The planned annual deadlines are October 15th in the fall and February 15th in the spring. -
Contact-Driven Multilingual Practices (Helsinki, 1‒2 June 2017) Book of Abstracts
Contact-Driven Multilingual Practices (Helsinki, 1‒2 June 2017) Book of abstracts Evgeniya Aleshinskaya ..................................................................................................................... 2 Jenni Alisaari, Leena Maria Heikkola, Emmanuel Opoku Acquah ..................................................... 3 Djegdjiga Amazouz, Martine Adda-Decker ........................................................................................ 4 Timofey Arkhangelskiy, Maria Usacheva .......................................................................................... 6 Hiwa Asadpour ................................................................................................................................. 7 Victor Bayda ..................................................................................................................................... 8 Ingeborg Birnie ................................................................................................................................. 8 Urjani Chakravarty ............................................................................................................................ 9 Svetlana Edygarova ........................................................................................................................ 10 Martin Ehala ................................................................................................................................... 11 M. M. Jocelyne Fernandez-Vest .................................................................................................... -
Περì O̓ρθότητος Ἐ Τύμων Uusiutuva Uralilainen Etymologia
ΠΕΡI OΡΘΌΤΗΤΌΣ EΤΎΜΩΝ UUSIUTUVA URALILAINEN ETYMOLOGIA Uralica Helsingiensia11 Περì ̓ ρθότητοςo ἐ τύμων Uusiutuva uralilainen etymologia TOIMITTANEET / EDITED BY SAMPSA HOLOPAINEN & JANNE SAARIKIVI HELSINKI 2018 Sampsa Holopainen & Janne Saarikivi (eds): Περı` o̓ ρθότητος ἐ τύμων. Uusiutuva uralilainen etymologia. Uralica Helsingiensia 11. Cover Rigina Ajanki Layout Mari Saraheimo English proofreading Alexandra Kellner ISBN 978-952-7262-04-7 (print) ISBN 978-952-7262-05-4 (online) ISSN 1797-3945 Orders • Tilaukset Printon Tiedekirja <www.tiedekirja.fi> Tallinn 2018 Snellmaninkatu 13 <[email protected]> FI-00170 Helsinki Uralica Helsingiensia Uralica Helsingiensia is a series published by the Finno-Ugrian Society. It features mono- graphs and thematic collections of articles with a research focus on Uralic languages, and it also covers the linguistic and cultural aspects of Estonian, Hungarian and Saami studies at the University of Helsinki. The series has a peer review system, i.e. the manuscripts of all articles and monographs submitted for publication will be refereed by two anonymous reviewers before binding decisions are made concerning the publication of the material. Uralica Helsingiensia on Suomalais-Ugrilaisen Seuran julkaisema sarja, jossa ilmestyy mono- grafioita ja temaattisia artikkelikokoelmia. Niiden aihepiiri liittyy uralilaisten kielten tutkimuk- seen ja kattaa myös Helsingin yliopistossa tehtävän Viron kielen ja kulttuurin, Unkarin kielen ja kulttuurin sekä saamentutkimuksen. Sarjassa noudatetaan refereekäytäntöä, eli -
Book of Abstracts
Congressus Duodecimus Internationalis Fenno-Ugristarum, Oulu 2015 Book of Abstracts Edited by Harri Mantila Jari Sivonen Sisko Brunni Kaisa Leinonen Santeri Palviainen University of Oulu, 2015 Oulun yliopisto, 2015 Photographs: © Oulun kaupunki ja Oulun yliopisto ISBN: 978-952-62-0851-0 Juvenes Print This book of abstracts contains all the abstracts of CIFU XII presentations that were accepted. Chapter 1 includes the abstracts of the plenary presentations, chapter 2 the abstracts of the general session papers and chapter 3 the abstracts of the papers submitted to the symposia. The abstracts are presented in alphabetical order by authors' last names except the plenary abstracts, which are in the order of their presentation in the Congress. The abstracts are in English. Titles in the language of presentation are given in brackets. We have retained the transliteration of the names from Cyrillic to Latin script as it was in the original papers. Table of Contents 1 Plenary presentations 7 2 Section presentations 19 3 Symposia 197 Symp. 1. Change of Finnic languages in a multilinguistic environment .......................................................................... 199 Symp. 2. Multilingual practices and code-switching in Finno-Ugric communities .......................................................................... 213 Symp. 3. From spoken Baltic-Finnic vernaculars to their national standardizations and new literary languages – cancelled ...... 231 Symp. 4. The syntax of Samoyedic and Ob-Ugric languages ...... 231 Symp. 5. The development -
Kone Foundation Te Language Programme 2012–2016
Kone Foundation !e Language Programme – Kone Foundation !e Language Programme .. Patrik Söderlund Mynäprint, Mynämäki "e Kone Foundation Language Programme – !"#$%#$& Background Kone Foundation Places Emphasis on the Minority Languages of the Baltic Region and on Finnish Supporting Multilingualism with the Help of the Language Programme "e Current Support for Finno-Ugrian Minorities and Languages Mission Vision "e Mission: Documentation of Small Finno-Ugrian Languages, Finnish, and Minority Languages in Finland Enhancing the Availability, Accessibility and Usability of Corpora Taking into Consideration All Kinds of Language Users Advancing a Culture of Openness and Interaction Advancing Multidisciplinarity and Crossing New Boundaries How Are the Objectives Achieved? Concrete Measures – Communication and Cooperation between Projects Projects Funded "rough Annual Grant Application Rounds, Specially Allocated Grant Calls, and Projects Initiated by the Foundation An Example of the Use and Development of Language Technology Applications Monitoring the Programme Koneen Säätiön ovat perustaneet vuorineuvos Heikki H. Herlin ja eko- nomi Pekka Herlin vuonna . Perustajat toimivat samanaikaisesti sekä säätiön että Kone Osakeyhtiön johdossa, joten kahden organisaation yh- teys oli läheinen. Nykyisin Koneen Säätiö on itsenäinen ja riippumaton yleishyödyllinen organisaatio, joka edistää tieteellistä tutkimustyötä, eri- tyisesti humanistista, yhteiskuntatieteellistä ja ympäristöntutkimusta, sekä taidetta ja kulttuuria Suomessa. -
Erzya E-Learning Course, Grammatical Description
ERZYA E-LEARNING COURSE, GRAMMATICAL DESCRIPTION Sirkka Saarinen The European Commission support for the production of this publication does not constitute an endorsement of the contents which reflects the views only of the authors, and the Commission cannot be held responsible for any use which may be made of the information contained therein. Sirkka Saarinen E-learning course: Erzya THE MORDVINS Moksha Mordvin girls Flag of the Mordvin Republic Erzya Mordvin women Mordvins = Erzyas and Mokshas The Mordvins are divided into two ethnic groups, Erzyas and Mokshas. The Mordvins reside in their own republic which is part of the Russian Federation (25,266 km2), in the territory between the Volga tributaries, the Oka and the Sura. Only 27% of the Mordvins dwell in the titular republic; the majority live in scattered enclaves over a very extensive area to the north and south of the borders of the republic as well as widely in the zone reaching to the Ural Mountains. According to the last Soviet census (1989) there were 1,154,000 Mordvins, although of these only 67% spoke their mother tongue Erzya or Moksha. The second Russian census (2010) estimated the number of Mordvins to have decreased by over 30 %, that is, to 806,000. About two-thirds of the Mordvins speak Erzya and one-third Moksha, which to some extent are mutually understandable. Literary languages for both groups were developed in the 1920s. The Mordvin languages belong to the Volgaic branch of the Finno-Ugric languages. The Russians began converting the Mordvins to the Orthodox faith as early as the 15th century. -
The Grown-Up Siblings: History and Functions of Western Uralic *Kse
Rigina Ajanki University of Helsinki The grown-up siblings: history and functions of Western Uralic *kse In this paper, it is claimed that the case suffix *kse, known as translative, dates back to the Finnic-Mordvin proto language, where it functioned as a functive� It is illustrated using synchronic data from Finnic-Mordvin languages that the functions of *kse do not display an inherent feature of directionality ‘into’, or in other terms, lative� It is even possible that the suffix was neutral with respect to time stability, as it is in contemporary Erzya� Further, it is assumed that since the Northern Finnic languages have acquired a new stative case, the functive labelled essive *nA, formerly applied as an intralocal case, the functions of *kse have changed in these languages: *kse has become mainly the marker of a transformative, with an inherent feature of dynamicity� 1� Introduction 5�1� Translative with 2� Typological background: stative copula in Finnish: *kse as a functive similatives and functives 3� Translatives in the case systems 5�2� Finnish ditransitive of Finnic-Mordvin languages constructions displaying 3�1� The Finnish case translative-essive variation system revisited 5�3� Erzya and Finnish expressions 3�2� Functives as of order in translative non-verbal predicates 6� The developmental path 4� The emergence of *kse of translative *kse as a case suffix 7� The emergence of essive 5� The functions of *kse and its consequences for in contemporary the functions of *kse Finnic-Mordvin languages 1. Introduction The translative -
Baltic Loanwords in Mordvin
Riho Grünthal Department of Finnish, Finno-Ugrian and Scandinavian Studies University of Helsinki Baltic loanwords in Mordvin Linguists have been aware of the existence of Baltic loanwords in the Mord- vinic languages Erzya (E) and Moksha (M) since the 19th century. However, the analysis and interpretation of individual etymologies and the contacts between these two language groups have been ambiguous, as the assumptions on the place and age of the contacts have changed. The assertions on the prehistoric development and early language contacts between the Finno-Ugric (Uralic) and Indo-European languages have changed as well. The main evidence concerning early Baltic loanwords in the Finno-Ugric languages is drawn from the Finnic languages, which are located geographically further west relative to Mordvinic. The high number of early Baltic loanwords in the Finnic languages suggests that the most intensive contacts took place between the early varieties of the Finnic and Baltic languages and did not infl uence other Finno-Ugric languages to the same extent. In principle, the continuity of these contacts extends until the modern era and very recent contacts between Estonian, Livonian, and Latvian that are geographical neighbors and documented languages with a concrete geo- graphical distribution, historical and cultural context. The Baltic infl uence on the Saamic and Mordvinic languages was much less intensive, as evidenced by the considerably lesser number of loanwords. Moreover, the majority of Baltic loanwords in Saamic are attested in the Finnic languages as well, whereas the early Baltic infl uence on the Mordvinic lan- guages diverges from that on the Finnic languages. -
Book of Abstracts
Congressus Duodecimus Internationalis Fenno-Ugristarum, Oulu 2015 Book of Abstracts Edited by Harri Mantila Jari Sivonen Sisko Brunni Kaisa Leinonen Santeri Palviainen University of Oulu, 2015 Oulun yliopisto, 2015 Photographs: © Oulun kaupunki ja Oulun yliopisto ISBN: 978-952-62-0851-0 Juvenes Print This book of abstracts contains all the abstracts of CIFU XII presentations that were accepted. Chapter 1 includes the abstracts of the plenary presentations, chapter 2 the abstracts of the general session papers and chapter 3 the abstracts of the papers submitted to the symposia. The abstracts are presented in alphabetical order by authors' last names except the plenary abstracts, which are in the order of their presentation in the Congress. The abstracts are in English. Titles in the language of presentation are given in brackets. We have retained the transliteration of the names from Cyrillic to Latin script as it was in the original papers. Table of Contents 1 Plenary presentations 7 2 Section presentations 19 3 Symposia 199 Symp. 1. Change of Finnic languages in a multilinguistic environment .......................................................................... 201 Symp. 2. Multilingual practices and code-switching in Finno-Ugric communities .......................................................................... 215 Symp. 3. From spoken Baltic-Finnic vernaculars to their national standardizations and new literary languages – cancelled ...... 233 Symp. 4. The syntax of Samoyedic and Ob-Ugric languages ...... 233 Symp. 5. The development