Types of Moksha-Russian Code-Switching1 1. What This Paper Is
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Open-Source Morphology for Endangered Mordvinic Languages
Open-Source Morphology for Endangered Mordvinic Languages Jack Rueter Mika Hämäläinen Niko Partanen Dept. of Digital Humanities Dept. of Digital Humanities Dept. of Finnish, University of Helsinki University of Helsinki Finno-Ugrian [email protected] and Rootroo Ltd and Scandinavian Studies [email protected] University of Helsinki [email protected] Abstract languages. 2002 saw the publication of the first monolingual dictionary of Erzya (Abramov, 2002), This document describes shared development and the manuscript was proclaimed open by the of finite-state description of two closely re- author for future development. The Mordvin lan- lated but endangered minority languages, Erzya and Moksha. It touches upon mor- guages have continued to receive a fair share of pholexical unity and diversity of the two lan- linguistic research interest in the recent years (Luu- guages and how this provides a motivation for tonen, 2014; Hamari and Aasmäe, 2015; Kashkin shared open-source FST development. We de- and Nikiforova, 2015; Grünthal, 2016). scribe how we have designed the transducers After the release first finite-state transducer for so that they can benefit from existing open- the closely related Komi-Zyrian (Rueter, 2000), source infrastructures and are as reusable as possible. it was only obvious that similar work should be done for Erzya Mordvin. Fortunately, over the 1 Introduction past decade there has been an increasing number of publications on Erzya, relating to its morphol- There are over 5000 languages spoken world wide, ogy (Rueter, 2010), its OCR tools (Silfverberg and and a vast majority of them are endangered (see Rueter, 2015) and universal dependencies (Rueter Moseley 2010). -
Финно-Угроведение, 2020 (№ 61) Научный Журнал Основан В Феврале 1994 Г
ФИННО-УГРОВЕДЕНИЕ, 2020 (№ 61) Научный журнал Основан в феврале 1994 г. ISSN 2312-0312 (Print), ISSN 2713-2250 (Online) Учредитель – Государственное бюджетное научное учреждение при Правительстве Республики Марий Эл «Марийский научно-исследовательский институт языка, литературы и истории им. В.М. Васильева» Редакционный совет: Кузьмин Евгений Петрович, председатель ред. совета, канд. ист. н., директор МарНИИЯЛИ (г. Йошкар-Ола) Пенькова Мария Викентьевна, секретарь ред. совета, канд. филол. н., зам. директора-ученый секретарь МарНИИЯЛИ (г. Йошкар-Ола) Агранат Татьяна Борисовна, д-р филол. н., ведущий научный сотрудник Института языкознания РАН (г. Москва) Воронцов Владимир Степанович, канд. ист. н., старший научный сотрудник Удмуртского института истории, языка и литературы УрО РАН (г. Ижевск) Зайцева Нина Григорьевна, д-р филол. н., зав. сектором языкознания Института языка, литературы и истории КарНЦ РАН (г. Петрозаводск) Косинцева Елена Викторовна, д-р филол. н., доцент, главный научный сотрудник Обско-угорского института прикладных исследований и разработок (г. Ханты-Мансийск) Крыласова Наталья Борисовна, д-р ист. н., главный научный сотрудник Пермского федерального исследовательского центра УрО РАН (г. Пермь) Куршева Галина Александровна, д-р ист. н., директор Научно-исследовательского института гуманитарных наук при Правительстве Республики Мордовия (г. Саранск) Лехтинен Илдико, д-р философии, доцент Хельсинкского университета (г. Хельсинки, Финляндия) Ошаев Алексей Григорьевич, канд. ист. н., декан историко-филологического факультета Марийского государственного университета (г. Йошкар-Ола) Пустаи Янош, профессор, д-р философии, председатель Ассоциации финно-угорских литератур, директор Института Collegium Fenno-Ugricum (г. Будапешт, Венгрия) Ракин Анатолий Николаевич, д-р филол. н., главный научный сотрудник Института языка, литературы и истории Коми НЦ УрО РАН (г. Сыктывкар) Сергеев Олег Арсентьевич, канд. филол. н., старший научный сотрудник МарНИИЯЛИ (г. -
Book of Abstracts
Congressus Duodecimus Internationalis Fenno-Ugristarum, Oulu 2015 Book of Abstracts Edited by Harri Mantila Jari Sivonen Sisko Brunni Kaisa Leinonen Santeri Palviainen University of Oulu, 2015 Oulun yliopisto, 2015 Photographs: © Oulun kaupunki ja Oulun yliopisto ISBN: 978-952-62-0851-0 Juvenes Print This book of abstracts contains all the abstracts of CIFU XII presentations that were accepted. Chapter 1 includes the abstracts of the plenary presentations, chapter 2 the abstracts of the general session papers and chapter 3 the abstracts of the papers submitted to the symposia. The abstracts are presented in alphabetical order by authors' last names except the plenary abstracts, which are in the order of their presentation in the Congress. The abstracts are in English. Titles in the language of presentation are given in brackets. We have retained the transliteration of the names from Cyrillic to Latin script as it was in the original papers. Table of Contents 1 Plenary presentations 7 2 Section presentations 19 3 Symposia 197 Symp. 1. Change of Finnic languages in a multilinguistic environment .......................................................................... 199 Symp. 2. Multilingual practices and code-switching in Finno-Ugric communities .......................................................................... 213 Symp. 3. From spoken Baltic-Finnic vernaculars to their national standardizations and new literary languages – cancelled ...... 231 Symp. 4. The syntax of Samoyedic and Ob-Ugric languages ...... 231 Symp. 5. The development -
Book of Abstracts
Congressus Duodecimus Internationalis Fenno-Ugristarum, Oulu 2015 Book of Abstracts Edited by Harri Mantila Jari Sivonen Sisko Brunni Kaisa Leinonen Santeri Palviainen University of Oulu, 2015 Oulun yliopisto, 2015 Photographs: © Oulun kaupunki ja Oulun yliopisto ISBN: 978-952-62-0851-0 Juvenes Print This book of abstracts contains all the abstracts of CIFU XII presentations that were accepted. Chapter 1 includes the abstracts of the plenary presentations, chapter 2 the abstracts of the general session papers and chapter 3 the abstracts of the papers submitted to the symposia. The abstracts are presented in alphabetical order by authors' last names except the plenary abstracts, which are in the order of their presentation in the Congress. The abstracts are in English. Titles in the language of presentation are given in brackets. We have retained the transliteration of the names from Cyrillic to Latin script as it was in the original papers. Table of Contents 1 Plenary presentations 7 2 Section presentations 19 3 Symposia 199 Symp. 1. Change of Finnic languages in a multilinguistic environment .......................................................................... 201 Symp. 2. Multilingual practices and code-switching in Finno-Ugric communities .......................................................................... 215 Symp. 3. From spoken Baltic-Finnic vernaculars to their national standardizations and new literary languages – cancelled ...... 233 Symp. 4. The syntax of Samoyedic and Ob-Ugric languages ...... 233 Symp. 5. The development -
Finno-Ugric Languages in Time and Space Tartu, Estonia, February 1-3, 2015
Ariste Conference Finno-Ugric Languages in Time and Space Tartu, Estonia, February 1-3, 2015 On the Morphological Alignment of Erzya and Moksha Verbal Paradigms Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Finno-Ugric Languages in Time and Space => • Финнэнь-угрань кельтне: • Шкастонзо • Тарказост аравтозь => • Finno-Ugric Languages • In time • In their place Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Not “Why” but “What for” • Open morphological analyzers • Giellatekno.uit.no • Robust • Requires more analyses to disambiguate itself • Open syntactical disambiguation • Constraint Grammar course in the works for Autumn 2015 • Open text to speech, Open translation Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki • Language affinity statistics • Morphology and Function • Moods and tenses • First Preterit • Non-ambiguous forms Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Statistics we have seen or heard on Erzya and Moksha language affinity • Vocabulary statistics for Erzya and Moksha: • 80% affinity (Keresztes 2011: 118). • Actual figure 80.80% (Zaicz 1990: 7-13). • 116 words: 92.42% • 902 Erzya words: 685 Moksha cognates • 75.94% • 1039 Moksha words: 890 Erzya cognates • 85.66% Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Statistics few have counted on Mordvin affinity • Paasonen’s “Mordwinisches Wörterbuch” 1990-1996, 2703 pages • 6952 root entries containing cognate information • Erzya 5302 = 76% • Moksha 4808 = 69% • Intersection 3158 = 45% • Stem entries: 21,754 • Erzya 14,524 = 67% • Moksha 12,945 = 59% • Intersection 5,785 = 27% Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Mutual Mordvin verb inventory based on Paasonen’s dictionary (1990-1996) Macro root articles (6,952) Erzya Moksha Intersection Union 1210 1198 734 1674 72.2% 71.5% 44% 100% Stem articles (21,754) Erzya Moksha Intersection Union 5,127 5,371 2,183 8,315 61.6% 64.6% 26% 100% Feb. -
Shallow Transfer
УДК 811.511.152’322=111(045) Jack Rueter PhD, researcher, Finland, Helsinki, University of Helsinki Mika Hämäläinen PhD, researcher, Finland, Helsinki, University of Helsinki PREREQUISITES FOR SHALLOW-TRANSFER MACHINE TRANSLATION OF MORDVIN LANGUAGES: LANGUAGE DOCUMENTATION WITH A PURPOSE В данной статье представлены текущие лексические, морфологические, синтаксические и основанные на правилах машинные переводы для эрзянского и мокшанского языков, которые можно и нужно использовать при разработке дорожной карты мордовских лингвистических исследований. Мы стремимся проиллюстрировать и обрисовать начальные типы проблем, с которыми придется столкнуться при построении системы поверхностного машинного перевода на основе Apertium для форм мордовских языков. Мы указываем ориентиры в мордовском языкознании и других разделах уралистики в качестве отправной точки для определения лингвистических исследований со средствами измерения собственного прогресса и разработки дорожной карты для дальнейших исследований. Ключевые слова: эрзянский, мокшанский, уральский, поверхностный машинный перевод, измеримые языковые исследования, измеримое языковое расстояние, конечная морфология, универсальные зависимости. Introduction In this paper, we describe the current state of the on-going eforts in building a rule-based machine translation system for the Erzya-Moksha language pair10. In addition, we seek to identify possible ways to extend this work in such a way that the system can be built fast and with as much reuse as possible from existing tools and solutions. There is a need to establish means to measure and develop a roadmap for Mordvin linguistic research. This research can and should include literary languages as well as dialect and historical language materials. By addressing normative and non-normative language forms with reference to temporal and spatial dimensions we can begin to fll the gaps in our knowledge of Mordvin language development and hypothetical proto forms. -
Mokša Mordvin Verbal Negation Described and Compared with Other Uralo-Altaic Languages
Mokša Mordvin verbal negation described and compared with other Uralo-Altaic languages Arnaud Fournet* La Garenne Colombes (France) Abstract Some languages use a single unit to express verbal negation, like not/n't in English. Some others have several units according to the moods and tenses of the verb. This is the case for the Uralic Mokša-Mordvin language. This feature can be even used for descriptive purposes to determine the moods and tenses of this language and this approach makes it apparent that the traditional descriptions of Mokša cannot be accepted without some reservations. The first part focuses on the description of verbal negative forms in Mokša. Once properly described, Mokša-Mordvin is compared with other languages of northern Eurasia, mostly of the Uralo- Altaic type, in the second part. Keywords : Verbal Negation, Descriptive Linguistics, Comparative Linguistics, Mokša, Mordvin, Uralic, Yukaghir, Mongolian, Tungusic, Japanese 1 Introduction In the languages spoken in Europe, verbal negation is generally expressed with a particular unit, which has a fairly stable and unique segmental expression: English not, German nicht, Italian non, Basque az, etc. Basque, even though it is not an Indo-European language, is no exception to this areal tendency. In French, the situation is slightly more complex as the negative load of ne, inherited from Latin, has been gradually transferred to other words: pas, point, mie, goutte among others, which were originally nouns. But in modern spoken French, the practical situation is that the negative load is entirely conveyed by pas. The other items are either obsolete, archaic or marked as dialectal and ne is most often disposed of in informal speech. -
About the Difficulty of Determining the Lexical Classes of the Moksha Language
FOURNET, Arnaud. About the difficulty of determining the lexical classes of the Moksha language. ReVEL , v. 9, n. 17, 2011. [www.revel.inf.br/eng]. ABOUT THE DIFFICULTY OF DETERMINING THE LEXICAL CLASSES OF THE MOKSHA LANGUAGE Arnaud Fournet 1 [email protected] ABSTRACT : The paper deals with the Moksha language belonging to the Uralic family. It begins with a presentation of Moksha and its oldest attestations. It then examines approaches of the parts of speech and lexical classes in that language as developed by a number of authors, mainly Ahlquist (1861) and Aliamkin (2000). The paper illustrates the difficulties and issues faced when applying the framework inherited from Greek and Latin traditions to languages displaying other typological features than the Indo-European mold. KEYWORDS : Moksha; Uralic; Part of speech, Lexicography; Typology. 1. INTRODUCTION The paper deals with the Moksha language belonging to the Uralic family. After a presentation of present-day Moksha and of its historical attestations the paper examines approaches of the parts of speech and lexical classes in this language as developed by a number of authors, especially Ahlquist (1861) and Aliamkin (2000), who provide the most detailed theories. 2. PRESENTATION OF THE MOKSHA LANGUAGE Moksha is a Uralic language, spoken in the Russian Federation. Together with Erzya it constitutes the two main branches of Mordvin. The word <moksha> is a direct transliteration of Cyrillic мокша . It is used to describe this language in contemporary texts. But other words have been used in the past to designate Mordvins. Many Mokshan speakers live in Mordovia, a province located 500 km to the south-east of Moscow. -
Études Finno-Ougriennes, 45 | 2013 2
Études finno-ougriennes 45 | 2013 Les langues finno-ougriennes aujourd’hui II Édition électronique URL : https://journals.openedition.org/efo/1622 DOI : 10.4000/efo.1622 ISSN : 2275-1947 Éditeur INALCO Édition imprimée Date de publication : 1 décembre 2013 ISBN : 978-2-343-04446-0 ISSN : 0071-2051 Référence électronique Études finno-ougriennes, 45 | 2013, « Les langues finno-ougriennes aujourd’hui II » [En ligne], mis en ligne le 01 février 2015, consulté le 08 juillet 2021. URL : https://journals.openedition.org/efo/1622 ; DOI : https://doi.org/10.4000/efo.1622 Ce document a été généré automatiquement le 8 juillet 2021. Études finno-ougriennes est mis à disposition selon les termes de la Licence Creative Commons Attribution - Pas d’Utilisation Commerciale 4.0 International. 1 Ce numéro poursuit l'entreprise lancée avec le no 44, c'est-à-dire qu'il passe en revue les langues finno-ougriennes de Russie centrale et du Nord dans leur position socio-linguistique. Les auteurs de ces articles sont des chercheurs linguistes, anthropologues, mais aussi des locuteurs et utilisateurs des langues finno-ougriennes ou des écrivains. Trois langues ne sont pas représentées dans ce recueil d'études: le mokcha, le permiak et le mansi. Mais la situation de ces langues s'insère dans les problématiques ici abordées: deux langues littéraires pour une dénomination (erza/mokcha, komi/permiak, mais la polémique sur le mari des plaines et le mari des montagnes est fortement développée par deux études illustrant bien les problèmes rencontrés : vieillissement des locuteurs, zones de transition multilingues. Le recueil est introduit par un essai de l'écrivain et chercheur oudmourte Aleksej Arzamazov. -
Language Vitality of the Peoples of Russia
Proceedings of SOCIOINT 2019- 6th International Conference on Education, Social Sciences and Humanities 24-26 June 2019- Istanbul, Turkey LANGUAGE VITALITY OF THE PEOPLES OF RUSSIA Vida Yu. Mikhalchenko1, Elena A. Kondrashkina2, Svetlana V. Kirilenko3* 1Prof. Dr., Head Scientific Researcher, Institute of Linguistics, Russian Academy of Sciences, RUSSIA, [email protected] 2Senior Scientific Researcher, Candidate of Sciences (Philology), Institute of Linguistics, Russian Academy of Sciences, RUSSIA, [email protected] 3Scientific Researcher, Candidate of Sciences (Philology), Institute of Linguistics, Russian Academy of Sciences; Assoc. Prof., MIIGAiK State University, RUSSIA, [email protected] *Corresponding author Abstract The article considers the problem of vitality of the languages of the peoples living on the territory of the Russian Federation. This subject of research is timely as substantial changes have been introduced in the school educational program in the last few years. The study of the title language became compulsory for all schoolchildren, regardless of their nationality or the wishes of their parents in a number of republics of the Russian Federation. At the same time, the hours for studying the Russian language were reduced, which caused conflict situations, especially in the Republic of Tatarstan. This situation has led to an enormous controversy in the discussions on national languages, their functional development and the level of their vitality. The level of vitality of a language is determined by a number of factors. The most important ones are: 1) the number of native speakers; 2) the level of the functional development of the language; 3) the existence of a written tradition; 4) the status of the language and the language policy aimed at its development and support; and a number of other factors. -
Multilingual Practices in Finno-Ugric Communities
MULTILINGUAL PRACTICES IN FINNO-UGRIC COMMUNITIES Uralica Helsingiensia13 Multilingual Practices in Finno-Ugric Communities EDITED BY OUTI TÁNCZOS, MAGDOLNA KOVÁCS & ULRIIKKA PUURA HELSINKI 2018 Outi Tánczos, Magdolna Kovács & Ulriikka Puura (eds.): Multilingual Practices in Finno-Ugric Communities. Orders • Tilaukset Uralica Helsingiensia 13. Tiedekirja Cover Outi Tánczos Snellmaninkatu 13 Layout Anna Kurvinen FI-00170 Helsinki <www.tiedekirja.fi> ISBN 978-952-7262-02-3 (print) <[email protected]> ISBN 978-952-7262-03-0 (online) ISSN 1797-3945 Printon Tallinn 2018 Uralica Helsingiensia The series Uralica Helsingiensia is founded in 2008. It features thematic volumes and mono- graphs on the history, typology and sociology of Uralic languages. Studies focusing on Es- tonian language and culture, Hungarian language and culture, as well as Saamic studies are equally welcome in the series. The series also seeks to support university studies on its focus areas. Uralica Helsingiensia is rated at level 1 by the Publication Forum of the Federation of Finnish Learned Societies. All submitted manuscripts are peer-reviewed. Uralica Helsingiensia on perustettu 2008. Sarjassa julkaistaan temaattisia artikkeli kokoelmia tai monografi oita erityisesti uralilaisten kielten historian, typologian ja kielisosiologian alalta. Aihepiiriin kuuluvat myös Unkarin kielen ja kulttuurin, Viron kielen ja kulttuurin ja saamentutkimuksen yliopisto-opintoja tukevat julkaisut. Sarja kuuluu TSV:n julkaisufoorumin luokituksessa kategoriaan 1, ja siinä noudatetaan vertaisarviointikäytäntöä. Publisher • Julkaisija Finno-Ugrian Society • Suomalais-Ugrilainen Seura Editors • Päätoimittajat Ulla-Maija Forsberg, Riho Grünthal Editorial board • Toimitusneuvosto Márta Csepregi, Cornelius Hasselblatt, Magdolna Kovács, Johanna Laakso, Helle Mets- lang, Matti Miestamo, Irma Mullonen, Karl Paju salu, Janne Saari kivi, Anneli Sarhimaa, Elena Skribnik www.sgr.fi/uh The publications are indexed in ARTO data base with the index Urbis. -
Boglárka Janurik Erzya–Russian Bilingual Discourse: a Structural
Uralic Studies PhD Program Graduate School in Linguistics University of Szeged Boglárka Janurik Erzya–Russian bilingual discourse: A structural analysis of intrasentential code-switching patterns PhD dissertation Supervisor: Magdolna Kovács, PhD, Adjunct Professor Szeged, 2017 To my grandparents Table of Contents Acknowledgements ...................................................................................................................................i List of abbreviations ......................................................................................................................... ii Chapter 1. Introduction .................................................................................................................... 1 Chapter 2. Sociohistorical background of the Erzya–Russian contact situation .............................. 6 2.1. Mordvin or Erzya and Moksha? ..................................................................................................... 6 2.2. Present situation of the Erzya ........................................................................................................ 9 2.3. History of the Erzya–Russian contacts...........................................................................................14 Chapter 3. Typological overview of the Erzya and the Russian language ...................................... 30 3.1. Phonology ....................................................................................................................................31 3.2. Morphosyntax