Ariste Conference Finno- in Time and Space Tartu, , February 1-3, 2015

On the Morphological Alignment of Erzya and Moksha Verbal Paradigms

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki

Finno-Ugric Languages in Time and Space

=> • Финнэнь-угрань кельтне: • Шкастонзо • Тарказост аравтозь => • Finno-Ugric Languages • In time • In their place

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Not “Why” but “What for”

• Open morphological analyzers • Giellatekno.uit.no • Robust • Requires more analyses to disambiguate itself • Open syntactical disambiguation • Constraint Grammar course in the works for Autumn 2015 • Open text to speech, Open translation

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki • Language affinity statistics • Morphology and Function • Moods and tenses • First Preterit • Non-ambiguous forms

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Statistics we have seen or heard on Erzya and Moksha language affinity • Vocabulary statistics for Erzya and Moksha: • 80% affinity (Keresztes 2011: 118). • Actual figure 80.80% (Zaicz 1990: 7-13). • 116 words: 92.42% • 902 Erzya words: 685 Moksha cognates • 75.94% • 1039 Moksha words: 890 Erzya cognates • 85.66%

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Statistics few have counted on Mordvin affinity • Paasonen’s “Mordwinisches Wörterbuch” 1990-1996, 2703 pages • 6952 root entries containing cognate information • Erzya 5302 = 76% • Moksha 4808 = 69% • Intersection 3158 = 45% • Stem entries: 21,754 • Erzya 14,524 = 67% • Moksha 12,945 = 59% • Intersection 5,785 = 27%

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Mutual Mordvin verb inventory based on Paasonen’s dictionary (1990-1996)

Macro root articles (6,952) Erzya Moksha Intersection Union 1210 1198 734 1674 72.2% 71.5% 44% 100%

Stem articles (21,754) Erzya Moksha Intersection Union 5,127 5,371 2,183 8,315 61.6% 64.6% 26% 100%

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Moods and tenses

• Indicative • Non-past (21|22), First Preterit (18|19), Second Preterit (18|19) • (Prt2 and Conjunctional are not distinguished in Moksha) • Conjunctional (18|19) • Desiderative (18|19) • Imperative: Second person subjects only (7|6) • Optative: (20?|6) • Derivations: Conditional, , Refl- Feb. 3,Telic… 2015 Jack Rueter, PhD. University of Helsinki Ind.Prt1 in Erzya (18 forms)

кундынь Sg1; Sg1/Pl3 кундык Sg2/Sg3; Imp Sg2/Sg3 кундыть Sg2; Sg2/Pl3; (Imp Sg2/Pl3) кундымим Sg3/Sg1 кундась Sg3 кундынзеть Sg3/Sg2 кундынек Pl1; Pl1/Sg3; Pl1/Pl3 кундызе Sg3/Sg3 кундыде Pl2 кундынзе Sg3/Pl3 кундасть Pl3 кундынк Pl2/Sg3; Pl2/Pl3; (Imp =) кундытинь Sg1/Sg2 кундызь Pl3/Sg3; Pl3/Pl3 кундыя Sg1/Sg3 кундымизь Pl3/Sg1; Pl3/Pl1; Sg3/Pl1; Pl2/Sg1; Pl2/Pl1; Sg2/Pl1 кундымик Sg2/Sg1 кундыдизь Pl3/Sg2; Pl3/Pl2; Sg3/Pl2; Pl1/Pl2; Pl1/Sg2; Sg1/Pl2

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Non-ambiguity in Erzya Indicative First Preterit = 10/34

1 4 2 5 3 6 0 1 + - + - - Sg1/Sg2; Sg1/Sg3 4 - - - - - 2 + - - - - Sg2/Sg1 5 - - - - + Pl2 3 + - + - + + + Sg3/Sg1; Sg3/Sg2; Sg3/Sg3; Sg3/Pl3; Sg3 6 ------+ Pl3

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Ind.Prt1 in Moksha (19 forms)

кундань Sg1 кундамань Sg3/Sg1 кундать Sg2 кунданзе Sg3/Sg2 кундась Sg3 кундазе Sg3/Sg3 кундаме Pl1 кундызень Sg3/Pl3 кундаде Pl2 кундаськ Pl1/Sg3; Pl1/Pl3 кундасть Pl3; Pl2/Sg3; Pl2/Pl3 кундазь Pl3/Sg3; Pl3/Pl3 кундайтень Sg1/Sg2 кундамасть Pl2/Sg1; Pl2/Pl1; Sg2/Pl1 кундайне Sg1/Sg3; Sg1/Pl3 кундамазь Pl3/Sg1; Pl3/Pl1; Sg3/Pl1 кундамайть Sg2/Sg1 кундадязь Pl3/Sg2; Pl3/Pl2; Sg3/Pl2; Pl1/Pl2; Pl1/Sg2; Sg1/Pl2 кундайть Sg2/Sg3; Sg2/Pl3

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Non-ambiguity in Moksha Indicative First Preterit = 11/34

1 4 2 5 3 6 0 1 + - - - + Sg1/Sg2; Sg1 4 - - - - + Pl1 2 + - - - + Sg2/Sg1; Sg2 5 - - - - + Pl2 3 + - + - + + + Sg3/Sg1; Sg3/Sg2; Sg3/Sg3; Sg3/Pl3; Sg3 6 ------

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Non-ambiguity Ind. Prt1 Erzya = v ; Moksha = ✪

1 4 2 5 3 6 0 1 v v ✪ Sg1/Sg2 ✪ 4 ✪ 2 v ✪ Sg2/Sg1 ✪ 5 v Pl2 ✪ 3 v v v v v Sg3/Sg1; Sg3/Sg2; Sg3/Sg3; Sg3/Pl3; Sg3 ✪ ✪ ✪ ✪ ✪ 6 v

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki 8 non-ambiguous cells

Erzya Moksha Intersection Union 10 11 8 13 76% 84% 61.5% 100%

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Non-ambiguous object conjugation forms

Erzya Oral Erzya Lit. Moksha Lit Sg2/Sg1 180 10 197 Sg3/Sg1 118 13 138 Sg1/Sg2 189 13 169 Sg3/Sg2 254 3 89 Sg3/Sg3 1940 275 2877 Sg3/Pl3 625 107 826 Pl2 ? ? ? Sg3 ? ? ? 55% 15.5% 59.7% (cf. Trosterud 2006: 291, 294)

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Only 2 identically ambiguous

Erzya Moksha Pl3/Sg3; Pl3/Pl3 Pl3/Sg3; Pl3/Pl3 = Pl3/Pl2, Pl3/Sg2; Pl1/Pl2, Pl3/Pl2, Pl3/Sg2; Pl1/Pl2, = Pl1/Sg2; Sg3/Pl2; Sg1/Pl2 Pl1/Sg2; Sg3/Pl2; Sg1/Pl2 Pl2/Pl1, Pl2/Sg1; Sg2/Pl1; Pl2/Pl1, Pl2/Sg1; Sg2/Pl1 Moksha distinguishes Pl3/Pl1, Pl3/Sg1; Sg3/Pl1 Pl3/Pl1, Pl3/Sg1; Sg3/Pl1 subject person (also IMP) Pl1, Pl1/Pl3, Pl1/Sg3 Pl1/Pl3, Pl1/Sg3 Moksha 1 less ambiguous Sg2/Sg3; Imp Sg2/Sg3 Sg2/Sg3, Sg2/Pl3 Moksha lacks distinct Sg3 object Sg2, Sg2/Pl3 Erzya lacks distinct Sg1, Sg2 Sg1, Sg1/Pl3 Sg1/Sg3, Sg1/Pl3 Moksha lacks distinct Sg3 object Pl3; Pl2/Pl3, Pl2/Sg3

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Requirements for machine translation • Non-ambiguous forms Ind.Prt1: • 6 object conjugation functions/forms (over 55%) • 2 subject conjugation functions/forms • Equally ambiguous forms: • 8 object conjugation functions/2 object conjugation forms Disambiguation 34 – 16 = 18 Syntax required:

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki

Aitäh!

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki Keresztes, László 2011: Bevezetés a mordvin nyelvészetbe. Debreceni egyetemi kiadó. Trosterud, Trond 2006: Homonymy in the Uralic Two-Argument Agreement Paradigms. SUST 151. Zaicz, Gábor 1990: Beitrag zur Typologie und Statistik des Erza- und Mokscha-mordwinischen (eine vergleichende Untersuchung). In Congressus septimus internationalis fenno-ugristarum. Debrecen 27. VIII. – 2. 1I. 1990, 3C. Sessiones sectionum, dissertationes linguistica. Redigunt László Jakab, László Keresztes, Antal Kiss, Sándor Maticsák, S. 7-13. Debrecen. Open Corpora: http://gtweb.uit.no/u_korp Open Technology: Giellatekno (Tromsø) http://giellatekno.uit.no Generous Funding: Kone Säätiö (Helsinki) http://www.konesaatio.fi

Feb. 3, 2015 Jack Rueter, PhD. University of Helsinki