Arxiv:2004.04803V1 [Cs.CL] 9 Apr 2020

Total Page:16

File Type:pdf, Size:1020Kb

Arxiv:2004.04803V1 [Cs.CL] 9 Apr 2020 FST Morphology for the Endangered Skolt Sami Language Jack Rueter, Mika Hämäläinen Department of Digital Humanities University of Helsinki {jack.rueter, mika.hamalainen}@helsinki.fi Abstract We present advances in the development of a FST-based morphological analyzer and generator for Skolt Sami. Like other minority Uralic languages, Skolt Sami exhibits a rich morphology, on the one hand, and there is little golden standard material for it, on the other. This makes NLP approaches for its study difficult without a solid morphological analysis. The language is severely endangered and the work presented in this paper forms a part of a greater whole in its revitalization efforts. Furthermore, we intersperse our description with facilitation and description practices not well documented in the infrastructure. Currently, the analyzer covers over 30,000 Skolt Sami words in 148 inflectional paradigms and over 12 derivational forms. Keywords: Skolt Sami, endangered languages, morphology 1. Introduction members access to language materials directly. The trick is Skolt Sami is a minority language belonging to Sami to find new uses and reuses for data sets and technologies branch of the Uralic language family. With its native speak- as well as to bring development closer to the language com- ers at only around 300, it is considered a severely endan- munity. If development follows the North Sámi lead, any gered language (Moseley, 2010), which, despite its pluri- project can reap from the work already done. centric potential, is decidedly focusing on one mutual lan- Extensive work has already been done on data and tool gauge (Rueter and Hämäläinen, 2019). In this paper, we development in the GiellaLT infrastructure (Moshagen et present our open-source FST morphology for the language, al., 2013) and (Moshagen et al., 2014), and previous work 3 which is a part of the wider context of its on-going revital- also exists for Skolt Sami (Sammallahti and Mosnikoff, ization efforts. 1991; Sammallahti, 2015; Feist, 2015). There are online 4 The intricacies of Skolt Sami morphology include qual- and click-in-text dictionaries (Rueter, 2017), spell check- 5 ity and quantity variation in the word stem as well as ers (Morottaja et al., 2018), , these are implemented in suprasegmental palatalization before subsequent affixes. OpenOffice, but some of the more prominent languages Like Northern Sami and Estonian, Skolt Sami has conso- are supported in MS Word, as well as rule-based language nant quantity and quality variation that surpasses that of learning (Antonsen et al., 2013; Uibo et al., 2015). For Finnish, i.e. Skolt Sami has as many as three lengths in languages with extensive description and documentation, the vowel and consonant quantities in a given word. there are syntax checkers (Wiechetek et al., 2019), machine The finite-state description of Skolt Sami involves develop- translation (Antonsen et al., 2017) and speech synthesis and ing strategies for reusability of open-source documentation recognition (Hjortnaes et al., 2020), just to mention the tip in other minority languages. In other words, the FST de- of the iceberg (Rueter, 2014). From a language learner scription is designed in such a fashion that it can be ap- and research point of departure, the development and ap- plied to other languages as well with minimal modifica- plication of these tools points to well-organized morpho- tions. Skolt Sami, like many other minority Uralic lan- syntactic and lexical descriptions of the language in focus. guages, attests to a fair degree of regular morphology, i.e., By well-organized descriptions, we mean approaching its nouns are marked for the categories of number, pos- tasks at hand with applied reusability. Reusability is illus- arXiv:2004.04803v1 [cs.CL] 9 Apr 2020 session and numerous case forms with regular diminutive trated in the construction of a morphological analyzer for derivation, and its verbs are conjugated for tense, mood linguists, which, due to the fact that it is able to recognize and person in addition to undergoing several regular deriva- and analyze regular morphological forms, can also serve as tions. Morphological descriptions have been developed in a morphological spell checker. In fact, this same analyzer the GiellaLT (Sami Language technology) infrastructure at can be reversed and used as a generator, which is useful the Norwegian Arctic University in Tromso, using Helsinki in providing language learners with fixed, analogous and Finite-State Technology (HFST) (Lindén et al., 2013). random tasks in morphology. The same morphological an- Working in the GiellaLT infrastructure, it is possible to ap- 3 ply ready-made solutions to multiple language learning, fa- http://oahpa.no/sms/useoahpa/background. eng.html/ cilitation and empowerment tasks. Leading into the digital , read further in this article for subsequent develop- 1 ments in http://oahpa.no/nuorti/ age, there are ongoing implementations, such as keyboards 4 2 The forerunner https://sanit.oahpa.no/read/, an for various platforms, and corpora , being expanded to online dictionary here, and on analogous pages of other dic- provide developers, researchers and language community tionaries, (e.g., https://saan.oahpa.no/read/), can be dragged to the tool bar of Firefox and Google Chrome 1http://divvun.no/keyboards/index.html/ 5http://divvun.no/korrektur/korrektur. 2http://gtweb.uit.no/korp/ html/ alyzer, when augmented by glosses, can immediately begin The scarce quantity of textual data is one limitation, but it to provide online dictionary and click-in-text analyses. is even a greater one given that the language is still being The development of an optimal morphological analyzer and standardized and the users provide a variety of forms and glossing for a language like Skolt Sami requires concise vocabulary when expressing themselves in their native lan- morphological and lexical work, on the one hand, and ac- guage. This means an even greater variety in morphology cess to corpora including language learning materials, on that the statistical model should be able capture from a lim- the other. Corpora provide access to language in use, and ited dataset. language learning materials help to establish a received un- In the absence of a reasonably sized descriptive corpus of derstanding of the language. To this end, the morphologi- the language, annotated or not, the most accurate way to cal analyzer for Skolt Sami has been constructed to analyze model the morphology is by using a rule-based methodol- and generate a pedagogically enhanced orthography, for in- ogy. dication of short and long diphthongs preceding geminates FSTs (Finite-State Transducers) have been shown in the as well as mid low front vowels, as might be rendered in a past to be an effective way to model the morphology even pronouncing dictionary. One such example might be seen for languages with an abundance of morphological features in the word kue0tt ‘hut’ as opposed to the literal norm kue0tt, (cf. (Beesley and Karttunen, 2003)). Perhaps one of the where the dot˙ below the e not only indicates a slightly low- largest-scale FSTs to model the morphology of a language ered pronunciation of the vowel but also assists in identi- is the one developed for Finnish (Pirinen et al., 2017). This fying the paradigm type, kue0tt : kue0¡id ‘hut+N+Pl+Acc’ tool, Omorfi, serves as the state-of-the-art morphological versus kue0ll : kuõ0lid ‘fish+N+Pl+Acc˙ ’. analyzer for Finnish. By focusing on the construction of a pedagogical enhanced analyzer-generator, teaching resources can be developed 3. The FST Model Development Pipeline that target randomly generated morphological tasks for the 6 Developing a morphological description of a language pre- language learner as in the North Sami learning tool Davvi . supposes a language-learning and documentary approach. In any given language reader, there are texts with words in Other people have learned the language and become profi- various forms and an accompanying vocabulary. While vo- cient in it before you, so extract paradigms from grammars, cabulary translation can readily be utilized as a fixed task in readers and research to build the language model. If you language learning, inflectional tasks, especially in morpho- are the first researcher to describe the language, take hints logically rich languages, can be developed as random exer- from the language learners, if there are any, they may be cises. Although the contextual word forms in the reader are still developing their own understanding of the language quite limited, it is possible to construct randomized mor- morpho-syntax, and, at times, they may provide you with phological exercises where the student is expected to in- informative interpretations of the language. flect nouns, adjectives and verbs alike in forms that have Idiosyncrasies of a language can, sometimes, be captured been taught but not explicitly given for the random words through comparison to those of another. When a descrip- provided in the reader vocabulary, e.g. in nouns the student tion of Skolt Sami, Finnish, Estonian, etc. introduces alien may select vocabulary from reader A chapters 1–5 with a phenomena, such as word-stem quality and quantity vari- randomized task for nouns, plural, comitative, third person ation as well as suprasegmental palatalization, it is a good singular possessive suffix: +N+Pl+Com+PxSg3. Essen- idea to try describing them both separately and in tandem. tially all nouns in the selected vocabulary available for this Word-stem quality variation affects both consonants and reading are inadvertently presented to the learner. vowel. In consonants, an analogous English example might be illustrated with the f :v variation found in the English 2. Related Work words life, lives and loaf, loaves. From a historical perspec- In the past, multiple methods have been proposed for auto- tive, the verb to live will serve as an instance where long matically learning morphology for a given language.
Recommended publications
  • The Reclamation of Sami Identity and the Traces of Swedish Colonialism
    THE RECLAMATION OF SAMI IDENTITY AND THE TRACES OF SWEDISH COLONIALISM A qualitative study about the formation of Saminess and Sami identity Master’s Programme in Social Work and Human Rights Degree report 30 higher education credit Spring 2020 Author : Frida Olofsson Supervisor : Adrián Groglopo Abstract Title: The Reclamation of Sami identity and the traces of Swedish colonialism : A qualitative study about the formation of Saminess and Sami identity Author: Frida Olofsson Key words (ENG): Sami identity, Saminess, Sami people, Indigenous People, identity Nyckelord (SWE): Samisk identitet, Samiskhet, Samer, Urfolk, Identitet The purpose of this study was to study identity formation among Sami people. The aim was therefore to investigate how Saminess and Sami identity is formed and specifically the way the Sami community transfers the identity. Semi structured interviews were conducted and the material was analyzed by the use of a thematic analysis. In the analysis of the material, four main themes were : Transfer of Sami heritage over generations, Sami identity, Expressions about being Sami and Sami attributes. The theoretical framework consisted of Postcolonial theory and theoretical concepts of identity. The main findings showed that the traces of colonialism is still present in the identity-formation of the Sami people and that there is a strong silence-culture related to the experiences of colonial events which consequently also have affected the intergenerational transfer of Saminess and Sami identity. Furthermore, the will to reclaim the Sami identity, heritage and the importance of a sense of belonging is strongly expressed by the participants. This can in turn be seen as a crucial step for the decolonization process of the Sami population as a whole.
    [Show full text]
  • Connections Between Sámi and Basque Peoples
    Connections between Sámi and Basque Peoples Kent Randell 2012 Siidastallan Outside of Minneapolis, Minneapolis Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota “D----- it Jim, I’m a librarian and an armchair anthropologist??” Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Connections between Sámi and Basque Peoples Hard evidence: - mtDNA - Uniqueness of language Other things may be surprising…. or not. It is fun to imagine other connections, understanding it is not scientific Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Documentary: Suddenly Sámi by Norway’s Ellen-Astri Lundby She receives her mtDNA test, and express surprise when her results state that she is connected to Spain. This also surprised me, and spurned my interest….. Then I ended up living in Boise, Idaho, the city with the largest concentration of Basque outside of Basque Country Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota What is mtDNA genealogy? The DNA of the Mitochondria in your cells. Cell energy, cell growth, cell signaling, etc. mtDNA – At Conception • The Egg cell Mitochondria’s DNA remains the same after conception. • Male does not contribute to the mtDNA • Therefore Mitochondrial mtDNA is the same as one’s mother. Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Four generation mtDNA line Sisters – Mother – Maternal Grandmother – Great-grandmother Jennie Mary Karjalainen b. Kent21 Randell March (c) 2012 1886, --- 2012 Siidastallan,parents from Kuusamo, Finland Linwood Township, Minnesota Isaac Abramson and Jennie Karjalainen wedding picture Isaac is from Northern Norway, Kvaen father and Saami mother from Haetta Kent Randell (c) 2012 --- 2012 Siidastallan, village.
    [Show full text]
  • Consonant Gradation in Estonian and Sámi: Two-Level Solution TROND TROSTERUD and HELI UIBO
    14 Consonant Gradation in Estonian and Sámi: Two-Level Solution TROND TROSTERUD AND HELI UIBO 14.1 Introduction Koskenniemi’s two-level morphology was the first practical general model in the history of computational linguistics for the analysis of morphologi- cally complex languages. In this article we will reconsider one of the key innovations in Koskenniemi (1983), namely the treatment of consonant gra- dation in finite state transducers. We will look not at Finnish, but at two lan- guages with a more extensive consonant gradation system, namely Estonian and Sámi. The goal of the paper is to demonstrate two different ways of mod- eling consonant gradation in a finite state morphological system - lexical and morphophonological. We will also compare the resulting systems by their computational complexity and human-readability. Consonant gradation is rare among the languages of the world, but stem al- ternation in itself is not, and the treatment of consonant gradation can readily be transferred to other stem alternation phenomena. Koskenniemi’s original idea was to see stem alternation as an agglutinative phenomenon. Consider the example (14.1), showing a two-level representation of stem alternation. ehT e$:ehe (14.1) Here the $ sign is a quasi-suffix, introduced to trigger consonant grada- tion in the stem. Two-level rules decide the correspondence of T to surface phonemes t or 0 (empty symbol), based on the context, specifically, according to the presence or absence of the symbol $ in the right context. Another type of rules for handling stem alternations that can be compiled Inquiries into Words, Constraints and Contexts.
    [Show full text]
  • Enets Language in the Studies of Domestic and Foreign Scientists
    Journal of Siberian Federal University. Humanities & Social Sciences 4 (2018 11) 546-560 ~ ~ ~ УДК 304.2; 811.511.24 Enets Language in the Studies of Domestic and Foreign Scientists Maria A. Kolesnik, Natalia M. Libakova and Ekaterina A. Sertakova* Siberian Federal University 79 Svobodny, Krasnoyarsk, 660041, Russia Received 06.03.2018, received in revised form 05.04.2018, accepted 09.04.2018 The article presents a historiographical review of foreign and domestic research literature devoted to the Enets language. The analysis of publications made it possible to identify general tendencies in the study of the Enets language, the indigenous minorities of the North, and also the vector of modern research. Modern cultural processes in the global space form and make an inexhaustible interest in the traditional cultures and, especially, in the cultures of small-numbered peoples urgent. The study of the languages of these ethnic groups appears as one of the tasks, the solution of which will allow revitalizing native languages, as well as the values of traditional culture as a whole. Keywords: The Enets, indigenous minorities of the North, traditional culture, language studies. The reported study was funded by Krasnoyarsk Regional Fund of Science according to the research project: «Development of scientific and methodological support for the preservation of a unique cultural heritage for the Ket and Enets ethno-cultural groups of indigenous small peoples of the North, Siberia and the Far East living on the territory of the Krasnoyarsk Territory. DOI: 10.17516/1997-1370-0248. Research area: culturology. Introduction practices of speech behaviors are constantly Language determines the thinking of a appearing.
    [Show full text]
  • Multilingual Literacy Among Young Learners of North Sámi: Contexts, Complexity and Writing in Sápmi
    Multilingual literacy among young learners of North Sámi: Contexts, complexity and writing in Sápmi Hanna Outakoski Umeå Studies in Language and Literature 27 Department of Language Studies Umeå University 2015 Department of Language Studies Umeå University SE-901 87 Umeå http://www.sprak.umu.se This work is protected by the Swedish Copyright Legislation (Act 1960:729) Copyright © 2015 Hanna Outakoski ISBN: 978-91-7601-284-0 Front cover illustration: Hanna Outakoski Electronic version accessible via http://umu.diva-portal.org/ Umeå Studies in Language and Literature 27 Series editors: Heidi Hansson, Per Ambrosiani Printed by: Print & media, Umeå University Distributed by: eddy.se ab, Visby Umeå, Sweden 2015 Ándaras Ovllái ja Ivvár Ásllahii, ráhkisvuođain Table of Contents Table of Contents v Abstract vii List of Papers ix Tables, Figures and other illustrations xi Acknowledgements xiii 1 Introduction 1 1.1 Aims and research questions 2 1.2 Outline 3 2 Context and background 5 2.1 Speakers of North Sámi in present day Sápmi 6 2.2 Weakened ties to Sámi cultural heritage 9 2.3 Three countries - three paths to Sámi education 11 2.3.1 Curricula and teacher training 12 2.3.2 School programs for Sámi learners 14 2.4 Earlier studies in Sápmi 17 3 Conceptual framework 21 3.1 Bilingualism 21 3.2 Continua of Biliteracy 24 3.3 Writing 28 4 Materials and methods 31 4.1 Methodology and ethics 31 4.1.1 Summary of Paper I 32 4.2 Participants 34 4.3 Data collection and methods 36 4.4 Challenges of the study 40 4.5 The author's contribution to research
    [Show full text]
  • 000 Euralex 2010 03 Plenary
    > State of the Art of the Lexicography of European Lesser Used or Non- State Languages anne tjerk popkema ‘The people who chronicle the life of our language (…) are called lexicographers’ (Martin Hardee, blogger in Cyberspace, 2006) 0 Introductory remarks 1 Language codification and language elaboration (‘Ausbau’) are key ingredients for raising a lesser used language to a level that is adequate for modern use.2 In dictionaries (as well as in grammars) a language’s written standard may be laid down, ‘codified’. 3 At the same time dictionaries make clear what lexical gaps remain or arise in a language. The filling of such gaps – part of language elaboration – will only gain wide acceptance when, in turn, it is codified in a dictionary itself. Thus, both prime categories of language development – codification and elaboration – are hats worn by the same head: the lexicographer’s. Bo Svensén begins the opening chapter of his recent handbook on lexicography by stating that ‘dictionaries are a cultural phenomenon. It is a commonplace to say that a dictionary is a product of the culture in which it has come into being; it is less so to say that it plays an important part in the development of that culture.’ 4 In the case of lesser used languages, language development may lead to (increased) use in domains that were formerly out of reach because of the dominance – for any number of reasons – of another language. In such instances, language development equals language emancipation. An emancipating language takes on new functions, enters new domains of society and is therefore in need of new terminology.
    [Show full text]
  • Sixth Periodical Report Presented to the Secretary General of the Council of Europe in Accordance with Article 15 of the Charter
    Strasbourg, 1 July 2014 MIN-LANG (2014) PR7 EUROPEAN CHARTER FOR REGIONAL OR MINORITY LANGUAGES Sixth periodical report presented to the Secretary General of the Council of Europe in accordance with Article 15 of the Charter NORWAY THE EUROPEAN CHARTER FOR REGIONAL OR MINORITY LANGUAGES SIXTH PERIODICAL REPORT NORWAY Norwegian Ministry of Local Government and Modernisation 2014 1 Contents Part I ........................................................................................................................................... 3 Foreword ................................................................................................................................ 3 Users of regional or minority languages ................................................................................ 5 Policy, legislation and practice – changes .............................................................................. 6 Recommendations of the Committee of Ministers – measures for following up the recommendations ................................................................................................................... 9 Part II ........................................................................................................................................ 14 Part II of the Charter – Overview of measures taken to apply Article 7 of the Charter to the regional or minority languages recognised by the State ...................................................... 14 Article 7 –Information on each language and measures to implement
    [Show full text]
  • Fifth Periodical Report Presented to the Secretary General of the Council of Europe in Accordance with Article 15 of the Charter
    Strasbourg, 17 November 2017 MIN-LANG (2017) PR 7 EUROPEAN CHARTER FOR REGIONAL OR MINORITY LANGUAGES Fifth periodical report presented to the Secretary General of the Council of Europe in accordance with Article 15 of the Charter FINLAND THE FIFTH PERIODIC REPORT BY THE GOVERNMENT OF FINLAND ON THE IMPLEMENTATION OF THE EUROPEAN CHARTER FOR REGIONAL OR MINORITY LANGUAGES NOVEMBER 2017 2 CONTENTS INTRODUCTION...................................................................................................................................................6 PART I .................................................................................................................................................................7 1. BASIC INFORMATION ON FINNISH POPULATION AND LANGUAGES....................................................................7 1.1. Finnish population according to mother tongue..........................................................................................7 1.2. Administration of population data ..............................................................................................................9 2. SPECIAL STATUS OF THE ÅLAND ISLANDS.............................................................................................................9 3. NUMBER OF SPEAKERS OF REGIONAL OR MINORITY LANGUAGES IN FINLAND.................................................10 3.1. The numbers of persons speaking a regional or minority language..........................................................10 3.2. Swedish ......................................................................................................................................................10
    [Show full text]
  • Beliefs and Emotions in South Sami Revitalisation Leena Huss and Sigrid Stångberg
    14 The Yoke and the Candy Bowl: Beliefs and Emotions in South Sami Revitalisation Leena Huss and Sigrid Stångberg Background While Sweden has a rather long history of promoting language maintenance among immigrant populations, it was not until the year 2000 that a national minority and minority language policy was launched to protect minority languages and cultures that have a long historical presence in Sweden. The new policy was a result of the Swedish ratification the same year of two Council of Europe conventions: the European Charter for Regional or Minority Languages (ECRML) and the Framework Convention for the Protection of National Minorities (FCNM). According to this policy, the five national minority languages of Sweden—Meänkieli (formerly called Tornedalian Finnish), Romani, Sami, Sweden Finnish and Yiddish—were to be protected and promoted on societal as well as individual levels, and three of them—Meänkieli, Sami (including North, Lule and South Sami) and Sweden Finnish—had their own administrative areas designated, consisting of seven municipalities in the northernmost parts of Sweden. In these areas, their speakers had the right to communicate with municipal authorities in their own languages. 129 INDIGENOUS EFFLORESCENCE They also had the right to childcare and care of the elderly ‘wholly or partly’ in the minority language (Swedish Parliament SFS 1999:1175; SFS 1999:1176). During the following years, evaluations and criticism on the part of the Council of Europe, minority organisations, and various Swedish authorities showed that the implementation of the national minority policy was seriously lacking, and in 2010, a reformed minority policy was launched (Proposition 2008/2009).
    [Show full text]
  • Sami Sieidis in a Nordic Context?
    ELSE MUNDAL Sami Sieidis in a Nordic Context? ABSTRACT In the present article, the author discusses two Old Norse texts that may indicate that the Sami cult of sieidis had spread to the neighbouring Old Norse culture in the period before the Chris- tianization of Norway. One of these texts is found in the Older Eiðsifaþing law, the law of the inland part of Eastern Norway. According to this law, it was prohibited to believe in (the power of) the finnar (Sami), and among their powerful objects, rót (the root of a tree) is mentioned. This root is in all likeli- hood a Sami sieidi that was sought out by Norwegians for help, probably for medical reasons. The other text is a notice in the Icelandic Landnámabók in which it is mentioned that a settler from Northern Norway worshipped some stones in the outfields on the border of his settlement, calledGunnsteinar . There are closer parallels to this outfields cult in Sami culture than in Old Norse culture. KEYWORDS Sami sieidis, Gunnsteinar, Eiðsifaþing law, Landnámabók, Sami cult, Old Norse cult ANDERS HULTGÅRD Personal Religion among the Ancient Scandinavians and the Fulltrúi-Concept ABSTRACT The question of personal religion among the ancient Scandinavians has centered around the concept of having a deity as one’s fulltrúi ‘trustworthy friend,’ ástvinr ‘close friend,’ or vinr ‘friend.’ Most scholars of the twentieth century regarded the concept as a true expression of pre-Christian Germanic belief. By contrast, modern scholarship strongly tends to see it as a construct of medieval authors who took the saints’ cult as a model to describe the personal piety of their ninth and tenth cen- tury compatriots.
    [Show full text]
  • Researching Less-Resourced Languages – the Digisami Corpus
    Researching Less-Resourced Languages – the DigiSami Corpus Kristiina Jokinen University of Helsinki, Finland and AIRC, AIST Tokyo Waterfront, Japan [email protected] Abstract Increased use of digital devices and data repositories has enabled a digital revolution in data collection and language research, and has also led to important activities supporting speech and language technology research for less-resourced languages. This paper describes the DigiSami project and its research results, focussing on spoken corpus collection and speech technology for the Fenno-Ugric language North Sami. The paper also discusses multifaceted questions on ethics and privacy related to data collection for less-resourced languages and indigenous communities. Keywords: corpus collection, under-resourced languages, North Sami with new technology applications. The main motivation 1. Introduction was to improve digital visibility and viability of the target languages, and to explore different choices for encouraging Several projects and events have increased research and maintaining the use of less-resourced languages in the activities for under-resourced languages during the past digitalized world. The goals of the DigiSami project are years. For instance, the DLDP-project (Digital Language discussed in Jokinen (2014) and Jokinen et al. (2017). Diversity Project) is to advance the sustainability of Europe’s regional and minority languages, while the Flare- The DigiSami project deals with the North Sami language net network and the LRE Map (Calzolari et al. 2012) have (Davvisámegiela) which belongs to the Fenno-Ugric had a big impact on sharing language resources and making language family and is one of the nine Sami languages speech corpora freely available.
    [Show full text]
  • Samiskan Under Fyra Sekel I Svenska Kyrkans Arbete
    Samiskan under fyra sekler i Svenska kyrkans arbete olavi korhonen Samiskan under fyra sekel i Svenska kyrkans arbete Abstract De första samiska böckerna publicerades 1619 av Nicolaus Andreæ, kyrkoherde i Piteå. Hans mässbok och ABC-bok hade mycket bristfällig samiska. Ett annat 1600-talsverk var Manuale Lapponicum (1648), en kyrkohandbok för Torne lappmark av Johannes Tornæus, kyrkoherde i Torneå. År 1669 utgavs en sydligare motsvarig- het för Ume och Pite lappmark av kyrkoherde Olaus Stephani Graan i Lycksele. Lycksele blev under 1700-talet centrum för arbetet med kyrklig litteratur på samiska. Där skapade prosten Pehr Fjellström ett skriftspråk på umesamisk grund. Han utgav en kyrkohandbok, Luthers lilla katekes och en psalmbok (1744) förutom en grammatik och en umesamisk ordsamling (1738). Viktigast var översättningen av Nya testamentet (1755). Genom att skola andra präster i samiska, bl.a. Pehr Hög- ström, Gellivare församlings förste kyrkoherde, spred Fjellström denna samiska till nordliga församlingar. Högström lade mer vikt vid lulesamiska i översättningen av en postilla (1737) och en katekes (1748), vilket gav ordförrådet en nordligare prägel. Ett viktigt verk var Johan Öhrlings och Eric Lindahls Lexicon Lapponicum (1780). Redan före år 1800 började arbetet att översätta Gamla testamentet till samiska. Kyrkoherde Samuel Öhrling i Jokkmokk hade huvudansvaret. I arbetet deltog flera sa- miskkunniga präster. Tat Ailes Tjalog, Den Heliga Skrift, trycktes 1811. Under 1800-talet utkom flera psalmböcker, av vilka en utgåva från 1895 länge brukades i Lule lappmark. När Lars Levi Læstadius lät trycka några småskrifter på lulesamiska, inleddes en över- gång till en samiska med fastare grund i ett områdes muntliga språk.
    [Show full text]