Uralic Language Identification (ULI) 2020 Shared Task Dataset and the Wanca 2017 Corpus
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Connections Between Sámi and Basque Peoples
Connections between Sámi and Basque Peoples Kent Randell 2012 Siidastallan Outside of Minneapolis, Minneapolis Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota “D----- it Jim, I’m a librarian and an armchair anthropologist??” Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Connections between Sámi and Basque Peoples Hard evidence: - mtDNA - Uniqueness of language Other things may be surprising…. or not. It is fun to imagine other connections, understanding it is not scientific Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Documentary: Suddenly Sámi by Norway’s Ellen-Astri Lundby She receives her mtDNA test, and express surprise when her results state that she is connected to Spain. This also surprised me, and spurned my interest….. Then I ended up living in Boise, Idaho, the city with the largest concentration of Basque outside of Basque Country Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota What is mtDNA genealogy? The DNA of the Mitochondria in your cells. Cell energy, cell growth, cell signaling, etc. mtDNA – At Conception • The Egg cell Mitochondria’s DNA remains the same after conception. • Male does not contribute to the mtDNA • Therefore Mitochondrial mtDNA is the same as one’s mother. Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Four generation mtDNA line Sisters – Mother – Maternal Grandmother – Great-grandmother Jennie Mary Karjalainen b. Kent21 Randell March (c) 2012 1886, --- 2012 Siidastallan,parents from Kuusamo, Finland Linwood Township, Minnesota Isaac Abramson and Jennie Karjalainen wedding picture Isaac is from Northern Norway, Kvaen father and Saami mother from Haetta Kent Randell (c) 2012 --- 2012 Siidastallan, village. -
Consonant Gradation in Estonian and Sámi: Two-Level Solution TROND TROSTERUD and HELI UIBO
14 Consonant Gradation in Estonian and Sámi: Two-Level Solution TROND TROSTERUD AND HELI UIBO 14.1 Introduction Koskenniemi’s two-level morphology was the first practical general model in the history of computational linguistics for the analysis of morphologi- cally complex languages. In this article we will reconsider one of the key innovations in Koskenniemi (1983), namely the treatment of consonant gra- dation in finite state transducers. We will look not at Finnish, but at two lan- guages with a more extensive consonant gradation system, namely Estonian and Sámi. The goal of the paper is to demonstrate two different ways of mod- eling consonant gradation in a finite state morphological system - lexical and morphophonological. We will also compare the resulting systems by their computational complexity and human-readability. Consonant gradation is rare among the languages of the world, but stem al- ternation in itself is not, and the treatment of consonant gradation can readily be transferred to other stem alternation phenomena. Koskenniemi’s original idea was to see stem alternation as an agglutinative phenomenon. Consider the example (14.1), showing a two-level representation of stem alternation. ehT e$:ehe (14.1) Here the $ sign is a quasi-suffix, introduced to trigger consonant grada- tion in the stem. Two-level rules decide the correspondence of T to surface phonemes t or 0 (empty symbol), based on the context, specifically, according to the presence or absence of the symbol $ in the right context. Another type of rules for handling stem alternations that can be compiled Inquiries into Words, Constraints and Contexts. -
Arxiv:2004.04803V1 [Cs.CL] 9 Apr 2020
FST Morphology for the Endangered Skolt Sami Language Jack Rueter, Mika Hämäläinen Department of Digital Humanities University of Helsinki {jack.rueter, mika.hamalainen}@helsinki.fi Abstract We present advances in the development of a FST-based morphological analyzer and generator for Skolt Sami. Like other minority Uralic languages, Skolt Sami exhibits a rich morphology, on the one hand, and there is little golden standard material for it, on the other. This makes NLP approaches for its study difficult without a solid morphological analysis. The language is severely endangered and the work presented in this paper forms a part of a greater whole in its revitalization efforts. Furthermore, we intersperse our description with facilitation and description practices not well documented in the infrastructure. Currently, the analyzer covers over 30,000 Skolt Sami words in 148 inflectional paradigms and over 12 derivational forms. Keywords: Skolt Sami, endangered languages, morphology 1. Introduction members access to language materials directly. The trick is Skolt Sami is a minority language belonging to Sami to find new uses and reuses for data sets and technologies branch of the Uralic language family. With its native speak- as well as to bring development closer to the language com- ers at only around 300, it is considered a severely endan- munity. If development follows the North Sámi lead, any gered language (Moseley, 2010), which, despite its pluri- project can reap from the work already done. centric potential, is decidedly focusing on one mutual lan- Extensive work has already been done on data and tool gauge (Rueter and Hämäläinen, 2019). In this paper, we development in the GiellaLT infrastructure (Moshagen et present our open-source FST morphology for the language, al., 2013) and (Moshagen et al., 2014), and previous work 3 which is a part of the wider context of its on-going revital- also exists for Skolt Sami (Sammallahti and Mosnikoff, ization efforts. -
000 Euralex 2010 03 Plenary
> State of the Art of the Lexicography of European Lesser Used or Non- State Languages anne tjerk popkema ‘The people who chronicle the life of our language (…) are called lexicographers’ (Martin Hardee, blogger in Cyberspace, 2006) 0 Introductory remarks 1 Language codification and language elaboration (‘Ausbau’) are key ingredients for raising a lesser used language to a level that is adequate for modern use.2 In dictionaries (as well as in grammars) a language’s written standard may be laid down, ‘codified’. 3 At the same time dictionaries make clear what lexical gaps remain or arise in a language. The filling of such gaps – part of language elaboration – will only gain wide acceptance when, in turn, it is codified in a dictionary itself. Thus, both prime categories of language development – codification and elaboration – are hats worn by the same head: the lexicographer’s. Bo Svensén begins the opening chapter of his recent handbook on lexicography by stating that ‘dictionaries are a cultural phenomenon. It is a commonplace to say that a dictionary is a product of the culture in which it has come into being; it is less so to say that it plays an important part in the development of that culture.’ 4 In the case of lesser used languages, language development may lead to (increased) use in domains that were formerly out of reach because of the dominance – for any number of reasons – of another language. In such instances, language development equals language emancipation. An emancipating language takes on new functions, enters new domains of society and is therefore in need of new terminology. -
Sixth Periodical Report Presented to the Secretary General of the Council of Europe in Accordance with Article 15 of the Charter
Strasbourg, 1 July 2014 MIN-LANG (2014) PR7 EUROPEAN CHARTER FOR REGIONAL OR MINORITY LANGUAGES Sixth periodical report presented to the Secretary General of the Council of Europe in accordance with Article 15 of the Charter NORWAY THE EUROPEAN CHARTER FOR REGIONAL OR MINORITY LANGUAGES SIXTH PERIODICAL REPORT NORWAY Norwegian Ministry of Local Government and Modernisation 2014 1 Contents Part I ........................................................................................................................................... 3 Foreword ................................................................................................................................ 3 Users of regional or minority languages ................................................................................ 5 Policy, legislation and practice – changes .............................................................................. 6 Recommendations of the Committee of Ministers – measures for following up the recommendations ................................................................................................................... 9 Part II ........................................................................................................................................ 14 Part II of the Charter – Overview of measures taken to apply Article 7 of the Charter to the regional or minority languages recognised by the State ...................................................... 14 Article 7 –Information on each language and measures to implement -
Fifth Periodical Report Presented to the Secretary General of the Council of Europe in Accordance with Article 15 of the Charter
Strasbourg, 17 November 2017 MIN-LANG (2017) PR 7 EUROPEAN CHARTER FOR REGIONAL OR MINORITY LANGUAGES Fifth periodical report presented to the Secretary General of the Council of Europe in accordance with Article 15 of the Charter FINLAND THE FIFTH PERIODIC REPORT BY THE GOVERNMENT OF FINLAND ON THE IMPLEMENTATION OF THE EUROPEAN CHARTER FOR REGIONAL OR MINORITY LANGUAGES NOVEMBER 2017 2 CONTENTS INTRODUCTION...................................................................................................................................................6 PART I .................................................................................................................................................................7 1. BASIC INFORMATION ON FINNISH POPULATION AND LANGUAGES....................................................................7 1.1. Finnish population according to mother tongue..........................................................................................7 1.2. Administration of population data ..............................................................................................................9 2. SPECIAL STATUS OF THE ÅLAND ISLANDS.............................................................................................................9 3. NUMBER OF SPEAKERS OF REGIONAL OR MINORITY LANGUAGES IN FINLAND.................................................10 3.1. The numbers of persons speaking a regional or minority language..........................................................10 3.2. Swedish ......................................................................................................................................................10 -
Sami Sieidis in a Nordic Context?
ELSE MUNDAL Sami Sieidis in a Nordic Context? ABSTRACT In the present article, the author discusses two Old Norse texts that may indicate that the Sami cult of sieidis had spread to the neighbouring Old Norse culture in the period before the Chris- tianization of Norway. One of these texts is found in the Older Eiðsifaþing law, the law of the inland part of Eastern Norway. According to this law, it was prohibited to believe in (the power of) the finnar (Sami), and among their powerful objects, rót (the root of a tree) is mentioned. This root is in all likeli- hood a Sami sieidi that was sought out by Norwegians for help, probably for medical reasons. The other text is a notice in the Icelandic Landnámabók in which it is mentioned that a settler from Northern Norway worshipped some stones in the outfields on the border of his settlement, calledGunnsteinar . There are closer parallels to this outfields cult in Sami culture than in Old Norse culture. KEYWORDS Sami sieidis, Gunnsteinar, Eiðsifaþing law, Landnámabók, Sami cult, Old Norse cult ANDERS HULTGÅRD Personal Religion among the Ancient Scandinavians and the Fulltrúi-Concept ABSTRACT The question of personal religion among the ancient Scandinavians has centered around the concept of having a deity as one’s fulltrúi ‘trustworthy friend,’ ástvinr ‘close friend,’ or vinr ‘friend.’ Most scholars of the twentieth century regarded the concept as a true expression of pre-Christian Germanic belief. By contrast, modern scholarship strongly tends to see it as a construct of medieval authors who took the saints’ cult as a model to describe the personal piety of their ninth and tenth cen- tury compatriots. -
Samiskan Under Fyra Sekel I Svenska Kyrkans Arbete
Samiskan under fyra sekler i Svenska kyrkans arbete olavi korhonen Samiskan under fyra sekel i Svenska kyrkans arbete Abstract De första samiska böckerna publicerades 1619 av Nicolaus Andreæ, kyrkoherde i Piteå. Hans mässbok och ABC-bok hade mycket bristfällig samiska. Ett annat 1600-talsverk var Manuale Lapponicum (1648), en kyrkohandbok för Torne lappmark av Johannes Tornæus, kyrkoherde i Torneå. År 1669 utgavs en sydligare motsvarig- het för Ume och Pite lappmark av kyrkoherde Olaus Stephani Graan i Lycksele. Lycksele blev under 1700-talet centrum för arbetet med kyrklig litteratur på samiska. Där skapade prosten Pehr Fjellström ett skriftspråk på umesamisk grund. Han utgav en kyrkohandbok, Luthers lilla katekes och en psalmbok (1744) förutom en grammatik och en umesamisk ordsamling (1738). Viktigast var översättningen av Nya testamentet (1755). Genom att skola andra präster i samiska, bl.a. Pehr Hög- ström, Gellivare församlings förste kyrkoherde, spred Fjellström denna samiska till nordliga församlingar. Högström lade mer vikt vid lulesamiska i översättningen av en postilla (1737) och en katekes (1748), vilket gav ordförrådet en nordligare prägel. Ett viktigt verk var Johan Öhrlings och Eric Lindahls Lexicon Lapponicum (1780). Redan före år 1800 började arbetet att översätta Gamla testamentet till samiska. Kyrkoherde Samuel Öhrling i Jokkmokk hade huvudansvaret. I arbetet deltog flera sa- miskkunniga präster. Tat Ailes Tjalog, Den Heliga Skrift, trycktes 1811. Under 1800-talet utkom flera psalmböcker, av vilka en utgåva från 1895 länge brukades i Lule lappmark. När Lars Levi Læstadius lät trycka några småskrifter på lulesamiska, inleddes en över- gång till en samiska med fastare grund i ett områdes muntliga språk. -
Kusmenko: Einführung
BERLINER BEITRÄGE ZUR SKANDINAVISTIK Titel/ Der samische Einfluss auf die skandinavischen Sprachen. title: Ein Beitrag zur skandinavischen Sprachgeschichte Autor(in)/ Jurij K. Kusmenko author: Kapitel/ »Einführung« chapter: In: Kusmenko, Jurij K.: Der samische Einfluss auf die skandinavischen Sprachen. Berlin: Nordeuropa-Institut, 2. durchgesehene Ausg., 2011 ISBN: 3-932406-25-7 978-3-932406-25-6 Reihe/ Berliner Beiträge zur Skandinavistik Bd. 10 series: ISSN: 0933-4009 Seiten/ 13–31 pages: Feste URL: [http://edoc.hu-berlin.de/....] © Copyright: Nordeuropa-Institut Berlin sowie der Autor. © Copyright: Department for Northern European Studies Berlin and the author. Einführung Nach dem Zerfall des Gemeingermanischen haben die daraus hervorge- gangenen Sprachen eine Reihe gemeinsamer phonologischer und gram- matischer Merkmale herausgebildet. In der Phonologie handelt es sich dabei um eine gleichartige prosodische und phonematische Hervorhe- bung des Wurzelmorphems, in Morphologie und Syntax um die Heraus- bildung einer festen Wortfolge, die Reduzierung des Kasus- und Genus- systems sowie die Entwicklung der neuen grammatischen Kategorien Bestimmtheit, Perfekt und Passiv. Die gesamte Geschichte der germani- schen Sprachen ist durch eine Entwicklung hin zum Analytismus geprägt. Wie die anderen germanischen Sprachen weisen auch die skandinavi- schen dafür mehrere Beispiele auf. Jene analytischen Merkmale, deren Verbreitungsrichtung wir verfolgen können – so z. B. die Hervorhebung des Wurzelmorphems (Quantitätsverschiebung) oder die Vereinfachung -
SELECTED ACTIVITIES RELATED to CLIMATE CHANGE, TRADITIONAL KNOWLEDGE and EDUCATION in REINDEER HUSBANDRY 2016 - 2018
For the Barents Working Group of Indigenous Peoples (WGIP): SELECTED ACTIVITIES RELATED TO CLIMATE CHANGE, TRADITIONAL KNOWLEDGE and EDUCATION IN REINDEER HUSBANDRY 2016 - 2018. Reindeer herders traditionalfrom knowledge - transforming indigenous economy in the circumpolar north – side event UniversitySeptember of the 11 Arcticth UArctic Institute Conference for Circumpolar St Petersburg Reindeer 2016. Husbandry Unprecedented climate challenges and threats to reindeer husbandry will effects low sustainability of reindeer(UArctic herding EALÁT economy Institute) in the(UEI) circumpolar north, effecting local communities and environment. Future sustainable governance of reindeer husbandry might face major challenges related to these rapid changes. Reindeer husbandry represents a livelihood and way of life based on practices and knowledge developed through long-term COORDINATEDexperiences AT in livingINTERNATIONAL under harsh and CENTREhighly variable FOR conditions. REINDEER Therefore, HUSBANDRY reindeer herders’ traditional knowledge, culture, and language(ICR) provide a central foundation for rebuilding social ecological resilience locally, to be used to navigate through future shocks and disturbances. This workshop will explore potential ways to full and effective participation of indigenous reindeer herder towards ecologically, socially and economically sustainable societies. Governance of Arctic resources have to reconnect to our planet, and to the original indigenous knowledge base. Our vision is to develop adaptive skills to sustaining -
A World Heritage Site As Arena for Sami Ethno-Politics in Sweden
Managing Laponia ACTA UNIVERSITATIS UPSALIENSIS Uppsala Studies in Cultural Anthropology no 47 Carina Green Managing Laponia A World Heritage as arena for Sami ethno-politics in Sweden Dissertation presented at Uppsala University to be publicly examined in Geijersalen, Thun- bergsväg 3H, Uppsala, Friday, December 18, 2009 at 10:00 for the degree of Doctor of Philosophy. The examination will be conducted in English. Abstract Green, C. 2009. Manging Laponia. A World Heritage Site as Arena for Sami Ethno-Politics in Sweden. Acta Universitatis Upsaliensis. Uppsala Studies in Cultural Anthropology 47. 221 pp. Uppsala. ISBN 978-91-554-7656-4. This study deals with the implications of implementing the World Heritage site of Laponia in northern Sweden. Laponia, consisting of previously well-known national parks such as Stora Sjöfallet and Sarek, obtained its World Heritage status in 1996. Both the biological and geological significance of the area and the local Sami reindeer herding culture are in- cluded in the justification for World Heritage status. This thesis explores how Laponia became an arena for the long-standing Sami ethno-political struggle for increased self- governance and autonomy. In many other parts of the world, various joint management schemes between indigenous groups and national environmental protection agencies are more and more common, but in Sweden no such agreements between the Swedish Envi- ronmental Protection Agency and the Sami community have been tested. The local Sami demanded to have a significant influence, not to say control, over the future management of Laponia. These were demands that were not initially acknowledged by the local and national authorities, and the negotiations about the management of Laponia continued over a period of ten years. -
DOMINANT ETHNIC GROUPS in EUROPE, 1850-1940 · Ethnic Groups and Language Rights Volume III
COMPARATIVE STUDIES ON GOVERNMENTS AND NON 1/ .f:( - DOMINANT ETHNIC GROUPS IN EUROPE, 1850-1940 · Ethnic Groups and Language Rights Volume III ~., i . 1 . Edited by { SERGIJ VILFAN in collaboration with 1 GUDMUND SANDVIK and LODE WILS 1. f' 1 ~... ,' Non-existent Sami Language Rights in Norway, 1850--1940 GUDMUND SANDVIK European Science Foundation NEW YORK UNIVERSITY PRESS DARTMOUTH 128 .., 13 Non-existent Sami Language Rights in Norway, 1850-1940 GUDMUND SANDVIK Background The Samis are an ethnic minority in the Nordic countries and in Russia. According to more or less reliable censuses, they number today about 40 000 in Norway (1 per cent of the total population), 20000 in Sweden (0.25 per cent), 4500 in Finland (0.1 per cent) and 2000 on the Kola peninsula in Russia. Only Finland has had a regular ethnic census. The Samis call themselves sapmi or sabmi. It is only recently (after 1950) that this name has been generally accepted in the Nordic countries (singular same, plural samer). Tacitus wrote about fenni;1 Old English had finnas;2 Historia NorvegitE (History of Norway) written about P80 had finni,3 and the Norse word was {imzar. 4 In medieval Icelandic and Norwegian literature, Finnmork was the region in northern Scandinavia where the Samis lived. The northern most Norwegian fylke (county) of today, Finnmark, takes jts name from the huge medieval Finnmork. But Samis of today still use the name S4pnzi for the entire region where they live (See Map 13.1, ;].. which "shows S4pmiwith state frontiers and some Sami centres). 'Finner' is accordingly an authentic Norwegian name.