000 Euralex 2010 03 Plenary
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Connections Between Sámi and Basque Peoples
Connections between Sámi and Basque Peoples Kent Randell 2012 Siidastallan Outside of Minneapolis, Minneapolis Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota “D----- it Jim, I’m a librarian and an armchair anthropologist??” Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Connections between Sámi and Basque Peoples Hard evidence: - mtDNA - Uniqueness of language Other things may be surprising…. or not. It is fun to imagine other connections, understanding it is not scientific Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Documentary: Suddenly Sámi by Norway’s Ellen-Astri Lundby She receives her mtDNA test, and express surprise when her results state that she is connected to Spain. This also surprised me, and spurned my interest….. Then I ended up living in Boise, Idaho, the city with the largest concentration of Basque outside of Basque Country Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota What is mtDNA genealogy? The DNA of the Mitochondria in your cells. Cell energy, cell growth, cell signaling, etc. mtDNA – At Conception • The Egg cell Mitochondria’s DNA remains the same after conception. • Male does not contribute to the mtDNA • Therefore Mitochondrial mtDNA is the same as one’s mother. Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Kent Randell (c) 2012 --- 2012 Siidastallan, Linwood Township, Minnesota Four generation mtDNA line Sisters – Mother – Maternal Grandmother – Great-grandmother Jennie Mary Karjalainen b. Kent21 Randell March (c) 2012 1886, --- 2012 Siidastallan,parents from Kuusamo, Finland Linwood Township, Minnesota Isaac Abramson and Jennie Karjalainen wedding picture Isaac is from Northern Norway, Kvaen father and Saami mother from Haetta Kent Randell (c) 2012 --- 2012 Siidastallan, village. -
On Etymology of Finnic Term for 'Sky'
Archaeoastronomy and Ancient Technologies 2019, 7(2), 5–10; http://aaatec.org/art/a_jg1 www.aaatec.org ISSN 2310-2144 On Etymology of Finnic Term for 'Sky' Jingyi Gao Beijing International Studies University, China; E-mail: [email protected] Institute of the Estonian Language, Tallinn, Estonia; E-mail: [email protected] University of Tartu, Estonia; E-mail: [email protected] Abstract Using etymological methods, the present study has identified five Sinitic and Uralic shared etymologies. These five etymologies form a rhyme correspondence. This regular sound change validates the etymological connection between Sinitic and Uralic. The Finnic term for 'sky' is among these five etymologies. It is demonstrated that this word root should be aboriginal in Sino-Uralic languages. Keywords: Rhyme correspondence, Sinitic, Uralic, Sino-Uralic, Baltic, Germanic, Celtic, Italic, Indo- Iranian. Introduction The Finnic term for 'sky' (Estonian taevas; Finnish taivas; Livonian tōvaz; Veps taivaz; Votic taivas) has no cognate in other Uralic languages, therefore it has been previously considered a loanword to Finnic from Indo-Iranian (Schott, 1849, p. 126), from Baltic (Thomsen, 1869, p. 34, 73), or from Germanic (Koivulehto, 1972). The present study finds that this Finnic word has cognates in Sinitic languages supported by a deep rhyme correspondence consisting of five etymologies; therefore this word root must be aboriginal in Sino-Uralic languages. Gao (e.g. 2005, 2014b, 2019; Gāo, 2008) detected and identified Sinitic and Uralic shared etymologies, and has solely researched Sinitic and Uralic shared etymologies for more than a decade. We could infer a general skepticism about this approach. -
Sami in Finland and Sweden
A baseline study of socio-economic effects of Northland Resources ore establishment in northern Sweden and Finland Indigenous peoples and rights Stefan Ekenberg Luleå University of Technology Department of Human Work Sciences 2008 Universitetstryckeriet, Luleå A baseline study of socio-economic effects of Northland Resources ore establishment in northern Sweden and Finland Indigenous peoples and rights Stefan Ekenberg Department of Human Work Sciences Luleå University of Technology 1 Summary The Sami is considered to be one people with a common homeland, Sápmi, but divided into four national states, Finland, Norway, Russia and Sweden. The indigenous rights therefore differ in each country. Finlands Sami policy may be described as accommodative. The accommodative Sami policy has had two consequences. Firstly, it has made Sami collective issues non-political and has thus change focus from previously political mobilization to present substate administration. Secondly, the depoliticization of the Finnish Sami probably can explain the absent of overt territorial conflicts. However, this has slightly changes due the discussions on implementation of the ILO Convention No 169. Swedish Sami politics can be described by quarrel and distrust. Recently the implementation of ILO Convention No 169 has changed this description slightly and now there is a clear legal demand to consult the Sami in land use issues that may affect the Sami. The Reindeer herding is an important indigenous symbol and business for the Sami especially for the Swedish Sami. Here is the reindeer herding organized in a so called Sameby, which is an economic organisations responsible for the reindeer herding. Only Sami that have parents or grandparents who was a member of a Sameby may become members. -
Arxiv:2004.04803V1 [Cs.CL] 9 Apr 2020
FST Morphology for the Endangered Skolt Sami Language Jack Rueter, Mika Hämäläinen Department of Digital Humanities University of Helsinki {jack.rueter, mika.hamalainen}@helsinki.fi Abstract We present advances in the development of a FST-based morphological analyzer and generator for Skolt Sami. Like other minority Uralic languages, Skolt Sami exhibits a rich morphology, on the one hand, and there is little golden standard material for it, on the other. This makes NLP approaches for its study difficult without a solid morphological analysis. The language is severely endangered and the work presented in this paper forms a part of a greater whole in its revitalization efforts. Furthermore, we intersperse our description with facilitation and description practices not well documented in the infrastructure. Currently, the analyzer covers over 30,000 Skolt Sami words in 148 inflectional paradigms and over 12 derivational forms. Keywords: Skolt Sami, endangered languages, morphology 1. Introduction members access to language materials directly. The trick is Skolt Sami is a minority language belonging to Sami to find new uses and reuses for data sets and technologies branch of the Uralic language family. With its native speak- as well as to bring development closer to the language com- ers at only around 300, it is considered a severely endan- munity. If development follows the North Sámi lead, any gered language (Moseley, 2010), which, despite its pluri- project can reap from the work already done. centric potential, is decidedly focusing on one mutual lan- Extensive work has already been done on data and tool gauge (Rueter and Hämäläinen, 2019). In this paper, we development in the GiellaLT infrastructure (Moshagen et present our open-source FST morphology for the language, al., 2013) and (Moshagen et al., 2014), and previous work 3 which is a part of the wider context of its on-going revital- also exists for Skolt Sami (Sammallahti and Mosnikoff, ization efforts. -
Sixth Periodical Report Presented to the Secretary General of the Council of Europe in Accordance with Article 15 of the Charter
Strasbourg, 1 July 2014 MIN-LANG (2014) PR7 EUROPEAN CHARTER FOR REGIONAL OR MINORITY LANGUAGES Sixth periodical report presented to the Secretary General of the Council of Europe in accordance with Article 15 of the Charter NORWAY THE EUROPEAN CHARTER FOR REGIONAL OR MINORITY LANGUAGES SIXTH PERIODICAL REPORT NORWAY Norwegian Ministry of Local Government and Modernisation 2014 1 Contents Part I ........................................................................................................................................... 3 Foreword ................................................................................................................................ 3 Users of regional or minority languages ................................................................................ 5 Policy, legislation and practice – changes .............................................................................. 6 Recommendations of the Committee of Ministers – measures for following up the recommendations ................................................................................................................... 9 Part II ........................................................................................................................................ 14 Part II of the Charter – Overview of measures taken to apply Article 7 of the Charter to the regional or minority languages recognised by the State ...................................................... 14 Article 7 –Information on each language and measures to implement -
Morphological Analyzers and Other Digital Tools for Uralic Languages
Morphological analyzers and other digital tools for Uralic languages Jack Rueter University of Helsinki, Digital Humanities Some Terminology Word associations Working out a methodology Morphological analyzers under development An outline Where it started Setting up a rule-based description Tools Important players 11/10/19 Jack Rueter, Ph.D. University of Helsinki, Digital Humanities 2 Open-source Language form Rule-based Finite-state morphology Some Disambiguation terminology Multiple reusability Linguistics and Language technology Neural networks Transfer learning Active learning 11/10/19 Jack Rueter, Ph.D. University of Helsinki, Digital Humanities 3 (1) Keyboards (2) Spellers Word (3) Dictionaries associations (4) ICALL for multi- (5) Translation reusability (6) Text-to-speech (7) …. 11/10/19 Jack Rueter, Ph.D. University of Helsinki, Digital Humanities 4 (1) Extract paradigms from grammars, readers and research to build an analyzer. (2) Extract words, part-of-speech information and definitions from existing dictionaries and research. Build on what has already been done (Dutch, French, German, Russian,…) (3) Test analysis coverage on written texts. Are the forms Searching for a unrecognized proper words? (4) Disambiguate morphological analyses based on grammars and methodology research. Point out gaps in descriptions for linguistics (5) Test syntactic disambiguation on example sentences cited in grammatical descriptions of the language. And then retest on text corpora. (6) Make disambiguated sentences -
Sami Sieidis in a Nordic Context?
ELSE MUNDAL Sami Sieidis in a Nordic Context? ABSTRACT In the present article, the author discusses two Old Norse texts that may indicate that the Sami cult of sieidis had spread to the neighbouring Old Norse culture in the period before the Chris- tianization of Norway. One of these texts is found in the Older Eiðsifaþing law, the law of the inland part of Eastern Norway. According to this law, it was prohibited to believe in (the power of) the finnar (Sami), and among their powerful objects, rót (the root of a tree) is mentioned. This root is in all likeli- hood a Sami sieidi that was sought out by Norwegians for help, probably for medical reasons. The other text is a notice in the Icelandic Landnámabók in which it is mentioned that a settler from Northern Norway worshipped some stones in the outfields on the border of his settlement, calledGunnsteinar . There are closer parallels to this outfields cult in Sami culture than in Old Norse culture. KEYWORDS Sami sieidis, Gunnsteinar, Eiðsifaþing law, Landnámabók, Sami cult, Old Norse cult ANDERS HULTGÅRD Personal Religion among the Ancient Scandinavians and the Fulltrúi-Concept ABSTRACT The question of personal religion among the ancient Scandinavians has centered around the concept of having a deity as one’s fulltrúi ‘trustworthy friend,’ ástvinr ‘close friend,’ or vinr ‘friend.’ Most scholars of the twentieth century regarded the concept as a true expression of pre-Christian Germanic belief. By contrast, modern scholarship strongly tends to see it as a construct of medieval authors who took the saints’ cult as a model to describe the personal piety of their ninth and tenth cen- tury compatriots. -
Reconstructing Language Ancestry by Performing Word Prediction with Neural Networks
MSc Artificial Intelligence Track: Natural Language Processing Master Thesis Reconstructing language ancestry by performing word prediction with neural networks by Peter Dekker 10820973 Defense date: 25th January 2018 42 ECTS January 2017 – January 2018 Supervisors: Assessor: dr. Jelle Zuidema dr. Raquel Fernández prof. dr. Gerhard Jäger Institute for Logic, Language and Computation – University of Amsterdam Seminar für Sprachwissenschaft – University of Tübingen 2 Contents 1 Introduction 5 1.1 Historical linguistics . 5 1.1.1 Historical linguistics: the comparative method and beyond . 5 1.1.2 Sound changes . 6 1.1.3 Computational methods in historical linguistics . 8 1.2 Developments in natural language processing . 10 1.2.1 Natural language processing . 10 1.2.2 Machine learning and language . 10 1.2.3 Deep neural networks . 10 1.3 Word prediction . 11 1.3.1 Word prediction . 11 1.3.2 Model desiderata . 12 1.4 Summary . 12 2 Method 13 2.1 Pairwise word prediction . 13 2.1.1 Task . 13 2.1.2 Models . 13 2.1.3 Data . 17 2.1.4 Experiments . 19 2.2 Applications . 22 2.2.1 Phylogenetic tree reconstruction . 22 2.2.2 Sound correspondence identification . 22 2.2.3 Cognate detection . 22 2.3 Summary . 23 3 Results 25 3.1 Word prediction . 25 3.2 Phylogenetic tree reconstruction . 25 3.3 Identification of sound correspondences . 27 3.4 Cognate detection . 29 3.5 Summary . 31 4 Context vector analysis 33 4.1 Extraction of context vectors and input/target words . 33 4.2 PCA visualization . 33 4.3 Cluster analysis . -
Herders' Congress
Herders' Congress The oldest industry with the widest tracks We are now joining together in Inari for the 2ndWorld Another equally important part ofthis congress is the p- Reindeer Herders' Congress in the period of June 18-23, sibility of meeting each other. People living in a \zst area 200 1. Reindeer husbandry is one of the industries, which and working in a scattered industry need to learn to bow have survived longest and has made the widest tracks in each other. There is a possibility for us now to meet each the North. other and get acquainted, and in the congress pqyam rime The reindeer peoples are now gathering for the third time has been reserved for that purpose. We have been rrying and it is the second time that we are having a worldwide to arrange the program so that there will be a possibility reindeer congress. The first meeting was in 1993 in Trom- for each one of us to get acquainted with the 6eM of rese- so, Norway, with participants from 15 different reindeer arch in different reindeer regions as weU as aih the aut- peoples. It was then that the decision was made for estab- horities working with matters that concern reindeer hw- lishing the Association of World Reindeer Herders (WFU-I). bandry. The reindeer husbandry does not exis alone in its The second meeting took place in Nadym, in the central area, and it is necessary to cooperate with the adminima- reindeer herding area of Nenets in Yamal and it was there tion. that WRH officially was established in 1997. -
Researching Less-Resourced Languages – the Digisami Corpus
Researching Less-Resourced Languages – the DigiSami Corpus Kristiina Jokinen University of Helsinki, Finland and AIRC, AIST Tokyo Waterfront, Japan [email protected] Abstract Increased use of digital devices and data repositories has enabled a digital revolution in data collection and language research, and has also led to important activities supporting speech and language technology research for less-resourced languages. This paper describes the DigiSami project and its research results, focussing on spoken corpus collection and speech technology for the Fenno-Ugric language North Sami. The paper also discusses multifaceted questions on ethics and privacy related to data collection for less-resourced languages and indigenous communities. Keywords: corpus collection, under-resourced languages, North Sami with new technology applications. The main motivation 1. Introduction was to improve digital visibility and viability of the target languages, and to explore different choices for encouraging Several projects and events have increased research and maintaining the use of less-resourced languages in the activities for under-resourced languages during the past digitalized world. The goals of the DigiSami project are years. For instance, the DLDP-project (Digital Language discussed in Jokinen (2014) and Jokinen et al. (2017). Diversity Project) is to advance the sustainability of Europe’s regional and minority languages, while the Flare- The DigiSami project deals with the North Sami language net network and the LRE Map (Calzolari et al. 2012) have (Davvisámegiela) which belongs to the Fenno-Ugric had a big impact on sharing language resources and making language family and is one of the nine Sami languages speech corpora freely available. -
ISO / TC 37 / SC 2 / WG 1 Table 1
ISO / TC 37 / SC 2 / WG 1 ISO / TC 37 / SC 2 / WG 1 N 65 TC 37 – Terminology (principles and coordination) Acting convener: Håvard Hjulstad SC 2 – Layout of vocabularies WG 1 – Coding systems Date: 2000-07-24 Current status of ISO 639-1 tables The following three tables are: 1. The finalized items in ISO 639-1 in alphabetical order by language identifier, i.e. the current version of table 3 in the DIS document. 2. The items that are still in “Annex C” in alphabetical order by the English name, i.e. the current version of table C.1 in the DIS document. Note: This annex will not be included in the next version of the document. 3. All changes from ISO 639:1988 to the current version of 639-1, in alphabetical order by the (current) language identifiers. We may decide to include this information in a new annex to 639-1. Table 1 – Current version of table 3 Id English name French name Indigenous name Status aa Afar afar afar a - Finalized 1988 ab Abkhazian; Abkhaz abkhaze; abkhazien a%sua bys%wa [1E55 p- a - Finalized 1988 acute; 017E z-caron] ae Avestan avestique ? b - Finalized 2000-02 af Afrikaans afrikaans afrikaans a - Finalized 1988 ak Akan akan akana a - Finalized before 2000-02 am Amharic; Abyssinian amharique amarinja a - Finalized 1988 ar Arabic arabe 'arabiy a - Finalized 1988 as Assamese assamais asam% [012B i-macron] a - Finalized 1988 av Avar; Avarish avar avar mac% [2021 double a - Finalized before dagger] 2000-02 ay Aymara aymara aymara a - Finalized 1988 az Azerbaijani azéri; azerbaïdjanais az%rbaycan dil [0259 a - Finalized -
Kuávsui Já Peeivičuovâ. 'Sarastus Ja Päivänvalo'. Inarinsaamen
(Nid.) 82-0 (Verkkojulkaisu) KUÁVSUI JÁ PEEIVIČUOVÂ. ’SARASTUS JA PÄIVÄNVALO’. INARINSAAMEN KIELEN REVITALISAATIO UUH9_VALMIS.inddH9_VALMIS.indd 1 117.5.20157.5.2015 221:44:231:44:23 Annika Pasanen: Kuávsui já peeivičuovâ. ’Sarastus ja päivänvalo’ . Inarinsaamen kielen revitalisaatio Uralica Helsingiensia 9. Layout, cover Alexandra Kellner, Niko Partanen Cover photograph Paadar Images Woman standing at the door of the church at Pielpajärvi ISBN 978-952-5667-70-7 Orders • Tilaukset ISSN 1797-3945 Tiedekirja <www.tiedekirja.fi> Unigrafi a Oy Snellmaninkatu 13 <[email protected]> Helsinki 2015 FI-00170 Helsinki Uralica Helsingiensia Uralica Helsingiensia is a series published jointly by the University of Helsinki Finno-Ugrian Studies Programme and the Finno-Ugrian Society. It features monographs and thematic col- lections of articles with a research focus on Uralic languages, and it also covers the linguistic and cultural aspects of Estonian, Hungarian and Saami studies at the University of Helsinki. The series has a peer review system, i.e. the manuscripts of all articles and monographs submitted for publication will be refereed by two anonymous reviewers before binding decisions are made concerning the publication of the material. Uralica Helsingiensia on Helsingin yliopiston suomalais-ugrilaiset kielet ja kulttuurit -oppi- aineryhmän yhdessä Suomalais-Ugrilaisen Seuran kanssa julkaisema sarja, jossa ilmestyy monografi oita ja temaattisia artikkelikokoelmia. Niiden aihepiiri liittyy uralilaisten kielten tut- kimukseen ja kattaa myös Helsingin