Addressing Challenges of Machine Translation of Inuit Languages by Jeffrey C Micher
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Argument Marking in Harakmbut
Argument marking in Harakmbut: Looking for referential transparency An Van linden ([email protected]) University of Leuven & Research Foundation Flanders (FWO) SLW6, Pavia, 9/09/2014 1. Introduction − Harakmbut is a language from the Peruvian Amazon, spoken in ‘native communities’ in the departments of Madre de Dios and Cusco − Genetic affiliation: − Formerly classified as an Arawakan or Maipuran language by McQuown (1955) (see Hart 1963: 6) and Matteson (1972); but this has found little acceptance (Adelaar 2007: 39). − Wise (1999: 307) states that Harakmbut is commonly accepted to be a (single language) isolate (cf. WALS; Fonseca 2002; Vergara 2007) − Adelaar (2000, 2007) proposes that it is genetically related to the Brazilian Katukina family (included in Guaporé-Mamoré linguistic area), which may be further linked to Macro-Ge − Some grammatical features are shared with Ese Eja (Tacanan family) (Pozzi-Escot 1998: 93), which is proposed to belong to the Guaporé-Mamoré linguistic area in southwest Brazil and eastern Bolivia, close to the border with Peru (Crevels & van der Voort 2008) − Previous linguistic work: focus on Amarakaeri dialect (Hart 1963; Helberg 1984, 1990; Tripp 1976ab, 1995) − Own work: two fieldwork stays in Puerto Luz, San José and Shintuya (all Amarakaeri informants): Jul-Aug 2010, Aug-Sept 2011 − Orthographic conventions: <’>: glottal stop; <¨>: nasal vowel; underlined sounds carry word stress − Agglutinating language − Synthetic verbal morphology, especially with respect to mood and argument marking 2. Mood marking − Argument marking interacts with mood marking: Harakmbut distinguishes between three mood types: indicative, dubitative and imperative mood, each of which has a distinct set of argument markers (cf. -
Nunavut Utilities Technical Guide, Version 2.1, June 2, 2005 1 CONTENTS
Nunavut Utilities Technical Guide Prepared by Multilingual E-Data Solutions For the Nunavut Department of Culture, Language, Elders and Youth Nunavut Utilities Technical Guide, Version 2.1, June 2, 2005 1 CONTENTS 1. Introduction................................................................................................................... 4 2. Syllabic Font Conversions............................................................................................ 4 2.1 Using Unicode as a Pivot Font.................................................................................. 5 2.2 Conversions to Unicode............................................................................................ 6 2.3 Conversions back to “Legacy” fonts......................................................................... 6 2.4 Special Processing Routines ................................................................................. 6 2.4.1 Placement of Long vowel markers .................................................................... 6 2.4.2 Extra Long vowel markers................................................................................. 7 2.4.3 Typing variations and collapsing characters...................................................... 7 3. Roman/Syllabic Transliteration Conversions ............................................................ 8 3.1 Introduction............................................................................................................... 8 3.2 Inuit Cultural Institute (ICI) Writing System........................................................... -
Agentive and Patientive Verb Bases in North Alaskan Inupiaq
AGENTTVE AND PATIENTIVE VERB BASES IN NORTH ALASKAN INUPIAQ A DISSERTATION Presented to the Faculty of the University of Alaska Fairbanks in Partial Fulfillment of the Requirements for the Degree of DOCTOR OF PHILOSOPHY By TadatakaNagai, B.Litt, M.Litt. Fairbanks, Alaska May 2006 © 2006 Tadataka Nagai Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. UMI Number: 3229741 INFORMATION TO USERS The quality of this reproduction is dependent upon the quality of the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleed-through, substandard margins, and improper alignment can adversely affect reproduction. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. ® UMI UMI Microform 3229741 Copyright 2006 by ProQuest Information and Learning Company. All rights reserved. This microform edition is protected against unauthorized copying under Title 17, United States Code. ProQuest Information and Learning Company 300 North Zeeb Road P.O. Box 1346 Ann Arbor, Ml 48106-1346 Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. AGENTIVE AND PATIENTIYE VERB BASES IN NORTH ALASKAN INUPIAQ By TadatakaNagai ^ /Z / / RECOMMENDED: -4-/—/£ £ ■ / A l y f l A £ y f 1- -A ;cy/TrlHX ,-v /| /> ?AL C l *- Advisory Committee Chair Chair, Linguistics Program APPROVED: A a r// '7, 7-ooG Date Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. iii Abstract This dissertation is concerned with North Alaskan Inupiaq Eskimo. -
Inuit & Cancer: Fact Sheets
Inuit & Cancer: Fact Sheets Inuit Tapiriit Kanatami February, 2009 INTRODUCTION Inuit Tapiriit Kanatami developed a series of fact sheets to raise awareness of Inuit and cancer with the intent of informing decision makers, advisors and non- government organizations about Inuit’s unique cancer concerns and realities. The following are the themes of each fact sheet: u About Inuit u Inuit Health Status u Health Care Delivery in Inuit Regions u Cancer Burden u Prevention u Screening, Diagnosis and Treatment u Cancer Care u Human Resources u Research & Surveillance This project received financial support from the Canadian Cancer Action Network. FACT SHEET: ABOUT INUIT Did you Know? Who are Inuit? u One person of Inuit descent is an Inuit are the indigenous people that Inuk, which is singular for Inuit. inhabit the Arctic regions of Canada, u Inuvialuit is the correct term for Russia, Alaska and Greenland. In Inuit from the Inuvialuit Settlement Canada, there are approximately 50,500 Region in the NWT. Inuit living primarily in four regions: u Inuit are “Aboriginal” or “First Nunavik (Northern Quebec), Peoples”, but are not “First Nunatsiavut (Labrador), the Inuvialuit Nations”. Also, Inuit are not Innu. Innu are a First Nations group Settlement Region (Northwest located in northeastern Quebec and Territories), and the new territory parts of Labrador. of Nunavut. There are Inuit living in u Inuit do not live “on” or “off” every political jurisdiction in Canada, reserve, this applies only to First with growing populations in Ottawa, Nations. Inuit live in Inuit Montreal, Yellowknife, Winnipeg, communities, hamlets or villages. Edmonton and other cities. -
A FEW THOUGHTS on the LANGUAGE of the Inult ¹
A FEW THOUGHTS ON THE LANGUAGE OF THE INUlT ¹ LOUIS-JACQUES DORAIS, Département D'Anthropologie, Universite Laval, Cité Universitaire, Québec, Quebec, Canada, G1K 7P4 ABSTRAC/RESUME The author outlines the present linguistic situation of Inuit, and discusses both political-economic reasons for it. and possible future developments. He then proposes an explanation for certain changes occurring in Inuktitut over the past several decades. L'auteur décrit la situation linguistique actuelle chez les Inuit et aborde une discussion portant sur les causes politico-économiques de cette situation, de même que sur son évolution éventuelle. Par la suite, il émet une hypothèse expliquant un certain nombre de changements qui se sont produits dans l'inuk- titut au coups des dernières décennies. 304 LOUIS-JACQUES DORAIS In spite of the fact that over the past few years the study of Inuktitut, both theoretical and applied, has attracted a great number of researchers, the position of this language, at least in Canada, does not yet seem to be as secure as it should be: Englishis gamingmore and more over it. The situation is somewhat discouraging, but we only have to look at what happens in Greenland to realize that Inuktitut is not necessarily bound to disappear. In that country, Greenlandic Eskimo is the customary language of private conversa- tions, schools, books, radio and administrative activity. Among Canadian Inuit, recent progress in native education and culture (like the development of local school boards and new curricula) has not reversed this trend towards linguistic assimilation. At best, it has given a few positive results in wen-defined, but rather narrow, spheres, like the standardization of Inuktitut syllabics and Roman orthography, which was achieved in 1976, and the release of a few texts and audio-visual productions on traditional life. -
Idawati Garim
Modality in Tae’ language: a grammatical-lexical view Kisman Salija – Jusmianti Garing – Idawati Garim DOI: 10.18355/XL.2017.10.04.18 Abstract There are many ways to find modality in languages. Modality of language can be expressed through grammatical or lexical feature. However, modality in Tae’ language specifically can be described through both these features. This research aims to express the modality in Tae’ language based on grammatical-lexical point of view. This is a descriptive qualitative research using library research methods. Library research aims to get of completed and detail data. Further, this research analyzes eighteen discourses of Tae’ language consisting of folklore, fable, messages, and Tae’ specific food. The result shows that there are some features marking modality in Tae’ language that expressed through grammatical and lexical feature. Grammatical modality of Tae’ established through affixation process, i.e. prefix la-, and suffixes – ri, –ra functioning as verbs and particles. Further, modality in Tae’ is also established by lexical feature using the words melo, parallu, musti, bela, wa’ding, bang, omi, sia, kade, le’, dau, tae, tannia, saba, iake, and kumua functioning as auxiliary verbs, particles, negations, and conjunctions. Semantically, these forms mark epistemic and deontic modality in Tae’ language. Epistemic and deontic modality in Tae’ describe as declarative, subjunctive, dubitative, imperative, desiderative and volition, interrogative, and possibility form. Key words: modality, epistemic, deontic, Tae’s language, grammatical-lexical view Introduction Modality can be defined as a philosophical concept, as a subject of the study of logic, or as a grammatical category. There are many definitions and classifications of modal phenomena. -
Aligning and Using an English-Inuktitut Parallel Corpus
Aligning and Using an English-Inuktitut Parallel Corpus Joel Martin, Howard Johnson, Benoit Farley and Anna Maclachlan Institute for Information Technology National Research Council Canada [email protected] Abstract as most approaches do, but also multiple substring corre- spondences resulting in far greater coverage. A parallel corpus of texts in English and in Inuktitut, an Inuit language, is presented. 2 An English-Inuktitut Corpus These texts are from the Nunavut Hansards. 2.1 The Parallel Texts The parallel texts are processed in two phases, the sentence alignment phase and the word cor- The corpus of parallel texts we present consists of respondence phase. Our sentence alignment 3,432,212 words of English and 1,586,423 words of Inuk- technique achieves a precision of 91.4% and titut from the Nunavut Hansards. These Hansards are a recall of 92.3%. Our word correspondence available to the public in electronic form in both English technique is aimed at providing the broadest and Inuktitut (www.assembly.nu.ca). The Legislative As- coverage collection of reliable pairs of Inuktitut sembly of the newly created territory of Nunavut began and English morphemes for dictionary expan- sitting on April 1, 1999. Our corpus represents 155 days sion. For an agglutinative language like Inuk- of transcribed proceedings of the Nunavut Legislative As- titut, this entails considering substrings, not sembly from that first session through to November 1, simply whole words. We employ a Pointwise 2002, which was part way through the sixth session of Mutual Information method (PMI) and attain a the assembly. -
5.3 Epistemic Modality and Evidentials in Luchuan
This thesis has been submitted in fulfilment of the requirements for a postgraduate degree (e.g. PhD, MPhil, DClinPsychol) at the University of Edinburgh. Please note the following terms and conditions of use: This work is protected by copyright and other intellectual property rights, which are retained by the thesis author, unless otherwise stated. A copy can be downloaded for personal non-commercial research or study, without prior permission or charge. This thesis cannot be reproduced or quoted extensively from without first obtaining permission in writing from the author. The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the author. When referring to this work, full bibliographic details including the author, title, awarding institution and date of the thesis must be given. Evidentials in the Shuri Dialect of Luchuan (Ryukyuan) Tomoko Arakaki A thesis submitted in fulfillment of requirements for the degree of Doctor of Philosophy School of Philosophy, Psychology and Language Sciences The University of Edinburgh 2010 ⒸCopyright 2010 by Tomoko Arakaki Declaration I hereby declare that this thesis is my own composition, and that it contains no material previously submitted for the award of other degree or qualification. The work reported in this thesis has been executed by myself, except where due acknowledgement is made in the text. Tomoko Arakaki Abstract This dissertation attempts to demonstrate that Luchuan is a language which has a grammatical category of evidentiality. Luchuan is the only sister language of Japanese; however, since they are considered to have diverged as early as the beginning of the 8th century, their vocabularies and grammars have developed in somewhat different directions. -
Inuktitut – the Language of the Inuit of Nunavut
3 Inuktitut – The Language of the Inuit of Nunavut Content Areas Social Studies, Language Arts, Technology, Art Lesson Plans on Inuit Culture Objective In Part 1 of the lesson, to be completed in the classroom, students will be introduced to the language of Inuktitut and will practice pronouncing and writing selected words using Inuktitut syllabics. Then, using the Internet, small groups of students will create an Inuktitut/ English ABC book, complete with illustrations. (Optional: If there are younger children in the same building, pair up two classes and allow older students to share their projects with the younger, teaching them several Inuktitut terms.) Materials · Inuktitut Syllabics, Words and Definitions – 1 per student · Research Sheet – Inuktitut/English ABC Book – 1 per student · Inuktitut Syllabics – 1 per student · Internet Access: 1. http://www.pageweb.com/kleekai/eskimo2.htm – for researching Inuktitut terms for dictionary 2. http://www.ainc-inac.gc.ca/ks/5020_e.html – for hearing Inuktitut pronunciations · Download and install Nunacom font (optional): http://www.nwmb.com/english/download.php or http://www.nunavut.com · Location of Inuktitut Syllabics on a Standard Keyboard – Reference for teacher if using Nunacom font Part 1 – Classroom 1 Optional: Prior to the lesson, teacher may choose to download and install Nunacom font onto the classroom computer or student computers. The Nunacom font will enable teacher/students to type using Inuktitut syllabics. Use Location of Inuktitut Syllabics on a Standard Keyboard to locate each Inuktitut syllabic. 123 Exploring Inuit Culture Curriculum Teacher Resource Guide 3 Inuktitut – The Language of the Inuit of Nunavut 2 Ask students if they know any traditional fairy tales by heart. -
Humanity Fluent Software Language
Pyash: Humanity Fluent Software Language Logan Streondj February 13, 2019 Contents 1 Introduction 4 1.1 Problem ................................... 4 1.1.1 Disglossia ............................... 4 1.2 Paradigm ................................... 5 1.2.1 Easy to write bad code ........................ 5 1.2.2 Obsolete Non-Parallel Paradigms .................... 5 1.3 Inspiration ................................. 5 1.4 Answer .................................... 5 1.4.1 Vocabulary ............................... 5 1.4.2 Grammar ................................ 5 1.4.3 Paradigm ................................ 6 I Core Language 7 2 Phonology 8 2.1 Notes .................................... 8 2.2 Contribution ................................. 8 3 Grammar 10 3.1 Composition ................................. 10 3.2 Grammar Tree ................................. 10 3.3 Noun Classes ................................. 10 3.3.1 grammatical number .......................... 12 3.3.2 noun classes for relative adjustment ................. 12 3.3.3 noun classes by animacy ........................ 13 3.3.4 noun classes regarding reproductive attributes ............ 13 3.4 Tense .................................... 13 3.5 Aspects ................................... 13 3.6 Grammatical Mood ............................... 14 3.7 participles ................................. 16 4 Dictionary 18 4.1 Prosody ................................... 18 4.2 Trochaic Rhythm ............................... 18 4.3 Espeak .................................... 18 4.4 -
Nunavut Language Policy Conference: Report and Recommendations
NUNAVUT LANGUAGE POLICY CONFERENCE: REPORT AND RECOMMENDATIONS IQALUIT, NUNAVUT MARCH 24-26,1998 Nunavut Implementation Commission ISBN 1-896548-28-8 L<LL5cn?YC Nunavut Hivumukpalianikhaagut Kat~mayit Nunavut Implementation Commission Commission d'etablissement du Nunavut Letter to the Reader June 30, 1998 Nunavut is fortunate to be one of a few jurisdictions in Canada in which the day-to-day use of an Aboriginal language remains widespread throughout the general population. Because so many of our people still speak Inuktitut, it provides the Nunavut Government with a rare opportunity, in a world increasingly dominated by the use of English, to promote an Aboriginal language as a functional and official language of government and society. Doing so, though, requires making sensible policy choices; ones that successfully marry the linguistic needs of the population with the practical realities of communication and the fiscal limitations of the modern world. It was with this problem in mind that the Nunavut Implementation Commission (NIC) recommended in Footprints 2, that the three parties the Nunavut Political Accord "... jointly convene ... a special Developing a Language Policy Conference, as a necessary step in pulling together an adequate societal consensus on the place of language in the future of Nunavut, with particular attention to the preservation and promotion of the lnuit language". It was hoped through a conference of this nature, enough sound policy advice would be provided, to enable decision makers to institute a balanced language policy. Nunavut at its outset, will be home to three Official languages; Inuktitut, English and French. English and French under the federal Official Languages Act are the official languages of Canada, with the language rights of all citizens protected under the Canadian Charter of Rights and Freedoms. -
The Role of Translation in Linguistic Standardisation Across Inuit Nunangat (Le Rôle De La Traduction Dans La Standardisation Linguistique En Inuit Nunangat)
The Role of Translation in Linguistic Standardisation across Inuit Nunangat (Le rôle de la traduction dans la standardisation linguistique en Inuit Nunangat) Noelle Palmer Mémoire présenté au Département d‘Études françaises comme exigence partielle au grade de maîtrise ès Arts (Traductologie) Université Concordia Montréal, Québec, Canada Avril 2016 © Noelle Palmer, 2016 CONCORDIA UNIVERSITY School of Graduate Studies This is to certify that the thesis prepared By: Noelle Palmer Entitled: The Role of Translation in Linguistic Standardisation across Inuit Nunangat and submitted in partial fulfillment of the requirements for the degree of Maîtrise ès Arts (Traductologie) complies with the regulations of the University and meets the accepted standards with respect to originality and quality. Signed by the final examining committee: Philippe Caignon_______________________ Chair René Lemieux_________________________ Examiner Sherry Simon__________________________ Examiner Debbie Folaron_________________________ Supervisor Approved by ________________________________________________ Chair of Department or Graduate Program Director ________________________________________________ Dean of Faculty Date April 11, 2016 ABSTRACT The Role of Translation in Linguistic Standardisation across Inuit Nunangat Noelle Palmer The history of translation and the history of standardisation in Inuit Nunangat, the Inuit homeland in Canada, are closely intertwined. As the Inuit language varieties continually adjust to changing circumstances, translation has triggered