Unicode As a Standard Framework for Syllables and Other Special Characters

Total Page:16

File Type:pdf, Size:1020Kb

Unicode As a Standard Framework for Syllables and Other Special Characters Unicode as a Standard Framework for Syllables and Other Special Characters CHRIS HARVEY University of Manitoba Although the computer has promised greater flexibility for those wishing to design, type and publish multilingual documents, many who have tried to do so have encountered such problems as: - how do I get character x to appear on my screen? - where can I find a font that contains all of the characters required? - how do I access those characters? - how can I stop my old files from showing up as garbage on someone else's computer (or my own new one)? Problems of this sort can be magnified exponentially for those who work full-time in minority languages that use uncommon scripts or characters, and need to develop entire corpora of educational materials, dictionaries, newsletters, books and web pages. The discussion that follows looks into some answers to these questions a propos of North American Native lan­ guages and will show that it is possible to take advantage of the com­ puter's vaunted flexibility to produce documents in such languages from the humblest of emails to professionally designed works of typographic art. THE COMPUTER AS TYPEWRITER Much as early printers emulated handwritten calligraphic manuscripts in their type design and layout, the computer word-processor has been typi­ cally understood and used as an electronic typewriter. The goal was a printed copy, on standard-size paper. As time went on, laser and ink-jet printers introduced more and better-quality font options, but the idea remained the same: the print-out of the document is the final copy to be read. As font technology progressed, it went in an entirely Western Euro­ pean direction. For linguists and native speakers working in non-Western orthographies, it often proved impossible to find the necessary letters in the ASCII or ANSI character sets.1 Following the typewriter mentality, writ- Papers of the 35th Algonquian Conference, ed. H.C. Wolfart (Winnipeg: University of Manitoba, 2004), pp. 125-136. 126 CHRIS HARVEY ers could only adapt to the unchanging hardware, for example, by amend­ ing or abandoning previous orthographies in favour of a system that was more Ascii-friendly.2 Hand-written diacritics over printed letters, or char­ acters like schwa and eth, pencilled in by hand, presumably document cases where getting the computer to print or type the character correctly was more trouble than it is worth. Another fallback strategy - using word processor formatting techniques3 - produces characters that cannot be sorted, and whose formatting is lost when copying text from one machine to another. As an alternative, each small community of linguists or speakers - or in many cases, each individual - could modify a pre-existing font to suit their immediate needs.4 In many cases, these modifications were done in graphics programs which are not built to design fonts, again limiting their transferability. Several additional problems arise from this process. First, every per­ sonalised font has its own mapping, so that if a language requires the let­ ter (5), font x maps it on 9 while font y uses it to replace a. Consequently, a word-processor document typed using font x will display the wrong characters on a computer using either font y or no special font at all. This incompatibility also plagues those who, in opening a file they themselves typed a few years back, discover that they have not installed their old fonts on their new computer, and are left with an illegible mess (I speak from personal experience). Second, new characters often replace letters which appear on standard keyboard keys (for simpler access), making it necessary to switch fonts every time one switches languages. This is a formatting nightmare since reformatting and organising documents in 1. ASCII and ANSI code pages are the standard lists of available characters, numbered up to 256 (including a substantial number of command codes, such as delete, tab, etc.). The most recent operating systems today employ Unicode, which provides for approximately 65,000 characters. 2. One example of this adaptation is provided in Ancient Greek dictionary websites, which use an ASCII alphabet to reproduce the Greek. 3. Such as the Kwak'wala underline accent, as in a, k, g, etc., designed specifically for the North American typewriter (Galois 1994). How does one, for emphasis, underline an underline? 4. Modifications for personal use are permitted, wider distribution is not: since elec­ tronic fonts are intellectual property, copyright law prohibits widespread modification and distribution. UNICODE AS A FRAMEWORK FOR SYLLABICS 127 multiple fonts is extremely time-consuming and error-prone. Third, per­ sonalized fonts are typically language- or dialect-specific, so that an East­ ern Cree syllabics font, for example, may or may not include characters particular to Western Cree syllabics (fig. 1). Orthographic standards in some languages change (like a dialect chain) slightly from community to community, meaning the font may only be useful to a small group of peo­ ple. Figure 1. Some finals in Cree and Ojibwe syllabics Final Consonant P t k C m n s § y Eastern < c b t L Q. <> C/l 0 Western • / \ - C 3 n U + These are some of the ways in which users of non-Western orthogra­ phies were hampered by the language attitudes of software designers in the not so distant past. With no consensus on ASCII font mapping, the result of exchanging documents electronically was predictably fraught, and a printout was essentially the only reliable way or sharing documents. Thus the computer became an expensive typewriter, just slightly more versatile than the old IBM Selecrric. TOWARDS A DIGITAL MENTALITY The current world of email, web pages, dictionary databases, interlinear software and electronic archives has pushed the demands of the computer far beyond that of a text-printing machine. Of all the font solutions listed above, only one - modifying orthography of the language to fit ASCII - seems compatible with the needs of the modem computer user. This solu­ tion has not been at all acceptable for languages with long literary histo­ ries such as Greek, Russian, Korean, Bengali, etc.; it should similarly be ruled out for those Native languages of North America that use uncom­ mon orthography. This is especially true for the Algonquian languages that use syllabics, where a wholesale switch to a roman orthography would create a literary schism between older speakers and the newly edu­ cated, along with disposing of a part of cultural identity (Poser 2000:2). However, until a few years ago, there was no other practical choice. 128 CHRIS HARVEY Many web pages and databases require special fonts. In the past - and this is still a factor today - these were typically either modified ASCII or ANSI fonts, or non-roman orthographies squeezed into space left over at the bottom range of ASCII. Limitations include non-transferability, obso­ lescence and platform specificity (Mac or Windows). If - as they say - the average web user has the attention span of a five-year old, in much less time than it takes to download and install a special font, that user will be surfing elsewhere. One response to this font problem is the .pdf file, which allows the browser to view a page, fully formatted and in any font, without down­ loading anything - except of course the browser plug-in. In the end, the .pdf file is little more than a scan of a print copy and harkens back to the computer as a typewriter. These files are not interactive, nor can they be transmitted to other users for modification. Web sites using ASCII or ANSI modified fonts can look great - many Inuktitut sites use such fonts - but web-patience will probably not tolerate a page full of AcAuE, which is what faces those who have not downloaded and installed the correct font. Current versions of Windows and MacOS are Unicode-compliant. Unicode is not a new font or font technology. Rather, it is a mapping sys­ tem (or numbered table) in which each distinct character (approximately 65,000 characters, at present) is given a number, unique to itself. This number is permanent and unchanging, regardless of hardware or software. Unicode numbers are presented as hexadecimal (base sixteen)5 so that the numbers look somewhat awkward, but most people will never have to see the encoding anyway. For example, the schwa character (a) has the cod­ ing 0259, and syllabics L- (mwa) is 14B7. To make this vast character set manageable, Unicode is divided into sections (ranges), one for each script (a representative selection of characters is included in the Appendix). Often purported to cover all of the world's languages (though not by Unicode developers), some bias towards the world's majority languages nevertheless remains. All characters used in French and Hungarian (for example) have been uniquely encoded, whereas some characters for Guarani and Navajo (to name but two) need to be composed of two or more characters. A significant number of languages use glyphs unavail- Hexadecimal numbers use the digits 0-9 and the letters A-F (representing the num- UNICODE AS A FRAMEWORK FOR SYLLABICS 129 able in Unicode, and others still - such as Javanese, Balinese, Batak and Mayan hieroglyphics - use scripts that are entirely absent. Through the introduction of standard character mapping, Unicode is intended to pro­ vide the same set of rules for all computer users, so that any time, any­ where, someone can look at a web page or read an email in any language without special downloads or fonts.6 At the time of this writing, most peo­ ple are still using software which does not take full advantage of Unicode, so the legibility of transferred documents containing Unicode characters is still limited to those using compatible software.
Recommended publications
  • Ka И @И Ka M Л @Л Ga Н @Н Ga M М @М Nga О @О Ca П
    ISO/IEC JTC1/SC2/WG2 N3319R L2/07-295R 2007-09-11 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации Doc Type: Working Group Document Title: Proposal for encoding the Javanese script in the UCS Source: Michael Everson, SEI (Universal Scripts Project) Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Replaces: N3292 Date: 2007-09-11 1. Introduction. The Javanese script, or aksara Jawa, is used for writing the Javanese language, the native language of one of the peoples of Java, known locally as basa Jawa. It is a descendent of the ancient Brahmi script of India, and so has many similarities with modern scripts of South Asia and Southeast Asia which are also members of that family. The Javanese script is also used for writing Sanskrit, Jawa Kuna (a kind of Sanskritized Javanese), and Kawi, as well as the Sundanese language, also spoken on the island of Java, and the Sasak language, spoken on the island of Lombok. Javanese script was in current use in Java until about 1945; in 1928 Bahasa Indonesia was made the national language of Indonesia and its influence eclipsed that of other languages and their scripts. Traditional Javanese texts are written on palm leaves; books of these bound together are called lontar, a word which derives from ron ‘leaf’ and tal ‘palm’. 2.1. Consonant letters. Consonants have an inherent -a vowel sound. Consonants combine with following consonants in the usual Brahmic fashion: the inherent vowel is “killed” by the PANGKON, and the follow- ing consonant is subjoined or postfixed, often with a change in shape: §£ ndha = § NA + @¿ PANGKON + £ DA-MAHAPRANA; üù n.
    [Show full text]
  • Suspicious Identity of U+A9B5 JAVANESE VOWEL SIGN TOLONG
    L2/19-003 Suspicious identity of U+A9B5 JAVANESE VOWEL SIGN TOLONG Liang Hai / 梁海 <[email protected]> Aditya Bayu Perdana / <[email protected]> ꦄꦢꦶꦠꦾ ꦧꦪꦸꦥꦢꦤ 4 January 2019 1 Acknowledgements The authors would like to thank Ilham Nurwansah and the Script Ad Hoc group for their feedback. Ilham Nurwansah also kindly provided the Sundanese samples (Figure 2, 3, 4, and 5). 2 Background In the original Unicode Javanese proposal L2/08-015R Proposal for encoding the Javanese script in the UCS, the character tolong (U+A9B5 JAVANESE VOWEL SIGN TOLONG) was described as a vowel sign that is used exclusively in the Sundanese writing system with three major use cases: 1. Used alone as the vowel sign o 2. As a part of the vowel sign eu: <vowel sign ĕ, tolong> 3. As a part of the letters and conjoined forms of reu/leu: <letter / conjoined form rĕ/lĕ, tolong> Table 1. Sundanese tolong usage according to the original proposal Written form ◌ ◌ꦵ ◌ꦼ ◌ꦼꦵ ◌� ◌�ꦵ A9C0 PANGKON A9BC PEPET Encoding (A9B5 TOLONG) A989 PA CEREK (A9B5 TOLONG) (A9B5 TOLONG) Transcription a o ĕ eu rĕ reu Pronunciation [a] [o] [ə] [ɤ] [rə] [rɤ] See also the note under Table 2. However, tolong appears to be merely a stylistic variant of tarung (U+A9B4 JAVANESE VOWEL SIGN TARUNG), therefore the disunification of tolong from tarung is likely a mistake. 3 Proposal The Unicode Standard needs to recommend how the inappropriately disunified character U+A9B5 JAVANESE VOWEL SIGN TOLONG should be handled. 1 In particular, clarification in the names list and the Core Specification is necessary for explaining the background of the mis-disunification and recommending how both the tarung and tolong forms for both the Javanese and Sundanese languages should be implemented.
    [Show full text]
  • Jtc1/Sc2/Wg2 N3427 L2/08-132
    JTC1/SC2/WG2 N3427 L2/08-132 2008-04-08 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации Doc Type: Working Group Document Title: Proposal to encode 39 Unified Canadian Aboriginal Syllabics in the UCS Source: Michael Everson and Chris Harvey Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Date: 2008-04-08 1. Summary. This document requests 39 additional characters to be added to the UCS and contains the proposal summary form. 1. Syllabics hyphen (U+1400). Many Aboriginal Canadian languages use the character U+1428 CANADIAN SYLLABICS FINAL SHORT HORIZONTAL STROKE, which looks like the Latin script hyphen. Algonquian languages like western dialects of Cree, Oji-Cree, western and northern dialects of Ojibway employ this character to represent /tʃ/, /c/, or /j/, as in Plains Cree ᐊᓄᐦᐨ /anohc/ ‘today’. In Athabaskan languages, like Chipewyan, the sound is /d/ or an alveolar onset, as in Sayisi Dene ᐨᕦᐣᐨᕤ /t’ąt’ú/ ‘how’. To avoid ambiguity between this character and a line-breaking hyphen, a SYLLABICS HYPHEN was developed which resembles an equals sign. Depending on the typeface, the width of the syllabics hyphen can range from a short ᐀ to a much longer ᐀. This hyphen is line-breaking punctuation, and should not be confused with the Blackfoot syllable internal-w final proposed for U+167F. See Figures 1 and 2. 2. DHW- additions for Woods Cree (U+1677..U+167D). ᙷᙸᙹᙺᙻᙼᙽ/ðwē/ /ðwi/ /ðwī/ /ðwo/ /ðwō/ /ðwa/ /ðwā/. The basic syllable structure in Cree is (C)(w)V(C)(C).
    [Show full text]
  • Typotheque North American Syllabics Proposed Revisions to The
    Typotheque Prepared by Kevin King Typotheque [email protected] www.typotheque.com 04/06/21 North American Syllabics Proposed revisions to the representative characters of the Unified Canadian Aboriginal Syllabics code charts Typotheque Proposed representative character revisions of the Unified Canadian Aboriginal Syllabics 2 CONTENTS 1 Summary of proposed character revisions 3 2 Revisions for Carrier 9 3 Revisions for Sayisi 36 4 Revisions for Ojibway 46 Bibliography 52 Acknowledgements 54 Typotheque Proposed representative character revisions of the Unified Canadian Aboriginal Syllabics 3 1 Summary of proposed character revisions The following proposal requests 120 revisions to the representative char- acters in the official code charts of Unified Canadian Aboriginal Syllabics main and extended blocks. The proposed characters for revision have been summarized below with representative glyphs and corresponding character names with annotations where applicable. Additionally, revised code charts for UCAS main and extended has been provided in the following section with the proposed revised representative characters marked in pink, imple- mented into their corresponding code point locations. The author has prepared a style-matched font for the purpose of imple- menting into the code chart: 144B ᑋ CANADIAN SYLLABICS carrier H 160D ᘍ CANADIAN SYLLABICS carrier ma 14D1 ᓑ CANADIAN SYLLABICS carrier NG 160E ᘎ CANADIAN SYLLABICS carrier yu 1506 ᔆ CANADIAN SYLLABICS athapascan s 160F ᘏ CANADIAN SYLLABICS carrier yO 15C0 ᗀ CANADIAN SYLLABICS Sayisi
    [Show full text]
  • Proposal to Encode Balinese Archaic Jnya 1. Introduction
    Proposal to encode Balinese Archaic Jnya Ben Yang 楊�� Aditya Bayu Perdana ꦄꦢꦶꦠꦾꦧꦪꦸꦥ�ꦢꦤ Director of Technology Typographer PanLex — The Long Now Foundation [email protected] [email protected] July 10, 02019 (most recent revision) 1. Introduction Balinese is an Austronesian language spoken on the island of Bali in Indonesia. It is currently written in two scripts, Latin and Balinese. The island of Bali has a long literary history, with extensive traditional literature in Sanskrit, Kawi (Old Javanese), and Balinese all written in the Balinese script. The Balinese script is currently well-supported in Unicode, and nearly all traditional literature can be accurately encoded. However, in the process of researching some older documents, we have discovered one character that is not currently covered by the existing encoding model. The character, from now on referred to as "ARCHAIC JNYA", is not found in modern Balinese documents, but is found in older documents. It represents the sound /dʒɲa/, which in modern Balinese is depicted using the sequence BALINESE LETTER JA+BALINESE ADEG (virama) ADEG +BALINESE LETTER NYA, forming the stacking conjunct ⟨ᬚ �⟩. The output of this sequence is an as-expected Balinese conjunct form, with the second consonant subjoined on to the first consonant. On the other hand, the ARCHAIC JNYA is not visually decomposable into separate JA and NYA glyphs. In some documents both forms of JNYA are found. It does not appear that they represent a specific semantic distinction, but the ability to encode both is necessary for the accurate transcription of older Balinese documents. Interestingly, the Javanese cognate grapheme to ARCHAIC JNYA is already encoded, as JAVANESE LETTER NYA MURDA .
    [Show full text]
  • Curriculum and Resources for First Nations Language Programs in BC First Nations Schools
    Curriculum and Resources for First Nations Language Programs in BC First Nations Schools Resource Directory Curriculum and Resources for First Nations Language Programs in BC First Nations Schools Resource Directory: Table of Contents and Section Descriptions 1. Linguistic Resources Academic linguistics articles, reference materials, and online language resources for each BC First Nations language. 2. Language-Specific Resources Practical teaching resources and curriculum identified for each BC First Nations language. 3. Adaptable Resources General curriculum and teaching resources which can be adapted for teaching BC First Nations languages: books, curriculum documents, online and multimedia resources. Includes copies of many documents in PDF format. 4. Language Revitalization Resources This section includes general resources on language revitalization, as well as resources on awakening languages, teaching methods for language revitalization, materials and activities for language teaching, assessing the state of a language, envisioning and planning a language program, teacher training, curriculum design, language acquisition, and the role of technology in language revitalization. 5. Language Teaching Journals A list of journals relevant to teachers of BC First Nations languages. 6. Further Education This section highlights opportunities for further education, training, certification, and professional development. It includes a list of conferences and workshops relevant to BC First Nations language teachers, and a spreadsheet of post‐ secondary programs relevant to Aboriginal Education and Teacher Training - in BC, across Canada, in the USA, and around the world. 7. Funding This section includes a list of funding sources for Indigenous language revitalization programs, as well as a list of scholarships and bursaries available for Aboriginal students and students in the field of Education, in BC, across Canada, and at specific institutions.
    [Show full text]
  • Quantifier in Javanese Reduplication
    Advances in Social Science, Education and Humanities Research (ASSEHR), volume 108 Social Sciences, Humanities and Economics Conference (SoSHEC 2017) The Unique Vowel-Play for “Negative” Quantifier in Javanese Reduplication Slamet Setiawan Ayunita Leliana English Language and Literature Department English Language and Literature Department Universitas Negeri Surabaya Universitas Negeri Surabaya Surabaya, Indonesia Surabaya, Indonesia [email protected] [email protected] types of reduplication, there is a phenomenon that needs to Abstract—Native speakers of a given language, including be revealed. The fact that there is somewhat melody for this Javanese, may not be aware of their detail language system. This reduplication is worth investigating. Study the following is due to the fact that they are able to use their first language examples as in (1a, 1b, and 1c). Note to be made that all Javanese through natural process of language transmission from care givers words have been consulted to Javanese dictionary written by [5]. and members of community. Furthermore, it seems that they are The base is mlaku ‘walk’. The possible reduplications are: given a talent to produce “strange” expressions automatically, for instances, mloka-mlaku vs. mlaku-mlaku, ngguya-ngguyu vs. (1a) mlaku-mlaku ‘go sightseeing' ngguyu-ngguyu. This paper is intended to reveal how the (1b) mlaka-mlaku* - reduplication system for “negative” quantifier works in Javanese language when phonology is taken into account. Descriptive (1c) mloka-mlaku ‘walk repeatedly within short period qualitative is applied with documentation and interview. The of time’ findings suggest that the occurrence of reduplication system for “negative” quantifier is predictable. Firstly, the words are Reduplication form in (1) shows that one word of mlaku can reflected regressively from the source of the word.
    [Show full text]
  • Introduction to Old Javanese Language and Literature: a Kawi Prose Anthology
    THE UNIVERSITY OF MICHIGAN CENTER FOR SOUTH AND SOUTHEAST ASIAN STUDIES THE MICHIGAN SERIES IN SOUTH AND SOUTHEAST ASIAN LANGUAGES AND LINGUISTICS Editorial Board Alton L. Becker John K. Musgrave George B. Simmons Thomas R. Trautmann, chm. Ann Arbor, Michigan INTRODUCTION TO OLD JAVANESE LANGUAGE AND LITERATURE: A KAWI PROSE ANTHOLOGY Mary S. Zurbuchen Ann Arbor Center for South and Southeast Asian Studies The University of Michigan 1976 The Michigan Series in South and Southeast Asian Languages and Linguistics, 3 Open access edition funded by the National Endowment for the Humanities/ Andrew W. Mellon Foundation Humanities Open Book Program. Library of Congress Catalog Card Number: 76-16235 International Standard Book Number: 0-89148-053-6 Copyright 1976 by Center for South and Southeast Asian Studies The University of Michigan Printed in the United States of America ISBN 978-0-89148-053-2 (paper) ISBN 978-0-472-12818-1 (ebook) ISBN 978-0-472-90218-7 (open access) The text of this book is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License: https://creativecommons.org/licenses/by-nc-nd/4.0/ I made my song a coat Covered with embroideries Out of old mythologies.... "A Coat" W. B. Yeats Languages are more to us than systems of thought transference. They are invisible garments that drape themselves about our spirit and give a predetermined form to all its symbolic expression. When the expression is of unusual significance, we call it literature. "Language and Literature" Edward Sapir Contents Preface IX Pronounciation Guide X Vowel Sandhi xi Illustration of Scripts xii Kawi--an Introduction Language ancf History 1 Language and Its Forms 3 Language and Systems of Meaning 6 The Texts 10 Short Readings 13 Sentences 14 Paragraphs..
    [Show full text]
  • Annual Meeting Handbook
    MEETING HANDBOOK LINGUISTIC SOCIETY OF AMERICA AMERICAN DIALECT SOCIETY AMERICAN NAME SOCIETY NORTH AMERICAN ASSOCIATION FOR THE HISTORY OF THE LANGUAGE SCIENCES SOCIETY FOR PIDGIN AND CREOLE LINGUISTICS SOCIETY FOR THE STUDY OF THE INDIGENOUS LANGUAGES OF THE AMERICAS SHERATON BOSTON HOTEL BOSTON, MA 8-11 JANUARY 2004 Introductory Note The LSA Secretariat has prepared this Meeting Handbook to serve as the official program for the 78th Annual Meeting of the Linguistic Society of America (LSA). In addition, this handbook is the official program for the Annual Meetings of the American Dialect Society (ADS), the American Name Society (ANS), the North American Association for the History of the Language Sciences (NAAHoLS), the Society for Pidgin and Creole Linguistics (SPCL), and the Society for the Study of the Indigenous Languages of the Americas (SSILA). We gratefully acknowledge the assistance provided by the LSA Program Committee: (William Idsardi, Chair; Diane Brentari; Peter Culicover; Toshiyuki Ogihara; Margaret Speas; Rosalind Thornton; Lindsay Whaley; and Draga Zec) and the help of the members who served as consultants to the Program Committee. We are also grateful to Marlyse Baptista (SPCL), David Boe (NAAHoLS), Edwin Lawson (ANS), Allan Metcalf (ADS), and Victor Golla (SSILA) for their cooperation. We appreciate the help given by the Boston Local Arrangements Committee chaired by Carol Neidle. We hope this Meeting Handbook is a useful guide for those attending, as well as a permanent record of, the 2004 Annual Meeting in Boston,
    [Show full text]
  • Unicode Cree Syllabics for Windows and Macintosh
    Unicode Cree Syllabics for Windows and Macintosh 37th Algonquian Conference, Ottawa 2005 Bill Jancewicz SIL International and Naskapi Development Corporation ABSTRACT Submitted as an update to a presentation made at the 34th Algonquian Conference (Kingston). The ongoing development of the operating systems has included increased support for cross-platform use of Unicode syllabic script. Key improvements that were included in Macintosh's OS X.3 (Panther) operating system now allow direct keyboarding of Unicode characters by means of a user-defined input method. A summary and comparison of the available tools for handling Unicode on both Windows and Macintosh will be discussed. INTRODUCTION Since 1988 the author has been working in the Naskapi language at Kawawachikamach with the primary purpose of linguistic analysis and Bible translation, sponsored by Wycliffe Bible Translators and SIL International. Related work includes mother tongue translator training, Naskapi literature and curriculum development. Along with the language work the author also developed methods of production for Naskapi language materials in syllabic script by means of the computer. With the advent of high quality publishing capabilities in newer computers, procedures were updated to keep pace with the improving technology. With resources available from SIL International computer services department, a very satisfactory system of keyboarding syllabic texts in Naskapi was developed. At the urging of colleagues working in related languages, the system was expanded to include the wider inventory of standard Eastern and Western Cree syllabics. While this has pushed the limit of what is possible with the current technology, Unicode makes this practical. Note that the system originally developed for keyboarding Naskapi syllabics is similar but not identical to the Cree system, because of the unique local orthography in use at Kawawachikamach.
    [Show full text]
  • Introduction the Hymns
    Father Morice©s Rendering of Latin in Carrier Syllabics* William J. Poser Introduction Father Adrien-Gabriel Morice, O.M.I., missionary in Fort Saint James, British Columbia from 1885 to 1904, fluent in the Carrier language and an early Athabascanist of distinction, published three editions of a Carrier Prayer Book, consisting mostly of prayers, hymns, and catechism lessons which he had translated into Carrier. All three editions also contain a number of Latin hymns. In the third edition, whose Carrier text is written in the somewhat idiosyncratic Roman-based phonetic notation that Father Morice used for his scholarly work (e.g. Morice 1932), the Latin hymns are written in the Roman alphabet in the usual spelling, but in the first two editions both the Carrier text and the Latin hymns are written in the Déné syllabics, the derivative of the Cree syllabics that he created. The Hymns We begin by presenting the text of the hymns. For each hymn, the first column gives the syllabic text, the second column a phonemic IPA transcription of the syllabic text, and the third column the usual Latin orthography. Images of the Prayerbook text are available separately. O Salutaris Hostia ᐃ ᙓᘦᑕrᐉᔆ ᐱᔆᑔᘓ o salitaɹis hostija O Salutaris Hostia ᐟᗓ ᙑᘦ ᗭᐣᑔᔆ ᐃᔆᑔᘏᒼ kwe seli pantis ostijom Quae coeli pandis ostium, ᗫᘧ ᗮrᐈᘉᐪ ᐱᔆᑔᘦᘓ. pela pɹemot hostilija. Bella praemunt hostilia! ᑎ rᐃᗬr, ɥᐈr ᐃᐟᙒᘦᘏᒼ to ɹopir, ver oksilijom Da robur, fer auxilium. ᐁᘆ ᐪrᐉᘃᐟᗓ ᑎᘌᘃ uni tɹinokwe tomino Uni trinoque Domino ᙒᐪ ᙑᒼᗬᑓrᘇ ᐟᘣrᐉᘓ, sit sempiterna kloɹija, Sit sempiterna gloria! ᐟᗔ ɥᐉᑕᒼ ᙒᘅ ᑓrᘌᘃ kwi vitam sine termino Qui vitam sine termino ᘃᗬᔆ ᑎᘅᐪ ᐉᐣ ᗭᐪrᐉᘓ.
    [Show full text]
  • Kamloops Chinúk Wawa, Chinuk Pipa, and the Vitality of Pidgins
    Kamloops Chinúk Wawa, Chinuk pipa, and the vitality of pidgins by David Douglas Robertson B.A., Columbia University, 1988 A Dissertation Submitted in Partial Fulfillment of the Requirements for the Degree of DOCTOR OF PHILOSOPHY in the Department of Linguistics © David Douglas Robertson, 2011 University of Victoria All rights reserved. This dissertation may not be reproduced in whole or in part, by photocopying or other means, without the permission of the author. ii Kamloops Chinúk Wawa, Chinuk pipa, and the vitality of pidgins by David Douglas Robertson B.A., Columbia University, 1988 Supervisory Committee Dr. Ewa Czaykowska-Higgins, Supervisor (Department of Linguistics, University of Victoria) Dr. Sarah Grey Thomason, Departmental Member (Department of Linguistics, University of Victoria; University of Michigan) Dr. Wendy Wickwire, Outside Member (Department of History, University of Victoria) iii Supervisory Committee Dr. Ewa Czaykowska-Higgins, Supervisor (Department of Linguistics, University of Victoria) Dr. Sarah Grey Thomason, Departmental Member (Department of Linguistics, University of Victoria; University of Michigan) Dr. Wendy Wickwire, Outside Member (Department of History, University of Victoria) Abstract This dissertation presents the first full grammatical description of unprompted (spontaneous) speech in pidgin Chinook Jargon [synonyms Chinúk Wawa, Chinook]. The data come from a dialect I term ‘Kamloops Chinúk Wawa’, used in southern interior British Columbia circa 1900. I also present the first historical study and structural analysis of the shorthand-based ‘Chinuk pipa ’ alphabet in which Kamloops Chinúk Wawa was written, primarily by Salish people. This study is made possible by the discovery of several hundred such texts, which I have transliterated and analyzed. The Basic Linguistic Theory-inspired (cf.
    [Show full text]