Unicode As a Standard Framework for Syllables and Other Special Characters
Total Page:16
File Type:pdf, Size:1020Kb

Load more
Recommended publications
-
Ka И @И Ka M Л @Л Ga Н @Н Ga M М @М Nga О @О Ca П
ISO/IEC JTC1/SC2/WG2 N3319R L2/07-295R 2007-09-11 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации Doc Type: Working Group Document Title: Proposal for encoding the Javanese script in the UCS Source: Michael Everson, SEI (Universal Scripts Project) Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Replaces: N3292 Date: 2007-09-11 1. Introduction. The Javanese script, or aksara Jawa, is used for writing the Javanese language, the native language of one of the peoples of Java, known locally as basa Jawa. It is a descendent of the ancient Brahmi script of India, and so has many similarities with modern scripts of South Asia and Southeast Asia which are also members of that family. The Javanese script is also used for writing Sanskrit, Jawa Kuna (a kind of Sanskritized Javanese), and Kawi, as well as the Sundanese language, also spoken on the island of Java, and the Sasak language, spoken on the island of Lombok. Javanese script was in current use in Java until about 1945; in 1928 Bahasa Indonesia was made the national language of Indonesia and its influence eclipsed that of other languages and their scripts. Traditional Javanese texts are written on palm leaves; books of these bound together are called lontar, a word which derives from ron ‘leaf’ and tal ‘palm’. 2.1. Consonant letters. Consonants have an inherent -a vowel sound. Consonants combine with following consonants in the usual Brahmic fashion: the inherent vowel is “killed” by the PANGKON, and the follow- ing consonant is subjoined or postfixed, often with a change in shape: §£ ndha = § NA + @¿ PANGKON + £ DA-MAHAPRANA; üù n. -
Suspicious Identity of U+A9B5 JAVANESE VOWEL SIGN TOLONG
L2/19-003 Suspicious identity of U+A9B5 JAVANESE VOWEL SIGN TOLONG Liang Hai / 梁海 <[email protected]> Aditya Bayu Perdana / <[email protected]> ꦄꦢꦶꦠꦾ ꦧꦪꦸꦥꦢꦤ 4 January 2019 1 Acknowledgements The authors would like to thank Ilham Nurwansah and the Script Ad Hoc group for their feedback. Ilham Nurwansah also kindly provided the Sundanese samples (Figure 2, 3, 4, and 5). 2 Background In the original Unicode Javanese proposal L2/08-015R Proposal for encoding the Javanese script in the UCS, the character tolong (U+A9B5 JAVANESE VOWEL SIGN TOLONG) was described as a vowel sign that is used exclusively in the Sundanese writing system with three major use cases: 1. Used alone as the vowel sign o 2. As a part of the vowel sign eu: <vowel sign ĕ, tolong> 3. As a part of the letters and conjoined forms of reu/leu: <letter / conjoined form rĕ/lĕ, tolong> Table 1. Sundanese tolong usage according to the original proposal Written form ◌ ◌ꦵ ◌ꦼ ◌ꦼꦵ ◌� ◌�ꦵ A9C0 PANGKON A9BC PEPET Encoding (A9B5 TOLONG) A989 PA CEREK (A9B5 TOLONG) (A9B5 TOLONG) Transcription a o ĕ eu rĕ reu Pronunciation [a] [o] [ə] [ɤ] [rə] [rɤ] See also the note under Table 2. However, tolong appears to be merely a stylistic variant of tarung (U+A9B4 JAVANESE VOWEL SIGN TARUNG), therefore the disunification of tolong from tarung is likely a mistake. 3 Proposal The Unicode Standard needs to recommend how the inappropriately disunified character U+A9B5 JAVANESE VOWEL SIGN TOLONG should be handled. 1 In particular, clarification in the names list and the Core Specification is necessary for explaining the background of the mis-disunification and recommending how both the tarung and tolong forms for both the Javanese and Sundanese languages should be implemented. -
Jtc1/Sc2/Wg2 N3427 L2/08-132
JTC1/SC2/WG2 N3427 L2/08-132 2008-04-08 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации Doc Type: Working Group Document Title: Proposal to encode 39 Unified Canadian Aboriginal Syllabics in the UCS Source: Michael Everson and Chris Harvey Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Date: 2008-04-08 1. Summary. This document requests 39 additional characters to be added to the UCS and contains the proposal summary form. 1. Syllabics hyphen (U+1400). Many Aboriginal Canadian languages use the character U+1428 CANADIAN SYLLABICS FINAL SHORT HORIZONTAL STROKE, which looks like the Latin script hyphen. Algonquian languages like western dialects of Cree, Oji-Cree, western and northern dialects of Ojibway employ this character to represent /tʃ/, /c/, or /j/, as in Plains Cree ᐊᓄᐦᐨ /anohc/ ‘today’. In Athabaskan languages, like Chipewyan, the sound is /d/ or an alveolar onset, as in Sayisi Dene ᐨᕦᐣᐨᕤ /t’ąt’ú/ ‘how’. To avoid ambiguity between this character and a line-breaking hyphen, a SYLLABICS HYPHEN was developed which resembles an equals sign. Depending on the typeface, the width of the syllabics hyphen can range from a short ᐀ to a much longer ᐀. This hyphen is line-breaking punctuation, and should not be confused with the Blackfoot syllable internal-w final proposed for U+167F. See Figures 1 and 2. 2. DHW- additions for Woods Cree (U+1677..U+167D). ᙷᙸᙹᙺᙻᙼᙽ/ðwē/ /ðwi/ /ðwī/ /ðwo/ /ðwō/ /ðwa/ /ðwā/. The basic syllable structure in Cree is (C)(w)V(C)(C). -
Typotheque North American Syllabics Proposed Revisions to The
Typotheque Prepared by Kevin King Typotheque [email protected] www.typotheque.com 04/06/21 North American Syllabics Proposed revisions to the representative characters of the Unified Canadian Aboriginal Syllabics code charts Typotheque Proposed representative character revisions of the Unified Canadian Aboriginal Syllabics 2 CONTENTS 1 Summary of proposed character revisions 3 2 Revisions for Carrier 9 3 Revisions for Sayisi 36 4 Revisions for Ojibway 46 Bibliography 52 Acknowledgements 54 Typotheque Proposed representative character revisions of the Unified Canadian Aboriginal Syllabics 3 1 Summary of proposed character revisions The following proposal requests 120 revisions to the representative char- acters in the official code charts of Unified Canadian Aboriginal Syllabics main and extended blocks. The proposed characters for revision have been summarized below with representative glyphs and corresponding character names with annotations where applicable. Additionally, revised code charts for UCAS main and extended has been provided in the following section with the proposed revised representative characters marked in pink, imple- mented into their corresponding code point locations. The author has prepared a style-matched font for the purpose of imple- menting into the code chart: 144B ᑋ CANADIAN SYLLABICS carrier H 160D ᘍ CANADIAN SYLLABICS carrier ma 14D1 ᓑ CANADIAN SYLLABICS carrier NG 160E ᘎ CANADIAN SYLLABICS carrier yu 1506 ᔆ CANADIAN SYLLABICS athapascan s 160F ᘏ CANADIAN SYLLABICS carrier yO 15C0 ᗀ CANADIAN SYLLABICS Sayisi -
Proposal to Encode Balinese Archaic Jnya 1. Introduction
Proposal to encode Balinese Archaic Jnya Ben Yang 楊�� Aditya Bayu Perdana ꦄꦢꦶꦠꦾꦧꦪꦸꦥ�ꦢꦤ Director of Technology Typographer PanLex — The Long Now Foundation [email protected] [email protected] July 10, 02019 (most recent revision) 1. Introduction Balinese is an Austronesian language spoken on the island of Bali in Indonesia. It is currently written in two scripts, Latin and Balinese. The island of Bali has a long literary history, with extensive traditional literature in Sanskrit, Kawi (Old Javanese), and Balinese all written in the Balinese script. The Balinese script is currently well-supported in Unicode, and nearly all traditional literature can be accurately encoded. However, in the process of researching some older documents, we have discovered one character that is not currently covered by the existing encoding model. The character, from now on referred to as "ARCHAIC JNYA", is not found in modern Balinese documents, but is found in older documents. It represents the sound /dʒɲa/, which in modern Balinese is depicted using the sequence BALINESE LETTER JA+BALINESE ADEG (virama) ADEG +BALINESE LETTER NYA, forming the stacking conjunct ⟨ᬚ �⟩. The output of this sequence is an as-expected Balinese conjunct form, with the second consonant subjoined on to the first consonant. On the other hand, the ARCHAIC JNYA is not visually decomposable into separate JA and NYA glyphs. In some documents both forms of JNYA are found. It does not appear that they represent a specific semantic distinction, but the ability to encode both is necessary for the accurate transcription of older Balinese documents. Interestingly, the Javanese cognate grapheme to ARCHAIC JNYA is already encoded, as JAVANESE LETTER NYA MURDA . -
Curriculum and Resources for First Nations Language Programs in BC First Nations Schools
Curriculum and Resources for First Nations Language Programs in BC First Nations Schools Resource Directory Curriculum and Resources for First Nations Language Programs in BC First Nations Schools Resource Directory: Table of Contents and Section Descriptions 1. Linguistic Resources Academic linguistics articles, reference materials, and online language resources for each BC First Nations language. 2. Language-Specific Resources Practical teaching resources and curriculum identified for each BC First Nations language. 3. Adaptable Resources General curriculum and teaching resources which can be adapted for teaching BC First Nations languages: books, curriculum documents, online and multimedia resources. Includes copies of many documents in PDF format. 4. Language Revitalization Resources This section includes general resources on language revitalization, as well as resources on awakening languages, teaching methods for language revitalization, materials and activities for language teaching, assessing the state of a language, envisioning and planning a language program, teacher training, curriculum design, language acquisition, and the role of technology in language revitalization. 5. Language Teaching Journals A list of journals relevant to teachers of BC First Nations languages. 6. Further Education This section highlights opportunities for further education, training, certification, and professional development. It includes a list of conferences and workshops relevant to BC First Nations language teachers, and a spreadsheet of post‐ secondary programs relevant to Aboriginal Education and Teacher Training - in BC, across Canada, in the USA, and around the world. 7. Funding This section includes a list of funding sources for Indigenous language revitalization programs, as well as a list of scholarships and bursaries available for Aboriginal students and students in the field of Education, in BC, across Canada, and at specific institutions. -
Quantifier in Javanese Reduplication
Advances in Social Science, Education and Humanities Research (ASSEHR), volume 108 Social Sciences, Humanities and Economics Conference (SoSHEC 2017) The Unique Vowel-Play for “Negative” Quantifier in Javanese Reduplication Slamet Setiawan Ayunita Leliana English Language and Literature Department English Language and Literature Department Universitas Negeri Surabaya Universitas Negeri Surabaya Surabaya, Indonesia Surabaya, Indonesia [email protected] [email protected] types of reduplication, there is a phenomenon that needs to Abstract—Native speakers of a given language, including be revealed. The fact that there is somewhat melody for this Javanese, may not be aware of their detail language system. This reduplication is worth investigating. Study the following is due to the fact that they are able to use their first language examples as in (1a, 1b, and 1c). Note to be made that all Javanese through natural process of language transmission from care givers words have been consulted to Javanese dictionary written by [5]. and members of community. Furthermore, it seems that they are The base is mlaku ‘walk’. The possible reduplications are: given a talent to produce “strange” expressions automatically, for instances, mloka-mlaku vs. mlaku-mlaku, ngguya-ngguyu vs. (1a) mlaku-mlaku ‘go sightseeing' ngguyu-ngguyu. This paper is intended to reveal how the (1b) mlaka-mlaku* - reduplication system for “negative” quantifier works in Javanese language when phonology is taken into account. Descriptive (1c) mloka-mlaku ‘walk repeatedly within short period qualitative is applied with documentation and interview. The of time’ findings suggest that the occurrence of reduplication system for “negative” quantifier is predictable. Firstly, the words are Reduplication form in (1) shows that one word of mlaku can reflected regressively from the source of the word. -
Introduction to Old Javanese Language and Literature: a Kawi Prose Anthology
THE UNIVERSITY OF MICHIGAN CENTER FOR SOUTH AND SOUTHEAST ASIAN STUDIES THE MICHIGAN SERIES IN SOUTH AND SOUTHEAST ASIAN LANGUAGES AND LINGUISTICS Editorial Board Alton L. Becker John K. Musgrave George B. Simmons Thomas R. Trautmann, chm. Ann Arbor, Michigan INTRODUCTION TO OLD JAVANESE LANGUAGE AND LITERATURE: A KAWI PROSE ANTHOLOGY Mary S. Zurbuchen Ann Arbor Center for South and Southeast Asian Studies The University of Michigan 1976 The Michigan Series in South and Southeast Asian Languages and Linguistics, 3 Open access edition funded by the National Endowment for the Humanities/ Andrew W. Mellon Foundation Humanities Open Book Program. Library of Congress Catalog Card Number: 76-16235 International Standard Book Number: 0-89148-053-6 Copyright 1976 by Center for South and Southeast Asian Studies The University of Michigan Printed in the United States of America ISBN 978-0-89148-053-2 (paper) ISBN 978-0-472-12818-1 (ebook) ISBN 978-0-472-90218-7 (open access) The text of this book is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License: https://creativecommons.org/licenses/by-nc-nd/4.0/ I made my song a coat Covered with embroideries Out of old mythologies.... "A Coat" W. B. Yeats Languages are more to us than systems of thought transference. They are invisible garments that drape themselves about our spirit and give a predetermined form to all its symbolic expression. When the expression is of unusual significance, we call it literature. "Language and Literature" Edward Sapir Contents Preface IX Pronounciation Guide X Vowel Sandhi xi Illustration of Scripts xii Kawi--an Introduction Language ancf History 1 Language and Its Forms 3 Language and Systems of Meaning 6 The Texts 10 Short Readings 13 Sentences 14 Paragraphs.. -
Annual Meeting Handbook
MEETING HANDBOOK LINGUISTIC SOCIETY OF AMERICA AMERICAN DIALECT SOCIETY AMERICAN NAME SOCIETY NORTH AMERICAN ASSOCIATION FOR THE HISTORY OF THE LANGUAGE SCIENCES SOCIETY FOR PIDGIN AND CREOLE LINGUISTICS SOCIETY FOR THE STUDY OF THE INDIGENOUS LANGUAGES OF THE AMERICAS SHERATON BOSTON HOTEL BOSTON, MA 8-11 JANUARY 2004 Introductory Note The LSA Secretariat has prepared this Meeting Handbook to serve as the official program for the 78th Annual Meeting of the Linguistic Society of America (LSA). In addition, this handbook is the official program for the Annual Meetings of the American Dialect Society (ADS), the American Name Society (ANS), the North American Association for the History of the Language Sciences (NAAHoLS), the Society for Pidgin and Creole Linguistics (SPCL), and the Society for the Study of the Indigenous Languages of the Americas (SSILA). We gratefully acknowledge the assistance provided by the LSA Program Committee: (William Idsardi, Chair; Diane Brentari; Peter Culicover; Toshiyuki Ogihara; Margaret Speas; Rosalind Thornton; Lindsay Whaley; and Draga Zec) and the help of the members who served as consultants to the Program Committee. We are also grateful to Marlyse Baptista (SPCL), David Boe (NAAHoLS), Edwin Lawson (ANS), Allan Metcalf (ADS), and Victor Golla (SSILA) for their cooperation. We appreciate the help given by the Boston Local Arrangements Committee chaired by Carol Neidle. We hope this Meeting Handbook is a useful guide for those attending, as well as a permanent record of, the 2004 Annual Meeting in Boston, -
Unicode Cree Syllabics for Windows and Macintosh
Unicode Cree Syllabics for Windows and Macintosh 37th Algonquian Conference, Ottawa 2005 Bill Jancewicz SIL International and Naskapi Development Corporation ABSTRACT Submitted as an update to a presentation made at the 34th Algonquian Conference (Kingston). The ongoing development of the operating systems has included increased support for cross-platform use of Unicode syllabic script. Key improvements that were included in Macintosh's OS X.3 (Panther) operating system now allow direct keyboarding of Unicode characters by means of a user-defined input method. A summary and comparison of the available tools for handling Unicode on both Windows and Macintosh will be discussed. INTRODUCTION Since 1988 the author has been working in the Naskapi language at Kawawachikamach with the primary purpose of linguistic analysis and Bible translation, sponsored by Wycliffe Bible Translators and SIL International. Related work includes mother tongue translator training, Naskapi literature and curriculum development. Along with the language work the author also developed methods of production for Naskapi language materials in syllabic script by means of the computer. With the advent of high quality publishing capabilities in newer computers, procedures were updated to keep pace with the improving technology. With resources available from SIL International computer services department, a very satisfactory system of keyboarding syllabic texts in Naskapi was developed. At the urging of colleagues working in related languages, the system was expanded to include the wider inventory of standard Eastern and Western Cree syllabics. While this has pushed the limit of what is possible with the current technology, Unicode makes this practical. Note that the system originally developed for keyboarding Naskapi syllabics is similar but not identical to the Cree system, because of the unique local orthography in use at Kawawachikamach. -
Introduction the Hymns
Father Morice©s Rendering of Latin in Carrier Syllabics* William J. Poser Introduction Father Adrien-Gabriel Morice, O.M.I., missionary in Fort Saint James, British Columbia from 1885 to 1904, fluent in the Carrier language and an early Athabascanist of distinction, published three editions of a Carrier Prayer Book, consisting mostly of prayers, hymns, and catechism lessons which he had translated into Carrier. All three editions also contain a number of Latin hymns. In the third edition, whose Carrier text is written in the somewhat idiosyncratic Roman-based phonetic notation that Father Morice used for his scholarly work (e.g. Morice 1932), the Latin hymns are written in the Roman alphabet in the usual spelling, but in the first two editions both the Carrier text and the Latin hymns are written in the Déné syllabics, the derivative of the Cree syllabics that he created. The Hymns We begin by presenting the text of the hymns. For each hymn, the first column gives the syllabic text, the second column a phonemic IPA transcription of the syllabic text, and the third column the usual Latin orthography. Images of the Prayerbook text are available separately. O Salutaris Hostia ᐃ ᙓᘦᑕrᐉᔆ ᐱᔆᑔᘓ o salitaɹis hostija O Salutaris Hostia ᐟᗓ ᙑᘦ ᗭᐣᑔᔆ ᐃᔆᑔᘏᒼ kwe seli pantis ostijom Quae coeli pandis ostium, ᗫᘧ ᗮrᐈᘉᐪ ᐱᔆᑔᘦᘓ. pela pɹemot hostilija. Bella praemunt hostilia! ᑎ rᐃᗬr, ɥᐈr ᐃᐟᙒᘦᘏᒼ to ɹopir, ver oksilijom Da robur, fer auxilium. ᐁᘆ ᐪrᐉᘃᐟᗓ ᑎᘌᘃ uni tɹinokwe tomino Uni trinoque Domino ᙒᐪ ᙑᒼᗬᑓrᘇ ᐟᘣrᐉᘓ, sit sempiterna kloɹija, Sit sempiterna gloria! ᐟᗔ ɥᐉᑕᒼ ᙒᘅ ᑓrᘌᘃ kwi vitam sine termino Qui vitam sine termino ᘃᗬᔆ ᑎᘅᐪ ᐉᐣ ᗭᐪrᐉᘓ. -
Kamloops Chinúk Wawa, Chinuk Pipa, and the Vitality of Pidgins
Kamloops Chinúk Wawa, Chinuk pipa, and the vitality of pidgins by David Douglas Robertson B.A., Columbia University, 1988 A Dissertation Submitted in Partial Fulfillment of the Requirements for the Degree of DOCTOR OF PHILOSOPHY in the Department of Linguistics © David Douglas Robertson, 2011 University of Victoria All rights reserved. This dissertation may not be reproduced in whole or in part, by photocopying or other means, without the permission of the author. ii Kamloops Chinúk Wawa, Chinuk pipa, and the vitality of pidgins by David Douglas Robertson B.A., Columbia University, 1988 Supervisory Committee Dr. Ewa Czaykowska-Higgins, Supervisor (Department of Linguistics, University of Victoria) Dr. Sarah Grey Thomason, Departmental Member (Department of Linguistics, University of Victoria; University of Michigan) Dr. Wendy Wickwire, Outside Member (Department of History, University of Victoria) iii Supervisory Committee Dr. Ewa Czaykowska-Higgins, Supervisor (Department of Linguistics, University of Victoria) Dr. Sarah Grey Thomason, Departmental Member (Department of Linguistics, University of Victoria; University of Michigan) Dr. Wendy Wickwire, Outside Member (Department of History, University of Victoria) Abstract This dissertation presents the first full grammatical description of unprompted (spontaneous) speech in pidgin Chinook Jargon [synonyms Chinúk Wawa, Chinook]. The data come from a dialect I term ‘Kamloops Chinúk Wawa’, used in southern interior British Columbia circa 1900. I also present the first historical study and structural analysis of the shorthand-based ‘Chinuk pipa ’ alphabet in which Kamloops Chinúk Wawa was written, primarily by Salish people. This study is made possible by the discovery of several hundred such texts, which I have transliterated and analyzed. The Basic Linguistic Theory-inspired (cf.