ISO/IEC JTC1/SC2/WG2 N2856 L2/04-357 Structure
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Method to Accommodate Backward Compatibility on the Learning Application-Based Transliteration to the Balinese Script
(IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 12, No. 6, 2021 A Method to Accommodate Backward Compatibility on the Learning Application-based Transliteration to the Balinese Script 1 3 4 Gede Indrawan , I Gede Nurhayata , Sariyasa I Ketut Paramarta2 Department of Computer Science Department of Balinese Language Education Universitas Pendidikan Ganesha (Undiksha) Universitas Pendidikan Ganesha (Undiksha) Singaraja, Indonesia Singaraja, Indonesia Abstract—This research proposed a method to accommodate transliteration rules (for short, the older rules) from The backward compatibility on the learning application-based Balinese Alphabet document 1 . It exposes the backward transliteration to the Balinese Script. The objective is to compatibility method to accommodate the standard accommodate the standard transliteration rules from the transliteration rules (for short, the standard rules) from the Balinese Language, Script, and Literature Advisory Agency. It is Balinese Language, Script, and Literature Advisory Agency considered as the main contribution since there has not been a [7]. This Bali Province government agency [4] carries out workaround in this research area. This multi-discipline guidance and formulates programs for the maintenance, study, collaboration work is one of the efforts to preserve digitally the development, and preservation of the Balinese Language, endangered Balinese local language knowledge in Indonesia. The Script, and Literature. proposed method covered two aspects, i.e. (1) Its backward compatibility allows for interoperability at a certain level with This study was conducted on the developed web-based the older transliteration rules; and (2) Breaking backward transliteration learning application, BaliScript, for further compatibility at a certain level is unavoidable since, for the same ubiquitous Balinese Language learning since the proposed aspect, there is a contradictory treatment between the standard method reusable for the mobile application [8], [9]. -
Iso/Iec Jtc1/Sc22/Wg20 N809r
ISO/IEC JTC1/SC22/WG20 N809R 2001-01-09 Internationalization International Organization for Standardization Organisation internationale de normalisation еждународнаяорганизацияпостандартизации Doc Type: Working Group Document Title: Ordering the Runic script Source: Michael Everson Status: Expert Contribution Date: 2001-01-09 On 2000-12-24 Olle Järnefors published on behalf of the ISORUNES Project in Sweden a proposal for ordering the Runes in the Common Tailorable Template (CTT) of ISO/IEC 14651. In my view this ordering is unsuitable for the CTT for a number of reasons. Runic ordering in ISO/IEC 10646. The Runes are encoded at U+16A0–U+16FF, in a unified set of characters encompassing the four major traditions of Runic use: Germanic, Anglo-Frisian, Danish, and Swedish-Norwegian, and Medieval. The Runes are arranged in the code table agreed by ISO/IEC JTC1/SC2/WG2 in an order based on the the traditional positions of the Runes in abecedaries, namely, the fuþark order. This order is known from hundreds of primary sources which list the Runes in sequence, often with no other text. Most scholarly texts refer to the fuþark in one way or another. Nearly all secondary texts, whether popular introductions to the Runes or New-Age esoterica, give primacy to the traditional fuþark sequence. Runic names in ISO/IEC 10646. The names given to the Runes in the UCS may be a bit clumsy, but they are intended to serve the needs of scholars and amateurs alike; not everyone is familiar with Runic transliteration practices, and not everyone is conversant with the traditional names in Germanic, English, and Scandinavian usage. -
Ka И @И Ka M Л @Л Ga Н @Н Ga M М @М Nga О @О Ca П
ISO/IEC JTC1/SC2/WG2 N3319R L2/07-295R 2007-09-11 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации Doc Type: Working Group Document Title: Proposal for encoding the Javanese script in the UCS Source: Michael Everson, SEI (Universal Scripts Project) Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Replaces: N3292 Date: 2007-09-11 1. Introduction. The Javanese script, or aksara Jawa, is used for writing the Javanese language, the native language of one of the peoples of Java, known locally as basa Jawa. It is a descendent of the ancient Brahmi script of India, and so has many similarities with modern scripts of South Asia and Southeast Asia which are also members of that family. The Javanese script is also used for writing Sanskrit, Jawa Kuna (a kind of Sanskritized Javanese), and Kawi, as well as the Sundanese language, also spoken on the island of Java, and the Sasak language, spoken on the island of Lombok. Javanese script was in current use in Java until about 1945; in 1928 Bahasa Indonesia was made the national language of Indonesia and its influence eclipsed that of other languages and their scripts. Traditional Javanese texts are written on palm leaves; books of these bound together are called lontar, a word which derives from ron ‘leaf’ and tal ‘palm’. 2.1. Consonant letters. Consonants have an inherent -a vowel sound. Consonants combine with following consonants in the usual Brahmic fashion: the inherent vowel is “killed” by the PANGKON, and the follow- ing consonant is subjoined or postfixed, often with a change in shape: §£ ndha = § NA + @¿ PANGKON + £ DA-MAHAPRANA; üù n. -
Suspicious Identity of U+A9B5 JAVANESE VOWEL SIGN TOLONG
L2/19-003 Suspicious identity of U+A9B5 JAVANESE VOWEL SIGN TOLONG Liang Hai / 梁海 <[email protected]> Aditya Bayu Perdana / <[email protected]> ꦄꦢꦶꦠꦾ ꦧꦪꦸꦥꦢꦤ 4 January 2019 1 Acknowledgements The authors would like to thank Ilham Nurwansah and the Script Ad Hoc group for their feedback. Ilham Nurwansah also kindly provided the Sundanese samples (Figure 2, 3, 4, and 5). 2 Background In the original Unicode Javanese proposal L2/08-015R Proposal for encoding the Javanese script in the UCS, the character tolong (U+A9B5 JAVANESE VOWEL SIGN TOLONG) was described as a vowel sign that is used exclusively in the Sundanese writing system with three major use cases: 1. Used alone as the vowel sign o 2. As a part of the vowel sign eu: <vowel sign ĕ, tolong> 3. As a part of the letters and conjoined forms of reu/leu: <letter / conjoined form rĕ/lĕ, tolong> Table 1. Sundanese tolong usage according to the original proposal Written form ◌ ◌ꦵ ◌ꦼ ◌ꦼꦵ ◌� ◌�ꦵ A9C0 PANGKON A9BC PEPET Encoding (A9B5 TOLONG) A989 PA CEREK (A9B5 TOLONG) (A9B5 TOLONG) Transcription a o ĕ eu rĕ reu Pronunciation [a] [o] [ə] [ɤ] [rə] [rɤ] See also the note under Table 2. However, tolong appears to be merely a stylistic variant of tarung (U+A9B4 JAVANESE VOWEL SIGN TARUNG), therefore the disunification of tolong from tarung is likely a mistake. 3 Proposal The Unicode Standard needs to recommend how the inappropriately disunified character U+A9B5 JAVANESE VOWEL SIGN TOLONG should be handled. 1 In particular, clarification in the names list and the Core Specification is necessary for explaining the background of the mis-disunification and recommending how both the tarung and tolong forms for both the Javanese and Sundanese languages should be implemented. -
Ahom Range: 11700–1174F
Ahom Range: 11700–1174F This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation. -
1 PASANGAN DAN SANDHANGAN DALAM AKSARA JAWA Oleh
PASANGAN DAN SANDHANGAN DALAM AKSARA JAWA1 oleh: Sri Hertanti Wulan [email protected] Jurusan Pendidikan Bahasa Daerah FBS UNY Aksara nglegena yang digunakan dalam ejaan bahasa Jawa pada dasarnya terdiri atas dua puluh aksara yang bersifat silabik Darusuprapta (2002: 12-13). Masing-masing aksara mempunyai aksara pasangan, yakni aksara yang berfungsi untuk menghubungkan suku kata mati/tertutup dengan suku kata berikutnya, kecuali suku kata yang tertutup dengan wignyan (.. ), layar (....), dan cecak (....). Pasangan – pasangan tersebut antara lain: a) Aksara pasangan wutuh, ditulis di bawah aksara yang diberi pasangan dan tidak disambung, yaitu antara lain: Tabel 1 Pasangan Wutuh pasangan Wujud Contoh Ra … dalan ramé= Ya … tumbas yuyu = Ga … dalan gĕdhé = .… dolan ngomah = nga b) Aksara pasangan tugelan ditulis di belakang aksara yang diberi pasangan dan tidak disambungkan dengan aksara yang diberi pasangan, yaitu antara lain: 1Disampaikan dalam PPM PelatihanAksaraJawadan PendirianHanacaraka Centre sebagaiRevitalisasiFungsiAksaraJawa kerjasama FBS UNY dan Dinas Dikpora DIY. Dilaksanakan di Dikpora DIY, Senin 28 Oktober 2013. 1 Tabel 2.1 Pasangan Tugelan pasangan Wujud Contoh ha … adhĕm hawané = pa … bakul pĕlĕm = sa … dalan sĕpi = c) Aksara pasangan tugelan ditulis di bawah aksara yang diberi pasangan dan tidak disambungkan dengan aksara yang diberi pasangan, yaitu antara lain: Tabel 2.2 Pasangan Tugelan pasangan Wujud Contoh kilèn kalen= ka … wis takon = ta … tas larang = la … Pasangan – pasangan tersebut, bila mendapatkan sandhangan -
ISO/IEC JTC1/SC2/WG2 N2664 L2/03-393 A. Administrative B. Technical -- General
ISO/IEC JTC1/SC2/WG2 N2664 L2/03-393 2003-11-02 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation internationale de normalisation еждународная организация по стандартизации Doc Type: Working Group Document Title: Preliminary proposal to encode the Cuneiform script in the SMP of the UCS Source: Michael Everson, Karljürgen Feuerherm, Steve Tinney Status: Individual Contribution Date: 2003-11-02 A. Administrative 1. Title Preliminary proposal to encode the Cuneiform script in the SMP of the UCS. 2. Requester’s name Michael Everson, Karljürgen Feuerherm, Steve Tinney 3. Requester type (Member body/Liaison/Individual contribution) Individual contribution. 4. Submission date 2003-11-02 5. Requester’s reference (if applicable) 6. Choose one of the following: 6a. This is a complete proposal No. This is a preliminary proposal 6b. More information will be provided later Yes. B. Technical -- General 1. Choose one of the following: 1a. This proposal is for a new script (set of characters) Yes. Proposed name of script Cuneiform and Cuneiform Numbers. 1b. The proposal is for addition of character(s) to an existing block No. 1b. Name of the existing block 2. Number of characters in proposal 952. 3. Proposed category (see section II, Character Categories) Category B. 4a. Proposed Level of Implementation (1, 2 or 3) (see clause 14, ISO/IEC 10646-1: 2000) Level 1. 4b. Is a rationale provided for the choice? Yes. 4c. If YES, reference Characters are ordinary spacing characters. 5a. Is a repertoire including character names provided? Yes. 5b. If YES, are the names in accordance with the character naming guidelines in Annex L of ISO/IEC 10646-1: 2000? Yes. -
Source Readings in Javanese Gamelan and Vocal Music, Volume 2
THE UNIVERSITY OF MICHIGAN CENTER FOR SOUTH AND SOUTHEAST ASIAN STUDIES MICHIGAN PAPERS ON SOUTH AND SOUTHEAST ASIA Editorial Board Alton L. Becker Karl L. Hutterer John K. Musgrave Peter E. Hook, Chairman Ann Arbor, Michigan USA KARAWITAN SOURCE READINGS IN JAVANESE GAMELAN AND VOCAL MUSIC Judith Becker editor Alan H. Feinstein assistant editor Hardja Susilo Sumarsam A. L. Becker consultants Volume 2 MICHIGAN PAPERS ON SOUTH AND SOUTHEAST ASIA; Center for South and Southeast Asian Studies The University of Michigan Number 30 Open access edition funded by the National Endowment for the Humanities/ Andrew W. Mellon Foundation Humanities Open Book Program. Library of Congress Catalog Card Number: 82-72445 ISBN 0-89148-034-X Copyright ©' 1987 by Center for South and Southeast Asian Studies The University of Michigan Publication of this book was assisted in part by a grant from the Publications Program of the National Endowment for the Humanities. Additional funding or assistance was provided by the National Endowment for the Humanities (Transla- tions); the Southeast Asia Regional Council, Association for Asian Studies; The Rackham School of Graduate Studies, The University of Michigan; and the School of Music, The University of Michigan. Printed in the United States of America ISBN 978-0-89-148034-1 (hardcover) ISBN 978-0-47-203819-0 (paper) ISBN 978-0-47-212769-6 (ebook) ISBN 978-0-47-290165-4 (open access) The text of this book is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License: https://creativecommons.org/licenses/by-nc-nd/4.0/ CONTENTS PREFACE: TRANSLATING THE ART OF MUSIC A. -
Jtc1/Sc2/Wg2 N3427 L2/08-132
JTC1/SC2/WG2 N3427 L2/08-132 2008-04-08 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации Doc Type: Working Group Document Title: Proposal to encode 39 Unified Canadian Aboriginal Syllabics in the UCS Source: Michael Everson and Chris Harvey Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Date: 2008-04-08 1. Summary. This document requests 39 additional characters to be added to the UCS and contains the proposal summary form. 1. Syllabics hyphen (U+1400). Many Aboriginal Canadian languages use the character U+1428 CANADIAN SYLLABICS FINAL SHORT HORIZONTAL STROKE, which looks like the Latin script hyphen. Algonquian languages like western dialects of Cree, Oji-Cree, western and northern dialects of Ojibway employ this character to represent /tʃ/, /c/, or /j/, as in Plains Cree ᐊᓄᐦᐨ /anohc/ ‘today’. In Athabaskan languages, like Chipewyan, the sound is /d/ or an alveolar onset, as in Sayisi Dene ᐨᕦᐣᐨᕤ /t’ąt’ú/ ‘how’. To avoid ambiguity between this character and a line-breaking hyphen, a SYLLABICS HYPHEN was developed which resembles an equals sign. Depending on the typeface, the width of the syllabics hyphen can range from a short ᐀ to a much longer ᐀. This hyphen is line-breaking punctuation, and should not be confused with the Blackfoot syllable internal-w final proposed for U+167F. See Figures 1 and 2. 2. DHW- additions for Woods Cree (U+1677..U+167D). ᙷᙸᙹᙺᙻᙼᙽ/ðwē/ /ðwi/ /ðwī/ /ðwo/ /ðwō/ /ðwa/ /ðwā/. The basic syllable structure in Cree is (C)(w)V(C)(C). -
Proposal to Encode Balinese Archaic Jnya 1. Introduction
Proposal to encode Balinese Archaic Jnya Ben Yang 楊�� Aditya Bayu Perdana ꦄꦢꦶꦠꦾꦧꦪꦸꦥ�ꦢꦤ Director of Technology Typographer PanLex — The Long Now Foundation [email protected] [email protected] July 10, 02019 (most recent revision) 1. Introduction Balinese is an Austronesian language spoken on the island of Bali in Indonesia. It is currently written in two scripts, Latin and Balinese. The island of Bali has a long literary history, with extensive traditional literature in Sanskrit, Kawi (Old Javanese), and Balinese all written in the Balinese script. The Balinese script is currently well-supported in Unicode, and nearly all traditional literature can be accurately encoded. However, in the process of researching some older documents, we have discovered one character that is not currently covered by the existing encoding model. The character, from now on referred to as "ARCHAIC JNYA", is not found in modern Balinese documents, but is found in older documents. It represents the sound /dʒɲa/, which in modern Balinese is depicted using the sequence BALINESE LETTER JA+BALINESE ADEG (virama) ADEG +BALINESE LETTER NYA, forming the stacking conjunct ⟨ᬚ �⟩. The output of this sequence is an as-expected Balinese conjunct form, with the second consonant subjoined on to the first consonant. On the other hand, the ARCHAIC JNYA is not visually decomposable into separate JA and NYA glyphs. In some documents both forms of JNYA are found. It does not appear that they represent a specific semantic distinction, but the ability to encode both is necessary for the accurate transcription of older Balinese documents. Interestingly, the Javanese cognate grapheme to ARCHAIC JNYA is already encoded, as JAVANESE LETTER NYA MURDA . -
Comments Received for ISO 639-3 Change Request 2015-046 Outcome
Comments received for ISO 639-3 Change Request 2015-046 Outcome: Accepted after appeal Effective date: May 27, 2016 SIL International ISO 639-3 Registration Authority 7500 W. Camp Wisdom Rd., Dallas, TX 75236 PHONE: (972) 708-7400 FAX: (972) 708-7380 (GMT-6) E-MAIL: [email protected] INTERNET: http://www.sil.org/iso639-3/ Registration Authority decision on Change Request no. 2015-046: to create the code element [ovd] Ӧvdalian . The request to create the code [ovd] Ӧvdalian has been reevaluated, based on additional information from the original requesters and extensive discussion from outside parties on the IETF list. The additional information has strengthened the case and changed the decision of the Registration Authority to accept the code request. In particular, the long bibliography submitted shows that Ӧvdalian has undergone significant language development, and now has close to 50 publications. In addition, it has been studied extensively, and the academic works should have a distinct code to distinguish them from publications on Swedish. One revision being added by the Registration Authority is the added English name “Elfdalian” which was used in most of the extensive discussion on the IETF list. Michael Everson [email protected] May 4, 2016 This is an appeal by the group responsible for the IETF language subtags to the ISO 639 RA to reconsider and revert their earlier decision and to assign an ISO 639-3 language code to Elfdalian. The undersigned members of the group responsible for the IETF language subtag are concerned about the rejection of the Elfdalian language. There is no doubt that its linguistic features are unique in the continuum of North Germanic languages. -
ISO/IEC JTC1/SC2/WG2 N 2005 Date: 1999-05-29
ISO INTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE DE NORMALISATION --------------------------------------------------------------------------------------- ISO/IEC JTC1/SC2/WG2 Universal Multiple-Octet Coded Character Set (UCS) -------------------------------------------------------------------------------- ISO/IEC JTC1/SC2/WG2 N 2005 Date: 1999-05-29 TITLE: ISO/IEC 10646-1 Second Edition text, Draft 2 SOURCE: Bruce Paterson, project editor STATUS: Working paper of JTC1/SC2/WG2 ACTION: For review and comment by WG2 DISTRIBUTION: Members of JTC1/SC2/WG2 1. Scope This paper provides a second draft of the text sections of the Second Edition of ISO/IEC 10646-1. It replaces the previous paper WG2 N 1796 (1998-06-01). This draft text includes: - Clauses 1 to 27 (replacing the previous clauses 1 to 26), - Annexes A to R (replacing the previous Annexes A to T), and is attached here as “Draft 2 for ISO/IEC 10646-1 : 1999” (pages ii & 1 to 77). Published and Draft Amendments up to Amd.31 (Tibetan extended), Technical Corrigenda nos. 1, 2, and 3, and editorial corrigenda approved by WG2 up to 1999-03-15, have been applied to the text. The draft does not include: - character glyph tables and name tables (these will be provided in a separate WG2 document from AFII), - the alphabetically sorted list of character names in Annex E (now Annex G), - markings to show the differences from the previous draft. A separate WG2 paper will give the editorial corrigenda applied to this text since N 1796. The editorial corrigenda are as agreed at WG2 meetings #34 to #36. Editorial corrigenda applicable to the character glyph tables and name tables, as listed in N1796 pages 2 to 5, have already been applied to the draft character tables prepared by AFII.