The Unicode Standard, Version 6.2

Total Page:16

File Type:pdf, Size:1020Kb

The Unicode Standard, Version 6.2 Javanese Range: A980–A9DF This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 6.2 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See http://www.unicode.org/errata/ for an up-to-date list of errata. See http://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See http://www.unicode.org/charts/PDF/Unicode-6.2/ for charts showing only the characters added in Unicode 6.2. See http://www.unicode.org/Public/6.2.0/charts/ for a complete archived file of character code charts for Unicode 6.2. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 6.2 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 6.2, online at http://www.unicode.org/versions/Unicode6.2.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, and #45, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See http://www.unicode.org/ucd/ and http://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation. Fonts The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be expected in actual fonts. The particular fonts used in these charts were provided to the Unicode Consortium by a number of different font designers, who own the rights to the fonts. See http://www.unicode.org/charts/fonts.html for a list. Terms of Use You may freely use these code charts for personal or internal business uses only. You may not incorporate them either wholly or in part into any product or publication, or otherwise distribute them without express written permission from the Unicode Consortium. However, you may provide links to these charts. The fonts and font data used in production of these code charts may NOT be extracted, or used in any other way in any product or publication, without permission or license granted by the typeface owner(s). The Unicode Consortium is not liable for errors or omissions in this file or the standard itself. Information on characters added to the Unicode Standard since the publication of the most recent version of the Unicode Standard, as well as on characters currently being considered for addition to the Unicode Standard can be found on the Unicode web site. See http://www.unicode.org/pending/pending.html and http://www.unicode.org/alloc/Pipeline.html. Copyright © 1991-2012 Unicode, Inc. All rights reserved. A980 Javanese A9DF A98 A99 A9A A9B A9C A9D 0 $ꦀ ꦐ ꦠ ꦰ $꧀ ꧐ A980 A990 A9A0 A9B0 A9C0 A9D0 1 $ꦁ ꦑ ꦡ ꦱ ꧁ ꧑ A981 A991 A9A1 A9B1 A9C1 A9D1 2 $ꦂ ꦒ ꦢ ꦲ ꧂ ꧒ A982 A992 A9A2 A9B2 A9C2 A9D2 3 $ꦃ ꦓ ꦣ $꦳ ꧃ ꧓ A983 A993 A9A3 A9B3 A9C3 A9D3 4 ꦄ ꦔ ꦤ $ꦴ ꧄ ꧔ A984 A994 A9A4 A9B4 A9C4 A9D4 5 ꦅ ꦕ ꦥ $ꦵ ꧅ ꧕ A985 A995 A9A5 A9B5 A9C5 A9D5 6 ꦆ ꦖ ꦦ $ꦶ ꧆ ꧖ A986 A996 A9A6 A9B6 A9C6 A9D6 7 ꦇ ꦗ ꦧ $ꦷ ꧇ ꧗ A987 A997 A9A7 A9B7 A9C7 A9D7 8 ꦈ ꦘ ꦨ $ꦸ ꧈ ꧘ A988 A998 A9A8 A9B8 A9C8 A9D8 9 ꦉ ꦙ ꦩ $ꦹ ꧉ ꧙ A989 A999 A9A9 A9B9 A9C9 A9D9 A ꦊ ꦚ ꦪ $ꦺ ꧊ A98A A99A A9AA A9BA A9CA B ꦋ ꦛ ꦫ $ꦻ ꧋ A98B A99B A9AB A9BB A9CB C ꦌ ꦜ ꦬ $ꦼ ꧌ A98C A99C A9AC A9BC A9CC D ꦍ ꦝ ꦭ $ꦽ ꧍ A98D A99D A9AD A9BD A9CD E ꦎ ꦞ ꦮ $ꦾ ꧞ A98E A99E A9AE A9BE A9DE F ꦏ ꦟ ꦯ $ꦿ ꧏ ꧟ A98F A99F A9AF A9BF A9CF A9DF The Unicode Standard 6.2, Copyright © 1991-2012 Unicode, Inc. All rights reserved. A980 Javanese A9CD Various signs A9A9 ꦩ JAVANESE LETTER MA A980 $ꦀ JAVANESE SIGN PANYANGGA A9AA ꦪ JAVANESE LETTER YA = ardhacandra A9AB ꦫ JAVANESE LETTER RA A981 $ꦁ JAVANESE SIGN CECAK A9AC ꦬ JAVANESE LETTER RA AGUNG = anusvara A9AD ꦭ JAVANESE LETTER LA A982 $ꦂ JAVANESE SIGN LAYAR A9AE ꦮ JAVANESE LETTER WA = repha A9AF ꦯ JAVANESE LETTER SA MURDA A983 $ꦃ JAVANESE SIGN WIGNYAN = sha = visarga A9B0 ꦰ JAVANESE LETTER SA MAHAPRANA Letters = ssa A9B1 ꦱ JAVANESE LETTER SA A984 ꦄ JAVANESE LETTER A A9B2 ꦲ JAVANESE LETTER HA A985 ꦅ JAVANESE LETTER I KAWI A986 ꦆ JAVANESE LETTER I Sign A987 ꦇ JAVANESE LETTER II A9B3 $꦳ JAVANESE SIGN CECAK TELU A988 ꦈ JAVANESE LETTER U = nukta A989 ꦉ JAVANESE LETTER PA CEREK Dependent vowel signs = vocalic r JAVANESE VOWEL SIGN TARUNG JAVANESE LETTER NGA LELET A9B4 $ꦴ A98A ꦊ = aa = vocalic l JAVANESE VOWEL SIGN TOLONG JAVANESE LETTER NGA LELET RASWADI A9B5 $ꦵ A98B ꦋ = o = vocalic ll JAVANESE VOWEL SIGN WULU JAVANESE LETTER E A9B6 $ꦶ A98C ꦌ = i JAVANESE LETTER AI A98D ꦍ A9B7 JAVANESE VOWEL SIGN WULU MELIK JAVANESE LETTER O $ꦷ A98E ꦎ = ii JAVANESE LETTER KA A98F ꦏ A9B8 $ꦸ JAVANESE VOWEL SIGN SUKU A990 ꦐ JAVANESE LETTER KA SASAK = u = qa A9B9 $ꦹ JAVANESE VOWEL SIGN SUKU MENDUT A991 ꦑ JAVANESE LETTER KA MURDA = uu = kha A9BA $ꦺ JAVANESE VOWEL SIGN TALING A992 ꦒ JAVANESE LETTER GA = e A993 ꦓ JAVANESE LETTER GA MURDA A9BB $ꦻ JAVANESE VOWEL SIGN DIRGA MURE = gha = ai A994 ꦔ JAVANESE LETTER NGA A9BC $ꦼ JAVANESE VOWEL SIGN PEPET A995 ꦕ JAVANESE LETTER CA = ae JAVANESE LETTER CA MURDA A996 ꦖ Dependent consonant signs = cha JAVANESE CONSONANT SIGN KERET A997 ꦗ JAVANESE LETTER JA A9BD $ꦽ JAVANESE LETTER NYA MURDA = vocalic r A998 ꦘ JAVANESE CONSONANT SIGN PENGKAL = jnya A9BE $ꦾ JAVANESE LETTER JA MAHAPRANA = medial ya A999 ꦙ JAVANESE CONSONANT SIGN CAKRA = jha A9BF $ꦿ = medial ra A99A ꦚ JAVANESE LETTER NYA A99B ꦛ JAVANESE LETTER TTA Sign A99C ꦜ JAVANESE LETTER TTA MAHAPRANA A9C0 $꧀ JAVANESE PANGKON = ttha = virama JAVANESE LETTER DDA A99D ꦝ Punctuation JAVANESE LETTER DDA MAHAPRANA A99E ꦞ JAVANESE LEFT RERENGGAN = ddha A9C1 ꧁ JAVANESE LETTER NA MURDA A9C2 ꧂ JAVANESE RIGHT RERENGGAN A99F ꦟ JAVANESE PADA ANDAP = nna A9C3 ꧃ JAVANESE PADA MADYA A9A0 ꦠ JAVANESE LETTER TA A9C4 ꧄ JAVANESE PADA LUHUR A9A1 ꦡ JAVANESE LETTER TA MURDA A9C5 ꧅ = tha A9C6 ꧆ JAVANESE PADA WINDU A9A2 ꦢ JAVANESE LETTER DA A9C7 ꧇ JAVANESE PADA PANGKAT A9A3 ꦣ JAVANESE LETTER DA MAHAPRANA A9C8 ꧈ JAVANESE PADA LINGSA = dha = danda A9A4 ꦤ JAVANESE LETTER NA A9C9 ꧉ JAVANESE PADA LUNGSI A9A5 ꦥ JAVANESE LETTER PA = double danda JAVANESE PADA ADEG A9A6 ꦦ JAVANESE LETTER PA MURDA A9CA ꧊ = pha A9CB ꧋ JAVANESE PADA ADEG ADEG A9A7 ꦧ JAVANESE LETTER BA A9CC ꧌ JAVANESE PADA PISELEH A9A8 ꦨ JAVANESE LETTER BA MURDA A9CD ꧍ JAVANESE TURNED PADA PISELEH = bha The Unicode Standard 6.2, Copyright © 1991-2012 Unicode, Inc. All rights reserved. A9CF Javanese A9DF Syllable reduplicator A9CF ꧏ JAVANESE PANGRANGKEP → 0662 arabic-indic digit two Digits A9D0 ꧐ JAVANESE DIGIT ZERO A9D1 ꧑ JAVANESE DIGIT ONE A9D2 ꧒ JAVANESE DIGIT TWO A9D3 ꧓ JAVANESE DIGIT THREE A9D4 ꧔ JAVANESE DIGIT FOUR A9D5 ꧕ JAVANESE DIGIT FIVE A9D6 ꧖ JAVANESE DIGIT SIX A9D7 ꧗ JAVANESE DIGIT SEVEN A9D8 ꧘ JAVANESE DIGIT EIGHT A9D9 ꧙ JAVANESE DIGIT NINE Ellipsis marks A9DE ꧞ JAVANESE PADA TIRTA TUMETES A9DF ꧟ JAVANESE PADA ISEN-ISEN The Unicode Standard 6.2, Copyright © 1991-2012 Unicode, Inc. All rights reserved..
Recommended publications
  • Ka И @И Ka M Л @Л Ga Н @Н Ga M М @М Nga О @О Ca П
    ISO/IEC JTC1/SC2/WG2 N3319R L2/07-295R 2007-09-11 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации Doc Type: Working Group Document Title: Proposal for encoding the Javanese script in the UCS Source: Michael Everson, SEI (Universal Scripts Project) Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Replaces: N3292 Date: 2007-09-11 1. Introduction. The Javanese script, or aksara Jawa, is used for writing the Javanese language, the native language of one of the peoples of Java, known locally as basa Jawa. It is a descendent of the ancient Brahmi script of India, and so has many similarities with modern scripts of South Asia and Southeast Asia which are also members of that family. The Javanese script is also used for writing Sanskrit, Jawa Kuna (a kind of Sanskritized Javanese), and Kawi, as well as the Sundanese language, also spoken on the island of Java, and the Sasak language, spoken on the island of Lombok. Javanese script was in current use in Java until about 1945; in 1928 Bahasa Indonesia was made the national language of Indonesia and its influence eclipsed that of other languages and their scripts. Traditional Javanese texts are written on palm leaves; books of these bound together are called lontar, a word which derives from ron ‘leaf’ and tal ‘palm’. 2.1. Consonant letters. Consonants have an inherent -a vowel sound. Consonants combine with following consonants in the usual Brahmic fashion: the inherent vowel is “killed” by the PANGKON, and the follow- ing consonant is subjoined or postfixed, often with a change in shape: §£ ndha = § NA + @¿ PANGKON + £ DA-MAHAPRANA; üù n.
    [Show full text]
  • Suspicious Identity of U+A9B5 JAVANESE VOWEL SIGN TOLONG
    L2/19-003 Suspicious identity of U+A9B5 JAVANESE VOWEL SIGN TOLONG Liang Hai / 梁海 <[email protected]> Aditya Bayu Perdana / <[email protected]> ꦄꦢꦶꦠꦾ ꦧꦪꦸꦥꦢꦤ 4 January 2019 1 Acknowledgements The authors would like to thank Ilham Nurwansah and the Script Ad Hoc group for their feedback. Ilham Nurwansah also kindly provided the Sundanese samples (Figure 2, 3, 4, and 5). 2 Background In the original Unicode Javanese proposal L2/08-015R Proposal for encoding the Javanese script in the UCS, the character tolong (U+A9B5 JAVANESE VOWEL SIGN TOLONG) was described as a vowel sign that is used exclusively in the Sundanese writing system with three major use cases: 1. Used alone as the vowel sign o 2. As a part of the vowel sign eu: <vowel sign ĕ, tolong> 3. As a part of the letters and conjoined forms of reu/leu: <letter / conjoined form rĕ/lĕ, tolong> Table 1. Sundanese tolong usage according to the original proposal Written form ◌ ◌ꦵ ◌ꦼ ◌ꦼꦵ ◌� ◌�ꦵ A9C0 PANGKON A9BC PEPET Encoding (A9B5 TOLONG) A989 PA CEREK (A9B5 TOLONG) (A9B5 TOLONG) Transcription a o ĕ eu rĕ reu Pronunciation [a] [o] [ə] [ɤ] [rə] [rɤ] See also the note under Table 2. However, tolong appears to be merely a stylistic variant of tarung (U+A9B4 JAVANESE VOWEL SIGN TARUNG), therefore the disunification of tolong from tarung is likely a mistake. 3 Proposal The Unicode Standard needs to recommend how the inappropriately disunified character U+A9B5 JAVANESE VOWEL SIGN TOLONG should be handled. 1 In particular, clarification in the names list and the Core Specification is necessary for explaining the background of the mis-disunification and recommending how both the tarung and tolong forms for both the Javanese and Sundanese languages should be implemented.
    [Show full text]
  • "9-41516)9? "9787:)4 ;7 -6+7,- )=1 16 ;0- & $
    L2/20-256 "9-41516)9?"9787:)4;7-6+7,-)=116;0-&$ ᭛᭜᭛ <;079 ,1;?))?<"-9,)6)215-14,7;3755/5)14+75 40)5 <9=)6:)0140)56<9=)6:)0/5)14+75 );- ;0$-8;-5*-9 6;97,<+;176 ,=:#6L>H8G>EI>H6=>HIDG>86AG6=B>76H:9H8G>EI;DJC9>CK6G>DJH>CH8G>EI>DCH6C96GI:;68IHEGD9J8:97:IL::CI=: I=6C9I=: I=8:CIJGN>C>CHJA6G+DJI=:6HIH>6A6G<:EDGI>DCD;>IH8DGEJH>H;DJC9>C"6K67JI#6L>B6I:G>6AH =6K:6AHD7::C;DJC9>C+JB6IG6%6A6N(:C>CHJA66A>6C9I=:(=>A>EE>C:H,=:H8G>EI>H;G:FJ:CIAN6HHD8>6I:9L>I= I=:'A9"6K6C:H:A6C<J6<:7JIB6I:G>6AHLG>II:C>C+6CH@G>I'A9%6A6N'A96A>C:H:6C9'A9+JC96C:H:A6C<J6<: =6H6AHD7::C;DJC9>CI=:#6L>H8G>EIGDBI=:B>9I=8:CIJGNH>BEA:;JC8I>DC6A#6L>L6HL>9:ANJH:9IDG:8DG9 A6C9 <G6CIH GDN6A :9>8IH 6C9 H>B>A6G 8=6C8:GN 9D8JB:CIH ,DL6G9H I=: :C9 D; I=: ;>GHI B>AA:CC>JB I=: H8G>EI 7:86B:>C8G:6H>C<AN9:8DG6I>K:6C986AA><G6E=>89J:ID>IHJH:6HI=:B6>CK:=>8A:D;'A9"6K6C:H:A>I:G6GNA6C<J6<: L>I=ADC<A6HI>C<A:<68N>CI=:A>I:G6GNIG69>I>DCD;I=:BD9:GC"6K6C:H:6C96A>C:H:A6C<J6<:H$6I:G#6L>H=DLH B6CNK6G>6I>DCHDK:G6L>9:<:D<G6E=>89>HIG>7JI>DC'K:GI>B:I=:H:K6G>6CIH=6K::KDAK:9>G:8IANDG>C9>G:8IAN >CIDI=:B6CNBD9:GCG6=B>8H8G>EIHD;>CHJA6G+H>6HJ8=6H6A>C:H:6I6@"6K6C:H:$DCI6G6:I8 /=>A:I=:68I>K:JH:D;#6L>H8G>EI=6H7::CG:EA68:97NDI=:GH8G>EIHH>C8:I=: I=8:CIJGNI=:G:6G:6CJB7:GD; BD9:GC96N:CI=JH>6HIH6C98DBBJC>I>:HL=DJH:I=:H8G>EIID96N;DGDI=:GEJGEDH:HI=6C6C8>:CIG:EGD9J8I>DC ;DG:M6BEA:ID8=6I>CHD8>6A6EEA>86I>DC6C98G:6I:>B6<:EDHIH!CI=>HG:K>K6AINE:D;JH:I=:#6L>H8G>EIB6N7: JH:9IDLG>I:A6C<J6<:HI=6I6G:CDI;DJC9>C‘6JI=:CI>8’#6L>8DGEJHHJ8=6HI=:BD9:GC"6K6C:H:A6C<J6<:DG I=: !C9DC:H>6C A6C<J6<: H#6L>=6H CDI 7::C :C8D9:9>C I=: -C>8D9: N:I I=:
    [Show full text]
  • Design of Javanese Text to Speech Application
    Design of Javanese Text to Speech Application Yulia, Liliana, Rudy Adipranata, Gregorius Satia Budhi Informatics Department, Industrial Technology Faculty, Petra Christian University Surabaya, Indonesia [email protected] Abstract—Javanese is one of the many regional languages used in Indonesia. Javanese language is used by most of the population in Java. But now along with the development of the era, the use of regional languages including Javanese language is to be re- duced especially among the younger generation. One way to help conserve the use of Javanese language is to utilize information technologies, one of them is by developing a text to speech appli- cation that can be used to find out how the pronunciation of Ja- vanese language. In this paper, we discussed the design for Java- nese text to speech applications uses finite state automata. The design result will be used as rules to separate syllables when im- plementing text to speech application. Index Terms—Javanese language; Finite state automata; Text to speech. Figure 1: Basic Javanese characters I. INTRODUCTION In addition to the basic characters, the Javanese character Javanese language is a language widely spoken by the peo- has supplementary characters, consist of symbols for express- ple of Java. It is one of the regional languages of many region- ing vowels as well as a combination of two specific conso- al languages spoken in Indonesia. As one of the assets of na- nants. This supplementary characters is called sandhangan tional culture, Javanese language needs to be preserved. The and can be seen in Figure 2 [5]. younger generation is now more interested in learning a for- Symbol Example Read eign language, rather than the native Indonesian local lan- guage.
    [Show full text]
  • Ahom Range: 11700–1174F
    Ahom Range: 11700–1174F This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation.
    [Show full text]
  • 1 PASANGAN DAN SANDHANGAN DALAM AKSARA JAWA Oleh
    PASANGAN DAN SANDHANGAN DALAM AKSARA JAWA1 oleh: Sri Hertanti Wulan [email protected] Jurusan Pendidikan Bahasa Daerah FBS UNY Aksara nglegena yang digunakan dalam ejaan bahasa Jawa pada dasarnya terdiri atas dua puluh aksara yang bersifat silabik Darusuprapta (2002: 12-13). Masing-masing aksara mempunyai aksara pasangan, yakni aksara yang berfungsi untuk menghubungkan suku kata mati/tertutup dengan suku kata berikutnya, kecuali suku kata yang tertutup dengan wignyan (.. ), layar (....), dan cecak (....). Pasangan – pasangan tersebut antara lain: a) Aksara pasangan wutuh, ditulis di bawah aksara yang diberi pasangan dan tidak disambung, yaitu antara lain: Tabel 1 Pasangan Wutuh pasangan Wujud Contoh Ra … dalan ramé= Ya … tumbas yuyu = Ga … dalan gĕdhé = .… dolan ngomah = nga b) Aksara pasangan tugelan ditulis di belakang aksara yang diberi pasangan dan tidak disambungkan dengan aksara yang diberi pasangan, yaitu antara lain: 1Disampaikan dalam PPM PelatihanAksaraJawadan PendirianHanacaraka Centre sebagaiRevitalisasiFungsiAksaraJawa kerjasama FBS UNY dan Dinas Dikpora DIY. Dilaksanakan di Dikpora DIY, Senin 28 Oktober 2013. 1 Tabel 2.1 Pasangan Tugelan pasangan Wujud Contoh ha … adhĕm hawané = pa … bakul pĕlĕm = sa … dalan sĕpi = c) Aksara pasangan tugelan ditulis di bawah aksara yang diberi pasangan dan tidak disambungkan dengan aksara yang diberi pasangan, yaitu antara lain: Tabel 2.2 Pasangan Tugelan pasangan Wujud Contoh kilèn kalen= ka … wis takon = ta … tas larang = la … Pasangan – pasangan tersebut, bila mendapatkan sandhangan
    [Show full text]
  • Source Readings in Javanese Gamelan and Vocal Music, Volume 2
    THE UNIVERSITY OF MICHIGAN CENTER FOR SOUTH AND SOUTHEAST ASIAN STUDIES MICHIGAN PAPERS ON SOUTH AND SOUTHEAST ASIA Editorial Board Alton L. Becker Karl L. Hutterer John K. Musgrave Peter E. Hook, Chairman Ann Arbor, Michigan USA KARAWITAN SOURCE READINGS IN JAVANESE GAMELAN AND VOCAL MUSIC Judith Becker editor Alan H. Feinstein assistant editor Hardja Susilo Sumarsam A. L. Becker consultants Volume 2 MICHIGAN PAPERS ON SOUTH AND SOUTHEAST ASIA; Center for South and Southeast Asian Studies The University of Michigan Number 30 Open access edition funded by the National Endowment for the Humanities/ Andrew W. Mellon Foundation Humanities Open Book Program. Library of Congress Catalog Card Number: 82-72445 ISBN 0-89148-034-X Copyright ©' 1987 by Center for South and Southeast Asian Studies The University of Michigan Publication of this book was assisted in part by a grant from the Publications Program of the National Endowment for the Humanities. Additional funding or assistance was provided by the National Endowment for the Humanities (Transla- tions); the Southeast Asia Regional Council, Association for Asian Studies; The Rackham School of Graduate Studies, The University of Michigan; and the School of Music, The University of Michigan. Printed in the United States of America ISBN 978-0-89-148034-1 (hardcover) ISBN 978-0-47-203819-0 (paper) ISBN 978-0-47-212769-6 (ebook) ISBN 978-0-47-290165-4 (open access) The text of this book is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License: https://creativecommons.org/licenses/by-nc-nd/4.0/ CONTENTS PREFACE: TRANSLATING THE ART OF MUSIC A.
    [Show full text]
  • Proposal to Encode Balinese Archaic Jnya 1. Introduction
    Proposal to encode Balinese Archaic Jnya Ben Yang 楊�� Aditya Bayu Perdana ꦄꦢꦶꦠꦾꦧꦪꦸꦥ�ꦢꦤ Director of Technology Typographer PanLex — The Long Now Foundation [email protected] [email protected] July 10, 02019 (most recent revision) 1. Introduction Balinese is an Austronesian language spoken on the island of Bali in Indonesia. It is currently written in two scripts, Latin and Balinese. The island of Bali has a long literary history, with extensive traditional literature in Sanskrit, Kawi (Old Javanese), and Balinese all written in the Balinese script. The Balinese script is currently well-supported in Unicode, and nearly all traditional literature can be accurately encoded. However, in the process of researching some older documents, we have discovered one character that is not currently covered by the existing encoding model. The character, from now on referred to as "ARCHAIC JNYA", is not found in modern Balinese documents, but is found in older documents. It represents the sound /dʒɲa/, which in modern Balinese is depicted using the sequence BALINESE LETTER JA+BALINESE ADEG (virama) ADEG +BALINESE LETTER NYA, forming the stacking conjunct ⟨ᬚ �⟩. The output of this sequence is an as-expected Balinese conjunct form, with the second consonant subjoined on to the first consonant. On the other hand, the ARCHAIC JNYA is not visually decomposable into separate JA and NYA glyphs. In some documents both forms of JNYA are found. It does not appear that they represent a specific semantic distinction, but the ability to encode both is necessary for the accurate transcription of older Balinese documents. Interestingly, the Javanese cognate grapheme to ARCHAIC JNYA is already encoded, as JAVANESE LETTER NYA MURDA .
    [Show full text]
  • ISO/IEC JTC1/SC2/WG2 N 2005 Date: 1999-05-29
    ISO INTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE DE NORMALISATION --------------------------------------------------------------------------------------- ISO/IEC JTC1/SC2/WG2 Universal Multiple-Octet Coded Character Set (UCS) -------------------------------------------------------------------------------- ISO/IEC JTC1/SC2/WG2 N 2005 Date: 1999-05-29 TITLE: ISO/IEC 10646-1 Second Edition text, Draft 2 SOURCE: Bruce Paterson, project editor STATUS: Working paper of JTC1/SC2/WG2 ACTION: For review and comment by WG2 DISTRIBUTION: Members of JTC1/SC2/WG2 1. Scope This paper provides a second draft of the text sections of the Second Edition of ISO/IEC 10646-1. It replaces the previous paper WG2 N 1796 (1998-06-01). This draft text includes: - Clauses 1 to 27 (replacing the previous clauses 1 to 26), - Annexes A to R (replacing the previous Annexes A to T), and is attached here as “Draft 2 for ISO/IEC 10646-1 : 1999” (pages ii & 1 to 77). Published and Draft Amendments up to Amd.31 (Tibetan extended), Technical Corrigenda nos. 1, 2, and 3, and editorial corrigenda approved by WG2 up to 1999-03-15, have been applied to the text. The draft does not include: - character glyph tables and name tables (these will be provided in a separate WG2 document from AFII), - the alphabetically sorted list of character names in Annex E (now Annex G), - markings to show the differences from the previous draft. A separate WG2 paper will give the editorial corrigenda applied to this text since N 1796. The editorial corrigenda are as agreed at WG2 meetings #34 to #36. Editorial corrigenda applicable to the character glyph tables and name tables, as listed in N1796 pages 2 to 5, have already been applied to the draft character tables prepared by AFII.
    [Show full text]
  • Malayalam Range: 0D00–0D7F
    Malayalam Range: 0D00–0D7F This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation.
    [Show full text]
  • An Introduction to Indic Scripts
    An Introduction to Indic Scripts Richard Ishida W3C [email protected] HTML version: http://www.w3.org/2002/Talks/09-ri-indic/indic-paper.html PDF version: http://www.w3.org/2002/Talks/09-ri-indic/indic-paper.pdf Introduction This paper provides an introduction to the major Indic scripts used on the Indian mainland. Those addressed in this paper include specifically Bengali, Devanagari, Gujarati, Gurmukhi, Kannada, Malayalam, Oriya, Tamil, and Telugu. I have used XHTML encoded in UTF-8 for the base version of this paper. Most of the XHTML file can be viewed if you are running Windows XP with all associated Indic font and rendering support, and the Arial Unicode MS font. For examples that require complex rendering in scripts not yet supported by this configuration, such as Bengali, Oriya, and Malayalam, I have used non- Unicode fonts supplied with Gamma's Unitype. To view all fonts as intended without the above you can view the PDF file whose URL is given above. Although the Indic scripts are often described as similar, there is a large amount of variation at the detailed implementation level. To provide a detailed account of how each Indic script implements particular features on a letter by letter basis would require too much time and space for the task at hand. Nevertheless, despite the detail variations, the basic mechanisms are to a large extent the same, and at the general level there is a great deal of similarity between these scripts. It is certainly possible to structure a discussion of the relevant features along the same lines for each of the scripts in the set.
    [Show full text]
  • Specifying Optional Malayalam Conjuncts
    Specifying Optional Malayalam Conjuncts Cibu Johny <[email protected]> Roozbeh Poornader <[email protected]> 2013­Jan­28 Current status Indic conjunct formation scheme currently favors the full conjunct for a given set of characters. Example: क् + ष → is prefered as opposed to क् ​ष. (KAd + SSAl → K.SSAn ) क् ​ष can be obtained by क् + ZWJ + ष which is KAd + ZWJ + SSAl → KAh + SSAn The Need In Malayalam there are two prevailing orthographies ­ traditional and reformed ­ both written with same Malayalam character set. The difference between them is typically manifested only by the font. Traditional orthography fonts accomodate lot more full conjuncts, while reformed orthography fonts would use visibile virama (Chandrakkala) separated sequences for many of those full conjuncts. For the vowel signs of U, UU, and Vocalic vowels and also for the RA­sign, reformed orthography font would use visually separate conjoining form. However, there is a definite need for the ability in a reformed orthography font to display the traditional full conjuncts on demand. As of now there is no mechanism specified in the standard to suggest a full conjunct of a cluster. The reverse case is also needed ­ a traditional orthography font might want to display reformed othrography grapheme clusters optionally. Following proposal uses ZWJ and ZWNJ insertions to achieve this need. However, potentially Chillu forming sequence <Consonant + Virama + ZWJ> is not used for any of the cases listed below. Proposal Case 1 1 The sequence <Consonant + ZWJ + Conjoining Vowel Sign> has following fallback order for display: 1. Full Conjunct 2. Consonant + non­conjoining vowel sign Example with reformed orthography font (in a reformed orthography Malayalam font that can allow optional traditional orthography) SA + Vowel Sign U → SA + ZWJ + Vowel Sign U → Case 2 <Consonant1 + ZWJ + Virama + Consonant2> has following display fallback order: 1.
    [Show full text]