The Traditional Arabic Typecase Extended to the Unicode Set of Glyphs Yannis Haralambous

Total Page:16

File Type:pdf, Size:1020Kb

The Traditional Arabic Typecase Extended to the Unicode Set of Glyphs Yannis Haralambous The Traditional Arabic Typecase Extended to the Unicode Set of Glyphs Yannis Haralambous To cite this version: Yannis Haralambous. The Traditional Arabic Typecase Extended to the Unicode Set of Glyphs. Electronic Publishing, John Wiley & Sons Ltd., 1994, 8 (2/3), pp.125-138. hal-02100480 HAL Id: hal-02100480 https://hal.archives-ouvertes.fr/hal-02100480 Submitted on 25 Apr 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. ELECTRONIC PUBLISHING, VOL. ?(?), 1 (??? 19??) The Traditional Arabic Typecase Extended to the Unicode Set of Glyphs YANNIS HARALAMBOUS Atelier Fluxus Virus 187, rue Nationale, 59800 Lille, France Fax (33) 20.40.28.64, Internet [email protected] SUMMARY Unicode, and other ªuniversalº encodings have shown the need for adapting resources until now available only for major scripts, to the ªethnicº extensions of these scripts. In this paper we de- scribe such an example: the adaptation of the traditional Arabic typecase to the needs of other languages using the Arabic script. We present an implementation of this extension : Al-Amal, AFONT based on TEX, MET , and Flex/Bison ®lters. 1 Introduction ²ws Ð *iP The ®rst Arabic book, a 5 11 cm volume titled (Book of the prayer of hours), was printed in 1514 by GreÂgoire de GreÂgoire in Venice and Fano, under the protec- tion of pope Leo the 10th ([1, p. 18±19]). It took about two centuries for Arabic book print- ing to move to the East: in 1727 the Ottoman printing agency is founded in Constantinople and starts printing using Dutch types and technology ([8, p. 156]). A similar institution was founded in Cairo in 1821. Undoubtedly a script like the Arabic one, having deep roots in calligraphy, was rather dif®cult to adapt to typography, a technique where strict standardization and repetition of forms is necessary. When Aldus Manutius created the ®rst italic font in 1501, out of manu- script calligraphic forms, he did a certain number of choicesÐand these choices became a standard of occidental typography. Similar choices had to be made for Arabic: calligraphy had to be ªtamedº, so that the result be homogeneous, reproducible, and ¯exible enough to be pleasant to the eye. 8 8 Ï âATô This standardization took place in 1906, in Cairo, when the ±A¢ ('Almat.ab`Å al'amÅarya) typecase is de®ned. This typecase (see ®g. 1 and 2), divided in four parts (as opposed to ªupperº and ªlowerº case of the Occident), uses a total of 470 characters. As- tonishing as it may seem, this typesetting system has been kept in use until today: books typeset in a traditional way, all around the Arabic world, are still using the same set of char- acters, and the same conventions and rules.1 In ®g. 1 and 2, the reader can see the four parts of this typecase. The reader knowing the technical limitations of computerized typesetting can already imagine the effect of computers on the Arabic script: not being able to cope with the com- 1 In [4, p. 102±103], a book published in 1880 (!) the reader can ®nd 30 rules for typesetting Arabic, which are still strictly applied today by traditional typographers. 0894±3982/??/??0001±1$?.?? c 19?? by John Wiley & Sons, Ltd. 2 YANNIS HARALAMBOUS casse1cis.eps Figure 1. The Cairo typecaseÐcases 1 and 2 29/6/1996 23:04 PAGE PROOFS ep96 THE ARABIC TYPECASE AND UNICODE 3 casse2cis.eps Figure 2. The Cairo typecaseÐcases 3 and 4 29/6/1996 23:04 PAGE PROOFS ep96 4 YANNIS HARALAMBOUS arabexamples.eps Figure 3. Samples of printed Arabic: Beirut 1963, Leipzig 1981, London 1992. plexity of the Cairo typecase, the computer industry has tried (and was ®nally able) to im- pose new standards of simpli®ed typesetting,2 coveringmost of the time only the fundamen- tal properties of Arabic script, without any typographical enhancement. Was it the comput- ers, which have simpli®ed Arabic printed script, or was it a deeper change in Arabic society and mentality? This is hard to say; nevertheless, even today, commercial computer typeset- ting systems areÐa few isolated exceptions apartÐunable to reach the typographic quality of 'Almat.ab`Å al'amÅarya. In ®g. 3 one can see different samples of printed Arabic material, showing the evolutionand simpli®cation of Arabic script; these examplesare extreme cases: the ®rst one is taken from a scholarly book printed in Lebanon (it contains almost all lig- atures of the 'Almat.ab`Å al'amÅarya typecase), the second from a technical book printed in East Germany (a fewer number of ligatures), and the third from a daily newspaper printed in the U.K. (almost no ligatures). This paper describes the author's solution to this problem: ¢Ï(Al-Amal), a typesetting system based on TEX (actually TEX--XET), emulating the 'Almat.ab`Å al'amÅarya typecase. 2 For more information on the Arabic script and the computer see also [3] and [10]. 29/6/1996 23:04 PAGE PROOFS ep96 THE ARABIC TYPECASE AND UNICODE 5 8 X ­d @ @b ,1928 ¡A j @ @y¥A ¤A@ @"¬ r £@ @g`¢ @ @`b w@ @h @ @n¢ * ® w $ý@@¬ A@ @hã w@ @Yn ¦¢ @ @sêtq¢ ,At @ @y¥A®rs @ @Gl Ð@ @D .At @ @ @ @®AG wSíô w@ @YnA @ @l ª A¢ j A@ @¢A ¤w@ @k® 8 d@ @Y @ @YtÊ wâÏ @ @yi j @ @¬ A@ @`õà @ @t`æ £@ @g`ô @ @A ¤ ¦ AÉ j ü@ @hâ @ @`¢ ª@ @tAY¬ @ @Ð @ @` ¤ A ,A@ @hAqt¬ A@ @" .)Ð@@¢Ï( @@*i[ @@b¬ @@®rHt 8 w@ @fq® ¤ A@ @yk® Ï @ @¢Ðt ¬ ¦@ @îÔdtb"ý w@ @sîô ¦¢ @ @î ª@ @å Ð@ @TÏ AÏ $ý d@ @!`¢¬ @ @ynîÐA w@ @tk¢ ,£¯ @ @nY¢ w@ @n $ý $ý A@ @¥d A¢ @ @h .r d@ @Y¢ ±d@ @Ï ª@ @bîr j A@ @¥Ay d@ @nts¢¬ @ @y ,ª@ @y @ @A A¢ ª@ @y ¬d ¤ 0$@ @l¢ ,Xdô w@ @n ¦¢ ,r@ @YtÉ £@ @g`¢ @ @¬ .A@@Ttsô d @@¥A¢Ï A@@õà ¢ Figure 4. Samples of text typeset with Al-Amal, with Cairo typecase ligatures This system (already presented in [6] and [7]), has been recently extended to the complete set of Unicode Arabic alphabet characters; problems and open questions arising from this extension are discussed at the end of the paper. 2 The Cairo typecase Arabic letters have contextual forms, depending on surrounding letters in the same word: a typical three letter word will start with a letter in initial form, followed by a letter in medial form and, ®nally, by a letter in ®nal form (the hypothetical word consisting of three times the letter `ghayn' is written G). A fourth form is used for isolated letters (this is also the form used in crosswords or Scrabble-like games, where letters have to placed in boxes, indepe- dently of their context). Some letters appear only in isolated and ®nal form (and sometimes even only in isolated form), so that the letters immediately following them must be written in initial (or isolated) form, although they are located inside the word. These are the basic contextual rules of the Arabic script: they are independent of style and medium, and are applied in all cases, without exception; they are as basic as the dot on the Latin lowercase `i', or the horizontal bar of the `t'. But besides these contextual forms, 'Almat.ab`Å al'amÅarya also combines letters into lig- ! atures, not unlike the `f' + `i' `®' phenomenon in Latin alphabet typesetting. The rules are described in [4, p. 102±103] and have been con®rmed by careful examination of various printed texts of different origin. In ®g. 4 and 5 the reader can compare the same text (actu- ally the text of the ®rst example of ®g. 3) typeset via the Al-Amal system, with and without 'Almat.ab`Å al'amÅarya ligatures. In the following we start by giving the mandatory ligatures (those that are part of every font), then we give the second level ligatures (those that are characteristic of the Cairo type- case), and ®nally we give the variant characters (form changes applied to single characters). By ª`foo-likeº, where ªfooº is some common Arabic letter, we mean all characters having 29/6/1996 23:04 PAGE PROOFS ep96 6 YANNIS HARALAMBOUS 8 ²@@ ­d @@b ,1928 ¡A ² @@y¥A ¤A@@ml¬ r £@@g`¢ @@`b w@@h @@n¢ w@ @H¥ °@ @l¬ A@ @h w@ @Yn ¦¢ @ @sbtq¢ ,Atm¢ @ @m @ @y¥A®rs @ @Gl .@ @®At @ @ @ @®AG wSnm w@ @YnA @ @l ª A¢ ² A@ @¢A ¤w@ @k® A@ @k® 8 wÏ @ @ym ² @ @g¬ A@ @`r¢ r@ @bt`® £@ @g`m @ @A ¤ ¦ ÐD r@ @t`¢ ª@ @tAY¬ @ @Ð @ @` ¤ A@ @m ,A@ @hAqt¬ A@ @mlk d@ @Ym @ @Ytm .)ТÏ( @@Atk @@b¬ @@®rHt ²@@Ag¢ ² A@@mh 8 w@ @fq® ¤ A@ @yk® r@ @yl A@ @ @ @¢Ðt ¬ ¦@ @yÔdtbml w@ @sym ¦¢ @ @y ª@ @¥ r@ @y ÐTÏ AÏ °@ @l d@ @mt`¢¬ @ @ynyÐA w@ @tk¢ ,£@ @ @ @nY¢ w@ @n °@ @l A@ @¥d A¢ @ @h .²@ @f d@ @Y¢ °@ @ ±d@ @gÏ ª@ @byr ² A@ @¥Ay d@ @nts¢¬ @ @y £¨ A¢ ª@ @y ¬dg® ¤ ¦@ @yl¢ ,²@ @dm w@ @n ¦¢ ,r@ @Yt¢ £@ @g`¢ @ @¬ °@ @l .A@@Ttsm d @@¥A¢Ï A@@r¢ ¢ ,ª@@y @@AH Figure 5. Samples of text typeset with Al-Amal, without Cairo typecase ligatures the same base form as letter ªfooº but different dots and other diacritics (for example, the Ä ba-like family is the set of characters , , , ,etc.).
Recommended publications
  • From Arabic Style Toward Javanese Style: Comparison Between Accents of Javanese Recitation and Arabic Recitation
    From Arabic Style toward Javanese Style: Comparison between Accents of Javanese Recitation and Arabic Recitation Nur Faizin1 Abstract Moslem scholars have acceptedmaqamat in reciting the Quran otherwise they have not accepted macapat as Javanese style in reciting the Quran such as recitationin the State Palace in commemoration of Isra` Miraj 2015. The paper uses a phonological approach to accents in Arabic and Javanese style in recitingthe first verse of Surah Al-Isra`. Themethod used here is analysis of suprasegmental sound (accent) by usingSpeech Analyzer programand the comparison of these accents is analyzed by descriptive method. By doing so, the author found that:first, there is not any ideological reason to reject Javanese style because both of Arabic and Javanese style have some aspects suitable and unsuitable with Ilm Tajweed; second, the suitability of Arabic style was muchthan Javanese style; third, it is not right to reject recitingthe Quran with Javanese style only based on assumption that it evokedmistakes and errors; fourth, the acceptance of Arabic style as the art in reciting the Quran should risedacceptanceof the Javanese stylealso. So, rejection of reciting the Quranwith Javanese style wasnot due to any reason and it couldnot be proofed by any logical argument. Keywords: Recitation, Arabic Style, Javanese Style, Quran. Introduction There was a controversial event in commemoration of Isra‘ Mi‘raj at the State Palacein Jakarta May 15, 2015 ago. The recitation of the Quran in the commemoration was recitedwithJavanese style (langgam).That was not common performance in relation to such as official event. Muhammad 58 Nur Faizin, From Arabic Style toward Javanese Style Yasser Arafat, a lecture of Sunan Kalijaga State Islamic University Yogyakarta has been reciting first verse of Al-Isra` by Javanese style in the front of state officials and delegationsof many countries.
    [Show full text]
  • A Practical Sanskrit Introductory
    A Practical Sanskrit Intro ductory This print le is available from ftpftpnacaczawiknersktintropsjan Preface This course of fteen lessons is intended to lift the Englishsp eaking studentwho knows nothing of Sanskrit to the level where he can intelligently apply Monier DhatuPat ha Williams dictionary and the to the study of the scriptures The rst ve lessons cover the pronunciation of the basic Sanskrit alphab et Devanagar together with its written form in b oth and transliterated Roman ash cards are included as an aid The notes on pronunciation are largely descriptive based on mouth p osition and eort with similar English Received Pronunciation sounds oered where p ossible The next four lessons describ e vowel emb ellishments to the consonants the principles of conjunct consonants Devanagar and additions to and variations in the alphab et Lessons ten and sandhi eleven present in grid form and explain their principles in sound The next three lessons p enetrate MonierWilliams dictionary through its four levels of alphab etical order and suggest strategies for nding dicult words The artha DhatuPat ha last lesson shows the extraction of the from the and the application of this and the dictionary to the study of the scriptures In addition to the primary course the rst eleven lessons include a B section whichintro duces the student to the principles of sentence structure in this fully inected language Six declension paradigms and class conjugation in the present tense are used with a minimal vo cabulary of nineteen words In the B part of
    [Show full text]
  • "9-41516)9? "9787:)4 ;7 -6+7,- )=1 16 ;0- & $
    L2/20-256 "9-41516)9?"9787:)4;7-6+7,-)=116;0-&$ ᭛᭜᭛ <;079 ,1;?))?<"-9,)6)215-14,7;3755/5)14+75 40)5 <9=)6:)0140)56<9=)6:)0/5)14+75 );- ;0$-8;-5*-9 6;97,<+;176 ,=:#6L>H8G>EI>H6=>HIDG>86AG6=B>76H:9H8G>EI;DJC9>CK6G>DJH>CH8G>EI>DCH6C96GI:;68IHEGD9J8:97:IL::CI=: I=6C9I=: I=8:CIJGN>C>CHJA6G+DJI=:6HIH>6A6G<:EDGI>DCD;>IH8DGEJH>H;DJC9>C"6K67JI#6L>B6I:G>6AH =6K:6AHD7::C;DJC9>C+JB6IG6%6A6N(:C>CHJA66A>6C9I=:(=>A>EE>C:H,=:H8G>EI>H;G:FJ:CIAN6HHD8>6I:9L>I= I=:'A9"6K6C:H:A6C<J6<:7JIB6I:G>6AHLG>II:C>C+6CH@G>I'A9%6A6N'A96A>C:H:6C9'A9+JC96C:H:A6C<J6<: =6H6AHD7::C;DJC9>CI=:#6L>H8G>EIGDBI=:B>9I=8:CIJGNH>BEA:;JC8I>DC6A#6L>L6HL>9:ANJH:9IDG:8DG9 A6C9 <G6CIH GDN6A :9>8IH 6C9 H>B>A6G 8=6C8:GN 9D8JB:CIH ,DL6G9H I=: :C9 D; I=: ;>GHI B>AA:CC>JB I=: H8G>EI 7:86B:>C8G:6H>C<AN9:8DG6I>K:6C986AA><G6E=>89J:ID>IHJH:6HI=:B6>CK:=>8A:D;'A9"6K6C:H:A>I:G6GNA6C<J6<: L>I=ADC<A6HI>C<A:<68N>CI=:A>I:G6GNIG69>I>DCD;I=:BD9:GC"6K6C:H:6C96A>C:H:A6C<J6<:H$6I:G#6L>H=DLH B6CNK6G>6I>DCHDK:G6L>9:<:D<G6E=>89>HIG>7JI>DC'K:GI>B:I=:H:K6G>6CIH=6K::KDAK:9>G:8IANDG>C9>G:8IAN >CIDI=:B6CNBD9:GCG6=B>8H8G>EIHD;>CHJA6G+H>6HJ8=6H6A>C:H:6I6@"6K6C:H:$DCI6G6:I8 /=>A:I=:68I>K:JH:D;#6L>H8G>EI=6H7::CG:EA68:97NDI=:GH8G>EIHH>C8:I=: I=8:CIJGNI=:G:6G:6CJB7:GD; BD9:GC96N:CI=JH>6HIH6C98DBBJC>I>:HL=DJH:I=:H8G>EIID96N;DGDI=:GEJGEDH:HI=6C6C8>:CIG:EGD9J8I>DC ;DG:M6BEA:ID8=6I>CHD8>6A6EEA>86I>DC6C98G:6I:>B6<:EDHIH!CI=>HG:K>K6AINE:D;JH:I=:#6L>H8G>EIB6N7: JH:9IDLG>I:A6C<J6<:HI=6I6G:CDI;DJC9>C‘6JI=:CI>8’#6L>8DGEJHHJ8=6HI=:BD9:GC"6K6C:H:A6C<J6<:DG I=: !C9DC:H>6C A6C<J6<: H#6L>=6H CDI 7::C :C8D9:9>C I=: -C>8D9: N:I I=:
    [Show full text]
  • Text and Translation
    TEXT AND TRANSLATION ~ _5) _5) ,,, .,.,.., / 9 6/J 11 //1 ? 4-?? ~ 'E!l)< ./VVJ ß/ q #-#J::z__ ~.)/ ~r~~1:· "(% ~~~ /»? ~~i~~~~?!j' /1/Y} ~·~ -Y,) A:-1 "? d\4 t'LO ~' 'gL_ c.;. /VI rM YJ?J N;!{ f4.7f!/r»? M ~ -n \ /VYJ" ~--" ~ ,q.-N(__ cp \ . c #: .\ ~ IM"'? ß-1-1 ~ rq "} ;<(~· ~' /W? 7 7~/»1;:'7~ t;l t"l ' / nlf-.... / lvYJ -r/'.<7 ~ l:lj 1 %l\:__ ~7 -?~ ~~ 1~''t,~ ~~tm44$' ~ q #"() "7 ~ nJ1' ~. Z()? :J...;z_ ~ 7 .. h'J.r. n U L.'J ~ -::}~ hJ1 ~ 1 ~-- >l:lj IV'YJ -M 4n? ~ ~ #\1, Y/ N ~l"l­ W' ~--:-/ ~ "7' J~k'r. ' :>'1 dJ) 4. ?/ "'Y1 ~ ~ /D} "'YJ ~ d~1 , /~ I '-i ~ · ,.,." am ~"""·~ ~' 7""'' // (.;,_ ~ ' #-1 1{.-;jCJ /D) 7J ;YJ1 10J -n~- ß'-<77 ~&tm'JI'~ c.ljlff/.t?'Y~'1! ~. ~<l ~ -"! t /1-(J /!:"4 ~ ~ 17\ /VJ1 -rflj r.7 //WJ ~ 1 ~ ~ ~- ~ ~-:'( 4n'J tVYJ . '--·· G ~/ 17 ~ 4 ft? q-/4v? /1?'J·-YJ, LI~ M,$j~~-.,,~~e3 ( ~ Facsimile of stanzas 1-2 of Canto I from Ms. A ( [R C'l I -~~QM~.,~~ltlll~ir..l~ ID~1ftlllla-. t.M\5\_\,IfliiPII~;,ItQII, i~_-1J:1~~,,~~~~ ~~u~.,~t~1~"\~öl~ ,"SlliQI\ 1 Vlli-'IKI~-IKI.-I~":J"JCVI~ ~~WII" Kl ~ta,~ lllll<ll-i\·, PI., ~j\~nin44leuO,~l~ Ul'Yl D ~·~~Er U~JA~6Ul ~ .,~ ( 't.& U ~ l ~~~riU~®ftO<ln;1 lltlll\ttQ .-..G~~~1(} 11>1J~la:a&""'''' ~~n~ a "' .R) D Q _ Qlb. ~~tM~\~ ~ ~ ~, ~~t:llö­ ltO--lt&DIUI-1 ~-. n "ln 11-11....1\ • ~ r.u ~ t;j ~ ·-aclct~~.Ll\,11~--'\'=:otO :>< ,..
    [Show full text]
  • Introduction to Old Javanese Language and Literature: a Kawi Prose Anthology
    THE UNIVERSITY OF MICHIGAN CENTER FOR SOUTH AND SOUTHEAST ASIAN STUDIES THE MICHIGAN SERIES IN SOUTH AND SOUTHEAST ASIAN LANGUAGES AND LINGUISTICS Editorial Board Alton L. Becker John K. Musgrave George B. Simmons Thomas R. Trautmann, chm. Ann Arbor, Michigan INTRODUCTION TO OLD JAVANESE LANGUAGE AND LITERATURE: A KAWI PROSE ANTHOLOGY Mary S. Zurbuchen Ann Arbor Center for South and Southeast Asian Studies The University of Michigan 1976 The Michigan Series in South and Southeast Asian Languages and Linguistics, 3 Open access edition funded by the National Endowment for the Humanities/ Andrew W. Mellon Foundation Humanities Open Book Program. Library of Congress Catalog Card Number: 76-16235 International Standard Book Number: 0-89148-053-6 Copyright 1976 by Center for South and Southeast Asian Studies The University of Michigan Printed in the United States of America ISBN 978-0-89148-053-2 (paper) ISBN 978-0-472-12818-1 (ebook) ISBN 978-0-472-90218-7 (open access) The text of this book is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License: https://creativecommons.org/licenses/by-nc-nd/4.0/ I made my song a coat Covered with embroideries Out of old mythologies.... "A Coat" W. B. Yeats Languages are more to us than systems of thought transference. They are invisible garments that drape themselves about our spirit and give a predetermined form to all its symbolic expression. When the expression is of unusual significance, we call it literature. "Language and Literature" Edward Sapir Contents Preface IX Pronounciation Guide X Vowel Sandhi xi Illustration of Scripts xii Kawi--an Introduction Language ancf History 1 Language and Its Forms 3 Language and Systems of Meaning 6 The Texts 10 Short Readings 13 Sentences 14 Paragraphs..
    [Show full text]
  • Balinese Romanization Table
    Balinese Principal consonants1 (h)a2 ᬳ ᭄ᬳ na ᬦ ᭄ᬦ ca ᬘ ᭄ᬘ᭄ᬘ ra ᬭ ᭄ᬭ ka ᬓ ᭄ᬓ da ᬤ ᭄ᬤ ta ᬢ ᭄ᬢ sa ᬲ ᭄ᬲ wa ᬯ la ᬮ ᭄ᬙ pa ᬧ ᭄ᬧ ḍa ᬟ ᭄ᬠ dha ᬥ ᭄ᬟ ja ᬚ ᭄ᬚ ya ᬬ ᭄ᬬ ña ᬜ ᭄ᬜ ma ᬫ ᭄ᬫ ga ᬕ ᭄ᬕ ba ᬩ ᭄ᬩ ṭa ᬝ ᭄ᬝ nga ᬗ ᭄ᬗ Other consonant forms3 na (ṇa) ᬡ ᭄ᬡ ca (cha) ᭄ᬙ᭄ᬙ ta (tha) ᬣ ᭄ᬣ sa (śa) ᬱ ᭄ᬱ sa (ṣa) ᬰ ᭄ᬰ pa (pha) ᬨ ᭄ᬛ ga (gha) ᬖ ᭄ᬖ ba (bha) ᬪ ᭄ᬪ ‘a ᬗ᬴ ha ᬳ᬴ kha ᬓ᬴ fa ᬧ᬴ za ᬚ᬴ gha ᬕ᬴ Vowels and other agglutinating signs4 5 a ᬅ 6 ā ᬵ ᬆ e ᬾ ᬏ ai ᬿ ᬐ ĕ ᭂ ö ᭃ i ᬶ ᬇ ī ᬷ ᬈ o ᭀ ᬑ au ᭁ ᬒ u7 ᬸ ᬉ ū ᬹ ᬊ ya, ia8 ᭄ᬬ r9 ᬃ ra ᭄ᬭ rĕ ᬋ rö ᬌ ᬻ lĕ ᬍ ᬼ lö ᬎ ᬽ h ᬄ ng ᬂ ng ᬁ Numerals 1 2 3 4 5 ᭑ ᭒ ᭓ ᭔ ᭕ 6 7 8 9 0 ᭖ ᭗ ᭘ ᭙ ᭐ 1 Each consonant has two forms, the regular and the appended, shown on the left and right respectively in the romanization table. The vowel a is implicit after all consonants and consonant clusters and should be supplied in transliteration, unless: (a) another vowel is indicated by the appropriate sign; or (b) the absence of any vowel is indicated by the use of an adeg-adeg sign ( ). (Also known as the tengenen sign; ᭄ paten in Javanese.) 2 This character often serves as a neutral seat for a vowel, in which case the h is not transcribed.
    [Show full text]
  • The Traditional Arabic Typecase, Unicode, Tex and METAFONT Yannis Haralambous
    The Traditional Arabic Typecase, Unicode, TeX and METAFONT Yannis Haralambous To cite this version: Yannis Haralambous. The Traditional Arabic Typecase, Unicode, TeX and METAFONT. Tugboat, TeX Users Group, 1997, 18 (1), pp.17-29. hal-02101612 HAL Id: hal-02101612 https://hal.archives-ouvertes.fr/hal-02101612 Submitted on 18 Apr 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. TUGboat, Volume 18 (1997), No. 1 17 finally able) to impose new standards of simplified Philology typesetting,2 most of the time covering only the fun- damental properties of Arabic script, without any typographical enhancement. Was it the computers, The Traditional Arabic Typecase, Unicode, which have simplified Arabic printed script, or was TEXand METAFONT it a deeper change in Arabic society and mental- Yannis Haralambous∗ ity? This is hard to say; nevertheless, even today, commercial computer typesetting systems are — a 1 Introduction few isolated exceptions apart — unable to reach the The first Arabic book, a 5 × 11 cm volume titled typographic quality of ’Almat¯˙ab‘ al’am¯arya.In fig. 2, one can see different samples of printed Arabic TswZ Pi (Book of the prayer of hours), was printed in 1514 by Grégoire de Grégoire in material, showing the evolution and simplification Venice and Fano, under the protection of Pope Leo of Arabic script; these examples are extreme cases: the 10th [1, p.
    [Show full text]
  • Translation As Commentary in the Sanskrit-Old Javanese Didactic and Religious Literature from Java and Bali Andrea Acri and Thomas M
    Translation as Commentary in the Sanskrit-Old Javanese Didactic and Religious Literature from Java and Bali Andrea Acri and Thomas M. Hunter* This article discusses the dynamics of translation and exegesis documented in the body of Sanskrit-Old Javanese Śaiva and Buddhist technical literature of the tutur/tattva genre, composed in Java and Bali in the period from c. the ninth to the sixteenth century. The texts belonging to this genre, mainly preserved on palm-leaf manuscripts from Bali, are concerned with the reconfiguration of Indic metaphysics, philosophy, and soteriology along localized lines. Here we focus on the texts that are built in the form of Sanskrit verses provided with Old Javanese prose exegesis – each unit forming a »translation dyad«. The Old Javanese prose parts document cases of linguistic and cultural »localization« that could be regarded as broadly corresponding to the Western categories of translation, paraphrase, and commen- tary, but which often do not fit neatly into any one category. Having introduced the »vyākhyā-style« form of commentary through examples drawn from the early inscriptional and didactic literature in Old Javanese, we present key instances of »cultural translations« as attested in texts composed at different times and in different geo- graphical and religio-cultural milieus, and describe their formal features. Our aim is to do- cument how local agents (re-)interpreted, fractured, and restated the messages conveyed by the Sanskrit verses in the light of their contingent contexts, agendas, and prevalent exegeti- cal practices. Our hypothesis is that local milieus of textual production underwent a pro- gressive »drift« from the Indic-derived scholastic traditions that inspired – and entered into a conversation with – the earliest sources, composed in Central Java in the early medieval period, and progressively shifted towards a more embedded mode of production in East Java and Bali from the eleventh to the sixteenth century and beyond.
    [Show full text]
  • JTC1/SC2/WG2 N2827 Proposal of 4 Myanmar Semivowels - Page: 1/13 Version: 22-Jun-04 8:23 AM
    JTC1/SC2/WG2 N2827 Proposal of 4 Myanmar semivowels - page: 1/13 Version: 22-Jun-04 8:23 AM Title: Proposal of 4 Myanmar Semivowels Source: Myanmar Unicode and NLP Research Center Status: National Contribution Action: For consideration by WG2 Date: 21-June-2004 This document requests additional characters to be added to the UCS within Myanmar block. It also requests to pull 4 semivowel signs out of virama model. This contains the proposal summary form. A. Administrative 1. Title Proposal of 4 Myanmar semivowels 2. Requester’s name Myanmar Unicode & NLP Research Center 3. Requester type (Member body/Liaison/Individual contribution) National contribution. 4. Submission date 2004-06-21 5. Requester’s reference (if applicable) Thein OO, President, MCF <[email protected]> Thein HTUT, Secretary, MCSA Tun TINT, Member, Myanmar Language Commission Zaw HTUT, Program Manager, Myanmar UNLP Research Center <[email protected]> Ngwe TUN, Program Manager, Myanmar UNLP Research Center <[email protected]> c/o: MYANMAR UNICODE AND NLP RESEARCH CENTER Myanmar Computer Federation B1R1, Myanmar ICT Park, Universities' Hlaing Campus, 11052, Yangon, Myanmar Tel: +95-1-652307 eFax: +1-707-988-0300 Email: [email protected] , [email protected] Internet: http://www.mcf.org.mm , http://myanmars.net/unicode/ 6. Choose one of the following: 6a. This is a complete proposal Yes. 6b. More information will be provided later No. B. Technical – General 1. Choose one of the following: 1a. This proposal is for a new script (set of characters) No. Proposed name of script 1b. The proposal is for addition of character(s) to an existing block Yes.
    [Show full text]
  • Unicode Formation & Rules & Problems with Unicode Rules
    Unicode Formation & Rules & Problems with Unicode Rules 1. Following problem is not well understood by Unicode. "A consonant following (preceding) a consonant is represented by a modifier called consonant modifier if the shape of other character to which it is attached, É remains unaltered. Among consonant modifier (ya-phala) is the most frequent k one. This character is a presentation form of the character (Ya, U+09AF). It takes this shape depending upon the context. It could be applied to nominal form of any consonant or conjunct or even to the vowel. Application of this character may either bring in repetitive sound or 'ya'-ness of the character it is applied to. Though it is a presentation form of an existing character, it should be treated like a separate character because of the two reasons. This character may appear along with a vowel where the conjunct formation rules cannot be applied. As for example: A + É + ¡ = AÉ¡ l k Apart from that, (Ra, U+09B0) and (Ya, U+09AF) combination has two different conjunction forms: l + k = kÑ l + k = lÉ Using the same conjunct rule both of them cannot be formed. If it is treated as a combination character, rather than a original form of Ya, both of these issues could be solved." "In The following, technical issues related to few characters (including some special characters) are the separately elaborated: v Khanda-ta ( ) It is a combining character, which is widely used in Bangla script. For some qvLÇfe, transliterated words it might appear at the beginning. Examples: jqv, v¢Sm¡XÑ e¡vp£ and " 2.
    [Show full text]
  • Information, Characters, Unicode
    Information, Characters, Unicode Unicode © 24 August 2021 1 / 107 Hidden Moral Small mistakes can be catastrophic! Style Care about every character of your program. Tip: printf Care about every character in the program’s output. (Be reasonably tolerant and defensive about the input. “Fail early” and clearly.) Unicode © 24 August 2021 2 / 107 Imperative Thou shalt care about every Ěaracter in your program. Unicode © 24 August 2021 3 / 107 Imperative Thou shalt know every Ěaracter in the input. Thou shalt care about every Ěaracter in your output. Unicode © 24 August 2021 4 / 107 Information – Characters In modern computing, natural-language text is very important information. (“number-crunching” is less important.) Characters of text are represented in several different ways and a known character encoding is necessary to exchange text information. For many years an important encoding standard for characters has been US ASCII–a 7-bit encoding. Since 7 does not divide 32, the ubiquitous word size of computers, 8-bit encodings are more common. Very common is ISO 8859-1 aka “Latin-1,” and other 8-bit encodings of characters sets for languages other than English. Currently, a very large multi-lingual character repertoire known as Unicode is gaining importance. Unicode © 24 August 2021 5 / 107 US ASCII (7-bit), or bottom half of Latin1 NUL SOH STX ETX EOT ENQ ACK BEL BS HT LF VT FF CR SS SI DLE DC1 DC2 DC3 DC4 NAK SYN ETP CAN EM SUB ESC FS GS RS US !"#$%&’()*+,-./ 0123456789:;<=>? @ABCDEFGHIJKLMNO PQRSTUVWXYZ[\]^_ `abcdefghijklmno pqrstuvwxyz{|}~ DEL Unicode Character Sets © 24 August 2021 6 / 107 Control Characters Notice that the first twos rows are filled with so-called control characters.
    [Show full text]
  • The Unicode Standard, Version 3.0, Issued by the Unicode Consor- Tium and Published by Addison-Wesley
    The Unicode Standard Version 3.0 The Unicode Consortium ADDISON–WESLEY An Imprint of Addison Wesley Longman, Inc. Reading, Massachusetts · Harlow, England · Menlo Park, California Berkeley, California · Don Mills, Ontario · Sydney Bonn · Amsterdam · Tokyo · Mexico City Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial capital letters. However, not all words in initial capital letters are trademark designations. The authors and publisher have taken care in preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. If these files have been purchased on computer-readable media, the sole remedy for any claim will be exchange of defective media within ninety days of receipt. Dai Kan-Wa Jiten used as the source of reference Kanji codes was written by Tetsuji Morohashi and published by Taishukan Shoten. ISBN 0-201-61633-5 Copyright © 1991-2000 by Unicode, Inc. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or other- wise, without the prior written permission of the publisher or Unicode, Inc.
    [Show full text]