The Traditional Arabic Typecase Extended to the Unicode Set of Glyphs Yannis Haralambous
Total Page:16
File Type:pdf, Size:1020Kb
The Traditional Arabic Typecase Extended to the Unicode Set of Glyphs Yannis Haralambous To cite this version: Yannis Haralambous. The Traditional Arabic Typecase Extended to the Unicode Set of Glyphs. Electronic Publishing, John Wiley & Sons Ltd., 1994, 8 (2/3), pp.125-138. hal-02100480 HAL Id: hal-02100480 https://hal.archives-ouvertes.fr/hal-02100480 Submitted on 25 Apr 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. ELECTRONIC PUBLISHING, VOL. ?(?), 1 (??? 19??) The Traditional Arabic Typecase Extended to the Unicode Set of Glyphs YANNIS HARALAMBOUS Atelier Fluxus Virus 187, rue Nationale, 59800 Lille, France Fax (33) 20.40.28.64, Internet [email protected] SUMMARY Unicode, and other ªuniversalº encodings have shown the need for adapting resources until now available only for major scripts, to the ªethnicº extensions of these scripts. In this paper we de- scribe such an example: the adaptation of the traditional Arabic typecase to the needs of other languages using the Arabic script. We present an implementation of this extension : Al-Amal, AFONT based on TEX, MET , and Flex/Bison ®lters. 1 Introduction ²ws Ð *iP The ®rst Arabic book, a 5 11 cm volume titled (Book of the prayer of hours), was printed in 1514 by GreÂgoire de GreÂgoire in Venice and Fano, under the protec- tion of pope Leo the 10th ([1, p. 18±19]). It took about two centuries for Arabic book print- ing to move to the East: in 1727 the Ottoman printing agency is founded in Constantinople and starts printing using Dutch types and technology ([8, p. 156]). A similar institution was founded in Cairo in 1821. Undoubtedly a script like the Arabic one, having deep roots in calligraphy, was rather dif®cult to adapt to typography, a technique where strict standardization and repetition of forms is necessary. When Aldus Manutius created the ®rst italic font in 1501, out of manu- script calligraphic forms, he did a certain number of choicesÐand these choices became a standard of occidental typography. Similar choices had to be made for Arabic: calligraphy had to be ªtamedº, so that the result be homogeneous, reproducible, and ¯exible enough to be pleasant to the eye. 8 8 Ï âATô This standardization took place in 1906, in Cairo, when the ±A¢ ('Almat.ab`Å al'amÅarya) typecase is de®ned. This typecase (see ®g. 1 and 2), divided in four parts (as opposed to ªupperº and ªlowerº case of the Occident), uses a total of 470 characters. As- tonishing as it may seem, this typesetting system has been kept in use until today: books typeset in a traditional way, all around the Arabic world, are still using the same set of char- acters, and the same conventions and rules.1 In ®g. 1 and 2, the reader can see the four parts of this typecase. The reader knowing the technical limitations of computerized typesetting can already imagine the effect of computers on the Arabic script: not being able to cope with the com- 1 In [4, p. 102±103], a book published in 1880 (!) the reader can ®nd 30 rules for typesetting Arabic, which are still strictly applied today by traditional typographers. 0894±3982/??/??0001±1$?.?? c 19?? by John Wiley & Sons, Ltd. 2 YANNIS HARALAMBOUS casse1cis.eps Figure 1. The Cairo typecaseÐcases 1 and 2 29/6/1996 23:04 PAGE PROOFS ep96 THE ARABIC TYPECASE AND UNICODE 3 casse2cis.eps Figure 2. The Cairo typecaseÐcases 3 and 4 29/6/1996 23:04 PAGE PROOFS ep96 4 YANNIS HARALAMBOUS arabexamples.eps Figure 3. Samples of printed Arabic: Beirut 1963, Leipzig 1981, London 1992. plexity of the Cairo typecase, the computer industry has tried (and was ®nally able) to im- pose new standards of simpli®ed typesetting,2 coveringmost of the time only the fundamen- tal properties of Arabic script, without any typographical enhancement. Was it the comput- ers, which have simpli®ed Arabic printed script, or was it a deeper change in Arabic society and mentality? This is hard to say; nevertheless, even today, commercial computer typeset- ting systems areÐa few isolated exceptions apartÐunable to reach the typographic quality of 'Almat.ab`Å al'amÅarya. In ®g. 3 one can see different samples of printed Arabic material, showing the evolutionand simpli®cation of Arabic script; these examplesare extreme cases: the ®rst one is taken from a scholarly book printed in Lebanon (it contains almost all lig- atures of the 'Almat.ab`Å al'amÅarya typecase), the second from a technical book printed in East Germany (a fewer number of ligatures), and the third from a daily newspaper printed in the U.K. (almost no ligatures). This paper describes the author's solution to this problem: ¢Ï(Al-Amal), a typesetting system based on TEX (actually TEX--XET), emulating the 'Almat.ab`Å al'amÅarya typecase. 2 For more information on the Arabic script and the computer see also [3] and [10]. 29/6/1996 23:04 PAGE PROOFS ep96 THE ARABIC TYPECASE AND UNICODE 5 8 X d @ @b ,1928 ¡A j @ @y¥A ¤A@ @"¬ r £@ @g`¢ @ @`b w@ @h @ @n¢ * ® w $ý@@¬ A@ @hã w@ @Yn ¦¢ @ @sêtq¢ ,At @ @y¥A®rs @ @Gl Ð@ @D .At @ @ @ @®AG wSíô w@ @YnA @ @l ª A¢ j A@ @¢A ¤w@ @k® 8 d@ @Y @ @YtÊ wâÏ @ @yi j @ @¬ A@ @`õà @ @t`æ £@ @g`ô @ @A ¤ ¦ AÉ j ü@ @hâ @ @`¢ ª@ @tAY¬ @ @Ð @ @` ¤ A ,A@ @hAqt¬ A@ @" .)Ð@@¢Ï( @@*i[ @@b¬ @@®rHt 8 w@ @fq® ¤ A@ @yk® Ï @ @¢Ðt ¬ ¦@ @îÔdtb"ý w@ @sîô ¦¢ @ @î ª@ @å Ð@ @TÏ AÏ $ý d@ @!`¢¬ @ @ynîÐA w@ @tk¢ ,£¯ @ @nY¢ w@ @n $ý $ý A@ @¥d A¢ @ @h .r d@ @Y¢ ±d@ @Ï ª@ @bîr j A@ @¥Ay d@ @nts¢¬ @ @y ,ª@ @y @ @A A¢ ª@ @y ¬d ¤ 0$@ @l¢ ,Xdô w@ @n ¦¢ ,r@ @YtÉ £@ @g`¢ @ @¬ .A@@Ttsô d @@¥A¢Ï A@@õà ¢ Figure 4. Samples of text typeset with Al-Amal, with Cairo typecase ligatures This system (already presented in [6] and [7]), has been recently extended to the complete set of Unicode Arabic alphabet characters; problems and open questions arising from this extension are discussed at the end of the paper. 2 The Cairo typecase Arabic letters have contextual forms, depending on surrounding letters in the same word: a typical three letter word will start with a letter in initial form, followed by a letter in medial form and, ®nally, by a letter in ®nal form (the hypothetical word consisting of three times the letter `ghayn' is written G). A fourth form is used for isolated letters (this is also the form used in crosswords or Scrabble-like games, where letters have to placed in boxes, indepe- dently of their context). Some letters appear only in isolated and ®nal form (and sometimes even only in isolated form), so that the letters immediately following them must be written in initial (or isolated) form, although they are located inside the word. These are the basic contextual rules of the Arabic script: they are independent of style and medium, and are applied in all cases, without exception; they are as basic as the dot on the Latin lowercase `i', or the horizontal bar of the `t'. But besides these contextual forms, 'Almat.ab`Å al'amÅarya also combines letters into lig- ! atures, not unlike the `f' + `i' `®' phenomenon in Latin alphabet typesetting. The rules are described in [4, p. 102±103] and have been con®rmed by careful examination of various printed texts of different origin. In ®g. 4 and 5 the reader can compare the same text (actu- ally the text of the ®rst example of ®g. 3) typeset via the Al-Amal system, with and without 'Almat.ab`Å al'amÅarya ligatures. In the following we start by giving the mandatory ligatures (those that are part of every font), then we give the second level ligatures (those that are characteristic of the Cairo type- case), and ®nally we give the variant characters (form changes applied to single characters). By ª`foo-likeº, where ªfooº is some common Arabic letter, we mean all characters having 29/6/1996 23:04 PAGE PROOFS ep96 6 YANNIS HARALAMBOUS 8 ²@@ d @@b ,1928 ¡A ² @@y¥A ¤A@@ml¬ r £@@g`¢ @@`b w@@h @@n¢ w@ @H¥ °@ @l¬ A@ @h w@ @Yn ¦¢ @ @sbtq¢ ,Atm¢ @ @m @ @y¥A®rs @ @Gl .@ @®At @ @ @ @®AG wSnm w@ @YnA @ @l ª A¢ ² A@ @¢A ¤w@ @k® A@ @k® 8 wÏ @ @ym ² @ @g¬ A@ @`r¢ r@ @bt`® £@ @g`m @ @A ¤ ¦ ÐD r@ @t`¢ ª@ @tAY¬ @ @Ð @ @` ¤ A@ @m ,A@ @hAqt¬ A@ @mlk d@ @Ym @ @Ytm .)ТÏ( @@Atk @@b¬ @@®rHt ²@@Ag¢ ² A@@mh 8 w@ @fq® ¤ A@ @yk® r@ @yl A@ @ @ @¢Ðt ¬ ¦@ @yÔdtbml w@ @sym ¦¢ @ @y ª@ @¥ r@ @y ÐTÏ AÏ °@ @l d@ @mt`¢¬ @ @ynyÐA w@ @tk¢ ,£@ @ @ @nY¢ w@ @n °@ @l A@ @¥d A¢ @ @h .²@ @f d@ @Y¢ °@ @ ±d@ @gÏ ª@ @byr ² A@ @¥Ay d@ @nts¢¬ @ @y £¨ A¢ ª@ @y ¬dg® ¤ ¦@ @yl¢ ,²@ @dm w@ @n ¦¢ ,r@ @Yt¢ £@ @g`¢ @ @¬ °@ @l .A@@Ttsm d @@¥A¢Ï A@@r¢ ¢ ,ª@@y @@AH Figure 5. Samples of text typeset with Al-Amal, without Cairo typecase ligatures the same base form as letter ªfooº but different dots and other diacritics (for example, the Ä ba-like family is the set of characters , , , ,etc.).