Dadtex, a Full Arabic Interface

Total Page:16

File Type:pdf, Size:1020Kb

Dadtex, a Full Arabic Interface Outlines Arabic writing Arabic mathematical expressions Arabic scientific document composition DadTEX motivations DadTEX system Open problems and prospects DadTEX, a full Arabic interface M. Eddahibi, A. Lazrek, K. Sami Department of Computer Science Cadi Ayyad University Marrakesh November 9 - 11, 2006 TUG2006 http://www.ucam.ac.ma/fssm/rydarab DadTEX, a full Arabic interface Outlines 1 Arabic writing 2 Arabic mathematical expressions 3 Arabic scientific document composition 4 DadTEX motivations 5 DadTEX system 6 Open problems and prospects Arabic Writing Arabic letters represent only consonants or long vowels Optional diacritical marks for short vowels can be used to annotate text or spell it out in full when desired Arabic script is written from right to left Arabic script is cursive Arabic Writing Arabic letters represent only consonants or long vowels Optional diacritical marks for short vowels can be used to annotate text or spell it out in full when desired Arabic script is written from right to left Arabic script is cursive Arabic Writing Arabic letters represent only consonants or long vowels Optional diacritical marks for short vowels can be used to annotate text or spell it out in full when desired Arabic script is written from right to left Arabic script is cursive Arabic Writing Arabic letters represent only consonants or long vowels Optional diacritical marks for short vowels can be used to annotate text or spell it out in full when desired Arabic script is written from right to left Arabic script is cursive Arabic mathematical expressions There are two mathematical notations according to the regions: ? Latin notation with Arabic text ? genuine Arabic notation Arabic notation Spreads from right to left Uses Arabic alphabet based symbols Uses Arabic abbreviations for usual functions Uses some Latin mirrored symbols Uses curved kashida for variable sized symbols Arabic mathematical expressions There are two mathematical notations according to the regions: ? Latin notation with Arabic text ? genuine Arabic notation Arabic notation Spreads from right to left Uses Arabic alphabet based symbols Uses Arabic abbreviations for usual functions Uses some Latin mirrored symbols Uses curved kashida for variable sized symbols Arabic mathematical expressions There are two mathematical notations according to the regions: ? Latin notation with Arabic text ? genuine Arabic notation Arabic notation Spreads from right to left Uses Arabic alphabet based symbols Uses Arabic abbreviations for usual functions Uses some Latin mirrored symbols Uses curved kashida for variable sized symbols Arabic mathematical expressions There are two mathematical notations according to the regions: ? Latin notation with Arabic text ? genuine Arabic notation Arabic notation Spreads from right to left Uses Arabic alphabet based symbols Uses Arabic abbreviations for usual functions Uses some Latin mirrored symbols Uses curved kashida for variable sized symbols Arabic mathematical expressions There are two mathematical notations according to the regions: ? Latin notation with Arabic text ? genuine Arabic notation Arabic notation Spreads from right to left Uses Arabic alphabet based symbols Uses Arabic abbreviations for usual functions Uses some Latin mirrored symbols Uses curved kashida for variable sized symbols Arabic mathematical expressions Arabic alphabet symbols are dotted or dotless Uses several styles to extend the amount of symbols Arabic scientific document composition Image based method Pure painting Painting and equation editor Painting and text layout Handwritten equation digitalization TEX based method RyDArab and CurExt Arabic scientific document composition Image based method Pure painting Painting and equation editor Painting and text layout Handwritten equation digitalization TEX based method RyDArab and CurExt Pure painting Uses drawing tool to paint both text and non textual symbols The result is cut and pasted to document Painting and text layout Use of letters for alphabet based symbols and drawings for other symbols Painting and equation editor Uses an equation editor for Arabic-Latin common notation and drawing for mirrored symbols Handwritten equation digitalization Use of scanner to transform both text and mathematical expressions to image Image based Methods Advantages are WYSIWYG do not present any linguistic difficulty when used in Arabic versions Drawbacks Bad typographical quality Difficult (need some drawing skills and Image handling) Not uniform (expressions have not the same metrics) No semantic content RyDArab and CurExt Advantages TEX based systems High quality technical documents Documents can be edited using simple text editor Several choices for notations and symbols styles Variable sized symbols dimensions are automatically and transparently set Expressions can be converted into several formats (image, MathML,...) Expression can be edited easily Drawbacks No WYSIWYG interface for TEX in Arabic till now Need some English and TEX learning DadTEX Motivations Information may be reached, used and communicated in the languages of the transmitter and that of the receiver without considerations of the technical support Several trends help to go from monolingualism to multilingualism from ASCII to Unicode TEX Arabic support: Omega, ArabTEX, Aleph, Rydarab ... Multilingual fonts: STIX project XML I18n ... Several studies show that one learns better in his mother tongue DadTEX system DadTEX goals Interface for composition of LATEX documents using only Arabic Build Arabic version of TEX commands lexicon Make it easy to learn and understand TEX vocabulary for Arabic users Avoid bidirectionality problems due to the mixture of English commands and Arabic text DadTEX System Bidirectionality problems Text selection Character position Semantic ambiguity Due to the graphical swapping DadTEX System Bidirectionality problems Text selection Character position Semantic ambiguity Due to the graphical swapping DadTEX System Bidirectionality problems Text selection Character position Semantic ambiguity Due to the graphical swapping DadTEX System A conceivable solution is to use an application to convert Arabic document into its transliterated equivalent. This mechanism is similar to the one used in FarsiTEX ( ftx2tex translates Persian text to transliterated text) This solution have several deficiencies: it is not direct; it generates a supplementary file to be processed instead of the original source file the compilation time is increased additional memory and free space are required DadTEX System DadTEX is an interface that allows the creation of LATEX documents in Arabic. The whole of the document can be composed using only Arabic text with some control characters like backslash, dollar, ... It is based on the primitive \def for the translation of every commands. In some cases commands translation is not sufficient : RydArab uses transliteration for individual letter-based alphabetical symbols it is necessary to redefine the correspondences using Arabic characters. DadTEX System DadTEX is an interface that allows the creation of LATEX documents in Arabic. The whole of the document can be composed using only Arabic text with some control characters like backslash, dollar, ... It is based on the primitive \def for the translation of every commands. In some cases commands translation is not sufficient : RydArab uses transliteration for individual letter-based alphabetical symbols it is necessary to redefine the correspondences using Arabic characters. DadTEX System DadTEX is an interface that allows the creation of LATEX documents in Arabic. The whole of the document can be composed using only Arabic text with some control characters like backslash, dollar, ... It is based on the primitive \def for the translation of every commands. In some cases commands translation is not sufficient : RydArab uses transliteration for individual letter-based alphabetical symbols it is necessary to redefine the correspondences using Arabic characters. DadTEX System In TEX, besides commands for mathematical objects, there are many commands for document segmentation and content nature specification. The translation of such commands, will make it very easy for Arabic users to understand why commands like \chapter should not be used in a standard article. DadTEX structure DadTEX structure dadpreamb.tex: a set of commands used in document preambles. The encoding system in use is defined here. dadalpha.tex: here, ISO-8859-6 Arabic characters are declared as letters using the primitive \catcode dadbody.tex: a set of commands used in documents bodies dadadapt.tex: DadTEX users can add their own commands. It is like a dictionary of commands. It allows for flexible and customizable translation. Encoding DadTEX can be used with any encoding system able of representing the Arabic alphabet, as long as the encoding is supported by the packages in use. In this version, we used ISO-8859-6 instead of UTF-8 because the ArabTEX system still has some problems when it is used with UTF-8 In ISO-8859-6, Arabic characters are encoded in one byte. It is thus easy to set their category code into 11. Encoding UTF-8 Using \catcode with UTF-8 will lead to errors, because it is intended to be used with one-byte characters. Arabic characters should be divided into there two visible bytes using an ASCII text editor. and . In the case of the Omega system, the hexadecimal code can be used directly: . Sample DadTEX advantages DadTEX is based completely on TEX crossplatform supported compatible with several other
Recommended publications
  • The Arabi System — TEX Writes in Arabic and Farsi
    The Arabi system | ] ¨r` [ A\ TEX writes in Arabic and Farsi Youssef Jabri Ecole´ Nationale des Sciences Appliqu´ees, Oujda, Morocco yjabri (at) ensa dot univ-oujda dot ac dot ma Abstract In this paper, we will present a newly arrived package on CTAN that provides Arabic script support for TEX without the need for an external pre-processor. The Arabi package adds one of the last major multilingual typesetting capabilities to Babel by adding support for the Arabic ¨r and Farsi ¨FCA languages. Other languages using the Arabic script should also be more or less easily imple- mentable. Arabi comes with many good quality free fonts, Arabic and Farsi, and may also use commercial fonts. It supports many 8-bit input encodings (namely, CP-1256, ISO-8859-6 and Unicode UTF-8) and can typeset classical Arabic poetry. The package is distributed under the LATEX Project Public License (LPPL), and has the LPPL maintenance status \author-maintained". It can be used freely (including commercially) to produce beautiful texts that mix Arabic, Farsi and Latin (or other) characters. Pl Y ¾Abn Tn ®¤ Tr` ¤r Am`tF TAk t§ A\ ¨r` TEC .¤r fOt TEX > < A\ Am`tFA d¤ dnts ¨ n A\ (¨FCA ¤ ¨r)tl Am`tF TAk S ¨r` TEC , T¤rm rb Cdq tmt§¤ ¯m ¢k zymt§ A\n @h , T§db @n¤ Y At§ ¯ ¢ Y TAR . AARn £EA A \` Am`tF® A ¢± ¾AO ¾AA ¨r` dq§ . Tmlk ¨ ¤r AkJ d§dt ¨CA A` © ¨ ¨t ªW d Am`tF ¢nkm§ Am Am`tF¯ ­r ªW Tmm ¯¤ ¨A ¨r` , A\n TbsnA A w¡ Am .
    [Show full text]
  • TUGBOAT Volume 32, Number 1 / 2011
    TUGBOAT Volume 32, Number 1 / 2011 General Delivery 3 From the president / Karl Berry 4 Editorial comments / Barbara Beeton Opus 100; BBVA award for Don Knuth; Short takes; Mimi 6 Mimi Burbank / Jackie Damrau 7 Missing Mimi / Christina Thiele 9 16 years of ConTEXt / Hans Hagen 17 TUGboat’s 100 issues — Basic statistics and random gleanings / David Walden and Karl Berry 23 TUGboat online / Karl Berry and David Walden 27 TEX consulting for fun and profit / Boris Veytsman Resources 30 Which way to the forum? / Jim Hefferon Electronic Documents 32 LATEX at Distributed Proofreaders and the electronic preservation of mathematical literature at Project Gutenberg / Andrew Hwang Fonts 39 Introducing the PT Sans and PT Serif typefaces / Pavel Far´aˇr 43 Handling math: A retrospective / Hans Hagen Typography 47 The rules for long s / Andrew West Software & Tools 56 Installing TEX Live 2010 on Ubuntu / Enrico Gregorio 62 tlcontrib.metatex.org: A complement to TEX Live / Taco Hoekwater 68 LuaTEX: What it takes to make a paragraph / Paul Isambert 77 Luna — my side of the moon / Paweł Jackowski A L TEX 83 Reflections on the history of the LATEX Project Public License (LPPL)— A software license for LATEX and more / Frank Mittelbach 95 siunitx: A comprehensive (SI) units package / Joseph Wright 99 Glisterings: Framing, new frames / Peter Wilson 104 Some misunderstood or unknown LATEX2ε tricks III / Luca Merciadri A L TEX 3 108 LATEX3 news, issue 5 / LATEX Project Team Book Reviews 109 Book review: Typesetting tables with LATEX / Boris Veytsman Hints & Tricks
    [Show full text]
  • Farsitex and the Iranian TEX Community
    FarsiTEX and the Iranian TEX Community Behdad Esfahbod Computing Center Sharif University of Technology Azadi Avenue Tehran, Iran [email protected] http://behdad.org/ Roozbeh Pournader Computing Center Sharif University of Technology Azadi Avenue Tehran, Iran [email protected] http://sina.sharif.edu/~roozbeh/ Abstract FarsiTEX, a localized version of LATEX, is a bilingual Persian/English typesetting package, meeting the minimum requirements of Persian mathematical and tech- nical typography. This paper will describe FarsiTEX, together with its history, future and technicalities, its user community, and the reasons behind its success in Iran, amid its various usage and interoperability problems. It will also draw a general picture of the TEX community in Iran, and tries to describe why the community is still far from achieving its basic typographical needs. Introduction version 3.2 (Unicode Editorial Committee, 2002), lists a total of 139 letters in the script, which are The Persian language, in its contemporary form, is derivatives of about 28 basic Arabic letters. a language spoken natively in Iran, Afghanistan, The Persian typography, influenced by major and Tajikistan. The local forms are known as calligraphic practices of the pre-printing era, is ac- Farsi, Dari, and Tajiki respectively. They all use tually based on the famous Naskh style, which more the same basic vocabulary and grammar, but there than 99% of contemporary texts published in it. The are differences in both pronunciations and modern alternate style, Nastaliq, a little harder to read but vocabulary. In this paper, we will focus on the form considered very beautiful by the general public, and used in Iran, which is the official language of the widely known as the hardest commonly used script country.
    [Show full text]
  • TUGBOAT Volume 34, Number 1 / 2013
    TUGBOAT Volume 34, Number 1 / 2013 General Delivery 3 Ab epistulis / Steve Peter 4 Editorial comments / Barbara Beeton This is the year for TEX bug reports; Don Knuth in the news (again); A new TEX calendar; Compulsive Bodoni / the Parmigiano Typographic System; Printing technology, old and new; Interactive and collaborative on-line LATEX; Mapping math and scientific symbols to their meanings Resources 6 CTAN: Relaunch of the Web portal / Gerd Neugebauer Fonts 10 Fonts! Fonts! Fonts! / Bob Tennent Typography 14 Typographers’ Inn / Peter Flynn Graphics 17 Entry-level MetaPost: On the grid / Mari Voipio 21 Recreating historical patterns with MetaPost / Mari Voipio 26 The xpicture package / Robert Fuster A L TEX 34 Side-by-side figures in LATEX / Thomas Thurnherr 37 Glisterings: Repetition; Verbatims; Small pages; Prefixing section heads / Peter Wilson 40 The esami package for examinations / Grazia Messineo and Salvatore Vassallo Dreamboat 47 E-TEX: Guidelines for future TEX extensions — revisited / Frank Mittelbach Software & Tools 64 LuaJITTEX / Luigi Scarso ConTEXt 72 ConTEXt: Just-in-time LuaTEX / Hans Hagen 79 ConTEXt basics for users: Images / Aditya Mahajan Macros 83 New CSplain of 2012 / Petr Olˇs´ak 88 OPmac: Macros for plain TEX / Petr Olˇs´ak Hints & Tricks 96 The treasure chest / Karl Berry 97 Production notes / Karl Berry Book Reviews 98 Book review: The Computer Science of TEX and LATEX / Boris Veytsman Abstracts 99 Die TEXnische Kom¨odie: Contents of issues 4/2012–1/2013 100 Eutypon: Contents of issue 28–29 (October 2012) News 101 Calendar 102 TUG 2013 announcement Advertisements 103 TEX consulting and production services TUG Business 2 TUGboat editorial information 2 TUG institutional members 105 TUG membership form 106 TUG financial statements for 2012 / Karl Berry 107 TUG 2013 election Fiction 108 Colophon / Daniel Quinn TEX Users Group Board of Directors TUGboat (ISSN 0896-3207) is published by the TEX Donald Knuth, Grand Wizard of TEX-arcana † Users Group.
    [Show full text]
  • TUGBOAT Volume 40, Number 2 / 2019 TUG 2019 Conference
    TUGBOAT Volume 40, Number 2 / 2019 TUG 2019 Conference Proceedings TUG 2019 98 Conference sponsors, participants, program, photos 101 Henri Menke / Back to the roots: TUG 2019 in Palo Alto 104 Jennifer Claudio / TEX Users Group 2019 Annual General Meeting notes General Delivery 106 Jim Hefferon / What do today’s newcomers want? Software & Tools 108 Tomas Rokicki / Type 3 fonts and PDF search in dvips 112 Arthur Reutenauer / The state of X TE EX 113 Arthur Reutenauer / Hyphenation patterns: Licensing and stability 115 Richard Koch / MacTEX-2019, notification, and hardened runtimes 126 Uwe Ziegenhagen / Combining LATEX with Python 119 Didier Verna / Quickref: Common Lisp reference documentation as a stress test for Texinfo 129 Henri Menke / Parsing complex data formats in LuaTEX with LPEG Methods 136 William Adams / Design into 3D: A system for customizable project designs Electronic Documents 143 Martin Ruckert / The design of the HINT file format 147 Rishikesan Nair T., Rajagopal C.V., Radhakrishnan C.V. / TEXFolio — a framework to typeset XML documents using TEX 150 Aravind Rajendran, Rishikesan Nair T., Rajagopal C.V. / Neptune — a proofing framework for LATEX authors LATEX 153 Frank Mittelbach / The LATEX release workflow and the LATEX dev formats 157 Chris Rowley, Ulrike Fischer, Frank Mittelbach / Accessibility in the LATEX kernel — experiments in Tagged PDF 159 Boris Veytsman / Creating commented editions with LATEX—the commedit package 163 Uwe Ziegenhagen / Creating and automating exams with LATEX & friends Bibliographies 167 Sree
    [Show full text]
  • TUGBOAT Volume 32, Number 3 / 2011 TUG 2011 Conference
    TUGBOAT Volume 32, Number 3 / 2011 TUG 2011 Conference Proceedings TUG 2011 242 Conference program, delegates, and sponsors 245 Barbara Beeton / TUG 2011 in India Resources 248 Stefan Kottwitz / TEX online communities — discussion and content Education 251 Kannan Moudgalya / LATEX training through spoken tutorials Electronic 257 Alan Wetmore / e-Readers and LATEX Documents 261 Boris Veytsman and Michael Ware / Ebooks and paper sizes: Output routines made easier 266 Rishi T. / LATEX to ePub 269 Manjusha Joshi / A dream of computing and LATEXing together: A reality with SageTEX 272 Axel Kielhorn / Multi-target publishing 278 S.K. Venkatesan / On the use of TEX as an authoring language for HTML5 281 S. Sankar, S. Mahalakshmi and L. Ganesh / An XML model of CSS3 as an XLATEX-TEXML-HTML5 stylesheet language Typography 285 Boris Veytsman and Leyla Akhmadeeva / Towards evidence-based typography: Literature review and experiment design Bibliographies 289 Jean-Michel Hufflen / A comparative study of methods for bibliographies LATEX 302 Brian Housley / The hletter class and style for producing flexible letters and page headings 309 Didier Verna / Towards LATEX coding standards Abstracts 329 TUG 2011 abstracts (Bazargan, Crossland, Radhakrishnan, Doumont, Mittelbach, Moore, Rishi, Skoup´y, Sojka, Wujastyk) A L TEX 331 LATEX Project Team / LATEX news, issue 20 333 Brian Beitzel / The meetingmins LATEX class: Hierarchically organized meeting agendas and minutes 335 Igor Ruiz-Agundez / Collaborative LATEX writing with Google Docs 339 Peter Wilson / Glisterings:
    [Show full text]
  • Farsitex and the Iranian TEX Community
    FarsiTEX and the Iranian TEX Community Behdad Esfahbod [email protected] Roozbeh Pournader [email protected] The 23rd Conference and Annual Meeting of the TEX Users group, Trivandrum, Kerala, India September 4, 2002 What is Persian? Contemporary Persian Dari Farsi Tajiki Afghanistan Iran Tajikistan 1 The Modern Persian Script • Based on the Arabic Script • Extra letters: Peh (p), Tcheh (c), Jeh (j), and Gaf (g) • Modified letters: – Kaf (K) → Keheh (k) – Yeh (x) → Farsi Yeh (y) 2 The History of the Script • The switch from Pahlavi to Arabic happened in the 7th century CE • The adaption propagated to Pakistan, Afghanistan, India, China, Malaysia, and Java where the alphabet was extended even more: 29 basic Arabic letters → 139 letters in modern use (from Kurdish to Jawi) 3 The Persian Typography • Based on calligraphic practices – Originally Naskh (as opposed to Kufi), the Meccan style of writing Arabic – Nastaliq was invented in 15th century CE and the calligraphy switched • With lead typography it switched back to Naskh • With late 1990s proprietary digital typography tools, Nastaliq become public again, but the popularity dropped because of unreadablity 4 Persian Scientific Typography • Blossoming in 1950s by Mosahab works (who also invented Iranic) • Manual typesetting using “match stick methods” • LinoType machines in 1970s, modern publishers raised, resulting in a leap in math books 5 Localized TEXs • TEX-e-Parsi and LATEX-e-Farsi appearing in 1992 • TEX-e-Parsi, won the competition because of better quality 6 TEX-e-Parsi • Developed
    [Show full text]
  • Constitution of 8Chan Written by Senate (Draft) Formal Name Of
    Constitution of 8chan written by Senate (draft) Formal name of: Magnum Charter of 8chan Shilsl cannot control us, let us dream for our future utopian society, no matter how complex. Everything here can be changed by anyone, if you suggest, we will help. Section 0: introductions of this nation What is the State of 8chan? 1. The country of 8chan is in the IP address of 8ch.net where it stands online, 2. The nation of 8chan are created by the user and board volunteer­owner community. 3. The state of 8chan is controlled by the Forever­King (spiritual leader) and Head Army General, HotWheels, and he gives the power to the government. What unites our people 1. It is anonymity­friendly (supports Tor) and archive­friendly (8archive.moe). 2. It has wide file upload variety (from jpg/png to webm/gif/mp4 to swf & pdf ). 3. It has large upload capacity (maximum 5 files at a time 5MB total). 4. It has video embed option, oekaki (drawing) and wide variety of text options. 5. It has custom flags and banners for unique, tags and categories for sorting boards. 6. It has HTTPS and CATPCHA security and CSS customizer. 7. It had donation allowance and Booru connection services. 8. It has no login, no Adverts, no PermaBan and no Banlists. 9. It will not be effected by a corrupt up­down­vote system. Official 1. The official boards of The 8chan Government are: a. /sen/ (8Chan Senate) which uses the #/sen/ IRC channel, b. /rep/ (8Chan House of Rrepresentatives) which uses the #/rep/ IRC channel, c.
    [Show full text]
  • A Complete Tugboat Bibliography
    A Complete TUGboat Bibliography Nelson H. F. Beebe University of Utah Department of Mathematics, 110 LCB 155 S 1400 E RM 233 Salt Lake City, UT 84112-0090 USA Tel: +1 801 581 5254 FAX: +1 801 585 1640, +1 801 581 4148 E-mail: [email protected], [email protected], [email protected] (Internet) WWW URL: http://www.math.utah.edu/˜beebe/ 21 August 2021 Version 4.05 Abstract This bibliography summarizes the contents of the TEX User Group journal, TUGboat. It is produced semiautomatically by an awk program from the tables- of-contents files (tb*.cnt) released periodically by the journal editors, or since 2017 (volume 38), from journal HTML pages. Hand editing then supplies final tweaks. Title word cross-reference #10 [1422]. #12 [1459]. #13 [1534]. #15 [1588]. #7 [1335]. #8 [1335]. #9 [1422]. 3 [4501, 2831, 3022]. 6 [3219]. @ [2862]. ,! [1421]. Λ [2732]. Ω [1757, 1879, 1933, 1939]. Ω2 [3070, 3071]. " [1758, 2016]. *.bib [4140]. -.02emCAD [3304]. -.03em [3361]. -.03emenbergg [3778, 3997]. -.15em [470]. -.1em [4176, 2800]. -.1emAMS [2704]. -.1emVDU [583]. -1pt [3848, 2658]. -Babel [1879]. -based [2826]. -TEX [1758, 2016]. 1 2 .02emCAD [3304]. .03em [3361]. .03emenbergg [3778, 3997]. .05em [2704]. .05em- [2704]. .05em`ı [2959]. .06em [4215]. .06em-foundry [4215]. .15em [470]. .1em [4176, 2800, 2782]. .1em-TEX [2782]. .1emAMS [2704]. .1emVDU [583]. .dtx [3297]. .dvi [941, 1962]. .ins [3297]. 0.4 [3601]. 0.60 [3520]. 0.8 — Prologue [4279]. 0.90 [4135, 4138]. 0.999 [1876]. 000 [1000]. 1 [4260, 4476, 944, 3108, 3182, 3307, 3310, 3565, 3605, 3644, 3683, 3719, 3780, 3812, 3928, 4149, 1851, 3167, 3239, 1811, 4621, 2824, 2696, 3167].
    [Show full text]
  • Polyglossia.Pdf
    Πολυγλωσσια Polyglossia: Modern multilingual typesetting with X LE ATEX and LuaLATEX FRanÇois ChaRette ARthuR ReutenaueR∗ Bastien RoucaRiÈs JÜRgen SpitzmÜlleR 2021/04/12 v1.53 (pdf file generated on 12 April 2021) Contents 1 Introduction 4 2 Setting up multilingual documents 5 2.1 Activating languages ........................ 5 2.2 Supported languages ........................ 5 2.3 Relation to and use of Babel language names ........... 6 2.4 Using IETF language tags ...................... 8 2.5 Global options ............................ 12 3 Language-switching commands 13 3.1 Recommended commands ..................... 13 3.2 Babel commands ........................... 14 3.3 Other commands .......................... 15 3.4 Setting up alias commands ..................... 15 4 Font setup 16 ∗Current maintainer 1 5 Adapting hyphenation 16 5.1 Hyphenation exceptions ...................... 16 5.2 Hyphenation thresholds ...................... 17 5.3 Hyphenation disabling ....................... 17 6 Language-specific options and commands 18 6.1 afrikaans ............................... 18 6.2 arabic ................................. 18 6.3 armenian ............................... 19 6.4 belarusian .............................. 19 6.5 bengali ................................ 21 6.6 catalan ................................ 21 6.7 croatian ................................ 21 6.8 czech ................................. 23 6.9 dutch ................................. 24 6.10 english ................................ 24 6.11 esperanto ..............................
    [Show full text]
  • Computers and Challenges of W Riting in Persian: Explorations at The
    . 1+2 . 1+2 . Visible Language 54 Visible Language 54 ComputersChallenges andWriting of in Persian: Explorationsthe Intersection ofat CultureTechnology and Behrooz Parhami The Persian script has presented some difficulties, ever since printing presses were intro- duced in Iran in the 1600s. The appearance of typewriters more than a century ago created additional problems and the introduction of digital computers in the late 1960s added to the design challenges. These difficulties persisted, until high-resolution dot-matrix printers and display devices offered greater flexibility to font designers and the expansion of the computer market in the Middle East attracted investments on improving the Persian script for comput- ers. Nevertheless, certain peculiarities of the Persian script have led to legibility and aesthetic quality issues that persist in many cases. In this paper, I enumerate some of the features of the Persian script that made it a poor match to implementation on modern technologies and review the challenges presented by, and some of the solutions proposed for, each new generation of printing and display devices. Before delving into the technical challenges and solutions, however, I discuss the sociocultural significance of the interplay between centuries- old Persian culture/language/script and modern technology, along with associated research problems. Interestingly, the same features that make legible and aesthetically pleasing Persian printing/displaying difficult also lead to challenges in automatic text recognition. I Parhami conclude with an overview of current state of the art and areas that still need further work. Keywords – Aesthetics; Arabic script; Computers Computer display; and Computer printing; Challenges of Dot-matrix font; Writing Font design; in Persian: Keyboard; Legibility; Line-printer; Movable type; Persian script; Printing; Technology transfer; Typewriter 186 187 .
    [Show full text]
  • راﻫﻨﻤﺎی ﻧﺼﺐ و اﺳﺘﻔﺎده از LATEX Persian ﻞﻣﺎﺷ E
    راﻫﻨﻤﺎی ﻧﺼﺐ و اﺳﺘﻔﺎده از LATEX * ﺷﺎﻣﻞ X E Persian و FarsiTEX ﻣﺤﻤﺪ ﻓﺮﺷﯽ ٢ ﻣﻬﺮ ١٣٩٣ ١ ﻣﻘﺪﻣﻪ ﻧﺮم اﻓﺰار ﺣﺮوﻓﭽﯿﻨﯽ TEXﯾﮑﯽ ازﻧﺮم اﻓﺰارﻫﺎی ﻣﻌﺮوف ﺣﺮوﻓﭽﯿﻨﯽ ﻣﺘﻮن ﻋﻠﻤﯽ اﺳﺖ ﮐﻪ در ﺳﻄﺢ وﺳﯿﻌﯽ ﺟﻬﺖ ﺣﺮوﻓﭽﯿﻨﯽ ﻣﺠﻼت و ﮐﺘﺐ اﺳﺘﻔﺎدهﻣﯽ ﺷﻮد. در اﯾﻦ ﻣﺘﻦ ﻣﺨﺘﺼﺮ ﺑﺮ آﻧﯿﻢ ﮐﻪ راﻫﻨﻤﺎی ﺳﺮﯾﻌﯽ ﺑﺮای ﻧﺼﺐ و اﺳﺘﻔﺎده از آن ﺑﯿﺎن ﮐﻨﯿﻢ ﺑﺎ اﯾﻦ اﻣﯿﺪ ﮐﻪ ﮐﺎرﺑﺮان ﺑﺎ ﭘﯿﮕﯿﺮی آن ﺑﻪ راﺣﺘﯽ ﺑﺘﻮاﻧﻨﺪ آن را ﻧﺼﺐ و اﺳﺘﻔﺎده ﻧﻤﺎﯾﻨﺪ. ﻗﺒﻞ از اﯾﻦ ﻻزم اﺳﺖ ﺟﻬﺖ واﺿﺢ ﺷﺪن ﺷﮑﻞ ﻋﻤﻠﮑﺮد اﯾﻦ ﻧﺮم اﻓﺰار، اﻃﻼﻋﺎﺗﯽ در ﻣﻮرد آن داﺷﺘﻪ ﺑﺎﺷﯿﻢ ﮐﻪ در اداﻣﻪ ﺑﻪ آن ﭘﺮداﺧﺘﻪﻣﯽ ﺷﻮد. ﻧﺮم اﻓﺰار ﺣﺮوﻓﭽﯿﻨﯽ TEXﯾﮏ ﻧﺮم اﻓﺰار ﻣﺠﺎﻧﯽ اﺳﺖ ﮐﻪ ﺑﻪ ﺻﻮرت ﺧﻂ ﻓﺮﻣﺎﻧﯽ ﮐﺎرﻣﯽ ﮐﻨﺪ، ﺑﻪ اﯾﻦ ﻣﻌﻨﯽ ﮐﻪ ﻣﺘﻦ ﻣﻮرد ﻧﻈﺮ در ﯾﮏ ﻓﺎﯾﻞ ﻧﻮﺷﺘﻪ ﺷﺪه و ﺳﭙﺲ اﯾﻦ ﻓﺎﯾﻞ از ﻃﺮﯾﻖ دﺳﺘﻮرات ﺧﻂ ﻓﺮﻣﺎن ﺑﻪ ﻧﺮم اﻓﺰار ﺣﺮوﻓﭽﯿﻦ TEXداده ﻣﯽ ﺷﻮد. اﯾﻦ ﻧﺮم اﻓﺰار ﻓﺎﯾﻞ داده ﺷﺪه را ﺧﻮاﻧﺪه و ﺑﺮ ﻣﺒﻨﺎی آن ﻣﺘﻦ ﺣﺮوﻓﭽﯿﻨﯽ ﺷﺪه را ﺑﻪ ﺻﻮرت ﯾﮏ ﻓﺎﯾﻞ (ﻣﺜﻼ (PDF اراﺋﻪﻣﯽ ﮐﻨﺪ. دﺳﺘﻮرات ﺧﻂ ﻓﺮﻣﺎن ﻣﺘﻌﺪدی ﺑﺮای اﺳﺘﻔﺎده از اﯾﻦ ﻧﺮم اﻓﺰار ﺣﺮوﻓﭽﯿﻦ وﺟﻮد دارد ﮐﻪ از ﻣﻬﻤﺘﺮﯾﻦ آﻧﻬﺎﻣﯽ ﺗﻮان ﺑﻪ ،pdflatex latex و xelatex اﺷﺎره ﮐﺮد. ﻣﻌﻤﻮﻻ ﻣﺎ اﯾﻦ ﺑﺨﺶ از ﻧﺮم اﻓﺰار ﺣﺮوﻓﭽﯿﻦ را ﻣﻮﺗﻮر TEXﻣﯽ ﻧﺎﻣﯿﻢ. اﯾﻦ ﺧﺎﺻﯿﺖ، اوﻟﯿﻦ ﻣﺘﻤﺎﯾﺰ ﮐﻨﻨﺪۀ اﯾﻦﻧﺮم اﻓﺰار از ﺳﺎﯾﺮﻧﺮم اﻓﺰارﻫﺎ ﻧﻈﯿﺮ Office اﺳﺖ زﯾﺮا در Office ﺷﻤﺎ ﻧﺘﯿﺠﻪ ﻧﻬﺎﯾﯽ را ﻫﻤﺰﻣﺎن ﺑﺎ ﺗﺎﯾﭗﻣﯽ ﺑﯿﻨﯿﺪ وﻟﯽ در اﯾﻦﻧﺮم اﻓﺰار ﺑﺎﯾﺪ ﻓﺎﯾﻞ را ﺑﻪ ﺣﺮوﻓﭽﯿﻦ ﺑﺪﻫﯿﺪ ﺗﺎ ﺧﻮدش ﺷﮑﻞ ﺧﺮوﺟﯽ را آﻣﺎده ﮐﻨﺪ. ﻋﻤﻼ ﺑﻪ ﻫﻤﯿﻦ دﻟﯿﻞ ﻧﯿﺰ آن راﻧﺮم اﻓﺰار ﺣﺮوﻓﭽﯿﻦﻣﯽ ﻧﺎﻣﻨﺪ، ﻣﺸﺎﺑﻪ اﯾﻦ ﮐﻪ ﺷﻤﺎ ﻣﺘﻦ ﺧﺎم ﺧﻮد را ﺑﻪ ﯾﮏ ﻓﺮد ﺣﺮوﻓﭽﯿﻦﻣﯽ دﻫﯿﺪ ﺗﺎ ﺑﺎ ﺷﮑﻞ دﻫﯽ آن در ﻗﺎﻟﺐ ﺻﻔﺤﺎت، آن را ﺑﺮای ﭼﺎپ آﻣﺎده ﮐﻨﺪ.
    [Show full text]