The Arabi System — TEX Writes in Arabic and Farsi

Total Page:16

File Type:pdf, Size:1020Kb

The Arabi System — TEX Writes in Arabic and Farsi The Arabi system | ] ¨r` [ A\ TEX writes in Arabic and Farsi Youssef Jabri Ecole´ Nationale des Sciences Appliqu´ees, Oujda, Morocco yjabri (at) ensa dot univ-oujda dot ac dot ma Abstract In this paper, we will present a newly arrived package on CTAN that provides Arabic script support for TEX without the need for an external pre-processor. The Arabi package adds one of the last major multilingual typesetting capabilities to Babel by adding support for the Arabic ¨r and Farsi ¨FCA languages. Other languages using the Arabic script should also be more or less easily imple- mentable. Arabi comes with many good quality free fonts, Arabic and Farsi, and may also use commercial fonts. It supports many 8-bit input encodings (namely, CP-1256, ISO-8859-6 and Unicode UTF-8) and can typeset classical Arabic poetry. The package is distributed under the LATEX Project Public License (LPPL), and has the LPPL maintenance status \author-maintained". It can be used freely (including commercially) to produce beautiful texts that mix Arabic, Farsi and Latin (or other) characters. Pl Y ¾Abn Tn ®¤ Tr` ¤r Am`tF TAk t§ A\ ¨r` TEC .¤r fOt TEX > < A\ Am`tFA d¤ dnts ¨ n A\ (¨FCA ¤ ¨r)tl Am`tF TAk S ¨r` TEC , T¤rm rb Cdq tmt§¤ ¯m ¢k zymt§ A\n @h , T§db @n¤ Y At§ ¯ ¢ Y TAR . AARn £EA A \` Am`tF® A ¢± ¾AO ¾AA ¨r` dq§ . Tmlk ¨ ¤r AkJ d§dt ¨CA A` © ¨ ¨t ªW d Am`tF ¢nkm§ Am Am`tF¯ ­r ªW Tmm ¯¤ ¨A ¨r` , A\n TbsnA A w¡ Am . ¾® E¤dn§¤ A\ . Am`tF¯ ºAn ¯ ¢lim`atsu lk§ 1 Introduction 4. Free (as in freedom), meaning a license like the The development of Arabi1 was a response to the GNU GPL or LPPL. absence of a package that manipulates the Arabic Arabi comes with an extensive user manual; this script and fulfills the following requirements: article gives a general overview of the system. 1. LATEX 2" and Babel compliant, this combina- 2 Typesetting Arabic with T X: the tion format/package being the most widely used E existing possibilities in our opinion when mixing different languages. 2. The possibility of using 8-bit input text includ- TEX and the Arabic script have a long history. ing already existing Arabic texts, on different One might imagine that enabling TEX to write systems. in both directions Right-to-Left (R2L) and Left-to- 3. Able to use existing, commercial and free, beau- Right (L2R) with an Arabic font suffices to typeset tiful Arabic fonts. Arabic with TEX. 1 Unfortunately, although such an extended TEX The name of the package should not be misunderstood. may perhaps be used to typeset a R2L language like It is not designed to support only the Arabic language, but all languages that use the Arabic script. Technically speaking, Hebrew, this is far from sufficient for a complex for Babel, they will all be considered as dialects of Arabic. script like Arabic, where the shapes of the glyphs TUGboat, Volume 27 (2006), No. 2 | Proceedings of the 2006 Annual Meeting 147 Youssef Jabri depend on the context, and may take many forms 3 Arabic script specifics (at least four forms for the majority of Arabic char- 2 The Arabic script is one of the most widely used acters even in the simplest cases). scripts on earth. It dominates in Arabic countries, Many early attempts have been made; they all of course, but has a special place for all Muslims relied on a preprocessor that does the contextual because it's the script used to write the Koran, the analysis (also known as the shaping algorithm). holy book of Muslims. One attempt, not widely known, due to Terry The Arabic script, like all other Semitic lan- Regier from the University of California, Berkeley, dating from December 1990, relied on the famous guages, is written from Right-to-Left. macros of D. Knuth and P. MacKay: Another important aspect of the Arabic script is that no hyphenation is needed, or allowed at all. %The lines below are from Knuth and MacKay % TUGboat vol.8, #1, page 14. So, no hyphenation patterns are needed for any lan- \font\revrm=xbmc10 \hyphenchar\revrm=-1 guages that uses the Arabic script. In very old Ara- \catcode`\|=\active bic documents, words could be split after a non- \def|#1|{{\revrm \reflect#1\empty\tcelfer}} connecting character, while characters that connect \def\reflect#1#2\tcelfer{\ifx#1\empty\else% were never split. In modern Arabic, hyphenation is \reflect#2\tcelfer#1\fi} forbidden completely. This makes it more difficult to do the reflection, after a preprocessor has done a to get justification when long words occur at the rough contextual analysis. end of a line, but Arabic is also cursive and has (in The pioneering work by Knuth and MacKay modern fonts mimicking the handwritten forms) a [11], who implemented the TEX bidirectional algo- special character called kashida or tatweel (keshideh rithm (which is unrelated to the Unicode Bidirec- is a Farsi word that means stretch) that may be used tional Algorithm; the latter implicitly chooses the between adjoining characters to make the word be- directions of the text) and added to TEX the four come longer. An example is the following word: primitives (\beginL, \endL, \beginR and \endR) A that may be written to occupy longer Aþ made things much better! and longer Aþþ and much more longer space Some early attempts were also carried out by Aþþþþ. Y. Haralambous, who used the new extended en- 3 3.1 The Arabic alphabet gine TEX--XET. This includes the non-free Al-Amal 4 (1992, [6]), and the free ArabiTEX (April 1995). The Arabic alphabet is caseless, but most letters The most widely used system at present is prob- have either two or four forms. The different forms ably K. Lagally's ArabTEX [13]. It is a package for are used according to the letter's position in the writing Arabic in several languages using the Ara- word (initial, medial, final and isolated). The al- bic script. It consists of a TEX macro package and phabet is constituted in its basic form by one Arabic Naskhi-like font. ArabTEX will run with • 28 consonants (29 if we count the hamza). But Plain TEX and LATEX; and work with any TEX en- the number of 28 characters can exceed easily gine, because it uses its own bidirectional algorithm. 1000 glyphs per font if all ligatures are present! So, no preprocessor is needed! This makes it a little slow but with today's computer power, this is not Isolated Initial Medial Final really a problem. Its real drawback lies in the fact that the macros apparently depend heavily on the b glyphs of the font it uses, making it quite impossi- ble to use any other fonts that may be available to ` the user. For courageous users, there also exist two more £ ¡ h ¢ powerful systems Table 1: Some characters' contextual forms • Ω by Y. Haralambous and J. Plaice, and • X TE EX by J. Kew, if you have the right system • Seven diacritical marks specifying the vowels. and the right fonts. They are not used in typical Arabic texts but 2 Through typographical simplifications. Some aspects of appear in poetry, textbooks for people learning traditional Arabic typography are described in [5]. 3 the Arabic language, and some religious texts. We did not review it, as it was not available to the public They can be typed and then at the moment of as far as we know. 4 The source and a DOS executable of the preprocessor compiling the document, can be either included were available through the French TUG. or omitted according to the author's wish! The 148 TUGboat, Volume 27 (2006), No. 2 | Proceedings of the 2006 Annual Meeting The Arabi system | T X writes in Arabic and Farsi ] ¨r` [ A\ E three basic ones are called fatha, damma and fonts, using the (quite limited) ligature possi- bilities of METAFONT. kasra: ; the sukun is used þiþþþþþuþþþþaþ þ"þ This second point is the whole secret of Arabi's com- patibility with most available packages. We tried to for the absence of vowels; and there are three shorten T X coding to deal with the specifics of the tanwin forms written by doubling the three ba- E Arabic script as much as we could, to avoid eventual conflicts and clashes with other code. sic ones: . þÀþþþ¿þþþþ¾þþ The system is also compatible with all other for- mats, such as plain or ConT Xt. This too is because The vowel marks are written somewhat like E the whole contextual analysis is done in the fonts! accents in the Latin script. Above, the drawn line represents the baseline, with the vowels that 4.1 Input and font encodings appear above the line being typeset above let- ters, while those below the line are typeset be- Typesetting Arabic and Farsi texts with TEX implies low letters. the use of special input and output encodings, so we need to use the standard packages inputenc and 3.2 Arabic typography fontenc. We use two special font encodings. For Arabic, This aspect of Arabic merits much investigation and we use LAE for Local Arabic Encoding, while for Farsi so much can be said about it. But in order not to we use LFE that stands for Local Farsi Encoding. be too lengthy, we will just cite three points. These two encoding are not final. Some character In the classical Arabic literature, there are no positions may change, and some empty slots will be typographical styles like bold, italic, etc.
Recommended publications
  • X E TEX Live
    X TE EX Live Jonathan Kew SIL International Horsleys Green High Wycombe Bucks HP14 3XL, England jonathan_kew (at) sil dot org 1 X TE EX in TEX Live in the preamble are sufficient to set the typefaces through- out the document. ese fonts were installed by simply e release of TEX Live 2007 marked a milestone for the dropping the .otf or .ttf files in the computer’s Fonts X TE EX project, as the first major TEX distribution to in- folder; no .tfm, .fd, .sty, .map, or other TEX-related files clude X TE EX (version 0.996) as an integral part. Prior to had to be created or installed. this, X TE EX was a tool that could be added to a TEX setup, Release 0.996 of X T X also provides some enhance- but version and configuration differences meant that it was E E ments over earlier, pre-T X Live versions. In particular, difficult to ensure smooth integration in all cases, and it was E there are new primitives for low-level access to glyph infor- only available for users who specifically chose to seek it out mation (useful during font development and testing); some and install it. (One exception to this is the MacTEX pack- preliminary support for the use of OpenType math fonts age, which has included X TE EX for the past year or so, but (such as the Cambria Math font shipped with MS Office this was just one distribution on one platform.) Integration 2007); and a variety of bug fixes.
    [Show full text]
  • DE-Tex-FAQ (Vers. 72
    Fragen und Antworten (FAQ) über das Textsatzsystem TEX und DANTE, Deutschsprachige Anwendervereinigung TEX e.V. Bernd Raichle, Rolf Niepraschk und Thomas Hafner Version 72 vom September 2003 Dieser Text enthält häufig gestellte Fragen und passende Antworten zum Textsatzsy- stem TEX und zu DANTE e.V. Er kann über beliebige Medien frei verteilt werden, solange er unverändert bleibt (in- klusive dieses Hinweises). Die Autoren bitten bei Verteilung über gedruckte Medien, über Datenträger wie CD-ROM u. ä. um Zusendung von mindestens drei Belegexem- plaren. Anregungen, Ergänzungen, Kommentare und Bemerkungen zur FAQ senden Sie bit- te per E-Mail an [email protected] 1 Inhalt Inhalt 1 Allgemeines 5 1.1 Über diese FAQ . 5 1.2 CTAN, das ‚Comprehensive TEX Archive Network‘ . 8 1.3 Newsgroups und Diskussionslisten . 10 2 Anwendervereinigungen, Tagungen, Literatur 17 2.1 DANTE e.V. 17 2.2 Anwendervereinigungen . 19 2.3 Tagungen »geändert« .................................... 21 2.4 Literatur »geändert« .................................... 22 3 Textsatzsystem TEX – Übersicht 32 3.1 Grundlegendes . 32 3.2 Welche TEX-Formate gibt es? Was ist LATEX? . 38 3.3 Welche TEX-Weiterentwicklungen gibt es? . 41 4 Textsatzsystem TEX – Bezugsquellen 45 4.1 Wie bekomme ich ein TEX-System? . 45 4.2 TEX-Implementierungen »geändert« ........................... 48 4.3 Editoren, Frontend-/GUI-Programme »geändert« .................... 54 5 TEX, LATEX, Makros etc. (I) 62 5.1 LATEX – Grundlegendes . 62 5.2 LATEX – Probleme beim Umstieg von LATEX 2.09 . 67 5.3 (Silben-)Trennung, Absatz-, Seitenumbruch . 68 5.4 Seitenlayout, Layout allgemein, Kopf- und Fußzeilen »geändert« . 72 6 TEX, LATEX, Makros etc. (II) 79 6.1 Abbildungen und Tafeln .
    [Show full text]
  • DRAFT Arabtex a System for Typesetting Arabic User Manual Version 4.00
    DRAFT ArabTEX a System for Typesetting Arabic User Manual Version 4.00 12 Klaus Lagally May 25, 1999 1Report Nr. 1998/09, Universit¨at Stuttgart, Fakult¨at Informatik, Breitwiesenstraße 20–22, 70565 Stuttgart, Germany 2This Report supersedes Reports Nr. 1992/06 and 1993/11 Overview ArabTEX is a package extending the capabilities of TEX/LATEX to generate the Perso-Arabic writing from an ASCII transliteration for texts in several languages using the Arabic script. It consists of a TEX macro package and an Arabic font in several sizes, presently only available in the Naskhi style. ArabTEX will run with Plain TEXandalsowithLATEX2e. It is compatible with Babel, CJK, the EDMAC package, and PicTEX (with some restrictions); other additions to TEX have not been tried. ArabTEX is primarily intended for generating the Arabic writing, but the stan- dard scientific transliteration can also be easily produced. For languages other than Arabic that are customarily written in extensions of the Perso-Arabic script some limited support is available. ArabTEX defines its own input notation which is both machine, and human, readable, and suited for electronic transmission and E-Mail communication. However, texts in many of the Arabic standard encodings can also be processed. Starting with Version 3.02, ArabTEX also provides support for fully vowelized Hebrew, both in its private ASCII input notation and in several other popular encodings. ArabTEX is copyrighted, but free use for scientific, experimental and other strictly private, noncommercial purposes is granted. Offprints of scientific publi- cations using ArabTEX are welcome. Using ArabTEX otherwise requires a license agreement. There is no warranty of any kind, either expressed or implied.
    [Show full text]
  • D Igital Text
    Digital text The joys of character sets Contents Storing text ¡ General problems ¡ Legacy character encodings ¡ Unicode ¡ Markup languages Using text ¡ Processing and display ¡ Programming languages A little bit about writing systems Overview Latin Cyrillic Devanagari − − − − − − Tibetan \ / / Gujarati | \ / − Armenian / Bengali SOGDIAN − Mongolian \ / / Gurumukhi SCRIPT Greek − Georgian / Oriya Chinese | / / | / Telugu / PHOENICIAN BRAHMI − − Kannada SINITIC − Japanese SCRIPT \ SCRIPT Malayalam SCRIPT \ / | \ \ Tamil \ Hebrew | Arabic \ Korean | \ \ − − Sinhala | \ \ | \ \ _ _ Burmese | \ \ Khmer | \ \ Ethiopic Thaana \ _ _ Thai Lao The easy ones Latin is the alphabet and writing system used in the West and some other places Greek and Cyrillic (Russian) are very similar, they just use different characters Armenian and Georgian are also relatively similar More difficult Hebrew is written from right−to−left, but numbers go left−to−right... Arabic has the same rules, but also requires variant selection depending on context and ligature forming The far east Chinese uses two ’alphabets’: hanzi ideographs and zhuyin syllables Japanese mixes four alphabets: kanji ideographs, katakana and hiragana syllables and romaji (latin) letters and numbers Korean uses hangul ideographs, combined from jamo components Vietnamese uses latin letters... The Indic languages Based on syllabic alphabets Require complex ligature forming Letters are not written in logical order, but require a strange ’circular’ ordering In addition, a single line consists of separate
    [Show full text]
  • United States RBDS Standard Specification of the Radio Broadcast Data System (RBDS) April, 2005
    NRSC STANDARD NATIONAL RADIO SYSTEMS COMMITTEE NRSC-4-A United States RBDS Standard Specification of the radio broadcast data system (RBDS) April, 2005 Part II - Annexes NAB: 1771 N Street, N.W. CEA: 1919 South Eads Street Washington, DC 20036 Arlington, VA 22202 Tel: (202) 429-5356 Fax: (202) 775-4981 Tel: (703) 907-7660 Fax: (703) 907-8113 Co-sponsored by the Consumer Electronics Association and the National Association of Broadcasters http://www.nrscstandards.org NOTICE NRSC Standards, Bulletins and other technical publications are designed to serve the public interest through eliminating misunderstandings between manufacturers and purchasers, facilitating interchangeability and improvement of products, and assisting the purchaser in selecting and obtaining with minimum delay the proper product for his particular need. Existence of such Standards, Bulletins and other technical publications shall not in any respect preclude any member or nonmember of the Consumer Electronics Association (CEA) or the National Association of Broadcasters (NAB) from manufacturing or selling products not conforming to such Standards, Bulletins or other technical publications, nor shall the existence of such Standards, Bulletins and other technical publications preclude their voluntary use by those other than CEA or NAB members, whether the standard is to be used either domestically or internationally. Standards, Bulletins and other technical publications are adopted by the NRSC in accordance with the NRSC patent policy. By such action, CEA and NAB do not assume any liability to any patent owner, nor do they assume any obligation whatever to parties adopting the Standard, Bulletin or other technical publication. Note: The user's attention is called to the possibility that compliance with this standard may require use of an invention covered by patent rights.
    [Show full text]
  • Proto-Elamite
    L2/20­192 2020­09­21 Preliminary proposal to encode Proto­Elamite in Unicode Anshuman Pandey [email protected] pandey.github.io/unicode September 21, 2020 Contents 1 Introduction 2 2 Overview of the Sign Repertoire 3 2.1 Sign names . 4 2.2 Numeric signs . 4 2.3 Numeric signs with extended representations . 5 2.4 Complex capacity signs . 6 2.5 Complex graphemes . 7 2.6 Signs in compounds without independent attestation . 10 2.7 Alternate or variant forms . 11 2.8 Scribal designs . 11 3 Proposed Encoding Model 12 4 Proposed Characters 13 4.1 Numeric signs . 13 4.2 General ideographic signs . 17 5 Characters Not Suitable for Encoding 110 6 References 110 7 Acknowledgments 111 1 Preliminary proposal to encode Proto­Elamite in Unicode Anshuman Pandey 1 Introduction The term ‘Proto­Elamite’ refers to a writing system that was used at the beginning of the 3rd millenium BCE in the region to the east and southeast of Mesopotamia, known as Elam, which corresponds to the eastern portion of present­day Iran. The name was assigned by the French epigraphist Jean­Vincent Scheil in the early 20th century, who believed it to be the predecessor of a ‘proper’ Elamite script, which would have been used for recording the Elamite language, simply on account of the location of the tablets at Susa, which was the capital city of Elam. While no ‘proper’ descendent of the script has been identified, scholars continue to use the name ‘Proto­Elamite’ as a matter of convention (Dahl 2012: 2). Proto­Elamite is believed to have been developed from an accounting system used in Mesopotamia, in a manner similar to the development of ‘Proto­Cuneiform’.
    [Show full text]
  • TUGBOAT Volume 32, Number 1 / 2011
    TUGBOAT Volume 32, Number 1 / 2011 General Delivery 3 From the president / Karl Berry 4 Editorial comments / Barbara Beeton Opus 100; BBVA award for Don Knuth; Short takes; Mimi 6 Mimi Burbank / Jackie Damrau 7 Missing Mimi / Christina Thiele 9 16 years of ConTEXt / Hans Hagen 17 TUGboat’s 100 issues — Basic statistics and random gleanings / David Walden and Karl Berry 23 TUGboat online / Karl Berry and David Walden 27 TEX consulting for fun and profit / Boris Veytsman Resources 30 Which way to the forum? / Jim Hefferon Electronic Documents 32 LATEX at Distributed Proofreaders and the electronic preservation of mathematical literature at Project Gutenberg / Andrew Hwang Fonts 39 Introducing the PT Sans and PT Serif typefaces / Pavel Far´aˇr 43 Handling math: A retrospective / Hans Hagen Typography 47 The rules for long s / Andrew West Software & Tools 56 Installing TEX Live 2010 on Ubuntu / Enrico Gregorio 62 tlcontrib.metatex.org: A complement to TEX Live / Taco Hoekwater 68 LuaTEX: What it takes to make a paragraph / Paul Isambert 77 Luna — my side of the moon / Paweł Jackowski A L TEX 83 Reflections on the history of the LATEX Project Public License (LPPL)— A software license for LATEX and more / Frank Mittelbach 95 siunitx: A comprehensive (SI) units package / Joseph Wright 99 Glisterings: Framing, new frames / Peter Wilson 104 Some misunderstood or unknown LATEX2ε tricks III / Luca Merciadri A L TEX 3 108 LATEX3 news, issue 5 / LATEX Project Team Book Reviews 109 Book review: Typesetting tables with LATEX / Boris Veytsman Hints & Tricks
    [Show full text]
  • Farsitex and the Iranian TEX Community
    FarsiTEX and the Iranian TEX Community Behdad Esfahbod Computing Center Sharif University of Technology Azadi Avenue Tehran, Iran [email protected] http://behdad.org/ Roozbeh Pournader Computing Center Sharif University of Technology Azadi Avenue Tehran, Iran [email protected] http://sina.sharif.edu/~roozbeh/ Abstract FarsiTEX, a localized version of LATEX, is a bilingual Persian/English typesetting package, meeting the minimum requirements of Persian mathematical and tech- nical typography. This paper will describe FarsiTEX, together with its history, future and technicalities, its user community, and the reasons behind its success in Iran, amid its various usage and interoperability problems. It will also draw a general picture of the TEX community in Iran, and tries to describe why the community is still far from achieving its basic typographical needs. Introduction version 3.2 (Unicode Editorial Committee, 2002), lists a total of 139 letters in the script, which are The Persian language, in its contemporary form, is derivatives of about 28 basic Arabic letters. a language spoken natively in Iran, Afghanistan, The Persian typography, influenced by major and Tajikistan. The local forms are known as calligraphic practices of the pre-printing era, is ac- Farsi, Dari, and Tajiki respectively. They all use tually based on the famous Naskh style, which more the same basic vocabulary and grammar, but there than 99% of contemporary texts published in it. The are differences in both pronunciations and modern alternate style, Nastaliq, a little harder to read but vocabulary. In this paper, we will focus on the form considered very beautiful by the general public, and used in Iran, which is the official language of the widely known as the hardest commonly used script country.
    [Show full text]
  • Saa5x9x Family Economy Teletext and TV Microcontrollers
    INTEGRATED CIRCUITS DATA SHEET SAA5x9x family Economy teletext and TV microcontrollers Preliminary specification 1998 Dec 14 Supersedes data of 1997 Jul 07 File under Integrated Circuits, IC02 Philips Semiconductors Preliminary specification Economy teletext and TV microcontrollers SAA5x9x family CONTENTS 9.3 East/West selection 9.4 National option characters 1 FEATURES 9.5 The twist attribute 1.1 General 9.6 Language group identification 1.2 Microcontroller 9.7 525-line operation 1.3 Teletext acquisition 9.8 On-Screen Display characters 1.4 Teletext Display 9.9 Control characters 1.5 Additional features of SAA529xA devices 9.10 Quadruple width display (SAA549x) 1.6 Additional features of SAA549x devices 9.11 Page attributes 9.12 Display modes 2 GENERAL DESCRIPTION 9.13 On-Screen Display boxes 2.1 Device masking history 9.14 Screen colour 3 ORDERING INFORMATION 9.15 Redefinable colours (SAA549x) 4 QUICK REFERENCE DATA 9.16 Cursor 9.17 Other display features 5 BLOCK DIAGRAM 9.18 Display timing 6 PINNING INFORMATION 9.19 Horizontal timing 6.1 Pinning 9.20 Vertical timing 6.2 Pin description 9.21 Display position 9.22 Clock generator 7 FUNCTIONAL DESCRIPTION 9.23 Reset signal 7.1 Microcontroller 10 CHARACTER SETS 7.2 80C51 features not supported 7.3 Additional features 10.1 Pan-European 7.4 Microcontroller interfacing 10.2 Cyrillic 10.3 Greek/Turkish 8 TELETEXT DECODER 10.4 Arabic/English/French 8.1 Data slicer 10.5 Thai 8.2 Acquisition timing 10.6 Arabic/Hebrew 8.3 Teletext acquisition 10.7 Iranian 8.4 Rolling headers and time 11 LIMITING
    [Show full text]
  • Technical Advisory Group on Machine Readable Travel Documents (Tag/Mrtd) Twentieth Meeting
    TAG/MRTD/20-WP/17 6/09/11 English Only International Civil Aviation Organization WORKING PAPER TECHNICAL ADVISORY GROUP ON MACHINE READABLE TRAVEL DOCUMENTS (TAG/MRTD) TWENTIETH MEETING Montréal, 7 to 9 September 2011 Agenda Item 2: Activities of the NTWG Agenda Item 2.9: Transliteration Rules (Arabic) TRANSLITERATION RULES (ARABIC) (Presented by the NTWG) 1. INTRODUCTION 1.1 The purpose of the working paper is to inform the TAG of the progress made on the recommended transliteration of Arabic national characters for Doc Doc 9303, Machine Readable Travel Documents , Part 1 — Machine Readable Passports , Volume1, Section IV, Appendix 9. These recommended transliterations apply to the MRZ only. 2. BACKGROUND 2.1 Transliteration of Arabic characters applies mainly to the name of the holder of the MRTD, and is critical for determining the true identity of the holder. At present, Arabic names are transcribed (phonetically) to Latin characters on an ad-hoc basis and this compromises identity management. 2.2 The only true and reliable source of identification is the original form of the name in Arabic characters. Thus, any transliteration scheme must preserve the original form of the name. 2.3 It would be advantageous to countries using the Arabic script if encoding the name in the MRZ led to machine reading producing the original form of the name in Arabic. This would allow these countries to gain the benefit of machine reading without having to deal with an intermediate form of the name as an inaccurate Latin transcription. 2.4 There is no known existing transliteration scheme for Arabic script that is suitable for the MRZ.
    [Show full text]
  • Bibliography and Index
    TLC2, ch-end.tex,v: 1.39, 2004/03/19 p.963 Bibliography [1] Adobe Systems Incorporated. Adobe Type 1 Font Format. Addison-Wes- ley, Reading, MA, USA, 1990. ISBN 0-201-57044-0. The “black book” contains the specifications for Adobe’s Type 1 font format and describes how to create a Type 1 font program. The book explains the specifics of the Type 1 syntax (a subset of PostScript), including information on the structure of font programs, ways to specify computer outlines, and the contents of the various font dictionaries. It also covers encryption, subroutines, and hints. http://partners.adobe.com/asn/developer/pdfs/tn/T1Format.pdf [2] Adobe Systems Incorporated. “PostScript document structuring conven- tions specification (version 3.0)”. Technical Note 5001, 1992. This technical note defines a standard set of document structuring conventions (DSC), which will help ensure that a PostScript document is device independent. DSC allows PostScript language programs to communicate their document structure and printing requirements to document managers in a way that does not affect the PostScript language page description. http://partners.adobe.com/asn/developer/pdfs/tn/5001.DSC_Spec.pdf [3] Adobe Systems Incorporated. “Encapsulated PostScript file format specifi- cation (version 3.0)”. Technical Note 5002, 1992. This technical note details the Encapsulated PostScript file (epsf) format, a standard format for importing and exporting PostScript language files among applications in a variety of heteroge- neous environments. The epsf format is based on and conforms to the document structuring conventions (DSC) [2]. http://partners.adobe.com/asn/developer/pdfs/tn/5002.EPSF_Spec.pdf [4] Adobe Systems Incorporated.
    [Show full text]
  • The Treasure Chest for Compatibility with Texpower and Seminar
    TUGboat, Volume 22 (2001), No. 1/2 67 the concept of pdfslide, but completely rewritten The Treasure Chest for compatibility with texpower and seminar. ifsym: in fonts Fonts with symbols for alpinistic, electronic, mete- orological, geometric, etc., usage. A LATEX2ε pack- age simplifies usage. Packages posted to CTAN jas99_m.bst: in biblio/bibtex/contrib “What’s in a name?” I did not realize that Jan Update of jas99.bst,modifiedforbetterconfor- Tschichold’s typographic standards lived on in the mity to the American Meteorological Society. koma-script package often mentioned on usenet (in LaTeX WIDE: in nonfree/systems/win32/LaTeX_WIDE comp.text.tex) until I happened upon the listing A demonstration version of an integrated editor for it in a previous edition of “The Treasure Chest”. and shell for TEX— free for noncommercial use, but without registration, customization is disabled. This column is an attempt to give TEX users an on- : LAT X2ε macro package of simple, “little helpers” going glimpse of the trove which is CTAN. lhelp E converted into dtx format. Includes common units This is a chronological list of packages posted with preceding thinspaces, framed boxes, start new to CTAN between June and December 2000 with odd or even pages, draft markers, notes, condi- descriptive text pulled from the announcement and tional includes (including EPS files), and versions edited for brevity — however, all errors are mine. of enumerate and itemize which allow spacing to Packages are in alphabetic order and are listed only be changed. in the last month they were updated. Individual files makecmds Provides commands to make commands, envi- / partial uploads are listed under their own name if ronments, counters and lengths.
    [Show full text]