LUCIDAH Ligative and Unligative Characters in a Dataset for Arabic Handwriting

Total Page:16

File Type:pdf, Size:1020Kb

LUCIDAH Ligative and Unligative Characters in a Dataset for Arabic Handwriting (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 10, No. 8, 2019 LUCIDAH Ligative and Unligative Characters in a Dataset for Arabic Handwriting 1 Yousef Elarian Abdelmalek Zidouri3 Teaching and Learning Coaching Electrical Engineering Department ® Interserve King Fahd University of Petroleum & Minerals Madīnah, Saudi Arabia Dhahran 31261, Saudi Arabia 2 Irfan Ahmad Wasfi G. Al-Khatib4 Information and Computer Science Department Information and Computer Science Department King Fahd University of Petroleum & Minerals King Fahd University of Petroleum & Minerals Dhahran 31261, Saudi Arabia Dhahran 31261, Saudi Arabia Abstract—Arabic script is inherently cursive, even when either written in the normal horizontal connection form or as a machine-printed. When connected to other characters, some ligature, depending on the writer’s choice. Arabic characters may be optionally written in compact aesthetic forms known as ligatures. It is useful to distinguish ligatures Many character sequences are not ligative (or “unligative”, from ordinary characters for several applications, especially as we refer to them); i.e., they do not have optional ligature ,family ”ـ٬“ or ”٫“) automatic text recognition. Datasets that do not annotate these forms. One particular family of ligatures ,and Madda (“~”) combinations (”ء“) ligatures may confuse the recognition system training. Some with different Hamza is considered (”ـ٪“ ,”٩“ ,”ـ٦“ ,”٥“ ,”ـ٨“ ,”٧“ ,popular datasets manually annotate ligatures, but no dataset namely (prior to this work) took ligatures into consideration from the obligatory, i.e., its constituent LAM and ALEF characters design phase. In this paper, a detailed study of Arabic ligatures cannot be connected in the common horizontal form. Table I and a design for a dataset that considers the representation of shows different connections of a ligative, an unligative, and an ligative and unligative characters are presented. Then, pilot data obligatorily ligative examples. collection and recognition experiments are conducted on the presented dataset and on another popular dataset of handwritten Ligatures are often used in handwriting for their esthetic Arabic words. These experiments show the benefit of annotating effects and their compactness. Their presence, however, can ligatures in datasets by reducing error-rates in character add complexities to tasks like Arabic character recognition and recognition tasks. synthesis. Researchers in these and similar fields have attempted to handle some ligatures [6] [7] [8] [9], but a Keywords—Arabic ligatures; automatic text recognition; comprehensive and systematic list of ligatives was only handwriting datasets; Hidden Markov Models available after the initial work presented by the authors in [4]. I. INTRODUCTION In addition to handwriting, Arabic ligatures are increasingly Arabic script is widely used, not only for the Arabic being incorporated in modern computer fonts to help make language, but also for other languages like Urdu, Jawi, and texts more compact, beautiful, and as an alternative method for Persian [1]. Despite such importance of the Arabic script, paragraph justification [10] [11]. research in technologies that serve it still remains The "Arabic Presentation Forms-A" of the Unicode underdeveloped [2]. This is mainly attributed to the scarcity of standard encodes more than 300 Arabic and Extended-Arabic specialized resources and algorithms that take the peculiarities ligatures [12]. We notice, however, that their set suffers from ”خئ“ of the Arabic script into consideration [2] [3]. incompleteness and inconsistencies. For example, the is absent in Unicode despite the presence (”داخئ“ Among the peculiarities of the Arabic script is that it is ligature (as in .([cf. [4) ”خن“ and ,”حن“ ,”جن“ ,”حئ“ ,”جئ“ inherently (and diversely) cursive. Twenty-two out of the 28 of similar ligatures such as Arabic letters must be connected to the letter that comes next to them in a word. The general form of connection is via short To the best of our knowledge, Arabic ligatures were not horizontal extensions, shown in the second row of Table I. systematically analyzed and studied before. In this paper, we Some characters, however, also accept to be connected in present a detailed study of Arabic ligatures that lead to the special compact forms known as ligatures [4] (or typographic design of LUCIDAH, a Ligative and Unligative Dataset for ligatures [5]). Examples of Arabic ligatures are shown in the Arabic Handwriting. LUCIDAH is especially designed to be last row of Table I. representative of both: ligatures and characters in their non- ligatured forms. It is also designed to be practically concise and Most ligatures are optional, i.e., when certain sequences of natural; based on the guidelines in [4]. characters (that we refer to as “ligatives”) occur, they can be 406 | P a g e www.ijacsa.thesai.org (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 10, No. 8, 2019 TABLE. I. EXAMPLES OF (A) A LIGATIVE, (B) AN UNLIGATIVE AND (C) AN connecting. Finally, as was mentioned earlier, there is one ,that does not accept connections from either side ”ء“ ,OBLIGATORILY-LIGATIVE CHARACTERS character Mode of connection Ligative Unligative Obligatorily and hence; it only has the Alone (A) character-shape. B. Arabic Typographic Models مػ ـا مػ ـس مػ ـمػ ػح Unconnected Character-shapes NA There are several Arabic typographic models. The مس مـمح Non-Ligature Connections traditional model introduces up to 4 character-shapes per letter Ligature Connections NA or connecting letters having four shapes, non-connecting letters) ﻻ has one). The traditional model ”ء“ having two, and The rest of this paper is organized as follows. In Section II, encompasses more than a hundred character-shapes, which can background on Arabic characters in several typographic be a large number for some applications [17]. Hence, several models is presented. Section III is devoted to the discussion of other models were introduced to represent Arabic script with ligatures and their categorization. In Section IV, the design of less numbers of shapes. LUCIDAH is described. In Section V, implementation issues such as the data collection forms, the demographic distribution One of these reduction models is the 2-Shapes model [18]. of writers, and some scanning parameters are discussed. In The 2-Shapes model clusters the (B) and (M) shapes together Section VI, text recognition experiments are presented to and the (A) and (E) shapes together, whenever the differences highlight the impact of annotating ligatures on recognition are merely the connecting extension found preceding the (M) error-rates. Finally, conclusions are presented in Section VII. and (E) shape, as seen in Fig. 3 (left and middle parts). II. BACKGROUND ON THE ARABIC SCRIPT The 1-Shape model introduces root shapes, or basic glyph parts that are present in all shapes of a character [17]. In The Arabic script consists of 28 alphabetic letters that are addition to the common root shapes, many (A) and (E) mapped to the 36 computer characters (shown and explained in character-shapes also have tails. For example, the character details in the subsection of the Arabic Typographic Models). “ ” in Fig. 3 (right) has the “ ” root in all of its shapes, and ظػ ص Most Arabic characters must connect to subsequent characters .tail in its (A) and (E) shapes ”ں“ to compose words. A few Arabic characters, however, do not the and “ٍ” in ”ح“ connect to subsequent characters (e.g., Letters الرحيم Arabic Word Fig. 1). If any of the non-connecting characters occurs before ا لر حيم another letter in a word, they cause the word to split into Three PAWs disconnected Pieces of Arabic Words (PAWs) [13] [14] [15] or PAWs Ordered from right to left 3rd 2nd 1st subwords [16]. (In addition to that, there is one non-connecting Number of characters in PAW Three Two One .that does not allow previous characters to Fig. 1. An Arabic Word Divided into its PAWs ,”ء“ ,character connect to it, even when the previous character is a connecting one; hence, its occurrence also causes words to be split into TABLE. II. DIFFERENT CHARACTER-SHAPES OF THE ARABIC CHARACTER PAWs). "ّ" BASED ON ITS CONNECTIONS TO PREVIOUS AND NEXT CHARACTERS Example Connected Connected Example Size of A. Arabic Character-Shapes Shape Character- to previous to next Word PAWa Each Arabic character takes a “character-shape” based on Shape character character No No 1 درس س (its position in a PAW, as detailed in the following subsection. (A No Yes 2+ أسف سـ (Arabic traditionally introduces four character-shapes, based on (B Yes Yes 3+ تسامح ـسـ (characters' context (i.e. connection with previous and next (M characters in a PAW). If a PAW consists of only one character, Yes No 2+ خس ـس (that character takes the Alone (A) character-shape. PAWs with (E more than one letter must start with a Beginning (B) character- a The "+" sign after a number X in this column denotes "X or more characters". shape and end with an Ending (E) character-shape. If the PAW ا لر حيم PAWs ـ consists of three or more letters, there would be characters ا لـ ـر حـ يـ ـم between the Beginning and the Ending character-shapes. These Character-Shapes must be Middle (M) character-shapes. Table II shows examples Character Shape Labels E M B E B A ."ّ" of (A), (B), (M), and (E) character-shapes of the letter Fig. 2. An Arabic word divided into PAWs with character-shapes annotated. Fig. 2 shows an example word consisting of three PAWs and annotates its character-shapes (as Alone (A), Beginning 4-Shapes Model 2-Shape Model 1-Shape Model (B), Middle (M), and Ending (E)). Note that Arabic text is (A) (E) (M) (B) (A)and (E) (M)and (B) (A), (E), (M), and (B) ٓـ ـٔـ ـْ ّ ٓـ ـٔـ ـْ ّ ٓـ ـٔـ ـْ ّ written from right to left; hence, the Beginning character-shape of an Arabic multi-lettered PAW appears at the right end of the ٗـ ـ٘ـ ـٖ ٕ ٗـ ـ٘ـ ـٖ ٕ ٗـ ـ٘ـ ـٖ ٕ PAW, whereas the Ending character-shape comes at the ٛـ ـٜـ ٛـ ـٜـ ـٙ ٚ ـٛ ٙ ٚـ ـٜـ ـleftmost end of it.
Recommended publications
  • Research Brief March 2017 Publication #2017-16
    Research Brief March 2017 Publication #2017-16 Flourishing From the Start: What Is It and How Can It Be Measured? Kristin Anderson Moore, PhD, Child Trends Christina D. Bethell, PhD, The Child and Adolescent Health Measurement Introduction Initiative, Johns Hopkins Bloomberg School of Every parent wants their child to flourish, and every community wants its Public Health children to thrive. It is not sufficient for children to avoid negative outcomes. Rather, from their earliest years, we should foster positive outcomes for David Murphey, PhD, children. Substantial evidence indicates that early investments to foster positive child development can reap large and lasting gains.1 But in order to Child Trends implement and sustain policies and programs that help children flourish, we need to accurately define, measure, and then monitor, “flourishing.”a Miranda Carver Martin, BA, Child Trends By comparing the available child development research literature with the data currently being collected by health researchers and other practitioners, Martha Beltz, BA, we have identified important gaps in our definition of flourishing.2 In formerly of Child Trends particular, the field lacks a set of brief, robust, and culturally sensitive measures of “thriving” constructs critical for young children.3 This is also true for measures of the promotive and protective factors that contribute to thriving. Even when measures do exist, there are serious concerns regarding their validity and utility. We instead recommend these high-priority measures of flourishing
    [Show full text]
  • Arabic Handwriting Synthesis
    © Yousef S. I. Elarian 2014 iii Dedication dهل لوالدي ثم لكل محب To Allah Then, to my parents, and to all who helped, cared, or loved. iv ACKNOWLEDGMENTS Thanks again to my Lord, and to my Parents. Thanks to King Abdul Aziz City for Science and Technology (KACST) for granting and supporting this work (Project # GSP-18-112). Thanks to Dr. Sabri Mahmoud and to Dr. Muhammad Al-Mulhem. Deep Personal Thanks to Dr. Zidouri, Dr. Al-Khatib, and the committee members. Thanks to my colleagues: Sameh, Tanvir, Misbahuddin, Irfan, Anas, and to all who helped that this dissertation is completed Thanks from the heart. v TABLE OF CONTENTS ACKNOWLEDGMENTS ............................................................................................... V TABLE OF CONTENTS ............................................................................................... VI LIST OF TABLES .......................................................................................................... IX LIST OF FIGURES ........................................................................................................ XI LIST OF ABBREVIATIONS ..................................................................................... XIV ABSTRACT ................................................................................................................ XVII XIX .................................................................................................................... ملخص الرسالة CHAPTER 1 INTRODUCTION ..................................................................................
    [Show full text]
  • The Origin of the Peculiarities of the Vietnamese Alphabet André-Georges Haudricourt
    The origin of the peculiarities of the Vietnamese alphabet André-Georges Haudricourt To cite this version: André-Georges Haudricourt. The origin of the peculiarities of the Vietnamese alphabet. Mon-Khmer Studies, 2010, 39, pp.89-104. halshs-00918824v2 HAL Id: halshs-00918824 https://halshs.archives-ouvertes.fr/halshs-00918824v2 Submitted on 17 Dec 2013 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Published in Mon-Khmer Studies 39. 89–104 (2010). The origin of the peculiarities of the Vietnamese alphabet by André-Georges Haudricourt Translated by Alexis Michaud, LACITO-CNRS, France Originally published as: L’origine des particularités de l’alphabet vietnamien, Dân Việt Nam 3:61-68, 1949. Translator’s foreword André-Georges Haudricourt’s contribution to Southeast Asian studies is internationally acknowledged, witness the Haudricourt Festschrift (Suriya, Thomas and Suwilai 1985). However, many of Haudricourt’s works are not yet available to the English-reading public. A volume of the most important papers by André-Georges Haudricourt, translated by an international team of specialists, is currently in preparation. Its aim is to share with the English- speaking academic community Haudricourt’s seminal publications, many of which address issues in Southeast Asian languages, linguistics and social anthropology.
    [Show full text]
  • Applications the Formula Y = Mx + B Sometimes Appears with Different
    PRIMARY CONTENT MODULE Algebra I -Linear Equations & Inequalities T-71 Applications The formula y = mx + b sometimes appears with different symbols. For example, instead of x, we could use the letter C. Instead of y, we could use the letter F. Then the equation becomes F = mC + b. All temperature scales are related by linear equations. For example, the temperature in degrees Fahrenheit is a linear function of degrees Celsius. © 1999, CISC: Curriculum and Instruction Steering Committee The WINNING EQUATION PRIMARY CONTENT MODULE Algebra I -Linear Equations & Inequalities T-72 Basic Temperature Facts Water freezes at: 0°C, 32°F Water Boils at: 100°C, 212°F F • (100, 212) • (0, 32) C Can you solve for m and b in F = mC + b? © 1999, CISC: Curriculum and Instruction Steering Committee The WINNING EQUATION PRIMARY CONTENT MODULE Algebra I -Linear Equations & Inequalities T-73 To find the equation relating Fahrenheit to Celsius we need m and b F – F m = 2 1 C2 – C1 212 – 32 m = 100 – 0 180 9 m = = 100 5 9 Therefore F = C + b 5 To find b, substitute the coordinates of either point. 9 32 = (0) + b 5 Therefore b = 32 Therefore the equation is 9 F = C + 32 5 Can you solve for C in terms of F? © 1999, CISC: Curriculum and Instruction Steering Committee The WINNING EQUATION PRIMARY CONTENT MODULE Algebra I -Linear Equations & Inequalities T-74 Celsius in terms of Fahrenheit 9 F = C + 32 5 5 5 9 F = æ C +32ö 9 9è 5 ø 5 5 F = C + (32) 9 9 5 5 F – (32) = C 9 9 5 5 C = F – (32) 9 9 5 C = (F – 32) 9 Example: How many degrees Celsius is77°F? 5 C = (77 – 32) 9 5 C = (45) 9 C = 25° So 72°F = 25°C © 1999, CISC: Curriculum and Instruction Steering Committee The WINNING EQUATION PRIMARY CONTENT MODULE Algebra I -Linear Equations & Inequalities T-75 Standard 8 Algebra I, Grade 8 Standards Students understand the concept of parallel and perpendicular lines and how their slopes are related.
    [Show full text]
  • Form Y-203-I, If You Had More Than One Job, Or If You Had a Job for Only Part of the Year
    Department of Taxation and Finance Yonkers Nonresident Earnings Tax Return Y-203 For the full year January 1, 2020, through December 31, 2020, or fiscal year beginning and ending Name as shown on Form IT-201 or IT-203 Social Security number A Were you a Yonkers resident for any part of the taxable year? (mark an X in the appropriate box) Yes No (see instructions) (See the instructions for Form IT-201 or IT-203 for the definition of a resident.) If Yes: 1. Give period of Yonkers residence. From (mmddyyyy) to (mmddyyyy) 2. Are you reporting Yonkers resident income tax surcharge on your New York State return? ..................................................................................... Yes No (submit explanation) 3. You must complete and submit Form IT-360.1 (see instructions). B Did you or your spouse maintain an apartment or other living quarters in Yonkers during any part of the year?...................................................................................... Yes No If Yes, give address below and enter the number of days spent in Yonkers during 2020: days Address: C Are you reporting income from self-employment (on line 2 below)?......... Yes No If Yes, complete the following: Business name Business address Employer identification number Principal business activity Form of business: Sole proprietorship Partnership Other (explain) Calculation of nonresident earnings tax 1 Gross wages and other employee compensation (see instructions; if claiming an allocation, include amount from line 22) ............................................... 1 .00 2 Net earnings from self-employment (see instructions; if claiming an allocation, include amount from line 32; if a loss, write loss on line 2) ............................................................................................... 2 .00 3 Add lines 1 and 2 (if line 2 is a loss, enter amount from line 1) ...........................................................
    [Show full text]
  • Ligature Modeling for Recognition of Characters Written in 3D Space Dae Hwan Kim, Jin Hyung Kim
    Ligature Modeling for Recognition of Characters Written in 3D Space Dae Hwan Kim, Jin Hyung Kim To cite this version: Dae Hwan Kim, Jin Hyung Kim. Ligature Modeling for Recognition of Characters Written in 3D Space. Tenth International Workshop on Frontiers in Handwriting Recognition, Université de Rennes 1, Oct 2006, La Baule (France). inria-00105116 HAL Id: inria-00105116 https://hal.inria.fr/inria-00105116 Submitted on 10 Oct 2006 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Ligature Modeling for Recognition of Characters Written in 3D Space Dae Hwan Kim Jin Hyung Kim Artificial Intelligence and Artificial Intelligence and Pattern Recognition Lab. Pattern Recognition Lab. KAIST, Daejeon, KAIST, Daejeon, South Korea South Korea [email protected] [email protected] Abstract defined shape of character while it showed high recognition performance. Moreover when a user writes In this work, we propose a 3D space handwriting multiple stroke character such as ‘4’, the user has to recognition system by combining 2D space handwriting write a new shape which is predefined in a uni-stroke models and 3D space ligature models based on that the and which he/she has never seen.
    [Show full text]
  • ISO Basic Latin Alphabet
    ISO basic Latin alphabet The ISO basic Latin alphabet is a Latin-script alphabet and consists of two sets of 26 letters, codified in[1] various national and international standards and used widely in international communication. The two sets contain the following 26 letters each:[1][2] ISO basic Latin alphabet Uppercase Latin A B C D E F G H I J K L M N O P Q R S T U V W X Y Z alphabet Lowercase Latin a b c d e f g h i j k l m n o p q r s t u v w x y z alphabet Contents History Terminology Name for Unicode block that contains all letters Names for the two subsets Names for the letters Timeline for encoding standards Timeline for widely used computer codes supporting the alphabet Representation Usage Alphabets containing the same set of letters Column numbering See also References History By the 1960s it became apparent to thecomputer and telecommunications industries in the First World that a non-proprietary method of encoding characters was needed. The International Organization for Standardization (ISO) encapsulated the Latin script in their (ISO/IEC 646) 7-bit character-encoding standard. To achieve widespread acceptance, this encapsulation was based on popular usage. The standard was based on the already published American Standard Code for Information Interchange, better known as ASCII, which included in the character set the 26 × 2 letters of the English alphabet. Later standards issued by the ISO, for example ISO/IEC 8859 (8-bit character encoding) and ISO/IEC 10646 (Unicode Latin), have continued to define the 26 × 2 letters of the English alphabet as the basic Latin script with extensions to handle other letters in other languages.[1] Terminology Name for Unicode block that contains all letters The Unicode block that contains the alphabet is called "C0 Controls and Basic Latin".
    [Show full text]
  • Form W-4, Employee's Withholding Certificate
    Employee’s Withholding Certificate OMB No. 1545-0074 Form W-4 ▶ (Rev. December 2020) Complete Form W-4 so that your employer can withhold the correct federal income tax from your pay. ▶ Department of the Treasury Give Form W-4 to your employer. 2021 Internal Revenue Service ▶ Your withholding is subject to review by the IRS. Step 1: (a) First name and middle initial Last name (b) Social security number Enter Address ▶ Does your name match the Personal name on your social security card? If not, to ensure you get Information City or town, state, and ZIP code credit for your earnings, contact SSA at 800-772-1213 or go to www.ssa.gov. (c) Single or Married filing separately Married filing jointly or Qualifying widow(er) Head of household (Check only if you’re unmarried and pay more than half the costs of keeping up a home for yourself and a qualifying individual.) Complete Steps 2–4 ONLY if they apply to you; otherwise, skip to Step 5. See page 2 for more information on each step, who can claim exemption from withholding, when to use the estimator at www.irs.gov/W4App, and privacy. Step 2: Complete this step if you (1) hold more than one job at a time, or (2) are married filing jointly and your spouse Multiple Jobs also works. The correct amount of withholding depends on income earned from all of these jobs. or Spouse Do only one of the following. Works (a) Use the estimator at www.irs.gov/W4App for most accurate withholding for this step (and Steps 3–4); or (b) Use the Multiple Jobs Worksheet on page 3 and enter the result in Step 4(c) below for roughly accurate withholding; or (c) If there are only two jobs total, you may check this box.
    [Show full text]
  • Ffontiau Cymraeg
    This publication is available in other languages and formats on request. Mae'r cyhoeddiad hwn ar gael mewn ieithoedd a fformatau eraill ar gais. [email protected] www.caerphilly.gov.uk/equalities How to type Accented Characters This guidance document has been produced to provide practical help when typing letters or circulars, or when designing posters or flyers so that getting accents on various letters when typing is made easier. The guide should be used alongside the Council’s Guidance on Equalities in Designing and Printing. Please note this is for PCs only and will not work on Macs. Firstly, on your keyboard make sure the Num Lock is switched on, or the codes shown in this document won’t work (this button is found above the numeric keypad on the right of your keyboard). By pressing the ALT key (to the left of the space bar), holding it down and then entering a certain sequence of numbers on the numeric keypad, it's very easy to get almost any accented character you want. For example, to get the letter “ô”, press and hold the ALT key, type in the code 0 2 4 4, then release the ALT key. The number sequences shown from page 3 onwards work in most fonts in order to get an accent over “a, e, i, o, u”, the vowels in the English alphabet. In other languages, for example in French, the letter "c" can be accented and in Spanish, "n" can be accented too. Many other languages have accents on consonants as well as vowels.
    [Show full text]
  • A Collection of Mildly Interesting Facts About the Little Symbols We Communicate With
    Ty p o g raph i c Factettes A collection of mildly interesting facts about the little symbols we communicate with. Helvetica The horizontal bars of a letter are almost always thinner than the vertical bars. Minion The font size is approximately the measurement from the lowest appearance of any letter to the highest. Most of the time. Seventy-two points equals one inch. Fridge256 point Cochin most of 50the point Zaphino time Letters with rounded bottoms don’t sit on the baseline, but slightly below it. Visually, they would appear too high if they rested on the same base as the squared letters. liceAdobe Caslon Bold UNITED KINGDOM UNITED STATES LOLITA LOLITA In Ancient Rome, scribes would abbreviate et (the latin word for and) into one letter. We still use that abbreviation, called the ampersand. The et is still very visible in some italic ampersands. The word ampersand comes from and-per-se-and. Strange. Adobe Garamond Regular Adobe Garamond Italic Trump Mediaval Italic Helvetica Light hat two letters ss w it cam gue e f can rom u . I Yo t h d. as n b ha e rt en ho a s ro n u e n t d it r fo w r s h a u n w ) d r e e m d a s n o r f e y t e t a e r b s , a b s u d t e d e e n m t i a ( n l d o b s o m a y r S e - d t w A i e t h h t t , h d e n a a s d r v e e p n t m a o f e e h m t e a k i i l .
    [Show full text]
  • The Selnolig Package: Selective Suppression of Typographic Ligatures*
    The selnolig package: Selective suppression of typographic ligatures* Mico Loretan† 2015/10/26 Abstract The selnolig package suppresses typographic ligatures selectively, i.e., based on predefined search patterns. The search patterns focus on ligatures deemed inappropriate because they span morpheme boundaries. For example, the word shelfful, which is mentioned in the TEXbook as a word for which the ff ligature might be inappropriate, is automatically typeset as shelfful rather than as shelfful. For English and German language documents, the selnolig package provides extensive rules for the selective suppression of so-called “common” ligatures. These comprise the ff, fi, fl, ffi, and ffl ligatures as well as the ft and fft ligatures. Other f-ligatures, such as fb, fh, fj and fk, are suppressed globally, while making exceptions for names and words of non-English/German origin, such as Kafka and fjord. For English language documents, the package further provides ligature suppression rules for a number of so-called “discretionary” or “rare” ligatures, such as ct, st, and sp. The selnolig package requires use of the LuaLATEX format provided by a recent TEX distribution, e.g., TEXLive 2013 and MiKTEX 2.9. Contents 1 Introduction ........................................... 1 2 I’m in a hurry! How do I start using this package? . 3 2.1 How do I load the selnolig package? . 3 2.2 Any hints on how to get started with LuaLATEX?...................... 4 2.3 Anything else I need to do or know? . 5 3 The selnolig package’s approach to breaking up ligatures . 6 3.1 Free, derivational, and inflectional morphemes .
    [Show full text]
  • (B) Dz[F(Z)+G(Z)]=Dzf(Z)+Dzg(Z), (C) Dz\F(Z)G(Z) ] = [Dzf(Z)}G(Z) +F(Z)Dzg(Z). II
    EXTENSION OF THE DERIVATIVE CONCEPT FOR FUNCTIONS OF MATRICES R. F. RINEHART 1. Introduction. Let Mr and Mq denote the set of all square matri- ces of order n over the real and complex fields, respectively. By a function f(Z) of a matrix Z of Mr (or M(j) is meant a mapping of a subset of Mr(Mc) into Mr(Mc). The question with which this paper is concerned is the establishment of suitable concepts of differentiabil- ity and derivative for such functions. A meaningful and useful definition of these concepts, should of course bear some noticeable resemblance to the analogous concepts for scalar functions. In addition the derivative should preserve some of the elementary properties of the derivative for scalar functions. A modest set of such desirable properties is: I (a) lif(Z) is a constant, then Dzf(Z) =0, (b) Dz[f(Z)+g(Z)]=Dzf(Z)+Dzg(Z), (c) Dz\f(Z)g(Z)] = [Dzf(Z)}g(Z)+f(Z)Dzg(Z). An additional important desired attribute, perhaps more strin- gent, is II The definitions of differentiability and derivative shall be applicable and meaningful when applied to the special functions on Mq arising from scalar functions of a complex variable [2]. For example, it would be desirable that the function ez turn out to be differentiable, accord- ing to the general definition of differentiability of functions on Mq. In the fairly extensive literature on functions defined on Mr or Mc, or more generally on linear algebras with unit element over R or C, no definition of derivative has been given which satisfactorily fulfills requirements I and II.
    [Show full text]