Issue Editor 1 R

Total Page:16

File Type:pdf, Size:1020Kb

Issue Editor 1 R DECEMBER 1989VOL. 12 NO. 4 a quarterlybulletin of the IEEE ComputerSociety technicalcommittee on Data Engineering CONTENTS Letterfrom the Issue Editor 1 R. King Problems and Peculiarities of ArabicDatabases 2 A.S. Elmaghraby, S. El—Shihaby, and S. El—Kassas Bayan: A Text DatabaseManagementSystemWhichSupports a Full Representation of the ArabicLanguage 12 A. Morfeq, and R. King Responsa: A Full—TextRetrievalSystem with LinguisticProcessing for a 65—MillionWordCorpus of JewishHeritage in Hebrew 22 V. Choueka IconicCommunication: ImageRealism and Meaning 32 F-S Liu, and J-W Ta! Overview of Japanese Kanji and KoreanHangulSupport on the ADABAS/NATURALSystem 42 V. ishi!, and H. Nishimoto Call for Papers 52 SPECIALISSUE ON NON-ENGLISHINTERFACES TO DATABASES ~ IEEECOMPUTERSOCIETY 4 -~ IEEE Editor-in—Chief, Data Engineering Chairperson, TC Dr. Won Kim Prof. LarryKerschberg MCC Dept. of informationSystems and SystemsEngineering 3500West BaiconesCenterDrive GeorgeMasonUniversity Austin, TX 78759 4400UniversityDrive (512) 338—3439 Fairfax, VA 22030 (703) 323—4354 AssociateEditors Vice Chairperson, TC Prof. Dma Bitton Prof. Stefano Coil Dept. of EiectricaiEngineering Dipartlmento di Matematica and ComputerScience Universita’ di Modena University of iiiinois Via Campi 213 Chicago, iL 60680 41100Modena, Italy (312) 413—2296 Prof. MichaeiCarey Secretary, TC ComputerSciencesDepartment Prof. Don Potter University of Wisconsin Dept. of ComputerScience Madison, WI 53706 University of Georgia (608) 262—2252 Athens, GA 30602 (404) 542—0361 Prof. Roger King Past Chairperson, TC Department of ComputerScience Prof. Sushli,Jajodia campus box 430 Dept. of informationSystems and SystemsEngineering University of Colorado GeorgeMasonUniversity Boulder, CO 80309 4400University Drive (303) 492—7398 Fairfax, VA 22030 (703) 764—6192 Prof. Z. MoralOzsoyogiu Department of ComputerEngineering and Science CaseWesternReserveUniversity Cieveiand, Ohio 44106 (216) 368—2818 Dr. Sunil Sarin Distribution XeroxAdvancedInformationTechnology Ms. Lori Rottenberg 4 CambridgeCenter IEEE ComputerSociety Cambridge, MA 02142 1730 Massachusetts Ave. (617) 492—8860 Washington, D.C. 20036—1903 (202) 371—1012 DataEngineeringBuiletin is a quarteriypubiicatlon of Membership in the Data EngineeringTechnicalCommittee the IEEE ComputerSocietyTechnicalCommittee on Data is open to individuals who demonstratewillingness to its of interestincludes: data structures in Engineering . scope activelyparticipate the variousactivities of the TC. A and models,accessstrategies,accesscontroltechniques, member of the IEEE ComputerSociety may Join the TC as a databasearchitecture,databasemachines,inteiligentfront full member. A non—member of the ComputerSociety may ends, massstorage for very largedatabases,distributed Join as a participatingmember, with approvalfrom at least databasesystems and techniques,databasesoftwaredesign one officer of the TC. Both full members and participating and Implementation,databaseutilities databasesecurity members of the TC are entitled to receive the quarterly and relatedareas. bulietin of the TC free of charge, until furthernotice. Contribution to the Bulletin is herebysolicited. News items, letters,technicalpapers, book reviews,meetingpreviews, summaries, casestudies, etc., should be sent to the Editor. All letters to the Editor wiii be considered for publication unlessaccompanied by a request to the contrary. Technical papers are unrefereed. OpinIonsexpressed in contributions are those of the mdi viduaiauthorrather than the officlaiposition of the TC on Data Engineering, the IEEE ComputerSociety, or orga nizations with which the author may be affiiiated. Letterfrom the Editor Severalmonths ago, Won Kimasked me to put together the December 1989issue. Ini tially, we agreed on graphicalinterfaces to databases as the topic. As I beganlooking into the literature and trying to figurewhatpapers to invite, I noticed that there wasvery little out therethat had to do withnon-Englishinterfaces to databases. So I set out to find a broadrange of papers, all dealingwithinterfaceproblemsthatAmerican andEuropean researcherswouldfindunusual. Theindividualswhocontributed to this issuewereasked to put in a lot of effort in a very shortamount of time. A few of them haddifficultywriting in English. Therewere also communicationproblems for the authorslocatedoutside the USA. All of the authors deserve a verywarmthanks. There are two papersdealing withArabic. The firstpaper, by Elmaghraby,El-Shihaby, and El-Kassas, gives an overview of why Arabicpresentsunusualproblems for the developer of interfaces, and whyEnglish-basedsoftware and hardwaremay not be easily adapted to handleArabic. The secondpaperrepresents the Ph.D.thesiswork of Au Mor feq and describes an effort to develop text databasemanagementtechniquesspecifically suited for Arabic. The thirdpaper also deals with textual data. It is by YaacovChoueka and describes a very aggressiveeffort at developing a database of Hebrew texts. This effort has pro ducedresearchcontributionsthat are of use to developers of textretrievalsystems in gen eral, notjust to developers of Hebrew-basedsystems. Thefourthpaper is by FangShengLiu and Ju Wei Tai, andconcerns the development of graphicalinterfaces. It suggests that a useful way of constructing and categorizing graphicaltokens can be based on Chinesecharactertheory. Oneinterestingaspect of this paper is that it is not dedicated to Chinese-basedinterfaces. The fifth paper (by Yoshioki Ishii and HidekiNishimoto)describes the support of Japanese and Koreandatabaseinterfaces. It gives an overview of why the Japanese and Koreanlanguagespresentnovelproblems, and then describes the approach they have taken in developingtheirsystem. I hopethat the readers of thisissuefindthesepapers to be unusual andinteresting. RogerKing December, 1989 1. Problems and Peculiarities of ArabicDatabases A. S. Eliuaghraby1, S. E1-Shihaby2, and S. E1—Kassas3 ABSTRACT This paper presents a brief overview of Arabiclanguagedatabases, their problems, solutions, and directions for the future. Experience is based on implementation of an Arabizationscheme and severaldatabaseapplicationsusingstandardDBMSpackages and programminglanguages.Most of theseproblems are due to the fact that Arabization is not integrated in the design of the hardware and systemprograms. 1. INTRODUCTION The need for ArabicDatabases has beenrecognizedsince the introduction of computers in the Middle East. Modernelectroniccomputers were first introduced to Egypt in the 1950’s starting by the IBM model 1620. Currently all types of computers exist, and are Arabized, with the exception of some high end models. Arabizationschemes for computers are key to understandingproblems of Arabic Databases. It is alsobeneficial to briefly get acquaintedwithsomepeculiarities of the Arabic language. The Arabiclanguage is written from right to left using the Arabiccharacter set with connectedscript.Twentyeightcharacterscompose the Arabic set, howevereach character has multipleshapes.Displaying an Arabicword requiresknowledge of shapeselection of each letter based on the letterspositions in the word. The lack of endogenouscomputerdesigns in the Arabworld have forcedArabization to follow a modification and patchworkapproach.PeripheralArabization has been the most popularapproach in the earlyyears and is used for mainframe and minicomputerswhile microcomputerArabization has been approached with a variety of schemes. Thispaperpresentssome of the problemsfacingDatabasedesign and Databaseinterfaces for Arabicapplications.Some of the necessarybackgroundrelated to Arabiclanguage and Arabizationschemes is also included. One major issue in Arabiclanguage that is briefly presentedwithoutdetail in this paper is diacritics(al-tashkeel) - which has been dropped in mostmodernArabicpublications -- diacritics are mainlyused to improvepronunciation. 1 EMA~SDept.,University of Louisville,Kentucky, USA40292. 2 University of Alexandria,Alexandria,EGYPT. EB Dept.,EindhovenUniversity of Technology, the Netherlands. 2. 2. PROBLEMS OF ARABICDATABASES 2.1. Peculiarities of Arabic and Arabization Differences amongnaturallanguages have caused performancedegradation in many computerenvironments,includingdatabases.However,softwaredesigners,most of the time, includeconsiderations for the differences in Latin-basedlanguages. They do constructlayers to allow the update of the character set -- IBM’sMVS OS allows thatwithin the whole OS interface. This is convenient when applications do not use “clever” techniques and communicatedirectlywithsystemperipherals,such as in the case of Focus on IBMsystems. When the linguisticdifferencesinclude more than minorvariations in the character set, such as in Arabization, serious problems occur. Operating system designers did not anticipate,whendesigningtheirsupervisorcallshandlingterminals, thatentering a character may alter the shape of an alreadydisplayedcharacter. In order to face these problems designers will assist in manoeuveringaround the systemcapabilities to reachtheseneeds. In an attempt to build an Arabicdatabase, the designer is facedwith the obviousproblems of using a non-Englishcharacter set in addition to somepeculiarities that relate to Arabic language and Arabizationschemes. 2.1.1. Orientation The Arabiclanguage is written in a right to left orientation.When the first IBMArabic terminals(XCOM,XCOM1, and XCOM2)wereintroducedtheyallowedwritingcharacters in both orientations;left-to-right (R-L), and right-to-left (L-R). However,because of technicalproblems,
Recommended publications
  • DIGITAL TYPOGRAPHY USING LATEX Springer New York Berlin Heidelberg Hong Kong London Milan Paris Tokyo Apostolos Syropoulos Antonis Tsolomitis Nick Sofroniou
    DIGITAL TYPOGRAPHY USING LATEX Springer New York Berlin Heidelberg Hong Kong London Milan Paris Tokyo Apostolos Syropoulos Antonis Tsolomitis Nick Sofroniou DIGITAL TYPOGRAPHY USING LATEX With 68 Illustrations Apostolos Syropoulos Antonis Tsolomitis 366, 28th October St. Dept. of Mathematics GR-671 00 Xanthi University of the Aegean GREECE GR-832 00 Karlobasi, Samos [email protected] GREECE [email protected] Nick Sofroniou Educational Research Centre St. Patrick’s College Drumcondra, Dublin 9 IRELAND [email protected] Library of Congress Cataloging-in-Publication Data Syropoulos, Apostolos. Digital typography using LaTeX / Apostolos Syropoulos, Antonis Tsolomitis, Nick Sofroniou. p. cm. Includes bibliographical references and indexes. ISBN 0-387-95217-9 (acid-free paper) 1. LaTeX (Computer file) 2. Computerized typesetting. I. Tsolomitis, Antonis. II. Sofroniou, Nick. III. Title. Z253.4.L38 S97 2002 686.2´2544—dc21 2002070557 ACM Computing Classification (1998): H.5.2, I.7.2, I.7.4, K.8.1 ISBN 0-387-95217-9 (alk. paper) Printed on acid-free paper. Printed on acid-free paper. © 2003 Springer-Verlag New York, Inc. All rights reserved. This work may not be translated or copied in whole or in part without the written permission of the publisher (Springer-Verlag New York, Inc., 175 Fifth Avenue, New York, NY 10010, USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in connection with any form of information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed is forbidden. The use in this publication of trade names, trademarks, service marks, and similar terms, even if they are not identified as such, is not to be taken as an expression of opinion as to whether they are subject to proprietary rights.
    [Show full text]
  • Arabic Alphabet - Wikipedia, the Free Encyclopedia Arabic Alphabet from Wikipedia, the Free Encyclopedia
    2/14/13 Arabic alphabet - Wikipedia, the free encyclopedia Arabic alphabet From Wikipedia, the free encyclopedia َأﺑْ َﺠ ِﺪﯾﱠﺔ َﻋ َﺮﺑِﯿﱠﺔ :The Arabic alphabet (Arabic ’abjadiyyah ‘arabiyyah) or Arabic abjad is Arabic abjad the Arabic script as it is codified for writing the Arabic language. It is written from right to left, in a cursive style, and includes 28 letters. Because letters usually[1] stand for consonants, it is classified as an abjad. Type Abjad Languages Arabic Time 400 to the present period Parent Proto-Sinaitic systems Phoenician Aramaic Syriac Nabataean Arabic abjad Child N'Ko alphabet systems ISO 15924 Arab, 160 Direction Right-to-left Unicode Arabic alias Unicode U+0600 to U+06FF range (http://www.unicode.org/charts/PDF/U0600.pdf) U+0750 to U+077F (http://www.unicode.org/charts/PDF/U0750.pdf) U+08A0 to U+08FF (http://www.unicode.org/charts/PDF/U08A0.pdf) U+FB50 to U+FDFF (http://www.unicode.org/charts/PDF/UFB50.pdf) U+FE70 to U+FEFF (http://www.unicode.org/charts/PDF/UFE70.pdf) U+1EE00 to U+1EEFF (http://www.unicode.org/charts/PDF/U1EE00.pdf) Note: This page may contain IPA phonetic symbols. Arabic alphabet ا ب ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع en.wikipedia.org/wiki/Arabic_alphabet 1/20 2/14/13 Arabic alphabet - Wikipedia, the free encyclopedia غ ف ق ك ل م ن ه و ي History · Transliteration ء Diacritics · Hamza Numerals · Numeration V · T · E (//en.wikipedia.org/w/index.php?title=Template:Arabic_alphabet&action=edit) Contents 1 Consonants 1.1 Alphabetical order 1.2 Letter forms 1.2.1 Table of basic letters 1.2.2 Further notes
    [Show full text]
  • A Sociolinguistic Analysis of the Use of Arabizi in Social Media Among Saudi Arabians
    International Journal of English Linguistics; Vol. 9, No. 6; 2019 ISSN 1923-869X E-ISSN 1923-8703 Published by Canadian Center of Science and Education A Sociolinguistic Analysis of the Use of Arabizi in Social Media Among Saudi Arabians Ashwaq Alsulami1 1 School of Languages, Literatures, and Linguistics, Bangor University, Wales, United Kingdom Correspondence: Ashwaq Alsulami, School of Languages, Literatures, and Linguistics, Bangor University, Wales, United Kingdom. E-mail: [email protected] Received: August 27, 2019 Accepted: September 20, 2019 Online Published: October 28, 2019 doi:10.5539/ijel.v9n6p257 URL: https://doi.org/10.5539/ijel.v9n6p257 Abstract The aim of this sociolinguistically-oriented study is to explore the Arabizi phenomenon which is characterized by spelling Arabic words using the Latin script. It is prevalent in the text-based computer-mediated communications among Saudi Arabians. The study focuses on why Arabizi is used, how, particularly in respect to with whom and in which topics, it is used, the attitudes of its users toward its use and the perceived advantages and disadvantages of its use. Using an online survey, data were collected from 241 participants, 72 of which were users of Arabizi. The findings revealed that the primary reasons for using Arabizi were its being a communication code among youths and a compensation for the lack of Arabic keyboard from technological devices as well as being more expressive than Arabic language. It was also found that Arabizi was primarily used to communicate with friends and individuals of the same age, but not with parents and older people or in formal relationships.
    [Show full text]
  • An Improved Arabic Keyboard Layout
    Sci.Int.(Lahore),33(1),5-15,2021 ISSN 1013-5316; CODEN: SINTE 8 5 AN IMPROVED ARABIC KEYBOARD LAYOUT 1Amjad Qtaish, 2Jalawi Alshudukhi, 3Badiea Alshaibani, 4Yosef Saleh, 5Salam Bazrawi College of Computer Science and Engineering, University of Ha'il, Ha'il, Saudi Arabia. [email protected], [email protected], [email protected], [email protected], [email protected] ABSTRACT: One of the most important human–machine interaction (HMI) systems is the computer keyboard. The keyboard layout (KL) dictates how a person interacts with a physical keyboard through the way in which the letters, numbers, punctuation marks, and symbols are mapped and arranged on the keyboard. Mapping letters onto the keys of a keyboard is complex because many issues need to be taken into considerations, such as the nature of the language, finger fatigue, hand balance, typing speed, and distance traveled by fingers during typing and finger movements. There are two main kinds of KL: English and Arabic. Although numerous research studies have proposed different layouts for the English keyboard, there is a lack of research studies that focus on the Arabic KL. To address this lack, this study analyzed and clarified the limitations of the standard legacy Arabic KL. Then an efficient Arabic KL was proposed to overcome the limitations of the current KL. The frequency of Arabic letters and bi-gram probabilities were measured on a large Arabic corpus in order to assess the current KL and to design the improved Arabic KL. The improved Arabic KL was then evaluated and compared against the current KL in terms of letter frequency, finger-travel distance, hand and finger balance, bi-gram frequency, row distribution, and most frequent words.
    [Show full text]
  • Arabizi – Help Or Harm?
    ARABIZI – HELP OR HARM? ANALYSIS OF THE IMPACTS OF ARABIZI -THREAT OR BENEFIT TO THE WRITTEN ARABIC LANGUAGE? Thesis Submitted to The College of Arts and Sciences of the UNIVERSITY OF DAYTON In Partial Fulfillment of the Requirements for The Degree of Master of Arts in English By Abdurazag Ahmed Saide M.S. Dayton, Ohio December 2019 ARABIZI – HELP OR HARM? AN ANALYSIS OF THE IMPACTS OF ARABIZI - THREAT OR BENEFIT TO THE WRITTEN ARABIC LANGUAGE? Name: Saide, Abdurazag Ahmed APPROVED BY: ____________________________________ Bryan Bardine, Ph.D. Faculty Advisor. ____________________________________ Thomas Wendorf, Ph.D. Faculty Advisor ____________________________________ Kirsten Mendoza, Ph.D. Faculty Advisor ii © Copyright by Abdurazag Saide All rights reserved 2019 iii ABSTRACT ARABIZI – HELP OR HARM? AN ANALYSIS OF THE IMPACTS OF ARABIZI - THREAT OR BENEFIT TO THE WRITTEN ARABIC LANGUAGE? Name: Saide, Abdurazag Ahmed University of Dayton Advisor: Dr. Bryan Bardine In the 21st century, the Arab world has undergone significant change, which has had profound impacts in the lives of Arab people throughout many occupations, fields, and locations. New technological advancements, communication innovations, and globalization efforts have been the core drivers to open the Arab world to Western civilization. The Arab world is characterized by unique cultural traditions, moral principles, and religious beliefs that differ greatly from those of the West. However, the globalization movement has had multiple impacts on Arab culture and the Arabic language. When comparing the Arabic language with the languages of Latin origin, it becomes clear that there exists multiple distinctions, especially in their writings and language. These distinctions did not play such a serious role several centuries ago, but with the increased cooperation between Arab nations and the Western world, these language dissimilarities have become well-recognized.
    [Show full text]
  • Arabic Alphabet 1 Arabic Alphabet
    Arabic alphabet 1 Arabic alphabet Arabic abjad Type Abjad Languages Arabic Time period 400 to the present Parent systems Proto-Sinaitic • Phoenician • Aramaic • Syriac • Nabataean • Arabic abjad Child systems N'Ko alphabet ISO 15924 Arab, 160 Direction Right-to-left Unicode alias Arabic Unicode range [1] U+0600 to U+06FF [2] U+0750 to U+077F [3] U+08A0 to U+08FF [4] U+FB50 to U+FDFF [5] U+FE70 to U+FEFF [6] U+1EE00 to U+1EEFF the Arabic alphabet of the Arabic script ﻍ ﻉ ﻅ ﻁ ﺽ ﺹ ﺵ ﺱ ﺯ ﺭ ﺫ ﺩ ﺥ ﺡ ﺝ ﺙ ﺕ ﺏ ﺍ ﻱ ﻭ ﻩ ﻥ ﻡ ﻝ ﻙ ﻕ ﻑ • history • diacritics • hamza • numerals • numeration abjadiyyah ‘arabiyyah) or Arabic abjad is the Arabic script as it is’ ﺃَﺑْﺠَﺪِﻳَّﺔ ﻋَﺮَﺑِﻴَّﺔ :The Arabic alphabet (Arabic codified for writing the Arabic language. It is written from right to left, in a cursive style, and includes 28 letters. Because letters usually[7] stand for consonants, it is classified as an abjad. Arabic alphabet 2 Consonants The basic Arabic alphabet contains 28 letters. Adaptations of the Arabic script for other languages added and removed some letters, such as Persian, Ottoman, Sindhi, Urdu, Malay, Pashto, and Arabi Malayalam have additional letters, shown below. There are no distinct upper and lower case letter forms. Many letters look similar but are distinguished from one another by dots (’i‘jām) above or below their central part, called rasm. These dots are an integral part of a letter, since they distinguish between letters that represent different sounds.
    [Show full text]
  • The Unicode Standard, Version 3.0, Issued by the Unicode Consor- Tium and Published by Addison-Wesley
    The Unicode Standard Version 3.0 The Unicode Consortium ADDISON–WESLEY An Imprint of Addison Wesley Longman, Inc. Reading, Massachusetts · Harlow, England · Menlo Park, California Berkeley, California · Don Mills, Ontario · Sydney Bonn · Amsterdam · Tokyo · Mexico City Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial capital letters. However, not all words in initial capital letters are trademark designations. The authors and publisher have taken care in preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. If these files have been purchased on computer-readable media, the sole remedy for any claim will be exchange of defective media within ninety days of receipt. Dai Kan-Wa Jiten used as the source of reference Kanji codes was written by Tetsuji Morohashi and published by Taishukan Shoten. ISBN 0-201-61633-5 Copyright © 1991-2000 by Unicode, Inc. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or other- wise, without the prior written permission of the publisher or Unicode, Inc.
    [Show full text]
  • Arabic Diacritics 1 Arabic Diacritics
    Arabic diacritics 1 Arabic diacritics Arabic alphabet ﻱ ﻭ ﻩ ﻥ ﻡ ﻝ ﻙ ﻕ ﻑ ﻍ ﻉ ﻅ ﻁ ﺽ ﺹ ﺵ ﺱ ﺯ ﺭ ﺫ ﺩ ﺥ ﺡ ﺝ ﺙ ﺕ ﺏ ﺍ Arabic script • History • Transliteration • Diacritics • Hamza • Numerals • Numeration • v • t [1] • e 〈ﺗَﺸْﻜِﻴﻞ〉 i‘jām, consonant pointing), and tashkil) 〈ﺇِﻋْﺠَﺎﻡ〉 The Arabic script has numerous diacritics, including i'jam .(〈ﺣَﺮَﻛَﺔ〉 vowel marks; singular: ḥarakah) 〈ﺣَﺮَﻛَﺎﺕ〉 tashkīl, supplementary diacritics). The latter include the ḥarakāt) The Arabic script is an impure abjad, where short consonants and long vowels are represented by letters but short vowels and consonant length are not generally indicated in writing. Tashkīl is optional to represent missing vowels and consonant length. Modern Arabic is nearly always written with consonant pointing, but occasionally unpointed texts are still seen. Early texts such as the Qur'an were initially written without pointing, and pointing was added later to determine the expected readings and interpretations. Tashkil (marks used as phonetic guides) The literal meaning of tashkīl is 'forming'. As the normal Arabic text does not provide enough information about the correct pronunciation, the main purpose of tashkīl (and ḥarakāt) is to provide a phonetic guide or a phonetic aid; i.e. show the correct pronunciation. It serves the same purpose as furigana (also called "ruby") in Japanese or pinyin or zhuyin in Mandarin Chinese for children who are learning to read or foreign learners. The bulk of Arabic script is written without ḥarakāt (or short vowels). However, they are commonly used in some al-Qur’ān). It is not) 〈ﺍﻟْﻘُﺮْﺁﻥ〉 religious texts that demand strict adherence to pronunciation rules such as Qur'an al-ḥadīth; plural: aḥādīth) as well.
    [Show full text]
  • (ISO)'S Arabic Transliteration Scheme an Improvement Over Library Of
    Is the Organization for Standardization (ISO)’s Arabic Transliteration Scheme an Improvement over Library of Congress’? Blair Kuntz University of Toronto he use of library transliteration of foreign languages has always T generated controversy and debate. For those opposed to translit- eration, especially in an age where computerization has introduced Uni- code in which native scripts can be displayed, entered, and searched in library catalogues, the practice is wholly unsatisfactory, serves no-one, and should probably be abolished. The critics point out that, especially for native users of languages with non-Roman scripts, searching for data in transliterated script is time-consuming and frustrating. Bilingual and multi-lingual catalogues, they note, have rendered transliteration unnecessary and obsolete. Why would a native speaker even bother searching for an item using transliteration, when searching using the original script is so much more reliable and efficient? For opponents of transliteration, transliteration is unreliable and serves neither librari- ans, bibliographers nor users of bibliographic systems. The time spent transliterating text in a record, when it could simply be entered in its native script, is wasteful and unproductive. Those who support library transliteration, even with the adoption of Unicode, however, argue that there are many reasons to continue the practice, inefficient as it is. Most of the arguments in favor of transliter- ation assert that while it is useless for native speakers to use translitera- tion, many other people need to search for records in any particular lan- guage employing non-Roman script. For example, transliteration is the only realistic way for Western-speaking librarians to maintain control over non-Roman materials.
    [Show full text]
  • Jan Just Witkam & A.G.P. Janson, Multi-Lingual
    Multi-Lingual Scholar'^ and Font Scholàr'^, a word-processorand a tool for font design for the student of Orientallanguages A testfu JanJast Vitkam dy A.G.P. Janson* l. Murrr-LTNGUALScHOLAR''. rne Wono-PnocessoR up to l5 printer fonts (5 Roman, 4 Hebrew, 2 Greek.2 Cyrillic and 2 Arabic fonts); thus with these five When comparedto other word-processors,what extras alphabetsone can write, edit and print in dozens of doesthis program offer its usersand what is neededto different languages.The program works in a purely operateit? The most conspicuousextra feature of this graphicalway. so that it offersnumerous other features program is that it enabiesthe userto work on the same which are seldomavailable in other word-processing documentconcurrently with up to five different alpha- software.One manifestexample of suchan extra featu- bets of one'sown choice- provided.of course.that re is the option ol uorking eitheruith largeon-screen they areinstalled. These alphabets can be mixedfreely'. characters(in ,10columns) or nith smallercharacters irrespectiveof the direction of u'riting. These Ín'e (in the usual80 columns).Workin-s *'ith largecharac- alphabetsinclude all sortsof accentsand vow'elsigns. tershas the adranta-qethat all the rouels. accentsand for both Europeanand non-Europeanlanguages. and the iike can be typed and seen on the screen.Using eachhas multiple font sizesand characterstyles. theo- eitherof thesescreen fonts has no consequencesfor the retically up to 30 fonts per document and in practice, way the document will be printed. since the program with the fonts that are suppliedas standardprocedure, recalculatesevery page for printing according to the user'sspecifications. * Multi-Lingual Scholar'' reviewedby Jan Just Witkam; To operateMLS, one needsto have at leastan reN,ÍPC Font Scholar''reviewed by' A.G.P.
    [Show full text]
  • Inhaltsverzeichnis Use of the Arabic Language and Script in Labdoo Project, Setting up Ubuntu Laptop
    How to install and how use „non latin letters“ language and fonts in the Labdoo project (Arabic, Urdu, Farsi, Hindi, Russian, Tamil, Thai etc.) Inhaltsverzeichnis Use of the Arabic language and script in Labdoo project, setting up ubuntu laptop...........1 Enabling Arabic support..............................................................................................................2 Keyboard layout switching.........................................................................................................5 Changing the text direction.......................................................................................................6 Linux programs with Arabic language support.......................................................................6 Arabic language as a system wide set...........................................................................................8 Keyboard layout Arabic (Algeria, Morocco, Tunisia):..............................................................9 Keyboard layout Arabic (Bahrain, Egypt, Jordan, Kuwait, Lebanon, Oman, Qatar, Saudi Arabia, Syrai, U.A.E, Yemen):.......................................................................................................9 Keyboard layout Urdu (Pakistan):............................................................................................10 Keyboard layout Farsi (Iran):.....................................................................................................10 Keyboard layout Hindi (India):..................................................................................................11
    [Show full text]
  • The Digital Orientalist Keyboard Layouts by L.W.C
    The Digital Orientalist Keyboard Layouts By L.W.C. van Lit *VERSION MARCH 9, 2014* ATTENTION: discrepancies exist between the Mac OSX and Microsoft Windows Arabic keyboard layouts. They are negligible and should become obvious upon usage. This document has been made using the Mac OSX layout. The Latin keyboard layout is only available for Mac OSX. Introduction This document explains how to install and use two keyboard lay-outs useful to all students, scholars and others interested in the Arabic and Persian language, whose primary language is a Western- European one. One will !nd that once installed, these layouts increase productivity tremendously. The Latin layout takes care of characters necessary for proper transliteration of Arabic words and accommodates all major transliteration systems. The Arabic layout allows the user to type Arabic in near-transliterary style, which is more intuitive for Orientalists. Pleaseًًًٛ note that Mac OS X Lion-users do not necessarily need to install the Latin keyboard-layout but only need to modify the regular English keyboard. However, the Latin keyboard-layout proofs to be faster and is therefore recommended. Installation for Windows • Run the Installer !le • On the taskbar, click on the language icon and select ‘add keyboard’ • Look up and select the keyboard • Delete other active Arabic keyboards Installation for Mac OSX Modifying the regular English keyboard Put the Keyboard-en.plist !le in the following folder: /System/Library/Input Methods/PressAndHold.app/Contents/Resources The computer will ask for permission and will ask what you want to do since a !le with the same name already exists.
    [Show full text]