A Framework for Intensive R Developing Cross-Br E Arabic Web Applicati Rowser Data Ions

Total Page:16

File Type:pdf, Size:1020Kb

A Framework for Intensive R Developing Cross-Br E Arabic Web Applicati Rowser Data Ions See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/261111986 A framework for developing cross-browser data intensive Arabic Web applications Conference Paper · October 2012 DOI: 10.1109/ICCTA.2012.6523547 CITATION READS 1 17 3 authors: Mahmoud Youssef Nourhan Hamdi George Washington University Arab Academy for Science, Technology & Maritime Transport 12 PUBLICATIONS 72 CITATIONS 3 PUBLICATIONS 4 CITATIONS SEE PROFILE SEE PROFILE Salma Rayan University of Strathclyde 2 PUBLICATIONS 2 CITATIONS SEE PROFILE Some of the authors of this publication are also working on these related projects: Enhancing location privacy View project All content following this page was uploaded by Salma Rayan on 20 June 2020. The user has requested enhancement of the downloaded file. A Framework for Developing Cross-Browser Data Intensive Arabic Web Applications Mahmoud Youssef, Nourhan Hamdi, and Salma Rayan Business Information Systems Department Arab Academy for Science, Technology, and Maritime Transport Alexandria, Egypt [email protected], [email protected], and [email protected] Abstract— the frequent encounter of incorrectly functioning The impact of properly functioning Arabic data-intensive Arabic Websites, especially those of large businesses and Websites can be seen on different fronts. From a societal governments calls for a clear and applied framework to help perspective, it enables the spread of e-commerce and e- practitioners develop properly functioning Websites. The issue of government applications in the Arab world with their Website internationalization has been addressed in a plethora of associated benefits. Moreover, it enhances the trust in the standards, guideline, good practices, and tutorials. However, these organizations that own these Websites [19]. And from a standards with their formal language may not be directly comprehensible to the practitioner. In addition, guidelines and technical perspective, it provides the ability to integrate tutorials are usually designed to address one issue requiring the correctly within Web 2.0 mashups, the ability to handle future practitioner to integrate knowledge from different sources. application needs such as linked data, and the ability to Moreover, these standards, guidelines and tutorials are limited to integrate with other components within distributed the Web technologies themselves excluding other parts of Web architectures such as the Service-Oriented Architecture (SOA). application architecture. In this paper, we propose a comprehensive step-by-step framework addressed to the The needs to provide information in languages other than Latin practitioner. The framework integrates knowledge from different and to provide more than one language at the same time are standards and technologies and calls attention to issues that could addressed in computer applications and specifically in Web be overlooked in designing and implementing data-intensive applications under the titles globalization, internationalization Arabic Web applications. (i18n), and localization (l10n) which we explain herein forth. Keywords-component; Internationalization; Arabic; Data- Globalization refers to the ability of an e-commerce intensive; Web applications Website to address global community taking into consideration the diversified needs of this community. I. INTRODUCTION While the discussion of Web applications Internationalization refers to the design of a Website so that internationalization and localization might sound as an it can be adapted to different countries taking into antiquated topic, the frequent encounter of incorrectly consideration their various languages, scripts, and cultures functioning Arabic Websites, especially those of large [21]. businesses and governments (See Figure 1), calls for a clear Localization refers to the adaption of a Website to a and applied framework to help practitioners develop properly specific locale such as Arabic/Egypt. This may impact, among functioning Arabic Websites. other factors, numeric date and time format, currency format, collate and sort order, and calendar system. A basic requirement of a Website is to perform properly across different Web browsers. While browsers are expected to render content in a consistent way, experience has shown that the behavior of Websites may be different with different browsers. Current statistics show that Google Chrome, Mozilla Firefox (FF), Microsoft Internet Explorer (IE), and Apple Safari are the dominant browsers in the market [22]. In this research we examine the proposed framework against these browsers. A data-intensive Website is characterized by its dynamicity, which typically involves architecture of several layers that interact together. Providing Arabic textual information Figure 1. Improperly functioning Website of a major bank correctly on the user interface requires proper interchange of data among these layers as well as proper representation of the Interestingly, numbers in Arabic, whether using Indic or Arabic data at each layer. numerals, are displayed left-to-right adding to the complexity. To address the needs for internationalization and 1) Arabic Character Sets History localization, the research community, through its In order for computers to process and interchange textual standardization organizations, has developed a plethora of data correctly, characters encoding must be standardized. For standards (e.g.,[7], [8], [9]). However, these standards with English language, the American Standard Code for Information their formal language may not be directly comprehensible to Interchange (ASCII) character set has been the standard the practitioner. To make such knowledge accessible, many representation since the early time of computers. ASCII used 7 guidelines and tutorials (e.g., [10], [16], [20]) were developed, bits per character, which allowed 128 characters only. These yet they are usually designed to address one issue requiring the characters included the upper and lower case English alphabet, practitioner to internalize knowledge from different sources. the digits from 0 to 9, and punctuation symbols. As such, Moreover, these standards, guidelines and tutorials are ASCII allowed representation of English text only. frequently limited to the Web technologies themselves excluding other parts of the architecture. With the existence of large amounts of information in ASCII and the need to represent other languages, 8-bit In this paper, we propose a comprehensive step-by-step character sets that extend ASCII were developed and framework addressed to the practitioner that integrates the standardized. They were referred to as Extended ASCII. As 8- knowledge from different standards and technologies and calls bit representations allowed 256 characters (code points) and as attention to issues that could be overlooked in designing and the first 128 of them were always occupied by ASCII, the other implementing data-intensive Arabic Web applications. 128 code points were used to represent one or more languages, Throughout the discussion, we strive to provide solutions that or character graphics, and they were referred to as the extended are applicable to different development environments; set. As an example, ISO-8859-1 character set extends ASCII however, we conducted our experiments, mostly, using open with other Western European languages. source applications and tools. Operating systems utilized Extended ASCII character sets We limit our work to addressing issues related enforcing to provide multilingual support and referred to them as “code proper Arabic text representation, transport, and processing. As pages”. Each user was able to add other locales by selecting such, we exclude discussion on other issues such as calendar their code pages. However, since only one code page can be systems, time zones, date and time, and currency formats. active at time in an application, it was impossible to have The rest of the paper is organized as follows. Section 2 languages from different code pages concurrently. The problem provides background, Section 3 presents related work, Section was not solved until the development of UNICODE [18] 4 introduces the proposed solution, and Section 5 concludes the presented in the next Section. paper. Arabic character set encoding went through several developments. In 1981, the CUDAR-U encoding appeared as II. PRELIMINARIES the first Arabic character set, which used 7 bits per character. Most of the Web standards emphasize the use of the In 1982, the Arab Standards and Metrology Organization globally accepted Unicode character encoding and maintaining (ASMO) produced its first character set standard, AMSO-449, that encoding throughout the different layers of the which used 7 bits per character as well. ASMO-449 then architecture. In this section, we describe the characteristics of became the basis for all subsequent standard sets, playing a role Arabic, the historical development of Arabic character sets and similar to ASCII for Latin characters; however, similar to the current status of the top Arabic Websites. ASCII, it allowed the representation of one language; Arabic in that case. In 1986, ASMO-708 standard appeared. It uses 8 bits A. Arabic Language Representation per character. Later, it became the international standard ISO- When displaying characters, it is important to distinguish 8859-6 and became the code page for the Arabic Macintosh between the characters themselves and their visual
Recommended publications
  • IGP® / VGL Emulation Code V™ Graphics Language Programmer's Reference Manual Line Matrix Series Printers
    IGP® / VGL Emulation Code V™ Graphics Language Programmer’s Reference Manual Line Matrix Series Printers Trademark Acknowledgements IBM and IBM PC are registered trademarks of the International Business Machines Corp. HP and PCL are registered trademarks of Hewlett-Packard Company. IGP, LinePrinter Plus, PSA, and Printronix are registered trademarks of Printronix, LLC. QMS is a registered trademark and Code V is a trademark of Quality Micro Systems, Inc. CSA is a registered certification mark of the Canadian Standards Association. TUV is a registered certification mark of TUV Rheinland of North America, Inc. UL is a registered certification mark of Underwriters Laboratories, Inc. This product uses Intellifont Scalable typefaces and Intellifont technology. Intellifont is a registered trademark of Agfa Division, Miles Incorporated (Agfa). CG Triumvirate are trademarks of Agfa Division, Miles Incorporated (Agfa). CG Times, based on Times New Roman under license from The Monotype Corporation Plc is a product of Agfa. Printronix, LLC. makes no representations or warranties of any kind regarding this material, including, but not limited to, implied warranties of merchantability and fitness for a particular purpose. Printronix, LLC. shall not be held responsible for errors contained herein or any omissions from this material or for any damages, whether direct, indirect, incidental or consequential, in connection with the furnishing, distribution, performance or use of this material. The information in this manual is subject to change without notice. This document contains proprietary information protected by copyright. No part of this document may be reproduced, copied, translated or incorporated in any other material in any form or by any means, whether manual, graphic, electronic, mechanical or otherwise, without the prior written consent of Printronix, LLC.
    [Show full text]
  • IS0 9036 : 1987 (E) This Is a Preview - Click Here to Buy the Full Publication
    This is a preview - click here to buy the full publication IS0 INTERNATIONAL STANDARD 9036 First edition 1987-04-H INTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE DE NORMALISATION MEXflYHAPOflHAFl OPI-AHM3AuMR l-l0 CTAHJJAPTM3A~MM Information processing - Arabic 7-bit coded character set for information interchange Jeu de caractkres arabes cod& A 7 Mments pour Ochange d/information Reference number IS0 9036 : 1987 (E) This is a preview - click here to buy the full publication Foreword IS0 (the International Organization for Standardization) is a worldwide federation of national standards bodies (IS0 member bodies). The work of preparing International Standards is normally carried out through IS0 technical committees. Each member body interested in a subject for which a technical committee has been established has the right to be represented on that committee. International organizations, govern- mental and non-governmental, in liaison with ISO, also take part in the work. Draft International Standards adopted by the technical committees are circulated to the member bodies for approval before their acceptance as International Standards by the IS0 Council. They are approved in accordance with IS0 procedures requiring at least 75 % approval by the member bodies voting. International Standard IS0 9636 was prepared by Technical Committee ISO/TC 97, Information processing systems. Users should note that all International Standards undergo revision from time to time and that any reference made herein to any other International Standard implies its latest edition, unless otherwise stated. 0 International Organization for Standardization, 1987 Printed in Switzerland This is a preview - click here to buy the full publication IS0 9036: 1987 (El Contents Page 1 Scope and field of application ........................................
    [Show full text]
  • Le Codage Informatique De L'écriture Arabe : D'asmo 449 À Unicode Et
    Le codage informatique de l’écriture arabe : d’ASMO 449 à Unicode et ISO/CEI 10646 Rachid Zghibi Département documentation Université Paris 8 Maison des Sciences de l’Homme Paris Nord 4, rue de la Croix Faron, Plaine Saint-Denis F-93210 Saint-Denis [email protected] RÉSUMÉ. Le traitement de l’écriture arabe par le standard Unicode et la norme ISO/CEI 10646 constitue l’objet de cet article. Nous présentons d’abord les principaux problèmes liés au codage informatique de l’écriture arabe, puis nous montrons comment Unicode résout ces problèmes. ABSTRACT. The present paper deals with the treatment of arabic script by the Unicode and ISO/CEI 10646. First, we shall outline the principal problems related to the encoding of arabic script. Then we shall show how Unicode resolves this problems. MOTS-CLÉS : écriture arabe, Unicode, ISO/CEI 10646, codage. KEYWORDS: arabic script, Unicode, ISO/CEI 10646, encoding. Document numérique. Volume 6 – n° 3-4/2002, pages 155 à 182 Cet article des Editions Lavoisier est disponible en acces libre et gratuit sur dn.revuesonline.com 156 DN – 6/2002. Unicode, écriture du monde ? 1. Écriture arabe 1.1.Origine L’arabe fait partie des langues chamito-sémitique et plus précisément, à l’intérieur de cet ensemble, du groupe des langues sémitiques. Ce groupe se divise en sémitique oriental (avec l’akkadien), sémitique occidental ou du nord (cananéen, phénicien, hébreu, araméen) et sémitique méridional. L’arabe relève de ce dernier groupe avec l’éthiopien et le sud-arabique. À la différence d’autres nations, comme les anciens égyptiens, les babyloniens et les chinois dont les systèmes d’écriture remontent à des milliers d’années, l’écriture arabe n’est apparue qu’au VIe siècle.
    [Show full text]
  • Toth 2017.Pdf
    Terminologie & Ontologie : Théories et Applications Actes de la conférence TOTh 2017 Chambéry – 8 & 9 juin 2017 Publications précédentes TOTh 2007 Actes de la première conférence TOTh - Annecy - er1 juin 2007 TOTh 2008 Actes de la deuxième conférence TOTh - Annecy – 5 et 6 juin 2008 TOTh 2009 Actes de la troisième conférence TOTh - Annecy – 4 et 5 juin 2009 TOTh 2010 Actes de la quatrième conférence TOTh - Annecy – 3 et 4 juin 2010 TOTh 2011 Actes de la cinquième conférence TOTh - Annecy – 24 et 25 mai 2011 TOTh 2012 Actes de la première conférence TOTh - Chambéry – 7 et 8 juin 20012 TOTh 2013 Actes de la deuxième conférence TOTh - Chambéry – 6 et 7 juin 2013 TOTh 2014 Actes de la troisième conférence TOTh - Chambéry – 12 et 13 juin 2014 TOTh 2015 Actes de la quatrième conférence TOTh - Chambéry – 2 et 3 juin 2015 TOTh 2016 Actes de la cinquième conférence TOTh - Chambéry – 9 et 10 juin 2016 Pour les ouvrages publiés ci-dessus, commandes à adresser à : [email protected] Éditeur : Université Savoie Mont Blanc 27 rue Marcoz BP 1104 73011 CHAMBÉRY CEDEX www.univ-smb.fr Réalisation : S. Carvalho & C. Brun Collection « Terminologica » ISBN : 978-2-919732-80-7 ISSN : en cours Dépôt légal : janvier 2018 Terminologie & Ontologie : Théories et Applications Actes de la conférence TOTh 2017 Chambéry – 8 & 9 juin 2017 http://toth.condillac.org avec le soutien de : Université Savoie Mont Blanc Association Européenne de Terminologie École d’ingénieurs Polytech Annecy Chambéry Université Savoie Mont Blanc Collection « Terminologica » Comité scientifique Président du Comité scientifique : Christophe Roche Comité de pilotage Rute Costa Universidade Nova de Lisboa Humbley John Université Paris 7 Kockaert Hendrik University of Leuven Christophe Roche Université Savoie Mont Blanc Comité de programme 2017 Le comité de programme est constitué chaque année à partir du comité scientifique de TOTh en fonction des soumissions reçues.
    [Show full text]
  • Technical Advisory Group on Machine Readable Travel Documents (Tag/Mrtd) Twentieth Meeting
    TAG/MRTD/20-WP/17 6/09/11 English Only International Civil Aviation Organization WORKING PAPER TECHNICAL ADVISORY GROUP ON MACHINE READABLE TRAVEL DOCUMENTS (TAG/MRTD) TWENTIETH MEETING Montréal, 7 to 9 September 2011 Agenda Item 2: Activities of the NTWG Agenda Item 2.9: Transliteration Rules (Arabic) TRANSLITERATION RULES (ARABIC) (Presented by the NTWG) 1. INTRODUCTION 1.1 The purpose of the working paper is to inform the TAG of the progress made on the recommended transliteration of Arabic national characters for Doc Doc 9303, Machine Readable Travel Documents , Part 1 — Machine Readable Passports , Volume1, Section IV, Appendix 9. These recommended transliterations apply to the MRZ only. 2. BACKGROUND 2.1 Transliteration of Arabic characters applies mainly to the name of the holder of the MRTD, and is critical for determining the true identity of the holder. At present, Arabic names are transcribed (phonetically) to Latin characters on an ad-hoc basis and this compromises identity management. 2.2 The only true and reliable source of identification is the original form of the name in Arabic characters. Thus, any transliteration scheme must preserve the original form of the name. 2.3 It would be advantageous to countries using the Arabic script if encoding the name in the MRZ led to machine reading producing the original form of the name in Arabic. This would allow these countries to gain the benefit of machine reading without having to deal with an intermediate form of the name as an inaccurate Latin transcription. 2.4 There is no known existing transliteration scheme for Arabic script that is suitable for the MRZ.
    [Show full text]
  • ALM : Arabic Language Module
    ALM : Arabic Language Module Product Presentation by Franck Portaneri [email protected] ALM What is ALM ? IRIX Arabic Interface support for : Open for Third Party Applications : − TTY − Immediat for TTY − GUI X11/Motif − Easy for Pure X11/Motif − Printing − Complex for GIS or DTP Technical Aspects IRIX Arabic supplement for : Future and Evolution : − Indigo Magic Desktop − IRIX 6.2 − MediaMail − O2 and IRIX 6.3 / 6.4 − Web − Internet ALM TTY : Character based Interface For Old Server applications Cheaper than Arabic Terminals Run under xterm and winterm Full Transparency ALM GUI: X Window Interface X11R5 / X11R6 Motif 1.2.4 − RTL Text Widget − Cut’n paste − Drag’n drop Basic X font set Transparency with Resource configuration ALM Printing support Basic Line printer set: − HP laserjet − HP deskjet − IBM proprinter − Epson LQ − Fujitsu DL Postscript : − aa2ps − amp ALM Indigo Magic Desktop Localization of IRIX tools − Toolchest − File Manager − Window Manager − MoviePlayer − AudioPlayer − Ieditor − Bitmap − xfontsel − mouse − ... Done by Resource file customization : /usr/lib/X11/ar/app−defaults/* ALM Media Mail : Z−Mail Full Localization of menus Arabic Mail exchange with other ALM users (in ISO 8859−6) Mime compliant "quoted printable" encoding Printing support ALM WWW support AraMosaic Web browser: − X Mosaic enhancement − ISO 8859−6 HTML (same as MAC encoding) − Postscript Printing − Cut’n Paste with ALM − Internet and Intranet Limitations: − No java − No Frames −... ALM WWW support Netscape 2.x and up − ISO 8859−6 HTML Page browsing − RTL with HTML markup − Java and Javascript − Frames − Animated Gifs Limitations − No Cut’n Paste − Pseudo RTL aligment − No Printing support ALM Third Party Applications Immediat for TTY Applications (Like with Arabic Terminals) Easy for X11/Motif Applications with resource file configuration −−> Text editing job Exemple: − MediaMail − Oracle Forms and Report − Oracle Forms Designer..
    [Show full text]
  • City Research Online
    City Research Online City, University of London Institutional Repository Citation: Ali, S.A.H. (1995). The impact of computer technology on accounting and auditing in the Middle East with special emphasis on Arabisation, transfer of technology and training. (Unpublished Doctoral thesis, City University London) This is the accepted version of the paper. This version of the publication may differ from the final published version. Permanent repository link: https://openaccess.city.ac.uk/id/eprint/7737/ Link to published version: Copyright: City Research Online aims to make research outputs of City, University of London available to a wider audience. Copyright and Moral Rights remain with the author(s) and/or copyright holders. URLs from City Research Online may be freely distributed and linked to. Reuse: Copies of full items can be used for personal research or study, educational, or not-for-profit purposes without prior permission or charge. Provided that the authors, title and full bibliographic details are credited, a hyperlink and/or URL is given for the original metadata page and the content is not changed in any way. City Research Online: http://openaccess.city.ac.uk/ [email protected] CITY University BUSINESS SCHOOL The Impact of Computer Technology on Accounting and Auditing in the Middle East with Special Emphasis on Arabisation, Transfer of Technology and Training Sami Abbas Hussain Ali lq016 A Thesis Submitted for the Degree of Doctor of Philosophy VOLUME I CONTENTS Page List of Figures 5 List of Tables 6-11 Acknowledgments
    [Show full text]
  • International Register of Coded Character Sets to Be Used with Escape Sequences for Information Interchange in Data Processing
    INTERNATIONAL REGISTER OF CODED CHARACTER SETS TO BE USED WITH ESCAPE SEQUENCES 1 Introduction 1.1 General This document is the ISO International Register of Coded Character Sets To Be Used With Escape Sequences for information interchange in data processing. It is compiled in accordance with the provisions of ISO/IEC 2022, "Code Extension Technique" and of ISO 2375 "Procedure for Registration of Escape Sequences". This International Register contains coded character sets which have been registered in accordance with procedures given in ISO 2375. Its purpose is to identify widely used coded character sets and associate with each a unique escape sequence by means of which it can be designated according to ISO/IEC 2022 and ISO/IEC 4873. The publication of this International Register should promote compatibility in international information interchange and avoid duplication of effort in developing application-oriented coded character sets. Registration provides an identification for a coded character set but implies nothing about its status; it may or may not be part of a standard of an international, national or a corporate body. However, if such a standard is published subsequently to the registration, it would be appropriate for the escape sequence identifying the character set to be specified in the standard. If it is desired to register a set, application should be made to the Registration Authority through an appropriate Sponsoring Authority as specified in ISO 2375. Any character set can be a candidate for registration if it meets the requirements of ISO 2375. The Registration Authority ascertains that the proposals received are formally in accordance with this International Standard, technically in accordance with ISO/IEC 2022, and, where applicable, with ISO/IEC 646 and ISO/IEC 4873, and meet the presentation practice of the Registration Authority.
    [Show full text]
  • Character Sets Reference Manual for Line Matrix Printers
    R Character Sets Reference Manual for Line Matrix Printers Character Sets Reference Manual for Line Matrix Printers R P/N 164308–001, Rev B Printronix, Inc. makes no representations or warranties of any kind regarding this material, including, but not limited to, implied warranties of merchantability and fitness for a particular purpose. Printronix, Inc. shall not be held responsible for errors contained herein or any omissions from this material or for any damages, whether direct, indirect, incidental or consequential, in connection with the furnishing, distribution, performance or use of this material. The information in this manual is subject to change without notice. This document contains proprietary information protected by copyright. No part of this document may be reproduced, copied, translated or incorporated in any other material in any form or by any means, whether manual, graphic, electronic, mechanical or otherwise, without the prior written consent of Printronix, Inc. All rights reserved. TRADEMARK ACKNOWLEDGMENTS Printronix, LinePrinter Plus, PGL and IGP are registered trademarks of Printronix, Inc. DEC is a registered trademark of Digital Equipment Corporation. Epson is a registered trademark of Seiko Epson. IBM is a registered trademark of Internation Business Machines Corporation. Proprinter is a registered trademark of IBM. Scalable type outlines are licensed from Agfa Corporation. Agfa is a registered trademark of Agfa Division, Miles Incorporated (Agfa). CG, Garth Graphic, Intellifont, and Type Director are registered trademarks of Agfa Corporation, and Shannon and CG Triumvirate are trademarks of Agfa Corporation. CG Bodoni, CG Century Schoolbook, CG Goudy Old Style, CG Melliza, Microstyle, CG Omega, and CG Palacio are products of Agfa Corporation.
    [Show full text]
  • Middle East-I 9 Modern and Liturgical Scripts
    The Unicode® Standard Version 13.0 – Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2020 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at http://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see http://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 13.0. Includes index. ISBN 978-1-936213-26-9 (http://www.unicode.org/versions/Unicode13.0.0/) 1.
    [Show full text]
  • Bibliographic Access to Non-Roman Scripts in Library Opacs: a Study of Selected Arl Academic Libraries in the United States
    BIBLIOGRAPHIC ACCESS TO NON-ROMAN SCRIPTS IN LIBRARY OPACS: A STUDY OF SELECTED ARL ACADEMIC LIBRARIES IN THE UNITED STATES By Ali Kamal Shaker BA—Cairo University-Egypt, 1995 MLIS—University of Wisconsin-Milwaukee, 2000 Submitted to the Graduate Faculty of School of Information Sciences in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Pittsburgh 2002 ii BIBLIOGRAPHIC ACCESS TO NON-ROMAN SCRIPTS IN LIBRARY OPACS A STUDY OF SELECTED ARL ACADEMIC LIBRARIES IN THE UNITED STATES Ali K. Shaker, PhD University of Pittsburgh, 2002 With the increasing availability of non-Roman script materials in academic/research libraries, bibliographic access to vernacular characters in library OPACs becomes one of the primary means for users of these materials to access and use them efficiently. Though Romanization, as a bibliographic control tool, has been studied extensively during the past five decades, investigations of using original (vernacular) scripts remain inadequate. The purpose of this study was to trace the transition from Romanization-based to vernacular-based bibliographic access to non-Roman script materials. Two major developments contributing to this transition were the availability of records with script characters from bibliographic utilities, and the development of a universal character set, the Unicode standard. The main data collection instrument was a self-administered mail questionnaire sent to a purposive sample of academic library members in the Association of Research Libraries with sizeable non-Roman script collections. Another data collection technique utilized was document/Web site analysis of bibliographic utilities and library automation vendors. Forty five questionnaires were obtained, which represented 65% of an actual population of 69 libraries.
    [Show full text]
  • Printronix ACA Emulation Programmer's Reference Manual
    ACA Emulation Programmer’s Reference Manual Trademark Acknowledgements Advanced Function Printing is a trademark of International Business Machines Corporation. AIX, AS/400, OS/2, and SAA are registered trademarks of International Business Machines Corporation. ANSI is a registered trademark of American National Standards Institute, Inc. Bar Code Object Content Architecture and BCOCA are trademarks of International Business Machines Corporation. IBM is a registered trademark of International Business Machines Corporation. Intelligent Printer Data Stream and IPDS are trademarks of International Business Machines Corporation. LinePrinter Plus is a registered trademark of Printronix, Inc. Printronix is a registered trademark of Printronix, Inc. PSA is a registered trademark of Printronix, Inc. Printronix, Inc. makes no representations or warranties of any kind regarding this material, including, but not limited to, implied warranties of merchantability and fitness for a particular purpose. Printronix, Inc. shall not be held responsible for errors contained herein or any omissions from this material or for any damages, whether direct, indirect, incidental or consequential, in connection with the furnishing, distribution, performance or use of this material. The information in this manual is subject to change without notice. This document contains proprietary information protected by copyright. No part of this document may be reproduced, copied, translated or incorporated in any other material in any form or by any means, whether manual, graphic,
    [Show full text]