A Sociolinguistic Analysis of the Use of Arabizi in Social Media Among Saudi Arabians
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
DIGITAL TYPOGRAPHY USING LATEX Springer New York Berlin Heidelberg Hong Kong London Milan Paris Tokyo Apostolos Syropoulos Antonis Tsolomitis Nick Sofroniou
DIGITAL TYPOGRAPHY USING LATEX Springer New York Berlin Heidelberg Hong Kong London Milan Paris Tokyo Apostolos Syropoulos Antonis Tsolomitis Nick Sofroniou DIGITAL TYPOGRAPHY USING LATEX With 68 Illustrations Apostolos Syropoulos Antonis Tsolomitis 366, 28th October St. Dept. of Mathematics GR-671 00 Xanthi University of the Aegean GREECE GR-832 00 Karlobasi, Samos [email protected] GREECE [email protected] Nick Sofroniou Educational Research Centre St. Patrick’s College Drumcondra, Dublin 9 IRELAND [email protected] Library of Congress Cataloging-in-Publication Data Syropoulos, Apostolos. Digital typography using LaTeX / Apostolos Syropoulos, Antonis Tsolomitis, Nick Sofroniou. p. cm. Includes bibliographical references and indexes. ISBN 0-387-95217-9 (acid-free paper) 1. LaTeX (Computer file) 2. Computerized typesetting. I. Tsolomitis, Antonis. II. Sofroniou, Nick. III. Title. Z253.4.L38 S97 2002 686.2´2544—dc21 2002070557 ACM Computing Classification (1998): H.5.2, I.7.2, I.7.4, K.8.1 ISBN 0-387-95217-9 (alk. paper) Printed on acid-free paper. Printed on acid-free paper. © 2003 Springer-Verlag New York, Inc. All rights reserved. This work may not be translated or copied in whole or in part without the written permission of the publisher (Springer-Verlag New York, Inc., 175 Fifth Avenue, New York, NY 10010, USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in connection with any form of information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed is forbidden. The use in this publication of trade names, trademarks, service marks, and similar terms, even if they are not identified as such, is not to be taken as an expression of opinion as to whether they are subject to proprietary rights. -
Technical Reference Manual for the Standardization of Geographical Names United Nations Group of Experts on Geographical Names
ST/ESA/STAT/SER.M/87 Department of Economic and Social Affairs Statistics Division Technical reference manual for the standardization of geographical names United Nations Group of Experts on Geographical Names United Nations New York, 2007 The Department of Economic and Social Affairs of the United Nations Secretariat is a vital interface between global policies in the economic, social and environmental spheres and national action. The Department works in three main interlinked areas: (i) it compiles, generates and analyses a wide range of economic, social and environmental data and information on which Member States of the United Nations draw to review common problems and to take stock of policy options; (ii) it facilitates the negotiations of Member States in many intergovernmental bodies on joint courses of action to address ongoing or emerging global challenges; and (iii) it advises interested Governments on the ways and means of translating policy frameworks developed in United Nations conferences and summits into programmes at the country level and, through technical assistance, helps build national capacities. NOTE The designations employed and the presentation of material in the present publication do not imply the expression of any opinion whatsoever on the part of the Secretariat of the United Nations concerning the legal status of any country, territory, city or area or of its authorities, or concerning the delimitation of its frontiers or boundaries. The term “country” as used in the text of this publication also refers, as appropriate, to territories or areas. Symbols of United Nations documents are composed of capital letters combined with figures. ST/ESA/STAT/SER.M/87 UNITED NATIONS PUBLICATION Sales No. -
Arabic Alphabet - Wikipedia, the Free Encyclopedia Arabic Alphabet from Wikipedia, the Free Encyclopedia
2/14/13 Arabic alphabet - Wikipedia, the free encyclopedia Arabic alphabet From Wikipedia, the free encyclopedia َأﺑْ َﺠ ِﺪﯾﱠﺔ َﻋ َﺮﺑِﯿﱠﺔ :The Arabic alphabet (Arabic ’abjadiyyah ‘arabiyyah) or Arabic abjad is Arabic abjad the Arabic script as it is codified for writing the Arabic language. It is written from right to left, in a cursive style, and includes 28 letters. Because letters usually[1] stand for consonants, it is classified as an abjad. Type Abjad Languages Arabic Time 400 to the present period Parent Proto-Sinaitic systems Phoenician Aramaic Syriac Nabataean Arabic abjad Child N'Ko alphabet systems ISO 15924 Arab, 160 Direction Right-to-left Unicode Arabic alias Unicode U+0600 to U+06FF range (http://www.unicode.org/charts/PDF/U0600.pdf) U+0750 to U+077F (http://www.unicode.org/charts/PDF/U0750.pdf) U+08A0 to U+08FF (http://www.unicode.org/charts/PDF/U08A0.pdf) U+FB50 to U+FDFF (http://www.unicode.org/charts/PDF/UFB50.pdf) U+FE70 to U+FEFF (http://www.unicode.org/charts/PDF/UFE70.pdf) U+1EE00 to U+1EEFF (http://www.unicode.org/charts/PDF/U1EE00.pdf) Note: This page may contain IPA phonetic symbols. Arabic alphabet ا ب ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع en.wikipedia.org/wiki/Arabic_alphabet 1/20 2/14/13 Arabic alphabet - Wikipedia, the free encyclopedia غ ف ق ك ل م ن ه و ي History · Transliteration ء Diacritics · Hamza Numerals · Numeration V · T · E (//en.wikipedia.org/w/index.php?title=Template:Arabic_alphabet&action=edit) Contents 1 Consonants 1.1 Alphabetical order 1.2 Letter forms 1.2.1 Table of basic letters 1.2.2 Further notes -
Arabic in Romanization
Transliteration of Arabic 1/6 ARABIC Arabic script* DIN 31635 ISO 233 ISO/R 233 UN ALA-LC EI 1982(1.0) 1984(2.0) 1961(3.0) 1972(4.0) 1997(5.0) 1960(6.0) iso ini med !n Consonants! " 01 # $% &% ! " — (3.1)(3.2) — (4.1) — — 02 ' ( ) , * ! " #, $ (2.1) —, ’ (3.3) %, — (4.2) —, ’ (5.1) " 03 + , - . b b b b b b 04 / 0 1 2 t t t t t t 05 3 4 5 6 & & & th th th 06 7 8 9 : ' ' ' j j dj 07 ; < = > ( ( ( ) ( ( 08 ? @ * + + kh kh kh 09 A B d d d d d d 10 C D , , , dh dh dh 11 E F r r r r r r 12 G H I J z z z z z z 13 K L M N s s s s s s 14 O P Q R - - - sh sh sh 15 S T U V . / . 16 W X Y Z 0 0 0 d 1 0 0 17 [ \ ] ^ 2 2 2 3 2 2 18 _ ` a b 4 4 4 z 1 4 4 19 c d e f 5 5 5 6 6 5 20 g h i j 7 7 8 gh gh gh 21 k l m n f f f f f f 22 o p q r q q q q q 9 23 s t u v k k k k k k 24 w x y z l l l l l l 25 { | } ~ m m m m m m 26 • € • n n n n n n 27 h h h h h h 28 … " h, t (1.1) : ;, <(3.4) h, t (4.3) h, t (5.2) a, at (6.1) 29 w w w w w w 30 y y y y y y 31 ! = — y y ! • 32 s! l! la" l! l! l! l! 33 # al- (1.2) "#al (2.2) al- (3.5) al- (4.4) al- (5.3) al-, %l- (6.2) Thomas T. -
Writing Arabizi: Orthographic Variation in Romanized
WRITING ARABIZI: ORTHOGRAPHIC VARIATION IN ROMANIZED LEBANESE ARABIC ON TWITTER ! ! ! ! Natalie!Sullivan! ! ! ! TC!660H!! Plan!II!Honors!Program! The!University!of!Texas!at!Austin! ! ! ! ! May!4,!2017! ! ! ! ! ! ! ! _______________________________________________________! Barbara!Bullock,!Ph.D.! Department!of!French!&!Italian! Supervising!Professor! ! ! ! ! _______________________________________________________! John!Huehnergard,!Ph.D.! Department!of!Middle!Eastern!Studies! Second!Reader!! ii ABSTRACT Author: Natalie Sullivan Title: Writing Arabizi: Orthographic Variation in Romanized Lebanese Arabic on Twitter Supervising Professors: Dr. Barbara Bullock, Dr. John Huehnergard How does technology influence the script in which a language is written? Over the past few decades, a new form of writing has emerged across the Arab world. Known as Arabizi, it is a type of Romanized Arabic that uses Latin characters instead of Arabic script. It is mainly used by youth in technology-related contexts such as social media and texting, and has made many older Arabic speakers fear that more standard forms of Arabic may be in danger because of its use. Prior work on Arabizi suggests that although it is used frequently on social media, its orthography is not yet standardized (Palfreyman and Khalil, 2003; Abdel-Ghaffar et al., 2011). Therefore, this thesis aimed to examine orthographic variation in Romanized Lebanese Arabic, which has rarely been studied as a Romanized dialect. It was interested in how often Arabizi is used on Twitter in Lebanon and the extent of its orthographic variation. Using Twitter data collected from Beirut, tweets were analyzed to discover the most common orthographic variants in Arabizi for each Arabic letter, as well as the overall rate of Arabizi use. Results show that Arabizi was not used as frequently as hypothesized on Twitter, probably because of its low prestige and increased globalization. -
An Improved Arabic Keyboard Layout
Sci.Int.(Lahore),33(1),5-15,2021 ISSN 1013-5316; CODEN: SINTE 8 5 AN IMPROVED ARABIC KEYBOARD LAYOUT 1Amjad Qtaish, 2Jalawi Alshudukhi, 3Badiea Alshaibani, 4Yosef Saleh, 5Salam Bazrawi College of Computer Science and Engineering, University of Ha'il, Ha'il, Saudi Arabia. [email protected], [email protected], [email protected], [email protected], [email protected] ABSTRACT: One of the most important human–machine interaction (HMI) systems is the computer keyboard. The keyboard layout (KL) dictates how a person interacts with a physical keyboard through the way in which the letters, numbers, punctuation marks, and symbols are mapped and arranged on the keyboard. Mapping letters onto the keys of a keyboard is complex because many issues need to be taken into considerations, such as the nature of the language, finger fatigue, hand balance, typing speed, and distance traveled by fingers during typing and finger movements. There are two main kinds of KL: English and Arabic. Although numerous research studies have proposed different layouts for the English keyboard, there is a lack of research studies that focus on the Arabic KL. To address this lack, this study analyzed and clarified the limitations of the standard legacy Arabic KL. Then an efficient Arabic KL was proposed to overcome the limitations of the current KL. The frequency of Arabic letters and bi-gram probabilities were measured on a large Arabic corpus in order to assess the current KL and to design the improved Arabic KL. The improved Arabic KL was then evaluated and compared against the current KL in terms of letter frequency, finger-travel distance, hand and finger balance, bi-gram frequency, row distribution, and most frequent words. -
Arabizi – Help Or Harm?
ARABIZI – HELP OR HARM? ANALYSIS OF THE IMPACTS OF ARABIZI -THREAT OR BENEFIT TO THE WRITTEN ARABIC LANGUAGE? Thesis Submitted to The College of Arts and Sciences of the UNIVERSITY OF DAYTON In Partial Fulfillment of the Requirements for The Degree of Master of Arts in English By Abdurazag Ahmed Saide M.S. Dayton, Ohio December 2019 ARABIZI – HELP OR HARM? AN ANALYSIS OF THE IMPACTS OF ARABIZI - THREAT OR BENEFIT TO THE WRITTEN ARABIC LANGUAGE? Name: Saide, Abdurazag Ahmed APPROVED BY: ____________________________________ Bryan Bardine, Ph.D. Faculty Advisor. ____________________________________ Thomas Wendorf, Ph.D. Faculty Advisor ____________________________________ Kirsten Mendoza, Ph.D. Faculty Advisor ii © Copyright by Abdurazag Saide All rights reserved 2019 iii ABSTRACT ARABIZI – HELP OR HARM? AN ANALYSIS OF THE IMPACTS OF ARABIZI - THREAT OR BENEFIT TO THE WRITTEN ARABIC LANGUAGE? Name: Saide, Abdurazag Ahmed University of Dayton Advisor: Dr. Bryan Bardine In the 21st century, the Arab world has undergone significant change, which has had profound impacts in the lives of Arab people throughout many occupations, fields, and locations. New technological advancements, communication innovations, and globalization efforts have been the core drivers to open the Arab world to Western civilization. The Arab world is characterized by unique cultural traditions, moral principles, and religious beliefs that differ greatly from those of the West. However, the globalization movement has had multiple impacts on Arab culture and the Arabic language. When comparing the Arabic language with the languages of Latin origin, it becomes clear that there exists multiple distinctions, especially in their writings and language. These distinctions did not play such a serious role several centuries ago, but with the increased cooperation between Arab nations and the Western world, these language dissimilarities have become well-recognized. -
Romanization of Arabic 1 Romanization of Arabic
Romanization of Arabic 1 Romanization of Arabic Arabic alphabet ﺍ ﺏ ﺕ ﺙ ﺝ ﺡ ﺥ ﺩ ﺫ ﺭ ﺯ ﺱ ﺵ ﺹ ﺽ ﻁ ﻅ ﻉ ﻍ ﻑ ﻕ ﻙ ﻝ ﻡ ﻥ ﻩ ﻭ ﻱ • History • Transliteration • Diacritics (ء) Hamza • • Numerals • Numeration Different approaches and methods for the romanization of Arabic exist. They vary in the way that they address the inherent problems of rendering written and spoken Arabic in the Latin script. Examples of such problems are the symbols for Arabic phonemes that do not exist in English or other European languages; the means of representing the Arabic definite article, which is always spelled the same way in written Arabic but has numerous pronunciations in the spoken language depending on context; and the representation of short vowels (usually i u or e o, accounting for variations such as Muslim / Moslem or Mohammed / Muhammad / Mohamed ). Method Romanization is often termed "transliteration", but this is not technically correct. Transliteration is the direct representation of foreign letters using Latin symbols, while most systems for romanizing Arabic are actually transcription systems, which represent the sound of the language. As an example, the above rendering is a transcription, indicating the pronunciation; an ﺍﻟﻌﺮﺑﻴﺔ ﺍﻟﺤﺮﻭﻑ ﻣﻨﺎﻇﺮﺓ :munāẓarat al-ḥurūf al-ʻarabīyah of the Arabic example transliteration would be mnaẓrḧ alḥrwf alʻrbyḧ. Romanization standards and systems This list is sorted chronologically. Bold face indicates column headlines as they appear in the table below. • IPA: International Phonetic Alphabet (1886) • Deutsche Morgenländische Gesellschaft (1936): Adopted by the International Convention of Orientalist Scholars in Rome. It is the basis for the very influential Hans Wehr dictionary (ISBN 0-87950-003-4). -
The Challenges and Pitfalls of Arabic Romanization and Arabization
The Challenges and Pitfalls of Arabic Romanization and Arabization Jack Halpern (春遍雀來) The CJK Dictionary Institute, Inc. (日中韓辭典研究所) 34-14, 2-chome, Tohoku, Niiza-shi, Saitama 352-0001, Japan [email protected] their native script directly into Arabic, something Abstract probably never attempted. These systems are part of our ongoing efforts to develop Arabic re- The high level of ambiguity of the Ara- sources for automatic transcription, machine bic script poses special challenges to translation and named entity extraction. developers of NLP tools in areas such as morphological analysis, named entity The following typographic conventions are used extraction and machine translation. in this paper: These difficulties are exacerbated by the lack of comprehensive lexical resources, 1. Phonemic transcriptions are indicated by . (/qaabuus/ < ﻗــــﺎﺑﻮس) such as proper noun databases, and the slashes multiplicity of ambiguous transcription 2. Phonetic transcriptions are indicated by . ([qɑːbuːs] < ﻗــــﺎﺑﻮس ) schemes. This paper focuses on some of square brackets the linguistic issues encountered in two subdisciplines that play an increasingly 3. Graphemic transliterations are indicated by . (\qAbws\ < ﻗــــﺎﺑﻮس ) important role in Arabic information back slashes processing: the romanization of Arabic 4. Popular transcriptions are indicated by italics .(Qaboos < ﻗــــﺎﺑﻮس ) -names and the arabization of non Arabic names. The basic premise is that linguistic knowledge in the form of lin- 2 Motivation and Previous Work guistic rules is essential for achieving Arabic transcription technology is playing an high accuracy. increasingly important role in a variety of practical applications such as named entity 1 Introduction recognition, machine translation, cross-language information retrieval and various security The process of automatically transcribing Arabic applications such as anti-money laundering and to a Roman script representation, called romani- terrorist watch lists. -
Arabic Alphabet 1 Arabic Alphabet
Arabic alphabet 1 Arabic alphabet Arabic abjad Type Abjad Languages Arabic Time period 400 to the present Parent systems Proto-Sinaitic • Phoenician • Aramaic • Syriac • Nabataean • Arabic abjad Child systems N'Ko alphabet ISO 15924 Arab, 160 Direction Right-to-left Unicode alias Arabic Unicode range [1] U+0600 to U+06FF [2] U+0750 to U+077F [3] U+08A0 to U+08FF [4] U+FB50 to U+FDFF [5] U+FE70 to U+FEFF [6] U+1EE00 to U+1EEFF the Arabic alphabet of the Arabic script ﻍ ﻉ ﻅ ﻁ ﺽ ﺹ ﺵ ﺱ ﺯ ﺭ ﺫ ﺩ ﺥ ﺡ ﺝ ﺙ ﺕ ﺏ ﺍ ﻱ ﻭ ﻩ ﻥ ﻡ ﻝ ﻙ ﻕ ﻑ • history • diacritics • hamza • numerals • numeration abjadiyyah ‘arabiyyah) or Arabic abjad is the Arabic script as it is’ ﺃَﺑْﺠَﺪِﻳَّﺔ ﻋَﺮَﺑِﻴَّﺔ :The Arabic alphabet (Arabic codified for writing the Arabic language. It is written from right to left, in a cursive style, and includes 28 letters. Because letters usually[7] stand for consonants, it is classified as an abjad. Arabic alphabet 2 Consonants The basic Arabic alphabet contains 28 letters. Adaptations of the Arabic script for other languages added and removed some letters, such as Persian, Ottoman, Sindhi, Urdu, Malay, Pashto, and Arabi Malayalam have additional letters, shown below. There are no distinct upper and lower case letter forms. Many letters look similar but are distinguished from one another by dots (’i‘jām) above or below their central part, called rasm. These dots are an integral part of a letter, since they distinguish between letters that represent different sounds. -
The Unicode Standard, Version 3.0, Issued by the Unicode Consor- Tium and Published by Addison-Wesley
The Unicode Standard Version 3.0 The Unicode Consortium ADDISON–WESLEY An Imprint of Addison Wesley Longman, Inc. Reading, Massachusetts · Harlow, England · Menlo Park, California Berkeley, California · Don Mills, Ontario · Sydney Bonn · Amsterdam · Tokyo · Mexico City Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial capital letters. However, not all words in initial capital letters are trademark designations. The authors and publisher have taken care in preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. If these files have been purchased on computer-readable media, the sole remedy for any claim will be exchange of defective media within ninety days of receipt. Dai Kan-Wa Jiten used as the source of reference Kanji codes was written by Tetsuji Morohashi and published by Taishukan Shoten. ISBN 0-201-61633-5 Copyright © 1991-2000 by Unicode, Inc. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or other- wise, without the prior written permission of the publisher or Unicode, Inc. -
Language Choice and Romanization Online by Lebanese Arabic Speakers
Treball de fi de màster Màster: Edició: Directors: Any de defensa: Col⋅lecció: Treballs de fi de màster Programa oficial de postgrau "Comunicació lingüística i mediació multilingüe" Departament de Traducció i Ciències del Llenguatge 1 TABLE OF CONTENTS 1. Introduction…………………………………………………………………..…4 2. Theoretical background………………………………………………………....6 2.1.Languages in Lebanon…………………………………………………...….6 2.2.Language choice online………………………………………………..……8 2.2.1. English…………………………………………………………..…..8 2.2.2. MSA…………………………………………………………..…….9 2.3.Romanization……………..………………………………………………..10 2.3.1. Romanization as an orthography…………………………………..10 2.3.2. Reasons for Romanization…………………………………………11 3. Informants……………………………………………………...………………14 4. Method…………………………………………………………………...…….15 5. Limitations……………………………………………………..........................17 6. Findings…………………………………………………………………..…....18 6.1.Language choice……………………………………………………..…….18 6.1.1. Emails and formal inquiries……………………………………..….18 6.1.2. Initiating contributions…………………………………………..….18 6.1.3. Responding contributions……………………………………….….19 6.1.4. Whatsapp……………………………………………………….…...20 6.2.Reasons for Romanization…………………………………………….…..22 6.3.The practice of Romanization……………………………………………..22 6.3.1. Phonology 6.3.1.1.Consonants………………………………………………….,...22 6.3.1.2. Vowels…………………………………………………….….23 6.3.2. Morphophonology………………………………………………....24 2 6.3.3. Morphology……………………………………………………….25 7. Conclusion…………………………………………………………………….26 8. References…………………………………………………………………….28 3 ABSTRACT This paper investigates language