MODERN STANDARD Arabic
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Automatic Identification of Arabic Language Varieties and Dialects in Social Media
Automatic Identification of Arabic Language Varieties and Dialects in Social Media Fatiha Sadat Farnazeh Kazemi Atefeh Farzindar University of Quebec in NLP Technologies Inc. NLP Technologies Inc. Montreal, 201 President Ken- 52 Le Royer Street W., 52 Le Royer Street W., nedy, Montreal, QC, Canada Montreal, QC, Canada Montreal, QC, Canada [email protected] [email protected] [email protected] Abstract Modern Standard Arabic (MSA) is the formal language in most Arabic countries. Arabic Dia- lects (AD) or daily language differs from MSA especially in social media communication. However, most Arabic social media texts have mixed forms and many variations especially be- tween MSA and AD. This paper aims to bridge the gap between MSA and AD by providing a framework for AD classification using probabilistic models across social media datasets. We present a set of experiments using the character n-gram Markov language model and Naive Bayes classifiers with detailed examination of what models perform best under different condi- tions in social media context. Experimental results show that Naive Bayes classifier based on character bi-gram model can identify the 18 different Arabic dialects with a considerable over- all accuracy of 98%. 1 Introduction Arabic is a morphologically rich and complex language, which presents significant challenges for nat- ural language processing and its applications. It is the official language in 22 countries spoken by more than 350 million people around the world1. Moreover, the Arabic language exists in a state of diglossia where the standard form of the language, Modern Standard Arabic (MSA) and the regional dialects (AD) live side-by-side and are closely related (Elfardy and Diab, 2013). -
Arabs at the Crossroads: Political Identity and Nationalism
Book Review of Hilal Khashan's Arabs at the Crossroads: Political Identity and Nationalism (Gainesville: University Press of Florida, 2000), The Arab World Geographer/Le Géographe du monde arabe 3(2):141-147, 2000. In the book Arabs at the Crossroads, Hilal Khashan, an associate professor of political science at the American University of Beirut, provides a vivid description of Arab political performance since the decline of the Ottoman Empire in the nineteenth century and the subsequent formal abrogation of the Islamic Caliphate in 1924 amidst rising European colonialism and Zionism. The book is small (149 pages of text) but concise and well written. Despite raising some very controversial issues, Khashan has combined the coolness of scholarship with intellectual rigor and political concern. His dispassionate perspective is refreshing and makes the book unapologetically independent and provocative. The book consists of nine chapters mostly focused on Arab political experience during the twentieth century. Chapter 1 examines the roots of the identity crisis in the Arab world, especially the ways in which “nineteenth-century reformers disturbed the Arab mind by sowing distrust in the Ottoman empire, without securing a tenable ideological alternative to that religious state and to Islam which it embodied” (page 1). Chapter 2 discusses the birth and the universalization of the European nation-state model of secular nationalism and its many incomplete versions in non-Western cultures, particularly in Muslim countries where “the clash between ethnocentrism and religion seems to resolve itself to the detriment of the former, without the latter emerging as a clear victor” (page 24). While this is a brilliant description of how modern Arab identity seems to swing constantly from Arab nationalism to Islamic identity and back, it stops short of an in-depth theoretical analysis of what constitutes the essence of identity in the Arab world. -
Arabic Language and Linguistics Certificate Revised: 05/2018
Arabic Language and Linguistics Certificate www.Linguistics.Pitt.edu Revised: 05/2018 Overview This certificate offers undergraduates a way to become highly proficient in the Arabic language and linguistic structure and to develop an understanding of important issues in Arabic life and culture. The certificate will conveniently accompany any undergraduate major in the Dietrich School and in other schools at the University of Pittsburgh. The program draws on the academic strengths and resources of the Department of Linguistics and on faculty from the School of Education. Enrollment in this program is limited to 20 students per academic year. Therefore, students must complete an evaluation process and be accepted into the program before declaring this certificate. For information, please contact the Amani Attia, the Arabic Coordinator, at [email protected]. Requirements Category 3: Culture of the Arabic-Speaking This certificate requires completion of 22-23 credits, World detailed as follows. Credit requirements do not include Choose one of the following courses. prerequisite courses. ARABIC 1615 Arabic Life and Thought ARABIC 1635 Introduction to Modern Arabic Literature Prerequisite courses Choose one dialect pair. Category 4: Electives ARABIC 0101 MSA Egyptian 1 Choose one of the following courses. Dialect courses ARABIC 0102 MSA Egyptian 2 must follow previous coursework. ARABIC 0105 MSA Egyptian 5 ARABIC 0121 MSA Levantine 1 ARABIC 0106 MSA Egyptian 6 ARABIC 0122 MSA Levantine 2 ARABIC 0125 MSA Levantine 5 ARABIC 0126 MSA Levantine 6 Category 1: Language instruction ARABIC 0211 Iraqi Arabic 1 Chose the same dialect pair as the prerequisite ARABIC 1615 Arabic Life and Thought language courses. -
Kiraz 2019 a Functional Approach to Garshunography
Intellectual History of the Islamicate World 7 (2019) 264–277 brill.com/ihiw A Functional Approach to Garshunography A Case Study of Syro-X and X-Syriac Writing Systems George A. Kiraz Institute for Advanced Study, Princeton and Beth Mardutho: The Syriac Institute, Piscataway [email protected] Abstract It is argued here that functionalism lies at the heart of garshunographic writing systems (where one language is written in a script that is sociolinguistically associated with another language). Giving historical accounts of such systems that began as early as the eighth century, it will be demonstrated that garshunographic systems grew organ- ically because of necessity and that they offered a certain degree of simplicity rather than complexity.While the paper discusses mostly Syriac-based systems, its arguments can probably be expanded to other garshunographic systems. Keywords Garshuni – garshunography – allography – writing systems It has long been suggested that cultural identity may have been the cause for the emergence of Garshuni systems. (In the strictest sense of the term, ‘Garshuni’ refers to Arabic texts written in the Syriac script but the term’s semantics were drastically extended to other systems, sometimes ones that have little to do with Syriac—for which see below.) This paper argues for an alterna- tive origin, one that is rooted in functional theory. At its most fundamental level, Garshuni—as a system—is nothing but a tool and as such it ought to be understood with respect to the function it performs. To achieve this, one must take into consideration the social contexts—plural, as there are many—under which each Garshuni system appeared. -
ARAB - Arabic (ARAB) 1
ARAB - Arabic (ARAB) 1 ARAB 301 Reading and Composition ARAB - ARABIC (ARAB) Credits 3. 3 Lecture Hours. Advanced Arabic grammar and readings of average difficulty and of ARAB 101 Beginning Arabic I different genres, including literary and journalistic texts and other Credits 4. 4 Lecture Hours. culturally-enriched materials in order to develop awareness of cultural (ARAB 1411) Beginning Arabic I. Introduction to Modern Standard Arabic products, perspectives, and practices found in the Arab world. in its written and spoken forms; emphasis on conversation, rudimentary Prerequisites: ARAB 202 or ARAB 204, or equivalent; junior or senior vocabulary, simple grammar, and reading. classification or approval of instructor. ARAB 102 Beginning Arabic II ARAB 302 Reading and Composition II Credits 4. 4 Lecture Hours. Credits 3. 3 Lecture Hours. (ARAB 1412) Beginning Arabic II. Introduction of more complex Readings of average difficulty and of different genres, including grammatical constructions; vocabulary building; emphasis on putting literary and journalistic texts and other culturally-enriched materials; acquired vocabulary and grammar to conversational use. development of writing skills with emphasis on grammatical Prerequisite: ARAB 101 or equivalent. constructions; expansion of vocabulary and oral expression. ARAB 104 Intensive Beginning Arabic Prerequisites: ARAB 301; junior or senior classification or approval of Credits 8. 8 Lecture Hours. instructor. Accelerated elementary language study, with oral, listening, reading and ARAB 321 Business Arabic writing practice. Equivalent to ARAB 101 and ARAB 102. Credits 3. 3 Lecture Hours. ARAB 201 Intermediate Arabic I Business and financial terminologies useful in the Arab World; cultural Credits 3. 3 Lecture Hours. etiquette for effective communication in Arabic business settings; (ARAB 2311) Intermediate Arabic I. -
Saudi Dialects: Are They Endangered?
Academic Research Publishing Group English Literature and Language Review ISSN(e): 2412-1703, ISSN(p): 2413-8827 Vol. 2, No. 12, pp: 131-141, 2016 URL: http://arpgweb.com/?ic=journal&journal=9&info=aims Saudi Dialects: Are They Endangered? Salih Alzahrani Taif University, Saudi Arabia Abstract: Krauss, among others, claims that languages will face death in the coming centuries (Krauss, 1992). Austin (2010a) lists 7,000 languages as existing and spoken in the world today. Krauss estimates that this figure could come down to 600. That is, most the world's languages are endangered. Therefore, an endangered language is a language that loses her speakers within a few generations. According to Dorian (1981), there is what is called ―tip‖ in language endangerment. He argues that a language's decline can start slowly but suddenly goes through a rapid decline towards the extinction. Thus, languages must be protected at much earlier stage. Arabic dialects such as Zahrani Spoken Arabic (ZSA), and Faifi Spoken Arabic (henceforth, FSA), which are spoken in the southern region of Saudi Arabia, have not been studied, yet. Few people speak these dialects, among many other dialects in the same region. However, the problem is that most these dialects' native speakers are moving to other regions in Saudi Arabia where they use other different dialects. Therefore, are these dialects endangered? What other factors may cause its endangerment? Have they been documented before? What shall we do? This paper discusses three main different points regarding this issue: language and endangerment, languages documentation and description and Arabic language and its family, giving a brief history of Saudi dialects comparing their situation with the whole existing dialects. -
A Comprehensive NLP System for Modern Standard Arabic and Modern Hebrew
A Comprehensive NLP System for Modern Standard Arabic and Modern Hebrew Morphological analysis, lemmatization, vocalization, disambiguation and text-to-speech Dror Kamir Naama Soreq Yoni Neeman Melingo Ltd. Melingo Ltd. Melingo Ltd. 16 Totseret Haaretz st. 16 Totseret Haaretz st. 16 Totseret Haaretz st. Tel-Aviv, Israel Tel-Aviv, Israel Tel-Aviv, Israel [email protected] [email protected] [email protected] Abstract 1 Introduction This paper presents a comprehensive NLP sys- 1.1 The common Semitic basis from an NLP tem by Melingo that has been recently developed standpoint for Arabic, based on MorfixTM – an operational formerly developed highly successful comprehen- Modern Standard Arabic (MSA) and Modern Hebrew (MH) share the basic Semitic traits: rich sive Hebrew NLP system. morphology, based on consonantal roots (Jiðr / The system discussed includes modules for Šoreš)1, which depends on vowel changes and in morphological analysis, context sensitive lemmati- some cases consonantal insertions and deletions to zation, vocalization, text-to-phoneme conversion, create inflections and derivations.2 and syntactic-analysis-based prosody (intonation) For example, in MSA: the consonantal root model. It is employed in applications such as full /ktb/ combined with the vocalic pattern CaCaCa text search, information retrieval, text categoriza- derives the verb kataba ‘to write’. This derivation tion, textual data mining, online contextual dic- is further inflected into forms that indicate seman- tionaries, filtering, and text-to-speech applications tic features, such as number, gender, tense etc.: katab-tu ‘I wrote’, katab-ta ‘you (sing. masc.) in the fields of telephony and accessibility and wrote’, katab-ti ‘you (sing. fem.) wrote, ?a-ktubu could serve as a handy accessory for non-fluent ‘I write/will write’, etc. -
Classic Poetry of Arab and Persian
European Journal of Scientific Research ISSN 1450-216X / 1450-202X Vol. 139 No 3 May, 2016, pp.257-262 http://www.europeanjournalofscientificresearch.com Comparative Study on Bahariye in Neo –Classic Poetry of Arab and Persian Mohammad Shaygan Mehr Department of Arabic Language and Literature Kashmar Branch, Islamic Azad university, Kashmar, Iran Ali asghar Mansouri .Department of Arabic Language and Literature Kashmar Branch, Islamic Azad university, Kashmar, Iran Nabialehrajani Department of Arabic Language and Literature Kashmar Branch, Islamic Azad university, Kashmar, Iran Hassan Ghamari Department of Arabic Language and Literature Kashmar Branch, Islamic Azad University, Kashmar, Iran Abstract As we can see the subject of the study has been not studied and researched in the previous works, this study tries to provide regular collection of scattered material to overcome the shortcomings of the issue. The aim of this paper is to review and correct lexical definitions in both Arabic and Persian words of Bahariyeh, and also studies the similarities and differences of Bahariyeh in Persian and Arabic classical new poetry. Bahariyeh is one of the common themes in Persian literature. Also in the literature of Arab it has been composed some poems on the theme of spring as Robyyat. In both contemporary periods, because of familiarity of poets with European literature in one hand and social issues, philanthropy and patriotism remember the other hand, the themes and contents of Bahariyeh, had found significant differences compared to the previous periods. In this study the similarities and differences of Bahariye, in these two languages, will be examined in the term of structure and content. -
Common Misconceptions and Stereotypes About the Middle East
Common Misconceptions and Stereotypes about the Middle East The Middle East does not have clear-cut boundaries. There are cross-cultural connections that stretch from North Africa through Western Asia into Central Asia. The most basic map of the Middle The Middle East is a clearly defined East includes Bahrain, Cyprus, Egypt, Iran, Iraq, Israel, Jordan, Kuwait, Lebanon, Oman, the Palestinian Territories, Qatar, Saudi place. Arabia, Syria, Turkey, the United Arab Emirates, and Yemen. People in the Middle East live as nomads. Very few people in the Middle East live as nomads. The Middle East is quite urbanized and has some of the oldest cities in the world. There is nothing but desert and oil in 60% of the region’s population lives in major cities such as the Middle East. Damascus, Istanbul and Cairo. Ancient Middle Eastern kingdoms such as Sumer, Babylon and Egypt were all cradles for western The Middle East and the Islamic World civilization. are the same. Everyone in the Middle East speaks There is more than desert and oil in the Middle East. The geography of the Middle East is diverse and includes everything from fertile Arabic. river deltas and forests, to mountain ranges and arid plateaus. Some Everyone in the Middle East is Muslim. countries in the Middle East are oil rich, while others have little or no All Arabs are Muslim. oil reserves. Violence in the Middle East is The Middle East and the Islamic World are not the same. While inevitable. Islam first developed in the Arabian Peninsula and Arabic is its Everyone in the Middle East hates the liturgical language, the majority of the world’s Muslims are not Arab and live outside the Middle East. -
Classical and Modern Standard Arabic Marijn Van Putten University of Leiden
Chapter 3 Classical and Modern Standard Arabic Marijn van Putten University of Leiden The highly archaic Classical Arabic language and its modern iteration Modern Standard Arabic must to a large extent be seen as highly artificial archaizing reg- isters that are the High variety of a diglossic situation. The contact phenomena found in Classical Arabic and Modern Standard Arabic are therefore often the re- sult of imposition. Cases of borrowing are significantly rarer, and mainly found in the lexical sphere of the language. 1 Current state and historical development Classical Arabic (CA) is the highly archaic variety of Arabic that, after its cod- ification by the Arab Grammarians around the beginning of the ninth century, becomes the most dominant written register of Arabic. While forms of Middle Arabic, a style somewhat intermediate between CA and spoken dialects, gain some traction in the Middle Ages, CA remains the most important written regis- ter for official, religious and scientific purposes. From the moment of CA’s rise to dominance as a written language, the whole of the Arabic-speaking world can be thought of as having transitioned into a state of diglossia (Ferguson 1959; 1996), where CA takes up the High register and the spoken dialects the Low register.1 Representation in writing of these spoken dia- lects is (almost) completely absent in the written record for much of the Middle Ages. Eventually, CA came to be largely replaced for administrative purposes by Ottoman Turkish, and at the beginning of the nineteenth century, it was function- ally limited to religious domains (Glaß 2011: 836). -
“The Sorrows of Egypt,” Revisited in Knowledge He Sought Years Idol Masses
A HOOVER INSTITUTION ESSAY on A us strateGIC vision in A CHANGING WORLD “The Sorrows of Egypt,” Revisited SAMUEL TADROS The sorrow of Egypt is made of entirely different material: the steady decline of its public life, the inability of an autocratic regime and of the middle class from which this regime issues to rid the country of its dependence on foreign handouts, to transmit to the vast underclass the skills needed for the economic competition of nations; to take the country beyond its endless alternations between glory and self-pity. (Fouad Ajami, “The Sorrows of Egypt”) In his authoritative 1995 essay “The Sorrows of Egypt,”1 Fouad Ajami, with the knowledge and experience of someone who had known Egypt intimately, and the spirit and pen of a poet who had come to love the place, attempted to delve deeply into what ailed the ancient land. The essay moved masterfully from the political to the social and Islamism and the International Order International the and Islamism from the religious to the economic, weaving an exquisite tapestry of a land of sorrows. This was not the first time that Ajami had approached Egypt. The country his generation had grown up knowing was the Egypt of promise and excitement, where Gamal Abdel Nasser’s towering presence and deep voice had captivated millions of Arabic speakers. Ajami had been one of those young men. He had made the pilgrimage to Damascus, watching and cheering as Nasser made his triumphant entry into the city in 1958, crowned as the idol of the Arabs by adoring masses. -
Arabic and Contact-Induced Change Christopher Lucas, Stefano Manfredi
Arabic and Contact-Induced Change Christopher Lucas, Stefano Manfredi To cite this version: Christopher Lucas, Stefano Manfredi. Arabic and Contact-Induced Change. 2020. halshs-03094950 HAL Id: halshs-03094950 https://halshs.archives-ouvertes.fr/halshs-03094950 Submitted on 15 Jan 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Arabic and contact-induced change Edited by Christopher Lucas Stefano Manfredi language Contact and Multilingualism 1 science press Contact and Multilingualism Editors: Isabelle Léglise (CNRS SeDyL), Stefano Manfredi (CNRS SeDyL) In this series: 1. Lucas, Christopher & Stefano Manfredi (eds.). Arabic and contact-induced change. Arabic and contact-induced change Edited by Christopher Lucas Stefano Manfredi language science press Lucas, Christopher & Stefano Manfredi (eds.). 2020. Arabic and contact-induced change (Contact and Multilingualism 1). Berlin: Language Science Press. This title can be downloaded at: http://langsci-press.org/catalog/book/235 © 2020, the authors Published under the Creative Commons Attribution