Morphological Analysis and Disambiguation for Gulf Arabic: the Interplay Between Resources and Methods

Total Page:16

File Type:pdf, Size:1020Kb

Morphological Analysis and Disambiguation for Gulf Arabic: the Interplay Between Resources and Methods Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 3895–3904 Marseille, 11–16 May 2020 c European Language Resources Association (ELRA), licensed under CC-BY-NC Morphological Analysis and Disambiguation for Gulf Arabic: The Interplay between Resources and Methods Salam Khalifa, Nasser Zalmout, Nizar Habash Computational Approaches to Modeling Language (CAMeL) Lab New York University Abu Dhabi {salamkhalifa, nasser.zalmout, nizar.habash}@nyu.edu Abstract In this paper we present the first full morphological analysis and disambiguation system for Gulf Arabic. We use an existing state-of-the- art morphological disambiguation system to investigate the effects of different data sizes and different combinations of morphological analyzers for Modern Standard Arabic, Egyptian Arabic, and Gulf Arabic. We find that in very low settings, morphological analyzers help boost the performance of the full morphological disambiguation task. However, as the size of resources increase, the value of the morphological analyzers decreases. Keywords: Gulf Arabic, Morphology, Disambiguation 1. Introduction ing the most probable morphological analysis in context for a given word. This task is achieved by either ranking Despite the many advances in the field in Natural Language the output of a morphological analyzer or through an end- Processing (NLP) for Arabic, many dialectal Arabic vari- to-end system that generates a single answer. eties are lagging behind. Modern Standard Arabic (MSA), In this paper, we focus on Gulf Arabic (GLF), a morpho- the official language of the Arab world, is well studied in logically rich and resource poor Arabic dialect. We aim to NLP and has an abundance of resources including corpora benchmark full morphological analysis and disambiguation and tools. On the other hand, most Arabic dialects are for GLF using state-of-the-art approaches for the first time. considered under-resourced, with the exception of Egyp- As part of this work, we investigate the relationship tian Arabic (EGY). between the size of the training data and the different One of the main challenges for Arabic NLP is data spar- available analyzers with respect to the disambiguation sity due to morphological richness and the lack of orthog- model. raphy standards. To address such challenges, recent NLP models use deep learning architectures that have access to The rest of the paper is structured as follows. We present character and subword-level information. Such models are related work in Section 2, and briefly discuss the linguis- well equipped to model some aspects of morphology im- tic background of GLF and its challenges in Section 3. We plicitly as part of an end-to-end system without requiring present the morphological analyzer creation process in Sec- explicit feature engineering. However, these models are tion 4. In Sections 5 and 6, we present our experimental very data-intensive, and do not scale down well in the case setup and evaluation, respectively. We conclude and out- of low-resource languages. Moreover, some morphological line future work in Section 7. behaviors can be complicated and irregular at times. This makes morphology more challenging to capture implicitly 2. Related Work while modeling other tasks. This further highlights the im- portance of explicit morphological modeling for languages In the past two decades, there have been a lot of efforts on that are morphologically-rich and low-resource in particu- morphological modeling for Arabic, as it proved to be use- lar. ful in a number downstream NLP tasks (Sadat and Habash, Morphological analyzers are dictionary-like resources 2006; El Kholy and Habash, 2012; Halabi, 2016; Baly et that provide all the potential analyses for a given word al., 2017). In this section we review early efforts on mor- out-of-context. Ideal morphological analyzers are expected phological modeling of MSA and dialectal Arabic. We then to return all possible analyses of a given word (modeling present the latest neural-based contributions with special ambiguity), along with covering all the different inflected interest in Arabic. forms of a word lemma (modeling richness). The quality of Modern Standard Arabic Morphological Modeling the morphological analyzers varies drastically based on the One of the earliest morphological tagging systems for Ara- method used to create them. Higher quality analyzers are bic was presented by Khoja (2001); it was based on a cor- carefully built using existing linguistic dictionaries, and are pus of 50,000 words. Later, the LDC released the Penn manually checked. On the other end of the spectrum, mor- Arabic Treebank (PATB) (Maamouri et al., 2004), which phological analyzers created automatically (e.g., extracted was substantially larger, and supported many further efforts from annotated text or seed dictionaries) can have a lower on Arabic morphological modeling. The PATB relied on quality depending on the quantity and quality of data used the existence of the Buckwalter Morphological Analyzer or the methods employed in creating them. (BAMA) (Buckwalter, 2004) and its later version the Stan- Morphological disambiguation is the process of provid- dard Arabic Morphological Analyzer (SAMA) (Maamouri 3895 et al., 2010). Among such efforts, are MADAMIRA (Pasha So far, the mentioned efforts regarding disambiguation suf- et al., 2014) and it predecessors MADA (Habash and Ram- fer from two main issues. First, they require explicit fea- bow, 2005; Roth et al., 2008; Habash et al., 2009; Habash et ture engineering which can lead to over fitting and not be- al., 2013) and AMIRA (Diab et al., 2004; Diab et al., 2007). ing able to generalize to new dialects. Second, those sys- MADAMIRA uses a morphological analyzer and SVMs- tems rely heavily on pre-existing morphological analyzers based taggers for different features, along with n-gram lan- to generate the final answers. This reliance limits the sys- guage models for lemmatization and diacritization. More tem’s performance to the quality of the analyzer, especially recent efforts include Farasa (Abdelali et al., 2016; Dar- when generating open class features such as lemmas. In this wish and Mubarak, 2016) and YAMAMA (Khalifa et al., work we use state-of-the-art neural architectures that have 2016b), which are out-of-context taggers/segmenters that the ability to model morphologically rich and complex lan- use different techniques to achieve reasonable performance guages such as Arabic and its different verities. and fast running time. Neural-based contributions for Arabic Morphology Adaptation of MSA Tools and Resources for Dialectal Neural-based contributions for Arabic NLP are relatively Arabic A number of approaches attempt to exploit lin- scarce and specific to individual tasks rather than full guistic similarity between MSA and Arabic dialects to build morphological disambiguation. Among the contributions dialect tools using existing MSA resources (Duh and Kirch- that utilize morphological structures to enhance the hoff, 2005; Habash and Rambow, 2006; Zribi et al., 2013; neural models in different NLP tasks, we note Guzmán Salloum and Habash, 2014; Hamdi et al., 2015; Albogamy et al. (2016) for machine translation, and Abandah et and Ramsay, 2015; Eskander et al., 2016). MAGEAD is al. (2015) for diacritization. Shen et al. (2016) applied a morphological analyzer that models Arabic dialects to- their Bi-LSTM morphological disambiguation model gether with MSA using a common multi-tape finite-state- on MSA, but did not present any improvements over machine framework (Habash and Rambow, 2006). Zribi the state-of-the-art. Heigold et al. (2016) developed et al. (2013) adapt an MSA analyzer, Al-Khalil (Boudlal character-based neural models for morphological tagging et al., 2010), to Tunisian Arabic, where they modify the for 14 different languages, including Arabic, using the derivation patterns and add Tunisian-specific roots and pat- UD treebank. Samih et al. (2017a) used a Bi-LSTM-CRF terns. Eskander et al. (2016), on the other hand, presented architecture and pre-trained character embeddings for the a paradigm completion approach to generate morphological segmentation of EGY tweets. They then build up on this analyzers for low-resource dialects using morphologically approach using a similar architecture for segmentation in annotated corpora. They then make use of available MSA multiple dialects, through combining the training datasets and EGY analyzers as backoff. for the different dialects, and train a unified segmentation model. They report the results using both an SVM-Rank Dialect-Specific Contributions Al-Sabbagh and Girju and Bi-LSTM-CRF models (Samih et al., 2017b). Darwish (2012) presented a POS annotated data set and tagger for et al. (2017) use Bi-LSTM models to train a POS tagger, EGY. Habash et al. (2012) presented CALIMA, a mor- and compared it against SVM-based model. The SVM phological analyzer for EGY, which was built by extend- model in their system outperformed the neural model, ing the Egyptian Colloquial Arabic Lexicon (ECAL) (Ki- even with incorporating pre-trained embeddings. Other lany et al., 2002). The LDC has also released the EGY notable contributions include the work of Inoue et al. treebank (ARZATB) (Maamouri et al., 2012). The tree- (2017), who used multi-task learning to model fine-grained bank is morphologically annotated in a similar style to the POS tags, using the individual morphosyntactic features. PATB. The aforementioned MADAMIRA and YAMAMA More recently, Zalmout and Habash (2017) presented systems were extended to EGY using CALIMA (Habash the first neural based full morphological disambigua- et al., 2012) and ARZATB (Maamouri et al., 2012). Jarrar tion system for Arabic. Alharbi et al. (2018) also use et al. (2014) released the Curras Corpus of Palestinian Ara- a Bi-LSTM model for GLF POS tagging, with good results. bic, also annotated in the PATB morphology style. Darwish et al. (2018) used a CRF model for a multi-dialect POS In this work, we introduce the first morphological analy- tagging, using a small annotated Twitter corpus for several sis and disambiguation system for GLF.
Recommended publications
  • Different Dialects of Arabic Language
    e-ISSN : 2347 - 9671, p- ISSN : 2349 - 0187 EPRA International Journal of Economic and Business Review Vol - 3, Issue- 9, September 2015 Inno Space (SJIF) Impact Factor : 4.618(Morocco) ISI Impact Factor : 1.259 (Dubai, UAE) DIFFERENT DIALECTS OF ARABIC LANGUAGE ABSTRACT ifferent dialects of Arabic language have been an Dattraction of students of linguistics. Many studies have 1 Ali Akbar.P been done in this regard. Arabic language is one of the fastest growing languages in the world. It is the mother tongue of 420 million in people 1 Research scholar, across the world. And it is the official language of 23 countries spread Department of Arabic, over Asia and Africa. Arabic has gained the status of world languages Farook College, recognized by the UN. The economic significance of the region where Calicut, Kerala, Arabic is being spoken makes the language more acceptable in the India world political and economical arena. The geopolitical significance of the region and its language cannot be ignored by the economic super powers and political stakeholders. KEY WORDS: Arabic, Dialect, Moroccan, Egyptian, Gulf, Kabael, world economy, super powers INTRODUCTION DISCUSSION The importance of Arabic language has been Within the non-Gulf Arabic varieties, the largest multiplied with the emergence of globalization process in difference is between the non-Egyptian North African the nineties of the last century thank to the oil reservoirs dialects and the others. Moroccan Arabic in particular is in the region, because petrol plays an important role in nearly incomprehensible to Arabic speakers east of Algeria. propelling world economy and politics.
    [Show full text]
  • Arabic Sociolinguistics: Topics in Diglossia, Gender, Identity, And
    Arabic Sociolinguistics Arabic Sociolinguistics Reem Bassiouney Edinburgh University Press © Reem Bassiouney, 2009 Edinburgh University Press Ltd 22 George Square, Edinburgh Typeset in ll/13pt Ehrhardt by Servis Filmsetting Ltd, Stockport, Cheshire, and printed and bound in Great Britain by CPI Antony Rowe, Chippenham and East bourne A CIP record for this book is available from the British Library ISBN 978 0 7486 2373 0 (hardback) ISBN 978 0 7486 2374 7 (paperback) The right ofReem Bassiouney to be identified as author of this work has been asserted in accordance with the Copyright, Designs and Patents Act 1988. Contents Acknowledgements viii List of charts, maps and tables x List of abbreviations xii Conventions used in this book xiv Introduction 1 1. Diglossia and dialect groups in the Arab world 9 1.1 Diglossia 10 1.1.1 Anoverviewofthestudyofdiglossia 10 1.1.2 Theories that explain diglossia in terms oflevels 14 1.1.3 The idea ofEducated Spoken Arabic 16 1.2 Dialects/varieties in the Arab world 18 1.2. 1 The concept ofprestige as different from that ofstandard 18 1.2.2 Groups ofdialects in the Arab world 19 1.3 Conclusion 26 2. Code-switching 28 2.1 Introduction 29 2.2 Problem of terminology: code-switching and code-mixing 30 2.3 Code-switching and diglossia 31 2.4 The study of constraints on code-switching in relation to the Arab world 31 2.4. 1 Structural constraints on classic code-switching 31 2.4.2 Structural constraints on diglossic switching 42 2.5 Motivations for code-switching 59 2.
    [Show full text]
  • Arabic and Contact-Induced Change Christopher Lucas, Stefano Manfredi
    Arabic and Contact-Induced Change Christopher Lucas, Stefano Manfredi To cite this version: Christopher Lucas, Stefano Manfredi. Arabic and Contact-Induced Change. 2020. halshs-03094950 HAL Id: halshs-03094950 https://halshs.archives-ouvertes.fr/halshs-03094950 Submitted on 15 Jan 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Arabic and contact-induced change Edited by Christopher Lucas Stefano Manfredi language Contact and Multilingualism 1 science press Contact and Multilingualism Editors: Isabelle Léglise (CNRS SeDyL), Stefano Manfredi (CNRS SeDyL) In this series: 1. Lucas, Christopher & Stefano Manfredi (eds.). Arabic and contact-induced change. Arabic and contact-induced change Edited by Christopher Lucas Stefano Manfredi language science press Lucas, Christopher & Stefano Manfredi (eds.). 2020. Arabic and contact-induced change (Contact and Multilingualism 1). Berlin: Language Science Press. This title can be downloaded at: http://langsci-press.org/catalog/book/235 © 2020, the authors Published under the Creative Commons Attribution
    [Show full text]
  • On the Syntax of Sentential Negation in Yemeni Arabic
    International Journal of English Linguistics; Vol. 10, No. 2; 2020 ISSN 1923-869X E-ISSN 1923-8703 Published by Canadian Center of Science and Education On the Syntax of Sentential Negation in Yemeni Arabic Abdulrahman Alqurashi1 & Mukarram Abduljalil1 1 Department of European Languages & Literature, King Abdelaziz University, Jeddah, Saudi Arabia Correspondence: Abdulrahman Alqurashi, P.O. BOX 80200, Jeddah 21589, Saudi Arabia. E-mail: [email protected] Received: December 26, 2019 Accepted: January 31, 2020 Online Published: February 23, 2020 doi:10.5539/ijel.v10n2p331 URL: https://doi.org/10.5539/ijel.v10n2p331 Abstract In this paper we explore the system of negation in modern Arabic dialects with a particular focus on Yemeni Arabic (Raymi dialect). The data observed in this dialect incorporate important and novel facts related to the syntax of sentential negation in Arabic. This includes the distribution of negation patterns and the interaction between negation and negative polarity items, which challenges the two widely adopted analyses for sentential negation in Arabic: The Spec-NegP analysis and the discontinuous Neg analysis. In this paper we argue that neither analysis can provide an adequate account of Raymi Arabic facts. Instead, a more recent analysis, the Spilt-Neg analysis, can accommodate them. In addition, in the study we provide empirical evidence in support of the Higher-Neg analysis, wherein Neg is projected higher than T in the derivation. Keywords: Arabic dialects, discontinuous negation, negative polarity items, non-discontinuous negation, Raymi dialect, sentential negation, Yemeni Arabic 1. Introduction The syntax of negation in Arabic is as extremely diverse as the varieties of the language themselves.
    [Show full text]
  • Croft's Cycle in Arabic: the Negative Existential Cycle in a Single Language
    Linguistics 2020; aop David Wilmsen* Croft’s cycle in Arabic: The negative existential cycle in a single language https://doi.org/10.1515/ling-2020-0021 Abstract: Thenegativeexistentialcyclehasbeenshowntobeoperativein several language families. Here it is shown that it also operates within a single language. It happens that the existential fī that has been adduced as an example of a type A in the Arabic of Damascus, Syria, negated with the standard spoken Arabic verbal negator mā, does not participate in a negative cycle, but another Arabic existential particle does. Reflexes of the existential particle šay(y)/šē/šī/ši of southern peninsular Arabic dialects enter into a type A > B configuration as a univerbation between mā and the existential particle ši in reflexes of maši. It also enters that configuration in others as a uni- verbation between mā, the 3rd-person pronouns hū or hī, and the existential particle šī in reflexes of mahūš/mahīš.Atthatpoint,theexistentialparticlešī loses its identity as such to be reanalyzed as a negator, with reflexes of mahūš/mahīš negating all manner of non-verbal predications except existen- tials. As such, negators formed of reflexes of šī skip a stage B, but they re- enter the cycle at stage B > C, when reflexes of mahūš/mahīš begin negating some verbs. The consecutive C stage is encountered only in northern Egyptian and southern Yemeni dialects. An inchoate stage C > A appears only in dialects of Lower Egypt. Keywords: Arabic dialects, grammaticalization in Arabic, linguistic cycles, standard negation, negative existential cycle, southern Arabian peninsular dialects 1 Introduction The negative existential cycle as outlined by Croft (1991) is a six-stage cycle whereby the negators of existential predications – those positing the existence of something with assertions analogous to the English ‘there is/are’–overtake the role of verbal negators, eventually replacing them, if the cycle continues to *Corresponding author: David Wilmsen, Department of Arabic and Translation Studies, American University of Sharjah P.O.
    [Show full text]
  • Rhode Island College
    Rhode Island College M.Ed. In TESL Program Language Group Specific Informational Reports Produced by Graduate Students in the M.Ed. In TESL Program In the Feinstein School of Education and Human Development Language Group: Arabic Author: Joy Thomas Program Contact Person: Nancy Cloud ([email protected]) http://cache.daylife.com/imagese http://cedarlounge.files.wordpress.com rve/0fxtg1u3mF0zB/340x.jpg /2007/12/3e55be108b958-64-1.jpg Joy http://photos.igougo.com/image Thomas s/p183870-Egypt-Souk.jpg http://www.famous-people.info/pictures/muhammad.jpg Arabic http://blog.ivanj.com/wp- http://en.wikipedia.org/wiki/File:Learning_Arabi content/uploads/2008/04/dubai-people.jpg http://www.mrdowling.com/images/607arab.jpg c_calligraphy.jpg History • Arabic is either an official language or is spoken by a major portion of the population in the following countries: Algeria, Bahrain, Chad, Comoros, Djibouti, Egypt, Eritrea, Ethiopia, Iraq, Jordan, Kuwait, Libya, Lebanon, Mauritania, Morocco, Oman, Qatar, Saudi Arabia, Somalia, Sudan, Syria, Tunisia, United Arab Emirates, Yemen • Arabic is a Semitic language, it has been around since the 4th century AD • Three different kinds of Arabic: Classical or “Qur’anical” Arabic, Formal or Modern Standard Arabic and Spoken or Colloquial Arabic • Classical Arabic only found in the Qur’an, it is not used in communication, but all Muslims are familiar to some extent with it, regardless of nationality • Arabic is diglosic in nature, as there is a major difference between Modern Standard and Colloquial Arabic • Modern
    [Show full text]
  • A N G L a I S Is a Semitic Language, in the Same Family As Hebrew and Aramaic
    Université Cheikh Anta Diop de Dakar 1/3 01-19 G 35 A-20 Durée :2 heures OFFICE DU BACCALAUREAT Séries: L-AR – Coef. 2 Téléfax (221) 864 67 39 - Tél. : 824 95 92 - 824 65 81 Épreuve du 1er groupe A N G L A I S is a Semitic language, in the same family as Hebrew and Aramaic. Around 260 (العربية) Arabic million people use it as their first language. Many more people can also understand it, but not as a first language. It is written with the Arabic alphabet, which is written from right to left, like Hebrew. 4 Since it is so widely spoken throughout the world, it is one of the six official languages of the UN, alongside English, Spanish, French, Russian, and Chinese. Many countries speak Arabic as an official language, but not all of them speak it the same way. There are many dialects, or varieties of the language, like Modern Standard Arabic, Egyptian Arabic, 8 Gulf Arabic, Maghreb Arabic, Levantine Arabic, and many others. Some of these dialects are very different from each other; as a result speakers find it hard to understand the other. Most of the countries that use Arabic as their official language are in the Middle East. They are part of the Arab World. This is because the largest religion in the Middle East is Islam. The language 12 is very important in Islam because Muslims believe that Allah (God) used it to talk to Muhammad through the Archangel Jibreel (Gabriel), giving him the Quran in Arabic. Many Arabic speakers are Muslims, but not all are.
    [Show full text]
  • The Routledge Handbook of Arabic Linguistics
    Review Copy - Not for Distribution Youssef A Haddad - University of Florida - 02/01/2018 THE ROUTLEDGE HANDBOOK OF ARABIC LINGUISTICS The Routledge Handbook of Arabic Linguistics introduces readers to the major facets of research on Arabic and of the linguistic situation in the Arabic-speaking world. The edited collection includes chapters from prominent experts on various fields of Arabic linguistics. The contributors provide overviews of the state of the art in their field and specifically focus on ideas and issues. Not simply an overview of the field, this handbook explores subjects in great depth and from multiple perspectives. In addition to the traditional areas of Arabic linguistics, the handbook covers computational approaches to Arabic, Arabic in the diaspora, neurolinguistic approaches to Arabic, and Arabic as a global language. The Routledge Handbook of Arabic Linguistics is a much-needed resource for researchers on Arabic and comparative linguistics, syntax, morphology, computational linguistics, psycholinguistics, sociolinguistics, and applied linguistics, and also for undergraduate and graduate students studying Arabic or linguistics. Elabbas Benmamoun is Professor of Asian and Middle Eastern Studies and Linguistics at Duke University, USA. Reem Bassiouney is Professor in the Applied Linguistics Department at the American University in Cairo, Egypt. Review Copy - Not for Distribution Youssef A Haddad - University of Florida - 02/01/2018 Review Copy - Not for Distribution Youssef A Haddad - University of Florida - 02/01/2018
    [Show full text]
  • A Large Scale Corpus of Gulf Arabic Salam Khalifa, Nizar Habash, Dana Abdulrahim, Sara Hassan
    A Large Scale Corpus of Gulf Arabic Salam Khalifa, Nizar Habash, Dana Abdulrahim, Sara Hassan To cite this version: Salam Khalifa, Nizar Habash, Dana Abdulrahim, Sara Hassan. A Large Scale Corpus of Gulf Arabic. Language Resources and Evaluation Conference, 2016, Portoroz, Slovenia. hal-01349204 HAL Id: hal-01349204 https://hal.archives-ouvertes.fr/hal-01349204 Submitted on 3 Aug 2016 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. A Large Scale Corpus of Gulf Arabic Salam Khalifa, Nizar Habash, Dana Abdulrahimy, Sara Hassan Computational Approaches to Modeling Language Lab, New York University Abu Dhabi, UAE yUniversity of Bahrain, Bahrain {salamkhalifa,nizar.habash,sah650}@nyu.edu,[email protected] Abstract Most Arabic natural language processing tools and resources are developed to serve Modern Standard Arabic (MSA), which is the official written language in the Arab World. Some Dialectal Arabic varieties, notably Egyptian Arabic, have received some attention lately and have a growing collection of resources that include annotated corpora and morphological analyzers and taggers. Gulf Arabic, however, lags behind in that respect. In this paper, we present the Gumar Corpus, a large-scale corpus of Gulf Arabic consisting of 110 million words from 1,200 forum novels.
    [Show full text]
  • The Vocabulary of Eastern Arabia
    xv THE VOCABULARY OF EASTERN ARABIA Introduction Two geographical features are shared by all the Gulf States: desert hinterlands stretching away into central Arabia, from which they are separated by no natural border, and long, shallow-shelved coastlines which afford easy marine access. It would be hard to underestimate the formative significance of these two factors on the culture and language of the Gulf littoral. For millennia, the absence of natural barriers has facilitated easy movement into and out of eastern Arabia, and the cultural, linguistic and political history of the city-states of the Gulf littoral is the synergistic result of population and trade movements along these two vectors. On the one hand, the land vector has for centuries (most recently in the 18th) carried new infusions of tribal blood from the centre of Arabia to the periphery, providing the bedrock of Gulf Arabic vocabulary, which gradually evolved to reflect the different material conditions and life-style of the sedentary life of the coast. On the other hand, the sea has brought a succession of short- lived and long-term foreign cultural and linguistic influences, beginning with the Sumerians five millennia ago, and continuing virtually unbroken with the Babylonians, Persians, Indians, Portuguese, up to the arrival of the British in the 19th century. A glance at the list of languages which have contributed to the present-day vocabulary of the Bahrain dialects (see ABBREVIATIONS AND CONVENTIONS) illustrates both the impressive time-depth and geographical diversity of these influences. Even if it were feasible, it would be beyond the scope of this introductory essay (and the abilities of the present writer) to attempt an exact periodisation of the development of Arabic dialects of Bahrain, let alone the region as a whole.
    [Show full text]
  • A Field Guide for Continued Study of the Arabic Language in Yemen and Oman
    DOCUMENT RESUME ED 289 374 FL 017 088 AUTHOR Critchfield, David Lawrence TITLE A Field Guide for Continued Study of the Arabic Language in Yemen and Oman. INSTITUTION Peace Corps, Washington, D.C. PUB DATE 79 CONTRACT 79-487-4136 NOTE 81p. PUB TYPE Guides - Classroom Use Materials (For Learner) (051) EDRS PRICE MF01/PC04 Plus Postage. DESCRIPTORS *Arabic; Developing Nations; *Dialects; Dictionaries; Foreign Countries; Grammar; Homework; *Independent Study; Language Styles; *Learning Strategies; Notetaking; Oral Language; Reading Skills; Second Language Learning; Writing Skills IDENTIFIERS Oman; Peace Corps; Yemen ABSTRACT A set of materials for independent study of Arabic is designed for Peace Corps volunteers working in Oman and Yemen who have had Arabic language training but need additional skills. It establishes guidelines for independent study and working with a tutor, helps check language performance, and provides grammatical information for reference. The materials begin with a brief history of Arabic and a discussion of the language's different forms and dialects. Subsequent chapters address issues: (1) obtaining appropriate learning materials; (2) getting speaking and conversational practice; (3) taking notes and doing homework; (4) continuing study in reading and writing; (5) finding and working with a tutor; (6) the structural, phonological, and geographic differences in Arabic dialects; and (7) basic grammatical forms and structures. The text is in English with some Arabic examples. (MSE) **********************************************k************************
    [Show full text]
  • Arabic Cross-Dialectal Conversations with Implications for the Teaching of Arabic As a Second Language
    Arabic Cross-dialectal Conversations with Implications for the Teaching of Arabic as a Second Language Rasha Kadry Abdelatti Mohamed Soliman Submitted in accordance with the requirements for the degree of PhD in Arabic Linguistics The University of Leeds Linguistics and Phonetics School of Modern Languages and Cultures July, 2014 ii The candidate confirms that the work submitted is her own and that appropriate credit has been given where reference has been made to the work of others. This copy has been supplied on the understanding that it is copyright material and that no quotation from the thesis may be published without proper acknowledgement. The right of Rasha Kadry Abdelatti Mohamed Soliman to be identified as Author of this work has been asserted by her in accordance with the Copyright, Designs and Patents Act 1988. © 2014 The University of Leeds and Rasha Kadry Abdelatti Mohamed Soliman iii Dedication To my parents Hend El-Naggar and Kadry Soliman iv Acknowledgements "In the name of Allah, the most gracious, the most merciful. All praise to Allah by whose grace good deeds are achieved" I wish to express sincere gratitude and deep thanks to my supervisor, Dr. Melinda Whong for her advice, patience and encouragement. I owe her much for her intellectual guidance and moral support throughout every stage of my study and for all the invaluable skills she taught me which I know I will continue to grow and utilise. From the School of Modern Languages and Cultures at the University of Leeds, I would like to thank Ms. Karen Priestley for her excellent and prompt administrative support and Dr.
    [Show full text]