DIWAN: a Dialectal Word Annotation Tool for Arabic

Total Page:16

File Type:pdf, Size:1020Kb

DIWAN: a Dialectal Word Annotation Tool for Arabic DIWAN: A Dialectal Word Annotation Tool for Arabic Faisal Al-Shargi Owen Rambow Universität Leipzig CCLS, Columbia University Leipzig New York, NY Germany USA [email protected] [email protected] leipzig.de Abstract prehensible to an Arabic speaker from the Levant or the Arabian Peninsula.1 This paper presents DIWAN, an anno- tation interface for Arabic dialectal Within these broad regions, further and consider- texts. While the Arabic dialects differ able geographic distinctions exist – within coun- in many respects from each other and tries, across country borders, and between cities from Modern Standard Arabic, they al- and villages. Some examples include Gulf Ara- so have much in common. To facilitate bic, Bahraini Arabic, Najdi Arabic, Hijazi Ara- annotation and to make it as efficient as bic, Yemeni Arabic, Yemeni Hadhrami Arabic, possible, it is therefore not advisable to Yemeni Sanaani Arabic, Yemeni Ta'izzi-Adeni treat each Arabic dialect as a separate Arabic, Dhofari Arabic, Omani Arabic, Shihhi language, unrelated to the other vari- Arabic, and the Peninsular Arabic dialects. ants of Arabic. Instead, we make anal- yses from other variants available to Despite this diversity, all Arabic dialects share the annotator, who then can choose to certain properties: much of their phonology, use them or not. templatic morphology augmented by affixes and a large set of clitics, large parts of their syntax, 1. Introduction and important (though unpredictable) parts of the lexicon. Arabic is a Central Semitic language, closely related to Aramaic, Hebrew, Ugaritic and Phoe- Current natural language processing (NLP) tools nician. It is spoken by 420 million speakers (na- work well with MSA because they were de- tive and non-native) in the Arab World. Arabic signed specifically for the processing of MSA, also is a liturgical language of 1.6 billion Mus- and because of the abundance of MSA resources. lims around the world. Applying the NLP tools designed for MSA di- rectly to DA yields significantly lower perfor- Modern Standard Arabic (MSA) is the official mance (Chiang et al., 2006; Habash and Ram- Arabic language. It is the educational language bow, 2006; Benajiba et al., 2010; Habash et al., and official language used in news and official 2012). This makes it imperative to direct re- communication across the Arabic-speaking search to building resources and tools for DA world. When Arabs communicate spontaneously processing. in informal settings, they use dialectal Arabic (DA). There are divisions of many dialects of the 1 When Arabic speakers of different dialects meet, they tend Arabic language that occur between the spoken to navigate towards a middle Arabic that encapsulates the languages of different regions. Some varieties of shared aspects they are aware of in order to maximize communication. A better and harder test of comprehension Arabic in North Africa, for example, are incom- is to eavesdrop on a conversation in another dialect. 49 Proceedings of the Second Workshop on Arabic Natural Language Processing, pages 49–58, Beijing, China, July 26-31, 2015. c 2014 Association for Computational Linguistics Figure 1: An example of MSA and DA code switching Arabic dialects lack large amounts of consistent This paper explains the design decisions we have data due to two main factors: the lack of ortho- made in order to meet these goals. DIWAN is graphic standards for the dialects, and the lack of fully implemented for use on Microsoft Win- overall Arabic content on the web (Benajiba et dows and is currently in use for the annotation of al., 2010). While the rise of the internet has in- Palestinian, Yemeni, and Moroccan Arabic. creased the amount of DA being written, some- times Arabic dialects come mixed with the MSA This paper is structured as follows. In Section 2, in various forms of text (see Figure 1, which we review the NLP components we use in DI- shows the code switching in our DIWAN tool). WAN. In Section 3, we describe the workflow Furthermore, language used in social media pos- when using DIWAN. In Section 4, we describe es a challenge for NLP tools in general in any the specific annotation tasks the annotator per- language due to the difference in genre. There- forms. Section 5 gives some technical detail fore, in order to create tools for dialectal Arabic, about the implementation. Section 6 discusses annotated DA corpora are needed in a variety of related work. We conclude in Section 7 with a dialects. discussion of future work. The goal of our Dialectal Word Annotation tool (DIWAN) is to address these gaps on the re- 2. NLP Resources used in DIWAN source creation level. In designing DIWAN, we have determined several important design goals: In order to make the annotation task easier, DI- WAN uses three main existing NLP resources: 1. We want to exploit the similarity between the MSA morphological analyzer SAMA, the dialects as much as possible to facilitate an- Egyptian morphological analyzer CALIMA- notation, which in general is costly and EGY, and the morphological tagger MADAMI- slow. RA which works for both MSA and Egyptian. 2. We want to use a convention for orthogra- We describe them in turn. phy (which the input text does not neces- sarily follow). The first resource is the Standard Arabic Mor- 3. We want to create data which can be used phological Analyzer, SAMA 3.1 (Graff et al. both for creating morphological analyzers 2009), which is based on the BAMA analyzer (which produce all morphological analyses (Buckwalter 2004). This system uses lexical for a word outside of any context) and mor- databases, divided into prefixes, stems, and suf- phological taggers (which determine the fixes, to assign words all possible MSA analyses. correct morphological analysis -- including A sample output is shown in Figure 2 (in Buck- ﻣﺎﺷﻲ the POS tag -- for a word in context). walter transliteration), for the input word mA$i, which is ambiguous between various in- flected forms of a verb meaning `walk’. 50 mA$y ﻲﺷﺎﻣ Figure 2: SAMA result for search on word mA$y ﻲﺷﺎﻣ Figure 3: CALIMA-Egyptian result for search on word The second resource is the Columbia Arabic 3. DIWAN Workflow Language and dialect Morphological Analyzer for Egyptian (CALIMA-EGY) (Habash et al. We designed and built DIWAN as a desktop ap- 2012b). It is an analyzer for Egyptian. A sample plication which can work locally (offline) or output is shown in Figure 3 for the input word online. As an annotation tool, we have designed mA$y. CALIMA returns the MSA readings DIWAN with two types of users: administrators ﻣﺎﺷﻲ shown in Figure 2, and in addition has Egyptian and annotators. The administrator’s responsibil- readings, in particular the interjection `OK’. ity is to create the DIWAN database, specify its settings, and track the annotator’s work. The third resource is MADAMIRA (Pasha et al. 2014). MADAMIRA is a system for morpholog- The administrator has several roles: ical analysis and disambiguation of Arabic that combines some of the best aspects of two previ- 1. She can create, edit and delete tables in ously commonly used systems for Arabic pro- the database. cessing, MADA (Habash and Rambow, 2005; 2. She can create, edit and delete annotator Habash et al., 2009; Habash et al., 2013) and accounts. AMIRA (Diab et al., 2007). MADAMIRA im- 3. She can check the status of the annota- proves upon the two systems with a more stream- tion tasks for each annotator. lined Java implementation that is more robust, 4. She can trace the annotator progress, portable, extensible, and is faster than its ances- work time, errors, etc. tors by more than an order of magnitude. Con- 5. She can generate reports and statistics on trary to SAMA and CALIMA-EGY, which pro- the underlying database (created by the vide all morphological analyses for a word re- annotators). gardless of context, MADAMIRA chooses a sin- 6. She can of course also annotate the data. gle analysis given the context of the word in a -The annotators can only annotate data. The ad ﻣﺎﺷﻲ ﻛﺪه ؟ sentence. For example, in the sentence mA$i kdh? `Is that OK?’, the interjection mean- ministrator assigns tasks to each annotator, and ing will be chosen. the annotations are added to the DIWAN data- base. As the annotator creates annotations, he 51 can reuse the resulting lexical entries in the DI- WAN tool as a new resource for himself or for other annotators. To work with DIWAN, the administrator first prepares the data. We assume that the data is DA written in Arabic script. There are two ways of preparing the data: 1. The administrator can either simply use DI- WAN itself to identify sentences and words in the corpus. DIWAN extracts sentences and words from the prepared file and builds a DIWAN database. 2. Or the administrator can send the corpus to MADAMIRA. MADAMIRA not only iden- tifies sentences and words, it also performs morphological analysis (using MSA and Egyptian resources) and tagging, making a single analysis available for each input word in context. After getting the resulting data Figure 4: DIWAN setup workflow from MADAMIRA, DIWAN will present the analysis for each word to the annotator as a default annotation option. As in the previ- ous case, DIWAN extracts sentences and words from the prepared file and builds a When online, several annotators can work at DIWAN database (which now includes the once, sharing their work immediately through the MADAMIRA analysis). centralized database. These two options are shown in Figure 4. 4.2. Choice of Word to Annotate and The annotator makes the dialect annotations by Using the Resources using the DIWAN GUI.
Recommended publications
  • Christians and Jews in Muslim Societies
    Arabic and its Alternatives Christians and Jews in Muslim Societies Editorial Board Phillip Ackerman-Lieberman (Vanderbilt University, Nashville, USA) Bernard Heyberger (EHESS, Paris, France) VOLUME 5 The titles published in this series are listed at brill.com/cjms Arabic and its Alternatives Religious Minorities and Their Languages in the Emerging Nation States of the Middle East (1920–1950) Edited by Heleen Murre-van den Berg Karène Sanchez Summerer Tijmen C. Baarda LEIDEN | BOSTON Cover illustration: Assyrian School of Mosul, 1920s–1930s; courtesy Dr. Robin Beth Shamuel, Iraq. This is an open access title distributed under the terms of the CC BY-NC 4.0 license, which permits any non-commercial use, distribution, and reproduction in any medium, provided no alterations are made and the original author(s) and source are credited. Further information and the complete license text can be found at https://creativecommons.org/licenses/by-nc/4.0/ The terms of the CC license apply only to the original material. The use of material from other sources (indicated by a reference) such as diagrams, illustrations, photos and text samples may require further permission from the respective copyright holder. Library of Congress Cataloging-in-Publication Data Names: Murre-van den Berg, H. L. (Hendrika Lena), 1964– illustrator. | Sanchez-Summerer, Karene, editor. | Baarda, Tijmen C., editor. Title: Arabic and its alternatives : religious minorities and their languages in the emerging nation states of the Middle East (1920–1950) / edited by Heleen Murre-van den Berg, Karène Sanchez, Tijmen C. Baarda. Description: Leiden ; Boston : Brill, 2020. | Series: Christians and Jews in Muslim societies, 2212–5523 ; vol.
    [Show full text]
  • New Perspectives in Biblical and Rabbinic Hebrew Rabbinic Hebrew Aaron D
    Cambridge Semitic Languages and Cultures Hornkohl and Hornkohl and New Perspectives in Biblical and Khan (eds) New Perspectives in Biblical and Rabbinic Hebrew Rabbinic Hebrew Aaron D. Hornkohl and Geoffrey Khan (eds) EDITED BY AARON D. HORNKOHL AND GEOFFREY KHAN Most of the papers in this volume originated as presenta� ons at the conference Biblical Hebrew and Rabbinic Hebrew: New Perspecti ves in Philology and Linguisti ,cs which was held at the University of Cambridge, 8–10th July, 2019. The aim of the conference was to build bridges between various strands of research in the fi eld of Hebrew language studies that rarely meet, namely philologists working on Biblical Hebrew, philologists working on New Perspectives in Biblical Rabbinic Hebrew and theore� cal linguists. The volume is the published outcome of this ini� a� ve. It contains peer-reviewed papers and Rabbinic Hebrew in the fi elds of Biblical and Rabbinic Hebrew that advance the fi eld by the philological inves� ga� on of primary sources and the applica� on of cu� ng-edge linguis� c theory. These include contribu� ons by established scholars and by students and early career researchers. This is the author-approved edi� on of this Open Access � tle. As with all Open Book publica� ons, this en� re book is available to read for free on the publisher’s website. Printed and digital edi� ons, together with supplementary digital material, can also be found here: www.openbookpublishers.com Cover image: Genizah fragment of the Hebrew Bible with Babylonian vocalisati on (Num. 18.27-28, Cambridge University Library T-S A38.12; courtesy of the Syndics of Cambridge University Library).
    [Show full text]
  • A Note on the Genitive Particle Ħaqq in Yemeni Arabic Free Genitives Mohammed Ali Qarabesh, University of Albayda Mohammed Q
    A note on ħaqq in Yemeni Arabic … Qarabesh & Shormani A Note on the Genitive particle ħaqq in Yemeni Arabic Free Genitives Mohammed Ali Qarabesh, University of Albayda Mohammed Q. Shormani, University of Ibb الملخص: تتىاوه هذي اىىرقح ميمح "حق" فٍ اىيهجح اىُمىُح ورتثتها اىىحىَح فٍ تزمُة إضافح اىمينُح اىتحيُيُح، وتقذً ىها تحيُو وحىٌ وصفٍ، حُث َفتزض اىثاحثان أن هىاك وىػُه مه هذي اىنيمح فٍ اىيهجح اىُمىُح: ا( تيل اىتٍ ﻻ تظهز ػيُها ػﻻماخ اىتطاتق، مثو "اىسُاراخ حق ػيٍ"، حُث وزي أن ميمح "اىسُاراخ" ىها اىسماخ )جمغ، مؤوث، غائة( وىنه ميمح "حق" ﻻ تتطاتق مؼها فٍ أٌ مه هذي اىصفاخ، و ب( تيل اىتٍ تظهز ػيُها ػﻻماخ اىتطاتق مثو "اىسُاراخ حقاخ ػيٍ" حُث تتطاتق اىنيمتان "اىسُاراخ" و"حقاخ" فٍ مو اىسماخ. وؼَزض اىثاحثان أن اىىىع اﻷوه َ ستخذً فٍ مىاطق مثو صىؼاء، ػذن، إب... اىخ، واىثاوٍ فٍ شثىج وحضزمىخ ... اىخ. وَخيص اىثاحثان إىً أن هىاك دىُو ػميٍ ىُس فقظ ػيً وجىد اىىحى اىنيٍ فٍ "اىمينح اىيغىَح" تو أَضا ػيً "تَ ْى َس َطح" هذا اىىحى، ىُس فقظ تُه اىيغاخ تو وتُه ىهجاخ اىيغح اىىاحذج. الكلمات المفتاحية: اىيغاخ اىسامُح، اىؼزتُح اىُمىُح، اىؼثزَح، اى م ْينُح، "حق" Abstract This paper provides a descriptive syntactic analysis of ħaqq in Yemeni Arabic (YA). ħaqq is a Semitic Free Genitive (FG) particle, much like the English of. A FG minimally consists of a head N, genitive particle and genitive DP complement. It (in a FG) expresses or conveys the meaning of possessiveness, something like of in English. There are two types of ħaqq in Yemeni Arabic: one not exhibiting agreement with the head N, and another exhibiting it.
    [Show full text]
  • Arabic Sociolinguistics: Topics in Diglossia, Gender, Identity, And
    Arabic Sociolinguistics Arabic Sociolinguistics Reem Bassiouney Edinburgh University Press © Reem Bassiouney, 2009 Edinburgh University Press Ltd 22 George Square, Edinburgh Typeset in ll/13pt Ehrhardt by Servis Filmsetting Ltd, Stockport, Cheshire, and printed and bound in Great Britain by CPI Antony Rowe, Chippenham and East bourne A CIP record for this book is available from the British Library ISBN 978 0 7486 2373 0 (hardback) ISBN 978 0 7486 2374 7 (paperback) The right ofReem Bassiouney to be identified as author of this work has been asserted in accordance with the Copyright, Designs and Patents Act 1988. Contents Acknowledgements viii List of charts, maps and tables x List of abbreviations xii Conventions used in this book xiv Introduction 1 1. Diglossia and dialect groups in the Arab world 9 1.1 Diglossia 10 1.1.1 Anoverviewofthestudyofdiglossia 10 1.1.2 Theories that explain diglossia in terms oflevels 14 1.1.3 The idea ofEducated Spoken Arabic 16 1.2 Dialects/varieties in the Arab world 18 1.2. 1 The concept ofprestige as different from that ofstandard 18 1.2.2 Groups ofdialects in the Arab world 19 1.3 Conclusion 26 2. Code-switching 28 2.1 Introduction 29 2.2 Problem of terminology: code-switching and code-mixing 30 2.3 Code-switching and diglossia 31 2.4 The study of constraints on code-switching in relation to the Arab world 31 2.4. 1 Structural constraints on classic code-switching 31 2.4.2 Structural constraints on diglossic switching 42 2.5 Motivations for code-switching 59 2.
    [Show full text]
  • Emerged from Antiquity As an All-Jewish Possession, Together with Is Interesting
    7+ Yiddish in the Framework of OtherJewish Languages Yiddish in the Framework of OtherJewish Languages there discoverable threads extending lrom these three linguistic groups "Arabic" as a native tongue amongJews (in z.rr.I it will become to the ancient Parsic? These questions have not yet been touched by cle ar why it is more appropriate to spe ak of a separate Jewish language scholarship. with Arabic stock, which.may be called Yahudic) is current among a 2.ro The sunset of Targumic as the spoken language of a major much larger group. On the eve of World War II the number of Yahudic Jewish community came with the rise of the Arabs (z.r.r). A survey of speakers was estimated at about seven hundred thousand. Of course, we the linguistic condition of the Jews up to the Arab period is therefore in have no statistics on the Gaonic period, but by no means can the current place. figure give us any idea of the proportion and the dynamics of Yahudic The frontal attack of Hellenism on Jewish culture failed; but at least in former years. By virtue of the Arab conquests, Yahudic was firmly it was historical drama on a large scaie, and visible signs olJaphet's established in Yemen, Babylonia, Palestine, and all of North Africa, beauty remained in the tents of Shem, to use a stock phrase so popular from Egypt to the Atlantic; even Sicily and southern Italy, which as a in the Haskalah period. Nor will we leave Persian out of consideration rule should be included in the Yavanic culture area (z.I 2 ), were at times in the overall picture ofJewish subcultures, although the phenomenon considerably influenced by North Africa.
    [Show full text]
  • Semantic Innovation and Change in Kuwaiti Arabic: a Study of the Polysemy of Verbs
    ` Semantic Innovation and Change in Kuwaiti Arabic: A Study of the Polysemy of Verbs Yousuf B. AlBader Thesis submitted to the University of Sheffield in fulfilment of the requirements for the degree of Doctor of Philosophy in the School of English Literature, Language and Linguistics April 2015 ABSTRACT This thesis is a socio-historical study of semantic innovation and change of a contemporary dialect spoken in north-eastern Arabia known as Kuwaiti Arabic. I analyse the structure of polysemy of verbs and their uses by native speakers in Kuwait City. I particularly report on qualitative and ethnographic analyses of four motion verbs: dašš ‘enter’, xalla ‘leave’, miša ‘walk’, and i a ‘run’, with the aim of establishing whether and to what extent linguistic and social factors condition and constrain the emergence and development of new senses. The overarching research question is: How do we account for the patterns of polysemy of verbs in Kuwaiti Arabic? Local social gatherings generate more evidence of semantic innovation and change with respect to the key verbs than other kinds of contexts. The results of the semantic analysis indicate that meaning is both contextually and collocationally bound and that a verb’s meaning is activated in different contexts. In order to uncover the more local social meanings of this change, I also report that the use of innovative or well-attested senses relates to the community of practice of the speakers. The qualitative and ethnographic analyses demonstrate a number of differences between friendship communities of practice and familial communities of practice. The groups of people in these communities of practice can be distinguished in terms of their habits of speech, which are conditioned by the situation of use.
    [Show full text]
  • Arabic and Contact-Induced Change Christopher Lucas, Stefano Manfredi
    Arabic and Contact-Induced Change Christopher Lucas, Stefano Manfredi To cite this version: Christopher Lucas, Stefano Manfredi. Arabic and Contact-Induced Change. 2020. halshs-03094950 HAL Id: halshs-03094950 https://halshs.archives-ouvertes.fr/halshs-03094950 Submitted on 15 Jan 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Arabic and contact-induced change Edited by Christopher Lucas Stefano Manfredi language Contact and Multilingualism 1 science press Contact and Multilingualism Editors: Isabelle Léglise (CNRS SeDyL), Stefano Manfredi (CNRS SeDyL) In this series: 1. Lucas, Christopher & Stefano Manfredi (eds.). Arabic and contact-induced change. Arabic and contact-induced change Edited by Christopher Lucas Stefano Manfredi language science press Lucas, Christopher & Stefano Manfredi (eds.). 2020. Arabic and contact-induced change (Contact and Multilingualism 1). Berlin: Language Science Press. This title can be downloaded at: http://langsci-press.org/catalog/book/235 © 2020, the authors Published under the Creative Commons Attribution
    [Show full text]
  • Cognate Words in Mehri and Hadhrami Arabic
    Cognate Words in Mehri and Hadhrami Arabic Hassan Obeid Alfadly* Khaled Awadh Bin Mukhashin** Received: 18/3/2019 Accepted: 2/5/2019 Abstract The lexicon is one important source of information to establish genealogical relations between languages. This paper is an attempt to describe the lexical similarities between Mehri and Hadhrami Arabic and to show the extent of relatedness between them, a very little explored and described topic. The researchers are native speakers of Hadhrami Arabic and they paid many field visits to the area where Mehri is spoken. They used the Swadesh list to elicit their data from more than 20 Mehri informants and from Johnston's (1987) dictionary "The Mehri Lexicon and English- Mehri Word-list". The researchers employed lexicostatistical techniques to analyse their data and they found out that Mehri and Hadhrmi Arabic have so many cognate words. This finding confirms Watson (2011) claims that Arabic may not have replaced all the ancient languages in the South-Western Arabian Peninsula and that dialects of Arabic in this area including Hadhrami Arabic are tinged, to a greater or lesser degree, with substrate features of the Pre- Islamic Ancient and Modern South Arabian languages. Introduction: three branches including Central Semitic, Historically speaking, the Semitic language Ethiopian and Modern south Arabian languages family from which both of Arabic and Mehri (henceforth MSAL). Though Arabic and Mehri descend belong to a larger family of languages belong to the West Semitic, Arabic descends called Afro-Asiatic or Hamito-Semitic that from the Central Semitic and Mehri from includes Semitic, Egyptian, Cushitic, Omotic, (MSAL) which consists of two branches; the Berber and Chadic (Rubin, 2010).
    [Show full text]
  • Modern Standard Arabic ﺝ
    International Journal of Linguistics, Literature and Culture (Linqua- IJLLC) December 2014 edition Vol.1 No.3 /Ʒ/ AND /ʤ/ :ﺝ MODERN STANDARD ARABIC Hisham Monassar, PhD Assistant Professor of Arabic and Linguistics, Department of Arabic and Foreign Languages, Cameron University, Lawton, OK, USA Abstract This paper explores the phonemic inventory of Modern Standard ﺝ Arabic (MSA) with respect to the phoneme represented orthographically as in the Arabic alphabet. This phoneme has two realizations, i.e., variants, /ʤ/, /ӡ /. It seems that there is a regional variation across the Arabic-speaking peoples, a preference for either phoneme. It is observed that in Arabia /ʤ/ is dominant while in the Levant region /ӡ/ is. Each group has one variant to the exclusion of the other. However, there is an overlap regarding the two variants as far as the geographical distribution is concerned, i.e., there is no clear cut geographical or dialectal boundaries. The phone [ʤ] is an affricate, a combination of two phones: a left-face stop, [d], and a right-face fricative, [ӡ]. To produce this sound, the tip of the tongue starts at the alveolar ridge for the left-face stop [d] and retracts to the palate for the right-face fricative [ӡ]. The phone [ӡ] is a voiced palato- alveolar fricative sound produced in the palatal region bordering the alveolar ridge. This paper investigates the dichotomy, or variation, in light of the grammatical (morphological/phonological and syntactic) processes of MSA; phonologies of most Arabic dialects’ for the purpose of synchronic evidence; the history of the phoneme for diachronic evidence and internal sound change; as well as the possibility of external influence.
    [Show full text]
  • Lenition in the Mozarabic Dialects: a Reappraisal
    Al-Qantara, vol. 18, nº 1 (1997) LENITION IN THE MOZARABIC DIALECTS: A REAPPRAISAL DAVID HANLON Birkbeck College, University of London Description of the varieties of Romance formerly spoken in al-An- dalus is complicated by the documentation of the greater part of the ex­ tant textual evidence in an alphabet which had been contrived to repre­ sent the sound system and syllabic structure of an unrelated language, and by the lack of any appreciable attempt on the part of the alphabet's users to adapt it to this new task. Lenition provides a measure of the difficulties this poses: whereas it has been possible to speculate on why the intervocalic plosives of Latin voiced in other varieties of Hispano- Romance, students of the Mozarabic dialects have sought to establish whether such lenition took place at all. Opinion remains divided bet­ ween those, like Meyer-Lübke i, who believe that it did not occur, and those, like Menéndez Pidal 2, who believe that it did. Meyer-Lübke affirms that the voicing of the intervocalic plosives in the Peninsula did not take place until after the Germanic invasions ^. He concludes from an acritical examination of the material that conti­ nuations of Latin /-t-/ and /-k-/ remained voiceless in the Mozarabic dialects at the time of the Islamic invasions '^, a view shared by Hall ^. Griffin observes that Latin /-t-/ and /-k-/ are transcribed as (voiceless) ;â'and çâ/respectively in the Vocabulista in arábico. He argues that the former was identified with Romance /-t-/, a dental plosive that had lost some its articulatory force but remained voiceless 6, and that the latter 1 Meyer-Lübke, W., «La sonorización de las sordas intervocálicas latinas en espa­ ñol», Revista de Filología Española 11 (1924), 1-32.
    [Show full text]
  • The Moraic Nature of Geminate Consonants in Moroccan Arabic: Evidence from Word-Minimality, Syllable Structure and Word Formation Ayoub Noamane
    The Moraic Nature of Geminate Consonants in Moroccan Arabic: Evidence from Word-minimality, Syllable Structure and Word Formation Ayoub Noamane To cite this version: Ayoub Noamane. The Moraic Nature of Geminate Consonants in Moroccan Arabic: Evidence from Word-minimality, Syllable Structure and Word Formation. The International Journal of Arabic Lin- guistics, Institut Marocain de l’Information Scientifique et Technique, 2019, 5 (2), pp.100-129. halshs- 02925497 HAL Id: halshs-02925497 https://halshs.archives-ouvertes.fr/halshs-02925497 Submitted on 27 Sep 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. The Moraic Nature of Geminate Consonants in M oroccan A rabic : Evidence from Word - minimality, Syllable S tructure and Word F ormation 1 Ayoub Noamane Mohammed V University, Rabat, Morocco [email protected] ﻣﻠﺨﺺ ﯾﮭﺪف ھﺬا اﻟﺒﺤﺚ إﻟﻰ اﺳﺘﺤﻀﺎر أدﻟﺔ ﺗﺪﻋﻢ اﻟﺘﻤﺜﯿﻞ اﻟﻌﺮوﺿﻲ اﻟﻤﺒﻨﻲ ﻋﻠﻰ اﻟﻮﻗﻊ ' mora ' ﻟﻠﺼﻮاﻣﺖ اﻟﻤﻀﻌﻔﺔ ﻓﻲ اﻟﻌﺮﺑﯿﺔ اﻟﻤﻐﺮﺑﯿﺔ . وﻣﻦ أﺟﻞ اﺳﺘﺤﻀﺎر أﻣﺜﻠﺔ ﻣﻦ ھﺬا اﻟﻘﺒﯿﻞ، ﻧﻌﺘﻤﺪ ﻋﻠﻰ ﺛﻼث ظﻮاھﺮ ﻣﻮرﻓﻮ - ﻓﻨﻮﻟﻮﺟﯿﺔ، وھﻲ : اﻟﺤﺪ اﻷدﻧﻰ ﻟﻠﻜﻠﻤﺔ واﻟﺒﻨﯿﺔ اﻟﻤﻘﻄ ﻌﯿﺔ ﻟﻸﺻﻮات واﻟﺘﻀﻌﯿﻒ اﻻﺷﺘﻘﺎﻗﻲ ﻟﻠﻜﻠﻤﺔ . ﺑﺤﯿﺚ ﻧﺒﯿﻦ أن اﻟﻌﺮﺑﯿﺔ اﻟﻤﻐﺮﺑﯿﺔ ﻻ ﺗﺴﻤﺢ ﺑﻮﺟﻮد ﻛﻠﻤﺎت ﻣﻜﻮﻧﺔ ﻣﻦ ﻣﻘﻄﻊ ﺻﻮﺗﻲ ﻗﺼﯿﺮ ﻋﻠﻰ ﺷﻜﻞ ﺻﺎﻣﺖ - ﺻﺎﺋﺖ ( CV ) ﻧﻈﺮا إﻟﻰ أﻧﮫ أﺣﺎدي اﻟﻮﻗﻊ وﺑﺬﻟﻚ ﻻ ﯾﺤﺘﺮم اﻟﺤﺪ اﻷدﻧﻰ اﻟﻤﺘﻤﺜﻞ ﻓﻲ وﻗﻌﯿﻦ، ﺑﺎﺳﺘﺜﻨﺎء إذا ﻛﺎن اﻟﺼﺎﻣﺖ ﻣﻀﻌﻔﺎ ( GV ) ، ﻣﻤﺎ ﯾﺸﯿﺮ إﻟﻰ ا ﺗﺼﺎﻟﮫ ﺑ ﻮﻗﻊ ﻋﻠﻰ ﻏﺮار اﻟﺼﺎﺋﺖ اﻟﺬي ﯾﻠﯿﮫ .
    [Show full text]
  • The Purpose of This Report Is to Survey and Evaluate the Various Facets of the Teaching of Arabic in the United States and Give an Appraisal of the State of the Art
    DOCUMENT RESUME ED 024 051 AL 001 627 By- Abboud, Peter F. The Teaching of Arabic in the United States: The State of the Art. Center for Applied Linguistics, Washington, D.C. ERIC Clearinghouse for Linguistics. Pub Date Dec 68 Note- 47p. EDRS Price MF-$0.25 HC-$2.45 Descriptors- Annotated Bibliographies, *Arabic, Diglossia, *Instructional Materials, *Language Instruction, Sociolinguistics, Standard Spoken Usage, Surveys, *Teaching, *Teaching Methods, Textbook Evaluation The purpose of this report is to survey and evaluate the various facetsof teaching of Arabic in the United States, and to give an appraisal of the state of the art. The objectives o f Arabic language teaching, the special problems thatArabic presents to Americans, and the various components common to all foreign language teaching programs, namely methods, manpower, materials, and university resources are considered, and recommendations presented.Appended are (1) a selected list of materials used in Arabic instruction with brief descriptive and evaluative annotations, and (2) background information on the socio-linguistic profile of the area where Arabic is used today. (Author/AMM) a a EDUCATIONAL RESOURCES INFORMATION CENTER CLEARINGHOUSE FOR LINGUISTICS CENTER FOR APPLIED LINGUISTICS, 1717 MASSACHUSETTS AVENUE, N. W., WASHINGTON, D, C. 20036 U.S. DEPARTMENT OF HEALTH, EDUCATION & WELFARE 4-) OFFICE OF EDUCATION THIS DOCUMENT HAS BEEN REPRODUCED EXACTLY AS RECEIVED FROM THE PERSON OR ORGANIZATION ORIGINATING IT.POINTS OF VIEW OR OPINIONS STATED DO NOT NECESSARILY REPRESENT OFFICIAL OFFICE OF EDUCATION POSITION OR POLICY. it THE TEACHING OF ARABIC IN THE UNITED STATES: THE STATE OF THE ART by PETER F. ABBOUD L 001627 MAIRTNIMM.M.,11,11.4 Foreword This state-of-the-art paper has been commissioned by the ERIC Clearing- house for Linguistics in collaboration with the Foreign Language Program of the Center for Applied Linguistics.
    [Show full text]