Saudi Dialects: Are They Endangered?
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Automatic Identification of Arabic Language Varieties and Dialects in Social Media
Automatic Identification of Arabic Language Varieties and Dialects in Social Media Fatiha Sadat Farnazeh Kazemi Atefeh Farzindar University of Quebec in NLP Technologies Inc. NLP Technologies Inc. Montreal, 201 President Ken- 52 Le Royer Street W., 52 Le Royer Street W., nedy, Montreal, QC, Canada Montreal, QC, Canada Montreal, QC, Canada [email protected] [email protected] [email protected] Abstract Modern Standard Arabic (MSA) is the formal language in most Arabic countries. Arabic Dia- lects (AD) or daily language differs from MSA especially in social media communication. However, most Arabic social media texts have mixed forms and many variations especially be- tween MSA and AD. This paper aims to bridge the gap between MSA and AD by providing a framework for AD classification using probabilistic models across social media datasets. We present a set of experiments using the character n-gram Markov language model and Naive Bayes classifiers with detailed examination of what models perform best under different condi- tions in social media context. Experimental results show that Naive Bayes classifier based on character bi-gram model can identify the 18 different Arabic dialects with a considerable over- all accuracy of 98%. 1 Introduction Arabic is a morphologically rich and complex language, which presents significant challenges for nat- ural language processing and its applications. It is the official language in 22 countries spoken by more than 350 million people around the world1. Moreover, the Arabic language exists in a state of diglossia where the standard form of the language, Modern Standard Arabic (MSA) and the regional dialects (AD) live side-by-side and are closely related (Elfardy and Diab, 2013). -
Christians and Jews in Muslim Societies
Arabic and its Alternatives Christians and Jews in Muslim Societies Editorial Board Phillip Ackerman-Lieberman (Vanderbilt University, Nashville, USA) Bernard Heyberger (EHESS, Paris, France) VOLUME 5 The titles published in this series are listed at brill.com/cjms Arabic and its Alternatives Religious Minorities and Their Languages in the Emerging Nation States of the Middle East (1920–1950) Edited by Heleen Murre-van den Berg Karène Sanchez Summerer Tijmen C. Baarda LEIDEN | BOSTON Cover illustration: Assyrian School of Mosul, 1920s–1930s; courtesy Dr. Robin Beth Shamuel, Iraq. This is an open access title distributed under the terms of the CC BY-NC 4.0 license, which permits any non-commercial use, distribution, and reproduction in any medium, provided no alterations are made and the original author(s) and source are credited. Further information and the complete license text can be found at https://creativecommons.org/licenses/by-nc/4.0/ The terms of the CC license apply only to the original material. The use of material from other sources (indicated by a reference) such as diagrams, illustrations, photos and text samples may require further permission from the respective copyright holder. Library of Congress Cataloging-in-Publication Data Names: Murre-van den Berg, H. L. (Hendrika Lena), 1964– illustrator. | Sanchez-Summerer, Karene, editor. | Baarda, Tijmen C., editor. Title: Arabic and its alternatives : religious minorities and their languages in the emerging nation states of the Middle East (1920–1950) / edited by Heleen Murre-van den Berg, Karène Sanchez, Tijmen C. Baarda. Description: Leiden ; Boston : Brill, 2020. | Series: Christians and Jews in Muslim societies, 2212–5523 ; vol. -
After Serbo-Croatian: the Narcissism of Small Difference1
polish 3()’ 171 10 sociological review ISSN 1231 – 1413 After Serbo-Croatian: The Narcissism of Small Difference1 Snježana Kordić. Jezik i nacjonalizam [Language and Nationalism] (Rotulus / Universitas Series). Zagreb, Croatia: Durieux. 2010. ISBN 978-953-188-311-5. Keywords: Croatian, kroatistik, language politics, nationalism, Serbo-Croatian Kordić’s book Jezik i nacjonalizam [Language and Nationalism] is a study of lan- guage politics or political sociolinguistics. Language being such a burning political issue in Yugoslavia after the adoption in 1974 of a truly federal constitution. In her extensive monograph, written in Croatian (or Latin script-based Croato-Serbian?), Kordić usefully summarizes today’s state of the linguistic and popular discourse on language and nationalism as it obtains in Croatia, amplified with some comparative examples drawn from Bosnia, Montenegro and Serbia. These four out of the seven post-Yugoslav states (the other three being Kosovo, Macedonia and Slovenia) parti- tioned among themselves Yugoslavia’s main official language, Serbo-Croatian (or, in the intra-Yugoslav parlance, ‘Serbo-Croatian or Croato-Serbian’), thus reinventing it anew as the four separate national languages of Bosnian, Croatian, Montenegrin and Serbian. The first two are written in the Latin alphabet; Montenegrin is written both in this alphabet and in Cyrillic; Serbian is officially in Cyrillic, but is in practice also written in Latin characters. The monograph is divided into three parts. The first and shortest one, Linguis- tic Purism (Jezični purizam), sets out the theoretical (and also ideological) position adopted by Kordić. Building on this theoretical framework, she conducts her anal- ysis and discussion in the two further sections, The Pluricentric Standard Language (Policentrični standardni jezik) and the final more far-ranging one, Nation, Identity, Culture and History (Nacija, identitet, kultura, povijest). -
Language of the Old Testament: Biblical Hebrew “The Holy Tongue”
E-ISSN 2281-4612 Academic Journal of Interdisciplinary Studies Vol 4 No 1 ISSN 2281-3993 MCSER Publishing, Rome-Italy March 2015 Language of the Old Testament: Biblical Hebrew “The Holy Tongue” Associate Professor Luke Emeka Ugwueye Department of Religion & Human Relations, Faculty of Arts, Nnamdi Azikiwe University, PMB 5025, Awka- Anambra State, Nigeria Email: [email protected] phone - 08067674763 Doi:10.5901/ajis.2015.v4n1p129 Abstract Some kind of familiarity with the structure and thought pattern of biblical Hebrew language enhances translation and improved ways of working with the language needed by students of Old Testament. That what the authors of the Scripture say also has meaning for us today is not in doubt but they did not express themselves primarily for us or in our language, and so it requires training on our part to understand them in their own language. The features of biblical Hebrew as combined in the language’s use of imagery and picturesque description of things are of huge assistance in this training exercise for a better operational knowledge of the language and meaning of Hebrew Scripture. Keywords: Language, Old Testament, Biblical Hebrew, Holy Tongue 1. Introduction Hebrew language is the language of the culture, religion and civilization of the Jewish people since ancient times. It belongs to the northwest ancient Semitic family of languages. The word Semitic, according to Kitchen (1992) is formed from the name Shem, Noah’s eldest son (Genesis 5:32). It is an adjective derived from ‘Shem’ meaning a member of any of the group of people speaking Akkadian, Phoenician, Punic, Aramaic, and especially Hebrew, Modern Hebrew and Arabic language. -
A Comprehensive NLP System for Modern Standard Arabic and Modern Hebrew
A Comprehensive NLP System for Modern Standard Arabic and Modern Hebrew Morphological analysis, lemmatization, vocalization, disambiguation and text-to-speech Dror Kamir Naama Soreq Yoni Neeman Melingo Ltd. Melingo Ltd. Melingo Ltd. 16 Totseret Haaretz st. 16 Totseret Haaretz st. 16 Totseret Haaretz st. Tel-Aviv, Israel Tel-Aviv, Israel Tel-Aviv, Israel [email protected] [email protected] [email protected] Abstract 1 Introduction This paper presents a comprehensive NLP sys- 1.1 The common Semitic basis from an NLP tem by Melingo that has been recently developed standpoint for Arabic, based on MorfixTM – an operational formerly developed highly successful comprehen- Modern Standard Arabic (MSA) and Modern Hebrew (MH) share the basic Semitic traits: rich sive Hebrew NLP system. morphology, based on consonantal roots (Jiðr / The system discussed includes modules for Šoreš)1, which depends on vowel changes and in morphological analysis, context sensitive lemmati- some cases consonantal insertions and deletions to zation, vocalization, text-to-phoneme conversion, create inflections and derivations.2 and syntactic-analysis-based prosody (intonation) For example, in MSA: the consonantal root model. It is employed in applications such as full /ktb/ combined with the vocalic pattern CaCaCa text search, information retrieval, text categoriza- derives the verb kataba ‘to write’. This derivation tion, textual data mining, online contextual dic- is further inflected into forms that indicate seman- tionaries, filtering, and text-to-speech applications tic features, such as number, gender, tense etc.: katab-tu ‘I wrote’, katab-ta ‘you (sing. masc.) in the fields of telephony and accessibility and wrote’, katab-ti ‘you (sing. fem.) wrote, ?a-ktubu could serve as a handy accessory for non-fluent ‘I write/will write’, etc. -
The Hebrew-Jewish Disconnection
Bridgewater State University Virtual Commons - Bridgewater State University Master’s Theses and Projects College of Graduate Studies 5-2016 The eH brew-Jewish Disconnection Jacey Peers Follow this and additional works at: http://vc.bridgew.edu/theses Part of the Reading and Language Commons Recommended Citation Peers, Jacey. (2016). The eH brew-Jewish Disconnection. In BSU Master’s Theses and Projects. Item 32. Available at http://vc.bridgew.edu/theses/32 Copyright © 2016 Jacey Peers This item is available as part of Virtual Commons, the open-access institutional repository of Bridgewater State University, Bridgewater, Massachusetts. THE HEBREW-JEWISH DISCONNECTION Submitted by Jacey Peers Department of Graduate Studies In partial fulfillment of the requirements For the Degree of Master of Arts in Teaching English to Speakers of Other Languages Bridgewater State University Spring 2016 Content and Style Approved By: ___________________________________________ _______________ Dr. Joyce Rain Anderson, Chair of Thesis Committee Date ___________________________________________ _______________ Dr. Anne Doyle, Committee Member Date ___________________________________________ _______________ Dr. Julia (Yulia) Stakhnevich, Committee Member Date 1 Acknowledgements I would like to thank my mom for her support throughout all of my academic endeavors; even when she was only half listening, she was always there for me. I truly could not have done any of this without you. To my dad, who converted to Judaism at 56, thank you for showing me that being Jewish is more than having a certain blood that runs through your veins, and that there is hope for me to feel like I belong in the community I was born into, but have always felt next to. -
The Production of Lexical Tone in Croatian
The production of lexical tone in Croatian Inauguraldissertation zur Erlangung des Grades eines Doktors der Philosophie im Fachbereich Sprach- und Kulturwissenschaften der Johann Wolfgang Goethe-Universität zu Frankfurt am Main vorgelegt von Jevgenij Zintchenko Jurlina aus Kiew 2018 (Einreichungsjahr) 2019 (Erscheinungsjahr) 1. Gutacher: Prof. Dr. Henning Reetz 2. Gutachter: Prof. Dr. Sven Grawunder Tag der mündlichen Prüfung: 01.11.2018 ABSTRACT Jevgenij Zintchenko Jurlina: The production of lexical tone in Croatian (Under the direction of Prof. Dr. Henning Reetz and Prof. Dr. Sven Grawunder) This dissertation is an investigation of pitch accent, or lexical tone, in standard Croatian. The first chapter presents an in-depth overview of the history of the Croatian language, its relationship to Serbo-Croatian, its dialect groups and pronunciation variants, and general phonology. The second chapter explains the difference between various types of prosodic prominence and describes systems of pitch accent in various languages from different parts of the world: Yucatec Maya, Lithuanian and Limburgian. Following is a detailed account of the history of tone in Serbo-Croatian and Croatian, the specifics of its tonal system, intonational phonology and finally, a review of the most prominent phonetic investigations of tone in that language. The focal point of this dissertation is a production experiment, in which ten native speakers of Croatian from the region of Slavonia were recorded. The material recorded included a diverse selection of monosyllabic, bisyllabic, trisyllabic and quadrisyllabic words, containing all four accents of standard Croatian: short falling, long falling, short rising and long rising. Each target word was spoken in initial, medial and final positions of natural Croatian sentences. -
An Introduction to the Relevance of and a Methodology for a Study of the Proper Names of the Book of Mormon
An Introduction to the Relevance of and a Methodology for a Study of the Proper Names of the Book of Mormon Paul Y. Hoskisson Since the appearance of the Book of Mormon in 1830, its proper names have been discussed in diverse articles and books.1 Most of the statements proffer etymologies, while a few suggest the signicance of various names. Because of the uneven quality of these statements this paper proposes an apposite methodology. First, though, a few words need to be said about the relevance of name studies to our understanding of the Book of Mormon. Relevance With the exception of a few modern proper names coined for their composite sounds,2 all names have meanings in their language of origin. People are often not aware of these meanings because the name has a private interpretation, or the name has been borrowed into a language in which the original meaning is no longer evident, or the name is very old and the meaning has not been transmitted. For example, the English personal name Wayne is an old form of the more modern English word wain, meaning a “wagon” or “cart,” hence the surname Wainwright, “builder/repairer of “3 However, to our contemporary ears Wayne no longer has a meaning; it is simply a personal name. With training and experience, it is often possible to dene the language of origin, the meaning, and, when applicable, the grammatical form of a name. Names like Karen, Tony, and Sasha (also written Sacha from the French spelling) have been borrowed into English from Danish,4 Italian,5 and Russian6 respectively. -
Classical and Modern Standard Arabic Marijn Van Putten University of Leiden
Chapter 3 Classical and Modern Standard Arabic Marijn van Putten University of Leiden The highly archaic Classical Arabic language and its modern iteration Modern Standard Arabic must to a large extent be seen as highly artificial archaizing reg- isters that are the High variety of a diglossic situation. The contact phenomena found in Classical Arabic and Modern Standard Arabic are therefore often the re- sult of imposition. Cases of borrowing are significantly rarer, and mainly found in the lexical sphere of the language. 1 Current state and historical development Classical Arabic (CA) is the highly archaic variety of Arabic that, after its cod- ification by the Arab Grammarians around the beginning of the ninth century, becomes the most dominant written register of Arabic. While forms of Middle Arabic, a style somewhat intermediate between CA and spoken dialects, gain some traction in the Middle Ages, CA remains the most important written regis- ter for official, religious and scientific purposes. From the moment of CA’s rise to dominance as a written language, the whole of the Arabic-speaking world can be thought of as having transitioned into a state of diglossia (Ferguson 1959; 1996), where CA takes up the High register and the spoken dialects the Low register.1 Representation in writing of these spoken dia- lects is (almost) completely absent in the written record for much of the Middle Ages. Eventually, CA came to be largely replaced for administrative purposes by Ottoman Turkish, and at the beginning of the nineteenth century, it was function- ally limited to religious domains (Glaß 2011: 836). -
Arabic and Contact-Induced Change Christopher Lucas, Stefano Manfredi
Arabic and Contact-Induced Change Christopher Lucas, Stefano Manfredi To cite this version: Christopher Lucas, Stefano Manfredi. Arabic and Contact-Induced Change. 2020. halshs-03094950 HAL Id: halshs-03094950 https://halshs.archives-ouvertes.fr/halshs-03094950 Submitted on 15 Jan 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Arabic and contact-induced change Edited by Christopher Lucas Stefano Manfredi language Contact and Multilingualism 1 science press Contact and Multilingualism Editors: Isabelle Léglise (CNRS SeDyL), Stefano Manfredi (CNRS SeDyL) In this series: 1. Lucas, Christopher & Stefano Manfredi (eds.). Arabic and contact-induced change. Arabic and contact-induced change Edited by Christopher Lucas Stefano Manfredi language science press Lucas, Christopher & Stefano Manfredi (eds.). 2020. Arabic and contact-induced change (Contact and Multilingualism 1). Berlin: Language Science Press. This title can be downloaded at: http://langsci-press.org/catalog/book/235 © 2020, the authors Published under the Creative Commons Attribution -
Language, Ideology and Politics in Croatia
Language, Ideology and Politics in Croatia M at e k a p o v i ć University of Zagreb, Department of Linguistics, Faculty of Humanities and Social Sciences, Ivana Lučića 3, HR – 10 000 Zagreb, [email protected] SCN IV/2 [2011], 45–56 Izhajajoč deloma iz osnovnih tez svoje pred kratkim izšle knjige Čiji je jezik (Čigav je jezik?) avtor podaja pregled zapletenega odnosa med jezikom, ideologijo in politiko na Hrvaškem v preteklih dveh desetletjih, vključno z novimi primeri in razčlembami. Razprava se osredotoča na vprašanja, povezana s Hrvaško, ki so lahko zanimiva za tuje slaviste in jezikoslovce, medtem ko se knjiga (v hrvaščini) ukvarja s problemi jezika, politike, ideologije in družbenega jeziko- slovja na splošno. Based in part on his recent book Čiji je jezik? (Who does Language Belong to?), the author reviews the intricate relation of language, ideology, and politics in Croatia in the last 20 years, including new examples and analyses. The article emphasizes problems related to Croatia specifically, which might be of interest to foreign Slavists and linguists, while the monograph (in Croatian) deals with the prob- lems of language, society, politics, ideology, and sociolinguistics in general. Ključne besede: jezikovna politika, jezikovno načrtovanje, purizem, hrvaški jezik, jezik v nekdanji Jugoslaviji Key words: language politics, language planning, purism, Croatian language, language in former Yugoslavia Introduction1 The aim of this article is to provide a general and brief overview of some problems concerning the intricate relation of language, ideology, and politics in Croatia in the last 20 years. The bulk of the article consists of some of the 1 I would like to thank Marko Kapović for reading the first draft of the article carefully. -
Reading Biblical Aramaic
TITLE: BS400 READING BIBLICAL ARAMAIC WORKLOAD: Duration: 4 Terms Teaching (1 hr p.w.): 26 hrs Language exercises and set reading: 30 hrs Text preparation: 40 hrs Examination preparation and writing: 24 hrs Total commitment: 120 hrs STATUS: Elective CO/PREREQUISITE(S): At least 60% in BS300 Hebrew 3, or at least 70% in BS200 Hebrew 2 if Hebrew 3 is taken concurrently. GENERAL AIM: This unit aims to build on students’ knowledge of Biblcal Hebrew by imparting a working knowledge of biblical Aramaic, enabling them to read all the passages in the Old Testament where Aramaic is employed. The unit focuses on building a comprehensive vocabulary of biblical Aramaic, understanding its grammar and syntax, and applying this knowledge to the translation of the relevant texts in Daniel and Ezra. The purpose of this unit is not only to enable students to work from the original language in all parts of the Old Testament, but also to lay a linguistic foundation for any subsequent study involving Aramaic texts. LEARNING OUTCOMES: At the end of this course students should be able to: • Recognise and distinguish biblical Aramaic from biblical Hebrew; • Translate all biblical Aramaic texts into good, accurate English; • Parse and reproduce regular forms of the biblical Aramaic verbal system; • Analyse and explain most grammatical and phrase-level syntactical constructions. CONTENT: Part 1: Biblical Aramaic Grammar • Introduction to biblical Aramaic: occurrences; name; relationship to Hebrew; vocalization and phonology • Aramaic Nouns, pronouns, and adjectives • Prepositions and conjunctions • The Aramaic verbal system & conjugations • Strong verbs • Weak verbs • Syntax • Numerals Part 2: Reading of set texts • Reading of Daniel 2 – 7 • Reading of Ezra 4 – 7 TEACHING AND Classroom teaching and tutorials; LEARNING METHODS: Private preparation of homework assignments, reviewed in class; Class tests and oral participation; Strong emphasis on peer learning and collaborative group work.