Revisiting Kurdish Dialect Geography: Preliminary Findings from the Manchester Database

Revisiting Kurdish dialect geography: Preliminary findings from the Manchester Database June 2017 Yaron Matras University of Manchester Introduction: Database method and Persian). Bilingual speakers were asked to translate the phrases into their local Kurdish dialect. Sessions were and scope recorded and transcribed into templates in which each phrase was pre-tagged for anticipated structures. The data My aim in this paper is to describe preliminary findings was imported into an open-source database (utilising from work carried out between 2011-2017 as part of a MySQL and PHP web interface software), which was made collaborative project on ‘Structural and typological accessible online. It allows the user to filter transcribed variation in the dialects of Kurdish’, based at the phrases by content (Kurdish forms), English elicitation University of Manchester. The project’s objectives were to phrase, tags, and speaker’s place of origin. create a reference database covering the main areas in A pilot questionnaire was tested in 2011-2012. It which dialects of Kurdish are spoken, to assess typological contained around 200 items, of which around half were variation (with particular consideration to possible individual lexemes and function words. The items had been contact influences), and to investigate the role of verb selected based on an assessment of structural variation in semantics in the volatility of the ergative construction in samples of connected speech from around 50 recorded Kurmanji/Bahdini. This paper presents initial findings interviews of up to 40 minutes each with speakers from pertaining to the distribution of structural features, dialect various locations in Turkey, Iraq, and Iran, and based on geography, and dialect classification. variation documented in existing literature, especially The project’s data elicitation method was inspired by that Mackenzie (1961a) but also descriptions of individual used between 2001-2006 to create the Romani Morpho- Kurdish varieties. Elicitation for the pilot was carried out Syntax (RMS) database (Matras & Elšík 2008, Matras, Elšík in a number of locations in the Kurdish speaking regions in & White 2009). A questionnaire was prepared in order to southeastern Turkey and northern Iraq and with recent capture salient variables in lexicon, phonology and émigrés in Western Europe. The questionnaire was then morpho-syntax. Items were translated into second extended in 2014. The new questionnaire has 300 items languages that are common in the region (Turkish, Arabic, and gives special consideration to possible semantic correlates of ergativity, capturing a scale of predicates and be taken to represent the free license to incorporate participant roles. The approach was inspired by findings on lexical items from the contact language in everyday speech correlates between ergativity, topicality and agentivity in in Kurdish. The same can be said for the occasional Kurmanji, as presented in Matras (1997) (see also Haig repetition, seemingly, of lexical verb forms from the 2008: 215ff.) and in theoretical perspective by Beaver’s elicitation source phrase, as in yaşamîş dibim ‘I (2011) semantic analysis of diagnostics for participant live’ (Turkish yaşamış) in several locations in Turkey affectedness. In addition to the questionnaires, speakers (among them İmranlı, Pertek, Karlıova, Suruç), ʕeyş dibim in were asked to provide a free speech sample, for which Kobane, Syria and maʕîşe dikim in Khanaqin, Iraq (both several standardised guideline questions were designed Arabic ʕīş), or zindigî ekem in Sahneh, Iran (Persian eliciting descriptions of village life, marriage customs, zendegī). The fact that the majority of participants – in the migration, or traditional tales. Free speech samples were case of this particular item, over 90% – opted for generally of 20-40 minutes in duration. translation equivalents that were not direct replications of the item used in the source, but of Iranian-Kurdish In order to facilitate data collection, project collaborators etymology – such as dimînim, dijîm, jiyan dekem etc. – trained fieldworkers in the region; these were recruited suggests that the responses containing a lexical loan among native speakers who are students of Kurdish reflect actual usage rather than the effect of convergence language and linguistics at universities in southeastern to the source language. In a small number of cases, some Turkey and northern Iraq. A protocol was applied by effect of the source language can be detected in the which fieldworkers contacted the project manager based organisation of complex clauses, though the questionnaire in Manchester with meta-data of proposed speaker is designed to control for such interference by including consultants and were then given authorisation to carry several sample sentences for each target construction. out recordings, which were archived. The recordings were then forwarded to specially trained native speaker transcription assistants. All questionnaire transcriptions underwent a systematic in-house control and correction Dialect geography and diffusion procedure by the project team. Sections of 5-7 minutes were selected from each free speech sample for centres transcription and translation; these transcriptions underwent two consecutive control processes. Until recently, Mackenzie’s (1961a) study of the Kurdish dialects of northern Iraq remained the principal reference Over 200 speaker consultants were recorded, in over 150 work on Kurdish dialect geography. Mackenzie’s survey locations. The sample shows a bias toward young, was limited in its geographical scope, covering only around educated males, reflecting in part the profile of the a dozen locations. Nevertheless, the spread of those fieldworkers and their access to speaker consultants. locations, on either side of the Zabb river, offered a more However, this bias has the advantage of limiting extra- or less equal level of attention to each of the two dialect linguistic variability to geographical location. Influence of groups which Mackenzie named ‘Central’ (Group 1) and the Standard language (either Kurmanji or Sorani) has ‘Northern’ (Group 2). As the most significant phonological been minimised thanks to the spontaneous elicitation difference between the two groups, Mackenzie (1961a: using a second language as source, but cannot be entirely 220-225) notes the shift of Old Iranian inter- and post- ruled out; however, the emerging geographical patterns of vocalic p and m to v in the Northern and w in the Central structural features offer evidence of the non-randomness group. The principal morphological isoglosses include the of speakers’ responses. The database, transcribed free use of enclitic pronominal forms, the presence of a speech samples with audio and translation, information on synthetic passive construction, the use of a definite article, tags/glosses and transliteration symbols, and information the presence of a general plural form, and the loss of case on speaker statistics can be found on the project website. distinction in pronouns in the Central but not in the On the whole, spontaneous, oral phrase translation has Northern group; and the presence of nominal case proven to be a reliable method of data elicitation, and marking, gender and number distinction in nominal convergence to the elicitation (source) language was not attributive endings (ezafe), and a future tense marker in found to be an interfering factor. The odd lexical loan from the Northern but not in the Central group. Mackenzie the contemporary contact language (for example, coz for also identifies isoglosses within the Central group. They ‘walnut’, from Arabic, in Sabahiya in Syria, rather than the include the replacement of ł by r and the retention of expected gwîz as recorded in neighbouring locations) can grammatical gender in Arbil, Xošnaw, and Rowanduz; the !2 use of the aspectual marker of the progressive-indicative Hakkari/Duhok (SEK) are a case in point, showing on the e- rather than d and of the demonstrative em in one hand more conservative features than the Kurmanji Suleimaniya and Warmawa; and some phonetic and dialects to the northwest, while on the other hand phonological specifics. In conclusion, Mackenzie (1961a: showing some influences from Sorani. 224) proposes a general division between Northern and Central dialects, and a sub-division of the latter between My approach in this paper is complementary to that dialects of the Soran-Arbil region to the north, said to be adopted by Öpengin & Haig (2014): I draw on data from more archaic, and those of the Suleimaniya-Halabja region the Manchester Database survey to reconstruct to the south. specifically layers of structural innovation and the extent of their diffusion in geographical space, returning then to This division of Kurdish into, essentially, three groups – the question of dialect classification by identifying zones Northern (Kurmanji/Bahdini), Central (Sorani), and that are the epicentres of such innovations. This approach Southern (the latter in the Kermanshah region of Iran), is based on the assumption that it is innovation that with a sub-division of the Central group – has since been creates differences among related varieties, and that followed broadly in Kurdish linguistics (cf. McCarus 2009). individual innovations differ in the extent of their Terminology remains, however, somewhat inconsistent, geographical spread, and so there are no pre-determined with the term ‘southern Kurdish’ sometimes used as dialect boundaries. Rather, the

Revisiting Kurdish Dialect Geography: Preliminary Findings from the Manchester Database

Language Specific Peculiarities Document for Kurmanji Kurdish As

Christians and Jews in Muslim Societies

Historical Linguistics and Cognitive Science

Contextualizing Historical Lexicology the State of the Art of Etymological Research Within Linguistics

Current Issues in Kurdish Linguistics Current Issues in Kurdish Linguistics 1 Bamberg Studies in Kurdish Linguistics Bamberg Studies in Kurdish Linguistics

Perceptual Dialectology and GIS in Kurdish 1

Lectures on English Lexicology

Kurdish Spoken Dialect Recogni

Cultural and Language Incentive Program – Bonus (CLIP-B) Program Information and Instructions

Map by Steve Huffman; Data from World Language Mapping System

The Importance of Morphology, Etymology, and Phonology

Variation in the Ergative Pattern of Kurmanji Songül Gündoğdu, Muş