Computational Analysis of Morphosyntactic Categories in Georgian

Total Page:16

File Type:pdf, Size:1020Kb

Computational Analysis of Morphosyntactic Categories in Georgian Computational Analysis of Morphosyntactic Categories in Georgian Sophiko Daraselia Submitted in accordance with the requirements for the degree of Doctor of Philosophy University of Leeds School of Languages, Cultures and Societies April 2019 The candidate confirms that the work submitted is her own and that appropriate credit has been given where reference has been made to the work of others. This copy has been supplied on the understanding that it is copyright material and that no quotation from the thesis may be published without proper acknowledgement. The right of Sophiko Daraselia to be identified as Author of this work has been asserted by her in accordance with the Copyright, Designs and Patents Act 1988. ©2019 The University of Leeds and Sophiko Daraselia i Acknowledgements I would like to thank the International Educational Center (IEC) for giving me the opportunity of a lifetime by fully funding my PhD program at the University of Leeds. Without this financial support, this would not have been possible. Of course, none of these would have been possible without my supervisory team, which has included Dr Serge Sharoff, Dr Diane Nelson and Dr Andrew Hardie. I would like to thank Dr Serge Sharoff, my first supervisor. I am very grateful to both him and my second supervisor, Dr Diane Nelson, for giving me their guidance throughout the years on my PhD project. Most importantly, they have always been supportive in my academic development beyond my PhD research. They have helped and supported me to lead the Kartvelian and Caucasian Studies research satellite at Language@Leeds. They have helped me in organizing the international annual workshop on Computational approaches to Morphologically Rich Languages (CAMRL2017 and CAMRL2018) at the University of Leeds. I cannot thank enough and express how grateful I am to Dr Andrew Hardie, my external supervisor from Lancaster University. He has been so helpful, and I feel I have learnt a great deal from him. With Dr Hardie’s supervision, I have managed to design a tagset for Georgian (KATAG), which is one of the main contributions in my PhD project. I am very lucky to have my husband Thomas James Blackwell for supporting during my PhD journey. My loving thanks go also to each of the members of Thomas’s family, who always made me feel very special. ii Many thanks go to my family in Georgia, especially to my mother Nona Berishvili, who has always been there for me, showing her love and support every single day throughout the whole PhD project. Alongside my PhD project, I was involved in a number of interesting projects with my Georgian friends and colleagues. I would like to thank Professor Tinatin Margalitadze, Professor Marine Beridze, Professor Nino Sharashenidze, Eto Churadze and Dr Rusudan Gersamia for their friendship, kindness and cooperation. I also would like to thank the University of Leeds for accepting my PhD project and providing an incredibly professional and supportive research environment throughout my PhD study. I would like to thank my friends in Leeds with whom I shared the PhD experience and their love and support have given me strength during hard times, especially Faye Bennet, Polly Gallis, Aishah Mubaraki and Shifa Al Askari. iii Abstract This thesis describes the development of part-of-speech tagging resources for the Georgian language, consisting of i.) a new morphosyntactic language model for part- of-speech (POS) tagging purposes; ii.) tagging guidelines for tagging and post-editing; iii.) the KATAG tagset and iv.) the trained parameter files the probabilistic TreeTagger program needs to work on Georgian texts. A new morphosyntactic model of Georgian for part-of-speech tagging purposes is described in the thesis. The thesis also describes a tagset (KATAG) defined in accordance with a new morphosyntactic model of the language and a set of design principles and tagging guidelines. A stochastic methodology is used here to perform tagging in Georgian. Namely, the Treetagger - a probabilistic part-of-speech tagging program has been trained on Georgian texts. The justification for this choice is discussed. I use two tokenisation approaches in part-of-speech tagging. An accuracy of 92.41% using an enclitic tokenisation approach and accuracy of 87.13% was achieved using a non-enclitic tokenisation approach, corroborating my hypothesis that treating enclitic elements separately from the host words results in better tagging performance. To make the tagger program easily adaptable for a range of inputs (type, variety or genre of text), the performance of the probabilistic TreeTagger program was evaluated according to the obtained test set consisting of five different genres such as academic, informal, legal, fiction and news. iv Table of Contents Acknowledgements .................................................................................................... ii Abstract ..................................................................................................................... iv List of Tables ............................................................................................................ ix List of Figures ......................................................................................................... xiii Abbreviation ........................................................................................................... xiv Chapter 1 ................................................................................................................... 1 Introduction ............................................................................................................... 1 1.1 Motivation .................................................................................................... 1 1.2 Aims and objectives ..................................................................................... 3 1.3 Research questions ....................................................................................... 3 1.4 Thesis outline ............................................................................................... 7 Chapter 2 Background issues to the tagging of Georgian ..................................... 9 2.1 The Georgian Language ............................................................................... 9 2.1.1 A brief overview of the structure of Georgian ................................ 11 2.2 Previous work on corpus annotation in Georgian ...................................... 23 2.2.1 The KaWaC Corpus......................................................................... 23 2.2.2 A parser for Georgian ...................................................................... 25 2.2.3 Georgian morphological analyser .................................................... 27 2.2.4 Morphological Analyzer and Generator for Georgian .................... 28 2.3 Concluding Remarks .................................................................................. 29 Chapter 3 Design principles of the Georgian Tagset ........................................... 31 3.1 What is part-of-speech tagging? ................................................................ 31 3.2 Previous work on English part-of-speech tagsets ...................................... 32 3.3 Design principles for a Georgian tagset ..................................................... 35 3.3.1 Information to include ..................................................................... 36 3.3.2 Hierarchy and decomposability ....................................................... 37 3.3.3 Tokenisation .................................................................................... 39 3.3.4 Disambiguation ................................................................................ 40 3.3.5 The tagset’s appearance ................................................................... 42 Chapter 4 Specification of the Tagset for Georgian ............................................ 44 4.1 Noun (arsebiti saxeli) ................................................................................ 44 4.1.1 Number ............................................................................................ 45 4.1.2 Case System in Modern Georgian ................................................... 47 v 4.1.3 Tags for Nouns ................................................................................ 58 4.2 Adjectives (zedsartavi saxeli) .................................................................... 59 4.2.1 Tags for Adjectives .......................................................................... 63 4.3 Pronouns (nacvalsaxeli) ............................................................................. 65 4.3.1 Tags for Pronouns ............................................................................ 77 4.4 Numerals (ricxviti saxeli) ........................................................................... 83 4.4.1 Tags for numerals ............................................................................ 85 4.5 Adverbs (zmnizeda or zmnisarti) ............................................................... 87 4.5.1 Tags for Adverbs ............................................................................. 93 4.6 Conjunctions (k'avširi) ............................................................................... 94 4.6.1 Coordinating Conjunctions .............................................................. 94 4.6.2 Subordinating Conjunctions ............................................................ 96 4.6.3 Tags for Conjunctions
Recommended publications
  • Khevsur and Tush and the Status of Unusual Phenomena in Corpora Author(S): Thomas R
    Khevsur and Tush and the status of unusual phenomena in corpora Author(s): Thomas R. Wier Proceedings of the 37th Annual Meeting of the Berkeley Linguistics Society: Special Session on Languages of the Caucasus (2013), pp. 96-110 Editors: Chundra Cathcart, Shinae Kang, and Clare S. Sandy Please contact BLS regarding any further use of this work. BLS retains copyright for both print and screen forms of the publication. BLS may be contacted via http://linguistics.berkeley.edu/bls/. The Annual Proceedings of the Berkeley Linguistics Society is published online via eLanguage, the Linguistic Society of America's digital publishing platform. Khevsur and Tush and the Status of Unusual Phenomena in Corpora THOMAS R. WIER University of Chicago Introduction Recent years have seen an increasing realization of the threat posed by language loss where, according to some estimates, upwards of ninety percent of all lan- guages may go extinct within the next century (Nettle & Romaine 2002). What is less often realized, much less discussed, is the extent to which linguistic diversity that falls within the threshold of mutual intelligibility is also diminishing. This is especially true of regions where one particular language variety is both widely spoken and holds especially high prestige across many different social classes and communities. In this paper, we will examine two such dialects of Georgian: Khevsur and Tush, and investigate what corpora-based dialectology can tell us about phylogenetic and typological rarities found in such language varieties. 1 Ethnolinguistic Background Spoken high in the eastern Caucasus mountains along the border with Chechnya and Ingushetia inside the Russian Federation, for many centuries, Khevsur and Tush have been highly divergent dialects of Georgian, perhaps separate lan- guages, bearing a relationship to literary Georgian not unlike that of Swiss German and Hochdeutsch (see map, from Hewitt 1995:vi).
    [Show full text]
  • Lexicographic Potential of the Georgian Dialect Corpus
    LexicographicLexicographic Potential Potential of ofthe the Georgian Georgian Dialect Dialect Corpus Corpus MarinaMarina Beridze, Beridze, David David Nadaraia Nadaraia Ivane Javakhishvili Tbilisi State University, Arn. Chikobava Institute of Linguistics e-mail: [email protected] Abstract The project Linguistic Portrait of Georgia envisages various aspects of documentation of Georgian linguistic reality by means of corpus methodologies. This title is an umbrella for three large-scale projects within the framework of which The Georgian Dialect Corpus – GDC (http://corpora.co) was developed. Presently, the architecture and text base of the corpus have been designed, being permanently developed and updated. Besides, the lexicographic base of the corpus is organized, agglomerating data from printed dialect dictionaries. The lexical stock of the corpus is presented based on text, lexicographic and encyclopaedic data. The total quantity of tokens in the corpus is estimated to be up to 2 000 000, while the lexicographic base has 60 000 items (lemmas with entries) by now; this quantity is considerably increased owing to phonetic and grammatical variations, frequently associated with a single lexical item. Keywords: Georgian Dialect Corpus; lexicographic base; encyclopaedic data 1 Georgian Dialect Corpus: General Characteristics The Georgian Dialect Corpus (GDC) was developed as a means for documentation and study of Georgian linguistic diversity (Beridze, Nadaraia 2009: 25; Beridze, Lordkipanidze, Nadaraia: 2015-1, 323). Consisting of several components, this is a reference resource, which, with respect to its interdisciplinary objectives, considerably differs from other similar corpora. The principal difference is that GDC has corpus and library modes of text access; it integrates the lexicographic base, a prospective universal lexicographic base of the Georgian language.
    [Show full text]
  • Shota Rustaveli Theatre and Film Georgia State University Faculty of Art Sciences, Media and Management Khatuna Damchidze Tbilis
    Shota Rustaveli Theatre and Film Georgia State University Faculty of Art Sciences, Media and Management Khatuna Damchidze Tbilisi 0108 Georgia Dance Dialects of West Georgia (Abkhazian, Acharian, Laz-Shavshetian, Megrelian, Rachan) and Main Ethnocoreological Aspects of Their Interrelation Abstract of the thesis work for the Academic degree Dr. of Arts (Phd) Scientific supervisor: Dr. of Arts Ana Samsonadze Tbilisi 2019 1 GENERAL DESCRIPTION OF THE WORK Actuality of the themes: Today, when the world is overwhelmed by the irreversible process of globalization, when the difference between the nations is being eliminated, the problem of maintaining originality is fairly acute. This problem is more obvious on the example of little countries such as Georgia. Had it not been the cultural heritage (material and intangible) from ancient times to this day, Georgia would not have occupied the place it holds now, in the world culture. As an object of cultural heritage, Georgian national choreography holds a special place and is of particular importance. Originality of Georgian folk dance is manifested in its ethnic variety which, on the one hand, exists as absolutely different dance tradition and on the other hand, as part of common Georgian folklore. Although certain number of dialects, from the standpoint of dance lexicon has disappeared (Imeretian, Lechkhumian); diversity of dance dialects is observed on a geographically small territory of West Georgia West Georgian circle of dialects comprises Svan, Rachan, Abkhazian, Megrelian, Gurian and Laz folk choreography1. One of the most important issues of choreology to be researched today is ascertainment and classification of separate dance dialects and elucidation of their interrelations.
    [Show full text]
  • |Ohn Benjamins Publishing Company
    |ohn Benjamins Publishing Company This is a contribution {rom h{ood in the Languages of Eurt:pe. Edited by Björn Rothstein and Rolf Thieroff O 2010. )ohn Benjamins Publishing Cornpany This electronic hle may not be altered in any rval'. The author(s) of this article islare permitted to use this PDF file to generate printed copies to be used by rvay of ofprints, for their personal use only. Permission is granted bv the publishers to post this file on a closed server rvhich is accessible to members (str.rdents aird staff) orrly of the author's/s'institute, it is not permitted to post this PDF on the open internet. For any other use of this material prior ivritten pernrission s}rould be obtained fr:om the publishers or througlt the Copy.right Cleararrce Center (Ibr USA: wnrt-.copyT ight.corn). Please contact [email protected] or consuit our rvebsite: wwir'.beniamins.com Tables of Contents, ahstracts and guidelirles are available at rmv'.benjamins.com Mood 1n Modern Georgian* Winfried Boeder Oldenburg r. Introduction Ceor:gian is one of the Karl-r,elian (South Caucasian) langtiages which ar:e spoken in a relatively compact area to tl're south of the Caucasus ridge rvhere (as tär as lve knou') the,v have alwa,ys been in conlact lvith each other: Georgian in the easl, Megrelian (or Min- greiian) in the rvest and Svan in the north-western mountains of Georgia; Laz is mainly spoken in an area ad;'acent to the Black Sea between'liabzon and Baturni. Only Georgian has a long-standing 1500 year old rvritten tradition.
    [Show full text]
  • Batumi Linguocultural Digital Archive (Contemporary Technological Achievements for the Database Arrangement of the Folklore Resources)
    M.Tandaschwili, R. Khalvashi, Kh. Beridze, M. Khakhutaishvili, N. Tsetskhladze, Batumi # 10. 2010 Linguocultural Digital Archive pp. 52-68 Manana Tandaschwili Frankfurt Goethe University R. Khalvashi, Kh. Beridze, M. Khakhutaishvili, N. Tsetskhladze Batumi State University Batumi Linguocultural Digital Archive (Contemporary Technological Achievements for the Database Arrangement of the Folklore Resources) Abstract The evolution of the new forms of the scientific communication and development of the web technologies and global networking gave the scholars an excellent opportunity to rapidly and effectively use academic digital resources, the number of which is constantly increasing. Establishment of the OR (Open-Resource) and introduction of the RE (Resource Exchange) supported development of the infrastructure for the digital archives. That, in its own right, became a fast and efficient instrument for the use of scholarly resources. It has essentially changed the research procedures in the 21st century. The researchers now are able to make use of the ‘open and merge’ approach to their resources. Creation of the global library has become a new opportunity of the international scholarly communication. The joint scientific project Batumi Linguocultural Digital Archieve (BaLDAR) implemented jointly by Batumi Shota Rustaveli State University and Goethe Frankfurt University is sponsored by the Shota Rustaveli National Science Foundation. The project is a result of the international cooperation and aims to introduce new forms of the scientific communication, which will support multidisciplinary research development. The paper studies the significance of the establishment digital archives in Georgia. It outlines the themes of the resources that have been developed within the project framework. Key words: web technologies; Linguoculturalizm; Digital Archieve.
    [Show full text]
  • Deponent Verbs in Georgian Kevin Tuite (Université De Montréal)
    Deponent verbs in Georgian Kevin Tuite (Université de Montréal) 0. Introduction. In the summer of 2000, while on a research trip to Georgia, I came across the following cartoon in a Tbilisi newspaper. Here is the text with a translation: Waiter: supši rat’om ipurtxebit? (Why do you keep spitting in the soup?) Customer: vsinǰav, cxelia tu ara. (I’m testing if it’s hot or not.) Waiter: ??? Customer: čemi coli q’oveltvis egre amoc’mebs utos. (My wife always checks the iron like this.) Leaving aside the political incorrectness — on several levels — of the content of the cartoon, let us make use of it as a source of linguistic data. The verb in the first line, i-purtx-eb-i-t “you [pl/polite] spit”, is formally in the passive voice; its 3rd-person subject form would be i- purtx-eb-a. Its morphology contrasts with that of the transitive a-purtx- eb-s in exactly the same way as, say, the passive (k’ari) i-ɣ-eb-a “(the door) is opened” is opposed to the active (k’ars) a-ɣ-eb-s “s/he opens (the door)”. If the meaning of a-purtx-eb-s is “s/he spits”, one would expect the first line of the above dialogue to mean something along the lines of “Why are you being spit into the soup?”, which is manifestly not the case. The Explanatory Dictionary of the Georgian Language (KEGL) glosses ipurtxeba “spits continually, sprays spit all the time (from the mouth)” (erttavad apurtxebs, c’ara-mara purtxs isvris (p’iridan)); according to Tschenkéli’s dictionary, it means “(dauernd) spucken”.
    [Show full text]
  • Disentangling Word Stress and Phrasal Prosody: Evidence from Georgian
    Disentangling word stress and phrasal prosody: evidence from Georgian Lena Borise Abstract This paper investigates the interaction of word stress and phrasal prosody in Georgian (Kartvelian). Based on novel experimental evidence, I show that the two prosodic phenomena, word stress and phrasal prosodic targets, differ both in their location and acoustic means that their expression relies on. By establishing this, I illustrate the workings of the two separate prosodic modules, which have been lumped together in the literature, leading to contradictory descriptions of Georgian prosody. The results provide evidence in favor of fixed initial stress, cued by greater duration of the stressed syllable, as compared to subsequent ones. They also attest to the presence of a phrasal intonational F0 target on the penultimate syllable. Furthermore, I show that hexasyllabic words, in contrast to shorter ones, exhibit durational ‘rhythmicity’, with longer and shorter syllables alternating; this independent process, accordingly, obscures the prominence of word stress in such words. These results help account for the facts related to word stress, phrasal intonation, and their interplay in Georgian, the object of numerous debates in the literature. They also demonstrate that the effects of word-level and phrase-level prosody can be successfully teased apart, even in a language in which their interaction is rather complex. Key words: Georgian, word stress, phrase accent, F0 targets, phrasal prosody *** Introduction There are numerous languages in which word stress facts, and particularly the interaction of word stress with phrasal intonation and/or information structure, have not been settled and raise a number of questions. Does a language have word stress? What acoustic parameter 1 (syllable/vowel duration, F0, intensity) does its realization chiefly rely upon? How does its realization interact with the expression of phrasal intonation? Such questions abound in languages understudied from the point of view of their phonological and prosodic properties.
    [Show full text]
  • Split-Intransitive Languages and the Unaccusative Hypothesis
    31 DEFINING TRANSITIVITY AND INTRANSITIVITY: SPLIT-INTRANSITIVE LANGUAGES AND THE UNACCUSATIVE HYPOTHESIS ASIER ALCÁZAR University of Southern California, Los Angeles 0. Proposals 0.1 Basque is not an ergative language; it is split-intransitive. * Against the standard typological claim (contra Comrie 1981, Dixon 1994, Primus 1999) 0.2 Unergative verbs are not always intransitive (they can be transitive: e.g. Basque). * Against the claim in the Unaccusative Hypothesis (contra Perlmutter 1978, Burzio 1986) 0.3 Split-intransitivity signals that unergatives are transitive. * Novel claim: split-intransitivity has been otherwise considered an anomalous pattern. 1. INTRODUCTION: INTRANSITIVE VERBS WITH TRANSITIVE LOOKS (I.E. 1A). Basque is often presented as a splendid example of ergativity (Comrie 1981, Dixon 1994, Primus 1999 among many). Tradi- tionally, ergative languages differ from accusative languages in the verbal argument they mark. Erga- tive languages mark the subject of transitives (compare the girl in 2 with 1b) and accusative lan- guages mark the object of transitives (e.g. Latin/Romance; see the feminine singular pronoun in 4). However, the axioms in the typological classification of languages would by definition declare Basque to be of type split-intransitive (e.g. like Guarani, Gregorez and Suarez 1967 cfr. Primus 1999; or Slave, Rice 1991). For Basque has a class of intransitive verbs (1a) whose argument is marked exactly as the subject of transitives (2). Basque intransitives: (1) a. Neskatil-ak dei-tu du b. Neskatil-a ailega-tu da girl-Erg.Sg call-Per have.3Sg_3Sg girl-Abs.Sg arrive-Per be.3Sg ‘The girl called’ ‘The girl arrived’ Basque transitives: (2) Neskatil-ak izozki-a amai-tu du girl-Erg.Sg ice-cream-Abs.Sg finish-Per have.3Sg_3Sg ‘The girl finished her ice-cream’ Spanish intransitives: (3) a.
    [Show full text]
  • UC Berkeley Dissertations, Department of Linguistics
    UC Berkeley Dissertations, Department of Linguistics Title Constructional Morphology: The Georgian Version Permalink https://escholarship.org/uc/item/1b93p0xs Author Gurevich, Olga Publication Date 2006 eScholarship.org Powered by the California Digital Library University of California Constructional Morphology: The Georgian Version by Olga I Gurevich B.A. (University of Virginia) 2000 M.A. (University of California, Berkeley) 2002 A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Linguistics in the GRADUATE DIVISION of the UNIVERSITY OF CALIFORNIA, BERKELEY Committee in charge: Professor Eve E. Sweetser, Co-Chair Professor James P. Blevins, Co-Chair Professor Sharon Inkelas Professor Johanna Nichols Spring 2006 The dissertation of Olga I Gurevich is approved: Co-Chair Date Co-Chair Date Date Date University of California, Berkeley Spring 2006 Constructional Morphology: The Georgian Version Copyright 2006 by Olga I Gurevich 1 Abstract Constructional Morphology: The Georgian Version by Olga I Gurevich Doctor of Philosophy in Linguistics University of California, Berkeley Professor Eve E. Sweetser, Co-Chair, Professor James P. Blevins, Co-Chair Linguistic theories can be distinguished based on how they represent the construc- tion of linguistic structures. In \bottom-up" models, meaning is carried by small linguistic units, from which the meaning of larger structures is derived. By contrast, in \top-down" models the smallest units of form need not be individually meaningful; larger structures may determine their overall meaning and the selection of their parts. Many recent developments in psycholinguistics provide empirical support for the latter view. This study combines intuitions from Construction Grammar and Word-and-Para- digm morphology to develop the framework of Constructional Morphology.
    [Show full text]
  • Zerohack Zer0pwn Youranonnews Yevgeniy Anikin Yes Men
    Zerohack Zer0Pwn YourAnonNews Yevgeniy Anikin Yes Men YamaTough Xtreme x-Leader xenu xen0nymous www.oem.com.mx www.nytimes.com/pages/world/asia/index.html www.informador.com.mx www.futuregov.asia www.cronica.com.mx www.asiapacificsecuritymagazine.com Worm Wolfy Withdrawal* WillyFoReal Wikileaks IRC 88.80.16.13/9999 IRC Channel WikiLeaks WiiSpellWhy whitekidney Wells Fargo weed WallRoad w0rmware Vulnerability Vladislav Khorokhorin Visa Inc. Virus Virgin Islands "Viewpointe Archive Services, LLC" Versability Verizon Venezuela Vegas Vatican City USB US Trust US Bankcorp Uruguay Uran0n unusedcrayon United Kingdom UnicormCr3w unfittoprint unelected.org UndisclosedAnon Ukraine UGNazi ua_musti_1905 U.S. Bankcorp TYLER Turkey trosec113 Trojan Horse Trojan Trivette TriCk Tribalzer0 Transnistria transaction Traitor traffic court Tradecraft Trade Secrets "Total System Services, Inc." Topiary Top Secret Tom Stracener TibitXimer Thumb Drive Thomson Reuters TheWikiBoat thepeoplescause the_infecti0n The Unknowns The UnderTaker The Syrian electronic army The Jokerhack Thailand ThaCosmo th3j35t3r testeux1 TEST Telecomix TehWongZ Teddy Bigglesworth TeaMp0isoN TeamHav0k Team Ghost Shell Team Digi7al tdl4 taxes TARP tango down Tampa Tammy Shapiro Taiwan Tabu T0x1c t0wN T.A.R.P. Syrian Electronic Army syndiv Symantec Corporation Switzerland Swingers Club SWIFT Sweden Swan SwaggSec Swagg Security "SunGard Data Systems, Inc." Stuxnet Stringer Streamroller Stole* Sterlok SteelAnne st0rm SQLi Spyware Spying Spydevilz Spy Camera Sposed Spook Spoofing Splendide
    [Show full text]
  • Viacheslav A. Chirikba: Abkhaz
    Abkhaz is one of the three languages comprising the Abkhazo­ Adyghean, or West Caucasian branch of North Caucasian linguistic bkhaz family (the other branch being Nakh-Daghestanian, or East Caucasian). Abkhaz is spoken by approximately 100,000 people in the former Soviet Union (mainly in the Republic of Abkhazia, Caucasus), and by at least the same number of speakers in Turkey and some Middle east countries (small Abkhaz colonies can be found also In Western Europe and the USA). Abkhaz is notorious for ist huge consonantal inventory (up to 67 consonants in the Bzyp dialect) and by its minmal vocalic system: only 2 vowels. Though Abkhaz was studied by a number of scholars (e.g. P. Uslar in XIX century, or K. Lomtatidze Viacheslav A. Chirikba in Georgia and G Hewitt in Great Britain), many aspects of Abkhaz grammar (especially its syntax) still have to be adequately described. Abkhaz is the only West Caucasian language to possess the category of grammatical classes, manifested in personal pronouns, verb conjugation, numerals and in the category of number. Abkhaz is an ergative language, the ergative construction being represented not by case endings, as in related Circasslan and Ubykh (Abkhaz does not have a case system), but by the order of actant markers. The verbal root consists usually of one consonant, preceded by a string of prefixes (class-personal, directional, temporal, negational, causatival, etc.) and followed by few suffixes. Verbs can be stative or dynamic, finite or non-finite. The grammatical sketch of Abkhaz includes Information about its phonological system, morphology, and syntax. A short text Is provided with grammatical comments.
    [Show full text]
  • Expression of the Perfect Semantics in Georgian Verbs1
    Journal of Literature and Art Studies, December 2019, Vol. 9, No. 12, 1266-1268 doi: 10.17265/2159-5836/2019.12.006 D DAVID PUBLISHING Expression of the Perfect Semantics in Georgian Verbs1 Nino Bagration-Davitashvili Ivane Javakhishvili Tbilisi State University, Tbilisi, Georgia In Georgian the verbal form comprises the names expressing the subjective-objective references of certain quantity, which shows the representing signs to express the category of the subjective and objective person. However, sometimes the subjective person is expressed by the sign of the objective person or vice versa, which is called inversion in Georgian language. There are many options about the reason of inversion. Bipersonal static verbs are considered as the oldest verbs as believed by many scientists. These are the verbs of feeling and owning; they are inverse, bipersonal and hold the dative construction. We think that inversion cannot be only explained with the static character. There are the verbs, which are not static but are inverse. We believe that it must be one and the same aspect in every case: perfect semantics. In Indo-European languages, the verbs of the perfect semantics have been converted from the class of the lexical words into the grammar perfect. When the Georgian language encountered the need to express the perfect forms respective to Indo-European languages, thus, the third series has been generated and it used the form as the model, belonging to the verbs of bipersonal non-transitional “perfect semantics” in present. This form has been supplemented by preverb and has become dynamic and gradually acquired various semantic nuances characterizing the perfect tense: resultativity, will, evidentiality, interrogative and negative, etc.
    [Show full text]