GF Modern Greek Resource Grammar
Total Page:16
File Type:pdf, Size:1020Kb
GF Modern Greek Resource Grammar Ioanna Papadopoulou University of Gothenburg [email protected] Abstract whilst each of the syntactic parts of the sentence (subject, object, predicate) is a carrier of a certain The paper describes the Modern Greek (MG) case, a fact that allows various word order Grammar, implemented in Grammatical structures. In addition, the language presents a Framework (GF) as part of the Grammatical dynamic syllable stress, whereas its position Framework Resource Grammar Library depends and alternates according to the (RGL). GF is a special-purpose language for morphological variations. Moreover, MG is one multilingual grammar applications. The RGL 1 is a reusable library for dealing with the of the two Indo-European languages that retain a morphology and syntax of a growing number productive synthetic passive formation. In order of natural languages. It is based on the use of to realize passivization, verbs use a second set of an abstract syntax, which is common for all morphological features for each tense. languages, and different concrete syntaxes implemented in GF. Both GF itself and the 2 Grammatical Framework RGL are open-source. RGL currently covers more than 30 languages. MG is the 35th GF (Ranta, 2011) is a special purpose language that is available in the RGL. For the programming language for developing purpose of the implementation, a morphology- multilingual applications. It can be used for driven approach was used, meaning a bottom- building translation systems, multilingual web up method, starting from the formation of gadgets, natural language interfaces, dialogue words before moving to larger units systems and natural language resources. GF is (sentences). We discuss briefly the main capable of parsing and generating texts, while characteristics and grammatical features of working from a language-independent MG, and present some of the major difficulties we encountered during the process of representation of meaning. The GF Grammar is implementation and how these are handled in based on two different modules. An abstract the MG grammar. module provides category and function declarations, thus it constitutes a representation 1 Introduction of a set of possible trees that reflect the semantically relevant structure of a language, Greek is a member of the Indo-European family and one or more concrete modules that contain of languages and constitutes by itself a separate linearization type definitions and rules, therefore branch of that family. Modern Greek (MG) can managing to relate the tree structures with linear be easily traced back to Ancient Greek in the tree representations. The RGL contains the set of form of letters, word roots and structures, despite grammars of natural languages that are the fact that the language has undergone a series implemented in GF. The parallelism of the of transformations through the ages and has been grammars is inevitable, given that their a subject of considerable simplification. MG development is based on the same rules and th makes use of the Greek alphabet since the 8 functions that are defined in a common abstract century B.C. Today the language is spoken by syntax. At the moment RGL covers 34 approximately 13.1 million people worldwide. languages2 that originate not only from the Some of the general characteristics of MG refer European continent, but from all over the world. to the diversity of the morphology and the use of The common API defines around 60 hierarchical an extremely large number of morphological features in order to express grammatical 1 The other one being Albanian notations. Words are in their majority declinable, 2 http://www.grammaticalframework.org/lib/doc/status.html 126 Proceedings of the Student Research Workshop associated with RANLP 2013, pages 126–133, Hissar, Bulgaria, 9-11 September 2013. grammatical categories, and a large number of all the specific features which may affect their syntactic functions. MG constitutes the newest formation. The creation of nouns includes 17 addition to the RGL and its implementation distinct functions that are categorized depending consists of 28 concrete modules. on the noun ending, the stress movement, whether the noun is parisyllabic or 3 Morphology imparisyllabic, or whether the noun augments its syllables when inflected. These functions also Morphology constitutes the most important handle specific phenomena of the MG language, aspect of the Greek Language. The words are in such as the change of gender of a noun in the their majority declinable, produced via a plural form, or nouns that originate from Ancient combination of meta-linguistic elements, such as Greek, and are still used nowadays, retaining a stem and an ending. The endings are assigned intact their form and endings. Similarly 6 proportionally with the part of speech and the functions create adjectives, where we also type, and act as carriers of grammatical introduce the degree parameter that creates notations, indicating the gender, the number, the additional forms for all three adjective genders. case or the person, or in the case of verbs the The formation of the pronouns is of special tense, the mood, the voice and the aspect as well. interest, as MG makes use of two distinct types, Appendix A presents the parameter types and the emphatic and the weak. The weak form3 operations that are defined in the grammar. The occurs more often, whilst the use is always in implementation of the GF MG morphology close connection with verbs, nouns or adverbs. started from scratch. All declinable words Our grammar introduces both forms of the needed to undergo a first simplistic pronoun, but it also alternates between them categorization in order to create basic declension when the syntactic structure requires the use of a tables, before moving to sub-categorizations that particular form. Greek proper nouns follow all allowed us to treat the various irregularities that the declension patterns and irregularities of govern the morphological structure of MG. One common nouns morphology, meaning that they of the main aspects of MG is the presence of a are primarily inflected for gender, case and dynamic syllable stress, a phenomenon that number. Moreover, they present a major created additional difficulties in the differentiation comparing to other languages, implementation of the morphology. A stress can which refers to the introduction of the proper move from a stem to an ending but in many cases noun with a definite article that takes its form the movement is realized inside the stem. Such according to the grammatical features of the issues are handled in GF with the introduction of modified proper noun. The morphology of the pattern matching functions and pattern macros. verb in MG consists of a complex inflection The MG grammar includes 25 pattern matching system, as shown in Appendix B. Whilst in many functions and macros that indentify stressed languages, the grammatical notations are vowels, while at the same time they perform over expressed with the use primarily of syntax, MG a string, checking the matches, transforming the uses the combination of a stem and an stressed vowels into their unstressed form, and inflectional ending to express grammatical assigning the stress to the correct character. They categories such as person, number, tense, voice, also serve to assigning the appropriate case aspect and mood. The fact that MG retains a ending or handle irregularities, such as the productive synthetic passive formation increases addition of extra consonants and reduplication drastically the number of possible forms of the cases. verb, as most verbs have a second set of 3.1 Declinable Parts of Speech morphological forms for each tense in order to express passivization. Whilst Greek verbs are All nouns, proper nouns, adjectives, determiners, divided in two main categories, the second one quantifiers, pronouns, participles, articles and subdivided into two smaller ones, (Holton et al verbs in MG are declinable and each category ,1999; Iordanidou, 1999), the creation of the verb presents its own characteristics and irregularities. morphology in our grammar imposed the The implementation of the above categories consideration of a number of specific parameters, follows a similar pattern: we first divide them among them the stress movement, the number of into the main conjugations that grammars syllables which affects on the creation of the propose and then we make an exhaustive list of all the rules that specify their creation, as well as 3 Clitic pronoun 127 imperative forms, the active stem forms upon oper which we create the passive stems, the formation Noun : Type = {s : Number => Case => Str ; g : of the passive perfective participle, reduplication Gender} ; patterns, internal augmentation phenomena. In addition to the above, we needed to handle mkNoun_agori : (s: Str) -> Gender -> Noun = \agOri, g -> various irregularities, which referred mainly to let the formation of the imperative or dependent agori = mkStemNouns agOri; forms, the passivization or not of the verb, the in { occurrence of a participle, the formation of the s = table { Sg => table { active or passive simple past with the use of Nom | Acc | Vocative|CPrep P_se |CPrep PNul => ending forms borrowed from Ancient Greek. All agOri ; the above parameters resulted in the creation of Gen |CPrep P_Dat=> mkGenSg agori} ; 26 main functions that handle the changes in the Pl => table { inflected endings of the verbs, and 39 smaller Nom | Acc | Vocative|CPrep P_se |CPrep PNul => functions that are connected to the main mkNomPl agOri; Gen |CPrep P_Dat=> mkGen agOri}} ; g = g } ; functions and help us handle the modifications that the stem is subjected to, when conjugated. mkStemNouns : Str -> Str = \s -> case s of { Moreover, we must emphasize on the necessity c + v@(#stressedVowel) + x@(_ + _) =>c + unstress to create a series of pattern matching functions v + x } ; that form and alter stems, for the production of the passive perfective according to the active mkGenSg : Str -> Str = \s -> perfective or imperfective, the passive imperative case s of and the participles.