Compositional Morphology Through Deep Learning

Total Page:16

File Type:pdf, Size:1020Kb

Compositional Morphology Through Deep Learning Compositional Morphology Through Deep Learning A thesis presented by Ekaterina Vylomova ORCID: 0000-0002-4058-5459 to School of Computing and Information Systems in total fulfillment of the requirements for the degree of Doctor of Philosophy The University of Melbourne Melbourne, Australia November, 2018 Supervised by: Tim Baldwin and Trevor Cohn Declaration This is to certify that (i) the thesis comprises only my original work towards the PhD except where indicated in the Preface, (ii) due acknowledgement has been made in the text to all other material used, (iii) the thesis is less than 100, 000 words in length, exclusive of tables, maps, bibliographies and appendicies. Ekaterina Vylomova November, 2018 Abstract Most human languages have sophisticated morphological systems. In order to build success- ful models of language processing, we need to focus on morphology, the internal structure of words. In this thesis, we study two morphological processes: inflection (word change rules, e.g. run!runs) and derivation (word formation rules, e.g. run!runner). We first evaluate the ability of contemporary models that are trained using the distribu- tional hypothesis, which states that a word’s meaning can be expressed by the context in which it appears, to capture these types of morphology. Our study reveals that inflections are predicted at high accuracy whereas derivations are more challenging due to irregularity of meaning change. We then demonstrate that supplying the model with character-level information improves predictions and makes usage of language resources more efficient, especially in morphologically rich languages. We then address the question of to what extent and which information about word proper- ties (such as gender, case, number) can be predicted entirely from a word’s sentential content. To this end, we introduce a novel task of contextual inflection prediction. Our experiments on prediction of morphological features and a corresponding word form from sentential context show that the task is challenging, and as morphological complexity increases, performance significantly drops. We found that some morphological categories (e.g., verbal tense) are inherent and typically cannot be predicted from context while others (e.g., adjective num- ber and gender) are contextual and inferred from agreement. Compared to morphological inflection tasks, where morphological features are explicitly provided, and the system hasto predict only the form, accuracy on this task is much lower. vi Finally, we turn to word formation, derivation. Experiments with derivations show that they are less regular and systematic. We study how much a sentential context is indicative of a meaning change type. Our results suggest that even though inflections are more productive and regular than derivations, the latter also present cases of high regularity of meaning and form change, but often require extra information such as etymology, word frequency and more fine-grained annotation in order to be predicted at high accuracy. To my loving parents, Galina and Aleksei. Acknowledgements First and foremost, my highest gratitude goes to my great supervisors, Tim Baldwin and Trevor Cohn, my “Cyril and Methodius”, who introduced me to a whole new world of academic life and always supported me in all my endeavours. During my PhD course, I was fortunate to meet and collaborate with many great researchers. A large part of my research was done together with my good friends, Ryan Cotterell and Christo Kirov, who always inspired me to learn and explore more. I am also grateful to people with whom I collaborated during the JSALT’15 workshop, Reza Haffari, Kaisheng Yao, Chris Dyer and Kevin Duh, from whom I learned a lot about neural machine translation. I am also extremely thankful to Jason Eisner for all his support and brainstorming. Many thanks go to Laura Rimell with whom we worked on my first paper, which was eventually presented at ACL 2016. Certainly, this work would not be complete without David Yarowsky and his passion for derivational paradigms, and I hope we will have a chance to continue research in this direction. I was also very pleased to visit Hinrich Schütze and Alex Fraser’s labs in Munich and Chris Biemann’s group in Darmstadt in 2016, both of which great experiences, and I am very thankful to them for making this happen. Although my MSc and BSc are both in Computer Science, I have always been fascinated by linguistics. Here, I would like to thank all linguists with whom I had a chance to meet and be involved in fruitful discussions during the CoEDL Summer School 2017: Nick Evans, Mark Ellison, Martin Haspelmath, Sabine Stoll, Balthasar Bickel, and John Mansfield. During the last three years, I was also involved in teaching various university subjects, and would like to thank people with whom we tried our best to do “knowledge transfer” to x new generations of students: Jeremy Nicholson, David Eccles, Greg Wadley, Ben Rubinstein, Sue Wright, Rao Kotagiri, and Karin Verspoor. I would like to thank our great NLP lab and people with whom I shared a significant part of my PhD path: Bahar Salehi, Long Duong, Afshin Rahimi (and his wife Ava), Oliver Adams, Daniel Beck, Matthias Petri, Fei Liu, Doris Hoogeveen, Nitika Mathur, Mohammad Oloomi, Xuanli He, Brian Hur (and his wife Lena), Yitong Li, Shiva Subramanian, Ned Letcher, Moe Fang, Vu Hoang, Yuan Li, Malcolm Karutz, Anirudh Joshi, Julian Brooke, Philip Schulz, Alex Vedernikov, Michał Lukasik, Simon De Deyne, Luis Soto, Sasha Panchenko, and many-many others. And, certainly, this thesis would not have been be possible without the support of my family, especially Andrei and Mark, who always believed in my abilities.1 Finally, I would like to thank my awesome committee chair, Alistair Moffat, and (of course) Google Australia for supporting my thesis research. 1and even sometimes overestimated them (: xi Publications Part of the material presented in the thesis has been published before. In particular, Chapter 3 is based on publications [1,2], and most of the material for Chapter 5 was derived from [3]. [1] Ekaterina Vylomova, Laura Rimell, Trevor Cohn, and Timothy Baldwin. Take and took, gaggle and goose, book and read: Evaluating the utility of vector differences for lexical relation learning. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1671–1682, 2016. [2] Ekaterina Vylomova, Trevor Cohn, Xuanli He, and Gholamreza Haffari. Word representation models for morphologically rich languages in neural machine translation. In Proceedings of the First Workshop on Subword and Character Level Models in NLP, pages 103–108, 2017a. [3] Ekaterina Vylomova, Ryan Cotterell, Timothy Baldwin, and Trevor Cohn. Context- aware prediction of derivational word-forms. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: (Volume 2 : Short Papers), pages 118–124, 2017b. Contents List of Figures xix List of Tables xxi 1 Introduction1 1.1 Context of the Study . .1 1.2 Aim and Scope . .6 1.3 Thesis Structure . .9 2 Background 13 2.1 Language . 13 2.2 Morphology . 16 2.2.1 Linguistic approaches . 16 2.2.2 Morphological Typology . 18 2.2.3 Morphology-Syntax Interface: an example of grammatical case . 21 2.2.4 Inflections . 23 2.2.5 Derivations . 30 2.2.6 Resources . 40 2.2.7 Tasks . 41 2.3 Modelling . 44 2.3.1 Finite-State Machines . 44 2.3.2 Inflection as String-to-String Transduction and Automatic Learning of Edit Distances . 50 xvi Contents 2.3.3 Inflectional Paradigm Modelling . 52 2.3.4 Derivational Models . 59 2.3.5 Distributed Representations and Distributional Semantics . 61 2.3.6 Learning of Compositionality . 71 2.4 Conclusion . 74 3 Evaluation of Word Embeddings and Distributional Semantics Approach 75 3.1 Introduction . 75 3.2 Language Modelling for English . 77 3.2.1 Relation Learning . 78 3.2.2 General Approach and Resources . 81 3.2.3 Clustering . 85 3.2.4 Classification . 89 3.3 Language Modelling for Russian . 96 3.3.1 Closed-World Classification . 97 3.3.2 Open-World Classification . 99 3.3.3 Split Volcabulary for Morphology . 100 3.4 Machine Translation . 101 3.4.1 Models . 102 3.4.2 Experiments . 105 3.5 Conclusion . 108 4 Inflectional Morphology Models 111 4.1 Introduction . 111 4.2 Predicting Inflectional Morphology . 114 4.2.1 Task Notation . 114 4.2.2 Morphological Attributes . 115 4.3 An Encoder–Decoder Model . 117 4.4 A Structured Neural Model . 118 4.4.1 A Neural Conditional Random Field . 119 Contents xvii 4.4.2 The Morphological Inflector . 121 4.4.3 Parameter Estimation and Decoding . 121 4.5 Experiments . 123 4.5.1 Dataset . 123 4.5.2 Evaluation . 124 4.5.3 Hyperparameters and Other Minutiae . 125 4.6 Results, Error Analysis, and Discussion . 125 4.7 SIGMORPHON 2018 – SubTask 2 . 130 4.8 Conclusion . 132 5 Derivational Morphology Models 135 5.1 Introduction . 135 5.2 Context-Aware Prediction . 137 5.2.1 Dataset . 139 5.2.2 Experiments . 140 5.2.3 Discussion . 144 5.3 Conclusion . 145 6 Conclusions and Future Work 147 6.1 Future work . 150 6.1.1 Joint Modelling of Etymology and Derivation for English . 150 6.1.2 Low-Resource Language Modelling . 151 6.1.3 Morphological models for Machine Translation . 152 Bibliography 155 List of Figures 2.1 An example of the NOMLEX data entry for promotion.......... 40 2.2 Finite-State Machines . 46 2.3 An FSA for English adjective morphology. 46 2.4 An improved FSA for English adjective morphology. 47 2.5 An FSA for English derivational morphology. 47 2.6 An FST describing regular English verbal inflection cases. 48 2.7 A cascade of rules mapped into a single FST.
Recommended publications
  • Lara Mantovan Nominal Modification in Italian Sign Language Sign Languages and Deaf Communities
    Lara Mantovan Nominal Modification in Italian Sign Language Sign Languages and Deaf Communities Editors Annika Herrmann, Markus Steinbach, Ulrike Zeshan Editorial board Carlo Geraci, Rachel McKee, Victoria Nyst, Sibaji Panda, Marianne Rossi Stumpf, Felix Sze, Sandra Wood Volume 8 Lara Mantovan Nominal Modification in Italian Sign Language ISHARA PRESS ISBN 978-1-5015-1343-5 e-ISBN (PDF) 978-1-5015-0485-3 e-ISBN (EPUB) 978-1-5015-0481-5 ISSN 2192-516X e-ISSN 2192-5178 Library of Congress Cataloging-in-Publication Data A CIP catalog record for this book has been applied for at the Library of Congress. Bibliographic information published by the Deutsche Nationalbibliothek The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic data are available on the Internet at http://dnb.dnb.de. © 2017 Walter de Gruyter Inc., Boston/Berlin and Ishara Press, Preston, UK Printing and binding: CPI books GmbH, Leck ♾ Printed on acid-free paper Printed in Germany www.degruyter.com E Pluribus Unum (uncertain origin, attributed to Virgilio, Moretum, v. 103) Acknowledgements This book is a revised version of my 2015 dissertation which was approved for the PhD degree in Linguistics at Ca’ Foscari University of Venice. When I first plunged into the world of academic research, almost five years ago, I would never have imagined it was possible to achieve such an important milestone. Being so close to finalizing this book, I would like to look back briefly and remember and thank all the people who showed me the way, supported me, and encouraged me to grow both academically and personally.
    [Show full text]
  • Tact in Translation Negotiating Trust by the Russian Interpreter, at Home
    Tact in Translation Negotiating trust by the Russian interpreter, at home and abroad Eline Helmer University College London Anthropology of Russia and Interpreting Prof Anne White Dr Seth Graham Declaration I, Eline Helmer, confirm that the work presented in this thesis is my own. Where information has been derived from other sources, I confirm that this has been indicated in the thesis. Eline Helmer 2 Abstract Being the only conversational participant with the ability to follow both sides of a cross- linguistic dialogue gives the interpreter the power to obscure or clarify. Because of heightened mutual dependency, all interpreters need trust to perform their roles. They actively build trust, both between self and client and between clients. In academic linguistic contexts, trust is often regarded as based on impartiality: the more objective and invisible the interpreter, the better and more professional he or she will be. In practice, this approach is not always possible, or desirable. The trust relationship between client and interpreter can also be based on closeness and personal interdependence. Interpreting po-chelovecheski (lit. ‘approaching someone in a humane way’) is a colloquial way for Russian interpreters to describe this approach. This thesis explores the negotiation of trust by Russian interpreters. The Russian translation market’s unregulated character, and historical framing of ‘the foreigner’ as someone to be protected and mistrusted, make for an interesting case to study face-to-face interpreting at all levels of the international dialogue. Based on ethnographic fieldwork with interpreters from St Petersburg, Moscow and Pskov, I argue that becoming ‘someone’s voice’ presents a specific caring relationship.
    [Show full text]
  • Calculus of Possibilities As a Technique in Linguistic Typology
    Calculus of possibilities as a technique in linguistic typology Igor Mel’uk 1. The problem stated: A unified conceptual system in linguistics A central problem in the relationship between typology and the writing of individual grammars is that of developing a cross-linguistically viable con- ceptual system and a corresponding terminological framework. I will deal with this problem in three consecutive steps: First, I state the problem and sketch a conceptual system that I have put forward for typological explora- tions in morphology (Sections 1 and 2). Second, I propose a detailed illus- tration of this system: a calculus of grammatical voices in natural languages (Section 3). And third, I apply this calculus (that is, the corresponding con- cepts) in two particular case studies: an inflectional category known as an- tipassive and the grammatical voice in French (Sections 4 and 5). In the latter case, the investigation shows that even for a language as well de- scribed as French a rigorously standardized typological framework can force us to answer questions that previous descriptions have failed to re- solve. I start with the following three assumptions: 1) One of the most pressing tasks of today’s linguistics is description of particular languages, the essential core of this work being the writing of grammars and lexicons. A linguist sets out to describe a language as pre- cisely and exhaustively as possible; this includes its semantics, syntax, morphology and phonology plus (within the limits of time and funds avail- able) its lexicon. 2) Such a description is necessarily carried out in terms of some prede- fined concepts – such as lexical unit, semantic actant, syntactic role, voice, case, phoneme, etc.
    [Show full text]
  • Proceedings of the LFG13 Conference
    Proceedings of LFG13 Miriam Butt and Tracy Holloway King (Editors) 2013 CSLI Publications http://csli-publications.stanford.edu/ Contents 1 Dedication 4 2 Editors’ Note 5 3 Yasir Alotaibi, Muhammad Alzaidi, Maris Camilleri, Shaimaa ElSadek and Louisa Sadler: Psychological Predicates and Verbal Complemen- tation in Arabic 6 4 I Wayan Arka: Nominal Aspect in Marori 27 5 Doug Arnold and Louisa Sadler: Displaced Dependent Constructions 48 6 Daniele Artoni and Marco Magnani: LFG Contributions in Second Language Acquisition Research: The Development of Case in Russian L2 69 7 Oleg Belyaev: Optimal Agreement at M-structure: Person in Dargwa 90 8 Ansu Berg, Rigardt Pretorius and Laurette Pretorius: The Represen- tation of Setswana Double Objects in LFG 111 9 Tina Bögel: A Prosodic Resolution of German Case Ambiguities 131 10 Kersti Börjars and John Payne: Dimensions of Variation in the Ex- pression of Functional Features: Modelling Definiteness in LFG 152 11 George Aaron Broadwell: An Emphatic Auxiliary Construction for Emotions in Copala Triqui 171 12 Özlem Çetinoglu,˘ Sina Zarrieß and Jonas Kuhn: Dependency-based Sentence Simplification for Increasing Deep LFG Parsing Coverage 191 13 Elizabeth Christie: Result XPs and the Argument-Adjunct Distinction212 14 Cheikh Bamba Dione: Valency Change and Complex Predicates in Wolof: An LFG Account 232 15 Lachlan Duncan: Non-verbal Predicates in K’ichee’ Mayan: An LFG Approach 253 16 Dag Haug: Partial Control and the Semantics of Anaphoric Control in LFG 274 17 Annette Hautli-Janisz: Moving Right Along:
    [Show full text]
  • The Changing EFL Teacher-Textbook Relationship in Ukraine, 1917 – 2010: a Non-Native English-Speaking Teacher’S Perspective
    University of Calgary PRISM: University of Calgary's Digital Repository Graduate Studies The Vault: Electronic Theses and Dissertations 2019-01-25 The Changing EFL Teacher-Textbook Relationship in Ukraine, 1917 – 2010: A Non-Native English-Speaking Teacher’s Perspective. An Autoethnography Chebotaryov, Oleksandr Chebotaryov, O. (2019). The Changing EFL Teacher-Textbook Relationship in Ukraine, 1917 – 2010: A Non-Native English-Speaking Teacher’s Perspective. An Autoethnography (Unpublished master's thesis). University of Calgary, Calgary, AB. http://hdl.handle.net/1880/109860 master thesis University of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission. Downloaded from PRISM: https://prism.ucalgary.ca UNIVERSITY OF CALGARY The Changing EFL Teacher-Textbook Relationship in Ukraine, 1917 – 2010: A Non-Native English-Speaking Teacher’s Perspective An Autoethnography by Oleksandr Chebotaryov A THESIS SUBMITTED TO THE FACULTY OF GRADUATE STUDIES IN PARTIAL FULFILMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF ARTS GRADUATE PROGRAM IN EDUCATIONAL RESEARCH CALGARY, ALBERTA JANUARY, 2019 © Oleksandr Chebotaryov 2019 Abstract The goal of this autoethnographic study is to understand the relationship between a non-native English-speaking teacher of English as a foreign language and their textbooks at different stages of their professional development, in different socio-cultural and political – Soviet and post- Soviet – contexts with the growing tendency of opposing or rejecting textbooks as educational tools.
    [Show full text]
  • Mighty Morpheme Tagging Rangers
    in Proceedings of NAACL-HLT 2019 Background ❖ It's hard to make crosslinguistic comparisons of RNN syntactic performance (e.g., on subject-verb agreement prediction) ➢ Languages differ in multiple typological properties ➢ Cannot hold training data constant across languages Proposal: generate synthetic data to devise a controlled experimental paradigm for studying the interaction of the inductive bias of a neural architecture with particular typological properties. Setup ❖ Data: English Penn Treebank sentences converted to Universal Dependencies scheme Example of a dependency parse tree ONE in Proceedings of NAACL-HLT 2019 Setup ❖ Identify all verb arguments with nsubj, nsubjpass, dobj and record plurality (HOW? manually?) Example of a dependency parse tree Setup ❖ Generate synthetic data by appending novel morphemes to the verb arguments identified to inflect them for argument role and number Setup ❖ Generate synthetic data by appending novel morphemes to the verb arguments identified to inflect them for argument role and number No explanation or motivation given for how the novel morphemes were developed, nor an explicit mention that they're novel! Might length matter? Typological properties ❖ Does jointly predicting object and subject plurality improve overall performance? ➢ Generate data with polypersonal agreement ❖ Do RNNs have inductive biases favoring certain word orders over others? ➢ Generate data with different word orders ❖ Does overt case marking influence agreement prediction? ➢ Generate data with different case marking systems ■ unambiguous, syncretic, argument marking Examples of synthetic data Task ❖ Predict a verb's subject and object plurality features. Input: synthetically-inflected sentence Output: one category prediction each for subject & object subject: [singular, plural] object: [singular, plural, none] (if no object) (It's NOT CLEAR in the paper WHAT the actual prediction task is / what the actual output space is.
    [Show full text]
  • Multi-Lingual Dependency Parsing: Word Representation and Joint
    Multi-Lingual Dependency Parsing : Word Representation and Joint Training for Syntactic Analysis Mathieu Dehouck To cite this version: Mathieu Dehouck. Multi-Lingual Dependency Parsing : Word Representation and Joint Training for Syntactic Analysis. Computer Science [cs]. Université de lille, 2019. English. tel-02197615 HAL Id: tel-02197615 https://tel.archives-ouvertes.fr/tel-02197615 Submitted on 30 Jul 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Centre de Recherche en Informatique, Signal et Automatique de Lille École Doctorale Sciences Pour L’Ingénieur Thèse de Doctorat Spécialité : Informatique et Applications préparée au sein de l’équipe Magnet, du laboratoire Cristal et du centre de recherche Inria Lille - Nord Europe financée par l’Université de Lille Mathieu Dehouck Multi-Lingual Dependency Parsing : Word Representation and Joint Training for Syntactic Analysis Parsing en Dépendances Multilingue : Représentation de Mots et Apprentissage Joint pour l’Analyse Syntaxique sous la direction de Dr. Marc TOMMASI et l’encadrement de Dr. Pascal DENIS Soutenue publiquement à Villeneuve d’Ascq, le 20 mai 2019 devant le jury composé de: Mme Sandra KÜBLER Indiana University Bloomington Rapportrice M. Alexis NASR Université d’Aix Marseille Rapporteur Mme Hélène TOUZET CNRS Présidente du jury M.
    [Show full text]
  • An Introduction to English Phonetics for the Estonian Learner
    TARTU STATE UNIVERSITY AN INTRODUCTION TO ENGLISH PHONETICS FOR THE ESTONIAN LEARNER O.MUTT TARTU SSm 19 7 1 TARTU STATE UNIVERSITY Chair of English AN INTRODUCTION TO ENGLISH PHONETICS FO R M ESTONIAN LEARNER O.MUTT'S preface The present survey of English pronunciation is primar­ ily intended to serve as a handbook for students of English in the Estonian S.S.R. There has been no shortage of good surveys of the pho­ netic system of British (and more recently of American) Eng­ lish either abroad or in the Soviet Union. What has been lacking, however, is a more-or-less complete account of the pronunciation of English written from the point of view of the Estonian learner. We do now have a competently written and thorough survey of English intonation and accentuation in comparison with that of Estonian (П. К. Ваараск, Тониче­ ские средства речи, ч. I—II, Таллин 1964 ), but there is as yet no comprehensive account of the English vowels and con­ sonants, of various assimilatory phenomena in English, etc. written with the Estonian learner in mind. The present hand­ book constitutes a modest attempt to fill this gap. The aim of this publication is to provide advanced Estonian learners of English with the essential theoretical and practical material which would enable them to master English pronunciation themselves and to learn how to teach it to others. Chapters 1-2, and the first four sections of Chapter 3» contain material from the course on theoretical » phonetics provided for students of English at universities and institutes in the Soviet Union.
    [Show full text]
  • Journal of South Asian Languages and Linguistics 2(2)
    JSALL 2021; aop Netra P. Paudyal and John Peterson* How one language became four: the impact of different contact-scenarios between “Sadani” and the tribal languages of Jharkhand https://doi.org/10.1515/jsall-2021-2028 Published online May 4, 2021 Abstract: Four Indo-Aryan linguistic varieties are spoken in the state of Jharkhand in eastern central India, Sadri/Nagpuri, Khortha, Kurmali and Panchparganiya, which are considered by most linguists to be dialects of other, larger languages of the region, such as Bhojpuri, Magahi and Maithili, although their speakers consider them to be four distinct but closely related languages, collectively referred to as “Sadani”. In the present paper, we first make use of the program COG by the Summer Institute of Linguistics (SIL) to show that these four varieties do indeed form a distinct, compact genealogical group within the Magadhan language group of Indo- Aryan. We then go on to argue that the traditional classification of these languages as dialects of other languages appears to be based on morphosyntactic differences between these four languages and similarities with their larger neighbors such as Bhojpuri and Magahi, differences which have arisen due to the different contact situations in which they are found. Keywords: Khortha; Kudmali; language contact; Sadani; Sadri 1 Introduction While the first official language of the state of Jharkhand in eastern central India is Hindi, over 96% of the state population speaks a local tribal or regional language as their first (L1) or second language (L2) on a daily basis, and only 3.7% of the people speak Hindi as their first language (JTWRI 2013:4–5).
    [Show full text]
  • English for Practical Purposes 9
    ENGLISH FOR PRACTICAL PURPOSES 9 CONTENTS Chapter 1: Introduction of English Grammar Chapter 2: Sentence Chapter 3: Noun Chapter 4: Verb Chapter 5: Pronoun Chapter 6: Adjective Chapter 7: Adverb Chapter 8: Preposition Chapter 9: Conjunction Chapter 10: Punctuation Chapter 11: Tenses Chapter 12: Voice Chapter 1 Introduction to English grammar English grammar is the body of rules that describe the structure of expressions in the English language. This includes the structure of words, phrases, clauses and sentences. There are historical, social, and regional variations of English. Divergences from the grammardescribed here occur in some dialects of English. This article describes a generalized present-dayStandard English, the form of speech found in types of public discourse including broadcasting,education, entertainment, government, and news reporting, including both formal and informal speech. There are certain differences in grammar between the standard forms of British English, American English and Australian English, although these are inconspicuous compared with the lexical andpronunciation differences. Word classes and phrases There are eight word classes, or parts of speech, that are distinguished in English: nouns, determiners, pronouns, verbs, adjectives,adverbs, prepositions, and conjunctions. (Determiners, traditionally classified along with adjectives, have not always been regarded as a separate part of speech.) Interjections are another word class, but these are not described here as they do not form part of theclause and sentence structure of the language. Nouns, verbs, adjectives, and adverbs form open classes – word classes that readily accept new members, such as the nouncelebutante (a celebrity who frequents the fashion circles), similar relatively new words. The others are regarded as closed classes.
    [Show full text]
  • A Grammar of Kunbarlang
    A Grammar of Kunbarlang Ivan Kapitonov ORCID: 0000-0002-1603-6265 Submitted in total fulfilment of the requirements ofthe degree of Doctor of Philosophy School of Languages and Linguistics The University of Melbourne July 2019 Copyright 2019 Ivan Kapitonov This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visit http://creativecommons. org/licenses/by-nc-nd/3.0/ or send a letter to Creative Commons, PO Box 1866, Mountain View, CA 94042, USA. Abstract This thesis is a comprehensive description of Kunbarlang, an Aboriginal language from northern Australia. The description and analysis are based on my original field work, as well as build on the preceding body of work by other scholars. Between 2015 and 2018 I have done field work in Warruwi (South Goulburn Island), Maningrida, and Darwin. The data elicited in those trips and the recordings of narratives andsemi- spontaneous conversation constitute the foundation of the present grammar. However, I was fortunate in that I was not working from scratch. Carolin Coleman did foundational work on Kunbarlang in central-western Arnhem Land from 1981, which resulted in the first grammar of the language (Coleman 1982). In her subsequent work in the area in the 1990’s, she carried on with lexicographic research in Kunbarlang, Mawng and Maningrida languages. More recently, Dr. Aung Si (Universität zu Köln), Dr. Isabel O’Keeffe (University of Sydney), and Dr. Ruth Singer (University of Melbourne / Australian National University) made a number of recordings of Kunbarlang speakers at Maningrida, Warruwi, Minjilang and Darwin.
    [Show full text]
  • Grammatical Markers and Grammatical Relations in the Simple Clause in Old French
    Grammatical markers and grammatical relations in the simple clause in Old French Nicolas Mazziotta Universität Stuttgart Institut für Linguistik/Romanistik Germany [email protected] Abstract described in a dependency framework (henceforth “DF”). As an introduction, I will first give a quick The focus of this paper is the descrip- overview of OF (1.1), and define the focus of this tion of the surface syntax relations in the study (1.2). simple clause in Old French and the way they can be described in a dependency 1.1 Old French: an overview grammar. The declension system of Old The term Old French roughly corresponds to a French is not reliable enough to cope with continuum of romance varieties that were spoken the identification of the dependents of the in the northern half of France, in Wallonia and main verb, but it remains true that related in England during the Middle Ages (9th-13th C.). grammatical markers are still observable To carry on a description of OF, one has to sys- and obey rules that forbid them to appear tematize the common ground that all these idioms in specific syntactic positions. share as well as the major differences that distin- This study relies on three previous ac- guish the varieties. The paper will focus on that counts; Igor Mel’cuk’sˇ “criteria B”, the common ground, which can be seen as the direct criteria that are used to determine which ancestor of modern French. is the syntactic governor in a syntactic de- From a grammatical point of view, OF is much pendency relation, Thomas Groß’s intra- more analytic than Latin is: many relations are in- word analysis, which grants morphs node troduced by prepositions.
    [Show full text]