Using Machine Learning Techniques to Identify the Native Language of an English User

Total Page:16

File Type:pdf, Size:1020Kb

Using Machine Learning Techniques to Identify the Native Language of an English User Using Machine Learning Techniques to Identify the Native Language of an English User Jake Browning 4th Year Project Report Computer Science and Mathematics School of Informatics University of Edinburgh April 2017 Acknowledgements I thank my supervisor, Dr. Shay Cohen, for his invaluable insights and advice during the undertaking of this project. 2 Contents 0.1 Abstract . 6 1 Introduction 7 1.1 Previous Work . 8 1.1.1 Areas Preceding Native Language Identification . 8 1.1.2 Native Language Identification . 8 1.2 Motivation . 9 1.3 Outline and Goals . 9 1.3.1 Outline . 9 1.3.2 Overview of Goals . 10 1.4 A Note on Topic Bias . 10 2 Description of the Data 12 2.1 Corpus of Non-Native English . 12 2.2 Corpus of Native English . 12 2.3 The Lang-8 Corpus vs other English Learner Corpora . 12 2.4 Training and Testing Sets . 13 2.4.1 Native vs Non-Native . 13 2.4.2 Multi-Class Non-Native . 13 2.5 General Information on Each Class . 13 3 Differing Notions of Grammatical Structure, and their Uses in this Task 14 3.1 Constituency Grammar . 14 3.2 Dependency Grammar . 14 3.3 POS Tagset and Tagging Software . 16 3.3.1 Tagset . 16 4 Classification Methods 17 4.1 Gaussian Naive Bayes . 17 4.2 Logistic Regression . 18 4.3 Decision Trees and Random Forests . 19 4.3.1 Entropy . 19 4.3.2 Information Gain . 19 3 4.3.3 Using Information Gain to Rank Features . 20 4.3.4 Random Forests . 20 4.4 Feature Normalisation . 21 5 Using Language Models as Classification Features 22 5.1 Description of a Language Model, and its Application to this Task . 22 5.2 Description of Language Models Tested . 22 5.2.1 The SLOR language model . 23 5.2.2 Potential issues with the SLOR model . 23 5.2.3 The POS language model . 23 5.3 Using Language Models to Classify Native vs Non-Native . 24 5.3.1 Results . 24 5.3.2 Summary of Results . 24 5.4 Using Language Models to Classify among Non-Natives . 25 6 A Heuristic Approach to Classifying Native vs Non-Native 27 6.1 Adverb Placement . 27 6.2 Left and Right Branching . 27 6.2.1 What is "branching"? . 28 6.2.2 Capturing Differences in Branching . 28 6.2.3 Passive Voice . 29 6.2.4 Sentence Length . 29 6.2.5 Results . 30 6.3 Combining the "Language Model" and "Heuristic" Features . 30 7 Feature "Families" Used in the Multi-Class Problem 31 8 Using Techniques Influenced by Previous L1 Identification Research to Improve the Multi- Class Classification Problem 32 8.1 Capturing and Categorising Spelling Errors . 32 8.1.1 Levenshtein Distance . 33 8.1.2 Using Levenshtein Distance to Capture Error Types . 34 8.1.3 Results and Feature Importance . 35 8.2 Using an Error Correction Library to Detect Errors . 36 8.2.1 Results . 37 8.3 Using Stopwords as Features . 38 8.3.1 Stopwords . 38 8.3.2 Method . 38 8.3.3 Results . 38 9 Investigating New Feature Types in the Multi-Class Problem 39 9.1 Capturing the Usage of Prepositions . 39 9.1.1 The "Caseframe" Method . 39 9.1.2 Justifications for Not Explicitly Counting Errors . 40 4 9.1.3 Using an Ensemble Method to Capture Differences in Preposition Usage . 40 9.1.4 Results . 42 9.2 Capturing the Usage of Verbs in the Global Context . 43 9.2.1 Simplifications to the Model Proposed by Tajiri et al . 43 9.2.2 Method . 43 9.2.3 Results . 45 9.3 Placement of Prepositional Phrases . 46 9.3.1 Method . 46 9.3.2 Results . 47 9.4 Correlative Conjunctions . 48 9.4.1 Linking Adjectives and Phrases with Conjunctions . 48 9.4.2 Correlative Conjunctions Spanning Multiple Clauses . 49 9.4.3 Frequencies . 50 10 Further Methods to Increase the Performance of the Multi-Class Classifier, and Analysis of the Accuracy 51 10.1 Incorporating Features Used in the Native vs Non-Native Problem . 51 10.2 Arguments against Dimensionality Reduction . 51 10.3 Confusion Matrix . 52 11 Feature Analysis of the Multi-Class Classifier 53 11.1 Methods used during Feature Analysis . 53 11.1.1 Information Gain . 53 11.1.2 Continuous Mutual Information . 53 11.2 Feature Analysis . 54 11.2.1 Using Information Gain to Rank all Features . 54 11.2.2 Using Information Gain to Rank Feature Families . 55 11.3 Feature Correlation . 56 12 Unused and Unsuccessful Methods 57 12.1 Using Simple Methods to Count Preposition Errors . 57 12.2 Overuse of "to me" . 57 12.3 Incorrect Tense used with Modal Verb . 58 12.4 "One of" Followed by Singular Noun . 58 13 Conclusion 59 13.0.1 Insights Gained . 59 13.1 Motivation for Future Research . 60 A Tagsets 62 A.1 Table of Phrase-Level POS Tags . 62 A.2 Table of Word-Level POS Tags . 62 Bibliography 64 5 Abstract 0.1 Abstract In this project, we perform native language identification on English text written by native speakers of English, Korean, Chinese and Japanese using machine learning techniques. Statistical language models, error type analysis, and investigations into the differing styles and grammatical structure of text written by authors in each class are utilised as features in the classification task. We build a binary classifier which discriminates natives from non-natives with accuracies ranging from 97:17% to 99:19%, and a multi-class classifier which discriminates among Korean, Chinese and Japanese speakers with 68:8% accuracy. We also emphasise and report on the differences in structure and style found in texts across the four classes. 6 Chapter 1 Introduction L1 Interference is a highly researched area of linguistics which is concerned with studying the various ways in which a non-native user of English is influenced by their native language in their style of speech, as well as the frequency and types of mistakes that they make. Within the past sixteen years, the area of "native language detection" has become well-established in computational linguistics, and involves using machine learning techniques to determine the native language of an English user. In this task, machine learning techniques were used to classify the native languages of native English, Korean, Chi- nese and Japanese speakers according to their use of English. The challenges of detecting a writer as native or not compared to detecting the native languages among non-native writers were found to be very different, and in.
Recommended publications
  • Measuring the Comprehension of Negation in 2- to 4-Year-Old Children Ann E
    Measuring the comprehension of negation in 2- to 4-year-old children Ann E. Nordmeyer Michael C. Frank [email protected] [email protected] Department of Psychology Department of Psychology Stanford University Stanford University Abstract and inferential negation (i.e. negation of inferred beliefs of others). Regardless of taxonomy, negation is used in a variety Negation is one of the most important concepts in human lan- guage, and yet little is known about children’s ability to com- of contexts to express a range of different thoughts. prehend negative sentences. In this experiment, we explore The relationship between different types of negation is un- how children’s comprehension of negative sentences changes between 2- to 4-year-old children, as well as how comprehen- known. One possibility is that distinct categories of negation sion is influenced by how negative sentences are used. Chil- belong to a single cohesive concept. Even pre-linguistically, dren between the ages of 2 and 4 years watched a video in nonexistence, rejection, and denial could all fall under a su- which they heard positive and negative sentences. Negative sentences, such as “look at the boy with no apples”, referred perordinate conceptual category of negation. It is also pos- either to an absence of a characteristic or an alternative char- sible, however, that these types of negation represent fun- acteristic. Older children showed significant improvements in damentally different concepts. For example, the situation in speed and accuracy of looks to target. Children showed more rejection difficulty when the negative sentence referred to nothing, com- which a child expresses a dislike for going outside ( ) pared to when it referred to an alternative.
    [Show full text]
  • Annotating Tense, Mood and Voice for English, French and German
    Annotating tense, mood and voice for English, French and German Anita Ramm1;4 Sharid Loaiciga´ 2;3 Annemarie Friedrich4 Alexander Fraser4 1Institut fur¨ Maschinelle Sprachverarbeitung, Universitat¨ Stuttgart 2Departement´ de Linguistique, Universite´ de Geneve` 3Department of Linguistics and Philology, Uppsala University 4Centrum fur¨ Informations- und Sprachverarbeitung, Ludwig-Maximilians-Universitat¨ Munchen¨ [email protected] [email protected] fanne,[email protected] Abstract features. They may, for instance, be used to clas- sify texts with respect to the epoch or region in We present the first open-source tool for which they have been produced, or for assigning annotating morphosyntactic tense, mood texts to a specific author. Moreover, in cross- and voice for English, French and Ger- lingual research, tense, mood, and voice have been man verbal complexes. The annotation is used to model the translation of tense between based on a set of language-specific rules, different language pairs (Santos, 2004; Loaiciga´ which are applied on dependency trees et al., 2014; Ramm and Fraser, 2016)). Identi- and leverage information about lemmas, fying the morphosyntactic tense is also a neces- morphological properties and POS-tags of sary prerequisite for identifying the semantic tense the verbs. Our tool has an average accu- in synthetic languages such as English, French racy of about 76%. The tense, mood and or German (Reichart and Rappoport, 2010). The voice features are useful both as features extracted tense-mood-voice (TMV) features may in computational modeling and for corpus- also be useful for training models in computational linguistic research. linguistics, e.g., for modeling of temporal relations (Costa and Branco, 2012; UzZaman et al., 2013).
    [Show full text]
  • Serial Verb Constructions Revisited: a Case Study from Koro
    Serial Verb Constructions Revisited: A Case Study from Koro By Jessica Cleary-Kemp A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Linguistics in the Graduate Division of the University of California, Berkeley Committee in charge: Associate Professor Lev D. Michael, Chair Assistant Professor Peter S. Jenks Professor William F. Hanks Summer 2015 © Copyright by Jessica Cleary-Kemp All Rights Reserved Abstract Serial Verb Constructions Revisited: A Case Study from Koro by Jessica Cleary-Kemp Doctor of Philosophy in Linguistics University of California, Berkeley Associate Professor Lev D. Michael, Chair In this dissertation a methodology for identifying and analyzing serial verb constructions (SVCs) is developed, and its application is exemplified through an analysis of SVCs in Koro, an Oceanic language of Papua New Guinea. SVCs involve two main verbs that form a single predicate and share at least one of their arguments. In addition, they have shared values for tense, aspect, and mood, and they denote a single event. The unique syntactic and semantic properties of SVCs present a number of theoretical challenges, and thus they have invited great interest from syntacticians and typologists alike. But characterizing the nature of SVCs and making generalizations about the typology of serializing languages has proven difficult. There is still debate about both the surface properties of SVCs and their underlying syntactic structure. The current work addresses some of these issues by approaching serialization from two angles: the typological and the language-specific. On the typological front, it refines the definition of ‘SVC’ and develops a principled set of cross-linguistically applicable diagnostics.
    [Show full text]
  • Tenses and Conjugation (Pdf)
    Created by the Evergreen Writing Center Library 3407 867-6420 Tenses and Conjugation Using correct verb forms is crucial to communicating coherently. Understanding how to apply different tenses and properly conjugate verbs will give you the tools with which to craft clear, effective sentences. Conjugations A conjugation is a list of verb forms. It catalogues the person, number, tense, voice, and mood of a verb. Knowing how to conjugate verbs correctly will help you match verbs with their subjects, and give you a firmer grasp on how verbs function in different sentences. Here is a sample conjugation table: Present Tense, Active Voice, Indicative Mood: Jump Person Singular Plural 1st Person I jump we jump 2nd Person you jump you jump 3rd Person he/she/it jumps they jump Person: Person is divided into three categories (first, second, and third person), and tells the reader whether the subject is speaking, is spoken to, or is spoken about. Each person is expressed using different subjects: first person uses I or we; second person uses you; and third person uses he/she/it or they. Keep in mind that these words are not the only indicators of person; for example in the sentence “Shakespeare uses images of the divine in his sonnets to represent his own delusions of grandeur”, the verb uses is in the third person because Shakespeare could be replaced by he, an indicator of the third person. Number: Number refers to whether the verb is singular or plural. Tense: Tense tells the reader when the action of a verb takes place.
    [Show full text]
  • Double Classifiers in Navajo Verbs *
    Double Classifiers in Navajo Verbs * Lauren Pronger Class of 2018 1 Introduction Navajo verbs contain a morpheme known as a “classifier”. The exact functions of these morphemes are not fully understood, although there are some hypotheses, listed in Sec- tions 2.1-2.3 and 4. While the l and ł-classifiers are thought to have an effect on averb’s transitivity, there does not seem to be a comparable function of the ; and d-classifiers. However, some sources do suggest that the d-classifier is associated with the middle voice (see Section 4). In addition to any hypothesized semantic functions, the d-classifier is also one of the two most common morphemes that trigger what is known as the d-effect, essentially a voicing alternation of the following consonant explained in Section 3 (the other mor- pheme is the 1st person dual plural marker ‘-iid-’). When the d-effect occurs, the ‘d’ of the involved morpheme is often realized as null in the surface verb. This means that the only evidence of most d-classifiers in a surface verb is the voicing alternation of the following stem-initial consonant from the d-effect. There is also a rare phenomenon where two *I would like to thank Jonathan Washington and Emily Gasser for providing helpful feedback on earlier versions of this thesis, and Jeremy Fahringer for his assistance in using the Swarthmore online Navajo dictionary. I would also like to thank Ted Fernald, Ellavina Perkins, and Irene Silentman for introducing me to the Navajo language. 1 classifiers occur in a single verb, something that shouldn’t be possible with position class morphology.
    [Show full text]
  • Number Systems in Grammar Position Paper
    1 Language and Culture Research Centre: 2018 Workshop Number systems in grammar - position paper Alexandra Y. Aikhenvald I Introduction I 2 The meanings of nominal number 2 3 Special number distinctions in personal pronouns 8 4 Number on verbs 9 5 The realisation of number 12 5.1 The forms 12 5.2 The loci: where number is shown 12 5.3 Optional and obligatory number marking 14 5.4 The limits of number 15 5.4.1 Number and the meanings of nouns 15 5.4.2 'Minor' numbers 16 5.4.3 The limits of number: nouns with defective number values 16 6 Number and noun categorisation 17 7 Markedness 18 8 Split, or mixed, number systems 19 9 Number and social deixis 19 10 Expressing number through other means 20 11 Number systems in language history 20 12 Summary 21 Further readings 22 Abbreviations 23 References 23 1 Introduction Every language has some means of distinguishing reference to one individual from reference to more than one. Number reference can be coded through lexical modifiers (including quantifiers of various sorts or number words etc.), or through a grammatical system. Number is a referential property of an argument of the predicate. A grammatical system of number can be shown either • Overtly, on a noun, a pronoun, a verb, etc., directly referring to how many people or things are involved; or • Covertly, through agreement or other means. Number may be marked: • within an NP • on the head of an NP • by agreement process on a modifier (adjective, article, demonstrative, etc.) • through agreement on verbs, or special suppletive or semi-suppletive verb forms which may code the number of one or more verbal arguments, or additional marker on the verb.
    [Show full text]
  • Clusivity in Presidential Discourse: a Rhetorical Discourse Analysis of State-Of-The-Nation Addresses in Ghana and the United States
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Michigan Technological University Michigan Technological University Digital Commons @ Michigan Tech Dissertations, Master's Theses and Master's Reports 2016 Clusivity in Presidential Discourse: A Rhetorical Discourse Analysis of State-of-the-Nation Addresses in Ghana and the United States Stephen K. Dadugblor Michigan Technological University, [email protected] Copyright 2016 Stephen K. Dadugblor Recommended Citation Dadugblor, Stephen K., "Clusivity in Presidential Discourse: A Rhetorical Discourse Analysis of State-of- the-Nation Addresses in Ghana and the United States", Open Access Master's Thesis, Michigan Technological University, 2016. https://digitalcommons.mtu.edu/etdr/123 Follow this and additional works at: https://digitalcommons.mtu.edu/etdr Part of the African Languages and Societies Commons, English Language and Literature Commons, Other Rhetoric and Composition Commons, and the Rhetoric Commons CLUSIVITY IN PRESIDENTIAL DISCOURSE: A RHETORICAL DISCOURSE ANALYSIS OF STATE-OF-THE-NATION ADDRESSES IN GHANA AND THE UNITED STATES By Stephen K. Dadugblor A THESIS Submitted in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE In Rhetoric, Theory and Culture MICHIGAN TECHNOLOGICAL UNIVERSITY 2016 © 2016 Stephen K. Dadugblor This thesis has been approved in partial fulfillment of the requirements for the Degree of MASTER OF SCIENCE in Rhetoric, Theory and Culture. Department of Humanities Thesis Advisor: Victoria L. Bergvall Committee Member: Abraham Romney Committee Member: Craig Waddell Committee Member: Kari B. Henquinet Department Chair: Ronald Strickland To my brother, Michael K. Dadugblor, who gave up much, that I might have so much more TABLE OF CONTENTS ACKNOWLEDGEMENTS…………………………………………………………6 ABSTRACT………………………………………………………………………..…7 CHAPTER ONE: THE ROLE OF LANGUAGE IN POLITICS………..……….9 1.1.
    [Show full text]
  • Evidentiality and Mood: Grammatical Expressions of Epistemic Modality in Bulgarian
    Evidentiality and mood: Grammatical expressions of epistemic modality in Bulgarian DISSERTATION Presented in Partial Fulfillment of the Requirements o the Degree Doctor of Philosophy in the Graduate School of The Ohio State University By Anastasia Smirnova, M.A. Graduate Program in Linguistics The Ohio State University 2011 Dissertation Committee: Brian Joseph, co-advisor Judith Tonhauser, co-advisor Craige Roberts Copyright by Anastasia Smirnova 2011 ABSTRACT This dissertation is a case study of two grammatical categories, evidentiality and mood. I argue that evidentiality and mood are grammatical expressions of epistemic modality and have an epistemic modal component as part of their meanings. While the empirical foundation for this work is data from Bulgarian, my analysis has a number of empirical and theoretical consequences for the previous work on evidentiality and mood in the formal semantics literature. Evidentiality is traditionally analyzed as a grammatical category that encodes information sources (Aikhenvald 2004). I show that the Bulgarian evidential has richer meaning: not only does it express information source, but also it has a temporal and a modal component. With respect to the information source, the Bulgarian evidential is compatible with a variety of evidential meanings, i.e. direct, inferential, and reportative, as long as the speaker has concrete perceivable evidence (as opposed to evidence based on a mental activity). With respect to epistemic commitment, the construction has different felicity conditions depending on the context: the speaker must be committed to the truth of the proposition in the scope of the evidential in a direct/inferential evidential context, but not in a reportative context.
    [Show full text]
  • Investigation of Spoken-Language Detection and Classification
    information Article Investigation of Spoken-Language Detection and Classification in Broadcasted Audio Content Rigas Kotsakis, Maria Matsiola , George Kalliris and Charalampos Dimoulas * School of Journalism and Mass Communications, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece; [email protected] (R.K.); [email protected] (M.M.); [email protected] (G.K.) * Correspondence: [email protected]; Tel.: +30-2310-9942454 Received: 1 March 2020; Accepted: 14 April 2020; Published: 15 April 2020 Abstract: The current paper focuses on the investigation of spoken-language classification in audio broadcasting content. The approach reflects a real-word scenario, encountered in modern media/monitoring organizations, where semi-automated indexing/documentation is deployed, which could be facilitated by the proposed language detection preprocessing. Multilingual audio recordings of specific radio streams are formed into a small dataset, which is used for the adaptive classification experiments, without seeking—at this step—for a generic language recognition model. Specifically, hierarchical discrimination schemes are followed to separate voice signals before classifying the spoken languages. Supervised and unsupervised machine learning is utilized at various windowing configurations to test the validity of our hypothesis. Besides the analysis of the achieved recognition scores (partial and overall), late integration models are proposed for semi-automatically annotation of new audio recordings. Hence, data augmentation mechanisms are
    [Show full text]
  • Does Grammatical Gender Influence Perception? 388
    Address for correspondence: Izabella Haertlé, Faculty of Psychology, SWPS University of Social Sciences and Humanities, Chodakowska 19/31, 03-815 Warszawa, Poland. E-mail: [email protected] 387 I. HAERTLÉ interpretation”1. The suggestion that language significantly influences thought was initially widely criticised, but has recently come back into favour, especially in its weaker forms, and has yielded interesting results - including evidence from neuroimaging studies (e.g. Boutonnet, Athanasopoulos, & Thierry, 2012; for review see: Thierry, 2016). Researchers have demonstrated the role of language in the perception of time (Boroditsky, 2000, 2001, 2010), colour (Thierry et al., 2009; Winawer et al., 2007), spatial cognition (Bowerman, Choi, McDonough, & Mandler, 1999; Gentner & Loftus, 1979; Landau & Jackendoff, 1993; Levinson, Kita, Haun, & Rasch, 2002; McDonough, Choi, & Mandler, 2003), the perception of motion (Athanasopoulos et al., 2015; Czechowska & Ewert, 2011), attention and information processing styles (Rhode, Voyer, & Gleibs, 2016), the perception and memory of events (Boroditsky, Ham, & Ramscar, 2002; Fausey & Boroditsky, 2010; Loftus & Palmer, 1974; Scholl & Nakayama, 2002), and even constructing agency, including attending to and remembering the agents of events (Choi, 2009; Fausey & Boroditsky, 2010a; Fausey & Boroditsky, 2011; Fausey, Long, Inamori, & Boroditsky, 2010). One specific aspect of language that may influence perceptions of reality is grammatical gender. Various cultures attribute grammatical gender to words based on various criteria, often seemingly impossible to explain with logic. Does the grammatical gender or class ascribed to a given noun influence the way we perceive it? Is there a supra-language conceptual gender categorisation of objects that is culturally universal, and not as arbitrary as grammatical gender? One of the first researchers to address these questions was Ervin (1962).
    [Show full text]
  • A Contrastive Study of Identity Perception in Voice
    Voice and Identity: A contrastive study of identity perception in voice Begoña Payá Herrero VoicVoicee and Identity: A contrastive study of identity perception in voice Inaugural-Dissertation zur Erlangung des Doktorgrades der Philosophie an der Ludwig-Maximilians-Universität München Im Cotutelle Verfahren mit der Universität Universidad de Valencia Vorgelegt von Begoña PAYÁ HERRERO 2009 Betreuer an der Ludwigs-Maximilians-Universität: Prof. Dr. Wolfgang Schulze Betreuer an der Universidad de Valencia: Prof. Dr. Barry Pennock-Speck Gutachter der Ludwigs-Maximilians-Universität: Prof. Dr. Wolfgang Schulze Gutachter der Universidad de Valencia: Prof. Dr. Barry Pennock-Speck Tag der mündlichen Prüfung: 11.09.2009 (in Valencia) Acknowledgements Every PhD student writing his/her doctoral thesis is confronted with an extremely challenging task, which requires a considerable amount of self-discipline. Throughout the years this may last, there are inevitably moments of uncertainty, of confusion, of emptiness, or even moments where we lose track of where we are going and are drowned by the avalanche of knowledge. So was my case. But what seemed like a mountain too high to climb at the beginning became a task that could be managed step by step thanks to the support of many people. First of all, from the LIPP Programme in Munich, I woud like to thank its coordinator, Caroline Trautmann, and all the LIPP professors for what they have taught me along these three years, especially, Prof. Janney and my tutor, Prof. Wolfgang Schulze, for always believing in me and not hesitating to make suggestions when I was a bit lost. I am equally grateful to my tutor in the Universitat de València, Dr.
    [Show full text]
  • The Evolution of Case Grammar
    The evolution of case grammar Remi van Trijp language Computational Models of Language science press Evolution 4 Computational Models of Language Evolution Editors: Luc Steels, Remi van Trijp In this series: 1. Steels, Luc. The Talking Heads Experiment: Origins of words and meanings. 2. Vogt, Paul. How mobile robots can self-organize a vocabulary. 3. Bleys, Joris. Language strategies for the domain of colour. 4. van Trijp, Remi. The evolution of case grammar. 5. Spranger, Michael. The evolution of grounded spatial language. ISSN: 2364-7809 The evolution of case grammar Remi van Trijp language science press Remi van Trijp. 2017. The evolution of case grammar (Computational Models of Language Evolution 4). Berlin: Language Science Press. This title can be downloaded at: http://langsci-press.org/catalog/book/52 © 2017, Remi van Trijp Published under the Creative Commons Attribution 4.0 Licence (CC BY 4.0): http://creativecommons.org/licenses/by/4.0/ ISBN: 978-3-944675-45-9 (Digital) 978-3-944675-84-8 (Hardcover) 978-3-944675-85-5 (Softcover) ISSN: 2364-7809 DOI:10.17169/langsci.b52.182 Cover and concept of design: Ulrike Harbort Typesetting: Sebastian Nordhoff, Felix Kopecky, Remi van Trijp Proofreading: Benjamin Brosig, Marijana Janjic, Felix Kopecky Fonts: Linux Libertine, Arimo, DejaVu Sans Mono Typesetting software:Ǝ X LATEX Language Science Press Habelschwerdter Allee 45 14195 Berlin, Germany langsci-press.org Storage and cataloguing done by FU Berlin Language Science Press has no responsibility for the persistence or accuracy of URLs for external or third-party Internet websites referred to in this publication, and does not guarantee that any content on such websites is, or will remain, accurate or appropriate.
    [Show full text]