The Assignment of Grammatical Gender in German:
Testing Optimal Gender Assignment Theory
Emma Charlotte Corteen
Trinity Hall
September 2018
This dissertation is submitted for the degree of Doctor of Philosophy
The Assignment of Grammatical Gender in German: Testing Optimal Gender Assignment Theory Emma Charlotte Corteen
Abstract
The assignment of grammatical gender in German is a notoriously problematic phenomenon due to the apparent opacity of the gender assignment system (e.g. Comrie 1999: 461). Various models of German gender assignment have been proposed (e.g. Spitz 1965, Köpcke 1982, Corbett 1991, Wegener 1995), but none of these is able to account for all of the German data.
This thesis investigates a relatively under-explored, recent approach to German gender assignment in the form of Optimal Gender Assignment Theory (OGAT), proposed by Rice (2006). Using the framework of Optimality Theory, OGAT claims that the form and meaning of a noun are of equal importance with respect to its gender. This is formally represented by the crucial equal ranking of all gender assignment constraints in a block of GENDER FEATURES, which is in turn ranked above a default markedness hierarchy *NEUTER » *FEMININE » *MASCULINE, which is based on category size.
A key weakness of OGAT is that it does not specify what constitutes a valid GENDER FEATURES constraint. This means that, in theory, any constraint can be proposed ad hoc to ensure that an OGAT analysis yields the correct result. In order to prevent any constraints based on ‘postfactum rationalisations’ (Comrie 1999: 461) from being included in the investigation, the GENDER FEATURES constraints which have been proposed in the literature for German are assessed according to six criteria suggested by Enger (2009), which seek to determine whether there is independent evidence for a GENDER FEATURES constraint.
Using an independently-verified constraint set, OGAT is then tested on a sample of 592 nouns systematically selected from the Duden Rechtschreibung. The results indicate that OGAT is relatively successful in its predictions when compared to other approaches but that it cannot account fully for the sample data. Accordingly, a revised version of the theory is proposed (OGAT II), which involves the ranking of certain GENDER FEATURES constraints. It is found that OGAT II is able to account for the genders of around 95% of nouns in the sample. A number of specific aspects of OGAT II are then tested by means of an experiment in which native German speakers are required to assign genders to 26 pseudo-nouns. The results suggest that OGAT II comes the closest of the systems discussed in the literature to modelling how native speakers assign gender in German.
i
Declaration
This dissertation is the result of my own work and includes nothing which is the outcome of work done in collaboration except as declared in the Preface and specified in the text. It is not substantially the same as any that I have submitted, or, is being concurrently submitted for a degree or diploma or other qualification at the University of Cambridge or any other University or similar institution except as declared in the Preface and specified in the text. I further state that no substantial part of my dissertation has already been submitted, or, is being concurrently submitted for any such degree, diploma or other qualification at the University of Cambridge or any other University or similar institution except as declared in the Preface and specified in the text. This dissertation does not exceed the prescribed word limit for the Degree Committee of the Faculty of Modern and Medieval Languages.
ii
Acknowledgements
Firstly, I would like to thank the Arts and Humanities Research Council for funding this project. I would also like to thank the Cambridge Trust and the Pigott Scholars Programme for their offers of funding,
and the Vice Chancellor for my Honorary Vice Chancellor’s Award.
My greatest thanks go to my supervisor, Dr Sheila Watts, for her continual support and guidance throughout this project. I am extremely grateful not only for her invaluable academic insight, but also for her never-ending patience, positivity and encouragement.
I am also grateful to all of the other members of the Department of German and Dutch and the Department of Theoretical and Applied Linguistics at Cambridge University who provided me with advice and support throughout my time at Cambridge. Particular thanks go to Dr Bert Vaux, Dr Napoleon Katsos, Dr David Willis, Dr Jim Blevins, Professor Sarah Colvin and Dr Simon Pickl.
I would also like to thank the Cambridge Statistics Clinic at the Centre for Mathematical Sciences for their assistance with my statistical queries.
My thanks also go to the many native German speakers who I consulted throughout the project, and particularly the 100 who took part in my experiment.
Additionally, I would like to thank members of staff at the Schools of Modern Languages and English Literature, Language and Linguistics at Newcastle University, where my studies of Linguistics began. In particular, I would like to thank Dr Carol Fehringer and Andrea Wilczynski for nurturing my interest in German linguistics and encouraging me to apply for postgraduate study.
I would also like to thank my friends and family for their support throughout the PhD writing process. Thank you to those of you who at least sometimes pretended to be interested in German grammar.
Last but by no means least, my thanks go to Eve, without whom I would not have been able to complete this project.
iii
Contents
Chapter 1: Introduction ............................................................................................................... 1
1.1 Grammatical gender in German ......................................................................................... 1 1.2 Gender assignment in German ........................................................................................... 2 1.3 Gender assignment principles ............................................................................................ 7
1.3.1 Morphological principles .................................................................................... 7
1.3.1.1 Derivation via affixation...................................................................... 8 1.3.1.2 Implicit derivation............................................................................... 9 1.3.1.3 Conversion .......................................................................................... 9 1.3.1.4 Compounds....................................................................................... 10 1.3.1.5 Reductions ........................................................................................ 10 1.3.1.6 Inflection........................................................................................... 11 1.3.1.7 Evidence for morphological principles.............................................. 14
1.3.2 Phonological principles ..................................................................................... 15
1.3.2.1 Word ending ..................................................................................... 15 1.3.2.2 Other phonological factors ............................................................... 16 1.3.2.3 Evidence for phonological principles................................................ 17
1.3.3 Semantic principles........................................................................................... 18
1.3.3.1 The Natural Gender Principle (NGP)................................................. 18 1.3.3.2 Other semantic classes .................................................................... 19 1.3.3.3 Levels of abstraction ........................................................................ 20 1.3.3.4 Evidence for semantic principles ..................................................... 22
1.4 Interaction of gender assignment principles .................................................................... 23
1.4.1 Semantics » form .............................................................................................. 24 1.4.2 Form » semantics.............................................................................................. 26 1.4.3 Morphology » semantics » phonology ............................................................. 27 1.4.4 Hierarchy of individual principles ..................................................................... 28 1.4.5 Optimal Gender Assignment Theory (OGAT).................................................... 29
1.4.5.1 Theory outline .................................................................................. 29 1.4.5.2 Strengths of OGAT ........................................................................... 34 1.4.5.3 Weaknesses and aspects of OGAT for further investigation ............ 37 1.4.5.4 Previous testing of OGAT.................................................................. 45
1.4.5.4.1 OGAT and German ............................................................ 45 1.4.5.4.2 OGAT and other languages ............................................... 49
1.5 Summary and plan for investigation................................................................................. 52
Chapter 2: Evaluating GENDER FEATURES constraints ...................................................................... 55
2.1 Introduction ..................................................................................................................... 55 2.2 The noun sample and the constraints............................................................................... 56 2.3 Criterion 1: Coverage of candidate nouns ........................................................................ 60 2.4 Criterion 2: Typological parallels....................................................................................... 70 2.5 Criterion 3: Parallels outside of gender assignment......................................................... 78 2.6 Criterion 4: Clarity of semantic features........................................................................... 85
iv
2.7 Criterion 5: Accounting for exceptions ............................................................................ 91 2.8 Criterion 6: Productivity.................................................................................................... 95
2.8.1 Neologisms........................................................................................................ 97 2.8.2 Historical gender changes............................................................................... 100
2.9 Refinement of constraint set .......................................................................................... 103 2.10 Summary of results ....................................................................................................... 109
Chapter 3: Analysis – testing OGAT .......................................................................................... 113
3.1 Introduction and preliminary analysis ........................................................................... 113 3.2 Constraint-related issues ............................................................................................... 118
3.2.1 Constraints with multiple versions ................................................................. 118 3.2.2 Constraints whose application is problematic ................................................ 121
3.2.2.1 DEVERBAL STEM→*F,*N ...................................................................... 121 3.2.2.2 SUPERORDINATES→*M,*F .................................................................... 129 3.2.2.3 DEVERBAL -T→*M,*N ......................................................................... 135 3.2.2.4 -MA→*M,*F ...................................................................................... 137
3.3 Noun-related issues ....................................................................................................... 138
3.3.1 Loanwords........................................................................................................ 138
3.3.1.1 Donor gender................................................................................... 140 3.3.1.2 Semantic equivalence ..................................................................... 141 3.3.1.3 Homonymy/partial homonymy ...................................................... 147 3.3.1.4 Default gender ................................................................................ 149 3.3.1.5 Summary ......................................................................................... 150
3.3.2 Nouns with multiple meanings or forms ........................................................ 150
3.3.2.1 Meaning variation............................................................................ 151
3.3.2.1.1 Multiple meanings vs. one meaning ............................... 151 3.3.2.1.2 Multiple meanings vs. one meaning in the sample ........ 153 3.3.2.1.3 Polysemy vs. homonymy ................................................ 153 3.3.2.1.4 Polysemy vs. homonymy in the sample .......................... 156 3.3.2.1.5 Approaches to polysemy ................................................ 157 3.3.2.1.6 Polysemy in the sample .................................................. 160
3.3.2.2 Form variation ................................................................................. 162
3.3.3 Complex nouns and “strong rules”.................................................................. 163
3.3.3.1 Heads .............................................................................................. 164
3.3.3.1.1 Head identification ......................................................... 164 3.3.3.1.2 Head position................................................................... 165
3.3.3.2 Multi-word units and OGAT............................................................. 166
3.3.3.2.1 Compounds...................................................................... 166 3.3.3.2.2 Lexicalised phrases .......................................................... 167 3.3.3.2.3 Multi-word units and OGAT............................................. 168
3.3.3.3 Nouns derived via affixation and OGAT........................................... 169
3.3.4 Subset of 55 nouns .......................................................................................... 173
3.3.4.1 Placenames ..................................................................................... 173 3.3.4.2 Personal names ............................................................................... 178
v
3.3.4.3 Plural terms ..................................................................................... 181 3.3.4.4 Regional terms ................................................................................ 182 3.3.4.5 Archaic nouns .................................................................................. 183 3.3.4.6 Homonyms ...................................................................................... 184 3.3.4.7 Adjectival nouns .............................................................................. 185 3.3.4.8 Multiple-gender nouns .................................................................... 186
3.4 Remaining issues............................................................................................................. 191
3.4.1 New constraints ............................................................................................... 192 3.4.2 Other nouns in the sample ............................................................................. 196 3.4.3 Testing alternative default hierarchies ........................................................... 198
3.5 Effectiveness of OGAT vs. OGAT II and exceptions ........................................................ 200
3.5.1 OGAT vs. OGAT II ............................................................................................. 200 3.5.2 Exceptions ....................................................................................................... 201 3.5.3 OGAT II vs. other approaches .......................................................................... 204 3.5.4 Economising the theory .................................................................................. 205
3.6 Summary ......................................................................................................................... 207
Chapter 4: Testing the predictive power of OGAT II .................................................................. 209
4.1 Introduction ................................................................................................................... 209 4.2 Experiment design ......................................................................................................... 209 4.3 Method .......................................................................................................................... 213 4.4 Results ........................................................................................................................... 216 4.5 Discussion ...................................................................................................................... 219 4.6 Conclusions and implications of results......................................................................... 232
Chapter 5: Conclusions and directions for further work ............................................................ 241
Reference list ...................................................................................................................................... 247 Appendices.......................................................................................................................................... 273
Appendix A: The sample ........................................................................................................ 273 Appendix B: The constraints .................................................................................................. 277 Appendix C: Sources for criterion 1 ....................................................................................... 303 Appendix D: Sources for criterion 4....................................................................................... 309 Appendix E: Summary of results from Chapter 2 .................................................................. 313 Appendix F: Summary of results from Chapter 3 .................................................................. 319 Appendix G: Experiment........................................................................................................ 339
- vi
- vii
Chapter 1: Introduction
‘ A person who has not studied German can form no idea of what a perplexing language it is […]. Every noun has a gender, and there is no sense or system in the distribution; so the gender of each must be learned separately and by heart. There is no other way. To do this one has to have a memory like a memorandum-book. In German, a young lady has no sex, while a turnip has. Think what overwrought reverence that shows for the turnip, and what
callous disrespect for the girl.’
– Mark Twain, The Awful German Language (1879)
1.1 Grammatical gender in German
Gender is a grammatical feature present in around half of the world’s languages which separates
nouns into classes (Corbett 2013, Audring 2016). In German, there are three grammatical genders: masculine, feminine and neuter. Almost all German nouns belong to one of these categories. For
instance, Tisch ‘table’ is masculine, Lampe ‘lamp’ is feminine, and Fenster ‘window’ is neuter. In
German, gender is said to be ‘covert’ because the gender of a noun is not marked on the noun (the
“controller”) itself (Corbett 1991: 62). In accordance with Hockett’s (1958: 231) well-known definition of gender, ‘genders are classes of nouns reflected in the behaviour of associated words’, gender in
German is visible only on agreeing elements (known as “targets”) such as determiners, attributive adjectives, relative pronouns and personal pronouns. For example, the masculine gender of Tisch is visible on the determiner in der Tisch ‘the.MASC table’.
The gender of a German noun can, however, only be seen on agreeing targets when the noun is in the
singular. In the plural, gender is said to be ‘neutralised’, since all targets are marked identically (e.g.
Bauch 1971: 412, Weinrich 2007: 326, Kürschner & Nübling 2011: 363). Accordingly, the small subset of nouns which appear only in the plural and have no singular variant, known as pluralia tantum, are said to have no gender, e.g. Eltern ‘parents’, Ferien ‘holidays’, Leute ‘people’ (e.g. Wegener 2000: 541, Nübling 2008: 289).
For all nouns with a singular form, however, i.e. most German nouns, gender is obligatory (e.g. MacKay 1999: 73, Hickey 2000: 639). The vast majority of German nouns belong unequivocally to just one gender. There are, however, two main categories of nouns for which this is not the case: “multiplegender” nouns and “hybrid” nouns.
So-called “multiple-gender” nouns (MGNs) are nouns which can be used with more than one gender
without any effect on the noun’s meaning, such as der/das Sims ‘window sill’ (Corbett 1991: 181).
MGNs are therefore not to be confused with homonyms such as die Leiter ‘ladder’ and der Leiter ‘leader’. The gender of a MGN may vary according to factors such as region, e.g. die Butter (Standard German) vs. der Butter (Southern Germany and Austria), or register, e.g. der Virus (everyday speech)
1vs. das Virus (scientific contexts) (e.g. Hickey 2000: 625, Duden Online-Wörterbuch 2018f). Many MGNs can also be found amongst relatively recent loanwords, e.g. der/das Laptop (e.g. Hoberg 2004: 79). MGNs constitute just 1.3% of nouns listed in the Duden dictionary (the standard reference work for German) (Duden Sprachwissen 2017).
“Hybrid” nouns, are nouns which ‘neither […] take the agreements of one consistent agreement pattern nor belong to two or more genders’ (Corbett 1991: 183). That is to say, while the hybrid noun itself is said to belong to just one gender, its targets may display agreement with more than one gender. In German, hybrid nouns are usually those which refer to people but whose grammatical gender does not match the conceptual gender of the referent, e.g. Mädchen ‘girl’. In line with Corbett’s (1979, 1991, 2006) Agreement Hierarchy, all attributive targets used in conjunction with Mädchen agree with the noun’s grammatical gender (neuter), but anaphoric personal pronouns may agree with either the noun’s grammatical gender or the conceptual gender of the referent (feminine) (e.g. Batliner 1984: 850, Semplicini 2012: 141). It has been argued that the kind of agreement displayed by an anaphoric personal pronoun may be influenced by factors such as the distance between the controller noun and the personal pronoun (Köpcke, Panther & Zubin 2010) or, in the case of Mädchen specifically, the age of the referent in question (Braun & Haig 2010). Due to the presence of hybrid nouns in German, it is important to look at attributive targets only in order to reliably ascertain the grammatical gender of a noun.