Spying on Your Neighbors: Fine-Grained Probing of Contextual Embeddings for Information About Surrounding Words

Total Page:16

File Type:pdf, Size:1020Kb

Spying on Your Neighbors: Fine-Grained Probing of Contextual Embeddings for Information About Surrounding Words Spying on your neighbors: Fine-grained probing of contextual embeddings for information about surrounding words Josef Klafka Allyson Ettinger Department of Psychology Department of Linguistics Carnegie Mellon University University of Chicago [email protected] [email protected] Abstract about what information these contextual embed- dings actually encode about the words around them. Although models using contextual word em- In a sentence like “The lawyer questioned the beddings have achieved state-of-the-art results judge”, does the contextual representation for ques- on a host of NLP tasks, little is known about tioned reflect properties of the subject lawyer? Of exactly what information these embeddings en- judge code about the context words that they are un- the object ? What determines the information derstood to reflect. To address this question, that a contextual embedding absorbs about its sur- we introduce a suite of probing tasks that en- rounding words? In this paper, we address these able fine-grained testing of contextual embed- questions by designing and implementing a suite dings for encoding of information about sur- of probing tasks, to test contextual embeddings rounding words. We apply these tasks to exam- for information about syntactic and semantic fea- ine the popular BERT, ELMo and GPT contex- tures of words in their contexts. We use controlled tual encoders, and find that each of our tested sentences of fixed structure, allowing us to probe information types is indeed encoded as contex- tual information across tokens, often with near- for information associated with word categories, perfect recoverability—but the encoders vary and to avoid confounds with particular vocabulary in which features they distribute to which to- items. We then apply these tests to examine the kens, how nuanced their distributions are, and distribution of contextual information across token how robust the encoding of each feature is to representations produced by contextual encoders distance. We discuss implications of these re- BERT (Devlin et al., 2019), ELMo (Peters et al., sults for how different types of models break 2018b), and GPT (Radford et al., 2018). down and prioritize word-level context infor- mation when constructing token embeddings. The contributions of this paper are twofold. First, we introduce a suite of novel probing tasks for test- 1 Introduction ing how encoders distribute contextual information across sentence tokens. All datasets and code are The field of natural language processing has re- available for follow-up testing.1 Second, we use cently seen impressive performance gains associ- these tests to shed light on the distribution of con- ated with the use of “contextual word embeddings”: text information in state-of-the-art encoders BERT, high-dimensional vectors that have access to infor- ELMo and GPT. We find that these models en- arXiv:2005.01810v1 [cs.CL] 4 May 2020 mation from the contexts of the words they repre- code each of our tested word features richly across sent. Models that use these contextual embeddings sentence tokens, often with perfect or near-perfect achieve state-of-the-art performance on a variety of recoverability, but the details of how the models natural language processing tasks, from question- distribute this information vary across encoders. In answering to natural language inference. As of particular, bidirectional models show more nuance writing, nearly all of the models on the SuperGLUE in information selectivity, while the deeper trans- leaderboard (Wang et al., 2019) use contextual em- former models show more robustness to distance. beddings in their architectures, most notably mod- Follow-up tests suggest that the effects cannot be els building on the BERT (Devlin et al., 2019) and chalked up to proximity, and that general word fea- Transformer XL (Dai et al., 2019) models. tures are encoded more robustly than word identity. Despite the clear power afforded by incorporat- 1Probing datasets and code available at ing context into word embeddings, little is known https://github.com/jklafka/context-probes. 2 Our approach almost at ceiling on syntactic tests. Testing BERT’s outputs on a range of semantic, syntactic and prag- Our tests address the following basic question: if matic information, Ettinger(2020) finds strong sen- we probe the contextual representation of a given sitivity to syntax, but clear limitations in areas of token in a sentence, how much information can we semantics and pragmatic/commonsense reasoning. recover about the other words in that sentence? For We complement this work with a direct focus on the example, if we create a contextual embedding for contextual token representations learned by mod- the word questioned in the sentence els pre-trained on language modeling, examining The lawyer questioned the judge the syntactic and semantic information that these how well can we extract information about the sub- embeddings capture about surrounding words. ject noun (lawyer)? What if we probe the object Most directly related to the present work are noun (judge) or determiners (the)? We develop studies using probing and other methods to analyze tasks to probe representations of each word for var- information in contextual token embeddings. Some ious types of information about the other words of this research (e.g. Tenney et al., 2019a; Jawa- of the sentence, allowing us to examine with fine har et al., 2019) finds that BERT encodes more granularity how contextual encoders distribute in- local, syntactic information at lower layers and formation about surrounding words. We complete more global, semantic information at higher lay- this investigation for each word in a set of fixed- ers. Peters et al.(2018a) find that encoders differ length sentences of pre-determined form, which in encoding strength for semantic features but all allows us to characterize behaviors based on word encode these features strongly where possible. He- categories (e.g., subjects versus verbs). Using this witt and Manning(2019) provide evidence that approach, we can examine how the distribution of contextual encoders capture sentence-level hierar- context information is impacted by a) the type of chical syntactic structures in their representations. information being encoded, and b) the properties Other work (Liu et al., 2019; Tenney et al., 2019b) of the word that the embedding corresponds to. finds that contextual word encoders struggle to learn fine-grained linguistic information in a variety 3 Related work of contexts. These papers have focused primarily on studying the ability of contextual embeddings Much work has been done on analyzing the in- to capture information about the full sentence, or formation captured by sentence encoders and lan- about phrases or dependencies of which those con- guage models in general. Classification-based prob- textual embeddings are a part. We focus on map- ing tasks have been used to analyze the contents of ping the precise distribution of context information sentence embeddings (Adi et al., 2016; Conneau across token embeddings, with a systematic, fine- et al., 2018; Ettinger et al., 2016), finding that these grained investigation of the information that each embeddings encode a variety of information about token encodes about each of its surrounding tokens. sentence structure, content, length, etc., though more tightly-controlled tasks suggest weaknesses 4 Probing for contextual information in capturing basic sentence meaning (Ettinger et al., 2018). Our work uses the same classification- For each of our probing tasks, we test for a particu- based probing methodology, but focuses on probing lar information type, formulated as a query about a token-level embeddings for context information. particular target word in the sentence—for instance, Other work has analyzed linguistic capacities “What is the animacy of the subject?” or “What is of language models by examining output probabil- the tense of the verb?”. We then apply these queries ities in context, emulating methods for studying to probe the embeddings for each word of the sen- human language processing. Much of this work tence in turn—we call this the probed word. For has studied sensitivity to syntactic dependencies example, a test with a probed word of verb, a target in recurrent neural network language models (e.g. word of subject noun, and an information type of Linzen et al., 2016; Wilcox et al., 2018; Chowd- animacy would ask the question: “What does the hury and Zamparelli, 2018; Gulordava et al., 2018; embedding of the verb tell us about the animacy Marvin and Linzen, 2018; Futrell et al., 2019). Us- of the subject noun?” We implement each test as a ing similar methods to test syntactic awareness in binary classification task (e.g., “animate” vs “inan- BERT, Goldberg(2019) finds the model to perform imate”), and train and test a multi-layer perceptron Task Example Label (subject) Number The lawyer betrayed the judge. SINGULAR The lawyers betrayed the judge. PLURAL (subject) Gender The waiter betrayed the judge. MASCULINE The waitress betrayed the judge. FEMININE (subject) Animacy The car betrayed the judge. INANIMATE The turtle betrayed the judge. ANIMATE Table 1: Example items from probing tasks for each noun information type. classifier using the embeddings of one probed word to ensure that no word but the target noun indicates category at a time as input for the task. In this sec- the number information of interest. tion, we describe the details of our probing datasets For each task we generate 4000 training and and tested information types. 1000 test transitive sentences. We generate sep- arate datasets for each target word within an in- 4.1 Dataset construction formation type—for example, generating separate subject animacy and object animacy datasets. We construct our datasets using generated transitive sentences with a fixed five-word structure: “DET 4.2 Information types SUBJ-N VB DET OBJ-N”, as in “The lawyer ques- We probe for three types of linguistic information tioned the judge”.
Recommended publications
  • Animacy and Alienability: a Reconsideration of English
    Running head: ANIMACY AND ALIENABILITY 1 Animacy and Alienability A Reconsideration of English Possession Jaimee Jones A Senior Thesis submitted in partial fulfillment of the requirements for graduation in the Honors Program Liberty University Spring 2016 ANIMACY AND ALIENABILITY 2 Acceptance of Senior Honors Thesis This Senior Honors Thesis is accepted in partial fulfillment of the requirements for graduation from the Honors Program of Liberty University. ______________________________ Jaeshil Kim, Ph.D. Thesis Chair ______________________________ Paul Müller, Ph.D. Committee Member ______________________________ Jeffrey Ritchey, Ph.D. Committee Member ______________________________ Brenda Ayres, Ph.D. Honors Director ______________________________ Date ANIMACY AND ALIENABILITY 3 Abstract Current scholarship on English possessive constructions, the s-genitive and the of- construction, largely ignores the possessive relationships inherent in certain English compound nouns. Scholars agree that, in general, an animate possessor predicts the s- genitive while an inanimate possessor predicts the of-construction. However, the current literature rarely discusses noun compounds, such as the table leg, which also express possessive relationships. However, pragmatically and syntactically, a compound cannot be considered as a true possessive construction. Thus, this paper will examine why some compounds still display possessive semantics epiphenomenally. The noun compounds that imply possession seem to exhibit relationships prototypical of inalienable possession such as body part, part whole, and spatial relationships. Additionally, the juxtaposition of the possessor and possessum in the compound construction is reminiscent of inalienable possession in other languages. Therefore, this paper proposes that inalienability, a phenomenon not thought to be relevant in English, actually imbues noun compounds whose components exhibit an inalienable relationship with possessive semantics.
    [Show full text]
  • The Use of Demonstrative Pronoun and Demonstrative Determiner This in Upper-Level Student Writing: a Case Study
    English Language Teaching; Vol. 8, No. 5; 2015 ISSN 1916-4742 E-ISSN 1916-4750 Published by Canadian Center of Science and Education The Use of Demonstrative Pronoun and Demonstrative Determiner this in Upper-Level Student Writing: A Case Study Katharina Rustipa1 1 Faculty of Language and Cultural Studies, Stikubank University (UNISBANK) Semarang, Indonesia Correspondence: Katharina Rustipa, UNISBANK Semarang, Jl. Tri Lomba Juang No.1 Semarang 50241, Indonesia. Tel: 622-4831-1668. E-mail: [email protected] Received: January 20, 2015 Accepted: February 26, 2015 Online Published: April 23, 2015 doi:10.5539/elt.v8n5p158 URL: http://dx.doi.org/10.5539/elt.v8n5p158 Abstract Demonstrative this is worthy to investigate because of the role of this as a common cohesive device in academic writing. This study attempted to find out the variables underlying the realization of demonstrative this in graduate-student writing of Semarang State University, Indonesia. The data of the study were collected by asking three groups of students (first semester, second semester, third semester students) to write an essay. The collected data were analyzed by identifying, classifying, calculating, and interpreting. Interviewing to several students was also done to find out the reasons underlying the use of attended and unattended this. Comparing the research results to those of the Michigan Corpus of Upper-level Student Paper (MICUSP) as proficient graduate-student writing was done in order to know the position of graduate-student writing of Semarang State University in reference to MICUSP. The conclusion of the research results is that most occurrences of demonstrative this are attended and these occurrences are stable across levels, similar to those in MICUSP.
    [Show full text]
  • How Do Young Children Acquire Case Marking?
    INVESTIGATING FINNISH-SPEAKING CHILDREN’S NOUN MORPHOLOGY: HOW DO YOUNG CHILDREN ACQUIRE CASE MARKING? Thesis submitted to the University of Manchester for the degree of Doctor in Philosophy in the Faculty of Medical and Human Sciences 2015 HENNA PAULIINA LEMETYINEN SCHOOL OF PSYCHOLOGICAL SCIENCES 2 Table of Contents LIST OF TABLES ......................................................................................................................... 6 LIST OF FIGURES ....................................................................................................................... 7 ABSTRACT ................................................................................................................................. 8 DECLARATION .......................................................................................................................... 9 COPYRIGHT STATEMENT .......................................................................................................... 9 ACKNOWLEDGEMENTS .......................................................................................................... 10 Chapter 1: General introduction to language acquisition research ...................................... 11 1.1. Generativist approaches to child language............................................................ 11 1.2. Usage-based approaches to child language........................................................... 14 1.3. The acquisition of morphology .............................................................................
    [Show full text]
  • Morphological Causatives Are Voice Over Voice
    Morphological causatives are Voice over Voice Yining Nie New York University Abstract Causative morphology has been associated with either the introduction of an event of causation or the introduction of a causer argument. However, morphological causatives are mono-eventive, casting doubt on the notion that causatives fundamentally add a causing event. On the other hand, in some languages the causative morpheme is closer to the verb root than would be expected if the causative head is responsible for introducing the causer. Drawing on evidence primarily from Tagalog and Halkomelem, I argue that the syntactic configuration for morphological causatives involves Voice over Voice, and that languages differ in whether their ‘causative marker’ spells out the higher Voice, the lower Voice or both. Keywords: causative, Voice, argument structure, morpheme order, typology, Tagalog 1. Introduction Syntactic approaches to causatives generally fall into one of two camps. The first view builds on the discovery that causatives may semantically consist of multiple (sub)events (Jackendoff 1972, Dowty 1979, Parsons 1990, Levin & Rappaport Hovav 1994, a.o.). Consider the following English causative–anticausative pair. The anticausative in (1a) consists of an event of change of state, schematised in (1b). The causative in (2a) involves the same change of state plus an additional layer of semantics that conveys how that change of state is brought about (2b). (1) a. The stick broke. b. [ BECOME [ stick STATE(broken) ]] (2) a. Pat broke the stick. b. [ Pat CAUSE [ BECOME [ stick STATE(broken) ]]] Word Structure 13.1 (2020): 102–126 DOI: 10.3366/word.2020.0161 © Edinburgh University Press www.euppublishing.com/word MORPHOLOGICAL CAUSATIVES ARE VOICE OVER VOICE 103 Several linguists have proposed that the semantic CAUSE and BECOME components of the causative are encoded as independent lexical verbal heads in the syntax (Harley 1995, Cuervo 2003, Folli & Harley 2005, Pylkkänen 2008, a.o.).
    [Show full text]
  • Determiners in Various Resources: Grammar Books, Coursebooks And
    Masaryk University Faculty of Education Department of English Language and Literature Determiners in various resources: Grammar books, coursebooks and online sources compared Bachelor thesis Brno 2016 Supervisor: Author: doc. PhDr. Renata Povolná, Ph.D. Pavla Fryštáková Declaration: „Prohlašuji, že jsem závěrečnou bakalářskou práci vypracovala samostatně, s využitím pouze citovaných literárních pramenů, dalších informací a zdrojů v souladu s Disciplinárním řádem pro studenty Pedagogické fakulty Masarykovy univerzity a se zákonem č. 121/2000 SB., o právu autorském, o právech souvisejících s právem autorským a o měně některých zákonů (autorský zákon), ve znění pozdějších předpisů.“ V Brně dne 16. 3. 2016 ………………………………. Pavla Fryštáková Acknowledgement: I would like to thank doc. PhDr. Renata Povolná, Ph.D., for help and advice provided during the supervision of my bachelor thesis. Contents 1 Introduction ............................................................................................................. 5 2 Reference to determiners in grammar books ....................................................... 7 2.1 Grammar books referring to determiners as one topic ....................................... 7 2.1.1 Swan (1996) ................................................................................................ 7 2.1.2 Leech and Svartvik (1975) .......................................................................... 9 2.1.3 Greenbaum and Quirk (1990) ................................................................... 16 2.1.4
    [Show full text]
  • The Definite Determiner in Early Middle English
    43 The defi nite determiner in Early Middle English: What happened with þe? Cynthia L. Allen Australian National University Abstract This paper offers new data bearing on the question of when English developed a defi nite article, distinct from the distal demonstrative. It focuses primarily on one criterion that has been used in dating this development, namely the inability of þe (Modern English the, the refl ex of the demonstrative se) to be used as a pronoun. I argue that this criterion is not a satisfactory one and propose a treatment of þe as a form which could occupy either the head D of DP or the specifi er of DP. This is an approach consistent with Crisma’s (2011) position that a defi nite article emerged within the Old English (OE) period. I offer a new piece of evidence supporting Crisma’s demonstration of a difference between OE poetry and the prose of the ninth century and later. 1. Introduction: dating the defi nite article A long-standing problem in historical English syntax is dating the emergence of the defi nite article. A major diffi culty here, noted by Johanna Wood (2003) and others, is defi ning exactly what we mean by an ‘article.’ Millar (2000:304, note 11) comments that we might say that Modern English does not have a ‘true’ article, but only a ‘weakly demonstrative defi nite determiner’ on Himmelmann’s (1997) proposed path of development for defi nite articles. However, no one can doubt that English has moved along this path from a demonstrative determiner to something that has become more purely grammatical, with loss of deictic properties.
    [Show full text]
  • Serial Verb Constructions Revisited: a Case Study from Koro
    Serial Verb Constructions Revisited: A Case Study from Koro By Jessica Cleary-Kemp A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Linguistics in the Graduate Division of the University of California, Berkeley Committee in charge: Associate Professor Lev D. Michael, Chair Assistant Professor Peter S. Jenks Professor William F. Hanks Summer 2015 © Copyright by Jessica Cleary-Kemp All Rights Reserved Abstract Serial Verb Constructions Revisited: A Case Study from Koro by Jessica Cleary-Kemp Doctor of Philosophy in Linguistics University of California, Berkeley Associate Professor Lev D. Michael, Chair In this dissertation a methodology for identifying and analyzing serial verb constructions (SVCs) is developed, and its application is exemplified through an analysis of SVCs in Koro, an Oceanic language of Papua New Guinea. SVCs involve two main verbs that form a single predicate and share at least one of their arguments. In addition, they have shared values for tense, aspect, and mood, and they denote a single event. The unique syntactic and semantic properties of SVCs present a number of theoretical challenges, and thus they have invited great interest from syntacticians and typologists alike. But characterizing the nature of SVCs and making generalizations about the typology of serializing languages has proven difficult. There is still debate about both the surface properties of SVCs and their underlying syntactic structure. The current work addresses some of these issues by approaching serialization from two angles: the typological and the language-specific. On the typological front, it refines the definition of ‘SVC’ and develops a principled set of cross-linguistically applicable diagnostics.
    [Show full text]
  • 6 the Major Parts of Speech
    6 The Major Parts of Speech KEY CONCEPTS Parts of Speech Major Parts of Speech Nouns Verbs Adjectives Adverbs Appendix: prototypes INTRODUCTION In every language we find groups of words that share grammatical charac- teristics. These groups are called “parts of speech,” and we examine them in this chapter and the next. Though many writers onlanguage refer to “the eight parts of speech” (e.g., Weaver 1996: 254), the actual number of parts of speech we need to recognize in a language is determined by how fine- grained our analysis of the language is—the more fine-grained, the greater the number of parts of speech that will be distinguished. In this book we distinguish nouns, verbs, adjectives, and adverbs (the major parts of speech), and pronouns, wh-words, articles, auxiliary verbs, prepositions, intensifiers, conjunctions, and particles (the minor parts of speech). Every literate person needs at least a minimal understanding of parts of speech in order to be able to use such commonplace items as diction- aries and thesauruses, which classify words according to their parts (and sub-parts) of speech. For example, the American Heritage Dictionary (4th edition, p. xxxi) distinguishes adjectives, adverbs, conjunctions, definite ar- ticles, indefinite articles, interjections, nouns, prepositions, pronouns, and verbs. It also distinguishes transitive, intransitive, and auxiliary verbs. Writ- ers and writing teachers need to know about parts of speech in order to be able to use and teach about style manuals and school grammars. Regardless of their discipline, teachers need this information to be able to help students expand the contexts in which they can effectively communicate.
    [Show full text]
  • Positional Verbs in Nen
    CORE Metadata, citation and similar papers at core.ac.uk Provided by The Australian National University Positional Verbs in Nen Nicholas Evans AUSTRALIAN NATIONAL UNIVERSITY In this paper, I lay out the workings of the rather unusual system of positional verbs found in Nen, a language of the Morehead-Maro family in Morehead dis- trict, Western Province, Papua New Guinea. Nen is unusual in its lexicalization patterns: it has very few verbs that are intransitive, with most verbs that tend to be intransitive cross-linguistically realized as morphologically middle verbs, including ‘talk’, ‘work’, ‘descend’, and so on. Within the fifty attested morpho- logically intransitive verbs, forty-five comprise an interesting class of “positional verbs,” the subject of this paper; the others are ‘be’, its derivatives ‘come’ and ‘go’ (lit. ‘be hither’ and ‘be thither’), and ‘walk’. Positional verbs denote spatial positions and postures like ‘be sitting’, ‘be up high’, ‘be erected (of a building)’, ‘be open’, ‘be in a tree-fork’, ‘be at the end of something’. Positional verbs differ from regular verbs in lacking in¿nitives, in possessing a special “stative” aspect inÀection and an unusual system for building a four- way number system (building large plurals by combining singular and dual markers), and in participating in a productive three-way alternation between positional statives (like ‘be high’), placement transitives (like ‘put up high’), and get-into-position middles (like ‘get into a high position’). The latter two types are more like normal verbs (for example, they possess in¿nitives and participate in the normal TAM series), but they are formally derived from the positionals.
    [Show full text]
  • Coding Practices Used in the Project Optimal Typology of Determiner Phrases
    Coding practices used in the project Optimal Typology of Determiner Phrases Gregory Garretson This document is one section of the coding manual used by the project Optimal Typology of Determiner Phrases, based at Boston University. It is being made available due to popular demand. If you have questions about the rest of the manual, or the contents of this section, please contact Gregory Garretson at [email protected] or [email protected]. Please cite this document as follows: Garretson, Gregory. 2004. Coding practices used in the project Optimal Typology of Determiner Phrases. Unpublished manuscript, Boston University, Boston, MA. http://npcorpus.bu.edu/html/documentation/index.html. Contents 5.1 General practices 5.2 Construction type 5.3 Which examples to count? 5.4 Expression type 5.5 Definiteness 5.6 Animacy 5.7 Weight 5.0 Applying tags to the examples This section of the coding manual details the policies we have adopted in adding tags to the examples, once they have been bracketed. Because of the wondrous variety of language, coding a corpus is not nearly as straightforward as we might like. This section will help you to make the often difficult decisions about which tags to apply when. Section 5.1 gives general guidance for coding; Sections 5.2 and on discuss various classes of tags in detail. Starting in Section 5.2, each section will begin with a list of the tags discussed in that section, so you can easily find the discussion of a given tag by looking at these lines. They look like this: [Tags discussed in this section: INCL, EXCL, PND, PWD, CMPD, IDIOM, NAME, PART, PART2, DINS, SORT] Please remember that in addition to the discussion in this section, there is a brief description of each tag given in Section 2.
    [Show full text]
  • Classifiers Determiners Yicheng Wu Adams Bodomo
    REMARKS AND REPLIES 487 Classifiers ϶ Determiners Yicheng Wu Adams Bodomo Cheng and Sybesma (1999, 2005) argue that classifiers in Chinese are equivalent to a definite article. We argue against this position on empirical grounds, drawing attention to the fact that semantically, syntactically, and functionally, Chinese classifiers are not on the same footing as definite determiners. We also show that compared with Cheng and Sybesma’s ClP analysis of Chinese NPs (in particular, Cantonese NPs, on which their proposal crucially relies), a consistent DP analysis is not only fully justified but strongly supported. Keywords: classifiers, open class, definite determiners, closed class, Mandarin, Cantonese 1 Introduction While it is often proposed that the category DP exists not only in languages with determiners such as English but also in languages without determiners such as Chinese (see, e.g., Pan 1990, Tang 1990a,b, Li 1998, 1999, Cheng and Sybesma 1999, 2005, Simpson 2001, 2005, Simpson and Wu 2002, Wu 2004), there seems to be no consensus about which element (if any) in Chinese is the possible counterpart of a definite determiner like the in English. In their influential 1999 article with special reference to Mandarin and Cantonese, Cheng and Sybesma (hereafter C&S) declare that ‘‘both languages have the equivalent of a definite article, namely, classifiers’’ (p. 522).1 Their treatment of Chinese classifiers as the counterpart of definite determiners is based on the following arguments: (a) both can serve the individualizing/singularizing function; (b) both can serve the deictic function. These arguments and the conclusion drawn from them have been incorporated into C&S 2005, C&S’s latest work on the classifier system in Chinese.
    [Show full text]
  • AN INTRODUCTORY GRAMMAR of OLD ENGLISH Medieval and Renaissance Texts and Studies
    AN INTRODUCTORY GRAMMAR OF OLD ENGLISH MEDievaL AND Renaissance Texts anD STUDies VOLUME 463 MRTS TEXTS FOR TEACHING VOLUme 8 An Introductory Grammar of Old English with an Anthology of Readings by R. D. Fulk Tempe, Arizona 2014 © Copyright 2020 R. D. Fulk This book was originally published in 2014 by the Arizona Center for Medieval and Renaissance Studies at Arizona State University, Tempe Arizona. When the book went out of print, the press kindly allowed the copyright to revert to the author, so that this corrected reprint could be made freely available as an Open Access book. TABLE OF CONTENTS PREFACE viii ABBREVIATIONS ix WORKS CITED xi I. GRAMMAR INTRODUCTION (§§1–8) 3 CHAP. I (§§9–24) Phonology and Orthography 8 CHAP. II (§§25–31) Grammatical Gender • Case Functions • Masculine a-Stems • Anglo-Frisian Brightening and Restoration of a 16 CHAP. III (§§32–8) Neuter a-Stems • Uses of Demonstratives • Dual-Case Prepositions • Strong and Weak Verbs • First and Second Person Pronouns 21 CHAP. IV (§§39–45) ō-Stems • Third Person and Reflexive Pronouns • Verbal Rection • Subjunctive Mood 26 CHAP. V (§§46–53) Weak Nouns • Tense and Aspect • Forms of bēon 31 CHAP. VI (§§54–8) Strong and Weak Adjectives • Infinitives 35 CHAP. VII (§§59–66) Numerals • Demonstrative þēs • Breaking • Final Fricatives • Degemination • Impersonal Verbs 40 CHAP. VIII (§§67–72) West Germanic Consonant Gemination and Loss of j • wa-, wō-, ja-, and jō-Stem Nouns • Dipthongization by Initial Palatal Consonants 44 CHAP. IX (§§73–8) Proto-Germanic e before i and j • Front Mutation • hwā • Verb-Second Syntax 48 CHAP.
    [Show full text]