Representation of Inflected Nouns in the Internal Lexicon

Total Page:16

File Type:pdf, Size:1020Kb

Representation of Inflected Nouns in the Internal Lexicon Memory & Cognition 1980, Vol. 8 (5), 415423 Represeritation of inflected nouns in the internal lexicon G. LUKATELA, B. GLIGORIJEVIC, and A. KOSTIC University ofBelgrade, Belgrade, Yugoslavia and M.T.TURVEY University ofConnecticut, Storrs, Connecticut 06268 and Haskins Laboratories, New Haven, Connecticut 06510 The lexical representation of Serbo-Croatian nouns was investigated in a lexical decision task. Because Serbo-Croatian nouns are declined, a noun may appear in one of several gram­ matical cases distinguished by the inflectional morpheme affixed to the base form. The gram­ matical cases occur with different frequencies, although some are visually and phonetically identical. When the frequencies of identical forms are compounded, the ordering of frequencies is not the same for masculine and feminine genders. These two genders are distinguished further by the fact that the base form for masculine nouns is an actual grammatical case, the nominative singular, whereas the base form for feminine nouns is an abstraction in that it cannot stand alone as an independent word. Exploiting these characteristics of the Serbo­ Croatian language, we contrasted three views of how a noun is represented: (1) the independent­ entries hypothesis, which assumes an independent representation for each grammatical case, reflecting its frequency of occurrence; (2) the derivational hypothesis, which assumes that only the base morpheme is stored, with the individual cases derived from separately stored inflec­ tional morphemes and rules for combination; and (3) the satellite-entries hypothesis, which assumes that all cases are individually represented, with the nominative singular functioning as the nucleus and the embodiment of the noun's frequency and around which the other cases cluster uniformly. The evidence strongly favors the satellite-entries hypothesis. Inflection is the major grammatical device of Serbo­ see that most nouns were masculine and that the nomi­ Croatian, Yugoslavia's principal language. In general, the native singular was the most popular grammatical case. grammatical cases of nouns are formed by adding a The italicized numbers are normalized percentages and suffix to a root morpheme, where the suffix is of the can be read as follows, taking the masculine gender as an vowel, vowel-consonant, or vowel-consonant-vowel type. example. For any given masculine noun that occurs in Less frequently, inflection involves additional processes, the language with frequency f, the nominative singular such as vowel deletion and consonant palatalization. form of that noun occurs with a frequency of approxi­ The grammatical cases of Serbo-Croatian nouns mately .29f, the genitive singular form with a frequency produced by inflection are not equal in their frequency of approximately .19f, the dative singular form with a of occurrence. Table I summarizes the frequency frequency of approximately .02f, and so on. In short, analysis of D. Kostic (1965) on a corpus of approxi­ the normalized percentage for a given grammatical case mately 2 million Serbo-Croatian words appearing in of a given gender is the likelihood that, when a noun of the daily press and contemporary poetry. The non­ that gender appears, it appears in that particular case. italicized numbers are actual percentages. Thus, for all The question of interest to the present paper is how nouns in the corpus, 12.83% were masculine nouns in the inflected Serbo-Croatian nouns are represented in the nominative singular, 7.8% were feminine nouns in lexical memory. Following MacKay (1978) and Manelis the genitive singular, .13% were neuter nouns in the and Tharp (1977), we can distinguish two hypotheses instrumental plural, and so on. Reading the totals, we about the lexical representation of words with common morphological stems. According to the independent­ This research was supported in oart by NICHD Grant HD­ entries hypothesis, the individual grammatical forms of a 08495 to the University of Belgrade and in part by NICHD Serbo-Croatian noun would be represented in the lexicon Grant HD.{) 1994 to Haskins Laboratories. Reprint requests by independent representations, one internal representa­ can be sent either to G. Lukatela, Faculty of Electrical Engi­ neering, Bulevar Revolucije 73, Beograd, Yugoslavia, or to tion for each grammatical form. On the derivational M. T. Turvey, Haskins Laboratories, 270 Crown Street, New hypothesis, rather than all the forms of a given noun Haven, Connecticut 06510. being instantiated in the internal lexicon, there would be Copyright 1980 Psychonomic Society, Inc. 415 0090-502X/80/050415-Q9$O 1.15/0 416 LUKATELA, GLIGORIJEVIC, KOSTIC, and TURVEY Table I Case Frequencies in Percentages Singular Plural Masculine Feminine Neuter Total Masculine. Feminine Neuter Total Nominative 12.83 8.84 2.88 24.55 3.33 3.58 .69 7.60 28.89 22.56 20.44 7.50 9.14 4.30 Genitive 8.56 7.88 3.47 19.91 3.96 3.22 .61 7.79 19.27 20.11 24.63 8.92 8.22 4.33 Dative .87 .38 .31 1.56 .28 .16 .04 .47 1.96 .97 2.20 .63 .41 .28 Accusative 5.49 5.48 2.55 13.52 2.21 2.75 .73 5.69 12.36 13.99 18.10 4.98 7.02 5.18 Instrumental 1.90 1.94 .86 4.70 .60 .80 .13 1.46 4.28 4.95 6.10 1.35 1.86 .92 Locative 3.77 3.42 1.61 8.80 .61 .80 .21 1.62 8.48 8.73 11.43 1.37 2.04 1.48 Total 33.42 27.94 11.68 73.04 10.99 11.24 2.41 23.64 75.25 71.31 82.89 24.75 28.69 17.11 Note-This tableisadopted from D. Kostic (1965). Figures in italics represent the normalizedpercentages, asrelatedto the particular gender. Percen tages do not add to 100% because the rarely occurring vocative casehasbeenomitted. but one instantiation, probably of the noun's root successful (meaning that the letter string is a nonaffixed morpheme. There would also be in memory only a word), then a positive response can be initiated. How­ single instantiation of the set of inflectional morphemes. ever, if the search is unsuccessful, then the letter string Appropriate combinations of the root morpheme and is decomposed and the combination of root and affix inflections would be determined by separately stored is tested for its validity. Obviously, on the decomposition­ syntactic rules. second argument, lexical decision should be slower for There have been relatively few direct contrasts of affixed words. The Manelis and Tharp (1977) investi­ the two hypotheses for English lexical items, and the gation failed to find a difference between affixed and results have been largely equivocal. Manelis and Tharp nonaffixed words in either direction, a result that (1977) compared lexical decision ("Is this letter string favored the independent-entries hypothesis over either a word?") times for pairs of affixed words (words version of the decompositional hypothesis. consisting of two morphemes, a root morpheme and a However, the failure to find evidence for morpho­ suffix) with lexical decision times for pairs of nonaffixed logical decomposition with suffixed words contrasts words (words consisting of a single morpheme). Manelis with the provision of such evidence by Taft and Forster and Tharp (1977) predicted two possible outcomes from (1975) for prefixed words. These investigators reported the derivational or, as they termed it, decompositional that rejecting real roots (for example, SULTS as in hypothesis. For a given letter string, decomposition into INSULTS) as words took longer than rejecting false root and ending could be an obligatory first step, with roots (for example, NINGS as in INNINGS) as words. lexical search for the whole item a contingent later The interpretation given was that real stems. would be step; or lexical search for the whole item could be the found in the lexicon and a subsequent check would be initial obligatory step, with decomposition occurring needed to determine that these lexical entries do not later and dependent upon failure to find the whole item constitute words in the absence of an appropriate in memory. Consider the prediction that follows from prefix. the notion that decomposition occurs first. A word, A further demonstration of morphological decompo­ whether it be affixed or nonaffixed, is partitioned into sition is reported by MacKay (1978), although his exper­ root and ending. A test is then made to determine iment is distinguished from the experiment described the validity of the combination as an affixed word. If above in that it looks at the production of words rather the combination proves valid, a positive response is than at their perception. Subjects heard verbs (e.g., con­ initiated; if it proves invalid (meaning that the word is clude, decide) that they had to nominalize (conclusion, nonaffixed), a search of the lexicon is conducted for the decision) as rapidly as possible (MacKay, 1978). It was nondecomposed letter string. In brief, with everything shown that certain nominalizations took longer than else equal, the decomposition-first argument predicts others; precisely, the more complicated the derivational faster lexical decision for affixed words than for non­ process (the more steps intervening between verb form affixed words. The contrary prediction follows from the and noun form), the slower the nominalizations. decomposition-second argument. If the initial search of The source of the discrepancy between the experi­ the lexicon for the nondecomposed letter string is ments of MacKay (1978) and Manelis and Tharp (1977) INFLECTED NOUNS 417 could be relatively trivial, a matter of differences in the coding of word frequency in lexical memory can be methodology. On the other hand, the discrepancy might referred to as an interentry account, then its popular arise from a deep-seated difference between the kind of alternative can be referred to as an intraentry account, memory structure needed to recognize words and the for here the emphasis is not on an entry's position kind of memory structure needed to produce them.
Recommended publications
  • Reduplication in Distributed Morphology
    Reduplication in Distributed Morphology Item Type text; Article Authors Haugen, Jason D. Publisher University of Arizona Linguistics Circle (Tucson, Arizona) Journal Coyote Papers: Working Papers in Linguistics, Linguistic Theory at the University of Arizona Rights http://rightsstatements.org/vocab/InC/1.0/ Download date 27/09/2021 03:15:12 Item License Copyright © is held by the author(s). Link to Item http://hdl.handle.net/10150/143067 The Coyote Papers 18 (May 2011), University of Arizona Linguistics Department, Tucson, AZ, U.S.A. Reduplication in Distributed Morphology Jason D. Haugen Oberlin College [email protected] Keywords reduplication, Distributed Morphology, allomorphy, reduplicative allomorphy, base-dependence, Hiaki (Yaqui), Tawala Abstract The two extant approaches to reduplication in Distributed Morphology (DM) are: (i) the read- justment approach, where reduplication is claimed to result from a readjustment operation on some stem triggered by a (typically null) affix; and (ii) the affixation approach, where reduplica- tion is claimed to result from the insertion of a special type of Vocabulary Item (i.e. a reduplica- tive affix–“reduplicant" or \Red") which gets inserted into a syntactic node in order to discharge some morphosyntactic feature(s), but which receives its own phonological content from some other stem (i.e. its \base") in the output. This paper argues from phonologically-conditioned allomorphy pertaining to base-dependence, as in the case of durative reduplication in Tawala, that the latter approach best accounts for a necessary distinction between \reduplicants" and \bases" as different types of morphemes which display different phonological effects, including \the emergence of the unmarked" effects, in many languages.
    [Show full text]
  • University of California Santa Cruz Minimal Reduplication
    UNIVERSITY OF CALIFORNIA SANTA CRUZ MINIMAL REDUPLICATION A dissertation submitted in partial satisfaction of the requirements for the degree of DOCTOR OF PHILOSOPHY in LINGUISTICS by Jesse Saba Kirchner June 2010 The Dissertation of Jesse Saba Kirchner is approved: Professor Armin Mester, Chair Professor Jaye Padgett Professor Junko Ito Tyrus Miller Vice Provost and Dean of Graduate Studies Copyright © by Jesse Saba Kirchner 2010 Some rights reserved: see Appendix E. Contents Abstract vi Dedication viii Acknowledgments ix 1 Introduction 1 1.1 Structureofthethesis ...... ....... ....... ....... ........ 2 1.2 Overviewofthetheory...... ....... ....... ....... .. ....... 2 1.2.1 GoalsofMR ..................................... 3 1.2.2 Assumptionsandpredictions. ....... 7 1.3 MorphologicalReduplication . .......... 10 1.3.1 Fixedsize..................................... ... 11 1.3.2 Phonologicalopacity. ...... 17 1.3.3 Prominentmaterialpreferentiallycopied . ............ 22 1.3.4 Localityofreduplication. ........ 24 1.3.5 Iconicity ..................................... ... 24 1.4 Syntacticreduplication. .......... 26 2 Morphological reduplication 30 2.1 Casestudy:Kwak’wala ...... ....... ....... ....... .. ....... 31 2.2 Data............................................ ... 33 2.2.1 Phonology ..................................... .. 33 2.2.2 Morphophonology ............................... ... 40 2.2.3 -mut’ .......................................... 40 2.3 Analysis........................................ ..... 48 2.3.1 Lengtheningandreduplication.
    [Show full text]
  • Verbal Case and the Nature of Polysynthetic Inflection
    Verbal case and the nature of polysynthetic inflection Colin Phillips MIT Abstract This paper tries to resolve a conflict in the literature on the connection between ‘rich’ agreement and argument-drop. Jelinek (1984) claims that inflectional affixes in polysynthetic languages are theta-role bearing arguments; Baker (1991) argues that such affixes are agreement, bearing Case but no theta-role. Evidence from Yimas shows that both of these views can be correct, within a single language. Explanation of what kind of inflection is used where also provides us with an account of the unusual split ergative agreement system of Yimas, and suggests a novel explanation for the ban on subject incorporation, and some exceptions to the ban. 1. Two types of inflection My main aim in this paper is to demonstrate that inflectional affixes can be very different kinds of syntactic objects, even within a single language. I illustrate this point with evidence from Yimas, a Papuan language of New Guinea (Foley 1991). Understanding of the nature of the different inflectional affixes of Yimas provides an explanation for its remarkably elaborate agreement system, which follows a basic split-ergative scheme, but with a number of added complications. Anticipating my conclusions, the structure in (1c) shows what I assume the four principal case affixes on a Yimas verb to be. What I refer to as Nominative and Accusative affixes are pronominal arguments: these inflections, which are restricted to 1st and 2nd person arguments in Yimas, begin as specifiers and complements of the verb, and incorporate into the verb by S- structure. On the other hand, what I refer to as Ergative and Absolutive inflection are genuine agreement - they are the spell-out of functional heads, above VP, which agree with an argument in their specifier.
    [Show full text]
  • Systemic Polyfunctionality and Morphology-Syntax Interdependencies
    Systemic polyfunctionality and morphology-syntax interdependencies Farrell Ackerman1 Olivier Bonami2 Irina Nikolaeva3 1University of California, San Diego 2Université Paris-Sorbonne 3SOAS Defaults in Morphological Theory Lexington, May 2012 The basic problem: Systemic Polyfunctionality Cross-linguistically person/number markers (PNMs) in verbal paradigms often exhibit similarities (up to identity) with person/number markers in nominal possessive constructions (Allen 1964, Radics 1980, Siewierska 1998, 2004, among others): When a language has distinct PNM paradigms for verbal subject (S/A) and object (O) indexing, a question arises: Which paradigm does the possessive paradigm align with? (1) Retuarã (Tucanoan) S/A alignment b˜ıre yi-ha¯a-a¯ Psi yi-behoa-pi 2SG 1SG-kill-NEG.IMP 1SG-spear-INSTR ‘(Be careful), lest I kill you with my spear’ (Strom 1992:63) (2) Kilivila (Central-Eastern Malayo-Polynesian) O alignment lube-gu ku-sake-gu buva friend-1SG 2SG-give-1SG betel_nut ‘My friend, do you give me betel nuts?’ (Senft 1986:53) The basic problem: Systemic Polyfunctionality Among the 130 relevant languages in Sierwieska’s (1998) sample she observes that, We see that [...], among the languages in the sample the affinities in form between the possessor affixes and the verbal person markers of the O (41%) are just marginally more common than those with the S/A (39%). (Siewierska 1998:2) There are, by hypothesis, systemic properties of specific grammars, rather than language independent universals, that explain the alignments observed. The languages compared by Siewierska appear to have distinct markers for S/A and O, and the question asked is which paradigm appears in possessive marking.
    [Show full text]
  • Serial Verb Constructions Revisited: a Case Study from Koro
    Serial Verb Constructions Revisited: A Case Study from Koro By Jessica Cleary-Kemp A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Linguistics in the Graduate Division of the University of California, Berkeley Committee in charge: Associate Professor Lev D. Michael, Chair Assistant Professor Peter S. Jenks Professor William F. Hanks Summer 2015 © Copyright by Jessica Cleary-Kemp All Rights Reserved Abstract Serial Verb Constructions Revisited: A Case Study from Koro by Jessica Cleary-Kemp Doctor of Philosophy in Linguistics University of California, Berkeley Associate Professor Lev D. Michael, Chair In this dissertation a methodology for identifying and analyzing serial verb constructions (SVCs) is developed, and its application is exemplified through an analysis of SVCs in Koro, an Oceanic language of Papua New Guinea. SVCs involve two main verbs that form a single predicate and share at least one of their arguments. In addition, they have shared values for tense, aspect, and mood, and they denote a single event. The unique syntactic and semantic properties of SVCs present a number of theoretical challenges, and thus they have invited great interest from syntacticians and typologists alike. But characterizing the nature of SVCs and making generalizations about the typology of serializing languages has proven difficult. There is still debate about both the surface properties of SVCs and their underlying syntactic structure. The current work addresses some of these issues by approaching serialization from two angles: the typological and the language-specific. On the typological front, it refines the definition of ‘SVC’ and develops a principled set of cross-linguistically applicable diagnostics.
    [Show full text]
  • The “Person” Category in the Zamuco Languages. a Diachronic Perspective
    On rare typological features of the Zamucoan languages, in the framework of the Chaco linguistic area Pier Marco Bertinetto Luca Ciucci Scuola Normale Superiore di Pisa The Zamucoan family Ayoreo ca. 4500 speakers Old Zamuco (a.k.a. Ancient Zamuco) spoken in the XVIII century, extinct Chamacoco (Ɨbɨtoso, Tomarâho) ca. 1800 speakers The Zamucoan family The first stable contact with Zamucoan populations took place in the early 18th century in the reduction of San Ignacio de Samuco. The Jesuit Ignace Chomé wrote a grammar of Old Zamuco (Arte de la lengua zamuca). The Chamacoco established friendly relationships by the end of the 19th century. The Ayoreos surrended rather late (towards the middle of the last century); there are still a few nomadic small bands in Northern Paraguay. The Zamucoan family Main typological features -Fusional structure -Word order features: - SVO - Genitive+Noun - Noun + Adjective Zamucoan typologically rare features Nominal tripartition Radical tenselessness Nominal aspect Affix order in Chamacoco 3 plural Gender + classifiers 1 person ø-marking in Ayoreo realis Traces of conjunct / disjunct system in Old Zamuco Greater plural and clusivity Para-hypotaxis Nominal tripartition Radical tenselessness Nominal aspect Affix order in Chamacoco 3 plural Gender + classifiers 1 person ø-marking in Ayoreo realis Traces of conjunct / disjunct system in Old Zamuco Greater plural and clusivity Para-hypotaxis Nominal tripartition All Zamucoan languages present a morphological tripartition in their nominals. The base-form (BF) is typically used for predication. The singular-BF is (Ayoreo & Old Zamuco) or used to be (Cham.) the basis for any morphological operation. The full-form (FF) occurs in argumental position.
    [Show full text]
  • SPLIT-ERGATIVITY in HITTITE Petra Goedegebuure (University of Chicago)
    Published in: Zeitschrift für Assyriologie und vorderasiatische Archäologie. Volume 102, Issue 2, Pages 270–303, ISSN (Online) 1613-1150, ISSN (Print) 0084-5299, DOI: 10.1515/za- 2012-0015, January 2013 1 SPLIT-ERGATIVITY IN HITTITE Petra Goedegebuure (University of Chicago) “it is possible that all languages show ergativity on some level” (McGregor 2009, 482) 1. Introduction2 As a highly heterogeneous phenomenon ergativity remains a conundrum for linguistic theory. The ergative case has been treated as a structural case, an inherent/lexical case, or rather as a mix (Butt 2006). Split-ergativity is thought to arise as an epiphenomenon, as ‘collateral damage’ of diachronic change after reinterpretation of passive constructions with instrumentals (Dixon 1994) or through reanalysis of transitive null-subject clauses with inanimate instrumentals (Garrett 1990b). Alternatively, case assignment and therefore also split-ergativity ultimately depends on synchronic structural properties of the clause (Merchant 2006). It has been claimed that only 25% of the world’s languages shows ergativity (Van de Visser 2006), or that “all languages show ergativity on some level” (McGregor 2009, 482). Irrespective of the correct ratio, split-ergativity seems to be the norm among languages that show ergativity. When the ergative split is based on semantic features of noun phrases, it is generally assumed that animacy plays a major role. Silverstein (1976) has shown that pronouns and nouns can be hierarchically arranged based on semantic features such as person, number, or grammatical gender. The strength of this hierarchy is that if agent marking is attested for the first time at a certain point in the hierarchy, all nominals lower in the hierarchy will carry agent marking as well.
    [Show full text]
  • 4.1 Inflection
    4.1 Inflection Within a lexeme-based theory of morphology, the difference between derivation and inflection is very simple. Derivation gives you new lexemes, and inflection gives you the forms of a lexeme that are determined by syntactic environment (cf. 2.1.2). But what exactly does this mean? Is there really a need for such a distinction? This section explores the answers to these questions, and in the process, goes deeper into the relation between morphology and syntax. 4.1.1 Inflection vs. derivation The first question we can ask about the distinction between inflection and derivation is whether there is any formal basis for distinguishing the two: can we tell them apart because they do different things to words? One generalization that has been made is that derivational affixes tend to occur closer to the root or stem than inflectional affixes. For example, (1) shows that the English third person singular present inflectional suffix -s occurs outside of derivational suffixes like the deadjectival -ize, and the plural ending -s follows derivational affixes including the deverbal -al: (1) a. popular-ize-s commercial-ize-s b. upheav-al-s arriv-al-s Similarly, Japanese derivational suffixes like passive -rare or causative -sase precede inflectional suffixes marking tense and aspect:1 (2) a. tabe-ru tabe-ta eat- IMP eat- PERF INFLECTION 113 ‘eats’ ‘ate’ b. tabe-rare- ru tabe-rare- ta eat - PASS-IMP eat- PASS-PERF ‘is eaten’ ‘was eaten’ c. tabe-sase- ru tabe-sase- ta eat- CAUS-IMP eat- CAUS-PERF ‘makes eat’ ‘made eat’ It is also the case that inflectional morphology does not change the meaning or grammatical category of the word that it applies to.
    [Show full text]
  • The Syntax of Answers to Negative Yes/No-Questions in English Anders Holmberg Newcastle University
    The syntax of answers to negative yes/no-questions in English Anders Holmberg Newcastle University 1. Introduction This paper will argue that answers to polar questions or yes/no-questions (YNQs) in English are elliptical expressions with basically the structure (1), where IP is identical to the LF of the IP of the question, containing a polarity variable with two possible values, affirmative or negative, which is assigned a value by the focused polarity expression. (1) yes/no Foc [IP ...x... ] The crucial data come from answers to negative questions. English turns out to have a fairly complicated system, with variation depending on which negation is used. The meaning of the answer yes in (2) is straightforward, affirming that John is coming. (2) Q(uestion): Isn’t John coming, too? A(nswer): Yes. (‘John is coming.’) In (3) (for speakers who accept this question as well formed), 1 the meaning of yes alone is indeterminate, and it is therefore not a felicitous answer in this context. The longer version is fine, affirming that John is coming. (3) Q: Isn’t John coming, either? A: a. #Yes. b. Yes, he is. In (4), there is variation regarding the interpretation of yes. Depending on the context it can be a confirmation of the negation in the question, meaning ‘John is not coming’. In other contexts it will be an infelicitous answer, as in (3). (4) Q: Is John not coming? A: a. Yes. (‘John is not coming.’) b. #Yes. In all three cases the (bare) answer no is unambiguous, meaning that John is not coming.
    [Show full text]
  • Morphological Sources of Phonological Length
    Morphological Sources of Phonological Length by Anne Pycha B.A. (Brown University) 1993 M.A. (University of California, Berkeley) 2004 A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Linguistics in the Graduate Division of the University of California, Berkeley Committee in charge: Professor Sharon Inkelas (chair) Professor Larry Hyman Professor Keith Johnson Professor Johanna Nichols Spring 2008 Abstract Morphological Sources of Phonological Length by Anne Pycha Doctor of Philosophy in Linguistics University of California, Berkeley Professor Sharon Inkelas, Chair This study presents and defends Resizing Theory, whose claim is that the overall size of a morpheme can serve as a basic unit of analysis for phonological alternations. Morphemes can increase their size by any number of strategies -- epenthesizing new segments, for example, or devoicing an existing segment (and thereby increasing its phonetic duration) -- but it is the fact of an increase, and not the particular strategy used to implement it, which is linguistically significant. Resizing Theory has some overlap with theories of fortition and lenition, but differs in that it uses the independently- verifiable parameter of size in place of an ad-hoc concept of “strength” and thereby encompasses a much greater range of phonological alternations. The theory makes three major predictions, each of which is supported with cross-linguistic evidence. First, seemingly disparate phonological alternations can achieve identical morphological effects, but only if they trigger the same direction of change in a morpheme’s size. Second, morpheme interactions can take complete control over phonological outputs, determining surface outputs when traditional features and segments fail to do so.
    [Show full text]
  • Corpus Study of Tense, Aspect, and Modality in Diglossic Speech in Cairene Arabic
    CORPUS STUDY OF TENSE, ASPECT, AND MODALITY IN DIGLOSSIC SPEECH IN CAIRENE ARABIC BY OLA AHMED MOSHREF DISSERTATION Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Linguistics in the Graduate College of the University of Illinois at Urbana-Champaign, 2012 Urbana, Illinois Doctoral Committee: Professor Elabbas Benmamoun, Chair Professor Eyamba Bokamba Professor Rakesh M. Bhatt Assistant Professor Marina Terkourafi ABSTRACT Morpho-syntactic features of Modern Standard Arabic mix intricately with those of Egyptian Colloquial Arabic in ordinary speech. I study the lexical, phonological and syntactic features of verb phrase morphemes and constituents in different tenses, aspects, moods. A corpus of over 3000 phrases was collected from religious, political/economic and sports interviews on four Egyptian satellite TV channels. The computational analysis of the data shows that systematic and content morphemes from both varieties of Arabic combine in principled ways. Syntactic considerations play a critical role with regard to the frequency and direction of code-switching between the negative marker, subject, or complement on one hand and the verb on the other. Morph-syntactic constraints regulate different types of discourse but more formal topics may exhibit more mixing between Colloquial aspect or future markers and Standard verbs. ii To the One Arab Dream that will come true inshaa’ Allah! عربية أنا.. أميت دمها خري الدماء.. كما يقول أيب الشاعر العراقي: بدر شاكر السياب Arab I am.. My nation’s blood is the finest.. As my father says Iraqi Poet: Badr Shaker Elsayyab iii ACKNOWLEDGMENTS I’m sincerely thankful to my advisor Prof. Elabbas Benmamoun, who during the six years of my study at UIUC was always kind, caring and supportive on the personal and academic levels.
    [Show full text]
  • Higher-Order Grammatical Features in Multilingual BERT Isabel Papadimitriou Ethan A
    Deep Subjecthood: Higher-Order Grammatical Features in Multilingual BERT Isabel Papadimitriou Ethan A. Chi Stanford University Stanford University [email protected] [email protected] Richard Futrell Kyle Mahowald University of California, Irvine University of California, Santa Barbara [email protected] [email protected] Abstract We investigate how Multilingual BERT (mBERT) encodes grammar by examining how the high-order grammatical feature of morphosyntactic alignment (how different languages define what counts as a “subject”) is manifested across the embedding spaces of different languages. To understand if and how morphosyntactic alignment affects contextual embedding spaces, we train classifiers to recover the subjecthood of mBERT embeddings in transitive sentences (which do not contain overt information about morphosyntactic alignment) and then evaluate Figure 1: Top: Illustration of the difference between them zero-shot on intransitive sentences alignment systems. A (for agent) is notation used for (where subjecthood classification depends on the transitive subject, and O for the transitive ob- alignment), within and across languages. We ject:“The lawyer chased the dog.” S denotes the find that the resulting classifier distributions intransitive subject: “The lawyer laughed.” The blue reflect the morphosyntactic alignment of their circle indicates which roles are marked as “subject” in training languages. Our results demonstrate each system. that mBERT representations are influenced by Bottom: Illustration of the training and test process. high-level grammatical features that are not We train a classifier to distinguish A from O arguments manifested in any one input sentence, and that using the BERT contextual embeddings, and test the this is robust across languages. Further ex- classifier’s behavior on intransitive subjects (S).
    [Show full text]