What's in a Name? in Some Languages, Grammatical Gender

Total Page:16

File Type:pdf, Size:1020Kb

What's in a Name? in Some Languages, Grammatical Gender What's in a name? In some languages, grammatical gender Vivi Nastase Marius Popescu EML Research gGmbH Department of Mathematics and Computer Science Heidelberg, Germany University of Bucharest, Bucharest, Romania [email protected] [email protected] Abstract within a single language family: travel – mascu- line in Spanish (el viage) and Italian (il viaggio), This paper presents an investigation of the but feminine in Portuguese (a viagem). relation between words and their gender in Grammatical gender groups nouns in a lan- two gendered languages: German and Ro- guage into distinct classes. There are languages manian. Gender is an issue that has long whose nouns are grouped into more or less than preoccupied linguists and baffled language three classes. English for example has none, and learners. We verify the hypothesis that makes no distinction based on gender, although gender is dictated by the general sound Old English did have three genders and some patterns of a language, and that it goes traces remain (e.g. blonde, blond). beyond suffixes or word endings. Exper- Linguists assume several sources for gender: (i) imental results on German and Romanian a first set of nouns which have natural gender and nouns show strong support for this hypoth- which have associated matching grammatical gen- esis, as gender prediction can be done with der; (ii) nouns that resemble (somehow) the nouns high accuracy based on the form of the in the first set, and acquire their grammatical gen- words. der through this resemblance. Italian and Roma- nian, for example, have strong and reliable phono- 1 Introduction logical correlates (Vigliocco et al., 2004b, for Ital- For speakers of a language whose nouns have no ian). (Doca, 2000, for Romanian). In Romanian gender (such as modern English), making the leap the majority of feminine nouns end in a˘ or e. Some to a language that does (such as German), does rules exists for German as well (Schumann, 2006), not come easy. With no or few rules or heuris- for example nouns ending in -tat,¨ -ung, -e, -enz, tics to guide him, the language learner will try to -ur, -keit, -in tend to be feminine. Also, when draw on the “obvious” parallel between grammat- specific morphological processes apply, there are ical and natural gender, and will be immediately rules that dictate the gender of the newly formed baffled to learn that girl – Madchen¨ – is neuter in word. This process explains why Frau (woman) is German. Furthermore, one may refer to the same feminine in German, while Fraulein¨ (little woman, object using words with different gender: car can miss) is neuter – Fraulein¨ = Frau + lein. The ex- be called (das) Auto (neuter) or (der) Wagen (mas- isting rules have exceptions, and there are numer- culine). Imagine that after hard work, the speaker ous nouns in the language which are not derived, has mastered gender in German, and now wishes and such suffixes do not apply. to proceed with a Romance language, for example Words are names used to refer to concepts. The Italian or Spanish. He is now confronted with the fact that the same concept can be referred to using task of relearning to assign gender in these new names that have different gender – as is the case languages, made more complex by the fact that for car in German – indicates that at least in some gender does not match across languages: e.g. sun cases, grammatical gender is in the name and not – feminine in German (die Sonne), but masculine the concept. We test this hypothesis – that the gen- in Spanish (el sol), Italian (il sole) and French (le der of a noun is in its word form, and that this goes soleil); moon – masculine in German (der Mond), beyond word endings – using noun gender data but feminine in Spanish (la luna), Italian (la luna) for German and Romanian. Both Romanian and and French (la lune). Gender doesn’t even match German have 3 genders: masculine, feminine and 1368 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pages 1368–1377, Singapore, 6-7 August 2009. c 2009 ACL and AFNLP neuter. The models built using machine learning larski (2007) presents a more detailed overview algorithms classify test nouns into gender classes of currents and ideas about the origin of gender. based on their form with high accuracy. These re- Unterbeck (1999) contains a collection of papers sults support the hypothesis that in gendered lan- that investigate grammatical gender in several lan- guages, the word form is a strong clue for gender. guages, aspects of gender acquisition and its rela- This supplements the situation when some con- tion with grammatical number and agreement. cepts have natural gender that matches their gram- There may be several reasons for the polemic matical gender: it allows for an explanation where between these two sides. One may come from the there is no such match, either directly perceived, categorization process, the other from the relation or induced through literary devices. between word form and its meaning. Let us take The present research has both theoretical and them each in turn, and see how they influenced practical benefits. From a theoretical point of gender. view, it contributes to research on phonology and Grammatical gender separates the nouns in a gender, in particular in going a step further in un- language into disjoint classes. As such, it is a cat- derstating the link between the two. From a practi- egorization process. The traditional – classical – cal perspective, such a connection between gender theory of categorization and concepts viewed cat- and sounds could be exploited in advertising, in egories and concepts as defined in terms of a set particular in product naming, to build names that of common properties that all its members should fit a product, and which are appealing to the de- share. Recent theories of concepts have changed, sired customers. Studies have shown that espe- and view concepts (and categories) not necessar- cially in the absence of meaning, the form of a ily as “monolithic” and defined through rules, but word can be used to generate specific associations rather as clusters of members that may resemble and stimulate the imagination of prospective cus- each other along different dimensions (Margolis tomers (Sells and Gonzales, 2003; Bedgley, 2002; and Laurence, 1999). Botton et al., 2002). In most linguistic circles, the principle of ar- bitrariness of the association between form and 2 Gender meaning, formalized by de Saussure (1916) has What is the origin of grammatical gender and how been largely taken for granted. It seems however, does it relate to natural gender? Opinions are split. that it is hard to accept such an arbitrary relation, Historically, there were two main, opposite, views: as there have always been contestants of this prin- (i) there is a semantic explanation, and natural ciple, some more categorical than others (Jakob- gender motivated the category (ii) the relationship son, 1937; Jespersen, 1922; Firth, 1951). It is pos- between natural and grammatical gender is arbi- sible that the correlation we perceive between the trary. word form and the meaning is something that has Grimm (1890) considered that grammatical arisen after the word was coined in a language, be- gender is an extension of natural gender brought ing the result of what Firth called “phonetic habit” on by imagination. Each gender is associated through “an attunement of the nervous system”, with particular adjectives or other attributes, and in and that we have come to prefer, or select, cer- some cases (such as for sun and moon) the assign- tain word forms as more appropriate to the con- ment of gender is based on personification. Brug- cept they name – “There is no denying that there mann (1889) and Bloomfield (1933) took the po- are words which we feel instinctively to be ade- sition that the mapping of nouns into genders is quate to express the ideas they stand for. ... Sound arbitrary, and other phenomena – such as deriva- symbolism, we may say, makes some words more tions, personification – are secondary to the estab- fit to survive” (Jespersen, 1922). lished agreement. Support for this second view These two principles relate to the discussion comes also from language acquisition: children on gender in the following manner: First of all, who learn a gendered language do not have a nat- the categories determined by grammatical gen- ural gender attribute that they try to match onto der need not be homogeneous, and their mem- the newly acquired words, but learn these in a sep- bers need not all respect the same member- arate process. Any match or mapping between ship criterion. This frees us from imposing a natural and grammatical gender is done after the matching between natural and grammatical gen- natural gender “feature” is acquired itself. Ki- der where no such relation is obvious or pos- 1369 sible through literary devices (personification, (2004b) test cognitive aspects of grammatical gen- metaphor, metonymy). Nouns belonging to the der of Italian nouns referring to animals. same gender category may resemble each other Cucerzan and Yarowsky (2003) present a boot- because of semantic considerations, lexical deriva- strapping process to predict gender for nouns in tions, internal structure, perceived associations context. They show that context gives accurate and so on. Second, the fact that we allow for the clues to gender (in particular through determiners, possibility that the surface form of a word may quantifiers, adjectives), but when the context is not encode certain word characteristics or attributes, useful, the algorithm can fall back successfully on allows us to hypothesize that there is a surface, the word form. Cucerzan and Yarowsky model phonological, similarity between words grouped the word form for predicting gender using suffix within the same gender category, that can supple- trie models.
Recommended publications
  • LINGUISTICS 221 Lecture #3 DISTINCTIVE FEATURES Part 1. an Utterance Is Composed of a Sequence of Discrete Segments. Is the Segm
    LINGUISTICS 221 Lecture #3 DISTINCTIVE FEATURES Part 1. An utterance is composed of a sequence of discrete segments. Is the segment indivisible? Is the segment the smallest unit of phonological analysis? If it is, segments ought to differ randomly from one another. Yet this is not true: pt k prs What is the relationship between members of the two groups? p t k - the members of this set have an internal relationship: they are all voiceles stops. p r s - no such relationship exists p b d s bilabial bilabial alveolar alveolar stop stop stop fricative voiceless voiced voiced voiceless SIMILARITIES AND DIFFERENCES! Segments may be viewed as composed of sets of properties rather than indivisible entities. We can show the relationship by listing the properties of each segment. DISTINCTIVE FEATURES • enable us to describe the segments in the world’s languages: all segments in any language can be characterized in some unique combination of features • identifies groups of segments → natural segment classes: they play a role in phonological processes and constraints • distinctive features must be referred to in terms of phonetic -- articulatory or acoustic -- characteristics. 1 Requirements on distinctive feature systems (p. 66): • they must be capable of characterizing natural segment classes • they must be capable of describing all segmental contrasts in all languages • they should be definable in phonetic terms The features fulfill three functions: a. They are capable of describing the segment: a phonetic function b. They serve to differentiate lexical items: a phonological function c. They define natural segment classes: i.e. those segments which as a group undergo similar phonological processes.
    [Show full text]
  • Grammatical Gender in the German Multiethnolect Peter Auer & Vanessa Siegel
    1 Grammatical gender in the German multiethnolect Peter Auer & Vanessa Siegel Contact: Deutsches Seminar, Universität Freiburg, D-79089 Freiburg [email protected], [email protected] Abstract While major restructurations and simplifications have been reported for gender systems of other Germanic languages in multiethnolectal speech, the article demonstrates that the three-fold gender distinction of German is relatively stable among young speakers of immigrant background. We inves- tigate gender in a German multiethnolect, based on a corpus of appr. 17 hours of spontaneous speech by 28 young speakers in Stuttgart (mainly of Turkish and Balkan backgrounds). German is not their second language, but (one of) their first language(s), which they have fully acquired from child- hood. We show that the gender system does not show signs of reduction in the direction of a two gender system, nor of wholesale loss. We also argue that the position of gender in the grammar is weakened by independent processes, such as the frequent use of bare nouns determiners in grammatical contexts where German requires it. Another phenomenon that weakens the position of gender is the simplification of adjective/noun agreement and the emergence of a generalized, gender-neutral suffix for pre-nominal adjectives (i.e. schwa). The disappearance of gender/case marking in the adjective means that the grammatical cat- egory of gender is lost in A + N phrases (without determiner). 1. Introduction Modern German differs from most other Germanic languages
    [Show full text]
  • Arxiv:2106.08037V1 [Cs.CL] 15 Jun 2021 Alternative Ways the World Could Be
    The Possible, the Plausible, and the Desirable: Event-Based Modality Detection for Language Processing Valentina Pyatkin∗ Shoval Sadde∗ Aynat Rubinstein Bar Ilan University Bar Ilan University Hebrew University of Jerusalem [email protected] [email protected] [email protected] Paul Portner Reut Tsarfaty Georgetown University Bar Ilan University [email protected] [email protected] Abstract (1) a. We presented a paper at ACL’19. Modality is the linguistic ability to describe b. We did not present a paper at ACL’20. events with added information such as how de- sirable, plausible, or feasible they are. Modal- The propositional content p =“present a paper at ity is important for many NLP downstream ACL’X” can be easily verified for sentences (1a)- tasks such as the detection of hedging, uncer- (1b) by looking up the proceedings of the confer- tainty, speculation, and more. Previous studies ence to (dis)prove the existence of the relevant pub- that address modality detection in NLP often p restrict modal expressions to a closed syntac- lication. The same proposition is still referred to tic class, and the modal sense labels are vastly in sentences (2a)–(2d), but now in each one, p is different across different studies, lacking an ac- described from a different perspective: cepted standard. Furthermore, these senses are often analyzed independently of the events that (2) a. We aim to present a paper at ACL’21. they modify. This work builds on the theoreti- b. We want to present a paper at ACL’21. cal foundations of the Georgetown Gradable Modal Expressions (GME) work by Rubin- c.
    [Show full text]
  • Serial Verb Constructions Revisited: a Case Study from Koro
    Serial Verb Constructions Revisited: A Case Study from Koro By Jessica Cleary-Kemp A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Linguistics in the Graduate Division of the University of California, Berkeley Committee in charge: Associate Professor Lev D. Michael, Chair Assistant Professor Peter S. Jenks Professor William F. Hanks Summer 2015 © Copyright by Jessica Cleary-Kemp All Rights Reserved Abstract Serial Verb Constructions Revisited: A Case Study from Koro by Jessica Cleary-Kemp Doctor of Philosophy in Linguistics University of California, Berkeley Associate Professor Lev D. Michael, Chair In this dissertation a methodology for identifying and analyzing serial verb constructions (SVCs) is developed, and its application is exemplified through an analysis of SVCs in Koro, an Oceanic language of Papua New Guinea. SVCs involve two main verbs that form a single predicate and share at least one of their arguments. In addition, they have shared values for tense, aspect, and mood, and they denote a single event. The unique syntactic and semantic properties of SVCs present a number of theoretical challenges, and thus they have invited great interest from syntacticians and typologists alike. But characterizing the nature of SVCs and making generalizations about the typology of serializing languages has proven difficult. There is still debate about both the surface properties of SVCs and their underlying syntactic structure. The current work addresses some of these issues by approaching serialization from two angles: the typological and the language-specific. On the typological front, it refines the definition of ‘SVC’ and develops a principled set of cross-linguistically applicable diagnostics.
    [Show full text]
  • Why Grammar Matters: Conjugating Verbs in Modern Legal Opinions Robert C
    Loyola University Chicago Law Journal Volume 40 Article 3 Issue 1 Fall 2008 2008 Why Grammar Matters: Conjugating Verbs in Modern Legal Opinions Robert C. Farrell Quinnipiac University School of Law Follow this and additional works at: http://lawecommons.luc.edu/luclj Part of the Law Commons Recommended Citation Robert C. Farrell, Why Grammar Matters: Conjugating Verbs in Modern Legal Opinions, 40 Loy. U. Chi. L. J. 1 (2008). Available at: http://lawecommons.luc.edu/luclj/vol40/iss1/3 This Article is brought to you for free and open access by LAW eCommons. It has been accepted for inclusion in Loyola University Chicago Law Journal by an authorized administrator of LAW eCommons. For more information, please contact [email protected]. Why Grammar Matters: Conjugating Verbs in Modern Legal Opinions Robert C. Farrell* I. INTRODUCTION Does it matter that the editors of thirty-three law journals, including those at Yale and Michigan, think that there is a "passive tense"? l Does it matter that the United States Courts of Appeals for the Sixth2 and Eleventh3 Circuits think that there is a "passive mood"? Does it matter that the editors of fourteen law reviews think that there is a "subjunctive tense"?4 Does it matter that the United States Court of Appeals for the District of Columbia Circuit thinks that there is a "subjunctive voice'"? 5 There is, in fact, no "passive tense" or "passive mood." The passive is a voice. 6 There is no "subjunctive voice" or "subjunctive tense." The subjunctive is a mood.7 The examples in the first paragraph suggest that there is widespread unfamiliarity among lawyers and law students * B.A., Trinity College; J.D., Harvard University; Professor, Quinnipiac University School of Law.
    [Show full text]
  • Spanish Verbs and Essential Grammar Review
    Spanish Verbs and Essential Grammar Review Prepared by: Professor Carmen L. Torres-Robles Department of Foreign Languages & Literatures Purdue University Calumet Revised: 1 /2003 Layout by: Nancy J. Tilka CONTENTS Spanish Verbs Introduction 4 Indicative Mood 5 ® simple & compound tenses: present, past, future, conditional Subjunctive Mood 12 ® simple & compound tenses: present, past Ser / Estar 16 Essential Grammar Pronouns 20 Possesive Adjectives and Pronouns 23 Prepositional Pronouns 25 Por versus Para 27 Comparisons / Superlatives 31 Preterite / Imperfect 34 Subjunctive Mood 37 Commands 42 Passive Voice 46 2 Spanish Verbs 3 INTRODUCTION VERBS (VERBOS) MOODS (MODOS) There are three moods or ways to express verbs (actions) in Spanish. 1. Indicative Mood (objective) 2. Subjunctive Mood (subjective) 3. Imperative Mood (commands) INFINITIVES (INFINITIVOS) A verb in the purest form (without a noun or subject pronoun to perform the action) is called an infinitive. The infinitives in English are characterized by the prefix “to” + “verb form”, the Spanish infinitives are identified by the “r” ending. Example estudiar, comer, dormir to study, to eat, to sleep CONJUGATIONS (CONJUGACIONES) Spanish verbs are grouped in three categories or conjugations. 1. Infinitives ending in –ar belong to the first conjugation. (estudiar) 2. Infinitives ending in –er belong to the second conjugation. (comer) 3. Infinitives ending in –ir belong to the third conjugation. (dormir) VERB STRUCTURE (ESTRUCTURA VERBAL) Spanish verbs are divided into three parts. (infinitive: estudiar) 1. Stem or Root (estudi-) 2. Theme Vowel (-a-) 3. "R" Ending (-r) CONJUGATED VERBS (VERBOS CONJUGADOS) To conjugate a verb, a verb must have an explicit subject noun (ex: María), a subject pronoun (yo, tú, usted, él, ella, nosotros(as), vosotros(as), ustedes, ellos, ellas), or an implicit subject, to indicate the performer of the action.
    [Show full text]
  • 30. Tense Aspect Mood 615
    30. Tense Aspect Mood 615 Richards, Ivor Armstrong 1936 The Philosophy of Rhetoric. Oxford: Oxford University Press. Rockwell, Patricia 2007 Vocal features of conversational sarcasm: A comparison of methods. Journal of Psycho- linguistic Research 36: 361−369. Rosenblum, Doron 5. March 2004 Smart he is not. http://www.haaretz.com/print-edition/opinion/smart-he-is-not- 1.115908. Searle, John 1979 Expression and Meaning. Cambridge: Cambridge University Press. Seddiq, Mirriam N. A. Why I don’t want to talk to you. http://notguiltynoway.com/2004/09/why-i-dont-want- to-talk-to-you.html. Singh, Onkar 17. December 2002 Parliament attack convicts fight in court. http://www.rediff.com/news/ 2002/dec/17parl2.htm [Accessed 24 July 2013]. Sperber, Dan and Deirdre Wilson 1986/1995 Relevance: Communication and Cognition. Oxford: Blackwell. Voegele, Jason N. A. http://www.jvoegele.com/literarysf/cyberpunk.html Voyer, Daniel and Cheryl Techentin 2010 Subjective acoustic features of sarcasm: Lower, slower, and more. Metaphor and Symbol 25: 1−16. Ward, Gregory 1983 A pragmatic analysis of epitomization. Papers in Linguistics 17: 145−161. Ward, Gregory and Betty J. Birner 2006 Information structure. In: B. Aarts and A. McMahon (eds.), Handbook of English Lin- guistics, 291−317. Oxford: Basil Blackwell. Rachel Giora, Tel Aviv, (Israel) 30. Tense Aspect Mood 1. Introduction 2. Metaphor: EVENTS ARE (PHYSICAL) OBJECTS 3. Polysemy, construal, profiling, and coercion 4. Interactions of tense, aspect, and mood 5. Conclusion 6. References 1. Introduction In the framework of cognitive linguistics we approach the grammatical categories of tense, aspect, and mood from the perspective of general cognitive strategies.
    [Show full text]
  • The Differences Between Spoken and Written Grammar in English, in Comparison with Vietnamese1
    GIST EDUCATION AND LEARNING RESEARCH JOURNAL. ISSN 1692-5777. NO. 11, (JULY - DECEMBER) 2015. pp. 138-153. The Differences between Spoken and Written Grammar in English, in Comparison 1 with Vietnamese Las Diferencias entre la Gramática Oral y Escrita del Idioma Inglés en Comparación con el Idioma Vietnamita Nguyen Cao Thanh2* Tan Trao University, Vietnam Abstract The fundamental point of this paper is to describe and evaluate some differences between spoken and written grammar in English, and compare some of the points with Vietnamese. This paper illustrates that spoken grammar is less rigid than written grammar. Moreover, it highlights the distinction between speaking and writing in terms of subordination and coordination. Further, the different frequency of adverbials and adjectivals between spoken and written language is also compared and analyzed. Keywords: spoken and written grammar, English, Vietnamese 138 1 Received: July 15, 2015 / Accepted: September 10, 2015 2 [email protected] No. 11 (July - December 2015) No. 11 (July - December 2015) CAO Resumen El principal objetivo de este artículo es describir y evaluar algunas diferencias entre la gramática oral y escrita del idioma inglés y comparar algunos aspectos gramaticales con el idioma vietnamita. Esta revisión muestra como la gramática oral es menos rígida que la gramática escrita. Por otra parte, se destaca la distinción entre el hablar y el escribir en términos de subordinación y coordinación. Además, la diferencia en el uso de adverbios y adjetivos entre la gramática oral y escrita también es comparada y analizada. Palabras clave: gramática oral y escrita, inglés, vietnamita Resumo O principal objetivo deste artigo é descrever e avaliar algumas diferenças entre a gramática oral e escrita do idioma inglês e comparar alguns aspectos gramaticais com o idioma vietnamita.
    [Show full text]
  • Adjective in Old English
    Adjective in Old English Adjective in Old English had five grammatical categories: three dependent grammatical categories, i.e forms of agreement of the adjective with the noun it modified – number, gender and case; definiteness – indefiniteness and degrees of comparison. Adjectives had three genders and two numbers. The category of case in adjectives differed from that of nouns: in addition to the four cases of nouns they had one more case, Instrumental. It was used when the adjective served as an attribute to a noun in the Dat. case expressing an instrumental meaning. Weak and Strong Declension Most adjectives in OE could be declined in two ways: according to the weak and to the strong declension. The formal differences between the declensions, as well as their origin, were similar to those of the noun declensions. The strong and weak declensions arose due to the use of several stem-forming suffixes in PG: vocalic a-, o-, u- and i- and consonantal n-. Accordingly, there developed sets of endings of the strong declension mainly coinciding with the endings of a-stems of nouns for adjectives in the Masc. and Neut. and of o-stems – in the Fem. Some endings in the strong declension of adjectives have no parallels in the noun paradigms; they are similar to the endings of pronouns: -um for Dat. sg, -ne for Acc. Sg Masc., [r] in some Fem. and pl endings. Therefore the strong declension of adjectives is sometimes called the ‘pronominal’ declension. As for the weak declension, it uses the same markers as n-stems of nouns except that in the Gen.
    [Show full text]
  • On the Relation Between Grammatical Number and Cardinal Numbers in Development
    MINI REVIEW ARTICLE published: 09 October 2014 doi: 10.3389/fpsyg.2014.01132 On the relation between grammatical number and cardinal numbers in development Barbara W. Sarnecka * Department of Cognitive Sciences, University of California at Irvine, Irvine, CA, USA Edited by: This mini-review focuses on the question of how the grammatical number system of Ann Dowker, University of a child’s language may help the child learn the meanings of cardinal number words Oxford, UK (e.g., “one” and “two”). Evidence from young children learning English, Russian, Reviewed by: Emily Mather, University of Hull, UK Japanese, Mandarin, Slovenian, or Saudi Arabic suggests that trajectories of number-word Ruth Ford, Anglia Ruskin learning differ for children learning different languages. Children learning English, University, UK which distinguishes between singular and plural, seem to learn the meaning of the *Correspondence: cardinal number “one” earlier than children learning Japanese or Mandarin, which Barbara W. Sarnecka, Department have very little singular/plural marking. Similarly, children whose languages have a of Cognitive Sciences, University of California at Irvine, 3151 Social singular/dual/plural system (Slovenian and Saudi Arabic) learn the meaning of “two” earlier Sciences Plaza A, Irvine, CA than English-speaking children. This relation between grammatical and cardinal number 92697-5100, USA may shed light on how humans acquire cardinal-number concepts. There is an ongoing e-mail: [email protected] debate about whether mental symbols for small cardinalities (concepts for “oneness,” “twoness,” etc.) are innate or learned. Although an effect of grammatical number on number-word learning does not rule out nativist accounts, it seems more consistent with constructivist accounts, which portray the number-learning process as one that requires significant conceptual change.
    [Show full text]
  • The Syntax of Answers to Negative Yes/No-Questions in English Anders Holmberg Newcastle University
    The syntax of answers to negative yes/no-questions in English Anders Holmberg Newcastle University 1. Introduction This paper will argue that answers to polar questions or yes/no-questions (YNQs) in English are elliptical expressions with basically the structure (1), where IP is identical to the LF of the IP of the question, containing a polarity variable with two possible values, affirmative or negative, which is assigned a value by the focused polarity expression. (1) yes/no Foc [IP ...x... ] The crucial data come from answers to negative questions. English turns out to have a fairly complicated system, with variation depending on which negation is used. The meaning of the answer yes in (2) is straightforward, affirming that John is coming. (2) Q(uestion): Isn’t John coming, too? A(nswer): Yes. (‘John is coming.’) In (3) (for speakers who accept this question as well formed), 1 the meaning of yes alone is indeterminate, and it is therefore not a felicitous answer in this context. The longer version is fine, affirming that John is coming. (3) Q: Isn’t John coming, either? A: a. #Yes. b. Yes, he is. In (4), there is variation regarding the interpretation of yes. Depending on the context it can be a confirmation of the negation in the question, meaning ‘John is not coming’. In other contexts it will be an infelicitous answer, as in (3). (4) Q: Is John not coming? A: a. Yes. (‘John is not coming.’) b. #Yes. In all three cases the (bare) answer no is unambiguous, meaning that John is not coming.
    [Show full text]
  • Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions
    Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions Arya D. McCarthy Adina Williams Shijia Liu David Yarowsky Ryan Cotterell Johns Hopkins University, Facebook AI Research, ETH Zurich [email protected], [email protected], [email protected], [email protected], [email protected] Abstract A grammatical gender system divides a lex- icon into a small number of relatively fixed grammatical categories. How similar are these gender systems across languages? To quantify the similarity, we define gender systems ex- (a) German, K = 3 (b) Spanish, K = 2 tensionally, thereby reducing the problem of comparisons between languages’ gender sys- Figure 1: Two gender systems partitioning N = 6 con- tems to cluster evaluation. We borrow a rich cepts. German (a) has three communities: Obst (fruit) inventory of statistical tools for cluster evalu- and Gras (grass) are neuter, Mond (moon) and Baum ation from the field of community detection (tree) are masculine, Blume (flower) and Sonne (sun) (Driver and Kroeber, 1932; Cattell, 1945), that are feminine. Spanish (b) has two communities: fruta enable us to craft novel information-theoretic (fruit), luna (moon), and flor are feminine, and cesped metrics for measuring similarity between gen- (grass), arbol (tree), and sol (sun) are masculine. der systems. We first validate our metrics, then use them to measure gender system similarity in 20 languages. Finally, we ask whether our exhaustively divides up the language’s nouns; that gender system similarities alone are sufficient is, the union of gender categories is the entire to reconstruct historical relationships between languages. Towards this end, we make phylo- nominal lexicon.
    [Show full text]