Arxiv:1910.09729V1 [Cs.CL] 22 Oct 2019
Total Page:16
File Type:pdf, Size:1020Kb
Grammatical Gender, Neo-Whorfianism, and Word Embeddings: A Data-Driven Approach to Linguistic Relativity Katharina Kann New York University, USA [email protected] Abstract 2012). Ultimately, Whorf went on to hypothesize that the Hopi perceive time differently as a result The relation between language and thought of their language’s grammar, kicking off a more has occupied linguists for at least a century. encompassing debate on the relation of language Neo-Whorfianism, a weak version of the con- troversial Sapir–Whorf hypothesis, holds that and thought and engendering one of the larger con- our thoughts are subtly influenced by the gram- troversies in linguistics to date (Deutscher, 2010). matical structures of our native language. One While influential, the strong version of what area of investigation in this vein focuses on has come to be known as the Sapir–Whorf2 hy- how the grammatical gender of nouns affects pothesis has been disproven. For instance, even the way we perceive the corresponding objects. though languages differ in the basic color terms For instance, does the fact that key is mascu- they employ, e.g., Korean has one word that rep- line in German (der Schlussel¨ ), but feminine in Spanish (la llave) change the speakers’ views resents both green and blue, the development of of those objects? Psycholinguistic evidence color terminology and perception is subject to uni- presented by Boroditsky et al.(2003, x4) sug- versalist constraints due to biology (Berlin and Kay, gested the answer might be yes: When asked 1969). Recent years, however, have witnessed a to produce adjectives that best described a key, resurgence of a milder strain of the hypothesis, al- German and Spanish speakers named more ternatively known as neo-Whorfianism or the weak stereotypically masculine and feminine ones, Sapir–Whorf hypothesis (Boroditsky, 2003). One respectively. However, recent attempts to repli- prominent controversial research direction inves- cate those experiments have failed (Mickan et al., 2014). In this work, we offer a com- tigates this hypothesis with respect to grammat- putational analogue of Boroditsky et al.(2003, icalized notions of gender and tense. Based on x4)’s experimental design on 9 languages, find- experimental evidence obtained from native Ger- ing evidence against neo-Whorfianism. man and Spanish speakers, Boroditsky(2003, x4.6) argued that grammatical gender affects how speak- 1 Introduction ers view objects in their native language. Similarly, During his tenure as a graduate student, 20th- differences in the usage of tense in Mandarin and century American linguist Benjamin Whorf con- English were found to affect how the respective ducted field work on Hopi, an Uto-Aztecan lan- speakers view time (Boroditsky, 2001). In this guage spoken in Southern Arizona. To his surprise, work, we focus on the influence of grammatical arXiv:1910.09729v1 [cs.CL] 22 Oct 2019 he found that Hopi does not mark the tense of a verb gender and how we can use NLP tools to shed light in the way many Western European languages do on the validity of this aspect of neo-Whorfianism. (Whorf, 1956).1 Thus, according to Whorf, a Hopi The gist of Boroditsky et al.(2003, x4)’s hy- speaker must infer whether an action takes place pothesis is that speakers of languages that mark in the past, present or future only from the senten- grammatical gender perceive nouns, even inani- tial context in which the verb occurs. This finding mate ones, differently depending on the noun’s inspired Whorf to start questioning whether lan- gender. They sought psycholinguistic evidence for guage influences thought, a position that has come this claim, arguing that speakers will more often to be known as linguistic relativity (Whorf et al., choose stereotypically masculine adjectives to de- scribe grammatically masculine inanimate nouns 1Whorf’s claim has subsequently been challenged. Later analyses of Hopi grammar suggest that the language marks 2The hypothesis is named after both Benjamin Whorf and two tenses: future and non-future (Malotki, 1983). his Ph.D. advisor Edward Sapir. POS= POS= POS= POS= POS= POS= POS= CASE= CASE= CASE= CASE= CASE= CASE= NUM= NUM= NUM= NUM= NUM= NUM= GEN= Все счастливые семьи похожи друг на друга… All happy families are similar to each other Figure 1: A Russian sentence showing the original forms and their decomposition into tag–lemma pairs. and stereotypically feminine adjectives to describe either two or three genders: the bipartite distinc- grammatically feminine inanimate nouns. To give tions masculine-feminine and the tripartite distinc- a concrete example, the German word for key, tion masculine-feminine-neuter. An important as- Schlussel¨ , happens to be masculine, so speakers sumption of our study is the tenet that the gender are more likely to use words such as heavy, jagged, assigned to inanimate objects is arbitrary. and hard. In contrast, the Spanish word for key, All languages we experiment on exhibit concord llave, happens to be feminine, so speakers are more in grammatical gender, i.e., articles and adjectives likely to use stereotypically feminine words such that modify a noun must agree with that noun in as elegant, pretty, and delicate. However, Mickan gender as well as in other features. Consider the et al.(2014) failed to replicate this experiment, German and Spanish translations of the English leaving the validity of the findings questionable. sentence: The beautiful key is on the table. In this work, we provide evidence against La llave bonita esta´ sobre la mesa. (1) Boroditsky et al.(2003, x4)’s hypothesis using NLP techniques. We contend that if this instance Der schone¨ Schlussel¨ liegt auf dem Tisch. (2) of neo-Whorfianism is true, then we should see Here, German employs the masculine article der a reflection of it in corpus co-occurrence counts. as Schlussel¨ is masculine; had Schlussel¨ been fem- We propose two experiments, based on word em- inine, German speakers would say die Schlussel¨ . beddings trained on large corpora, to investigate Spanish exhibits a similar behavior, making use of how speakers of different languages use certain la, rather than el, as llave is feminine. For a thor- nouns. Our results on 9 languages suggest that ough treatment of the subject, see Corbett(2006). the grammatical gender of inanimate nouns does A key part of our experimental design will be not influence how they are used in context, taking stripping side effects of the grammatical concord credence away from Boroditsky et al.(2003, x4)’s from articles, adjectives, and verbs, thus removing claims. However, we caution that it is important overt signals of the noun’s gender. not to overstate the findings of one study; we see our results as providing further evidence in the lin- 2.2 Morphological Tagging and guistic relativity debate against neo-Whorfianism Lemmatization from a largely orthogonal source. Our study will make use of NLP tools that automate bits of linguistic analysis: morphological tagging 2 Background and lemmatization. Both will be explained here. In languages that exhibit inflectional morphol- 2.1 Grammatical Gender ogy, we may decompose a word into a bundle In this section, we provide a brief overview of of morpho-syntactic features and a lemma, its grammatical gender systems since those play an canonical form. Formally, we denote a word important role in Boroditsky et al.(2003, x4)’s, in a natural language as w, and a sentence of and hence our, experiments. Languages range length n as w = w1 ··· wi ··· wn. Each word from encoding no grammatical gender on inan- can be factored into a lemma ` and a bundle imate nouns, like English or Mandarin Chinese, of morphological features m. For instance, we to distinguishing tens of gender-like noun classes, may think of the German word Schlussels¨ as as found in the Bantu languages of Africa (Cor- the lemma ` = Schlussel¨ and the features m = bett, 1991). In this work, we will exclusively con- [ POS=n; GEN=masc; NUM=sg; CASE=gen ]. sider gendered languages of Indo-European and Each morphological feature in m is an attribute– Afro-Asiatic (Semitic) stock; they all distinguish value pair. Attributes encompass lexical properties bg es fr he it pl ro ru sk where Xij denotes the probability that vi co-occurs all 96.6 98.5 98.4 96.5 98.2 96.5 98.1 96.5 95.0 with vj within a certain context window. For in- OOV 82.1 91.7 89.6 79.6 91.2 86.9 89.0 88.1 86.6 stance, a symmetric context window of size 5 is a common choice. This asks how often vj occurs Table 1: Token-level lemmatization accuracy obtained by within five positions to the left or right of vi. Un- LEMMING on the test splits of the UD treebanks for all lan- guages when evaluated on all tokens or just OOVs. der this interpretation, WORD2VEC is an instance of exponential-family PCA (Collins et al., 2001; such as gender, number, and case, taking values Cotterell et al., 2017). The output of WORD2VEC such as masculine, singular and genitive, respec- is a mapping of word types to a vector space: d tively. Following Booij(1996), we may divide e : V ! R . We highlight that the model only the morphological attributes into two categories: considers words categorically, i.e., it is unable to inherent and contextual. Inherent categories are look at the surface form of the word. This is rel- those that are embedded in the lemma itself: the evant as there are sub-word indicators of gender, lemma Schlussel¨ without any additional inflection e.g., feminine nouns in Spanish often end in -a. reveals POS=n and GEN=masc. However, the In practice, embeddings are taken as a proxy for sentential context dictates that NUM=sg and lexical semantic meaning—words that have similar CASE=gen.