This Thesis Has Been Submitted in Fulfilment of the Requirements for a Postgraduate Degree (E.G

Total Page:16

File Type:pdf, Size:1020Kb

This Thesis Has Been Submitted in Fulfilment of the Requirements for a Postgraduate Degree (E.G This thesis has been submitted in fulfilment of the requirements for a postgraduate degree (e.g. PhD, MPhil, DClinPsychol) at the University of Edinburgh. Please note the following terms and conditions of use: This work is protected by copyright and other intellectual property rights, which are retained by the thesis author, unless otherwise stated. A copy can be downloaded for personal non-commercial research or study, without prior permission or charge. This thesis cannot be reproduced or quoted extensively from without first obtaining permission in writing from the author. The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the author. When referring to this work, full bibliographic details including the author, title, awarding institution and date of the thesis must be given. Incorporating Pronoun Function into Statistical Machine Translation Liane Guillou I V N E R U S E I T H Y T O H F G E R D I N B U Doctor of Philosophy Institute for Language, Cognition and Computation School of Informatics University of Edinburgh 2016 Abstract Pronouns are used frequently in language, and perform a range of functions. Some pronouns are used to express coreference, and others are not. Languages and genres differ in how and when they use pronouns and this poses a prob- lem for Statistical Machine Translation (SMT) systems (Le Nagard and Koehn, 2010; Hardmeier and Federico, 2010; Novák, 2011; Guillou, 2012; Weiner, 2014; Hardmeier, 2014). Attention to date has focussed on coreferential (anaphoric) pronouns with NP antecedents, which when translated from English into a lan- guage with grammatical gender, must agree with the translation of the head of the antecedent. Despite growing attention to this problem, little progress has been made, and little attention has been given to other pronouns. The central claim of this thesis is that pronouns performing different func- tions in text should be handled differently by SMT systems and when evaluating pronoun translation. This motivates the introduction of a new framework to categorise pronouns according to their function: Anaphoric/cataphoric reference, event reference, extra-textual reference, pleonastic, addressee reference, speaker reference, generic reference, or other function. Labelling pronouns according to their function also helps to resolve instances of functional ambiguity arising from the same pronoun in the source language having multiple functions, each with dif- ferent translation requirements in the target language. The categorisation frame- work is used in corpus annotation, corpus analysis, SMT system development and evaluation. I have directed the annotation and conducted analyses of a parallel corpus of English-German texts called ParCor (Guillou et al., 2014), in which pronouns are manually annotated according to their function. This provides a first step toward understanding the problems that SMT systems face when translating pro- nouns. In the thesis, I show how analysis of manual translation can prove useful in identifying and understanding systematic differences in pronoun use between two languages and can help inform the design of SMT systems. In particular, the anal- ysis revealed that the German translations in ParCor contain more anaphoric and pleonastic pronouns than their English originals, reflecting differences in pronoun use. This raises a particular problem for the evaluation of pronoun translation. Automatic evaluation methods that rely on reference translations to assess pro- noun translation, will not be able to provide an adequate evaluation when the i reference translation departs from the original source-language text. I also show how analysis of the output of state-of-the-art SMT systems can reveal how well current systems perform in translating different types of pronouns and indicate where future efforts would be best directed. The analysis revealed that biases in the training data, for example arising from the use of “it” and “es” as both anaphoric and pleonastic pronouns in both English and German, is a problem that SMT systems must overcome. SMT systems also need to disambiguate the function of those pronouns with ambiguous surface forms so that each pronoun may be translated in an appropriate way. To demonstrate the value of this work, I have developed an automated post- editing system in which automated tools are used to construct ParCor-style anno- tations over the source-language pronouns. The annotations are then used to re- solve functional ambiguity for the pronoun “it” with separate rules applied to the output of a baseline SMT system for anaphoric vs. non-anaphoric instances. The system was submitted to the DiscoMT 2015 shared task on pronoun translation for English-French. As with all other participating systems, the automatic post- editing system failed to beat a simple phrase-based baseline. A detailed analysis, including an oracle experiment in which manual annotation replaces the auto- mated tools, was conducted to discover the causes of poor system performance. The analysis revealed that the design of the rules and their strict application to the SMT output are the biggest factors in the failure of the system. The lack of automatic evaluation metrics for pronoun translation is a limiting factor in SMT system development. To alleviate this problem, Christian Hard- meier and I have developed a testing regimen called PROTEST comprising (1) a hand-selected set of pronoun tokens categorised according to the different prob- lems that SMT systems face and (2) an automated evaluation script. Pronoun translations can then be automatically compared against a reference translation, with mismatches referred for manual evaluation. The automatic evaluation was applied to the output of systems submitted to the DiscoMT 2015 shared task on pronoun translation. This again highlighted the weakness of the post-editing system, which performs poorly due to its focus on producing gendered pronoun translations, and its inability to distinguish between pleonastic and event refer- ence pronouns. ii Lay Summary Pronouns are used frequently in language, to serve different functions. These dif- fer somewhat from language to language, and this poses a problem for Statistical Machine Translation (SMT) systems. Focussing on cases where a pronoun refers to an entity introduced earlier in the text with a noun phrase (its antecedent) – where the pronoun serves an anaphoric function – when translating from En- glish into a language with grammatical gender such as French or German, the anaphoric pronoun must agree with the translation of its antecedent. Despite growing attention to this problem, little progress has been made, and little at- tention has been given to pronouns serving other functions. The central claim of this thesis is that pronouns performing different func- tions in text should be handled differently by SMT systems and when evaluating pronoun translation. This motivates the introduction of a new framework to categorise pronouns according to their function. Labelling pronouns according to their function also helps to resolve instances of functional ambiguity arising from the same pronoun in the source language having multiple functions, each with different translation requirements in the target language. The categorisation framework is used in corpus annotation, corpus analysis, SMT system develop- ment and evaluation. I have directed the annotation and conducted analyses of a collection of paral- lel English-German texts called ParCor, in which pronouns are manually anno- tated according to their function. This provides a first step toward understanding the problems that SMT systems face when translating pronouns. An analysis of manual translation revealed differences in the use of anaphoric and pleonastic (“dummy”) pronouns between English and German. An analysis of the output of state-of-the-art SMT systems highlighted (1) the need to disambiguate the func- tion of those pronouns with ambiguous surface forms so that each pronoun may be translated in an appropriate way and (2) the need to further sub-categorise anaphoric pronouns according to the translation requirements of the target lan- guage. To demonstrate the value of this work, I have developed an automated post- editing system which aims to identify pronouns that have been translated in- correctly and correct them, using rules that differ for anaphoric and for non- anaphoric pronouns. The system was submitted to the DiscoMT 2015 shared iii task on pronoun translation for English-French. As with all other participating systems, my system failed to beat a simple baseline. A detailed analysis revealed that the design of the rules and their strict application to the SMT output are the biggest factors in the failure of the system. The lack of automatic evaluation metrics for pronoun translation is a limiting factor in SMT system development. To alleviate this problem, Christian Hard- meier and I have developed a testing regimen called PROTEST comprising (1) a hand-selected set of pronoun tokens categorised according to the different prob- lems that SMT systems face and (2) an automated evaluation script. Pronoun translations can then be automatically compared against a human-authored refer- ence translation, with mismatches referred for manual evaluation. The automatic evaluation was applied to the output of systems submitted to the DiscoMT 2015 shared task on pronoun translation. This again highlighted the weakness of the post-editing system,
Recommended publications
  • Gender Differences in the Personal Pronouns Usage on the Corpus of Congressional Speeches
    jrds (print) issn 2052-417x jrds (online) issn 2052-4188 Article Gender differences in the personal pronouns usage on the corpus of congressional speeches Dragana Bozic Lenard Abstract Gender differences in language have been extensively investigated by sociolinguists since the 1960s. This paper aimed to study gender differences in the personal pro- nouns usage on the corpus of the 113th United States Congress. All uninterrupted speeches (672 by women and 3,655 by men) whose transcripts were downloaded from the official repository Thomas were analyzed with the text analysis software Linguistic Inquiry and Word Count calculating the degree to which the politicians use personal pronouns. In addition, the computational analysis results were further analyzed with the software for statistical analysis SPSS. The quantitative analysis results pointed to minor statistically significant gender differences in the personal pronouns usage. However, the qualitative analysis showed more subtle gender differ- ences pointing to linguistic changes in stereotypization. Keywords: congressional speeches; gender differences; LIWC; SPSS; personal pronouns 1. Introduction Language is one of the most important means of humans’ expression of thoughts. Guided by thoughts, choices people make in the forms of expres- sion can be paralleled to their perception of things from the real world and consequently the way they express themselves about those things. Hence, two people may be speaking about the same thing with their descriptions being Affiliation University of Osijek, Osijek, Croatia. email: [email protected] jrds vol 3.2 2016 161–188 https://doi.org/10.1558/jrds.30111 ©2017, equinox publishing 162 Gender differences in personal pronouns usage utterly unrelated.
    [Show full text]
  • The Use of Personal Pronouns in Political Speeches a Comparative Study of the Pronominal Choices of Two American Presidents
    School of Language and Literature G3, Bachelors’ Course English Linguistics Course Code: 2EN10E Supervisor: Ibolya Maricic Credits: 15 Examiner: Charlotte Hommerberg Date: May 28, 2012 The Use of Personal Pronouns in Political Speeches A comparative study of the pronominal choices of two American presidents Jessica Håkansson ! !"#$%&'$( The study investigates the pronominal choices made by George W Bush and Barack Obama in their State of the Union speeches. The main focus of the study is on determining whom the two presidents refer to when they use the pronouns I, you, we and they, and to compare the differences in pronominal usage by the two presidents. The results suggest that the pronominal choices of the presidents do not differ significantly. The results also indicate that the pronoun I is used when the speaker wants to speak as an individual rather than as a representative of a group. You is used both as generic pronoun as well as a way for the President to speak to the Congress, without speaking on their behalf. The pronoun we is used to invoke a sense of collectivity and to share responsibility, in most cases it refers to the President and the Congress. They is used to separate self from other; whom the speaker refers to while using they varied greatly between the speakers. The study also showed that the pronominal choices and whom the pronouns refer to vary greatly depending on the context of the speech. Since a great deal of studies on pronominal choices in political interviews and debates already exist, this study can be regarded as significant because it deals with prepared speeches rather than interviews and debates.
    [Show full text]
  • The Development of Dutch Connectives
    The development of Dutch connectives Change and acquisition as windows on form-function relations Published by LOT phone: +31 30 253 6006 Trans 10 fax: +31 30 253 6000 3512 JK Utrecht e-mail: [email protected] The Netherlands http://wwwlot.let.uu.nl/ Cover illustration: photograph of train rails with sidetracks as a representation of developments in language. ISBN 90-76864-77-2 NUR 632 Copyright © 2005: Jacqueline Evers-Vermeul. All rights reserved. The development of Dutch connectives Change and acquisition as windows on form-function relations De ontwikkeling van Nederlandse connectieven Verandering en verwerving als vensters op vorm-functierelaties (met een samenvatting in het Nederlands) Proefschrift ter verkrijging van de graad van doctor aan de Universiteit Utrecht op gezag van de Rector Magnificus, Prof. dr. W.H. Gispen, ingevolge het besluit van het College voor Promoties in het openbaar te verdedigen op dinsdag 24 mei 2005 des middags te 16.15 uur door Jacqueline Evers-Vermeul geboren op 9 december 1973 te Purmerend Promotoren: Prof. dr. T.J.M. Sanders Faculteit der Letteren, Universiteit Utrecht Prof. dr. F.P. Weerman Faculteit der Geesteswetenschappen, Universiteit van Amsterdam Gelukkig is de mens die wijsheid heeft gevonden, een mens die inzicht wint. Wijsheid levert meer op dan zilver, geeft meer profijt dan goud, is kostbaarder dan edelstenen. Alles wat je ooit zou kunnen wensen valt bij de wijsheid in het niet. (Spreuken 3: 13-15, Nieuwe Bijbelvertaling) Happy is the man who finds wisdom, the man who gets understanding. For her good profit is better than getting silver, and her return is better than fine gold.
    [Show full text]
  • Effects of Generic-You in Creating Resonance Between People and Ideas
    “You” speaks to me: Effects of generic-you in creating resonance between people and ideas Ariana Orvella, Ethan Krossb,c, and Susan A. Gelmanb,1 aDepartment of Psychology, Bryn Mawr College, Bryn Mawr, PA 19010; bDepartment of Psychology, University of Michigan, Ann Arbor, MI 48109; and cDepartment of Management and Organizations, Ross School of Business, University of Michigan, Ann Arbor, MI 48109 Contributed by Susan A. Gelman, October 3, 2020 (sent for review May 29, 2020; reviewed by Sandeep Prasada and Timothy D. Wilson) Creating resonance between people and ideas is a central goal of tested this hypothesis across five preregistered studies (24–28), communication. Historically, attempts to understand the factors using a combination of crowd-sourced data and online experi- that promote resonance have focused on altering the content of a mental paradigms. Data, code, and materials are publicly avail- message. Here we identify an additional route to evoking reso- able via the Open Science Framework (https://osf.io/6J2ZC/) nance that is embedded in the structure of language: the generic (29). Study 1 used publicly available data from the Amazon use of the word “you” (e.g., “You can’t understand someone until Kindle application. Studies 2–5 were approved by the University you’ve walked a mile in their shoes”). Using crowd-sourced data of Michigan Health Sciences and Behavioral Sciences institu- from the Amazon Kindle application, we demonstrate that pas- tional review board (IRB) under HUM00172473 and deemed sages that people highlighted—collectively, over a quarter of a exempt from ongoing IRB review. All participants who partici- million times—were substantially more likely to contain generic- pated in studies 2–5 provided informed consent via a checkbox you compared to yoked passages that they did not highlight.
    [Show full text]
  • Subject Ellipsis in Early Child Language
    DOI: http://dx.doi.org/10.5007/2175-8026.2020v73n3p81 SUBJECT ELLIPSIS IN EARLY CHILD LANGUAGE Karina Bertolino1* 1University of Connecticut, Storrs, Connecticut, United States Abstract This paper aims to discuss whether parametric analyses can account for missing subjects in child English. I review previous parametric analyses of missing subjects and present a new learning model built upon a more recent version of the Null Subject Parameter (Holmberg, 2010a). Parametric analyses of the phenomenon have the advantage of giving a simple answer to why children omit subjects and how they reach an adult grammar: the problem is reduced to a matter of parameter missetting and resetting. However, early parametric analyses of missing subjects (Hyams, 1986, 1991) were challenged by empirical findings (Valian, 1991; Wang et al., 1992). The pattern of subject omission in child English also cannot be explained by the new parametric learning model presented in this article. I conclude that missing subjects in English are best analyzed as subject ellipsis (i.e., ‘diary drop’), an option available for English-speaking adults. Keywords: Missing Subjects; Non-Null-Subject Languages; Parameters; Subject Ellipsis; ‘Diary Drop’ * Obtained her PhD in the Department of Linguistics at University of Connecticut. Email: karina.bertolino@ uconn.edu. ORCID: https://orcid.org/0000-0001-5089-4688. Esta obra tem licença Creative Commons 82 Karina Bertolino, Subject ellipsis in early child language Introduction In this review article, I discuss the phenomenon of missing subjects in English-speaking children, arguing that no parametric analysis can capture the pattern of subject omission in early child language. Early parametric accounts of missing subjects (Hyams, 1986, 1991) do not account for the empirical facts (Valian, 1991).
    [Show full text]
  • Leaving Obligations Behind: Epistemic Incrementation in Preschool English
    Language Learning and Development ISSN: 1547-5441 (Print) 1547-3341 (Online) Journal homepage: https://www.tandfonline.com/loi/hlld20 Leaving Obligations Behind: Epistemic Incrementation in Preschool English Ailís Cournane & Ana Teresa Pérez-Leroux To cite this article: Ailís Cournane & Ana Teresa Pérez-Leroux (2020): Leaving Obligations Behind: Epistemic Incrementation in Preschool English, Language Learning and Development, DOI: 10.1080/15475441.2020.1738233 To link to this article: https://doi.org/10.1080/15475441.2020.1738233 Published online: 27 Apr 2020. Submit your article to this journal View related articles View Crossmark data Full Terms & Conditions of access and use can be found at https://www.tandfonline.com/action/journalInformation?journalCode=hlld20 LANGUAGE LEARNING AND DEVELOPMENT https://doi.org/10.1080/15475441.2020.1738233 Leaving Obligations Behind: Epistemic Incrementation in Preschool English Ailís Cournanea and Ana Teresa Pérez-Lerouxb aDepartment of Linguistics, New York University, New York, New York, USA; bDepartments of Spanish & Portuguese and Linguistics, University of Toronto, Toronto, Ontario, Canada ABSTRACT Does language development drive language change? A common account of language change attributes the regularity of certain patterns to children’s learning biases. The present study examines these predictions for change-in -progress in the use of must in Toronto English. Historically, modal verbs like must start with root (deontic) meanings, eventually developing epistemic (probability) meanings in addition. Epistemic uses increase over successive generations, phasing out root uses (INCREMENTATION). The modal becomes unambiguously epistemic and eventually disappears from the language. Such cyclic changes are predictable and common across languages. To explore whether children contribute to incrementation and loss, we tested intuitions about must in preschoolers (n = 141) and adults (n = 29).
    [Show full text]
  • English for Practical Purposes 9
    ENGLISH FOR PRACTICAL PURPOSES 9 CONTENTS Chapter 1: Introduction of English Grammar Chapter 2: Sentence Chapter 3: Noun Chapter 4: Verb Chapter 5: Pronoun Chapter 6: Adjective Chapter 7: Adverb Chapter 8: Preposition Chapter 9: Conjunction Chapter 10: Punctuation Chapter 11: Tenses Chapter 12: Voice Chapter 1 Introduction to English grammar English grammar is the body of rules that describe the structure of expressions in the English language. This includes the structure of words, phrases, clauses and sentences. There are historical, social, and regional variations of English. Divergences from the grammardescribed here occur in some dialects of English. This article describes a generalized present-dayStandard English, the form of speech found in types of public discourse including broadcasting,education, entertainment, government, and news reporting, including both formal and informal speech. There are certain differences in grammar between the standard forms of British English, American English and Australian English, although these are inconspicuous compared with the lexical andpronunciation differences. Word classes and phrases There are eight word classes, or parts of speech, that are distinguished in English: nouns, determiners, pronouns, verbs, adjectives,adverbs, prepositions, and conjunctions. (Determiners, traditionally classified along with adjectives, have not always been regarded as a separate part of speech.) Interjections are another word class, but these are not described here as they do not form part of theclause and sentence structure of the language. Nouns, verbs, adjectives, and adverbs form open classes – word classes that readily accept new members, such as the nouncelebutante (a celebrity who frequents the fashion circles), similar relatively new words. The others are regarded as closed classes.
    [Show full text]
  • Generic Du (‘You’) As a Case of Informalization and Subjectification in Swedish Skärlund, Sanna
    Does you mean I? Generic du (‘you’) as a case of informalization and subjectification in Swedish Skärlund, Sanna Published in: Norsk Lingvistisk Tidsskrift 2017 Document Version: Publisher's PDF, also known as Version of record Link to publication Citation for published version (APA): Skärlund, S. (2017). Does you mean I? Generic du (‘you’) as a case of informalization and subjectification in Swedish. Norsk Lingvistisk Tidsskrift, 35(1), 129-146. Total number of authors: 1 General rights Unless other specific re-use rights are stated the following general rights apply: Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal Read more about Creative commons licenses: https://creativecommons.org/licenses/ Take down policy If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. LUND UNIVERSITY PO Box 117 221 00 Lund +46 46-222 00 00 NLT 2017-1 ombrukket 3.qxp_Layout 1 17.07.2017 21.30 Side 129 Norsk Lingvistisk Tidsskrift · Årgang 35 · 2017 129–146 Does you mean I? Generic du (‘you’) as a case of informalization and sub- jectification in Swedish Sanna Skärlund In Swedish today, du (‘you’) is sometimes used with generic reference.
    [Show full text]
  • Modality in New Englishes: a Corpus-Based Study of Obligation and Necessity
    Modality in New Englishes: A Corpus-Based Study of Obligation and Necessity Inaugural-Dissertation zur Erlangung der Doktorwürde der Philologischen Fakultät der Albert-Ludwigs-Universität Freiburg i. Br. vorgelegt von Gabriela Veronica Diaconu aus Brașov SS 2011 Erstgutachter: Prof. Dr. Dr. h.c. Christian Mair Zweitgutachter: Prof. Dr. Bernd Kortmann Vorsitzender des Promotionsausschusses der Gemeinsamen Kommission der Philologischen, Philosophischen und Wirtschafts- und Verhaltenswissenschaftlichen Fakultät: Prof. Dr. Hans-Helmuth Gander Datum der Fachprüfung im Promotionsfach: 24.01.2012 Contents List of tables IV List of figures V List of abbreviations VI Acknowledgements VIII 1 Introduction 1 1.1 Modal usage and change in progress in new standard Englishes: an overview 1 1.2 A corpus-based study of modal variation and change in progress 5 1.3 Aims and scope 9 2 The dynamics of the New Englishes 14 2.1 Introduction 14 2.2 Terminology and categorization 14 2.3 The New Englishes in linguistic research 17 2.4 Summary 20 3 The emergence of new standard Englishes in Ireland, Jamaica and India 21 3.1 Introduction 21 3.2 Irish English 22 3.2.1 The shift from Irish to English 22 3.2.2 Standard Irish English as a linguistic research area 23 3.3 Jamaican English 25 3.3.1 The creole-continuum model and the standard norm 25 3.3.2 English as an emerging standard in contemporary Jamaica 28 3.4 Indian English 31 3.4.1 The spread of English in India 31 3.4.2 Recent research in standard English in India 33 3.5 Summary 35 4 Theoretical and methodological
    [Show full text]
  • It's All About You in Dutch
    It’s all about you in Dutch Helen de Hoop and Sammie Tarenskeen Radboud University Nijmegen Abstract Although second person pronouns are typically thought of as referring to the addressee, we find that in almost half the cases the personal subject pronoun je ‘you’ gets a generic (impersonal) interpretation in the Spoken Dutch Corpus. A further examination of the contexts in which these two readings of you arise reveals that 66% receives a generic and 34% a deictic reading in declaratives, but only 12% of readings is generic, and 88% is deictic for questions. This frequency distribution verifies our hypothesis that deictic second person subjects are typically used in interactive discourse, whereas generic second person subjects are mostly used in descriptive language. We assume that second person pronouns get a deictic reading via self-ascription by the addressee (following Wechsler 2010), but that this reading can be overruled in favor of a generic reading when a deictic reading does not fit the context. The mechanism of self-ascription upon hearing a second person pronoun can explain why the generic use of you seems to evoke a reading of empathy often reported in the literature (e.g., Malamud 2012). Keywords: second person; generic reading; deictic reading; corpus study; self- ascription 1 1. Introduction The popular hashtag yolo, an abbreviation for “you only live once”, is often used in tweets these days. The second person pronoun you in this expression gets a generic interpretation, as it does not directly refer to the addressee, but rather denotes people in general, including the speaker, that is, the person who wrote the tweet (van der Auwera et al 2012; Gast et al, submitted).
    [Show full text]
  • Effects of Generic-You in Creating Resonance Between People and Ideas
    “You” speaks to me: Effects of generic-you in creating resonance between people and ideas Ariana Orvella, Ethan Krossb,c, and Susan A. Gelmanb,1 aDepartment of Psychology, Bryn Mawr College, Bryn Mawr, PA 19010; bDepartment of Psychology, University of Michigan, Ann Arbor, MI 48109; and cDepartment of Management and Organizations, Ross School of Business, University of Michigan, Ann Arbor, MI 48109 Contributed by Susan A. Gelman, October 3, 2020 (sent for review May 29, 2020; reviewed by Sandeep Prasada and Timothy D. Wilson) Creating resonance between people and ideas is a central goal of tested this hypothesis across five preregistered studies (24–28), communication. Historically, attempts to understand the factors using a combination of crowd-sourced data and online experi- that promote resonance have focused on altering the content of a mental paradigms. Data, code, and materials are publicly avail- message. Here we identify an additional route to evoking reso- able via the Open Science Framework (https://osf.io/6J2ZC/) nance that is embedded in the structure of language: the generic (29). Study 1 used publicly available data from the Amazon use of the word “you” (e.g., “You can’t understand someone until Kindle application. Studies 2–5 were approved by the University you’ve walked a mile in their shoes”). Using crowd-sourced data of Michigan Health Sciences and Behavioral Sciences institu- from the Amazon Kindle application, we demonstrate that pas- tional review board (IRB) under HUM00172473 and deemed sages that people highlighted—collectively, over a quarter of a exempt from ongoing IRB review. All participants who partici- million times—were substantially more likely to contain generic- pated in studies 2–5 provided informed consent via a checkbox you compared to yoked passages that they did not highlight.
    [Show full text]
  • “You” and “I” in a Foreign Land the Persuasive Force of Generic-You
    Journal of Experimental Social Psychology 85 (2019) 103869 Contents lists available at ScienceDirect Journal of Experimental Social Psychology journal homepage: www.elsevier.com/locate/jesp ☆ “You” and “I” in a foreign land: The persuasive force of generic-you T ⁎ Ariana Orvell , Ethan Kross, Susan A. Gelman Department of Psychology, University of Michigan, United States of America ARTICLE INFO ABSTRACT Keywords: Norms help people navigate their social lives, dictating what behaviors are typical, expected, or valued in a given Norms context. Here we suggest that a subtle linguistic cue—the generic usage of the word “you” (i.e., “you” that refers Persuasion to people in general rather than to one or more specific individuals) carries persuasive force, influencing how Language people discern unfamiliar norms. Across five experiments (N = 800), people endorsed unfamiliar behaviors Nudges described with generic-you (e.g., “You share them with your friends”) as more normatively correct than beha- Generics viors expressed with ‘I’ (e.g., “I share them with my family”) or third-person singular pronouns (“he” or “she”; e.g., “She shares them with her family”). These effects persisted even when participants were told that their informants were highly knowledgeable about the norms. Together, these findings indicate that generic-you functions as a linguistic nudge that carries persuasive force. Broadly, they add to a growing body of literature on how subtle linguistic shifts can meaningfully influence social cognition. 1. Introduction ambiguous, the actions of others are particularly important for sig- naling what is normative (Asch, 1951; Goldstein & Cialdini, 2007; Imagine that you just set foot in a foreign country.
    [Show full text]