Reconsidering Annotator Disagreement About Racist Language: Noise Or Signal?

Total Page:16

File Type:pdf, Size:1020Kb

Reconsidering Annotator Disagreement About Racist Language: Noise Or Signal? Reconsidering Annotator Disagreement about Racist Language: Noise or Signal? Savannah Larimore Ian Kennedy Washington University in St. Louis University of Washington Breon Haskett Alina Arseniev-Koehler University of Washington University of California - Los Angeles Abstract 2016). Indeed, social scientists have long ac- knowledged the difficulties of measuring racism, An abundance of methodological work aims even when using traditional social science meth- to detect hateful and racist language in text. However, these tools are hampered by prob- ods (e.g., interviews and surveys), due to social lems like low annotator agreement and remain desirability biases and the increasingly covert na- largely disconnected from theoretical work on ture of racism (Bonilla-Silva, 2006). race and racism in the social sciences. Using In this paper, we reconsider efforts to anno- annotations of 5188 tweets from 291 annota- tate for racism in light of sociological work on tors, we investigate how annotator perceptions race and racism. Instead of generalizing our de- of racism in tweets vary by annotator racial tection of racism, we narrow our scope to focus identity and two text features of the tweets: on anti-Black racism. In other words, we focus relevant keywords and latent topics identified through structural topic modeling. We provide on racialized language directed at or centering on 1 a descriptive summary of our data and estimate Black Americans. Using human annotations for a series of linear models to determine if anno- the racial sentiment (positive or negative) of 5188 tator racial identity and our 12 topics, alone or Tweets, we describe how ratings vary by annota- in combination, explain the way racial senti- tors’ own racial identity. Our findings suggest that ment was annotated, net of relevant annotator White raters respond differently to particular types characteristics and tweet features. Our results of racialized language on Twitter, as identified by show that White and non-White annotators ex- hibit significant differences in ratings when structural topic modeling (STM), than non-White reading tweets with high prevalence of certain raters. Failing to account for this systematic dif- racially-charged topics. We conclude by sug- ference could lead researchers to consider tweets gesting how future methodological work can which non-White annotators identify as including draw on our results and further incorporate so- negative sentiment as innocuous because more nu- cial science theory into analyses. merous White annotators rate those same tweets as positive or neutral. We conclude by suggesting 1 Introduction several ways in which future work can account for Hateful and racist language is abundant on so- the variability in annotations that comes from an- cial media platforms like Twitter and a growing notators’ own racial identities. body of work aims to develop tools to detect such 1.1 Annotating for Racism language in these spaces. Such a tool would of- fer opportunities to intervene, like providing au- Collecting annotations is a key step to develop- tomatic trigger warnings, and would provide a ing a tool to detect racial sentiment in text data. powerful barometer to measure racism. How- At the same time, the challenges of this task have ever, these efforts are hampered by low inter- been well-documented as ratings depend upon the rater agreement, low modeling performance, and ability of human annotators to consistently iden- a lack of consensus on what counts as racist lan- tify the racial sentiment of words and phrases (Zou guage (e.g., Kwok and Wang, 2013; Burnap and and Schiebinger, 2018). Empirical work often Williams, 2016; Waseem, 2016; Schmidt and Wie- finds large amounts of disagreement in these anno- gand, 2017). These efforts are also largely dis- tations, even with carefully designed annotations connected from rich understandings of race and 1We use the terms rating/annotation and rater/annotator racism in the social sciences (but, see Waseem, interchangeably throughout. schemes (e.g., Bartlett et al., 2014; Schmidt and color tended to rate these same memes as more of- Wiegand, 2017). fensive than White students. That is, while White As these efforts have shown, perceptions of students could identify that a meme was racist, racial sentiment are contextual and subjective, they rated the level of offensiveness lower than making the prevalence of racism in text sources students of color. In addition, Tynes and Markoe inherently difficult to detect. Recent social scien- (2010) find that European American college stu- tific work (Bonilla-Silva, 2006; Carter and Mur- dents were less likely than African American col- phy, 2015; Krysan and Moberg, 2016; Tynes and lege students to react negatively to racially-themed Markoe, 2010) has taken that difficulty as its sub- party images on social media. Furthermore, Euro- ject, and sought to capture and understand the pean American students reported higher levels of variation in perceptions of racism or racial senti- color-blind racial attitudes and students with lower ment, by showing how individuals’ perceptions of levels of color-blind attitudes were more likely to racism vary systematically based on the their own react as “bothered” by the images, implying that racial identity. These findings suggest that anno- both race and racial attitudes influence percep- tator disagreement is not merely noise to smooth tions of racism online. Similarly, Daniels(2009) over. Rather, annotator disagreement for racism finds that critical race literacy matters more than includes important variation that should be dis- internet literacy in identifying racially biased or aggregated and accounted for. “cloaked” websites (i.e., websites that appear to promote racial justice but are actually maintained 1.2 Varying Perceptions of Racism by White supremacist organizations). This find- Differential attitudes about and perceptions of ing suggests that students who lack a critical race racism based on an individuals’ own racial iden- consciousness may be less likely to identify racist tity are well-documented. White Americans tend materials online and that White students may be to hold more overtly racist beliefs, are less likely to particularly susceptible. believe racial discrimination is prevalent in mod- The subtlety of racism that pervades social me- ern society, and are less likely to recognize racial dia sites like Twitter may also influence percep- microaggressions than Black Americans (Bonilla- tions of racism. As Carter and Murphy(2015) Silva, 2006; Carter and Murphy, 2015; Krysan and note, Whites tend to focus only on blatant, overt Moberg, 2016; Tynes and Markoe, 2010). In ad- forms of racism (e.g., racial slurs, mentions of dition, Krysan and Moberg(2016) note that White racial violence) but are less attuned to microag- Americans increasingly disregard racial topics on gressions and other, subtler forms of racism. As questionnaires, signaling that they have “no in- such, scholars have also advocated for a method- terest” in issues of racial inequality. Likewise, ological move away from “bag of words” ap- fewer White Americans agree that racial inequal- proaches to the evaluation of racism on social ity is due to overt discrimination, arguing instead media (Watanabe et al., 2018) because these ap- that racial discrimination is a thing of the past and proaches reinforce a focus on blatant, overt forms that in contemporary society, everyone has equally of racism, and neglect more subtle, or contextually fair life chances (Bonilla-Silva, 2006). As Carter racist tweets (Chaudhry, 2015). and Murphy(2015) note, White and Black Amer- Similarly, Kwok and Wang(2013), noting the icans may differ in their views of racial inequality subtlety of racism pervading social media posts, because White Americans compare contemporary argue that to get evaluations of tweets that accu- racial inequalities to the past, referencing slavery rately assesses meaning, features of tweets other and Jim Crow and naturally concluding that condi- than the text must be included. Tweet features set tions have improved, while Black Americans com- the rules of engagement by offering markers of pare the present to an imagined future, in which an credibility, sarcasm, and persuasiveness (Sharma ideal state of racial equality has been achieved. and Brooker, 2016; Hamshaw et al., 2018). Tweet These differences also extend to how we per- features such as links, hashtags, and number of ceive racism in online platforms. Williams et al. comments have been shown to illuminate the con- (2016) find that while White students were equally text of the tweet’s message (Burnap and Williams, as likely as students of color to perceive racially- 2016). The inclusion of these features in evalua- themed internet memes as offensive, students of tion offers deeper context and more realistic eval- uation of tweets allowing for greater attention to Project (DOLLY; a repository all geotagged tweets the differential evaluations of people in racially since December 2011) (Morcos et al.). For both marginalized groups engaging with the social me- projects, tweets were restricted to those that were dia platform. sent from the contiguous United States, were writ- Here, we expand on previous research by in- ten in English, and contained at least one keyword vestigating how annotator racial identity and tweet relevant to the analysis. To limit our sample to features interact to influence perceptions of racism tweets that concerned Black Americans, we used on Twitter. Our analysis builds on previous common hate speech terms, the term “black girl research that uses racist speech as a stimulus magic”, and the same keywords to identify tweets (Leets, 2001; Cowan and Hodge, 1996; Tynes and about Black Americans as Flores(2017). Markoe, 2010), either in print or digital media, For the second project, tweets were also re- and calls for renewed attention to variations based stricted to a 10% sample of all tweets sent from on annotator racial identity and how these varia- 19 metropolitan statistical areas between January tions ultimately influence instruments to measure 2012 and December 2016. While both data collec- racial sentiment.
Recommended publications
  • POLITICAL SCIENCE 444-544-SEM B2– the POLITICS of SOCIAL JUSTICE (Winter Term 2018)
    DEPARTMENT OF POLITICAL SCIENCE https://uofa.ualberta.ca/political-science POLITICAL SCIENCE 444-544-SEM B2– THE POLITICS OF SOCIAL JUSTICE (Winter Term 2018) Instructor: Dr. Malinda S. Smith Time: Thursday, 12:30-3:20 pm Office: 10-23 Tory Building Class Location: 10-4 Tory Building Office Hours: Tues, 1:30-2:30PM, or by appointment Email: [email protected] Office: 780.492.5380 I. COURSE DESCRIPTION AND OBJECTIVES: Global Critical Race Theory explores conceptions of race, racialization and anti-racism from various perspectives in political and social theory, recognizing that such knowledge and understanding are important for making sense of the ambiguities and complexities of political life in modern settler colonial states. Although power and privilege in the polis long have been shaped by ideas about race and its intersections with, for example, gender, class, sexuality, and disabilities – the discipline of political science has not always been in the forefront of theoretical efforts to make sense of this fundamental aspect of the human condition. It is in this context that Critical Race Theory (CRT) emerged as an important body of scholarship first in law and later across the social sciences and humanities. Rather than one perspective or approach, CRT is a constellation of theoretical standpoints that self-consciously views the ‘race question’ and questions about race and intersectionality through a critical lens, posing new questions on the persistence if not magnification of race and the ‘colour line’ in the twenty-first century and an ostensibly post-racial and colour-blind era. This course guides students through anti- racism, intersectionality, settler colonial and the implications for a decolonial politics.
    [Show full text]
  • Tnpsc Group-2 Current Affairs 2018
    TNPSC GROUP-2 CURRENT AFFAIRS 2018 TAMIL NADU AISHE 2016-17: TAMIL NADU LEADS IN GROSS ENROLMENT RATIO Prakash Javadekar released All India Survey on Higher Education (AISHE) report 2016- 2017. Highlights of All India Survey on Higher Education (2016-2017): As per the findings of the survey, overall enrolment in higher education institutions across India has increased from 27.5 million in 2010-11 to 35.7 million in 2016-17. 1. Gross Enrolment Ratio (GER) has increased from 24.5% in 2015-16 to 25.2 in 2016-17. 2. Gender Parity Index (GP1) too has improved from 0.86 to 0.94. 3. Tamil Nadu topped in terms of Gross Enrolment Ratio (GER) in higher education. Tamil Nadu recorded 46.9% GER. 4. Among Union territories, Chandigarh topped with 56.1% GER. 5. Bihar posted lowest GER of 14.4%. RS PURAM POLICE STATION IN TAMIL NADU RANKS INDIA'S BEST POLICE STATION R.S. Puram Police station in Coimbatore, Tamil Nadu has been adjudged as the best police station in India in SMART Police Stations initiative of Union Ministry of Home Affairs (MHA), • R.S. Puram Police Station House Officer T. JOTHI, received the award from Union Home Minister Rajnath Singh on January 6, 2018 at the all India conference of Director General/Inspector General of Police held at BSF Academy Tekanpur in Madhya Pradesh. • Abbreviation SMART in SMART Police stands for S-Sensitive and Strict; M-Modern with mobility; A- Alert and Accountable; R- Reliable and Responsive; T- Trained and Techno-savvy. 2 | P a g e SHANMUGAM IAS ACADEMY: 46/1, Gokhale St, Opp Senthil Kumaran Theatre, Ram Nagar, Coimbatore, Tamil Nadu 641009 WEBSITE: iasipstnpsc.in TNPSC GROUP-2 CURRENT AFFAIRS 2018 GOVERNMENT STARTS PROCESS TO BUILD INDIA'S FIRST DEFENCE 'QUAD' Union Defence Ministry has started the process to set up India's first defence industrial 'Quad' (quadrilateral corridor) linking Chennai with four other cities of Tamil Nadu to promote industries manufacturing weapons and military equipments.
    [Show full text]
  • Hashtags for Change: Can Twitter Promote Social Progress in Saudi Arabia
    International Journal of Communication 8 (2014), 943–961 1932–8036/20140005 #Hashtags for Change: Can Twitter Promote Social Progress in Saudi Arabia IRFAN CHAUDHRY University of Alberta, Canada Since the Arab Spring uprisings in 2011, Twitter has proven to be a useful mobilization tool for citizens. The power of Twitter to mobilize citizens (as seen in the Arab Spring) worries some governments. In response, a number of countries have begun to censor access to Internet technology. The Saudi monarchy, for example, issued a decree banning the reporting of news that contradicts sharia (Islamic) law, undermines national security, promotes foreign interests, or slanders religious leaders. A key question requiring further examination is why the Saudi government issued this decree. Are these controls in place to manage the Kingdom of Saudi Arabia’s political image on a global level, or are they in place to regulate the morality of its citizens at the local level? Drawing upon the work of Manuel Castells and his discussion of network power, this article asks: Can Twitter usage promote social progress in Saudi Arabia? Keywords: Saudi Arabia, Twitter, communication technology, social progress, gender, #Women2Drive Since the Arab Spring uprisings in 2011, social media has proven to be a useful mobilization tool for citizens to protest perceived injustices. In the Tunisian and Egyptian revolutions, the microblogging site Twitter emerged as a key source for real-time logistical coordination, information, information sharing, and discussion (Lotan et al., 2011). Twitter helps you “helps you create and share ideas and information instantly, without barriers” (About Twitter, 2013). Twitter’s global nature and ability to connect people anywhere in the world through hashtags and retweets1 make it possible for people to share information on topics as mundane as what to cook for this evening’s dinner to something as spectacular as the fall of an authoritarian government regime.
    [Show full text]
  • By James King B.A., Samford University, 2006 M.L.I.S., University
    THE STRUGGLE CONTINUES: ARCHIVAL APPROACHES TO CIVIL RIGHTS IN NORTHERN IRELAND AND THE AMERICAN SOUTH by James King B.A., Samford University, 2006 M.L.I.S., University of Alabama, 2007 M.A., Boston College, 2009 Submitted to the Graduate Faculty of School of Computing and Information in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Pittsburgh 2018 UNIVERSITY OF PITTSBURGH SCHOOL OF COMPUTING AND INFORMATION This dissertation was presented by James King It was defended on November 16, 2017 and approved by Dr. Sheila Corrall, Professor, Library and Information Science Dr. Andrew Flinn, Reader in Archival Studies and Oral History, Information Studies, University College London Dr. Alison Langmead, Associate Professor, Library and Information Science Dissertation Advisor: Dr. Richard J. Cox, Professor, Library and Information Science ii Copyright © by James King 2018 iii THE STRUGGLE CONTINUES: ARCHIVAL APPROACHES TO CIVIL RIGHTS IN NORTHERN IRELAND AND THE AMERICAN SOUTH James King, PhD University of Pittsburgh, 2018 When police and counter-protesters broke up the first march of the Northern Ireland Civil Rights Association (NICRA) in August 1968, activists sang the African American spiritual, “We Shall Overcome” before disbanding. The spiritual, so closely associated with the earlier civil rights struggle in the United States, was indicative of the historical and material links shared by the movements in Northern Ireland and the American South. While these bonds have been well documented within history and media studies, the relationship between these regions’ archived materials and contemporary struggles remains largely unexplored. While some artifacts from the movements—along with the oral histories and other materials that came later—remained firmly ensconced within the archive, others have been digitally reformatted or otherwise repurposed for a range of educational, judicial, and social projects.
    [Show full text]
  • Demographic Word Embeddings for Racism Detection on Twitter
    Demographic Word Embeddings for Racism Detection on Twitter Mohammed Hasanuzzaman Gael¨ Dias Andy Way ADAPT Centre GREYC UMR 6072 ADAPT Centre Dublin City University Universite´ de Caen Normandie Dublin City University Ireland France Ireland Abstract depression, anxiety, and emotional stress as well as negative physical health outcomes such as high Most social media platforms grant users blood pressure and low infant birth weight. freedom of speech by allowing them to As the information contained in social media freely express their thoughts, beliefs, and often reflects the real-world experiences of their opinions. Although this represents in- users, there is an increased expectation that the credible and unique communication op- norms of society will also apply in social media portunities, it also presents important chal- settings. As such, there is an increasing demand lenges. Online racism is such an exam- for social media platforms to empower users with ple. In this study, we present a super- the tools to report offensive and hateful content vised learning strategy to detect racist lan- (Oboler and Connelly, 2014). guage on Twitter based on word embed- Hateful content can be defined as ”speech or ex- ding that incorporate demographic (Age, pression which is capable of instilling or inciting Gender, and Location) information. Our hatred of, or prejudice towards, a person or group methodology achieves reasonable classi- of people on a specified ground, including race, fication accuracy over a gold standard nationality, ethnicity, country of origin, ethno- dataset (F1=76.3%) and significantly im- religious identity, religion, sexuality, gender iden- proves over the classification performance tity or gender” (Gelber and Stone, 2007).
    [Show full text]
  • Master Bibliography for Sports in Society, 1994–2009
    1 MASTER BIBLIOGRAPHY Sports In Society: Issues & Controversies, 1994–2015 (5,089 references) Jay Coakley This bibliography is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. CSPS Working Papers may be distributed or cited as long as the author(s) is/are appropriately credited. CSPS Working Papers may not be used for commercial purposes or modified in any way without the permission of the author(s). For more information please visit: www.creativecommons.org/licenses/by-nc-nd/3.0/. 60 Minutes; ―Life after the NFL: Happiness.‖ Television program, (2004, December); see also, www.cbsnews.com/stories/2004/12/16/60minutes/main661572.shtml. AAA. 1998. Statement on ―Race.‖ Washington, DC: American Anthropological Association. www.aaanet.org/stmts/racepp.htm (retrieved June, 2005). AAHPERD. 2013. Maximizing the benefits of youth sport. Journal of Physical Education, Recreation & Dance 84(7): 8-13. Abad-Santos, Alexander. 2013. Everything you need to know about Steubenville High's football 'rape crew'. The Atlantic Wire (January 3): http://www.theatlanticwire.com/national/2013/01/steubenville-high-football-rape- crew/60554/ (retrieved 5-22-13). Abdel-Shehid, Gamal, & Nathan Kalman-Lamb. 2011. Out of Left Field: Social Inequality and Sport. Black Point, Nova Scotia: Fernwood Publishing Abdel-Shehid, Gamal. 2002. Muhammad Ali: America‘s B-Side. Journal of Sport and Social Issues 26, 3: 319–327. Abdel-Shehid, Gamal. 2004. Who da man?: black masculinities and sporting cultures. Toronto: Canadian Scholars‘ Press. www.cspi.org. Abney, Roberta. 1999. African American women in sport. Journal of Physical Education, Recreation & Dance 70(4), 35–38.
    [Show full text]
  • Computer-Assisted Understanding of Stance in Social Media: Formalizations, Data Creation, and Prediction Models
    Computer-assisted Understanding of Stance in Social Media: Formalizations, Data Creation, and Prediction Models Von der Fakult¨at fur¨ Ingenieurwissenschaften Abteilung Informatik und Angewandte Kognitionswissenschaft der Universit¨at Duisburg-Essen zur Erlangung des akademischen Grades Doktor der Ingenieurwissenschaften genehmigte Dissertation von Michael Maximilian Wojatzki aus Bad Mergentheim 1. Gutachter: Prof. Dr.-Ing. Torsten Zesch 2. Gutachter: Prof. Dr. rer. nat. Chris Biemann Tag der mundlichen¨ Prufung:¨ 21.01.2019 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0). cbna Danksagung Die vorliegende Dissertation ist das Resultat eines langen Weges bestehend aus Schule, Studium, Promotion und praktischer Aktivit¨aten. Auf diesem Weg haben mich eine Vielzahl von Personen begleitet ohne die ich diesen nicht h¨atte beschreiten k¨onnen und denen ich nun danken m¨ochte: Zuallererst und mit Nachdruck m¨ochte ich mich bei Torsten Zesch bedanken. Seit Torsten mich im Studium in die schwarze Magie des machine learnings eingefuhrt¨ hat, war er mir stets ein Mentor, der mich immer ermutigt hat, sich nicht mit oberfl¨achlichen Antworten zufrieden zu geben. W¨ahrend meiner Zeit als studentische Hilfskraft und als Doktorand habe ich von Torsten unglaublich viel uber¨ wissenschaftliches Arbeiten, Informatik, NLP aber auch daruber¨ gelernt, das große Ganze nicht aus den Augen zu verlieren. Ich bedanke mich bei Chris Biemann, der mir wertvolles Feedback gegeben hat und mit dem ich ein spannendes GermEval 2017 erleben durfte. Mein Dank gilt der DFG, dem Graduiertenkolleg UCSM und der Universit¨at Duisburg-Essen, die Sachmittel und ein produktives Arbeitsumfeld zur Verfugung¨ gestellt haben.
    [Show full text]
  • FINAL TFGBV Paper
    Page | 1 Copyright © 2021 Women’s Legal Education and Action Fund (LEAF) Author Nicole Biros-Bolton Published by Women’s Legal Education and Action Fund (LEAF) 180 Dundas Street West, Suite 1420 Toronto, Ontario, Canada M5G 1C7 www.leaf.ca LEAF is a national, charitable, non-profit organization, founded in 1985. LEAF works to advance the substantive equality rights of women and girls in Canada through litigation, law reform and public education using the Canadian Charter of Rights and Freedoms. This publication was created as part of LEAF's Feminist Strategic Litigation (FSL) Project. The FSL Project examines the use and impact of feminist strategic litigation to help LEAF, feminists, and gender equality advocates more effectively combat systemic discrimination and oppression. Page | 2 Table of Contents 1 | Overview _________________________________________________________________________3 Terminology ______________________________________________________________________4 Methodology _____________________________________________________________________4 2 | How Technologies are Used to Harm Women ____________________________________________6 InOmate Partner Violence and Harassment _______________________________________________6 Technology-Facilitated Stalking _________________________________________________________7 Threatening and Aggressive Acts Over the Internet TargeOng Women and PerpetraOng IntersecOonal DiscriminaOon ______________________________________________________________________8 Online Harassment ________________________________________________________________9
    [Show full text]
  • Sbi Po & Clerk Mains
    IBPS G.K. CAPSULE SBI PO & CLERK MAINS PO & Clerk MAINS Special Current Affairs Static G.K. India's trusted Educational Blog Banking Awareness www.ibtindia.com www.makemyexam.in CURRENT AFFAIRS Classical Dances of India 71 National News 1 Folk Dances in India 71 State News 7 United Nations Headquarters 72 International News 12 Important Airports in India 72 Central Government Schemes 16 Nick Names of Important Indian Places 73 State Government Schemes 19 Temples in India 74 Bilateral Relations 20 Important Revolutions in India 75 Banking News 22 Stadiums in India 75 Important Meetings, Summits and Conference Some Important Facts related to Census 76 24 Important Wildlife Sanctuaries 77 Science & technology and Health 30 12 Major Ports of India 80 Defence 32 Important Dams in India 81 Important Index 34 Important Boundaries 81 MoUS and Agreements 35 BANKING AND FINANCIAL AWARENESS Important Days 38 Banking and Finance News 82 Sports 42 Reserve Bank of India (RBI) 85 Awards and Prizes 48 Schemes Related to Banking & Finance 86 Appointments 52 Masala Bond, Green Bond & Junk Bond 88 Person in News 57 Books and Authors 58 List of Three Big Credit Rating Agencies 90 Obituaries 59 List of Important Committees 91 STATIC G.K. Monetary Policy 92 Bank Accounts 94 Countries | Capital | Currency 62 Banking Abbreviations 95 State | Capital | Chief Minister | Governor 63 Financial Instruments 97 Awards 64 Indian Railways / Metro in India 66 Differentiated Banks in India 103 Important Days 66 Cheques 104 Index/Reports 67 Basel Norms 105 Important National Parks in India 68 Important Financial Institutions of India 106 Important Power Plants in India 70 Banks with Headquarters & Tag Lines 107 G.K.
    [Show full text]
  • Empowering Uncertainty Resolution for Marginalized Populations Through Social Technologies
    Empowering Uncertainty Resolution for Marginalized Populations through Social Technologies Alexandra Akiye To Human-Computer Interaction Institute School of Computer Science Carnegie Mellon University Ph.D. Thesis CMU-HCII-20-101 September, 2020 Committee Dr. Jessica Hammer (co-chair), Human-Computer Interaction Institute/Entertainment Technology Center, Carnegie Mellon University Dr. Geoff Kaufman (co-chair), Human-Computer Interaction Institute, Carnegie Mellon University ​ Dr. Jason Hong, Human-Computer Interaction Institute, Carnegie Mellon University ​ Dr. Kevin Jarbo, Social & Decision Science/Center for Student Diversity & Inclusion, Carnegie Mellon University ​ Dr. Kody Manke, Psychology, Carnegie Mellon University ​ 1 For my sisters Robyn, Rebecca, and Caroline 2 Acknowledgements As many have said before me, a PhD is a communal effort. There are countless people who have contributed to my development as a scholar, researcher, activist, and community member and who have both directly and indirectly contributed to this PhD thesis research. Thank you to: - My family members: my grandparents Reiko and Michio Yoshizu, Rosemary and Stanley To, sisters Robyn, Rebecca, Caroline, and parents Joanne and John To. You have supported my career and my academic pursuits, cheered on my every accomplishment, and pushed me through every rejection and failure, all usually while laughing with and at me for taking myself so seriously - My advisors, Jessica Hammer and Geoff Kaufman, whose confidence in my ability, support through my struggle, mentorship, and friendship allowed me to be brave and bold - My partner Joe, for seeing me and hearing me - My incredible committee members, Kevin Jarbo, Kody Manke, Jason Hong - My lab mates in the OH! Lab, Ludolab, eHeart Lab, and Data-Driven Diversity (D3) Lab - Michael Bernstein, Daniela Retelny, and Scott Klemmer for each getting me interested and involved in research, pursuing a PhD, and human-computer interaction respectively - My colleagues, students, and collaborators Ihudiya Finda Ogbonnaya-Ogburu, Angela D.
    [Show full text]
  • An Empirical Study of Offensive Language in Online Interactions
    Rochester Institute of Technology RIT Scholar Works Theses 5-2021 An Empirical Study of Offensive Language in Online Interactions Diptanu Sarkar [email protected] Follow this and additional works at: https://scholarworks.rit.edu/theses Recommended Citation Sarkar, Diptanu, "An Empirical Study of Offensive Language in Online Interactions" (2021). Thesis. Rochester Institute of Technology. Accessed from This Thesis is brought to you for free and open access by RIT Scholar Works. It has been accepted for inclusion in Theses by an authorized administrator of RIT Scholar Works. For more information, please contact [email protected]. An Empirical Study of Offensive Language in Online Interactions by Diptanu Sarkar A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science Department of Computer Science B. Thomas Golisano College of Computing and Information Sciences Rochester Institute of Technology Rochester, New York May 2021 An Empirical Study of Offensive Language in Online Interactions by Diptanu Sarkar APPROVED BY SUPERVISING COMMITTEE: Dr. Marcos Zampieri, Chairperson Dr. Alexander G. Ororbia, Reader Dr. Christopher Homan, Observer Date ii An Empirical Study of Offensive Language in Online Interactions by Diptanu Sarkar Abstract In the past decade, usage of social media platforms has increased significantly. People use these platforms to connect with friends and family, share infor- mation, news and opinions. Platforms such as Facebook, Twitter are often used to propagate offensive and hateful content online. The open nature and anonymity of the internet fuels aggressive and inflamed conversations. The companies and federal institutions are striving to make social media cleaner, welcoming and unbiased.
    [Show full text]
  • A Measurement Study of Hate Speech in Social Media
    A Measurement Study of Hate Speech in Social Media Mainack Mondal Leandro Araújo Silva Fabrício Benevenuto MPI-SWS Universidade Federal de Minas Gerais Universidade Federal de Minas Gerais Germany Belo Horizonte, Brazil Belo Horizonte, Brazil [email protected] [email protected] [email protected] ABSTRACT 1 INTRODUCTION Social media platforms provide an inexpensive communication Online social media sites today allow users to freely communicate at medium that allows anyone to quickly reach millions of users. nearly marginal costs. Increasingly users leverage these platforms Consequently, in these platforms anyone can publish content and not only to interact with each other, but also to share news. While anyone interested in the content can obtain it, representing a trans- the open platform provided by these systems allow users to express formative revolution in our society. However, this same potential of themselves, there is also a dark side of these systems. Particularly, social media systems brings together an important challenge—these these social media sites have become a fertile ground for inflamed systems provide space for discourses that are harmful to certain discussions, that usually polarize ‘us’ against ‘them’, resulting in groups of people. This challenge manifests itself with a number of many cases of insulting and offensive language usage. variations, including bullying, offensive content, and hate speech. Another important aspect that favors such behavior is the level Specifically, authorities of many countries today are rapidly rec- of anonymity that some social media platforms grant to users. As ognizing hate speech as a serious problem, specially because it is example, “Secret” was created, in part, to promote free and anony- hard to create barriers on the Internet to prevent the dissemination mous speech but became a mean for people to defame others while of hate across countries or minorities.
    [Show full text]