Analyzing Political Parody in Social Media

Analyzing Political Parody in Social Media Antonis Maronikolakis1∗y Danae Sanchez´ Villegas2∗ Daniel Preot¸iuc-Pietro3 Nikolaos Aletras2 1 Center for Information and Language Processing, LMU Munich, Germany 2 Computer Science Department, University of Sheffield, UK 3 Bloomberg [email protected], fdsanchezvillegas1, [email protected] [email protected] Abstract of parody recently emerged in social media, and Twitter in particular, through accounts that im- Parody is a figurative device used to imitate personate public figures. Highfield(2016) defines an entity for comedic or critical purposes and parody accounts acting as: a known, real person, represents a widespread phenomenon in so- for obviously comedic purposes. There should be cial media through many popular parody accounts. In this paper, we present the first com- no risk of mistaking their tweets for their subject’s putational study of parody. We introduce a new actual views; these accounts play with stereotypes publicly available data set of tweets from real of these figures or juxtapose their public image politicians and their corresponding parody ac- with a very different, behind-closed-doors per- counts. We run a battery of supervised ma- sona. chine learning models for automatically de- A very popular type of parody is political par- tecting parody tweets with an emphasis on ro- ody which plays an important role in public speech bustness by testing on tweets from accounts unseen in training, across different genders by offering irreverent interpretations of political and across countries. Our results show that po- personas (Hariman, 2008). Table1 shows exam- litical parody tweets can be predicted with an ples of very popular (over 50k followers) and ac- accuracy up to 90%. Finally, we identify the tive (thousands of tweets sent) political parody ac- markers of parody through a linguistic analy- counts on Twitter. Sample tweets show how the sis. Beyond research in linguistics and politi- style and topic of parody tweets are similar to cal communication, accurately and automati- those from the real accounts, which may pose is- cally detecting parody is important to improv- ing fact checking for journalists and analytics sues to automatic classification. such as sentiment analysis through filtering out While closely related figurative devices such as parodical utterances.1 irony and sarcasm have been extensively studied in computational linguistics (Wallace, 2015; Joshi 1 Introduction et al., 2017), parody yet to be explored using computational methods. In this paper, we aim to bridge Parody is a figurative device which is used to im- this gap and conduct, for the first time, a system- itate and ridicule a particular target (Rose, 1993) atic study of political parody as a figurative device and has been studied in linguistics as a figura- in social media. To this end, we make the follow- tive trope distinct to irony and satire (Kreuz and ing contributions: Roberts, 1993; Rossen-Knill and Henry, 1997). 1. A novel classification task where we seek to au- Traditional forms of parody include editorial car- tomatically classify real and parody tweets. For toons, sketches or articles pretending to have been this task, we create a new large-scale publicly authored by the parodied person.2 A new form available data set containing a total of 131,666 ∗Equal contribution. English tweets from 184 parody accounts and y Work was done while at the University of Sheffield. corresponding real accounts of politicians from 1Data is available here: https://archive.org/de tails/parody data acl20 the US, UK and other countries (Section3); 2The ‘Kapou Opa’ column by K. Maniatis parodying 2. Experiments with feature- and neural-based Greek popular persons was a source of inspiration for this machine learning models for parody detection, work - https://www.oneman.gr/originals/to -imerologio-karantinas-tou-dimitri-kouts which achieve high predictive accuracy of up oumpa/ to 89.7% F1. These are focused on the robust- 4373 Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 4373–4384 July 5 - 10, 2020. c 2020 Association for Computational Linguistics ness of classification, with test data from: a) involves a highly situated, intentional, and conven- users; b) genders; c) locations; unseen in train- tional speech act (Rossen-Knill and Henry, 1997) ing (Section5); composed of both a negative evaluation and a form 3. Linguistic analysis of the markers of parody of pretense or echoic mention (Sperber, 1984; Wil- tweets and of the model errors (Section6). son, 2006; Dynel, 2014) through which an entity We argue that understanding the expression is mimicked or imitated with the goal of criticiz- and use of parody in natural language and au- ing it to a comedic effect. Thus, imitative compo- tomatically identifying it are important to appli- sition for amusing purpose is an an inherent char- cations in computational social science and be- acteristic of parody (Franke, 1971). The parodist yond. Parody tweets can often be misinterpreted intentionally re-presents the object of the parody as facts even though Twitter only allows parody and flaunts this re-presentation (Rossen-Knill and accounts if they are explicitly marked as parody3 Henry, 1997). and the poster does not have the intention to mis- lead. For example, the Speaker of the US House Parody on Social Media Parody is considered of Representatives, Nancy Pelosi, falsely cited a an integral part of Twitter (Vis, 2013) and previ- Michael Flynn parody tweet;4 and many users ous studies on parody in social media focused on were fooled by a Donald Trump parody tweet analysing how these accounts contribute to topi- about ‘Dow Joans’.5 Thus, accurate parody clas- cal discussions (Highfield, 2016) and the relation- sification methods can be useful in downstream ship between identity, impersonation and authen- NLP applications such as automatic fact check- ticity (Page, 2014). Public relation studies showed ing (Vlachos and Riedel, 2014) and rumour verifi- that parody accounts impact organisations during cation (Karmakharm et al., 2019), sentiment anal- crises while they can become a threat to their rep- ysis (Pang et al., 2008) or nowcasting voting inten- utation (Wan et al., 2015). tion (Tumasjan et al., 2010; Lampos et al., 2013; Satire Most related to parody, satire has been Tsakalidis et al., 2018). tangentially studied as one of several prediction Beyond NLP, parody detection can be used in: targets in NLP in the context of identifying disin- (i) political communication, to study and under- formation (McHardy et al., 2019; de Morais et al., stand the effects of political parody in the pub- 2019). (Rashkin et al., 2017) compare the lan- lic speech on a large scale (Hariman, 2008; High- guage of real news with that of satire, hoaxes, and field, 2016); (ii) linguistics, to identify characteris- propaganda to identify linguistic features of unre- tics of figurative language (Rose, 1993; Kreuz and liable text. They demonstrate how stylistic charac- Roberts, 1993; Rossen-Knill and Henry, 1997); teristics can help to decide the text’s veracity. The (iii) network science, to identify the adoption and study of parody is therefore relevant to this topic, diffusion mechanisms of parody (Vosoughi et al., as satire and parodies are classified by some as a 2018). type of disinformation with ‘no intention to cause harm but has potential to fool’ (Wardle and Der- 2 Related Work akhshan, 2018). Parody in Linguistics Parody is an artistic Irony and Sarcasm There is a rich body of form and literary genre that dates back to Aristo- work in NLP on identifying irony and sarcasm as phanes in ancient Greece who parodied argumen- a classification task (Wallace, 2015; Joshi et al., tation styles in Frogs. Verbal parody was studied 2017). Van Hee et al.(2018) organized two open in linguistics as a figurative trope distinct to irony shared tasks. The first aims to automatically clas- and satire (Kreuz and Roberts, 1993; Rossen-Knill sify tweets as ironic or not, and the second is on and Henry, 1997) and researchers long debated its identifying the type of irony expressed in tweets. definition and theoretic distinctions to other types However, the definition of irony is usually ‘a trope of humor (Grice et al., 1975; Sperber, 1984; Wil- whose actual meaning differs from what is lit- son, 2006; Dynel, 2014). In general, verbal parody erally enunciated’ (Van Hee et al., 2018), fol- 3Both the profile description and account name need to lowing the Gricean belief that the hallmark of mention this – https://help.twitter.com/en/ru les-and-policies/parody-account-policy irony is to communicate the opposite of the lit- 4https://tinyurl.com/ybbrh74g eral meaning (Wilson, 2006), violating the first 5https://tinyurl.com/s34dwgm maxim of Quality (Grice et al., 1975). In this 4374 Account type Twitter Handle Sample tweet The Republican Party, and me, had a GREAT day yesterday with respect to the phony Impeachment Hoax, & yet, when I got home to the White House & Real @realDonaldTrump checked out the news coverage on much of television, you would have no idea they were reporting on the same event. FAKE & CORRUPT NEWS! Lies! Kampala Harris says my crimes are committed in plane site! Shes lying! Parody @realDonaldTrFan My crimes are ALWAYS hidden! ALWAYS!! Our NHS will never be on the table for any trade negotiations. Were invest- ing more than ever before - and when we leave the EU, we will introduce an Real @BorisJohnson Australian style, points-based immigration system so the NHS can plan for the future. People seem to be ignoring the many advantages of selling off the NHS, like Parody @BorisJohnson MP the fact that hospitals will be far more spacious once poor people can’t afford to use them. Table 1: Two examples of Twitter accounts of politicians and their corresponding parody account with a sample tweet from each.

Load more