“I Won the Election!”: An Empirical Analysis of Soft Moderation Interventions on Twitter* Savvas Zannettou Max Planck Institute for Informatics
[email protected] Abstract Over the past few years, there is a heated debate and serious public concerns regarding online content moderation, censor- ship, and the principle of free speech on the Web. To ease these concerns, social media platforms like Twitter and Facebook refined their content moderation systems to support soft mod- eration interventions. Soft moderation interventions refer to warning labels attached to potentially questionable or harmful content to inform other users about the content and its nature Figure 1: An example of a soft moderation intervention on Twitter. while the content remains accessible, hence alleviating con- cerns related to censorship and free speech. In this work, we perform one of the first empirical studies on soft moderation interventions on Twitter. Using a mixed- be performed in a timely manner to ensure that harmful con- methods approach, we study the users who share tweets with tent is removed fast and only a small number of users are ex- warning labels on Twitter and their political leaning, the en- posed to harmful content. This is a tough challenge given the gagement that these tweets receive, and how users interact scale of modern social media platforms like Twitter and Face- with tweets that have warning labels. Among other things, we book. Second, content moderation should be consistent and find that 72% of the tweets with warning labels are shared by fair across users. Finally, content moderation should ensure Republicans, while only 11% are shared by Democrats.