Stre: Self Attentive Edit Quality Prediction in Wikipedia

Total Page:16

File Type:pdf, Size:1020Kb

Stre: Self Attentive Edit Quality Prediction in Wikipedia StRE: Self Attentive Edit Quality Prediction in Wikipedia Soumya Sarkar∗1, Bhanu Prakash Reddy*2, Sandipan Sikdar3 Animesh Mukherjee4 IIT Kharagpur, India1,2,4,RWTH Aachen, Germany3 [email protected],[email protected] [email protected], [email protected] Abstract guidelines for them1. Problem: The inherent openness of Wikipedia has Wikipedia can easily be justified as a behe- also made it vulnerable to external agents who moth, considering the sheer volume of con- tent that is added or removed every minute to intentionally attempt to divert the unbiased, ob- its several projects. This creates an immense jective discourse to a narrative which is aligned scope, in the field of natural language pro- with the interest of the malicious actors. Our pilot cessing toward developing automated tools for study on manually annotated 100 Wikipedia pages content moderation and review. In this paper of four categories (25 pages each category) shows we propose Self Attentive Revision Encoder us that at most 30% of the edits are reverted (See (StRE) which leverages orthographic similar- Fig1). Global average of number of reverted dam- ity of lexical units toward predicting the qual- ∼ 2 ity of new edits. In contrast to existing propo- aging edits is 9% . This makes manual interven- sitions which primarily employ features like tion to detect these edits with potential inconsis- page reputation, editor activity or rule based tent content, infeasible. Wikipedia hence deploys heuristics, we utilize the textual content of machine learning based classifiers (West et al., the edits which, we believe contains superior 2010; Halfaker and Taraborelli, 2015) which pri- signatures of their quality. More specifically, marily leverage hand-crafted features from three we deploy deep encoders to generate repre- aspects of revision (i) basic text features like re- sentations of the edits from its text content, peated characters, long words, capitalized words which we then leverage to infer quality. We further contribute a novel dataset containing etc. (ii) temporal features like inter arrival time be- ∼ 21M revisions across 32K Wikipedia pages tween events of interest (iii) dictionary based fea- and demonstrate that StRE outperforms exist- tures like presence of any curse words or informal ing methods by a significant margin – at least words (e.g., ‘hello’, ‘yolo’). Other feature based 17% and at most 103%. Our pre-trained model approaches include (Daxenberger and Gurevych, achieves such result after retraining on a set as 2013; Bronner and Monz, 2012) which generally small as 20% of the edits in a wikipage. This, follow a similar archetype. to the best of our knowledge, is also the first at- tempt towards employing deep language mod- Proposed model: In most of the cases, the ed- els to the enormous domain of automated con- its are reverted because they fail to abide by the tent moderation and review in Wikipedia. edit guidelines, like usage of inflammatory word- ing, expressing opinion instead of fact among oth- 1 Introduction ers (see Fig2). These flaws are fundamentally related to the textual content rather than tempo- Wikipedia is the largest multilingual encyclopedia ral patterns or editor behavior that have been de- known to mankind with the current English ver- ployed in existing methods. Although dictionary sion consisting of more than 5M articles on highly based approaches do look into text to a small ex- diverse topics which are segregated into cate- tent (swear words, long words etc.), they account gories, constructed by a large editor base of more for only a small subset of the edit patterns. We than 32M editors (Hube and Fetahu, 2019). To en- further hypothesize that owing to the volume and courage transparency and openness, Wikipedia al- lows anyone to edit its pages albeit with certain 1en.wikipedia.org/Wikipedia:List of policies *Both∗ authors contributed equally 2stats.wikimedia.org/EN/PlotsPngEditHistoryTop.htm 3962 Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 3962–3972 Florence, Italy, July 28 - August 2, 2019. c 2019 Association for Computational Linguistics Facebook and Myspace are social Facebook is lot better than Myspace 7000 Total Edits networking sites is every aspects. Reverted Edits 6000 Facebook formerly known as thefacebook 5000 Facebook formerly known as thefacebook is is RETARTED as well as a social networking a social networking service for high school, 4000 service for high school, college, university college, university communities communities 3000 Edit Count 2000 Dr Eric Schmidt, FORMER CEO OF NOVELL, [[Eric Schmidt]], took over Google’s CEO took over Google’s CEO when co-founder 1000 when co-founder Larry Page stepped down Larry Page stepped down 0 Company Person Technology Concept Category Google is foraying into other Google is trying to become the jack of businesses, which other companies all trades have recently dominated. Figure 1: Average number of edits and average number of damaging edits, i.e., reverted edits for four different Figure 2: Examples of edits in Facebook and Google categories of pages. A fraction (at most 30%) of user Wikipedia page. The blue bubbles are the original generated edits are damaging edits. sentences. The orange bubbles indicate damaging ed- its while the green bubbles indicate ‘good faith’ edits. Good faith edits are unbiased formal English sentence variety of Wikipedia data, it is impossible to de- while damaging edits often correspond to incoherent velop a feature driven approach which can en- use of language, abusive language, imperative mood, compass the wide array of dependencies present opinionated sentences etc. in text. In fact, we show that such approaches are inefficacious in identifying most of the damag- our model to pages with lower number of edits. ing edits owing to these obvious limitations. We hence propose Self Atttentive Revision Encoder Contributions: Our primary contributions in this (StRE) which extracts rich feature representations paper are summarized below - of an edit that can be further utilized to predict (i) We propose a deep neural network based model whether the edit has damaging intent. In specific, to predict edit quality in Wikipedia which utilizes we use two stacked recurrent neural networks to language modeling techniques, to encode seman- encode the semantic information from sequence tic information in natural language. of characters and sequence of words which serve (ii) We develop a novel dataset consisting of a twofold advantage. While character embeddings ∼ 21M unique edits extracted from ∼ 32K extract information from out of vocabulary tokens, Wikipedia pages. In fact our proposed model i.e., repeated characters, misspelled words, mali- outperforms all the existing methods in detecting cious capitalized characters, unnecessary punctu- damaging edits on this dataset. ation etc., word embeddings extract meaningful (iii) We further develop a transfer learning set up features from curse words, informal words, imper- which allows us to deploy our model to newer cat- ative tone, facts without references etc. We further egories without the need for training from scratch. employ attention mechanisms (Bahdanau et al., Code and sample data related to the pa- 2014) to quantify the importance of a particular per are available at https://github.com/ character/word. Finally we leverage this learned bhanu77prakash/StRE. representation to classify an edit to be damaging 2 The Model or valid. Note that StRE is reminiscent of struc- tured self attentive model proposed in (Lin et al., In this section we give a detailed description of 2017) albeit used in a different setting. our model. We consider an edit to be a pair of Findings: To determine the effectiveness of our sentences with one representing the original (Por) model, we develop an enormous dataset consisting while the other representing the edited version of ∼ 21M edits across 32K wikipages. We observe (Ped). The input to the model is the concatenation that StRE outperforms the closest baseline by at of Por and Ped (say P = fPorjjPedg) separated by a least 17% and at most 103% in terms of AUPRC. delimiter (‘jj’). We assume P consists of wi words Since it is impossible to develop an universal and ci characters. Essentially we consider two lev- model which performs equally well for all cat- els of encoding - (i) character level to extract pat- egories, we develop a transfer learning (Howard terns like repeated characters, misspelled words, and Ruder, 2018) set up which allows us to de- unnecessary punctuation etc. and (ii) word level ploy our model to newer categories without train- to identify curse words, imperative tone, opinion- ing from scratch. This further allows us to employ ated phrases etc. In the following we present how 3963 we generate a representation of the edit and utilize an MLP. Each embedded character is then passed it to detect malicious edits. The overall architec- through a bidirectional LSTM to obtain the hidden ture of StRE is presented in Fig3. states for each character. Formally, we have 2.1 Word encoder yi = s(Wcci + bc); i 2 [0;T] (7) Given an edit P with wi;i 2 [0;L] words, we first −! −−−! h i = LSTM(yi); i 2 [0;T] (8) embed the words through a pre-trained embedding − −−− matrix We such that xi = Wewi. This sequence of h i = LSTM(yi); i 2 [T;0] (9) embedded words is then provided as an input to a bidirectional LSTM (Hochreiter and Schmidhu- We next calculate the attention scores for each ber, 1997) which provides representations of the hidden state hi as words by summarizing information from both di- z = s(W h + b ) (10) rections. i c i c exp(zT u ) a = i c (11) x = W w ; i 2 [0;L] (1) i T T i e i ∑i=0 exp(zi uc) −! −−−! v i = LSTM(xi); i 2 [0;L] (2) Rc = ∑aihi (12) − −−− i v i = LSTM(xi); i 2 [L;0] (3) Note that uc is a character context vector which We obtain the representation for each word by is learned during training.
Recommended publications
  • Community and Communication
    Community and 12 Communication A large, diverse, and thriving group of volun- teers produces encyclopedia articles and administers Wikipedia. Over time, members of the Wikipedia community have developed conventions for interacting with each other, processes for managing content, and policies for minimizing disruptions and maximizing use- ful work. In this chapter, we’ll discuss where to find other contributors and how to ask for help with any topic. We’ll also explain ways in which community members interact with each other. Though most discussion occurs on talk pages, Wikipedia has some central community forums for debate about the site’s larger policies and more specific issues. We’ll also talk about the make-up of the community. First, however, we’ll outline aspects of Wikipedia’s shared culture, from key philosophies about how contributors How Wikipedia Works (C) 2008 by Phoebe Ayers, Charles Matthews, and Ben Yates should interact with each other to some long-running points of debate to some friendly practices that have arisen over time. Although explicit site policies cover content guidelines and social norms, informal philosophies and practices help keep the Wikipedia community of contributors together. Wikipedia’s Culture Wikipedia’s community has grown spontaneously and organically—a recipe for a baffling culture rich with in-jokes and insider references. But core tenets of the wiki way, like Assume Good Faith and Please Don’t Bite the Newcomers, have been with the community since the beginning. Assumptions on Arrival Wikipedians try to treat new editors well. Assume Good Faith (AGF) is a funda- mental philosophy, as well as an official guideline (shortcut WP:AGF) on Wikipedia.
    [Show full text]
  • The Culture of Wikipedia
    Good Faith Collaboration: The Culture of Wikipedia Good Faith Collaboration The Culture of Wikipedia Joseph Michael Reagle Jr. Foreword by Lawrence Lessig The MIT Press, Cambridge, MA. Web edition, Copyright © 2011 by Joseph Michael Reagle Jr. CC-NC-SA 3.0 Purchase at Amazon.com | Barnes and Noble | IndieBound | MIT Press Wikipedia's style of collaborative production has been lauded, lambasted, and satirized. Despite unease over its implications for the character (and quality) of knowledge, Wikipedia has brought us closer than ever to a realization of the centuries-old Author Bio & Research Blog pursuit of a universal encyclopedia. Good Faith Collaboration: The Culture of Wikipedia is a rich ethnographic portrayal of Wikipedia's historical roots, collaborative culture, and much debated legacy. Foreword Preface to the Web Edition Praise for Good Faith Collaboration Preface Extended Table of Contents "Reagle offers a compelling case that Wikipedia's most fascinating and unprecedented aspect isn't the encyclopedia itself — rather, it's the collaborative culture that underpins it: brawling, self-reflexive, funny, serious, and full-tilt committed to the 1. Nazis and Norms project, even if it means setting aside personal differences. Reagle's position as a scholar and a member of the community 2. The Pursuit of the Universal makes him uniquely situated to describe this culture." —Cory Doctorow , Boing Boing Encyclopedia "Reagle provides ample data regarding the everyday practices and cultural norms of the community which collaborates to 3. Good Faith Collaboration produce Wikipedia. His rich research and nuanced appreciation of the complexities of cultural digital media research are 4. The Puzzle of Openness well presented.
    [Show full text]
  • Classifying Wikipedia Article Quality with Revision History Networks
    Classifying Wikipedia Article Quality With Revision History Networks Narun Raman∗ Nathaniel Sauerberg∗ Carleton College Carleton College [email protected] [email protected] Jonah Fisher Sneha Narayan Carleton College Carleton College [email protected] [email protected] ABSTRACT long been interested in maintaining and investigating the quality We present a novel model for classifying the quality of Wikipedia of its content [4][6][12]. articles based on structural properties of a network representation Editors and WikiProjects typically rely on assessments of article of the article’s revision history. We create revision history networks quality to focus volunteer attention on improving lower quality (an adaptation of Keegan et. al’s article trajectory networks [7]), articles. This has led to multiple efforts to create classifiers that can where nodes correspond to individual editors of an article, and edges predict the quality of a given article [3][4][18]. These classifiers can join the authors of consecutive revisions. Using descriptive statistics assist in providing assessments of article quality at scale, and help generated from these networks, along with general properties like further our understanding of the features that distinguish high and the number of edits and article size, we predict which of six quality low quality Wikipedia articles. classes (Start, Stub, C-Class, B-Class, Good, Featured) articles belong While many Wikipedia article quality classifiers have focused to, attaining a classification accuracy of 49.35% on a stratified sample on assessing quality based on the content of the latest version of of articles. These results suggest that structures of collaboration an article [1, 4, 18], prior work has suggested that high quality arti- underlying the creation of articles, and not just the content of the cles are associated with more intense collaboration among editors article, should be considered for accurate quality classification.
    [Show full text]
  • Controlled Analyses of Social Biases in Wikipedia Bios
    Controlled Analyses of Social Biases in Wikipedia Bios Anjalie Field Chan Young Park Yulia Tsvetkov [email protected] [email protected] [email protected] Carnegie Mellon University Carnegie Mellon University Carnegie Mellon University ABSTRACT as explanatory variables in a regression model, which restricts anal- Social biases on Wikipedia, a widely-read global platform, could ysis to regression models and requires explicitly enumerating all greatly influence public opinion. While prior research has examined confounds [52]. man/woman gender bias in biography articles, possible influences In contrast, we develop a matching algorithm that enables ana- of confounding variables limit conclusions. In this work, we present lyzing different demographic groups while reducing the influence a methodology for reducing the effects of confounding variables in of confounding variables. Given a corpus of Wikipedia biography analyses of Wikipedia biography pages. Given a target corpus for pages for people that contain target attributes (e.g. pages for cis- analysis (e.g. biography pages about women), we present a method gender women), our algorithm builds a matched comparison corpus for constructing a comparison corpus that matches the target cor- of biography pages for people that do not (e.g. for cisgender men). pus in as many attributes as possible, except the target attribute The comparison corpus is constructed so that it closely matches the (e.g. the gender of the subject). We evaluate our methodology by de- target corpus on all known attributes except the targeted one. Thus, veloping metrics to measure how well the comparison corpus aligns examining differences between the two corpora can reveal content with the target corpus.
    [Show full text]
  • Project Talk: Coordination Work and Group Membership in Wikiprojects
    Project talk: Coordination work and group membership in WikiProjects Jonathan T. Morgan*, Michael Gilbert*, David W. McDonald**, Mark Zachry* *Human Centered Design & Engineering **The Information School University of Washington Seattle, WA USA {jmo25, mdg, dwmc, zachry} @uw.edu ABSTRACT Groups emerge in online collaborations as individuals organize their WikiProjects have contributed to Wikipedia’s success in important productive activities around shared goals, interests, tasks and work- ways, yet the range of work that WikiProjects perform and the way spaces. These groups can provide important benefits for their mem- they coordinate that work remains largely unexplored. In this study, bers and perform valuable work for the community they belong to. we perform a content analysis of 788 work-related discussions from Lave & Wenger [16] assert that the most effective way to under- the talk pages of 138 WikiProjects in order to understand the role stand working groups like these is to examine the work activities WikiProjects play in collaborative work on Wikipedia. We find that their members engage in. But, as the scenario above illustrates, the editors use WikiProjects to coordinate a wide variety of work identifying the members of an online group and the work the group activities beyond content production and that non-members play an performs can be difficult for an outsider—whether they are a new active role in that work. Our research suggests that WikiProject user, a researcher or a system designer. collaboration is less structured and more open than that of many virtual teams and that WikiProjects may function more like FLOSS Research on the behavior of Wikipedia editors has informed our projects than traditional groups.
    [Show full text]
  • Critical Point of View: a Wikipedia Reader
    w ikipedia pedai p edia p Wiki CRITICAL POINT OF VIEW A Wikipedia Reader 2 CRITICAL POINT OF VIEW A Wikipedia Reader CRITICAL POINT OF VIEW 3 Critical Point of View: A Wikipedia Reader Editors: Geert Lovink and Nathaniel Tkacz Editorial Assistance: Ivy Roberts, Morgan Currie Copy-Editing: Cielo Lutino CRITICAL Design: Katja van Stiphout Cover Image: Ayumi Higuchi POINT OF VIEW Printer: Ten Klei Groep, Amsterdam Publisher: Institute of Network Cultures, Amsterdam 2011 A Wikipedia ISBN: 978-90-78146-13-1 Reader EDITED BY Contact GEERT LOVINK AND Institute of Network Cultures NATHANIEL TKACZ phone: +3120 5951866 INC READER #7 fax: +3120 5951840 email: [email protected] web: http://www.networkcultures.org Order a copy of this book by sending an email to: [email protected] A pdf of this publication can be downloaded freely at: http://www.networkcultures.org/publications Join the Critical Point of View mailing list at: http://www.listcultures.org Supported by: The School for Communication and Design at the Amsterdam University of Applied Sciences (Hogeschool van Amsterdam DMCI), the Centre for Internet and Society (CIS) in Bangalore and the Kusuma Trust. Thanks to Johanna Niesyto (University of Siegen), Nishant Shah and Sunil Abraham (CIS Bangalore) Sabine Niederer and Margreet Riphagen (INC Amsterdam) for their valuable input and editorial support. Thanks to Foundation Democracy and Media, Mondriaan Foundation and the Public Library Amsterdam (Openbare Bibliotheek Amsterdam) for supporting the CPOV events in Bangalore, Amsterdam and Leipzig. (http://networkcultures.org/wpmu/cpov/) Special thanks to all the authors for their contributions and to Cielo Lutino, Morgan Currie and Ivy Roberts for their careful copy-editing.
    [Show full text]
  • Detailed Table of Contents
    Contents in Detail Introduction.............................................................................................................................. xvii Inside This Book ............................................................................................................................. xviii What You Should Know Going In ...................................................................................................xix Using This Book ............................................................................................................................... xix Our Approach to Understanding Wikipedia .................................................................................xx It’s Everyone’s Encyclopedia: Be Bold! .........................................................................................xxi Wikisyntax Cheatsheet..................................................................................................................xxiii Part I: Content Chapter 1: What’s in Wikipedia?.....................................................................................3 Types of Articles..................................................................................................................................7 Article and Content Inclusion Policies............................................................................................ 11 Core Policies: V, NOR, and NPOV ............................................................................................. 12 Understanding the Policies.......................................................................................................
    [Show full text]
  • Xtools Release 3.1.45
    XTools Release 3.1.45 Mar 05, 2018 Table of Contents 1 Tools 3 1.1 Edit Counter...............................................3 1.1.1 General Statistics........................................3 1.1.2 Namespace totals........................................4 1.1.3 Timecard............................................4 1.1.4 Year counts...........................................4 1.1.5 Month counts..........................................4 1.1.6 Latest global edits........................................4 1.1.7 Automated edits.........................................4 1.2 Page History...............................................4 1.2.1 General Statistics........................................5 1.2.2 Top editors...........................................5 1.2.3 Year counts...........................................6 1.2.4 Month counts..........................................6 1.2.5 (Semi-)automated edits.....................................6 1.2.6 Assessments...........................................6 1.2.7 Bugs...............................................6 1.3 Pages Created..............................................6 1.4 Top Edits.................................................7 1.5 Admin Score...............................................7 1.5.1 Algorithm............................................7 1.6 Bash Quote................................................7 1.7 Simple Counter..............................................7 2 API 9 2.1 Project API................................................9 2.1.1 Normalize project........................................9
    [Show full text]
  • A Wikipedia Reader
    UvA-DARE (Digital Academic Repository) Critical point of view: a Wikipedia reader Lovink, G.; Tkacz, N. Publication date 2011 Document Version Final published version Link to publication Citation for published version (APA): Lovink, G., & Tkacz, N. (2011). Critical point of view: a Wikipedia reader. (INC reader; No. 7). Institute of Network Cultures. http://www.networkcultures.org/_uploads/%237reader_Wikipedia.pdf General rights It is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), other than for strictly personal, individual use, unless the work is under an open content license (like Creative Commons). Disclaimer/Complaints regulations If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library: https://uba.uva.nl/en/contact, or a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible. UvA-DARE is a service provided by the library of the University of Amsterdam (https://dare.uva.nl) Download date:05 Oct 2021 w ikipedia pedai p edia p Wiki CRITICAL POINT OF VIEW A Wikipedia Reader 2 CRITICAL POINT OF VIEW A Wikipedia Reader CRITICAL POINT OF VIEW 3 Critical Point of View: A Wikipedia Reader Editors: Geert Lovink
    [Show full text]
  • African Americans in STEM Wikipedia Edit-A-Thon
    African Americans in STEM Wikipedia Edit-A-Thon February 21 & 28, 2021 Presented by the National Museum of African American History and Culture and blackcomputeHER.org with support from Wikimedia DC While you wait, please login to Wikipedia (en.wikipedia.org) and the Outreach Dashboard. Links in chat. Photo: C.R. Patterson & Sons Company, Patterson-Greenfield Automobile. Source: Wikimedia Commons, User: Jooojay African Americans in STEM Wikipedia Edit-A-Thon Agenda 1. Welcome 2. Wikipedia Editing Training 3. Open editing 2 Wikimedia DC Wikimedia DC is the regional outreach organization for Wikipedia and the other projects of the Wikimedia Foundation. Our mission is to promote participation in Wikimedia projects in Washington, DC, Maryland, Virginia, West Virginia, Delaware and throughout the United States. Ariel Cetrone - Institutional Partnerships Manager Wikipedia Username: @Ariel Cetrone (WMDC) WikimediaDC.org 3 Safe Space Policy The purpose of the Safe Space Policy ("Policy") is to ensure that the events and programs operated by Wikimedia District of Columbia ("Wikimedia DC") are free of harassment and other unwelcome behavior. This Policy applies to all events where Wikimedia DC has both authority and reasonable means to implement the provisions of this Policy. It further applies to all programs operated by Wikimedia DC and all spaces, whether physical or virtual, under Wikimedia DC control. Access the full policy at WikimediaDC.org or on today’s wiki event page 4 Today’s Objectives and Goals Attendees will: ● Gain an understanding
    [Show full text]
  • About Wikipedia (English)
    Wikipedia:About 1 Wikipedia:About Wikipedia (pronounced /ˌwɪkɨˈpiːdi.ə/ WIK-i-PEE-dee-ə) is a multilingual, web-based, free-content encyclopedia project based on an openly editable model. The name "Wikipedia" is a portmanteau of the words wiki (a technology for creating collaborative websites, from the Hawaiian word wiki, meaning "quick") and encyclopedia. Wikipedia's articles provide links to guide the user to related pages with additional information. Wikipedia is written collaboratively by largely anonymous Internet volunteers who write without pay. Anyone with Internet access can write and make changes to Wikipedia articles (except in certain cases where editing is restricted to prevent disruption or vandalism). Users can contribute anonymously, under a pseudonym, or with their real identity, if they choose. The fundamental principles by which Wikipedia operates are the Five pillars. The Wikipedia community has developed many policies and guidelines to improve the encyclopedia; however, it is not a formal requirement to be familiar with them before contributing. Since its creation in 2001, Wikipedia has grown rapidly into one of the largest reference websites, attracting nearly 78 million visitors [1] monthly as of January 2010. There are more than 91,000 active contributors [2] working on more than 17,000,000 articles in more than 270 languages. As of today, there are 0 articles in English. Every day, hundreds of thousands of visitors from around the world collectively make tens of thousands of edits and create thousands of new articles to augment the knowledge held by the Wikipedia encyclopedia. (See also: Wikipedia:Statistics.) People of all ages, cultures and backgrounds can add or edit article prose, references, images and other media here.
    [Show full text]
  • Thanks for Stopping By: a Study of “Thanks
    Thanks for Stopping By: A Study of “Thanks” Usage on Wikimedia Swati Goel Ashton Anderson Leila Zia [email protected] [email protected] [email protected] Henry M. Gunn High School University of Toronto Wikimedia Foundation ABSTRACT editors a better experience, can therefore increase editor activity. The Thanks feature on Wikipedia, also known as “Thanks”, is a tool A positive environment may actually be one of the most crucial with which editors can quickly and easily send one other positive elements for increasing engagement, as social factors tend to out- feedback [1]. The aim of this project is to better understand this fea- weigh even those surrounding usability with regards to positively ture: its scope, the characteristics of a typical “Thanks” interaction, affecting contribution [3]. The impact of these social factors could and the effects of receiving a thank on individual editors. Westudy be quite significant, as a community member’s internal value sys- the motivational impacts of “Thanks” because maintaining editor tems can be influenced by external rewards, thus making positive engagement is a central problem for crowdsourced repositories of feedback an extremely useful tool in building online communi- knowledge such as Wikimedia. Our main findings are that most ties [6]. The Thanks feature could therefore represent an important editors have not been exposed to the Thanks feature (meaning they resource for building a positive Wiki community. have never given nor received a thank), thanks are typically sent “Thanks” is no longer a new Wiki feature, having been imple- upwards (from less experienced to more experienced editors), and mented on English Wikipedia on May 30th, 2013 and introduced receiving a thank is correlated with having high levels of editor to all projects soon thereafter.
    [Show full text]