A Report on the Account Creation Improvement Project and the Fellowship

Total Page:16

File Type:pdf, Size:1020Kb

A Report on the Account Creation Improvement Project and the Fellowship 47! A report on the Account Creation Improvement Project and the Fellowship By Lennart Guldbrandsson Table of contents Introduction .................................................................................................................................................... 4 Overview of the project and the Fellowship ................................................................................................. 4 Timeline ................................................................................................................................................................... 4 Before the Fellowship ............................................................................................................................................... 4 The Fellowship .......................................................................................................................................................... 4 After the Fellowship .................................................................................................................................................. 5 Participants and method .......................................................................................................................................... 5 The lay of the land .......................................................................................................................................... 6 The original account creation process ..................................................................................................................... 6 The technical aspect .................................................................................................................................................. 6 The content aspect ..................................................................................................................................................... 7 The ”before” statistics ............................................................................................................................................. 8 The surveys .............................................................................................................................................................. 9 Results ..................................................................................................................................................................... 11 Comments from people taking the survey ................................................................................................................ 11 Testing it live ................................................................................................................................................. 12 The low-quality tests ............................................................................................................................................. 12 Samples of versions that were tested: ..................................................................................................................... 13 Results of the first tests ........................................................................................................................................... 14 The backlash with version 2 .................................................................................................................................... 14 CustomUserSignup ................................................................................................................................................ 15 The high-quality tests ............................................................................................................................................ 16 The user page creator ............................................................................................................................................. 16 Step 0 and 1: ...................................................................................................................................................... 17 Step 2: ................................................................................................................................................................ 17 Step 3: ................................................................................................................................................................ 18 Step 4: ................................................................................................................................................................ 18 The new user bar template ................................................................................................................................. 19 Category:New Wikipedians ............................................................................................................................... 21 The Sparked.com model .......................................................................................................................................... 21 Step 0 and 1: ...................................................................................................................................................... 22 Step 2: ................................................................................................................................................................ 22 Step 3: ................................................................................................................................................................ 23 Step 4: ................................................................................................................................................................ 23 Results ................................................................................................................................................................... 24 What we measure and why ...................................................................................................................................... 24 The raw data ........................................................................................................................................................... 24 Crunching the numbers ........................................................................................................................................... 25 All account creation processes ........................................................................................................................... 25 ACP1 .................................................................................................................................................................. 25 ACP2 .................................................................................................................................................................. 25 ACP3 .................................................................................................................................................................. 26 Total scores ........................................................................................................................................................ 26 Which account creation process is better? ............................................................................................................. 26 Other results and aspects ........................................................................................................................................ 27 Final thoughts ............................................................................................................................................... 28 Further research ..................................................................................................................................................... 28 Appendices .................................................................................................................................................... 29 Survey 1 ................................................................................................................................................................. 29 Survey 2 ................................................................................................................................................................. 30 Page 2 Interpreting the results ............................................................................................................................................ 30 List of MediaWiki messages involved in the account creation process ............................................................... 32 Log In / Create Account page ................................................................................................................................. 32 User Create Page .................................................................................................................................................... 32 Confirmation page .................................................................................................................................................. 33 Function that checks for available usernames ....................................................................................................... 34 The code .................................................................................................................................................................. 34 Implementation
Recommended publications
  • Cultural Anthropology Through the Lens of Wikipedia: Historical Leader Networks, Gender Bias, and News-Based Sentiment
    Cultural Anthropology through the Lens of Wikipedia: Historical Leader Networks, Gender Bias, and News-based Sentiment Peter A. Gloor, Joao Marcos, Patrick M. de Boer, Hauke Fuehres, Wei Lo, Keiichi Nemoto [email protected] MIT Center for Collective Intelligence Abstract In this paper we study the differences in historical World View between Western and Eastern cultures, represented through the English, the Chinese, Japanese, and German Wikipedia. In particular, we analyze the historical networks of the World’s leaders since the beginning of written history, comparing them in the different Wikipedias and assessing cultural chauvinism. We also identify the most influential female leaders of all times in the English, German, Spanish, and Portuguese Wikipedia. As an additional lens into the soul of a culture we compare top terms, sentiment, emotionality, and complexity of the English, Portuguese, Spanish, and German Wikinews. 1 Introduction Over the last ten years the Web has become a mirror of the real world (Gloor et al. 2009). More recently, the Web has also begun to influence the real world: Societal events such as the Arab spring and the Chilean student unrest have drawn a large part of their impetus from the Internet and online social networks. In the meantime, Wikipedia has become one of the top ten Web sites1, occasionally beating daily newspapers in the actuality of most recent news. Be it the resignation of German national soccer team captain Philipp Lahm, or the downing of Malaysian Airlines flight 17 in the Ukraine by a guided missile, the corresponding Wikipedia page is updated as soon as the actual event happened (Becker 2012.
    [Show full text]
  • Wikipedia Edit-A-Thons and Editor Experience: Lessons from a Participatory Observation
    Aalborg Universitet Wikipedia Edit-a-thons and Editor Experience: Lessons from a Participatory Observation Gluza, Wioletta; Turaj, Izabela ; Meier, Florian Maximilian Published in: Proceeding of 17th International Symposium on Open Collaboration (OpenSym2021) Publication date: 2021 Link to publication from Aalborg University Citation for published version (APA): Gluza, W., Turaj, I., & Meier, F. M. (Accepted/In press). Wikipedia Edit-a-thons and Editor Experience: Lessons from a Participatory Observation. In Proceeding of 17th International Symposium on Open Collaboration (OpenSym2021) General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. ? Users may download and print one copy of any publication from the public portal for the purpose of private study or research. ? You may not further distribute the material or use it for any profit-making activity or commercial gain ? You may freely distribute the URL identifying the publication in the public portal ? Take down policy If you believe that this document breaches copyright please contact us at [email protected] providing details, and we will remove access to the work immediately and investigate your claim. Wikipedia Edit-a-thons and Editor Experience: Lessons from a Participatory Observation WIOLETTA GLUZA, Aalborg University Copenhagen, Denmark IZABELA ANNA TURAJ, Aalborg University Copenhagen, Denmark FLORIAN MEIER, Aalborg University Copenhagen, Denmark Wikipedia is one of the most important sources of encyclopedic knowledge and among the most visited websites on the internet.
    [Show full text]
  • Modeling Popularity and Reliability of Sources in Multilingual Wikipedia
    information Article Modeling Popularity and Reliability of Sources in Multilingual Wikipedia Włodzimierz Lewoniewski * , Krzysztof W˛ecel and Witold Abramowicz Department of Information Systems, Pozna´nUniversity of Economics and Business, 61-875 Pozna´n,Poland; [email protected] (K.W.); [email protected] (W.A.) * Correspondence: [email protected] Received: 31 March 2020; Accepted: 7 May 2020; Published: 13 May 2020 Abstract: One of the most important factors impacting quality of content in Wikipedia is presence of reliable sources. By following references, readers can verify facts or find more details about described topic. A Wikipedia article can be edited independently in any of over 300 languages, even by anonymous users, therefore information about the same topic may be inconsistent. This also applies to use of references in different language versions of a particular article, so the same statement can have different sources. In this paper we analyzed over 40 million articles from the 55 most developed language versions of Wikipedia to extract information about over 200 million references and find the most popular and reliable sources. We presented 10 models for the assessment of the popularity and reliability of the sources based on analysis of meta information about the references in Wikipedia articles, page views and authors of the articles. Using DBpedia and Wikidata we automatically identified the alignment of the sources to a specific domain. Additionally, we analyzed the changes of popularity and reliability in time and identified growth leaders in each of the considered months. The results can be used for quality improvements of the content in different languages versions of Wikipedia.
    [Show full text]
  • International Journal of Computational Linguistics
    International Journal of Computational Linguistics & Chinese Language Processing Aims and Scope International Journal of Computational Linguistics and Chinese Language Processing (IJCLCLP) is an international journal published by the Association for Computational Linguistics and Chinese Language Processing (ACLCLP). This journal was founded in August 1996 and is published four issues per year since 2005. This journal covers all aspects related to computational linguistics and speech/text processing of all natural languages. Possible topics for manuscript submitted to the journal include, but are not limited to: • Computational Linguistics • Natural Language Processing • Machine Translation • Language Generation • Language Learning • Speech Analysis/Synthesis • Speech Recognition/Understanding • Spoken Dialog Systems • Information Retrieval and Extraction • Web Information Extraction/Mining • Corpus Linguistics • Multilingual/Cross-lingual Language Processing Membership & Subscriptions If you are interested in joining ACLCLP, please see appendix for further information. Copyright © The Association for Computational Linguistics and Chinese Language Processing International Journal of Computational Linguistics and Chinese Language Processing is published four issues per volume by the Association for Computational Linguistics and Chinese Language Processing. Responsibility for the contents rests upon the authors and not upon ACLCLP, or its members. Copyright by the Association for Computational Linguistics and Chinese Language Processing. All rights reserved. No part of this journal may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical photocopying, recording or otherwise, without prior permission in writing form from the Editor-in Chief. Cover Calligraphy by Professor Ching-Chun Hsieh, founding president of ACLCLP Text excerpted and compiled from ancient Chinese classics, dating back to 700 B.C.
    [Show full text]
  • China Date: 8 January 2007
    Refugee Review Tribunal AUSTRALIA RRT RESEARCH RESPONSE Research Response Number: CHN31098 Country: China Date: 8 January 2007 Keywords: China – Taiwan Strait – 2006 Military exercises – Typhoons This response was prepared by the Country Research Section of the Refugee Review Tribunal (RRT) after researching publicly accessible information currently available to the RRT within time constraints. This response is not, and does not purport to be, conclusive as to the merit of any particular claim to refugee status or asylum. Questions 1. Is there corroborating information about military manoeuvres and exercises in Pingtan? 2. Is there any information specifically about the military exercise there in July 2006? 3. Is there any information about “Army day” on 1 August 2006? 4. What are the aquatic farming/fishing activities carried out in that area? 5. Has there been pollution following military exercises along the Taiwan Strait? 6. The delegate makes reference to independent information that indicates that from May until August 2006 China particularly the eastern coast was hit by a succession of storms and typhoons. The last one being the hardest to hit China in 50 years. Could I have information about this please? The delegate refers to typhoon Prapiroon. What information is available about that typhoon? 7. The delegate was of the view that military exercises would not be organised in typhoon season, particularly such a bad one. Is there any information to assist? RESPONSE 1. Is there corroborating information about military manoeuvres and exercises in Pingtan? 2. Is there any information specifically about the military exercise there in July 2006? There is a minor naval base in Pingtan and military manoeuvres are regularly held in the Taiwan Strait where Pingtan in located, especially in the June to August period.
    [Show full text]
  • THE CASE of WIKIPEDIA Shane Greenstein Feng Zhu
    NBER WORKING PAPER SERIES COLLECTIVE INTELLIGENCE AND NEUTRAL POINT OF VIEW: THE CASE OF WIKIPEDIA Shane Greenstein Feng Zhu Working Paper 18167 http://www.nber.org/papers/w18167 NATIONAL BUREAU OF ECONOMIC RESEARCH 1050 Massachusetts Avenue Cambridge, MA 02138 June 2012 The views expressed herein are those of the authors and do not necessarily reflect the views of the National Bureau of Economic Research. NBER working papers are circulated for discussion and comment purposes. They have not been peer- reviewed or been subject to the review by the NBER Board of Directors that accompanies official NBER publications. © 2012 by Shane Greenstein and Feng Zhu. All rights reserved. Short sections of text, not to exceed two paragraphs, may be quoted without explicit permission provided that full credit, including © notice, is given to the source. Collective Intelligence and Neutral Point of View: The Case of Wikipedia Shane Greenstein and Feng Zhu NBER Working Paper No. 18167 June 2012 JEL No. L17,L3,L86 ABSTRACT We examine whether collective intelligence helps achieve a neutral point of view using data from a decade of Wikipedia’s articles on US politics. Our null hypothesis builds on Linus’ Law, often expressed as “Given enough eyeballs, all bugs are shallow.” Our findings are consistent with a narrow interpretation of Linus’ Law, namely, a greater number of contributors to an article makes an article more neutral. No evidence supports a broad interpretation of Linus’ Law. Moreover, several empirical facts suggest the law does not shape many articles. The majority of articles receive little attention, and most articles change only mildly from their initial slant.
    [Show full text]
  • A Natural Experiment at Chinese Wikipedia
    American Economic Review 101 (June 2011): 1601–1615 http://www.aeaweb.org/articles.php?doi 10.1257/aer.101.4.1601 = Group Size and Incentives to Contribute: A Natural Experiment at Chinese Wikipedia By Xiaoquan Michael Zhang and Feng Zhu* ( ) Many public goods on the Internet today rely entirely on free user contributions. Popular examples include open source software development communities e.g., ( Linux, Apache , open content production e.g., Wikipedia, OpenCourseWare , and ) ( ) content sharing networks e.g., Flickr, YouTube . Several studies have examined the ( ) incentives that motivate these free contributors e.g., Josh Lerner and Jean Tirole ( 2002; Karim R. Lakhani and Eric von Hippel 2003 . ) In this paper, we examine the causal relationship between group size and incen- tives to contribute in the setting of Chinese Wikipedia, the Chinese language ver- sion of an online encyclopedia that relies entirely on voluntary contributions. The group at Chinese Wikipedia is composed of Chinese-speaking people in mainland China, Taiwan, Hong Kong, Singapore, and other regions in the world, who are aware of Chinese Wikipedia and have access to it. Our identification hinges on the exogenous reduction in group size at Chinese Wikipedia as a result of the block of Chinese Wikipedia in mainland China in October 2005. During the block, mainland Chinese could not use or contribute to Chinese Wikipedia, although contributors outside mainland China could continue to do so. We find that contribution levels of these nonblocked contributors decrease by 42.8 percent on average as a result of the block. We attribute the cause to social effects: contributors receive social benefits from their contributions, and the shrinking group size reduces these social benefits.
    [Show full text]
  • Amplifying the Impact of Open Access: Wikipedia and the Diffusion of Science
    (forthcoming in the Journal of the Association for Information Science and Technology) Amplifying the Impact of Open Access: Wikipedia and the Diffusion of Science Misha Teplitskiy Grace Lu Eamon Duede Dept. of Sociology and KnowledgeLab Computation Institute and KnowledgeLab University of Chicago KnowledgeLab University of Chicago [email protected] University of Chicago [email protected] (773) 834-4787 [email protected] (773) 834-4787 5735 South Ellis Avenue (773) 834-4787 5735 South Ellis Avenue Chicago, Illinois 60637 5735 South Ellis Avenue Chicago, Illinois 60637 Chicago, Illinois 60637 Abstract With the rise of Wikipedia as a first-stop source for scientific knowledge, it is important to compare its representation of that knowledge to that of the academic literature. Here we identify the 250 most heavi- ly used journals in each of 26 research fields (4,721 journals, 19.4M articles in total) indexed by the Scopus database, and test whether topic, academic status, and accessibility make articles from these journals more or less likely to be referenced on Wikipedia. We find that a journal’s academic status (im- pact factor) and accessibility (open access policy) both strongly increase the probability of its being ref- erenced on Wikipedia. Controlling for field and impact factor, the odds that an open access journal is referenced on the English Wikipedia are 47% higher compared to paywall journals. One of the implica- tions of this study is that a major consequence of open access policies is to significantly amplify the dif- fusion of science, through an intermediary like Wikipedia, to a broad audience. Word count: 7894 Introduction Wikipedia, one of the most visited websites in the world1, has become a destination for information of all kinds, including information about science (Heilman & West, 2015; Laurent & Vickers, 2009; Okoli, Mehdi, Mesgari, Nielsen, & Lanamäki, 2014; Spoerri, 2007).
    [Show full text]
  • RASLAN 2017 Recent Advances in Slavonic Natural Language Processing
    RASLAN 2017 Recent Advances in Slavonic Natural Language Processing A. Horák, P. Rychlý, A. Rambousek (Eds.) RASLAN 2017 Recent Advances in Slavonic Natural Language Processing Eleventh Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2017 Karlova Studánka, Czech Republic, December 1–3, 2017 Proceedings Tribun EU 2017 Proceedings Editors Aleš Horák Faculty of Informatics, Masaryk University Department of Information Technologies Botanická 68a CZ-602 00 Brno, Czech Republic Email: [email protected] Pavel Rychlý Faculty of Informatics, Masaryk University Department of Information Technologies Botanická 68a CZ-602 00 Brno, Czech Republic Email: [email protected] Adam Rambousek Faculty of Informatics, Masaryk University Department of Information Technologies Botanická 68a CZ-602 00 Brno, Czech Republic Email: [email protected] This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the Czech Copyright Law, in its current version, and permission for use must always be obtained from Tribun EU. Violations are liable for prosecution under the Czech Copyright Law. Editors © Aleš Horák, 2017; Pavel Rychlý, 2017; Adam Rambousek, 2017 Typography © Adam Rambousek, 2017 Cover © Petr Sojka, 2010 This edition © Tribun EU, Brno, 2017 ISBN 978-80-263-1340-3 ISSN 2336-4289 Preface This volume contains the Proceedings of the Eleventh Workshop on Recent Advances in Slavonic Natural Language Processing (RASLAN 2017) held on December 1st–3rd 2017 in Karlova Studánka, Sporthotel Kurzovní, Jeseníky, Czech Republic.
    [Show full text]
  • Language-Agnostic Relation Extraction from Abstracts in Wikis
    information Article Language-Agnostic Relation Extraction from Abstracts in Wikis Nicolas Heist, Sven Hertling and Heiko Paulheim * ID Data and Web Science Group, University of Mannheim, Mannheim 68131, Germany; [email protected] (N.H.); [email protected] (S.H.) * Correspondence: [email protected] Received: 5 February 2018; Accepted: 28 March 2018; Published: 29 March 2018 Abstract: Large-scale knowledge graphs, such as DBpedia, Wikidata, or YAGO, can be enhanced by relation extraction from text, using the data in the knowledge graph as training data, i.e., using distant supervision. While most existing approaches use language-specific methods (usually for English), we present a language-agnostic approach that exploits background knowledge from the graph instead of language-specific techniques and builds machine learning models only from language-independent features. We demonstrate the extraction of relations from Wikipedia abstracts, using the twelve largest language editions of Wikipedia. From those, we can extract 1.6 M new relations in DBpedia at a level of precision of 95%, using a RandomForest classifier trained only on language-independent features. We furthermore investigate the similarity of models for different languages and show an exemplary geographical breakdown of the information extracted. In a second series of experiments, we show how the approach can be transferred to DBkWik, a knowledge graph extracted from thousands of Wikis. We discuss the challenges and first results of extracting relations from a larger set of Wikis, using a less formalized knowledge graph. Keywords: relation extraction; knowledge graphs; Wikipedia; DBpedia; DBkWik; Wiki farms 1. Introduction Large-scale knowledge graphs, such as DBpedia [1], Freebase [2], Wikidata [3], or YAGO [4], are usually built using heuristic extraction methods, by exploiting crowd-sourcing processes, or both [5].
    [Show full text]
  • Social Changer Jae-Hee Technology Comfort Level
    Social Changer Jae-Hee Technology comfort level AGE: EDUCATION: 27 years old University Low High LOCATION: LANGUAGES: Seoul, South Korea Korean (fluent) Writing comfort level Japanese (proficient) OCCUPATION: English (proficient) Freelance graphic designer Low High Macbook Pro iPad iPhone 6S PRIMARY USE: Graphic design work, PRIMARY USE: Reading, looking for PRIMARY USE: Calling, messaging maintaining her website, writing, inspiration for her work, and quick friends on Kakao Talk, Twitter editing Wikipedia internet browsing when she’s not at home Background Jae-Hee graduated from university two years ago Japanese. While in university, she took a class on Experience Goals and currently works as a freelance graphic de- design and sustainability and became interested • To freely share her opinions and signer. She lives in the suburbs of Seoul, with her in environmental issues. Now she volunteers for knowledge without conflict or rebuke, parents and younger sister. In her spare time, she an environmental advocacy non-profit which ed- like she does elsewhere online works on her digital art and photography, which ucates people about living a sustainable lifestyle, she publishes on her personal website, through and shares similar lifestyle tips on her personal End Goals WordPress. She loves reading, particularly fantasy blog. and science fiction stories, in both Korean and • To raise awareness on environmental issues • To collaborate with other Tech Habits environmentalists She first started using a computer in grade school, environmental groups. She is well-known by her and got her first smartphone in high school. She online username, “jigu”. She learned basic HTML Challenges uses social media avidly, particularly Twitter to to run her website, and uses Adobe software for share her work and writing, and to follow other her graphic design and art.
    [Show full text]
  • Defending Democracy & Kristin Skare Orgeret Nordicom-Information
    Defending Defending Oslo 8–11 August 2013 Democracy The 2013 NordMedia conference in Oslo marked the 40 years that had passed since the very first Nordic media conference. To acknowledge this 40-year anniversary, it made sense to have a conference theme that dealt with a major and important topic: Defending Democracy. Nordic Defending and Global Diversities in Media and Journalism. Focusing on the rela- tionship between journalism, other media practices and democracy, the plenary sessions raised questions such as: Democracy & Edited by What roles do media and journalism play in democratization • Kristin Skare Orgeret processes and what roles should they play? Nordic and Global Diversities How does the increasingly complex and omnipresent media in Media and Journalism • Hornmoen Harald field affect conditions for freedom of speech? This special issue contains the keynote speeches of Natalie Fenton, Stephen Ward and Ib Bondebjerg. A number of the conference papers have been revised and edited to become articles. Together, the articles presented should give the reader an idea of the breadth and depth of Edited by current Nordic scholarship in the area. Harald Hornmoen & Kristin Skare Orgeret SPECIAL ISSUE Nordicom Review | Volume 35 | August 2014 Nordicom-Information | Volume 36 | Number 2 | August 2014 Nordicom-Information Nordicom Review University of Gothenburg 2014 issue Special Box 713, SE 405 30 Göteborg, Sweden Telephone +46 31 786 00 00 (op.) | Fax +46 31 786 46 55 www.nordicom.gu.se | E-mail: [email protected] SPECIAL ISSUE Nordicom Review | Volume 35 | August 2014 Nordicom-Information | Volume 36 | Number 2 | August 2014 Nordicom Review Journal from the Nordic Information Centre for Media and Communication Research Editor NORDICOM invites media researchers to contri- Ulla Carlsson bute scientific articles, reviews, and debates.
    [Show full text]