Wikipedia and Cyberculture ___/ a Platform for the Social Consensus, Or a Platform for Social Folly?
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Wikipedia and Intermediary Immunity: Supporting Sturdy Crowd Systems for Producing Reliable Information Jacob Rogers Abstract
THE YALE LAW JOURNAL FORUM O CTOBER 9 , 2017 Wikipedia and Intermediary Immunity: Supporting Sturdy Crowd Systems for Producing Reliable Information Jacob Rogers abstract. The problem of fake news impacts a massive online ecosystem of individuals and organizations creating, sharing, and disseminating content around the world. One effective ap- proach to addressing false information lies in monitoring such information through an active, engaged volunteer community. Wikipedia, as one of the largest online volunteer contributor communities, presents one example of this approach. This Essay argues that the existing legal framework protecting intermediary companies in the United States empowers the Wikipedia community to ensure that information is accurate and well-sourced. The Essay further argues that current legal efforts to weaken these protections, in response to the “fake news” problem, are likely to create perverse incentives that will harm volunteer engagement and confuse the public. Finally, the Essay offers suggestions for other intermediaries beyond Wikipedia to help monitor their content through user community engagement. introduction Wikipedia is well-known as a free online encyclopedia that covers nearly any topic, including both the popular and the incredibly obscure. It is also an encyclopedia that anyone can edit, an example of one of the largest crowd- sourced, user-generated content websites in the world. This user-generated model is supported by the Wikimedia Foundation, which relies on the robust intermediary liability immunity framework of U.S. law to allow the volunteer editor community to work independently. Volunteer engagement on Wikipedia provides an effective framework for combating fake news and false infor- mation. 358 wikipedia and intermediary immunity: supporting sturdy crowd systems for producing reliable information It is perhaps surprising that a project open to public editing could be highly reliable. -
Decentralization in Wikipedia Governance
Decentralization in Wikipedia Governance Andrea Forte1, Vanessa Larco2 and Amy Bruckman1 1GVU Center, College of Computing, Georgia Institute of Technology {aforte, asb}@cc.gatech.edu 2Microsoft [email protected] This is a preprint version of the journal article: Forte, Andrea, Vanessa Larco and Amy Bruckman. (2009) Decentralization in Wikipedia Governance. Journal of Management Information Systems. 26(1) pp 49-72. Publisher: M.E. Sharp www.mesharpe.com/journals.asp Abstract How does “self-governance” happen in Wikipedia? Through in-depth interviews with twenty individuals who have held a variety of responsibilities in the English-language Wikipedia, we obtained rich descriptions of how various forces produce and regulate social structures on the site. Our analysis describes Wikipedia as an organization with highly refined policies, norms, and a technological architecture that supports organizational ideals of consensus building and discussion. We describe how governance on the site is becoming increasingly decentralized as the community grows and how this is predicted by theories of commons-based governance developed in offline contexts. We also briefly examine local governance structures called WikiProjects through the example of WikiProject Military History, one of the oldest and most prolific projects on the site. 1. The Mechanisms of Self-Organization Should a picture of a big, hairy tarantula appear in an encyclopedia article about arachnophobia? Does it illustrate the point, or just frighten potential readers? Reasonable people might disagree on this question. In a freely editable site like Wikipedia, anyone can add the photo, and someone else can remove it. And someone can add it back, and the process continues. -
Position Description Addenda
POSITION DESCRIPTION January 2014 Wikimedia Foundation Executive Director - Addenda The Wikimedia Foundation is a radically transparent organization, and much information can be found at www.wikimediafoundation.org . That said, certain information might be particularly useful to nominators and prospective candidates, including: Announcements pertaining to the Wikimedia Foundation Executive Director Search Kicking off the search for our next Executive Director by Former Wikimedia Foundation Board Chair Kat Walsh An announcement from Wikimedia Foundation ED Sue Gardner by Wikimedia Executive Director Sue Gardner Video Interviews on the Wikimedia Community and Foundation and Its History Some of the values and experiences of the Wikimedia Community are best described directly by those who have been intimately involved in the organization’s dramatic expansion. The following interviews are available for viewing though mOppenheim.TV . • 2013 Interview with Former Wikimedia Board Chair Kat Walsh • 2013 Interview with Wikimedia Executive Director Sue Gardner • 2009 Interview with Wikimedia Executive Director Sue Gardner Guiding Principles of the Wikimedia Foundation and the Wikimedia Community The following article by Sue Gardner, the current Executive Director of the Wikimedia Foundation, has received broad distribution and summarizes some of the core cultural values shared by Wikimedia’s staff, board and community. Topics covered include: • Freedom and open source • Serving every human being • Transparency • Accountability • Stewardship • Shared power • Internationalism • Free speech • Independence More information can be found at: https://meta.wikimedia.org/wiki/User:Sue_Gardner/Wikimedia_Foundation_Guiding_Principles Wikimedia Policies The Wikimedia Foundation has an extensive list of policies and procedures available online at: http://wikimediafoundation.org/wiki/Policies Wikimedia Projects All major projects of the Wikimedia Foundation are collaboratively developed by users around the world using the MediaWiki software. -
Danish Resources
Danish resources Finn Arup˚ Nielsen November 19, 2017 Abstract A range of different Danish resources, datasets and tools, are presented. The focus is on resources for use in automated computational systems and free resources that can be redistributed and used in commercial applications. Contents 1 Corpora3 1.1 Wikipedia...................................3 1.2 Wikisource...................................3 1.3 Wikiquote...................................4 1.4 ADL......................................4 1.5 Gutenberg...................................4 1.6 Runeberg...................................5 1.7 Europarl....................................5 1.8 Leipzig Corpora Collection..........................5 1.9 Danish Dependency Treebank........................6 1.10 Retsinformation................................6 1.11 Other resources................................6 2 Lexical resources6 2.1 DanNet....................................6 2.2 Wiktionary..................................7 2.3 Wikidata....................................7 2.4 OmegaWiki..................................8 2.5 Other lexical resources............................8 2.6 Wikidata examples with medical terminology extraction.........8 3 Natural language processing tools9 3.1 NLTK.....................................9 3.2 Polyglot....................................9 3.3 spaCy.....................................9 3.4 Apache OpenNLP............................... 10 3.5 Centre for Language Technology....................... 10 3.6 Other libraries................................ -
Proceedings of the 46Th Annual Meeting of the Association for Computational Linguistics on Hu- Man Language Technologies, Pages 9–12
Coling 2010 23rd International Conference on Computational Linguistics Proceedings of the 2nd Workshop on The People’s Web Meets NLP: Collaboratively Constructed Semantic Resources Produced by Chinese Information Processing Society of China All rights reserved for Coling 2010 CD production. To order the CD of Coling 2010 and its Workshop Proceedings, please contact: Chinese Information Processing Society of China No.4, Southern Fourth Street Haidian District, Beijing, 100190 China Tel: +86-010-62562916 Fax: +86-010-62562916 [email protected] ii Introduction This volume contains papers accepted for presentation at the 2nd Workshop on Collaboratively Constructed Semantic Resources that took place on August 28, 2010, as part of the Coling 2010 conference in Beijing. Being the second workshop on this topic, we were able to build on the success of the previous workshop on this topic held as part of ACL-IJCNLP 2009. In many works, collaboratively constructed semantic resources have been used to overcome the knowledge acquisition bottleneck and coverage problems pertinent to conventional lexical semantic resources. The greatest popularity in this respect can so far certainly be attributed to Wikipedia. However, other resources, such as folksonomies or the multilingual collaboratively constructed dictionary Wiktionary, have also shown great potential. Thus, the scope of the workshop deliberately includes any collaboratively constructed resource, not only Wikipedia. Effective deployment of such resources to enhance Natural Language Processing introduces a pressing need to address a set of fundamental challenges, e.g. the interoperability with existing resources, or the quality of the extracted lexical semantic knowledge. Interoperability between resources is crucial as no single resource provides perfect coverage. -
Modeling User Reputation in Wikipedia
Modeling User Reputation in Wikis Sara Javanmardi, Cristina Lopes and Pierre Baldi Bren School of Information and Computer Sciences, University of California, Irvine, USA {sjavanma,lopes,pfbaldi}@ics.uci.edu Abstract Collaborative systems available on the Web allow millions of users to share information through a growing collection of tools and platforms such as wikis, blogs, and shared forums. By their very nature, these systems contain resources and information with different quality levels. The open na- ture of these systems, however, makes it difficult for users to determine the quality of the available information and the reputation of its providers. Here, we first parse and mine the entire English Wikipedia history pages in order to extract detailed user edit patterns and statistics. We then use these patterns and statistics to derive three computational models of a user’s reputation. Finally, we validate these models using ground–truth Wikipedia data associated with vandals and adminis- trators. When used as a classifier, the best model produces an area under the ROC curve of 0.98. Furthermore, we assess the reputation predictions generated by the models on other users, and show that all three models can be used efficiently for predicting user behavior in Wikipedia. Keywords: Wiki, Reputation, Reliability, Wiki Mining, Wikipedia, Web 2.0 1 Introduction The last few years have seen a substantial growth in user–generated Web content. Simple editing interfaces encourage users to create and maintain repositories of shared content. Online information repositories such as wikis, forums and blogs have increased the participation of the general public in the production of web content through the notion of social software [1, 2, 3]. -
A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages
Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 2373–2380 Marseille, 11–16 May 2020 c European Language Resources Association (ELRA), licensed under CC-BY-NC A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages Dwaipayan Roy, Sumit Bhatia, Prateek Jain GESIS - Cologne, IBM Research - Delhi, IIIT - Delhi [email protected], [email protected], [email protected] Abstract Wikipedia is the largest web-based open encyclopedia covering more than three hundred languages. However, different language editions of Wikipedia differ significantly in terms of their information coverage. We present a systematic comparison of information coverage in English Wikipedia (most exhaustive) and Wikipedias in eight other widely spoken languages (Arabic, German, Hindi, Korean, Portuguese, Russian, Spanish and Turkish). We analyze the content present in the respective Wikipedias in terms of the coverage of topics as well as the depth of coverage of topics included in these Wikipedias. Our analysis quantifies and provides useful insights about the information gap that exists between different language editions of Wikipedia and offers a roadmap for the Information Retrieval (IR) community to bridge this gap. Keywords: Wikipedia, Knowledge base, Information gap 1. Introduction other with respect to the coverage of topics as well as Wikipedia is the largest web-based encyclopedia covering the amount of information about overlapping topics. -
Omnipedia: Bridging the Wikipedia Language
Omnipedia: Bridging the Wikipedia Language Gap Patti Bao*†, Brent Hecht†, Samuel Carton†, Mahmood Quaderi†, Michael Horn†§, Darren Gergle*† *Communication Studies, †Electrical Engineering & Computer Science, §Learning Sciences Northwestern University {patti,brent,sam.carton,quaderi}@u.northwestern.edu, {michael-horn,dgergle}@northwestern.edu ABSTRACT language edition contains its own cultural viewpoints on a We present Omnipedia, a system that allows Wikipedia large number of topics [7, 14, 15, 27]. On the other hand, readers to gain insight from up to 25 language editions of the language barrier serves to silo knowledge [2, 4, 33], Wikipedia simultaneously. Omnipedia highlights the slowing the transfer of less culturally imbued information similarities and differences that exist among Wikipedia between language editions and preventing Wikipedia’s 422 language editions, and makes salient information that is million monthly visitors [12] from accessing most of the unique to each language as well as that which is shared information on the site. more widely. We detail solutions to numerous front-end and algorithmic challenges inherent to providing users with In this paper, we present Omnipedia, a system that attempts a multilingual Wikipedia experience. These include to remedy this situation at a large scale. It reduces the silo visualizing content in a language-neutral way and aligning effect by providing users with structured access in their data in the face of diverse information organization native language to over 7.5 million concepts from up to 25 strategies. We present a study of Omnipedia that language editions of Wikipedia. At the same time, it characterizes how people interact with information using a highlights similarities and differences between each of the multilingual lens. -
Transformation of Participation in a Collaborative Online Encyclopedia Susan L
Becoming Wikipedian: Transformation of Participation in a Collaborative Online Encyclopedia Susan L. Bryant, Andrea Forte, Amy Bruckman College of Computing/GVU Center, Georgia Institute of Technology 85 5th Street, Atlanta, GA, 30332 [email protected]; {aforte, asb}@cc.gatech.edu ABSTRACT New forms of computer-supported cooperative work have sprung Traditional activities change in surprising ways when computer- from the World Wide Web faster than researchers can hope to mediated communication becomes a component of the activity document, let alone understand. In fact, the organic, emergent system. In this descriptive study, we leverage two perspectives on nature of Web-based community projects suggests that people are social activity to understand the experiences of individuals who leveraging Web technologies in ways that largely satisfy the became active collaborators in Wikipedia, a prolific, social demands of working with geographically distant cooperatively-authored online encyclopedia. Legitimate collaborators. In order to better understand this phenomenon, we peripheral participation provides a lens for understanding examine how several active collaborators became members of the participation in a community as an adaptable process that evolves extraordinarily productive and astonishingly successful over time. We use ideas from activity theory as a framework to community of Wikipedia. describe our results. Finally, we describe how activity on the In this introductory section, we describe the Wikipedia and related Wikipedia stands in striking contrast to traditional publishing and research, as well as two perspectives on social activity: activity suggests a new paradigm for collaborative systems. theory (AT) and legitimate peripheral participation (LPP). Next, we describe our study and how ideas borrowed from activity Categories and Subject Descriptors theory helped us investigate the ways that participation in the J.7 [Computer Applications]: Computers in Other Systems – Wikipedia community is transformed along multiple dimensions publishing. -
The Culture of Wikipedia
Good Faith Collaboration: The Culture of Wikipedia Good Faith Collaboration The Culture of Wikipedia Joseph Michael Reagle Jr. Foreword by Lawrence Lessig The MIT Press, Cambridge, MA. Web edition, Copyright © 2011 by Joseph Michael Reagle Jr. CC-NC-SA 3.0 Purchase at Amazon.com | Barnes and Noble | IndieBound | MIT Press Wikipedia's style of collaborative production has been lauded, lambasted, and satirized. Despite unease over its implications for the character (and quality) of knowledge, Wikipedia has brought us closer than ever to a realization of the centuries-old Author Bio & Research Blog pursuit of a universal encyclopedia. Good Faith Collaboration: The Culture of Wikipedia is a rich ethnographic portrayal of Wikipedia's historical roots, collaborative culture, and much debated legacy. Foreword Preface to the Web Edition Praise for Good Faith Collaboration Preface Extended Table of Contents "Reagle offers a compelling case that Wikipedia's most fascinating and unprecedented aspect isn't the encyclopedia itself — rather, it's the collaborative culture that underpins it: brawling, self-reflexive, funny, serious, and full-tilt committed to the 1. Nazis and Norms project, even if it means setting aside personal differences. Reagle's position as a scholar and a member of the community 2. The Pursuit of the Universal makes him uniquely situated to describe this culture." —Cory Doctorow , Boing Boing Encyclopedia "Reagle provides ample data regarding the everyday practices and cultural norms of the community which collaborates to 3. Good Faith Collaboration produce Wikipedia. His rich research and nuanced appreciation of the complexities of cultural digital media research are 4. The Puzzle of Openness well presented. -
Introduction: Connections
Wikipedia @ 20 Introduction: Connections Joseph Reagle, Jackie Koerner Published on: Aug 22, 2019 Updated on: Jun 04, 2020 License: Creative Commons Attribution 4.0 International License (CC-BY 4.0) Wikipedia @ 20 Introduction: Connections Image credit: William Warby, Vasco da Gama Bridge B&W, 2018. Introduction: Connections Joseph Reagle and Jackie Koerner Twenty years ago, Wikipedia set out on its path toward providing humanity with free access to the sum of all knowledge. Even if this is a mission that can’t be finished, Wikipedia has made remarkable progress toward the impossible. How so? Wikipedia is an encyclopedia built on a wiki. And never has an application, gathering the sum of human knowledge, been so suited to its medium, easily interconnected web pages. Encyclopedias have long been reliant on interconnections. In 1755, the Encyclopédie’s Denis Diderot wrote that the use of cross-references (or renvois) was “the most important part of our encyclopedia scheme.”1 This feature allowed the Encyclopédie’s editors to depict the connective tissue of Enlightenment knowledge and to dodge state and church authorities by way of facetious and satirical references. For example, they expressed skepticism about the “Eucharist” and “Communion” by linking to them from the article on “Cannibals.” At the onset of each new informational medium—from paper, to microfilm, to silicon—connectivity was the impetus. There are many examples, but consider the names of the following. Among the documentalists of the early twentieth-century, there was Wilhelm Ostwald’s Brücke, a bridge, and Suzanne Briet’s indice, an indicator. Such documentalists advanced indexing and classification schemes to improve interconnections between information. -
Name of the Tool Citizendium Home Page Logo URL En.Citizendium
Name of the Tool Citizendium Home Page Logo URL en.citizendium.org Subject Encyclopedias Accessibility Free Language English Publisher CZ: Media Assets Workgroup Brief History It is an English-language Wiki-based free encyclopedia project launched by Larry Sanger, who had previously co - founded Wikipedia in 2001. It had launched on 23 October 2006 (as pilot project) and on 25 March 2007 (publicly). Scope and Coverage It serves all over the world and provides the access and modifications of information related to the “parent topics” or main topics like Natural Sciences, Social Sciences, Humanities, Arts, Applied arts and sciences, Recreation. Under each main topic or parent topic, there is hyperlinked list of sub topics and other related topics. As for example, under the main topic “ Natural Sciences” there is a following list of “Subtopics” and “Other related topics”: Subtopics Physics Chemistry Biology Astronomy Earth science Mathematics Other related topics Natural philosophy Natural history Applied science Health science Geology Under the main topic “Humanities” there is a list of following topics: Subtopics Classics History Literature Philosophy Religion Theology Other related topics Art Applied arts Education Law Music Science Social science Scholarship Society Theatre The encyclopedia includes total number of 16,891 articles when last accessed. Kind of Information Every article under subtopics and other related topics is provided with “Talk”, “Related Articles”, “Biography”, “External Links”, “Citable Version”, “Video” related to that article. Brief description, history of a topic etc. are present in the articles. Coloured images on topics, charts, graphs etc. are available where applicable. Notes and references are also found after the articles.