Final Diss Manuscript

Total Page:16

File Type:pdf, Size:1020Kb

Final Diss Manuscript HACKERS, CYBORGS, AND WIKIPEDIANS: THE POLITICAL ECONOMY AND CULTURAL HISTORY OF WIKIPEDIA Andrew Famiglietti A Dissertation Submitted to the Graduate College of Bowling Green State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2011 Committee: Victoria Smith Ekstrand, Advisor Nancy C. Patterson Graduate Faculty Representative Radhika Gajjala Donald McQuarie David Parry i ABSTRACT Victoria Smith Ekstrand, Advisor This dissertation explores the political economy and cultural history of Wikipedia, the free encyclopedia. It demonstrates how Wikipedia, an influential and popular site of knowledge production and distribution, was influenced by its heritage from the hacker communities of the late twentieth century. More specifically, Wikipedia was shaped by an ideal I call, “the cyborg individual,” which held that the production of knowledge was best entrusted to a widely distributed network of individual human subjects and individually owned computers. I trace how this ideal emerged from hacker culture in response to anxieties hackers experienced due to their intimate relationships with machines. I go on to demonstrate how this ideal influenced how Wikipedia was understood both those involved in the early history of the site, and those writing about it. In particular, legal scholar Yochai Benkler seems to base his understanding of Wikipedia and its strengths on the cyborg individual ideal. Having established this, I then move on to show how the cyborg individual ideal misunderstands Wikipedia’s actual method of production. Most importantly, it overlooks the importance of how the boundaries drawn around communities and shared technological resources shape Wikipedia’s content. I then proceed to begin the process of building what I believe is a better way of understanding Wikipedia, by tracing how communities and shared resources shape the production of recent Wikipedia articles. ii For my father, who taught me that creativity is a human right, not a property right. and For my mother, who taught me that service to the community is what makes us really human. iii ACKNOWLEDGMENTS There are many people to thank for this dissertation. First, of course, I owe a great debt to all of the members of the committee for their patient and consistent work with me in improving this dissertation. In addition, I'd like to thank my many colleagues and friends in the American Culture Studies program at Bowling Green State University for their support, their reviews of early drafts, and for many inspiring and fruitful conversations that helped shape my thinking on the topic of Wikipedia, and on peer production more broadly. Though I cannot possibly list all those deserving thanks here, Colin Helb, Mark Bernard, Cristian Pralea, Jason Zeh, Marnie Pratt, Jamie Stuart, Mike Lewis, Gavin Mueller, Kelly Watson and Lisa Avila are among my many Bowling Green colleagues and friends who I would like to single out for their support during this process. I would also like to thank Roger Whitson, Melanie Kohnen, and Robin Wharton for a series of discussions that helped lead me to my current theoretical frame for this piece. Finally, I must thank all of the editors of Wikipedia for their help in describing the remarkable free encyclopedia they have built together. iv TABLE OF CONTENTS CHAPTER I. INTRODUCTION........................................................................................... 1 Defining Cultural Studies and Peer Production …..................................................... 5 Common Features of Cultural Studies and Peer Production ..................................... 8 Cultural Studies, Peer Production and the Problem of Property.................... 11 Cultural Studies, Peer Production and the Regulatory Abilities of Non-State Entities ........................................................................................................... 17 Existing Wikipedia Literature.................................................................................... 22 Wikipedia’s Uses and Coverage .................................................................... 23 Wikipedia’s Accuracy.................................................................................... 25 Wikipedia and Search .................................................................................... 30 Wikipedia Demographics............................................................................... 31 Growth and Sustainability of Wikipedia........................................................ 32 Motivation of Wikipedia Contributors........................................................... 33 Wikipedia Community Governance .............................................................. 35 The Cultural Studies Gap............................................................................... 38 Chapter Breakdown ................................................................................................... 40 CHAPTER II. THEORIES AND METHODS ..................................................................... 42 Reading Networks of Associations: Bruno Latour and ANT .................................... 42 The Embodiment of Wikipedia: N. Katherine Hayles............................................... 48 Methods ............................................................................................................ 56 v The Anatomy of a Wiki Site .......................................................................... 58 Wikipedia Mailing Lists ................................................................................ 61 Non-Wiki Sources.......................................................................................... 62 CHAPTER III. EMBODYING FOSS .................................................................................. 65 The Cyborg Individual Illustrated: OLPC ................................................................. 66 The Genesis of Cyborg Individualism: Hacker Embodiment.................................... 74 The “hands on imperative” and the evolution of the personal computer....... 75 How Information Lost Its Body: How Hackers Built New Geometries of Connections Between and Among Humans and Machines ........................... 87 Cyborg Anxieties ........................................................................................... 100 CHAPTER IV. THE EMBODIMENT AND POLITICAL ECONOMY OF WIKIPEDIA 109 “An Encyclopedia Located Everywhere:” The Cyborg Individual Ideal Present in the Early Wikipedia ........................................................................................................ 110 In Fork We Trust: Benkler’s Belief in the Cyborg Individual as Guarantee of Media Diversity ............................................................................................................ 115 The Wikipedia Utility: The Centralized Embodiment of the Contemporary Wikipedia ............................................................................................................ 128 A Web of Linkbacks: The marginal role of Wikipedia Mirrors .................... 132 “In every case I have given you what you wanted:” The Fork and Wikipedia’s Appetite for Free Labor ................................................................................. 139 “We have partisans working together on the same articles:” Labor as Force Shaping Features of Wikipedia Policy .................................................................................... 160 vi Conclusion ............................................................................................................ 164 CHAPTER V. WIKIPEDIA AND GOOGLE ...................................................................... 166 “What Google Liked:” Search Visibility and Wikipedia........................................... 168 “We make the internet not suck:” Wikipedia as Engine of Content Creation ........... 171 “If you are not on Google, you never existed:” How Search Informs Wikipedia’s Retention of Articles ................................................................................................. 177 Conclusion ............................................................................................................. 201 CHAPTER VI. A CASE STUDY OF THE CONTEMPORARY WIKIPEDIA: THE GAZA WAR ARTICLE ............................................................................................................ 203 Bounding the Community From Above: Admin Actions in the Gaza War Article... 206 Article Protection: Setting the Default Community................................................... 208 Disciplining Individual Editors: Arbitrator Action in the Gaza War Article............. 214 Negotiated Neutrality: Key Negotiations Among Members of the Editing Community that Shaped the Gaza War Article.............................................................................. 220 WP: V as Mechanism for Resolving the Difficulties of Neutrality ............... 222 Reliable Sources as Contested Terrain .......................................................... 229 A Scarce Symbol: The article title as breakdown of the neutrality process .. 241 Conclusion ............................................................................................................ 251 CHAPTER VII. CONCLUSIONS........................................................................................ 254 The Limitations and Potentials of the Ideal Body of the Cyborg Individual............. 255 Wikipedia Research Methodology: Opportunities
Recommended publications
  • 4€€€€An Encyclopedia with Breaking News
    Wikipedia @ 20 • ::Wikipedia @ 20 4 An Encyclopedia with Breaking News Brian Keegan Published on: Oct 15, 2020 Updated on: Nov 16, 2020 License: Creative Commons Attribution 4.0 International License (CC-BY 4.0) Wikipedia @ 20 • ::Wikipedia @ 20 4 An Encyclopedia with Breaking News Wikipedia’s response to the September 11 attacks profoundly shaped its rules and identity and illuminated a new strategy for growing the project by coupling the supply and demand for information about news. Wikipedia’s breaking news collaborations offer lessons for hardening other online platforms against polarization, disinformation, and other sociotechnical sludge. The web was a very different place for news in the United States between 2001 and 2006. The hanging chads from the 2000 presidential election, the spectacular calamity of 9/11, the unrepentant lies around Operation Iraqi Freedom, and the campy reality television featuring Donald Trump were all from this time. The burst of the dot-com bubble and corporate malfeasance of companies like Enron dampened entrepreneurial spirits, news publishers were optimistically sharing their stories online without paywalls, and blogging was heralded as the future of technology-mediated accountability and participatory democracy. “You” was Time Magazine’s Person of the Year in 2006 because “Web 2.0” platforms like YouTube, MySpace, and Second Life had become tools for “bringing together the small contributions of millions of people and making them matter.”1 Wikipedia was a part of this primordial soup, predating news-feed-mediated engagement, recommender-driven polarization, politicized content moderation, and geopolitical disinformation campaigns. From very early in its history, Wikipedia leveraged the supply and demand for information about breaking news and current events into strategies that continue to sustain this radical experiment in online peer production.
    [Show full text]
  • Decentralization in Wikipedia Governance
    Decentralization in Wikipedia Governance Andrea Forte1, Vanessa Larco2 and Amy Bruckman1 1GVU Center, College of Computing, Georgia Institute of Technology {aforte, asb}@cc.gatech.edu 2Microsoft [email protected] This is a preprint version of the journal article: Forte, Andrea, Vanessa Larco and Amy Bruckman. (2009) Decentralization in Wikipedia Governance. Journal of Management Information Systems. 26(1) pp 49-72. Publisher: M.E. Sharp www.mesharpe.com/journals.asp Abstract How does “self-governance” happen in Wikipedia? Through in-depth interviews with twenty individuals who have held a variety of responsibilities in the English-language Wikipedia, we obtained rich descriptions of how various forces produce and regulate social structures on the site. Our analysis describes Wikipedia as an organization with highly refined policies, norms, and a technological architecture that supports organizational ideals of consensus building and discussion. We describe how governance on the site is becoming increasingly decentralized as the community grows and how this is predicted by theories of commons-based governance developed in offline contexts. We also briefly examine local governance structures called WikiProjects through the example of WikiProject Military History, one of the oldest and most prolific projects on the site. 1. The Mechanisms of Self-Organization Should a picture of a big, hairy tarantula appear in an encyclopedia article about arachnophobia? Does it illustrate the point, or just frighten potential readers? Reasonable people might disagree on this question. In a freely editable site like Wikipedia, anyone can add the photo, and someone else can remove it. And someone can add it back, and the process continues.
    [Show full text]
  • A New Wave of Freedom
    A New Wave of Freedom Dr. V. Sasi Kumar Any action that is dictated by fear or by coercion of any kind ceases to be moral. | Mohandas Karamchand Gandhi India became a Sovereign Republic on 26 January 1950. India became free. Or, more precisely, political power was transferred from the British to Indians. That was a period when a number of countries that were living under the yoke of imperialism broke free. Politically, that is. We still do not enjoy certain freedoms that we deserve. A new wave of freedom movements, to achieve these freedoms, is now sweeping the world|a movement that is bound to change the way we think, the way we do things and the way we interact. This time it started from the United States and is aiming to free people from the clutches of monopoly corporations. And the role of Gandhiji is being played by an extraordinary person with long hair and a long beard; a man named Richard Mathew Stallman, though he vehemently rejects any comparison with Gandhiji or Nelson Mandela. \Till we are fully free, we are slaves", said Gandhiji. Developments in technology have made it possible for mankind to enjoy greater freedom in certain ways. However, vested interests, with help from legislators, are now succeeding in preventing society from enjoying this freedom. For instance, with the advent of the computer and the Internet, it has become possible for data, information and knowledge to be communicated instantaneously, provided a computer with Internet connection is available at both ends. However, some of our laws that were designed for an earlier era are preventing society from benefiting fully from this technology.
    [Show full text]
  • Modeling and Analyzing Latency in the Memcached System
    Modeling and Analyzing Latency in the Memcached system Wenxue Cheng1, Fengyuan Ren1, Wanchun Jiang2, Tong Zhang1 1Tsinghua National Laboratory for Information Science and Technology, Beijing, China 1Department of Computer Science and Technology, Tsinghua University, Beijing, China 2School of Information Science and Engineering, Central South University, Changsha, China March 27, 2017 abstract Memcached is a widely used in-memory caching solution in large-scale searching scenarios. The most pivotal performance metric in Memcached is latency, which is affected by various factors including the workload pattern, the service rate, the unbalanced load distribution and the cache miss ratio. To quantitate the impact of each factor on latency, we establish a theoretical model for the Memcached system. Specially, we formulate the unbalanced load distribution among Memcached servers by a set of probabilities, capture the burst and concurrent key arrivals at Memcached servers in form of batching blocks, and add a cache miss processing stage. Based on this model, algebraic derivations are conducted to estimate latency in Memcached. The latency estimation is validated by intensive experiments. Moreover, we obtain a quantitative understanding of how much improvement of latency performance can be achieved by optimizing each factor and provide several useful recommendations to optimal latency in Memcached. Keywords Memcached, Latency, Modeling, Quantitative Analysis 1 Introduction Memcached [1] has been adopted in many large-scale websites, including Facebook, LiveJournal, Wikipedia, Flickr, Twitter and Youtube. In Memcached, a web request will generate hundreds of Memcached keys that will be further processed in the memory of parallel Memcached servers. With this parallel in-memory processing method, Memcached can extensively speed up and scale up searching applications [2].
    [Show full text]
  • A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages
    Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 2373–2380 Marseille, 11–16 May 2020 c European Language Resources Association (ELRA), licensed under CC-BY-NC A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages Dwaipayan Roy, Sumit Bhatia, Prateek Jain GESIS - Cologne, IBM Research - Delhi, IIIT - Delhi [email protected], [email protected], [email protected] Abstract Wikipedia is the largest web-based open encyclopedia covering more than three hundred languages. However, different language editions of Wikipedia differ significantly in terms of their information coverage. We present a systematic comparison of information coverage in English Wikipedia (most exhaustive) and Wikipedias in eight other widely spoken languages (Arabic, German, Hindi, Korean, Portuguese, Russian, Spanish and Turkish). We analyze the content present in the respective Wikipedias in terms of the coverage of topics as well as the depth of coverage of topics included in these Wikipedias. Our analysis quantifies and provides useful insights about the information gap that exists between different language editions of Wikipedia and offers a roadmap for the Information Retrieval (IR) community to bridge this gap. Keywords: Wikipedia, Knowledge base, Information gap 1. Introduction other with respect to the coverage of topics as well as Wikipedia is the largest web-based encyclopedia covering the amount of information about overlapping topics.
    [Show full text]
  • Building a Scalable Index and a Web Search Engine for Music on the Internet Using Open Source Software
    Department of Information Science and Technology Building a Scalable Index and a Web Search Engine for Music on the Internet using Open Source software André Parreira Ricardo Thesis submitted in partial fulfillment of the requirements for the degree of Master in Computer Science and Business Management Advisor: Professor Carlos Serrão, Assistant Professor, ISCTE-IUL September, 2010 Acknowledgments I should say that I feel grateful for doing a thesis linked to music, an art which I love and esteem so much. Therefore, I would like to take a moment to thank all the persons who made my accomplishment possible and hence this is also part of their deed too. To my family, first for having instigated in me the curiosity to read, to know, to think and go further. And secondly for allowing me to continue my studies, providing the environment and the financial means to make it possible. To my classmate André Guerreiro, I would like to thank the invaluable brainstorming, the patience and the help through our college years. To my friend Isabel Silva, who gave me a precious help in the final revision of this document. Everyone in ADETTI-IUL for the time and the attention they gave me. Especially the people over Caixa Mágica, because I truly value the expertise transmitted, which was useful to my thesis and I am sure will also help me during my professional course. To my teacher and MSc. advisor, Professor Carlos Serrão, for embracing my will to master in this area and for being always available to help me when I needed some advice.
    [Show full text]
  • Modeling Popularity and Reliability of Sources in Multilingual Wikipedia
    information Article Modeling Popularity and Reliability of Sources in Multilingual Wikipedia Włodzimierz Lewoniewski * , Krzysztof W˛ecel and Witold Abramowicz Department of Information Systems, Pozna´nUniversity of Economics and Business, 61-875 Pozna´n,Poland; [email protected] (K.W.); [email protected] (W.A.) * Correspondence: [email protected] Received: 31 March 2020; Accepted: 7 May 2020; Published: 13 May 2020 Abstract: One of the most important factors impacting quality of content in Wikipedia is presence of reliable sources. By following references, readers can verify facts or find more details about described topic. A Wikipedia article can be edited independently in any of over 300 languages, even by anonymous users, therefore information about the same topic may be inconsistent. This also applies to use of references in different language versions of a particular article, so the same statement can have different sources. In this paper we analyzed over 40 million articles from the 55 most developed language versions of Wikipedia to extract information about over 200 million references and find the most popular and reliable sources. We presented 10 models for the assessment of the popularity and reliability of the sources based on analysis of meta information about the references in Wikipedia articles, page views and authors of the articles. Using DBpedia and Wikidata we automatically identified the alignment of the sources to a specific domain. Additionally, we analyzed the changes of popularity and reliability in time and identified growth leaders in each of the considered months. The results can be used for quality improvements of the content in different languages versions of Wikipedia.
    [Show full text]
  • Strengthening and Unifying the Visual Identity of Wikimedia Projects: a Step Towards Maturity
    Strengthening and unifying the visual identity of Wikimedia projects: a step towards maturity Guillaume Paumier∗ Elisabeth Bauer [[m:User:guillom]] [[m:User:Elian]] Abstract In January 2007, the Wikimedian community celebrated the sixth birthday of Wikipedia. Six years of constant evolution have now led to Wikipedia being one of the most visited websites in the world. Other projects developing free content and supported by the Wikimedia Foundation have been expanding rapidly too. The Foundation and its projects are now facing some communication issues due to the difference of scale between the human and financial resources of the Foundation and the success of its projects. In this paper, we identify critical issues in terms of visual identity and marketing. We evaluate the situation and propose several changes, including a redesign of the default website interface. Introduction The first Wikipedia project was created in January 2001. In these days, the technical infrastructure was provided by Bomis, a dot-com company. In June 2003, Jimmy Wales, founder of Wikipedia and owner of Bomis, created the Wikimedia Foundation [1] to provide a long-term administrative and technical structure dedicated to free content. Since these days, both the projects and the Foundation have been evolving. New projects have been created. All have grown at different rates. Some have got more fame than the others. New financial, technical and communication challenges have risen. In this paper, we will first identify some of these challenges and issues in terms of global visual identity. We will then analyse logos, website layouts, projects names, trademarks so as to provide some hindsight.
    [Show full text]
  • Is Stallman Stalled? One of the Greatest Programmers Alive :Uture Where All Software Was Free
    • •••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• I Is Stallman Stalled? One of the Greatest Programmers Alive :uture Where All Software Was Free. Then Reality Set In. by Simson L. Garfinkel fter nine years, peuple still dun't get it. ning to take the computer wurld by storm. H~nce the "The ,"vord "Free' doesn't refer to price; it project's tail-chasing name: GNU's Not Unix. /f. refers to freedom," said Richard Stallman, pres­ Working day and night for two years, Stallman created ident of the Free Software Foundation. E'>1ACS, an extensible text editor for Unix. That same 'vIost software these days is sold in shrink-wrapped year, Stallman incorporated the Free Software Foun­ cardboard boxes, often for hundreds of dollars. For dation, the world's only charitable non-profit organiza­ that, you get a floppy disk containing a program that tion with the mission of developing free software. the computer can execute, but which can't be modified. Reeause FSF sold EMACS in source-code form, people Companies keep their around the world started making additions to the pro­ source-code - the actual gram and porting it to dill"erent manufacturer's com­ language in which pro­ puters. Today, EMACS is a mammoth system that helps grammers write - a closely a person do everything from read electronic mail to guarded secret. develop software. Because of its popular'ity, many com­ Stallman's vision of puter companies, including IBM, Digital F:quipment freedom is software that Corp., and Hewlett Packard include it as standard soft­ has no secrets. It comes ware Vl'ith their Unix operating systems.
    [Show full text]
  • Omnipedia: Bridging the Wikipedia Language
    Omnipedia: Bridging the Wikipedia Language Gap Patti Bao*†, Brent Hecht†, Samuel Carton†, Mahmood Quaderi†, Michael Horn†§, Darren Gergle*† *Communication Studies, †Electrical Engineering & Computer Science, §Learning Sciences Northwestern University {patti,brent,sam.carton,quaderi}@u.northwestern.edu, {michael-horn,dgergle}@northwestern.edu ABSTRACT language edition contains its own cultural viewpoints on a We present Omnipedia, a system that allows Wikipedia large number of topics [7, 14, 15, 27]. On the other hand, readers to gain insight from up to 25 language editions of the language barrier serves to silo knowledge [2, 4, 33], Wikipedia simultaneously. Omnipedia highlights the slowing the transfer of less culturally imbued information similarities and differences that exist among Wikipedia between language editions and preventing Wikipedia’s 422 language editions, and makes salient information that is million monthly visitors [12] from accessing most of the unique to each language as well as that which is shared information on the site. more widely. We detail solutions to numerous front-end and algorithmic challenges inherent to providing users with In this paper, we present Omnipedia, a system that attempts a multilingual Wikipedia experience. These include to remedy this situation at a large scale. It reduces the silo visualizing content in a language-neutral way and aligning effect by providing users with structured access in their data in the face of diverse information organization native language to over 7.5 million concepts from up to 25 strategies. We present a study of Omnipedia that language editions of Wikipedia. At the same time, it characterizes how people interact with information using a highlights similarities and differences between each of the multilingual lens.
    [Show full text]
  • The Culture of Wikipedia
    Good Faith Collaboration: The Culture of Wikipedia Good Faith Collaboration The Culture of Wikipedia Joseph Michael Reagle Jr. Foreword by Lawrence Lessig The MIT Press, Cambridge, MA. Web edition, Copyright © 2011 by Joseph Michael Reagle Jr. CC-NC-SA 3.0 Purchase at Amazon.com | Barnes and Noble | IndieBound | MIT Press Wikipedia's style of collaborative production has been lauded, lambasted, and satirized. Despite unease over its implications for the character (and quality) of knowledge, Wikipedia has brought us closer than ever to a realization of the centuries-old Author Bio & Research Blog pursuit of a universal encyclopedia. Good Faith Collaboration: The Culture of Wikipedia is a rich ethnographic portrayal of Wikipedia's historical roots, collaborative culture, and much debated legacy. Foreword Preface to the Web Edition Praise for Good Faith Collaboration Preface Extended Table of Contents "Reagle offers a compelling case that Wikipedia's most fascinating and unprecedented aspect isn't the encyclopedia itself — rather, it's the collaborative culture that underpins it: brawling, self-reflexive, funny, serious, and full-tilt committed to the 1. Nazis and Norms project, even if it means setting aside personal differences. Reagle's position as a scholar and a member of the community 2. The Pursuit of the Universal makes him uniquely situated to describe this culture." —Cory Doctorow , Boing Boing Encyclopedia "Reagle provides ample data regarding the everyday practices and cultural norms of the community which collaborates to 3. Good Faith Collaboration produce Wikipedia. His rich research and nuanced appreciation of the complexities of cultural digital media research are 4. The Puzzle of Openness well presented.
    [Show full text]
  • SI 410 Ethics and Information Technology
    Author(s): Paul Conway, PhD, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/ We have reviewed this material in accordance with U.S. Copyright Law and have tried to maximize your ability to use, share, and adapt it. The citation key on the following slide provides information about how you may share and adapt this material. Copyright holders of content included in this material should contact [email protected] with any questions, corrections, or clarification regarding the use of content. For more information about how to cite these materials visit http://open.umich.edu/privacy-and-terms-use. Any medical information in this material is intended to inform and educate and is not a tool for self-diagnosis or a replacement for medical evaluation, advice, diagnosis or treatment by a healthcare professional. Please speak to your physician if you have questions about your medical condition. Viewer discretion is advised: Some medical content is graphic and may not be suitable for all viewers. Citation Key for more information see: http://open.umich.edu/wiki/CitationPolicy Use + Share + Adapt { Content the copyright holder, author, or law permits you to use, share and adapt. } Public Domain – Government: Works that are produced by the U.S. Government. (17 USC § 105) Public Domain – Expired: Works that are no longer protected due to an expired copyright term. Public Domain – Self Dedicated: Works that a copyright holder has dedicated to the public domain.
    [Show full text]