1 Wikipedia: an Effective Anarchy Dariusz Jemielniak, Ph.D
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Modeling Popularity and Reliability of Sources in Multilingual Wikipedia
information Article Modeling Popularity and Reliability of Sources in Multilingual Wikipedia Włodzimierz Lewoniewski * , Krzysztof W˛ecel and Witold Abramowicz Department of Information Systems, Pozna´nUniversity of Economics and Business, 61-875 Pozna´n,Poland; [email protected] (K.W.); [email protected] (W.A.) * Correspondence: [email protected] Received: 31 March 2020; Accepted: 7 May 2020; Published: 13 May 2020 Abstract: One of the most important factors impacting quality of content in Wikipedia is presence of reliable sources. By following references, readers can verify facts or find more details about described topic. A Wikipedia article can be edited independently in any of over 300 languages, even by anonymous users, therefore information about the same topic may be inconsistent. This also applies to use of references in different language versions of a particular article, so the same statement can have different sources. In this paper we analyzed over 40 million articles from the 55 most developed language versions of Wikipedia to extract information about over 200 million references and find the most popular and reliable sources. We presented 10 models for the assessment of the popularity and reliability of the sources based on analysis of meta information about the references in Wikipedia articles, page views and authors of the articles. Using DBpedia and Wikidata we automatically identified the alignment of the sources to a specific domain. Additionally, we analyzed the changes of popularity and reliability in time and identified growth leaders in each of the considered months. The results can be used for quality improvements of the content in different languages versions of Wikipedia. -
'Anyone Can Edit', Not Everyone Does: Wikipedia and the Gender
Heather Ford and Judy Wajcman ‘Anyone can edit’, not everyone does: Wikipedia and the gender gap Article (Accepted version) (Refereed) Original citation: Ford, Heather and Wajcman, Judy (2017) ‘Anyone can edit’, not everyone does: Wikipedia and the gender gap. Social Studies of Science, 47 (4). pp. 511-527. ISSN 0306-3127 DOI: 10.1177/0306312717692172 © 2017 The Authors This version available at: http://eprints.lse.ac.uk/68675/ Available in LSE Research Online: September 2017 LSE has developed LSE Research Online so that users may access research output of the School. Copyright © and Moral Rights for the papers on this site are retained by the individual authors and/or other copyright owners. Users may download and/or print one copy of any article(s) in LSE Research Online to facilitate their private study or for non-commercial research. You may not engage in further distribution of the material or use it for any profit-making activities or any commercial gain. You may freely distribute the URL (http://eprints.lse.ac.uk) of the LSE Research Online website. This document is the author’s final accepted version of the journal article. There may be differences between this version and the published version. You are advised to consult the publisher’s version if you wish to cite from it. Anyone can edit, not everyone does: Wikipedias infrastructure and the gender gap Heather Ford School of Media and Communication, University of Leeds, UK Judy Wajcman Department of Sociology, London School of Economics, UK Abstract Feminist STS has continues to define what counts as knowledge and expertise. -
From Faculty Enemy to Faculty Enabler
Wikipedia @ 20 • ::Wikipedia @ 20 9 The First Twenty Years of Teaching with Wikipedia: From Faculty Enemy to Faculty Enabler Robert E. Cummings Published on: Oct 15, 2020 Updated on: Nov 16, 2020 License: Creative Commons Attribution 4.0 International License (CC-BY 4.0) Wikipedia @ 20 • ::Wikipedia @ 20 9 The First Twenty Years of Teaching with Wikipedia: From Faculty Enemy to Faculty Enabler Wikipedia jumbles the faculty roles of teaching, researching, and service by challenging traditional notions of faculty expertise, but a more integrated approach for these roles is also possible. I have never been able to see Wikipedia without the lens of a faculty member. On the one hand, I subconsciously carry with me perpetual concerns about accuracy and reliability. I am often trying to prove to myself that Wikipedia is legitimate work: it is, after all, the world’s largest encyclopedia. Writing to Wikipedia follows a set of complicated rules. So if it is a serious project with rules and enforcers and the world relies on its information, it ought to be universally accepted. And on the other hand? I want Wikipedia to be a lark. I also want Wikipedia to be a place where I wander freely and learn lots and lots of information. A guilty pleasure, not unlike pulling a book off a library shelf and simply reading it. I suspect that these anxieties I experience might speak to inherent conflict in being a faculty member while engaging Wikipedia. Reading Wikipedia, contributing to Wikipedia, and certainly teaching with Wikipedia jumble and reconfigure the faculty identities of teacher and researcher because they recontextualize our relationships with expertise itself. -
Wikipedia Citations: a Comprehensive Data Set of Citations with Identifiers Extracted from English Wikipedia
RESEARCH ARTICLE Wikipedia citations: A comprehensive data set of citations with identifiers extracted from English Wikipedia Harshdeep Singh1 , Robert West1 , and Giovanni Colavizza2 an open access journal 1Data Science Laboratory, EPFL 2Institute for Logic, Language and Computation, University of Amsterdam Keywords: citations, data, data set, Wikipedia Downloaded from http://direct.mit.edu/qss/article-pdf/2/1/1/1906624/qss_a_00105.pdf by guest on 01 October 2021 Citation: Singh, H., West, R., & ABSTRACT Colavizza, G. (2020). Wikipedia citations: A comprehensive data set Wikipedia’s content is based on reliable and published sources. To this date, relatively little of citations with identifiers extracted from English Wikipedia. Quantitative is known about what sources Wikipedia relies on, in part because extracting citations Science Studies, 2(1), 1–19. https:// and identifying cited sources is challenging. To close this gap, we release Wikipedia doi.org/10.1162/qss_a_00105 Citations, a comprehensive data set of citations extracted from Wikipedia. We extracted DOI: 29.3 million citations from 6.1 million English Wikipedia articles as of May 2020, and https://doi.org/10.1162/qss_a_00105 classified as being books, journal articles, or Web content. We were thus able to extract Received: 14 July 2020 4.0 million citations to scholarly publications with known identifiers—including DOI, PMC, Accepted: 23 November 2020 PMID, and ISBN—and further equip an extra 261 thousand citations with DOIs from Crossref. Corresponding Author: As a result, we find that 6.7% of Wikipedia articles cite at least one journal article with Giovanni Colavizza [email protected] an associated DOI, and that Wikipedia cites just 2% of all articles with a DOI currently indexed in the Web of Science. -
Jimmy Wales and Larry Sanger, It Is the Largest, Fastest-Growing and Most Popular General Reference Work Currently Available on the Internet
Tomasz „Polimerek” Ganicz Wikimedia Polska WikipediaWikipedia andand otherother WikimediaWikimedia projectsprojects WhatWhat isis Wikipedia?Wikipedia? „Imagine„Imagine aa worldworld inin whichwhich everyevery singlesingle humanhuman beingbeing cancan freelyfreely shareshare inin thethe sumsum ofof allall knowledge.knowledge. That'sThat's ourour commitment.”commitment.” JimmyJimmy „Jimbo”„Jimbo” Wales Wales –– founder founder ofof WikipediaWikipedia As defined by itself: Wikipedia is a free multilingual, open content encyclopedia project operated by the non-profit Wikimedia Foundation. Its name is a blend of the words wiki (a technology for creating collaborative websites) and encyclopedia. Launched in January 2001 by Jimmy Wales and Larry Sanger, it is the largest, fastest-growing and most popular general reference work currently available on the Internet. OpenOpen and and free free content content RichardRichard StallmanStallman definition definition of of free free software: software: „The„The wordword "free""free" inin ourour namename doesdoes notnot referrefer toto price;price; itit refersrefers toto freedom.freedom. First,First, thethe freedomfreedom toto copycopy aa programprogram andand redistributeredistribute itit toto youryour neighbors,neighbors, soso thatthat theythey cancan useuse itit asas wellwell asas you.you. Second,Second, thethe freedomfreedom toto changechange aa program,program, soso ththatat youyou cancan controlcontrol itit insteadinstead ofof itit controllingcontrolling you;you; forfor this,this, thethe sourcesource -
Bridging the Gap Between Wikipedia and Academia
Bridging the Gap Between Wikipedia and Academia Dariusz Jemielniak New Research on Digital Societies (NeRDS) group, Kozminski University, Jagiellonska 59, 03-301 Warszawa, Poland. E-mail: [email protected] Eduard Aibar Research Group on Open Science & Innovation, Universitat Oberta de Catalunya, Av. Tibidabo, 39-43, 08035 Barcelona, Spain. E-mail: [email protected] In this opinion piece, we would like to present a short liter- isms, and a community whose enthusiasm will inevitably ature review of perceptions and reservations towards wane: Since 2005 Eric Goldman, a professor of law at Santa Wikipedia in academia, address the common questions Clara University, keeps predicting that “Wikipedia Will Fail about overall reliability of Wikipedia entries, review the actual practices of Wikipedia usage in academia, and con- Within 5 Years” (Goldman, 2005), changing only the clude with possible scenarios for a peaceful coexistence. expected demise date (Anderson, 2009). Because Wikipedia is a regular topic of JASIST publica- Even people sympathetic to open collaboration models, tions (Lim, 2009; Meseguer-Artola, Aibar, Llados, consider Wikipedia to rely mainly on the wisdom of crowds, Minguillon, & Lerga, 2015; Mesgari, Okoli, Mehdi, Nielsen, & Lanamaki,€ 2015; Okoli, Mehdi, Mesgari, Nielsen, & not necessarily on actual expertise (Surowiecki, 2004). It is Lanamaki,€ 2014), we hope to start a useful discussion with called a “flawed knowledge community” (Roberts & Peters, the right audience. 2011, p. 36), a broken surrogate. And the common view -
Cultural Bias in Wikipedia Content on Famous Persons
ASI21577.tex 14/6/2011 16: 39 Page 1 Cultural Bias in Wikipedia Content on Famous Persons Ewa S. Callahan Department of Film, Video, and Interactive Media, School of Communications, Quinnipiac University, 275 Mt. Carmel Avenue, Hamden, CT 06518-1908. E-mail: [email protected] Susan C. Herring School of Library and Information Science, Indiana University, Bloomington, 1320 E. 10th St., Bloomington, IN 47405-3907. E-mail: [email protected] Wikipedia advocates a strict “neutral point of view” This issue comes to the fore when one considers (NPOV) policy. However, although originally a U.S-based, that Wikipedia, although originally a U.S-based, English- English-language phenomenon, the online, user-created language phenomenon, now has versions—or “editions,” as encyclopedia now has versions in many languages. This study examines the extent to which content and perspec- they are called on Wikipedia—in many languages, with con- tives vary across cultures by comparing articles about tent and perspectives that can be expected to vary across famous persons in the Polish and English editions of cultures. With regard to coverage of persons in different Wikipedia.The results of quantitative and qualitative con- language versions, Kolbitsch and Maurer (2006) claim that tent analyses reveal systematic differences related to Wikipedia “emphasises ‘local heroes”’ and thus “distorts the different cultures, histories, and values of Poland and the United States; at the same time, a U.S./English- reality and creates an imbalance” (p. 196). However, their language advantage is evident throughout. In conclusion, evidence is anecdotal; empirical research is needed to inves- the implications of these findings for the quality and tigate the question of whether—and if so, to what extent—the objectivity of Wikipedia as a global repository of knowl- cultural biases of a country are reflected in the content of edge are discussed, and recommendations are advanced Wikipedia entries written in the language of that country. -
A Complete, Longitudinal and Multi-Language Dataset of the Wikipedia Link Networks
WikiLinkGraphs: A Complete, Longitudinal and Multi-Language Dataset of the Wikipedia Link Networks Cristian Consonni David Laniado Alberto Montresor DISI, University of Trento Eurecat, Centre Tecnologic` de Catalunya DISI, University of Trento [email protected] [email protected] [email protected] Abstract and result in a huge conceptual network. According to Wiki- pedia policies2 (Wikipedia contributors 2018e), when a con- Wikipedia articles contain multiple links connecting a sub- cept is relevant within an article, the article should include a ject to other pages of the encyclopedia. In Wikipedia par- link to the page corresponding to such concept (Borra et al. lance, these links are called internal links or wikilinks. We present a complete dataset of the network of internal Wiki- 2015). Therefore, the network between articles may be seen pedia links for the 9 largest language editions. The dataset as a giant mind map, emerging from the links established by contains yearly snapshots of the network and spans 17 years, the community. Such graph is not static but is continuously from the creation of Wikipedia in 2001 to March 1st, 2018. growing and evolving, reflecting the endless collaborative While previous work has mostly focused on the complete hy- process behind it. perlink graph which includes also links automatically gener- The English Wikipedia includes over 163 million con- ated by templates, we parsed each revision of each article to nections between its articles. This huge graph has been track links appearing in the main text. In this way we obtained exploited for many purposes, from natural language pro- a cleaner network, discarding more than half of the links and cessing (Yeh et al. -
Wikipédia, Mythes Et Réalités
Wikipédia, mythes et réalités David Monniaux Wikimédia France 28 janvier 2011 David Monniaux (Wikimédia France) Wikipédia, mythes et réalités 28 janvier 2011 1 / 62 Qu’est-ce que Wikipédia ? http://www.wikipedia.org/ I Un site Web. I Présentant une collection d’articles encyclopédiques. I Éditables par tout à chacun (via connexion Internet). I Pas de comité éditorial. I Dans de multiples langues : http://fr.wikipedia.org/ pour le français, http://en.wikipedia.org/ pour l’anglais. David Monniaux (Wikimédia France) Wikipédia, mythes et réalités 28 janvier 2011 2 / 62 Aspects juridiques Aspects juridiques Aspects éditoriaux En pratique Les articles à avoir... ou pas Le danger Wikipédia La CIA et le Vatican manipulent Wikipédia Google+Wikipédia dévoie la jeunesse La culture du copier-coller Wikipédia, surtout forte en culture populaire Une vérité ? Conclusion David Monniaux (Wikimédia France) Wikipédia, mythes et réalités 28 janvier 2011 3 / 62 Aspects juridiques Hébergement Initialement, projet sur quelques machines hébergées chez Bomis, entreprise de Jimmy Wales. Wikipédia est maintenant un site important : e I Comscore décembre 2010 : 12 site aux USA e I Comscore 2010 : 5 site aux USA (après Google, Microsoft, Yahoo !, Facebook et devant AOL, eBay, Ask, Amazon...) e I Médiamétrie novembre 2010 : 6 site en France (après Google, Facebook, Microsoft, Orange, Youtube et devant Free, Yahoo !, Pages Jaunes...). De très loin le premier site non commercial, premier site culturel et éducatif. David Monniaux (Wikimédia France) Wikipédia, mythes et réalités 28 janvier 2011 4 / 62 Aspects juridiques Hébergement haut débit Jusqu’à 90000 requêtes http/s Les pannes de Wikipédia sont rapportées dans la presse ! Ceci nécessite : I Hébergement solide, matériel suffisant.. -
State of the Community.Pdf
State of the community Dariusz Jemielniak, “pundit” (the WMF Trustee) Wikimania 2016 http://tiny.cc/wikimania Wikimedia community logo attribution: Artur Jan Fijałkowski pl.wiki/Commons: WarX, [email protected] [Public domain], via Wikimedia Commons https://commons.wikimedia.org/wiki/File:Wikimedia_Community_Logo.svg State of the community? Wikimedia community logo attribution: Artur Jan Fijałkowski pl.wiki/Commons: WarX, [email protected] [Public domain], via Wikimedia Commons https://commons.wikimedia.org/wiki/File:Wikimedia_Community_Logo.svg I do not know. And I am not sure anyone does Disclaimer User digest presentations are talks which provide an overview of a topic and are meant to be useful to anyone, without any particular background knowledge Read more here: https://wikimania2016.wikimedia.org/wiki/User_digest_presentations [[Wikimedia projects]] From https://wikimediafoundation.org/wiki/Our_projects ● Thus 16 different projects* ● 11 of them are content projects (Wikipedia, Wikibooks, Wikiversity, Wikinews, Wiktionary, Wikisource, Wikiquote, Wikivoyage, Wikimedia Commons, Wikidata, Wikispecies) ● 8 of them can be in different languages... *A full listing of Wikimedia projects is available here: https://wikimediafoundation.org/wiki/Special:SiteMatrix ● Wikipedias exist in almost 300 different languages... ○ But only around 100 of them are active and mature enough to be useful as encyclopedias *A list of Wikipedias is available here: https://meta.wikimedia.org/wiki/List_of_Wikipedias Structure ● A core element and basis is Volunteers -
Genre Analysis of Online Encyclopedias. the Case of Wikipedia
Genre analysis online encycloped The case of Wikipedia AnnaTereszkiewicz Genre analysis of online encyclopedias The case of Wikipedia Wydawnictwo Uniwersytetu Jagiellońskiego Publikacja dofi nansowana przez Wydział Filologiczny Uniwersytetu Jagiellońskiego ze środków wydziałowej rezerwy badań własnych oraz Instytutu Filologii Angielskiej PROJEKT OKŁADKI Bartłomiej Drosdziok Zdjęcie na okładce: Łukasz Stawarski © Copyright by Anna Tereszkiewicz & Wydawnictwo Uniwersytetu Jagiellońskiego Wydanie I, Kraków 2010 All rights reserved Książka, ani żaden jej fragment nie może być przedrukowywana bez pisemnej zgody Wydawcy. W sprawie zezwoleń na przedruk należy zwracać się do Wydawnictwa Uniwersytetu Jagiellońskiego. ISBN 978-83-233-2813-1 www.wuj.pl Wydawnictwo Uniwersytetu Jagiellońskiego Redakcja: ul. Michałowskiego 9/2, 31-126 Kraków tel. 12-631-18-81, 12-631-18-82, fax 12-631-18-83 Dystrybucja: tel. 12-631-01-97, tel./fax 12-631-01-98 tel. kom. 0506-006-674, e-mail: [email protected] Konto: PEKAO SA, nr 80 1240 4722 1111 0000 4856 3325 Table of Contents Acknowledgements ........................................................................................................................ 9 Introduction .................................................................................................................................... 11 Materials and Methods .................................................................................................................. 14 1. Genology as a study .................................................................................................................. -
Citations Needed: Build Your Wikipedia Skills While Building the World’S Encyclopedia
A companion guide to deepen your learning during the WebJunction webinar on January 10, 2018, at 3:00 pm EST Citations Needed: Build Your Wikipedia Skills While Building the World’s Encyclopedia A glimpse into the inner workings of English Wikipedia for information professionals The Five Pillars of Wikipedia What are the ways in which the five pillars of Wikipedia align 1. Wikipedia is an encyclopedia with the mission of libraries? 2. It is written from a Neutral Point of View (NPOV) 3. It’s free content that anyone can use, edit, and distribute 4. Editors should treat each other with respect and civility 5. Wikipedia has no firm rules Learn about what U.S. public library staff are doing with Wikipedia in the WebJunction series Librarians Who Wikipedia List two (or more) insights you’ve gained about how Wikipedia editing works, such as the color-coded peer assessments that are shown in the chart below. 1. 2. How does learning about Wikipedia’s inner workings help you evaluate the quality of articles? Wikipedia’s articles are in a constant state of development, learn more about quality assessments made by other editors 1 | P a g e OCLC Wikipedia + Libraries: Better Together About the #1lib1ref campaign (and how you and your library can participate) What is the #1lib1ref campaign? How can you participate? How can your library participate? #1lib1ref The Wikipedia Library’s annual It’s easy! Follow the steps on Plan a #1lib1ref event for your #1lib1ref (“One Librarian, One pages three and four to insert a library, Wikipedia is better with Reference”) global campaign reference as a footnote citation.