Welcome to Wikipedia Wikipedia Is the Largest Encyclopedia in the World
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Position Description Addenda
POSITION DESCRIPTION January 2014 Wikimedia Foundation Executive Director - Addenda The Wikimedia Foundation is a radically transparent organization, and much information can be found at www.wikimediafoundation.org . That said, certain information might be particularly useful to nominators and prospective candidates, including: Announcements pertaining to the Wikimedia Foundation Executive Director Search Kicking off the search for our next Executive Director by Former Wikimedia Foundation Board Chair Kat Walsh An announcement from Wikimedia Foundation ED Sue Gardner by Wikimedia Executive Director Sue Gardner Video Interviews on the Wikimedia Community and Foundation and Its History Some of the values and experiences of the Wikimedia Community are best described directly by those who have been intimately involved in the organization’s dramatic expansion. The following interviews are available for viewing though mOppenheim.TV . • 2013 Interview with Former Wikimedia Board Chair Kat Walsh • 2013 Interview with Wikimedia Executive Director Sue Gardner • 2009 Interview with Wikimedia Executive Director Sue Gardner Guiding Principles of the Wikimedia Foundation and the Wikimedia Community The following article by Sue Gardner, the current Executive Director of the Wikimedia Foundation, has received broad distribution and summarizes some of the core cultural values shared by Wikimedia’s staff, board and community. Topics covered include: • Freedom and open source • Serving every human being • Transparency • Accountability • Stewardship • Shared power • Internationalism • Free speech • Independence More information can be found at: https://meta.wikimedia.org/wiki/User:Sue_Gardner/Wikimedia_Foundation_Guiding_Principles Wikimedia Policies The Wikimedia Foundation has an extensive list of policies and procedures available online at: http://wikimediafoundation.org/wiki/Policies Wikimedia Projects All major projects of the Wikimedia Foundation are collaboratively developed by users around the world using the MediaWiki software. -
Danish Resources
Danish resources Finn Arup˚ Nielsen November 19, 2017 Abstract A range of different Danish resources, datasets and tools, are presented. The focus is on resources for use in automated computational systems and free resources that can be redistributed and used in commercial applications. Contents 1 Corpora3 1.1 Wikipedia...................................3 1.2 Wikisource...................................3 1.3 Wikiquote...................................4 1.4 ADL......................................4 1.5 Gutenberg...................................4 1.6 Runeberg...................................5 1.7 Europarl....................................5 1.8 Leipzig Corpora Collection..........................5 1.9 Danish Dependency Treebank........................6 1.10 Retsinformation................................6 1.11 Other resources................................6 2 Lexical resources6 2.1 DanNet....................................6 2.2 Wiktionary..................................7 2.3 Wikidata....................................7 2.4 OmegaWiki..................................8 2.5 Other lexical resources............................8 2.6 Wikidata examples with medical terminology extraction.........8 3 Natural language processing tools9 3.1 NLTK.....................................9 3.2 Polyglot....................................9 3.3 spaCy.....................................9 3.4 Apache OpenNLP............................... 10 3.5 Centre for Language Technology....................... 10 3.6 Other libraries................................ -
A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages
Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 2373–2380 Marseille, 11–16 May 2020 c European Language Resources Association (ELRA), licensed under CC-BY-NC A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages Dwaipayan Roy, Sumit Bhatia, Prateek Jain GESIS - Cologne, IBM Research - Delhi, IIIT - Delhi [email protected], [email protected], [email protected] Abstract Wikipedia is the largest web-based open encyclopedia covering more than three hundred languages. However, different language editions of Wikipedia differ significantly in terms of their information coverage. We present a systematic comparison of information coverage in English Wikipedia (most exhaustive) and Wikipedias in eight other widely spoken languages (Arabic, German, Hindi, Korean, Portuguese, Russian, Spanish and Turkish). We analyze the content present in the respective Wikipedias in terms of the coverage of topics as well as the depth of coverage of topics included in these Wikipedias. Our analysis quantifies and provides useful insights about the information gap that exists between different language editions of Wikipedia and offers a roadmap for the Information Retrieval (IR) community to bridge this gap. Keywords: Wikipedia, Knowledge base, Information gap 1. Introduction other with respect to the coverage of topics as well as Wikipedia is the largest web-based encyclopedia covering the amount of information about overlapping topics. -
On the Evolution of Wikipedia
On the Evolution of Wikipedia Rodrigo B. Almeida Barzan Mozafari Junghoo Cho UCLA Computer Science UCLA Computer Science UCLA Computer Science Department Department Department Los Angeles - USA Los Angeles - USA Los Angeles - USA [email protected] [email protected] [email protected] Abstract time. So far, several studies have focused on understanding A recent phenomenon on the Web is the emergence and pro- and characterizing the evolution of this huge repository of liferation of new social media systems allowing social inter- data [5, 11]. action between people. One of the most popular of these Recently, a new phenomenon, called social systems, has systems is Wikipedia that allows users to create content in a emerged from the Web. Generally speaking, such systems collaborative way. Despite its current popularity, not much allow people not only to create content, but also to easily is known about how users interact with Wikipedia and how interact and collaborate with each other. Examples of such it has evolved over time. systems are: (1) Social network systems such as MySpace In this paper we aim to provide a first, extensive study of or Orkut that allow users to participate in a social network the user behavior on Wikipedia and its evolution. Compared by creating their profiles and indicating their acquaintances; to prior studies, our work differs in several ways. First, previ- (2) Collaborative bookmarking systems such as Del.icio.us or ous studies on the analysis of the user workloads (for systems Yahoo’s MyWeb in which users are allowed to share their such as peer-to-peer systems [10] and Web servers [2]) have bookmarks; and (3) Wiki systems that allow collaborative mainly focused on understanding the users who are accessing management of Web sites. -
News Release
NEWS RELEASE For immediate release Sue Gardner to deliver 16th annual LaFontaine-Baldwin Lecture Former head of CBC.ca and Wikimedia Foundation to open 6 Degrees Toronto TORONTO, August 13, 2018—6 Degrees announces that the 2018 LaFontaine-Baldwin Lecture will be delivered by leading digital pioneer Sue Gardner. The lecture will be given on September 24 as part of 6 Degrees Toronto, a project of the Institute for Canadian Citizenship. As senior director of CBC.ca, Gardner reinvented the Canadian Broadcasting Corporation’s place in the world of digital news. Later, as executive director of the Wikimedia Foundation, she played a crucial role in the explosive growth of Wikipedia. The San Francisco–based Gardner continues to be a sought-after global thought-leader: she currently advises media and technology companies, and serves on the boards of Privacy International and the Organized Crime and Corruption Reporting Project. “Sue Gardner is on the forefront of ideas on technology, democracy, and women’s roles in our society,” said ICC Co-founder and Co-chair John Ralston Saul. “As we witness the alarming erosion of democratic institutions, I can’t think of a more relevant voice to address the challenges ahead. I can’t wait to welcome Sue home to Toronto to deliver this year’s LaFontaine-Baldwin Lecture.” Titled Dark Times Ahead: Taking Back Truth, Freedom, and Technology, the interactive event will include Gardner in conversation with John Ralston Saul. Gardner joins an illustrious list of past LaFontaine-Baldwin lecturers, including His Highness the Aga Khan, Naomi Klein, Shawn A-in-chut Atleo, Michael Sandel, and Naheed Nenshi. -
Omnipedia: Bridging the Wikipedia Language
Omnipedia: Bridging the Wikipedia Language Gap Patti Bao*†, Brent Hecht†, Samuel Carton†, Mahmood Quaderi†, Michael Horn†§, Darren Gergle*† *Communication Studies, †Electrical Engineering & Computer Science, §Learning Sciences Northwestern University {patti,brent,sam.carton,quaderi}@u.northwestern.edu, {michael-horn,dgergle}@northwestern.edu ABSTRACT language edition contains its own cultural viewpoints on a We present Omnipedia, a system that allows Wikipedia large number of topics [7, 14, 15, 27]. On the other hand, readers to gain insight from up to 25 language editions of the language barrier serves to silo knowledge [2, 4, 33], Wikipedia simultaneously. Omnipedia highlights the slowing the transfer of less culturally imbued information similarities and differences that exist among Wikipedia between language editions and preventing Wikipedia’s 422 language editions, and makes salient information that is million monthly visitors [12] from accessing most of the unique to each language as well as that which is shared information on the site. more widely. We detail solutions to numerous front-end and algorithmic challenges inherent to providing users with In this paper, we present Omnipedia, a system that attempts a multilingual Wikipedia experience. These include to remedy this situation at a large scale. It reduces the silo visualizing content in a language-neutral way and aligning effect by providing users with structured access in their data in the face of diverse information organization native language to over 7.5 million concepts from up to 25 strategies. We present a study of Omnipedia that language editions of Wikipedia. At the same time, it characterizes how people interact with information using a highlights similarities and differences between each of the multilingual lens. -
The Culture of Wikipedia
Good Faith Collaboration: The Culture of Wikipedia Good Faith Collaboration The Culture of Wikipedia Joseph Michael Reagle Jr. Foreword by Lawrence Lessig The MIT Press, Cambridge, MA. Web edition, Copyright © 2011 by Joseph Michael Reagle Jr. CC-NC-SA 3.0 Purchase at Amazon.com | Barnes and Noble | IndieBound | MIT Press Wikipedia's style of collaborative production has been lauded, lambasted, and satirized. Despite unease over its implications for the character (and quality) of knowledge, Wikipedia has brought us closer than ever to a realization of the centuries-old Author Bio & Research Blog pursuit of a universal encyclopedia. Good Faith Collaboration: The Culture of Wikipedia is a rich ethnographic portrayal of Wikipedia's historical roots, collaborative culture, and much debated legacy. Foreword Preface to the Web Edition Praise for Good Faith Collaboration Preface Extended Table of Contents "Reagle offers a compelling case that Wikipedia's most fascinating and unprecedented aspect isn't the encyclopedia itself — rather, it's the collaborative culture that underpins it: brawling, self-reflexive, funny, serious, and full-tilt committed to the 1. Nazis and Norms project, even if it means setting aside personal differences. Reagle's position as a scholar and a member of the community 2. The Pursuit of the Universal makes him uniquely situated to describe this culture." —Cory Doctorow , Boing Boing Encyclopedia "Reagle provides ample data regarding the everyday practices and cultural norms of the community which collaborates to 3. Good Faith Collaboration produce Wikipedia. His rich research and nuanced appreciation of the complexities of cultural digital media research are 4. The Puzzle of Openness well presented. -
Supporting Organizational Effectiveness Within Movements
While organizational effectiveness is a common concern with nonprofit organizations and networks, especially ones that are growing, the de-centralized culture of Wikimedia presented some unique challenges and opportunities. In a movement that so strongly valued the autonomy of indi- vidual organizations, what did it mean for the Foundation to Supporting Organizational attempt to increase their effectiveness – and how could it do this in a way that was not top-down? Furthermore, the Effectiveness Within Foundation was concerned that its financial support was fueling momentum toward larger-budget, “traditional” or Movements staffed nonprofit structures which might not be consistent with its history and mission as a global free-knowledge move- A Wikimedia Foundation Case Study ment created and driven largely by individual volunteers. Funders supporting movements face a unique set of chal- The Foundation was grappling with two fundamental lenges when trying to support organizations working questions: for the same cause. In their efforts to increase organi- zational effectiveness, foundations invariably face ques- How could the Wikimedia Foundation help Wikime- dia groups make a clearer connection between their tions of credibility and control, and sometimes encoun- strategies and their desired impacts without being di- ter diverging understandings of success. This case study 1 rective about those strategies and desired impacts? profiles how TCC Group – with its partner, the Wikimedia Foundation – developed an innovative, participatory ap- Anecdotally WMF knew that some chapters were proach to these challenges, resulting in increased orga- “more effective” than others, but without clear cri- nizational effectiveness in the Wikimedia movement. teria for success it was difficult to determine how or why some groups were thriving. -
An Analysis of Contributions to Wikipedia from Tor
Are anonymity-seekers just like everybody else? An analysis of contributions to Wikipedia from Tor Chau Tran Kaylea Champion Andrea Forte Department of Computer Science & Engineering Department of Communication College of Computing & Informatics New York University University of Washington Drexel University New York, USA Seatle, USA Philadelphia, USA [email protected] [email protected] [email protected] Benjamin Mako Hill Rachel Greenstadt Department of Communication Department of Computer Science & Engineering University of Washington New York University Seatle, USA New York, USA [email protected] [email protected] Abstract—User-generated content sites routinely block contri- butions from users of privacy-enhancing proxies like Tor because of a perception that proxies are a source of vandalism, spam, and abuse. Although these blocks might be effective, collateral damage in the form of unrealized valuable contributions from anonymity seekers is invisible. One of the largest and most important user-generated content sites, Wikipedia, has attempted to block contributions from Tor users since as early as 2005. We demonstrate that these blocks have been imperfect and that thousands of attempts to edit on Wikipedia through Tor have been successful. We draw upon several data sources and analytical techniques to measure and describe the history of Tor editing on Wikipedia over time and to compare contributions from Tor users to those from other groups of Wikipedia users. Fig. 1. Screenshot of the page a user is shown when they attempt to edit the Our analysis suggests that although Tor users who slip through Wikipedia article on “Privacy” while using Tor. Wikipedia’s ban contribute content that is more likely to be reverted and to revert others, their contributions are otherwise similar in quality to those from other unregistered participants and to the initial contributions of registered users. -
100+ Ways to Assess
1 2 100+ Ways to Assess: 1. Annotated Bibliography: a list of citations followed by a brief descriptive and evaluative paragraph, the purpose of which is to inform the reader of the relevance, accuracy, and quality of the sources cited. 2. Assessment Stations: students rotate between different stations which have different questions to answer at them; e.g. a science practical assessment where questions relate to particular artefacts at each station. 3. Book Review: gives the reader a concise summary of the content including a relevant description of the topic as well as its overall perspective, argument, or purpose. Second, and more importantly, a review offers a critical assessment of the content. 4. Business Plan: a formal statement of business goals, reasons they are attainable, and plans for reaching them. It may also contain background information about the organization or team attempting to reach those goals. 5. Capstone Project: students pursue independent research on a question or problem of their choice, engage with the scholarly debates in the relevant disciplines, and produce a substantial thesis/dissertation providing a deep understanding of the topic. 6. Case Study: detailed examination of a subject of study (the case), as well as its related contextual conditions (often associated with medicine or law). 7. Clinical Observation: observation and feedback for teaching and evaluating medical student communication skills between trainee physician and patient may involve role plays or actual trainee-patient interactions. 8. Concept Map: a type of graphic organizer that can used to organize and represent knowledge of a subject. Concept maps begin with a main idea (or concept) and then branch out to show how that main idea can be broken down into specific topics. -
Mathematical Formulae in Wikimedia Projects 2020”
Preprint from https://www.gipp.com/pub/ M. Schubotz et al. \Mathematical Formulae in Wikimedia Projects 2020". In: Proc. ACM/IEEE JCDL. 2020. doi: 10.1145/3383583. 3398557 Mathematical Formulae in Wikimedia Projects 2020 Moritz Schubotz1,2, Andr´eGreiner-Petter1, Norman Meuschke1,3, Olaf Teschke2, and Bela Gipp1,3 1University of Wuppertal, Germany ([email protected], [email protected]) 2FIZ-Karlsruhe, Germany (ffirst.lastg@fiz-karlsruhe.de) 3University of Konstanz, Germany (ffi[email protected]) May 8, 2020 Abstract This poster summarizes our contributions to Wikimedia's processing pipeline for mathematical formulae. We describe how we have supported the transition from rendering formulae as course-grained PNG images in 2001 to providing modern semantically enriched language-independent MathML formulae in 2020. Additionally, we describe our plans to im- prove the accessibility and discoverability of mathematical knowledge in Wikimedia projects further. 1 Introduction Mathematical formulae are an integral part of Wikipedia and other projects of the Wikimedia foundation1. The MediaWiki software is the technical back- bone of many Wikimedia projects, including Wikipedia. Since 2003, wikitext { the markup language of MediaWiki { supports mathematical content [9]. For example, MediaWiki converts the wikitext code <math>E=mc^2</math> to the arXiv:2003.09417v2 [cs.DL] 6 May 2020 formula E = mc2. While the markup for mathematical formulae has remained stable since 2003, MediaWikis pipeline for processing wikitext to render formu- lae has changed significantly. Initially, MediaWiki used LaTeX to convert math tags in wikitext to PNG images. The rendering process was slow, the images did not integrate well into the text, were inaccessible to screen readers for visually impaired users, and scaled poorly for both small and high-resolution screens. -
Jimmy Wales and Larry Sanger, It Is the Largest, Fastest-Growing and Most Popular General Reference Work Currently Available on the Internet
Tomasz „Polimerek” Ganicz Wikimedia Polska WikipediaWikipedia andand otherother WikimediaWikimedia projectsprojects WhatWhat isis Wikipedia?Wikipedia? „Imagine„Imagine aa worldworld inin whichwhich everyevery singlesingle humanhuman beingbeing cancan freelyfreely shareshare inin thethe sumsum ofof allall knowledge.knowledge. That'sThat's ourour commitment.”commitment.” JimmyJimmy „Jimbo”„Jimbo” Wales Wales –– founder founder ofof WikipediaWikipedia As defined by itself: Wikipedia is a free multilingual, open content encyclopedia project operated by the non-profit Wikimedia Foundation. Its name is a blend of the words wiki (a technology for creating collaborative websites) and encyclopedia. Launched in January 2001 by Jimmy Wales and Larry Sanger, it is the largest, fastest-growing and most popular general reference work currently available on the Internet. OpenOpen and and free free content content RichardRichard StallmanStallman definition definition of of free free software: software: „The„The wordword "free""free" inin ourour namename doesdoes notnot referrefer toto price;price; itit refersrefers toto freedom.freedom. First,First, thethe freedomfreedom toto copycopy aa programprogram andand redistributeredistribute itit toto youryour neighbors,neighbors, soso thatthat theythey cancan useuse itit asas wellwell asas you.you. Second,Second, thethe freedomfreedom toto changechange aa program,program, soso ththatat youyou cancan controlcontrol itit insteadinstead ofof itit controllingcontrolling you;you; forfor this,this, thethe sourcesource