UNIVERSITY of CALIFORNIA Santa Barbara Utilizing Wikipedia As A
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Modeling Popularity and Reliability of Sources in Multilingual Wikipedia
information Article Modeling Popularity and Reliability of Sources in Multilingual Wikipedia Włodzimierz Lewoniewski * , Krzysztof W˛ecel and Witold Abramowicz Department of Information Systems, Pozna´nUniversity of Economics and Business, 61-875 Pozna´n,Poland; [email protected] (K.W.); [email protected] (W.A.) * Correspondence: [email protected] Received: 31 March 2020; Accepted: 7 May 2020; Published: 13 May 2020 Abstract: One of the most important factors impacting quality of content in Wikipedia is presence of reliable sources. By following references, readers can verify facts or find more details about described topic. A Wikipedia article can be edited independently in any of over 300 languages, even by anonymous users, therefore information about the same topic may be inconsistent. This also applies to use of references in different language versions of a particular article, so the same statement can have different sources. In this paper we analyzed over 40 million articles from the 55 most developed language versions of Wikipedia to extract information about over 200 million references and find the most popular and reliable sources. We presented 10 models for the assessment of the popularity and reliability of the sources based on analysis of meta information about the references in Wikipedia articles, page views and authors of the articles. Using DBpedia and Wikidata we automatically identified the alignment of the sources to a specific domain. Additionally, we analyzed the changes of popularity and reliability in time and identified growth leaders in each of the considered months. The results can be used for quality improvements of the content in different languages versions of Wikipedia. -
The Culture of Wikipedia
Good Faith Collaboration: The Culture of Wikipedia Good Faith Collaboration The Culture of Wikipedia Joseph Michael Reagle Jr. Foreword by Lawrence Lessig The MIT Press, Cambridge, MA. Web edition, Copyright © 2011 by Joseph Michael Reagle Jr. CC-NC-SA 3.0 Purchase at Amazon.com | Barnes and Noble | IndieBound | MIT Press Wikipedia's style of collaborative production has been lauded, lambasted, and satirized. Despite unease over its implications for the character (and quality) of knowledge, Wikipedia has brought us closer than ever to a realization of the centuries-old Author Bio & Research Blog pursuit of a universal encyclopedia. Good Faith Collaboration: The Culture of Wikipedia is a rich ethnographic portrayal of Wikipedia's historical roots, collaborative culture, and much debated legacy. Foreword Preface to the Web Edition Praise for Good Faith Collaboration Preface Extended Table of Contents "Reagle offers a compelling case that Wikipedia's most fascinating and unprecedented aspect isn't the encyclopedia itself — rather, it's the collaborative culture that underpins it: brawling, self-reflexive, funny, serious, and full-tilt committed to the 1. Nazis and Norms project, even if it means setting aside personal differences. Reagle's position as a scholar and a member of the community 2. The Pursuit of the Universal makes him uniquely situated to describe this culture." —Cory Doctorow , Boing Boing Encyclopedia "Reagle provides ample data regarding the everyday practices and cultural norms of the community which collaborates to 3. Good Faith Collaboration produce Wikipedia. His rich research and nuanced appreciation of the complexities of cultural digital media research are 4. The Puzzle of Openness well presented. -
Name of the Tool Citizendium Home Page Logo URL En.Citizendium
Name of the Tool Citizendium Home Page Logo URL en.citizendium.org Subject Encyclopedias Accessibility Free Language English Publisher CZ: Media Assets Workgroup Brief History It is an English-language Wiki-based free encyclopedia project launched by Larry Sanger, who had previously co - founded Wikipedia in 2001. It had launched on 23 October 2006 (as pilot project) and on 25 March 2007 (publicly). Scope and Coverage It serves all over the world and provides the access and modifications of information related to the “parent topics” or main topics like Natural Sciences, Social Sciences, Humanities, Arts, Applied arts and sciences, Recreation. Under each main topic or parent topic, there is hyperlinked list of sub topics and other related topics. As for example, under the main topic “ Natural Sciences” there is a following list of “Subtopics” and “Other related topics”: Subtopics Physics Chemistry Biology Astronomy Earth science Mathematics Other related topics Natural philosophy Natural history Applied science Health science Geology Under the main topic “Humanities” there is a list of following topics: Subtopics Classics History Literature Philosophy Religion Theology Other related topics Art Applied arts Education Law Music Science Social science Scholarship Society Theatre The encyclopedia includes total number of 16,891 articles when last accessed. Kind of Information Every article under subtopics and other related topics is provided with “Talk”, “Related Articles”, “Biography”, “External Links”, “Citable Version”, “Video” related to that article. Brief description, history of a topic etc. are present in the articles. Coloured images on topics, charts, graphs etc. are available where applicable. Notes and references are also found after the articles. -
SI 410 Ethics and Information Technology
Author(s): Paul Conway, PhD, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/ We have reviewed this material in accordance with U.S. Copyright Law and have tried to maximize your ability to use, share, and adapt it. The citation key on the following slide provides information about how you may share and adapt this material. Copyright holders of content included in this material should contact [email protected] with any questions, corrections, or clarification regarding the use of content. For more information about how to cite these materials visit http://open.umich.edu/privacy-and-terms-use. Any medical information in this material is intended to inform and educate and is not a tool for self-diagnosis or a replacement for medical evaluation, advice, diagnosis or treatment by a healthcare professional. Please speak to your physician if you have questions about your medical condition. Viewer discretion is advised: Some medical content is graphic and may not be suitable for all viewers. Citation Key for more information see: http://open.umich.edu/wiki/CitationPolicy Use + Share + Adapt { Content the copyright holder, author, or law permits you to use, share and adapt. } Public Domain – Government: Works that are produced by the U.S. Government. (17 USC § 105) Public Domain – Expired: Works that are no longer protected due to an expired copyright term. Public Domain – Self Dedicated: Works that a copyright holder has dedicated to the public domain. -
Télécharger Le Texte Intégral En Format
ANNUAIRE FRANÇAIS DE RELATIONS INTERNATIONALES 2019 Volume XX PUBLICATION COURONNÉE PAR L’ACADÉMIE DES SCIENCES MORALES ET POLITIQUES (Prix de la Fondation Edouard Bonnefous, 2008) Université Panthéon-Assas Centre Thucydide AFRI_2019_v3_1124p.indd 3 24/04/2019 11:44 WIKIPÉDIA PAR VALÉRIE NICOLAS (*) Wikipédia (1) est un site Internet collaboratif qui se décrit lui-même comme « encyclopédie en libre accès, en lecture comme en écriture » (2). Multilingue, ce site est servi par un logiciel identique le Mediawiki. Il est adossé à une fondation à but non lucratif Wikimedia, qui assure son fonctionnement et gère d’autres projets frères. Le contenu de Wikipédia est disponible sous licence libre (3). Ainsi chacun peut le recopier, le modifier et l’utiliser. Le projet encyclopédique est alimenté par chaque utilisateur par une écriture collaborative, participative et bénévole. Crée en 2001 par deux ressortissants américains (4), Wikipédia (WP) est en 2014 le 5e site le plus fréquenté au monde (5). 500 millions de visiteurs le consultent chaque mois. Il offre aux lecteurs plus de 30 millions d’articles dans plus de 300 versions linguistiques. La version en anglais – matrice du projet –, compte plus de 5 millions de contributions. Les chiffres sont évocateurs du formidable recueil de connaissances que WP constitue. Internet est le moteur et le vecteur du succès du projet Wikipédia. Internet est un réseau de réseaux informatiques international organisé grâce à un protocole unique de communication (TCP/IP). Outil de communication, le World Wide Web (Web), un des services fournis par le réseau (6), a bouleversé les échanges entre les individus. Le réseau permet leur multiplication sans considération de frontières, ni de temps. -
Mapping the Nature of Content and Processes on the English Wikipedia
‘Creating Knowledge’: Mapping the nature of content and processes on the English Wikipedia Report submitted by Sohnee Harshey M. Phil Scholar, Advanced Centre for Women's Studies Tata Institute of Social Sciences Mumbai April 2014 Study Commissioned by the Higher Education Innovation and Research Applications (HEIRA) Programme, Centre for the Study of Culture and Society, Bangalore as part of an initiative on ‘Mapping Digital Humanities in India’, in collaboration with the Centre for Internet and Society, Bangalore. Supported by the Ford Foundation’s ‘Pathways to Higher Education Programme’ (2009-13) Introduction Run a search on Google and one of the first results to show up would be a Wikipedia entry. So much so, that from ‘googled it’, the phrase ‘wikied it’ is catching up with students across university campuses. The Wikipedia, which is a ‘collaboratively edited, multilingual, free Internet encyclopedia’1, is hugely popular simply because of the range and extent of topics covered in a format that is now familiar to most people using the internet. It is not unknown that the ‘quick ready reference’ nature of Wikipedia makes it a popular source even for those in the higher education system-for quick information and even as a starting point for academic writing. Since there is no other source which is freely available on the internet-both in terms of access and information, the content from Wikipedia is thrown up when one runs searches on Google, Yahoo or other search engines. With Wikipedia now accessible on phones, the rate of distribution of information as well as the rate of access have gone up; such use necessitates that the content on this platform must be neutral and at the same time sensitive to the concerns of caste, gender, ethnicity, race etc. -
Does Wikipedia Matter? the Effect of Wikipedia on Tourist Choices Marit Hinnosaar, Toomas Hinnosaar, Michael Kummer, and Olga Slivko Discus Si On Paper No
Dis cus si on Paper No. 15-089 Does Wikipedia Matter? The Effect of Wikipedia on Tourist Choices Marit Hinnosaar, Toomas Hinnosaar, Michael Kummer, and Olga Slivko Dis cus si on Paper No. 15-089 Does Wikipedia Matter? The Effect of Wikipedia on Tourist Choices Marit Hinnosaar, Toomas Hinnosaar, Michael Kummer, and Olga Slivko First version: December 2015 This version: September 2017 Download this ZEW Discussion Paper from our ftp server: http://ftp.zew.de/pub/zew-docs/dp/dp15089.pdf Die Dis cus si on Pape rs die nen einer mög lichst schnel len Ver brei tung von neue ren For schungs arbei ten des ZEW. Die Bei trä ge lie gen in allei ni ger Ver ant wor tung der Auto ren und stel len nicht not wen di ger wei se die Mei nung des ZEW dar. Dis cus si on Papers are inten ded to make results of ZEW research prompt ly avai la ble to other eco no mists in order to encou ra ge dis cus si on and sug gesti ons for revi si ons. The aut hors are sole ly respon si ble for the con tents which do not neces sa ri ly repre sent the opi ni on of the ZEW. Does Wikipedia Matter? The Effect of Wikipedia on Tourist Choices ∗ Marit Hinnosaar† Toomas Hinnosaar‡ Michael Kummer§ Olga Slivko¶ First version: December 2015 This version: September 2017 September 25, 2017 Abstract We document a causal influence of online user-generated information on real- world economic outcomes. In particular, we conduct a randomized field experiment to test whether additional information on Wikipedia about cities affects tourists’ choices of overnight visits. -
Flags and Banners
Flags and Banners A Wikipedia Compilation by Michael A. Linton Contents 1 Flag 1 1.1 History ................................................. 2 1.2 National flags ............................................. 4 1.2.1 Civil flags ........................................... 8 1.2.2 War flags ........................................... 8 1.2.3 International flags ....................................... 8 1.3 At sea ................................................. 8 1.4 Shapes and designs .......................................... 9 1.4.1 Vertical flags ......................................... 12 1.5 Religious flags ............................................. 13 1.6 Linguistic flags ............................................. 13 1.7 In sports ................................................ 16 1.8 Diplomatic flags ............................................ 18 1.9 In politics ............................................... 18 1.10 Vehicle flags .............................................. 18 1.11 Swimming flags ............................................ 19 1.12 Railway flags .............................................. 20 1.13 Flagpoles ............................................... 21 1.13.1 Record heights ........................................ 21 1.13.2 Design ............................................. 21 1.14 Hoisting the flag ............................................ 21 1.15 Flags and communication ....................................... 21 1.16 Flapping ................................................ 23 1.17 See also ............................................... -
Lessons from Citizendium
Lessons from Citizendium Wikimania 2009, Buenos Aires, 28 August 2009 HaeB [[de:Benutzer:HaeB]], [[en:User:HaeB]] Please don't take photos during this talk. Citizendium Timeline ● September 2006: Citizendium announced. Sole founder: Larry Sanger, known as former editor-in-chief of Nupedia, chief organizer of Wikipedia (2001-2002), and later as Wikipedia critic ● October 2006: Started non-public pilot phase ● January 2007: “Big Unfork”: All unmodified copies of Wikipedia articles deleted ● March 2007: Public launch ● December 2007: Decision to use CC-BY-3.0, after debate about commercial reuse and compatibility with Wikipedia ● Mid-2009: Sanger largely inactive on Citizendium, focuses on WatchKnow ● August 2009: Larry Sanger announces he will step down as editor-in-chief soon (as committed to in 2006) Citizendium and Wikipedia: Similarities and differences ● Encyclopedia ● Strict real names ● Free license policy ● ● Open (anyone can Special role for contribute) experts: “editors” can issue content ● Created by amateurs decisions, binding to ● MediaWiki-based non-editors collaboration ● Governance: Social ● Non-profit contract, elements of a constitutional republic Wikipedian views of Citizendium ● Competitor for readers, contributions ● Ally, common goal of creating free encyclopedic content ● “Who?” ● In this talk: A long-time experiment testing several fundamental policy changes, in a framework which is still similar enough to that of Wikipedia to generate valuable evidence as to what their effect might be on WP Active editors: Waiting to explode ● Sanger (October 2007): ”At some point, possibly very soon, the Citizendium will grow explosively-- say, quadruple the number of its active contributors, or even grow by an order of magnitude ....“ © Aleksander Stos, CC-BY 3.0 Number of users that made at least one edit in each month Article creation rate: Still muddling Sanger (October 2007): “It's still possible that the project will, from here until eternity, muddle on creating 14 articles per day. -
Reliability of Wikipedia 1 Reliability of Wikipedia
Reliability of Wikipedia 1 Reliability of Wikipedia The reliability of Wikipedia (primarily of the English language version), compared to other encyclopedias and more specialized sources, is assessed in many ways, including statistically, through comparative review, analysis of the historical patterns, and strengths and weaknesses inherent in the editing process unique to Wikipedia. [1] Because Wikipedia is open to anonymous and collaborative editing, assessments of its Vandalism of a Wikipedia article reliability usually include examinations of how quickly false or misleading information is removed. An early study conducted by IBM researchers in 2003—two years following Wikipedia's establishment—found that "vandalism is usually repaired extremely quickly — so quickly that most users will never see its effects"[2] and concluded that Wikipedia had "surprisingly effective self-healing capabilities".[3] A 2007 peer-reviewed study stated that "42% of damage is repaired almost immediately... Nonetheless, there are still hundreds of millions of damaged views."[4] Several studies have been done to assess the reliability of Wikipedia. A notable early study in the journal Nature suggested that in 2005, Wikipedia scientific articles came close to the level of accuracy in Encyclopædia Britannica and had a similar rate of "serious errors".[5] This study was disputed by Encyclopædia Britannica.[6] By 2010 reviewers in medical and scientific fields such as toxicology, cancer research and drug information reviewing Wikipedia against professional and peer-reviewed sources found that Wikipedia's depth and coverage were of a very high standard, often comparable in coverage to physician databases and considerably better than well known reputable national media outlets. -
What You Say Is Who You Are. How Open Government Data Facilitates Profiling Politicians
What you say is who you are. How open government data facilitates profiling politicians Maarten Marx and Arjan Nusselder ISLA, Informatics Institute, University of Amsterdam Science Park 107 1098XG Amsterdam, The Netherlands Abstract. A system is proposed and implemented that creates a lan- guage model for each member of the Dutch parliament, based on the official transcripts of the meetings of the Dutch Parliament. Using ex- pert finding techniques, the system allows users to retrieve a ranked list of politicians, based on queries like news messages. The high quality of the system is due to extensive data cleaning and transformation which could have been avoided when it had been available in an open machine readable format. 1 Introduction The Internet is changing from a web of documents into a web of objects. Open and interoperable (linkable) data are crucial for web applications which are build around objects. Examples of objects featuring prominently is (mashup) websites are traditional named entities like persons, products, organizations [6,4], but also events and unique items like e.g. houses. The success of several mashup sites is simply due to the fact that they provide a different grouping of already (freely) available data. Originally the data could only be grouped by documents; the mashup allows for groupings by objects which are of interest in their specific domain. Here is an example from the political domain. Suppose one wants to know more about Herman van Rompuy, the new EU “president” from Belgium. Being a former member of the Belgium parliament and several governments, an im- portant primary source of information are the parliamentary proceedings. -
Federated Ontology Search Vasco Calais Pedro CMU-LTI-09-010
Federated Ontology Search Vasco Calais Pedro CMU-LTI-09-010 Language Technologies Institute School of Computer Science Carnegie Mellon University 5000 Forbes Ave. Pittsburgh, PA 15213 www.lti.cs.cmu.edu Thesis Committee: Jaime Carbonell, Chair Eric Nyberg Robert Frederking Eduard Hovy, Information Sciences Institute Submitted in partial fulfillment of the requirements for the degree Doctor of Philosophy In Language and Information Technologies Copyright © 2009 Vasco Calais Pedro For my grandmother, Avó Helena, I am sorry I wasn’t there Abstract An Ontology can be defined as a formal representation of a set of concepts within a domain and the relationships between those concepts. The development of the semantic web initiative is rapidly increasing the number of publicly available ontologies. In such a distributed environment, complex applications often need to handle multiple ontologies in order to provide adequate domain coverage. Surprisingly, there is a lack of adequate frameworks for enabling the use of multiple ontologies transparently while abstracting the particular ontological structures used by that framework. Given that any ontology represents the views of its author or authors, using multiple ontologies requires us to deal with several significant challenges, some stemming from the nature of knowledge itself, such as cases of polysemy or homography, and some stemming from the structures that we choose to represent such knowledge with. The focus of this thesis is to explore a set of techniques that will allow us to overcome some of the challenges found when using multiple ontologies, thus making progress in the creation of a functional information access platform for structured sources.