What Is Trending on Wikipedia?
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Modeling Popularity and Reliability of Sources in Multilingual Wikipedia
information Article Modeling Popularity and Reliability of Sources in Multilingual Wikipedia Włodzimierz Lewoniewski * , Krzysztof W˛ecel and Witold Abramowicz Department of Information Systems, Pozna´nUniversity of Economics and Business, 61-875 Pozna´n,Poland; [email protected] (K.W.); [email protected] (W.A.) * Correspondence: [email protected] Received: 31 March 2020; Accepted: 7 May 2020; Published: 13 May 2020 Abstract: One of the most important factors impacting quality of content in Wikipedia is presence of reliable sources. By following references, readers can verify facts or find more details about described topic. A Wikipedia article can be edited independently in any of over 300 languages, even by anonymous users, therefore information about the same topic may be inconsistent. This also applies to use of references in different language versions of a particular article, so the same statement can have different sources. In this paper we analyzed over 40 million articles from the 55 most developed language versions of Wikipedia to extract information about over 200 million references and find the most popular and reliable sources. We presented 10 models for the assessment of the popularity and reliability of the sources based on analysis of meta information about the references in Wikipedia articles, page views and authors of the articles. Using DBpedia and Wikidata we automatically identified the alignment of the sources to a specific domain. Additionally, we analyzed the changes of popularity and reliability in time and identified growth leaders in each of the considered months. The results can be used for quality improvements of the content in different languages versions of Wikipedia. -
Omnipedia: Bridging the Wikipedia Language
Omnipedia: Bridging the Wikipedia Language Gap Patti Bao*†, Brent Hecht†, Samuel Carton†, Mahmood Quaderi†, Michael Horn†§, Darren Gergle*† *Communication Studies, †Electrical Engineering & Computer Science, §Learning Sciences Northwestern University {patti,brent,sam.carton,quaderi}@u.northwestern.edu, {michael-horn,dgergle}@northwestern.edu ABSTRACT language edition contains its own cultural viewpoints on a We present Omnipedia, a system that allows Wikipedia large number of topics [7, 14, 15, 27]. On the other hand, readers to gain insight from up to 25 language editions of the language barrier serves to silo knowledge [2, 4, 33], Wikipedia simultaneously. Omnipedia highlights the slowing the transfer of less culturally imbued information similarities and differences that exist among Wikipedia between language editions and preventing Wikipedia’s 422 language editions, and makes salient information that is million monthly visitors [12] from accessing most of the unique to each language as well as that which is shared information on the site. more widely. We detail solutions to numerous front-end and algorithmic challenges inherent to providing users with In this paper, we present Omnipedia, a system that attempts a multilingual Wikipedia experience. These include to remedy this situation at a large scale. It reduces the silo visualizing content in a language-neutral way and aligning effect by providing users with structured access in their data in the face of diverse information organization native language to over 7.5 million concepts from up to 25 strategies. We present a study of Omnipedia that language editions of Wikipedia. At the same time, it characterizes how people interact with information using a highlights similarities and differences between each of the multilingual lens. -
Title of Thesis: ABSTRACT CLASSIFYING BIAS
ABSTRACT Title of Thesis: CLASSIFYING BIAS IN LARGE MULTILINGUAL CORPORA VIA CROWDSOURCING AND TOPIC MODELING Team BIASES: Brianna Caljean, Katherine Calvert, Ashley Chang, Elliot Frank, Rosana Garay Jáuregui, Geoffrey Palo, Ryan Rinker, Gareth Weakly, Nicolette Wolfrey, William Zhang Thesis Directed By: Dr. David Zajic, Ph.D. Our project extends previous algorithmic approaches to finding bias in large text corpora. We used multilingual topic modeling to examine language-specific bias in the English, Spanish, and Russian versions of Wikipedia. In particular, we placed Spanish articles discussing the Cold War on a Russian-English viewpoint spectrum based on similarity in topic distribution. We then crowdsourced human annotations of Spanish Wikipedia articles for comparison to the topic model. Our hypothesis was that human annotators and topic modeling algorithms would provide correlated results for bias. However, that was not the case. Our annotators indicated that humans were more perceptive of sentiment in article text than topic distribution, which suggests that our classifier provides a different perspective on a text’s bias. CLASSIFYING BIAS IN LARGE MULTILINGUAL CORPORA VIA CROWDSOURCING AND TOPIC MODELING by Team BIASES: Brianna Caljean, Katherine Calvert, Ashley Chang, Elliot Frank, Rosana Garay Jáuregui, Geoffrey Palo, Ryan Rinker, Gareth Weakly, Nicolette Wolfrey, William Zhang Thesis submitted in partial fulfillment of the requirements of the Gemstone Honors Program, University of Maryland, 2018 Advisory Committee: Dr. David Zajic, Chair Dr. Brian Butler Dr. Marine Carpuat Dr. Melanie Kill Dr. Philip Resnik Mr. Ed Summers © Copyright by Team BIASES: Brianna Caljean, Katherine Calvert, Ashley Chang, Elliot Frank, Rosana Garay Jáuregui, Geoffrey Palo, Ryan Rinker, Gareth Weakly, Nicolette Wolfrey, William Zhang 2018 Acknowledgements We would like to express our sincerest gratitude to our mentor, Dr. -
Arxiv:1212.0018V2 [Cs.SI] 18 Dec 2012 Etn Ucin Ffe-Akteoois[ Economies Free-Market of Functions Setting E.[ Ref
Evidence for Non-Finite-State Computation in a Human Social System Simon DeDeo∗ Santa Fe Institute, Santa Fe, NM 87501, USA (Dated: December 19, 2012) We investigate the computational structure of a paradigmatic example of distributed so- cial interaction: that of the open-source Wikipedia community. The typical computational approach to modeling such a system is to rely on finite-state machines. However, we find strong evidence in this system for the emergence of processing powers over and above the finite-state. Thus, Wikipedia, understood as an information processing system, must have access to (at least one) effectively unbounded resource. The nature of this resource is such that one observes far longer runs of cooperative behavior than one would expect using finite- state models. We provide evidence that the emergence of this non-finite-state computation is driven by collective interaction effects. Social systems—particularly human social systems—process information. From the price- setting functions of free-market economies [1, 2] to resource management in traditional communi- ties [3], from deliberations in large-scale democracies [4, 5] to the formation of opinions and spread of reputational information in organizations [6] and social groups [7, 8], it has been recognized that such groups can perform functions analogous to (and often better than) engineered systems. Such functional roles are found in groups in addition to their contingent historical aspects and, when described mathematically, may be compared across cultures and times. The computational phenomena implicit in social systems are only now, with the advent of large, high-resolution data-sets, coming under systematic, empirical study at large scales. -
An Analysis of Contributions to Wikipedia from Tor
Are anonymity-seekers just like everybody else? An analysis of contributions to Wikipedia from Tor Chau Tran Kaylea Champion Andrea Forte Department of Computer Science & Engineering Department of Communication College of Computing & Informatics New York University University of Washington Drexel University New York, USA Seatle, USA Philadelphia, USA [email protected] [email protected] [email protected] Benjamin Mako Hill Rachel Greenstadt Department of Communication Department of Computer Science & Engineering University of Washington New York University Seatle, USA New York, USA [email protected] [email protected] Abstract—User-generated content sites routinely block contri- butions from users of privacy-enhancing proxies like Tor because of a perception that proxies are a source of vandalism, spam, and abuse. Although these blocks might be effective, collateral damage in the form of unrealized valuable contributions from anonymity seekers is invisible. One of the largest and most important user-generated content sites, Wikipedia, has attempted to block contributions from Tor users since as early as 2005. We demonstrate that these blocks have been imperfect and that thousands of attempts to edit on Wikipedia through Tor have been successful. We draw upon several data sources and analytical techniques to measure and describe the history of Tor editing on Wikipedia over time and to compare contributions from Tor users to those from other groups of Wikipedia users. Fig. 1. Screenshot of the page a user is shown when they attempt to edit the Our analysis suggests that although Tor users who slip through Wikipedia article on “Privacy” while using Tor. Wikipedia’s ban contribute content that is more likely to be reverted and to revert others, their contributions are otherwise similar in quality to those from other unregistered participants and to the initial contributions of registered users. -
Emotions and Activity Profiles of Influential Users in Product Reviews Communities
Research Collection Journal Article Emotions and activity profiles of influential users in product reviews communities Author(s): Tanase, Dorian; Garcia, David; Garas, Antonios; Schweitzer, Frank Publication Date: 2015-11 Permanent Link: https://doi.org/10.3929/ethz-b-000108082 Originally published in: Frontiers in Physics 3, http://doi.org/10.3389/fphy.2015.00087 Rights / License: Creative Commons Attribution 4.0 International This page was generated automatically upon download from the ETH Zurich Research Collection. For more information please consult the Terms of use. ETH Library ORIGINAL RESEARCH published: 17 November 2015 doi: 10.3389/fphy.2015.00087 Emotions and Activity Profiles of Influential Users in Product Reviews Communities Dorian Tanase, David Garcia *, Antonios Garas and Frank Schweitzer Chair of Systems Design, ETH Zurich, Zurich, Switzerland Viral marketing seeks to maximize the spread of a campaign through an online social network, often targeting influential nodes with high centrality. In this article, we analyze behavioral aspects of influential users in trust-based product reviews communities, quantifying emotional expression, helpfulness, and user activity level. We focus on two independent product review communities, Dooyoo and Epinions, in which users can write product reviews and define trust links to filter product recommendations. Following the patterns of social contagion processes, we measure user social influence by means of the k-shell decomposition of trust networks. For each of these users, we apply sentiment analysis to extract their extent of positive, negative, and neutral emotional expression. In addition, we quantify the level of feedback they received in their reviews, the length of their contributions, and their level of activity over their lifetime in the community. -
Brain & Cognitive Science 2016
Brain & Cognitive Science 2016 press.princeton.edu Contents 1 general interest 3 psychology New New 6 Phishing for Phools How to Clone a The Economics of Mammoth social science Manipulation and Deception The Science of De-Extinction George A. Akerlof & Beth Shapiro Robert J. Shiller 9 “[A] fascinating book. A great biology & “This fun but serious book tells popular science title, and one neuroscience how the standard story about that makes it clear that a future free markets often gets it wrong. you may have imagined is already 11 Indeed, Akerlof and Shiller suggest underway.” that we should drop the view —Library Journal, starred review philosophy of markets as generally benign “As a researcher who is shaping institutions. The argument is laid this eld, Shapiro is the perfect out with the help of fascinating 12 guide to the ongoing discussion anecdotes, the language is con- best of the backlist about de-extinction. While many versational, and the book is easy news items and conference to read.” presentations have focused on 13 —Dani Rodrik, author of The the technology required to create index | order form Globalization Paradox extinct life, Shapiro carefully “Phishing for Phools is a coherent considers every step along the and highly plausible explanation journey to de-extinction, from of why markets—although usually choosing a species to revive to bene cial—can lead to undesir- making sure they don’t become able outcomes. The book takes extinct all over again. Whether an intriguing approach and gives you’re all for de-extinction or many interesting examples.” against it, Shapiro’s sharp, witty, —Diane Coyle, author of GDP: A and impeccably-argued book is Brief but A ectionate History essential for informing those who Ever since Adam Smith, the central will decide what life will become.” teaching of economics has been —Brian Switek, NationalGeo- that free markets provide us with graphic.com’s Laelaps blog material well-being, as if by an in- “Beth Shapiro. -
The Case of Croatian Wikipedia: Encyclopaedia of Knowledge Or Encyclopaedia for the Nation?
The Case of Croatian Wikipedia: Encyclopaedia of Knowledge or Encyclopaedia for the Nation? 1 Authorial statement: This report represents the evaluation of the Croatian disinformation case by an external expert on the subject matter, who after conducting a thorough analysis of the Croatian community setting, provides three recommendations to address the ongoing challenges. The views and opinions expressed in this report are those of the author and do not necessarily reflect the official policy or position of the Wikimedia Foundation. The Wikimedia Foundation is publishing the report for transparency. Executive Summary Croatian Wikipedia (Hr.WP) has been struggling with content and conduct-related challenges, causing repeated concerns in the global volunteer community for more than a decade. With support of the Wikimedia Foundation Board of Trustees, the Foundation retained an external expert to evaluate the challenges faced by the project. The evaluation, conducted between February and May 2021, sought to assess whether there have been organized attempts to introduce disinformation into Croatian Wikipedia and whether the project has been captured by ideologically driven users who are structurally misaligned with Wikipedia’s five pillars guiding the traditional editorial project setup of the Wikipedia projects. Croatian Wikipedia represents the Croatian standard variant of the Serbo-Croatian language. Unlike other pluricentric Wikipedia language projects, such as English, French, German, and Spanish, Serbo-Croatian Wikipedia’s community was split up into Croatian, Bosnian, Serbian, and the original Serbo-Croatian wikis starting in 2003. The report concludes that this structure enabled local language communities to sort by points of view on each project, often falling along political party lines in the respective regions. -
Selling Sex: What Determines Rates and Popularity? an Analysis of 11,500 Online Profiles, Culture, Health & Sexuality
This is an authors’ copy of: Alicia Mergenthaler & Taha Yasseri (2021) Selling sex: what determines rates and popularity? An analysis of 11,500 online profiles, Culture, Health & Sexuality. DOI: 10.1080/13691058.2021.1901145 Selling sex: what determines rates and popularity? An analysis of 11,500 online profiles Alicia Mergenthalera, Taha Yasseri*abc aOxford Internet Institute, University of Oxford, Oxford, UK; bSchool of Sociology, University College Dublin, Dublin, Ireland; cGeary Institute for Public Policy, University College Dublin, Dublin, Ireland *Corresponding Author: Taha Yasseri: [email protected] Abstract Sex work, or the exchange of sexual services for money or goods, is ubiquitous across eras and cultures. However, the practice of selling sex is often hidden due to stigma and the varying legal status of sex work. Online platforms that sex workers use to advertise services have become an increasingly important means of studying a market that is largely hidden. Although prior literature has primarily shed light on sex work from a public health or policy perspective (focusing largely on female sex workers), there are few studies that empirically research patterns of service provision in online sex work. This study investigated the determinants of pricing and popularity in the market for commercial sexual services online by using data from the largest UK network of online sexual services, a platform that is the industry-standard for sex workers. While the size of these influences varies across genders, nationality, age and the services provided are shown to be primary drivers of rates and popularity in sex work. Keywords: sex work, popularity dynamics, gender, online marketplace, UK Introduction In this article, we analyse a dataset from AdultWork.com, a UK-based online platform to determine drivers behind pricing and popularity in the market for sex work. -
Does Wikipedia Matter? the Effect of Wikipedia on Tourist Choices Marit Hinnosaar, Toomas Hinnosaar, Michael Kummer, and Olga Slivko Discus Si On Paper No
Dis cus si on Paper No. 15-089 Does Wikipedia Matter? The Effect of Wikipedia on Tourist Choices Marit Hinnosaar, Toomas Hinnosaar, Michael Kummer, and Olga Slivko Dis cus si on Paper No. 15-089 Does Wikipedia Matter? The Effect of Wikipedia on Tourist Choices Marit Hinnosaar, Toomas Hinnosaar, Michael Kummer, and Olga Slivko First version: December 2015 This version: September 2017 Download this ZEW Discussion Paper from our ftp server: http://ftp.zew.de/pub/zew-docs/dp/dp15089.pdf Die Dis cus si on Pape rs die nen einer mög lichst schnel len Ver brei tung von neue ren For schungs arbei ten des ZEW. Die Bei trä ge lie gen in allei ni ger Ver ant wor tung der Auto ren und stel len nicht not wen di ger wei se die Mei nung des ZEW dar. Dis cus si on Papers are inten ded to make results of ZEW research prompt ly avai la ble to other eco no mists in order to encou ra ge dis cus si on and sug gesti ons for revi si ons. The aut hors are sole ly respon si ble for the con tents which do not neces sa ri ly repre sent the opi ni on of the ZEW. Does Wikipedia Matter? The Effect of Wikipedia on Tourist Choices ∗ Marit Hinnosaar† Toomas Hinnosaar‡ Michael Kummer§ Olga Slivko¶ First version: December 2015 This version: September 2017 September 25, 2017 Abstract We document a causal influence of online user-generated information on real- world economic outcomes. In particular, we conduct a randomized field experiment to test whether additional information on Wikipedia about cities affects tourists’ choices of overnight visits. -
Flags and Banners
Flags and Banners A Wikipedia Compilation by Michael A. Linton Contents 1 Flag 1 1.1 History ................................................. 2 1.2 National flags ............................................. 4 1.2.1 Civil flags ........................................... 8 1.2.2 War flags ........................................... 8 1.2.3 International flags ....................................... 8 1.3 At sea ................................................. 8 1.4 Shapes and designs .......................................... 9 1.4.1 Vertical flags ......................................... 12 1.5 Religious flags ............................................. 13 1.6 Linguistic flags ............................................. 13 1.7 In sports ................................................ 16 1.8 Diplomatic flags ............................................ 18 1.9 In politics ............................................... 18 1.10 Vehicle flags .............................................. 18 1.11 Swimming flags ............................................ 19 1.12 Railway flags .............................................. 20 1.13 Flagpoles ............................................... 21 1.13.1 Record heights ........................................ 21 1.13.2 Design ............................................. 21 1.14 Hoisting the flag ............................................ 21 1.15 Flags and communication ....................................... 21 1.16 Flapping ................................................ 23 1.17 See also ............................................... -
Taha Yasseri
Curriculum Vitae TAHA YASSERI Personal Address Birth: 06. Sep. 1984 (27), Tehran Department of Theoretical Physics Nationality: Iranian Budapest University of Technology and Economics Marital status: single Budafoki ´ut8. http://www.phy.bme.hu/∼yasseri H1111 Budapest, Hungary CURRENT POSITION Post-doctoral fellow in European Commission project ICTeCollective, Research Group of Prof. J´anos Kert´esz. Department of Theoretical Physics, Budapest University of Technology and Economics, Budapest, Hungary. Since September 2010 WORK EXPERIENCES • Assistant Scientist, Institute for theoretical Physics, Georg-August-Universit¨at G¨ottingen,Germany. January 2009 - July 2010 • Researcher, SFB 602, Complex Structures in Condensed Matter from Atomic to Mesoscopic Scales, Georg-August-Universit¨atG¨ottingen,Germany. January 2007 - January 2009 • Member of Scientific Committee of the First Step to Nobel Prize in Physics National Competition, Young Scholars Club, Tehran, Iran. November 2003 - March 2005 • Teacher of Physics, Farzanegan and Allame Helli High Schools under the supervision of National Or- ganization for Developing Exceptional Talents, Tehran, Iran. September 2002 - April 2004 • Teacher of Physics Laboratory, Young Scholars Club, Tehran, Iran. April 2002 - September 2005 EDUCATION PhD, Physics, Faculty of Physics, Georg-August-Universit¨at G¨ottingen,Germany. January 2007 - January 2010 TOPIC: Pattern formation at ion-sputtered surfaces. (Adviser: Prof. Dr. R. Kree, Co-adviser: Prof. Dr. A.K. Hartmann) Secondary topics: Computational Neuroscience and Physics of Polymers. Master of Science, Physics, Department of Physics, Sharif University of Technology Tehran, Iran. September 2005 - September 2006 THESIS - Localization in scale free networks. (Adviser: Prof. Dr. M. Reza Rahimi Tabar) Bachelor of Science, Physics, Department of Physics, Sharif University of Technology Tehran, Iran.