Doctoral Thesis
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Philippine Election ; PDF Copied from The
Senatorial Candidates’ Matrices Philippine Election 2010 Name: Nereus “Neric” O. Acosta Jr. Political Party: Liberal Party Agenda Public Service Professional Record Four Pillar Platform: Environment Representative, 1st District of Bukidnon – 1998-2001, 2001-2004, Livelihood 2004-2007 Justice Provincial Board Member, Bukidnon – 1995-1998 Peace Project Director, Bukidnon Integrated Network of Home Industries, Inc. (BINHI) – 1995 seek more decentralization of power and resources to local Staff Researcher, Committee on International Economic Policy of communities and governments (with corresponding performance Representative Ramon Bagatsing – 1989 audits and accountability mechanisms) Academician, Political Scientist greater fiscal discipline in the management and utilization of resources (budget reform, bureaucratic streamlining for prioritization and improved efficiencies) more effective delivery of basic services by agencies of government. Website: www.nericacosta2010.com TRACK RECORD On Asset Reform and CARPER -supports the claims of the Sumilao farmers to their right to the land under the agrarian reform program -was Project Director of BINHI, a rural development NGO, specifically its project on Grameen Banking or microcredit and livelihood assistance programs for poor women in the Bukidnon countryside called the On Social Services and Safety Barangay Unified Livelihood Investments through Grameen Banking or BULIG Nets -to date, the BULIG project has grown to serve over 7,000 women in 150 barangays or villages in Bukidnon, -
Wikipedia and Intermediary Immunity: Supporting Sturdy Crowd Systems for Producing Reliable Information Jacob Rogers Abstract
THE YALE LAW JOURNAL FORUM O CTOBER 9 , 2017 Wikipedia and Intermediary Immunity: Supporting Sturdy Crowd Systems for Producing Reliable Information Jacob Rogers abstract. The problem of fake news impacts a massive online ecosystem of individuals and organizations creating, sharing, and disseminating content around the world. One effective ap- proach to addressing false information lies in monitoring such information through an active, engaged volunteer community. Wikipedia, as one of the largest online volunteer contributor communities, presents one example of this approach. This Essay argues that the existing legal framework protecting intermediary companies in the United States empowers the Wikipedia community to ensure that information is accurate and well-sourced. The Essay further argues that current legal efforts to weaken these protections, in response to the “fake news” problem, are likely to create perverse incentives that will harm volunteer engagement and confuse the public. Finally, the Essay offers suggestions for other intermediaries beyond Wikipedia to help monitor their content through user community engagement. introduction Wikipedia is well-known as a free online encyclopedia that covers nearly any topic, including both the popular and the incredibly obscure. It is also an encyclopedia that anyone can edit, an example of one of the largest crowd- sourced, user-generated content websites in the world. This user-generated model is supported by the Wikimedia Foundation, which relies on the robust intermediary liability immunity framework of U.S. law to allow the volunteer editor community to work independently. Volunteer engagement on Wikipedia provides an effective framework for combating fake news and false infor- mation. 358 wikipedia and intermediary immunity: supporting sturdy crowd systems for producing reliable information It is perhaps surprising that a project open to public editing could be highly reliable. -
Modeling Popularity and Reliability of Sources in Multilingual Wikipedia
information Article Modeling Popularity and Reliability of Sources in Multilingual Wikipedia Włodzimierz Lewoniewski * , Krzysztof W˛ecel and Witold Abramowicz Department of Information Systems, Pozna´nUniversity of Economics and Business, 61-875 Pozna´n,Poland; [email protected] (K.W.); [email protected] (W.A.) * Correspondence: [email protected] Received: 31 March 2020; Accepted: 7 May 2020; Published: 13 May 2020 Abstract: One of the most important factors impacting quality of content in Wikipedia is presence of reliable sources. By following references, readers can verify facts or find more details about described topic. A Wikipedia article can be edited independently in any of over 300 languages, even by anonymous users, therefore information about the same topic may be inconsistent. This also applies to use of references in different language versions of a particular article, so the same statement can have different sources. In this paper we analyzed over 40 million articles from the 55 most developed language versions of Wikipedia to extract information about over 200 million references and find the most popular and reliable sources. We presented 10 models for the assessment of the popularity and reliability of the sources based on analysis of meta information about the references in Wikipedia articles, page views and authors of the articles. Using DBpedia and Wikidata we automatically identified the alignment of the sources to a specific domain. Additionally, we analyzed the changes of popularity and reliability in time and identified growth leaders in each of the considered months. The results can be used for quality improvements of the content in different languages versions of Wikipedia. -
Letter to SUBTEL Re Wikipedia Zero
Letter to SUBTEL re Wikipedia Zero To the Undersecretariat of Telecommunications: Mr. Pedro Huichalaf, We write to you to ask for clarification regarding the official circular letter No. 40 /DAP 13221 /F51, issued on April 14, 2014. In particular, we would like to clarify that this order does not apply to providing free mobile access to educational resources. In developing nations across the globe, the Wikimedia Foundation has partnered with mobile network operators interested in expanding their philanthropic operations to provide mobile access to Wikipedia without data charges through a project called Wikipedia Zero. Chile is an ideal country for Wikipedia Zero because it has a great need for free knowledge and a high mobile penetration in urban and rural areas, providing the perfect setting to deploy the program. “Imagine a world in which every single human being can freely share in the sum of all knowledge” – that is the vision statement that guides the Wikimedia Foundation, the nonprofit organization behind Wikipedia. As the largest and most popular online encyclopedia in the world, Wikipedia has more than 30 million volunteerauthored articles in over 287 languages (including Spanish), and is visited by more than 490 million people every month, making it the largest collection of shared knowledge in human history. All the content on Wikipedia is provided under a Creative Commons license to encourage anyone to freely reuse and contribute to the content. That is why Wikipedia content can now be found in multiple third party applications and websites, like the Google Knowledge Graph. Our mission is to empower a global volunteer community to collect and develop the world's knowledge and to make it available to everyone for free. -
Omnipedia: Bridging the Wikipedia Language
Omnipedia: Bridging the Wikipedia Language Gap Patti Bao*†, Brent Hecht†, Samuel Carton†, Mahmood Quaderi†, Michael Horn†§, Darren Gergle*† *Communication Studies, †Electrical Engineering & Computer Science, §Learning Sciences Northwestern University {patti,brent,sam.carton,quaderi}@u.northwestern.edu, {michael-horn,dgergle}@northwestern.edu ABSTRACT language edition contains its own cultural viewpoints on a We present Omnipedia, a system that allows Wikipedia large number of topics [7, 14, 15, 27]. On the other hand, readers to gain insight from up to 25 language editions of the language barrier serves to silo knowledge [2, 4, 33], Wikipedia simultaneously. Omnipedia highlights the slowing the transfer of less culturally imbued information similarities and differences that exist among Wikipedia between language editions and preventing Wikipedia’s 422 language editions, and makes salient information that is million monthly visitors [12] from accessing most of the unique to each language as well as that which is shared information on the site. more widely. We detail solutions to numerous front-end and algorithmic challenges inherent to providing users with In this paper, we present Omnipedia, a system that attempts a multilingual Wikipedia experience. These include to remedy this situation at a large scale. It reduces the silo visualizing content in a language-neutral way and aligning effect by providing users with structured access in their data in the face of diverse information organization native language to over 7.5 million concepts from up to 25 strategies. We present a study of Omnipedia that language editions of Wikipedia. At the same time, it characterizes how people interact with information using a highlights similarities and differences between each of the multilingual lens. -
Title of Thesis: ABSTRACT CLASSIFYING BIAS
ABSTRACT Title of Thesis: CLASSIFYING BIAS IN LARGE MULTILINGUAL CORPORA VIA CROWDSOURCING AND TOPIC MODELING Team BIASES: Brianna Caljean, Katherine Calvert, Ashley Chang, Elliot Frank, Rosana Garay Jáuregui, Geoffrey Palo, Ryan Rinker, Gareth Weakly, Nicolette Wolfrey, William Zhang Thesis Directed By: Dr. David Zajic, Ph.D. Our project extends previous algorithmic approaches to finding bias in large text corpora. We used multilingual topic modeling to examine language-specific bias in the English, Spanish, and Russian versions of Wikipedia. In particular, we placed Spanish articles discussing the Cold War on a Russian-English viewpoint spectrum based on similarity in topic distribution. We then crowdsourced human annotations of Spanish Wikipedia articles for comparison to the topic model. Our hypothesis was that human annotators and topic modeling algorithms would provide correlated results for bias. However, that was not the case. Our annotators indicated that humans were more perceptive of sentiment in article text than topic distribution, which suggests that our classifier provides a different perspective on a text’s bias. CLASSIFYING BIAS IN LARGE MULTILINGUAL CORPORA VIA CROWDSOURCING AND TOPIC MODELING by Team BIASES: Brianna Caljean, Katherine Calvert, Ashley Chang, Elliot Frank, Rosana Garay Jáuregui, Geoffrey Palo, Ryan Rinker, Gareth Weakly, Nicolette Wolfrey, William Zhang Thesis submitted in partial fulfillment of the requirements of the Gemstone Honors Program, University of Maryland, 2018 Advisory Committee: Dr. David Zajic, Chair Dr. Brian Butler Dr. Marine Carpuat Dr. Melanie Kill Dr. Philip Resnik Mr. Ed Summers © Copyright by Team BIASES: Brianna Caljean, Katherine Calvert, Ashley Chang, Elliot Frank, Rosana Garay Jáuregui, Geoffrey Palo, Ryan Rinker, Gareth Weakly, Nicolette Wolfrey, William Zhang 2018 Acknowledgements We would like to express our sincerest gratitude to our mentor, Dr. -
Librarians As Wikimedia Movement Organizers in Spain: an Interpretive Inquiry Exploring Activities and Motivations
Librarians as Wikimedia Movement Organizers in Spain: An interpretive inquiry exploring activities and motivations by Laurie Bridges and Clara Llebot Laurie Bridges Oregon State University https://orcid.org/0000-0002-2765-5440 Clara Llebot Oregon State University https://orcid.org/0000-0003-3211-7396 Citation: Bridges, L., & Llebot, C. (2021). Librarians as Wikimedia Movement Organizers in Spain: An interpretive inQuiry exploring activities and motivations. First Monday, 26(6/7). https://doi.org/10.5210/fm.v26i3.11482 Abstract How do librarians in Spain engage with Wikipedia (and Wikidata, Wikisource, and other Wikipedia sister projects) as Wikimedia Movement Organizers? And, what motivates them to do so? This article reports on findings from 14 interviews with 18 librarians. The librarians interviewed were multilingual and contributed to Wikimedia projects in Castilian (commonly referred to as Spanish), Catalan, BasQue, English, and other European languages. They reported planning and running Wikipedia events, developing partnerships with local Wikimedia chapters, motivating citizens to upload photos to Wikimedia Commons, identifying gaps in Wikipedia content and filling those gaps, transcribing historic documents and adding them to Wikisource, and contributing data to Wikidata. Most were motivated by their desire to preserve and promote regional languages and culture, and a commitment to open access and open education. Introduction This research started with an informal conversation in 2018 about the popularity of Catalan Wikipedia, Viquipèdia, between two library coworkers in the United States, the authors of this article. Our conversation began with a sense of wonder about Catalan Wikipedia, which is ranked twentieth by number of articles, out of 300 different language Wikipedias (Meta contributors, 2020). -
Collaboration: Interoperability Between People in the Creation of Language Resources for Less-Resourced Languages a SALTMIL Workshop May 27, 2008 Workshop Programme
Collaboration: interoperability between people in the creation of language resources for less-resourced languages A SALTMIL workshop May 27, 2008 Workshop Programme 14:30-14:35 Welcome 14:35-15:05 Icelandic Language Technology Ten Years Later Eiríkur Rögnvaldsson 15:05-15:35 Human Language Technology Resources for Less Commonly Taught Languages: Lessons Learned Toward Creation of Basic Language Resources Heather Simpson, Christopher Cieri, Kazuaki Maeda, Kathryn Baker, and Boyan Onyshkevych 15:35-16:05 Building resources for African languages Karel Pala, Sonja Bosch, and Christiane Fellbaum 16:05-16:45 COFFEE BREAK 16:45-17:15 Extracting bilingual word pairs from Wikipedia Francis M. Tyers and Jacques A. Pienaar 17:15-17:45 Building a Basque/Spanish bilingual database for speaker verification Iker Luengo, Eva Navas, Iñaki Sainz, Ibon Saratxaga, Jon Sanchez, Igor Odriozola, Juan J. Igarza and Inma Hernaez 17:45-18:15 Language resources for Uralic minority languages Attila Novák 18:15-18:45 Eslema. Towards a Corpus for Asturian Xulio Viejo, Roser Saurí and Angel Neira 18:45-19:00 Annual General Meeting of the SALTMIL SIG i Workshop Organisers Briony Williams Language Technologies Unit Bangor University, Wales, UK Mikel L. Forcada Departament de Llenguatges i Sistemes Informàtics Universitat d'Alacant, Spain Kepa Sarasola Lengoaia eta Sistema Informatikoak Saila Euskal Herriko Unibertsitatea / University of the Basque Country SALTMIL Speech and Language Technologies for Minority Languages A SIG of the International Speech Communication Association -
Multilingual Ranking of Wikipedia Articles with Quality and Popularity Assessment in Different Topics
computers Article Multilingual Ranking of Wikipedia Articles with Quality and Popularity Assessment in Different Topics Włodzimierz Lewoniewski * , Krzysztof W˛ecel and Witold Abramowicz Department of Information Systems, Pozna´nUniversity of Economics and Business, 61-875 Pozna´n,Poland * Correspondence: [email protected]; Tel.: +48-(61)-639-27-93 Received: 10 May 2019; Accepted: 13 August 2019; Published: 14 August 2019 Abstract: On Wikipedia, articles about various topics can be created and edited independently in each language version. Therefore, the quality of information about the same topic depends on the language. Any interested user can improve an article and that improvement may depend on the popularity of the article. The goal of this study is to show what topics are best represented in different language versions of Wikipedia using results of quality assessment for over 39 million articles in 55 languages. In this paper, we also analyze how popular selected topics are among readers and authors in various languages. We used two approaches to assign articles to various topics. First, we selected 27 main multilingual categories and analyzed all their connections with sub-categories based on information extracted from over 10 million categories in 55 language versions. To classify the articles to one of the 27 main categories, we took into account over 400 million links from articles to over 10 million categories and over 26 million links between categories. In the second approach, we used data from DBpedia and Wikidata. We also showed how the results of the study can be used to build local and global rankings of the Wikipedia content. -
Embracing Wikipedia As a Teaching and Learning Tool Benefits Health Professional Schools and the Populations They Serve
2017 Embracing Wikipedia as a teaching and learning tool benefits health professional schools and the populations they serve Author schools’ local service missions, suggesting that embracing Wikipedia as a teaching and learning Amin Azzam1* tool for tomorrow’s health professionals may be globally generalizable. A network of health Abstract professional schools and students contributing to Wikipedia would accelerate fulfillment of Wikipedia’s To paraphrase Wikipedia cofounder Jimmy Wales, audacious aspirational goal—providing every single “Imagine a world where all people have access person on the planet free access to the sum of all to high quality health information clearly written human knowledge. in their own language.” Most health professional students likely endorse that goal, as do individuals Keywords who volunteer to contribute to Wikipedia’s health- related content. Bringing these two communities medical education; medical communication; together inspired our efforts: a course for medical Wikipedia students to earn academic credit for improving Wikipedia. Here I describe the evolution of that Introduction course between 2013 – 2017, during which 80 students completed the course. Collectively they “Imagine a world in which every single person on the edited 65 pages, adding over 93,100 words and planet is given free access to the sum of all human 608 references. Impressively, these 65 Wikipedia knowledge. That’s what we’re doing.”1 pages were viewed 1,825,057 times during only the students’ active editing days. The students’ Some might consider this audacious statement a efforts were in partnership with communities naïve dreamer’s fantasy. Yet even at 16 years old, outside of academia—namely Wikiproject Medicine, Wikipedia continues to rank amongst the top 10 most 2 Translators Without Borders, and Wikipedia Zero. -
Why Medical Schools Should Embrace Wikipedia
Innovation Report Why Medical Schools Should Embrace Wikipedia: Final-Year Medical Student Contributions to Wikipedia Articles for Academic Credit at One School Amin Azzam, MD, MA, David Bresler, MD, MA, Armando Leon, MD, Lauren Maggio, PhD, Evans Whitaker, MD, MLIS, James Heilman, MD, Jake Orlowitz, Valerie Swisher, Lane Rasberry, Kingsley Otoide, Fred Trotter, Will Ross, and Jack D. McCue, MD Abstract Problem course on student participants, and improved their articles, enjoyed giving Most medical students use Wikipedia readership of students’ chosen articles. back “specifically to Wikipedia,” and as an information source, yet medical broadened their sense of physician schools do not train students to improve Outcomes responsibilities in the socially networked Wikipedia or use it critically. Forty-three enrolled students made information era. During only the “active 1,528 edits (average 36/student), editing months,” Wikipedia traffic Approach contributing 493,994 content bytes statistics indicate that the 43 articles Between November 2013 and November (average 11,488/student). They added were collectively viewed 1,116,065 2015, the authors offered fourth-year higher-quality and removed lower- times. Subsequent to students’ efforts, medical students a credit-bearing course quality sources for a net addition of these articles have been viewed nearly to edit Wikipedia. The course was 274 references (average 6/student). As 22 million times. designed, delivered, and evaluated by of July 2016, none of the contributions faculty, medical librarians, and personnel of the first 28 students (2013, 2014) Next Steps from WikiProject Medicine, Wikipedia have been reversed or vandalized. If other schools replicate and improve Education Foundation, and Translators Students discovered a tension between on this initiative, future multi-institution Without Borders. -
The Case of Croatian Wikipedia: Encyclopaedia of Knowledge Or Encyclopaedia for the Nation?
The Case of Croatian Wikipedia: Encyclopaedia of Knowledge or Encyclopaedia for the Nation? 1 Authorial statement: This report represents the evaluation of the Croatian disinformation case by an external expert on the subject matter, who after conducting a thorough analysis of the Croatian community setting, provides three recommendations to address the ongoing challenges. The views and opinions expressed in this report are those of the author and do not necessarily reflect the official policy or position of the Wikimedia Foundation. The Wikimedia Foundation is publishing the report for transparency. Executive Summary Croatian Wikipedia (Hr.WP) has been struggling with content and conduct-related challenges, causing repeated concerns in the global volunteer community for more than a decade. With support of the Wikimedia Foundation Board of Trustees, the Foundation retained an external expert to evaluate the challenges faced by the project. The evaluation, conducted between February and May 2021, sought to assess whether there have been organized attempts to introduce disinformation into Croatian Wikipedia and whether the project has been captured by ideologically driven users who are structurally misaligned with Wikipedia’s five pillars guiding the traditional editorial project setup of the Wikipedia projects. Croatian Wikipedia represents the Croatian standard variant of the Serbo-Croatian language. Unlike other pluricentric Wikipedia language projects, such as English, French, German, and Spanish, Serbo-Croatian Wikipedia’s community was split up into Croatian, Bosnian, Serbian, and the original Serbo-Croatian wikis starting in 2003. The report concludes that this structure enabled local language communities to sort by points of view on each project, often falling along political party lines in the respective regions.