[ [Conference Book
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Improving Wikimedia Projects Content Through Collaborations Waray Wikipedia Experience, 2014 - 2017
Improving Wikimedia Projects Content through Collaborations Waray Wikipedia Experience, 2014 - 2017 Harvey Fiji • Michael Glen U. Ong Jojit F. Ballesteros • Bel B. Ballesteros ESEAP Conference 2018 • 5 – 6 May 2018 • Bali, Indonesia In 8 November 2013, typhoon Haiyan devastated the Eastern Visayas region of the Philippines when it made landfall in Guiuan, Eastern Samar and Tolosa, Leyte. The typhoon affected about 16 million individuals in the Philippines and Tacloban City in Leyte was one of [1] the worst affected areas. Philippines Eastern Visayas Eastern Visayas, specifically the provinces of Biliran, Leyte, Northern Samar, Samar and Eastern Samar, is home for the Waray speakers in the Philippines. [3] [2] Outline of the Presentation I. Background of Waray Wikipedia II. Collaborations made by Sinirangan Bisaya Wikimedia Community III. Problems encountered IV. Lessons learned V. Future plans References Photo Credits I. Background of Waray Wikipedia (https://war.wikipedia.org) Proposed on or about June 23, 2005 and created on or about September 24, 2005 Deployed lsjbot from Feb 2013 to Nov 2015 creating 1,143,071 Waray articles about flora and fauna As of 24 April 2018, it has a total of 1,262,945 articles created by lsjbot (90.5%) and by humans (9.5%) As of 31 March 2018, it has 401 views per hour Sinirangan Bisaya (Eastern Visayas) Wikimedia Community is the (offline) community that continuously improves Waray Wikipedia and related Wikimedia projects I. Background of Waray Wikipedia (https://war.wikipedia.org) II. Collaborations made by Sinirangan Bisaya Wikimedia Community A. Collaborations with private* and national government** Introductory letter organizations Series of meetings and communications B. -
Christians and Jews in Muslim Societies
Arabic and its Alternatives Christians and Jews in Muslim Societies Editorial Board Phillip Ackerman-Lieberman (Vanderbilt University, Nashville, USA) Bernard Heyberger (EHESS, Paris, France) VOLUME 5 The titles published in this series are listed at brill.com/cjms Arabic and its Alternatives Religious Minorities and Their Languages in the Emerging Nation States of the Middle East (1920–1950) Edited by Heleen Murre-van den Berg Karène Sanchez Summerer Tijmen C. Baarda LEIDEN | BOSTON Cover illustration: Assyrian School of Mosul, 1920s–1930s; courtesy Dr. Robin Beth Shamuel, Iraq. This is an open access title distributed under the terms of the CC BY-NC 4.0 license, which permits any non-commercial use, distribution, and reproduction in any medium, provided no alterations are made and the original author(s) and source are credited. Further information and the complete license text can be found at https://creativecommons.org/licenses/by-nc/4.0/ The terms of the CC license apply only to the original material. The use of material from other sources (indicated by a reference) such as diagrams, illustrations, photos and text samples may require further permission from the respective copyright holder. Library of Congress Cataloging-in-Publication Data Names: Murre-van den Berg, H. L. (Hendrika Lena), 1964– illustrator. | Sanchez-Summerer, Karene, editor. | Baarda, Tijmen C., editor. Title: Arabic and its alternatives : religious minorities and their languages in the emerging nation states of the Middle East (1920–1950) / edited by Heleen Murre-van den Berg, Karène Sanchez, Tijmen C. Baarda. Description: Leiden ; Boston : Brill, 2020. | Series: Christians and Jews in Muslim societies, 2212–5523 ; vol. -
Modeling Popularity and Reliability of Sources in Multilingual Wikipedia
information Article Modeling Popularity and Reliability of Sources in Multilingual Wikipedia Włodzimierz Lewoniewski * , Krzysztof W˛ecel and Witold Abramowicz Department of Information Systems, Pozna´nUniversity of Economics and Business, 61-875 Pozna´n,Poland; [email protected] (K.W.); [email protected] (W.A.) * Correspondence: [email protected] Received: 31 March 2020; Accepted: 7 May 2020; Published: 13 May 2020 Abstract: One of the most important factors impacting quality of content in Wikipedia is presence of reliable sources. By following references, readers can verify facts or find more details about described topic. A Wikipedia article can be edited independently in any of over 300 languages, even by anonymous users, therefore information about the same topic may be inconsistent. This also applies to use of references in different language versions of a particular article, so the same statement can have different sources. In this paper we analyzed over 40 million articles from the 55 most developed language versions of Wikipedia to extract information about over 200 million references and find the most popular and reliable sources. We presented 10 models for the assessment of the popularity and reliability of the sources based on analysis of meta information about the references in Wikipedia articles, page views and authors of the articles. Using DBpedia and Wikidata we automatically identified the alignment of the sources to a specific domain. Additionally, we analyzed the changes of popularity and reliability in time and identified growth leaders in each of the considered months. The results can be used for quality improvements of the content in different languages versions of Wikipedia. -
Omnipedia: Bridging the Wikipedia Language
Omnipedia: Bridging the Wikipedia Language Gap Patti Bao*†, Brent Hecht†, Samuel Carton†, Mahmood Quaderi†, Michael Horn†§, Darren Gergle*† *Communication Studies, †Electrical Engineering & Computer Science, §Learning Sciences Northwestern University {patti,brent,sam.carton,quaderi}@u.northwestern.edu, {michael-horn,dgergle}@northwestern.edu ABSTRACT language edition contains its own cultural viewpoints on a We present Omnipedia, a system that allows Wikipedia large number of topics [7, 14, 15, 27]. On the other hand, readers to gain insight from up to 25 language editions of the language barrier serves to silo knowledge [2, 4, 33], Wikipedia simultaneously. Omnipedia highlights the slowing the transfer of less culturally imbued information similarities and differences that exist among Wikipedia between language editions and preventing Wikipedia’s 422 language editions, and makes salient information that is million monthly visitors [12] from accessing most of the unique to each language as well as that which is shared information on the site. more widely. We detail solutions to numerous front-end and algorithmic challenges inherent to providing users with In this paper, we present Omnipedia, a system that attempts a multilingual Wikipedia experience. These include to remedy this situation at a large scale. It reduces the silo visualizing content in a language-neutral way and aligning effect by providing users with structured access in their data in the face of diverse information organization native language to over 7.5 million concepts from up to 25 strategies. We present a study of Omnipedia that language editions of Wikipedia. At the same time, it characterizes how people interact with information using a highlights similarities and differences between each of the multilingual lens. -
Title of Thesis: ABSTRACT CLASSIFYING BIAS
ABSTRACT Title of Thesis: CLASSIFYING BIAS IN LARGE MULTILINGUAL CORPORA VIA CROWDSOURCING AND TOPIC MODELING Team BIASES: Brianna Caljean, Katherine Calvert, Ashley Chang, Elliot Frank, Rosana Garay Jáuregui, Geoffrey Palo, Ryan Rinker, Gareth Weakly, Nicolette Wolfrey, William Zhang Thesis Directed By: Dr. David Zajic, Ph.D. Our project extends previous algorithmic approaches to finding bias in large text corpora. We used multilingual topic modeling to examine language-specific bias in the English, Spanish, and Russian versions of Wikipedia. In particular, we placed Spanish articles discussing the Cold War on a Russian-English viewpoint spectrum based on similarity in topic distribution. We then crowdsourced human annotations of Spanish Wikipedia articles for comparison to the topic model. Our hypothesis was that human annotators and topic modeling algorithms would provide correlated results for bias. However, that was not the case. Our annotators indicated that humans were more perceptive of sentiment in article text than topic distribution, which suggests that our classifier provides a different perspective on a text’s bias. CLASSIFYING BIAS IN LARGE MULTILINGUAL CORPORA VIA CROWDSOURCING AND TOPIC MODELING by Team BIASES: Brianna Caljean, Katherine Calvert, Ashley Chang, Elliot Frank, Rosana Garay Jáuregui, Geoffrey Palo, Ryan Rinker, Gareth Weakly, Nicolette Wolfrey, William Zhang Thesis submitted in partial fulfillment of the requirements of the Gemstone Honors Program, University of Maryland, 2018 Advisory Committee: Dr. David Zajic, Chair Dr. Brian Butler Dr. Marine Carpuat Dr. Melanie Kill Dr. Philip Resnik Mr. Ed Summers © Copyright by Team BIASES: Brianna Caljean, Katherine Calvert, Ashley Chang, Elliot Frank, Rosana Garay Jáuregui, Geoffrey Palo, Ryan Rinker, Gareth Weakly, Nicolette Wolfrey, William Zhang 2018 Acknowledgements We would like to express our sincerest gratitude to our mentor, Dr. -
Feedforward Approach to Sequential Morphological Analysis in the Tagalog Language
IALP 2020, Kuala Lumpur, Dec 4-6, 2020 Feedforward Approach to Sequential Morphological Analysis in the Tagalog Language Arian N. Yambao1,2 and Charibeth K. Cheng1,2 1College of Computer Studies, De La Salle University, Manila, Philippines 2Senti Techlabs Inc., Manila, Philippines arian_yambao, charibeth.cheng @dlsu.edu.ph, arian.yambao, chari.cheng @senti.com.ph { } { } Abstract—Morphological analysis is a preprocessing task used the affix class of a given verb. This can be shown with the to normalize and get the grammatical information of a word root words starting with a consonant like ’baba’ (’down’ in which may also be referred to as lemmatization. Several NLP English), a prefix ba- added to the word makes it ’bababa’ tasks use this preprocessing technique to improve their im- plementations in different languages, however languages like (”going down” in English). And for words that start with a Tagalog, are considered to be morphosyntactically rich for having vowel, like ’asa’ (’hope’ or ’wish’ in English), an infix -um- a diverse number of morphological phenomena. This paper is added and the initial letter of the word is reduplicated in presents a Tagalog morphological analyzer implemented using a between to form the word ’umaasa’ (’hoping in English) [3] Feed Forward Neural Networks model (FFNN). We transformed [4]. our data to its character-level representation and used the model to identify its capability of learning the language’s inflections. We There have been efforts in MA that are mainly focused compared our MA to previous rule-based Tagalog MA works and on Tagalog includes the works of De Guzman [5], Nelson saw an improved accuracy of 0.93. -
An Analysis of Contributions to Wikipedia from Tor
Are anonymity-seekers just like everybody else? An analysis of contributions to Wikipedia from Tor Chau Tran Kaylea Champion Andrea Forte Department of Computer Science & Engineering Department of Communication College of Computing & Informatics New York University University of Washington Drexel University New York, USA Seatle, USA Philadelphia, USA [email protected] [email protected] [email protected] Benjamin Mako Hill Rachel Greenstadt Department of Communication Department of Computer Science & Engineering University of Washington New York University Seatle, USA New York, USA [email protected] [email protected] Abstract—User-generated content sites routinely block contri- butions from users of privacy-enhancing proxies like Tor because of a perception that proxies are a source of vandalism, spam, and abuse. Although these blocks might be effective, collateral damage in the form of unrealized valuable contributions from anonymity seekers is invisible. One of the largest and most important user-generated content sites, Wikipedia, has attempted to block contributions from Tor users since as early as 2005. We demonstrate that these blocks have been imperfect and that thousands of attempts to edit on Wikipedia through Tor have been successful. We draw upon several data sources and analytical techniques to measure and describe the history of Tor editing on Wikipedia over time and to compare contributions from Tor users to those from other groups of Wikipedia users. Fig. 1. Screenshot of the page a user is shown when they attempt to edit the Our analysis suggests that although Tor users who slip through Wikipedia article on “Privacy” while using Tor. Wikipedia’s ban contribute content that is more likely to be reverted and to revert others, their contributions are otherwise similar in quality to those from other unregistered participants and to the initial contributions of registered users. -
(Anti-)Semitic” and “(Anti-) Zionist” in Modern Middle Eastern Discourse
On the use of the terms “(anti-)Semitic” and “(anti-) Zionist” in modern Middle Eastern discourse Lutz Edzard1 University of Oslo Abstract Just as political and cultural discourse in general, modern Middle Eastern discourse is at times characterized by a great deal of hostility, not only between different states or religious denominations, but also state-internally among various ethnic, political, or religious groups. This short article focuses on the use of the attributes “Semitic” and “Zionist,” as well as their negative counterparts “anti-Semitic” and “anti-Zionist,” respectively, in examples of both Arabic and Israeli critical to hostile discourse. The focus of the discussion will lie on how the original meanings of these terms, especially in their negated forms, tend to be distorted in engaged political and cultural discourse. Keywords: Semitic, Anti-Semitic, Zionist, lā-sāmī, ṣahyūnī, ʾanṭi-šemi, ṣiyyōn The term “Semitic” Brief overview of the term “Semitic” As is well known, the term “Semitic” derives from the name of one of the sons of Noah, Shem,2 and was suggested for the language family in question by the encyclopedic German historian and polymath August Ludwig von Schlözer in 1781. 3 In linguistics context, the term “Semitic” is generally speaking non-controversial. Together with Ancient Egyptian, as well as the language families Berber, Cushitic, Chadic, and possibly Omotic, the Semitic language family is part of the larger Afroasiatic macrofamily (formerly also referred to as “Hamito-Semitic”).4 This usage of the term “Semitic” must be kept apart from the usage of the term in the compound adjective “anti-Semitic,” a term only coined in 1879 in a pamphlet by the journalist Wilhelm Marr (if not already in 1860 by the bibliographer and Orientalist Moritz Steinschneider), referring to prejudices against or hatred of Jews. -
The Waray Wikipedia Experience 族語傳遞資訊 瓦萊語維
他者から自らを省みる 移鏡借鑑 Quotable Experiences Disseminating Information Using an Indigenous Language – the Waray Wikipedia Experience 族語傳遞資訊──瓦萊語維基百科經驗談 Disseminating Information Using an Indigenous Language – the Waray Wikipedia Experience 族語傳遞資訊──瓦萊語維基百科經驗談 民族語で情報伝達―ワライ語ウィキペディア経験談 Disseminating Information Using an Indigenous Language – the Waray Wikipedia Experience 文‧圖︱Joseph F. Ballesteros, Belinda T. Bernas-Ballesteros, Michael Glen U. Ong 譯者︱陳穎柔 A forum discussing Waray Wikipedia at University of the Philippines Visayas Tacloban College at Tacloban City, Leyte, Philippines on 21 November 2014 with students and professors. [Photo by JinJian - Own work, CC BY-SA 4.0] 2014年11月21日一場與學生及教授討論瓦萊語維基百科的論壇,於菲律賓禮智省獨魯萬市的菲律賓米沙鄢大學獨魯萬學院。 of October 2019, there are 175 living 2019年10月,菲律賓尚存 時至 種原住民族語言,使 As indigenous languages in the Philippines. One 175 用人口 萬的瓦萊瓦萊語(簡稱瓦 of these is the Waray-waray or simply Waray which is 260 used in one of the Philippine-language based editions 維基百科,是菲律賓眾語版本維基百科 萊語)為其一,主要分布於菲律賓薩 spoken by 2.6 million Filipinos mostly in the Samar of Wikipedia – the Waray Wikipedia. Wikipedia is a 之一。維基百科是人人皆可使用、發布 馬島及禮智島。瓦萊語有自己版本的 and Leyte islands in the Philippines. This language is free and online encyclopedia that anyone can use, 及編輯的免費線上百科全書。瓦萊語維 distribute and edit. In Waray Wikipedia, information 基百科由自願者以瓦萊語文撰寫資訊, is written in the Waray language by volunteers, from 秉持中性觀點,資訊來源獨立、可靠亦 a neutral point of view and obtained from 可查核;正如任一語言版本的維基百 independent, reliable and verifiable sources. Just like 科,瓦萊語維基百科無須網際網路連線 any language -
Forgotten Palestinians
1 2 3 4 5 6 7 8 9 THE FORGOTTEN PALESTINIANS 10 1 2 3 4 5 6x 7 8 9 20 1 2 3 4 5 6 7 8 9 30 1 2 3 4 5 36x 1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 20 1 2 3 4 5 6 7 8 9 30 1 2 3 4 5 36x 1 2 3 4 5 THE FORGOTTEN 6 PALESTINIANS 7 8 A History of the Palestinians in Israel 9 10 1 2 3 Ilan Pappé 4 5 6x 7 8 9 20 1 2 3 4 5 6 7 8 9 30 1 2 3 4 YALE UNIVERSITY PRESS 5 NEW HAVEN AND LONDON 36x 1 In memory of the thirteen Palestinian citizens who were shot dead by the 2 Israeli police in October 2000 3 4 5 6 7 8 9 10 1 2 3 4 5 Copyright © 2011 Ilan Pappé 6 The right of Ilan Pappé to be identified as author of this work has been asserted by 7 him in accordance with the Copyright, Designs and Patents Act 1988. 8 All rights reserved. This book may not be reproduced in whole or in part, in any form (beyond that copying permitted by Sections 107 and 108 of the U.S. Copyright 9 Law and except by reviewers for the public press) without written permission from 20 the publishers. 1 For information about this and other Yale University Press publications, 2 please contact: U.S. -
Flags and Banners
Flags and Banners A Wikipedia Compilation by Michael A. Linton Contents 1 Flag 1 1.1 History ................................................. 2 1.2 National flags ............................................. 4 1.2.1 Civil flags ........................................... 8 1.2.2 War flags ........................................... 8 1.2.3 International flags ....................................... 8 1.3 At sea ................................................. 8 1.4 Shapes and designs .......................................... 9 1.4.1 Vertical flags ......................................... 12 1.5 Religious flags ............................................. 13 1.6 Linguistic flags ............................................. 13 1.7 In sports ................................................ 16 1.8 Diplomatic flags ............................................ 18 1.9 In politics ............................................... 18 1.10 Vehicle flags .............................................. 18 1.11 Swimming flags ............................................ 19 1.12 Railway flags .............................................. 20 1.13 Flagpoles ............................................... 21 1.13.1 Record heights ........................................ 21 1.13.2 Design ............................................. 21 1.14 Hoisting the flag ............................................ 21 1.15 Flags and communication ....................................... 21 1.16 Flapping ................................................ 23 1.17 See also ............................................... -
DEC SHOFAR Pages 1-10
THE A Publication of the Jewish Federation of Greater SH Chattanooga OF Volume 31 NumberAR 4 December 2018 Community Candle Lighting and Federation/ The State of the 2019 Annual Campaign Hadassah Lunch in December by Mike Spector, Campaign Chair Join us at the JCC for the annual We have reached the month of December. Our Chanukah candle lighting ceremony and Thanksgiving leftovers are long gone, and now we dinner, Sunday, December 2nd at 5:30 get ready for the eight nights of Chanukah, which pm. There will be oven-fried chicken, begin December 2nd. Students are wrapping up their potato latkes, green beans, and desserts. fall semester, and many people are looking toward Bring your own chanukiah or use one of end-of-the year vacations. ours. We will supply candles. But here at the Federation, it’s crunch time! This Cost is $12 per person or $30 for is when we reach out to donors we’ve not yet heard a family of four. Children age five to from, to encourage them to help us achieve our sixteen are $6; those under four get in annual goal. free. The per-person cost increases by $4 at the door, so rsvp early to The 2019 Annual Campaign is coming along quite [email protected], or by calling 493-0270. well. As of this printing we have just surpassed the $951,000 amount--so close On Tuesday, December 4th, join us again for a joint Federation/ to breaking $1,000,000 and beyond! Four-hundred seventy community members Hadassah Chanukah lunch.