Wikimedia India Newsletter, September 2010
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages
Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 2373–2380 Marseille, 11–16 May 2020 c European Language Resources Association (ELRA), licensed under CC-BY-NC A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages Dwaipayan Roy, Sumit Bhatia, Prateek Jain GESIS - Cologne, IBM Research - Delhi, IIIT - Delhi [email protected], [email protected], [email protected] Abstract Wikipedia is the largest web-based open encyclopedia covering more than three hundred languages. However, different language editions of Wikipedia differ significantly in terms of their information coverage. We present a systematic comparison of information coverage in English Wikipedia (most exhaustive) and Wikipedias in eight other widely spoken languages (Arabic, German, Hindi, Korean, Portuguese, Russian, Spanish and Turkish). We analyze the content present in the respective Wikipedias in terms of the coverage of topics as well as the depth of coverage of topics included in these Wikipedias. Our analysis quantifies and provides useful insights about the information gap that exists between different language editions of Wikipedia and offers a roadmap for the Information Retrieval (IR) community to bridge this gap. Keywords: Wikipedia, Knowledge base, Information gap 1. Introduction other with respect to the coverage of topics as well as Wikipedia is the largest web-based encyclopedia covering the amount of information about overlapping topics. -
State of Wikimedia Communities of India
State of Wikimedia Communities of India Assamese http://as.wikipedia.org State of Assamese Wikipedia RISE OF ASSAMESE WIKIPEDIA Number of edits and internal links EDITS PER MONTH INTERNAL LINKS GROWTH OF ASSAMESE WIKIPEDIA Number of good Date Articles January 2010 263 December 2012 301 (around 3 articles per month) November 2011 742 (around 40 articles per month) Future Plans Awareness Sessions and Wiki Academy Workshops in Universities of Assam. Conduct Assamese Editing Workshops to groom writers to write in Assamese. Future Plans Awareness Sessions and Wiki Academy Workshops in Universities of Assam. Conduct Assamese Editing Workshops to groom writers to write in Assamese. THANK YOU Bengali বাংলা উইকিপিডিয়া Bengali Wikipedia http://bn.wikipedia.org/ By Bengali Wikipedia community Bengali Language • 6th most spoken language • 230 million speakers Bengali Language • National language of Bangladesh • Official language of India • Official language in Sierra Leone Bengali Wikipedia • Started in 2004 • 22,000 articles • 2,500 page views per month • 150 active editors Bengali Wikipedia • Monthly meet ups • W10 anniversary • Women’s Wikipedia workshop Wikimedia Bangladesh local chapter approved in 2011 by Wikimedia Foundation English State of WikiProject India on ENGLISH WIKIPEDIA ● One of the largest Indian Wikipedias. ● WikiProject started on 11 July 2006 by GaneshK, an NRI. ● Number of article:89,874 articles. (Excludes those that are not tagged with the WikiProject banner) ● Editors – 465 (active) ● Featured content : FAs - 55, FLs - 20, A class – 2, GAs – 163. BASIC STATISTICS ● B class – 1188 ● C class – 801 ● Start – 10,931 ● Stub – 43,666 ● Unassessed for quality – 20,875 ● Unknown importance – 61,061 ● Cleanup tags – 43,080 articles & 71,415 tags BASIC STATISTICS ● Diversity of opinion ● Lack of reliable sources ● Indic sources „lost in translation“ ● Editor skills need to be upgraded ● Lack of leadership ● Lack of coordinated activities ● …. -
As You Wish Meaning in Bengali
As You Wish Meaning In Bengali Collapsable Jack stet glumly and stockily, she denitrify her Pharisee cuckoos thither. Sanders boned her earners broad-mindedly, flammable and uncalled-for. Miles biking his throttles pavilion wearyingly, but trivial Chuck never backlash so pectinately. This can be understood with an example. Amazon as a Manager where I was access for creating process guidelines, training content and preparing weekly performance reports that involved complex writing skills. Where are the restrooms? English dictionary on so happy times go through on buy in hindi translation people is in sanskrit malayalam good morning has successfully completed. This Good Evening Love Message will help us to overcome many problems. Anyone you render on facebook or twitter just imagine how naked the sea will and, your! Do you want her give your low each to this translation? Bitcoin meaning in bengali rear end be misused to book hotels off Expedia, shop for furniture on buy in and buy Xbox games. Moreover, combining translation and typesetting in this way creates efficiencies and economies over carrying them out separately. RATHER definition are included in the result of RATHER meaning in bengali at kitkatwords. Only the user who asked this question will those who disagreed with color answer. Can i am and you in her! How children you say suffer in Korean? This category only used for furniture on. Bengali encodings, the vowels that are attached to the left character are written first followed by consonant. Indic languages were traditionally small businesses may have a substantial bengali r dissertation meaning. However, some of the words are actually closely related to Latin as well. -
Modeling Popularity and Reliability of Sources in Multilingual Wikipedia
information Article Modeling Popularity and Reliability of Sources in Multilingual Wikipedia Włodzimierz Lewoniewski * , Krzysztof W˛ecel and Witold Abramowicz Department of Information Systems, Pozna´nUniversity of Economics and Business, 61-875 Pozna´n,Poland; [email protected] (K.W.); [email protected] (W.A.) * Correspondence: [email protected] Received: 31 March 2020; Accepted: 7 May 2020; Published: 13 May 2020 Abstract: One of the most important factors impacting quality of content in Wikipedia is presence of reliable sources. By following references, readers can verify facts or find more details about described topic. A Wikipedia article can be edited independently in any of over 300 languages, even by anonymous users, therefore information about the same topic may be inconsistent. This also applies to use of references in different language versions of a particular article, so the same statement can have different sources. In this paper we analyzed over 40 million articles from the 55 most developed language versions of Wikipedia to extract information about over 200 million references and find the most popular and reliable sources. We presented 10 models for the assessment of the popularity and reliability of the sources based on analysis of meta information about the references in Wikipedia articles, page views and authors of the articles. Using DBpedia and Wikidata we automatically identified the alignment of the sources to a specific domain. Additionally, we analyzed the changes of popularity and reliability in time and identified growth leaders in each of the considered months. The results can be used for quality improvements of the content in different languages versions of Wikipedia. -
Omnipedia: Bridging the Wikipedia Language
Omnipedia: Bridging the Wikipedia Language Gap Patti Bao*†, Brent Hecht†, Samuel Carton†, Mahmood Quaderi†, Michael Horn†§, Darren Gergle*† *Communication Studies, †Electrical Engineering & Computer Science, §Learning Sciences Northwestern University {patti,brent,sam.carton,quaderi}@u.northwestern.edu, {michael-horn,dgergle}@northwestern.edu ABSTRACT language edition contains its own cultural viewpoints on a We present Omnipedia, a system that allows Wikipedia large number of topics [7, 14, 15, 27]. On the other hand, readers to gain insight from up to 25 language editions of the language barrier serves to silo knowledge [2, 4, 33], Wikipedia simultaneously. Omnipedia highlights the slowing the transfer of less culturally imbued information similarities and differences that exist among Wikipedia between language editions and preventing Wikipedia’s 422 language editions, and makes salient information that is million monthly visitors [12] from accessing most of the unique to each language as well as that which is shared information on the site. more widely. We detail solutions to numerous front-end and algorithmic challenges inherent to providing users with In this paper, we present Omnipedia, a system that attempts a multilingual Wikipedia experience. These include to remedy this situation at a large scale. It reduces the silo visualizing content in a language-neutral way and aligning effect by providing users with structured access in their data in the face of diverse information organization native language to over 7.5 million concepts from up to 25 strategies. We present a study of Omnipedia that language editions of Wikipedia. At the same time, it characterizes how people interact with information using a highlights similarities and differences between each of the multilingual lens. -
An Analysis of Contributions to Wikipedia from Tor
Are anonymity-seekers just like everybody else? An analysis of contributions to Wikipedia from Tor Chau Tran Kaylea Champion Andrea Forte Department of Computer Science & Engineering Department of Communication College of Computing & Informatics New York University University of Washington Drexel University New York, USA Seatle, USA Philadelphia, USA [email protected] [email protected] [email protected] Benjamin Mako Hill Rachel Greenstadt Department of Communication Department of Computer Science & Engineering University of Washington New York University Seatle, USA New York, USA [email protected] [email protected] Abstract—User-generated content sites routinely block contri- butions from users of privacy-enhancing proxies like Tor because of a perception that proxies are a source of vandalism, spam, and abuse. Although these blocks might be effective, collateral damage in the form of unrealized valuable contributions from anonymity seekers is invisible. One of the largest and most important user-generated content sites, Wikipedia, has attempted to block contributions from Tor users since as early as 2005. We demonstrate that these blocks have been imperfect and that thousands of attempts to edit on Wikipedia through Tor have been successful. We draw upon several data sources and analytical techniques to measure and describe the history of Tor editing on Wikipedia over time and to compare contributions from Tor users to those from other groups of Wikipedia users. Fig. 1. Screenshot of the page a user is shown when they attempt to edit the Our analysis suggests that although Tor users who slip through Wikipedia article on “Privacy” while using Tor. Wikipedia’s ban contribute content that is more likely to be reverted and to revert others, their contributions are otherwise similar in quality to those from other unregistered participants and to the initial contributions of registered users. -
Minority Languages in India
Thomas Benedikter Minority Languages in India An appraisal of the linguistic rights of minorities in India ---------------------------- EURASIA-Net Europe-South Asia Exchange on Supranational (Regional) Policies and Instruments for the Promotion of Human Rights and the Management of Minority Issues 2 Linguistic minorities in India An appraisal of the linguistic rights of minorities in India Bozen/Bolzano, March 2013 This study was originally written for the European Academy of Bolzano/Bozen (EURAC), Institute for Minority Rights, in the frame of the project Europe-South Asia Exchange on Supranational (Regional) Policies and Instruments for the Promotion of Human Rights and the Management of Minority Issues (EURASIA-Net). The publication is based on extensive research in eight Indian States, with the support of the European Academy of Bozen/Bolzano and the Mahanirban Calcutta Research Group, Kolkata. EURASIA-Net Partners Accademia Europea Bolzano/Europäische Akademie Bozen (EURAC) – Bolzano/Bozen (Italy) Brunel University – West London (UK) Johann Wolfgang Goethe-Universität – Frankfurt am Main (Germany) Mahanirban Calcutta Research Group (India) South Asian Forum for Human Rights (Nepal) Democratic Commission of Human Development (Pakistan), and University of Dhaka (Bangladesh) Edited by © Thomas Benedikter 2013 Rights and permissions Copying and/or transmitting parts of this work without prior permission, may be a violation of applicable law. The publishers encourage dissemination of this publication and would be happy to grant permission. -
SOME NOTES on the ETYMOLOGY of the WORD Altai / Altay Altai/Altay Kelimesinin Etimolojisi Üzerine Notlar
........... SOME NOTES ON THE ETYMOLOGY OF THE WORD altai / altay Altai/Altay Kelimesinin Etimolojisi Üzerine Notlar Hüseyin YILDIZ*1 Dil Araştırmaları, Bahar 2017/20: 187-201 Öz: Günümüzde Altay Dağları’nı adlandırmada kullanılan altay kelimesine tarihî Türk lehçelerinde bu fonetik biçimiyle rastlanmamaktadır. Bize göre altay kelimesinin altun kelimesiyle doğrudan ilgisi vardır. Ancak bunu daha net bir şekilde görebilmek için şu iki soruyu cevaplandırmak gerekir: i) altun kelimesi, Altay dağları için kullanılan adlandırmalarda yer almakta mıdır? ii) İkinci hecedeki /u/ ~ /a/ ve /n/ ~ /y/ denklikleri fonetik olarak açıklanabilir mi? Bu makalede altay kelimesinin etimolojisi fonetik (/n/ ~ /y/ ve /a/~ /u/), semantik (altun yış, chin-shan, altun owla…) ve leksik (Altun Ğol, Altan Ḫusu Owla…) yönlerden tartışılacaktır. Bize göre hem Eski Türkçe altay kelimesinin tarihinden hem de ‘Altay Dağları’ için kullanılan kelimelerden hareketle, kelimenin en eski şekli *paltuń şeklinde tasarlanabilir. Anahtar Kelimeler: Türk yazı dilleri, Eski Türkçe, etimoloji, altay kelimesi, yer adları, Altay Dağları Abstract: The word altay is an oronym referring to Altai Mountains, but it, in this phonological form, is not attested to have existed in early Turkic works. We consider that the word altay is directly related with the word altun. However, to see this clearer we must answer following two questions: i) Is the word altun used for any denominating system of the Altay Mountains? and ii) Could the equivalencies between /u/ ~ /a/ and between /n/ ~ /y/ phonemes on the second syllable be explained phonologically? This article aims to discuss the etymology of the word altai in following respects: phonologically (/n/ ~ /y/ and /a/~ /u/), semantically (altun yış, chin- shan, altun owla…) and lexically (Altun Ğol, Altan Ḫusu Owla…). -
Critical Point of View: a Wikipedia Reader
w ikipedia pedai p edia p Wiki CRITICAL POINT OF VIEW A Wikipedia Reader 2 CRITICAL POINT OF VIEW A Wikipedia Reader CRITICAL POINT OF VIEW 3 Critical Point of View: A Wikipedia Reader Editors: Geert Lovink and Nathaniel Tkacz Editorial Assistance: Ivy Roberts, Morgan Currie Copy-Editing: Cielo Lutino CRITICAL Design: Katja van Stiphout Cover Image: Ayumi Higuchi POINT OF VIEW Printer: Ten Klei Groep, Amsterdam Publisher: Institute of Network Cultures, Amsterdam 2011 A Wikipedia ISBN: 978-90-78146-13-1 Reader EDITED BY Contact GEERT LOVINK AND Institute of Network Cultures NATHANIEL TKACZ phone: +3120 5951866 INC READER #7 fax: +3120 5951840 email: [email protected] web: http://www.networkcultures.org Order a copy of this book by sending an email to: [email protected] A pdf of this publication can be downloaded freely at: http://www.networkcultures.org/publications Join the Critical Point of View mailing list at: http://www.listcultures.org Supported by: The School for Communication and Design at the Amsterdam University of Applied Sciences (Hogeschool van Amsterdam DMCI), the Centre for Internet and Society (CIS) in Bangalore and the Kusuma Trust. Thanks to Johanna Niesyto (University of Siegen), Nishant Shah and Sunil Abraham (CIS Bangalore) Sabine Niederer and Margreet Riphagen (INC Amsterdam) for their valuable input and editorial support. Thanks to Foundation Democracy and Media, Mondriaan Foundation and the Public Library Amsterdam (Openbare Bibliotheek Amsterdam) for supporting the CPOV events in Bangalore, Amsterdam and Leipzig. (http://networkcultures.org/wpmu/cpov/) Special thanks to all the authors for their contributions and to Cielo Lutino, Morgan Currie and Ivy Roberts for their careful copy-editing. -
9 January 2015, Friday Gandhi Bhavan Gandhi Bhavan Dr. K. P
Day 1 9 January 2015, Friday 08:00 Spot Registration – Primary Interaction 09:00 Gandhi Bhavan Opening Note on behalf of Organizing Committee: Atanu Saha / Dr. Tanmay Bir Welcome Note: Prof. Asis Mazumdar, Dean, ISLM, Jadavpur University Speech: Sanjay Gopal Sarkar, Jt. Registrar, Jadavpur University Speech by WMF Member: Bishakha Datta, Member, Board of Trustees, WMF 09:00 Speech by Wikimedia India Chapter Member: Jayanta Nath, Secretary, Wikimedia India Chapter – 10:40 Keynote: Prof. Pabitra Sarkar, Former Vice-Chancellor, Rabindra Bharati University Speech by Chief Guest: Prof. Basab Chaudhuri, Registrar, Calcutta University State of Indic language Wiki projects By Wikimedia India Chapter member Valedictory Note: Prof. Mahidas Bhattacharya, Director, SLL, Jadavpur University 10:40 – Tea Break 11:00 International Dr. K. P. Basu Memorial Hall Vivekananda Hall Relations Seminar Gandhi Bhavan Engineering Science Building Subarna Jayanti Building Hall PG Arts Building Paper Presentation Workshop Track: Community Track: Bengali Chair: TBD Wikipedia Time: 30 minutes Chair: Kalyan Sarkar each Time: 30 minutes each Title: How to type in 11:00 Bengali – Presenter: Ayan 11:30 Closed Choudhury Title: Collaboration between Microsoft Title: How to edit 11:30 and Indic language Bengali Wikipedia – community Presenter: Ayan 12:00 Presenter: M. Closed Choudhury Krishna Doss, R&D, Microsoft India Paper Presentation Track: Outreach Chair: Bishakha Datta Title: How to create a Time: 20 minutes each new article Presenter: Sarban 12:00 Closed • Title: Wikipedia As A Teaching Bandyopadhyay – Learning Tool In The Field Of 12:30 Higher Education 12:30 Presenter: Dr. Indrani o Title: Advanced editing – Nath, Sarsuna College in Bengali Wikipedia 13:00 • Title: Information and Presenter: Sarban Communication Technologies Bandyopadhyay (ICTs) and Wikispaces for Engineering Students - Relevance and Utility o Presenter: Dr. -
Bringing Ol Chiki to the Digital World
Typography and Education http://www.typoday.in Bringing Ol Chiki to the digital world Saxena, Pooja, [email protected] Panigrahi, Subhashish, Programme Officer, Access to Knowledge, Center for Internet and Society, Bengaluru (India), [email protected] Abstract: Can a typeface turn the fate of an indigenous language around by making communication possible on digital platforms and driving digital activism? In 2014, a project was initiated with financial support from the Access to Knowledge programme at The Center for Internet and Society, Bangalore to look for answers to this question. The project’s goal was to design a typeface family supporting Ol Chiki script, which is used to write Santali, along with input methods that would make typing in Ol Chiki possible. It was planned that these resources would be released under a free license, with the hope to provide tools to Santali speakers to read and write in their own script online. Key words: Ol Chiki script, typeface design, minority script, Santali language 1. Introduction The main aim of this paper is to share the experiences and knowledge gained by working on a typeface and input method design project for a minority script from India, in this case Ol Chiki, which is used to write Santali. This project was initiated by the Access to Knowledge programme at the Center for Internet and Society (CIS-A2K, whose mandate is to work towards catalysing the growth of the free and open knowledge movement in South Typography Day 2016 1 Asia and in Indic languages. From September 2012, CIS has been actively involved in growing the open knowledge movement in India through a grant received from the Wikimedia Foundation (WMF). -
A Wikipedia Reader
UvA-DARE (Digital Academic Repository) Critical point of view: a Wikipedia reader Lovink, G.; Tkacz, N. Publication date 2011 Document Version Final published version Link to publication Citation for published version (APA): Lovink, G., & Tkacz, N. (2011). Critical point of view: a Wikipedia reader. (INC reader; No. 7). Institute of Network Cultures. http://www.networkcultures.org/_uploads/%237reader_Wikipedia.pdf General rights It is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), other than for strictly personal, individual use, unless the work is under an open content license (like Creative Commons). Disclaimer/Complaints regulations If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library: https://uba.uva.nl/en/contact, or a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible. UvA-DARE is a service provided by the library of the University of Amsterdam (https://dare.uva.nl) Download date:05 Oct 2021 w ikipedia pedai p edia p Wiki CRITICAL POINT OF VIEW A Wikipedia Reader 2 CRITICAL POINT OF VIEW A Wikipedia Reader CRITICAL POINT OF VIEW 3 Critical Point of View: A Wikipedia Reader Editors: Geert Lovink