TAO Handbook

Total Page:16

File Type:pdf, Size:1020Kb

TAO Handbook TAO Handbook PDF generated using the open source mwlib toolkit. See http://code.pediapress.com/ for more information. PDF generated at: Mon, 30 Sep 2013 08:05:34 UTC Contents Articles Introduction 1 TAO 1 Handbook Overview 2 Welcome to the Handbook 5 How to Contribute to the Handbook 6 Description of the Idea 9 Examples of Users and Communities 10 Background Information 12 Target Groups of Online Communities 12 Fostering Older Adults' Online Participation 16 Older Adults and Online Communities 19 Usability 21 Accessibility 25 What is and Online Community? 27 TAO Survey Among Older Adults - Wave 1 31 TAO Survey Among Elderly - Wave 2 34 Context Analysis 37 Activities 48 Initiation and Meaningful Use 48 Working with Volunteers 49 Facebook Activities 54 Free Cruise on the Internet 55 Online Learning Activities 58 Silver Knowledge 60 Silver Knowledge: Comparison of Different Locations 65 Silver Knowledge: Information for Cooperation Partners 69 Wikimedia Seniors' Outreach 72 Online Co-Creation 76 SeniorWeb NL's Online Contact Services 79 Activities at Seniorweb.ch 85 terzLivingLab 87 Online tools 89 Online Tools 89 Online Collaboration 92 Web Conferencing 93 Virtual classrooms 96 Online Tools for people 50plus 97 General Conditions 99 Public Relations 99 Communities of Practice 101 Business Models 104 Sponsorship and Fundraising 108 Methods and Practical Tools 112 Analytical framework for the evaluation of online communities 112 Co-Creation with Older Persons 116 Methods for Cooperation and Seminars 120 Accessibility Tool 127 Problem-oriented access 128 TAO/Handbook/Problem Motivation 128 TAO/Handbook/Problem Funding 128 TAO/Handbook/Problem Volunteers 128 TAO/Handbook/Problem Communities 129 TAO/Handbook/Problem Usability Accessability 129 TAO/Handbook/Problem Evaluation 129 TAO/Handbook/Problem Research 130 References Article Sources and Contributors 131 Image Sources, Licenses and Contributors 133 Article Licenses License 134 1 Introduction TAO TAO is about engaging older adults in online activities... TAO Handbook TAO Community of Practice You're looking for information on how to involve older adults in your You want to meet and exchange with other people community's activities?' who work with online communities and older adults? In our handbook, you'll find background information, descriptions of best We're building up a community of people from various practices and other things you need to know if you want to make your backgrounds who are ready to share their experiences online community a great place for older adults and tap into older people's and to provide mutual support. experience and collaborative energy. And you can add your own Join us for common learning, research and innovation experiences! activities in order to promote the active participation of For community managers, managers of educational institutions, teachers, older people in online communities and online volunteer co-ordinators, instructors, moderators, facilitators, consultants, collaboration projects. and other people interested in getting older people to play an active role in the web2.0 world. People&Learning Activities Contents Choose your Entry Point For staff members in a For teachers, instructors, For people who would like to You don't fall in one of those managing or co-ordinating moderators, facilitators support the community categories? function Tell us about yourself TAO 2 TAO is both a placeholder for the Chinese ideograph 道 (Pinyin: Dào source), meaning way or method, and an acronym for Third Age Online. The main target of the project is to highlight the ways in which the access of older persons to the opportunities offered by online communities can be facilitated. At the same time, the project aims to profit from the growing number of older persons to advance charitable projects of online communities. - Learn more about the project by checking out its official website [1]. TAO consortium members: You can still access the outdated version of this start page. References [1] http:/ / www. thirdageonline. eu/ Handbook Overview For expanding the list click "show". You can also download a “ready for printing” version of the handbook, it always represents the current state of the work done on these pages. Introduction Learn about the Handbook concept, how to read and edit the Handbook. Welcome to the Handbook Sharing your know-how with others - How to contribute Examples of users and communities Description of the Idea Definitions Clarification of some important terms and concepts. Target groups Online communities Handbook Overview 3 Background Information General information on older adults' internet use, online communities etc. Fostering older adults' online participation Older adults and online communities Volunteer Management and Motivation Mutual Benefits of Volunteer Work Usability Accessibility TAO Survey Among Older Adults - Wave 1 TAO Survey Among Older Adults - Wave 2 TAO Context Analysis Activities Case studies of activities involving older adults and online collaboration Some notes on different types of activities: Activities initiating older adults to meaningful use of the internet Activities with volunteer instructors: practical experiences Examples of activities: Facebook Activities (online community, workshops) Free Cruise on the Internet (online skills, online collaboration, workshops, volunteers) Online learning activities (online collaboration, intergenerational, reading) Open TAO Workshop (community building, workshop) Silver Knowledge (Wikipedia, mentoring, workshops) Wikimedia Seniors Outreach Online Co-Creation (online collaboration, workshops) SeniorWebNL's online contact services (online collaboration, workshops) Seniorweb.ch (e-Learning from Seniors for Seniors) terzLivingLab Online tools Some tools that will help you implement your activities Online tools: general remarks, approach and requirements Collaboration: working together, online and in real-time Web conferencing: communicating over the web Virtual classrooms: remote teaching Online tools and people 50+ Handbook Overview 4 General Conditions Information on how to organize events, projects and find appropriate funding Business Models Public Relations Sponsorship and Fundraising Communities of Practice Methods and Practical Tools Strategies for evaluating, implementing and planning seminars Analytical framework for the evaluation of online communities Co-Creation with Older Persons (online collaboration, mentoring, workshops) Geragogical methods for cooperation and seminars The CAC Accessibility Tool Problem-oriented access Where can I read some general information about online communities? How do I get members to participate? How do I manage volunteers? How do I develop a sustainable business model? How can I increase usability and accessability? How do I evaluate the current status of my online community? What are current findings of research on participation of older adults in online communities? You want to contribute? Find Tips on how to work with Wikiversity and on Technical Issues here: • Idea: What the TAO handbook is about • Contribute: How to edit pages in the TAO handbook and what to consider when you're contributing • Languages: How to handle the different language versions of this project • Overview: List of pages belonging to of in focus of the project • Templates: Collection of templates recommended for use Choose your Entry Point For staff members in a For teachers, instructors, For people who would like to You don't fall in one of those managing or co-ordinating moderators, facilitators support the community categories? function Tell us about yourself Welcome to the Handbook 5 Welcome to the Handbook Welcome to the TAO Handbook! The TAO [1] Handbook is a collection of practical and background information on how to involve older adults in online communities and online collaboration. It was conceived primarily for two types of users: •• managers of online communities and people in charge of online collaboration (in an educational setting or other); •• trainers and multiplicators in online communities and collaboration. A special characteristic of the Handbook is that it still can be edited by anyone. As it was created within the Wikimedia project Wikiversity you possibly hold in your hands a printed edition of the Handbook, however there may already be a version containing additional material available on the web. If you want to go on online right now follow this link http:/ / en. wikiversity. org/ wiki/ TAO/ Handbook''. We would be glad to also benefit from your experience! Contributing Our Handbook was set on a collaborative platform for a purpose. We think that you have a lot of relevant experience. Help us make this Handbook better, and add your own information! Find out how in the next module. Contents The Handbook contains several chapters that treat different types of information. We start out with a chapter containing background information on several relevant topics. To start with, the chapter outlines features of the group targeted by the Handbook users: older adults. Information is given on their internet use, motivation to join an online community and how they may benefit from it. Different strategies to foster their participation are outlined. Then, the important role of volunteers and a general information on usability issues are presented. The chapter closes with a definition of online communities and communities of practice, giving ideas on how to manage both types of communities. The following chapter presents activities that
Recommended publications
  • 12€€€€Collaborating on the Sum of All Knowledge Across Languages
    Wikipedia @ 20 • ::Wikipedia @ 20 12 Collaborating on the Sum of All Knowledge Across Languages Denny Vrandečić Published on: Oct 15, 2020 Updated on: Nov 16, 2020 License: Creative Commons Attribution 4.0 International License (CC-BY 4.0) Wikipedia @ 20 • ::Wikipedia @ 20 12 Collaborating on the Sum of All Knowledge Across Languages Wikipedia is available in almost three hundred languages, each with independently developed content and perspectives. By extending lessons learned from Wikipedia and Wikidata toward prose and structured content, more knowledge could be shared across languages and allow each edition to focus on their unique contributions and improve their comprehensiveness and currency. Every language edition of Wikipedia is written independently of every other language edition. A contributor may consult an existing article in another language edition when writing a new article, or they might even use the content translation tool to help with translating one article to another language, but there is nothing to ensure that articles in different language editions are aligned or kept consistent with each other. This is often regarded as a contribution to knowledge diversity since it allows every language edition to grow independently of all other language editions. So would creating a system that aligns the contents more closely with each other sacrifice that diversity? Differences Between Wikipedia Language Editions Wikipedia is often described as a wonder of the modern age. There are more than fifty million articles in almost three hundred languages. The goal of allowing everyone to share in the sum of all knowledge is achieved, right? Not yet. The knowledge in Wikipedia is unevenly distributed.1 Let’s take a look at where the first twenty years of editing Wikipedia have taken us.
    [Show full text]
  • Europe's Online Encyclopaedias
    Europe's online encyclopaedias Equal access to knowledge of general interest in a post-truth era? IN-DEPTH ANALYSIS EPRS | European Parliamentary Research Service Author: Naja Bentzen Members' Research Service PE 630.347 – December 2018 EN The post-truth era – in which emotions seem to trump evidence, while trust in institutions, expertise and mainstream media is declining – is putting our information ecosystem under strain. At a time when information is increasingly being manipulated for ideological and economic purposes, public access to sources of trustworthy, general-interest knowledge – such as national online encyclopaedias – can help to boost our cognitive resilience. Basic, reliable background information about history, culture, society and politics is an essential part of our societies' complex knowledge ecosystem and an important tool for any citizen searching for knowledge, facts and figures. AUTHOR Naja Bentzen, Members' Research Service This paper has been drawn up by the Members' Research Service, within the Directorate-General for Parliamentary Research Services (EPRS) of the Secretariat of the European Parliament. The annexes to this paper are dedicated to individual countries and contain contributions from Ilze Eglite (coordinating information specialist), Evarts Anosovs, Marie-Laure Augere-Granier, Jan Avau, Michele Bigoni, Krisztina Binder, Kristina Grosek, Matilda Ekehorn, Roy Hirsh, Sorina Silvia Ionescu, Ana Martinez Juan, Ulla Jurviste, Vilma Karvelyte, Giorgios Klis, Maria Kollarova, Veronika Kunz, Elena Lazarou, Tarja Laaninen, Odile Maisse, Helmut Masson, Marketa Pape, Raquel Juncal Passos Rocha, Eric Pichon, Anja Radjenovic, Beata Rojek- Podgorska, Nicole Scholz, Anne Vernet and Dessislava Yougova. The graphics were produced by Giulio Sabbati. To contact the authors, please email: [email protected] LINGUISTIC VERSIONS Original: EN Translations: DE, FR Original manuscript, in English, completed in January 2018 and updated in December 2018.
    [Show full text]
  • Is Wikipedia Really Neutral? a Sentiment Perspective Study of War-Related Wikipedia Articles Since 1945
    PACLIC 29 Is Wikipedia Really Neutral? A Sentiment Perspective Study of War-related Wikipedia Articles since 1945 Yiwei Zhou, Alexandra I. Cristea and Zachary Roberts Department of Computer Science University of Warwick Coventry, United Kingdom fYiwei.Zhou, A.I.Cristea, [email protected] Abstract include books, journal articles, newspapers, web- pages, sound recordings2, etc. Although a “Neutral Wikipedia is supposed to be supporting the 3 “Neutral Point of View”. Instead of accept- point of view” (NPOV) is Wikipedia’s core content ing this statement as a fact, the current paper policy, we believe sentiment expression is inevitable analyses its veracity by specifically analysing in this user-generated content. Already in (Green- a typically controversial (negative) topic, such stein and Zhu, 2012), researchers have raised doubt as war, and answering questions such as “Are about Wikipedia’s neutrality, as they pointed out there sentiment differences in how Wikipedia that “Wikipedia achieves something akin to a NPOV articles in different languages describe the across articles, but not necessarily within them”. same war?”. This paper tackles this chal- Moreover, people of different language backgrounds lenge by proposing an automatic methodology based on article level and concept level senti- share different cultures and sources of information. ment analysis on multilingual Wikipedia arti- These differences have reflected on the style of con- cles. The results obtained so far show that rea- tributions (Pfeil et al., 2006) and the type of informa- sons such as people’s feelings of involvement tion covered (Callahan and Herring, 2011). Further- and empathy can lead to sentiment expression more, Wikipedia webpages actually allow to con- differences across multilingual Wikipedia on tain opinions, as long as they come from reliable au- war-related topics; the more people contribute thors4.
    [Show full text]
  • Parallel-Wiki: a Collection of Parallel Sentences Extracted from Wikipedia
    Parallel-Wiki: A Collection of Parallel Sentences Extracted from Wikipedia Dan Ştefănescu1,2 and Radu Ion1 1 Research Institute for Artificial Intelligence, Romanian Academy {danstef,radu}@racai.ro 2 Department of Computer Science, The University of Memphis [email protected] Abstract. Parallel corpora are essential resources for certain Natural Language Processing tasks such as Statistical Machine Translation. However, the existing publically available parallel corpora are specific to limited genres or domains, mostly juridical (e.g. JRC-Acquis) and medical (e.g. EMEA), and there is a lack of such resources for the general domain. This paper addresses this issue and presents a collection of parallel sentences extracted from the entire Wikipedia collection of documents for the following pairs of languages: English-German, English-Romanian and English-Spanish. Our work began with the processing of the publically available Wikipedia static dumps for the three languages in- volved. The existing text was stripped of the specific mark-up, cleaned of non- textual entries like images or tables and sentence-split. Then, corresponding documents for the above mentioned pairs of languages were identified using the cross-lingual Wikipedia links embedded within the documents themselves. Considering them comparable documents, we further employed a publically available tool named LEXACC, developed during the ACCURAT project, to extract parallel sentences from the preprocessed data. LEXACC assigns a score to each extracted pair, which is a measure of the degree of parallelism between the two sentences in the pair. These scores allow researchers to select only those sentences having a certain degree of parallelism suited for their intended purposes.
    [Show full text]
  • Wikipedia @ 20
    Wikipedia @ 20 Wikipedia @ 20 Stories of an Incomplete Revolution Edited by Joseph Reagle and Jackie Koerner The MIT Press Cambridge, Massachusetts London, England © 2020 Massachusetts Institute of Technology This work is subject to a Creative Commons CC BY- NC 4.0 license. Subject to such license, all rights are reserved. The open access edition of this book was made possible by generous funding from Knowledge Unlatched, Northeastern University Communication Studies Department, and Wikimedia Foundation. This book was set in Stone Serif and Stone Sans by Westchester Publishing Ser vices. Library of Congress Cataloging-in-Publication Data Names: Reagle, Joseph, editor. | Koerner, Jackie, editor. Title: Wikipedia @ 20 : stories of an incomplete revolution / edited by Joseph M. Reagle and Jackie Koerner. Other titles: Wikipedia at 20 Description: Cambridge, Massachusetts : The MIT Press, [2020] | Includes bibliographical references and index. Identifiers: LCCN 2020000804 | ISBN 9780262538176 (paperback) Subjects: LCSH: Wikipedia--History. Classification: LCC AE100 .W54 2020 | DDC 030--dc23 LC record available at https://lccn.loc.gov/2020000804 Contents Preface ix Introduction: Connections 1 Joseph Reagle and Jackie Koerner I Hindsight 1 The Many (Reported) Deaths of Wikipedia 9 Joseph Reagle 2 From Anarchy to Wikiality, Glaring Bias to Good Cop: Press Coverage of Wikipedia’s First Two Decades 21 Omer Benjakob and Stephen Harrison 3 From Utopia to Practice and Back 43 Yochai Benkler 4 An Encyclopedia with Breaking News 55 Brian Keegan 5 Paid with Interest: COI Editing and Its Discontents 71 William Beutler II Connection 6 Wikipedia and Libraries 89 Phoebe Ayers 7 Three Links: Be Bold, Assume Good Faith, and There Are No Firm Rules 107 Rebecca Thorndike- Breeze, Cecelia A.
    [Show full text]
  • The Tower of Babel Meets Web 2.0: User-Generated Content and Its Applications in a Multilingual Context Brent Hecht* and Darren Gergle*† Northwestern University *Dept
    The Tower of Babel Meets Web 2.0: User-Generated Content and Its Applications in a Multilingual Context Brent Hecht* and Darren Gergle*† Northwestern University *Dept. of Electrical Engineering and Computer Science, † Dept. of Communication Studies [email protected], [email protected] ABSTRACT the goal of this research to illustrate the splintering effect of This study explores language’s fragmenting effect on user- this “Web 2.0 Tower of Babel”1 and to explicate the generated content by examining the diversity of knowledge positive and negative implications for HCI and AI-based representations across 25 different Wikipedia language applications that interact with or use Wikipedia data. editions. This diversity is measured at two levels: the concepts that are included in each edition and the ways in We begin by suggesting that current technologies and which these concepts are described. We demonstrate that applications that rely upon Wikipedia data structures the diversity present is greater than has been presumed in implicitly or explicitly espouse a global consensus the literature and has a significant influence on applications hypothesis with respect to the world’s encyclopedic that use Wikipedia as a source of world knowledge. We knowledge. In other words, they make the assumption that close by explicating how knowledge diversity can be encyclopedic world knowledge is largely consistent across beneficially leveraged to create “culturally-aware cultures and languages. To the social scientist this notion applications” and “hyperlingual applications”. will undoubtedly seem problematic, as centuries of work have demonstrated the critical role culture and context play Author Keywords in establishing knowledge diversity (although no work has Wikipedia, knowledge diversity, multilingual, hyperlingual, yet measured this effect in Web 2.0 user-generated content Explicit Semantic Analysis, semantic relatedness (UGC) on a large scale).
    [Show full text]
  • CCURL 2014: Collaboration and Computing for Under- Resourced Languages in the Linked Open Data Era
    CCURL 2014: Collaboration and Computing for Under- Resourced Languages in the Linked Open Data Era Workshop Programme 09:15-09:30 – Welcome and Introduction 09:30-10:30 – Invited Talk Steven Moran, Under-resourced languages data: from collection to application 10:30-11:00 – Coffee break 11:00-13:00 – Session 1 Chairperson: Joseph Mariani 11:00-11:30 – Oleg Kapanadze, The Multilingual GRUG Parallel Treebank – Syntactic Annotation for Under-Resourced Languages 11:30-12:00 – Martin Benjamin, Paula Radetzky, Multilingual Lexicography with a Focus on Less- Resourced Languages: Data Mining, Expert Input, Crowdsourcing, and Gamification 12:00-12:30 – Thierry Declerck, Eveline Wandl-Vogt, Karlheinz Mörth, Claudia Resch, Towards a Unified Approach for Publishing Regional and Historical Language Resources on the Linked Data Framework 12:30-13:00 – Delphine Bernhard, Adding Dialectal Lexicalisations to Linked Open Data Resources: the Example of Alsatian 13:00-15:00 – Lunch break 13:00-15:00 – Poster Session Chairpersons: Laurette Pretorius and Claudia Soria Georg Rehm, Hans Uszkoreit, Ido Dagan, Vartkes Goetcherian, Mehmet Ugur Dogan, Coskun Mermer, Tamás Varadi, Sabine Kirchmeier-Andersen, Gerhard Stickel, Meirion Prys Jones, Stefan Oeter, Sigve Gramstad, An Update and Extension of the META-NET Study “Europe’s Languages in the Digital Age” István Endrédy, Hungarian-Somali-English Online Dictionary and Taxonomy Chantal Enguehard, Mathieu Mangeot, Computerization of African Languages-French Dictionaries Uwe Quasthoff, Sonja Bosch, Dirk Goldhahn, Morphological Analysis for Less- Resourced Languages: Maximum Affix Overlap Applied to Zulu Edward O. Ombui, Peter W. Wagacha, Wanjiku Ng’ang’a, InterlinguaPlus Machine Translation Approach for Under-Resourced Languages: Ekegusii & Swahili Ronaldo Martins, UNLarium: a Crowd-Sourcing Environment for Multilingual Resources Anuschka van ´t Hooft, José Luis González Compeán, Collaborative Language Documentation: the Construction of the Huastec Corpus Sjur Moshagen, Jack Rueter, Tommi Pirinen, Trond Trosterud, Francis M.
    [Show full text]
  • The Role of Local Content in Wikipedia: a Study on Reader and Editor Engagement1
    ARTÍCULOS Área Abierta. Revista de comunicación audiovisual y publicitaria ISSN: 2530-7592 / ISSNe: 1578-8393 https://dx.doi.org/10.5209/arab.72801 The Role of Local Content in Wikipedia: A Study on Reader and Editor Engagement1 Marc Miquel2, David Laniado3 and Andreas Kaltenbrunner4 Recibido: 1 de diciembre de 2020 / Aceptado: 15 de febrero de 2021 Abstract. About a quarter of each Wikipedia language edition is dedicated to representing “local content”, i.e. the corresponding cultural context —geographical places, historical events, political figures, among others—. To investigate the relevance of such content for users and communities, we present an analysis of reader and editor engagement in terms of pageviews and edits. The results, consistent across fifteen diverse language editions, show that these articles are more engaging for readers and especially for editors. The highest proportion of edits on cultural context content is generated by anonymous users, and also administrators engage proportionally more than plain registered editors; in fact, looking at the first week of activity of every editor in the community, administrators already engage correlatively more than other editors in content representing their cultural context. These findings indicate the relevance of this kind of content both for fulfilling readers’ informational needs and stimulating the dynamics of the editing community. Keywords: Wikipedia; Online Communities; User Engagement; Culture; Cultural Context; Online Collaboration [es] El rol del contenido local en Wikipedia: un estudio sobre la participación de lectores y editores Resumen. Aproximadamente una cuarta parte de cada Wikipedia está dedicada a representar “contenido local”, es decir, el contexto cultural correspondiente de la lengua —lugares geográficos, hechos históricos, figuras políticas, entre otros—.
    [Show full text]
  • Large SMT Data-Sets Extracted from Wikipedia
    Large SMT data-sets extracted from Wikipedia Dan Tufiş 1, Radu Ion 1, Ştefan Dumitrescu 1, Dan Ştefănescu 2 1Research Institute for Artificial Intelligence - Romanian Academy, 13 Calea 13 Septembrie, 050711, Bucharest, Romania {tufis, radu, sdumitrescu}@racai.ro 2The University of Memphis, Department of Computer Science, Memphis, TN, 38152, USA-USA [email protected] Abstract The article presents experiments on mining Wikipedia for extracting SMT useful sentence pairs in three language pairs. Each extracted sentence pair is associated with a cross-lingual lexical similarity score based on which, several evaluations have been conducted to estimate the similarity thresholds which allow the extraction of the most useful data for training three-language pairs SMT systems. The experiments showed that for a similarity score higher than 0.7 all sentence pairs in the three language pairs were fully parallel. However, including in the training sets less parallel sentence pairs (that is with a lower similarity score) showed significant improvements in the translation quality (BLEU-based evaluations). The optimized SMT systems were evaluated on unseen test-sets also extracted from Wikipedia. As one of the main goals of our work was to help Wikipedia contributors to translate (with as little post editing as possible) new articles from major languages into less resourced languages and vice-versa, we call this type of translation experiments “ in-genre” translation. As in the case of “in-domain” translation, our evaluations showed that using only “in-genre” training data for translating same genre new texts is better than mixing the training data with “out-of-genre” (even) parallel texts.
    [Show full text]
  • Global Gender Differences in Wikipedia Readership
    Global gender differences in Wikipedia readership Isaac Johnson1, Florian Lemmerich2, Diego Saez-Trumper´ 1, Robert West3, Markus Strohmaier2,4, and Leila Zia1 1Wikimedia Foundation; fi[email protected] 2RWTH Aachen University; fi[email protected] 3EPFL; fi[email protected] 4GESIS - Leibniz Institute for the Social Sciences; fi[email protected] ABSTRACT Wikipedia represents the largest and most popular source of encyclopedic knowledge in the world today, aiming to provide equal access to information worldwide. From a global online survey of 65,031 readers of Wikipedia and their corresponding reading logs, we present novel evidence of gender differences in Wikipedia readership and how they manifest in records of user behavior. More specifically we report that (1) women are underrepresented among readers of Wikipedia, (2) women view fewer pages per reading session than men do, (3) men and women visit Wikipedia for similar reasons, and (4) men and women exhibit specific topical preferences. Our findings lay the foundation for identifying pathways toward knowledge equity in the usage of online encyclopedic knowledge. Equal access to encyclopedic knowledge represents a critical prerequisite for promoting equality more broadly in society and for an open and informed public at large. With almost 54 million articles written by roughly 500,000 monthly editors across more than 160 actively edited languages, Wikipedia is the most important source of encyclopedic knowledge for a global audience, and one of the most important knowledge resources available on the internet. Every month, Wikipedia attracts users on more than 1.5 billion unique devices from across the globe, for a total of more than 15 billion monthly pageviews.1 Data about who is represented in this readership provides unique insight into the accessibility of encyclopedic knowledge worldwide.
    [Show full text]
  • Diversity of Visual Encyclopedic Knowledge Across Wikipedia Language Editions
    Proceedings of the Twelfth International AAAI Conference on Web and Social Media (ICWSM 2018) The Tower of Babel.jpg: Diversity of Visual Encyclopedic Knowledge Across Wikipedia Language Editions Shiqing He Allen Yilun Lin Eytan Adar Brent Hecht University of Michigan Northwestern University University of Michigan Northwestern University Ann Arbor, MI 48109 Evanston, IL 60208 Ann Arbor, MI 48109 Evanston, IL 60208 Abstract Knight 2017)) was first observed by examining the same AI Across all Wikipedia language editions, millions of im- technology operating on different Wikipedia language edi- ages augment text in critical ways. This visual encyclope- tions (Hecht and Gergle 2010). dic knowledge is an important form of wikiwork for edi- Research on language edition diversity has focused on tors, a critical part of reader experience, an emerging re- comparing article text and links across language editions. source for machine learning, and a lens into cultural differ- However, the encyclopedic knowledge in Wikipedia is mul- ences. However, Wikipedia research–and cross-language edi- timedia in nature and, in addition to text and links, consists tion Wikipedia research in particular–has thus far been lim- of images and other media. In this paper, we seek to extend ited to text. In this paper, we assess the diversity of visual en- the literature on Wikipedia language edition diversity to the cyclopedic knowledge across 25 language editions and com- critical medium of imagery. More formally, this paper aims pare our findings to those reported for textual content. Un- like text, translation in images is largely unnecessary. Addi- to measure and characterize the similarities and differences tionally, the Wikimedia Foundation, through the Wikipedia in visual encyclopedia knowledge across language editions.
    [Show full text]
  • Towards a Romanian End-To-End Automatic Speech Recognition Based on Deepspeech2
    THE PUBLISHING HOUSE PROCEEDINGS OF THE ROMANIAN ACADEMY, Series A, OF THE ROMANIAN ACADEMY Volume 21, Number 4/2020, pp. 395–402 TOWARDS A ROMANIAN END-TO-END AUTOMATIC SPEECH RECOGNITION BASED ON DEEPSPEECH2 Andrei-Marius AVRAM, Vasile PAIŞ, Dan TUFIŞ Research Institute for Artificial Intelligence, Romanian Academy Corresponding author: Dan Tufiş, E-mail: [email protected] Abstract. This paper presents an implementation of an ASR system for the Romanian language that uses a multi-layer neural network architecture to transcribe the input speech, augmented with a statistical language model to improve the correctness of the transcriptions. The neural model was trained on 230 hours of speech for 70 epochs and, together with the language model, it achieved a word error rate of 9.91% and a character error rate of 2.81%. We further tested the response time of the ASR and we obtained an average latency of 70ms, while running on a GPU, which can be suitable for near real-time transcriptions. Key words: automatic speech recognition, Romanian, natural language processing, deep neural networks. 1. INTRODUCTION Automatic Speech Recognition (ASR) consists in translating human spoken utterances into a textual transcript, and it is a key component in voice assistants such as Apple’s Siri, Amazon’s Alexa, Microsoft’s Cortana or Google’s Assistant (Sainath et al. [23], Lopatovska et al. [25]), in spoken language translation systems (Di Gangi et al. [8]) or in generating automatic transcriptions for audio and videos (Noda et al. [27]). Most of the ASR systems before the deep learning revolution used variations of Hidden Markov Models (HMM) (Garg et al.
    [Show full text]