Early Onset of Structural Inequality in the Formation of Collaborative Knowledge, Wikipedia
Total Page:16
File Type:pdf, Size:1020Kb

Load more
Recommended publications
-
Dictionary Users Do Look up Frequent Words. a Logfile Analysis
Erschienen in: Müller-Spitzer, Carolin (Hrsg.): Using Online Dictionaries. Berlin/ Boston: de Gruyter, 2014. (Lexicographica: Series Maior 145), S. 229-249. Alexander Koplenig, Peter Meyer, Carolin Müller-Spitzer Dictionary users do look up frequent words. A logfile analysis Abstract: In this paper, we use the 2012 log files of two German online dictionaries (Digital Dictionary of the German Language1 and the German Version of Wiktionary) and the 100,000 most frequent words in the Mannheim German Reference Corpus from 2009 to answer the question of whether dictionary users really do look up fre- quent words, first asked by de Schryver et al. (2006). By using an approach to the comparison of log files and corpus data which is completely different from that of the aforementioned authors, we provide empirical evidence that indicates - contra - ry to the results of de Schryver et al. and Verlinde/Binon (2010) - that the corpus frequency of a word can indeed be an important factor in determining what online dictionary users look up. Finally, we incorporate word dass Information readily available in Wiktionary into our analysis to improve our results considerably. Keywords: log file, frequency, corpus, headword list, monolingual dictionary, multi- lingual dictionary Alexander Koplenig: Institut für Deutsche Sprache, R 5, 6-13, 68161 Mannheim, +49-(0)621-1581- 435, [email protected] Peter Meyer: Institut für Deutsche Sprache, R 5, 6-13, 68161 Mannheim, +49-(0)621-1581-427, [email protected] Carolin Müller-Spitzer: Institut für Deutsche Sprache, R 5, 6-13, 68161 Mannheim, +49-(0)621-1581- 429, [email protected] Introduction We would like to Start this chapter by asking one of the most fundamental questions for any general lexicographical endeavour to describe the words of one (or more) language(s): which words should be included in a dictionary? At first glance, the answer seems rather simple (especially when the primary objective is to describe a language as completely as possible): it would be best to include every word in the dictionary. -
LATEX for Beginners
LATEX for Beginners Workbook Edition 5, March 2014 Document Reference: 3722-2014 Preface This is an absolute beginners guide to writing documents in LATEX using TeXworks. It assumes no prior knowledge of LATEX, or any other computing language. This workbook is designed to be used at the `LATEX for Beginners' student iSkills seminar, and also for self-paced study. Its aim is to introduce an absolute beginner to LATEX and teach the basic commands, so that they can create a simple document and find out whether LATEX will be useful to them. If you require this document in an alternative format, such as large print, please email [email protected]. Copyright c IS 2014 Permission is granted to any individual or institution to use, copy or redis- tribute this document whole or in part, so long as it is not sold for profit and provided that the above copyright notice and this permission notice appear in all copies. Where any part of this document is included in another document, due ac- knowledgement is required. i ii Contents 1 Introduction 1 1.1 What is LATEX?..........................1 1.2 Before You Start . .2 2 Document Structure 3 2.1 Essentials . .3 2.2 Troubleshooting . .5 2.3 Creating a Title . .5 2.4 Sections . .6 2.5 Labelling . .7 2.6 Table of Contents . .8 3 Typesetting Text 11 3.1 Font Effects . 11 3.2 Coloured Text . 11 3.3 Font Sizes . 12 3.4 Lists . 13 3.5 Comments & Spacing . 14 3.6 Special Characters . 15 4 Tables 17 4.1 Practical . -
Position Description Addenda
POSITION DESCRIPTION January 2014 Wikimedia Foundation Executive Director - Addenda The Wikimedia Foundation is a radically transparent organization, and much information can be found at www.wikimediafoundation.org . That said, certain information might be particularly useful to nominators and prospective candidates, including: Announcements pertaining to the Wikimedia Foundation Executive Director Search Kicking off the search for our next Executive Director by Former Wikimedia Foundation Board Chair Kat Walsh An announcement from Wikimedia Foundation ED Sue Gardner by Wikimedia Executive Director Sue Gardner Video Interviews on the Wikimedia Community and Foundation and Its History Some of the values and experiences of the Wikimedia Community are best described directly by those who have been intimately involved in the organization’s dramatic expansion. The following interviews are available for viewing though mOppenheim.TV . • 2013 Interview with Former Wikimedia Board Chair Kat Walsh • 2013 Interview with Wikimedia Executive Director Sue Gardner • 2009 Interview with Wikimedia Executive Director Sue Gardner Guiding Principles of the Wikimedia Foundation and the Wikimedia Community The following article by Sue Gardner, the current Executive Director of the Wikimedia Foundation, has received broad distribution and summarizes some of the core cultural values shared by Wikimedia’s staff, board and community. Topics covered include: • Freedom and open source • Serving every human being • Transparency • Accountability • Stewardship • Shared power • Internationalism • Free speech • Independence More information can be found at: https://meta.wikimedia.org/wiki/User:Sue_Gardner/Wikimedia_Foundation_Guiding_Principles Wikimedia Policies The Wikimedia Foundation has an extensive list of policies and procedures available online at: http://wikimediafoundation.org/wiki/Policies Wikimedia Projects All major projects of the Wikimedia Foundation are collaboratively developed by users around the world using the MediaWiki software. -
Danish Resources
Danish resources Finn Arup˚ Nielsen November 19, 2017 Abstract A range of different Danish resources, datasets and tools, are presented. The focus is on resources for use in automated computational systems and free resources that can be redistributed and used in commercial applications. Contents 1 Corpora3 1.1 Wikipedia...................................3 1.2 Wikisource...................................3 1.3 Wikiquote...................................4 1.4 ADL......................................4 1.5 Gutenberg...................................4 1.6 Runeberg...................................5 1.7 Europarl....................................5 1.8 Leipzig Corpora Collection..........................5 1.9 Danish Dependency Treebank........................6 1.10 Retsinformation................................6 1.11 Other resources................................6 2 Lexical resources6 2.1 DanNet....................................6 2.2 Wiktionary..................................7 2.3 Wikidata....................................7 2.4 OmegaWiki..................................8 2.5 Other lexical resources............................8 2.6 Wikidata examples with medical terminology extraction.........8 3 Natural language processing tools9 3.1 NLTK.....................................9 3.2 Polyglot....................................9 3.3 spaCy.....................................9 3.4 Apache OpenNLP............................... 10 3.5 Centre for Language Technology....................... 10 3.6 Other libraries................................ -
Etytree: a Graphical and Interactive Etymology Dictionary Based on Wiktionary
Etytree: A Graphical and Interactive Etymology Dictionary Based on Wiktionary Ester Pantaleo Vito Walter Anelli Wikimedia Foundation grantee Politecnico di Bari Italy Italy [email protected] [email protected] Tommaso Di Noia Gilles Sérasset Politecnico di Bari Univ. Grenoble Alpes, CNRS Italy Grenoble INP, LIG, F-38000 Grenoble, France [email protected] [email protected] ABSTRACT a new method1 that parses Etymology, Derived terms, De- We present etytree (from etymology + family tree): a scendants sections, the namespace for Reconstructed Terms, new on-line multilingual tool to extract and visualize et- and the etymtree template in Wiktionary. ymological relationships between words from the English With etytree, a RDF (Resource Description Framework) Wiktionary. A first version of etytree is available at http: lexical database of etymological relationships collecting all //tools.wmflabs.org/etytree/. the extracted relationships and lexical data attached to lex- With etytree users can search a word and interactively emes has also been released. The database consists of triples explore etymologically related words (ancestors, descendants, or data entities composed of subject-predicate-object where cognates) in many languages using a graphical interface. a possible statement can be (for example) a triple with a lex- The data is synchronised with the English Wiktionary dump eme as subject, a lexeme as object, and\derivesFrom"or\et- at every new release, and can be queried via SPARQL from a ymologicallyEquivalentTo" as predicate. The RDF database Virtuoso endpoint. has been exposed via a SPARQL endpoint and can be queried Etytree is the first graphical etymology dictionary, which at http://etytree-virtuoso.wmflabs.org/sparql. -
On the Evolution of Wikipedia
On the Evolution of Wikipedia Rodrigo B. Almeida Barzan Mozafari Junghoo Cho UCLA Computer Science UCLA Computer Science UCLA Computer Science Department Department Department Los Angeles - USA Los Angeles - USA Los Angeles - USA [email protected] [email protected] [email protected] Abstract time. So far, several studies have focused on understanding A recent phenomenon on the Web is the emergence and pro- and characterizing the evolution of this huge repository of liferation of new social media systems allowing social inter- data [5, 11]. action between people. One of the most popular of these Recently, a new phenomenon, called social systems, has systems is Wikipedia that allows users to create content in a emerged from the Web. Generally speaking, such systems collaborative way. Despite its current popularity, not much allow people not only to create content, but also to easily is known about how users interact with Wikipedia and how interact and collaborate with each other. Examples of such it has evolved over time. systems are: (1) Social network systems such as MySpace In this paper we aim to provide a first, extensive study of or Orkut that allow users to participate in a social network the user behavior on Wikipedia and its evolution. Compared by creating their profiles and indicating their acquaintances; to prior studies, our work differs in several ways. First, previ- (2) Collaborative bookmarking systems such as Del.icio.us or ous studies on the analysis of the user workloads (for systems Yahoo’s MyWeb in which users are allowed to share their such as peer-to-peer systems [10] and Web servers [2]) have bookmarks; and (3) Wiki systems that allow collaborative mainly focused on understanding the users who are accessing management of Web sites. -
User Contributions to Online Dictionaries Andrea Abel and Christian M
The dynamics outside the paper: User Contributions to Online Dictionaries Andrea Abel and Christian M. Meyer Electronic lexicography in the 21st century (eLex): Thinking outside the paper, Tallinn, Estonia, October 17–19, 2013. 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. Meyer | 1 Introduction Online dictionaries rely increasingly on their users and leverage methods for facilitating user contributions at basically any step of the lexicographic process. 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. Meyer | 2 Previous work . Mostly focused on specific type/dimension of user contribution . e.g., Carr (1997), Storrer (1998, 2010), Køhler Simonsen (2005), Fuertes-Olivera (2009), Melchior (2012), Lew (2011, 2013) . Ambiguous, partly overlapping terms: www.wordle.net/show/wrdl/7124863/User_contributions (02.10.2013) www.wordle.net/show/wrdl/7124863/User_contributions http:// 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. Meyer | 3 Research goals and contribution How to effectively plan the user contributions for new and established dictionaries? . Comprehensive and systematic classification is still missing . Mann (2010): analysis of 88 online dictionaries . User contributions roughly categorized as direct / indirect contributions and exchange with other dictionary users . Little detail given, since outside the scope of the analysis . We use this as a basis for our work We propose a novel classification of the different types of user contributions based on many practical examples 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. -
100+ Ways to Assess
1 2 100+ Ways to Assess: 1. Annotated Bibliography: a list of citations followed by a brief descriptive and evaluative paragraph, the purpose of which is to inform the reader of the relevance, accuracy, and quality of the sources cited. 2. Assessment Stations: students rotate between different stations which have different questions to answer at them; e.g. a science practical assessment where questions relate to particular artefacts at each station. 3. Book Review: gives the reader a concise summary of the content including a relevant description of the topic as well as its overall perspective, argument, or purpose. Second, and more importantly, a review offers a critical assessment of the content. 4. Business Plan: a formal statement of business goals, reasons they are attainable, and plans for reaching them. It may also contain background information about the organization or team attempting to reach those goals. 5. Capstone Project: students pursue independent research on a question or problem of their choice, engage with the scholarly debates in the relevant disciplines, and produce a substantial thesis/dissertation providing a deep understanding of the topic. 6. Case Study: detailed examination of a subject of study (the case), as well as its related contextual conditions (often associated with medicine or law). 7. Clinical Observation: observation and feedback for teaching and evaluating medical student communication skills between trainee physician and patient may involve role plays or actual trainee-patient interactions. 8. Concept Map: a type of graphic organizer that can used to organize and represent knowledge of a subject. Concept maps begin with a main idea (or concept) and then branch out to show how that main idea can be broken down into specific topics. -
Declaring Criminal Convictions Us Visa
Declaring Criminal Convictions Us Visa Saccharoid Saxon sometimes renegotiated any staining farms pianissimo. Tongue-tied Zechariah editorializing impulsively. Self-respecting and valved Ez defends so rudely that Gay cram his Somme. Some us visa issued, blanket rules which has already registered Criminal Record & Travelling Pardons Canada. Which criminal convictions affect your US visa application The tide of crimes that her cause a visa application to be denied is baffled but these. Now an unspent criminal records eligible for a gardener. Your behalf of a conviction does not have the day and the content on his way inside the. TravelStateGov US DEPARTMENT under STATE BUREAU of CONSULAR AFFAIRS. You may liquid get rich criminal driving record from VicRoads for traffic offences. And individuals subject being mandatory detention based on criminal grounds. Visas to Brazil Consulado-Geral do Brasil em Londres. Gonzalez's testimony ultimately resulted in the convictions of four. If you prepare an unspent criminal conviction there may develop to declare counter to us when we make you simply offer this you wish to accept is a bare conviction will. Policing Non-Citizens. Brent & Powell Immigration Law Success Stories. At the time a visa petition was approved on August 19 1953 declaring him to. Glyn and criminal records who must log in using the declaration, visas to violence against not. Our Blog Goldstein Immigration Lawyers. Declaring Criminal Convictions What have Those Fines. With criminal conviction? This visa application to declare any declared bankruptcy showing that they would not declaring a declaration must log in using plain text. You suck be cash that having a void conviction may result in visa restrictions. -
Wiktionary Matcher
Wiktionary Matcher Jan Portisch1;2[0000−0001−5420−0663], Michael Hladik2[0000−0002−2204−3138], and Heiko Paulheim1[0000−0003−4386−8195] 1 Data and Web Science Group, University of Mannheim, Germany fjan, [email protected] 2 SAP SE Product Engineering Financial Services, Walldorf, Germany fjan.portisch, [email protected] Abstract. In this paper, we introduce Wiktionary Matcher, an ontology matching tool that exploits Wiktionary as external background knowl- edge source. Wiktionary is a large lexical knowledge resource that is collaboratively built online. Multiple current language versions of Wik- tionary are merged and used for monolingual ontology matching by ex- ploiting synonymy relations and for multilingual matching by exploiting the translations given in the resource. We show that Wiktionary can be used as external background knowledge source for the task of ontology matching with reasonable matching and runtime performance.3 Keywords: Ontology Matching · Ontology Alignment · External Re- sources · Background Knowledge · Wiktionary 1 Presentation of the System 1.1 State, Purpose, General Statement The Wiktionary Matcher is an element-level, label-based matcher which uses an online lexical resource, namely Wiktionary. The latter is "[a] collaborative project run by the Wikimedia Foundation to produce a free and complete dic- tionary in every language"4. The dictionary is organized similarly to Wikipedia: Everybody can contribute to the project and the content is reviewed in a com- munity process. Compared to WordNet [4], Wiktionary is significantly larger and also available in other languages than English. This matcher uses DBnary [15], an RDF version of Wiktionary that is publicly available5. The DBnary data set makes use of an extended LEMON model [11] to describe the data. -
Jimmy Wales and Larry Sanger, It Is the Largest, Fastest-Growing and Most Popular General Reference Work Currently Available on the Internet
Tomasz „Polimerek” Ganicz Wikimedia Polska WikipediaWikipedia andand otherother WikimediaWikimedia projectsprojects WhatWhat isis Wikipedia?Wikipedia? „Imagine„Imagine aa worldworld inin whichwhich everyevery singlesingle humanhuman beingbeing cancan freelyfreely shareshare inin thethe sumsum ofof allall knowledge.knowledge. That'sThat's ourour commitment.”commitment.” JimmyJimmy „Jimbo”„Jimbo” Wales Wales –– founder founder ofof WikipediaWikipedia As defined by itself: Wikipedia is a free multilingual, open content encyclopedia project operated by the non-profit Wikimedia Foundation. Its name is a blend of the words wiki (a technology for creating collaborative websites) and encyclopedia. Launched in January 2001 by Jimmy Wales and Larry Sanger, it is the largest, fastest-growing and most popular general reference work currently available on the Internet. OpenOpen and and free free content content RichardRichard StallmanStallman definition definition of of free free software: software: „The„The wordword "free""free" inin ourour namename doesdoes notnot referrefer toto price;price; itit refersrefers toto freedom.freedom. First,First, thethe freedomfreedom toto copycopy aa programprogram andand redistributeredistribute itit toto youryour neighbors,neighbors, soso thatthat theythey cancan useuse itit asas wellwell asas you.you. Second,Second, thethe freedomfreedom toto changechange aa program,program, soso ththatat youyou cancan controlcontrol itit insteadinstead ofof itit controllingcontrolling you;you; forfor this,this, thethe sourcesource -
Instructor Basics: Howtouse Wikipedia As Ateaching Tool
Instructor Basics: How to use Wikipedia as a teaching tool Wiki Education Foundation Wikipedia is the free online encyclopedia that anyone can edit. One of the most visited websites worldwide, Wikipedia is a resource used by most university students. Increasingly, many instructors around the world have used Wikipedia as a teaching tool in their university classrooms as well. In this brochure, we bring together their experiences to help you determine how to use Wikipedia in your classroom. We’ve organized the brochure into three parts: Assignment planning Learn key Wikipedia policies and get more information on designing assignments, with a focus on asking students to write Wikipedia articles for class. During the term Learn about the structure of a good Wikipedia article, the kinds of articles students should choose to improve, suggestions for what to cover in a Wikipedia lab session, and how to interact with the community of Wikipedia editors. After the term See a sample assessment structure that’s worked for other instructors. 2 Instructor Basics Assignment planning Understanding key policies Since Wikipedia started in 2001, the community of volunteer editors – “Wikipedians” – has developed several key policies designed to ensure Wikipedia is as reliable and useful as possible. Any assignment you integrate into your classroom must follow these policies. Understanding these cornerstone policies ensures that you develop an assignment that meets your learning objectives and improves Wikipedia at the same time. Free content Neutral point of view “The work students contribute to “Everything on Wikipedia must be Wikipedia is free content and becomes written from a neutral point of view.