The Wiki Family of Web Sites
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Dictionary Users Do Look up Frequent Words. a Logfile Analysis
Erschienen in: Müller-Spitzer, Carolin (Hrsg.): Using Online Dictionaries. Berlin/ Boston: de Gruyter, 2014. (Lexicographica: Series Maior 145), S. 229-249. Alexander Koplenig, Peter Meyer, Carolin Müller-Spitzer Dictionary users do look up frequent words. A logfile analysis Abstract: In this paper, we use the 2012 log files of two German online dictionaries (Digital Dictionary of the German Language1 and the German Version of Wiktionary) and the 100,000 most frequent words in the Mannheim German Reference Corpus from 2009 to answer the question of whether dictionary users really do look up fre- quent words, first asked by de Schryver et al. (2006). By using an approach to the comparison of log files and corpus data which is completely different from that of the aforementioned authors, we provide empirical evidence that indicates - contra - ry to the results of de Schryver et al. and Verlinde/Binon (2010) - that the corpus frequency of a word can indeed be an important factor in determining what online dictionary users look up. Finally, we incorporate word dass Information readily available in Wiktionary into our analysis to improve our results considerably. Keywords: log file, frequency, corpus, headword list, monolingual dictionary, multi- lingual dictionary Alexander Koplenig: Institut für Deutsche Sprache, R 5, 6-13, 68161 Mannheim, +49-(0)621-1581- 435, [email protected] Peter Meyer: Institut für Deutsche Sprache, R 5, 6-13, 68161 Mannheim, +49-(0)621-1581-427, [email protected] Carolin Müller-Spitzer: Institut für Deutsche Sprache, R 5, 6-13, 68161 Mannheim, +49-(0)621-1581- 429, [email protected] Introduction We would like to Start this chapter by asking one of the most fundamental questions for any general lexicographical endeavour to describe the words of one (or more) language(s): which words should be included in a dictionary? At first glance, the answer seems rather simple (especially when the primary objective is to describe a language as completely as possible): it would be best to include every word in the dictionary. -
LATEX for Beginners
LATEX for Beginners Workbook Edition 5, March 2014 Document Reference: 3722-2014 Preface This is an absolute beginners guide to writing documents in LATEX using TeXworks. It assumes no prior knowledge of LATEX, or any other computing language. This workbook is designed to be used at the `LATEX for Beginners' student iSkills seminar, and also for self-paced study. Its aim is to introduce an absolute beginner to LATEX and teach the basic commands, so that they can create a simple document and find out whether LATEX will be useful to them. If you require this document in an alternative format, such as large print, please email [email protected]. Copyright c IS 2014 Permission is granted to any individual or institution to use, copy or redis- tribute this document whole or in part, so long as it is not sold for profit and provided that the above copyright notice and this permission notice appear in all copies. Where any part of this document is included in another document, due ac- knowledgement is required. i ii Contents 1 Introduction 1 1.1 What is LATEX?..........................1 1.2 Before You Start . .2 2 Document Structure 3 2.1 Essentials . .3 2.2 Troubleshooting . .5 2.3 Creating a Title . .5 2.4 Sections . .6 2.5 Labelling . .7 2.6 Table of Contents . .8 3 Typesetting Text 11 3.1 Font Effects . 11 3.2 Coloured Text . 11 3.3 Font Sizes . 12 3.4 Lists . 13 3.5 Comments & Spacing . 14 3.6 Special Characters . 15 4 Tables 17 4.1 Practical . -
Etytree: a Graphical and Interactive Etymology Dictionary Based on Wiktionary
Etytree: A Graphical and Interactive Etymology Dictionary Based on Wiktionary Ester Pantaleo Vito Walter Anelli Wikimedia Foundation grantee Politecnico di Bari Italy Italy [email protected] [email protected] Tommaso Di Noia Gilles Sérasset Politecnico di Bari Univ. Grenoble Alpes, CNRS Italy Grenoble INP, LIG, F-38000 Grenoble, France [email protected] [email protected] ABSTRACT a new method1 that parses Etymology, Derived terms, De- We present etytree (from etymology + family tree): a scendants sections, the namespace for Reconstructed Terms, new on-line multilingual tool to extract and visualize et- and the etymtree template in Wiktionary. ymological relationships between words from the English With etytree, a RDF (Resource Description Framework) Wiktionary. A first version of etytree is available at http: lexical database of etymological relationships collecting all //tools.wmflabs.org/etytree/. the extracted relationships and lexical data attached to lex- With etytree users can search a word and interactively emes has also been released. The database consists of triples explore etymologically related words (ancestors, descendants, or data entities composed of subject-predicate-object where cognates) in many languages using a graphical interface. a possible statement can be (for example) a triple with a lex- The data is synchronised with the English Wiktionary dump eme as subject, a lexeme as object, and\derivesFrom"or\et- at every new release, and can be queried via SPARQL from a ymologicallyEquivalentTo" as predicate. The RDF database Virtuoso endpoint. has been exposed via a SPARQL endpoint and can be queried Etytree is the first graphical etymology dictionary, which at http://etytree-virtuoso.wmflabs.org/sparql. -
User Contributions to Online Dictionaries Andrea Abel and Christian M
The dynamics outside the paper: User Contributions to Online Dictionaries Andrea Abel and Christian M. Meyer Electronic lexicography in the 21st century (eLex): Thinking outside the paper, Tallinn, Estonia, October 17–19, 2013. 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. Meyer | 1 Introduction Online dictionaries rely increasingly on their users and leverage methods for facilitating user contributions at basically any step of the lexicographic process. 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. Meyer | 2 Previous work . Mostly focused on specific type/dimension of user contribution . e.g., Carr (1997), Storrer (1998, 2010), Køhler Simonsen (2005), Fuertes-Olivera (2009), Melchior (2012), Lew (2011, 2013) . Ambiguous, partly overlapping terms: www.wordle.net/show/wrdl/7124863/User_contributions (02.10.2013) www.wordle.net/show/wrdl/7124863/User_contributions http:// 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. Meyer | 3 Research goals and contribution How to effectively plan the user contributions for new and established dictionaries? . Comprehensive and systematic classification is still missing . Mann (2010): analysis of 88 online dictionaries . User contributions roughly categorized as direct / indirect contributions and exchange with other dictionary users . Little detail given, since outside the scope of the analysis . We use this as a basis for our work We propose a novel classification of the different types of user contributions based on many practical examples 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. -
100+ Ways to Assess
1 2 100+ Ways to Assess: 1. Annotated Bibliography: a list of citations followed by a brief descriptive and evaluative paragraph, the purpose of which is to inform the reader of the relevance, accuracy, and quality of the sources cited. 2. Assessment Stations: students rotate between different stations which have different questions to answer at them; e.g. a science practical assessment where questions relate to particular artefacts at each station. 3. Book Review: gives the reader a concise summary of the content including a relevant description of the topic as well as its overall perspective, argument, or purpose. Second, and more importantly, a review offers a critical assessment of the content. 4. Business Plan: a formal statement of business goals, reasons they are attainable, and plans for reaching them. It may also contain background information about the organization or team attempting to reach those goals. 5. Capstone Project: students pursue independent research on a question or problem of their choice, engage with the scholarly debates in the relevant disciplines, and produce a substantial thesis/dissertation providing a deep understanding of the topic. 6. Case Study: detailed examination of a subject of study (the case), as well as its related contextual conditions (often associated with medicine or law). 7. Clinical Observation: observation and feedback for teaching and evaluating medical student communication skills between trainee physician and patient may involve role plays or actual trainee-patient interactions. 8. Concept Map: a type of graphic organizer that can used to organize and represent knowledge of a subject. Concept maps begin with a main idea (or concept) and then branch out to show how that main idea can be broken down into specific topics. -
Wiktionary Matcher
Wiktionary Matcher Jan Portisch1;2[0000−0001−5420−0663], Michael Hladik2[0000−0002−2204−3138], and Heiko Paulheim1[0000−0003−4386−8195] 1 Data and Web Science Group, University of Mannheim, Germany fjan, [email protected] 2 SAP SE Product Engineering Financial Services, Walldorf, Germany fjan.portisch, [email protected] Abstract. In this paper, we introduce Wiktionary Matcher, an ontology matching tool that exploits Wiktionary as external background knowl- edge source. Wiktionary is a large lexical knowledge resource that is collaboratively built online. Multiple current language versions of Wik- tionary are merged and used for monolingual ontology matching by ex- ploiting synonymy relations and for multilingual matching by exploiting the translations given in the resource. We show that Wiktionary can be used as external background knowledge source for the task of ontology matching with reasonable matching and runtime performance.3 Keywords: Ontology Matching · Ontology Alignment · External Re- sources · Background Knowledge · Wiktionary 1 Presentation of the System 1.1 State, Purpose, General Statement The Wiktionary Matcher is an element-level, label-based matcher which uses an online lexical resource, namely Wiktionary. The latter is "[a] collaborative project run by the Wikimedia Foundation to produce a free and complete dic- tionary in every language"4. The dictionary is organized similarly to Wikipedia: Everybody can contribute to the project and the content is reviewed in a com- munity process. Compared to WordNet [4], Wiktionary is significantly larger and also available in other languages than English. This matcher uses DBnary [15], an RDF version of Wiktionary that is publicly available5. The DBnary data set makes use of an extended LEMON model [11] to describe the data. -
教學大綱 098 1 2769 Public Domain 公版著作
朝陽科技大學 098學年度第1學期教學大綱 Public Domain 公版著作 當 2769 2769 期 Course Number 課 號 授 毛慶禎 Mao,Ching Chen 課 Instructor 教 師 中 公版著作 Public Domain 文 Course Name 課 名 開 資訊管理系(四日)四C 課 Department 單 位 修 選修 Elective 習 Required/Elective 別 學 2 2 分 Credits 數 1. 公版著作物就是著作財產權消滅之著作,依照著作權法 1. Public domain materials are those whose copyrights 的規定,以自由利用為原則。著作財產權的存續期,以著 have expired and can be freely used. Copyright of 課 作人之生存期間及其死亡後五十年為限,攝影、視聽、錄 creative works extend up to 50 years after the death of 程 音及表演之著作財產權的存續期,以公開發表後五十年為 Objectives the author. Copyright of photos, videos, recordings, and 目 限。 performances extend up to 50 years after the first 標 2. 檢視現有的公版著作,蒐集整理。 publication. 2. Examine and organize existing public domain materials. 參考資源 http://sites.google.com/site/maolins/teaching/pd 1. The Public Domain: Enclosing the Commons of the Mind [公領域: 納入共用的思維] / James Boyle. -- Yale University Press (December 9, 2008). -- 336 p. -- ISBN-10: 0300137400, ISBN-13: 978-0300137408 [PDF] [HTML], http://www.thepublicdomain.org/ 2. 自由資訊概論, http://www.lins.fju.edu.tw/mao/works/freeinformation4lac.htm 3. 自由資訊概論, http://www.lins.fju.edu.tw/mao/works/mtp4www.htm, 2004/7 for PCOffice 4. 公版著作物 / 毛慶禎, 2003/09/1, http://www.lins.fju.edu.tw/mao/works/fspd.htm 5. 古騰堡計畫, 2003/02/19, http://www.lins.fju.edu.tw/mao/foi/pg.htm 6. 海盜灣(Pirate Bay), http://thepiratebay.org/ 7. TPB Tracker Geo Statistic, http://geo.keff.org/ 8. 開放式課程計畫, http://www.myoops.org/twocw/ 9. 合法下載何必盜版 10. 維基百科, http://tinyurl.com/wikipediataiwan 學英文救饑荒, http://www.freerice.com/ 古騰堡計畫, http://blue.lins.fju.edu.tw/~mao/foi/pg.htm 11. -
Generating Openmath Content Dictionaries from Wikidata
Generating OpenMath Content Dictionaries from Wikidata Moritz Schubotz Dept. of Computer and Information Science, University of Konstanz, Box 76, 78464 Konstanz, Germany, [email protected] Abstract OpenMath content dictionaries are collections of mathematical sym- bols. Traditionally, content dictionaries are handcrafted by experts. The OpenMath specification requires a name and a textual description in English for each symbol in a dictionary. In our recently published MathML benchmark (MathMLBen), we represent mathematical for- mulae in Content MathML referring to Wikidata as the knowledge base for the grounding of the semantics. Based on this benchmark, we present an OpenMath content dictionary, which we generated auto- matically from Wikidata. Our Wikidata content dictionary consists of 330 entries. We used the 280 entries of the benchmark MathMLBen, as well as 50 entries that correspond to already existing items in the official OpenMath content dictionary entries. To create these items, we proposed the Wikidata property P5610. With this property, everyone can link OpenMath symbols and Wikidata items. By linking Wikidata and OpenMath data, the multilingual community maintained textual descriptions, references to Wikipedia articles, external links to other knowledge bases (such as the Wolfram Functions Site) are connected to the expert crafted OpenMath content dictionaries. Ultimately, these connections form a new content dictionary base. This provides multi- lingual background information for symbols in MathML formulae. 1 Introduction and Prior Works Traditionally, mathematical formulae occur in a textual or situational context. Human readers infer the meaning of formulae from their layout and the context. An essential task in mathematical information retrieval (MathIR) is to mimic parts of this process to automate MathIR tasks. -
February 5, 2011
February Arguing the law with Nicolaus Everardi 5, 2011 Posted by rechtsgeschiedenis under Digital editions | Tags: Bibliography,Digital libraries, Great Council of Malines, Legal history, Medieval law,Netherlands, Rare books In the early sixteenth century some changes become already visible in the way lawyers approached the law. Not only was there a growing interest in the history of Roman and canon law, but lawyers began to free themselves from the framework offered by these legal systems. One of the signs of this are the titles of legal treatises, the growth itself of this genre, and a more systematic approach of law. Nicolaus Everardi’s book on legal argumentation, his Topicorum seu de locis legalibus liber (Louvain 1516) is an example of this development. The book of this Dutch lawyer who presided the Court of Holland and the Great Council of Malines became almost a bestseller because of the reprints published everywhere in Europe. Printers in Bologna, Basel, Paris, Lyon, Strasbourg, Venice, Frankfurt am Main and Cologne printed this book until the mid-seventeenth century. I have found eight reprints of the first edition and eighteen of the second edition. On the blog of the Arbeitsgemeinschaft Frühe Neuzeit Klaus Graf recently criticized sharply the new database Early Modern Thought Online (EMTO) of the Fernuniversität Hagen that enables you to search for editions of texts in the broad field of early modern philosophy and thought. The EMTO database notes in the search results the availability of online versions. In this respect Graf saw major shortcomings, because EMTO does not harvest its results from some of the major sources for early modern texts online. -
Extracting an Etymological Database from Wiktionary Benoît Sagot
Extracting an Etymological Database from Wiktionary Benoît Sagot To cite this version: Benoît Sagot. Extracting an Etymological Database from Wiktionary. Electronic Lexicography in the 21st century (eLex 2017), Sep 2017, Leiden, Netherlands. pp.716-728. hal-01592061 HAL Id: hal-01592061 https://hal.inria.fr/hal-01592061 Submitted on 22 Sep 2017 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Extracting an Etymological Database from Wiktionary Benoît Sagot Inria 2 rue Simone Iff, 75012 Paris, France E-mail: [email protected] Abstract Electronic lexical resources almost never contain etymological information. The availability of such information, if properly formalised, could open up the possibility of developing automatic tools targeted towards historical and comparative linguistics, as well as significantly improving the automatic processing of ancient languages. We describe here the process we implemented for extracting etymological data from the etymological notices found in Wiktionary. We have produced a multilingual database of nearly one million lexemes and a database of more than half a million etymological relations between lexemes. Keywords: Lexical resource development; etymology; Wiktionary 1. Introduction Electronic lexical resources used in the fields of natural language processing and com- putational linguistics are almost exclusively synchronic resources; they mostly include information about inflectional, derivational, syntactic, semantic or even pragmatic prop- erties of their entries. -
Jimmy Wales and Larry Sanger, It Is the Largest, Fastest-Growing and Most Popular General Reference Work Currently Available on the Internet
Tomasz „Polimerek” Ganicz Wikimedia Polska WikipediaWikipedia andand otherother WikimediaWikimedia projectsprojects WhatWhat isis Wikipedia?Wikipedia? „Imagine„Imagine aa worldworld inin whichwhich everyevery singlesingle humanhuman beingbeing cancan freelyfreely shareshare inin thethe sumsum ofof allall knowledge.knowledge. That'sThat's ourour commitment.”commitment.” JimmyJimmy „Jimbo”„Jimbo” Wales Wales –– founder founder ofof WikipediaWikipedia As defined by itself: Wikipedia is a free multilingual, open content encyclopedia project operated by the non-profit Wikimedia Foundation. Its name is a blend of the words wiki (a technology for creating collaborative websites) and encyclopedia. Launched in January 2001 by Jimmy Wales and Larry Sanger, it is the largest, fastest-growing and most popular general reference work currently available on the Internet. OpenOpen and and free free content content RichardRichard StallmanStallman definition definition of of free free software: software: „The„The wordword "free""free" inin ourour namename doesdoes notnot referrefer toto price;price; itit refersrefers toto freedom.freedom. First,First, thethe freedomfreedom toto copycopy aa programprogram andand redistributeredistribute itit toto youryour neighbors,neighbors, soso thatthat theythey cancan useuse itit asas wellwell asas you.you. Second,Second, thethe freedomfreedom toto changechange aa program,program, soso ththatat youyou cancan controlcontrol itit insteadinstead ofof itit controllingcontrolling you;you; forfor this,this, thethe sourcesource -
Instructor Basics: Howtouse Wikipedia As Ateaching Tool
Instructor Basics: How to use Wikipedia as a teaching tool Wiki Education Foundation Wikipedia is the free online encyclopedia that anyone can edit. One of the most visited websites worldwide, Wikipedia is a resource used by most university students. Increasingly, many instructors around the world have used Wikipedia as a teaching tool in their university classrooms as well. In this brochure, we bring together their experiences to help you determine how to use Wikipedia in your classroom. We’ve organized the brochure into three parts: Assignment planning Learn key Wikipedia policies and get more information on designing assignments, with a focus on asking students to write Wikipedia articles for class. During the term Learn about the structure of a good Wikipedia article, the kinds of articles students should choose to improve, suggestions for what to cover in a Wikipedia lab session, and how to interact with the community of Wikipedia editors. After the term See a sample assessment structure that’s worked for other instructors. 2 Instructor Basics Assignment planning Understanding key policies Since Wikipedia started in 2001, the community of volunteer editors – “Wikipedians” – has developed several key policies designed to ensure Wikipedia is as reliable and useful as possible. Any assignment you integrate into your classroom must follow these policies. Understanding these cornerstone policies ensures that you develop an assignment that meets your learning objectives and improves Wikipedia at the same time. Free content Neutral point of view “The work students contribute to “Everything on Wikipedia must be Wikipedia is free content and becomes written from a neutral point of view.