Real-Time Knowledge Graph Generation Team Project HWS 2020

Total Page:16

File Type:pdf, Size:1020Kb

Real-Time Knowledge Graph Generation Team Project HWS 2020 Real-time Knowledge Graph Generation Team Project HWS 2020 Team Project HWS20 | Real-time Knowledge Graph Generation 1 What is a Knowledge Graph? • describes real world entities and their interrelations, organized in a graph Organization type 01.01.1962 Impossible birth date occupation Missions Tom portrayer Force Cruise Ethan gender Hunt male religion series John producer Woo Mission Scientology Impossible (film series) director Team Project HWS20 | Real-time Knowledge Graph Generation 2 Do you know DBpedia? Team Project HWS20 | Real-time Knowledge Graph Generation 3 What about long tail entities? • Benji Dunn from Mission Impossible Team Project HWS20 | Real-time Knowledge Graph Generation 4 There are more Wikis out there… • Fandom allow generation of Wiki – e.g. Mission Impossible Wiki – 385,000 Wikis – 50,000,000+ pages Team Project HWS20 | Real-time Knowledge Graph Generation 5 WikiApiary • 205 active Wiki Farms • 21,654 standalone Wikis Team Project HWS20 | Real-time Knowledge Graph Generation 6 Your mission, should you choose to accept it • is to create a real time / on demand knowledge graph • example: 1. user browse to page KGoD.dws.informatik.uni-mannheim.de/resource/Benji_Dunn 2. search a specified list of Wikis or an already given Wiki for the corresponding page • also pages which links to this page 3. fetch the MediaWiki Markup with the API 4. run the DBpedia extraction framework to extract all information 5. display the information like DBpedia 6. provide a query interface and measure useability and performance Team Project HWS20 | Real-time Knowledge Graph Generation 7 We, the customers Prof. Dr. Heiko Paulheim Chair for Data Science Phone: 0621 181-2652 E-Mail: [email protected] Sven Hertling Scientific Researcher Chair for Data Science Phone: 0621 181-2532 E-Mail: [email protected] Team Project HWS20 | Real-time Knowledge Graph Generation 8 You, the IMF Agents, should • ...be a group of 3 students – also MMDS students • … have time for 1 semester • ...have deep knowledge in Java 9 • ...already attended Data Mining 1 • ...already attended or attend Semantic Web Technologies this semester • ...have some knowledge about Web programming • ...are not afraid of programming and big data Team Project HWS20 | Real-time Knowledge Graph Generation Questions? Team Project HWS20 | Real-time Knowledge Graph Generation 10.
Recommended publications
  • Icannwiki Staff
    ICANNWiki Staff JACKIE TREIBER VIVIAN HUA RAY KING has a background in is ICANNWiki’s primary is a long-time wiki lover writing and editing for visual strategist and is who began ICANNWiki literary publications. responsible for all of the in 2005 as his first She likes writing short non-profit’s marketing wiki project. As well as stories, making art and collateral and giveaways. being the Director of learning about the city When she is not working ICANNWiki, he is also she lives in--Portland, on ICANNWiki, she the head of Top Level Oregon. She is runs a music and arts Design, LLC, which growing internet knowledge currently ICANNWiki’s publication called currently administers primary point-gal, REDEFINE, creates a number of TLDs, thus wearing many visual art, explores including .design, .ink, hats within the small esoteric knowledge, and and .wiki. organization. wanders all over. AN INTRODUCTION TO WIKI PRINCIPLES & EDITING This informative guide is co-presented by Editing Cheatsheet YOUR VOICE, YOUR INTERNET EDITING ARTICLES Wiki code you type Output on ICANNWiki Italic Bold What is ICANNWiki? Section headers ICANNWiki is a grassroots, community effort to create and curate articles describing the people and organizations, terms and topics within the ICANN community. We actively seek worldwide collaboration to increase understanding of Internal link to how policy is created for the continued development of the another ICANNWiki page Internet, a tool which we all use everyday. In particular, we focus on the Internet Corporation for Assigned Names and Internal link using Numbers (ICANN) and related international policy and alternate text WWW.ICANNWIKI.COM management bodies.
    [Show full text]
  • Wiki Software for Knowledge Management in Organisations
    Spoilt for Choice - Wiki Software for Knowledge Management in Organisations Katarzyna Grzeganek, Ingo Frost, Daphne Gross Pumacy Technologies AG EMAIL: [email protected] Abstract The article presents the most popular wiki solutions and provides an analysis of features and functionalities based on organisational needs for the management of knowledge. All wiki solutions are compared to usability, search function, structuring and validation of knowledge. Keywords Wiki, Organisation, Knowledge Management, Analysis, Assessment, Feature, Platform, Documentation, Usability, Research, Structuring, Security, Integration, Quality, Validation URL http://www.pumacy.de/en/publications/wikis_fuer_wissensmanagement.html Spoilt for choice - Wiki Software in Organisations ................................................................... 1 1. Preface................................................................................................................................ 2 2. Wiki software for organisations ......................................................................................... 2 3. Presentation of wiki solutions ............................................................................................ 5 4. Wikis & Knowledge Management: Criteria and Analysis................................................. 8 4.1. Knowledge management across the organisation—ease of use.................................. 8 4.2. Structured Knowledge Base........................................................................................ 9 4.3.
    [Show full text]
  • Using "Wikis" in Developing Writing Performance and Motivation Among EFL Students at Majmaah University
    Using "Wikis" in Developing Writing Performance and Motivation among EFL Students at Majmaah University By Dr. Eman Abdel-Reheem Amin Assistant professor, English language department, Az Zulfi College of Education, Majmaah University, KSA Journal of Faculty of Education No (105) January, Part (1), 2016 Using "Wikis" in Developing Writing Performance and Motivation among EFL Students at Majmaah University By Dr. Eman Abdel-Reheem Amin Assistant professor, English language department, Az Zulfi College of Education, Majmaah University, KSA Abstract This study aimed at developing writing performance and motivation of EFL students, Az Zulfi College of Education, Majmaah University through using Wikis. The participants were 6th level English language female students (n= 39). The study adopted the one group pre- post test design. Students were pre-tested using a Writing Performance Test (WPT) and a Writing Motivation Questionnaire (WMQ). The study was an attempt to develop students' collaborative essay writing processes as reflected in their writing performance using wikispaces software. Student' motivation towards writing was also pursued. During the sessions, students were assigned into groups and worked collaboratively on writing their assignments. Students were guided through the writing process to produce well-organized, adequately developed paragraphs and essays. After the treatment, students were post-tested on their writing performance and motivation. Data were dealt statistically using SPSS. Results showed that there was a statistically
    [Show full text]
  • What Is a Wiki? Tutorial 1 for New Wikieducators
    What is a Wiki? Tutorial 1 for new WikiEducators wikieducator.org book - Generated using the open source mwlib toolkit - see http://code.pediapress.com for more information 2 Introducing a Wiki Objectives In this tutorial we will: • provide an overview of what wikis are, • and show some examples of their different uses. • discuss the advantages and disadvantages of using wikis to develop content • describe the main features of WikiEducator What is a Wiki? The name "Wiki" was chosen by Ward Cunningham - - the creator of the first Wiki. It is a shortened form of "wiki- wiki", the Hawaiian word for quick. A wiki is a web site that is generally editable by anyone with a computer, a web browser, and an internet connection. Wikis use a quick and easy syntax to allow users to apply formatting to text and create links between pages. This simple formatting syntax means that authors no longer need to learn the complexities of HTML to create content on the web. The main strength of a wiki is that it gives people the ability to work collaboratively on the same document. The only software you need is an Wiki wiki sign outside Honolulu International Internet browser. Consequently, wikis are used Airport. (Image courtesy of A. Barataz) for a variety of purposes. If you make a mistake, it's easy to revert back to an earlier version of the document. All content sourced from WikiEducator.org and is licensed under CC-BY-SA or CC-BY where specified. 3 Examples of Wikis The largest and most talked about Wiki on the Internet is Wikipedia[1] Wikipedia is, for the most part, editable by anyone in the world with a computer and an internet connection and, at the time of this writing, contained over 1,500,000 pages.
    [Show full text]
  • The Future of the Past
    THE FUTURE OF THE PAST A CASE STUDY ON THE REPRESENTATION OF THE HOLOCAUST ON WIKIPEDIA 2002-2014 Rudolf den Hartogh Master Thesis Global History and International Relations Erasmus University Rotterdam The future of the past A case study on the representation of the Holocaust on Wikipedia Rudolf den Hartogh Master Thesis Global History and International Relations Erasmus School of History, Culture and Communication (ESHCC) Erasmus University Rotterdam July 2014 Supervisor: prof. dr. Kees Ribbens Co-reader: dr. Robbert-Jan Adriaansen Cover page images: 1. Wikipedia-logo_inverse.png (2007) of the Wikipedia user Nohat (concept by Wikipedian Paulussmagnus). Digital image. Retrieved from: http://commons.wikimedia.org (2-12-2013). 2. Holocaust-Victim.jpg (2013) Photographer unknown. Digital image. Retrieved from: http://www.myinterestingfacts.com/ (2-12-2013). 4 This thesis is dedicated to the loving memory of my grandmother, Maagje den Hartogh-Bos, who sadly passed away before she was able to see this thesis completed. 5 Abstract Since its creation in 2001, Wikipedia, ‘the online free encyclopedia that anyone can edit’, has increasingly become a subject of debate among historians due to its radical departure from traditional history scholarship. The medium democratized the field of knowledge production to an extent that has not been experienced before and it was the incredible popularity of the medium that triggered historians to raise questions about the quality of the online reference work and implications for the historian’s craft. However, despite a vast body of research devoted to the digital encyclopedia, no consensus has yet been reached in these debates due to a general lack of empirical research.
    [Show full text]
  • Wikis. Teacher Librarian, 32(4), 54-56
    Wikis, Laurel A. Clyde Clyde, L.A. (2005). Wikis. Teacher Librarian, 32(4), 54-56. Wiki is yet another new infotech term that has come to prominence in the last couple of it is said, from the Hawaiian word for "fast," "WikiWikiWeb" (or just "wiki") describe a particular type of interactive web site--a site to which users can contribute. Though the first wikis appeared about nine years ago, it has been only in the last two years that they have captured the imagination of relatively large numbers of Internet users. Wikis are being used in different contexts, including education, business, and library science and information management. Like blogging software (see "Educational Blogging," Teacher Librarian, 32(3), 43-45), wiki "engines" (the software used as the basis for a wiki) are a form of "social software," that is, "a type of software that makes it easy for groups of people to work together in a virtual environment" (Chawner & Lewis, 2004). Wikis support collaborative activity by providing a format for the submission of contributions, a way of organizing and updating the site, and automatic maintenance of the links among pages and to external sites. Users do not need to possess sophisticated web skills to take part in developing a wiki, though setting up a wiki does require more skills. In its simplest form, a wiki allows any authorized user to add new content and new pages to a wiki, and to edit, change, or delete existing content. Some allow any users to make changes, and those changes might not require moderator approval before appearing on the web site, depending on how the wiki has been set up.
    [Show full text]
  • TOOLS and METHODS for WIKI RESEARCH Jeffrey
    ABSTRACT Title of thesis: ANALYZING THE WIKISPHERE: TOOLS AND METHODS FOR WIKI RESEARCH Jeffrey Stuckman, Master of Science, 2010 Thesis directed by: Professor James Purtilo Department of Computer Science We present tools and techniques that facilitate wiki research and an analysis of wikis found on the internet. We developed WikiCrawler, a tool that downloads and analyzes wikis. With this tool, we built a corpus of 151 Mediawiki wikis. We also developed a wiki analysis toolkit in R, which, among other tasks, fits probability distributions to discrete data, and uses a Monte Carlo method to test the fit. From the corpus we determined that, like Wikipedia, most wikis were authored collaboratively, but users contributed at unequal rates. We proposed a distribution- based method for measuring wiki inequality and compared it to the Gini coefficient. We also analyzed distributions of edits across pages and users, producing data which can motivate or verify future mathematical models of behavior on wikis. Future research could also analyze user behavior and establish measurement baselines, fa- cilitating evaluation, or generalize Wikipedia research by testing hypotheses across many wikis. ANALYZING THE WIKISPHERE: TOOLS AND METHODS FOR WIKI RESEARCH by Jeffrey Stuckman Thesis submitted to the Faculty of the Graduate School of the University of Maryland, College Park in partial fulfillment of the requirements for the degree of Master of Science 2010 Advisory Committee: Professor James Purtilo, Chair/Advisor Professor Amol Deshpande Professor Adam Porter c Copyright by Jeffrey C. Stuckman 2010 Acknowledgments I thank everyone who inspired this thesis and motivated my interest in wikis and measurement. First and foremost, I would like to thank Jim Purtilo, my advisor, for giving me the freedom to pursue my interests and for clarifying the oft-mystifying practices and traditions of academia.
    [Show full text]
  • Wikipedia and Wikis Haider, Jutta
    Wikipedia and Wikis Haider, Jutta; Sundin, Olof Published in: The Handbook of Peer Production DOI: 10.1002/9781119537151.ch13 2021 Document Version: Peer reviewed version (aka post-print) Link to publication Citation for published version (APA): Haider, J., & Sundin, O. (2021). Wikipedia and Wikis. In M. O'Neil, C. Pentzold, & S. Toupin (Eds.), The Handbook of Peer Production (pp. 169-184). (Wiley Handbooks in Communication and Media Series ). Wiley. https://doi.org/10.1002/9781119537151.ch13 Total number of authors: 2 Creative Commons License: CC BY-NC-ND General rights Unless other specific re-use rights are stated the following general rights apply: Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal Read more about Creative commons licenses: https://creativecommons.org/licenses/ Take down policy If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. LUND UNIVERSITY PO Box 117 221 00 Lund +46 46-222 00 00 This is the author’s version of a chapter accepted for publication in the Handbook of Peer Production.
    [Show full text]
  • Wikipedia @ 20
    Wikipedia @ 20 Wikipedia @ 20 Stories of an Incomplete Revolution Edited by Joseph Reagle and Jackie Koerner The MIT Press Cambridge, Massachusetts London, England © 2020 Massachusetts Institute of Technology This work is subject to a Creative Commons CC BY- NC 4.0 license. Subject to such license, all rights are reserved. The open access edition of this book was made possible by generous funding from Knowledge Unlatched, Northeastern University Communication Studies Department, and Wikimedia Foundation. This book was set in Stone Serif and Stone Sans by Westchester Publishing Ser vices. Library of Congress Cataloging-in-Publication Data Names: Reagle, Joseph, editor. | Koerner, Jackie, editor. Title: Wikipedia @ 20 : stories of an incomplete revolution / edited by Joseph M. Reagle and Jackie Koerner. Other titles: Wikipedia at 20 Description: Cambridge, Massachusetts : The MIT Press, [2020] | Includes bibliographical references and index. Identifiers: LCCN 2020000804 | ISBN 9780262538176 (paperback) Subjects: LCSH: Wikipedia--History. Classification: LCC AE100 .W54 2020 | DDC 030--dc23 LC record available at https://lccn.loc.gov/2020000804 Contents Preface ix Introduction: Connections 1 Joseph Reagle and Jackie Koerner I Hindsight 1 The Many (Reported) Deaths of Wikipedia 9 Joseph Reagle 2 From Anarchy to Wikiality, Glaring Bias to Good Cop: Press Coverage of Wikipedia’s First Two Decades 21 Omer Benjakob and Stephen Harrison 3 From Utopia to Practice and Back 43 Yochai Benkler 4 An Encyclopedia with Breaking News 55 Brian Keegan 5 Paid with Interest: COI Editing and Its Discontents 71 William Beutler II Connection 6 Wikipedia and Libraries 89 Phoebe Ayers 7 Three Links: Be Bold, Assume Good Faith, and There Are No Firm Rules 107 Rebecca Thorndike- Breeze, Cecelia A.
    [Show full text]
  • The Handbook of Peer Production Chapter 13: Wikipedia and Wikis
    This is the author’s version of a chapter accepted for publication in the Handbook of Peer Production. Changes resulting from the publishing process such as copy-editing, typesetting, and other quality control mechanisms may not be reflected in this document. This author manuscript version is available for personal, non-commercial and no derivative uses only. Citation: Haider, J. & Sundin, O. (2021). Wikipedia and wikis. In: M. O’Neil, C. Pentzold & S. Toupin (Eds.), The Handbook of Peer Production (pp. 169-184). Malden, MA: Wiley-Blackwell. ISBN 9781119537106 Available at: https://www.wiley.com/en-au/The+Handbook+of+Peer+Production-p-9781119537090 The Handbook of Peer Production Chapter 13: Wikipedia and Wikis Jutta Haider, University of Borås, Sweden & Olof Sundin, Lund University, Sweden Chapter 13 – Wikis and Wikipedia 2 1. Introduction WikiWikiWebs or wikis constitute the core platform of peer production. Wikis are software programs allowing for flexible collaboration without necessarily having a defined content owner or leader. A wiki is a user-editable website or content management system. Wikis might use different programming languages and licenses, but they apply the same model for cooperation, which means that collaborators can modify content, insert hyperlinks, and change the structure of a document directly in any web browser. Edits are usually archived and open to revision for all collaborators. The most popular and successful wiki-based project by far is Wikipedia. Established in 2001, today this encyclopedia is one of the most popular sites on the Web; what is more, with its data supporting other applications and commercial platforms like Google, Wikipedia has taken on an infrastructural role in the contemporary internet.
    [Show full text]
  • Xtools Release 3.10.19
    XTools Release 3.10.19 Dec 13, 2020 Table of Contents 1 Tools 3 1.1 Edit Counter...............................................3 1.1.1 General Statistics........................................3 1.1.2 Namespace totals........................................4 1.1.3 Timecard............................................4 1.1.4 Year counts...........................................4 1.1.5 Month counts..........................................4 1.1.6 Latest global edits........................................4 1.1.7 Automated edits.........................................4 1.2 Page History...............................................4 1.2.1 General Statistics........................................5 1.2.2 Top editors...........................................5 1.2.3 Year counts...........................................6 1.2.4 Month counts..........................................6 1.2.5 (Semi-)automated edits.....................................6 1.2.6 Assessments...........................................6 1.2.7 Bugs...............................................6 1.3 Pages Created..............................................6 1.4 Top Edits.................................................7 1.5 Admin Score...............................................7 1.5.1 Algorithm............................................7 1.6 Bash Quote................................................7 1.7 Simple Counter..............................................7 2 API 9 2.1 Response format.............................................9 2.2 Errors and warnings...........................................9
    [Show full text]
  • Criticism of Wikipedia from Wikipidia.Pdf
    Criticism of Wikipedia from Wikipidia For a list of criticisms of Wikipedia, see Wikipedia:Criticisms. See also Wikipedia:Replies to common objections. Two radically different versions of a Wikipedia biography, presented to the public within days of each other: Wikipedia's susceptibility to edit wars and bias is one of the issues raised by Wikipedia critics http://medicalexposedownloads.com/PDF/Criticism%20of%20Wikipedia%20from%20Wikipidia.pdf http://medicalexposedownloads.com/PDF/Examples%20of%20Bias%20in%20Wikipedia.pdf http://medicalexposedownloads.com/PDF/Wikipedia%20is%20Run%20by%20Latent%20Homosexual%20Homophob ics.pdf http://medicalexposedownloads.com/PDF/Bigotry%20and%20Bias%20in%20Wikipedia.pdf http://medicalexposedownloads.com/PDF/Dear%20Wikipedia%20on%20Libelous%20lies%20against%20Desire%20 Dubounet.pdf http://medicalexposedownloads.com/PDF/Desir%c3%a9%20Dubounet%20Wikipidia%20text.pdf Criticism of Wikipedia—of the content, procedures, and operations, and of the Wikipedia community—covers many subjects, topics, and themes about the nature of Wikipedia as an open source encyclopedia of subject entries that almost anyone can edit. Wikipedia has been criticized for the uneven handling, acceptance, and retention of articles about controversial subjects. The principal concerns of the critics are the factual reliability of the content; the readability of the prose; and a clear article layout; the existence of systemic bias; of gender bias; and of racial bias among the editorial community that is Wikipedia. Further concerns are that the organization allows the participation of anonymous editors (leading to editorial vandalism); the existence of social stratification (allowing cliques); and over-complicated rules (allowing editorial quarrels), which conditions permit the misuse of Wikipedia. Wikipedia is described as unreliable at times.
    [Show full text]