Combining Wikidata with Other Linked Databases

Total Page:16

File Type:pdf, Size:1020Kb

Combining Wikidata with Other Linked Databases Combining Wikidata with other linked databases Andra Waagmeester, Dragan Espenschied Known variants in the CIViC database for genes reported in a WikiPathways pathway on Bladder Cancer Primary Sources: ● Wikipathways (Q7999828) ● NCBI Gene (Q20641742) ● CIViCdb (Q27612411) ● Disease Ontology (Q5282129) Example 1: Wikidata contains public data “All structured data from the main and property namespace is available under the Creative Commons CC0 License; text in the other namespaces is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy.” Wikidata requirement for Notability An item is acceptable if and only if it fulfills at least one of these two goals, that is if it meets at least one of the criteria below: ● It contains at least one valid sitelink to a page on Wikipedia, Wikivoyage, Wikisource, Wikiquote, Wikinews, Wikibooks, Wikidata, Wikispecies, Wikiversity, or Wikimedia Commons. ● It refers to an instance of a clearly identifiable conceptual or material entity.it can be described using serious and publicly available references. ● It fulfills some structural need, https://www.wikidata.org/wiki/Wikidata:Notability Wikidata property proposals “Before a new property is created, it has to be discussed here. When after some time there are some supporters, but no or very few opponents, the property is created by a property creator or an administrator. You can propose a property here or on one of the subject-specific pages listed below.” https://www.wikidata.org/wiki/Wikidata:Property_proposal Structure of a Federated Query Wikidata supports federation on a limited set of endpoints ● On 27 endpoints (https://www.mediawiki.org/wiki/Wikidata_query_service/User_Manual#Feder ation) ● Europeana, Biblioteca Virtual Miguel de Cervantes, Biblioteca Nacional de España, Smithsonian American Art Museum, Bibliothèque nationale de France, DBPedia, Getty Vocabularies, INSEE, Istituto per i beni artistici, culturali e naturali, Italian Chamber of Deputies, Nomisma.org, Smart Points of Interest, UK Department for Communities and Local Government, UK Office for National Statistics, UK ordnance survey, URI Burnder, WikiPathways, MW2SPARQL, Yale Center for British Art, Linked Geodata, Linked Open Street Map, Framester, AGROVOC Example 2: Integrating with more detailed data Example 3: Integrating Wikidata from a remote SPARQL endpoint “all human UniProt entries with a sequence variants that leads to a loss of function, and also physically interacts with a drug used as an enzyme inhibitor.” Making sense of Zika in the americas 1. Install a SPARQL endpoint locally 2. Model data with Wikidata properties 3. Convert data into RDF 4. Load 5. Query 6. Live: Demo Live demo.
Recommended publications
  • Position Description Addenda
    POSITION DESCRIPTION January 2014 Wikimedia Foundation Executive Director - Addenda The Wikimedia Foundation is a radically transparent organization, and much information can be found at www.wikimediafoundation.org . That said, certain information might be particularly useful to nominators and prospective candidates, including: Announcements pertaining to the Wikimedia Foundation Executive Director Search Kicking off the search for our next Executive Director by Former Wikimedia Foundation Board Chair Kat Walsh An announcement from Wikimedia Foundation ED Sue Gardner by Wikimedia Executive Director Sue Gardner Video Interviews on the Wikimedia Community and Foundation and Its History Some of the values and experiences of the Wikimedia Community are best described directly by those who have been intimately involved in the organization’s dramatic expansion. The following interviews are available for viewing though mOppenheim.TV . • 2013 Interview with Former Wikimedia Board Chair Kat Walsh • 2013 Interview with Wikimedia Executive Director Sue Gardner • 2009 Interview with Wikimedia Executive Director Sue Gardner Guiding Principles of the Wikimedia Foundation and the Wikimedia Community The following article by Sue Gardner, the current Executive Director of the Wikimedia Foundation, has received broad distribution and summarizes some of the core cultural values shared by Wikimedia’s staff, board and community. Topics covered include: • Freedom and open source • Serving every human being • Transparency • Accountability • Stewardship • Shared power • Internationalism • Free speech • Independence More information can be found at: https://meta.wikimedia.org/wiki/User:Sue_Gardner/Wikimedia_Foundation_Guiding_Principles Wikimedia Policies The Wikimedia Foundation has an extensive list of policies and procedures available online at: http://wikimediafoundation.org/wiki/Policies Wikimedia Projects All major projects of the Wikimedia Foundation are collaboratively developed by users around the world using the MediaWiki software.
    [Show full text]
  • Danish Resources
    Danish resources Finn Arup˚ Nielsen November 19, 2017 Abstract A range of different Danish resources, datasets and tools, are presented. The focus is on resources for use in automated computational systems and free resources that can be redistributed and used in commercial applications. Contents 1 Corpora3 1.1 Wikipedia...................................3 1.2 Wikisource...................................3 1.3 Wikiquote...................................4 1.4 ADL......................................4 1.5 Gutenberg...................................4 1.6 Runeberg...................................5 1.7 Europarl....................................5 1.8 Leipzig Corpora Collection..........................5 1.9 Danish Dependency Treebank........................6 1.10 Retsinformation................................6 1.11 Other resources................................6 2 Lexical resources6 2.1 DanNet....................................6 2.2 Wiktionary..................................7 2.3 Wikidata....................................7 2.4 OmegaWiki..................................8 2.5 Other lexical resources............................8 2.6 Wikidata examples with medical terminology extraction.........8 3 Natural language processing tools9 3.1 NLTK.....................................9 3.2 Polyglot....................................9 3.3 spaCy.....................................9 3.4 Apache OpenNLP............................... 10 3.5 Centre for Language Technology....................... 10 3.6 Other libraries................................
    [Show full text]
  • On the Evolution of Wikipedia
    On the Evolution of Wikipedia Rodrigo B. Almeida Barzan Mozafari Junghoo Cho UCLA Computer Science UCLA Computer Science UCLA Computer Science Department Department Department Los Angeles - USA Los Angeles - USA Los Angeles - USA [email protected] [email protected] [email protected] Abstract time. So far, several studies have focused on understanding A recent phenomenon on the Web is the emergence and pro- and characterizing the evolution of this huge repository of liferation of new social media systems allowing social inter- data [5, 11]. action between people. One of the most popular of these Recently, a new phenomenon, called social systems, has systems is Wikipedia that allows users to create content in a emerged from the Web. Generally speaking, such systems collaborative way. Despite its current popularity, not much allow people not only to create content, but also to easily is known about how users interact with Wikipedia and how interact and collaborate with each other. Examples of such it has evolved over time. systems are: (1) Social network systems such as MySpace In this paper we aim to provide a first, extensive study of or Orkut that allow users to participate in a social network the user behavior on Wikipedia and its evolution. Compared by creating their profiles and indicating their acquaintances; to prior studies, our work differs in several ways. First, previ- (2) Collaborative bookmarking systems such as Del.icio.us or ous studies on the analysis of the user workloads (for systems Yahoo’s MyWeb in which users are allowed to share their such as peer-to-peer systems [10] and Web servers [2]) have bookmarks; and (3) Wiki systems that allow collaborative mainly focused on understanding the users who are accessing management of Web sites.
    [Show full text]
  • Jimmy Wales and Larry Sanger, It Is the Largest, Fastest-Growing and Most Popular General Reference Work Currently Available on the Internet
    Tomasz „Polimerek” Ganicz Wikimedia Polska WikipediaWikipedia andand otherother WikimediaWikimedia projectsprojects WhatWhat isis Wikipedia?Wikipedia? „Imagine„Imagine aa worldworld inin whichwhich everyevery singlesingle humanhuman beingbeing cancan freelyfreely shareshare inin thethe sumsum ofof allall knowledge.knowledge. That'sThat's ourour commitment.”commitment.” JimmyJimmy „Jimbo”„Jimbo” Wales Wales –– founder founder ofof WikipediaWikipedia As defined by itself: Wikipedia is a free multilingual, open content encyclopedia project operated by the non-profit Wikimedia Foundation. Its name is a blend of the words wiki (a technology for creating collaborative websites) and encyclopedia. Launched in January 2001 by Jimmy Wales and Larry Sanger, it is the largest, fastest-growing and most popular general reference work currently available on the Internet. OpenOpen and and free free content content RichardRichard StallmanStallman definition definition of of free free software: software: „The„The wordword "free""free" inin ourour namename doesdoes notnot referrefer toto price;price; itit refersrefers toto freedom.freedom. First,First, thethe freedomfreedom toto copycopy aa programprogram andand redistributeredistribute itit toto youryour neighbors,neighbors, soso thatthat theythey cancan useuse itit asas wellwell asas you.you. Second,Second, thethe freedomfreedom toto changechange aa program,program, soso ththatat youyou cancan controlcontrol itit insteadinstead ofof itit controllingcontrolling you;you; forfor this,this, thethe sourcesource
    [Show full text]
  • Instructor Basics: Howtouse Wikipedia As Ateaching Tool
    Instructor Basics: How to use Wikipedia as a teaching tool Wiki Education Foundation Wikipedia is the free online encyclopedia that anyone can edit. One of the most visited websites worldwide, Wikipedia is a resource used by most university students. Increasingly, many instructors around the world have used Wikipedia as a teaching tool in their university classrooms as well. In this brochure, we bring together their experiences to help you determine how to use Wikipedia in your classroom. We’ve organized the brochure into three parts: Assignment planning Learn key Wikipedia policies and get more information on designing assignments, with a focus on asking students to write Wikipedia articles for class. During the term Learn about the structure of a good Wikipedia article, the kinds of articles students should choose to improve, suggestions for what to cover in a Wikipedia lab session, and how to interact with the community of Wikipedia editors. After the term See a sample assessment structure that’s worked for other instructors. 2 Instructor Basics Assignment planning Understanding key policies Since Wikipedia started in 2001, the community of volunteer editors – “Wikipedians” – has developed several key policies designed to ensure Wikipedia is as reliable and useful as possible. Any assignment you integrate into your classroom must follow these policies. Understanding these cornerstone policies ensures that you develop an assignment that meets your learning objectives and improves Wikipedia at the same time. Free content Neutral point of view “The work students contribute to “Everything on Wikipedia must be Wikipedia is free content and becomes written from a neutral point of view.
    [Show full text]
  • The Free Encyclopedia General Overview
    WWiikkiippeeddiiaa The free Encyclopedia General overview WWhathat isis WWikikipedia?ipedia? ·Wikipedia is freely licensed encyclopedia founded on 15 January 2001 by Jimmy Wales ·Wikipedia and all sister projects are run by the Wikimedia Foundation ·Wikipedia is a website that anyone can edit written by thousands of volunteers run by MediaWiki software ·Wikipedia is avalible in many languages, roughly over 200 languages. ·Some of the sister projects: Wikipedia, Wiktionary, Wikibooks, Wikisource, Wikiquote, Wikispecies, Wikinews ·Free license allows others to freely copy, redistribute, and modify our work commercially or non-commercially Languages English Turkish 1,107,419 articles 21,706 articles 4,068,322 total pages 61.405 total pages 1,345,073 registered 18,480 registered user accounts user accounts 896 administrators 15 administrators 52,617,849 edits 360,211 edits Main Pag e Articles In other languages Anybody can edit How Editing Works Article History An error has been corrected Maintaining article integrity Vandalism VVaannddaalliissmm Vandalism is any addition, deletion, or change to content made in a deliberate attempt to reduce the quality of the encyclopedia. A 2002 study by IBM found that most vandalism on the English Wikipedia is reverted within five minutes. Only a minority of the edits are vandalism Reliability · A study by Nature suggests among 42 entries tested ± Wikipedia contained around four inaccuracies ± Britannica contained around three inaccuracies · Nature conducted this study by mailing fifty entries from the websites of Wikipedia and Encyclopaedia Britannica on subjects that represented a broad range of scientific disciplines. ± Only entries that were approximately the same length in both encyclopaedias were selected.
    [Show full text]
  • Openstreetmap and Wikimedia: a Quick Overview
    OpenStreetMap and Wikimedia: A quick overview State of the Map 2018 Eugene Alvin Villar [[User:seav]] OpenStreetMap is like Wikipedia for maps OpenStreetMap is like Wikidata for geographical data OpenStreetMap has nodes, ways, relations, tags, keys, values, etc. Wikidata has items, statements, properties, values, qualifiers, etc. Data modeling discussions on the Wikidata:Project chat page are actually quite similar to discussions on OSM’s tagging mailing list. Wikimedia in OSM The OSM Wiki is powered by MediaWiki, the wiki engine developed by Wikimedia, and this also provides access to Wikimedia Commons images. Tag definitions on the OSM Wiki link to Wikipedia and Wikidata to help clarify features. OSM objects can link to corresponding Wikipedia articles and Wikidata items using the wikipedia=* and wikidata=* tags respectively. The OpenStreetMap Foundation has derived its Local Chapters agreement and Trademark Policy from corresponding documents from the Wikimedia Foundation. OSM in Wikimedia OSM has been used to create maps to illustrate Wikipedia articles and populate Wikimedia Commons. OSM has been used to create maps to illustrate Wikipedia articles and populate Wikimedia Commons. OSM powers the Wikimedia Foundation’s Kartotherian map tile service, which is used by the Kartographer MediaWiki extension and almost all other interactive maps on Wikimedia projects. The Wikimedia Foundation recently released internationalized map tiles for Kartotherian, leveraging OSM’s name:*=* tags. WikiMiniAtlas, an older MediaWiki plugin still in use in many Wikipedias, is also powered by OSM data, including 3D building data. Wikidata items on places can link to OSM relations using the OSM relation ID (P402) property. Wikidata items about features can link to equivalent OSM features using the OSM tag or key (P1282) property.
    [Show full text]
  • S Wiktionary Wikisource Wikibooks Wikiquote Wikimedia Commons
    SCHWESTERPROJEKTE Wiktionary S Das Wiktionary ist der lexikalische Partner der freien Enzyklopädie Wikipedia: ein Projekt zur Erstellung freier Wörterbücher und Thesau- ri. Während die Wikipedia inhaltliche Konzepte beschreibt, geht es in ihrem ältesten Schwester- projekt, dem 2002 gegründeten Wiktionary um Wörter, ihre Grammatik und Etymologie, Homo- nyme und Synonyme und Übersetzungen. Wikisource Wikisource ist eine Sammlung von Texten, die entweder urheberrechtsfrei sind oder unter ei- ner freien Lizenz stehen. Das Projekt wurde am 24. November 2003 gestartet. Der Wiktionary-EIntrag zum Wort Schnee: Das Wörterbuch präsen- Zunächst mehrsprachig auf einer gemeinsamen tiert Bedeutung, Deklination, Synonyme und Übersetzungen. Plattform angelegt, wurde es später in einzel- ne Sprachversionen aufgesplittet. Das deutsche Teilprojekt zählte im März 2006 über 2000 Texte Wikisource-Mitarbeiter arbeiten an einer digitalen, korrekturge- und über 100 registrierte Benutzer. lesenen und annotierten Ausgabe der Zimmerischen Chronik. Wikibooks Das im Juli 2003 aus der Taufe gehobene Projekt Wikibooks dient der gemeinschaftlichen Schaf- fung freier Lehrmaterialien – vom Schulbuch über den Sprachkurs bis zum praktischen Klet- terhandbuch oder der Go-Spielanleitung Wikiquote Wikiquote zielt darauf ab, auf Wiki-Basis ein freies Kompendium von Zitaten und Das Wikibooks-Handbuch Go enthält eine ausführliche Spielanleitung Sprichwörtern in jeder Sprache zu schaffen. Die des japanischen Strategiespiels. Artikel über Zitate bieten (soweit bekannt) eine Quellenangabe und werden gegebenenfalls in die deutsche Sprache übersetzt. Für zusätzliche Das Wikimedia-Projekt Wikiquote sammelt Sprichwörter und Informationen sorgen Links in die Wikipedia. Zitate, hier die Seite zum Schauspieler Woody Allen Wikimedia Commons Wikimedia Commons wurde im September 2004 zur zentralen Aufbewahrung von Multime- dia-Material – Bilder, Videos, Musik – für alle Wi- kimedia-Projekte gegründet.
    [Show full text]
  • Interview with Sue Gardner, Executive Director WMF October 1, 2009 510 Years from Now, What Is Your Vision?
    Interview with Sue Gardner, Executive Director WMF October 1, 2009 5-10 years from now, what is your vision? What's different about Wikimedia? Personally, I would like to see Wikimedia in the top five for reach in every country. I want to see a broad, rich, deep encyclopedia that's demonstrably meeting people's needs, and is relevant and useful for people everywhere around the world. In order for that to happen, a lot of things would need to change. The community of editors would need to be healthier, more vibrant, more fun. Today, people get burned out. They get tired of hostility and endless debates. Working on Wikipedia is hard, and it does not offer many rewards. Editors have intrinsic motivation not extrinsic, but even so, not much is done to affirm or thank or recognize them. We need to find ways to foster a community that is rich and diverse and friendly and fun to be a part of. That world would include more women, more newly-retired people, more teachers ± all different kinds of people. There would be more ways to participate, more affirmation, more opportunities to be social and friendly. We need a lot of tools and features to help those people be more effective. Currently, there are tons of hacks and workarounds that experienced editors have developed over time, but which new editors don©t know about, and would probably find difficult to use. We need to make those tools visible and easier to use, and we need to invent new ones where they are lacking.
    [Show full text]
  • Wikipedia Sociographics
    Wikipedia Sociographics Jimmy Wales President, Wikimedia Foundation Wikipedia Founder Today’s Talk Quick introduction to who we are and what we are doing Two views of how Wikipedia works Details about the Community What is the Wikimedia Foundation? Non-profit foundation Aims to distribute a free encyclopedia to every single person on the planet in their own language Wikipedia and its sister projects Funded by public donations Applying for grants wikimediafoundation.org What is Wikipedia? Wikipedia is a freely licensed encyclopedia written by thousands of volunteers in many languages Free license allows others to freely copy, redistribute, and modify our work commercially or non-commercially Founded January 15, 2001 wikipedia.org Advantages of Freely Licensed Content GNU Free Documentation Licence Allows authors to retain attribution Remains non-proprietary Enhances the popularity of Wikipedia Decreases individual sense of ownership Increases a sense of shared ownership Free Software MediaWiki is GPL We use all free software on the website GNU/Linux Apache MySQL Php How big is Wikipedia? English Wikipedia is largest and has over 130 million words English Wikipedia larger than Britannica and Microsoft Encarta combined In 15 months the publicly distributed compressed database dumps may reach 1 terabyte total size How big is Wikipedia Globally? English – 412,000 articles German – 172,000 articles Japanese – 87,000 articles French – 66,000 articles Swedish –53,000 articles Over 1.2 million across 200 languages
    [Show full text]
  • The Future of Mediawiki and the Wikimedia Projects Erik Möller – August 6, 2005 the Purpose of Technology Research
    phase iv The Future of MediaWiki and the Wikimedia projects Erik Möller – August 6, 2005 The Purpose of Technology Research ● Many (thousands) very active content producers ● Very few (less than 10) very active developers ● New projects with specific needs ● Research can – Identify useful software enhancements – Write specifications and make recommendations – Supervise and review implementation process – Get the community involved in technical processes Wikimania – August 6, 2005 Wikimedia Research Network ● Attempt to bring indidividuals together to – work on specs – study Wikimedia content and communities – coordinate external contacts – organize community meetings ● Current activities – Single login specs – Development tasks – User survey Wikimania – August 6, 2005 Why peer review? ● Beyond existing mechanisms ● Main criticism against Wikipedia – From academia – From search engines – From pundits ● Fact-checking is a collaborative process ● As much work as the encyclopedia itself ● First step: Article survey Wikimania – August 6, 2005 Article survey Wikimania – August 6, 2005 Page protection ● Pages only editable by sysops ● Edit warring or distributed vandalism, decided by sysop ● English Wikipedia: avg. 12 protections per day ● However, some pages stay protected very long – Lack of processes or responsibility – e.g. Sexual abuse of children Wikimania – August 6, 2005 Alternatives ● Code which exists (recent, not in use): – User edits invisible copy of page – Sysops can “verify” a revision – Displayed copy is last verified one during period of protection ● Ideal solution: – If no sysop “verifies”, page is automatically published if no activity for n minutes Wikimania – August 6, 2005 My thoughts on peer review ● Must be “wiki-like” – Fast and easy – Consensus-based ● One basic concept for Wikipedia, Wikinews, etc.
    [Show full text]
  • Sourcing Images Online Page No. 1 / 12
    Sourcing Images Online Page No. 1 / 12 Social Media is Visual Media | Sourcing Images Online Categories of copyright for online content Copyright - is all content that can Creative Commons - in order to contribute not be reused or remixed (even if to keeping the Internet free and open, you properly attribute it) because authors can choose to license their material its authors have chosen exclusive under a CC license, this means the content rights for its distribution. can be legally shared and reused. But always remember this; it should always be attributed! CC licenses provide a flexible range of protections and freedoms for Public Domain - is any content authors, artists, and educators; we present that is not subjected to copyright below the main four licenses, you can read laws and which can be used more about how you can make your own without permission. combination on the Creative Commons website here. Page No. 2 / 12 Social Media is Visual Media | Sourcing Images Online Categories of Creative Commons Attribution (BY) Noncommercial (NC) Content can be reused for any purpose Content can be reused and redistributed providing that the author is appropriately solely for noncommercial purposes cited ShareAlike (SA) No Derivative Works (ND) Content must be distributed under the same Content can be reused as long as it is passed license as the original along unchanged and in whole Page No. 3 / 12 Social Media is Visual Media | Sourcing Images Online The following pages include some examples of websites where you can source images licensed by Creative Commons or even without image licenses.
    [Show full text]