Ontologies, Knowledge Bases, MPRI 2.26.2: Web Data Management

Antoine Amarilli Friday, January 11th

1/31 Reminder

• Ontology: vocabulary (classes and relations) to describe things • : set of facts in one or several ontologies → Focus on Wikidata: a general-purpose knowledge base and ontology

2/31 Ontologies Ontologies

• Various domain-specific vocabularies used across knowledge bases • One general-purpose ontology used by , Microsoft, Yahoo, Yandex: schema.org • Other ontologies that come together with a knowledge base

3/31 Friend of a friend (FOAF)

Describe people, relationship, profiles, activities (social network)

@prefix rdf: . @prefix rdfs: . @prefix : .

<#JW> a foaf:Person ; foaf:name "" ; foaf:mbox ; foaf:homepage ; foaf:nick "Jimbo" ; foaf:depiction ; foaf:interest ; foaf:knows [ a foaf:Person ; foaf:name "Angela Beesley" ] . 4/31 Creative Commons

Describe the license and rights on documents

This page, by Lawrence Lessig, is licensed under a Creative Commons Attribution License.

• Many content providers add this kind of markup (e.g., Flickr) • Search engines can use it (e.g., Google)

5/31 Other domain-specific ontologies

(DC): Describe digital resources (videos, images, etc.) and physical resources (books, CDs, etc.) • Simple knowledge organization system (SKOS): describe thesauri, taxonomies, etc. • Open Graph Protocol: for Web pages to be integrated in ’s social graph; also Twitter Cards for Twitter • DOAP (Description of a Project): describe software projects • VoID (Vocabulary of Interlinked Datasets): describe a linked dataset • Countless others

6/31 Schema.org: a general-purpose ontology

• General-purpose ontology: 598 types and 862 properties in version 3.5 • Intended to be used on Web pages to annotate the semantics of elements • Used by search engines for rich search results • Used in over 10 million sites1

1Source: https://schema.org/ 7/31 Format: Microdata

Sat Sep 14
Typhoon with Radiation City
The Hi-Dive
7 S. Broadway
Denver, CO 80209
9:30 PM
• itemscope creates an item and itemtype gives its type • itemprop gives values for properties of the item 8/31 Format: RDFa

Competing format to Microdata, seems less common2

Sat Sep 14
Typhoon with Radiation City
The Hi-Dive
7 S. Broadway
Denver, CO 80209
9:30 PM

2http://webdatacommons.org/structureddata/index.html#toc2 9/31 Format: JSON-LD

Alternative approach: give the structured data separately in JSON

10/31 Web Data Commons Structured Data

• Extraction of semantic content from the Common Crawl • Also useful to measure usage of structured data: • In November 2017, the Common Crawl contained 66 TB (compressed), 260 TB (uncompressed), 3.2G pages • 39% of pages (and 28% of domains) contained semantic data • 9G entities and 38G triples • http://webdatacommons.org/structureddata/

11/31 Knowledge bases Common Knowledge bases

• Generalistic: DBpedia, , (defunct), Wikidata • Proprietary: Google , Bing Knowledge Graph (aka Satori) • Domain-specific • We will focus afterwards on Wikidata

12/31 DBpedia

• Started in 2007 • License: CC-BY-SA • Code license: GPLv2 • Actors: Leipzig University, University of Mannheim, Open Link Software • Latest release: 2016-10 • Extracted from Wikimedia projects 6M entities and 10G triples in 2016-043, 3 • https://blog.dbpedia.org/2016/10/19/yeah-we-did-it-again-new-2016-04--release/ 13/31 YAGO

• Started in 2008 • License: CC-BY • Code license: GPLv3 • Actors: Max Planck Institute for Informatics, Télécom ParisTech • Latest release: YAGO 3.1 (2017) • Extracted from and other sources; manual evaluation • 10M entities and 120M triples4, 4http://yago-knowledge.org/ 14/31 Freebase

• Started in 2007, discontinued in 2016 • License: CC-BY • Code license: Apache2 (provided after-the-fact by Google) • Actors: Metaweb, acquired by Google in 2010 • Initially imported from various sources • Could be edited by anyone • Partially imported into Wikidata (but not completely) • Last release: 2016 • Last dump has 1.9G triples 15/31 Wikidata

• Started in 2012 • License: • Code license: GPLv2 • Actors: Wikimedia Deutschland, Wikimedia • Last release: weekly • Around 650M statements and 54M items • Can be edited by anyone! Around 20k active users. 16/31 Domain-specific

• MusicBrainz, for CDs and music in general (20 million recordings) • British National Bibliography: bibliographic details about books published in the UK since 1950 • data.bnf.fr, data from the French national library • OpenStreetMaps, and Geonames • Medicine and chemistry with SNOMED CT, and other : DrugBank, KEGG, UniProt, ChEMBL, etc. • Linguistic resources, e.g., Babelnet • Bibliography, e.g., DBLP, Crossref

17/31 Linked

Legend

Whisky... Reacto... Cross Domain OpenEI... The Or... vulner... OpenCa... Reposi... Bio2RD... Hedatuz Geography Garnic... EPA-RCRA Univer... EPA-TRI

eagle-... BioSam... The Ge... SmartL... Job ap... Government GovTra... iServe... Linked... Chat G... BioMod... DataGo... JITA C... Schola... Kidney... openda... "Raini... Life Sciences tags2c... linked... data.dcs Bio2RD... EPA-FRS Temple... Poképé... IWN Linked... Lexvo.org crowds... 2011 U... Freeyork The Li... Schema... TWC: L... DBTune... EnAKTi... Drug D... Nobel ... Copyri... Eniped... SwetoDblp My Fam... AragoD... Open L... UniProtKB Media webconf NPM School... DBTune... Select... Commun... The Vi... Cornet... Founda... Biogra... ChEMBL... Italia... openda... Ruben ... GeoEcu... LCSubj... Transc... Publications Univer... Shoah ... The Lo... Salzbu... WordNe... Pleiades IPTC N... Univer... MARC C... Aperti... CORE -... Art & ... Polyth... ERA - ... Entorn... Europe... gemet-... Gene E... Social Networking Linked... ISOcat... Ruben ... ESD-To... bio2rd... STW Th... Linked... John G... Period... ePrint... SweFN-RDF Data a... LIBRIS Intera... openda... OntoBe... medline Bricklink User Generated Japane... Regist... openda... Verrij... Struct... IATE RDF Univer... Next W... Telegr... Greek ... Europe... Aperti... dotAC ... openda... YSA - ... Europe... WarSampo Inever... Debian... Amster... Code l... Bio2RD... Web Sc... PDEV-L... Swedis... Univer... Averag... SLI Ga... Bio2RD... Resear... Polyma... Museos... Univer... Aperti... Chem2B... ChEMBL... ReSIST... BioTop openda... Catala... Syndro... Biblio... Regist... Univer... Eurost... Didact... Job ap... EuroSe... Bio2RD... Taiwan... Univer... Univer... GovWIL... Semant... Kallik... ePrint... Thesau... data.g... Yahoo ... Englis... Entrez... School... openda... Nation... Linked... BibSon... Nation... zarago... School... Aperti... CrossR... Biblio... Street... openda... CiteSe... Addgene Univer... Galici... bio2rd... Metoff... Bio2RD... DBpedi... Lexico... Green ... Univer... openda... openda... GeoLin... Requir... Facete... TCGA ... Vytaut... openda... Aperti... cablegate IEEE V... Face Link IBM Re... openda... RDFohloh Bio2RD... DBTune... DBLP i... WordLi... Univer... openda... Aperti... Edublogs openda... Google... busine... status... Prospe... Univer... Aperti... Bio2RD... Univer... Basque... DBLP R... AEGP, ... Social... Linked... Multex... YSO - ... DEPLOY... Aperti... Bio2RD... Gemeen... openda... STITCH... Open A... de-gaa... greek-... Open D... MusicB... EnAKTi... Inspec... Bio2RD... Semant... Commun... Aperti... Bio2RD... Mis Mu... patent... Univer... Resili... xLiD-L... Techni... openda... Aperti... Bio2RD... VIVO W... Associ... SALDOM... epsrc MediCare Unempl... openda... Inever... Linked... LAAS-C... Univer... Bio2RD... Klapps... GNOSS.... Number... WordLi... ZBW Labs Korean... Univer... ReSIST... Nomenc... Regist... fun Orthol... AEMET ... Regist... lexinfo Bio2RD... Instit... openda... USPTO ... Cadast... Red Un... Atheli... ISTAT ... EMN EEA Vo... VIVO W... DBpedi... resear... Univer... Betwee... ReSIST... Lichfi... openda... ThIST LODAC ... Biblio... UK JIS... UMTHES semant... RISKS ... openda... Univer... Thesau... ECCO-T... Scotti... Univer... openda... typepad Bio2RD... openda... Bio2RD... El Via... DBLP C... status... Deep B... openda... Aperti... Active... UK Leg... Bio2RD... France... proven... Europe... Bio2RD... KORE 5... data.o... OLiA D... EU Age... Climb ... EnAKTi... Deaths... CTIC P... Mi Guí... Framester status... Web ND... EnAKTi... DBTune... Fundaç... openda... SIMPLE Linked... Norsk ... openda... Budape... Standa... Bio2RD... Bio2RD... webnma... openda... Genera... LinkedCT Audite... Rechts... Focus ... Enviro... The Eu... Renewa... Nation... Weathe... IEEE P... refere... Europe... Compre... Ordnan... Univer... DBpedi... Airpor... Univer... lemonUby Linked... Arthro... Phonet... status... myExpe... Bio2RD... Yovist... Chemic... Transp... MASC-B... Plant ... dbpedi... openda... Czech ... Aperti... BBC Wi... GEnera... WordNe... Univer... status... openda... EU: fi... openda... openda... Univer... Sancti... Hungar... Projet... openda... Aperti... Bio2RD... ECS So... Univer... Univer... DBpedi... openda... Bio2RD... MetaSh... VIVO I... Summar... Univer... IceWor... Univer... Bundes... ICANE Ontos ... statis... VIVO U... Bio2RD... ... ESD St... NUTS (... Linked... Openly... Produc... BibBase Univer... Bio2RD... EEA Re... openda... Univer... Tradit... Integr... UK Pos... status... DBkWik Archiv... Organi... status... VIVO S... openda... Prince... openda... Federa... Aperti... TheSoz... VIVO EnAKTi... Bans o... Univer... status... twc-op... openda... openda... Aperti... Lingui... Divers... status... Ocean ... openda... Swedis... OpenCo... SORS YAGO openda... Librar... World ... Norweg... Open D... Traffi... EnAKTi... ItalWo... LOD2 P... Univer... Linked... VIVO C... Aperti... vivo2doi Univer... Inspec... Yeast ... CLLD-P... FAO ge... NERC V... EU Par... IATI a... openda... status... openda... Judaic... Univer... Aperti... Japane... transp... Univer... Croati... tharaw... Univer... Univer... EARTh UNESCO... FlyBas... Bio2RD... Salzbu... EUMIDA... Intern... openda... CLLD-S... Aperti... Open W... World ... Semant... Linked... DanNet... Instan... Data a... openda... Aperti... status... List o... Wordne... Bio2RD... Learni... Instit... Food a... Aperti... status... CLLD-WALS openda... Zhishi.me WOLF W... Deutsc... Bank f... Linked... Univer... openda... Romani... oceand... MORElab openda... status... plWord... Spring... Parole... oreilly Open B... educat... Britis... Planet... World ... PreMOn Hebrew... openda... OpenWN... GeoWor... bio2rd... status... unipro... Confis... openda... wiktio... openli... Bio2RD... associ... U.S. S... Serend... FiESTA CLLD-G... Greek ... Freebase openda... Compar... BabelNet Tradit... Linked... Univer... Slovak... Cerebr... Univer... status... WordNe... RDFLic... Publis... Muninn... Deusto... status... Nation... openda... Europe... EIONET... ISPRA ... status... WordNe... W3C data-s... Aragon... southa... BulTre... DBpedia status... BPR ? ... German... ATC gr... ALPINO... Organi... status... Chines... Glottolog openda... status... wordpress status... data-h... OLiA DM2E status... apache B3Kat ... ISOcat CLLD-A... CIPFA Chines... AgriNe... datos.... Medici... status... status... datahub Linked... Deutsc... Global... openda... status... FOODpe... Physic... Open L... BioPAX State ... Univer... Allie ... status... status... DBTropes Lista ... status... aliada... status... EURAXE... Amino ... Austra... status... Ocean ... Geolog... Univer... Interc... TCMGen... Univer... DWS-Group Educat... CE4R K... Linked... Linked... Physic... status... FrameB... The Co... HeBIS ... status... Linked... Arabic... Deusto... Univer... Rådata... MExiCo status... Bio2RD... Ocean ... RSS-50... Images... status... status... QBOAir... EPA-CDR Ontolo... OSM Se... status... Experi... Mass s... Univer... Agenda... Univer... lobid-... Aperti... Lexvo Linked... World ... Sample... status... status... Ordnan... RDF Bo... Social... Englis... TDS FinnWo... Bio2RD... VIAF: ... DBTune... Public... status... Calames Dewey ... Open D... status... status... Zebraf... News-1... status... Surge ... PreLex ExO status... MultiW... Semant... Ontolo... Wheat ... Persia... Basisr... eagle-... status... Europe... status... Mathem... Uberbl... Linked... status... status... Linked... eagle-... sandra... ISIL->... status... Salzbu... Aperti... Open M... Projec... Proteo... eagle-... status... RAMEAU... Plant ... Revyu.... Univer... status... Edinbu... status... Diavgeia eagle-... SALDO-RDF Sudoc ... yso-fi... status... DBpedi... EPA-SRS status... OLAC M... Influe... status... TIP Person... eagle-... yso-fi... status... Discog... ichoose ciard-... eagle-... Univer... status... sloWNe... Linked... SNOMED... GeoSpe... status... status... Taxons Hellen... Open D... status... LinkLi... Reuter... status... System... Lotico AGRIS status... datos-... eagle-... Fungal... Teleos... status... LemonW... Wikidata Brucel... ... status... Gemein... Health... prefix.cc Open E... status... status... notube Units ... EUR-Le... Twarql CLLD-afbo DBpedi... SoyOnt... Univer... status... Pediat... berlios Plant ... Public... status... DBpedi... eagle-... OpenMo... Tender... ABA Ad... status... Units ... Teleos... DATATU... Englis... GeoNam... Instit... Univer... eagle-... Produc... status... thesaurus mEduca... 20th C... Gene R... eagle-... dbnary Solana... taxonc... Univer... eagle-... IdRef:... Uber a... eagle-... Linkin... Situat... taxonc... Univer... NTNU s... NIFSTD Courts... Univer... Source... myopen... SysMO-... Univer... C. ele... Animal... Aperti... RxNORM Human ... Albane... MESH T... eagle-... 18/31 RISM A... status... CLLD-E... Plant ... Pokede... Sugges... N-Lex ... Univer... Prince... Subcel... Hymeno... status... ICD-10... Family... PanLex Linked... Wikili... Instit... Breast... CLLD-WOLD NIF Cell Univer... Atlant... Influe... DIKB-E... Cell l... Ontolo... OpenCyc Fissio... C. ele... ietflang Verteb... CRISP ... Xenopu... COSTART DailyMed eagle-... Eurost... Accomm... Italia... Neomar... Sleep ... Mosele... NanoPa... geodom... status... IxnO eagle-... Non Ra... EventKG Cell type TEKORD Cell l... C. ele... Univer... Europe... BIRNLex Organi... Italia... Termin... status... Cancer... eVOC (... EU Who... VIVO Spatia... Measur... Africa... Finnis... DBTune... DBLP B... Sequen... Intern... AGROVOC PRotei... Datos.... Rat St... Medica... eagle-... Mammal... eagle-... interv... Univer... Univer... DBpedi... status... Cereal... Accomm... Bilate... status... Medlin... Bone D... Anatom... Amphib... status... Vaccin... WHO Ad... Intern... CareLex DBTune... Geospa... PMA 2010 DisGeNET EventM... Emotio... ASN:US Univer... Gene O... UN/LOC... Ontolo... Alpine... Near Random... ProductDB Protei... UMBEL ... Reprod... Studen... ICD10 Amphib... Transl... RNA on... sears.com Physic... Experi... Cancer... World ... URIBurner ... Ontolo... Malari... eagle-... NMR-in... RadLex SPARQL... Open D... Chroni... Ontolo... eagle-... Norweg... Univer... DBpedi... Ontolo... Ascomy... Univer... Parasi... Cell L... TaxonC... JRC-Na... verteb... Orphan... Genera... BRENDA... R&D Pr... semanlink Multil... Host P... NHS Ja... Geogra... FDA Me... Neural... VANDF Pathwa... enviro... Intera... Cardia... Nation... eagle-... Genera... Ontolo... Gene O... status... Univer... lobid-... flickr... Logger... status... Tick g... Read C... Nation... UNODC ... NIF Dy... Human ... Master... AI/RHEUM Natura... BioAss... Farmac... Proyec... Mosqui... System... semant... NASA S... NCI Th... 2000 U... PHARE Drosop... Univer... Manual... Linked... Softwa... WebIsALOD Verteb... Logica... Univer... eagle-... Plant ... Ontolo... GeoSpe... Basic ... Persée... Social... Mental... Result... Metath... Projec... MIxS C... Comput... New Yo... MGED O... Gene R... Bleedi... Thesau... ICPC-2... status... Spider... Curren... BBC Music Protei... eagle-... status... The Eu... Data I... Inform... CAO DBTune... status... Ontolo... Human ... Cognit... SIDER:... Phenot... Drosop... Automa... Mouse ... Chemic... data.b... Event ... eagle-... status... Smokin... Eventseer Linked... TAXREF... MLSA -... MeGO Enviro... Nation... Datos ... Univer... Neomar... Common... IMGT-O... Taxono... Intern... Basic ... Univer... Ontolo... ICD10CM Univer... Molecu... Lipid ... xxxxx Medaka... Chemic... theses.fr MedDRA UniProt Time E... Advers... ICPS N... Mosqui... Dictyo... HUGO Multip... Clinic... Neural... Biolog... RDFizi... Evalua... Breast... Thai W... Hellen... Brown ... Infect... Cell C... ta... Ontolo... Last.F... Diagno... gdlc Sentim... Diseasome Open D... Englis... status... HEALTH... List o... Merite... Online... Electr... Univer... Semant... Ontolo... Mouse ... Cognit... NCBI o... dev8d SNP-On... status... R&D Pr... Epilepsy Role O... Human ... Univer... Minima... status... lingvo... OBOE SBC status... zhishi... Biomed... Intern... HCPCS PKO_Re Univer... Dendri... TOK_On... DrugBank

eagle-... photos OBOE BBC Pr... Univer... MaHCO ...

The Linked Open Data Cloud from lod-cloud.net Gathering Data

• Browsing online versions of KBs • Using ad-hoc to retrieve relevant triples • Using a SPARQL endpoint • Downloading a dump • Crawling other knowledge bases, e.g., dereferencing Cool URIs

19/31 Systems

• RDF stores (triplestores) with relational or native backend, open-source or commercial, related to graph databases • Apache Jena • Virtuoso • Blazegraph, essentially acquired by • Amazon Neptune • SPARQL engines, usually on top of a triplestore. http://en.wikipedia.org/wiki/SPARQL • Tool to view semantic data in Web pages: http: //www.google.com/webmasters/tools/richsnippets

20/31 Semantic Web challenges

• Complexity: • Writing structured content is harder than writing text! • Using structured content (with heterogeneous schema) is complicated! • Discoverability problem for knowledge bases, vocabularies • Performance: • Data is large • Running queries on graphs is tricky • Reasoning makes it even worse • Federation makes things worse again

21/31 Semantic Web challenges, cont’d

: • Vagueness and modeling issues • Trust (anyone can add a triple) • Canonicity and alignment • Temporality, sources often complicated to represent • Open-world semantics: missing values vs no values • Incentives: many data providers do not want to be eaten by others

22/31 Wikidata Why Wikidata matters

• Backed by the : credible and noncommercial • Not run by academics, but some academics are involved • Genuine uses on (to some extent) • Centralized model, which is a good idea for now • Good tradeoffs in terms of expressiveness, scope... • Uses the successful model 23/31 • Entities and properties have a label and short description in each language, along with aliases (search engine) • Entities can also have sitelinks to Wikimedia projects (e.g., the corresponding Wikimedia pages) • For each entity and property, we can have facts (or claims) with different objects • Everyone can create and edit entities and facts • Discussion is needed before creating a property • Software: , a set of extensions to Mediawiki

Wikidata basics

• Entities: Q1, Q2, Q3, ..., Q60527475 and beyond • Properties: P1, P2, P3, ..., P6343 and beyond

24/31 • For each entity and property, we can have facts (or claims) with different objects • Everyone can create and edit entities and facts • Discussion is needed before creating a property • Software: Wikibase, a set of extensions to Mediawiki

Wikidata basics

• Entities: Q1, Q2, Q3, ..., Q60527475 and beyond • Properties: P1, P2, P3, ..., P6343 and beyond • Entities and properties have a label and short description in each language, along with aliases (search engine) • Entities can also have sitelinks to Wikimedia projects (e.g., the corresponding Wikimedia pages)

24/31 • Everyone can create and edit entities and facts • Discussion is needed before creating a property • Software: Wikibase, a set of extensions to Mediawiki

Wikidata basics

• Entities: Q1, Q2, Q3, ..., Q60527475 and beyond • Properties: P1, P2, P3, ..., P6343 and beyond • Entities and properties have a label and short description in each language, along with aliases (search engine) • Entities can also have sitelinks to Wikimedia projects (e.g., the corresponding Wikimedia pages) • For each entity and property, we can have facts (or claims) with different objects

24/31 • Software: Wikibase, a set of extensions to Mediawiki

Wikidata basics

• Entities: Q1, Q2, Q3, ..., Q60527475 and beyond • Properties: P1, P2, P3, ..., P6343 and beyond • Entities and properties have a label and short description in each language, along with aliases (search engine) • Entities can also have sitelinks to Wikimedia projects (e.g., the corresponding Wikimedia pages) • For each entity and property, we can have facts (or claims) with different objects • Everyone can create and edit entities and facts • Discussion is needed before creating a property

24/31 Wikidata basics

• Entities: Q1, Q2, Q3, ..., Q60527475 and beyond • Properties: P1, P2, P3, ..., P6343 and beyond • Entities and properties have a label and short description in each language, along with aliases (search engine) • Entities can also have sitelinks to Wikimedia projects (e.g., the corresponding Wikimedia pages) • For each entity and property, we can have facts (or claims) with different objects • Everyone can create and edit entities and facts • Discussion is needed before creating a property • Software: Wikibase, a set of extensions to Mediawiki

24/31 Qualifiers, references, ranks, data types

• Each fact can have qualifiers to indicate things like start/end time, details (e.g., major/degree for P69 “educated at”) • Each fact can also have sources to indicate where it comes from (a source is a set of key–value pairs) • Each fact can have a rank among “normal”, “preferred” (e.g., for the current value), or “deprecated”. • Literal values can have data types https://www.wikidata.org/wiki/Special:ListDatatypes • Also two special values • “unknown value” (a value exists but is unknown) • “no value” (it is known that there is no value)

25/31 Constraints

• Wikidata has constraints which are only advisory (= you can create violations) and are quite simple. Main ones: • “single (best) value constraint” • “inverse constraint” (mother vs child), “symmetric constraint” • “type constraint”, or requiring/disallowing certain facts • “range constraint”“contemporary constraint”, “format constraint” • “one-of/none-of constraint” (list of allowed/forbidden values) • Requiring/allowing qualifiers or units • Allowing use as a qualifier/unit • There is a mechanism for exceptions • Many constraint violations in practice

26/31 Usage on Wikipedia

• Used for interwiki links, i.e., the links between Wikipedia pages across languages • Used in some on Wikipedia, e.g., to automatically populate some fields • Can be used for other things, e.g., filling tables, or external links to other sources • Policy depends on each Wikipedia: some communities are more welcoming than others...

27/31 Ongoing Wikidata discussions

• Project scope: what belongs in Wikidata? • The public domain license is a strong requirement • Concerns, e.g., about the high number of bibliographic entities (almost half of the entities) • Some external datasets are imported, but Wikipedia (historically) gave much importance to human validation of imports • Some support for federation in queries; and many external links • Notability: essentially no policy currently • Managing vandalism? • Importance of references?

28/31 Accessing Wikidata data

• Simply by browsing • Can retrieve in multiple formats, e.g., https://www.wikidata.org/wiki/Special: EntityData/Q42.json • For simple queries (triple patterns), fragments https://query.wikidata.org/bigdata/ldf • Wikimedia API, e.g., API for recent changes • SPARQL queries, https://query.wikidata.org/ (and API) • Weekly dumps in JSON, RDF, XML (around 50 GB compressed)

29/31 Other cool Wikidata stuff

• Distributed Wikidata Game: edits on Wikidata https://tools.wmflabs.org/wikidata-game/distributed/ • Reasonator: automatically generate a Wikipedia-like page from a Wikidata entity https://tools.wmflabs.org/reasonator/ • Lexemes: ongoing effort to add linguistic data to Wikidata • OWL ontology: http://wikiba.se/ontology • askplatyp.us: natural language tool • File captions on to have a structured way to give labels to images (deployed on January 10) • OpenRefine to reconcile datasets with Wikidata and add Wikidata facts https://www.wikidata.org/wiki/Wikidata: Tools/OpenRefine/Editing/Tutorials/Video

30/31 Slide acknowledgements

• Many thanks to Thomas Pellissier-Tanon for his helpful feedback • Slide 4: https://en.wikipedia.org/wiki/FOAF_(ontology) • Slide 5: https://www.w3.org/Submission/ccREL/ • Slide 8–10: https://schema.org/Event • Slide 13: https://commons.wikimedia.org/wiki/File:DBpediaLogo.svg • Slide 14: https://en.wikipedia.org/wiki/File:YAGO.svg • Slide 15: https://commons.wikimedia.org/wiki/File: Freebase_Logo_optimised.svg • Slide 16, 23: https://en.wikipedia.org/wiki/File:Wikidata-logo-en.svg

31/31