Using Wikidata for Video Game Research
Total Page:16
File Type:pdf, Size:1020Kb
Using Wikidata for Video Game Research Experiences, Opportunities and Challenges for Research Data Management Using Wikidata for Video Game Research Vienna, 18.10.2019 Tracy Hoffmann Leipzig University Library Using Wikidata for Video Game Research The research project ● diggr (Databased Infrastructure for Global Games Research) ● Collaborative research project funded by the German Research Foundation ● Duration: 01/2017 - 07/2020 ● The Team: ○ Interdisciplinary (Information Science, Librarianship, Cultural Studies, [Japan|Area] Studies) ○ Library's IT department ○ Institute for Japanese Studies of Leipzig University Leipzig University Library Using Wikidata for Video Game Research Support Research Data Lifecycle Research data lifecycle diagram by Jisc CC BY-NC-ND Leipzig University Library Using Wikidata for Video Game Research For Science! Metadata about Video Games and Companies Using Wikidata for Video Game Research Data about Video Game Companies ● No database (only entities inside video game databases) ● No common identifier ● Little information Approach ● Data curation by hand Leipzig University Library Data-driven Perspectives on FromSoftware Videogames Data about Video Games ● A lot of databases (Mobygames, Media Art DB, GameFAQs, IGDB, …) ● No common identifier ● Different data coverage / specialization ● Conceptual differences ● Completeness ● Errors ● Bias ● … Approach ● Linking databases Universitätsbibliothek Leipzig Data-driven Perspectives on FromSoftware Videogames Two Main Tasks ● Data curation … of public available information ● Linking databases … to aggregate information + Cooperation with other researchers Universitätsbibliothek Leipzig Data-driven Perspectives on FromSoftware Videogames Two Main Tasks ● Data curation … of public available information ● Linking databases … to aggregate information + Cooperation with other researchers Own infrastructure?! ● Development, maintenance, long term service, time, human resource, ... Universitätsbibliothek Leipzig Data-driven Perspectives on FromSoftware Videogames Two Main Tasks ● Data curation … of public available information ● Linking databases … to aggregate information + Cooperation with other researchers Own infrastructure?! ● Development, maintenance, long term service, time, human resource, ... There's gotta be something there…. Universitätsbibliothek Leipzig Using Wikidata for Video Game Research Data Curation with Wikidata Simple as that Leipzig University Library Using Wikidata for Video Game Research Wikidata as THE Linking Hub for Video Games 100 identifier properties for video games! https://www.wikidata.org/wiki/Wikidata:WikiProject_Video_games/Identifiers Leipzig University Library Using Wikidata for Video Game Research Wikidata as THE Linking Hub for Video Games Example: Dark Souls MusicBrainz Work ID Soundtrack Metacritic ID Reviews and scores subreddit Fan culture (e.g. memes) Behind The Voice Actors Video Game ID Voice Actors speedrun.com game id Speedruns Bibliothèque nationale de France ID Library ID IGN game ID News https://www.wikidata.org/wiki/Wikidata:WikiProject_Video_games/Identifiers Leipzig University Library Using Wikidata for Video Game Research Other Examples for Wikidata + Research Data Bio (GeneDB) Waagmeester A, Schriml L, Su A (2019) Wikidata as a linked-data hub for Biodiversity data . Biodiversity Information Science and Standards 3: e35206. https://doi.org/10.3897/biss.3.35206 Sebastian Burgstaller-Muehlbacher, Andra Waagmeester, Elvira Mitraka, Julia Turner, Tim Putman, Justin Leong, Chinmay Naik, Paul Pavlidis, Lynn Schriml, Benjamin M Good, Andrew I Su, Wikidata as a semantic framework for the Gene Wiki initiative, Database, Volume 2016, 2016, baw015, https://doi.org/10.1093/database/baw015 Manske M, Böhme U, Püthe C and Berriman M. GeneDB and Wikidata [version 1; peer review: 1 approved, 1 approved with reservations]. Wellcome Open Res 2019, 4:114 (https://doi.org/10.12688/wellcomeopenres.15355.1) History FactGrid https://blog.factgrid.de/welcome General Mietchen D, Hagedorn G, Willighagen E, Rico M, Gómez-Pérez A, Aibar E, Rafes K, Germain C, Dunning A, Pintscher L, Kinzler D (2015) Enabling Open Science: Wikidata for Research (Wiki4R). Research Ideas and Outcomes 1: e7573. https://doi.org/10.3897/rio.1.e7573 Leipzig University Library Using Wikidata for Video Game Research Advantages ● Other people's infrastructure ● Easy access (for researchers, student assistants, developers) ● Community knowledge ● Provenance and history of statements ● Open Refine integration -> easy bulk imports ● Collaborative data curation tool -> international collaboration Leipzig University Library Using Wikidata for Video Game Research Learning by doing Activities Using Wikidata for Video Game Research Timeline Company data enhancement since 2019-01 Company data 2017 Import Link Start 2018-05 Import Project 2019-08 Try own Wikibase OLAC Video 2018-01 Game Genre 2019-01 Curation of links since 2019-08 Leipzig University Library Using Wikidata for Video Game Research Activities Bulk Imports − Video game companies (names, locations, inception, industry, identifiers) − Links of video game databases (Media Art DB, GameFAQs) Ongoing data enhancement − Locations and Mobygames ID for video game companies − New company items − Links to video game database Mobygames (via Mix 'n' Match) − New game items Discussion − Community-driven Initiatives and Research Avenues Workshop (July 2019) Leipzig University Library Using Wikidata for Video Game Research Pitfalls and Confusions Challenges Using Wikidata for Video Game Research Challenges Company Data − Company data often doesn't match the principles of notability -> some were deleted :( Does not meet the notability policy: content was: "Beyond Interactive, Inc." − small companies − perhaps doesn't exist anymore − little information about it − Complex relationships between or inside companies (subsidiaries, branches, successors, merges, …) Leipzig University Library Using Wikidata for Video Game Research Challenges Video Game Data − Concepts, vocabulary, inconsistencies, ... Leipzig University Library Using Wikidata for Video Game Research Wikidata and Research Data Management? Using Wikidata for Video Game Research Support Research Data Lifecycle − Wikidata for − Collect and capture data − Collaborate and analyse data − Store data − Share data − Discover and reuse data Research data lifecycle diagram by Jisc CC BY-NC-ND Leipzig University Library Using Wikidata for Video Game Research Is it Findable, Accessible, Interoperable, and Reusable (FAIR) − yes … Leipzig University Library Using Wikidata for Video Game Research Is it Findable, Accessible, Interoperable, and Reusable (FAIR) − yes … − … but, we cannot import everything: research dataset contains data derived from a 3rd party or which is not suitable with Wikidata − copyright (databases) − personal information (credit data) − Approach a: Own Wikibase instance − we failed with that in 2018, but maybe in the future it gets easier for research projects − Approach b: Data dumps in repositories (also needed for publications!) Leipzig University Library Using Wikidata for Video Game Research Résumé/Resümee #sharingiscaring − After two years: − it's getting better and better − active community − data is (mostly) stable − We are convinced that: − Having research data on Wikidata increases its sustainability − When research project end, data remain on an independently funded infrastructure − Wikidata allows rapid integration with other domains − The project activities are well documented inside Wikidata − Open = many eyes = fast curation loops = less errors Leipzig University Library Using Wikidata for Video Game Research Next steps − WikidataCon 2019 − "Sum of All video games − 2019 edition" (Jean-Frédéric Berthelot, Envel Le Hir, Tracy Hoffmann) − Work on data model − Further data curation Leipzig University Library Using Wikidata for Video Game Research Thanks! A big thank you goes out to @Jean-Fred ( ´ ▽ ` )b https://www.wikidata.org/wiki/Wikidata:WikiProject_Video_games https://diggr.link/ https://github.com/diggr/ The contents of this presentation - except where otherwise noted and with the exception of the graphic design elements and the logos of the Leipzig University and its institutions - are licensed under Creative Commons Attribution 4.0 International. Leipzig University Library.