Creating Library Linked Data with Wikibase
Total Page:16
File Type:pdf, Size:1020Kb
OCLCOCLC RESEARCHRESEARCH REPORTREPORT Creating Library Linked Data with Wikibase Lessons Learned from Project Passage Jean Godby, Karen Smith-Yoshimura, Bruce Washburn, Kalan Knudson Davis, Karen Detling, Christine Fernsebner Eslao, Steven Folsom, Xiaoli Li, Marc McGee, Karen Miller, Honor Moody, Craig Thomas, Holly Tomren Creating Library Linked Data with Wikibase: Lessons Learned from Project Passage Jean Godby OCLC Research Karen Smith-Yoshimura OCLC Research Bruce Washburn OCLC Research Kalan Knudson Davis University of Minnesota Karen Detling National Library of Medicine Christine Fernsebner Eslao Harvard University Steven Folsom Cornell University Xiaoli Li University of California, Davis Marc McGee Harvard University Karen Miller Northwestern University Honor Moody Harvard University Craig Thomas Harvard University Holly Tomren Temple University © 2019 OCLC Online Computer Library Center, Inc. This work is licensed under a Creative Commons Attribution 4.0 International License. http://creativecommons.org/licenses/by/4.0/ August 2019 OCLC Research Dublin, Ohio 43017 USA www.oclc.org ISBN: 978-1-55653-134-7 DOI: 10.25333/faq3-ax08 OCLC Control Number: 1110105996 ORCID iDs Jean Godby https://orcid.org/0000-0003-0085-2134 Karen Smith-Yoshimura https://orcid.org/0000-0002-8757-2962 Bruce Washburn http://orcid.org/0000-0003-4396-7345 Kalan Knudson Davis https://orcid.org/0000-0002-1032-6042 Christine Fernsebner Eslao https://orcid.org/0000-0002-7621-916X Steven Folsom https://orcid.org/0000-0003-3427-5769 Xiaoli Li https://orcid.org/0000-0001-5362-2151 Marc McGee https://orcid.org/0000-0001-5757-1494 Karen Miller https://orcid.org/0000-0002-9597-2376 Craig Thomas https://orcid.org/0000-0002-4027-7907 Holly Tomren https://orcid.org/0000-0002-6062-1138 Please direct correspondence to: OCLC Research [email protected] Suggested citation: Godby, Jean, Karen Smith-Yoshimura, Bruce Washburn, Kalan Knudson Davis, Karen Detling, Christine Fernsebner Eslao, Steven Folsom, Xiaoli Li, Marc McGee, Karen Miller, Honor Moody, Craig Thomas, and Holly Tomren. 2019. Creating Library Linked Data with Wikibase: Lessons Learned from Project Passage. Dublin, OH: OCLC Research. https://doi.org/10.25333/faq3-ax08. CONTENTS Executive Summary .................................................................................................................. 8 Introduction .............................................................................................................................. 9 Barriers to adoption of linked data in the library resource-description workflow ..............10 Project Passage: The OCLC Research Linked Data Wikibase Prototype ..............................11 The terms of the pilot ...........................................................................................................12 The pilot study participants .................................................................................................13 How this report is organized ................................................................................................14 An Overview of the MediaWiki/Wikibase Platform and OCLC Project Prototypes ................15 A Wikibase article, page, or “entity” ....................................................................................15 The Wikibase entity ecosystem ............................................................................................ 17 MediaWiki and Wikibase ...................................................................................................... 17 The Triplestore, SPARQL Query API, and the SPARQL Query user interface ....................18 Applications that use the MediaWiki/Wikibase environment and the SPARQL Query service: OpenRefine and the Passage Explorer ......................................................19 Data import with PyWikibot and the Passage Retriever .....................................................19 The baseline dataset ............................................................................................................ 20 Three phases of user-centered development and testing...................................................21 Phase 1: Introducing Wikibase, evaluating the initial dataset, promoting SPARQL queries ................................................................................................................... 21 Phase 2: Implementing requested features, testing constraints, and promoting the Explorer ...................................................................................................... 22 Phase 3: Enlarging the set of participants, developing documentation, holding office hours, and testing the Retriever ................................................................. 24 A robust software foundation ............................................................................................. 26 Studies of Resource Description in Passage .......................................................................... 26 Use cases in library metadata creation using the Wikibase editing interface ................... 26 Works, translations, and Chinese-language descriptions .................................................. 28 A translated work of philosophy ........................................................................................ 29 Wikibase “person” and “place” entities in Chinese ........................................................... 33 A person: Sun Yat-sen .................................................................................................. 33 A place: Bianliang ......................................................................................................... 35 Visual resources .................................................................................................................. 37 A historical map .................................................................................................................. 37 Photographs, a postcard, and a poster ..............................................................................41 Establishing a reference with uniquely identifying data ............................................ 43 Defining and naming events ........................................................................................ 43 Building a context for interpretation ........................................................................... 44 Archives ...............................................................................................................................46 A musical work associated with an event .......................................................................... 48 Creating and editing Items in Passage .............................................................................. 49 Summary of use cases and first impressions ...................................................................... 53 Lessons Learned and Reflections ........................................................................................... 53 How Passage differs from other linked data projects in the library community ................ 53 Seven lessons learned ......................................................................................................... 55 Reflections and perspectives ..............................................................................................60 The transition from human-readable records to knowledge graphs as a paradigm shift .....................................................................................................................60 New tasks ......................................................................................................................60 Tasks that become obsolete .......................................................................................60 What is still necessary ...................................................................................................61 Best practices are still important ................................................................................. 62 Librarians will still uphold the values of authoritativeness and quality ..................... 63 Reinventing crowd-sourcing .............................................................................................. 64 Interoperability with Wikidata and other sources of linked data ...................................... 65 Local Wikibase instances ............................................................................................. 65 Direct collaboration with Wikidata ............................................................................. 66 Looking Forward .....................................................................................................................68 Acknowledgements ................................................................................................................ 70 Notes ........................................................................................................................................71 FIGURES FIGURE 1 Structural components in the Wikibase editing page, Charlie Kritschmar (WMDE) [CC0], from Wikimedia Commons ..........................................................15 FIGURE 2 MediaWiki and Wikibase environment ...................................................................18 FIGURE 3 MediaWiki and Wikibase environment with a parallel Triplestore, SPARQL Query API, and SPARQL Query user interface ........................................18 FIGURE 4 Applications