Wikidata for Digital Preservation

Total Page:16

File Type:pdf, Size:1020Kb

Wikidata for Digital Preservation Wikidata for Digital Preservation Katherine Thornton Kenneth Seals-Nutt Yale University Yale University [email protected] Euan Cochrane Carl Wilson Yale University Open Preservation Foundation ABSTRACT CCS CONCEPTS The Wikidata knowledge base provides a public infrastructure for • Human-centered computing → Wikis; • Information sys- machine-readable metadata about computing resources. To opti- tems → Digital libraries and archives; mize the Wikidata knowledge base for digital preservation pro- fessionals, we need to create additional structured descriptions KEYWORDS about file formats and software titles. To facilitate this process, we Wikidata, digital preservation, metadata introduce the Wikidata for Digital Preservation portal. This free ACM Reference Format: software portal allows people to browse, and to contribute data to, Katherine Thornton, Kenneth Seals-Nutt, Euan Cochrane, and Carl Wilson. the Wikidata knowledge base. The interface guides users in con- 2018. Wikidata for Digital Preservation. In Proceedings of iPres Conference tributing data in alignment with current data models for the domain (iPres’18). ACM, New York, NY, USA, 10 pages. https://doi.org/ of computing, which are collaboratively created by members of the Wikidata community. 1 INTRODUCTION Structured data about file formats, the many versions of soft- Wikidata is becoming a centralized repository for machine-readable, ware titles, and computing environments, are already available in structured data about resources in the domain of computing. Wiki- Wikidata. The content of Wikidata is licensed under the Creative data allows us to store cross-domain data, and provides a powerful Commons Zero (CC0) license, meaning that anyone can reuse the query interface. data for any purpose. The content in Wikidata is available in more than 350 human languages. The data in Wikidata is FAIR data, and it is linked open data. Our portal provides a streamlined interface designed for the needs of the digital preservation community. When using the Wikidata for Digital Preservation portal, users will be enriching a multilingual repository of data that is open for reuse across institutional boundaries. The vision we share of using Wiki- data as a registry of technical metadata for the domain of computing is a vision of semantic integration of data from multiple sources [6]. We publish the source code for the Wikidata for Digital Preser- vation portal under the Gnu General Public License, v31 to ensure that this tool will remain in the public domain. We aligned our license decision with those of other tools in the Wikidata ecosys- tem, and the enabling software infrastructure of Wikidata itself, to ensure the sustainability of this infrastructure over time. Active developer communities maintain and provision this public infras- tructure, and cultural heritage organizations are able to freely reuse the descriptive and technical metadata for software, file formats, Figure 1: A screenshot of the Wikidata item for Matroska. and all other resources in the domain of digital preservation in hundreds of human languages. Metadata about software, file formats and computing environ- ments is necessary for the identification and management of these 1https://github.com/WikiDP/portal-proto/blob/master/LICENSE.GPL entities. Creating machine-readable metadata about resources in the domain of computing allows digital preservation practitioners to then automate programmatic interactions with these entities. Permission to make digital or hard copies of part or all of this work for personal or People working in digital preservation have a shared need for accu- classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation rate, reusable, technical and descriptive metadata about the domain on the first page. Copyrights for third-party components of this work must be honored. of computing. For all other uses, contact the owner/author(s). The data about resources in the domain of computing within iPres’18, September, Cambridge, MA USA © 2018 Copyright held by the owner/author(s). Wikidata fulfills the functions of a metadata registry [33]. A tech- nical registry in the domain of digital preservation is a data store iPres’18, September, Cambridge, MA USA Thornton et al. of descriptions of file formats, software used to create or interact resources in a flexible way. SPARQL queries enable us to search for with files, configured environments, operating systems and sus- file formats by media type, to search for software titles by their read- tainability factors. The Wikidata community has added structural, able file formats, or to search for software titles published within descriptive and technical metadata about these resources to the specific windows of time. We can use SPARQL queries to search for knowledge base. The combination of these different types of meta- software titles by genre, to search for technical specifications that data in a single repository enables us to query the data in more describe a particular file format, or to search for a digitized copy flexible ways. In Figure 1, there is a screenshot of a Wikidata item of a user guide for a piece of legacy software. Because Wikidata is and the properties used to describe the item. The metadata for file a cross-domain knowledge base, the range of data combinations formats in Wikidata includes information about who developed allow users to query data that span technical metadata as well as the format, how it is related to other formats, and the file format descriptive metadata aspects of these resources. Wikidata is the first signature. We contribute data about the domain of computing to Wikidata because it is multilingual, and anyone can contribute to the project. Many researchers and practitioners in the field of digital preserva- tion have identified the fragmented nature of technical registries [13]. When registry data is stored in multiple systems, communi- cation also takes place in distributed settings. In response to the fragmented landscape, several groups have presented plans for how to centralize and unify technical registry information [14, 27, 28]. Both GDFR and UDFR have concluded work. The sustainability of a technical registry is important set of factors to consider. The creation of a sustainability plan for the infrastructure of the registry is a crucial component to consider. In this paper, we explore an alternative to creating infrastructure for a repository of technical registry data through use of existing, independently supported, infrastructure provided by the knowledge base of structured data, Figure 2: Wikidata item and data organization. Wikidata. information system with a graphical user interface that supports 2 WHY WIKIDATA? public read and write access to the semantic web [4, 9]. This means Wikidata went live in late 2012 [30]. The infrastructure of Wikidata that anyone with information to contribute about file formats, soft- is collaboratively built via commons-based peer production [2, 3, ware, etc. can add that information directly to Wikidata, and it will 16]. Commons-based peer production is the name given to open be available as linked open data for others to use immediately after collaboration systems where users are creating content under the clicking the publish button. This also means that rather than in- agreement that all content will remain in the public domain. This dependently curating this information within our own institutions, means that all of the work products of the community are free to be we can collaboratively curate these resources, which reduces the reused by others. The peer production aspect refers to how users metadata workload for all. coordinate work themselves. Wikidata is edited by volunteers from In our previous work, we described the data modeling process all over the world in more than 350 languages [7]. that we participate in to collaboratively create data models for the In addition to a free software infrastructure, the Wikidata com- domain of computing [29]. The Wikidata for Digital Preservation munity also publishes all content in the knowledge base under a portal mirrors these data models, and we update the portal on a Creative Commons Zero License. The Wikidata community makes monthly basis as new properties get added to Wikidata, or changes dumps of previous versions of the content of the knowledge base are made to the constraint system. available. The infrastructure of the Wikidata knowledge base is maintained by an international community of people. For cultural 3 DATA MODEL heritage institutions who find the structured data in Wikidata use- The Wikidata data model is minimal, consisting of items and proper- ful for work flows, this mans that there will be much less staff ties. Properties are used to express statements about items. In Figure time necessary to design, build and maintain infrastructure for this 2, we see a screenshot of the Wikidata page for entity Q1676669 data. For cultural heritage institutions will limited digital preserva- JPEG File Interchange Format, version 1.02. Each item is tion budgets, this means that they can now access descriptive and allotted a page in Wikidata and has a unique identifier, with prefix technical metadata for tens of thousands software titles and more Q plus a string of numbers, ex. Q1676669, which is assigned to this than two thousand file formats without having to create manage item. The two properties instance of, and is part of, are used or maintain that data locally. to assert statements about the item. A claim and its references are The Wikidata community maintains a public SPARQL endpoint. considered to be a statement. A description of the Wikidata data As of October, 2017 the endpoint was consistently handling 8.5 model can be found in [31]. Sets of properties are not pre-defined for million SPARQL queries per day [20]. Writing SPARQL queries classes in Wikidata. Editors may freely select properties to express for the Wikidata endpoint allows users to search for data about statements on Wikidata items.
Recommended publications
  • 1 Wikipedia: an Effective Anarchy Dariusz Jemielniak, Ph.D
    Wikipedia: An Effective Anarchy Dariusz Jemielniak, Ph.D. Kozminski University [email protected] Paper presented at the Society for Applied Anthropology conference in Baltimore, MD (USA), 27-31 March, 2012 (work in progress) This paper is the first report from a virtual ethnographic study (Hine, 2000; Kozinets, 2010) of Wikipedia community conducted 2006-2012, by the use of participative methods, and relying on an narrative analysis of Wikipedia organization (Czarniawska, 2000; Boje, 2001; Jemielniak & Kostera, 2010). It serves as a general introduction to Wikipedia community, and is also a basis for a discussion of a book in progress, which is going to address the topic. Contrarily to a common misconception, Wikipedia was not the first “wiki” in the world. “Wiki” (originated from Hawaiian word for “quick” or “fast”, and named after “Wiki Wiki Shuttle” on Honolulu International Airport) is a website technology based on a philosophy of tracking changes added by the users, with a simplified markup language (allowing easy additions of, e.g. bold, italics, or tables, without the need to learn full HTML syntax), and was originally created and made public in 1995 by Ward Cunningam, as WikiWikiWeb. WikiWikiWeb was an attractive choice among enterprises and was used for communication, collaborative ideas development, documentation, intranet, knowledge management, etc. It grew steadily in popularity, when Jimmy “Jimbo” Wales, then the CEO of Bomis Inc., started up his encyclopedic project in 2000: Nupedia. Nupedia was meant to be an online encyclopedia, with free content, and written by experts. In an attempt to meet the standards set by professional encyclopedias, the creators of Nupedia based it on a peer-review process, and not a wiki-type software.
    [Show full text]
  • From Faculty Enemy to Faculty Enabler
    Wikipedia @ 20 • ::Wikipedia @ 20 9 The First Twenty Years of Teaching with Wikipedia: From Faculty Enemy to Faculty Enabler Robert E. Cummings Published on: Oct 15, 2020 Updated on: Nov 16, 2020 License: Creative Commons Attribution 4.0 International License (CC-BY 4.0) Wikipedia @ 20 • ::Wikipedia @ 20 9 The First Twenty Years of Teaching with Wikipedia: From Faculty Enemy to Faculty Enabler Wikipedia jumbles the faculty roles of teaching, researching, and service by challenging traditional notions of faculty expertise, but a more integrated approach for these roles is also possible. I have never been able to see Wikipedia without the lens of a faculty member. On the one hand, I subconsciously carry with me perpetual concerns about accuracy and reliability. I am often trying to prove to myself that Wikipedia is legitimate work: it is, after all, the world’s largest encyclopedia. Writing to Wikipedia follows a set of complicated rules. So if it is a serious project with rules and enforcers and the world relies on its information, it ought to be universally accepted. And on the other hand? I want Wikipedia to be a lark. I also want Wikipedia to be a place where I wander freely and learn lots and lots of information. A guilty pleasure, not unlike pulling a book off a library shelf and simply reading it. I suspect that these anxieties I experience might speak to inherent conflict in being a faculty member while engaging Wikipedia. Reading Wikipedia, contributing to Wikipedia, and certainly teaching with Wikipedia jumble and reconfigure the faculty identities of teacher and researcher because they recontextualize our relationships with expertise itself.
    [Show full text]
  • Bridging the Gap Between Wikipedia and Academia
    Bridging the Gap Between Wikipedia and Academia Dariusz Jemielniak New Research on Digital Societies (NeRDS) group, Kozminski University, Jagiellonska 59, 03-301 Warszawa, Poland. E-mail: [email protected] Eduard Aibar Research Group on Open Science & Innovation, Universitat Oberta de Catalunya, Av. Tibidabo, 39-43, 08035 Barcelona, Spain. E-mail: [email protected] In this opinion piece, we would like to present a short liter- isms, and a community whose enthusiasm will inevitably ature review of perceptions and reservations towards wane: Since 2005 Eric Goldman, a professor of law at Santa Wikipedia in academia, address the common questions Clara University, keeps predicting that “Wikipedia Will Fail about overall reliability of Wikipedia entries, review the actual practices of Wikipedia usage in academia, and con- Within 5 Years” (Goldman, 2005), changing only the clude with possible scenarios for a peaceful coexistence. expected demise date (Anderson, 2009). Because Wikipedia is a regular topic of JASIST publica- Even people sympathetic to open collaboration models, tions (Lim, 2009; Meseguer-Artola, Aibar, Llados, consider Wikipedia to rely mainly on the wisdom of crowds, Minguillon, & Lerga, 2015; Mesgari, Okoli, Mehdi, Nielsen, & Lanamaki,€ 2015; Okoli, Mehdi, Mesgari, Nielsen, & not necessarily on actual expertise (Surowiecki, 2004). It is Lanamaki,€ 2014), we hope to start a useful discussion with called a “flawed knowledge community” (Roberts & Peters, the right audience. 2011, p. 36), a broken surrogate. And the common view
    [Show full text]
  • State of the Community.Pdf
    State of the community Dariusz Jemielniak, “pundit” (the WMF Trustee) Wikimania 2016 http://tiny.cc/wikimania Wikimedia community logo attribution: Artur Jan Fijałkowski pl.wiki/Commons: WarX, [email protected] [Public domain], via Wikimedia Commons https://commons.wikimedia.org/wiki/File:Wikimedia_Community_Logo.svg State of the community? Wikimedia community logo attribution: Artur Jan Fijałkowski pl.wiki/Commons: WarX, [email protected] [Public domain], via Wikimedia Commons https://commons.wikimedia.org/wiki/File:Wikimedia_Community_Logo.svg I do not know. And I am not sure anyone does Disclaimer User digest presentations are talks which provide an overview of a topic and are meant to be useful to anyone, without any particular background knowledge Read more here: https://wikimania2016.wikimedia.org/wiki/User_digest_presentations [[Wikimedia projects]] From https://wikimediafoundation.org/wiki/Our_projects ● Thus 16 different projects* ● 11 of them are content projects (Wikipedia, Wikibooks, Wikiversity, Wikinews, Wiktionary, Wikisource, Wikiquote, Wikivoyage, Wikimedia Commons, Wikidata, Wikispecies) ● 8 of them can be in different languages... *A full listing of Wikimedia projects is available here: https://wikimediafoundation.org/wiki/Special:SiteMatrix ● Wikipedias exist in almost 300 different languages... ○ But only around 100 of them are active and mature enough to be useful as encyclopedias *A list of Wikipedias is available here: https://meta.wikimedia.org/wiki/List_of_Wikipedias Structure ● A core element and basis is Volunteers
    [Show full text]
  • Arxiv:1509.04206V1 [Cs.CY] 14 Sep 2015 Enth¨Alt Kleinere Korrekturen, Die in Fassung 3 Bis Einschließlich 12
    Nicht-propositionales Wissen aus Literaturlekture¨ und Bedingungen seiner Darstellbarkeit in Wikipedia-Eintr¨agen zu literarischen Werken von Claudia Koltzenburg Dieses Werk hat die Lizenz CC BY-SA 4.0 (Creative Commons Attribution-Share Alike 4.0 Unported) Wo diese Arbeit in welchem Ausgabeformat und welcher Fassung zu finden ist: Belegfassung (Print-pdf) (Fassung 1, time stamp: 3. September 2015, enth¨alt 7 Anh¨ange und tr¨agt einen l¨angeren Titel: The sum of all human knowledge\: Nicht- " propositionales Wissen aus Literaturlekture¨ und Bedingungen seiner Darstellbarkeit in Wikipedia-Eintr¨agen zu literarischen Werken), Dissertation Universit¨at Tubingen,¨ Philo- sophische Fakult¨at, Open Access, doi: 10.15496/publikation-6100 als pdf mit Verlinkungen (Fassung 2, uberarbeitet¨ und gekurzt,¨ time stamp: 7. September 2015) 1. bei eprints.rclis.org (pdf) 2. bei commons.wikimedia.org (pdf) 3. bei arxiv.org (pdf) arXiv:1509.04206v1 [cs.CY] 14 Sep 2015 Enth¨alt kleinere Korrekturen, die in Fassung 3 bis einschließlich 12. September 2015 vor- genommen worden sind, mit Dank an User:95.113.139.178, User:Chricho und User:Cethegus bei Wikiversity zum Kommentieren und Weiterschreiben (Fassung 3, mit Er- weiterungen in Kapitel Schreibweisen bei Ulrike Draesner, Hans Ulrich Gumbrecht und Ina Hartwig, time stamp der ersten Version: 12:13:47 UTC am 1. September 2015, Antikriegstag) Mail: <[email protected]> 2 Abstract (English title: Wikipedia entries on fiction and non-propositional knowledge representation) Given that Wikipedia entries are likely to extert a strong influence on how literary texts are perceived { due to their preferential ranking in Google { there is some de- mand that research dealing with the transfer of knowledge on literature to the public be more concerned with looking into both the content that is available for free on the Web and any aspect that may come with writing about literature for free.
    [Show full text]
  • ABSTRACT HOW WIKIPEDIA EDITORS COLLABORATE on ARTICLE “TALK” PAGES Victor Magnuson, M.A. Department of English Northern Illi
    ABSTRACT HOW WIKIPEDIA EDITORS COLLABORATE ON ARTICLE “TALK” PAGES Victor Magnuson, M.A. Department of English Northern Illinois University, 2018 Dr. Jessica Reyman, Director This thesis is a case study of how Wikipedia editors use article “talk” pages to collaborate. A review of related literature provides background on how Wikipedia has rapidly grown into a community of volunteers to make it the largest online collaboration project ever. To perform the analysis, four diverse articles were chosen to examine the discussions and dialogue from their most recent or selected “talk” page. The information resulting from this study has provided both qualitative and quantitative data for analysis. By codifying terms deemed as potential barriers to participation (expertise requirements for Wikipedia policy and subject-matter) Wikipedia’s low-barrier-to-entry, which is articulated through its open editor policy, can be more clearly determined. Furthermore, this study seeks qualitative data regarding Wikipedia’s civility policy as demonstrated by editors themselves, as well as strategies for reaching consensus. In doing so, future contributors can be informed to better understand how Wikipedia’s editors act as a community in attaining the goal of continuous improvement. NORTHERN ILLINOIS UNIVERSITY DE KALB, ILLINOIS MAY 2018 HOW WIKIPEDIA EDITORS COLLABORATE ON ARTICLE “TALK” PAGES BY VICTOR MAGNUSON ©2018 Victor Magnuson A THESIS SUBMITTED TO THE GRADUATE SCHOOL IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE MASTER OF ARTS DEPARTMENT OF ENGLISH Thesis Director: Jessica Reyman ACKNOWLEDGEMENTS I would like to acknowledge and thank those who have supported me in the process of completing this project. In particular, I would like to thank my thesis director, Dr.
    [Show full text]
  • Wikipedia @ 20
    Wikipedia @ 20 Wikipedia @ 20 Stories of an Incomplete Revolution Edited by Joseph Reagle and Jackie Koerner The MIT Press Cambridge, Massachusetts London, England © 2020 Massachusetts Institute of Technology This work is subject to a Creative Commons CC BY- NC 4.0 license. Subject to such license, all rights are reserved. The open access edition of this book was made possible by generous funding from Knowledge Unlatched, Northeastern University Communication Studies Department, and Wikimedia Foundation. This book was set in Stone Serif and Stone Sans by Westchester Publishing Ser vices. Library of Congress Cataloging-in-Publication Data Names: Reagle, Joseph, editor. | Koerner, Jackie, editor. Title: Wikipedia @ 20 : stories of an incomplete revolution / edited by Joseph M. Reagle and Jackie Koerner. Other titles: Wikipedia at 20 Description: Cambridge, Massachusetts : The MIT Press, [2020] | Includes bibliographical references and index. Identifiers: LCCN 2020000804 | ISBN 9780262538176 (paperback) Subjects: LCSH: Wikipedia--History. Classification: LCC AE100 .W54 2020 | DDC 030--dc23 LC record available at https://lccn.loc.gov/2020000804 Contents Preface ix Introduction: Connections 1 Joseph Reagle and Jackie Koerner I Hindsight 1 The Many (Reported) Deaths of Wikipedia 9 Joseph Reagle 2 From Anarchy to Wikiality, Glaring Bias to Good Cop: Press Coverage of Wikipedia’s First Two Decades 21 Omer Benjakob and Stephen Harrison 3 From Utopia to Practice and Back 43 Yochai Benkler 4 An Encyclopedia with Breaking News 55 Brian Keegan 5 Paid with Interest: COI Editing and Its Discontents 71 William Beutler II Connection 6 Wikipedia and Libraries 89 Phoebe Ayers 7 Three Links: Be Bold, Assume Good Faith, and There Are No Firm Rules 107 Rebecca Thorndike- Breeze, Cecelia A.
    [Show full text]
  • Postfassung 2015-09-01.Pdf
    The sum of all human knowledge\ " Nicht-propositionales Wissen aus Literaturlekture¨ und Bedingungen seiner Darstellbarkeit in Wikipedia-Eintr¨agen zu literarischen Werken DISSERTATION zur Erlangung des akademischen Grades Doktorin der Philosophie in der Philosophischen Fakult¨at der Eberhard Karls Universit¨at Tubingen¨ vorgelegt von Claudia Koltzenburg aus Osnabruck¨ 2015 Gedruckt mit Genehmigung der Philosophischen Fakult¨at der Eberhard Karls Universit¨at Tubingen¨ Dekan: Prof. Dr. Jurgen¨ Leonhardt Hauptberichterstatter: Prof. Dr. Jurgen¨ Wertheimer Mitberichterstatterin: Prof. Dr. Ingrid Hotz-Davies Tag der mundlichen¨ Prufung:¨ 21.07.2015 Universit¨atsbibliothek Tubingen:¨ TOBIAS-lib Abstract Given that Wikipedia entries are likely to extert a strong influence on how literary texts are perceived { due to their preferential ranking in Google { there is some de- mand that research dealing with the transfer of knowledge on literature to the public be more concerned with looking into both the content that is available for free on the Web and any aspect that may come with writing about literature for free. This con- tribution argues from within Wikipedia's multidisciplinary consensus-driven space in which propositional knowledge is given priority that it would be essential for ent- ries on fiction to present non-propositional knowledge as being one of its hallmarks. For this aim, a special concept is developed that is designed to function as the stu- dy's formal object: Erlesnis\ (which in German is a pun that combines Erlebnis\ " " { adventure experience { and lesen\ { reading). It is defined as non-propositional " knowledge that has been acquired in an individual reading process. Writing about one's own Erlesnis in new ways is being tried out in essays on Traveling on One Leg (1989) by Herta Mueller, Save the Reaper (1998) by Alice Munro, Alfred and Emily (2008) by Doris Lessing and rein GOLD (2013) by Elfriede Jelinek respectively.
    [Show full text]
  • Wikimania Is an Annual Global Event Devoted to Wikimedia Projects
    Wikimania is an annual global event devoted to Wikimedia projects around the globe (including Wikipedia, Wikibooks, Wikinews, Wiktionary, Wikispecies, and Wikimedia Commons). The conference is a community gathering, giving the editors and users of Wikimedia projects an opportunity to meet each other, exchange ideas, report on research and projects, and collaborate on the future of the projects. Maps The Accommodation Guide 1. Medical Univ. Dormitory 1 5. Gdansk Univ. Dormitory 6 9. Hotel Gdansk 13. Hotel Hanza 2. SSM Youth Hostel 1 6. Old Town Hostel 10. Villa Biala Lilia 3. Railway Station 7. Music Academy Dormitory 2 11. Hotel Parnas W - WIKIMANIA 4. Hotel Hilton 8. Dom Muzyka (Musician House) 12. Holland House The Map of the Neighbourhood of the Venue 1. Conference Venue 3. A good place to buy souvenirs Green line: a route to the Venue 2. Areas with bars and restaurants 4. Ferry across the river You can find more information here: http://bit.ly/buTMGR 2 | Page Program at a Glance Friday, July 9 Friday, July 9 - Baltic Philharmonic Concert hall Jazz hall Green hall Oak hall 08.00 - 09.00 On-Site Registration Opening Ceremony (10 min for organizers) + Opening Plenary (Sue Gardner + WMF Board) 09.00 - 11.00 + Wikimania Madness (last 15 min) 11.00 - 11.30 Coffee Break + Spotlight on Posters Panels: Chapters: Collaboration Workshop: Brainstorming Talks: Wikiversity Talk+Workshop: Accessibility 11:30 - 13:00 and Coordination Wikimedia and Social Media Workshop 13.00 - 14.30 Lunch Break at Baltic Philharmonic: Gdańsk salon and foyer level
    [Show full text]
  • Ways of Knowing When Research Subjects Care
    Ways of Knowing When Research Subjects Care Dorothy Howard Lilly Irani Communication & Design Lab Communication & Design Lab San Diego, CA San Diego, CA [email protected] [email protected] ABSTRACT 1 INTRODUCTION This paper investigates a hidden dimension of research with Research subjects rarely have the chance to reflect on their real world stakes: research subjects who care – sometimes experiences of participating in research in the worlds where deeply – about the topic of the research in which they partic- researchers discuss, debate, and reflect. The voices and re- ipate. They manifest this care, we show, by managing how flections of researchers, on the other hand, are common. they are represented in the research process, by exercising Yet subjects of research certainly have situated knowledge politics in shaping knowledge production, and sometimes in – knowledge born of their practices and social positionality experiencing trauma in the process. We draw first-hand re- [70]. They also have their own projects – projects in which flections on participation in diversity research on Wikipedia, researchers may be an instrument. transforming participants from objects of study to active ne- There are many reasons why participants in human-computer gotiators of research process. We depict how care, vulnerabil- interaction (HCI) research may have interests, needs, and ity, harm, and emotions shape ethnographic and qualitative desires beyond offering themselves as a repository of expe- data. We argue that, especially in reflexive cultures, research rience or a specimen of practice. People might participate subjects are active agents with agendas, accountabilities, and because they are passionate or curious about technological political projects of their own.
    [Show full text]
  • Wikipedia Editor Drop-Off a Framework to Characterize Editors’ Inactivity
    Wikipedia Editor Drop-Off A Framework to Characterize Editors’ Inactivity Marc Miquel-Ribé Cristian Consonni David Laniado Eurecat - Centre Tecnològic de Eurecat - Centre Tecnològic de Eurecat - Centre Tecnològic de Catalunya Catalunya Catalunya [email protected] [email protected] [email protected] ABSTRACT spread to its adoption even among professionals and academia - it While there is extensive literature both on the motivations of has issues in being able to renew or grow its communities [3, 6]. Wikipedia’s editors and on newcomers’ retention, less is known In many Wikipedia language communities, the decline or stabi- about the process by which experienced editors leave. In this paper, lization in the number of editors was detected more than a decade we present an approach to characterize Wikipedia’s editor drop-off ago [3]. Despite dedicated interventions by the Wikimedia Foun- 1 2 3 as the transitional states from activity to inactivity. Our approach dation and other organizations, and groups of editors, it has 4 is based on the data that can be collected or inferred about editors’ not been possible to reverse this trend. While one can speculate activity within the project, namely their contributions to encyclope- whether the project is not as attractive to newcomers as in the dic articles, discussions with other editors, and overall participation. early days, we can observe that there are both language editions Along with the characterization, we want to advance three main and topics with very different
    [Show full text]
  • Wikidata Exploration for Social Science
    Wikidata exploration for Social Science Deadly death metal or rap yourself to death Leon Ciechanowski a Anna Kovbasiuk b Dariusz Jemielniak c a Management in Networked and Digital Societies (MINDS) Department, Kozminski University, Jagiellonska 57/59 Warsaw, Poland; Department of Psy- chology, University of Social Sciences and Humanities, Chodakowska 19/31, War- saw, Poland; Email: [email protected] b Faculty of Neurocognitive Science, University of Social Sciences and Humani- ties SWPS, Chodakowska 19/31, Warsaw, Poland; Email: akovba- [email protected] c Management in Networked and Digital Societies (MINDS) Department, Kozminski University, Jagiellonska 57/59 Warsaw, Poland; Email: [email protected] Keywords: Wikidata, social studies, Social Network Analysis Extended abstract In this presentation we are going to show the possibilities and limitations of Wiki- data [http://www.wikidata.org] – “Wikipedia for data”, or in other words, a sister service of Wikipedia providing tags and collaborative, shareable knowledgebase. Wikidata is, or at least aims to be, the ultimate source of structured data scraped from the biggest Internet encyclopedia. In this way, Wikidata allows for easy ex- ploration of various topics, that may answer academic hypotheses in a data-driven way [Vrandečić & Krötzsch 2014]. As an exemplary project based on data generated with the use of one of Wikidata tools: Wikidata Query Service, we will present a study inspired by the research of longevity of life of jazz musicians [Kinra & Okasha 1999]. The research compared the average duration of life of musicians playing on different instruments. The re- sults have shown that woodwind instrument players (especially saxophonists) lived the shortest, which may be caused by the overstrain of their respiratory and cere- brovascular systems.
    [Show full text]