Using Linked Data in Purposive Social Networks

Total Page:16

File Type:pdf, Size:1020Kb

Using Linked Data in Purposive Social Networks UNIVERSITY OF SOUTHAMPTON FACULTY OF PHYSICAL AND APPLIED SCIENCES Electronics and Computer Science Using Linked Data in Purposive Social Networks by Priyanka Singh Thesis for the degree of Doctor of Philosophy September 2016 UNIVERSITY OF SOUTHAMPTON ABSTRACT FACULTY OF PHYSICAL AND APPLIED SCIENCES Electronics and Computer Science Doctor of Philosophy USING LINKED DATA IN PURPOSIVE SOCIAL NETWORKS by Priyanka Singh The Web has provided a platform for people to collaborate by using collective intelli- gence. Messaging boards, Q&A forums are some examples where people broadcast their issues and other people provide solutions. Such communities are defined as a Purposive Social Network (PSN) in this thesis. PSN is a community where people with similar interest and varied expertise come together, use collective intelligence to solve common problems in the community and build tools for common purpose. Usually, Q&A forums are closed or semi-open. The data are controlled by the websites. Difficulties in the search and discovery of information is an issue. People searching for answers or experts in a website can only see results from its own network, while losing a whole community of experts in other websites. Another issue in Q&A forums is not getting any response from the community. There is a long tail of questions that get no answer. The thesis introduces the Suman system that utilises Semantic Web (SW) and Linked Data technologies to solve above challenges. SW technologies are used to structure the community data so it can be decentralized and used across platforms. Linked Data helps to find related information about linked resources. The Suman system uses available tools to solve name entity disambiguation problem and add semantics to the PSN data. It uses a novel combination of semantic keyword search with traditional text search tech- niques to find similar questions with answers for unanswered questions to expand the query term with added semantics and uses corwdsourced data to rank the results. Fur- thermore, the Suman system also recommends experts who can answer those questions. This helps to narrow down the long tail of unanswered questions in such communities. The Suman system is designed using the Design Science methodology and evaluated by users in two experiments. The results were statistically analysed to show that the keywords generated by the Suman system were rated higher than the original keywords from the websites. It also showed that the participants agreed with the algorithm rating for answers provided by the Suman system. StackOverflow and Reddit are used as an example of PSN and to build an application as a proof of concept of the Suman system. Contents Declaration of Authorship xv Acknowledgements xvii Nomenclature xix 1 Introduction1 1.1 Overview....................................1 1.2 Research Questions...............................3 1.3 Research Methodology.............................4 1.3.1 Scientific Steps to Answer KQ1 and KQ2..............5 1.3.1.1 Knowledge Problem Investigation.............5 1.3.1.2 Research Design.......................5 1.3.1.3 Research Design Validation.................5 1.3.1.4 Research Execution.....................6 1.3.1.5 Analysis of Results......................6 1.3.2 Scientific Steps to Answer DP1 and DP2...............6 1.3.2.1 Problem Investigation....................6 1.3.2.2 Treatment Design......................7 1.3.2.3 Design Validation......................8 1.3.2.4 Treatment Implementation.................8 1.3.2.5 Implementation Evaluation.................9 1.3.3 Scientific Steps to Answer KQ3, KQ4 and KQ5...........9 1.3.3.1 Knowledge Problem Investigation.............9 1.3.3.2 Research Design....................... 10 1.3.3.3 Research Design Validation................. 11 1.3.3.4 Research Execution..................... 12 1.3.3.5 Analysis of Results...................... 12 1.4 Research Contribution............................. 12 1.5 Structure of Thesis............................... 14 2 Background 15 2.1 Emergence of Social Web........................... 16 2.1.1 Web 2.0 and Social Media....................... 17 2.1.2 Content-specific Social Networking Services............. 18 2.2 Collective Intelligence and Crowdsourcing.................. 19 2.2.1 Information Sharing.......................... 20 2.2.2 Human Computation.......................... 21 v vi CONTENTS 2.2.3 Quality Management and User Recommendation.......... 22 2.3 Semantic Web and Linked Data........................ 23 2.3.1 Semantic Web Technology....................... 23 2.3.2 Linked Data Technology........................ 25 2.3.2.1 Benefits of Linked Data................... 26 2.3.2.2 Linking the Datasets.................... 27 2.3.2.3 Linked Data Cloud..................... 30 2.3.3 Semantic Web Vocabularies...................... 31 2.3.3.1 FOAF............................. 31 2.3.3.2 SIOC............................. 32 2.3.4 Social Semantic Applications..................... 33 2.3.4.1 Semantic Tagging...................... 33 2.3.4.2 Semantic blogging and microblogging........... 34 2.3.4.3 Semantic Wiki........................ 35 2.3.5 Semantic Search and Query...................... 35 2.3.5.1 Information Retrieval.................... 36 2.3.5.2 Web Search.......................... 36 2.3.5.3 Semantic Search....................... 37 2.3.5.3.1 Keyword Disambiguation:............ 40 2.3.5.3.2 Concept mapping................. 41 2.3.5.3.3 Semantic Queries................. 42 2.3.5.3.4 Expert Recommendation:............. 44 3 Purposive Social Network 47 3.1 What is Purposive Social Network...................... 48 3.2 Different types of communities in Purposive Social Network........ 49 3.2.1 Information Based Community.................... 49 3.2.2 Interest Based Community...................... 49 3.2.3 Expert Based Community....................... 50 3.2.4 Location Based Community...................... 50 3.3 Properties of Purposive Social Network................... 51 3.3.1 Community Size............................ 51 3.3.2 Focused Interest............................ 51 3.3.3 Direct Communication......................... 51 3.3.4 Active Participation.......................... 52 3.3.5 Short Lifespan............................. 52 3.3.6 Strong Incentive............................ 52 3.4 Benefits of Purposive Social Network..................... 52 3.4.1 Information Exchange and Self-interest............... 53 3.4.2 Symbiotic Relation and Social Exchange............... 54 3.4.3 Social Recognition and Personal Satisfaction............ 54 3.4.4 Recommendation System....................... 55 3.4.5 Expert Finder............................. 55 3.5 Challenges in Purposive Social Network................... 55 3.5.1 Recruiting and Retaining Users.................... 56 3.5.2 Incentive Model............................ 57 3.5.3 Quality Control............................. 58 CONTENTS vii 3.5.4 Search and Discovery of Quality Content.............. 59 3.6 Case Study of Purposive Social Network................... 59 3.6.1 StackOverflow Analysis........................ 60 3.6.1.1 Question and Answers.................... 61 3.6.1.2 Users............................. 62 3.6.1.3 Tags.............................. 63 3.6.1.4 Votes and Reputation.................... 65 3.6.1.5 Communication network structure............. 66 3.6.1.6 Tags network......................... 67 3.6.1.7 Incentive Design....................... 68 3.6.1.8 Quality Control....................... 71 3.6.1.9 Community Moderation................... 72 3.6.2 Reddit Analysis............................. 72 3.6.2.1 Communication network structure............. 75 3.6.2.2 Subreddit network...................... 76 3.6.2.3 Incentive Design....................... 76 3.6.2.4 Quality Control....................... 79 3.6.2.5 Community Moderation................... 79 3.6.2.6 Discourse Analysis...................... 80 4 The Suman System 85 4.1 Use of Semantic Web and Linked Data in Purposive Social Network... 85 4.1.1 Research problem and challenges................... 86 4.1.2 Benefit of Using Semantic Web and Linked Data in Purposive Social Network............................. 87 4.1.2.1 Structured Data....................... 87 4.1.2.2 Linking People to People and People to Data....... 88 4.1.2.3 Multidimensional Network and Graph........... 88 4.1.2.4 Integrated Knowledge.................... 88 4.1.2.5 Smart Query and Search.................. 88 4.1.2.6 Social Network Analysis................... 89 4.1.3 Research Validation.......................... 89 4.1.4 Research execution........................... 89 4.1.5 Result evaluation............................ 90 4.2 The Suman System............................... 91 4.2.1 Problem Investigation......................... 92 4.2.2 Treatment Design of the Suman System............... 92 4.2.2.1 Data Collection....................... 93 4.2.2.2 Data Structuring....................... 94 4.2.2.3 Keyword Annotation and Linking............. 94 4.2.2.4 Database and Query..................... 96 4.2.2.4.1 Database Indexing and Configuration:...... 97 4.2.2.5 Suman Search Algorithm.................. 98 4.2.2.5.1 Detailed Explanation of Each Step........ 99 4.2.2.6 Expert Finder........................ 102 4.2.2.6.1 Detailed Explanation of Each Step........ 103 4.2.2.7 Design Innovation in the Suman System......... 105 viii CONTENTS 4.2.2.7.1 Special feature of the Suman algorithms:.... 105 4.2.3 Design Validation............................ 106 4.2.3.1 Validating the Suman system design........... 106 4.2.4
Recommended publications
  • Getting Humans to Do Quantum Optimization - User Acquisition, Engagement and Early Results from the Citizen Cyberscience Game Quantum Moves
    Human Computation (2014) 1:2:221-246 © 2014, Lieberoth et al. CC-BY-3.0 ISSN: 2330-8001, DOI: 10.15346/hc.v1i2.11 Getting Humans to do Quantum Optimization - User Acquisition, Engagement and Early Results from the Citizen Cyberscience Game Quantum Moves ANDREAS LIEBEROTH, Aarhus University MADS KOCK PEDERSEN, Aarhus University ANDREEA CATALINA MARIN, Aarhus University TILO PLANKE, Aarhus University JACOB FRIIS SHERSON, Aarhus University ABSTRACT The game Quantum Moves was designed to pit human players against computer algorithms, combining their solutions into hybrid optimization to control a scalable quantum computer. In this midstream report, we open our design process and describe the series of constitutive building stages going into a quantum physics citizen science game. We present our approach from designing a core gameplay around quantum simulations, to putting extra game elements in place in order to frame, structure, and motivate players’ difficult path from curious visitors to competent science contributors. The player base is extremely diverse – for instance, two top players are a 40 year old female accountant and a male taxi driver. Among statistical predictors for retention and in-game high scores, the data from our first year suggest that people recruited based on real-world physics interest and via real-world events, but only with an intermediate science education, are more likely to become engaged and skilled contributors. Interestingly, female players tended to perform better than male players, even though men played more games per day. To understand this relationship, we explore the profiles of our top players in more depth. We discuss in-world and in-game performance factors departing in psychological theories of intrinsic and extrinsic motivation, and the implications for using real live humans to do hybrid optimization via initially simple, but ultimately very cognitively complex games.
    [Show full text]
  • The Playful Citizen Civic Engagement in Civic Engagement a Mediatized Culture Edited Glas, by René Sybille Lammes, Michiel Lange, De Raessens,Joost and Imar Vries De
    GAMES AND PLAY Raessens, and Vries De (eds.) Glas, Lammes, Lange, De The Playful Citizen The Playful Edited by René Glas, Sybille Lammes, Michiel de Lange, Joost Raessens, and Imar de Vries The Playful Citizen Civic Engagement in a Mediatized Culture The Playful Citizen The Playful Citizen Civic Engagement in a Mediatized Culture Edited by René Glas, Sybille Lammes, Michiel de Lange, Joost Raessens, and Imar de Vries Amsterdam University Press The publication of this book was made possible by the Utrecht Center for Game Research (one of the focus areas of Utrecht University), the Open Access Fund of Utrecht University, and the research project Persuasive gaming. From theory-based design to validation and back, funded by the Netherlands Organisation for Scientific Research (NWO). Cover illustration: Photograph of Begging For Change, MEEK (courtesy of Jake Smallman). Cover design: Coördesign Lay-out: Crius Group, Hulshout isbn 978 94 6298 452 3 e-isbn 978 90 4853 520 0 doi 10.5117/9789462984523 nur 670 Creative Commons License CC BY NC ND (http://creativecommons.org/licenses/by-nc-nd/3.0) All authors / Amsterdam University Press B.V., Amsterdam 2019 Some rights reserved. Without limiting the rights under copyright reserved above, any part of this book may be reproduced, stored in or introduced into a retrieval system, or transmitted, in any form or by any means (electronic, mechanical, photocopying, recording or otherwise). Contents 1. The playful citizen: An introduction 9 René Glas, Sybille Lammes, Michiel de Lange, Joost Raessens, and Imar de Vries Part I Ludo-literacies Introduction to Part I 33 René Glas, Sybille Lammes, Michiel de Lange, Joost Raessens, and Imar de Vries 2.
    [Show full text]
  • Making Room for Cooperative Innovation
    Florida State University Law Review Volume 41 Issue 4 Article 5 Summer 2014 Making Room for Cooperative Innovation Liza S. Vertinsky Follow this and additional works at: https://ir.law.fsu.edu/lr Part of the Intellectual Property Law Commons Recommended Citation Liza S. Vertinsky, Making Room for Cooperative Innovation, 41 Fla. St. U. L. Rev. 1067 (2014) . https://ir.law.fsu.edu/lr/vol41/iss4/5 This Article is brought to you for free and open access by Scholarship Repository. It has been accepted for inclusion in Florida State University Law Review by an authorized editor of Scholarship Repository. For more information, please contact [email protected]. MAKING ROOM FOR COOPERATIVE INNOVATION LIZA S. VERTINSKY ABSTRACT Patent law, created in response to a constitutional mandate to encourage innovation, may be discouraging important forms of cooperative innovation. Advances in technology have enabled new ways of pooling knowledge and computational capabilities, facilitating cooperation among many participants with complementary skills and motivations to collec- tively solve complex problems. But emerging models of cooperative innovation increasingly run into patent roadblocks. Why might patent law sometimes thwart instead of support socially beneficial coopera- tive innovation? The problem lies in the tensions between the market-based incentives that patent law creates and the mechanisms that support emerging models of cooperative innova- tion. The complexity and cost of solving contemporary public challenges are nudging diverse participants together to collectively build their knowledge, but patents often serve to keep them apart. While digital technologies enable new forms of massively distributed, open and collaborative intellectual production, patents threaten the vitality and even the viability of these promising types of innovation.
    [Show full text]
  • CROWDSOURCING SCIENCE Scientific Utopia
    CROWDSOURCING SCIENCE 1 RUNNING HEAD: CROWDSOURCING SCIENCE Scientific Utopia III: Crowdsourcing Science Eric Luis Uhlmann4 Christopher R. Chartier6 Charles R. Ebersole1 Timothy M. Errington2 Mallory Kidwell9 Calvin K. Lai8 Randy J. McCarthy7 Amy Riegelman5 Raphael Silberzahn3 Brian A. Nosek1,2 1 University of Virginia, 2 Center for Open Science, 3 University of Sussex, 4 INSEAD, 5 University of Minnesota, 6 Ashland University, 7 Northern Illinois University, 8 Washington University in St. Louis, 9 University of Utah Authors’ Note: This research was supported by the Center for Open Science and an R&D grant from INSEAD. Please address correspondence to Eric Uhlmann, [email protected], or Brian Nosek, [email protected] Author contribution statement: This paper was outlined and the literature review conducted through a crowdsourced process to which all authors contributed. EL, BN, CC, TE, CL, RC, AR, & RS drafted the body of the manuscript. All authors then provided critical edits and revisions. The second through ninth authors are listed alphabetically. CROWDSOURCING SCIENCE 2 Abstract Most scientific research is conducted by small teams of researchers, who together formulate hypotheses, collect data, conduct analyses, and report novel findings. These teams are rather closed and operate as vertically integrated silos. Here we argue that scientific research that is horizontally distributed provides substantial complementary value by maximizing available resources, increasing inclusiveness and transparency, and facilitating error detection. This alternative approach enables researchers to tackle ambitious projects by diversifying contributions and leveraging specialized expertise. The benefits of large scale collaboration span the entire research process: ideation, study design, data collection, data analysis, reporting, and peer review.
    [Show full text]
  • Fantastiska Spelmekaniker Och Var Man Hittar Dem En Utforskning Av Unika Designm¨Ojligheter Inom Fysikbaserade Videospel
    Fantastiska spelmekaniker och var man hittar dem En utforskning av unika designm¨ojligheter inom fysikbaserade videospel Kandidatarbete inom Data- och Informationsteknik AKE˚ ANDERSSON THOMAS JONSSON DAMGAARD ALGOT NILSSON ANNA ROMEBORN RAZMUS STRANDELL Instutitionen f¨or Data- och Informationsteknik Chalmers Tekniska Hogskola¨ Gothenburg, Sweden 2018 Kandidatarbete DATX02-18-42 Fantastiska spelmekaniker och var man hittar dem En utforskning av unika designm¨ojligheter inom fysikbaserade videospel AKE˚ ANDERSSON THOMAS JONSSON DAMGAARD ALGOT NILSSON ANNA ROMEBORN RAZMUS STRANDELL Instutitionen f¨or Data- och Informationsteknik Chalmers Tekniska Hogskola¨ G¨oteborg, Sweden 2018 Fantastiska spelmekaniker och var man hittar dem En utforskning av unika designm¨ojligheter inom fysikbaserade videospel AKE˚ ANDERSSON THOMAS JONSSON DAMGAARD ALGOT NILSSON ANNA ROMEBORN RAZMUS STRANDELL c Ake˚ Andersson, Thomas Jonsson Damgaard, Algot Nilsson, Anna Romeborn, Razmus Strandell, 2018. Handledare: Staffan Bj¨ork, Inst. f¨or Data- och Informationsteknik Examinator: Olof Torgersson, Inst. f¨or Data- och Informationsteknik Kandidatarbete DATX02-18-42 Instutitionen f¨or Data- och Informationsteknik Chalmers Tekniska H¨ogskola 412 96 G¨oteborg Telefon +46 31 772 1000 Omslagsbild: En kombinationsbild av de fem minispel som har utvecklats under projektets g˚ang. Institutionen f¨or Data- och Informationsteknik Chalmers Tekniska H¨ogskola G¨oteborg, Sweden 2018 Sammandrag Syftet med detta projekt har varit att utforska designm¨ojligheter i vi- deospel baserade p˚afenomen fr˚ans¨allan anv¨anda fysikomr˚aden.F¨or att hitta intressanta outforskade fysikomr˚adensom skulle kunna anv¨andas har en dokumentation av tidigare spel skapats, varefter en analys av de- ras fysikanv¨andning har utf¨orts. D¨arefter gjordes en kartl¨aggning av des- sa, vilken anv¨andes f¨or att hitta de olika fysikomr˚adenasom vi ville ut- forska.
    [Show full text]
  • CROWDSOURCING SCIENCE Scientific Utopia
    CROWDSOURCING SCIENCE 1 RUNNING HEAD: CROWDSOURCING SCIENCE Scientific Utopia III: Crowdsourcing Science Eric Luis Uhlmann4 Charles R. Ebersole1 Christopher R. Chartier6 Timothy M. Errington2 Mallory Kidwell9 Calvin K. Lai8 Randy J. McCarthy7 Amy Riegelman5 Raphael Silberzahn3 Brian A. Nosek1,2 1 University of Virginia, 2 Center for Open Science, 3 University of Sussex, 4 INSEAD, 5 University of Minnesota, 6 Ashland University, 7 Northern Illinois University, 8 Washington University in St. Louis, 9 University of Utah Authors’ Note: This research was supported by the Center for Open Science and an R&D grant from INSEAD. Please address correspondence to Eric Uhlmann, [email protected], or Brian Nosek, [email protected] Author contribution statement: This paper was outlined and the literature review conducted through a crowdsourced process to which all authors contributed. EU, CE, BN, CC, TE, CL, RM, AR, & RS drafted the body of the manuscript. CE created the figure and tables. All authors provided critical edits and revisions. The third through ninth authors are listed alphabetically. CROWDSOURCING SCIENCE 2 Abstract Most scientific research is conducted by small teams of investigators, who together formulate hypotheses, collect data, conduct analyses, and report novel findings. These teams operate independently, as vertically integrated silos. Here we argue that scientific research that is horizontally distributed can provide substantial complementary value, aiming to maximize available resources, promote inclusiveness and transparency, and increase rigor and reliability. This alternative approach enables researchers to tackle ambitious projects that would not be possible under the standard model. Crowdsourced scientific initiatives vary in terms of the degree of communication between project members, from largely independent work curated by a coordination team to crowd collaboration on shared activities.
    [Show full text]
  • Download to Windows, Mac, and Linux Through Quantummoves.Org
    Leaderboard Effects on Player Performance in a Citizen Science Game Mads Kock Pedersen1, Nanna Ravn Rasmussen1, Jacob F. Sherson1, and Rajiv Vaid Basaiawmoit2 1ScienceAtHome, Department of Physics and Astronomy, Aarhus University, Aarhus Denmark 2Faculty of Science and Technology, Aarhus University, Aarhus Denmark [email protected] [email protected] [email protected] [email protected] Abstract: Points, badges, and leaderboards are some of the most often used game elements, and they are typically the first elements that people use from the gamification toolbox. Very few studies have tried to investigate the effects of the individual elements and even fewer have examined how these elements work together. Quantum Moves is a citizen science game that investigates the ability of humans to solve complex physics challenges that are intractable for computers. During the launch of Quantum Moves in April 2016 the game's leaderboard function broke down resulting in a “no leaderboard” game experience for some players for a couple of days (though their scores were still displayed). The subsequent quick fix of an all-time Top 5 leaderboard, and the following long-term implementation of a personalized relative-position (infinite) leaderboard provided us with a unique opportunity to compare and investigate the effect of different leaderboard implementations on player performance in a points-driven citizen science game. All three conditions were live sequentially during the game’s initial influx of more than 150.000 players that stemmed from global press attention on Quantum Moves due the publication of a Nature paper about the use of Quantum Moves in solving a specific quantum physics problem.
    [Show full text]
  • Scientific Utopia III: Crowdsourcing Science
    Scientific Utopia III: crowdsourcing science Article (Accepted Version) Uhlmann, Eric Luis, Ebersole, Charles R, Chartier, Christopher R, Errington, Timothy M, Kidwell, Mallory, Lai, Calvin K, McCarthy, Randy J, Riegelmann, Amy, Silberzahn, Raphael and Nosek, Brian A (2019) Scientific Utopia III: crowdsourcing science. Perspectives on Psychological Science. ISSN 1745-6916 This version is available from Sussex Research Online: http://sro.sussex.ac.uk/id/eprint/83380/ This document is made available in accordance with publisher policies and may differ from the published version or from the version of record. If you wish to cite this item you are advised to consult the publisher’s version. Please see the URL above for details on accessing the published version. Copyright and reuse: Sussex Research Online is a digital repository of the research output of the University. Copyright and all moral rights to the version of the paper presented here belong to the individual author(s) and/or other copyright owners. To the extent reasonable and practicable, the material made available in SRO has been checked for eligibility before being made available. Copies of full text items generally can be reproduced, displayed or performed and given to third parties in any format or medium for personal research or study, educational, or not-for-profit purposes without prior permission or charge, provided that the authors, title and full bibliographic details are credited, a hyperlink and/or URL is given for the original metadata page and the content is not changed in any way. http://sro.sussex.ac.uk CROWDSOURCING SCIENCE 1 RUNNING HEAD: CROWDSOURCING SCIENCE Scientific Utopia III: Crowdsourcing Science Eric Luis Uhlmann4 Charles R.
    [Show full text]