A Comparison of Self-Organizing Maps and Pathfinder Networks for the Mapping of Co-Cited Authors Jan William Buzydlowski Howard D

Total Page:16

File Type:pdf, Size:1020Kb

A Comparison of Self-Organizing Maps and Pathfinder Networks for the Mapping of Co-Cited Authors Jan William Buzydlowski Howard D A Comparison of Self-Organizing Maps and Pathfinder Networks for the Mapping of Co-Cited Authors A Thesis Submitted to the Faculty of Drexel University by Jan William Buzydlowski in partial fulfillment of the requirements for the degree of Doctor of Philosophy December 2002 © Copyright 2003 Jan William Buzydlowski. All Rights Reserved ii Dedications To my father. iii Acknowledgements Howard D. White, PhD, thesis advisor, who never treated me as a student, but from whom I learned more than any other teacher. The distinguished members of my committee: · Xia Lin, PhD, who treated me as a colleague and friend. · Katherine W. McCain, PhD, who always knew what to do. · Richard Heiberger, PhD, who showed me fascinating new worlds. · Stephen Kimbrough, PhD, for his invaluable contribution. J. Jeffery Hand, Linda Marion, Joan Lussky for their camaraderie. All of the experts involved with this study who were more than generous with their time and expertise. Cheryl L. Berringer for her emotional support, without which none of this would have been possible. iv Table of Contents List of Tables ......................................................................................................... vii List of Figures ........................................................................................................viii Abstract ....................................................................................................................x 1. INTRODUCTION AND STATEMENT OF PROBLEM ...................................1 1.1. Goal of the Research.............................................................................1 1.2. Research Questions...............................................................................4 1.3. Context of the Research........................................................................7 1.4. Interpreting AuthorLink Maps: An Example......................................13 1.5. Setting the Stage.................................................................................17 2. RELEVANT LITERATURE.............................................................................19 3. PATHFINDER NETWORKS ...........................................................................26 3.1. Computing Pathfinder Networks ........................................................26 3.2. Displaying Pathfinder Networks.........................................................30 4. SELF-ORGANIZING MAPS............................................................................34 4.1. Computing Self-Organizing Maps......................................................34 4.2. Displaying Self-Organizing Maps ......................................................38 5. METHODOLOGY.............................................................................................40 5.1. Overview.............................................................................................40 5.1.1 Data .......................................................................................40 5.1.2. The Computer System Used to Generate the Maps .............42 5.2. Procedure ............................................................................................44 5.2.1. Selecting the Experts............................................................45 v 5.2.2. Selecting Author Names Used for the Experiment ..............46 5.2.3. Extracting the Experts’ Mental Maps ..................................48 5.2.4. Creating a Data Structure to Represent the Groupings........51 5.2.5. Determining the Groupings that Exist on Generated Visual Maps .........................................................................53 5.2.6. Combining the Co-Membership Matrices ...........................59 5.3. Research Questions.............................................................................60 5.3.1. Research Question 1.............................................................61 5.3.2. Research Question 2.............................................................64 5.3.3. Research Question 3.............................................................70 6. RESULTS ..........................................................................................................72 6.1. Research Question 1............................................................................72 6.2. Research Question 2............................................................................74 6.3. Research Question 3............................................................................78 6.3.1. Effects of Demographics......................................................80 7. SUMMARY OF INTERVIEWS .......................................................................84 8. THE EXPERTS’ COMPOSITE MAP OF PLATO...........................................90 9. FINAL SUMMARY AND FUTURE WORK...................................................99 9.1. Database Development .....................................................................100 9.2. Enrichment of PFNETs.....................................................................102 9.3. Further Improvement of AuthorLink ................................................103 BIBLIOGRAPHY................................................................................................106 APPENDIX A: SOURCE AND DERIVED DATA............................................112 vi APPENDIX B: A CO-OCCURRENCE CALCULUS ........................................115 APPENDIX C: THE EXPERTS’ AREAS OF EXPERTISE AND NAMESEEDS ...................................................................119 APPENDIX D: INTRODUCTORY LETTER ....................................................120 APPENDIX E: INSTRUCTIONS TO EXPERT.................................................122 APPENDIX F: AFFILIATION OF EXPERTS ...................................................125 APPENDIX G: DATA FROM EXPERTS ..........................................................126 APPENDIX H: EXPLANATION OF THE SYSTEM ........................................186 APPENDIX I: THE INTERVIEW QUESTIONS...............................................189 APPENDIX J: INTERVIEW EXAMPLE ...........................................................191 APPENDIX K: ACCUMULATED MATRICES BASED ON PLATO .............204 APPENDIX L: MAP PREFERENCES OF EXPERTS .......................................207 APPENDIX M: AUTHORS ASSOCIATED WITH THE INTEREST NAMESEEDS............................................................................210 Vita.......................................................................................................................217 vii List of Tables 1. 24 Authors Associated with Plato....................................................................47 2. Grouping Statistics for the Card Sorts .............................................................50 3. A Possible Grouping of the 25 Authors Associated with Plato .......................51 4. A Possible Grouping of the 25 Authors Associated with Plato .......................53 5. Example of a Card Sort....................................................................................58 6. Two Abridged Co-Membership Matrices........................................................66 7. Scoring the Difference of Two Vectors ...........................................................67 8. Fictitious Scores of 5 Experts ..........................................................................68 9. QAP Correlation between SOM Circling and Card Sort .................................73 10. QAP Correlation between PFNET Circling and Card Sort .............................73 11. Co-membership Agreement Measures and Average .......................................75 12. Results of the t-test...........................................................................................76 13. Distribution of Map Preference........................................................................79 14. Counts of Area of Expertise by Map Preference .............................................80 15. Statistics of Area of Expertise by Map Preference ..........................................81 16. Counts of Gender by Map Preference..............................................................81 17. Statistics of Gender by Map Preference...........................................................82 18. Counts of Institutional Affiliation by Map Preference ....................................82 19. Statistics of Institution by Map Preference......................................................83 20. 24 Associated Authors with Plato....................................................................90 21. Accumulated Parings by Experts .....................................................................92 viii List of Figures 1. Initial AuthorLink Screen..................................................................................8 2. Authors with Counts ..........................................................................................9 3. SOM of Plato ...................................................................................................10 4. PFNET of Plato................................................................................................11 5. Retrieved Articles ............................................................................................12 6. SOM of Descartes ............................................................................................14 7. PFNET of Descartes.........................................................................................15
Recommended publications
  • Network Science
    This is a preprint of Katy Börner, Soma Sanyal and Alessandro Vespignani (2007) Network Science. In Blaise Cronin (Ed) Annual Review of Information Science & Technology, Volume 41. Medford, NJ: Information Today, Inc./American Society for Information Science and Technology, chapter 12, pp. 537-607. Network Science Katy Börner School of Library and Information Science, Indiana University, Bloomington, IN 47405, USA [email protected] Soma Sanyal School of Library and Information Science, Indiana University, Bloomington, IN 47405, USA [email protected] Alessandro Vespignani School of Informatics, Indiana University, Bloomington, IN 47406, USA [email protected] 1. Introduction.............................................................................................................................................2 2. Notions and Notations.............................................................................................................................4 2.1 Graphs and Subgraphs .........................................................................................................................5 2.2 Graph Connectivity..............................................................................................................................7 3. Network Sampling ..................................................................................................................................9 4. Network Measurements........................................................................................................................11
    [Show full text]
  • Cognitively Plausible Information Visualization
    Exploring Geovisualization J. Dykes, A.M. MacEachren, M.-J. Kraak (Editors) q 2005 Elsevier Ltd. All rights reserved. Chapter 35 Cognitively Plausible Information Visualization Sara Irina Fabrikant, Department of Geography, University of California Santa Barbara, Santa Barbara, CA 93106, USA Andre´ Skupin, Department of Geography, University of New Orleans, New Orleans, LA 70148, USA Keywords: Information Visualization, spatialization, Cartography, spatial cognition, human–computer interaction Abstract Information Visualization is concerned with the art and technology of designing and implementing highly interactive, computer supported tools for knowledge discovery in large non-spatial databases. Information Visualization displays, also known as information spaces or graphic spatializations, differ from ordinary data visualization and geovisualization in that they may be explored as if they represented spatial information. Information spaces are very often based on spatial metaphors such as location, distance, region, scale, etc., thus potentially affording spatial analysis techniques and geovisualization approaches for data exploration and knowledge discovery. Two major concerns in spatialization can be identified from a GIScience/ geovisualization perspective: the use of space as a data generalization strategy, and the use of spatial representations or maps to depict these data abstractions. A range of theoretical and technical research questions needs to be addressed to assure the construction of cognitively adequate spatializations. In the first part of this chapter we propose a framework for the construction of cognitively plausible semantic information spaces. This theoretical scaffold is based on geographic information theory and includes principles of ontological modeling such as semantic generalization (spatial primitives), geometric generalization (visual variables), association (source–target domain mapping through spatial metaphors), and aggregation (hierarchical organization).
    [Show full text]
  • Visual Analytics of Dynamic Networks
    Visual Analytics of Dynamic Networks Paolo Federico Dissertation Faculty of Informatics, TU Wien, May 31, 2017 Visual Analytics of Dynamic Networks DISSERTATION submitted in partial fulfillment of the requirements for the degree of Doktor/in der technischen Wissenschaften by Paolo Federico Registration Number 0928613 to the Faculty of Informatics at the Vienna University of Technology Advisor: Univ.Prof.in Dr.in rer.soc.oec. Silvia Miksch The dissertation has been reviewed by: (Ao.Univ.Prof.in Dr.in phil. (Univ.Prof. Dr.rer.nat. Margit Pohl) Dr.techn.h.c. Dr.-Ing.E.h. Thomas Ertl) Wien, 25.04.2017 (Paolo Federico) Technische Universität Wien A-1040 Wien ⇧ Karlsplatz 13 ⇧ Tel. +43-1-58801-0 ⇧ www.tuwien.ac.at Acknowledgements I would like to thank my advisor Silvia Miksch for supporting me during the course of my doc- toral study with scientific rigour, friendly communication, and relentless enthusiasm into visual analytic research. Thanks to her and to all my present and past colleagues in the ieg/CVAST group, I could work on my research in an enjoyable, inspiring, and supportive work environ- ment. In addition, I would like to thank the researchers who collaborated to the projects in whose context I conducted my PhD, namely Wolfgang Aigner, Albert Amor-Amorós, Jürgen Pfeffer, Michael Smuc, Florian Windhager, and Lukas Zenk. Special thanks go to Wolfgang Aigner who provided fruitful suggestions and feedback in the first phases of my PhD. Christian Bors and Markus Bögl helped me with the German abstract. The research leading to this thesis has received funding from the Austrian Research Pro- motion Agency (FFG) through ViENA (Visual Enteprise Network Analytics, project number 820928) and Expand (EXploratory visualization of PAtent Network Dynamics, project number 835937) as well as from the Austrian Federal Ministry of Science, Research, and Economy (for- merly known as Austrian Federal Ministry of Economy, Family and Youth) through the Laura Bassi Centre for Visual Analytics Science and Technology (CVAST), project number: 822746 (Phase 1).
    [Show full text]
  • FINDING PATHS OR GETTING LOST? – Examining the Mental Model Construct and Mental Model Methodology
    Linköping University | Department of Computer Science Master’s thesis, 30 credits | Cognitive Science Spring 2017 | ISRN LIU-IDA/KOGVET-A--17/009--SE FINDING PATHS OR GETTING LOST? – Examining the mental model construct and mental model methodology Jimmy Hammarbäck Supervisor: Rita Kovordányi External supervisor: Jonas Hallberg Examiner: Arne Jönsson Linköping University SE-581 83 Linköping, Sweden +46 013 28 10 00, www.liu.se Upphovsrätt Detta dokument hålls tillgängligt på Internet – eller dess framtida ersättare – under 25 år från publiceringsdatum under förutsättning att inga extraordinära omständigheter uppstår. Tillgång till dokumentet innebär tillstånd för var och en att läsa, ladda ner, skriva ut enstaka kopior för enskilt bruk och att använda det oförändrat för ickekommersiell forskning och för undervisning. Överföring av upphovsrätten vid en senare tidpunkt kan inte upphäva detta tillstånd. All annan användning av dokumentet kräver upphovsmannens medgivande. För att garantera äktheten, säkerheten och tillgängligheten finns lösningar av teknisk och administrativ art. Upphovsmannens ideella rätt innefattar rätt att bli nämnd som upphovsman i den omfattning som god sed kräver vid användning av dokumentet på ovan beskrivna sätt samt skydd mot att dokumentet ändras eller presenteras i sådan form eller i sådant sammanhang som är kränkande för upphovsmannens litterära eller konstnärliga anseende eller egenart. För ytterligare information om Linköping University Electronic Press se förlagets hemsida http://www.ep.liu.se/. Copyright The publishers will keep this document online on the Internet – or its possible replacement – for a period of 25 years starting from the date of publication barring exceptional circumstances. The online availability of the document implies permanent permission for anyone to read, to download, or to print out single copies for his/hers own use and to use it unchanged for non- commercial research and educational purpose.
    [Show full text]
  • Visualizing Knowledge Domains1
    This is a preprint of Katy Börner, Chaomei Chen, & Kevin Boyack: Visualizing Knowledge Domains. In Blaise Cronin (Ed.), Annual Review of Information Science & Technology, Volume 37, 1 Medford, NJ: Information Today, Inc./American Visualizing Knowledge Domains Society for Information Science and Technology, chapter 5, pp. 179-255, 2003. Katy Börner School of Library and Information Science, Indiana University, Bloomington, IN 47405, USA [email protected] Chaomei Chen College of Information Science and Technology, Drexel University, Philadelphia, PA 19104, USA [email protected] Kevin W. Boyack Sandia National Laboratories, Albuquerque, NM 87185, USA [email protected] "The purpose of computing is insight – not numbers." R. W. Hamming (1962) 1 INTRODUCTION ...............................................................................................................................................2 2 HISTORY.............................................................................................................................................................4 2.1 SCIENTOMETRICS, BIBLIOMETRICS, AND CITATION ANALYSIS ......................................................................4 2.2 MAP GENERATION AND VISUALIZATION ........................................................................................................7 3 PROCESS FLOW OF VISUALIZING KNOWLEDGE DOMAINS .............................................................8 3.1 UNITS OF ANALYSIS .....................................................................................................................................10
    [Show full text]
  • Selected Research and Development Papers Presented at the Annual
    thannual Proceedings Selected Research and Development Papers Presented at The Annual Convention of the Association for Educational Communications and Technology - Volume 1 Sponsored by the Research and Theory Division And The Division of Instructional Design 37 Jacksonville, FL Editor: Michael Simonson Nova Southeastern University, North Miami Beach, Florida 2014 Annual Proceedings - Jacksonville: Volumes 1 & 2 Volume 1: Selected Research and Development Papers And Volume 2: Selected Papers On the Practice of Educational Communications and Technology Presented at The Annual Convention of the Association for Educational Communications and Technology Sponsored by the Research and Theory Division And The Division of Instructional Design Jacksonville, FL 2014 Editor Michael Simonson Professor Instructional Technology and Distance Education Nova Southeastern University Fischler School of Education and Human Services North Miami Beach, FL Preface Since 1979, the Research and Division of the Association for Educational Communications and Technology (AECT) has sponsored or co-sponsored the publication of these Proceedings. Papers published in this year's 37th Annual Proceedings were presented at the 2014 AECT Convention in Jacksonville, FL.A limited quantity of these Proceedings were printed and sold in both hardcopy and electronic versions. Volumes 1 and 2 are available through the Educational Resources Clearinghouse (ERIC) System. Proceedings volumes are available to members at AECT.ORG. The Proceedings of AECT’s Convention are published in two volumes. Volume #1 contains papers dealing primarily with research and development topics. Papers dealing with the practice of instructional technology including instruction and training issues are contained in Volume #2. This year, both volumes are included in one document. REFEREEING PROCESS: Papers selected for presentation at the AECT Convention and included in these Proceedings were subjected to a reviewing process.
    [Show full text]