Leveraging Open Source Web Resources to Improve Retrieval of Low Text Content Items

Total Page:16

File Type:pdf, Size:1020Kb

Leveraging Open Source Web Resources to Improve Retrieval of Low Text Content Items Leveraging open source web resources to improve retrieval of low text content items A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Ayush Singhal IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY JAIDEEP SRIVASTAVA August, 2014 c Ayush Singhal 2014 ALL RIGHTS RESERVED Acknowledgements There are many people to whom I feel grateful. First of all, I would like to thank my advisor Prof. Jaideep Srivastava for being such a nice and open-minded advisor. He always encouraged me to pursue my research interests. I am grateful to him for taking me as his PhD student even after one year of graduate studies. I also express my special gratitude to Prof. Vipin Kumar for giving me the opportunity to come to Univesity of Minnesota for my PhD. His advice about PhD and his understanding nature helped me so much in times of difficulties. I am grateful to him for that. I sincerely thanks Prof. Nikolaos Papanikolopoulos and Prof. Adam Rothman for spending their valuable time to serve on my thesis committee. I also thanks Prof Maria Gini and Prof. Abhishek Chandra for being in my preliminary exam committee. The way I have developed during my PhD work and the strength to be able to face several challenges is a result of the wonderful teachings of Dr. Krishnan. I feel indebted to him for his constant efforts to encourage me to do this work with the right motivation– for the cause of education and welfare of the society. Because of him, today I feel that all the dynamics in my life have a clear purpose and I can be always joyful to pursue that purpose. I am extremely grateful to Dr. Ankush Mittal, who introduced me to research during my undergraduate studies at IIT -Roorkee. He has given me several loving suggestions both before and during my PhD which I would like to use in time to come. I am most delighted to express my loving feelings towards my dear friends Sanyam and Ashish. They have been like my brothers throughout my PhD work. I also thank Shashank, Ankit and Dr. Saket for being such nice friends. I would like to share my appreciation for my family members, especially my mother, who cooperateed very favorably with me. She always encouraged me to fulfill the aim with which I have come to US. I am grateful for their love and sacrifices. I extend a loving thanks to my brothers Arpit and Kushagra for just being good brothers. i Dedication To my teacher Dr. Krishnan, who always keeps me up... with his teachings, precept and encouragements. ii Abstract With the exponential increase in the amount of digital information in the world, search en- gines and recommendation systems have become the most convenient ways to find relevant information. As an example, the number of web pages on the world wide web was estimated to be over a trillion mark by the year 2008. However, search today is no longer limited to docu- ments on the world wide web. The new “information needs” such as multimedia items (images, videos) opens up challenging avenues for scientific research. Thus the search techniques used to find items which are content rich ( e.g documents) no longer holds for items with low-text content. In the literature, several solutions are proposed for developing search framework for multimedia item search which includes using the visual or audio content of such items for re- trieval purposes. However, there is little research on this problem in the domain of scientific research artifacts. This thesis investigates the problem of retrieval of low-text content items for search and recommendation purposes and propose novel techniques to improve retrieval of such items. In particular, we focus on scientific research datasets owing to their importance and exponential growth in the last decade. One of the main challenges in searching research datasets is the lack of text content or the meta-information about the datasets. While the datasets themselves have raw content, the problem of low text content makes the conventional text based search techniques inadequate for their retrieval. In comparison to the multimedia items, where visual and audio features have been utilized to enhance search or recommendation based retrieval, scientific research datasets lack a uniform schema for representing their raw content. Although solutions such as curation and annotation by experts/data scientists exists but these are infeasible for practical operation on a large scale. As a solution, this thesis provides a computational and an efficient framework for retrieving such low-text content items. We primarily present two retrieval models, namely, (1) a user profile based search, and (2) keyword based search. For the user profile based search model, we show that the text content of the item can be derived from the user’s profile and the relevance ranking can also be derived based on the information in the users profile. We find that the proposed approach which lever- ages the information from open source web resources for item extraction outperforms the local content based extraction approaches. For the keyword based search model, we have developed a iii content rich database for research datasets. We use novel content generation techniques to over- come the low-text content challenge for datasets. The content information is extracted from open source and crowd sourced web resources like academic search engines and Wikipedia. In addition to the stand-alone quantitative assessment of the content generated, we evaluate the efficiency of the entire keyword based search framework via a user study. Based on user re- sponses, the thesis reports positive evidence that the proposed search framework is better than the popular general purpose search engine for searching research datasets with a context based queries. The ideas developed in this thesis are implemented in a real search tool- DataGopher.org: an open source search engine for scientific research datasets. Moreover, the approaches de- veloped for research datasets can have application to other low-content items such as short text document, news feeds, Twitter and Facebook postings. In summary, the computational approaches proposed in this work advance the state-of-the-art in retrieval of low-text content items. Whereas the extensive evaluations that are performed on items like scientific research datasets and low text content documents demonstrate the validity of the findings. iv Contents Acknowledgements i Dedication ii Abstract iii List of Tables ix List of Figures xi 1 Introduction 1 1.1 Information retrieval . .1 1.1.1 Basic definitions and concepts . .1 1.1.2 A brief history . .4 1.2 Research challenges and thesis problem . .5 1.3 Overview of the thesis . .6 1.3.1 Organization of the thesis . .6 2 Leveraging user profile to enhance search of low-content items 9 2.1 Background and Problem Statement . 11 2.2 Related Work . 11 2.3 Proposed Approach . 12 2.3.1 Context creation from user’s interest . 13 2.3.2 Item finding . 15 2.3.3 Item ranking . 17 v 2.4 Experimental Analysis . 18 2.4.1 Experimental analysis for item finding algorithm . 18 2.4.2 Experimental analysis for overall framework . 20 2.5 Experimental results and discussion . 22 2.6 Conclusions and Future Work . 24 3 Automating keyword assignment to short text documents 25 3.1 Related Work . 28 3.2 Background of open source web resources . 29 3.2.1 WikiCFP . 29 3.2.2 Crowd-sourced knowledge . 30 3.2.3 Academic search engines . 31 3.3 Overview of our work . 31 3.4 Web context extraction . 33 3.5 Keyword abstraction . 33 3.5.1 Finding keywords for documents with 5-10 text words . 33 3.5.2 Automating keyword abstraction and de-noising . 37 3.6 Keyword extraction from the summary text of a document . 40 3.6.1 TextRank[1] . 40 3.6.2 Rake[2] . 40 3.6.3 Alchemy[3] . 41 3.6.4 Adding web context . 41 3.7 Experimental analysis for keyword abstraction for documents with 5-10 text words ....................................... 42 3.7.1 Test dataset description . 42 3.7.2 Ground truth . 42 3.7.3 Baseline approach . 42 3.7.4 Evaluation metrics . 43 3.7.5 Results and discussion for keyword abstraction without de-noising . 44 3.7.6 Results and discussion for keyword abstraction with de-noising . 46 3.8 Experimental analysis of the keyword extraction approach . 50 3.8.1 Dataset description . 50 vi 3.8.2 Preliminary analysis of the data . 51 3.8.3 Experimental design parameters . 51 3.8.4 Results and discussion . 54 3.9 Conclusions and future work . 58 4 Automating annotation for research datasets 59 4.1 Problem description . 62 4.2 Proposed Approach . 63 4.2.1 Context generation . 64 4.2.2 Identifying data type labels . 64 4.2.3 Concept generation . 65 4.2.4 Finding similar datasets . 66 4.3 Experiments . 67 4.3.1 Dataset Used . 68 4.3.2 Experiments for data type labelling . 69 4.3.3 Experiments for concept generation . 70 4.3.4 Experiments for similar dataset assignment . 72 4.4 Experimental results and discussion . 73 4.4.1 Data type labelling . 73 4.4.2 Concept generation . 74 4.4.3 Similar dataset assignment . 77 4.5 Use case study . 78 4.5.1 Qualitative evaluation . 79 4.5.2 Quantitative evaluation . 80 4.6 Related work . 81 4.7 Conclusions and future work . 83 5 Annotation expansion for low text content items 84 5.1 Problem Setting . 86 5.2 Proposed Approach .
Recommended publications
  • Alan Emtage Coach Nick's Fav Tech Innovators
    Coach Nick’s Fav Tech Innovators Alan Emtage Innovation - The Search Engine: In 1989, Alan Emtage conceived of and implemented Archie, the world’s first Internet search engine. In doing so, he pioneered many of the techniques used by public search engines today. Coach Nick Says: “The search engine was named after, Archie, a famous comic book that I read as a kid! It changed the way people learn.” Coach Nick’s Fav Tech Innovators Ray Tomlinson Innovation - E-mail: In 1971, Ray Tomlinson developed an e-mail program and the @ sign. He is internationally known and credited as the inventor of email. WVMEN Coach Nick Says: “When I worked in the WV State Department of Education, we started using e-mail in our high schools in 1984 as part of WVMEN, the first statewide micro-computing network in the Nation!” Coach Nick’s Fav Tech Innovators Doug Engelbart Innovation - The Computer Mouse: Doug Engelbart invented the computer mouse in the early 1960s in his research lab at Stanford Research Institute (now SRI International). The first prototype was built in 1964, the patent application was filed in 1967, and US Patent was awarded in 1970. Coach Nick Says: “Steve Jobs acquired — some say stole — the mouse concept from the Xerox PARC Labs in Palo Alto in 1979 and changed the world.” Coach Nick’s Fav Tech Innovators Steve Wozniak & Steve Jobs Innovation - The First Apple: The Apple I, also known as the Apple-1, was designed and hand-built by Steve Wozniak. Wozniak’s friend Steve Jobs had the idea of selling the computer.
    [Show full text]
  • 2020 Vision: Info Pro Skills for a New Decade
    2020 Vision: Info Pro Skills for a New Decade Search Skills for Today’s Info Pros and Thriving in the New Information Landscape Presented by: Mary Ellen Bates Bates Information Services BatesInfo.com Presented for: Initiative Fortbildung e.V. 9 and 10 May 2019 2020 VISION DAY 1: Search Skills for Today’s Info Pros INSIDE A SEARCHER’S MIND: BRINGING THE DETECTIVE TO THE SEARCH ..........................................1 TECHNIQUES OF A DETECTIVE ......................................................................................................................2 DIFFERENT SEARCH APPROACHES.................................................................................................................3 GETTING CREATIVE....................................................................................................................................5 WHAT’S NEW (OR AT LEAST USEFUL) WITH GOOGLE: TIPS AND TOOLS FOR TODAY’S GOOGLE .........6 GOOGLE TRICKS........................................................................................................................................6 SEARCHING THE DEEP WEB / GREY LITERATURE ................................................................................8 SEARCH STRATEGIES FOR GREY LITERATURE....................................................................................................9 SOME GREY LIT/DEEP WEB TOOLS.............................................................................................................10 GLEANING INSIGHT FROM SOCIAL MEDIA........................................................................................12
    [Show full text]
  • Ciência De Dados Na Ciência Da Informação
    Ciência da Informação v. 49 n.3 set./dez. 2020 ISSN 0100-1965 eISSN 1518-8353 Edição especial temática Special thematic issue / Edición temática especial Ciência de dados na ciência da informação Data science in Information Sience Ciencia de datos en la Ciencia de la Información Instituto Brasileiro de Informação em Ciência e Tecnologia (Ibict) Diretoria Indexação Cecília Leite Oliveira Ciência da Informação tem seus artigos indexados ou resumidos. Coordenação-Geral de Pesquisa e Desenvolvimento de Novos Produtos (CGNP) Bases Internacionais Anderson Luis Cambraia Itaborahy Directory of Open Access Journals - DOAJ. Paschal Thema: Science de L’Information, Documentation. Library and Coordenação-Geral de Pesquisa e Manutenção de Produtos Consolidados (CGPC) Information Science Abstracts. PAIS Foreign Language Bianca Amaro Index. Information Science Abstracts. Library and Literature. Páginas de Contenido: Ciências de la Información. Coordenação-Geral de Tecnologias de Informação e Informática EDUCACCION: Notícias de Educación, Ciencia y Cultura (CGTI) Tiago Emmanuel Nunes Braga Iberoamericanas. Referativnyi Zhurnal: Informatika. ISTA Information Science & Technology Abstracts. LISTA Library, Coordenação de Ensino e Pesquisa, Ciência e Tecnologia da Information Science & Technology Abstracts. SciELO Informação (COEPPE) Scientific Electronic Library On-line. Latindex – Sistema Gustavo Saldanha Regional de Información em Línea para Revistas Científicas Coordenação de Planejamento, Acompanhamento e Avaliação de América Latina el Caribe, España y Portugal, México. (COPAV) INFOBILA: Información Bibliotecológica Latinoamericana. José Luis dos Santos Nascimento Indexação em Bases de Dados Nacionais Coordenação de Administração (COADM) Reginaldo de Araújo Silva Portal de Periódicos LivRe – Portal de Periódicos de Livre Acesso. Comissão Divisão de Editoração Científica Nacional de Energia Nuclear (Cnen). Portal Periódicos Ramón Martins Sodoma da Fonseca da Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (Capes).
    [Show full text]
  • Tim Berners-Lee
    Le Web Quelques repères Informations compilées par Omer Pesquer - http://infonum.omer.mobi/ - @_omr Internet Mapping Project, Kevin Kelly, 1999, https://kk.org/ct2/the-internet-mapping-project La proposition d'Alan Levin pour l'Internet Mapping Project https://www.flickr.com/photos/cogdog/24868066787/ 3 Hypertexte «Une structure de fchier pour l'information complexe, changeante et indéterminée » Ted Nelson, 1965 http://www.hyperfiction.org/texts/whatHypertextIs.pdf + https://fr.wikipedia.org/wiki/Hypertexte http://xanadu.com.au/ted/XUsurvey/xuDation.html THE INTERNET 2015 - THE OPTE PROJECT (juillet 2015) - Barrett Lyon http://www.opte.org/the-intern et/ 5 1990... Un accès universel à un large univers de documents En mars 1989, Tim Berners-Lee soumettait une Le 6 août 1991, Tim Berners-Lee annonce proposition d'un nouveau système de gestion de publiquement sur « alt.hypertext » l'existence du l'information à son supérieur. WorldWideWeb. http://info.cern.ch/Proposal.html http://info.cern.ch Le Web « Les sites (Web) doivent être en mesure d'interagir dans un espace unique et universel. » Tim Berners-Lee http://fr.wikiquote.org/wiki/Tim_Berners-Lee +http://www.w3.org/People/Berners-Lee/WorldWideWeb.html 8 9/73 Les Horribles Cernettes - première photo publiée sur World Wide Web en 1992. http://en.wikipedia.org/wiki/Les_Horribles_Cernettes Logo historique du World Wide Web par Robert Cailliau http://fr.wikipedia.org/wiki/Fichier:WWW_logo_by_Robert_Cailliau.svg Le site Web du Ministère de la Culture en novembre 1996 Le logo est GIF animé : https://twitter.com/_omr/status/711125480477487105 "Une archéologie des premiers sites web de musées en France" https://www.facebook.com/804024616337535/photos/?tab=album&album_id=1054783704594957 11/73 1990..
    [Show full text]
  • Cc5212-1 Procesamiento Masivo De Datos Otoño 2020
    CC5212-1 PROCESAMIENTO MASIVO DE DATOS OTOÑO 2020 Lecture 4.5 Projects, Practice with Pig/Hadoop Aidan Hogan [email protected] Course Marking (Revised) • 75% for Weekly Labs (~9% a lab) – 4/4 obligatory, 4/7 optional • 25% for Class Project • Need to pass in overall grade Assignments each week Hands-on each week! Working in groups Working in groups! CLASS PROJECTS Class Project • Done in threes • Goal: Use what you’ve learned to do something cool/fun (hopefully) • Process: – Form groups of three (in the forum, before April 30th) – On April 30th we will assign the rest automatically – Start thinking up topics / find interesting datasets! – Register topic (deadline around May 21st) – Work on projects during semester – Deliverables will due be around week 13 • Deliverables: 4 minute presentation (video) & short report • Marked on: Difficulty, appropriateness, scale, good use of techniques, presentation, coolness, creativity, value – Ambition is appreciated, even if you don’t succeed Desiderata for project • Must focus around some technique from the course! • Expected difficulty: similar to a lab, but without any instructions • Data not too small: – Should have >250,000 tuples/entries • Data not too large: – Should have <1,000,000,000 tuples/entries – If very large, perhaps take a sample? • In case of COVID-19 data, we can make exceptions Where to find/explore data? • Kaggle: – https://www.kaggle.com/ • Google Dataset Search: – https://datasetsearch.research.google.com/ • Datos Abiertos de Chile: – https://datos.gob.cl/ – https://es.datachile.io/ • … PRACTICE WITH HADOOP/PIG Practice with Hadoop • Optional Assignment 1 (not evaluated): – Hadoop: Find the number of good movies in which each actor/actresses has starred.
    [Show full text]
  • DRI2020 Rettskilder Og Informasjonssøking
    DRI2020 Rettskilder og informasjonssøking Søkemotorer: Troverdighet og synlighet Gisle Hannemyr Ifi, høstsemesteret 2014 Opprinnelsen til fritekstsøk • Vedtak i Pennsylvania en gang på slutten av 1950-tallet om å bytte ut begrepet “retarded child” i diverse lover med det mer politisk korrekte “special child”. • Uoverkommelig å finne alle forekomster manuellt. • Deler av lovsamlingen var tastet inn på hullkort. John F. Horty fikk utvided databasen til å omfatte hele lovsamlingen med komemntarer. • Kilde: John F. Horty, “Experience with the Application of Electronic Data Processing Systems in General Law”, Modern Uses of Logic in Law, December 1960. 2014 Gisle Hannemyr Side #2 1 The Internet: The Resource discovery problem • The existence of digital resources on the Internet led to formulation of “The Resource Discovery Problem”. • First formulated by Alan Emtage and Peter Deutsch in Archie - an Electronic Directory Service for the Internet1 (1992) . «Before a user can effectively exploit any of the services offered by the Internet community or access any information provided by such services, that user must be aware of both the existence of the service and the host or hosts on which it is available.» 1) Archie was a search engine into ftp-space that pre-dated any web-oriented search engines. 2014 Gisle Hannemyr Page #3 The Resource discovery problem • So the resorce discovery problem encompasses not only to establish the existence and location of resources, but: . If the discovery process yields pointers to several alternative resources, some means to qualify them and to identify the resource or resoures that provides the “best fit” for the problem at hand.
    [Show full text]
  • Dataset Search: a Lightweight, Community-Built Tool to Support Research Data Discovery
    Journal of eScience Librarianship Volume 10 Issue 1 Special Issue: Research Data and Article 3 Preservation (RDAP) Summit 2020 2021-01-19 Dataset Search: A lightweight, community-built tool to support research data discovery Sara Mannheimer Montana State University Et al. Let us know how access to this document benefits ou.y Follow this and additional works at: https://escholarship.umassmed.edu/jeslib Part of the Scholarly Communication Commons Repository Citation Mannheimer S, Clark JA, Hagerman K, Schultz J, Espeland J. Dataset Search: A lightweight, community- built tool to support research data discovery. Journal of eScience Librarianship 2021;10(1): e1189. https://doi.org/10.7191/jeslib.2021.1189. Retrieved from https://escholarship.umassmed.edu/jeslib/ vol10/iss1/3 Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 License. This material is brought to you by eScholarship@UMMS. It has been accepted for inclusion in Journal of eScience Librarianship by an authorized administrator of eScholarship@UMMS. For more information, please contact [email protected]. ISSN 2161-3974 JeSLIB 2021; 10(1): e1189 https://doi.org/10.7191/jeslib.2021.1189 Full-Length Paper Dataset Search: A lightweight, community-built tool to support research data discovery Sara Mannheimer, Jason A. Clark, Kyle Hagerman, Jakob Schultz, and James Espeland Montana State University, Bozeman, MT, USA Abstract Objective: Promoting discovery of research data helps archived data realize its potential to advance knowledge. Montana
    [Show full text]
  • The Commodification of Search
    San Jose State University SJSU ScholarWorks Master's Theses Master's Theses and Graduate Research 2008 The commodification of search Hsiao-Yin Chen San Jose State University Follow this and additional works at: https://scholarworks.sjsu.edu/etd_theses Recommended Citation Chen, Hsiao-Yin, "The commodification of search" (2008). Master's Theses. 3593. DOI: https://doi.org/10.31979/etd.wnaq-h6sz https://scholarworks.sjsu.edu/etd_theses/3593 This Thesis is brought to you for free and open access by the Master's Theses and Graduate Research at SJSU ScholarWorks. It has been accepted for inclusion in Master's Theses by an authorized administrator of SJSU ScholarWorks. For more information, please contact [email protected]. THE COMMODIFICATION OF SEARCH A Thesis Presented to The School of Journalism and Mass Communications San Jose State University In Partial Fulfillment of the Requirement for the Degree Master of Science by Hsiao-Yin Chen December 2008 UMI Number: 1463396 INFORMATION TO USERS The quality of this reproduction is dependent upon the quality of the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleed-through, substandard margins, and improper alignment can adversely affect reproduction. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. ® UMI UMI Microform 1463396 Copyright 2009 by ProQuest LLC. All rights reserved. This microform edition is protected against unauthorized copying under Title 17, United States Code. ProQuest LLC 789 E.
    [Show full text]
  • Talks & Abstracts
    TALKS & ABSTRACTS MONDAY, MAY 13 Welcome & Opening Remarks: Keith Webster, Dean, Carnegie Mellon University Libraries Michael McQuade, Vice President for Research, Carnegie Mellon University Beth A. Plale, Science Advisor, CISE/OAC, National Science Foundation Keynote 1: Tom Mitchell, Interim Dean, E. Fredkin University Professor, School of Computer Science, Carnegie Mellon University Title: Discovery from Brain Image Data Abstract: How does the human brain use neural activity to create and represent meanings of words, phrases, sentences and stories? One way to study this question is to collect brain image data to observe neural activity while people read text. We have been doing such experiments with fMRI (1 mm spatial resolution) and MEG (1 msec time resolution) brain imaging, and developing novel machine learning approaches to analyze this data. As a result, we have learned answers to questions such as "Are the neural encodings of word meaning the same in your brain and mine?", and "What sequence of neurally encoded information flows through the brain during the half-second in which the brain comprehends a word?" This talk will summarize our machine learning approach to data discovery, and some of what we have learned about the human brain as a result. We will also consider the question of how reuse and aggregation of such scientific data might change the future of research in cognitive neuroscience. Session 1: Automation in data curation and metadata generation Session Chair: Paola Buitrago, Director for AI and Big Data, Pittsburgh Supercomputing Center 1.1 Keyphrase extraction from scholarly documents for data discovery and reuse Cornelia Caragea and C.
    [Show full text]
  • Issue 23, 2019
    UWI Cave Hill Campus ISSUE 23 September 2019 Student-Athlete Extraordinaire Redonda Restoration Lessons in the Key of Life ISSUE 23 : SEPTEMBER 2019 Contents DISCOURSE 47 ‘Workload’ of Diabetes Greater than 1 Education: A Renewable Resource that of HIV NEWS PUBLICATIONS A PUBLICATION OF 2 Highest Seal of Approval 48 Kamugisha Goes Beyond Coloniality THE UNIVERSITY OF THE WEST INDIES, 49 Nuts and Bolts of Researching CAVE HILL CAMPUS, BARBADOS. 3 Transformative Education Key to Economic Growth 50 Stronger Together We welcome your comments and feedback which 4 Promoting Homegrown IT Solutions 51 Urging a Bigger Role from Civil Society can be directed to [email protected] 5 Caribbean Science Foundation Attends 52 Watson Interrogates Barrow or CHILL c/o Office of Public Information Clinton Foundation Conference The UWI, Cave Hill Campus, Bridgetown BB11000 53 Management Under Scrutiny Barbados. 6 Supporting Regional Civil Servant 54 Pan-Africanism: A History Development Tel: (246) 417-4076/77 54 Pioneering a Shipshape Enterprise 7 Expanding Vistas into Japan EDITOR: 8 A Call to Come Home OUTREACH Chelston Lovell 10 Student-Athlete Extraordinaire 55 Regional Ministers Discuss Climate Change Threats CONSULTANT EDITOR: 12 Youth Internet Forum Ann St. Hill 57 Law for Development 13 Senior Staff on the Move 58 Blackbirds Conquer Ocean Challenge PHOTO EDITORS: IN FOCUS Rasheeta Dorant 60 SALISES Conference Celebrates & Brian Elcock 14 Cave Hill Provides Medical Cannabis Rethinks Caribbean Futures ......................................................................... Training AWARDS CONTRIBUTORS: 15 More Dorm Space on the Cards 61 Dr. Madhuvanti Murphy’s Research Win Professor Eudine Barriteau, PhD, GCM 16 People Empowerment Dwayne Devonish, PhD 62 Recognising Unsung Heroes Franchero Ellis 17 Writing Across the Curriculum 63 Six Certified in Ethereum Blockchain Caribbean Science Foundation 19 Transport Gift Strengthens All-inclusive April A.
    [Show full text]
  • Internet Accuracy Is Information on the Web Reliable?
    Researcher Published by CQ Press, a division of SAGE Publications CQ www.cqresearcher.com Internet Accuracy Is information on the Web reliable? he Internet has been a huge boon for information- seekers. In addition to sites maintained by newspa- pers and other traditional news sources, there are untraditional sources ranging from videos, personal TWeb pages and blogs to postings by interest groups of all kinds — from government agencies to hate groups. But experts caution that determining the credibility of online data can be tricky, and that critical-reading skills are not being taught in most schools. In the new online age, readers no longer have the luxury of Wikipedia banned Stephen Colbert, host of Comedy Central’s “The Colbert Report,” from editing articles on the popular online site after he made joke edits. depending on a reference librarian’s expertise in finding reliable sources. Anyone can post an article, book or opinion online with I no second pair of eyes checking it for accuracy, as in traditional N publishing and journalism. Now many readers are turning to THIS REPORT S user-created sources like Wikipedia, or powerful search engines THE ISSUES ......................627 I like Google, which tally how many people previously have BACKGROUND ..................634 D CHRONOLOGY ..................635 accessed online documents and sources — a process that is E open to manipulation. CURRENT SITUATION ..........640 AT ISSUE ..........................641 OUTLOOK ........................643 CQ Researcher • Aug. 1, 2008 • www.cqresearcher.com Volume 18, Number 27 • Pages 625-648 BIBLIOGRAPHY ..................646 RECIPIENT OF SOCIETY OF PROFESSIONAL JOURNALISTS AWARD FOR THE NEXT STEP ................647 EXCELLENCE N AMERICAN BAR ASSOCIATION SILVER GAVEL AWARD INTERNET ACCURACY CQ Researcher Aug.
    [Show full text]
  • Search, Reuse and Sharing of Research Data in Materials Science and Engineering—A Qualitative Interview Study
    PLOS ONE RESEARCH ARTICLE Search, reuse and sharing of research data in materials science and engineeringÐA qualitative interview study Bettina SuhrID*, Johanna Dungl, Alexander StockerID Virtual Vehicle Research GmbH, Graz, Austria * [email protected] a1111111111 a1111111111 a1111111111 Abstract a1111111111 a1111111111 Open research data practices are a relatively new, thus still evolving part of scientific work, and their usage varies strongly within different scientific domains. In the literature, the inves- tigation of open research data practices covers the whole range of big empirical studies covering multiple scientific domains to smaller, in depth studies analysing a single field of OPEN ACCESS research. Despite the richness of literature on this topic, there is still a lack of knowledge on Citation: Suhr B, Dungl J, Stocker A (2020) the (open) research data awareness and practices in materials science and engineering. Search, reuse and sharing of research data in While most current studies focus only on some aspects of open research data practices, we materials science and engineeringÐA qualitative aim for a comprehensive understanding of all practices with respect to the considered scien- interview study. PLoS ONE 15(9): e0239216. https://doi.org/10.1371/journal.pone.0239216 tific domain. Hence this study aims at 1) drawing the whole picture of search, reuse and sharing of research data 2) while focusing on materials science and engineering. The cho- Editor: Marco Lepidi, University of Genova, ITALY sen approach allows to explore the connections between different aspects of open research Received: February 27, 2020 data practices, e.g. between data sharing and data search. In depth interviews with 13 Accepted: September 1, 2020 researchers in this field were conducted, transcribed verbatim, coded and analysed using Published: September 15, 2020 content analysis.
    [Show full text]