Leveraging Open Source Web Resources to Improve Retrieval of Low Text Content Items
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Alan Emtage Coach Nick's Fav Tech Innovators
Coach Nick’s Fav Tech Innovators Alan Emtage Innovation - The Search Engine: In 1989, Alan Emtage conceived of and implemented Archie, the world’s first Internet search engine. In doing so, he pioneered many of the techniques used by public search engines today. Coach Nick Says: “The search engine was named after, Archie, a famous comic book that I read as a kid! It changed the way people learn.” Coach Nick’s Fav Tech Innovators Ray Tomlinson Innovation - E-mail: In 1971, Ray Tomlinson developed an e-mail program and the @ sign. He is internationally known and credited as the inventor of email. WVMEN Coach Nick Says: “When I worked in the WV State Department of Education, we started using e-mail in our high schools in 1984 as part of WVMEN, the first statewide micro-computing network in the Nation!” Coach Nick’s Fav Tech Innovators Doug Engelbart Innovation - The Computer Mouse: Doug Engelbart invented the computer mouse in the early 1960s in his research lab at Stanford Research Institute (now SRI International). The first prototype was built in 1964, the patent application was filed in 1967, and US Patent was awarded in 1970. Coach Nick Says: “Steve Jobs acquired — some say stole — the mouse concept from the Xerox PARC Labs in Palo Alto in 1979 and changed the world.” Coach Nick’s Fav Tech Innovators Steve Wozniak & Steve Jobs Innovation - The First Apple: The Apple I, also known as the Apple-1, was designed and hand-built by Steve Wozniak. Wozniak’s friend Steve Jobs had the idea of selling the computer. -
2020 Vision: Info Pro Skills for a New Decade
2020 Vision: Info Pro Skills for a New Decade Search Skills for Today’s Info Pros and Thriving in the New Information Landscape Presented by: Mary Ellen Bates Bates Information Services BatesInfo.com Presented for: Initiative Fortbildung e.V. 9 and 10 May 2019 2020 VISION DAY 1: Search Skills for Today’s Info Pros INSIDE A SEARCHER’S MIND: BRINGING THE DETECTIVE TO THE SEARCH ..........................................1 TECHNIQUES OF A DETECTIVE ......................................................................................................................2 DIFFERENT SEARCH APPROACHES.................................................................................................................3 GETTING CREATIVE....................................................................................................................................5 WHAT’S NEW (OR AT LEAST USEFUL) WITH GOOGLE: TIPS AND TOOLS FOR TODAY’S GOOGLE .........6 GOOGLE TRICKS........................................................................................................................................6 SEARCHING THE DEEP WEB / GREY LITERATURE ................................................................................8 SEARCH STRATEGIES FOR GREY LITERATURE....................................................................................................9 SOME GREY LIT/DEEP WEB TOOLS.............................................................................................................10 GLEANING INSIGHT FROM SOCIAL MEDIA........................................................................................12 -
Ciência De Dados Na Ciência Da Informação
Ciência da Informação v. 49 n.3 set./dez. 2020 ISSN 0100-1965 eISSN 1518-8353 Edição especial temática Special thematic issue / Edición temática especial Ciência de dados na ciência da informação Data science in Information Sience Ciencia de datos en la Ciencia de la Información Instituto Brasileiro de Informação em Ciência e Tecnologia (Ibict) Diretoria Indexação Cecília Leite Oliveira Ciência da Informação tem seus artigos indexados ou resumidos. Coordenação-Geral de Pesquisa e Desenvolvimento de Novos Produtos (CGNP) Bases Internacionais Anderson Luis Cambraia Itaborahy Directory of Open Access Journals - DOAJ. Paschal Thema: Science de L’Information, Documentation. Library and Coordenação-Geral de Pesquisa e Manutenção de Produtos Consolidados (CGPC) Information Science Abstracts. PAIS Foreign Language Bianca Amaro Index. Information Science Abstracts. Library and Literature. Páginas de Contenido: Ciências de la Información. Coordenação-Geral de Tecnologias de Informação e Informática EDUCACCION: Notícias de Educación, Ciencia y Cultura (CGTI) Tiago Emmanuel Nunes Braga Iberoamericanas. Referativnyi Zhurnal: Informatika. ISTA Information Science & Technology Abstracts. LISTA Library, Coordenação de Ensino e Pesquisa, Ciência e Tecnologia da Information Science & Technology Abstracts. SciELO Informação (COEPPE) Scientific Electronic Library On-line. Latindex – Sistema Gustavo Saldanha Regional de Información em Línea para Revistas Científicas Coordenação de Planejamento, Acompanhamento e Avaliação de América Latina el Caribe, España y Portugal, México. (COPAV) INFOBILA: Información Bibliotecológica Latinoamericana. José Luis dos Santos Nascimento Indexação em Bases de Dados Nacionais Coordenação de Administração (COADM) Reginaldo de Araújo Silva Portal de Periódicos LivRe – Portal de Periódicos de Livre Acesso. Comissão Divisão de Editoração Científica Nacional de Energia Nuclear (Cnen). Portal Periódicos Ramón Martins Sodoma da Fonseca da Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (Capes). -
Tim Berners-Lee
Le Web Quelques repères Informations compilées par Omer Pesquer - http://infonum.omer.mobi/ - @_omr Internet Mapping Project, Kevin Kelly, 1999, https://kk.org/ct2/the-internet-mapping-project La proposition d'Alan Levin pour l'Internet Mapping Project https://www.flickr.com/photos/cogdog/24868066787/ 3 Hypertexte «Une structure de fchier pour l'information complexe, changeante et indéterminée » Ted Nelson, 1965 http://www.hyperfiction.org/texts/whatHypertextIs.pdf + https://fr.wikipedia.org/wiki/Hypertexte http://xanadu.com.au/ted/XUsurvey/xuDation.html THE INTERNET 2015 - THE OPTE PROJECT (juillet 2015) - Barrett Lyon http://www.opte.org/the-intern et/ 5 1990... Un accès universel à un large univers de documents En mars 1989, Tim Berners-Lee soumettait une Le 6 août 1991, Tim Berners-Lee annonce proposition d'un nouveau système de gestion de publiquement sur « alt.hypertext » l'existence du l'information à son supérieur. WorldWideWeb. http://info.cern.ch/Proposal.html http://info.cern.ch Le Web « Les sites (Web) doivent être en mesure d'interagir dans un espace unique et universel. » Tim Berners-Lee http://fr.wikiquote.org/wiki/Tim_Berners-Lee +http://www.w3.org/People/Berners-Lee/WorldWideWeb.html 8 9/73 Les Horribles Cernettes - première photo publiée sur World Wide Web en 1992. http://en.wikipedia.org/wiki/Les_Horribles_Cernettes Logo historique du World Wide Web par Robert Cailliau http://fr.wikipedia.org/wiki/Fichier:WWW_logo_by_Robert_Cailliau.svg Le site Web du Ministère de la Culture en novembre 1996 Le logo est GIF animé : https://twitter.com/_omr/status/711125480477487105 "Une archéologie des premiers sites web de musées en France" https://www.facebook.com/804024616337535/photos/?tab=album&album_id=1054783704594957 11/73 1990.. -
Cc5212-1 Procesamiento Masivo De Datos Otoño 2020
CC5212-1 PROCESAMIENTO MASIVO DE DATOS OTOÑO 2020 Lecture 4.5 Projects, Practice with Pig/Hadoop Aidan Hogan [email protected] Course Marking (Revised) • 75% for Weekly Labs (~9% a lab) – 4/4 obligatory, 4/7 optional • 25% for Class Project • Need to pass in overall grade Assignments each week Hands-on each week! Working in groups Working in groups! CLASS PROJECTS Class Project • Done in threes • Goal: Use what you’ve learned to do something cool/fun (hopefully) • Process: – Form groups of three (in the forum, before April 30th) – On April 30th we will assign the rest automatically – Start thinking up topics / find interesting datasets! – Register topic (deadline around May 21st) – Work on projects during semester – Deliverables will due be around week 13 • Deliverables: 4 minute presentation (video) & short report • Marked on: Difficulty, appropriateness, scale, good use of techniques, presentation, coolness, creativity, value – Ambition is appreciated, even if you don’t succeed Desiderata for project • Must focus around some technique from the course! • Expected difficulty: similar to a lab, but without any instructions • Data not too small: – Should have >250,000 tuples/entries • Data not too large: – Should have <1,000,000,000 tuples/entries – If very large, perhaps take a sample? • In case of COVID-19 data, we can make exceptions Where to find/explore data? • Kaggle: – https://www.kaggle.com/ • Google Dataset Search: – https://datasetsearch.research.google.com/ • Datos Abiertos de Chile: – https://datos.gob.cl/ – https://es.datachile.io/ • … PRACTICE WITH HADOOP/PIG Practice with Hadoop • Optional Assignment 1 (not evaluated): – Hadoop: Find the number of good movies in which each actor/actresses has starred. -
DRI2020 Rettskilder Og Informasjonssøking
DRI2020 Rettskilder og informasjonssøking Søkemotorer: Troverdighet og synlighet Gisle Hannemyr Ifi, høstsemesteret 2014 Opprinnelsen til fritekstsøk • Vedtak i Pennsylvania en gang på slutten av 1950-tallet om å bytte ut begrepet “retarded child” i diverse lover med det mer politisk korrekte “special child”. • Uoverkommelig å finne alle forekomster manuellt. • Deler av lovsamlingen var tastet inn på hullkort. John F. Horty fikk utvided databasen til å omfatte hele lovsamlingen med komemntarer. • Kilde: John F. Horty, “Experience with the Application of Electronic Data Processing Systems in General Law”, Modern Uses of Logic in Law, December 1960. 2014 Gisle Hannemyr Side #2 1 The Internet: The Resource discovery problem • The existence of digital resources on the Internet led to formulation of “The Resource Discovery Problem”. • First formulated by Alan Emtage and Peter Deutsch in Archie - an Electronic Directory Service for the Internet1 (1992) . «Before a user can effectively exploit any of the services offered by the Internet community or access any information provided by such services, that user must be aware of both the existence of the service and the host or hosts on which it is available.» 1) Archie was a search engine into ftp-space that pre-dated any web-oriented search engines. 2014 Gisle Hannemyr Page #3 The Resource discovery problem • So the resorce discovery problem encompasses not only to establish the existence and location of resources, but: . If the discovery process yields pointers to several alternative resources, some means to qualify them and to identify the resource or resoures that provides the “best fit” for the problem at hand. -
Dataset Search: a Lightweight, Community-Built Tool to Support Research Data Discovery
Journal of eScience Librarianship Volume 10 Issue 1 Special Issue: Research Data and Article 3 Preservation (RDAP) Summit 2020 2021-01-19 Dataset Search: A lightweight, community-built tool to support research data discovery Sara Mannheimer Montana State University Et al. Let us know how access to this document benefits ou.y Follow this and additional works at: https://escholarship.umassmed.edu/jeslib Part of the Scholarly Communication Commons Repository Citation Mannheimer S, Clark JA, Hagerman K, Schultz J, Espeland J. Dataset Search: A lightweight, community- built tool to support research data discovery. Journal of eScience Librarianship 2021;10(1): e1189. https://doi.org/10.7191/jeslib.2021.1189. Retrieved from https://escholarship.umassmed.edu/jeslib/ vol10/iss1/3 Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 License. This material is brought to you by eScholarship@UMMS. It has been accepted for inclusion in Journal of eScience Librarianship by an authorized administrator of eScholarship@UMMS. For more information, please contact [email protected]. ISSN 2161-3974 JeSLIB 2021; 10(1): e1189 https://doi.org/10.7191/jeslib.2021.1189 Full-Length Paper Dataset Search: A lightweight, community-built tool to support research data discovery Sara Mannheimer, Jason A. Clark, Kyle Hagerman, Jakob Schultz, and James Espeland Montana State University, Bozeman, MT, USA Abstract Objective: Promoting discovery of research data helps archived data realize its potential to advance knowledge. Montana -
The Commodification of Search
San Jose State University SJSU ScholarWorks Master's Theses Master's Theses and Graduate Research 2008 The commodification of search Hsiao-Yin Chen San Jose State University Follow this and additional works at: https://scholarworks.sjsu.edu/etd_theses Recommended Citation Chen, Hsiao-Yin, "The commodification of search" (2008). Master's Theses. 3593. DOI: https://doi.org/10.31979/etd.wnaq-h6sz https://scholarworks.sjsu.edu/etd_theses/3593 This Thesis is brought to you for free and open access by the Master's Theses and Graduate Research at SJSU ScholarWorks. It has been accepted for inclusion in Master's Theses by an authorized administrator of SJSU ScholarWorks. For more information, please contact [email protected]. THE COMMODIFICATION OF SEARCH A Thesis Presented to The School of Journalism and Mass Communications San Jose State University In Partial Fulfillment of the Requirement for the Degree Master of Science by Hsiao-Yin Chen December 2008 UMI Number: 1463396 INFORMATION TO USERS The quality of this reproduction is dependent upon the quality of the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleed-through, substandard margins, and improper alignment can adversely affect reproduction. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. ® UMI UMI Microform 1463396 Copyright 2009 by ProQuest LLC. All rights reserved. This microform edition is protected against unauthorized copying under Title 17, United States Code. ProQuest LLC 789 E. -
Talks & Abstracts
TALKS & ABSTRACTS MONDAY, MAY 13 Welcome & Opening Remarks: Keith Webster, Dean, Carnegie Mellon University Libraries Michael McQuade, Vice President for Research, Carnegie Mellon University Beth A. Plale, Science Advisor, CISE/OAC, National Science Foundation Keynote 1: Tom Mitchell, Interim Dean, E. Fredkin University Professor, School of Computer Science, Carnegie Mellon University Title: Discovery from Brain Image Data Abstract: How does the human brain use neural activity to create and represent meanings of words, phrases, sentences and stories? One way to study this question is to collect brain image data to observe neural activity while people read text. We have been doing such experiments with fMRI (1 mm spatial resolution) and MEG (1 msec time resolution) brain imaging, and developing novel machine learning approaches to analyze this data. As a result, we have learned answers to questions such as "Are the neural encodings of word meaning the same in your brain and mine?", and "What sequence of neurally encoded information flows through the brain during the half-second in which the brain comprehends a word?" This talk will summarize our machine learning approach to data discovery, and some of what we have learned about the human brain as a result. We will also consider the question of how reuse and aggregation of such scientific data might change the future of research in cognitive neuroscience. Session 1: Automation in data curation and metadata generation Session Chair: Paola Buitrago, Director for AI and Big Data, Pittsburgh Supercomputing Center 1.1 Keyphrase extraction from scholarly documents for data discovery and reuse Cornelia Caragea and C. -
Issue 23, 2019
UWI Cave Hill Campus ISSUE 23 September 2019 Student-Athlete Extraordinaire Redonda Restoration Lessons in the Key of Life ISSUE 23 : SEPTEMBER 2019 Contents DISCOURSE 47 ‘Workload’ of Diabetes Greater than 1 Education: A Renewable Resource that of HIV NEWS PUBLICATIONS A PUBLICATION OF 2 Highest Seal of Approval 48 Kamugisha Goes Beyond Coloniality THE UNIVERSITY OF THE WEST INDIES, 49 Nuts and Bolts of Researching CAVE HILL CAMPUS, BARBADOS. 3 Transformative Education Key to Economic Growth 50 Stronger Together We welcome your comments and feedback which 4 Promoting Homegrown IT Solutions 51 Urging a Bigger Role from Civil Society can be directed to [email protected] 5 Caribbean Science Foundation Attends 52 Watson Interrogates Barrow or CHILL c/o Office of Public Information Clinton Foundation Conference The UWI, Cave Hill Campus, Bridgetown BB11000 53 Management Under Scrutiny Barbados. 6 Supporting Regional Civil Servant 54 Pan-Africanism: A History Development Tel: (246) 417-4076/77 54 Pioneering a Shipshape Enterprise 7 Expanding Vistas into Japan EDITOR: 8 A Call to Come Home OUTREACH Chelston Lovell 10 Student-Athlete Extraordinaire 55 Regional Ministers Discuss Climate Change Threats CONSULTANT EDITOR: 12 Youth Internet Forum Ann St. Hill 57 Law for Development 13 Senior Staff on the Move 58 Blackbirds Conquer Ocean Challenge PHOTO EDITORS: IN FOCUS Rasheeta Dorant 60 SALISES Conference Celebrates & Brian Elcock 14 Cave Hill Provides Medical Cannabis Rethinks Caribbean Futures ......................................................................... Training AWARDS CONTRIBUTORS: 15 More Dorm Space on the Cards 61 Dr. Madhuvanti Murphy’s Research Win Professor Eudine Barriteau, PhD, GCM 16 People Empowerment Dwayne Devonish, PhD 62 Recognising Unsung Heroes Franchero Ellis 17 Writing Across the Curriculum 63 Six Certified in Ethereum Blockchain Caribbean Science Foundation 19 Transport Gift Strengthens All-inclusive April A. -
Internet Accuracy Is Information on the Web Reliable?
Researcher Published by CQ Press, a division of SAGE Publications CQ www.cqresearcher.com Internet Accuracy Is information on the Web reliable? he Internet has been a huge boon for information- seekers. In addition to sites maintained by newspa- pers and other traditional news sources, there are untraditional sources ranging from videos, personal TWeb pages and blogs to postings by interest groups of all kinds — from government agencies to hate groups. But experts caution that determining the credibility of online data can be tricky, and that critical-reading skills are not being taught in most schools. In the new online age, readers no longer have the luxury of Wikipedia banned Stephen Colbert, host of Comedy Central’s “The Colbert Report,” from editing articles on the popular online site after he made joke edits. depending on a reference librarian’s expertise in finding reliable sources. Anyone can post an article, book or opinion online with I no second pair of eyes checking it for accuracy, as in traditional N publishing and journalism. Now many readers are turning to THIS REPORT S user-created sources like Wikipedia, or powerful search engines THE ISSUES ......................627 I like Google, which tally how many people previously have BACKGROUND ..................634 D CHRONOLOGY ..................635 accessed online documents and sources — a process that is E open to manipulation. CURRENT SITUATION ..........640 AT ISSUE ..........................641 OUTLOOK ........................643 CQ Researcher • Aug. 1, 2008 • www.cqresearcher.com Volume 18, Number 27 • Pages 625-648 BIBLIOGRAPHY ..................646 RECIPIENT OF SOCIETY OF PROFESSIONAL JOURNALISTS AWARD FOR THE NEXT STEP ................647 EXCELLENCE N AMERICAN BAR ASSOCIATION SILVER GAVEL AWARD INTERNET ACCURACY CQ Researcher Aug. -
Search, Reuse and Sharing of Research Data in Materials Science and Engineering—A Qualitative Interview Study
PLOS ONE RESEARCH ARTICLE Search, reuse and sharing of research data in materials science and engineeringÐA qualitative interview study Bettina SuhrID*, Johanna Dungl, Alexander StockerID Virtual Vehicle Research GmbH, Graz, Austria * [email protected] a1111111111 a1111111111 a1111111111 Abstract a1111111111 a1111111111 Open research data practices are a relatively new, thus still evolving part of scientific work, and their usage varies strongly within different scientific domains. In the literature, the inves- tigation of open research data practices covers the whole range of big empirical studies covering multiple scientific domains to smaller, in depth studies analysing a single field of OPEN ACCESS research. Despite the richness of literature on this topic, there is still a lack of knowledge on Citation: Suhr B, Dungl J, Stocker A (2020) the (open) research data awareness and practices in materials science and engineering. Search, reuse and sharing of research data in While most current studies focus only on some aspects of open research data practices, we materials science and engineeringÐA qualitative aim for a comprehensive understanding of all practices with respect to the considered scien- interview study. PLoS ONE 15(9): e0239216. https://doi.org/10.1371/journal.pone.0239216 tific domain. Hence this study aims at 1) drawing the whole picture of search, reuse and sharing of research data 2) while focusing on materials science and engineering. The cho- Editor: Marco Lepidi, University of Genova, ITALY sen approach allows to explore the connections between different aspects of open research Received: February 27, 2020 data practices, e.g. between data sharing and data search. In depth interviews with 13 Accepted: September 1, 2020 researchers in this field were conducted, transcribed verbatim, coded and analysed using Published: September 15, 2020 content analysis.