Information Rereival, Part 1

Total Page:16

File Type:pdf, Size:1020Kb

Information Rereival, Part 1 11/4/2019 Information Retrieval Deepak Kumar Information Retrieval Searching within a document collection for a particular needed information. 1 11/4/2019 Query Search Engines… Altavista Entireweb Leapfish Spezify Ask Excite Lycos Stinky Teddy Baidu Faroo Maktoob Stumpdedia Bing Info.com Miner.hu Swisscows Blekko Fireball Monster Crawler Teoma ChaCha Gigablast Naver Walla Dogpile Google Omgili WebCrawler Daum Go Rediff Yahoo! Dmoz Goo Scrub The Web Yandex Du Hakia Seznam Yippy Egerin HotBot Sogou Youdao ckDuckGo Soso 2 11/4/2019 Search Engine Marketshare 2019 3 11/4/2019 Search Engine Marketshare 2017 Matching & Ranking matched pages ranked pages 1. 2. query 3. muddy waters matching ranking “hits” 4 11/4/2019 Index Inverted Index • A mapping from content (words) to location. • Example: the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat 5 11/4/2019 Inverted Index the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3 cat 1 3 dog 2 3 mat 1 2 on 1 2 sat 1 3 stood 2 3 the 1 2 3 while 3 Inverted Index the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3 cat 1 3 dog 2 3 mat 1 2 Every word in every on 1 2 web page is indexed! sat 1 3 stood 2 3 the 1 2 3 while 3 6 11/4/2019 Searching the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3 cat 1 3 query dog 2 3 mat 1 2 cat on 1 2 sat 1 3 stood 2 3 the 1 2 3 while 3 Searching the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3 cat 1 3 query dog 2 3 mat 1 2 cat on 1 2 sat 1 3 stood 2 3 the 1 2 3 while 3 7 11/4/2019 Searching the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat hits a 3 cat 1 3 the cat sat on dog 2 3 1 query the mat mat 1 2 cat on 1 2 the cat stood 3 sat 1 3 while a dog sat stood 2 3 the 1 2 3 while 3 Searching the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat hits a 3 cat 1 3 the dog stood on 2 query dog 2 3 the mat mat 1 2 dog on 1 2 the cat stood 3 sat 1 3 while a dog sat stood 2 3 the 1 2 3 while 3 8 11/4/2019 Searching the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3 cat 1 3 query dog 2 3 mat 1 2 cat dog on 1 2 sat 1 3 stood 2 3 the 1 2 3 while 3 Searching the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3 cat 1 3 query dog 2 3 mat 1 2 cat dog on 1 2 sat 1 3 stood 2 3 the 1 2 3 while 3 9 11/4/2019 Searching the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat hits a 3 cat 1 3 the cat stood 3 query dog 2 3 while a dog sat mat 1 2 cat dog on 1 2 sat 1 3 stood 2 3 the 1 2 3 while 3 Searching the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3 cat 1 3 query dog 2 3 mat 1 2 cat the sat on 1 2 ??? sat 1 3 stood 2 3 the 1 2 3 while 3 10 11/4/2019 Phrase Queries the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat hits a 3 cat 1 3 the cat sat on dog 2 3 1 query the mat mat 1 2 “cat sat” on 1 2 the cat stood 3 sat 1 3 while a dog sat stood 2 3 the 1 2 3 while 3 Phrase Queries the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat hits a 3 cat 1 3 the cat sat on dog 2 3 1 query the mat mat 1 2 “cat sat” on 1 2 the cat stood 3 sat 1 3 while a dog sat stood 2 3 the 1 2 3 while 3 How to tell if two words occur next to each other? 11 11/4/2019 Phrase Queries the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat hits a 3 cat 1 3 the cat sat on dog 2 3 1 query the mat mat 1 2 “cat sat” on 1 2 the cat stood 3 sat 1 3 while a dog sat stood 2 3 the 1 2 3 while 3 How to tell if two words occur next to each other? EFFICIENTLY??? Inverted Index with Location the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3-5 cat 1-2 3-2 dog 2-2 3-6 mat 1-6 2-6 on 1-4 2-4 sat 1-3 3-7 stood 2-3 3-3 the 1-1 1-5 2-1 2-5 3-1 while 3-4 12 11/4/2019 Inverted Index with Location the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3-5 cat 1-2 3-2 query dog 2-2 3-6 mat 1-6 2-6 “cat sat” on 1-4 2-4 sat 1-3 3-7 stood 2-3 3-3 the 1-1 1-5 2-1 2-5 3-1 while 3-4 Inverted Index with Location the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3-5 cat 1-2 3-2 1-2, 3-2 query dog 2-2 3-6 mat 1-6 2-6 “cat sat” on 1-4 2-4 sat 1-3 3-7 1-3, 3-7 stood 2-3 3-3 the 1-1 1-5 2-1 2-5 3-1 while 3-4 13 11/4/2019 Inverted Index with Location the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3-5 cat 1-2 3-2 1-2, 3-2 query dog 2-2 3-6 mat 1-6 2-6 “cat sat” on 1-4 2-4 sat 1-3 3-7 1-3, 3-7 stood 2-3 3-3 the 1-1 1-5 2-1 2-5 3-1 while 3-4 Inverted Index with Location the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat hits a 3-5 cat 1-2 3-2 the cat sat on dog 2-2 3-6 1-2 1 query the mat mat 1-6 2-6 “cat sat” on 1-4 2-4 sat 1-3 3-7 1-3 stood 2-3 3-3 the 1-1 1-5 2-1 2-5 3-1 while 3-4 14 11/4/2019 NEAR* Queries the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat hits a 3-5 cat 1-2 3-2 3-2 the cat stood dog 2-2 3-6 3 query 3-6 while a dog sat mat 1-6 2-6 cat NEAR dog on 1-4 2-4 sat 1-3 3-7 stood 2-3 3-3 the 1-1 1-5 2-1 2-5 3-1 while 3-4 *NEAR: distance <= 5 NEAR* Queries the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat hits a 3-5 cat 1-2 3-2 3-2 the cat stood dog 2-2 3-6 3 query 3-6 while a dog sat mat 1-6 2-6 cat NEAR dog on 1-4 2-4 sat 1-3 3-7 stood 2-3 3-3 the 1-1 1-5 2-1 2-5 3-1 while 3-4 Useful in ranking! *NEAR: distance <= 5 15 11/4/2019 Matching & Ranking matched pages ranked pages 1. 2. query 3. muddy waters matching ranking “hits” Ranking & Relevance 1 By far the most common 2 Our cause was not cause of malaria is helped by the poor being bitten by an health of the troops, infected mosquito, but many of whom were there are also other suffering from malaria ways to contract the and other tropical disease.
Recommended publications
  • Internet Economy 25 Years After .Com
    THE INTERNET ECONOMY 25 YEARS AFTER .COM TRANSFORMING COMMERCE & LIFE March 2010 25Robert D. Atkinson, Stephen J. Ezell, Scott M. Andes, Daniel D. Castro, and Richard Bennett THE INTERNET ECONOMY 25 YEARS AFTER .COM TRANSFORMING COMMERCE & LIFE March 2010 Robert D. Atkinson, Stephen J. Ezell, Scott M. Andes, Daniel D. Castro, and Richard Bennett The Information Technology & Innovation Foundation I Ac KNOW L EDGEMEN T S The authors would like to thank the following individuals for providing input to the report: Monique Martineau, Lisa Mendelow, and Stephen Norton. Any errors or omissions are the authors’ alone. ABOUT THE AUTHORS Dr. Robert D. Atkinson is President of the Information Technology and Innovation Foundation. Stephen J. Ezell is a Senior Analyst at the Information Technology and Innovation Foundation. Scott M. Andes is a Research Analyst at the Information Technology and Innovation Foundation. Daniel D. Castro is a Senior Analyst at the Information Technology and Innovation Foundation. Richard Bennett is a Research Fellow at the Information Technology and Innovation Foundation. ABOUT THE INFORMATION TECHNOLOGY AND INNOVATION FOUNDATION The Information Technology and Innovation Foundation (ITIF) is a Washington, DC-based think tank at the cutting edge of designing innovation policies and exploring how advances in technology will create new economic opportunities to improve the quality of life. Non-profit, and non-partisan, we offer pragmatic ideas that break free of economic philosophies born in eras long before the first punch card computer and well before the rise of modern China and pervasive globalization. ITIF, founded in 2006, is dedicated to conceiving and promoting the new ways of thinking about technology-driven productivity, competitiveness, and globalization that the 21st century demands.
    [Show full text]
  • Internet Pro Za Č Ínající Uživatele
    Gymnázium Františka Živného INFORMAČNÍ CENTRUM 2/2004 INTERNET PRO ZAČ ÍNAJÍCÍ UŽIVATELE DISTANČ NÍ TEXT Bohumín 2004 TOMÁŠ HUDEC OBSAH PŘ EDMĚ TU Úvod........................................................................................................................................... 3 1. Připojme se..................................................................................................................... 4 2. Prohlížíme stránky.......................................................................................................... 6 3. Hledání v kupce sena.................................................................................................... 10 4. Elektronická pošta ....................................................................................................... 16 2 ÚVOD Text je určen široké pedagogické veřejnosti z řad učitelů „ neinformatiků“, kteří chtějí získat základní informace z oblasti Internetu a chtějí pokud možnost studovat sami v čase, který jim vyhovuje a v prostředí, kde se cítí dobře, nemusí se stresovat a mohou pracovat svým tempem.. Text je napsán distanční formou a může být studován jak samostatně, tak v případě otevření příslušného kurzu pod vedením lektora. Text je oproštěn od obecných informací a je směrován k praktické činnosti čtenáře či frekventanta kurzu. Hlubší studium a někdy i odpovědi na otázky předpokládají další studium literatury nebo vyhledávání informací právě z Internetu. Po prostudování textu budete znát: • Co je to Internet a jak se připojit • Jaké jsou základní
    [Show full text]
  • Dining Hall,” “Cafeteria,” and “Campus Food Service” • Be Specific As You Learn More – E.G
    THE INTERNET Conducting Internet Research Computer Applications I Martin Santos Jorge Cab Objectives • After completing this section, students will be able to: • Understand the internet • Identify the different tools for research • Use and cite references from the internet Lecturers: Martin Santos/Jorge Cab (S.P.J.C.) 2 Vocabulary List • Internet (the Net): a global connection of millions of computer networks • Browser: software that helps a user access web sites (Internet Explorer and Netscape) • Server: a computer that runs special software and sends information over the Internet when requested • World Wide Web (the Web or www.): multimedia portion of the Internet consisting of text, graphics, audio and video • URL: stands for Uniform Resource Locator. It is the website's “address” or what the user types in to make the connection • Web site: a “virtual” place on the Internet with a unique URL • Virtual: “mental” replica of something - you can’t “touch” it – need a “tool” to get to it • Web page: a place on a web site where specific information is located • Home page: main page of a web site and first page to load when a site is accessed • Hyperlink: “clickable” text or graphics – takes you from one place to another – usually underlined and shows a hand shaped icon • Hypertext: capability to “link” or “jump” to other references or cross references by clicking • Cyberspace: “electronic” universe where information from one computer connects with another • Upload: process of transferring information to a page/site on the internet • Download: process of transferring information to a computer • Search engine: a site that scans the contents of other web sites to create a large index of information • Domain (top level): code located in the URL representing the type of organization (i.e., .gov (government), .edu (education), .mil (military), .org (organization – non-profit), .com (commercial – a business – for profit) • Internet Service Provider (ISP): a company with direct connection to the Internet that grants subscribers access to various Internet services.
    [Show full text]
  • No Facet Analyses of Categories Used in Web Directories
    Date : 09/ 06/2006 Facet Analyses of Categories Used in Web Directories: A Comparative Study Kyung-Sun Kim Sei-Ching Joanna Sin Soo-Jin Park Xiaohua Zhu Jom Polparsi School of Library and Information Studies University of Wisconsin-Madison Madison, USA Meeting: 136 Classification and Indexing Simultaneous Interpretation: No WORLD LIBRARY AND INFORMATION CONGRESS: 72ND IFLA GENERAL CONFERENCE AND COUNCIL 20-24 August 2006, Seoul, Korea http://www.ifla.org/IV/ifla72/index.htm Abstract Faceted classification is believed to be suitable for organizing digital information resources. Based on a faceted classification model suggested for Web resources (Zins, 2002), the current study analyzed popular Web directories from different Asian countries/areas and examined cultural differences reflected in their classification systems. Three popular Web directories from four countries/regions (China, Hong Kong, Korea, and Thailand) were selected and their classifications were analyzed and compared: a local Yahoo and two home-grown Web directories from each country/region. Based on the findings, the study suggests a model that might be more suitable to Asian culture. INTRODUCTION Information available through the Web is becoming more diverse and its amount is also growing rapidly. As of January 2005, it is estimated that the indexable Web has reached to at least 11.5 billion pages (Gulli & Signorini, 2005). Presently, these voluminous and ever changing resources on the Web are often organized in mostly hierarchical classification systems. However, such hierarchical systems might not be as suitable for organizing cross-cultural resources for a culturally and linguistically diverse groups, as there might be cultural differences in categorization (Segall, 1990; Malt, 1995).
    [Show full text]
  • Market Research SD-5 Gathering Information About Commercial Products and Services
    Market Research SD-5 Gathering Information About Commercial Products and Services DEFENSE STANDARDIZATION PROGRA M JANUARY 2008 Contents Foreword 1 The Market Research Other Considerations 32 Background 2 Process 13 Amount of Information Strategic Market Research to Gather 32 What Is Market Research? 2 (Market Surveillance) 14 Procurement Integrity Act 32 Why Do Market Research? 2 Identify the Market or Market Paperwork Reduction Act 33 Segment of Interest 14 When Is Market Research Cost of Market Research 34 Done? 5 Identify Sources of Market Information 16 Who Should Be Involved In Market Research? 7 Collect Relevant Market Other Information Information 17 Technical Specialist 8 Document the Results 18 on Market Research 35 User 9 Logistics Specialist 9 Tactical Market Research Appendix A 36 (Market Investigation) 19 Testing Specialist 9 Types of Information Summarize Strategic Market Available on the Internet Cost Analyst 10 Research 19 Legal Counsel 10 Formulate Requirements 20 Appendix B 39 Contracting Officer 10 Web-Based Information Identify Sources of Sources Information 21 Guiding Principles 11 Collect Product or Service Appendix C 47 Examples of Tactical Start Early 11 Information from Sources 22 Collect Information from Information Define and Document Product or Service Users 26 Requirements 11 Evaluate the Data 27 Refine as You Proceed 12 Document the Results 30 Tailor the Investigation 12 Repeat as Necessary 12 Communicate 12 Involve Users 12 Foreword The Department of Defense (DoD) relies extensively on the commercial market for the products and services it needs, whether those products and services are purely commercial, modified for DoD use from commercial products and services, or designed specifically for DoD.
    [Show full text]
  • Gekonnt Suchen
    20 > PRAXIS > SUCH-TIPPS PCtipp, März 2020 Gekonnt suchen Wer im Internet etwas sucht, kommt nicht um Google herum. Mit den richtigen Suchtricks werden Ihre Ergebnisse viel besser. Ausserdem stellen wir einige praktische Alternativen zur Google-Suche vor. VON LUCA DIGGELMANN oogle ist die Nummer eins in Sachen raus. Sowohl über diverse Einstellungen als die Google bei der Suchanfrage lädt, oder die Websuche – ob man es mag oder nicht. auch mit Textcodes finden Sie besser, was Sie Region, aus der Google seine Inhalte primär GSowohl in Sachen Funktionalität als wirklich suchen. bezieht. auch bei der Qualität der Suchergebnisse ist der Internetgigant nur schwer zu schlagen. EINSTELLUNGEN ANPASSEN TABS VERWENDEN Deshalb gibt es von uns gleich nachfolgend Gleich zu Beginn sollten Sie die Einstellungen Direkt unter jeder Suche zeigt Google diverse viele Tipps für alle, die Google mögen. Und von Google genau durchgehen. Das lohnt sich Reiter (engl. Tabs) an. Standardmässig ist Alle für alle anderen stellen wir ab S. 23 mehrere generell bei jeder Software, so auch bei der angewählt. Dahinter reihen sich Optionen wie Alternativen vor, mit denen man auch nicht Google-Suche. Sie finden die Einstellungen auf News, Bilder, Maps oder Shopping. Die genaue schlecht fährt. der Frontseite von Google unten rechts im Reihenfolge variiert jeweils leicht, je nachdem, grauen Balken, Bild 1. welche Inhalte Google zu Ihrer Anfrage fin- Besser googeln In den Einstellungen sehen Sie verschiedene det. Klicken Sie einen der Tabs an, zeigt Menüs. Gehen Sie diese Eintrag für Eintrag Google vornehmlich Inhalte aus der gewähl- Bereits ohne Vorkenntnisse ist Google eine genau durch, besonders das Menü Sucheinstel- ten Kategorie an.
    [Show full text]
  • Awareness and Utilization of Search Engines for Information Retrieval by Students of National Open University of Nigeria in Enugu Study Centre Library
    University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Library Philosophy and Practice (e-journal) Libraries at University of Nebraska-Lincoln 2020 Awareness and Utilization of Search Engines for Information Retrieval by Students of National Open University of Nigeria in Enugu Study Centre Library SUNDAY JUDE ONUH National Open University of Nigeria, [email protected] OGOEGBUNAM LOVETH EKWUEME National Open University of Nigeria, [email protected] Follow this and additional works at: https://digitalcommons.unl.edu/libphilprac Part of the Library and Information Science Commons ONUH, SUNDAY JUDE and EKWUEME, OGOEGBUNAM LOVETH, "Awareness and Utilization of Search Engines for Information Retrieval by Students of National Open University of Nigeria in Enugu Study Centre Library" (2020). Library Philosophy and Practice (e-journal). 4650. https://digitalcommons.unl.edu/libphilprac/4650 Awareness and Utilization of Search Engines for Information Retrieval by Students of National Open University of Nigeria in Enugu Study Centre Library By Jude Sunday Onuh Enugu Study Centre Library National Open University of Nigeria [email protected] & Loveth Ogoegbunam Ekwueme Department of Library and Information Science National Open University of Nigeria [email protected] Abstract This study dwelt on awareness and utilization of search engines for information retrieval by students of National Open University of Nigeria (NOUN) Enugu Study centre. Descriptive survey research was adopted for the study. Two research questions were drawn from the two purposes that guided the study. The population consists of 5855 undergraduate students of NOUN Enugu Study Centre. A sample size of 293 students was used as 5% of the entire population.
    [Show full text]
  • A Study on Vertical and Broad-Based Search Engines
    International Journal of Latest Trends in Engineering and Technology IJLTET Special Issue- ICRACSC-2016 , pp.087-093 e-ISSN: 2278-621X A STUDY ON VERTICAL AND BROAD-BASED SEARCH ENGINES M.Swathi1 and M.Swetha2 Abstract-Vertical search engines or Domain-specific search engines[1][2] are becoming increasingly popular because they offer increased accuracy and extra features not possible with general, Broad-based search engines or Web-wide search engines. The paper focuses on the survey of domain specific search engine which is becoming more popular as compared to Web- Wide Search Engines as they are difficult to maintain and time consuming .It is also difficult to provide appropriate documents to represent the target data. We also listed various vertical search engines and Broad-based search engines. Index terms: Domain specific search, vertical search engines, broad based search engines. I. INTRODUCTION The Web has become a very rich source of information for almost any field, ranging from music to histories, from sports to movies, from science to culture, and many more. However, it has become increasingly difficult to search for desired information on the Web. Users are facing the problem of information overload , in which a search on a general-purpose search engine such as Google (www.google.com) results in thousands of hits.Because a user cannot specify a search domain (e.g. medicine, music), a search query may bring up Web pages both within and outside the desired domain. Example 1: A user searching for “cancer” may get Web pages related to the disease as well as those related to the Zodiac sign.
    [Show full text]
  • Reuters Institute Digital News Report 2020
    Reuters Institute Digital News Report 2020 Reuters Institute Digital News Report 2020 Nic Newman with Richard Fletcher, Anne Schulz, Simge Andı, and Rasmus Kleis Nielsen Supported by Surveyed by © Reuters Institute for the Study of Journalism Reuters Institute for the Study of Journalism / Digital News Report 2020 4 Contents Foreword by Rasmus Kleis Nielsen 5 3.15 Netherlands 76 Methodology 6 3.16 Norway 77 Authorship and Research Acknowledgements 7 3.17 Poland 78 3.18 Portugal 79 SECTION 1 3.19 Romania 80 Executive Summary and Key Findings by Nic Newman 9 3.20 Slovakia 81 3.21 Spain 82 SECTION 2 3.22 Sweden 83 Further Analysis and International Comparison 33 3.23 Switzerland 84 2.1 How and Why People are Paying for Online News 34 3.24 Turkey 85 2.2 The Resurgence and Importance of Email Newsletters 38 AMERICAS 2.3 How Do People Want the Media to Cover Politics? 42 3.25 United States 88 2.4 Global Turmoil in the Neighbourhood: 3.26 Argentina 89 Problems Mount for Regional and Local News 47 3.27 Brazil 90 2.5 How People Access News about Climate Change 52 3.28 Canada 91 3.29 Chile 92 SECTION 3 3.30 Mexico 93 Country and Market Data 59 ASIA PACIFIC EUROPE 3.31 Australia 96 3.01 United Kingdom 62 3.32 Hong Kong 97 3.02 Austria 63 3.33 Japan 98 3.03 Belgium 64 3.34 Malaysia 99 3.04 Bulgaria 65 3.35 Philippines 100 3.05 Croatia 66 3.36 Singapore 101 3.06 Czech Republic 67 3.37 South Korea 102 3.07 Denmark 68 3.38 Taiwan 103 3.08 Finland 69 AFRICA 3.09 France 70 3.39 Kenya 106 3.10 Germany 71 3.40 South Africa 107 3.11 Greece 72 3.12 Hungary 73 SECTION 4 3.13 Ireland 74 References and Selected Publications 109 3.14 Italy 75 4 / 5 Foreword Professor Rasmus Kleis Nielsen Director, Reuters Institute for the Study of Journalism (RISJ) The coronavirus crisis is having a profound impact not just on Our main survey this year covered respondents in 40 markets, our health and our communities, but also on the news media.
    [Show full text]
  • Evaluation of Web-Based Search Engines Using User-Effort Measures
    Evaluation of Web-Based Search Engines Using User-Effort Measures Muh-Chyun Tang and Ying Sun 4 Huntington St. School of Information, Communication and Library Studies Rutgers University, New Brunswick, NJ 08901, U.S.A. [email protected] [email protected] Abstract This paper presents a study of the applicability of three user-effort-sensitive evaluation measures —“first 20 full precision,” “search length,” and “rank correlation”—on four Web-based search engines (Google, AltaVista, Excite and Metacrawler). The authors argue that these measures are better alternatives than precision and recall in Web search situations because of their emphasis on the quality of ranking. Eight sets of search topics were collected from four Ph.D. students in four different disciplines (biochemistry, industrial engineering, economics, and urban planning). Each participant was asked to provide two topics along with the corresponding query terms. Their relevance and credibility judgment of the Web pages were then used to compare the performance of the search engines using these three measures. The results show consistency among these three ranking evaluation measures, more so between “first 20 full precision” and search length than between rank correlation and the other two measures. Possible reasons for rank correlation’s disagreement with the other two measures are discussed. Possible future research to improve these measures is also addressed. Introduction The explosive growth of information on the World Wide Web poses a challenge to traditional information retrieval (IR) research. Other than the sheer amount of information, some structural factors make searching for relevant and quality information on the Web a formidable task.
    [Show full text]
  • Gender and Information Literacy: Evaluation of Gender Differences in a Student Survey of Information Sources
    Gender and Information Literacy: Evaluation of Gender Differences in a Student Survey of Information Sources Arthur Taylor and Heather A. Dalal Information literacy studies have shown that college students use a va- riety of information sources to perform research and commonly choose Internet sources over traditional library sources. Studies have also shown that students do not appear to understand the information quality issues concerning Internet information sources and may lack the information literacy skills to make good choices concerning the use of these sources. No studies currently provide clear guidance on how gender might influ- ence the information literacy skills of students. Such guidance could help improve information literacy instruction. This study used a survey of college-aged students to evaluate a subset of student information literacy skills in relation to Internet information sources. Analysis of the data collected provided strong indications of gender differences in information literacy skills. Female respondents ap- peared to be more discerning than males in evaluating Internet sources. Males appeared to be more confident in the credibility and accuracy of the results returned by search engines. Evaluation of other survey responses strengthened our finding of gender differentiation in informa- tion literacy skills. ollege students today have come of age surrounded by a sea of information delivered from an array of sources available at their fingertips at any time of the day or night. Studies have shown that the most common source of information for college-aged students is the Internet. Information gathered in this environment is most likely found using a commercial search engine that returns sources of dubious quality using an unknown algorithm.
    [Show full text]
  • Social Media at Legal Division Programs Cont
    Summer 2010 The Newsletter of the Legal Division of The Special Libraries Association Volume 17, Number 3 Social Media at Legal Inside Division Programs Social Media at Legal Division by Stephanie Davidson, (@stephdavidson), Head of Public Services & Assistant Programs Professor of Library Administration, University of Illinois College of Law Library, 2010 Stephanie Davidson . 1 Academic Law Librarian Grant Recipient Editor’s Notes Liz Smith . 2 As a new member of SLA and the about programs could be shared Legal Division and a first-time attendee between participants and attendees and From the Board Room Constance Ard . 3 of the SLA annual conference, there even with SLA members and others at were many programs that caught my home. The most interesting example Conference Review: BNA Breakfast and eye before I arrived in New Orleans. I observed during the conference was Legal Division Business Meeting Certainly all of this year’s Legal Division in the 10 AM Tuesday program slot, as Kami Bedard . 4 programs were of interest, though Steve I tweeted from Steve Arnold’s program Conference Review: “SPOTLIGHT Arnold’s “Emerging Technologies: Real- and simultaneously monitored tweets SESSION—Emerging Technologies: Time Search” and the panel on “Law from at least three other sessions. Real-time Search” Librarianship Off the Cuff,” were most Emily Rushing . 6 At the same time Steve Arnold’s “Real applicable to my work and experience Conference Review: “60 Sites in 60 level (“Cases, Codes & Context,” was Time Search” program was being held Minutes” marketed to non-law librarians). in the VirtualU Spotlight Session room Virginia M .
    [Show full text]