Web Searcher Interactions with the Dogpile.Com Meta-Search Engine

Total Page:16

File Type:pdf, Size:1020Kb

Web Searcher Interactions with the Dogpile.Com Meta-Search Engine Web Searcher Interaction With the Dogpile.com Metasearch Engine Bernard J. Jansen College of Information Sciences and Technology, The Pennsylvania State University, 329F IST Building, University Park, PA 16802. E-mail: [email protected] Amanda Spink Faculty of Information Technology, Queensland University of Technology, Gardens Point Campus, 2 George Street, GPO Box 2434, Brisbane QLD 4001, Australia. E-mail: [email protected] Sherry Koshman School of Information Sciences, University of Pittsburgh, 610 IS Building, 135 N. Bellefield Avenue, Pittsburgh, PA 15260. E-mail: [email protected] Metasearch engines are an intuitive method for improving Unlike single source Web search engines, metasearch the performance of Web search by increasing coverage, engines do not crawl the Internet themselves to build an returning large numbers of results with a focus on rele- index of Web documents. Instead, a metasearch engine sends vance, and presenting alternative views of information needs. However, the use of metasearch engines in an queries simultaneously to multiple other Web search en- operational environment is not well understood. In this gines, retrieves the results from each, and then combines the study, we investigate the usage of Dogpile.com, a major results from all into a single results listing, at the same time Web metasearch engine, with the aim of discovering how avoiding redundancy. In effect, Web metasearch engine Web searchers interact with metasearch engines. We users are not using just one engine, but many search engines report results examining 2,465,145 interactions from 534,507 users of Dogpile.com on May 6, 2005 and com- at once to effectively utilize Web searching. The ultimate pare these results with findings from other Web searching purpose of a metasearch engine is to diversify the results of studies. We collect data on geographical location of the queries by utilizing the innate differences of single searchers, use of system feedback, content selection, source Web search engines and provide Web searchers with sessions, queries, and term usage. Findings show that the highest ranked search results from the collection of Web Dogpile.com searchers are mainly from the USA (84% of searchers), use about 3 terms per query (mean 2.85), search engines. Although one could certainly query multiple implement system feedback moderately (8.4% of users), search engines, a metasearch engine distills these top results and generally (56% of users) spend less than one automatically, giving the searcher a comprehensive set of minute interacting with the Web search engine. Overall, search results within a single listing, all in real time. metasearchers seem to have higher degrees of interac- We know that there is little overlap among typical search tion than searchers on non-metasearch engines, but their sessions are for a shorter period of time. These aspects of engine result listings (Ding & Marchionini, 1996), and sin- metasearching may be what define the differences from gle search engines index a relatively small percentage of the other forms of Web searching. We discuss the implica- Web (Lawrence & Giles, 1999). Research shows that results tions of our findings in relation to metasearch for Web retrieved from multiple sources have a higher probability of searchers, search engines, and content providers. being relevant to the searcher’s information needs (Gauch, Wang, & Gomez, 1996). Finally, a single search engine may Introduction have inherent biases that influence what results are returned Metasearch engines have an intuitive appeal as a method (Gerhart, 2004; Introna & Nissenbaum, 2000). By combin- of improving the retrieval performance for Web searches. ing results from several sources, a metasearch engine ad- dresses all three concerns. Chignell, Gwizdka, and Bodner (1999) found little overlap Received October 25, 2005; revised May 18, 2006; accepted May 18, 2006 in the results returned by various Web search engines. They © 2007 Wiley Periodicals, Inc. • Published online 2 February 2007 in describe a metasearch engine as useful, since different engines Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/asi.20555 employ different means of matching queries to relevant items, JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 58(5):744–755, 2007 and also have different indexing coverage. Selberg and load as the metric of comparison. Searching characteristics Etzioni (1997) further suggested that no single search engine were not presented. is likely to return more than 45% of the relevant results. Sub- Developers of the Mearf metasearch engine (Oztekin, sequently, the design and performance of metasearch en- Karypis, & Kumar, 2002) collected transaction logs from gines have become an ongoing area of study (Buzikashvili, November 22, 2000 to November 10, 2001, using click- 2002; Chignell, Gwizdka & Bodner, 1999; Dreilinger & through as a mechanism for evaluating Mearf performance. Howe, 1997; Meng, Yu, & Lui, 2002; Selberg & Etzioni, They report on the mean documents returned per query, user 1997; Spink, Lawrence, & Giles, 2000). reranking of results, and the number of documents clicked However, there has been little investigation into how on by searchers. Approximately 64% of queries included a searchers interact with Web metasearch engines. If metasearch click on a document, with a mean of 2.02 clicks per query. provides an improved Web searching environment, one may However, there were a total of 17,055 queries submitted expect differences in interactions when compared to Web during the one year period, so this may not be a representa- searching on other search engines. What are the interaction tive sample of metasearch engine users. patterns between searchers and a metasearch engine? This Many studies have examined the performance of single question motivates our research. Web search engines such as AltaVista, Excite, AlltheWeb In the following sections, we review the related studies (Spink & Jansen, 2004), and NAVER (Park, Bae, & Lee, and list our research questions. We then discuss the Dog- 2005). Spink, Jansen, Blakely, and Koshman (2006) found pile.com Web metasearch engine and the research design little results overlap and uniqueness among major Web that was used in our study. We then discuss the findings from search engines. However, limited large-scale studies have multiple levels of analysis, concluding with implications for examined how searchers interact with Web metasearch Web metasearching. engines. An understanding of how searchers utilize these systems is critical for the future refinement of metasearch Related Studies engine design and the evaluation of Web metasearch engine Web research is now a major interdisciplinary area of performance. These are the motivators for our research. study, including the modeling of user behavior and Web search engine performance (Spink & Jansen, 2004). Web Research Questions search engine crawling and retrieving studies have evolved The research questions driving our study are as follows: as an important area of Web research since the mid-1990s. Many metasearch tools have been developed and commer- 1. What are the characteristics of search interactions on the cially implemented, but little research has investigated the Dogpile.com metasearch engine? To address this re- usage and performance of Web metasearch engines. Selberg search question, we investigated session length, query and Etzioni (1997) developed one of the first metasearch length, query structure, query formulation, result pages engines, Metacrawler (http://www.metacrawler.com). Largely viewed and term usage of these Web searchers. focusing on the system design, the researchers discuss usage, 2. What are the temporal characteristics of metasearching reporting on 50,878 queries submitted between July 7 and on Dogpile.com? For this research question, we investi- September 30, 1995, with 46.67% (24,253 queries) being gated the duration of sessions and the frequency of inter- actions during these sessions. unique. The top 10 queries represented 3.37% (1,716) of all 3. What are the topical characteristics of searches on the queries. The top queries were all one term in length, and com- Dogpile.com metasearch engine? To address this re- monly occurring natural language terms (e.g., the, of, and, or) search question, we investigated a subset of queries reported in later Web user studies were not present. submitted by searchers on Dogpile.com to gain insight Gauch, Wang, and Gomez (1996) designed the ProFusion into the nature of their search topics using a qualitative metasearch engine and evaluated its performance in a lab analysis. setting. The researchers used 12 students who submitted queries and compared ProFusion to the six underlying search engines using the number of relevant documents Research Design retrieved, the number of irrelevant documents retrieved, the Dogpile.com number of broken links, the number of duplicates, the num- ber of unique relevance documents and precision. How the Dogpile.com (http://www.Dogpile.com/) is owned by In- study participants utilized the metasearch engine was not fospace, a market leader in the metasearch engine business. discussed. Dogpile.com incorporates into its search result listings the The SavvySearch (Dreilinger & Howe, 1997; Howe & results from other search engines, including results from the Dreilinger, 1997) is a metasearch engine that selects the most four leading Web search indices (i.e., Ask Jeeves, Google, promising search engines
Recommended publications
  • Internet Training: the Basics. INSTITUTION Temple Univ., Philadelphia, PA
    DOCUMENT RESUME ED 424 360 CE 077 241 AUTHOR Gallo, Gail; Wechowski, Chester P. TITLE Internet Training: The Basics. INSTITUTION Temple Univ., Philadelphia, PA. Center for Vocational Education Professional Personnel Development. PUB DATE 1998-04-00 NOTE 27p. PUB TYPE Guides Non-Classroom (055) EDRS PRICE MF01/PCO2 Plus Postage. DESCRIPTORS Adult Education; Educational Needs; *Information Sources; *Online Systems; Postsecondary Education; Secondary Education; Teaching Methods; Vocational Education Teachers; *World Wide Web IDENTIFIERS Web Sites ABSTRACT This paper outlines the basic information teachers need to . know to use the World Wide Web for research and communication, using Netscape 3.04. Topics covered include the following: what is the World Wide Web?; what is a browser?; accessing the Web; moving around a web document; the Uniform Resource Locator (URL); Bookmarks; saving and printing a web page; Netscape's options preferences menu; searching the Web; and interesting places (websites of interest to vocational educators) .Two appendixes contain the following: "Infoseek: How Do I Search? 'Ruby Slippers'" and Guidelines for Completing Abstracts. (KC) ******************************************************************************** * Reproductions supplied by EDRS are the best that can be made * * from the original document. * ***********************************************************************i******** t. The Center for Vocational Education \I/ Professional Personnel Development/ \ INTERNET TRAINING: THE BASICS Developed by Gail Gallo Project Director Chester P. Wichowski Temple University April, 1998 ./1.S. DEPARTMENT OFEDUCATION /ffice of Educational Research and Improvement PERMISSION TO REPRODUCE AND E UCATIONAL RESOURCES INFORMATION DISSEMINATE THIS MATERIAL HAS CENTER (ERIC) BEEN GRANTED BY This document has been reproduced as received from the person or organization originating it. 0 Minor changes have been made to improve reproduction quality.
    [Show full text]
  • Dining Hall,” “Cafeteria,” and “Campus Food Service” • Be Specific As You Learn More – E.G
    THE INTERNET Conducting Internet Research Computer Applications I Martin Santos Jorge Cab Objectives • After completing this section, students will be able to: • Understand the internet • Identify the different tools for research • Use and cite references from the internet Lecturers: Martin Santos/Jorge Cab (S.P.J.C.) 2 Vocabulary List • Internet (the Net): a global connection of millions of computer networks • Browser: software that helps a user access web sites (Internet Explorer and Netscape) • Server: a computer that runs special software and sends information over the Internet when requested • World Wide Web (the Web or www.): multimedia portion of the Internet consisting of text, graphics, audio and video • URL: stands for Uniform Resource Locator. It is the website's “address” or what the user types in to make the connection • Web site: a “virtual” place on the Internet with a unique URL • Virtual: “mental” replica of something - you can’t “touch” it – need a “tool” to get to it • Web page: a place on a web site where specific information is located • Home page: main page of a web site and first page to load when a site is accessed • Hyperlink: “clickable” text or graphics – takes you from one place to another – usually underlined and shows a hand shaped icon • Hypertext: capability to “link” or “jump” to other references or cross references by clicking • Cyberspace: “electronic” universe where information from one computer connects with another • Upload: process of transferring information to a page/site on the internet • Download: process of transferring information to a computer • Search engine: a site that scans the contents of other web sites to create a large index of information • Domain (top level): code located in the URL representing the type of organization (i.e., .gov (government), .edu (education), .mil (military), .org (organization – non-profit), .com (commercial – a business – for profit) • Internet Service Provider (ISP): a company with direct connection to the Internet that grants subscribers access to various Internet services.
    [Show full text]
  • Market Research SD-5 Gathering Information About Commercial Products and Services
    Market Research SD-5 Gathering Information About Commercial Products and Services DEFENSE STANDARDIZATION PROGRA M JANUARY 2008 Contents Foreword 1 The Market Research Other Considerations 32 Background 2 Process 13 Amount of Information Strategic Market Research to Gather 32 What Is Market Research? 2 (Market Surveillance) 14 Procurement Integrity Act 32 Why Do Market Research? 2 Identify the Market or Market Paperwork Reduction Act 33 Segment of Interest 14 When Is Market Research Cost of Market Research 34 Done? 5 Identify Sources of Market Information 16 Who Should Be Involved In Market Research? 7 Collect Relevant Market Other Information Information 17 Technical Specialist 8 Document the Results 18 on Market Research 35 User 9 Logistics Specialist 9 Tactical Market Research Appendix A 36 (Market Investigation) 19 Testing Specialist 9 Types of Information Summarize Strategic Market Available on the Internet Cost Analyst 10 Research 19 Legal Counsel 10 Formulate Requirements 20 Appendix B 39 Contracting Officer 10 Web-Based Information Identify Sources of Sources Information 21 Guiding Principles 11 Collect Product or Service Appendix C 47 Examples of Tactical Start Early 11 Information from Sources 22 Collect Information from Information Define and Document Product or Service Users 26 Requirements 11 Evaluate the Data 27 Refine as You Proceed 12 Document the Results 30 Tailor the Investigation 12 Repeat as Necessary 12 Communicate 12 Involve Users 12 Foreword The Department of Defense (DoD) relies extensively on the commercial market for the products and services it needs, whether those products and services are purely commercial, modified for DoD use from commercial products and services, or designed specifically for DoD.
    [Show full text]
  • Dogpile.Com First to Combine Search Results from MSN Search with Google, Yahoo and Ask Jeeves
    Dogpile.com First to Combine Search Results From MSN Search with Google, Yahoo and Ask Jeeves With the Addition of MSN Search, Dogpile.com Users Can Efficiently Find More of the Web's Most Relevant Search Results in One Place BELLEVUE, Wash. – August 2, 2005 – Dogpile.com today announced that search results from MSN Search are now available on the Web's leading metasearch engine. By becoming the first to combine results from the four leading search sites—MSN Search, Google, Yahoo and Ask Jeeves—Dogpile.com gives consumers the most comprehensive view of the Web and helps them efficiently retrieve the most relevant results. The addition of MSN Search to Dogpile.com further extends Dogpile.com's differentiation from any single search engine. Most people believe search results across all four engines are the same, when, in fact, the vast majority of the results from each engine are different. According to a new study, researchers at the University of Pittsburgh and the Pennsylvania State University evaluated 12,570 random queries run on MSN Search, Google, Yahoo and Ask Jeeves. They found only 1.1 percent of the first page results are the same across all four engines. The full results of the study can be found at http://CompareSearchEngines.dogpile.com/whitepaper. "By bringing together the best results from the top engines on Dogpile.com, consumers can be confident they are receiving the most relevant results," said Brian Bowman, vice president of marketing and product management for InfoSpace Search & Directory. Dogpile.com has built a tool that allows consumers to compare the results of the leading engines for themselves.
    [Show full text]
  • Evaluation of Web-Based Search Engines Using User-Effort Measures
    Evaluation of Web-Based Search Engines Using User-Effort Measures Muh-Chyun Tang and Ying Sun 4 Huntington St. School of Information, Communication and Library Studies Rutgers University, New Brunswick, NJ 08901, U.S.A. [email protected] [email protected] Abstract This paper presents a study of the applicability of three user-effort-sensitive evaluation measures —“first 20 full precision,” “search length,” and “rank correlation”—on four Web-based search engines (Google, AltaVista, Excite and Metacrawler). The authors argue that these measures are better alternatives than precision and recall in Web search situations because of their emphasis on the quality of ranking. Eight sets of search topics were collected from four Ph.D. students in four different disciplines (biochemistry, industrial engineering, economics, and urban planning). Each participant was asked to provide two topics along with the corresponding query terms. Their relevance and credibility judgment of the Web pages were then used to compare the performance of the search engines using these three measures. The results show consistency among these three ranking evaluation measures, more so between “first 20 full precision” and search length than between rank correlation and the other two measures. Possible reasons for rank correlation’s disagreement with the other two measures are discussed. Possible future research to improve these measures is also addressed. Introduction The explosive growth of information on the World Wide Web poses a challenge to traditional information retrieval (IR) research. Other than the sheer amount of information, some structural factors make searching for relevant and quality information on the Web a formidable task.
    [Show full text]
  • How to Choose a Search Engine Or Directory
    How to Choose a Search Engine or Directory Fields & File Types If you want to search for... Choose... Audio/Music AllTheWeb | AltaVista | Dogpile | Fazzle | FindSounds.com | Lycos Music Downloads | Lycos Multimedia Search | Singingfish Date last modified AllTheWeb Advanced Search | AltaVista Advanced Web Search | Exalead Advanced Search | Google Advanced Search | HotBot Advanced Search | Teoma Advanced Search | Yahoo Advanced Web Search Domain/Site/URL AllTheWeb Advanced Search | AltaVista Advanced Web Search | AOL Advanced Search | Google Advanced Search | Lycos Advanced Search | MSN Search Search Builder | SearchEdu.com | Teoma Advanced Search | Yahoo Advanced Web Search File Format AllTheWeb Advanced Web Search | AltaVista Advanced Web Search | AOL Advanced Search | Exalead Advanced Search | Yahoo Advanced Web Search Geographic location Exalead Advanced Search | HotBot Advanced Search | Lycos Advanced Search | MSN Search Search Builder | Teoma Advanced Search | Yahoo Advanced Web Search Images AllTheWeb | AltaVista | The Amazing Picture Machine | Ditto | Dogpile | Fazzle | Google Image Search | IceRocket | Ixquick | Mamma | Picsearch Language AllTheWeb Advanced Web Search | AOL Advanced Search | Exalead Advanced Search | Google Language Tools | HotBot Advanced Search | iBoogie Advanced Web Search | Lycos Advanced Search | MSN Search Search Builder | Teoma Advanced Search | Yahoo Advanced Web Search Multimedia & video All TheWeb | AltaVista | Dogpile | Fazzle | IceRocket | Singingfish | Yahoo Video Search Page Title/URL AOL Advanced
    [Show full text]
  • Analysis of Query Keywords of Sports-Related Queries Using Visualization and Clustering
    Analysis of Query Keywords of Sports-Related Queries Using Visualization and Clustering Jin Zhang and Dietmar Wolfram School of Information Studies, University of Wisconsin—Milwaukee, Milwaukee, WI 53201. E-mail: {jzhang, dwolfram}@uwm.edu Peiling Wang School of Information Sciences, College of Communication and Information, University of Tennessee at Knoxville, Knoxville, TN 37996–0341. E-mail: [email protected] The authors investigated 11 sports-related query key- the user’s request includes the client Internet Protocol (IP) words extracted from a public search engine query log address, request date/time, page requested, HTTP code, bytes to better understand sports-related information seeking served, user agent, referrer, and so on. The data are kept in on the Internet. After the query log contents were cleaned and query data were parsed, popular sports-related key- a standard format in a transaction log file (Hallam-Baker & words were identified, along with frequently co-occurring Behlendorf, 2008). query terms associated with the identified keywords. Although a transaction log comprises rich data, including Relationships among each sports-related focus keyword browsing times and traversal paths, it is the queries directly and its related keywords were characterized and grouped submitted by users that have attracted the most research using multidimensional scaling (MDS) in combination with traditional hierarchical clustering methods. The two attention. Query data contain keywords that reflect users’ approaches were synthesized in a visual context by high- wide-ranging information needs. Query logs have been ana- lighting the results of the hierarchical clustering analysis lyzed from a variety of sources with different emphases and in the visual MDS configuration.
    [Show full text]
  • Multitasking Web Search on Alta Vista
    1 Multitasking Web Search on Alta Vista Amanda Spink* & Bernard J. Jansen Minsoo Park School of Info Sciences and Jan Pedersen School of Information Technology Chief Scientist Sciences The Pennsylvania State Overture Web Search University of Pittsburgh University Division 610 IS Building, 135 N. 4P Thomas Building 1070 Arastradero Road Bellefield Avenue University Park PA 16802 Palo Alto, CA 94304 Pittsburgh PA 15260 Tel: (814) 865-6459 [email protected] Tel: (412) 624-9454 Fax: (814) 865-6424 Fax: (412) 648-7001 E-mail: [email protected] E-mail: [email protected],edu * To whom all correspondence should be addressed. Ozmutlu [8] show that IR searches often include Abstract multiple topics, during a single search session or A user’s single session with a Web search engine multitasking search. Spink, Batemen and Greisdorf may consist of seeking information on single or [9] found that eleven (3.8%) of the 287 Excite users multiple topics. Most Web search sessions consist of responding to a Web-based survey reported two queries of two words. We present findings from a multitasking searches. However, limited knowledge study of two-query search sessions on the Alta Vista exists on the characteristics and patterns of Web search engine to examine the degree of multitasking searches. Recent studies have examined multitasking search by typical web searchers. A multitasking searching on the Excite and sample of two-query length search sessions were AlltheWeb.com Web search engines [10, 11]. filtered from Alta Vista transaction logs from 2003. Ozmutlu, Ozmutlu and Spink [10] provide a Findings include: (1) 81% of two-query sessions detailed analysis of multitasking sessions on were multitasking searches, and (2) there are a broad AlltheWeb.com.
    [Show full text]
  • The Deep Web: Surfacing Hidden Value
    White Paper The Deep Web: Surfacing Hidden Value BrightPlanet.com LLC July 2000 The author of this study is Michael K. Bergman. Editorial assistance was provided by Mark Smither; analysis and retrieval assistance was provided by Will Bushee. This White Paper is the property of BrightPlanet.com LLC. Users are free to distribute and use it for personal use.. Some of the information in this document is preliminary. BrightPlanet plans future revisions as better information and documentation is obtained. We welcome submission of improved information and statistics from others involved with the “deep” Web. Mata Hari® is a registered trademark and BrightPlanet™, CompletePlanet™, LexiBot™, search filter™ and A Better Way to Search™ are pending trademarks of BrightPlanet.com LLC. All other trademarks are the respective property of their registered owners. © 2000 BrightPlanet.com LLC. All rights reserved. Summary BrightPlanet has uncovered the “deep” Web — a vast reservoir of Internet content that is 500 times larger than the known “surface” World Wide Web. What makes the discovery of the deep Web so significant is the quality of content found within. There are literally hundreds of billions of highly valuable documents hidden in searchable databases that cannot be retrieved by conventional search engines. This discovery is the result of groundbreaking search technology developed by BrightPlanet called a LexiBot™ — the first and only search technology capable of identifying, retrieving, qualifying, classifying and organizing “deep” and “surface” content from the World Wide Web. The LexiBot allows searchers to dive deep and explore hidden data from multiple sources simultaneously using directed queries. Businesses, researchers and consumers now have access to the most valuable and hard- to-find information on the Web and can retrieve it with pinpoint accuracy.
    [Show full text]
  • A Meta Search Engine Based on a New Result Merging Strategy
    Usearch: A Meta Search Engine based on a New Result Merging Strategy Tarek Alloui1, Imane Boussebough2, Allaoua Chaoui1, Ahmed Zakaria Nouar3 and Mohamed Chaouki Chettah3 1MISC Laboratory, Department of Computer Science and its Applications, Faculty of NTIC, University Constantine 2 Abdelhamid Mehri, Constantine, Algeria 2LIRE Laboratory, Department of Software Technology and Information Systems, Faculty of NTIC, University Constantine 2 Abdelhamid Mehri, Constantine, Algeria 3Department of Computer Science and its Applications, Faculty of NTIC, University Constantine 2, Abdelhamid Mehri, Constantine, Algeria Keywords: Meta Search Engine, Ranking, Merging, Score Function, Web Information Retrieval. Abstract: Meta Search Engines are finding tools developed for improving the search performance by submitting user queries to multiple search engines and combining the different search results in a unified ranked list. The effectiveness of a Meta search engine is closely related to the result merging strategy it employs. But nowadays, the main issue in the conception of such systems is the merging strategy of the returned results. With only the user query as relevant information about his information needs, it’s hard to use it to find the best ranking of the merged results. We present in this paper a new strategy of merging multiple search engine results using only the user query as a relevance criterion. We propose a new score function combining the similarity between user query and retrieved results and the users’ satisfaction toward used search engines. The proposed Meta search engine can be used for merging search results of any set of search engines. 1 INTRODUCTION advantages of MSEs are their abilities to combine the coverage of multiple search engines and to reach Nowadays, the World Wide Web is considered as the deep Web.
    [Show full text]
  • Internet Research
    Discipline Specific Researching on the Internet The internet is growing exponentially, and thousands of new web pages are being added each day. The upside is that you have an enormous amount of information only a few mouse clicks away. The downside is that you must refine your approach to online research in order to target the handful of websites that may be useful to you. The following search tops are designed to get you stared and save you time. Happy hunting! Finding Sources Go to www.library.unh.edu The University of New Hampshire Library has access to dozens of online databases catering to nearly every subject you may be studying. The site also has several online research guides and tools to ensure you'll find what you need. The library’s web site is a good first choice to help narrow your research. Try several search engines Google is the most popular search engine in the world, so that is a good place to start. There are, however, other search engines that might be of use to you: Altavista.com askjeeves.com ditto.com excite.com metacrawler.com dogpile.com Alltheweb.com yahoo.com Use the advanced search function Most search engines have a link for “advanced search.” This can be extremely useful in narrowing your search, as the additional features enable you to search by exact phrase, date range, domain, language, date of web page update, and more. Choose your search words carefully Use words that are specific, or unique, to what you are looking for. Some words are very common and will lead to far too many hits.
    [Show full text]
  • Meta Search Engine with an Intelligent Interface for Information Retrieval on Multiple Domains
    International Journal of Computer Science, Engineering and Information Technology (IJCSEIT), Vol.1, No.4, October 2011 META SEARCH ENGINE WITH AN INTELLIGENT INTERFACE FOR INFORMATION RETRIEVAL ON MULTIPLE DOMAINS D.Minnie1, S.Srinivasan2 1Department of Computer Science, Madras Christian College, Chennai, India [email protected] 2Department of Computer Science and Engineering, Anna University of Technology Madurai, Madurai, India [email protected] ABSTRACT This paper analyses the features of Web Search Engines, Vertical Search Engines, Meta Search Engines, and proposes a Meta Search Engine for searching and retrieving documents on Multiple Domains in the World Wide Web (WWW). A web search engine searches for information in WWW. A Vertical Search provides the user with results for queries on that domain. Meta Search Engines send the user’s search queries to various search engines and combine the search results. This paper introduces intelligent user interfaces for selecting domain, category and search engines for the proposed Multi-Domain Meta Search Engine. An intelligent User Interface is also designed to get the user query and to send it to appropriate search engines. Few algorithms are designed to combine results from various search engines and also to display the results. KEYWORDS Information Retrieval, Web Search Engine, Vertical Search Engine, Meta Search Engine. 1. INTRODUCTION AND RELATED WORK WWW is a huge repository of information. The complexity of accessing the web data has increased tremendously over the years. There is a need for efficient searching techniques to extract appropriate information from the web, as the users require correct and complex information from the web. A Web Search Engine is a search engine designed to search WWW for information about given search query and returns links to various documents in which the search query’s key words are found.
    [Show full text]