The Structuration of Traffic on the World-Wide Web

Total Page:16

File Type:pdf, Size:1020Kb

The Structuration of Traffic on the World-Wide Web The London School of Economics and Political Science Search Engine Bias The Structuration of Traffic on the World-Wide Web Elizabeth Jane Van Couvering A thesis submitted to The Department Media and Communications London School of Economics and Political Science for the degree of Doctor of Philosophy London, December 2009 1 Declaration I certify that the thesis I have presented for examination for the MPhil/PhD degree of the London School of Economics and Political Science is solely my own work other than where I have clearly indicated that it is the work of others (in which case the extent of any work carried out jointly by me and any other person is clearly identified in it). The copyright of this thesis rests with the author. Quotation from it is permitted, provided that full acknowledgement is made. This thesis may not be reproduced without the prior written consent of the author. I warrant that this authorization does not, to the best of my belief, infringe the rights of any third party. 2 Abstract Search engines are essential components of the World Wide Web; both commercially and in terms of everyday usage, their importance is hard to overstate. This thesis examines the question of why there is bias in search engine results – bias that invites users to click on links to large websites, commercial websites, websites based in certain countries, and websites written in certain languages. In this thesis, the historical development of the search engine industry is traced. Search engines first emerged as prototypical technological startups emanating from Silicon Valley, followed by the acquisition of search engine companies by major US media corporations and their development into portals. The subsequent development of pay-per-click advertising is central to the current industry structure, an oligarchy of virtually integrated companies managing networks of syndicated advertising and traffic distribution. The study also shows a global landscape in which search production is concentrated in and caters for large global advertising markets, leaving the rest of the world with patchy and uneven search results coverage. The analysis of interviews with senior search engine engineers indicates that issues of quality are addressed in terms of customer service and relevance in their discourse, while the analysis of documents, interviews with search marketers, and participant observation within a search engine marketing firm showed that producers and marketers had complex relationships that combine aspects of collaboration, competition, and indifference. The results of the study offer a basis for the synthesis of insights of the political economy of media and communication and the social studies of technology tradition, emphasising the importance of culture in constructing and maintaining both local structures and wider systems. In the case of search engines, the evidence indicates that the culture of the technological entrepreneur is very effective in creating a new mega- business, but less successful in encouraging a debate on issues of the public good or public responsibility as they relate to the search engine industry. Abstract 3 Acknowledgements This thesis is dedicated to my readers, past, present and future. It is a pleasure to thank those who have helped me in this work. First and foremost I would like to thank my husband, Henrik Örnebring, who has been an unwavering and uncomplaining source of support and my daughter Charlotte, who has given me perspective. My interviewees and informants, without whom this work would not have been possible, have my sincere gratitude. Thanks to my colleagues at the LSE who have helped me so much while working on their own projects by listening, reading and commenting: Evangelia Berdou, Ellen Helsper, Zoe Sujon, David Brake, Shani Orgad, Anita Howarth, Patrick McCurdy and Yukie Hori (who also generously helped to translate figures from Nielsen NetRatings Japanese home page). I also owe a debt of gratitude to the more senior scholars at the LSE, especially Sonia Livingstone, Nick Couldry, Terhi Rantanen, Damian Tambini and the late Roger Silverstone, for criticism and encouragement in equal measures. Thanks also go to Jean Morris, whose administrative support was always prompt, efficient, and pleasant. I would like to thank my parents who inspired me to begin this work and, particularly, my mother, Judith Harris, who has been a wonderful support and confidante throughout the work. And finally I would like to specially thank my supervisor, Robin Mansell, since I have benefited very greatly from her scholarship, her help and her commitment to me and to this project from the very first. Thank you all very much. 4 Search Engine Bias: The Structuration of Traffic on the World-Wide Web Table of Contents I. Bias in Internet Search Engines................................................... 9 1.1 Introduction...................................................................................................9 1.2 Researching search engines and identifying a gap.................................. 10 1.3 Investigating bias in internet search engines........................................... 18 1.4 Conclusion.................................................................................................. 27 II. The Dynamics of Technological Structuration ........................ 29 2.1 Introduction................................................................................................ 29 2.2 Bias as a normative conflict ...................................................................... 31 2.3 Technology as a social practice................................................................. 36 2.4 Conflict in context ..................................................................................... 50 2.5 Conceptual framework.............................................................................. 59 2.6 Conclusion.................................................................................................. 63 III. Follow the Results....................................................................... 64 3.1 Introduction................................................................................................ 64 3.2 Development of the research question ................................................... 65 3.3 Research design.......................................................................................... 67 3.4 Level A: Agents and their actions ............................................................ 71 3.5 Level B: Structures..................................................................................... 83 3.6 Alternative research designs...................................................................... 88 3.7 Conclusion.................................................................................................. 90 IV. The History of the Internet Search Engine ...............................91 4.1 Introduction................................................................................................ 91 4.2 The development of the search engine industry .................................... 93 4.3 Technological entrepreneurs (1994-1997)............................................... 95 4.4 Portals and vertical integration (1997-2001) ......................................... 100 4.5 Syndication and consolidation (2002-?)................................................. 114 4.6 Conclusion................................................................................................ 125 V. Finding the Centre..................................................................... 129 5.1 Introduction.............................................................................................. 129 5.2 The four country cases............................................................................ 132 5.3 Search Engines in Japan.......................................................................... 134 5.4 Search Engines in Germany ................................................................... 137 5.5 Search Engines in China ......................................................................... 138 5.6 Search Engines in South Africa.............................................................. 143 5.7 International search engine roduction................................................... 147 5.8 Conclusions .............................................................................................. 158 VI. Is Relevance Relevant?................................................................161 6.1 Introduction.............................................................................................. 161 6.2 The market schema.................................................................................. 163 6.3 The science-technology schema............................................................. 166 5 6.4 The war schema ....................................................................................... 169 6.5 The strategic use of technological schemas .......................................... 171 6.6 The difficulty of articulating the public good ....................................... 177 6.7 Conclusions .............................................................................................. 179 VII. Ranking Highly............................................................................181 7.1 Introduction.............................................................................................. 181 7.2
Recommended publications
  • Internet Economy 25 Years After .Com
    THE INTERNET ECONOMY 25 YEARS AFTER .COM TRANSFORMING COMMERCE & LIFE March 2010 25Robert D. Atkinson, Stephen J. Ezell, Scott M. Andes, Daniel D. Castro, and Richard Bennett THE INTERNET ECONOMY 25 YEARS AFTER .COM TRANSFORMING COMMERCE & LIFE March 2010 Robert D. Atkinson, Stephen J. Ezell, Scott M. Andes, Daniel D. Castro, and Richard Bennett The Information Technology & Innovation Foundation I Ac KNOW L EDGEMEN T S The authors would like to thank the following individuals for providing input to the report: Monique Martineau, Lisa Mendelow, and Stephen Norton. Any errors or omissions are the authors’ alone. ABOUT THE AUTHORS Dr. Robert D. Atkinson is President of the Information Technology and Innovation Foundation. Stephen J. Ezell is a Senior Analyst at the Information Technology and Innovation Foundation. Scott M. Andes is a Research Analyst at the Information Technology and Innovation Foundation. Daniel D. Castro is a Senior Analyst at the Information Technology and Innovation Foundation. Richard Bennett is a Research Fellow at the Information Technology and Innovation Foundation. ABOUT THE INFORMATION TECHNOLOGY AND INNOVATION FOUNDATION The Information Technology and Innovation Foundation (ITIF) is a Washington, DC-based think tank at the cutting edge of designing innovation policies and exploring how advances in technology will create new economic opportunities to improve the quality of life. Non-profit, and non-partisan, we offer pragmatic ideas that break free of economic philosophies born in eras long before the first punch card computer and well before the rise of modern China and pervasive globalization. ITIF, founded in 2006, is dedicated to conceiving and promoting the new ways of thinking about technology-driven productivity, competitiveness, and globalization that the 21st century demands.
    [Show full text]
  • Internet Pro Za Č Ínající Uživatele
    Gymnázium Františka Živného INFORMAČNÍ CENTRUM 2/2004 INTERNET PRO ZAČ ÍNAJÍCÍ UŽIVATELE DISTANČ NÍ TEXT Bohumín 2004 TOMÁŠ HUDEC OBSAH PŘ EDMĚ TU Úvod........................................................................................................................................... 3 1. Připojme se..................................................................................................................... 4 2. Prohlížíme stránky.......................................................................................................... 6 3. Hledání v kupce sena.................................................................................................... 10 4. Elektronická pošta ....................................................................................................... 16 2 ÚVOD Text je určen široké pedagogické veřejnosti z řad učitelů „ neinformatiků“, kteří chtějí získat základní informace z oblasti Internetu a chtějí pokud možnost studovat sami v čase, který jim vyhovuje a v prostředí, kde se cítí dobře, nemusí se stresovat a mohou pracovat svým tempem.. Text je napsán distanční formou a může být studován jak samostatně, tak v případě otevření příslušného kurzu pod vedením lektora. Text je oproštěn od obecných informací a je směrován k praktické činnosti čtenáře či frekventanta kurzu. Hlubší studium a někdy i odpovědi na otázky předpokládají další studium literatury nebo vyhledávání informací právě z Internetu. Po prostudování textu budete znát: • Co je to Internet a jak se připojit • Jaké jsou základní
    [Show full text]
  • Dining Hall,” “Cafeteria,” and “Campus Food Service” • Be Specific As You Learn More – E.G
    THE INTERNET Conducting Internet Research Computer Applications I Martin Santos Jorge Cab Objectives • After completing this section, students will be able to: • Understand the internet • Identify the different tools for research • Use and cite references from the internet Lecturers: Martin Santos/Jorge Cab (S.P.J.C.) 2 Vocabulary List • Internet (the Net): a global connection of millions of computer networks • Browser: software that helps a user access web sites (Internet Explorer and Netscape) • Server: a computer that runs special software and sends information over the Internet when requested • World Wide Web (the Web or www.): multimedia portion of the Internet consisting of text, graphics, audio and video • URL: stands for Uniform Resource Locator. It is the website's “address” or what the user types in to make the connection • Web site: a “virtual” place on the Internet with a unique URL • Virtual: “mental” replica of something - you can’t “touch” it – need a “tool” to get to it • Web page: a place on a web site where specific information is located • Home page: main page of a web site and first page to load when a site is accessed • Hyperlink: “clickable” text or graphics – takes you from one place to another – usually underlined and shows a hand shaped icon • Hypertext: capability to “link” or “jump” to other references or cross references by clicking • Cyberspace: “electronic” universe where information from one computer connects with another • Upload: process of transferring information to a page/site on the internet • Download: process of transferring information to a computer • Search engine: a site that scans the contents of other web sites to create a large index of information • Domain (top level): code located in the URL representing the type of organization (i.e., .gov (government), .edu (education), .mil (military), .org (organization – non-profit), .com (commercial – a business – for profit) • Internet Service Provider (ISP): a company with direct connection to the Internet that grants subscribers access to various Internet services.
    [Show full text]
  • To E-Commerce EC4E Ch 01 WA 11-23.Qxd 12/10/2007 5:16 PM Page 2
    EC4E_Ch_01_WA_11-23.qxd 12/10/2007 5:16 PM Page 1 PART 1 CHAPTER 1 The Revolution Is Just Beginning CHAPTER 2 E-commerce Business Models and Concepts Introduction to E-commerce EC4E_Ch_01_WA_11-23.qxd 12/10/2007 5:16 PM Page 2 CHAPTER11 The Revolution Is Just Beginning LEARNING OBJECTIVES After reading this chapter, you will be able to: ■ Define e-commerce and describe how it differs from e-business. ■ Identify and describe the unique features of e-commerce technology and discuss their business significance. ■ Recognize and describe Web 2.0 applications. ■ Describe the major types of e-commerce. ■ Discuss the origins and growth of e-commerce. ■ Understand the evolution of e-commerce from its early years to today. ■ Identify the factors that will define the future of e-commerce. ■ Describe the major themes underlying the study of e-commerce. ■ Identify the major academic disciplines contributing to e-commerce. EC4E_Ch_01_WA_11-23.qxd 12/10/2007 5:16 PM Page 3 MySpace and Facebook: It’s All About You ow many people watched the final episode of the most popular American Htelevision show in history, the Sopranos? Answer: about 12 million (out of a total television audience size of 111 million). Only once in American history has a television show drawn more simultaneous viewers—13 million for NBC’s “America’s Got Talent” premiere in 2006. How many people visit MySpace each month? About 70 million. There are now more than 100 million personal profiles on MySpace. Almost 40 million visit MySpace’s closest social network rival, Facebook, each month.
    [Show full text]
  • Market Research SD-5 Gathering Information About Commercial Products and Services
    Market Research SD-5 Gathering Information About Commercial Products and Services DEFENSE STANDARDIZATION PROGRA M JANUARY 2008 Contents Foreword 1 The Market Research Other Considerations 32 Background 2 Process 13 Amount of Information Strategic Market Research to Gather 32 What Is Market Research? 2 (Market Surveillance) 14 Procurement Integrity Act 32 Why Do Market Research? 2 Identify the Market or Market Paperwork Reduction Act 33 Segment of Interest 14 When Is Market Research Cost of Market Research 34 Done? 5 Identify Sources of Market Information 16 Who Should Be Involved In Market Research? 7 Collect Relevant Market Other Information Information 17 Technical Specialist 8 Document the Results 18 on Market Research 35 User 9 Logistics Specialist 9 Tactical Market Research Appendix A 36 (Market Investigation) 19 Testing Specialist 9 Types of Information Summarize Strategic Market Available on the Internet Cost Analyst 10 Research 19 Legal Counsel 10 Formulate Requirements 20 Appendix B 39 Contracting Officer 10 Web-Based Information Identify Sources of Sources Information 21 Guiding Principles 11 Collect Product or Service Appendix C 47 Examples of Tactical Start Early 11 Information from Sources 22 Collect Information from Information Define and Document Product or Service Users 26 Requirements 11 Evaluate the Data 27 Refine as You Proceed 12 Document the Results 30 Tailor the Investigation 12 Repeat as Necessary 12 Communicate 12 Involve Users 12 Foreword The Department of Defense (DoD) relies extensively on the commercial market for the products and services it needs, whether those products and services are purely commercial, modified for DoD use from commercial products and services, or designed specifically for DoD.
    [Show full text]
  • Dogpile.Com First to Combine Search Results from MSN Search with Google, Yahoo and Ask Jeeves
    Dogpile.com First to Combine Search Results From MSN Search with Google, Yahoo and Ask Jeeves With the Addition of MSN Search, Dogpile.com Users Can Efficiently Find More of the Web's Most Relevant Search Results in One Place BELLEVUE, Wash. – August 2, 2005 – Dogpile.com today announced that search results from MSN Search are now available on the Web's leading metasearch engine. By becoming the first to combine results from the four leading search sites—MSN Search, Google, Yahoo and Ask Jeeves—Dogpile.com gives consumers the most comprehensive view of the Web and helps them efficiently retrieve the most relevant results. The addition of MSN Search to Dogpile.com further extends Dogpile.com's differentiation from any single search engine. Most people believe search results across all four engines are the same, when, in fact, the vast majority of the results from each engine are different. According to a new study, researchers at the University of Pittsburgh and the Pennsylvania State University evaluated 12,570 random queries run on MSN Search, Google, Yahoo and Ask Jeeves. They found only 1.1 percent of the first page results are the same across all four engines. The full results of the study can be found at http://CompareSearchEngines.dogpile.com/whitepaper. "By bringing together the best results from the top engines on Dogpile.com, consumers can be confident they are receiving the most relevant results," said Brian Bowman, vice president of marketing and product management for InfoSpace Search & Directory. Dogpile.com has built a tool that allows consumers to compare the results of the leading engines for themselves.
    [Show full text]
  • The Social Economy
    McKinsey Global Institute McKinsey Global Institute The social economy: Unlocking value and productivity through social technologies social through productivity and value Unlocking economy: The social July 2012 The social economy: Unlocking value and productivity through social technologies The McKinsey Global Institute The McKinsey Global Institute (MGI), the business and economics research arm of McKinsey & Company, was established in 1990 to develop a deeper understanding of the evolving global economy. Our goal is to provide leaders in the commercial, public, and social sectors with the facts and insights on which to base management and policy decisions. MGI research combines the disciplines of economics and management, employing the analytical tools of economics with the insights of business leaders. Our “micro-to-macro” methodology examines microeconomic industry trends to better understand the broad macroeconomic forces affecting business strategy and public policy. MGI’s in-depth reports have covered more than 20 countries and 30 industries. Current research focuses on six themes: productivity and growth; the evolution of global financial markets; the economic impact of technology and innovation; urbanization; the future of work; and natural resources. Recent reports have assessed job creation, resource productivity, cities of the future, and the impact of big data. MGI is led by three McKinsey & Company directors: Richard Dobbs, James Manyika, and Charles Roxburgh. Susan Lund serves as director of research. Project teams are led by a group of senior fellows and include consultants from McKinsey’s offices around the world. These teams draw on McKinsey’s global network of partners and industry and management experts. In addition, leading economists, including Nobel laureates, act as research advisers.
    [Show full text]
  • Internet and Information About Nuclear Sciences
    SK00K0107 INTERNET AND INFORMATION ABOUT NUCLEAR SCIENCES. THE WORLD WIDE WEB VIRTUAL LIBRARY: NUCLEAR SCIENCES Jozef KURUC Department of Nuclear Chemistry, Faculty of Natural Sciences, Comenius University, Mlynska dolina CH-1, SK-842 15 Bratislava, Slovak Republic, E-mail: [email protected] Similarly as in other areas, as well as in chemistry, INTERNET has brought revolutionary changes in searching, processing of information and in.the teaching of chemistry [1], The powerful instrument in searching of information in INTERNET are different browsers of the web-pages (www.altavista.com, www.yahoo.com, search.excite.com, www.webcrawier.com, www.iycos.com, infoseeek.go.com, www.hotbot.com, www.zoznam.sk, www.kompas.sk www.seznam.cz and other) [2], but they usually give over-much number of web-pages. Sometimes is ill and slowly to obtain necessary information from so over- much number searched and offered web-pages. By searching in the INTERNET assists the advanced searching, but sometimes it does not conduce to searched information. For assistance by the solving of these problems and for speeding-up of the work serve specialised servers, which give grouped information from certain scientific area and first and foremost links for next relative relevant web-links and web-pages, which are in the area of chemistry, for example, Yahoo- Chemistry-Server [3], list of Mendeleev periodic tables of elements [4], from which each provides certain supplementary information about properties of individual elements, isotopes, occasionally radionuclides. Some of them provide more detail information about radioisotopes [5-7], in nuclear physics it is, for example, Nuclear Info WWW Server [8}.
    [Show full text]
  • Multimedia, Internet, On-Line
    Section IV: Multimedia, the Internet, and On-Line Services High-End Digital Video Applications Larry Amiot Electronic and Computing Technologies Division Argonne National Laboratory The emphasis of this paper is on the high-end applications Internet and Intranet that are driving digital video. The research with which I am involved at Argonne National Laboratory is not done on dig- The packet video networks which currently support many ital video per se, but rather on how the research applications applications such as file transfer, Mbone video (talking at the laboratory drive its requirements for digital video. The heads), and World Wide Web browsing are limiting for high- paper will define what digital video is, what some of its com- quality video because of the low throughput one can achieve ponents are, and then discuss a few applications that are dri- via the Internet or intranets. Examples of national packet ving the development of these components. The focus will be switched networks developed in the last several years include on what digital video means to individuals in the research the National Science Foundation Network (NSFNet). The and education community. Department of Energy had its own network called ESNET, and the National Aeronautics and Space Administration The Digital Video Environment (NASA) had a network as well. Recently, the NSFNet was de- commissioned, and commercial interests are now starting to In 1996, a group of people from several universities in the fill that void. Research and education communities are find- Midwest and from Argonne formed a Video Working Group. ing, however, that this new commercial Internet is too re- This body tried to define the areas of digital video of impor- stricting and does not meet their throughput requirements; it tance to their institutions.
    [Show full text]
  • Natural Language Processing Technique for Information Extraction and Analysis
    International Journal of Research Studies in Computer Science and Engineering (IJRSCSE) Volume 2, Issue 8, August 2015, PP 32-40 ISSN 2349-4840 (Print) & ISSN 2349-4859 (Online) www.arcjournals.org Natural Language Processing Technique for Information Extraction and Analysis T. Sri Sravya1, T. Sudha2, M. Soumya Harika3 1 M.Tech (C.S.E) Sri Padmavati Mahila Visvavidyalayam (Women’s University), School of Engineering and Technology, Tirupati. [email protected] 2 Head (I/C) of C.S.E & IT Sri Padmavati Mahila Visvavidyalayam (Women’s University), School of Engineering and Technology, Tirupati. [email protected] 3 M. Tech C.S.E, Assistant Professor, Sri Padmavati Mahila Visvavidyalayam (Women’s University), School of Engineering and Technology, Tirupati. [email protected] Abstract: In the current internet era, there are a large number of systems and sensors which generate data continuously and inform users about their status and the status of devices and surroundings they monitor. Examples include web cameras at traffic intersections, key government installations etc., seismic activity measurement sensors, tsunami early warning systems and many others. A Natural Language Processing based activity, the current project is aimed at extracting entities from data collected from various sources such as social media, internet news articles and other websites and integrating this data into contextual information, providing visualization of this data on a map and further performing co-reference analysis to establish linkage amongst the entities. Keywords: Apache Nutch, Solr, crawling, indexing 1. INTRODUCTION In today’s harsh global business arena, the pace of events has increased rapidly, with technological innovations occurring at ever-increasing speed and considerably shorter life cycles.
    [Show full text]
  • How to Choose a Search Engine Or Directory
    How to Choose a Search Engine or Directory Fields & File Types If you want to search for... Choose... Audio/Music AllTheWeb | AltaVista | Dogpile | Fazzle | FindSounds.com | Lycos Music Downloads | Lycos Multimedia Search | Singingfish Date last modified AllTheWeb Advanced Search | AltaVista Advanced Web Search | Exalead Advanced Search | Google Advanced Search | HotBot Advanced Search | Teoma Advanced Search | Yahoo Advanced Web Search Domain/Site/URL AllTheWeb Advanced Search | AltaVista Advanced Web Search | AOL Advanced Search | Google Advanced Search | Lycos Advanced Search | MSN Search Search Builder | SearchEdu.com | Teoma Advanced Search | Yahoo Advanced Web Search File Format AllTheWeb Advanced Web Search | AltaVista Advanced Web Search | AOL Advanced Search | Exalead Advanced Search | Yahoo Advanced Web Search Geographic location Exalead Advanced Search | HotBot Advanced Search | Lycos Advanced Search | MSN Search Search Builder | Teoma Advanced Search | Yahoo Advanced Web Search Images AllTheWeb | AltaVista | The Amazing Picture Machine | Ditto | Dogpile | Fazzle | Google Image Search | IceRocket | Ixquick | Mamma | Picsearch Language AllTheWeb Advanced Web Search | AOL Advanced Search | Exalead Advanced Search | Google Language Tools | HotBot Advanced Search | iBoogie Advanced Web Search | Lycos Advanced Search | MSN Search Search Builder | Teoma Advanced Search | Yahoo Advanced Web Search Multimedia & video All TheWeb | AltaVista | Dogpile | Fazzle | IceRocket | Singingfish | Yahoo Video Search Page Title/URL AOL Advanced
    [Show full text]
  • United States Patent (19) 11 Patent Number: 6,094,649 Bowen Et Al
    US006094649A United States Patent (19) 11 Patent Number: 6,094,649 Bowen et al. (45) Date of Patent: Jul. 25, 2000 54) KEYWORD SEARCHES OF STRUCTURED “Charles Schwab Broadens Deployment of Fulcrum-Based DATABASES Corporate Knowledge Library Application', Uknown, Full 75 Inventors: Stephen J Bowen, Sandy; Don R crum Technologies Inc., Mar. 3, 1997, pp. 1-3. Brown, Salt Lake City, both of Utah (List continued on next page.) 73 Assignee: PartNet, Inc., Salt Lake City, Utah 21 Appl. No.: 08/995,700 Primary Examiner-Hosain T. Alam 22 Filed: Dec. 22, 1997 Assistant Examiner Thuy Pardo Attorney, Agent, or Firm-Computer Law---- (51) Int. Cl." ...................................................... G06F 17/30 52 U.S. Cl. ......................................... 707/3; 707/5; 707/4 (57 ABSTRACT 58 Field of Search .................................... 707/1, 2, 3, 4, 707/5, 531, 532,500 Methods and Systems are provided for Supporting keyword Searches of data items in a structured database, Such as a 56) References Cited relational database. Selected data items are retrieved using U.S. PATENT DOCUMENTS an SQL query or other mechanism. The retrieved data values 5,375,235 12/1994 Berry et al. ................................. is are documented using a markup language such as HTML. 5,469,354 11/1995 Hatakeyama et al. ... 707/3 The documents are indexed using a web crawler or other 5,546,578 8/1996 Takada ................. ... 707/5 indexing agent. Data items may be Selected for indexing by 5,685,003 11/1997 Peltonen et al. .. ... 707/531 5,787.295 7/1998 Nakao ........... ... 707/500 identifying them in a data dictionary. The indexing agent 5,787,421 7/1998 Nomiyama ..
    [Show full text]