1. the Content Score Is the Resulting Score Provided by the Indexing Engine Avail

Total Page:16

File Type:pdf, Size:1020Kb

1. the Content Score Is the Resulting Score Provided by the Indexing Engine Avail INSTITUTO TECNOLÓGICO Y DE ESTUDIOS SUPERIORES DE MONTERREY CAMPUS MONTERREY PROGRAMA DE GRADUADOS EN ELECTRÓNICA, COMPUTACIÓN, INFORMACIÓN Y COMUNICACIONES TECNOLÓGICO DE MONTERREY CONCEPT BASED RANKING IN PERSONAL REPOSITORIES TESIS PRESENTADA COMO REQUISITO PARCIAL PARA OBTENER EL GRADO ACADÉMICO DE: MAESTRÍA EN CIENCIAS EN TECNOLOGÍA INFORMÁTICA POR ALBERTO JOAN SAAVEDRA CHOQUE MONTERREY, N. L., DICIEMBRE 2007 Concept Based Ranking in Personal Repositories by Alberto Joan Saavedra Choque Thesis Presented to the Gradúate Program in Electronics, Computing, Information and Communications School This thesis is a partial requirement to obtain the academic degree of Master of Science in Information Technology Instituto Tecnológico y de Estudios Superiores de Monterrey Campus Monterrey December 2007 A Gary, mi hermano Que me motiva a superarme cada día y a mirar las circunstancias desde diversos puntos de vista. A Yolanda, mi mamá Que me apoya incondicionalmente y me acompaño presencialmente durante un período de mi recorrido en este país que me acogió. A Alberto, mi papá Que me orienta cuando lo necesito y es mi punto de referencia para superar los obstáculos que se presentan en el camino. A Pilar, mi mejor amiga Que conoce mejor que nadie el sacrificio realizado durante esta aventura y me mantuvo enfocado en cumplir exitosamente la trayectoria iniciada. A mi abuelito, tíos y primos Que me hicieron sentir a la distancia, el respaldo y el apoyo de toda la familia. Reconocimientos Al Dr. Juan Carlos Lavariega, mi asesor Por su confianza y apoyo en la realización de esta investigación. Al Dr. Raúl Pérez, sinodal y coordinador de la maestría Por su disponibilidad constante y orientación para responder mis consultas Al Dr. David Garza, Por su confianza para participar en iniciativas de investigación como el proyecto PDLib A los compañeros del proyecto PDLib y del ITESM, Por darme la oportunidad de aprender y trabajar en conjunto con personas muy ca- paces. A los profesores del proyecto PDLib y del ITESM, Por la comprensión y paciencia para transmitir sus conocimientos y aclarar algunas dudas. A LASPAU/OEA Por darme la oportunidad de conocer y ser parte de esta prestigiosa institución: el Tec de Monterrey. ALBERTO JOAN SAAVEDRA CHOQUE Instituto Tecnológico y de Estudios Superiores de Monterrey December 2007 iv Concept Based Ranking in Personal Repositories Summary We are storing an increasing amount of digital information to support our work and life activities. Emails, documents, articles, bookmarks, e-books, images, audio and video are being persisted not only in our hard drives but also in the storage provided by the newer Web 2.0 services. All this information constitutes some sort of personal digital memory, where the key feature is to be able to find what we are looking for as fast as possible. Information requests in these personal repositories are mostly related to the re- finding of documents we have already read/seen. Pulí text search will do the job when we remember the exact words, which is getting harder nowadays; so we may remember a concept associated in our memory to the information object we are looking for. To sol ve this problem, concept based query expansión is the solution. With query expansión, our chances to find the document requested increase. But if the list of results is big, probably it will take some time to lócate it. In repositories like the Personal Digital Library system, which targets an optimal use in portable devices, it is important to have an adequate ranking of the documents to provide a better searching experience. The analysis of the linking structure of documents is a good ranking factor for general search engines. But in the context of personal collections, this factor is not available; so, more information is needed about the document and the tagging approach seems like the best option to provide more criteria to rank appropiately the list of results. This research proposes an architecture of a conceptual information retrieval system for personal repositories, which includes tagging as the semantics provider mechanism and query expansión as the recall enabler. The architecture includes an additional feature to gather more semantics which is the indexing of concepts using latent semantic analysis. Contents Reconocimientos iv Summary v List of Tables ix List of Figures x Chapter 1 Introduction 1 1.1 Personal Digital Repositories 2 1.1.1 Personal Digital Libraries 3 1.1.2 Search in Personal Repositories 3 1.2 Problem Statement 4 1.2.1 Research Goal 5 1.2.2 Hypothesis 5 1.2.3 Contributions 6 1.3 Thesis Outline 6 Chapter 2 Literature Review 8 2.1 Concept based Information Retrieval 8 2.1.1 Set-theoretic conceptual models 10 2.1.2 Algebraic conceptual models 12 2.1.3 Probabilistic conceptual models 15 2.2 Concept Based Organization 17 2.2.1 Tagging 18 2.2.2 Information Retrieval in Folksonomies 20 2.3 Personal Information Re-Use 21 2.3.1 Personal Digital Libraries Retrieval 23 2.4 Concept Based Ranking 24 2.4.1 Search Engine Ranking 25 2.4.2 Domain Specific Ranking 25 2.4.3 Personalized Ranking 26 vi 2.4.4 Query expansión 27 2.4.5 Relevance Feedback 28 2.4.6 Retrieval Performance 29 2.5 Summary 31 Chapter 3 Personal Conceptual Architecture 32 3.1 Architecture of a Conceptual Information Retrieval System 32 3.2 Conceptual Modules 36 3.2.1 Concepts Labeling 36 3.2.2 Concepts Indexing 38 3.2.3 Concepts Ranking 39 3.2.4 Indexing Engine 41 3.2.5 Tagging Store 42 3.2.6 Document Retrieval 42 3.3 Integration Modules 43 Chapter 4 Prototype 44 4.1 Technologies used 44 4.1.1 Tagging Store datábase: db4o 44 4.1.2 Tagging interface: del.icio.us API 46 4.1.3 Concepts Indexing: Lingo 48 4.1.4 Indexing Engine: Lucene 49 4.1.5 Text Summarization: blindLight 51 4.1.6 Query Expansión: WordNet 52 4.2 The Experiment 52 4.2.1 Experiment documents selection 53 4.2.2 índices structure 54 4.2.3 Querying and Ranking 58 4.3 Integration with other systems 60 4.3.1 Search services 60 4.3.2 Upload service 61 4.3.3 Indexing service 62 4.4 Considerations for PDLib 62 4.4.1 Advantages 62 4.4.2 Considerations 63 4.5 Summary 63 Chapter 5 Results, Conclusions and Future Work 65 5.1 Results 65 5.2 Conclusions 67 vii 5.3 Futurework 69 5.3.1 Tagging interface and usability 69 5.3.2 Attention metadata for relevance feedback 70 5.3.3 Probabilistic Latent Semantic Analysis for Concepts Indexing . 70 Bibliography 71 Appendix A XML Files 80 Appendix B UML diagrams 83 B.l Class diagrams 83 B.2 Sequence diagrams 86 viii List of Tables 2.1 Constraints and Retrieval Formulas 30 4.1 Comparison of datábase technologies 46 4.2 Test set summary 54 4.3 Set features for concepts indexing 57 5.1 Comparison of precisión and recall averages 66 ix List of Figures 2.1 Information Retrieval Models 9 3.1 Information Retrieval System 33 3.2 The Retrieval Process 34 3.3 Architecture of a conceptual IR 35 3.4 Concepts Labeling Module 37 3.5 Concepts Indexing Module 38 3.6 Concepts Ranking Module 40 4.1 Tagging Store Data Model 45 4.2 Delicious Class Diagram 47 4.3 Lingo Algorithm 49 4.4 Lucene Index Update and Search Configuration 50 4.5 Automatic Text Summarization with blindLight 52 4.6 WordNet Lucene índex 53 4.7 del.icio.us user interface 54 4.8 Test set tag cloud 55 4.9 Lucene index document example 56 4.10 Concepts in db4o datábase 57 4.11 Bayesian Network for Ranking 59 4.12 delPDLib Web Service WSDL diagram 60 5.1 Google Custom Search Engine Results 66 5.2 Expanded Relevant Ranking Results 67 B.l Package Dependency 84 B.2 Package edu.itesm.pdlib.delicious 85 B.3 Package edu.itesm.pdlib.lucene - index process 86 B.4 Package edu.itesm.pdlib.lucene - query process 87 B.5 Package edu.itesm.pdlib.model - Documents model 88 B.6 Package edu.itesm.pdlib.model - Results model 89 B.7 retrieveAUPosts method - sequence diagram 90 Chapter 1 Introduction Our society's progress has been dependent on access to the knowledge generated by our ancestors; throughout our history the libraries had been gathering, archiving and preserving the written knowledge, classifying and organizing these collections to make them available to the public. Nowadays, since an important amount of our inter- actions can be digitized, we can even have access to the information generated by our colleagues almost in an on-needed basis, it is the digital libraries' job to organize this information. A digital library is a collection of digital information objects and services which support users in dealing with management, access, storage and manipulation of these objects, taking special consideration in the organization and presentation of the objects so users find them useful and have fewer problems in understanding them [78]. One of the key requirements for a digital library is to provide adequate search tools to allow agüe access to the information requested [55]. The amount of information available is huge. According to the EMC's Digital Universe Whitepaper [51] 161 exabytes where used to store digital information in 2006, only 25% of the 161 exabytes is original data. These numbers are going to keep grow- ing as more information becomes digitally available - like digital TV broadcasting or the digitization of books collections by the Open Content Alliance and Google - and archival efforts like the Internet Archive or the Televisión Broadcast archiving [66] bring memory capabilities to the most powerful médiums in our culture; digital libraries have to take into consideration this constant growth.
Recommended publications
  • Hacking the Master Switch? the Role of Infrastructure in Google's
    Hacking the Master Switch? The Role of Infrastructure in Google’s Network Neutrality Strategy in the 2000s by John Harris Stevenson A thesis submitteD in conformity with the requirements for the Degree of Doctor of Philosophy Faculty of Information University of Toronto © Copyright by John Harris Stevenson 2017 Hacking the Master Switch? The Role of Infrastructure in Google’s Network Neutrality Strategy in the 2000s John Harris Stevenson Doctor of Philosophy Faculty of Information University of Toronto 2017 Abstract During most of the decade of the 2000s, global Internet company Google Inc. was one of the most prominent public champions of the notion of network neutrality, the network design principle conceived by Tim Wu that all Internet traffic should be treated equally by network operators. However, in 2010, following a series of joint policy statements on network neutrality with telecommunications giant Verizon, Google fell nearly silent on the issue, despite Wu arguing that a neutral Internet was vital to Google’s survival. During this period, Google engaged in a massive expansion of its services and technical infrastructure. My research examines the influence of Google’s systems and service offerings on the company’s approach to network neutrality policy making. Drawing on documentary evidence and network analysis data, I identify Google’s global proprietary networks and server locations worldwide, including over 1500 Google edge caching servers located at Internet service providers. ii I argue that the affordances provided by its systems allowed Google to mitigate potential retail and transit ISP gatekeeping. Drawing on the work of Latour and Callon in Actor– network theory, I posit the existence of at least one actor-network formed among Google and ISPs, centred on an interest in the utility of Google’s edge caching servers and the success of the Android operating system.
    [Show full text]
  • Google Pagerank and Markov Chains
    COMP4121 Lecture Notes The PageRank, Markov chains and the Perron Frobenius theory LiC: Aleks Ignjatovic [email protected] THE UNIVERSITY OF NEW SOUTH WALES School of Computer Science and Engineering The University of New South Wales Sydney 2052, Australia Topic One: the PageRank Please read Chapter 3 of the textbook Networked Life, entitled \How does Google rank webpages?"; other references on the material covered are listed at the end of these lecture notes. 1 Problem: ordering webpages according to their importance Setup: Consider all the webpages on the entire WWW (\World Wide Web") as a directed graph whose nodes are the web pages fPi : Pi 2 WWW g, with a directed edge Pi ! Pj just in case page Pi points to page Pj, i.e. page Pi has a link to page Pj. Problem: Rank all the webpages of the WWW according to their \importance".1 Intuitively, one might feel that if many pages point to a page P0, then P0 should have a high rank, because if one includes on their webpage a link to page P0, then this can be seen as their recommendation of P0. However, this is not a good criterion for several reasons. For example, it can be easily manipulated to increase the rating of any webpage P0, simply by creating a lot of silly web pages which just point to P0; also, if a webpage \generously" points to a very large number of webpages, such \easy to get" recommendation is of dubious value. Thus, we need to refine our strategy how to rank webpages according to their \importance", by doing something which cannot be easily manipulated \locally" (i.e., by any group of people who might collude, even if such a group is sizeable).
    [Show full text]
  • Google: the World's First Information Utility?
    BISE – STATE OF THE ART Google: The World’s First Information Utility? The idea of information utilities came to naught in the 70s and 80s, but the world wide web, the Internet, mobile access, and a plethora of access devices have now made information utilities within reach. Google’s business model, IT infrastructure, and innovation strategies have created the capabilities needed for Google to become the world’s first global information utility. But, the difficulty of monetizing utility services and concerns about privacy, which could bring government regulation, might stymie Google’s growth plans. DOI 10.1007/s12599-008-0011-6 a national scale (Sackmann and Nie 1970; the number of users to the point where it The Authors Sackman and Boehm 1972). Not to be left is the dominant search player in both the out, the idea was also promoted by the U.S. and the world. This IT-based strategy Rex Chen Computer Usage Development Institute has allowed Google to reach more than Prof. Kenneth L. Kraemer, PhD in Japan, the British Post Office in the UK, $16 billion in annual revenue, a stock price Prakul Sharma and Bell Canada and the Telecommunica- over $600 per share and a market capital- University of California, Irvine tions Board in Canada (Press 1974). ization of nearly $200 billion. Center for Research on Information The idea was so revolutionary at the Over the next decade, Google plans to Technology and Organization (CRITO) time that at least one critic called for a extend its existing services to the wire- 5251 California Ave., Ste.
    [Show full text]
  • The Commercial Search Engine Industry and Alternatives to The
    and Yahoo!), works successfully to hide an increasingly profitable information and advertising industry. Search engine companies, of which there are really only three, have morphed into advertising conglomerates and now serve advertisers, not users, in a mutual, rather delightful, relationship. The advertiser pays the search engine to be affiliated with certain key words; the search company The Commercial Search Engine Industry and provides the sponsored links, which users click on; and traffic is driven to the sites of advertisers. Indeed, users are now universally described as consumers in the Alternatives to the Oligopoly marketplace rhetoric of search engine enterprise; they are the pawns who the business world seeks to manipulate into clicking those links that will ultimately Bettina Fabos - Interactive Media Studies and Journalism, Miami lead to the most profits. University of Ohio In this arrangement, helping users find relevant information is a priority only in This essay details the search engine industry’s transformation into an that, like other commercial media systems (think radio or television), there has to advertising oligopoly. It discusses how librarians, educators, archivists, be some decent content to create a perception that Internet users matter. In fact, activists, and citizens, many of whom are the guardians of indispensable users only matter to the extent that they participate in the commercial system by noncommercial websites and portals, can band together against a sea of knowingly—or unknowingly—clicking on sponsored links.1 Keyword advertising advertising interests and powerful and increasingly overwhelming online generated an unprecedented $3.9 billion in 2004. By 2005 Google’s advertising marketing strategies.
    [Show full text]
  • Link Analysis Google
    Introduction to the Use of Link Analysis by Google Amy Langville Carl Meyer Department of Mathematics North Carolina State University Raleigh, NC SIAM Annual Meeting 7/15/04 Outline • Introduction to Information Retrieval (IR) • Link Analysis • PageRank Algorithm Short History of IR IR = search within doc. coll. for particular info. need (query) B. C. cave paintings 7-8th cent. A.D. Beowulf 12th cent. A.D. invention of paper, monks in scriptoriums 1450 Gutenberg’s printing press 1700s Franklin’s public libraries 1872 Dewey’s decimal system 1900s Card catalog 1940s-1950s Computer 1960s Salton’s SMART system 1989 Berner-Lee’s WWW the pre-1998 Web Yahoo • hierarchies of sites • organized by humans Best Search Techniques • word of mouth • expert advice Overall Feeling of Users • Jorge Luis Borges’ 1941 short story, The Library of Babel When it was proclaimed that the Library contained all books, the first impression was one of extravagant happiness. All men felt themselves to be the masters of an intact and secret treasure. There was no personal or world problem whose eloquent solution did not exist in some hexagon. ... As was natural, this inordinate hope was followed by an excessive depression. The certitude that some shelf in some hexagon held precious books and that these precious books were inaccessible, seemed almost intolerable. 1998 ... enter Link Analysis Change in User Attitudes about Web Search Today • “It’s not my homepage, but it might as well be. I use it to ego-surf. I use it to read the news. Anytime I want to find out anything, I use it.” - Matt Groening, creator and executive producer, The Simpsons • “I can’t imagine life without Google News.
    [Show full text]
  • Ambitious Form a General Theory of Visual Culture
    68 Art Ambitious Form A General Theory Giambologna, Ammanati, and Danti in Florence of Visual Culture Michael W. Cole Whitney Davis Ambitious Form describes the transformation of Italian What is cultural about vision—or visual about culture? sculpture during the neglected half-century between In this ambitious book, Whitney Davis provides new the death of Michelangelo and the rise of Bernini. The answers to these difficult and important questions by book follows the Florentine careers of three major presenting an original framework for understanding sculptors—Giambologna, Bartolomeo Ammanati, and visual culture. Grounded in the theoretical traditions Vincenzo Danti—as they negotiated the politics of the of art history, A General Theory of Visual Culture argues Medici court and eyed one another’s work, setting new that, in a fully consolidated visual culture, artifacts and aims for their art in the process. Only through a com- pictures have been made to be seen in a certain way; parative look at Giambologna and his contemporaries, what Davis calls “visuality” is the visual perspective it argues, can we understand them individually—or from which certain culturally constituted aspects of understand the period in which they worked. artifacts and pictures are visible to informed viewers. Michael Cole shows how the concerns of central In this book, Davis provides a systematic analysis of Italian artists changed during the last decades of the visuality and describes how it comes into being as a Cinquecento. Whereas their predecessors had focused historical form of vision. on specific objects and on the particularities of materi- Expansive in scope, A General Theory of Visual als, late sixteenth-century sculptors turned their atten- Culture draws on art history, aesthetics, the psychology tion to models and design.
    [Show full text]
  • Eric Miller Interview
    Volume 1 Issue 2 SIGSEMIS July 2004 Bulletin Theme: “SW Challenges http://www.sigsemis.org for KM” The Official Bimonthly Newsletter of AIS Special Interest Group on Semantic Web and Information Systems SIGSEMIS © 2004 Inside Bulletin EDITORIAL… Editorial................................1 For the second time AIS SIGSEMIS Bulletin is on air. We would like to thank SIGSEMIS Activities ...........2 Eric Miller Interview...........3 you for your warm receipt of the first issue which gives us a motive to IJSWIS Announcement ....11 continue our volunteer hard work towards more activities and services for our Dialogue Column..............18 research community. Our portal site at www.sigsemis.org had 1650 unique Special Issue Theme..........23 visitors (and same number of AIS SIGSEMIS Bulletin 1(1) downloads) from Research in Progress over 50 countries. Inside this issue you can find many interesting things. A Column...............................68 call for papers for our SIG Official Peer Reviewed Journal which will be Special Section: DERI........73 published by IDEA Group (inaugural issue 1/2005). I invite you to consider Chris Bussler Interview....74 this publication outlet as an interesting and high quality journal in which we Regular Columns ..............93 will put all of our efforts in order to meet your high standards for quality and Semantic Search to communicate high impact research. Technologies ......................93 SW Technologies ...............99 Methodologies for SW....103 The special theme that is discussed in this issue is Semantic Web Challenges SW Basics..........................113 for Knowledge Management: towards the Knowledge Web. We would like to SW Calendar ....................115 thank all the contributors of the short articles and especially the directors and SW Research Centers......119 Professors Christoph Bussler and Dieter Fensel of Digital Enterprise Research Projects Corner ................123 Institute (DERI) who contributed to the special section of DERI presentation.
    [Show full text]
  • Chenlamkraemer2007crito.Pdf
    INTRODUCTION Arguably the most popular search engine available today, Google is widely known for its unparalleled search engine technology, embodied in the web page ranking algorithm, PageRanki and running on an efficient distributed computer system. In fact, the verb “to Google” has ingrained itself in the vernacular as a synonym of “[performing] a web search.”1 The key to Google’s success has been its strategic use of both software and hardware information technologies. The IT infrastructure behind the search engine includes huge storage databases and numerous server farms to produce significant computational processing power. These critical IT components are distributed across multiple independent computers that provide parallel computing resources. This architecture has allowed Google’s business to reach a market capital over $100 billion and become one of the most respected and admirable companies in the world. MARKET ENVIRONMENTS Search Engine Internet search engines were first developed in the early 1990s to facilitate the sharing of information among researchers. The original effort to develop a tool for information search occurred simultaneously across multiple universities is shown in Table 1. Although functionalities of these systems were very limited, they provided the foundation for future web- based search engines. TABLE 1. Early Search Engines Search Engine Name University Year Inventor Archie McGill University 1990 Alan Emtage Veronica University of Nevada 1993 Many students WWW Wanderer Massachusetts Institute of Technology 1993 Matthew Gray Source: Battelle, 2005. Search Industry During the 1990s, the Internet experienced exponential growth with thousands of new web pages being created daily. Online document search became the chief method of navigating the ever- expanding World Wide Web, as Internet users sought useful information among the largely disorganized pages.
    [Show full text]
  • Ranking Pages and the Topology of The
    Ranking pages and the topology of the web Argimiro Arratia∗ Departament de Llenguatges i Sistemes Inform`atics, Universitat Polit`ecnica de Catalunya, Barcelona, Spain [email protected] Carlos Mariju´an † Departamento de Matem´atica Aplicada Universidad de Valladolid, Valladolid, Spain, [email protected] Abstract This paper presents our studies on the rearrangement of links in the structure of websites for the purpose of improving the valuation of a page or group of pages as established by a ranking function as Google’s PageRank. We build our topo- logical taxonomy starting from unidirectional and bidirectional rooted trees, and up to more complex hierarchical structures as cyclical rooted trees (obtained by closing cycles on bidirectional trees) and PR–digraph rooted trees (digraphs whose condensation digraph is a rooted tree that behave like cyclical rooted trees). We give different modifications on the structure of these trees and its effect on the valuation given by the PageRank function. We derive closed formulas for the PageRank of the root of various types of trees, and establish a hierarchy of these topologies in terms of PageRank. Keywords: PageRank, world wide web topology, link structure. AMS Math. Subject Classification: 05C05, 05C99, 05C40, 68R10, 94C15. 1 Introduction arXiv:1105.1595v2 [cs.DM] 23 Apr 2012 Google is still today’s most popular search engine for the World Wide Web, and the key to its success has been its PageRank algorithm [4], which ranks documents based primarily on the link structure of the web. Simply put, PageRank considers a link from a page H to another page J as a weighted vote from H in favour of the importance of J, where the weight of the vote of H is itself determined by the number of links (or voters) to H.
    [Show full text]
  • Link Analysis Web Search Engines
    Introduction to the Use of Link Analysis by Web Search Engines Amy Langville Carl Meyer Department of Mathematics North Carolina State University Raleigh, NC Mt. St. Mary’s College 4/19/04 Outline • Introduction to Information Retrieval (IR) • Link Analysis • HITS Algorithm • PageRank Algorithm Short History of IR IR = search within doc. coll. for particular info. need (query) B. C. cave paintings 7-8th cent. A.D. Beowulf 12th cent. A.D. invention of paper, monks in scriptoriums 1450 Gutenberg’s printing press 1700s Franklin’s public libraries 1872 Dewey’s decimal system Card catalog 1940s-1950s Computer 1960s Salton’s SMART system 1989 Berner-Lee’s WWW the pre-1998 Web Yahoo • hierarchies of sites • organized by humans Best Search Techniques • word of mouth • expert advice Overall Feeling of Users • Jorge Luis Borges’ 1941 short story, The Library of Babel When it was proclaimed that the Library contained all books, the first impression was one of extravagant happiness. All men felt themselves to be the masters of an intact and secret treasure. There was no personal or world problem whose eloquent solution did not exist in some hexagon. ... As was natural, this inordinate hope was followed by an excessive depression. The certitude that some shelf in some hexagon held precious books and that these precious books were inaccessible, seemed almost intolerable. 1998 ... enter Link Analysis Change in User Attitudes about Web Search Today • “It’s not my homepage, but it might as well be. I use it to ego-surf. I use it to read the news. Anytime I want to find out anything, I use it.” - Matt Groening, creator and executive producer, The Simpsons • “I can’t imagine life without Google News.
    [Show full text]
  • Discoveries That Will Change Our World
    discoveries that will change our world OFFICE OF TECHNOLOGY LICENSING ANNUAL REPORT 2002 - 2003 OFFICE OF TECHNOLOGY LICENSING HAT WILL LIFE BE LIKE WHEN OUR COVER BABY TURNS 21? WILL LIFE BE WEASIER, WITH UNIMAGINED CONVENIENCES, ENVIRONMENTALLY-FRIENDLY PRODUCTS, ENOUGH FOOD AND CLEAN WATER FOR ALL? ALL ABOUT US OR WILL IT BE A TIME OF GRAVE DECISIONS ABOUT HOW AND FOR WHOM AND FOR WHAT AVERAGE AGE 38 PURPOSE OUR LIMITED RESOURCES WILL BE USED, WHEN TECHNOLOGICAL ADVANCES HAVE AVERAGE YEARS AT OTL 7 COMPLICATED OUR LIVES AND WHEN THE TECHNOLOGICAL GAP BETWEEN “HAVES” AND CUMULATIVE EXPERIENCE 246.5 YEARS “HAVE-NOTS” IS AN UNCROSSABLE CHASM? NUMBER OF EMPLOYEES 24 OUR BABY’SFUTURE IS BEING CREATED NOW, BY RESEARCH AND INNOVATIONS AT STANFORD AND ELSEWHERE. MOST OF STANFORD’S INVENTIONS ARE EMBRYONIC, EARLY-STAGE DEVEL- OPMENTS FOR WHICH COMMERCIAL PRODUCTS ARE LIKELY TO BE 10-15 YEARS AWAY. YET THESE INNOVATIONS HAVE THE POTENTIAL TO CHANGE OUR LIVES IN WAYS WE CAN ONLY IMAGINE NOW. TAKE A PEAK INTO THE FUTURE…. DISCOVERIES THAT WILL CHANGE OUR WORLD | 1 IMAGINE A WORLD OF VISION FOR THE BLIND, WHERE IMAGINE IF DOCTORS COULD SEE CELLS AT THE MOLECULAR DISEASES SUCH AS MACULAR DEGENERATION CAN BE CURED. LEVEL, ENABLING THEM TO DIAGNOSE AND TREAT DISEASES Age-related macular degeneration is the leading cause of blind- SUCH AS CANCER AND HEART DISEASE. Molecular Imaging, or ness in people over the age of 65. Currently, there is no effec- MI, is a young research area, just 10 years old, that combines tive treatment for most patients with AMD, a disease that often contrast agents and/or pharmacologically active diagnostic results in permanent damage to photoreceptors.
    [Show full text]
  • Strategic Use of Information Technology - Google
    UC Irvine I.T. in Business Title Strategic Use of Information Technology - Google Permalink https://escholarship.org/uc/item/0nj460xn Authors Chen, Rex Lam, Oisze Kraemer, Kenneth Publication Date 2007-02-01 eScholarship.org Powered by the California Digital Library University of California INTRODUCTION Arguably the most popular search engine available today, Google is widely known for its unparalleled search engine technology, embodied in the web page ranking algorithm, PageRanki and running on an efficient distributed computer system. In fact, the verb “to Google” has ingrained itself in the vernacular as a synonym of “[performing] a web search.”1 The key to Google’s success has been its strategic use of both software and hardware information technologies. The IT infrastructure behind the search engine includes huge storage databases and numerous server farms to produce significant computational processing power. These critical IT components are distributed across multiple independent computers that provide parallel computing resources. This architecture has allowed Google’s business to reach a market capital over $100 billion and become one of the most respected and admirable companies in the world. MARKET ENVIRONMENTS Search Engine Internet search engines were first developed in the early 1990s to facilitate the sharing of information among researchers. The original effort to develop a tool for information search occurred simultaneously across multiple universities is shown in Table 1. Although functionalities of these systems were very limited, they provided the foundation for future web- based search engines. TABLE 1. Early Search Engines Search Engine Name University Year Inventor Archie McGill University 1990 Alan Emtage Veronica University of Nevada 1993 Many students WWW Wanderer Massachusetts Institute of Technology 1993 Matthew Gray Source: Battelle, 2005.
    [Show full text]