Thème : Le Web Les Moteurs De Recherche Moteur De Recherche

Total Page:16

File Type:pdf, Size:1020Kb

Thème : Le Web Les Moteurs De Recherche Moteur De Recherche Thème : Le Web Les moteurs de recherche Capacités attendues : - Comprendre le fonctionnement des moteurs de recherche - Mener une analyse critique des résultats d’un moteur de recherche - Comprendre les enjeux de la publication d’informations Moteur de recherche : application informatique permettant de rechercher une ressource (page Web, image, vidéo, fichier…) à partir d’une requête sous forme de mots-clés. 1. Le fonctionnement des moteurs de recherche * Au brouillon, essayez de schématiser le fonctionnement d’un moteur de recherche. Vocabulaire à retenir : - crawlers : robot d’indexation qui explore automatiquement le Web en suivant les liens entre les différentes pages pour collecter les ressources. - indexation : les mots-clés sont listés, classés et enregistrés sur des serveurs qui stockent les données. - pertinence : répond au besoin au moment où il est exprimé. Le Web est un immense graphe : chaque page est un noeud relié à d’autres nœuds par des liens hypertextes. On pourrait le schématiser de façon très simplifiée, comme ci-contre. Mme Suaudeau SNT 2019-2020 * A partir du nœud E, parcourez le graphe précédant en listant les pages consultées et les pages à visiter à chaque étape : Action Pages visitées Pages à visiter On visite E, lié à A et H E A H On visite A , lié à B E A H B On visite H , lié à J E A H B J On visite B , lié à A et C E A H B J C (A déjà visité) On visite J , lié à F E A H B J C F On visite C , lié à A E A H B J C C F (A déjà visité) On visite F , lié à C D G H E A H B J C F D G (C et H déjà visités) On visite D , lié à / E A H B J C F D G On visite G , lié à / E A H B J C F D G * Que se passe-t-il si l’on commence l’exploration par le nœud A ? La visite est terminée en 3 étapes mais on a « raté » une grande partie du graphe. 2. Les différents moteurs de recherche * Quels moteurs de recherche connaissez-vous ? * Le tableau ci-dessous compare l’utilisation des moteurs de recherche selon différentes zones géographiques pour le mois d’août 2019. Google Bing Yahoo ! DuckDuckGo Baidu Yandex Ecosia MSN Sogou Qwant Afrique 96,89 % 1,70 % 1,11 % 0,12 % 0,05 % 0,04 % Amérique du Nord 89,29 % 5,84 % 3,43 % 1,10 % 0,11 % 0,07 % Amérique du Sud 97,64 % 1,14 % 0,93 % 0,10 % 0,12 % 0,03 % Asie 92,27 % 1,04 % 1,72 % 2,58 % 0,82 % 0,36 % Europe 93,03 % 3,02 % 0,89 % 0,47 % 1,72 % 0,27 % France 84,74 % 11,80 % 1,06 % 0,71 % 0,73 % 0,66 % USA 88,01 % 6,40 % 3,94 % 1,30 % 0,10 % 0,07 % Chine 3,18 % 2,23 % 76,69 % 10,67 % Russie 51,65 % 3,23 % 0,29 % 0,25 % 45,16 % Monde entier 92,37 % 4,23 % 1,80 % 0,55 % 1,10 % 0,51 % https://gs.statcounter.com/search-engine-market-share/ * Entourez en rouge le chiffre le plus élevé de chaque colonne, en bleu le moins élevé. → Dans quelle région Google est-il le plus utilisé ? Amérique du Sud Et le moins ? Chine et Russie → Quelle est à votre avis la nationalité des moteurs de recherche Baidu (Chinois), Yandex (Russe) et Sogou (Chinois) ? Mme Suaudeau SNT 2019-2020 * Google 1995 : rencontre de Larry Page et Sergey Brin à l’Université de Stanford, Californie. Ils travaillent à la mise au point d'un moteur de recherche qui utilise des liens pour déterminer l'importance de pages individuelles sur le Web. Ils l'appellent « BackRub » avant de le renommer « Google » en référence au terme mathématique « gogol » qui désigne le nombre 1 suivi de 100 zéros. 1998 : Naissance de Google Inc. Après l’investissement du cofondateur de Sun Microsystems. 2010 : Google traite plus d’un milliard de requêtes par jour Aujourd’hui : Google emploie plus de 60 000 personnes dans 50 pays différents et conçoit des centaines de produits à travers différentes applications comme Youtube, Android, Smartbox… Les informations susceptibles d’être collectées par Google : - langue - les annonces - vidéos Youtube - services utilisés - données sur l’appareil (modèle, système d’exploitation…) - position - nom profil Google - photo compte Google * Qwant Conçu et basé en France, Qwant est un moteur de recherche qui repose sur deux piliers : - respect de la vie privée : vise à offrir les meilleurs résultats possibles sans chercher à savoir qui est l’internaute ou ce qu’il a cherché précédemment. Les requêtes sont chiffrées pour qu’un tiers ne puisse y avoir accès, l’historique des requêtes n’est pas enregistré et aucun cookie ou traceur sont installé sur le navigateur. - neutralité et impartialité garanties : indexe l’ensemble du Web sans discrimination, et applique partout ses algorithmes de classement sans chercher à mettre certains sites en avant résultats parce qu’ils répondraient à des préoccupations commerciales, politiques ou morales. Fonctionne en partenariat avec Bing pour compléter ses résultats en attendant que leurs crawlers indexent une plus grande partie du Web. * Ecosia 2009 : naissance d’Ecosia suite à un voyage permettant à son fondateur de prendre conscience des problèmes liés à la déforestation. 2019 : 60 millions d’arbres plantés et construction de sa propre centrale solaire pour alimenter les serveurs. Les principes : - Les données de recherche sont anonymisées au bout d’une semaine. - Les données des internautes ne sont pas vendues aux annonceurs publicitaires. - Les recherches sont chiffrées pour qu’un tiers n’y ait pas accès. Fonctionne lui aussi en partenariat avec Bing. Mme Suaudeau SNT 2019-2020 Test avec Turing Enigma Google Qwant Ecosia 1 360 000 résultats 310 000 résultats Vidéos Wikipédia Wikipédia Wikipédia France culture Vidéos Le journal CNRS Youtube France culture Interstices Imperial War Museums Imperial War Museums Hgerodote.net Le journal CNRS Le journal CNRS 3. Fiabilité des résultats et référencement La popularité d’une page dépend de plusieurs facteurs : - le nombre de pages qui ont des liens vers cette page - le nombre de fois que les utilisateurs choisissent cette page en réponse à une requêtes - l’achat de mots-clés au moteur de recherche par un site afin que ses pages soient mieux classées Les moteurs de recherche combinent ces facteurs selon des algorithmes gardés secrets et qui évoluent régulièrement. C’est pourquoi les réponses varient d’un moteur à l’autre. Certains sites tentent de tromper les moteurs de recherche en utilisant des textes cachés ou en plaçant exprès certains mots-clés dans les zones de titres afin d’améliorer le référencement. L’algorithme le plus connu est le PageRank de Google qui attribue à chaque page un nombre entre 0 et 1 à partir de deux règles : - score élevé si la page est référencée dans une autre page bien notée - score baissé si la page est référencée dans une page parmi plein d’autres références 4. Bonus : comparaison entre les moteurs de recherche et e-Sidoc e-Sidoc est le portail documentaire du CDI. Il fonctionne un peu comme un moteur de recherche mais en cherchant dans une base limitée (celle du CDI) ≠ à l’univers infini du Web. Les réponses qu’il fournit à une recherche ont été validées en amont par le professeur-documentaliste ≠ aux résultats des moteurs de recherche qu’il faut valider soi-même. Fonctionnement des deux avec des mots-clés que l’on peut combiner entre eux : ET / OU = opérateurs booléens. Faire se lever les élèves en fonction de consignes différentes pour déterminer lequel donne le plus de résultats. Mme Suaudeau SNT 2019-2020.
Recommended publications
  • Microsoft Advertising Client Brochure
    EXPANDMicrosoft YOUR Advertising. CUSTOMER Intelligent AUDIENCE connections. WITH Microsoft Advertising Thank you for considering Microsoft Advertising Though you may already advertise through other platforms such as Google Ads, the Microsoft Search Network can boost traffic by offering an additional customer audience and increase diversity, growth and profits for your business. Globally, the Microsoft 11 billion monthly Search Network is continually searches 1 expanding its reach in 37 markets The Microsoft Search Network powers millions of searches in Canada:2 14 MILLION 296 MILLION 22% unique searchers monthly of the PC who represent searches search market Reach a diverse audience in Canada3 34% 56% 43% have a household income are under the age of have graduated college of $80K+ CAD 45 (16-44 years old) (university/postgraduate degree) High-quality partnerships and The Microsoft Search Network integration add value to the reaches people across multiple Microsoft Search Network devices and platforms4 • Bing powers AOL web, mobile and tablet • Bing search is built into Windows 10, which is now search, providing paid search ads to AOL on over 800 million devices. properties worldwide. • Bing powers Microsoft search, which is a unified • Windows 10 drives more engagement and delivers search experience for enterprises including Office, more volume to the Microsoft Search Network. SharePoint and Microsoft Edge. • Our partnerships with carefully vetted search • Bing is on phones, tablets, PCs and across many partners, like The Wall Street
    [Show full text]
  • Ecosia Launches First Ever Out-Of-Home and TV Advertising Campaign to Let New Users Know They Can Plant Trees with Their Searches - No Matter How Weird
    Ecosia launches first ever Out-of-Home and TV advertising campaign to let new users know they can plant trees with their searches - no matter how weird The green search engine is launching its first international campaign on April 5th across 12 European cities with JCDecaux and Sky Media ● Ecosia’s ‘Weird Search Requests’ campaign will be live in France, Germany, the Netherlands and the UK ● Its concept expands on award-winning brand video created by students at Germany’s Filmakademie Baden-Wuerttemberg ● The campaign has been developed together with JCDecaux’s Nurture Programme - which helps start-ups and scale-ups across Europe to scale their brand - and Sky Media in the UK ● With more than 120 million trees already funded by users’ searches globally, Ecosia expects the campaign across 12 European cities to generate significant growth for tree planting Paris, France, 31 st March, 2021: Green search engine Ecosia is launching its first major international brand campaign in cities across Europe, to let millions of potential new users know that they can turn their searches, no matter how weird, into trees. The user-generated campaign speaks directly to users in each city, by displaying searches they have shared via social media. It has been developed together with JCDecaux, the world’s leading out-of-home advertising company, and Sky Media in the UK. The concept stems from an award-winning brand video created by students at Germany’s Filmakademie Baden-Wuerttemberg, originally the students’ own initiative for a class assignment. It shows Ecosia users making unusual searches on the app, which leads to a tree sprouting up wherever the search was made - ranging from a bus to a lecture hall - and ending with the words: “No matter how weird your search request, we’ll plant the trees anyway .” The film won a number of awards, including silver in the prestigious Young Director Award award at Cannes.
    [Show full text]
  • PRESS RELEASE for IMMEDIATE RELEASE Media Contact Name: Kendall Garifo Email: [email protected]
    PRESS RELEASE FOR IMMEDIATE RELEASE Media Contact Name: Kendall Garifo Email: [email protected] ​ Ecosia and Trees for the Future announce partnership to plant trees in West Africa SILVER SPRING, MARCH 2, 2018— Trees for the Future (TREES) and Ecosia join forces to plant trees ​ ​ to combat climate change, revitalize degraded lands and improve livelihoods. TREES fully supports Ecosia in its ambitious journey to plant one billion trees by 2020. Ecosia has committed to fund a new Forest Garden site in Kaffrine, Senegal and plant 1.2 Million trees over a 46 month period. An estimated 300 farming families will benefit from this project. About Trees for the Future Trees for the Future (TREES) is an international development non profit that meets a triple bottom line: poverty alleviation, hunger eradication, and healing the environment. Through our Forest Garden Approach they train farmers to plant and manage Forest Gardens that sustainably feed families and raise their incomes by 400 percent. TREES receives donations to implement their work in areas where they can have the greatest impact. TREES currently works across five countries in Sub-Saharan Africa: Cameroon, Kenya, Senegal, Tanzania, and Uganda. Since 1989, TREES has planted nearly 150 million trees.​ Learn ​ more at trees.org. ​ ​ ​ About Ecosia Ecosia is the search engine that plants trees with its ad revenue. They are committed to donating at least ​ 80 percent of its sponsored links income to tree planting projects around the world. It is Ecosia’s mission to plant 1 billion trees by 2020. Ecosia looks to support poor agricultural communities by planting a trees.
    [Show full text]
  • The Ecosia on Campus Movement Tree Planting Quiz Q&A
    Baden-Württemberg! ● What is Ecosia? ● The Ecosia on Campus movement ● Tree planting quiz ● Q&A Fred Henderson Project coordinator Business development team ● Ecosia on Campus co-founder ○ Supporting students at 250 universities ● Organizational support ○ Corporate companies switching to Ecosia ● User support ○ Automating and answering user questions Where does Ecosia plant trees? ● 60 projects across 30 countries ● Focus on biodiversity hotspots ● New contracts signed in India, Australia + Malawi Why trees? ● Ecosia’s 100 million trees are sequestering between 1 - 3.5M of tons of carbon dioxide each year ● Trees benefit both people and planet ● Each project serves a unique purpose Your trees in Ethiopia ● Project with environmental and humanitarian NGO Green Ethiopia ● Over 9 million trees planted Video caption if needed Your trees in Australia ● In response to the bushfires in 2020. ● Ecosia dedicates search revenue from a day to finance a project with ReforestNOW ● 26,000 trees planted in a subtropical area Your trees in South America How does Ecosia make money? ● Ecosia generates revenue from users clicking on ads ● Ecosia is completely transparent with its finances. ● 80% of Ecosia’s profits each month goes towards financing tree-planting projects. ● 20% is put into a reserve to ensure Ecosia is always sustainable. Video caption if needed “Ecosia is not just producing enough solar energy to power all Ecosia searches with renewables - we are producing twice as much.” Wolfgang Oels Ecosia’s COO Video caption if needed Which Universities have switched? ● A global movement of students campaigning to make Ecosia the default search engine at their university; ● More than 250 active campaigns worldwide; What is Ecosia on ● Over 200,000 trees financed by Campus? students searches.
    [Show full text]
  • Download on Our Platform and We Have Obtained Licenses from Many Content Providers
    Table of Contents UNITED STATES SECURITIES AND EXCHANGE COMMISSION Washington, D.C. 20549 Form 20-F (Mark One) ¨ REGISTRATION STATEMENT PURSUANT TO SECTION 12(b) OR 12(g) OF THE SECURITIES EXCHANGE ACT OF 1934 or x ANNUAL REPORT PURSUANT TO SECTION 13 OR 15(d) OF THE SECURITIES EXCHANGE ACT OF 1934 For the fiscal year ended December 31, 2013. or ¨ TRANSITION REPORT PURSUANT TO SECTION 13 OR 15(d) OF THE SECURITIES EXCHANGE ACT OF 1934 For the transition period from to or ¨ SHELL COMPANY REPORT PURSUANT TO SECTION 13 OR 15(d) OF THE SECURITIES EXCHANGE ACT OF 1934 Date of event requiring this shell company report Commission file number: 000-51469 Baidu, Inc. (Exact name of Registrant as specified in its charter) N/A (Translation of Registrant’s name into English) Cayman Islands (Jurisdiction of incorporation or organization) Baidu Campus No. 10 Shangdi 10th Street Haidian District, Beijing 100085 The People’s Republic of China (Address of principal executive offices) Jennifer Xinzhe Li, Chief Financial Officer Telephone: +(86 10) 5992-8888 Email: [email protected] Facsimile: +(86 10) 5992-0000 Baidu Campus No. 10 Shangdi 10th Street, Haidian District, Beijing 100085 The People’s Republic of China (Name, Telephone, Email and/or Facsimile number and Address of Company Contact Person) Securities registered or to be registered pursuant to Section 12(b) of the Act: Title of Each Class Name of Each Exchange on Which Registered American depositary shares (ten American depositary shares representing one Class A ordinary share, The NASDAQ Stock Market LLC par value US$0.00005 per share) (The NASDAQ Global Select Market) Class A ordinary shares, par value US$0.00005 per share* The NASDAQ Stock Market LLC (The NASDAQ Global Select Market) * Not for trading, but only in connection with the listing on The NASDAQ Global Select Market of American depositary shares.
    [Show full text]
  • Applications: S
    Applications: S This chapter contains the following sections: • Sabah, on page 9 • Safari, on page 10 • SAFT, on page 11 • Sage, on page 12 • Sahibinden, on page 13 • Saks Fifth Avenue, on page 14 • Salesforce.com, on page 15 • Salesforce.com Live Agent, on page 16 • Sam's Club, on page 17 • Sametime, on page 18 • SAMR, on page 19 • Samsung, on page 20 • Samsung Push Notification, on page 21 • SANity, on page 22 • Sanook.com, on page 23 • SAP, on page 24 • SAP HostControl, on page 25 • SASCDN, on page 26 • SATNET, on page 27 • SATNET and Backroom EXPAK, on page 28 • SATNET Monitoring, on page 29 • SaveFrom, on page 30 • Sberbank of Russia, on page 31 • SBS, on page 32 • SCC Security, on page 33 • SCCM, on page 34 • SCCM Remote Control, on page 35 • SCCP, on page 36 • Schedule Transfer Protocol, on page 37 • schuelerVZ, on page 38 • Schwab, on page 39 • ScienceDirect, on page 40 Applications: S 1 Applications: S • SCO Desktop Administration Server, on page 41 • Sco I2 Dialog Daemon, on page 42 • SCO System Administration Server, on page 43 • SCO Web Server Manager 3, on page 44 • SCO WebServer Manager, on page 45 • scohelp, on page 46 • Scopia, on page 47 • Scopia Audio, on page 48 • Scopia Video, on page 49 • Scorecard Research, on page 50 • Scottrade, on page 51 • SCPS, on page 52 • Scribd, on page 53 • Scribd Upload, on page 54 • Scribol, on page 55 • SCSI-ST, on page 56 • SCTP, on page 57 • scx-proxy, on page 58 • SDNS-KMP, on page 59 • SDRP, on page 60 • Seamonkey, on page 61 • Search-Result.com, on page 62 • Searchnu, on page 63 •
    [Show full text]
  • Ecosia at Your Faculty
    Ecosia at your Faculty Imagine your faculty could contribute to reforestation around the world with every internet search that is being done at the faculty. This opportunity can easily be offered by Ecosia. Ecosia is a search engine, just like Google and was founded in 2009 in Berlin. Like most search engines, it makes its money by clicks on advertisements. However in contrast to other search engines, Ecosia uses 80% or more of its profit to plant trees. This way Ecosia is effectively fighting deforestation worldwide. Ecosia calculated that on average every 45 searches are sufficient to plant one tree. So far, they were able to plant over 120 Million trees (February 2021). Ecosia is very transparent when it comes to the usage/storage of data, about privacy and financial reporting. Ecosia on campus is an initiative led by students around the world to implement Ecosia as the standard search engine at their Universities. Over 175,000 trees have already been financed by students' searches! With your help as faculty we’ll be able to plant thousands more! In January 2021, Maastricht University (as the first University in The Netherlands) has set the sustainable search engine Ecosia as the default search engine on 800 Student Desktops (SDT) in both library locations and the Randwyck Computer Facilities. To make UM even more sustainable we are planning to extend this project across all faculties. The sustainable search engine is easy to implement and it is possible to install it on every other UM workstation. For more information about the change at UM please refer to: https://www.maastrichtuniversity.nl/news/ecosia-now-default-800-student-desktops How to switch your standard search engine to Ecosia? The switch is very easy.
    [Show full text]
  • User Browsing Graph: Structure, Evolution and Application1
    User Browsing Graph: Structure, Evolution and Application1 Yiqun Liu, Min Zhang, Shaoping Ma, Liyun Ru State Key Lab of Intelligent Technology and Systems Tsinghua National Laboratory for Information Science and Technology Department of Computer Science and Technology Tsinghua University, Beijing, 100084, China P.R. [email protected] ABSTRACT don’t work very well in this new situation. This paper focuses on ‘user browsing graph’ which is constructed Recently, the wisdom of the crowd is paid much attention in with users’ click-through behavior modeled with Web access logs. Web search researches, e.g. [7], [8] and [9]. In their work, users’ User browsing graph has recently been adopted to improve Web browsing behavior is usually considered as implicit feedback search performance and the initial study shows it is more reliable information for page relevance and importance. For example, Liu than hyperlink graph for inferring page importance. However, et. al. constructed ‘user browsing graph’ with search log data [10]. structure and evolution of the user browsing graph haven’t been They proposed a page importance estimation algorithm called fully studied and many questions remain to be answered. In this BrowseRank which performs on the user browsing graph. It is paper, we look into the structure of the user browsing graph and believed that the link structure in user browsing graph is more its evolution over time. We try to give a quantitative analysis on reliable than hyperlink graph because users actually follow links the difference in graph structure between hyperlink graph and in the browsing graph. user browsing graph, and then find out why link analysis Liu’s initial study shows that the BrowseRank algorithm works algorithms perform better on the browsing graph.
    [Show full text]
  • Sogou Announces Fourth Quarter and Full Year 2020 Results BEIJING, China, February 4, 2021 – Sogou Inc. (NYSE: SOGO) ("So
    Sogou Announces Fourth Quarter and Full Year 2020 Results BEIJING, China, February 4, 2021 – Sogou Inc. (NYSE: SOGO) ("Sogou" or the “Company"), an innovator in search and a leader in China's internet industry, today announced its unaudited financial results for the fourth quarter and full year, ended December 31, 2020. Fourth Quarter 2020 Financial Results Total revenues1 were $189.5 million, a 37% decrease year-over-year. The decrease was primarily driven by uncertainties with respect to Sogou’s business policies among certain advertisers as a result of the previously-announced proposal by Tencent Holdings Limited (“Tencent”) to take Sogou private, as well as reduced traffic acquisition activity. • Search and search-related revenues were $166.7 million, a 39% decrease year- over-year. Auction-based pay-for-click services decreased year-over-year, accounting for 84.0% of search and search-related revenues, compared to 88.2% in the corresponding period in 2019. • Other revenues were $22.8 million, a 14% decrease year-over-year. The decrease was primarily due to decreased revenues from non-core businesses. Cost of revenues was $151.2 million, a 10% decrease year-over-year. Traffic acquisition cost, a primary driver of cost of revenues, was $115.5 million, a 10% decrease year-over-year, representing 60.9% of total revenues, compared to 42.8% in the corresponding period in 2019. The decrease in traffic acquisition costs was driven by decreased traffic acquisition from third parties. Gross profit and non-GAAP2 gross profit were both $38.3 million, a 71% decrease year- over-year for both.
    [Show full text]
  • Is Google the Next Microsoft? Competition, Welfare and Regulation in Online Search
    IS GOOGLE THE NEXT MICROSOFT? COMPETITION, WELFARE AND REGULATION IN ONLINE SEARCH RUFUS POLLOCK UNIVERSITY OF CAMBRIDGE Abstract. The rapid growth of online search and its centrality to the ecology of the Internet pose important questions: why is the search engine market so concentrated and will it evolve towards monopoly? What implications does this concentration have for consumers, search engines, and advertisers? Does search require regulation and if so in what form? This paper supplies empirical and theoretical material with which to examine these questions. In particular, we (a) show that the already large levels of concentration are likely to continue (b) identify the consequences, negative and positive, of this outcome (c) discuss the regulatory interventions that policy-makers could use to address these. Keywords: Search Engine, Regulation, Competition, Antitrust, Platform Markets JEL Classification: L40 L10 L50 Emmanuel College and Faculty of Economics, University of Cambridge. Email: [email protected] or [email protected]. I thank the organizers of 2007 IDEI Toulouse Conference on the Economics of the Software and Internet Industries whose search round-table greatly stimulated my thoughts on this topic as well as participants at seminars in Cambridge and at the 2009 RES conference. Particular thanks is also due to the editor and anonymous referee who provided an excellent set of comments that served to greatly improve the paper. Finally, I would like to thank http://searchenginewatch.com and Hitwise who both generously provided me with access to some of their data. This work is licensed under a Creative Commons Attribution Licence (v3.0, all jurisdictions).
    [Show full text]
  • Thiswaifudoesnotexist.Net: Google Analytics: All Traffic 20190218-20190720
    This Waifu Does Not Exist Analytics All Web Site Data Go to report All Traffic Feb 18, 2019 - Jul 20, 2019 All Users 100.00% Users Explorer Summary Users 600,000 400,000 200,000 March 2019 April 2019 May 2019 June 2019 July 2019 Acquisition Behavior Conversions Source / Medium Avg. Goal Bounce Pages / Goal Goal Users New Users Sessions Session Conversion Rate Session Completions Value Duration Rate 1,161,978 1,170,333 1,384,602 46.58% 8.74 00:01:48 0.00% 0 $0.00 % of Total: % of Total: % of Total: Avg for Avg for Avg for Avg for % of Total: % of 100.00% 100.07% 100.00% View: View: View: View: 0.00% Total: (1,161,978) (1,169,566) (1,384,602) 46.58% 8.74 00:01:48 0.00% (0) 0.00% (0.00%) (0.00%) (0.00%) (0.00%) ($0.00) 1. (direct) / (none) 966,071 967,968 1,121,738 45.78% 8.39 00:01:47 0.00% 0 $0.00 (82.06%) (82.71%) (81.02%) (0.00%) (0.00%) 2. google / organic 30,702 28,990 44,887 48.86% 15.63 00:02:49 0.00% 0 $0.00 (2.61%) (2.48%) (3.24%) (0.00%) (0.00%) 3. m.facebook.com / referral 22,693 22,544 24,304 46.93% 4.77 00:00:51 0.00% 0 $0.00 (1.93%) (1.93%) (1.76%) (0.00%) (0.00%) 4. away.vk.com / referral 20,111 19,178 26,667 47.13% 13.97 00:02:31 0.00% 0 $0.00 (1.71%) (1.64%) (1.93%) (0.00%) (0.00%) 5.
    [Show full text]
  • Information Rereival, Part 1
    11/4/2019 Information Retrieval Deepak Kumar Information Retrieval Searching within a document collection for a particular needed information. 1 11/4/2019 Query Search Engines… Altavista Entireweb Leapfish Spezify Ask Excite Lycos Stinky Teddy Baidu Faroo Maktoob Stumpdedia Bing Info.com Miner.hu Swisscows Blekko Fireball Monster Crawler Teoma ChaCha Gigablast Naver Walla Dogpile Google Omgili WebCrawler Daum Go Rediff Yahoo! Dmoz Goo Scrub The Web Yandex Du Hakia Seznam Yippy Egerin HotBot Sogou Youdao ckDuckGo Soso 2 11/4/2019 Search Engine Marketshare 2019 3 11/4/2019 Search Engine Marketshare 2017 Matching & Ranking matched pages ranked pages 1. 2. query 3. muddy waters matching ranking “hits” 4 11/4/2019 Index Inverted Index • A mapping from content (words) to location. • Example: the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat 5 11/4/2019 Inverted Index the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3 cat 1 3 dog 2 3 mat 1 2 on 1 2 sat 1 3 stood 2 3 the 1 2 3 while 3 Inverted Index the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3 cat 1 3 dog 2 3 mat 1 2 Every word in every on 1 2 web page is indexed! sat 1 3 stood 2 3 the 1 2 3 while 3 6 11/4/2019 Searching the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3 cat 1 3 query dog 2 3 mat 1 2 cat on 1 2 sat 1 3 stood 2 3 the 1 2 3 while 3 Searching the cat sat on the dog stood on the cat stood 1 2 3 the mat the mat while a dog sat a 3 cat
    [Show full text]