Web Search Algorithms

Total Page:16

File Type:pdf, Size:1020Kb

Web Search Algorithms Web Search Algorithms - 1 - Why web search in this module ? • WWW is the delivery platform and the interface • How do we find information and services on the web … we try to generate a url that seems sensible – Dell Computers – www.dell.ie, Ford Ireland – www.ford.ie But products? • GPS Devices – www.gps.ie is not ok • Or, we use a Search Engine – So we rely on Search Engines - we even use them to look up spellings and as a calculator ! • Search Engines bring people to a website – For most, such as Google, ranking algorithm is closely guarded, wholesome, true, uncorrupted, and not paid • advertisements are merely sold based on similarity to query keywords. • This leads to the industry of Search Engine Optimisations (SEO) ... the “Google Dance” - 2 - Text IR - Google as example • Google is operational since 1998 – Two PhD students from Stanford • ?? Billion documents – Early search engines competed on size of index, related to how powerful their infrastructure was. Not an issue now. – Stopped advertising after 8,168,684,336 pages in Aug 2005 – Size now, effectively unknown • Also has ??? billion images – not all unique images – Flickr has about 2B (Nov 2007); FaceBook had 4.1 B at that time - 3 - Searching or Marketing? • However, Search Engines must make a profit! – Advertisment Sales – Marketing – Paid Listings – And selling their indexes • A lot of Search Engines are also marketing companies… – This is at odds with the idea that a search engine is a page you visit on the way elsewhere. • The less time you spend there the better! • But, many people ‘pass through the doors’, so they sell query focussed advertisements – You can estimate by looking at the main page of the search engine. - 4 - How do SEs help user searches • It is known that we search for – people / home pages. – companies / company HPs (or guess from URLs). – a particular product or service. – a fact, buried in one or more documents, any one of which will do… – a document, an entire document, with text/image, and nothing smaller will do. – an overview on a broad or narrow topic – Media Search • an MPEG-4 file. • Through image databases. • Through (digital) video library, and/or through a video. • If the SE knows the type of query, then ranking can be tailored to that query, because different search types can be satisfied by different search algorithms. - 5 - Search Engines • Originally SE’s were web directorys – Manually generated (e.g. Yahoo!) • Then automatic crawler-based Search Engines developed – The web got big and manual categorisation was becoming too difficult (e.g. Lycos) – Today the large SE’s index over ?? billion web pages. – The first crawler-based SE was the WWWW in 1994 - 6 - Architecture of a Search Engine - 7 - My Google! - 8 - Bing - 9 - Facebook ? Is it a Search Engine? - 10 - Facebook Social Graph A Previous Class College Friends Friends IR Research Community - 11 - TWITTER - 12 - The Landscape is changing - 13 - Web 1.0 Web 3.0 • Web 1.0 – Static content... Companies created content – We were consumers • Web 2.0 – User generated content – Communities and creators... We create, filter, recommend the content • Web 3.0 – UGC and... Semantic Web... Life streams? – Social and Location – What is the next big thing? - 14 - Web 1.0 • Search engines over prepared and planned content • Organisations and some users • SEO was the way to optimise WEB 1.0 • HTML and static content - 15 - - 16 - Web 2.0 • User and Organisation Generated Content • Social Graphs • Social Filtering and Social Ranking • Examples: – Social networks : facebook, twitter, linkedin – Shared bookmarks: digg, delicious, reddit, stumbleupon – Social media sharing :flickr, youtube – Blogs (MSN space, wordpress, blogger) – Even 3D social worlds... Social gaming? - 17 - - 18 - Web 3.0 • Semantic Web – Many media types... Integrated for smarter uses • Rich media integration • Personalisation to the user context • Life streaming of content – We are integrated into our own entertainment - 19 - What is Web3.0 about? - 20 - The Search Landscape Changing enormously - 21 - Continuous Partial Attention • Be aware of Continuous Partial Attention... a kind of multitasking • skimming the surface of the incoming data, picking out the relevant details, and moving on to the next stream. • Continuous not episodic • Cast a wider net, but never full attention • So.. How does this impact on search? http://www.wisegeek.com/what-is-continuous-partial-attention.htm - 22 - And don’t forget the twitter curve… http://headrush.typepad.com/creating_passionate_users/2006/12/httpwww37signal.html - 23 - Google AdSense - 24 - Spamming • Spamming is a technique based on the manipulation of content in order to affect ranking from search engines – Bogus meta tags, hidden text, plan text… – Also link spamming… • Huge SE resources are used in defeating spamming - more than in search quality improvement ! • Getting in the top-10 is essential for businesses – 85% of users only look at top 10. – Lead to the business of Search Engine Optimisation - 25 - Search Engine Ranking As we all know, simply examining web page content as text is not enough.. We need to examine ranking factors.. Positive and negative. - 26 - Positive Ranking Factors : Term Location • In the TITLE of the page, most important • In the body of the text, but must MAKE SENSE • In the Heading text (H1,H2…) • In the Domain Name – Also in page URL • In ALT tag and image title • In BOLD/STRONG tags • Terms near the top likely ranked higher than other terms - 27 - Positive Ranking Factors : Page Attributes • Importance of the page in the Website – Number of links to it from the same website • Quality of links to other pages • Age of a document – Older may be more authorative • We will see authorities later! – Newer may be better for some queries (e.g. news) • Amount of text on the page • Structure of the page • Frequency of updates • Spelling and correctness of HTML - 28 - SE Ranking + : Website Issues • Linkage of the Website – Global link popularity of the website • Like a global Pagerank (SiteRank) – Relevance of the links into the website – Link popularity of the site in a topical community – Rate of new inbound links to a website • Age of a website (older is better) • Freshness of a website (new pages is better) • Relevancy of the website (as well as the page) • Clickthrough rate for the website • Reputation of the top-level domain – E.g. .GOV & .EDU … can not easily be bought - 29 - SE Ranking + : Linkage Issues • Anchor text of inbound links as a description of the WWW page – Also text surrounding the link into the webpage • Topical relationship between source and target of link • Link popularity of the page in a topical community • Age of links – The older the better, i.e. long lasting links • Pagerank of the webpage – Googles PageRank algorithm • Number of links into a web page - 30 - Positive Ranking Factors : Images • Images on a web page – Can provide a chance to express ideas in a visual way that can convey a considerable amount of information – Add to the attractiveness and perceived quality of a site. – Recent Microsoft Patent on “Scoring Relevance of a Document Based on Image Text” – Also.. Remember to name the image properly and have alt element - 31 - Negative Ranking Factors • Link Farm Participation – Try to artificially increase PageRank • Proportion of links to or from known Spamming sites • Duplicate Content to already indexed content • Server Errors or server down-time • External links to low-quality content • Low level of visitors to the website • Try to include hidden text on the page - 32 - Using the Ranking Factors… PageRank Factors Linkage Factors Negative Factors Website Factors Page Factors Term Location Factors Result The Search Engine ranking process is a closely guarded trade secret of the User Query search engines. - 33 - So lets look in some detail at some of these ranking factors… Linkage-based Search - 34 - The Shape of the WWW This is based on a study of 200 million web pages. Scale up to WWW scale. - 35 - Spidering : finding WWW content • A Search Engine needs to find WWW content for its index – This is done by the spidering software • Starting from some ‘seed’ WWW pages, the spider software downloads these pages and extracts the links, thereby learning about new pages to crawl. • WWW-scale crawling means crawling thousands of pages per second - 36 - A Basic Crawling Algorithm • You need to be linked to from the main WWW.. Remember the shape! • Given a set of ‘seed’ URLs (WWW pages addresses): – Add them to a (priority) queue of URLs – While the queue is not empty (!empty) • Take the first URL (u) off the queue • Download the WWW page for u • Store the URL in a list of seen URLs • Index it • If u is a HTML page, extract the links (y) – For each y add it to the queue if it has not been visited before - 37 - Spiders must behave! • Most crawlers/spiders will follow some rules: – A spider must never request large numbers of documents from the same host sequentially… change the target website as often as is feasible. – A spider must never (for whatever reason) repeatedly request the same document. If a document is unavailable, … it’s position in the queue must be penalized … Repeated failures must be taken into account and the document flagged as unavailable and taken off the queue. – A spider must respect author’s wishes as expressed using the robots exclusion protocol - 38 - Robots Exclusion allows Web site administrators to indicate to visiting robots which parts of their site should not be visited by the robot. Most good robots will process it… BUT it makes a crawler less efficient… more explorative crawling required… To exclude all robots from the entire server User-agent: * Disallow: / To allow all robots complete access User-agent: * Disallow: To exclude all robots from part of the server User-agent: * Disallow: /cgi-bin/ Disallow: /private/ To exclude a single robot User-agent: BadBot Disallow: / To allow a single robot User-agent: WebCrawler Disallow: User-agent: * Disallow: / - 39 - Robots.txt example - 40 - Another Example - 41 - And one more… - 42 - Simple Overview WWW 1.
Recommended publications
  • HTTP Cookie - Wikipedia, the Free Encyclopedia 14/05/2014
    HTTP cookie - Wikipedia, the free encyclopedia 14/05/2014 Create account Log in Article Talk Read Edit View history Search HTTP cookie From Wikipedia, the free encyclopedia Navigation A cookie, also known as an HTTP cookie, web cookie, or browser HTTP Main page cookie, is a small piece of data sent from a website and stored in a Persistence · Compression · HTTPS · Contents user's web browser while the user is browsing that website. Every time Request methods Featured content the user loads the website, the browser sends the cookie back to the OPTIONS · GET · HEAD · POST · PUT · Current events server to notify the website of the user's previous activity.[1] Cookies DELETE · TRACE · CONNECT · PATCH · Random article Donate to Wikipedia were designed to be a reliable mechanism for websites to remember Header fields Wikimedia Shop stateful information (such as items in a shopping cart) or to record the Cookie · ETag · Location · HTTP referer · DNT user's browsing activity (including clicking particular buttons, logging in, · X-Forwarded-For · Interaction or recording which pages were visited by the user as far back as months Status codes or years ago). 301 Moved Permanently · 302 Found · Help 303 See Other · 403 Forbidden · About Wikipedia Although cookies cannot carry viruses, and cannot install malware on 404 Not Found · [2] Community portal the host computer, tracking cookies and especially third-party v · t · e · Recent changes tracking cookies are commonly used as ways to compile long-term Contact page records of individuals' browsing histories—a potential privacy concern that prompted European[3] and U.S.
    [Show full text]
  • Neoplanet Browser Download Neoplanet 5.1 Build 1262
    neoplanet browser download NeoPlanet 5.1 build 1262. The NeoPlanet Browser was designed to be fun and easy to use and will put the best of the Web at your fingertips. NeoPlanet 5.1 build 1262 Features: · Control your Internet experience by customizing content Channels with favorite websites. · Select your interests from the Preference Center to instantly import content of choice. · Take advantage of powerful E-mail, Download Management, and QuickSearch features. · Don't forget to express your own sense of Internet style with over 500 different skins! This download is marked as adware because it displays advertisement banners or other type of commercials while running. Why is NeoPlanet 5.1 build 1262 flagged as AdWare? · NeoPlanet contains flyswat adware. NeoPlanet security information. You cannot download any crack or serial number for NeoPlanet on this page. Every software that you are able to download on our site is legal. There is no crack, serial number, hack or activation key for NeoPlanet present here. Our collection also doesn't contain any keygens, because keygen programs are being used in illegal ways which we do not support. All software that you can find here is freely downloadable and legal. NeoPlanet installation package is prepared to be downloaded from our fast download servers. It is checked for possible viruses and is proven to be 100% clean and safe. Various leading antiviruses have been used to test NeoPlanet, if it contains any viruses. No infections have been found and downloading NeoPlanet is completelly problem free because of that reason. Our experts on malware detection tested NeoPlanet with various spyware and malware detection programs, including fyxm.net custom malware and spyware detection, and absolutelly no malware or spyware was found in NeoPlanet.
    [Show full text]
  • Discontinued Browsers List
    Discontinued Browsers List Look back into history at the fallen windows of yesteryear. Welcome to the dead pool. We include both officially discontinued, as well as those that have not updated. If you are interested in browsers that still work, try our big browser list. All links open in new windows. 1. Abaco (discontinued) http://lab-fgb.com/abaco 2. Acoo (last updated 2009) http://www.acoobrowser.com 3. Amaya (discontinued 2013) https://www.w3.org/Amaya 4. AOL Explorer (discontinued 2006) https://www.aol.com 5. AMosaic (discontinued in 2006) No website 6. Arachne (last updated 2013) http://www.glennmcc.org 7. Arena (discontinued in 1998) https://www.w3.org/Arena 8. Ariadna (discontinued in 1998) http://www.ariadna.ru 9. Arora (discontinued in 2011) https://github.com/Arora/arora 10. AWeb (last updated 2001) http://www.amitrix.com/aweb.html 11. Baidu (discontinued 2019) https://liulanqi.baidu.com 12. Beamrise (last updated 2014) http://www.sien.com 13. Beonex Communicator (discontinued in 2004) https://www.beonex.com 14. BlackHawk (last updated 2015) http://www.netgate.sk/blackhawk 15. Bolt (discontinued 2011) No website 16. Browse3d (last updated 2005) http://www.browse3d.com 17. Browzar (last updated 2013) http://www.browzar.com 18. Camino (discontinued in 2013) http://caminobrowser.org 19. Classilla (last updated 2014) https://www.floodgap.com/software/classilla 20. CometBird (discontinued 2015) http://www.cometbird.com 21. Conkeror (last updated 2016) http://conkeror.org 22. Crazy Browser (last updated 2013) No website 23. Deepnet Explorer (discontinued in 2006) http://www.deepnetexplorer.com 24. Enigma (last updated 2012) No website 25.
    [Show full text]
  • The Internet and the Web T6 a Technical View of System Analysis and Design
    Technology Guides T1 Hardware T2 Software T3 Data and Databases T4 Telecommunications ᮣ T5 The Internet and the Web T6 A Technical View of System Analysis and Design Technology Guide The Internet 5 and the Web T5.1 What Is the Internet? T5.2 Basic Characteristics and Capabilities of the Internet T5.3 Browsing and the World Wide Web T5.4 Communication Tools for the Internet T5.5 Other Internet Tools T5.1 T5.2 Technology Guide The Internet and the Web T5.1 What Is the Internet?1 The Internet (“the Net”) is a network that connects hundreds of thousands of inter- nal organizational computer networks worldwide. Examples of internal organiza- tional computer networks are a university computer system, the computer system of a corporation such as IBM or McDonald’s, a hospital computer system, or a system used by a small business across the street from you. Participating computer systems, called nodes, include PCs, local area networks, database(s), and mainframes.A node may include several networks of an organization, possibly connected by a wide area network. The Internet connects to hundreds of thousands of computer networks in more than 200 countries so that people can access data in other organizations, and can communicate and collaborate around the globe, quickly and inexpensively. Thus, the Internet has become a necessity in the conduct of modern business. The Internet grew out of an experimental project of the Advanced Research Proj- BRIEF HISTORY ect Agency (ARPA) of the U.S. Department of Defense.The project was initiated in 1969 as ARPAnet to test the feasibility of a wide area computer network over which researchers, educators, military personnel, and government agencies could share data, exchange messages, and transfer files.
    [Show full text]
  • November, 1999
    CADDO-BOSSIER WINDOWS USER GROUP Shreveport – Bossier City Louisiana www.shreveport.com/cbwug Volume 1, No 9 Caddo-Bossier Windows User Group Newsletter November, 1999 GO!ZILLA one finishes to start the next. GO! OUR NEXT MEETING by Mark Reeves ZILLA fixes all those problems and 7 p.m. a few more besides. It is a Thursday November 11, l999 download and connections manager Being in the computer business as a Marshall and Associates system integrator, I am always for the Internet that works with 819 Shreveport-Barksdale Hwy downloading something off the IE4.x and above, plus Netscape 4.x Internet for my clients. Bios up- and above. Shreveport, Louisiana grades, videodriv- Meetings 2nd Thursday of each month ers, etc. My big- If you lose your gest problem is the connection it will poor phone lines restart on command MARK YOUR CALENDAR we have in after you log back Shreveport/ on to the Internet. It Bossier discon- restarts where it left Newsletter reproduction necting me repeatedly. I recently off so you do not have to compliments of Office Depot start over! You can use its had to download on East 70th Street a 46 meg patch file, I had to restart download manager to download downloading 3 times from scratch files in sequence, which is much AFTER I had download 80% of the faster. C-B WUG CLUB ELECTIONS file. My next problem is needing to Our annual election of officers will be download multiple files, so I either (Continued on page 4) held at the December 10 meeting.
    [Show full text]
  • Effective Web Design, Second Edition
    Effective Web Design Effective Web Design, Second Edition Ann Navarro SYBEX® Associate Publisher: Cheryl Applewood Contracts and Licensing Manager: Kristine O'Callaghan Acquisitions and Developmental Editor: Raquel Baker Editors: Joseph A. Webb, James A. Compton, Colleen Wheeler Strand Production Editor: Dennis Fitzgerald Technical Editor: Marshall Jansen Book Designer: Maureen Forys, Happenstance Type-O-Rama Graphic Illustrator: Tony Jonick Electronic Publishing Specialist: Maureen Forys, Happenstance Type-O-Rama Proofreaders: Nelson Kim, Nancy Riddiough, Leslie E.H. Light Indexer: Ann Rogers CD Coordinator: Christine Harris CD Technician: Kevin Ly Cover Designer: Design Site Cover Illustrator/Photographer: Dan Bowman Copyright © 2001 SYBEX Inc., 1151 Marina Village Parkway, Alameda, CA 94501. World rights reserved. page 1 Effective Web Design The author(s) created reusable code in this publication expressly for reuse by readers. Sybex grants readers limited permission to reuse the code found in this publication or its accompanying CD-ROM so long as (author(s)) are attributed in any application containing the reusable code and the code itself is never distributed, posted online by electronic transmission, sold, or commercially exploited as a stand- alone product. Aside from this specific exception concerning reusable code, no part of this publication may be stored in a retrieval system, transmitted, or reproduced in any way, including but not limited to photocopy, photograph, magnetic, or other record, without the prior agreement and written permission of the publisher. An earlier version of this book was published under the title Effective Web Design © 1998 SYBEX Inc. Library of Congress Card Number: 2001088112 ISBN: 0-7821-2849-1 SYBEX and the SYBEX logo are either registered trademarks or trademarks of SYBEX Inc.
    [Show full text]
  • GKM521R Manual.P65
    Wireless RF Keyboard/ Optical Mouse Combo User Manual (GKM521R) ® ©2004 IOGEAR. All Rights Reserved. PKG-M0126b IOGEAR, the IOGEAR logo, are trademarks or registered trademarks of IOGEAR, Inc. Microsoft and Windows are registered trademarks of Microsoft Corporation. IBM is a registered trademark of International Business Machines, Inc. Macintosh, G3/G4 and iMac are registered trademarks of Apple Computer, Inc. All other brand and product names are trademarks or registered trademarks of their respective holders. IOGEAR makes no warranty of any kind with regards to the information presented in this document. All information furnished here is for informational purposes only and is subject to change without notice. IOGEAR, Inc. assumes no responsibility for any inaccuracies or errors that may appear in this document. Table of Contents ○○○○○○○○○○○○○○○○○○○○○○ Welcome ○○○○○○○○○○○ 02 ○○○○○○○○○○○○○○○ Package Contents ○○○○○○○○○○○○○○ 03 ○○○○○○○○○○○○○○○○○○○○○ Features ○○○○○○○○○○○○ 04 Requirements ○ ○ ○○○○○○○○○○○○○○○○○○○○○○○○○○○○ 05 ○○○○○○○○○○○○○○○○○○○○○ Introduction ○○○○○○○○○○○ 06 ○○○○○○○○○○○○○○○○○○○○○○ Hardware Installation ○○○○○○ 14 ○○○○○○○○○○○○ Software Installation (Windows 98/ME/2000/XP) ○○○○○○ 17 Mouse Configuration ○○○○○○○○○○○○○○○○○○○○○○○○○○○○○ 23 ○○○○○○○○○○○○○○○○○ Function Keys ○○○○○○○○○○○○○○ 29 ○○○○○○○○○○○ FAQs/Troubleshooting ○○○○○○○○○○○○○○○○○ 34 ○○○○○○○○○○○○○○○○○○○○○○○○○○○ Specification ○○○○○ 35 Technical Support ○ ○ ○○○○○○○○○○○○○○○○○○○○○○○○○○○○ 37 ○○○○ Radio & TV Interference Statement ○○○○○○○○○○○○○○○○○○○ 38 ○○○○○○○○○○○○○○○○○○○○○○○○○○
    [Show full text]
  • Team Ideenreich www.drweb.de <title>www.drweb.de</title> <meta name="publisher" content="ideenreich.com - Ahrensburg"> <meta name="author" content=" Sven Lennartz, Michael H. Ragwitz, Ralph Segert, Michael Jaroszewski"> <meta name="contact" content= "[email protected]"> <meta name=”print" content="cw Obotritendruck GmbH Schwerin”> © 2000 ideenreich.com ISBN 3-928484-25-7 Editorial Seit Oktober 97 hilft Dr. Web Seitengestaltern und Home- pageautoren bei der täglichen Arbeit. Was als Dienstleistung ange- fangen hat, ist im Laufe der Zeit zu einem bekannten Online- Magazin herangewachsen. Inzwischen gibt es mehr als 400 Artikel in diversen Rubriken. Grund genug, die Site komplett zu überar- beiten und in neuer Form als Buch herauszubringen. Wir haben es uns nicht leicht gemacht und keineswegs einfach HTML- Seiten zwischen die Buchdeckel gepresst. Alle Artikel wur- den aufbereitet, geordnet und mit Abbildungen versehen. An genau dieser Stelle ist die gedruckte Form dem Online-Publishing überlegen. Das Web ist ein flüchtiges und schnelles Medium, während das Buch die Dinge zu bewahren sucht und in unwandel- barer Form griffbereit hält. Die dahinterstehende Absicht erkennt man schon mit Blick auf den Umschlag. Kein schönes Buch soll es sein, sondern ein prakti- sches Werk. Ein Buch, das man immer wieder zur Hand nehmen muss und das ein ständiger Begleiter auf dem Schreibtisch ist. Seine endgültige Form erreicht es aber erst mit Ihrer Hilfe. Dann nämlich, wenn seine Seiten mit Notizen und Anmerkungen beschrieben sind, hier und da Flecken auf manchen Seiten pran- gen und Eselsohren die wichtigsten Stellen markieren. Wir mögen es, wenn Sie grob mit unsere Arbeit umgehen. Äußer- lichkeiten sind hier Nebensache, so lange Sie nur größtmöglichen Nutzen aus dem Werk ziehen.
    [Show full text]
  • Influences Réciproques Relatives À L'usage Des Nouvelles
    Influences r´eciproques relatives `al'usage des Nouvelles Technologies de l'Information et de la Communication par les acteurs de l'´ecole.Le cas des sites Web des ´ecoles primaires fran¸caises Jacques Audran To cite this version: Jacques Audran. Influences r´eciproques relatives `al'usage des Nouvelles Technologies de l'Information et de la Communication par les acteurs de l'´ecole.Le cas des sites Web des ´ecoles primaires fran¸caises. Education.´ Universit´ede Provence - Aix-Marseille I, 2001. Fran¸cais. <tel-00342534> HAL Id: tel-00342534 https://tel.archives-ouvertes.fr/tel-00342534 Submitted on 27 Nov 2008 HAL is a multi-disciplinary open access L'archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destin´eeau d´ep^otet `ala diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publi´esou non, lished or not. The documents may come from ´emanant des ´etablissements d'enseignement et de teaching and research institutions in France or recherche fran¸caisou ´etrangers,des laboratoires abroad, or from public or private research centers. publics ou priv´es. UNIVERSITÉ AIX-MARSEILLE I – Université de Provence U.F.R. Psychologie et Sciences de l’Éducation N° attribué par la bibliothèque |__|__|__|__|__|__|__|__|__|__| T H E S E pour obtenir le grade de DOCTEUR DE L’UNIVERSITÉ AIX-MARSEILLE I Formation doctorale : Systèmes d’apprentissage, systèmes d’évaluation présentée et soutenue publiquement par Jacques AUDRAN Novembre 2001 Titre : Influences réciproques relatives à l’usage des Nouvelles Technologies de l’Information et de la Communication par les acteurs de l’école.
    [Show full text]
  • Unit 14 Internet Services
    UNIT 14 INTERNET SERVICES Structure 14.0 Objectives 14.1 Introduction 14.2 World Wide Web .., 14.2.1 Importance of the Web 14.3 How does the Web Work? 14.3.1 Client-Server Architecture 14.3.2 Hypertext Transfer Protocol (HITP) 14.3.3 Hypertext Links: Uniform Resource Locators (URL) 14.4 Web Servers 14.5 Web Browsers 14.5.1 Plug-ins or Helper Programs 14.5.2 Using Web Browser 14.5.3 Toolbar 14.5.4 The Location (URL) Box 14.6 Mark-up Languages 14.6.1 Standard Generalized Mark-up Language (SGML) 14.6.2 Extensible Mark-up Language (XML) 14.6.3 Hypertext Mark-up Language (HTML) 14'.6.4 Dynamic HTML 14.6.5. Virtual Reality Modelling Language (VRML) 14.7 Internet Applications 14.7.1 Internet-based Communication Services 14.7.2 Connectivity 14.7.3 Access to Information Resources 14.7.4 Searching Information Resources on Internet 14.8 Internet for Library Applications 14.8.1 Use of Internet for Supporting Traditional Library Activities 14.8.2 Traditional Library Services Modified in the Internet Era 14.8.3 Internet-based New Library Services 14.9 Summary 14.10 Answers to Self Check Exercises 14.11 Keywords 14.12 References and Further Reading 14.0 OBJECTIVES After reading this Unit, you will be able to acquire knowledge on the following components of Internet: •World Wide Web, its importance and its functioning; • web servers and web browsers and using web browser; • mark-up languages: SGML, XML, HTML, Dynamic HTML and VRML; and • general Internet applications and library-specific Internet applications.
    [Show full text]
  • Linkman PDF Manual
    Outertech Linkman Linkman is a bookmark management solution which supports 10 different browsers and integrates with Firefox, Internet Explorer and Maxthon. Linkman calls upon many powerful features that replace the browser's native URL management and allow to browse faster and more efficiently. To give you a better impression a Linkman tutorial video can be watched at http://linkmanvideo.outertech.com You can choose between two editions. Linkman Lite (Freeware) http://linkmanlite.outertech.com Linkman Lite is free for private non-commercial use as well as for use in charity organizations and educational use. Private use is only when used by individuals at home on their private PC. Educational use is by students for education in school or university. Linkman Pro ($25 | EUR19 for single computer license) http://linkmanpro.outertech.com This edition is intended for the professional user. Linkman Pro contains all features of Linkman Lite. In addition these abilities are included: * Synchronize links between two (or more) computers * Ability to check URLs for dead links, intelligent (only major) content changes, and page movements * Add all links on a single webpage * Improved keyword features (e.g. Keyword List) * Replace feature * Improved Database backup * Retrieve URL meta tags * Editable Export Templates (XML, TSV...) with UTF8 support * Optional installation on USB sticks for mobile usage Linkman 3 Table of Content Foreword 0 Part I Introduction 7 1 .O...v..e...r.v..i.e...w.............................................................................................................................. 7 2 .W...h...a..t.'.s.. .n..e..w............................................................................................................................ 9 3 ...O...t.h..e...r. .s..o..f..t.w...a..r.e................................................................................................................... 21 Part II Frequently Asked Questions 23 Part III Tutorial 27 1 ...F..i.r..s..t.
    [Show full text]
  • America's Army
    america's army - http://www.americasarmy.com/ flightgear (simulador de avi�o) - http://www.flightgear.org/ neo sonic universe - http://gamingbrasil.mundoperdido.com.br alien arena 2007 - http://red.planetarena.org/aquire.html cube 2 - http://www.cubeengine.com/index.php4 hidden and dangerous deluxe full - http://www.gathering.com/hd2/hddeluxe.html torcs - http://torcs.sourceforge.net/index.php s.w.i.n.e. - http://www.stormregion.com/index.php?sid=4...=swine_download carom3d: http://carom3d.com/ capman: http://www.jani-immonen.net/capman/ cubert badbone: http://cubert.deirdrakiai.com/ enemy territory: http://www.splashdamage.com/ gunbound brasil: http://www.gbound.com.br/ kartingrace: http://www.steinware.dk/ kquery: http://www.kquery.com/ little fighter 2: http://littlefighter.com/ mu online: http://www.muonline.com/ racer: http://www.racer.nl/ soldat: http://www.soldat.pl/main.php pacwars: http://pw2.sourceforge.net/ teamspeak: http://www.goteamspeak.com/news.php p2p - compartilhadores de arquivos abc: http://pingpong-abc.sf.net/ ares: http://aresgalaxy.sourceforge.net/ azureus: http://azureus.sourceforge.net/ bitcomet: http://www.bitcomet.com/ bittorrent++: http://sourceforge.net/projects/btplusplus/ bt++: http://btplusplus.sourceforge.net/ dc++: http://www.dcpp.net/ edonkey: http://www.edonkey2000.com/ emule: http://www.emule-project.net/ exeem: http://www.exeem.com/ imesh: http://www.imesh.com/ kazaa: http://www.kazaa.com limeware: http://www.limewire.com/ mldonkey: http://mldonkey.berlios.de/modules.php?name=downloads onemx: http://www.onemx.com/
    [Show full text]