Hypertext Databases

Total Page:16

File Type:pdf, Size:1020Kb

Hypertext Databases Hypertext databases • Academia – Digital library, web publication • Consumer – Newsgroups, communities, product reviews • Industry and organizations – Health care, customer service – Office documents, email The Web is bigger than the sum of its parts Search products and services • Verity • Inktomi (HotBot) •Fulcrum • Alta Vista •PLS • Google! • Oracle text extender • Yahoo! • DB2 text extender • Infoseek Internet • Infoseek Intranet •Lycos • SMART (academic) • Excite • Glimpse (academic) Map Local data FTP Gopher HTML More structure Indexing Crawling Search WebSQL WebL Relevance Ranking Social Latent Semantic Indexing Network XML of Hyperlinks Clustering Web Communities Collaborative Scatter- Filtering Gather Web Servers Topic Topic Directories Distillation Monitor User Mine Profiling Modify Semi-supervised Automatic Focused Learning Classification Crawling Web Browsers Keyword indexing • Boolean search My care is loss of care D1 – care AND NOT old with old care done • Stemming Your care is gain of D2 care with new care won – gain* • Phrases and proximity care D1: 1, 5, 8 – “new care” D2: 1, 5, 8 – loss NEAR/5 care new D2: 7 – <SENTENCE> old D1: 7 loss D1: 3 Tables and queries 1 POSTING select distinct did from POSTING where tid = ‘care’ except tid did pos select distinct did from POSTING where tid like ‘gain%’ care d1 1 care d1 5 with care d1 8 TPOS1(did, pos) as care d2 1 (select did, pos from POSTING where tid = ‘new’), care d2 5 TPOS2(did, pos) as care d2 8 (select did, pos from POSTING where tid = ‘care’) select distinct did from TPOS1, TPOS2 new d2 7 where TPOS1.did = TPOS2.did old d1 7 and proximity(TPOS1.pos, TPOS2.pos) loss d1 3 ……… proximity(a, b) ::= a + 1 = b abs(a - b) < 5 Relevance ranking • Recall = coverage Query “True response” – What fraction of relevant documents Search Compare were reported Consider Output sequence • Precision = accuracy prefix k 1 – What fraction of 0.8 reported documents 0.6 0.4 were relevant Precision 0.2 • Trade-off 0 0 0.2 0.4 0.6 0.8 1 Recall Vector space model and TFIDF • Some words are more important than others • W.r.t. a document collection D – d+ have a term, d- do not d + d – “Inverse document frequency” 1+ log + − • “Term frequency” (TF) d + – Many variants: n(d,t) n(d,t) , ∑ n(d,t) max n(d,t) t t • Probabilistic models Tables and queries 2 VECTOR(did, tid, elem) ::= With TEXT(did, tid, freq) as (select did, tid, count(distinct pos) from POSTING group by did, tid), LENGTH(did, len) as (select did, sum(freq) from TEXT group by did), DOCFREQ(tid, df) as (select tid, count(distinct did) from POSTING group by tid) select did, tid, (freq / len) * (1 + log((select count(distinct did from POSTING))/df)) from TEXT, LENGTH, DOCFREQ where TEXT.did = LENGTH.did and TEXT.tid = DOCFREQ.tid Similarity • Direct similarity … auto …car … auto …car … car… auto… auto …car – Cosine, normalized distance … car … auto … car … auto • Indirect similarity – auto and car co-occur often – They must be related car ≅ auto – Documents having related words are related … auto … • Useful for search and clustering ≅ … car … Latent semantic indexing Term Document k d Documents D V A t SVD U Terms d r k-dim vector Supervised learning (classification) • Many forms – Content: automatically organize the web per Yahoo! – Type: faculty, student, staff – Intent: education, discussion, comparison, advertisement • Applications – Relevance feedback for re-scoring query responses – Filtering news, email, etc. – Narrowing searches and selective data acquisition Difficulties • Dimensionality – Decision tree classifiers: dozens of columns – Vector space model: 50,000 ‘columns’ • Context-dependent noise – ‘Can’ (v.) considered a ‘stopword’ – ‘Can’ (n.) may not be a stopword in /Yahoo/SocietyCulture/Environment/Recycling • Need for scalability – High dimension needs more data to learn Techniques • Nearest neighbor + Standard keyword index also supports classification – How to define similarity? (TFIDF may not work) – Wastes space by storing individual document info • Rule-based, decision-tree based – Very slow to train (but quick to test) + Good accuracy (but brittle rules) • Model-based + Fast training and testing with small footprint The “bag-of-words” document model • Decide topic; topic c is picked with prior π ∑ π probability (c); c (c) = 1 • Each topic c has parameters θ(c,t) for terms t ∑ θ • Coin with face probabilities t (c,t) = 1 • Fix document length and keep tossing coin •Given c, probability of document is n(d) Pr[d | c] = ∏θ (c,t)n(d ,t) {n(d,t)} t∈d Feature selection Model with unknown parameters Confidence intervals T T p1 p1 p2 ... q1 q2 ... q1 Observed data N 0 1 ... ⊆ N Pick F T such that models built over F have high separation confidence Tables and queries 3 TAXONOMY pcid kcid kcname EGMAPR(did, kcid) ::= ((select did, kcid from EGMAP) union all 1 (select e.did, t.pcid from 12Arts EGMAPR as e, TAXONOMY as t 1 3 Science where e.kcid = t.kcid)) 34Math 35Physics STAT(pcid, tid, kcid, kdoc, knum) ::= (select pcid, tid, TAXONOMY.kcid, 1 count(distinct TEXT.did), sum(freq) from EGMAPR, TAXONOMY, TEXT 2 3 EGMAP where TAXONOMY.kcid = EGMAPR.kcid did kcid 4 5 and EGMAPR.did = TEXT.did group by pcid, tid, TAXONOMY.kcid) TEXT did tid freq Clustering • Standard notion from structured data analysis • Techniques – Agglomerative, k-means – Mixture models and Expectation Maximization • How to reduce distance computation time? – Sample points or directions at random – Pre-cluster (eliminate redundancy) – Project remaining points on to subspace Collaborative filtering • People=record, movies=features, cluster people • Both people and features can be clustered • For hypertext access, time of access is a feature • Need advanced models Batman Rambo Andre Hiver Whispers StarWars Lyle Ellen Jason Fred Dean Karen Hypertext models • c=class, t=text, N=neighbors • Text-only model: Pr[t|c] • Using neighbors’ text to judge my topic: " Pr[t, t(N) | c] • Better model: Pr[t, c(N) | c] • Non-linear relaxation Result of exploiting link features • Pretend to know only a % 40 of neighborhood topics 35 • Bootstrap using text-only 30 25 classifier 20 • Use non-linear relaxation %Error 15 10 to update topic 5 assignment iteratively 0 0 50 100 ☛Link information reduces %Neighborhood known error significantly Text Link Text+Link Hyperlink graph analysis • Hypermedia is a social network – Telephoned, advised, co-authored, paid, cited • Social network theory (cf. Wasserman & Faust) – Extensive research applying graph notions – Centrality – Prestige – Reflected prestige • Can be applied directly to Web search – HIT, Google, CLEVER, topic distillation Google and HITS • In-degree ≈ prestige • High prestige ⇔ good • Not all votes worth authority the same • High reflected • Prestige of a page is prestige ⇔ good hub the sum of prestige of • Bipartite iteration citing pages: p = Ep – a = Eh • Pre-compute query – h = ETa independent prestige – h = ETEh score Tables and queries 4 delete from HUBS; insert into HUBS(url, score) HUBS (select urlsrc, sum(score * wtrev) from AUTH, LINK url score where authwt is not null and type = non-local AUTH and ipdst <> ipsrc and url = urldst url score group by urlsrc); update HUBS set (score) = score / (select sum(score) from HUBS); update LINK as X set (wtfwd) = 1. / wgtfwd (select count(ipsrc) from LINK HUBS.score AUTH.score where ipsrc = X.ipsrc urlsrc urldst and urldst = X.urldst) @ipsrc @ipdst where type = non-local; LINK wgtrev urlsrc urldst ipsrc ipdst wgtfwd wtrev type Resource discovery ([DPSOH )HHGEDFN 7RSLF 7D[RQRP\ 6FKHGXOHU (GLWRU %URZVHU 'LVWLOOHU &UDZOHU 7D[RQRP\ &UDZO :RUNHUV 'DWDEDVH 'DWDEDVH +\SHUWH[W +\SHUWH[W &ODVVLILHU 7RSLF &ODVVLILHU /HDUQ 0RGHOV $SSO\ Resource discovery results Harvest Rate (Cycling, Unfocused) H arvest R ate (C ycling , S oft F ocus) 1 1 Avg over 100 0.9 0.9 •High rate of Avg over 1000 0.8 0.8 0.7 0.7 “harvesting” 0.6 0.6 0.5 0.5 0.4 0.4 Average Relevance relevant pages Average Relevance 0.3 0.3 0.2 0.2 0.1 Avg over 100 0.1 0 0 0 5000 10000 0 2000 4000 6000 • Robust to #U R Ls fetched # U R Ls fetch ed URL Coverage Distance to top authorities perturbations of 0.9 18 0.8 16 starting URLs 0.7 14 0.6 12 0.5 10 0.4 8 • Great resources Frequency 0.3 6 0.2 4 found 10 links Fraction of reference crawl 0.1 2 0 0 0 1000 2000 3000 123456789101112 from start set #URLs crawled by test crawler Shortest distance found (#links) Database issues • Useful features + Concurrency and recovery (for crawling) + I/O-efficient representation of mining algorithms + Enables ad-hoc semi-structured queries • Would help to have – Unlogged tablespaces, flexible choice of recovery – Index (-ed scans) over temporary table expressions – Efficient string storage and operations – Answering multiple queries approximately.
Recommended publications
  • Rdfa in XHTML: Syntax and Processing Rdfa in XHTML: Syntax and Processing
    RDFa in XHTML: Syntax and Processing RDFa in XHTML: Syntax and Processing RDFa in XHTML: Syntax and Processing A collection of attributes and processing rules for extending XHTML to support RDF W3C Recommendation 14 October 2008 This version: http://www.w3.org/TR/2008/REC-rdfa-syntax-20081014 Latest version: http://www.w3.org/TR/rdfa-syntax Previous version: http://www.w3.org/TR/2008/PR-rdfa-syntax-20080904 Diff from previous version: rdfa-syntax-diff.html Editors: Ben Adida, Creative Commons [email protected] Mark Birbeck, webBackplane [email protected] Shane McCarron, Applied Testing and Technology, Inc. [email protected] Steven Pemberton, CWI Please refer to the errata for this document, which may include some normative corrections. This document is also available in these non-normative formats: PostScript version, PDF version, ZIP archive, and Gzip’d TAR archive. The English version of this specification is the only normative version. Non-normative translations may also be available. Copyright © 2007-2008 W3C® (MIT, ERCIM, Keio), All Rights Reserved. W3C liability, trademark and document use rules apply. Abstract The current Web is primarily made up of an enormous number of documents that have been created using HTML. These documents contain significant amounts of structured data, which is largely unavailable to tools and applications. When publishers can express this data more completely, and when tools can read it, a new world of user functionality becomes available, letting users transfer structured data between applications and web sites, and allowing browsing applications to improve the user experience: an event on a web page can be directly imported - 1 - How to Read this Document RDFa in XHTML: Syntax and Processing into a user’s desktop calendar; a license on a document can be detected so that users can be informed of their rights automatically; a photo’s creator, camera setting information, resolution, location and topic can be published as easily as the original photo itself, enabling structured search and sharing.
    [Show full text]
  • A Comparison of Two Navigational Aids for Hypertext Mark Alan Satterfield Iowa State University
    Iowa State University Capstones, Theses and Retrospective Theses and Dissertations Dissertations 1992 A comparison of two navigational aids for hypertext Mark Alan Satterfield Iowa State University Follow this and additional works at: https://lib.dr.iastate.edu/rtd Part of the Business and Corporate Communications Commons, and the English Language and Literature Commons Recommended Citation Satterfield, Mark Alan, "A comparison of two navigational aids for hypertext" (1992). Retrospective Theses and Dissertations. 14376. https://lib.dr.iastate.edu/rtd/14376 This Thesis is brought to you for free and open access by the Iowa State University Capstones, Theses and Dissertations at Iowa State University Digital Repository. It has been accepted for inclusion in Retrospective Theses and Dissertations by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. A Comparison of two navigational aids for h3q5ertext by Mark Alan Satterfield A Thesis Submitted to the Gradxiate Facultyin Partial Fulfillment ofthe Requirements for the Degree of MASTER OF ARTS Department: English Major; English (Business and Technical Communication) Signatureshave been redactedforprivacy Iowa State University Ames, Iowa 1992 Copyright © Mark Alan Satterfield, 1992. All rights reserved. u TABLE OF CONTENTS Page ACKNOWLEDGEMENTS AN INTRODUCTION TO USER DISORIENTATION AND NAVIGATION IN HYPERTEXT 1 Navigation Aids 3 Backtrack 3 History 4 Bookmarks 4 Guided tours 5 Indexes 6 Browsers 6 Graphic browsers 7 Table-of-contents browsers 8 Theory of Navigation 8 Schemas ^ 9 Cognitive maps 9 Schemas and maps in text navigation 10 Context 11 Schemas, cognitive maps, and context 12 Metaphors for navigation ' 13 Studies of Navigation Effectiveness 15 Paper vs.
    [Show full text]
  • The Web 2.0 Way of Learning with Technologies Herwig Rollett
    Int. J. Learning Technology, Vol. X, No. Y, xxxx 1 The Web 2.0 way of learning with technologies Herwig Rollett* Know-Center, Inffeldgasse 21a A-8010 Graz, Austria E-mail: [email protected] *Corresponding author Mathias Lux Department for Information Technology University of Klagenfurt Universitätsstraße 65–67 A-9020 Klagenfurt, Austria E-mail: [email protected] Markus Strohmaier Department of Computer Science University of Toronto 40 St. George Street Toronto, Ontario M5S 2E4, Canada Know-Center, Inffeldgasse 21a A-8010 Graz, Austria E-mail: [email protected] Gisela Dösinger Know-Center, Inffeldgasse 21a A-8010 Graz, Austria E-mail: [email protected] Klaus Tochtermann Know-Center Institute of Knowledge Management Graz University of Technology Inffeldgasse 21a, A-8010 Graz, Austria E-mail: [email protected] Copyright © 200x Inderscience Enterprises Ltd. 2 H. Rollett et al. Abstract: While there is a lot of hype around various concepts associated with the term Web 2.0 in industry, little academic research has so far been conducted on the implications of this new approach for the domain of education. Much of what goes by the name of Web 2.0 can, in fact, be regarded as a new kind of learning technologies, and can be utilised as such. This paper explains the background of Web 2.0, investigates the implications for knowledge transfer in general, and then discusses its particular use in eLearning contexts with the help of short scenarios. The main challenge in the future will be to maintain essential Web 2.0 attributes, such as trust, openness, voluntariness and self-organisation, when applying Web 2.0 tools in institutional contexts.
    [Show full text]
  • Hypertext Newswriting Effects on Satisfaction, Comprehension and Attitudes João Canavilhas Universidade Da Beira Interior Key-W
    Hypertext newswriting effects on satisfaction, comprehension and attitudes João Canavilhas Universidade da Beira Interior Key-words: hypertext; web journalism; online jornalism Abstract More than a decade after the beginning of online journalism, the great expectations of this field of expertise remain unfulfilled. Hypertextuality, multimediality, and interactivity remain original marks as yet unexplored by journalism made on the web (though not for the web). In the specific case of hypertextuality, the subject of this study, its little use is linked with journalists’ time constraints, the absence of an organizational model not limited to copying printed papers, and the fear that readers may reject non-linear reading (Paul, 2005). The latter reason is likely what is most hindering the evolution of web journalism, despite the great potential of online news formed by networks of linked texts. To begin with, it frees the reader, giving them the option to create their own path of reading. Additionally, it offers extra contextual information about the subject, allowing readers to increase their knowledge on the subject according to their own interests. Despite the apparent advantages, such textual networks demand an additional effort of readers to interact with the theme. They must follow links, and skip from text to text, which may be perceived as obstructive to their reading. The passivity imposed by the most powerful medium, television, and the recipients’ deeply-rooted habit of linear readings raise an important question: Will readers value the hypertext contexts to such an extent that they will make that additional effort, or are the newspapers right in their fears of the readers’s reaction? In an attempt to answer this question, an experimental study was carried out assessing the effects of hypertext on levels satisfaction, perception, comprehension, and attitudes 1 of online news readers.
    [Show full text]
  • An Analysis of Dewey Linked Data
    Palabra Clave (La Plata) ISSN: 1853-9912 [email protected] Universidad Nacional de La Plata Argentina Classification on the Web: an analysis of Dewey Linked Data Tomoyose, Kazumi; Simionato Arakaki, Ana Carolina Classification on the Web: an analysis of Dewey Linked Data Palabra Clave (La Plata), vol. 9, núm. 2, 2020 Universidad Nacional de La Plata, Argentina Disponible en: http://www.redalyc.org/articulo.oa?id=350562513005 DOI: https://doi.org/10.24215/18539912e092 PDF generado a partir de XML-JATS4R por Redalyc Proyecto académico sin fines de lucro, desarrollado bajo la iniciativa de acceso abierto Palabra Clave (La Plata), 2020, 9(2), Abril-Septiembre, ISSN: 1853-9912 Avances de investigación Classification on the Web: an analysis of Dewey Linked Data La Clasificación en la web: un análisis de Dewey Linked Data Kazumi Tomoyose DOI: https://doi.org/10.24215/18539912e092 Universidade Federal de São Carlos, Brasil Redalyc: http://www.redalyc.org/articulo.oa? [email protected] id=350562513005 Ana Carolina Simionato Arakaki Universidade Federal de São Carlos, Brasil [email protected] Recepción: 23 Agosto 2019 Aprobación: 12 Noviembre 2019 Abstract: With the availability of information in the World Wide Web its access and retrieval by the users is facilitated, and the Library and Information Science (LIS) field’s knowledge and techniques can be applied to this environment in order to help with the process. e present study is descriptive, qualitative and exploratory, based on bibliographical sources, in which it was explored how the Classification discipline interacts with Linked Data, focusing on the analysis of Dewey Linked Data.
    [Show full text]
  • XHTML Rdfa Modules XHTML Rdfa Modules
    XHTML RDFa Modules XHTML RDFa Modules XHTML RDFa Modules Modules to support RDF annotation of elements W3C Editor’s Draft 11 August 2007 This version: http://www.w3.org/MarkUp/2007/ED-xhtml-rdfa-20070811 Latest version: http://www.w3.org/TR/xhtml-rdfa Previous Editor’s Draft: http://www.w3.org/MarkUp/2007/ED-xhtml-rdfa-20070705 Diff from previous Editor’s Draft: xhtml-rdfa-diff.html Editors: Mark Birbeck, x-port.net Ltd. Shane McCarron, Applied Testing and Technology, Inc. This document is also available in these non-normative formats: PostScript version, PDF version, ZIP archive, and Gzip’d TAR archive. The English version of this specification is the only normative version. Non-normative translations may also be available. Copyright © 2007 W3C® (MIT, ERCIM, Keio), All Rights Reserved. W3C liability, trademark and document use rules apply. Abstract The XHTML RDFa (RDF using attributes) modules define a collection of elements and attributes that enhance a document authors ability to annotate the relationships of content within and among documents. These modules can be integrated into any markup language based upon XHTML Modularization [XHTMLMOD [p.45] ]. Status of This Document This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/. - 1 - Table of Contents XHTML RDFa Modules This document is an internal editors draft for development purposes. However, its content is based upon mature materials from [XHTML2 [p.45] ] and is therefore considered nearly complete.
    [Show full text]
  • An Overview of Hypertext Transfer Protocol Service Security on Business Domain
    2012 International Conference on Management and Education Innovation IPEDR vol.37 (2012) © (2012) IACSIT Press, Singapore An Overview of Hypertext Transfer Protocol service Security on Business Domain + Burra Venkata Durga Kumar Taylor’s Business School, Taylor’s University-Malaysia Abstract. Network protocols define the way data is transported between computers in a networked environment. Network protocols completely hide business functionality from higher level services and protocols, which can simply assume that, by providing the assigned name of another computer and company can transit a message or open a continuous communication stream without dealing with the intricacies of data transport. Populates Internet began to grow rapidly like expands with new tools, that is the new standard Hypertext Transfer Protocol and Hypertext Markup Language were introduced to the public. Hypertext Transfer Protocol to make accessing information through the Transfer Control Protocol or Internet Protocol is easier than ever. Hypertext Markup Language allows people to present information that is visually more interesting. Appearance of Hypertext Transfer Protocol and Hypertext Markup Language made people knows so popular, that is often considered synonymous with the Internet itself to the World Wide Web. The purpose of this paper is that knowing what kinds of security methods are appropriate for this Hypertext Transfer Protocol service and then explanations on World Wide Web and Hypertext Transfer Protocol works mechanisms. Proving the security methods of the answers obtained from what are assumed from the issues and included all models. Keywords: Network Protocols, Hypertext Transfer Protocol, Hypertext Markup Language, Transfer Control Protocol, Security, Business. 1. Introduction Nowadays, Internet service become very widely in business purpose and functionality, and for two popular internet services that common used by people, such as FTP (File Transfer Protocol) and HTTP (Hypertext Transfer Protocol).
    [Show full text]
  • Web 2.0: Hypertext by Any Other Name?
    Web 2.0: Hypertext by Any Other Name? David E. Millard and Martin Ross Electronics and Computer Science University of Southampton Southampton, UK +44 (0) 23 8059 3255 {dem, mar302}@ecs.soton.ac.uk ABSTRACT be the inventor of hypertext, and ever since his original work Web 2.0 is the popular name of a new generation of Web others have been refining his approach, both in terms of what applications, sites and companies that emphasis openness, hypertext should offer, how it is interpreted by readers, and also community and interaction. Examples include technologies such in concrete systems and studies. as Blogs and Wikis, and sites such as Flickr. In this paper we Walker has noted that the new generation of Web applications compare these next generation tools to the aspirations of the has created a new form of feral hypertext, unrestrained by early Hypertext pioneers to see if their aims have finally been systems or ownership [22]. In this paper we analyse the field of realized. hypertext research in order to draw out the aspirations of its pioneers and their subsequent refinements by the community, we Categories and Subject Descriptors then compare these aspirations with a number of Web 2.0 H.5.4 [ Hypertext/Hypermedia ]: Theory systems in order to draw conclusions about how well those earlier ideas have been realised in the modern Web. General Terms Design, Theory 2. ASPIRATIONS In 1987, Halasz took the aspirations of the earlier pioneers and, based on the systems around at the time, described seven issues Keywords that had to be considered and resolved in order to progress Web 2.0, Hypertext Pioneers, Hypertext Functionality towards the systems envisioned; these included composite structures, versioning, collaboration and search [12].
    [Show full text]
  • Urls Hypertext
    Universal Resource Locator (URL) Mendel Rosenblum CS142 Lecture Notes - URLs Hypertext ● Text with links to other text ○ Click on links takes you somewhere else ○ Old idea: ■ Ted Nelson coined the term (early '60s), built Xanadu system ■ Doug Englebart: "Mother of all demos" in 1968 ■ HyperCard for the Macintosh: 1987 ● Web adapted the idea, link specification: ○ Uniform Resource Locators (URL) - Provided names for web content <a href="https://en.wikipedia.org/wiki/URL">URL</a> CS142 Lecture Notes - URLs Parts of an URL http://host.company.com:80/a/b/c.html?user=Alice&year=2008#p2 Scheme (http:): identifies protocol used to fetch the content. Host name (//host.company.com): name of a machine to connect to. Server's port number (80): allows multiple servers to run on the same machine. Hierarchical portion (/a/b/c.html): used by server to find content. Query parameters (?user=Alice&year=2008): provides additional parameters Fragment (#p2): Have browser scroll page to fragment (html: p2 is anchor tag) Used on the browser only; not sent to the server. CS142 Lecture Notes - URLs URL: schemes (e.g. http) http: is the most common scheme; it means use the HTTP protocol https: is similar to http: except that it uses SSL encryption file: means read a file from the local disk mailto: means open an email program composing a message There are several other schemes, such as ftp:, but they aren't used much anymore. CS142 Lecture Notes - URLs URL: Hierarchical portion (/a/b/c.html) ● Passed to the web server for interpretation. Early web servers: ○ Path name for a static HTML file.
    [Show full text]
  • On Using JSON-LD to Create Evolvable Restful Services
    On Using JSON-LD to Create Evolvable RESTful Services Markus Lanthaler 1, 2 Christian Gütl 1, 2 1 Institute for Information Systems and Computer Media 2 School of Information Systems Graz University of Technology Curtin University of Technology Graz, Austria Perth, Australia [email protected] [email protected] ABSTRACT being solved, issues on how to integrate these hugely hetero- As the amount of data and devices on the Web experiences geneous datasets into a scalable system become increasingly exponential growth issues on how to integrate such hugely important. Reusing the Web’s underlying architectural style, i.e., heterogeneous components into a scalable system become increa- REST [1], has proven to be a viable solution to transform islands singly important. REST has proven to be a viable solution for of data into an integrated Web of Data. It provides a set of archi- such large-scale information systems. It provides a set of archi- tectural constraints that, when applied as a whole, result in a tectural constraints that, when applied as a whole, result in bene- concrete system architecture that “emphasizes scalability of com- fits in terms of loose coupling, maintainability, evolvability, and ponent interactions, generality of interfaces, independent scalability. Unfortunately, some of REST’s constraints such as the deployment of components, and intermediary components to ones that demand self-descriptive messages or require the use of reduce interaction latency, enforce security, and encapsulate hypermedia as the engine of application state are rarely imple- legacy systems” [1]. mented correctly. This results in tightly coupled and thus brittle While some of REST’s constraints such as stateless interaction, systems.
    [Show full text]
  • A Study of Hypertext Reading on the Internet ૺࢢቶŎ Yu-Wei Chang
    ௐ4ഇĞ2004.3ğࢱ59-79סဦ३ྤੈጯΏ ௐ1 შྮ෹͛ώዦ᝝̝ଣ੅ A Study of Hypertext Reading on the Internet ૺࢢቶŎ Yu-Wei Chang ၡࢋ multiple developments of reading and provide ቢّ̈́ another choice for the readers. Linear reading isܧ͛ώӮΞͽଳϡה෹͛ώ̈́О .ቢّ͞ёซҖዦ᝝ĂΪߏྤੈࡊԫ۞ᐹ๕Ξ still the main reading style in the society ͽឰ෹͛ώ۞পᕇдშྮ˯Ξͽ൴೭۞Հ However, information technology changes the рĂ೩ֻ᝝۰Ω˘჌ዦ᝝͞ё۞ᏴፄĄቢّ way of communication and even changes the .ዦ᝝̪ߏۤົ̂ி౵͹ࢋ۞ዦ᝝ݭၗĂҭЯ peopleŅs culture including the reading habit ࠎྤੈࡊԫԼត˞ˠᙷ̝็ᇫ͞ёĂซ҃Լ This paper will focus on the difference between តΒ߁ዦ᝝͞ёд̰۞ˠᙷ̼͛ݭၗĂ߇შ the hypertext reading on the Internet and the ͛ώዦ᝝̝मளĂߏώ printed text reading. Although Internet brings aה෹͛ώዦ᝝ᄃОྮ ͛ᙯ͕۞ࢦᕇĄშྮ۞΍னឰዦ᝝ѣາ۞ࢬ new reading style, the interaction between the ᄻ΍னĂҭৌϒ׶᝝۰யϠ̢જ۞ߏ̰टĂ text and the reader is the information content γдྶវĂ߇̙გߏშྮ෹͛ώዦ᝝ٕ instead of the carrier. Therefore, the importantܧ҃ ௚ዦ᝝Ăࢦࢋ۞ኝᗟӮߏтң೩̿ዦ᝝̝ issue for the hypertext reading on the Internet็ ݡኳĂͽ႕֖᝝۰̝ዦ᝝ᅮՐĄ or the conventional reading is the improvement on the reading quality in order to satisfy the Abstract reader's need. The hypertext and the printed text can be read in a non-linear or a linear way. The ᙯᔣෟĈშᅫშྮă෹͛ώăዦ᝝ advantages of information technology favor the KeywordsĈInternet; Hypertext; Reading ϲέ៉̂ጯဦ३ྤੈࡁտٙ౾̀Ϡ઼+ (Doctoral Student, Graduate School of Library and Information Science, National Taiwan University) 59 ௐ4ഇĞ2004.3ğסဦ३ྤੈጯΏ ௐ1 ಥă݈֏ ಫ̬۞ତৼĂซ҃யϠາ۞৵ዳĞliteracyğ ߏ׽ؠ̙ត۞Ăܧၹ۞࿅඀Ă˵ಶߏྋ યᗟĂЯѩ৵ዳ۞ؠཌྷ֭ޙዦ᝝ߏ˘჌ຍཌྷ ᛖဦ͛ຍཌྷ۞જүĞො1ğĂ౅࿅ዦ᝝ˠࣇ଀ ҃ߏᄃ༊ॡ̝ࡊԫᒖဩ૜̷࠹ᙯĄĞො3ğ םͽӛќۢᙊซ҃౅࿅็ᇫ็ٚۢᙊĂЯѩዦ ࢬ၆შྮ็ᇫॡ΃۞ֽᓜĂ઼ᅫዦ᝝ ᝝˘ۡᄃۢᙊܲ޺૜̷ᙯܼĂ͍׎Վˢۢᙊ
    [Show full text]
  • WWW, HTML, and URLS Thomas Weise · 汤卫思 [email protected] ·
    Distributed Computing Lesson 9: WWW, HTML, and URLS Thomas Weise · 汤卫思 [email protected] · http://www.it-weise.de Hefei University, South Campus 2 合肥学院 南艳湖校区/南2区 Faculty of Computer Science and Technology 计算机科学与技术系 Institute of Applied Optimization 应用优化研究所 230601 Shushan District, Hefei, Anhui, China 中国 安徽省 合肥市 蜀山区 230601 Econ. & Tech. Devel. Zone, Jinxiu Dadao 99 经济技术开发区 锦绣大道99号 Outline 1 World Wide Web 2 Web Pages and Sites 3 URLs & Co. website Distributed Computing Thomas Weise 2/22 Overview • Understand the concepts of static HTML and URLs • Understand closely related standards such as CSS and JavaScript Distributed Computing Thomas Weise 3/22 World Wide Web (History) 89-03 First proposal for HTML/WWW by Tim Berners-Lee at CERN in Geneva for a network of documents [1] in a Hypertext [2] format Distributed Computing Thomas Weise 4/22 World Wide Web (History) 89-03 First proposal for HTML/WWW by Tim Berners-Lee at CERN in Geneva for a network of documents [1] in a Hypertext [2] format 91-12 First text-based prototype shown at the Hypertext’91 conference [3] Distributed Computing Thomas Weise 4/22 World Wide Web (History) 89-03 First proposal for HTML/WWW by Tim Berners-Lee at CERN in Geneva for a network of documents [1] in a Hypertext [2] format 91-12 First text-based prototype shown at the Hypertext’91 conference [3] 92-12 50 WWW server wordwide Distributed Computing Thomas Weise 4/22 World Wide Web (History) 89-03 First proposal for HTML/WWW by Tim Berners-Lee at CERN in Geneva for a network of documents [1] in a Hypertext [2] format
    [Show full text]