An Extensible Framework for Efficient Document Management Using RDF and OWL

Total Page:16

File Type:pdf, Size:1020Kb

An Extensible Framework for Efficient Document Management Using RDF and OWL An Extensible Framework for Efficient Document Management Using RDF and OWL Erica Meena Ashwani Kumar Laurent Romary Laboratory LORIA M.I.T Laboratory LORIA Vandoeuvre-les-Nancy Cambridge, MA Vandoeuvre-les-Nancy France USA France [email protected] [email protected] [email protected] bling technologies such as XML, RDF and OWL. Most Abstract of the existing document management systems ([1], [2]) limit themselves in the scope of application or document In this paper, we describe an integrated approach to- formats or simply neglect any structure-based analysis. wards dealing with various semantic and structural is- However, considering our requirements, it is obvious sues associated with document management. We that only a multi-layered functional architecture can provide motivations for using XML, RDF and OWL in cover various issues related to distributed document building a seamless architecture to serve not only as a management such as localized vs global structural con- document exchange service but also to enable higher straints, conceptual definition of documents, reasoning- level services such as annotations, metadata access and based discovery etc. querying. The key idea is to manifest differential treat- ments for the actual document structure, semantic con- Indeed, evolving technologies such as XML (eXtensible tent of the document and ontological document Markup Language), RDF (Resource Description organization. The deployment of this architecture in the Framework) and OWL (Web Ontology Language) pro- PROTEUS project1 provides an industrial setting for vide us with rich set of application frameworks that if evaluation and further specification. applied intelligently, can help a great deal in solving these problems. XML ([3]) is primarily designed for 1. Introduction low-level structural descriptions. It provides a tree of structured nodes, which can be efficiently used to de- Digital documents are ubiquitously used to encode, pre- scribe documents and check their models using DTDs serve as well as exchange useful information in order to (Document Type Definitions) or XML Schemas. Be- accomplish information sharing across the community. sides, XML enables easy human readability as well as As the growth in volumes of digital data is exponential, efficient machine interpretability. However, there are it is necessary to adopt a principled way of managing issues if we only deal with the structural aspect. If one these documents. Besides, due to the distributed nature wants to pick some semantic information from a docu- of information, it is also imperative to take into account ment, there is no straightforward way other than to con- the geographical and enterprise-level barriers for uni- strain it by an schema or make an application hand- form data access and retrieval. programmed to recognize certain document-specific semantics. Furthermore, if the schema changes over The ITEA (Information Technology for European Ad- time, it could typically introduce new intermediate ele- vancement) project, Proteus2 has similar objectives. ments. This might have the consequences of invalidat- Proteus is a collaborative initiative of French, German ing certain queries and creating incoherencies in the and Belgium companies, universities and research in- semantic data-model of the document. stitutes aimed at developing a European generic soft- ware platform usable for implementation of web-based RDF (Resource Description Framework) and OWL e-maintenance centers. It implements a generic archi- (Web Ontology Language) build upon the XML syntax tecture for integrated document management using ena- to describe the actual semantics of a document and pro- vide useful reasoning and inference mechanisms. RDF ([4]) specifies graphs of nodes, which are connected by 1 This material is based upon work supported by the ITEA (Information Tech- directed arcs representing relational predicates such as nology for European Advancement) programme under Grant 01011 (2002). 2 http://www.proteus-iteaproject.com/ URIs (Uniform Resource Identifiers) and encode the conceptual model of the real world. Unlike XML, an ers though distinct at the level of data flow and RDF schema is a simple vocabulary language. The individual processing of information, afford function- parse of the semantic graph results in a set of triples, alities that are exploited by the e-Doc server. which mimic predicate-argument conceptual structures. OWL can be used on top of these semantic structures to Figure 2.2 shows various such layers of the e-Doc do logical reasoning and discover relations that are not server. On the foundation level, it is assumed that every explicit and obvious. document on the e-Doc server adheres to a single syntax i.e. XML, which represents the top most layer in the In the following sections we discuss how we use these architecture. The second layer depicts the access points technologies to enable a generic document management that are broadly categorized along various dimensions system. Firstly, in Section 2 we describe the document such as metadata, conceptual/ontology system and ter- management and the Proteus architecture followed by minology. A detailed description of the access points discussion on Annotations in Section 3. Section 4 pro- will be carried out in the Section 4. The e-Doc server is vides brief account of the model theoretic access assumed to be flexible enough to handle all possible mechanisms enabled by OWL followed by description ontology formats/standards whether it is a native XML of data categories in Section 5. document or a text or a picture/video data coming from some streaming applications. This forms the third im- 2. Document Management Architecture portant layer of the e-Doc server. The bottom layer rep- resents Annotations [6], which adheres to the RDF [4] Without differentiating at the level of content, layout syntax. This layer forms an integral part of the e-Doc and formats, we treat documents as information re- server as it enables annotation capability and RDF- sources. These information resources can potentially be describable semantics to the actively retrieved document distributed across various document repositories called or existing documents in the server [7]. Besides, RDF e-Doc servers. Figure 2.1 demonstrates a simplified also provides the opportunity to utilize annotations as distributed document management system. The archi- access points for the documents. tecture shows how three different document repositories could co-exist functionally along with the Annotea en- Query abled annotation framework ([5]). These servers imple- Retrieve Document XML Syntax ment procedural mechanisms for query access and server View retrieval of documents. Besides, these documents can be Access Meta-data Conceptual Terminology points (1 doc) system User annotated and the annotations reside on an independent (n ontologies) Document Native XML Misc. textual Picture/ Application server known as the annotation server, which also s document format Videos Data (XML) RDF serves as a document server. Principally, annotations Annotations Comment can be viewed as information resources, which are de- scribed in RDF. Figure 2.2: General Organization of the e-Doc Server Annotea RDF As can be seen from the Figure 2.2, a user interacts with the server through a client interface by launching his queries. The architecture provides the user ample flexi- Server 1 Server 2 Server 3 bility in utilizing different levels of descriptions for re- TEI/XML Doc TEI/XML Doc TEI/XML Doc trieving documents by providing variety of access points. In the following sections, we describe each of Header Header Header these access layers in more detail. Body Body Body 3. Annotations: Specified as RDF Model Annotations form the most abstract layer within the e- Doc architecture. They can be broadly defined as com- Figure 2.1: Simplified view of the distributed document ments, notes, explanations, or other types of external server architecture remarks that can be attached to either a document or a sub portion of a document. As annotations are consid- The e-Doc server consists of several functional layers ered external, it is possible to annotate a document as a that inter-communicate and holistically, serve the cu- whole or in part without actually editing its content or mulative purpose of document management. These lay- structure. Conceptually, annotations can be considered as metadata, as they give additional information about …> an existing piece of data. Annotations can have many <dc:creator>Ashwani</dc:creator> distinguishing properties, which can be broadly classi- <rdf:type fied as:- rdf:resource='http://www.w3.org/2000 /10/annotation-ns#Annotation'/> • Physical location:- An annotation can be stored <NS1:origin rdf:nodeID='A0'/> locally or on one or more annotation servers; <NS0:created>2004-05- • Scope:- An annotation can be associated with a 24T01:11Z</NS0:created> document as a whole or to a sub-portion of a <NS0:annotates document. rdf:resource='http://docB4.teiSpec.o • Annotation type:- Annotations can have vari- rg'/> <rdf:type ous functional types such as, “Comment”, rdf:resource='http://www.w3.org/2000 “Remark”, “Query” e.t.c.… /10/annotationType#Comment'/> <NS0:body rdf:resource='Please re- view this document.'/> Due to this abstract nature and multiplicity of functional <dc:title>review</dc:title> types, a formal treatment of annotations is often un- <dc:date>2004-05-24T01:11Z</dc:date> wieldy. Therefore, it is desired to have a semantically </rdf:Description> <rdf:Description driven structural representation for annotations, which rdf:nodeID='A1'> we describe below. …. </rdf:Description> Annotation Semantics </rdf:RDF> Figure 3.2: An abridged Annotation in RDF Annotations are stored in one or multiple annotation servers. These servers endorse exchange protocols as Xpointers are used to point to the Annotated portions specified by Annotea [5]. Essentially, the Annotation within the document, while Xlinks [11] are used to Server can be regarded as a general purpose RDF store, setup a link between the Document and it's annotation. with additional mechanisms for optimized queries and access. This RDF store is built on top of a general SQL store.
Recommended publications
  • Annotea: an Open RDF Infrastructure for Shared Web Annotations
    Proceedings of the WWW 10th International Conference, Hong Kong, May 2001. Annotea: An Open RDF Infrastructure for Shared Web Annotations Jos´eKahan,1 Marja-Riitta Koivunen,2 Eric Prud’Hommeaux2 and Ralph R. Swick2 1 W3C INRIA Rhone-Alpes 2 W3C MIT Laboratory for Computer Science {kahan, marja, eric, swick}@w3.org Abstract. Annotea is a Web-based shared annotation system based on a general-purpose open RDF infrastructure, where annotations are modeled as a class of metadata.Annotations are viewed as statements made by an author about a Web doc- ument. Annotations are external to the documents and can be stored in one or more annotation servers.One of the goals of this project has been to re-use as much existing W3C technol- ogy as possible. We have reacheditmostlybycombining RDF with XPointer, XLink, and HTTP. We have also implemented an instance of our system using the Amaya editor/browser and ageneric RDF database, accessible through an Apache HTTP server. In this implementation, the merging of annotations with documents takes place within the client. The paper presents the overall design of Annotea and describes some of the issues we have faced and how we have solved them. 1Introduction One of the basic milestones in the road to a Semantic Web [22] is the as- sociation of metadata to content. Metadata allows the Web to describe properties about some given content, even if the medium of this content does not directly provide the necessary means to do so. For example, ametadata schema for digital photos [15] allows the Web to describe, among other properties, the camera model used to take a photo, shut- ter speed, date, and location.
    [Show full text]
  • Conceptualization and Visualization of Tagging and Folksonomies
    Conceptualization and Visualization of Tagging and Folksonomies Von der Fakultät für Ingenieurwissenschaften, Abteilung Informatik und Angewandte Kognitionswissenschaft der Universität Duisburg-Essen zur Erlangung des akademischen Grades Doktor der Ingenieurwissenschaften (Dr.-Ing.) genehmigte Dissertation von Steffen Lohmann aus Hamburg 1. Gutachter: Prof. Dr. Maria Paloma Díaz Pérez 2. Gutachter: Prof. Dr.-Ing. Jürgen Ziegler Tag der mündlichen Prüfung: 27.11.2013 Hinweis: Diese Dissertation ist im Rahmen eines binationalen Promotionsverfahrens (Cotutelle) in Kooperation mit der Universidad Carlos III de Madrid entstanden. Abstract Tagging has become a popular indexing method for interactive systems in the past decade. It offers a simple yet effective way for users to organize an ever increasing amount of digital information for themselves and/or others. The linked user vocabulary resulting from tagging is known as folksonomy and provides a valuable source for the retrieval and exploration of digital resources. Although several models and representations of tagging have been proposed, there is no coherent conceptualization that provides a comprehensive and pre- cise description of the concepts and relationships in the domain. Furthermore, there is little systematic research in the area of folksonomy visualization, and so folksonomies are still mainly depicted as simple tag clouds. Both problems are related, as a well-defined conceptualization is an important prerequisite for the interoperable use and visualization of folksonomies. The thesis addresses these shortcomings by developing a coherent conceptualiza- tion of tagging and visualizations for the interactive exploration of folksonomies. It gives an overview and comparison of tagging models and defines key concepts of the domain. After a comprehensive review of existing tagging ontologies, a unified and coherent conceptualization is presented that incorporates the best parts of the reviewed ontologies.
    [Show full text]
  • Peiwen Zhu. Web Annotation Systems: a Literature Review and Case Study
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Carolina Digital Repository Peiwen Zhu. Web Annotation Systems: A Literature Review and Case Study. A Master’s Paper for the M.S. in I.S. degree. April, 2008. 35 pages. Advisor: Bradley M.Hemminger. Web annotation has been a popular research topic since the appearance of Internet and its supplementary technologies. This paper provides a literature review in the related research areas, and introduces some currently available web annotation systems with the comparison of these systems in seven aspects. In addition, this paper incorporates a description of the ongoing NeoNote project, and identifies limitations of this study for future work. Headings: Scholarly Communication Web annotations Information Systems/Design WEB ANNOTATION SYSTEMS: A LITERATURE REVIEW AND CASE STUDY by Peiwen Zhu A Master's paper submitted to the faculty of the School of Information and Library Science of the University of North Carolina at Chapel Hill in partial fulfillment of the requirements for the degree of Master of Science in Information Science. Chapel Hill, North Carolina April, 2008 Approved by: ___________________________ Bradley M.Hemminger 1 Table of Contents Introduction………………………………………………………………………………..2 Literature Review………………………………………………………………………….4 Current Systems………………………………………………………………………….15 Case Study……………………………………………………………………………….26 Conclusion……………………………………………………………………………….30 References………………………………………………………………………………..32 2 Introduction Web browsing plays an important role nowadays in people’s daily life, study, and work. Since documents exist mostly in digital format on the web, people may spend a large part of their time on browsing or searching on the web to look for useful information. However, this used to be a one-way interaction with users having few options to mark texts or to highlight important sections in a web document; what’s more, it is difficult to add extra information as reference on web pages, which is useful for further reference or sharing with friends.
    [Show full text]
  • Incorporating Annotations Into Formal and Informal Ontologies: Experiences and Implications
    Incorporating Annotations into Formal and Informal Ontologies: Experiences and Implications L. I. Lumb a,∗ J. R. Freemantle b J. I. Lederman b K. D. Aldridge b aComputing and Network Services, York University, 4700 Keele Street, Toronto, Ontario, M3J 1P3, Canada bEarth & Space Science and Engineering, York University, 4700 Keele Street, Toronto, Ontario, M3J 1P3, Canada Abstract Traditionally, and to a first approximation, annotations can be regarded as com- ments. In the case of the Web Ontology Language (OWL), this perspective is largely accurate, as annotations are internal constructs included with the language. As in- ternal constructs, annotations in OWL Description Logic (DL) are also constrained to ensure, ultimately, that they do not negatively impact on the ontology’s ability to remain computationally complete and decidable. Formal ontologies, however, can also be annotated externally with the XML Pointer Language (XPointer). Because XPointer-based annotations are quite likely to result in violations of the constraints traditionally placed on OWL DL’s built-in annotations, there exist potentially seri- ous consequences for maintaining self-contained formal ontologies. Insight gained in modeling annotations in formal ontologies using top-down strategies can be applied to informal ontologies. In part, the previous practice of incorporating feature-based annotations directly into informal ontologies is regarded differently, as the XPointer- based annotations may require more complex OWL dialects in which computational completeness and decidability cannot be guaranteed. Critical to the development of informal ontologies is Gleaning Resource Descriptions from Dialects of Languages (GRDDL), as it facilitates the extraction of Resource Description Format (RDF) relationships from representations cast in the eXtensible Markup Language (XML).
    [Show full text]
  • Enhanced Visualisation of Web Links and Annotations
    FACULTY OF SCIENCE AND BIO- ENGINEERING SCIENCES DEPARTMENT OF COMPUTER SCIENCE Enhanced Visualisation of Web Links and Annotations Graduation thesis submitted in partial fulfillment of the requirements for the degree of Master in Applied Computer Science David Swalus Promoter: Prof. Dr. Beat Signer Advisor: Ahmed Tayeh Academic year 2013-2014 c Vrije Universiteit Brussel, all rights reserved. FACULTEIT WETENSCHAPPEN EN BIO- INGENIEURSWETENSCHAPPEN VAKGROEP COMPUTERWETENSCHAPPEN Enhanced Visualisation of Web Links and Annotations Afstudeer eindwerk ingediend in gedeeltelijke vervulling van de eisen voor het behalen van de graad Master in de Toegepaste Informatica David Swalus Promoter: Prof. Dr. Beat Signer Advisor: Ahmed Tayeh Academiejaar 2013-2014 c Vrije Universiteit Brussel, all rights reserved. i Abstract The World Wide Web is playing an increasingly important role in day to day life. From personal use to big Internet companies, the Web is omnipresent in our modern society. The Web was created in 1989 and publicly available since 1991. Before Web 2.0, webpages were being linked using simple HTML anchor tags. Moreover, web users were passive users. With the introduction of Web 2.0 and the use of some standards such as XLink standard, web users are not passive users any more. They are able to enrich webpages with extra external hyperlinks and annotations. Besides that, with the development of Semantic Web technologies such as RDF and RDFa, web users and developers are able to enrich webpages and their hyperlinks with meaningful machine- readable metadata. Thereby, the Web is very rich of informative hyperlinks and annotations. However, it is rather strange that web browsers have not changed the hyperlinks visualisation since the beginning of the Web.
    [Show full text]
  • Creating Ontology-Based Metadata by Annotation for the Semantic Web
    Zur Erlangung des akademischen Grades eines Doktors der Wirtschaftswissenschaften (Dr. rer. pol.) von der Fakult¨at f¨ur Wirtschaftswissenschaften der Universit¨at Fridericiana zu Karlsruhe genehmigte Dissertation. Creating Ontology-based Metadata by Annotation for the Semantic Web von Dipl.-Ing.(FH), Dipl.-Inf.wiss. Siegfried Handschuh Tag der m¨undlichen Pr¨ufung: 16. Februar 2005 Referent: Prof. Dr. Rudi Studer Korreferent: Prof. Dr. Christof Weinhardt To my family. Acknowledgements I would like to thank people that supported me during the process of researching and writing this thesis. First of all, I would like to express my gratitude to my advisor, Prof. Dr. Rudi Studer, for his support, patience, and encouragement throughout my PhD studies at the Institute AIFB at the University of Karlsruhe. I am also grateful to Prof. Dr. Christof Weinhardt on my dissertation committee, and to Prof. Dr. Andreas Oberweis, who served on the examination committee. My work very much profited from working with Prof. Dr. Steffen Staab. I learned a lot from him, he inspiried me with his ideas and he significantly improved my scientific skills, e.g. by teaching me how to successfully write papers. He and Prof. Dr. Gerd Stumme gave me invaluable advice on organizing research workshops. My thanks also goes to all my colleagues in Karlsruhe at the Institute AIFB, the FZI and Ontoprise GmbH for providing a very fruitful and stimulating working atmosphere. In particular to Philipp Cimiano, Nenad and Ljiljana Stojanovic, Sudhir Agarwal and Alexander M¨adche. I’m especially indebted to Dr. Stefan Decker, for taking me on board his Onto- Agents project and the DARPA DAML program, which funded my work on semantic annotation.
    [Show full text]
  • Annotation Persistence Over Dynamic Documents
    Annotation Persistence over Dynamic Documents by Shaomin Wang Submitted to the Department of Civil and Environmental Engineering W in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy in the Field of Information Technology at the MASSACH USEUS IN148TWTtE OF TECHNOLOGY Massachusetts Institute of Technology FEB 2 4 2005 February 2005 LIBRARIES © 2005 Massachusetts Institute of Technology All right reserved Signature of Author ................ Department of Ci iand Environmental Engineering October 5, 2004 Certified by ................ ...... ... ........ Steven R. Lerman Professor of Civil and Environmental Engineering Thesis Co-supervisor d \ 1 ) Certified by V. Judson Harward Principal Research Scientist, Center for Educational Computing Initiatives Thesis Co-Supervisor I <el I I A Accepted by . .... -. .. ,..-I. ... ... .. .. .. .. .. .. .. Andrew Whittle Chairman, Departmental Committee on Graduate Studies Annotation Persistence over Dynamic Documents by Shaomin Wang Submitted to the Department of Civil and Environmental Engineering On October 5, 2004, in Partial Fulfillment of the Requirement for the Degree of Doctor of Philosophy in the Field of Information Technology Abstract Annotations, as a routine practice of actively engaging with reading materials, are heavily used in the paper world to augment the usefulness of documents. By annotation, we include a large variety of creative manipulations by which the otherwise passive reader becomes actively involved in a document. Annotations in digital form possess many benefits paper annotations do not enjoy, such as annotation searching, annotation multi- referencing, and annotation sharing. The digital form also introduces challenges to the process of annotation. This study looks at one of them, annotationpersistence over dynamic documents. With the development of annotation software, users now have the opportunity to annotate documents which they don't own, or to which they don't have write permission.
    [Show full text]
  • ANNOTATING WEB SEARCH RESULTS a Thesis Presented To
    ANNOTATING WEB SEARCH RESULTS A Thesis Presented to the Department of Computer Science and Engineering, The African University of Science and Technology In Partial Fulfillment of the Requirements For the Degree of MASTER OF SCIENCE By Sam Manthalu Abuja, Nigeria December, 2014 ANNOTATING WEB SEARCH RESULTS By Sam Manthalu A THESIS APPROVED BY THE COMPUTER SCIENCE AND ENGINEERING DEPARTMENT RECOMMENDED: ……………………………………… Supervisor, Professor David Amos ………………………………………………………………. Head, Department of Computer Science APPROVED ……………………………………………. Chief Academic Officer ………………………………………. Date Abstract With more than millions of pages, the Web has become a greatly enormous information source. This information is in form of documents, images, videos as well as text. With such vast sizes of data, it is a common problem to get the right information that one wants. Oftentimes users have to search for the right content they are looking for from the Web with the help of search engines. Searching can be done manually by use of available platforms like Google or automatically in form of web crawlers. Since the semantic web is not structured, search results can include varying types of information relating to the same query. Sometimes these results cannot be directly analyzed to meet the specific interpretation need. The search result records (SRRs) returned from the Web following manual or automatic queries are in form of web pages that hold results obtained from underlying databases. Such results can further be used in many applications such as data collection, comparison of prices etc. Thus, there is a need to make the SRRs machine processable. To achieve that, it is important that the SRRs are annotated in a meaningful fashion.
    [Show full text]
  • Building Web Annotation Stickies Based on Bidirectional Links
    Building Web Annotation Stickies based on Bidirectional Links Hiroyuki Sano, Taiki Ito, Tadachika Ozono and Toramatsu Shintani Dept. of Computer Science and Engineering Graduate School of Engineering, Nagoya Institute of Technology Gokiso-cho, Showa-ku, Nagoya, Aichi, 466-8555 Japan fhsano, taiki, ozono, [email protected] Abstract. We propose a web annotation system which adds the func- tionality of stickies to web pages and creates bidirectional links between the stickies. The stickies allow for important parts of a web page which contains large amounts of data to be highlighted. We have implemented a positioning method based on the Document Object Model (DOM), as well as a new method for placing stickies which depends on web contents related to the information referenced by the stickies. By using web agents, the system automatically generates bidirectional links between stickies referencing similar information and subsequently categorizes them. Such stickies and links can be used as user preferences, and have the potential to become a much better alternative to bookmarks and tags. 1 Introduction We propose a web annotation system for web contents. Web pages contain text, images, and other types of information which are often related to more than one topic. We have implemented a web annotation system which enables users to place stickies on web pages. By using the system, users can point out speci¯c contents more accurately than by using bookmarks in web browsers or current tagging systems. Bookmarks and tagging systems enable users to specify only the web page of interest. However, when users generate a link from a web page to a content located on another web page, they need to build a new system for managing the referenced web page.
    [Show full text]
  • Annotating Relationships Between Multiple Mixed-Media Digital Objects by Extending Annotea
    Annotating Relationships between Multiple Mixed-media digital objects by extending Annotea Ronald Schroeter1, Jane Hunter1 1School of ITEE, The University Of Queensland {ronalds, jane}@itee.uq.edu.au Abstract. Annotea provides an annotation protocol to support collaborative Semantic Web-based annotation of digital resources accessible through the Web. It provides a model whereby a user may attach supplementary information to a resource or part of a resource in the form of: either a simple textual comment; a hyperlink to another web page; a local file; or a semantic tag extracted from a formal ontology and controlled vocabulary. Hence, annotations can be used to attach subjective notes, comments, rankings, queries or tags to enable semantic reasoning across web resources. More recently tabbed Browsers and specific annotation tools, allow users to view several resources (e.g., images, video, audio, text, HTML, PDF) simultaneously in order to carry out side-by-side comparisons. In such scenarios, users frequently want to be able to create and annotate a link or relationship between two or more objects or between segments within those objects. For example, a user might want to create a link between a scene in an original film and the corresponding scene in a remake and attach an annotation to that link. Based on past experiences gained from implementing Annotea within different communities in order to enable knowledge capture, this paper describes and compares alternative ways in which the Annotea Schema may be extended for the purpose of annotating links between multiple resources (or segments of resources). It concludes by identifying and recommending an optimum approach which will enhance the power, flexibility and applicability of Annotea in many domains.
    [Show full text]
  • Peiwen Zhu. Web Annotation Systems: a Literature Review and Case Study
    Peiwen Zhu. Web Annotation Systems: A Literature Review and Case Study. A Master’s Paper for the M.S. in I.S. degree. April, 2008. 35 pages. Advisor: Bradley M.Hemminger. Web annotation has been a popular research topic since the appearance of Internet and its supplementary technologies. This paper provides a literature review in the related research areas, and introduces some currently available web annotation systems with the comparison of these systems in seven aspects. In addition, this paper incorporates a description of the ongoing NeoNote project, and identifies limitations of this study for future work. Headings: Scholarly Communication Web annotations Information Systems/Design WEB ANNOTATION SYSTEMS: A LITERATURE REVIEW AND CASE STUDY by Peiwen Zhu A Master's paper submitted to the faculty of the School of Information and Library Science of the University of North Carolina at Chapel Hill in partial fulfillment of the requirements for the degree of Master of Science in Information Science. Chapel Hill, North Carolina April, 2008 Approved by: ___________________________ Bradley M.Hemminger 1 Table of Contents Introduction………………………………………………………………………………..2 Literature Review………………………………………………………………………….4 Current Systems………………………………………………………………………….15 Case Study……………………………………………………………………………….26 Conclusion……………………………………………………………………………….30 References………………………………………………………………………………..32 2 Introduction Web browsing plays an important role nowadays in people’s daily life, study, and work. Since documents exist mostly in digital format on the web, people may spend a large part of their time on browsing or searching on the web to look for useful information. However, this used to be a one-way interaction with users having few options to mark texts or to highlight important sections in a web document; what’s more, it is difficult to add extra information as reference on web pages, which is useful for further reference or sharing with friends.
    [Show full text]
  • Conceptualization and Visualization of Tagging and Folksonomies Is the Main Reason Why Both Are Jointly Investigated in This Thesis
    TESIS DOCTORAL Conceptualization and Visualization of Tagging and Folksonomiees Autor: Steffen Lohmann Director/es: Prof. Dr. Maaria Paloma Díaz Pérez Prof. Dr.-Ing. Jürgen Ziegler DEPARTAMEENTO DE INFORMÁTICA Leganés (Madrid), Octubre 2013 Abstract Tagging has become a popular indexing method for interactive systems in the past decade. It offers a simple yet effective way for users to organize an ever increasing amount of digital information for themselves and/or others. The linked user vocabulary resulting from tagging is known as folksonomy and provides a valuable source for the retrieval and exploration of digital resources. Although several models and representations of tagging have been proposed, there is no coherent conceptualization that provides a comprehensive and pre- cise description of the concepts and relationships in the domain. Furthermore, there is little systematic research in the area of folksonomy visualization, and so folksonomies are still mainly depicted as simple tag clouds. Both problems are related, as a well-defined conceptualization is an important prerequisite for the interoperable use and visualization of folksonomies. The thesis addresses these shortcomings by developing a coherent conceptualiza- tion of tagging and visualizations for the interactive exploration of folksonomies. It gives an overview and comparison of tagging models and defines key concepts of the domain. After a comprehensive review of existing tagging ontologies, a unified and coherent conceptualization is presented that incorporates the best parts of the reviewed ontologies. The conceptualization is implemented in a standardized ontology language and enables scalable reasoning on folksonomies. Based on the conceptualization, different tag cloud layouts and their ability to support users in information seeking tasks are investigated.
    [Show full text]