A Knowledge Discovery Approach

Total Page:16

File Type:pdf, Size:1020Kb

A Knowledge Discovery Approach Semantic XML Tagging of Domain-Specific Text Archives: A Knowledge Discovery Approach Dissertation zur Erlangung des akademisches Grades Doktoringenieur (Dr.-Ing.) angenommen durch die Fakult¨at fur¨ Informatik der Otto-von-Guericke-Universit¨at Magdeburg von Diplom-Kaufmann Peter Karsten Winkler, geboren am 1. Oktober 1971 in Berlin Gutachterinnen und Gutachter: Prof. Dr. Myra Spiliopoulou Prof. Dr. Gunter Saake Prof. Dr. Stefan Conrad Ort und Datum des Promotionskolloquiums: Magdeburg, 22. Januar 2009 Karsten Winkler. Semantic XML Tagging of Domain-Specific Text Archives: A Knowl- edge Discovery Approach. Dissertation, Faculty of Computer Science, Otto von Guericke University Magdeburg, Magdeburg, Germany, January 2009. Contents List of Figures v List of Tables vii List of Algorithms xi Abstract xiii Zusammenfassung xv Acknowledgments xvii 1 Introduction 1 1.1 TheAbundanceofText ............................ 1 1.2 Defining Semantic XML Markup . 3 1.3 BenefitsofSemanticXMLMarkup . 9 1.4 ResearchQuestions ............................... 12 1.5 ResearchMethodology ............................. 14 1.6 Outline...................................... 16 2 Literature Review 19 2.1 Storage, Retrieval, and Analysis of Textual Data . ....... 19 2.1.1 Knowledge Discovery in Textual Databases . 19 2.1.2 Information Storage and Retrieval . 23 2.1.3 InformationExtraction. 25 2.2 Discovering Concepts in Textual Data . 26 2.2.1 Topic Discovery in Text Documents . 27 2.2.2 Extracting Relational Tuples from Text . 31 2.2.3 Learning Taxonomies, Thesauri, and Ontologies . 35 2.3 Semantic Annotation of Text Documents . 39 2.3.1 Manual Semantic Text Annotation . 40 2.3.2 Semi-Automated Semantic Text Annotation . 43 2.3.3 Automated Semantic Text Annotation . 47 2.4 Schema Discovery in Marked-Up Text Documents . 50 2.5 Summary .................................... 53 Contents 3 DIAsDEM Framework 57 3.1 Terminology................................... 57 3.2 ObjectivesandOverview. 62 3.3 Knowledge Discovery Phase . 65 3.4 Knowledge Application Phase . 67 3.5 Summary .................................... 68 4 DIAsDEM Knowledge Discovery Process 71 4.1 Terminology................................... 71 4.2 Pre-ProcessingofTextDocuments . 75 4.2.1 Creating and Tokenizing Text Units . 75 4.2.2 ExtractingNamedEntities. 80 4.2.3 Lemmatizing Words and Word Sense Disambiguation . 83 4.2.4 Establishing a Controlled Vocabulary . 86 4.2.5 Mapping Text Units onto Text Unit Vectors . 94 4.3 Clustering of Text Unit Vectors . 99 4.3.1 Clustering Textual Data: An Overview . 99 4.3.2 Selecting a Clustering Algorithm . 107 4.3.3 Ranking Clusters of Text Unit Vectors . 115 4.3.4 Iterative Clustering of Text Unit Vectors . 124 4.4 Post-Processing of Discovered Patterns . 132 4.4.1 Recommending Semantic Cluster Labels . 133 4.4.2 Establishing a Concept-Based XML DTD . 138 4.4.3 Semantic XML Tagging of Text Documents . 143 4.5 Bridging Knowledge Discovery and Knowledge Application . ........148 4.6 Process Automation vs. Expert Involvement . 152 4.7 Summary ....................................154 5 DIAsDEM Workbench 157 5.1 Key Characteristics and Architecture . 157 5.2 OverviewofCoreTasks ............................160 5.2.1 Pre-Processing of Text Documents . 161 5.2.2 Iterative Clustering of Text Unit Vectors . 166 5.2.3 Post-Processing of Discovered Patterns . 169 5.3 Summary ....................................171 6 Experimental Evaluation 173 6.1 Assessing the Quality of Semantic XML Markup . 173 6.1.1 Quality Criteria for Semantic XML Markup . 174 6.1.2 Extending DIAsDEM Workbench . 179 6.2 Real-World Applications of the DIAsDEM Framework . 180 6.2.1 Semantic XML Markup for Competitive Intelligence . 180 6.2.2 German Commercial Register Entries . 181 ii Contents 6.2.3 News about U.S. Mergers and Acquisitions . 194 6.3 Summary ....................................206 7 Conclusions 209 7.1 SummaryandContribution . 209 7.2 FutureResearch.................................212 7.2.1 Structuring the Concept-Based XML DTD . 212 7.2.2 Temporal Aspects of Discovered Knowledge . 213 7.2.3 Towards Automated Knowledge Discovery . 214 7.3 ConcludingRemarks ..............................215 A Contents of the Supplementary Web Site 217 B Specifications of Abstract Data Types 219 B.1 ADT Notation, Primitive Data Types, and Arrays . 219 B.2 ADTforStrings.................................220 B.3 ADTforVectors ................................220 B.4 ADTforTextDocuments . .220 B.5 ADTforTextArchives.............................221 B.6 ADTforTextUnits...............................221 B.7 ADTforTextUnitLayers .... ....... ....... ....... ..222 B.8 ADTforConcepts ...............................222 B.9 ADTforSetsofConcepts ...........................223 B.10ADTforNamedEntityTypes . 223 B.11 ADT for Sets of Named Entity Types . 224 B.12ADTforNamedEntities . .224 B.13ADTforSetsofNamedEntities. 225 B.14 ADT for Semantically Marked-Up Text Units . 225 B.15 ADT for Semantically Marked-Up Text Unit Layers . 226 B.16 ADT for Semantically Marked-Up Text Documents . 226 B.17 ADT for Semantically Marked-Up Text Archives . 227 B.18 ADT for Conceptual Document Structures . 227 B.19ADTforKDTAlgorithms . .228 B.20ADTforKDTProcessFlows . 229 B.21ADTforTokens.................................229 B.22 ADT for Tokenized Text Units . 229 B.23 ADT for Intermediate Named Entities . 230 B.24 ADT for Sets of Intermediate Named Entities . 231 B.25ADTforTextUnitVectors . .231 B.26 ADT for Intermediate Text Units . 232 B.27 ADT for Intermediate Text Unit Layers . 233 B.28 ADT for Intermediate Text Documents . 234 B.29 ADT for Intermediate Text Archives . 234 iii Contents B.30 ADT for Controlled Vocabulary Terms . 235 B.31 ADT for Controlled Vocabularies . 235 B.32ADTforTextUnitClusters . 236 B.33 ADT for Text Unit Clusterings . 238 B.34 ADT for Descriptor Weighting Schemata . 239 B.35 ADT for Clustering Algorithms . 240 B.36 ADT for Cluster Quality Criteria . 241 B.37ADTforIterationMetadata . 241 B.38ADTforKDTProcessMetadata . 242 C List of Relevant German Vocabulary 245 List of Abbreviations 251 Notation and List of Symbols 253 Bibliography 257 Erkl¨arung 295 iv List of Figures 1.1 Taxonomy of Markup in Text Documents . 4 1.2 TriangleofReference ............................. 5 1.3 Systems Development Research Process and Specific Research Process . 15 2.1 Process of Knowledge Discovery in Textual Databases . ...... 22 2.2 Fundamental and Complementary Research Areas . 54 3.1 Outline of the Two-Phase DIAsDEM Framework. 64 3.2 Knowledge Discovery Process of the DIAsDEM Framework . 65 3.3 Knowledge Application Process of the DIAsDEM Framework . 67 4.1 Text Document ˇtE3 Decomposed into Three Distinct Text Unit Layers . 76 4.2 Iterative Clustering in the DIAsDEM Knowledge Discovery Process . 126 4.3 Illustration of Pattern Discovery in Four Clustering Iterations . 128 4.4 Iterative Classification in the DIAsDEM Knowledge Application Process . 150 4.5 Generic Process of Knowledge Discovery in Textual Databases .......155 5.1 Architectural Overview of DIAsDEM Workbench ............158 5.2 Screen Shot of DIAsDEM Workbench GUI Client ..........159 5.3 Screen Shot of the Replace Named Entities 2.1 Task..........160 5.4 Screen Shot of the Batch Script Editor Tool ..............161 5.5 Screen Shot of the Thesaurus Editor Tool ................165 5.6 Screen Shot of the Cluster Text Unit Vectors (hypKNOWsys) Task .......................................167 5.7 Visualization of Text Unit Clustering Created by the Monitor Cluster Quality 2.2 Task ...............................168 5.8 Visualization of Text Unit Cluster Created by the Monitor Cluster Quality 2.2 Task ...............................169 5.9 Screen Shot of the Derive Conceptual DTD 2.2 Task . 170 6.1 Screen Shot of the Tagging Quality Evaluator 2.2 Tool . 180 List of Tables 1.1 Excerpt of XML Document Containing Semantically Marked-Up Text . 8 2.1 Knowledge Discovery in Textual Databases: Tasks and Applications . 21 4.1 Five Reuters News Items Used in Examples . 72 4.2 Text Document ˇtE1 Decomposed into Text Unit Layer ˇrE1 .......... 76 4.3 Tokenized Text Units after Tokenizing the Elements of Text Unit Layer ˇrE1 79 4.4 Excerpt from the Extended Named Entity Hierarchy and Exemplary Named Entities ..................................... 80 4.5 Processed Text Units of Intermediate Text Unit Layer ¯rE1 after Extracting NamedEntities ................................. 82 4.6 Intermediate Named Entities Identified in Tokenized Text Units of Inter- mediate Text Unit Layer ¯rE1 .......................... 83 4.7 Processed Text Units of Intermediate Text Unit Layer ¯rE1 after Lemmati- zation and Word Sense Disambiguation . 86 4.8 Excerpt of ISO-2788 Thesaurus for Text Documents ˇtE1 through ˇtE5 and Corresponding DIAsDEM-Specific Controlled Vocabulary Terms . 92 4.9 Text Unit Descriptors of the Controlled Vocabulary VE and Weighting Components for Exemplary Text Documents ˇtE1 through ˇtE5 ........ 97 4.10 Text Unit Vectors of Intermediate Text Unit Layer ¯rE1 ........... 97 4.11 Common Proximity Measures between Two Text Documents Represented by m-Dimensional Property Vectors t1 and t2 ................102 4.12 Five Relative Cluster Validity Indices . .......106 4.13 Summary of Three Proposed Clustering Algorithms w.r.t. the Fulfillment of DIAsDEM-Specific Selection Criteria . 110 4.14 Sentences Assigned to Qualitatively Acceptable Text Unit Cluster 4 in Iteration1....................................116 4.15 Sentences Assigned to Qualitatively Unacceptable, Inhomogeneous Text UnitCluster7inIteration1
Recommended publications
  • D3.3 Initial Technical & Logical Architecture and Future Work
    D3.3 Initial Technical & Logical Architecture and future work recommendations ECP 2008 DILI 558001 Europeana v1.0 D3.3 Initial Technical & Logical Architecture and future work recommendations Deliverable number D3.3 Dissemination level Public Delivery date 30 July 2010 Status Final Makx Dekkers, Stefan Gradmann, Jan Author(s) Molendijk eContentplus This project is funded under the eContentplus programme1, a multiannual Community programme to make digital content in Europe more accessible, usable and exploitable. 1 OJ L 79, 24.3.2005, p. 1. 1/14 D3.3 Initial Technical & Logical Architecture and future work recommendations 1. Introduction This deliverable has two tasks: To characterise the technical and logical architecture of Europeana as a system in its 1.0 state (that is to say by the time of the ‘Rhine’ release To outline the future work recommendations that can reasonably be made at that moment. This also provides a straightforward and logical structure to the document: characterisation comes first followed by the recommendations for future work. 2. Technical and Logical Architecture From a high-level architectural point of view, Europeana.eu is best characterized as a search engine and a database. It loads metadata delivered by providers and aggregators into a database, and uses that database to allow users to search for cultural heritage objects, and to find links to those objects. Various methods of searching and browsing the objects are offered, including a simple and an advanced search form, a timeline, and an openSearch API. It is also important to describe what Europeana.eu does not do, even though people sometimes expect it to.
    [Show full text]
  • Bibliography [Abiteboul and Kanellakis, 1989] Serge Abiteboul and Paris Kanellakis
    507 Bibliography [Abiteboul and Kanellakis, 1989] Serge Abiteboul and Paris Kanellakis. Object identity as a query language primitive. In Proc. of the ACM SIGMOD Int. Conf. on Management of Data, pages 159–173, 1989. [Abiteboul et al., 1995] Serge Abiteboul, Richard Hull, and Victor Vianu. Foundations of Databases. Addison Wesley Publ. Co., Reading, Massachussetts, 1995. [Abiteboul et al., 1997] Serge Abiteboul, Dallan Quass, Jason McHugh, Jennifer Widom, and Janet L. Wiener. The Lorel query language for semistructured data. Int. J. on Digital Libraries, 1(1):68–88, 1997. [Abiteboul et al., 2000] Serge Abiteboul, Peter Buneman, and Dan Suciu. Data on the Web: from Relations to Semistructured Data and XML. Morgan Kaufmann, Los Altos, 2000. [Abiteboul, 1997] Serge Abiteboul. Querying semi-structured data. In Proc. of the 6th Int. Conf. on Database Theory (ICDT’97), pages 1–18, 1997. [Abrahams et al., 1996] Merryll K. Abrahams, Deborah L. McGuinness, Rich Thomason, Lori Alperin Resnick, Peter F. Patel-Schneider, Violetta Cavalli-Sforza, and Cristina Conati. NeoClassic tutorial: Version 1.0. Technical report, Artificial Intelligence Prin- ciples Research Department, AT&T Labs Research and University of Pittsburgh, 1996. Available as http://www.bell-labs.com/project/classic/papers/NeoTut/NeoTut. html. [Abrett and Burstein, 1987] Glen Abrett and Mark H. Burstein. The KREME knowledge editing environment. Int. J. of Man-Machine Studies, 27(2):103–126, 1987. [Abrial, 1974] J. R. Abrial. Data semantics. In J. W. Klimbie and K. L. Koffeman, editors, Data Base Management, pages 1–59. North-Holland Publ. Co., Amsterdam, 1974. [Achilles et al., 1991] E. Achilles, B.
    [Show full text]
  • Ontology-Based Methods for Analyzing Life Science Data
    Habilitation a` Diriger des Recherches pr´esent´ee par Olivier Dameron Ontology-based methods for analyzing life science data Soutenue publiquement le 11 janvier 2016 devant le jury compos´ede Anita Burgun Professeur, Universit´eRen´eDescartes Paris Examinatrice Marie-Dominique Devignes Charg´eede recherches CNRS, LORIA Nancy Examinatrice Michel Dumontier Associate professor, Stanford University USA Rapporteur Christine Froidevaux Professeur, Universit´eParis Sud Rapporteure Fabien Gandon Directeur de recherches, Inria Sophia-Antipolis Rapporteur Anne Siegel Directrice de recherches CNRS, IRISA Rennes Examinatrice Alexandre Termier Professeur, Universit´ede Rennes 1 Examinateur 2 Contents 1 Introduction 9 1.1 Context ......................................... 10 1.2 Challenges . 11 1.3 Summary of the contributions . 14 1.4 Organization of the manuscript . 18 2 Reasoning based on hierarchies 21 2.1 Principle......................................... 21 2.1.1 RDF for describing data . 21 2.1.2 RDFS for describing types . 24 2.1.3 RDFS entailments . 26 2.1.4 Typical uses of RDFS entailments in life science . 26 2.1.5 Synthesis . 30 2.2 Case study: integrating diseases and pathways . 31 2.2.1 Context . 31 2.2.2 Objective . 32 2.2.3 Linking pathways and diseases using GO, KO and SNOMED-CT . 32 2.2.4 Querying associated diseases and pathways . 33 2.3 Methodology: Web services composition . 39 2.3.1 Context . 39 2.3.2 Objective . 40 2.3.3 Semantic compatibility of services parameters . 40 2.3.4 Algorithm for pairing services parameters . 40 2.4 Application: ontology-based query expansion with GO2PUB . 43 2.4.1 Context . 43 2.4.2 Objective .
    [Show full text]
  • Description Logics
    Description Logics Franz Baader1, Ian Horrocks2, and Ulrike Sattler2 1 Institut f¨urTheoretische Informatik, TU Dresden, Germany [email protected] 2 Department of Computer Science, University of Manchester, UK {horrocks,sattler}@cs.man.ac.uk Summary. In this chapter, we explain what description logics are and why they make good ontology languages. In particular, we introduce the description logic SHIQ, which has formed the basis of several well-known ontology languages, in- cluding OWL. We argue that, without the last decade of basic research in description logics, this family of knowledge representation languages could not have played such an important rˆolein this context. Description logic reasoning can be used both during the design phase, in order to improve the quality of ontologies, and in the deployment phase, in order to exploit the rich structure of ontologies and ontology based information. We discuss the extensions to SHIQ that are required for languages such as OWL and, finally, we sketch how novel reasoning services can support building DL knowledge bases. 1 Introduction The aim of this section is to give a brief introduction to description logics, and to argue why they are well-suited as ontology languages. In the remainder of the chapter we will put some flesh on this skeleton by providing more technical details with respect to the theory of description logics, and their relationship to state of the art ontology languages. More detail on these and other matters related to description logics can be found in [6]. Ontologies There have been many attempts to define what constitutes an ontology, per- haps the best known (at least amongst computer scientists) being due to Gruber: “an ontology is an explicit specification of a conceptualisation” [47].3 In this context, a conceptualisation means an abstract model of some aspect of the world, taking the form of a definition of the properties of important 3 This was later elaborated to “a formal specification of a shared conceptualisation” [21].
    [Show full text]
  • Will the Semantic Web Scale?
    Panel Discussion P3: Will the Semantic Web scale? New York Sheraton May 19th, 2004 Proposers: Panelists: Organizers: - Raphael Volz - Dr. Cathy Marshall - Raphael Volz - Carole Goble - Prof. Dr. Alon Y. Halevy - Daniel Oberle - Rudi Studer - Prof. Dr. Jürgen Angele - Prof. Dr. Ian Horrocks Panelist 1 Dr. Cathy Marshall Microsoft Corporation Texas A+M University Why the Semantic Web won’t scale the scaled semantic web seen as mass-market product “the Flowbee uses the suction power of your household vacuum to draw the hair up to the desired length, and then gives it a perfect cut.....every time.” Three important questions: • Will it really work? • Who needs it? • Is it safe? 3 will it work? evaluating the semantic web as metadata • compare the semantic web to a widely adopted metadata scheme like the MARC record used for library cataloging – MARC practitioners are members of a community and are trained to create metadata – MARC reduces interpretive load by careful choice of attributes, authority lists, & cataloging rules (AACR, e.g.) to constrain values – MARC records are controlled for interoperability and consistency in various ways (e.g. by clearinghouses like OCLC) – so... on-line catalog (OPAC) users know what to expect 4 will it work? evaluating the semantic web as metadata • by contrast, the semantic web is subject to the following pitfalls as it scales: – social structures for creating universal semantic web metadata are missing (local culture/practices/needs prevail) – semantic web metadata requires substantial interpretation of domain knowledge; underlying assumptions about use are highly situated – no way of ensuring interoperability, consistency, accuracy • e.g.
    [Show full text]
  • 3Rd NASA/IEEE Workshop on Formal Approaches to Agent-Based Systems
    Proceedings 3rdNASA/IEEE Workshop on Formal Approaches to Agent-Based Systems Springer-Verlag LNCS vol. 3228 Edited by Michael G. Hinchey, James L. Rash, Walter F. Truszkowski and Christopher A. Rouff Please note: This is a complete draft of the text; the publisher will add the correct page numbers, etc. The current formatting, although not very neat, I is the way the publisher requires it from us. Preface Greenbelt, MD October 2004 Organizing Committee Mike Hinchey, NASA Goddard Space Flight Center Jim Rash, NASA Goddard Space Flight Center Walt Truszkowski, NASA Goddard Space Flight Center Chris Rouff, SAIC I- _- Contents Ecology Based Decentralized Agent Management System Maim D. Pqirakhov, VincentA. Cicirello and William C. Regli ........................ 1 From Abstract to Concrete Nomin Agent Institutions Daide Grossi and Frank Dipm ............................................................ 12 Meeting the Deadline: Why, When and How Frank Dignum, Jan Broersen, Virginia Dignurn and John-Jules Meyer ................ 30 Multi-Agent Systems Reliability, Fuzziness, and Deterrence Michel Rudnimki and H&ne Bestougefl.. ................................................ 41 Formalism Challenges of the Cougaar Model Driven Architecture Shawn A. Bohner. Boby George. Denis Gracirnin andMichael G. Hinchey .......... 58 Facilitating the Specification Capture and Transformation Process in the Development of Multi-Agent Systems Aluizio Haendchen Filho, Nuno Caminah, Edward Hermann Haeusler and Arndt von Staa.. ............................................................................ 73 Using Ontologies to Formalize Services Specifications in Multi-Agent Systems Karin Koogan Breitman, Aluizio Haendchen Filho, Edward Hermann Haeusler and Am& von Staa ............................................................................. 93 Two Formal Gas Models For Multi-Agent Sweeping and Obstacle Avoidance Wesley Ken, Diana Spears, William Spears and David Thayer ...................... 1 13 A Formal kysisof Potential Energy in a Multi-agent System William M Spears, Diana F.
    [Show full text]
  • Toward an Open Knowledge Research Graph.Pdf
    THE SERIALS LIBRARIAN https://doi.org/10.1080/0361526X.2019.1540272 Toward an Open Knowledge Research Graph Sören Auera and Sanjeet Mann b aPresenter; bRecorder ABSTRACT KEYWORDS Knowledge graphs facilitate the discovery of information by organizing it into Knowledge graph; scholarly entities and describing the relationships of those entities to each other and to communication; Semantic established ontologies. They are popular with search and e-commerce com- Web; linked data; scientific panies and could address the biggest problems in scientific communication, research; machine learning according to Sören Auer of the Technische Informationsbibliothek and Leibniz University of Hannover. In his NASIG vision session, Auer introduced attendees to knowledge graphs and explained how they could make scientific research more discoverable, efficient, and collaborative. Challenges include incentiviz- ing researchers to participate and creating the training data needed to auto- mate the generation of knowledge graphs in all fields of research. Change in the digital world Thank you to Violeta Ilik and the NASIG Program Planning Committee for inviting me to this conference. I would like to show you where I come from. Leibniz University of Hannover has a castle that belonged to a prince, and next to the castle is the Technische Informationsbibliothek (TIB), responsible for supporting the scientific and technology community in Germany with publications, access, licenses, and digital information services. Figure 1 is an example of a knowledge graph about TIB. The basic ingredients of a knowledge graph are entities and relationships. We are the library of Leibniz University of Hannover and we are a member of Leibniz Association (a German research association).
    [Show full text]
  • Scalable Approaches for Auditing the Completeness of Biomedical Ontologies
    University of Kentucky UKnowledge Theses and Dissertations--Computer Science Computer Science 2021 Scalable Approaches for Auditing the Completeness of Biomedical Ontologies Fengbo Zheng University of Kentucky, [email protected] Author ORCID Identifier: https://orcid.org/0000-0001-5902-0186 Digital Object Identifier: https://doi.org/10.13023/etd.2021.128 Right click to open a feedback form in a new tab to let us know how this document benefits ou.y Recommended Citation Zheng, Fengbo, "Scalable Approaches for Auditing the Completeness of Biomedical Ontologies" (2021). Theses and Dissertations--Computer Science. 105. https://uknowledge.uky.edu/cs_etds/105 This Doctoral Dissertation is brought to you for free and open access by the Computer Science at UKnowledge. It has been accepted for inclusion in Theses and Dissertations--Computer Science by an authorized administrator of UKnowledge. For more information, please contact [email protected]. STUDENT AGREEMENT: I represent that my thesis or dissertation and abstract are my original work. Proper attribution has been given to all outside sources. I understand that I am solely responsible for obtaining any needed copyright permissions. I have obtained needed written permission statement(s) from the owner(s) of each third-party copyrighted matter to be included in my work, allowing electronic distribution (if such use is not permitted by the fair use doctrine) which will be submitted to UKnowledge as Additional File. I hereby grant to The University of Kentucky and its agents the irrevocable, non-exclusive, and royalty-free license to archive and make accessible my work in whole or in part in all forms of media, now or hereafter known.
    [Show full text]
  • Owl2, Rif, Sparql1.1)
    http://www.csd.abdn.ac.uk/ http://www.deri.ie The Role of Logics and Logic Programming in Semantic Web Standards (OWL2, RIF, SPARQL1.1) Axel Polleres Digital Enterprise Research Institute (DERI) National University of Ireland, Galway 1 Outline ESWC2010 Quick intro of RDF/OWL/Linked Open Data OWL2 Overview: from OWL 1 to OWL 2 Reasoning services in OWL 2 OWL2 Treactable Fragments (OWL2RL, OWL2EL, OWL2QL) OWL2 and RIF OWL2 and SPARQL1.1 Time allowed: Implementing SPARQL, OWL2RL, RIF on top of DLV – The GiaBATA system 2 Example: Finding experts/reviewers? ESWC2010 Tim Berners-Lee, Dan Connolly, Lalana Kagal, Yosi Scharf, Jim Hendler: N3Logic: A logical framework for the World Wide Web. Theory and Practice of Logic Programming (TPLP), Volume 8, p249-269 Who are the right reviewers? Who has the right expertise? Which reviewers are in conflict? Observation: Most of the necessary data already on the Web, as RDF! More and more of it follows the Linked Data principles, i.e.: 1. Use URIs as names for things 2. Use HTTP dereferenceable URIs so that people can look up those names. 3. When someone looks up a URI, provide useful information. 4. Include links to other URIs so that they can discover more things. 3 RDF on the Web ESWC2010 (i) directly by the publishers (ii) by e.g. transformations, D2R, RDFa exporters, etc. FOAF/RDF linked from a home page: personal data (foaf:name, foaf:phone, etc.), relationships foaf:knows, rdfs:seeAlso ) 4 RDF on the Web ESWC2010 (i) directly by the publishers (ii) by e.g.
    [Show full text]
  • 599, Giese, M., Skjaeveland, M
    City Research Online City, University of London Institutional Repository Citation: Soylu, A., Kharlamov, E., Zheleznyakov, D., Jimenez-Ruiz, E. ORCID: 0000- 0002-9083-4599, Giese, M., Skjaeveland, M. G., Hovland, D., Schlatte, R., Brandt, S., Lie, H. and Horrocks, I. (2018). OptiqueVQS: A visual query system over ontologies for industry. Semantic Web, 9(5), doi: 10.3233/SW-180293 This is the accepted version of the paper. This version of the publication may differ from the final published version. Permanent repository link: https://openaccess.city.ac.uk/id/eprint/22928/ Link to published version: http://dx.doi.org/10.3233/SW-180293 Copyright: City Research Online aims to make research outputs of City, University of London available to a wider audience. Copyright and Moral Rights remain with the author(s) and/or copyright holders. URLs from City Research Online may be freely distributed and linked to. Reuse: Copies of full items can be used for personal research or study, educational, or not-for-profit purposes without prior permission or charge. Provided that the authors, title and full bibliographic details are credited, a hyperlink and/or URL is given for the original metadata page and the content is not changed in any way. City Research Online: http://openaccess.city.ac.uk/ [email protected] Semantic Web 0 (0) 1–35 1 IOS Press OptiqueVQS: a Visual Query System over Ontologies for Industry 1 Editor(s): Freddy Lecue, Accenture Tech Labs, Ireland Solicited review(s): Vanessa Lopez, IBM Research, Ireland; Two anonymous reviewers Ahmet Soylu a;∗, Evgeny Kharlamov b, Dmitriy Zheleznyakov b, Ernesto Jimenez-Ruiz c, Martin Giese c, Martin G.
    [Show full text]
  • Submission Data for 2017 CORE Conference Re-Ranking Process
    Submission Data for 2017 CORE conference Re-ranking process International Semantic Web Conference (Submitted as a comparator for Semantics - International Conference on Semantic Systems) Submitted by: Tassilo Pellegrini [email protected] Supported by: Van Harmelen Frank, Sack Harald, Fensel Dieter, Auer Sören Conference Details Conference Title: International Semantic Web Conference Acronym : ISWC Rank: A Recent Years Most Recent Year Year: 2016 URL: http://iswc2016.semanticweb.org/ Papers submitted: 326 Papers published: 75 Acceptance rate: 23 Source for acceptance rate: http://www.springer.com/de/book/9783319465227 Program Chairs Name: Paul Groth Affiliation: Elsevier Labs Netherlands H index: 39 Google Scholar URL: https://scholar.google.at/citations?user=0tHSHCIAAAAJ&hl=de DBLP URL: Name: Elena Simperl Affiliation: University of Southampton UK H index: 28 Google Scholar URL: DBLP URL: General Chairs Name: Yolanda Gil Affiliation: Information Sciences Institute, University of Southern California United States H index: 57 Google Scholar URL: https://scholar.google.at/citations?user=NDmGsCgAAAAJ&hl=de DBLP URL: Second Most Recent Year Year: 2015 URL: http://iswc2015.semanticweb.org/ Papers submitted: 172 Papers published: 38 Acceptance rate: 22 Source for acceptance rate: http://www.springer.com/de/book/9783319250090 1 Program Chairs Name: Marcelo Arenas Affiliation: Pontificia Universidad Catolica de Chile Chile H index: 38 Google Scholar URL: https://scholar.google.at/citations?user=YHR0wkkAAAAJ&hl=de DBLP URL: Name: Oscar Corcho
    [Show full text]
  • Submission Data for 2020-2021 CORE Conference Ranking Process (Submitted As a Comparator for International Conference on Automated Deduction )
    Submission Data for 2020-2021 CORE conference Ranking process (Submitted as a comparator for International Conference on Automated Deduction ) Philipp Ruemmer, Christoph Weidenbach Conference Details Conference Title: International Joint Conference on Automated Reasoning Acronym : IJCAR Rank: A* DBLP Link DBLP url: https://dblp.org/db/conf/cade/index.html Recent Years Proceedings Publishing Style Proceedings Publishing: series Link to most recent proceedings: https://www.springer.com/gp/book/9783030510732 Further details: Published in the LNCS series of Springer. The proceedings mainly consist of contributed research papers and tool papers. Most Recent Years Most Recent Year Year: 2018 URL: http://ijcar2018.org/ Location: Oxford, United Kingdom Papers submitted: 108 Papers published: 46 Acceptance rate: 43 Source for numbers: http://ijcar.org//conferences General Chairs Name: Ian Horrocks Affiliation: University of Oxford, UK Gender: M H Index: 96 GScholar url: https://scholar.google.com/citations?user=0ypdmcYAAAAJ&hl=en DBLP url: Program Chairs 1 Name: Didier Galmiche Affiliation: UniversitÃľ de Lorraine, France Gender: M H Index: 19 GScholar url: DBLP url: https://dblp.org/pid/82/76.html Name: Stephan Schulz Affiliation: DHBW Stuttgart, Germany Gender: M H Index: 25 GScholar url: https://scholar.google.com/citations?user=Ebf2ARQAAAAJ&hl=en DBLP url: Name: Roberto Sebastiani Affiliation: University of Trento, Italy Gender: M H Index: 45 GScholar url: https://scholar.google.com/citations?user=qmnmdYsAAAAJ&hl=en DBLP url: Second Most Recent Year
    [Show full text]