Embedding a Multimedia Metadata Model Into a Workflow-Driven Environment Using Idiomatic Semantic Web Technologies

Total Page:16

File Type:pdf, Size:1020Kb

Embedding a Multimedia Metadata Model Into a Workflow-Driven Environment Using Idiomatic Semantic Web Technologies emanuel berndl EMBEDDINGAMULTIMEDIAMETADATAMODEL INTOAWORKFLOW-DRIVENENVIRONMENT USINGIDIOMATICSEMANTICWEB TECHNOLOGIES EMBEDDINGAMULTIMEDIAMETADATAMODELINTOA WORKFLOW-DRIVENENVIRONMENTUSINGIDIOMATIC SEMANTICWEBTECHNOLOGIES emanuel berndl master of science (m.sc.) Doctoral Thesis December 2018 University of Passau Faculty of Computer Science and Mathematics Chair of Distributed Informations Systems Emanuel Berndl: Embedding a Multimedia Metadata Model into a Workflow- Driven Environment Using Idiomatic Semantic Web Technologies, © De- cember 2018 In many ways, people growing up with the Web and now the Semantic Web take the power at their fingertips for granted. — Tim Berners-Lee, 2005 ABSTRACT The Semantic Web exists for about 20 years by now, but its appli- cability as well as its presence does not live up to the standards of its original idea. Incorporated Semantic Web Technologies do have an initial barrier to learn and apply, which can discourage many po- tential users. This leads to less available data overall in addition to decreased data quality. This work solves parts of the aforementioned problem by support- ing idiomatic entry to those Semantic Web Technologies, allowing for "easier" accessibility and usability. Anno4j is a Java library that im- plements a form of Object-Relational Mapping for RDF data. With its application, RDF data can be created via a mapping by simply instan- tiating Java objects - an object-oriented programming concept the user is familiar with. On the other side, requesting persisted data is sup- ported by a path-based querying possibility, while other features like transactional behaviour, code generation, and automated validation of input contribute to a more effective, comprehensive, and straight- forward usage. A use-case is provided by the MICO Platform, a centralised soft- ware instance that connects autonomous multimedia extractors in a workflow-driven fashion. This leads to a rich metadata background for the inserted multimedia files, enabling them to be used in diverse scenarios as well as unlocking yet hidden semantics. For this task it was necessary to design and implement a metadata model that is able to aggregate and merge the varying extractor results under a common “headline”: the MICO Metadata Model. The results of this work allow the use case to incorporate idiomatic Semantic Web Technologies which are then usable natively by non- Semantic Web experts. Additionally, an increase has been achieved in forms of data integration, synchronisation, integrity and validity, as well as an overall more comprehensive and rich implementation of the multimedia extractors. vii ZUSAMMENFASSUNG Das Semantic Web existiert nun seit rund 20 Jahren, jedoch ist des- sen Anwendbarkeit und Präsenz nicht auf dem Stand, der in der ur- sprünglichen Idee angestrebt wurde. Die verwendeten semantischen Technologien besitzen eine Einstiegshürde die potenzielle Nutzer ab- schrecken kann. Dies führt sowohl zu weniger verfügbaren Daten so- wie einer schlechteren Datenqualität. Diese Arbeit löst einen Teil dieses Problems, indem diese Technolo- gien ëinfacheränwendbar für den Benutzer gemacht werden. Anno4j ist eine Java Bibliothek die eine Form einer objektrelationalen Ab- bildung für RDF Daten umsetzt. Dadurch können RDF Daten über ein Mapping erstellt werden, indem lediglich einfache Java Objekte erstellt werden müssen - ein objektorientiertes Konzept das dem Be- nutzer bekannt ist. Eine Abfrage der Daten wird durch Pfad-basiertes Querying vereinfacht, während andere Eigenschaften wie transaktio- nales Verhalten, Code-Generierung und automatisierte Validierung weitere Beiträge zur effektiveren, umfassenderen und unkomplizier- teren Benutzung liefern. Den Anwendungsfall stellt die MICO Platform dar, eine zentrale Softwareinstanz die über Workflow-orientierte Arbeitsweise autono- me Multimedia Extraktoren miteinander verbindet, um eingegebene Multimediainhalte auf ihren Metadatenhintergrund zu untersuchen. Dies erlaubt die Inhalte in einem breiterem Spektrum sowie eventuell unvorhergesehenen Szenarien zu erschließen. Ein eigens entwickeltes RDF Metadatenmodell, das MICO Metadata Model, bildet dafür ein uniformes Datenformat, um alle erstellten Ergebnisse unter einen ge- meinsamen Hut zu bringen. Die Ergebnisse dieser Arbeit ermöglichen in diesem Anwendungs- fall eine Vereinfachung der semantischen Technologien für Semantic Web-fremde Benutzer, eine Verbesserung in Form von Datenintegra- tion, Synchronisation und Integrität, sowie eine funktional reichere Implementierung der Multimediaextraktoren. viii CONTENTS i preface1 1 the semantic web3 1.1 The Vision and Idea of the Semantic Web 4 1.1.1 A Semantic Web Use Case 4 1.1.2 A Look at the Current State of the Semantic Web 7 1.1.3 Open Issues of the Semantic Web 9 1.1.4 The Semantic Gap - A Metadata Problem in the Domain of Multimedia 10 1.2 A Technical Setting for this Thesis - The MICO Project 12 1.3 Research Questions and Contributions 13 1.4 Structure of this Thesis 16 ii modelling metadata - classic approaches and a multimedia context 19 2 principles of metadata modeling and querying 21 2.1 The Semantic Web and Other Familiar Concepts 22 2.2 The “Resource Description Framework” - The Back- bone of Metadata Modeling 29 2.2.1 Information Units in RDF and Their Structure 30 2.2.2 RDF Basics and Concepts for Metadata Mod- elling 32 2.2.3 RDF Vocabularies and Ontologies 37 2.2.4 RDF Datasets and Named Graphs 41 2.2.5 Expressing and Transporting RDF Data - RDF Documents and Their Serialisations 42 2.2.6 Advanced RDF Features - Inferencing, Reason- ing, and Reification 48 2.2.7 Querying and Manipulating RDF Data - SPARQL, the SPARQL Protocol and RDF Query Language 50 3 combining metadata modeling with multimedia 55 3.1 Related Work - Multimedia Metadata Modeling 57 3.2 The Web Annotation Data Model 64 3.2.1 Web Annotation Structure 65 3.2.2 RDF Classes for Body and Target Components 67 3.2.3 Fragmentation of Resource IRIs and Selectors 69 3.2.4 Additional Information for Web Annotation Nodes 73 3.2.5 Complete Exemplary Web Annotation 75 3.3 The MICO Metadata Model - Connecting Annotations 76 3.3.1 Composition Module 78 3.3.2 Context Module 79 ix x contents 3.3.3 The Content Module / The Body of the Part An- notation 80 3.3.4 The Selection Module / The Target of the Part Annotations 81 3.3.5 Provenance Module 82 3.3.6 Multimedia Ontology Requirements Check 87 iii the application of multimedia metadata in a workflow-driven approach and idiomatic seman- tic web technologies 91 4 increasing the usability and applicability of se- mantic web technologies for multimedia meta- data modeling and workflows - anno4j 93 4.1 Related Work - Object/Relational and Object/RDF Map- pings 95 4.2 Anno4j - An Object-RDF-Mapping Library 102 4.3 Creation of Metadata with Anno4j 110 4.4 Querying of Metadata with Anno4j 117 4.5 Established Database Concepts of the Anno4j Library 123 4.5.1 Supporting Transactional Behaviour in Anno4j 125 4.5.2 Validate Database Input with Validated Trans- actions 126 4.5.3 Schema Annotations for Data Validity 127 4.6 Automated Domain Model Generation through Anno4j RDF Schema Parsing 130 4.6.1 Domain Model Generation Functionality 131 4.6.2 Generation Process Internals and Algorithms 133 4.6.3 Generation of a Web Component for a Metadata Model 141 4.7 Additional Anno4j Database Features 143 4.8 Anno4j Conclusion, Outlook, and Envisioned Addi- tions 146 5 a workflow-driven approach for multimedia meta- data application 153 5.1 RW - Multimedia Metadata Platforms, Metadata Life- cycles 155 5.2 Embedding the Multimedia Metadata Workflow - MICO 165 5.2.1 Accessing the Produced Results of the MICO Platform 168 5.2.2 MICO Extractor Description 170 5.2.3 Implementing Own MICO Extractors 171 5.2.4 MICO Orchestration Service - The MICO Bro- ker 173 5.3 From a Multimedia Metadata Workflow to a Self-Sustaining Metadata Cycle 174 5.4 An Extension of the Workflow Environment 179 contents xi 5.5 Multimedia Metadata Application Conclusion 184 iv experiments and evaluations 187 6 experiments and evaluations 189 6.1 Related Work - Overall ORdfM Evaluations 190 6.1.1 Quasthoff: General ORdfM Comparison 191 6.1.2 Quasthoff: ORdfM Implementation and Run- time Evaluations 192 6.2 Anno4j and Ontology Structure Experiments 196 6.2.1 Runtime Experiments with Generated Anno4j Domain Models 197 6.2.2 Runtime Experiments with Pre-Created Proxy Classes 205 6.2.3 Runtime Experiments with Generated Anno4j Domain Models and Multiple Varying Parame- ters 209 6.2.4 Conclusion of the Anno4j Evaluations 217 6.3 ViSIT - A Cultural Heritage Use Case for the ORdfM Library Anno4j 218 6.4 Recapitulation of Posed Research Questions 226 v summary and conclusion 233 7 résumé 235 7.1 Conclusion 235 7.2 Future Work and Outlook 239 vi appendix 241 a appendix 243 a.1 The MICO Project 243 a.1.1 Background of the MICO Project 243 a.1.2 Development Cycle of the MICO Broker 245 a.2 Further Informations and Appended Data 247 bibliography 285 LISTOFFIGURES Figure 1 The Role of Semantic Web Users that Partic- ipated in the Survey of Cardoso [41] (From Which the Numbers and the Table Are Adopted). 8 Figure 2 Visualisation of the Semantic Gap Problem. 11 Figure 3 The Evolution of a Web of Documents to a Web of Data. 23 Figure 4 The Evolution of the Semantic Web. 27 Figure 5 Interaction Between the Concepts Around the Semantic Web. 28 Figure 6 Two Simple Statements Presented as Graph. 31 Figure 7 Combined Statements Presented as Graph. 32 Figure 8 Exemplary Blank Node. 35 Figure 9 Example RDF Dataset with Two Named Graphs “cities:NewYork” and “cities:Chicago”. The Exemplary Namespace “http://examplecities.- com/” is Assumed for the Prefix “cities:”. 41 Figure 10 Exemplary RDF Graph Used for Showing the Different RDF Serialisation Formats. 43 Figure 11 Exemplary Web Annotation Illustrating the In- formation That the Human Face Displayed on a Given Picture is Barack Obama. 66 Figure 12 Exemplary Web Annotation from Figure 11 with more Metadata Information, Especially the Ad- dition of the Datatypes Text and Image.
Recommended publications
  • Domain Landscape Review and Data Value Chain Definition
    Ref. Ares(2018)2305100 - 30/04/2018 H2020 - INDUSTRIAL LEADERSHIP - Information and Communication Technologies (ICT) ICT-14-2016-2017: Big Data PPP: cross-sectorial and cross-lingual data integration and experimentation ICARUS: “Aviation-driven Data Value Chain for Diversified Global and Local Operations” D1.1 – Domain Landscape Review and Data Value Chain Definition Workpackage: WP1 – ICARUS Data Value Chain Elaboration Authors: University of Cyprus (UCY), Suite5, UBITECH, CINECA, AIA, PACE, ISI, CELLOCK, TXT, OAG Status: Final Classification: Public Date: 30/04/2018 Version: 1.00 Disclaimer: The ICARUS project is co-funded by the Horizon 2020 Programme of the European Union. The information and views set out in this publication are those of the author(s) and do not necessarily reflect the official opinion of the European Communities. Neither the European Union institutions and bodies nor any person acting on their behalf may be held responsible for the use which may be made of the information contained therein. © Copyright in this document remains vested with the ICARUS Partners. D1.1 - Domain Landscape Review and Data Value Chain Definition ICARUS Project Profile Grant Agreement No.: 780792 Acronym: ICARUS Aviation-driven Data Value Chain for Diversified Global and Local Title: Operations URL: http://www.icarus2020.aero Start Date: 01/01/2018 Duration: 36 months Partners UBITECH (UBITECH) Greece ENGINEERING - INGEGNERIA INFORMATICA SPA (ENG) Italy PACE Aerospace Engineering and Information Technology GmbH (PACE) Germany SUITE5 DATA
    [Show full text]
  • Qualifikationsprofil #10309
    QUALIFIKATIONSPROFIL #10309 ALLGEMEINE DATEN Geburtsjahr: 1972 Ausbildung: Abitur Diplom, Informatik, (TU, Kaiserslautern) Fremdsprachen: Englisch, Französisch Spezialgebiete: Kubernetes KENNTNISSE Tools Active Directory Apache Application Case CATIA CVS Eclipse Exchange Framework GUI Innovator ITIL J2EE JMS LDAP Lotus Notes make MS Exchange MS Outlook MS-Exchange MS-Office MS-Visual Studio NetBeans OSGI RACF SAS sendmail Together Turbine UML VMWare .NET ADS ANT ASP ASP.NET Flash GEnie IDES Image Intellij IDEA IPC Jackson JBOSS Lex MS-Visio ODBC Oracle Application Server OWL PGP SPSS SQS TesserAct Tivoli Toolbook Total Transform Visio Weblogic WebSphere YACC Tätigkeiten Administration Analyse Beratung Design Dokumentation KI Konzeption Optimierung Support Vertrieb Sprachen Ajax Basic C C# C++ Cobol Delphi ETL Fortran Java JavaScript Natural Perl PHP PL/I PL-SQL Python SAL Smalltalk SQL ABAP Atlas Clips Delta FOCUS HTML Nomad Pascal SPL Spring TAL XML Detaillierte Komponenten AM BI FS-BA MDM PDM PM BW CO FI LO PP Datenbanken Approach IBM Microsoft Object Store Oracle Progress Sybase DMS ISAM JDBC mySQL DC/Netzwerke ATM DDS Gateway HBCI Hub Internet Intranet OpenSSL SSL VPN Asynchronous CISCO Router DNS DSL Firewall Gateways HTTP RFC Router Samba Sockets Switches Finance Business Intelligence Excel Konsolidierung Management Projektleiter Reporting Testing Wertpapiere Einkauf CAD Systeme CATIA V5 sonstige Hardware Digital HP PC Scanner Siemens Spark Teradata Bus FileNet NeXT SUN Switching Tools, Methoden Docker Go Kubernetes Rational RUP
    [Show full text]
  • Integration of Web Apis and Linked Data Using SPARQL Micro-Services
    Integration of Web APIs and Linked Data Using SPARQL Micro-Services - Application to Biodiversity Use Cases Franck Michel, Catherine Faron Zucker, Olivier Gargominy, Fabien Gandon To cite this version: Franck Michel, Catherine Faron Zucker, Olivier Gargominy, Fabien Gandon. Integration of Web APIs and Linked Data Using SPARQL Micro-Services - Application to Biodiversity Use Cases. Information, MDPI, 2018, Special Issue ”Semantics for Big Data Integration”, 9 (12), 10.3390/info9120310. hal- 01947589 HAL Id: hal-01947589 https://hal.archives-ouvertes.fr/hal-01947589 Submitted on 7 Dec 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Distributed under a Creative Commons Attribution| 4.0 International License information Article Integration of Web APIs and Linked Data Using SPARQL Micro-Services—Application to Biodiversity Use Cases † Franck Michel 1,*, Catherine Faron Zucker 1, Olivier Gargominy 2 and Fabien Gandon 1 1 I3S laboratory, University Côte d’Azur, CNRS, Inria, 930 route des Colles, 06903 Sophia Antipolis, France; [email protected] (C.F.Z.); [email protected] (F.G.) 2 Muséum National d’Histoire Naturelle, 36 rue Geoffroy Saint-Hilaire, 75005 Paris, France; [email protected] (O.G.) * Correspondence: [email protected] (F.M.) † This paper is an extended version of our paper published in Michel F., Faron Zucker C.
    [Show full text]
  • EAGLE EAGLE OER Learning Platform
    Deliverable Nature P Document Title Dissemination level EAGLE OER Learning Platform PU Contract Number Version 619347 1.0 EAGLE EnhAnced Government Learning www.fp7-eagle.eu FP7-ICT-2013-11 Objective 8.2 Technology-enhanced learning; Target outcome c): Holistic learning solutions for managing, reaching and engaging learners in the public administrations Deliverable D5.6 EAGLE OER Learning Platform WP 5 – OER Learning Platform Development Lead Participant: Fraunhofer FOKUS Approval Name / Department / Date Panel Partner short name Function Arun Prakash Juhi Gaba, System Quality Engineering Horst Friedrich, (SQC), Digital Public Services Author 01/07/2016 Thomas-Frederick (DPS) / WP5, T6.1, T6.2 Leader, Gordon / Developers Fraunhofer Dietmar Glachs / Internet of Things (IoT) / Author 01/07/2016 SRFG T5.3, T5.4 Leader, Developer Eric Tobias / Embedded Assessment Research 01/07/2016 Author LIST Group / T5.5 Leader, Developer Kheira Acem, Embedded Assessment Research Reviewer Céline Thomase / 26/07/2016 Group / Developers LIST This project has received funding from the European Union’s Seventh Framework Programme for research, technological development and demonstration under grant agreement N°310806. Deliverable Nature P Document Title Dissemination level EAGLE OER Learning Platform PU Contract Number Version 619347 1.0 Table of Contents 1 LIST OF FIGURES ........................................................................................................... II 2 GLOSSARY .................................................................................................................
    [Show full text]
  • Linked Data Architectural Components How-To Attach Linked Data Services to Legacy Infrastructure?
    Linked Data architectural components How-to attach linked data services to legacy infrastructure? Daniel Martini, Mario Schmitz, Günter Engelhardt Berlin, 27th of June, 2017 Organization ● Registered Association (non-profit): Funded ~ 2/3 by the german ministry for nutrition and agriculture ~ 400 members: experts from research, industry, extension… ~ 70 employees working in Darmstadt Managing lots of working groups, organizing expert workshops, represented in other committees, maintaining an expert network ● Tasks: Knowledge transfer from research into agricultural practice Supporting policy decision making by expertises Evaluating new technologies: economics, ecological impact… Providing planning data (investment, production processes…) to extension and farmers ● Role of Information Technology: Data acquisition: harvesting open data sources Data processing: calculating planning data from raw data Information provision: delivery to clients via ebooks, web, apps 3 Goals and requirements Deliver KTBL planning data in human and machine readable form alike: ● Machine classes: purchase prices, useful life, consumption of supplies… ● Standard field work processes: working time, machines commonly used under different regimes… ● Operating supplies: average prices, contents… ● Facilities and buildings: stables, milking machines and their properties ● … to reach a broader audience and enable further processing within software applications for the use of farmers, extension… 4 Problem statement ● There‘s data that wants to get shared available
    [Show full text]
  • Code Smell Prediction Employing Machine Learning Meets Emerging Java Language Constructs"
    Appendix to the paper "Code smell prediction employing machine learning meets emerging Java language constructs" Hanna Grodzicka, Michał Kawa, Zofia Łakomiak, Arkadiusz Ziobrowski, Lech Madeyski (B) The Appendix includes two tables containing the dataset used in the paper "Code smell prediction employing machine learning meets emerging Java lan- guage constructs". The first table contains information about 792 projects selected for R package reproducer [Madeyski and Kitchenham(2019)]. Projects were the base dataset for cre- ating the dataset used in the study (Table I). The second table contains information about 281 projects filtered by Java version from build tool Maven (Table II) which were directly used in the paper. TABLE I: Base projects used to create the new dataset # Orgasation Project name GitHub link Commit hash Build tool Java version 1 adobe aem-core-wcm- www.github.com/adobe/ 1d1f1d70844c9e07cd694f028e87f85d926aba94 other or lack of unknown components aem-core-wcm-components 2 adobe S3Mock www.github.com/adobe/ 5aa299c2b6d0f0fd00f8d03fda560502270afb82 MAVEN 8 S3Mock 3 alexa alexa-skills- www.github.com/alexa/ bf1e9ccc50d1f3f8408f887f70197ee288fd4bd9 MAVEN 8 kit-sdk-for- alexa-skills-kit-sdk- java for-java 4 alibaba ARouter www.github.com/alibaba/ 93b328569bbdbf75e4aa87f0ecf48c69600591b2 GRADLE unknown ARouter 5 alibaba atlas www.github.com/alibaba/ e8c7b3f1ff14b2a1df64321c6992b796cae7d732 GRADLE unknown atlas 6 alibaba canal www.github.com/alibaba/ 08167c95c767fd3c9879584c0230820a8476a7a7 MAVEN 7 canal 7 alibaba cobar www.github.com/alibaba/
    [Show full text]
  • Smart Campus Based on Linked Data Phd Thesis
    Smart Campus based on Linked Data PhD thesis Barnabás Szász Supervisor: Dr. András Micsik UNIVERSITY OF DEBRECEN Doctoral School of Informatics Debrecen, 2019 Okos egyetemi szolgáltatásokat megalapozó kapcsolt adathalmazok Egyetemi doktori (PhD) értekezés Szász Barnabás Témavezető: Dr. Micsik András DEBRECENI EGYETEM Természettudományi és Informatikai Doktori Tanács Informatikai Tudományok Doktori Iskola Debrecen, 2019 Ezen értekezést a Debreceni Egyetem Természettudományi Doktori Tanács a Informatikai Tudományok Doktori Iskola Elméleti számítástudomány, adatvédelem és kriptográfia doktori programja keretében készítettem a Debreceni Egyetem természettudományi doktori (PhD) fokozatának elnyerése céljából. Nyilatkozom arról, hogy a tézisekben leírt eredmények nem képezik más PhD disszertáció részét. Debrecen, 2019. ………………………… a jelölt aláírása Tanúsítom, hogy Szász Barnabás doktorjelölt 2014-2019 között a fent megnevezett Doktori Iskola Elméleti számítástudomány, adatvédelem és kriptográfia doktori programjának keretében irányításommal végezte munkáját. Az értekezésben foglalt eredményekhez a jelölt önálló alkotó tevékenységével meghatározóan hozzájárult. Nyilatkozom továbbá arról, hogy a tézisekben leírt eredmények nem képezik más PhD disszertáció részét. Az értekezés elfogadását javasolom. Debrecen, 2019. ………………………… a témavezető aláírása OKOS EGYETEMI SZOLGÁLTATÁSOKAT MEGALAPOZÓ KAPCSOLT ADATHALMAZOK SMART CAMPUS BASED ON LINKED DATA Értekezés a doktori (Ph.D.) fokozat megszerzése érdekében Írta: Szász Barnabás okleveles Programtervező
    [Show full text]
  • Testsmelldescriber Enabling Developers’ Awareness on Test Quality with Test Smell Summaries
    Bachelor Thesis January 31, 2018 TestSmellDescriber Enabling Developers’ Awareness on Test Quality with Test Smell Summaries Ivan Taraca of Pfullendorf, Germany (13-751-896) supervised by Prof. Dr. Harald C. Gall Dr. Sebastiano Panichella software evolution & architecture lab Bachelor Thesis TestSmellDescriber Enabling Developers’ Awareness on Test Quality with Test Smell Summaries Ivan Taraca software evolution & architecture lab Bachelor Thesis Author: Ivan Taraca, [email protected] URL: http://bit.ly/2DUiZrC Project period: 20.10.2018 - 31.01.2018 Software Evolution & Architecture Lab Department of Informatics, University of Zurich Acknowledgements First of all, I like to thank Dr. Harald Gall for giving me the opportunity to write this thesis at the Software Evolution & Architecture Lab. Special thanks goes out to Dr. Sebastiano Panichella for his instructions, guidance and help during the making of this thesis, without whom this would not have been possible. I would also like to express my gratitude to Dr. Fabio Polomba, Dr. Yann-Gaël Guéhéneuc and Dr. Nikolaos Tsantalis for providing me access to their research and always being available for questions. Last, but not least, do I want to thank my parents, sisters and nephews for the support and love they’ve given all those years. Abstract With the importance of software in today’s society, malfunctioning software can not only lead to disrupting our day-to-day lives, but also large monetary damages. A lot of time and effort goes into the development of test suites to ensure the quality and accuracy of software. But how do we elevate the quality of test code? This thesis presents TestSmellDescriber, a tool with the ability to generate descriptions detailing potential problems in test cases, which are collected by conducting a Test Smell analysis.
    [Show full text]
  • Waternomics Deliverable Template
    Ref. Ares(2015)1164079 - 17/03/2015 D3.1.1 Linked Water Dataspace Project Acronym: Waternomics Project Title: ICT for Water Resource Management Project Number: 619660 Instrument: Collaborative project Thematic Priority: FP7-ICT-2013.11 D3.1.1 © Waternomics consortium Page 1 of 84 Work Package: WP3 Due Date: 31/12/2014 Submission Date: Start Date of Project: 01/02/2014 Duration of Project: 36 Months Organisation Responsible of Deliverable: NUIG Version: 1.3 Status: Draft Author name(s): Daniele Bortoluzzi R2M Edward Curry NUIG Wassim Derguech NUIG Souleiman Hasan NUIG Fadi Maali NUIG Diego Reforgiato R2M Arkadiusz Stasiewicz NUIG Umair Ul Hassan NUIG Reviewer(s): Christos Kouroupetroglou Ultra4 Peter O’Donovan NUIG Schalk-Jan van Andel UNESCO-IHE Nick van de Giesen TU Delft Nature: R – Report P – Prototype D – Demonstrator O – Other Dissemination level: PU – Public CO – Confidential, only for members of the consortium (including the Commission) RE – Restricted to a group specified by the consortium (including the Commission Services) Project co-funded by the European Commission within the Seventh Framework Programme (2007-2013) © Waternomics consortium Page 2 of 84 619660 Revision history Version Date Modified by Comments 0.1 22/07/14 Edward Curry Initial Version of Document 0.2 18/12/2014 Wassim Derguech Add/Edit section minimal vocabulary 0.2 Dec 2014 Wassim Derguech Dataspace functional requirements 0.3 Dec 2014 Souleiman Hassan Linked Data 0.3 Dec 2014 Umair Ul Hasan Entity Management 0.3 Jan 2015 Souleiman Hasan Real-time data 0.3
    [Show full text]
  • RDF Triplestores and SPARQL Endpoints
    RDF triplestores and SPARQL endpoints Lecturer: Mathias Bonduel [email protected] LDAC summer school 2019 – Lisbon, Portugal Lecture outline • Storing RDF data: RDF triplestores o Available methods to store RDF data o RDF triplestores o Triplestore applications – databases – default graph – named graphs o List of triplestore applications o Comparing triplestores o Relevant triplestore settings o Communication with triplestores • Distributing RDF data: SPARQL endpoints o Available methods to distribute RDF data o SPARQL endpoints o Reuse of SPARQL queries o SPARQL communication protocol: requests and responses June 18, 2019 RDF triplestores and SPARQL endpoints | Mathias Bonduel 2 Storing RDF data June 18, 2019 RDF triplestores and SPARQL endpoints | Mathias Bonduel 3 Available methods to store RDF data • In-memory storage (local RAM) o Working memory of application (e.g. client side web app, desktop app) o Frameworks/libraries: RDFLib (Python), rdflib.js (JavaScript), N3 (JavaScript), rdfstore-js (JavaScript), Jena (Java), RDF4J (Java), dotNetRDF (.NET), etc. o Varied support for SPARQL querying • Persistent storage (storage drive) o RDF file/dump (diff. RDF serializations): TTL, RDF/XML, N-Quads, JSON-LD, N-triples, TriG, N3, TriX, RDFa (RDF embedded in HTML), etc. o RDF triplestore o (ontology editing applications: Protégé, Topbraid Composer, etc.) June 18, 2019 RDF triplestores and SPARQL endpoints | Mathias Bonduel 4 RDF triplestores “a database to store and query RDF triples” • Member of the family of graph/NoSQL databases • Data structure: RDF • Main query language: SPARQL standards • Oftentimes support for RDFS/OWL/rules reasoning • Data storage is typically persistent June 18, 2019 RDF triplestores and SPARQL endpoints | Mathias Bonduel 5 Triplestore applications – databases - default graph - named graphs • An RDF triplestore instance (application) can have one or multiple databases (repositories) • Each database has one default graph and zero or more named graphs o a good practice is to place TBox in a separate named graph.
    [Show full text]
  • Linked Data Notifications and Activitypub Client and Server Student: Bc
    ASSIGNMENT OF MASTER’S THESIS Title: Linked Data Notifications and ActivityPub Client and Server Student: Bc. Antonín Karola Supervisor: RNDr. Jakub Klímek, Ph.D. Study Programme: Informatics Study Branch: Web and Software Engineering Department: Department of Software Engineering Validity: Until the end of summer semester 2019/20 Instructions The student will get familiar with Linked Data, the RDF data model, the recent W3C Recommendations [1][2][3] and the Solid project [4], a recent activity of the inventor of the Web, Sir Tim Berners-Lee. The student will implement a client and a server supporting decentralized messaging on the Web according to the Linked Data Notifications [2] and ActivityPub [3] W3C Recommendations in support of the Web re- decentralization. The client part will be a new, user friendly messaging application. Based on the analysis of existing Solid server implementations, the student will determine what is missing in the existing implementations for the given task. The missing features will be implemented either as a new Solid server, or an existing implementation will be enhanced. The client and the server will be documented, evaluated, tested and published as open-source on GitHub. The tests will consist of unit tests and tests of compatibility with existing tools implementing the Recommendations. References [1] Linked Data Platform 1.0, W3C Recommendation, 2015, https://www.w3.org/TR/ldp/ [2] Linked Data Notifications, W3C Recommendation, 2017, https://www.w3.org/TR/ldn/ [3] ActivityPub, W3C Recommendation, 2018, https://www.w3.org/TR/activitypub/ [4] Solid, MIT, https://solid.mit.edu/ Ing. Michal Valenta, Ph.D. doc.
    [Show full text]
  • A Geosparql Compliance Benchmark Which Aims to Measure the Extent to Which an RDF Triplestore Complies with the Requirements Specified in the Geosparql Standard
    AGEOSPARQL COMPLIANCE BENCHMARK APREPRINT Milos Jovanovik Timo Homburg Ss. Cyril and Methodius Univesity in Skopje, N. Macedonia Mainz University Of Applied Sciences, Germany OpenLink Software, London, UK [email protected] [email protected] Mirko Spasic´ University of Belgrade, Serbia OpenLink Software, London, UK [email protected] February 12, 2021 ABSTRACT We propose a series of tests that check for the compliance of RDF triplestores with the GeoSPARQL standard. The purpose of the benchmark is to test how many of the requirements outlined in the standard a tested system supports and to push triplestores forward in achieving a full GeoSPARQL compliance. This topic is of concern because the support of GeoSPARQL varies greatly between different triplestore implementations, and such support is of great importance for the domain of geospatial RDF data. Additionally, we present a comprehensive comparison of triplestores, providing an insight into their current GeoSPARQL support. Keywords GeoSPARQL · Benchmarking · RDF · SPARQL 1 Introduction The geospatial Semantic Web [1] as part of the Semantic Web [2] represents an ever-growing semantically interpreted wealth of geospatial information. The initial research [3] and the subsequent introduction of the OGC GeoSPARQL standard [4] formalized geospatial vector data representations (WKT [5] and GML [6]) in ontologies, and extended the SPARQL query language [7] with support for spatial relation operators. arXiv:2102.06139v1 [cs.DB] 11 Feb 2021 Several RDF storage solutions have since adopted GeoSPARQL to various extents as features of their triplestore implementations [8, 9]. These varying levels of implementation may lead to some false assumptions of users when choosing an appropriate triplestore implementation for their project.
    [Show full text]