Waternomics Deliverable Template

Ref. Ares(2015)1164079 - 17/03/2015 D3.1.1 Linked Water Dataspace Project Acronym: Waternomics Project Title: ICT for Water Resource Management Project Number: 619660 Instrument: Collaborative project Thematic Priority: FP7-ICT-2013.11 D3.1.1 © Waternomics consortium Page 1 of 84 Work Package: WP3 Due Date: 31/12/2014 Submission Date: Start Date of Project: 01/02/2014 Duration of Project: 36 Months Organisation Responsible of Deliverable: NUIG Version: 1.3 Status: Draft Author name(s): Daniele Bortoluzzi R2M Edward Curry NUIG Wassim Derguech NUIG Souleiman Hasan NUIG Fadi Maali NUIG Diego Reforgiato R2M Arkadiusz Stasiewicz NUIG Umair Ul Hassan NUIG Reviewer(s): Christos Kouroupetroglou Ultra4 Peter O’Donovan NUIG Schalk-Jan van Andel UNESCO-IHE Nick van de Giesen TU Delft Nature: R – Report P – Prototype D – Demonstrator O – Other Dissemination level: PU – Public CO – Confidential, only for members of the consortium (including the Commission) RE – Restricted to a group specified by the consortium (including the Commission Services) Project co-funded by the European Commission within the Seventh Framework Programme (2007-2013) © Waternomics consortium Page 2 of 84 619660 Revision history Version Date Modified by Comments 0.1 22/07/14 Edward Curry Initial Version of Document 0.2 18/12/2014 Wassim Derguech Add/Edit section minimal vocabulary 0.2 Dec 2014 Wassim Derguech Dataspace functional requirements 0.3 Dec 2014 Souleiman Hassan Linked Data 0.3 Dec 2014 Umair Ul Hasan Entity Management 0.3 Jan 2015 Souleiman Hasan Real-time data 0.3 Jan 2015 Arkadiusz Stasiewicz OpenCube 0.4 Jan 2015 Christos Data Inputs from WP4 Kouroupetroglou 0.4 Jan 2015 Wassim Derguech Review + executive summary 0.5 Jan 2015 DomenicoPerfido Appendix C – Survey of Open Data for Water management 0.6 Jan 2015 Peter O’Donovan Review comments 0.6 Jan 2015 Schalk Jan van Andel Review comments 0.7 Jan 2015 Wassim Derguech Updated Deliverable based on Reviews 0.8 Jan 2015 Wassim Derguech change conclusion of section 5 and remove appendix B 1.0 Jan 2015 Wassim Derguech Final version 1.0 1.1 Feb 2015 Wassim Derguech Update executive summary and introduction 1.2 Feb 2015 Wassim Derguech Add section “About Waternomics” 1.3 Feb 2015 Souleiman Hasan Update Section 5 and 6 to match latest findings after the February Hackathon 1.4 Mar 2015 Wassim Derguech Update based on reviews © Waternomics consortium Page 3 of 84 619660 Copyright © 2015, Waternomics Consortium The Waternomics Consortium (http://www.waternomics.eu/) grants third parties the right to use and distribute all or parts of this document, provided that the Waternomics project and the document are properly referenced. THIS DOCUMENT IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS DOCUMENT, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. © Waternomics consortium Page 4 of 84 619660 Executive Summary This deliverable reports on the current efforts towards designing a Linked Water Dataspace. A dataspace is an emerging information management approach used to tackle heterogeneous data sources that supports requirements, such as standardization, enrichment, and linking of data in an incremental manner. Designing and implementing a dataspace is the core of the Waternomics information platform and is one of the major objectives for the Waternomics project. In the Waternomics vision collecting, standardizing, enriching and linking water usage data coming from sensors together with other relevant contextual data sources is a key step needed for effective analytics to drive decision making: e.g., planning, adjustments and predictions and to raise user awareness of water consumption. In this report, we detail the current design and implementation of the dataspace using Linked Data as a recommended W3C best practice for exposing, sharing and connecting pieces of data, information and knowledge. The deliverable introduces the concept of Linked Data, Semantic Web and Web of Data and how these are applied in the context of the Waternomics dataspace. The primary contribution of this report is a customized architecture for implementing a Linked Water Dataspace. The deliverable exposes a first version of a water data ontology used for semantically describing sensors and their data. The design of the ontology is informed of by the requirements of the pilots for importing or exporting data from/to the dataspace. © Waternomics consortium Page 5 of 84 619660 Table of Contents Executive Summary ................................................................................................................. 5 Table of Contents ..................................................................................................................... 6 List of Figures .......................................................................................................................... 8 List of Tables .......................................................................................................................... 10 List of Listings ........................................................................................................................ 11 1 Introduction .................................................................................................................... 12 1.1 Work Package 3 Objectives ............................................................................ 12 1.2 Purpose and Target Group of the Deliverable ................................................. 12 1.3 Relations to other Activities in the Project ....................................................... 13 1.4 Document Outline ............................................................................................ 14 1.5 About Waternomics ......................................................................................... 14 2 Functional Dataspace Requirements ............................................................................ 16 3 Linked Data ..................................................................................................................... 18 3.1 Overview ......................................................................................................... 18 3.2 The Semantic Web .......................................................................................... 19 3.3 Web of Data .................................................................................................... 21 3.4 Linked Water Data ........................................................................................... 23 3.5 Conclusion ...................................................................................................... 24 4 State of the Art Analysis ................................................................................................ 25 4.1 Data Adapters ................................................................................................. 25 4.2 Linked Data Platforms ..................................................................................... 28 4.2.1 Virtuoso .......................................................................................... 28 4.2.2 LinDA ............................................................................................. 29 4.2.3 Pubby ............................................................................................. 29 4.2.4 Apache Marmotta ........................................................................... 30 4.2.5 LDP4j ............................................................................................. 30 4.2.6 OpenCube Toolkit .......................................................................... 31 4.2.7 LOD2 Linked Data Stack ............................................................... 32 4.2.8 CKAN ............................................................................................. 33 4.3 Entity Management ......................................................................................... 33 4.3.1 Master Data Management ............................................................. 34 4.3.2 Crowdsourced Entity Management ................................................ 35 4.3.3 Collaborative Entity Management .................................................. 36 4.4 Lambda Architecture ....................................................................................... 37 4.4.1 DRUID Platform ............................................................................. 37 4.4.2 Apache Spark ................................................................................ 38 4.5 Conclusion ...................................................................................................... 39 5 A Realtime Linked Water Dataspace ............................................................................. 41 5.1 Dataspace Overview ....................................................................................... 43 5.2 The Dataspace

Waternomics Deliverable Template

Domain Landscape Review and Data Value Chain Definition

Qualifikationsprofil #10309

Integration of Web Apis and Linked Data Using SPARQL Micro-Services

EAGLE EAGLE OER Learning Platform

Linked Data Architectural Components How-To Attach Linked Data Services to Legacy Infrastructure?

Code Smell Prediction Employing Machine Learning Meets Emerging Java Language Constructs"

Smart Campus Based on Linked Data Phd Thesis

Testsmelldescriber Enabling Developers’ Awareness on Test Quality with Test Smell Summaries

RDF Triplestores and SPARQL Endpoints

Linked Data Notifications and Activitypub Client and Server Student: Bc

A Geosparql Compliance Benchmark Which Aims to Measure the Extent to Which an RDF Triplestore Complies with the Requirements Speciﬁed in the Geosparql Standard

Domain Landscape Review and Data Value Chain Definition