Towards a Web of Archaeological Linked Open Data

Total Page:16

File Type:pdf, Size:1020Kb

Towards a Web of Archaeological Linked Open Data WP15 Study: Towards a Web of Archaeological Linked Open Data Author: Guntram Geser, Salzburg Research Ariadne is funded by the European Commission’s 7th Framework Programme. ARIADNE WP15 Study: Towards a Web of Archaeological Linked Open Data Guntram Geser Salzburg Research Version 1.0 6 October 2016 ARIADNE is a project funded by the European Commission under the Community’s Seventh Framework Programme, contract no. FP7-INFRASTRUCTURES-2012-1-313193. The views and opinions expressed in this report are the sole responsibility of the author and do not necessarily reflect the views of the European Commission. ARIADNE – WP15 Study: Towards a Web of Archaeological Linked Open Data Prepared by SRFG Table of content 1 Executive Summary ................................................................................................................. 5 2 Vision, study summaries, and recommendations ...................................................................... 7 2.1 Archaeological Linked Open Data – a vision ............................................................................... 7 2.2 Study summaries and recommendations ................................................................................... 8 2.2.1 Linked Open Data: Background and principles .............................................................................. 8 2.2.2 The Linked Open Data Cloud ......................................................................................................... 9 2.2.3 Adoption of the Linked Data approach in archaeology ............................................................... 10 2.2.4 Requirements for wider uptake of the Linked Data approach .................................................... 11 3 Linked Open Data: Background and principles ........................................................................ 17 3.1 LOD – A brief introduction ........................................................................................................ 17 3.2 Historical and current background ........................................................................................... 18 3.3 Linked Data principles and standards....................................................................................... 19 3.3.1 Linked Data basics ....................................................................................................................... 19 3.3.2 Linked Open Data ........................................................................................................................ 20 3.3.3 Metadata and vocabulary as Linked Data ................................................................................... 21 3.3.4 Good practices for Linked Data vocabularies .............................................................................. 22 3.3.5 Metadata for sets of Linked Data ................................................................................................ 23 3.4 What adopters should consider first ........................................................................................ 24 3.5 Mastering the Linked Data lifecycle ......................................................................................... 25 3.6 Brief summary and recommendations ..................................................................................... 26 4 The Linked Open Data Cloud .................................................................................................. 28 4.1 LOD Cloud figures ..................................................................................................................... 28 4.2 (Mis-)reading the LOD diagram ................................................................................................ 29 4.3 Cultural heritage in the LOD Cloud ........................................................................................... 31 4.4 Brief summary and recommendations ..................................................................................... 34 5 Adoption of the Linked Data approach in archaeology ............................................................ 35 5.1 Adoption by cultural heritage institutions ............................................................................... 35 5.2 Low uptake for archaeological research data .......................................................................... 36 5.3 The Ancient World research community as a front-runner ..................................................... 38 5.4 Brief summary and recommendations ..................................................................................... 41 6 Requirements for wider uptake of the Linked Data approach.................................................. 43 6.1 Raise awareness of Linked Data ............................................................................................... 43 6.1.1 Fragmentation of archaeological data ......................................................................................... 43 6.1.2 Current awareness of Linked Data............................................................................................... 44 6.1.3 Brief summary and recommendations ........................................................................................ 46 6.2 Clarify the benefits and costs of Linked Data ........................................................................... 47 6.2.1 The notion of an unfavourable cost/benefit ratio ....................................................................... 47 ARIADNE 2 October 2016 ARIADNE – WP15 Study: Towards a Web of Archaeological Linked Open Data Prepared by SRFG 6.2.2 Lack of cost/benefit evaluation ................................................................................................... 48 6.2.3 Collecting examples of benefits and costs ................................................................................... 50 6.2.4 Brief summary and recommendations ........................................................................................ 54 6.3 Enable non-IT experts use Linked Data tools ........................................................................... 55 6.3.1 Linked Data tools: there are many and most are not useable..................................................... 55 6.3.2 Need of expert support ............................................................................................................... 56 6.3.3 The case of CIDOC CRM: from difficult to doable ........................................................................ 56 6.3.4 Progress through data mapping tools and templates ................................................................. 57 6.3.5 Need to integrate shared vocabularies into data recording tools ............................................... 58 6.3.6 Brief summary and recommendations ........................................................................................ 60 6.4 Promote Knowledge Organization Systems as Linked Open Data ........................................... 61 6.4.1 Knowledge Organization Systems (KOSs) .................................................................................... 61 6.4.2 Cultural heritage vocabularies in use .......................................................................................... 62 6.4.3 Development of KOSs as Linked Open Data ................................................................................ 63 6.4.4 KOSs registries ............................................................................................................................. 66 6.4.5 Brief summary and recommendations ........................................................................................ 68 6.5 Foster reliable Linked Data for interlinking .............................................................................. 69 6.5.1 Current lack of interlinking .......................................................................................................... 69 6.5.2 Why is there a lack of interlinking? ............................................................................................. 70 6.5.3 Need of reliable Linked Data resources ....................................................................................... 70 6.5.4 Foster a community of archaeological LOD curators ................................................................... 72 6.5.5 Brief summary and recommendations ........................................................................................ 72 6.6 Promote Linked Open Data for research .................................................................................. 73 6.6.1 A Linked Open Data vision (2010) ................................................................................................ 74 6.6.2 LOD for research: The current state of play ................................................................................ 74 6.6.3 Search vs. research ...................................................................................................................... 76 6.6.4 Examples of research-oriented Linked Data projects .................................................................. 77 6.6.5 CIDOC CRM as a basis for research applications ......................................................................... 78 6.6.6 Brief summary and recommendations ........................................................................................ 80 7 References and other sources ...............................................................................................
Recommended publications
  • Off the Beaten Track
    Off the Beaten Track To have your recording considered for review in Sing Out!, please submit two copies (one for one of our reviewers and one for in- house editorial work, song selection for the magazine and eventual inclusion in the Sing Out! Resource Center). All recordings received are included in “Publication Noted” (which follows “Off the Beaten Track”). Send two copies of your recording, and the appropriate background material, to Sing Out!, P.O. Box 5460 (for shipping: 512 E. Fourth St.), Bethlehem, PA 18015, Attention “Off The Beaten Track.” Sincere thanks to this issue’s panel of musical experts: Richard Dorsett, Tom Druckenmiller, Mark Greenberg, Victor K. Heyman, Stephanie P. Ledgin, John Lupton, Angela Page, Mike Regenstreif, Seth Rogovoy, Ken Roseman, Peter Spencer, Michael Tearson, Theodoros Toskos, Rich Warren, Matt Watroba, Rob Weir and Sule Greg Wilson. that led to a career traveling across coun- the two keyboard instruments. How I try as “The Singing Troubadour.” He per- would have loved to hear some of the more formed in a variety of settings with a rep- unusual groupings of instruments as pic- ertoire that ranged from opera to traditional tured in the notes. The sound of saxo- songs. He also began an investigation of phones, trumpets, violins and cellos must the music of various utopian societies in have been glorious! The singing is strong America. and sincere with nary a hint of sophistica- With his investigation of the music of tion, as of course it should be, as the Shak- VARIOUS the Shakers he found a sect which both ers were hardly ostentatious.
    [Show full text]
  • Archaeology and the Big Data Challenge
    Think big about data: Archaeology and the Big Data challenge Gabriele Gattiglia Abstract – Usually defined as high volume, high velocity, and/or high variety data, Big Data permit us to learn things that we could not comprehend using smaller amounts of data, thanks to the empowerment provided by software, hardware and algorithms. This requires a novel archaeological approach: to use a lot of data; to accept messiness; to move from causation to correlation. Do the imperfections of archaeological data preclude this approach? Or are archaeological data perfect because they are messy and difficult to structure? Normal- ly archaeology deals with the complexity of large datasets, fragmentary data, data from a variety of sources and disciplines, rarely in the same format or scale. If so, is archaeology ready to work more with data-driven research, to accept predictive and probabilistic techniques? Big Data inform, rather than explain, they expose patterns for archaeological interpretation, they are a resource and a tool: data mining, text mining, data visualisations, quantitative methods, image processing etc. can help us to understand complex archaeological information. Nonetheless, however seductive Big Data appear, we cannot ignore the problems, such as the risk of considering that data = truth, and intellectual property and ethical issues. Rather, we must adopt this technology with an appreciation of its power but also of its limitations. Key words – Big Data, datafication, data-led research, correlation, predictive modelling Zusammenfassung – Üblicherweise als Hochgeschwindigkeitsdaten (high volume, high velocity und/oder high variety data) bezeichnet, machen es Big Data möglich, dank dem Einsatz von Software, Hardware und Algorithmen historische Prozesse zu studieren, die man anhand kleinerer Datenmengen nicht verstehen kann.
    [Show full text]
  • Data Mining and Data Warehousing Unit 1: Introduction
    https://genuinenotes.com Data Mining and Data Warehousing Unit 1: Introduction Data Mining Databases today can range in size into the terabytes — more than 1,000,000,000,000 bytes of data. Within these masses of data lies hidden information of strategic importance. But when there is such enormous amount of data how could we draw any meaningful conclusion? We are data rich, but information poor. The abundance of data, coupled with the need for powerful data analysis tools, has been described as a data rich but information poor situation. Data mining is being used both to increase revenues and to reduce costs. The potential returns are enormous. Innovative organizations worldwide are using data mining to locate and appeal to higher-value customers, to reconfigure their product offerings to increase sales, and to minimize losses due to error or fraud. What is data mining? Mining is a brilliant term characterizing the process that finds a small set of precious bits from a great deal of raw material. Definition: Data mining is the process of discovering meaningful new correlations, patterns and trends by sifting through large amounts of data stored in repositories, using pattern recognition technologies as well as statistical and mathematical techniques. Data mining is the analysis of (often large) observational data sets to find unsuspected relationships and to summarize the data in unique ways that are both understandable and useful to the data owner. Data mining is an interdisciplinary field bringing together techniques from machine learning, pattern recognition, statistics, databases, and visualization to address the issue of information extraction from large database.
    [Show full text]
  • Renewal Journal Vol 4
    Renewal Journal Volume 4 (16-20) Vision – Unity - Servant Leadership - Church Life Geoff Waugh (Editor) Renewal Journals Copyright © Geoff Waugh, 2012 Renewal Journal articles may be reproduced if the copyright is acknowledged as Renewal Journal (www.renewaljournal.com). Articles of everlasting value ISBN-13: 978-1466366442 ISBN-10: 1466366443 Free airmail postage worldwide at The Book Depository Renewal Journal Publications www.renewaljournal.com Brisbane, Qld, 4122 Australia Logo: lamp & scroll, basin & towel, in the light of the cross 2 Contents 16 Vision 7 Editorial: Vision for the 21st Century 9 1 Almolonga, the Miracle City, by Mell Winger 13 2 Cali Transformation, by George Otis Jr 25 3 Revival in Bogatá, by Guido Kuwas 29 4 Prison Revival in Argentina, by Ed Silvoso 41 5 Missions at the Margins, Bob Ekblad 45 5 Vision for Church Growth, by Daryl & Cecily Brenton 53 6 Vision for Ministry, by Geoff Waugh 65 Reviews 95 17 Unity 103 Editorial: All one in Christ Jesus 105 1 Snapshots of Glory, by George Otis Jr. 107 2 Lessons from Revivals, by Richard Riss 145 3 Spiritual Warfare, by Cecilia Estillore Oliver 155 4 Unity not Uniformity, by Geoff Waugh 163 Reviews 191 Renewal Journals 18 Servant Leadership 195 Editorial: Servant Leadership 197 1 The Kingdom Within, by Irene Alexander 201 2 Church Models: Integration or Assimilation? by Jeanie Mok 209 3 Women in Ministry, by Sue Fairley 217 4 Women and Religions, by Susan Hyatt 233 5 Disciple-Makers, by Mark Setch 249 6 Ministry Confronts Secularisation, by Sam Hey 281 Reviews 297 19
    [Show full text]
  • Data Lakes, Data Hubs and AI
    Data Lakes, Data Hubs and AI Dan McCreary Distinguished Engineer in Artificial Intelligence Optum Advanced Applied Technologies Background for Dan McCreary • Co-founder of "NoSQL Now!" conference • Coauthor (with Ann Kelly) of "Making Sense of NoSQL" • Guide for managers and architects • Focus on NoSQL architectural tradeoff analysis • Basis for 40 hour course on database architectures • How to pick the right database architecture • http://manning.com/mccreary • Focus on metadata and IT strategy (capabilities) • Currently focused on NLP and Artificial Intelligence Training Data Management 2 The Impact of Deep Learning Predictive Precision Deep Learning Traditional Machine Learning (Linear Regression) Training Set Size Large datasets create competitive advantage 3 High Costs of Data Sourcing for Deep Learning $ % OF TIME % OF THE BILLION IN 80 WASTED 60 COST 3.5 SPENDING By data scientists just getting Of data warehouse projects In 2016 on data integration access to data and preparing is on ETL software data for analysis Six Database Core Architecture Patterns Relational Analytical (read-mostly OLAP) Key-Value key value key value key value key value Column-Family Graph Document Which architectures are best for data lakes, data hubs and AI? 5 Role of the Solution Architect Relational Analytical (OLAP) Key-Value Help! k v k v k v k v Column-Family Graph Document Business Unit Sally Solutions Title: Solution Architect • Non-bias matching of business problem to the right data architecture before we begin looking at a specific products 6 Google Trends Data Lake Data Hub 7 Data Science and Deep Learning Data Science Deep Learning 8 Data Lake Definition ~10 TB and up $350/10TB Drive A storage repository that holds a vast amount of raw data in its native format until it is needed.
    [Show full text]
  • FI-WARE Product Vision Front Page Ref
    FI-WARE Product Vision front page Ref. Ares(2011)1227415 - 17/11/20111 FI-WARE Product Vision front page Large-scale Integrated Project (IP) D2.2: FI-WARE High-level Description. Project acronym: FI-WARE Project full title: Future Internet Core Platform Contract No.: 285248 Strategic Objective: FI.ICT-2011.1.7 Technology foundation: Future Internet Core Platform Project Document Number: ICT-2011-FI-285248-WP2-D2.2b Project Document Date: 15 Nov. 2011 Deliverable Type and Security: Public Author: FI-WARE Consortium Contributors: FI-WARE Consortium. Abstract: This deliverable provides a high-level description of FI-WARE which can help to understand its functional scope and approach towards materialization until a first release of the FI-WARE Architecture is officially released. Keyword list: FI-WARE, PPP, Future Internet Core Platform/Technology Foundation, Cloud, Service Delivery, Future Networks, Internet of Things, Internet of Services, Open APIs Contents Articles FI-WARE Product Vision front page 1 FI-WARE Product Vision 2 Overall FI-WARE Vision 3 FI-WARE Cloud Hosting 13 FI-WARE Data/Context Management 43 FI-WARE Internet of Things (IoT) Services Enablement 94 FI-WARE Applications/Services Ecosystem and Delivery Framework 117 FI-WARE Security 161 FI-WARE Interface to Networks and Devices (I2ND) 186 Materializing Cloud Hosting in FI-WARE 217 Crowbar 224 XCAT 225 OpenStack Nova 226 System Pools 227 ISAAC 228 RESERVOIR 229 Trusted Compute Pools 230 Open Cloud Computing Interface (OCCI) 231 OpenStack Glance 232 OpenStack Quantum 233 Open
    [Show full text]
  • Off the Beaten Track
    Off the Beaten Track To have your recording considered for review in Sing Out!, please submit two copies (one for one of our reviewers and one for in- house editorial work, song selection for the magazine and eventual inclusion in the Sing Out! Resource Center, our multimedia, folk-related archive). All recordings received are included in Publication Noted (which follows Off the Beaten Track). Send two copies of your recording, and the appropriate background material, to Sing Out!, P.O. Box 5460 (for shipping: 512 E. Fourth St.), Bethlehem, PA 18015, Attention Off The Beaten Track. Sincere thanks to this issues panel of musical experts: Roger Dietz, Richard Dorsett, Tom Druckenmiller, Mark Greenberg, Victor K. Heyman, Stephanie P. Ledgin, John Lupton, Andy Nagy, Angela Page, Mike Regenstreif, Peter Spencer, Michael Tearson, Rich Warren, Matt Watroba, Elijah Wald, and Rob Weir. liant interpretation but only someone with not your typical backwoods folk musician, Jodys skill and knowledge could pull it off. as he studied at both Oberlin and the Cin- The CD continues in this fashion, go- cinnati College Conservatory of Music. He ing in and out of dream with versions of was smitten with the hammered dulcimer songs like Rhinordine, Lord Leitrim, in the early 70s and his virtuosity has in- and perhaps the most well known of all spired many players since his early days ballads, Barbary Ellen. performing with Grey Larsen. Those won- To use this recording as background derful June Appal recordings are treasured JODY STECHER music would be a mistake. I suggest you by many of us who were hearing the ham- Oh The Wind And Rain sit down in a quiet place, put on the head- mered dulcimer for the first time.
    [Show full text]
  • Supervised Machine Learning Models for Fake News Detection
    CCT College Dublin ARC (Academic Research Collection) ICT Student Achievement Summer 6-15-2019 Supervised Machine Learning Models for Fake News Detection Andrea Lopez CCT College Dublin Adelo Vieira CCT College Dublin Zafar Ahsan CCT College Dublin Farooq Sabib CCT College Dublin Shirley Marinho CCT College Dublin Follow this and additional works at: https://arc.cct.ie/ict Part of the Graphics and Human Computer Interfaces Commons, Numerical Analysis and Scientific Computing Commons, and the Theory and Algorithms Commons Recommended Citation Lopez, Andrea; Vieira, Adelo; Ahsan, Zafar; Sabib, Farooq; and Marinho, Shirley, "Supervised Machine Learning Models for Fake News Detection" (2019). ICT. 5. https://arc.cct.ie/ict/5 This Dissertation is brought to you for free and open access by the Student Achievement at ARC (Academic Research Collection). It has been accepted for inclusion in ICT by an authorized administrator of ARC (Academic Research Collection). For more information, please contact [email protected]. CCT COLLEGE Supervised Machine Learning Models for Fake News Detection A Supervised Machine Learning Models for Fake News Detection by Gofaas group by GoSupervisedfaas group Machine Learning Models for Fake News Detection [Figure.1]by Gofaas group A Supervised Machine Learning by GoModelsfaas group for Fake News Detection 1 INDEX: Executive Summary Chapter 1. Introduction ................................................................................................. 6 Initial Proposal ..............................................................................................................
    [Show full text]
  • A Data Lake for Archaeological Data Management and Analytics
    ArchaeoDAL: A Data Lake for Archaeological Data Management and Analytics Pengfei Liu Camille Noûs Sabine Loudcher [email protected] Jérôme Darmont Laboratoire Cogitamus [email protected] France [email protected] [email protected] Université de Lyon, Lyon 2, UR ERIC Lyon, France ABSTRACT 1 INTRODUCTION With new emerging technologies, such as satellites and drones, Over the past decade, new forms of data such as geospatial data and archaeologists collect data over large areas. However, it becomes aerial photography have been included in archaeology research [8], difficult to process such data in time. Archaeological data also have leading to new challenges such as storing massive, heterogeneous many different formats (images, texts, sensor data) and canbe data, high-performance data processing and data governance [4]. structured, semi-structured and unstructured. Such variety makes As a result, archaeologists need a platform that can host, process, data difficult to collect, store, manage, search and analyze effectively. analyze and share such data. A few approaches have been proposed, but none of them covers the In this context, a multidisciplinary consortium of archaeologists full data lifecycle nor provides an efficient data management system. and computer scientists proposed the HyperThesau project1, which Hence, we propose the use of a data lake to provide centralized data aims at designing a data management and analysis platform. Hyper- stores to host heterogeneous data, as well as tools for data quality Thesau has two main objectives: 1) the design and implementation checking, cleaning, transformation and analysis. In this paper, we of an integrated platform to host, search, analyze and share archaeo- propose a generic, flexible and complete data lake architecture.
    [Show full text]
  • Report of the Committee on Data Management
    REPORT OF THE COMMITTEE ON DATA MANAGEMENT GOVERNMENT OF INDIA NATIONAL STATISTICAL COMMISSION MINISTRY OF STATISTICS AND PROGRAMME IMPLEMENTATION NEW DELHI CONTENTS Page Number Constitution, Terms of Reference and Members‘ List (i) Acknowledgements (vii) List of Abbreviations (ix) Introduction, Executive Summary, and Main observations (xviii) Chapter - 1 Indian Statistical system, Challenges in Official Statistics & data management and remedial measures 1. The importance of official statistics 1 2. Current scenario of National Statistical System in India 3 3. Need to know and right to share 4 4. Issues of decentralized system 5 5. Vertical and horizontal issues 7 6. Priority 9 7. Inter operability 9 8. Need for assessability and communicability 10 9. Need for scanning of published data before conversion into defined data format 11 10. Non-official data providers 12 11. Consistency in definition and standards 13 12. Draw upon insights into problems 13 13. Technological and technical issues 15 14. Training intervention/ capacity building 17 15. Technology versus Applications 19 16. Legal framework for providing data 19 17. Setting up of Data Management Division 20 Chapter - 2 Conceptual framework on Data management 1. Data Management 22 2. Data governance 25 3. Data Architecture, Analysis and Design 26 4. Database Management 29 5. Data Security Management 30 6. Data Quality Management 32 7. Reference and Master Data Management 34 8. Data Warehousing and Business Intelligence Management 36 9. Document, Record and Content Management 39 10. Meta Data Management 40 11. Contact Data Management 42 12. Data Movement 44 13. Statistical Data Metadata Exchange (SDMX) 44 14. Secured Data Centers 46 Chapter -3 Recommendations: 1.
    [Show full text]
  • Archaeological Archives
    Archaeological Archives A guide to best practice in creation, compilation, transfer and curation Archaeological Archives A guide to best practice in creation, compilation, transfer and curation Duncan H. Brown Supported by: Archaeological Archives Forum Archaeology Data Service Association of Local Government Archaeological Officers UK Department of Environment for Northern Ireland English Heritage Historic Scotland Institute for Archaeologists Royal Commission on the Ancient and Historical Monuments of Scotland Society of Museum Archaeologists This guide has been published by the Institute for Archaeologists on behalf of the Archaeological Archives Forum. Development and production of this guide was funded by grant-aid from English Heritage, Historic Scotland, the Royal Commission on the Ancient and Historical Monuments of Scotland, the Environment and Heritage Service (Northern Ireland) and the Society of Museum Archaeologists, with support in kind from the Archaeology Data Service and the Association of Local Government Archaeological Officers (UK). Original photographs by John Lawrence. Others reproduced with the kind permission of Mark Bowden (page 5), English Heritage (page 40), the Environment and Heritage Service (Northern Ireland) (page 1), Southampton City Council (page 29), and the Royal Commission on the Ancient and Historical Monuments of Scotland (page 26). First published July 2007. Second Edition September 2011. ISBN 0948393912 Design and layout by Maria Geals. Foreword The creation of stable, consistent, logical, and accessible archives from fieldwork is a fundamental building block of archaeological activity. Since the discipline emerged in the late 19th and early 20th centuries, it has been recognised that the process of excavation is destructive and that no archaeological interpretations are sustainable unless they can be backed up with the evidence of field records and post-excavation analysis.
    [Show full text]
  • D2.1.1 Methods & Algorithms for Traffic Density Measurement
    INSIST Deliverable D2.1.1 Methods & Algorithms for Traffic Density Measurement Editor: [Çiğdem Çavdaroğlu] D2.1.1 Methods & Algorithms for Traffic Density Measurement Version 1.0 ITEA 3 Project 13021 Date : 19.12.2016 ITEA3: INSIST 2 D2.1.1 Methods & Algorithms for Traffic Density Measurement Version 1.0 Document properties Security Confidential Version Version 1.0 Author Çiğdem Çavdaroğlu Pages History of changes Version Author, Institution Changes 0.1 Çiğdem Çavdaroğlu, KoçSistem Initial Version 0.2 0.3 0.4 ITEA3: INSIST 3 D2.1.1 Methods & Algorithms for Traffic Density Measurement Version 1.0 Abstract This document describes existing traffic density measurement methods and algorithms, compares this existing methods, and finally suggests the most suitable ones. In order to measure traffic density information, implementing traffic surveillance methods was planned in the initial version of INSIST project (2013). When the project is started in the beginning of 2016, two new methods were introduced. This document covers information about all these three methods. ITEA3: INSIST 4 D2.1.1 Methods & Algorithms for Traffic Density Measurement Version 1.0 Table of contents Document properties ................................................................................................................. 3 History of changes ...................................................................................................................... 3 Abstract .....................................................................................................................................
    [Show full text]