A Linked Data Extraction Architecture for Engineering Software Tools

Semantic Web 0 (0) 1 1 IOS Press A Linked Data extraction architecture for engineering software tools Jad El-khoury a;∗, Andrii Berezovskyi a, Mattias Nyberg b a Department of Machine Design KTH Royal Institute of Technology Stockholm, Sweden b Scania CV AB Södertälje, Sweden Abstract. New industrial initiatives such as Industrie 4.0 rely on digital end-to-end engineering across the entire product lifecy- cle, which in turn depends on the ability of the supporting software tools to interoperate. A tool interoperability approach based on Linked Data has the potential to provide such integration, where each software tool can expose its information and services to other tools using the web as a common technology base. In this paper, we report on our negative findings when attempting to use existing Linked Data extraction techniques to expose the structured content managed in engineering tools. Such techniques typically target the storage facility to extract and transform its content, with the assumption that sufficient information is available within the facility to automate the process. Based on a case study with the truck manufacturer Scania CV AB, our study finds that an engineering tool manages its artefacts using business logic that is not necessarily reflected in the storage facility. This renders existing extraction techniques inadequate. Instead we propose an alternative Linked Data extraction architecture that can mitigate the identified shortcomings. While less automated compared to the existing solutions, the proposed architecture is realised as a standalone library that can still facilitate the extraction process. Keywords: Linked Data, OSLC, Tool Integration, Tool Interoperability, SQL to RDF, Resource Shapes, Relational Databases to Linked Data Transformation 1. Introduction ity to provide “digital end-to-end engineering across the entire value chain of both the product and the asso- Advances in information and communication tech- ciated manufacturing system”. That is, it is necessary nology (ICT), and in particular internet technologies, to provide digital integration throughout the product such as the Internet of Things (IoT) and cloud comput- development and manufacturing processes, as well as ing are enabling factors for new industrial initiatives across the different technical disciplines and organisa- such as Industrie 4.0 [1] and the Industrial Internet [2]. tions involved. In the interconnected manufacturing environment of Given that engineering processes are supported the near future, machinery, storage and production fa- through a number of software tools, this need par- cilities will be able to intelligently and autonomously tially translates to the necessity to integrate these sup- exchange information and trigger actions among each port tools, as well as the product information resid- other. This can lead to more dynamic manufacturing ing within each tool. However, such support tools are and optimised decision-making processes, which will not necessarily designed to work together, since they allow for individual customer requirements to be met, are developed to focus primarily on a specific activity, while maintaining efficiency and productivity. development phase, or discipline within the complete According to the Recommendations for Implement- product life cycle. In general, software tools will most ing the Strategic Initiative INDUSTRIE 4.0 [1, p. 30], likely adopt different interface technologies (if any), one of the important features of this strategy is the abil- making their interoperability difficult. How can digital integration be achieved across soft- *Corresponding author. Email address: [email protected] ware tools that may or may not be designed to work 1570-0844/0-1900/$35.00 c 0 – IOS Press and the authors. All rights reserved 2 / together? A tool interoperability approach based on requirements on a suitable solution, which will be dis- Linked Data could well provide such a solution. With cussed in Section 5. Our proposed extraction architec- Linked Data, each tool can expose its information and ture is then presented in Section 6, with further imple- services using the ubiquitous web as a common tech- mentation and evaluation detailed in Sections 7 and 8 nology base, through which other tools can interoper- respectively. Reflections and analysis of other related ate. solutions are discussed in Section 9. The article is then The very first step to achieve such integration is for concluded in Section 10. each tool to extract and transform its internally managed artefacts into RDF resources. In this paper, we in- vestigate the suitability of existing information extrac- 2. Background tion technologies from the Linked Data domain, when applied in the context of software tools typically used The recommendations for implementing the Indus- in an industrial setting. We here focus on the extraction trie 4.0 strategy identify model-based development of structured engineering information that resides in a (MBD) as the enabling methodology to cover the relational database. Yet, the experiences gained are in- needed digital end-to-end engineering. This makes tended to apply for other data management technolo- sense since MBD is leading the effort of migrating gies. engineering focus from text-based documentation to We report negative findings based on our investi- a digital representation of product data. Besides a gations. We conclude that existing Linked Data ex- model’s ability to facilitate communication between traction technologies are not appropriate to expose in- individuals and teams, the structured information con- formation from engineering tools of the complexity veyed in a model – when made electronically accessi- we studied. As will be detailed in this paper, two ble – can now be processed by IT systems to facilitate fundamental shortcomings are identified in our study. the information flow across processes, disciplines and First, current solutions tend to provide read-only ac- organisations. cess to the extracted information. This is not sufficient, However, the complete adoption of MBD in an in- since end-to-end engineering integration requires read- dustrial context remains somewhat limited. Consider- write access between the tools. Second, extraction so- ing the variety of modelling technologies available, lutions target data storage facilities such as relational considerable effort needs to be invested in integrating databases. However, in software tools, considerable modelling tools that do not necessarily share common logic resides in the application software itself which is technologies (such as a modelling framework, storage not taken into consideration when analysing the tool’s technologies, etc.) As a result, MBD and its benefits storage solely. are typically constrained to a subset of the develop- Nevertheless, the conducted experiments were in- ment life cycle [3]. strumental for us to better understand and define the Besides MBD tools, an equally relevant set of soft- data extraction needs of the industry domain. As a sec- ware tools are those supporting engineering activities, ond contribution of this paper, we here also propose an based on structured digital information. Typical tools alternative Linked Data extraction architecture that can such as issue tracking, computer-aided design (CAD), mitigate the identified shortcomings. While less auto- code analysis, ALM and PLM systems, all rely on in- mated compared to the existing solutions, the proposed formation being stored in some digitally manageable architecture is realised as a stand-alone library that can format such as a relational database, or versioned XML still facilitate the extraction process. files. If well integrated, such tools - together with MBD In the next section, we give further background in- tools - can together contribute to the Industrie 4.0 strat- formation on the tool interoperability problem, and egy of digital end-to-end engineering. how Linked Data can be a promising solution to this One alternative to provide digital integration across problem. In Section 3, we present a case study from the software tools is to impose the same technological truck manufacturer Scania CV AB that illustrates the space on tools throughout the life cycle, leading to the data extraction needs of the industry. The case study is adoption of a more centralised platform (such as PTC also used to evaluate the chosen extraction technique. Integrity [4] or MSR-Backbone [5]). While this may be Section 4 reports on our evaluation experiments based feasible at a smaller scale, such centralised platforms on the D2RQ platform. An analysis of the negative re- cannot scale to handle the complete heterogeneous set sults obtained, lead to the definition of the necessary of software tools normally found in a large organisa- / 3 tion. Centralised platforms may also be less flexible 3. Case Study for changes over time when additional tools need to be introduced. 3.1. Architecture Design - Activities, Artefacts and Instead, a more sustainable integration approach Support Tools would acknowledge the existence of distributed, het- During the architectural design of an embedded sys- erogeneous and independent data sources within the tem, one of the architect’s main responsibilities is to environment. A Linked Data approach to tool interop- capture the product’s desired functionalities as a set erability would promote a distributed architecture, in of system requirements. The requirements need then which each tool autonomously manages its own prod- to be

A Linked Data Extraction Architecture for Engineering Software Tools

Is a SPARQL Endpoint a Good Way to Manage Nursing Documentation

A Survey of Geospatial Semantic Web for Cultural Heritage

Diplomová Práce Přenos Dat Z Registru SITS Ve Formátu DASTA

Towards a Service-Oriented Architecture for a Mobile Assistive System with Real-Time Environmental Sensing

Building and Utilizing Linked Data from Massive Open Online Courses

A Computational Framework for Identity and Its Web-Based

Linking and Maintaining Quality of Data Provided by Various MOOC Providers

HPC-ABDS High Performance Computing Enhanced Apache Big Data Stack

Devontocreator: an Ontology-Based Approach to Software Engineering

250166701.Pdf

Title: Graphhelper

Introducing JDBC for SPARQL