A Lightweight Data Interchange Format for Internet of Things in the Palcom Middleware Framework

Total Page:16

File Type:pdf, Size:1020Kb

A Lightweight Data Interchange Format for Internet of Things in the Palcom Middleware Framework Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 56 ( 2015 ) 284 – 291 The 10th International Conference on Future Networks and Communications (FNC 2015) A lightweight data interchange format for Internet of Things in the PalCom middleware framework. Mattias Nordahla,*, Boris Magnussona aLund University, Computer Science, Lund, Sweden Abstract We present the PalCom Object Notation, a textual data representation format for communication between internet of things which support binary and textual data. The format does not require parsing of user data (or the “payload”) and is thus efficient to use also for large binary values such as digital images, audio and video as well as for short textual values. These can be mixed in the same messages and thus transported over the same communication link. Its structure is influenced by JSON, making it easy to translate between the two formats. The evaluation show that its size and processing efficiency is comparable to that of JSON for small data, but becomes both smaller and more efficient as data grows, and can yield a tenfold performance increase for binary payloads. ©© 20120155 TheThe Authors.Authors. Published Published by by Elsevier Elsevier B.V. B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). PeerPeer-review-review under under responsibility responsibility of ofthe the Conference Conference Program Program Chairs Chairs. Keywords: Lightweight protocol; Data interchange format; PON; JSON; Textual format; Efficient binary handling; PalCom; Java 1. Introduction In this paper we introduce the PalCom Object Notation (PON), a lightweight format for data interchange developed for use in PalCom. PalCom is a pervasive middleware framework implemented in Java that aims to simplify the creation of dynamic networks between devices in distributed systems, and to give an easy and intuitive * Corresponding author. Tel.: +46-46-2220000 E-mail address: [email protected] 1877-0509 © 2015 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). Peer-review under responsibility of the Conference Program Chairs doi: 10.1016/j.procs.2015.07.210 Mattias Nordahl and Boris Magnusson / Procedia Computer Science 56 ( 2015 ) 284 – 291 285 way for combining services provided by the devices.1 It allows users to combine services from any device on the network into larger units with more complex functionality. PalCom has automatic device and service discovery, as described by a discovery protocol, which entails data of various types being sent between devices. Communication between services themselves are governed by a service-interaction protocol, but what information are sent with it are up to the service implementer. There is thus a need to be able to send a mixture of types of information (text, structured information, binary information such as images, audio and video) between devices in a general and homogeneous way. This is exemplified well in the itACiH project, which develops, on top of PalCom, a system to support advanced care in the home with support for visiting nurses, on-line equipment in the patients’ home, and process support at hospital wards. Previously all PalCom protocols has been represented in XML3, with only limited type support for service communications, as everything was represented as strings. One part of the itACiH project is a web-server with PalCom as backend8, in which JSON2 has been used for communication with clients, which gives direct support for different data types. But as the project progressed a requirement arose for being able to send binary information as well. However, JSON, just as XML, lack the direct ability to represent such data, and must resort to special encodings which increase their size and adds extra processing. This motivated the development of a new compact, textual format, similar to JSON, but which could directly handle binary information. In order to outline the design space we take examples from early language design which also illustrate that these problems are nothing new. 1.1. String handling in FORTRAN Handling of strings in Standard FORTRAN4, Hollerith constants, was an important influence in designing the PalCom Object Notation. Its format statement was extended to include strings: FORMAT(13H HELLO, WORLD) In this format the programmer was to count the number of characters in the string and give the compiler the length before the ‘H’ which indicated the string type. Although the manual counting could be a problem, this format had the advantages that 1) the FORTRAN complier did not need to parse the string itself, but simply copy the length of characters, and 2) the string could contain any character in the character set, with no exception for a special ending delimiter. When computers generate the information, the counting of characters ceases to be a problem, while the two advantages remain. In FORTRAN 775 the modern notation for representing strings was introduced: FORMAT(‘HELLO, WORLD’) Here the compiler had to scan the otherwise uninteresting string in order to count the characters, and the string could not contain single quotes. In situations where hardware are to interpret data, such as in CPUs and network equipment, it is common to use fixed format layouts, as illustrated by the header part of IPv4 packets6 in Figure 1. Also, here the length (in this case of the packet with payload) is given explicitly as in Standard FORTRAN, but note that the “Total Length” attribute is limited to 16 bit numbers and the address fields are limited to 32 bits. Although practical in situations with dedicated hardware, such limitations create problems over time when the length of the fields become too short and new extended protocols such as IPv6 needs to be introduced. It is thus wise to avoid fixed formats and limited lengths of size fields. 286 Mattias Nordahl and Boris Magnusson / Procedia Computer Science 56 ( 2015 ) 284 – 291 Figure 1. IPv4 header. More recent formats, geared towards human readability and easy construction, such as XML and JSON, have chosen representations in line with the FORTRAN 77 format, with explicit delimiters around strings. This notation has the drawback that all user data need to be parsed in order to understand the structure of the information. Additionally, binary information, e.g. images, which are often large by comparison (several MB) needs to be converted to readable text (e.g. through base64 encoding) before inclusion in such messages which increase their size by 1/3, adds more work for the programmer and slows down generation and parsing of messages. With PON we set out to define a format for structured data that combines the structure and compactness offered by JSON with more efficient handling for both textual and binary data. 1.2. Previous work Since JSON was popularized, several new formats have been designed that borrow from or build upon its structure, but simultaneously try to improve certain aspects where JSON falls short. The perhaps most apparent shortcoming of JSON, as suggested by the numeral binary JSON-like formats, is its lack of direct support for handling binary information. BSON11, BJSON12 and Universal Binary JSON13 are all examples of such formats. These formats share the general structuring of data with JSON, but are fully binary rather than textual, which enabled them to makes use of byte or bit manipulation to greatly decrease the number of bytes needed to represent the same data and to speed up processing. Like in Standard FORTRAN, these formats include the length of strings and other data types before their actual value, meaning that they do not need to parse the data itself, looking for an ending delimiter. 2. The PalCom Object Notation The PalCom Object Notation is also a JSON-like format, and can easily be translated to and from it. Like FORTRAN and the previously mentioned binary JSON-like formats, each value is prepended with its own length. Unlike those formats, however, PON is a fully textual format, like JSON, and can therefore be seen as mix of the two variations. Due to knowing the length of values and thus not needing to parse them, PON can directly handle binary information, while still maintaining a textual and thus readable format. 2.1. PON structure and node syntax The overall structure of PON is borrowed directly from JSON, and data is thus organized by objects and arrays, where objects are lists of key-value pairs and arrays are list of values.2 Objects and arrays are themselves also considered values, which enables nesting. In PON all values are represented as nodes that consists of three fields; the length of the value, a type identifier and the actual data value itself: <Length><Type><Data> Mattias Nordahl and Boris Magnusson / Procedia Computer Science 56 ( 2015 ) 284 – 291 287 Length is an integer stating the length of the data, the type identifier is a one byte ASCII character that determines the value type and on which the format of the data field depend. The three fields are written in sequence without spaces or delimiters, as are all nodes themselves. For instance, three consecutive string values are written: 15sMy first string16sMy second string15sMy third string 2.2. Objects, arrays and keys Since objects and arrays are also values they too follow the node structure. An array is a list of values, and its data field is thus a list of nodes, written in sequence, and their combined length is that of the array’s length field. Similarly, objects are lists of key-value pairs. Keys are the only exception to the node structure, instead being more closely represented as they are in JSON, i.e.
Recommended publications
  • Arxiv:2103.14485V2 [Physics.Med-Ph] 12 Apr 2021
    ADWI-BIDS: AN EXTENSION TO THE BRAIN IMAGING DATA STRUCTURE FOR ADVANCED DIFFUSION WEIGHTED IMAGING James Gholam1,2, Filip Szczepankiewicz3, Chantal M.W. Tax1,2,4, Lars Mueller2,5, Emre Kopanoglu2,5, Markus Nilsson3, Santiago Aja-Fernandez6, Matt Griffin1, Derek K. Jones2,5, and Leandro Beltrachini1,2 1School of Physics and Astronomy, Cardiff University, Cardiff, United Kingdom 2Cardiff Univeristy Brain Research Imaging Centre (CUBRIC), Cardiff, United Kingdom 3Department of Diagnostic Radiology, Lund University, Lund, Sweden 4Image Sciences Institute, University Medical Center Utrecht, Utrecht, Netherlands 5School of Psychology, Cardiff University, Cardiff, United Kingdom 6Universidad de Valladolid, Valladolid, Spain ABSTRACT Diffusion weighted imaging techniques permit us to infer microstructural detail in biological tissue in vivo and noninvasively. Modern sequences are based on advanced diffusion encoding schemes, allowing probing of more revealing measures of tissue microstructure than the standard apparent diffusion coefficient or fractional anisotropy. Though these methods may result in faster or more revealing acquisitions, they generally demand prior knowledge of sequence-specific parameters for which there is no accepted sharing standard. Here, we present a metadata labelling scheme suitable for the needs of developers and users within the diffusion neuroimaging community alike: a lightweight, unambiguous parametric map relaying acqusition parameters. This extensible scheme supports a wide spectrum of diffusion encoding methods, from single diffusion encoding to highly complex sequences involving arbitrary gradient waveforms. Built under the brain imaging data structure (BIDS), it allows storage of advanced diffusion MRI data comprehensively alongside any other neuroimaging information, facilitating processing pipelines and multimodal analyses. We illustrate the usefulness of this BIDS-extension with a range of example data, and discuss the extension’s impact on pre- and post-processing software.
    [Show full text]
  • A Coap Server with a Rack Interface for Use of Web Frameworks Such As Ruby on Rails in the Internet of Things
    A CoAP Server with a Rack Interface for Use of Web Frameworks such as Ruby on Rails in the Internet of Things Diploma Thesis Henning Muller¨ Matriculation No. 2198830 March 10, 2015 Supervisor Prof. Dr.-Ing. Carsten Bormann Reviewer Dr.-Ing. Olaf Bergmann Adviser Dipl.-Inf. Florian Junge Faculty 3: Computer Science and Mathematics 2afc1e5 cbna This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License. http://creativecommons.org/licenses/by-nc-sa/4.0/ Henning Muller¨ [email protected] Abstract We present a Constrained Application Protocol (CoAP) server with a Rack interface to enable application development for the Internet of Things (or Wireless Embedded Internet) using frameworks such as Ruby on Rails. Those frameworks avoid the need for reinvention of the wheel, and simplify the use of Test-driven Development (TDD) and other agile software development methods. They are especially beneficial on less constrained devices such as infrastructure devices or application servers. Our solution supports development of applications almost without paradigm change compared to HTTP and provides performant handling of numerous concurrent clients. The server translates transparently between the protocols and also supports specifics of CoAP such as service and resource discovery, block-wise transfers and observing resources. It also offers the possibility of transparent transcoding between JSON and CBOR payloads. The Resource Directory draft was implemented by us as a Rails application running on our server software. Wir stellen einen Constrained Application Protocol (CoAP) Server mit einem Rack In- terface vor, der Anwendungsentwicklung fur¨ das Internet der Dinge (bzw. das Wireless Embedded Internet) mit Frameworks wie Ruby on Rails ermoglicht.¨ Solche Framworks verhindern die Notwendigkeits, das Rad neu zu erfinden und vereinfachen die Anwen- dung testgetriebener Entwicklung (TDD) und anderer agiler Methoden der Softwareen- twicklung.
    [Show full text]
  • Fullpaper Format
    Virtual ITS European Congress, 9-10 November 2020 Paper number ITS-TP2189 TPEG3: Dynamic Information for Automated Vehicles Matthias Unbehaun1, Teun Hendriks2, Martin Pfeifle3, Pieter Pauwels4, 1 TISA – Traveller Information Services Association, Belgium, [email protected] 2 TISA – Traveller Information Services Association, The Netherlands, [email protected] 3 NNG, Hungary, [email protected] , 4 Be-Mobile, Belgium, [email protected] Abstract The TPEG21 protocol suite for traffic and traveller information is developed and maintained by the Traveller Information Services Association (TISA) for delivering real-time services to support drivers. With the advent of automated vehicles, the role of the drivers and their need for information are changing. As the vehicle will be controlling its own trajectory, it will need more detailed and precise external Information for Automated Driving (I4AD) to do this safely. This paper focusses on traffic-related aspects of I4AD; supporting AD vehicles and passengers to stay safe while driving. This paper outlines technical solutions that solve relevant technical and business issues in context with automated driving, addressing also that not all vehicles are automated. The paper describes example use cases and ‘TPEG3 as a product’ in terms of functional, non-functional and business requirements. Basic ideas and concepts that address these requirements are presented. Keywords: Traffic Information, Automated Vehicles, TPEG Introduction Real-time information to support Automated Vehicles (AD) is very relevant for AD vehicles to be able to properly drive through non-standard situations, such as roadworks where roadworkers may move about and dynamic changes in road layouts are to be expected.
    [Show full text]
  • PURDUE UNIVERSITY GRADUATE SCHOOL Thesis/Dissertation Acceptance
    Graduate School Form 30 Updated 12/26/2015 PURDUE UNIVERSITY GRADUATE SCHOOL Thesis/Dissertation Acceptance This is to certify that the thesis/dissertation prepared By GEETHA R SATYANARAYANA Entitled A QUANTITATIVE COMPARISON & EVALUATION OF PROMINENT MARSHALLING/UN-MARSHALLING FORMATS IN DISTRIBUTED REAL-TIME & EMBEDDED SYSTEMS For the degree of Master of Science Is approved by the final examining committee: JAMES H HILL Chair RAJEEV RAJE MIHRAN TUCERYAN To the best of my knowledge and as understood by the student in the Thesis/Dissertation Agreement, Publication Delay, and Certification Disclaimer (Graduate School Form 32), this thesis/dissertation adheres to the provisions of Purdue University’s “Policy of Integrity in Research” and the use of copyright material. Approved by Major Professor(s): JAMES H HILL SHIAOFEN FANG 7/12/2016 Approved by: Head of the Departmental Graduate Program Date A QUANTITATIVE COMPARISON & EVALUATION OF PROMINENT MARSHALLING/UN-MARSHALLING FORMATS IN DISTRIBUTED REAL-TIME & EMBEDDED SYSTEMS A Thesis Submitted to the Faculty of Purdue University by Geetha R. Satyanarayana In Partial Fulfillment of the Requirements for the Degree of Master of Science August 2016 Purdue University Indianapolis, Indiana ii To Amma & Appa iii ACKNOWLEDGMENTS \All that I am, or hope to be, I owe it to my angel mother." I cannot begin to imagine how life would have been if it weren't for all the hardships my mother had to go through, and all the sacrifices she had to make as a single mother to bring my brother and me to where we are today. I cannot thank her enough for her love, her support, her belief in me and her constant motivation to give my best in everything I do.
    [Show full text]
  • Author(S) Hill, Bruce W. Title Evaluation of Efficient XML
    Author(s) Hill, Bruce W. Evaluation of efficient XML interchange (EXI) for large datasets and as an alternative Title to binary JSON encodings Publisher Monterey, California: Naval Postgraduate School Issue Date 2015-03 URL http://hdl.handle.net/10945/45196 This document was downloaded on June 12, 2015 at 11:53:11 NAVAL POSTGRADUATE SCHOOL MONTEREY, CALIFORNIA THESIS EVALUATION OF EFFICIENT XML INTERCHANGE (EXI) FOR LARGE DATASETS AND AS AN ALTERNATIVE TO BINARY JSON ENCODINGS by Bruce W. Hill March 2015 Thesis Advisor: Don Brutzman Co-Advisor: Don McGregor Approved for public release; distribution is unlimited THIS PAGE INTENTIONALLY LEFT BLANK REPORT DOCUMENTATION PAGE Form Approved OMB No. 0704–0188 Public reporting burden for this collection of information is estimated to average 1 hour per response, including the time for reviewing instruction, searching existing data sources, gathering and maintaining the data needed, and completing and reviewing the collection of information. Send comments regarding this burden estimate or any other aspect of this collection of information, including suggestions for reducing this burden, to Washington headquarters Services, Directorate for Information Operations and Reports, 1215 Jefferson Davis Highway, Suite 1204, Arlington, VA 22202-4302, and to the Office of Management and Budget, Paperwork Reduction Project (0704-0188) Washington, DC 20503. 1. AGENCY USE ONLY (Leave blank) 2. REPORT DATE 3. REPORT TYPE AND DATES COVERED March 2015 Master’s Thesis 4. TITLE AND SUBTITLE 5. FUNDING NUMBERS EVALUATION OF EFFICIENT XML INTERCHANGE (EXI) FOR LARGE W4V02 DATASETS AND AS AN ALTERNATIVE TO BINARY JSON ENCODINGS 6. AUTHOR Bruce W. Hill 7. PERFORMING ORGANIZATION NAME AND ADDRESS 8.
    [Show full text]
  • Network Working Group C. Bormann Internet-Draft Universitaet Bremen TZI Obsoletes: 7049 (If Approved) P
    Network Working Group C. Bormann Internet-Draft Universitaet Bremen TZI Obsoletes: 7049 (if approved) P. Hoffman Intended status: Standards Track ICANN Expires: 3 April 2021 30 September 2020 Concise Binary Object Representation (CBOR) draft-ietf-cbor-7049bis-16 Abstract The Concise Binary Object Representation (CBOR) is a data format whose design goals include the possibility of extremely small code size, fairly small message size, and extensibility without the need for version negotiation. These design goals make it different from earlier binary serializations such as ASN.1 and MessagePack. This document is a revised edition of RFC 7049, with editorial improvements, added detail, and fixed errata. This revision formally obsoletes RFC 7049, while keeping full compatibility of the interchange format from RFC 7049. It does not create a new version of the format. Status of This Memo This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet- Drafts is at https://datatracker.ietf.org/drafts/current/. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." This Internet-Draft will expire on 3 April 2021. Copyright Notice Copyright (c) 2020 IETF Trust and the persons identified as the document authors.
    [Show full text]
  • JSON Tiles: Fast Analytics on Semi-Structured Data
    JSON Tiles: Fast Analytics on Semi-Structured Data Dominik Durner Viktor Leis Thomas Neumann Technische Universität München Friedrich-Schiller-Universität Jena Technische Universität München [email protected] [email protected] [email protected] ABSTRACT ery Sinew [57] Access Developers often prefer flexibility over upfront schema design, mak- Optimization Performance (Statistics) JSON Tiles Raw ing semi-structured data formats such as JSON increasingly popular. Parser Large amounts of JSON data are therefore stored and analyzed by Dremel [6, 42] [37, 39] relational database systems. In existing systems, however, JSON’s Robustness JSONB [44, 52] lack of a fixed schema results in slow analytics. In this paper, we Update Friendliness present JSON tiles, which, without losing the flexibility of JSON, en- Figure 1: Classification of existing work. ables relational systems to perform analytics on JSON data at native speed. JSON tiles automatically detects the most important keys and data that is stored in relational systems. Analytics on large JSON extracts them transparently – often achieving scan performance data is valuable but expensive. Specialized tools for log file analysis, similar to columnar storage. At the same time, JSON tiles is capable such as Splunk [55] exist, but lack the flexibility and functionality of handling heterogeneous and changing data. Furthermore, we of general-purpose data management systems. automatically collect statistics that enable the query optimizer to To speed up analytical processing of JSON data, a number of find good execution plans. Our experimental evaluation compares approaches have been proposed. Figure 1 classifies them with re- against state-of-the-art systems and research proposals and shows spect to access performance, robustness to heterogeneous data, and that our approach is both robust and efficient.
    [Show full text]
  • Network Working Group C. Bormann Internet-Draft Universitaet Bremen TZI Intended Status: Informational B
    Network Working Group C. Bormann Internet-Draft Universitaet Bremen TZI Intended status: Informational B. Gamari Expires: December 16, 2017 Well-Typed H. Birkholz Fraunhofer SIT June 14, 2017 Concise Binary Object Representation (CBOR) Tags for Time, Duration, and Period draft-bormann-cbor-time-tag-01 Abstract The Concise Binary Object Representation (CBOR, RFC 7049) is a data format whose design goals include the possibility of extremely small code size, fairly small message size, and extensibility without the need for version negotiation. In CBOR, one point of extensibility is the definition of CBOR tags. RFC 7049 defines two tags for time: CBOR tag 0 (RFC3339 time) and tag 1 (Posix time [TIME_T], int or float). Since then, additional requirements have become known. The present document defines a CBOR tag for time that allows a more elaborate representation of time, and anticipates the definition of related CBOR tags for duration and time period. It is intended as the reference document for the IANA registration of the CBOR tags defined. Note to Readers Version -00 of the present draft opened up the possibilities provided by extended representations of time in CBOR. The present version -01 consolidates this draft to non-speculative content, the normative parts of which are believed will stay unchanged during further development of the draft. This version is provided to aid the registration of the CBOR tag immediately needed. Further versions will re-introduce some of the material from -00, but in a more concrete form. Status of This Memo This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.
    [Show full text]
  • XML Interoperability Standards for Seamless Communication: An
    Computers in Industry 92 (2017) 118–136 Contents lists available at ScienceDirect Computers in Industry journal homepage: www.elsevier.com/locate/compind XML interoperability standards for seamless communication: An analysis of industry-neutral and domain-specific initiatives Claudia-Melania Chituc Eindhoven University of Technology, Eindhoven, The Netherlands A R T I C L E I N F O A B S T R A C T Article history: Received 20 October 2016 Attaining seamless interoperability among heterogeneous communication systems and technologies Received in revised form 12 June 2017 remains a great challenge in todays’ networked world. Real time information exchange among Accepted 20 June 2017 heterogeneous and geographically distributed systems is required to support the execution of complex e- Available online xxx business scenarios supported by cross-organizational business processes. XML emerged as a foundation for performing e-business, receiving an increasing adoption in the new market economies. Several Keywords: interoperability standards emerged, which provide specifications on performing business-to-business e- Interoperability business, what information to share, when and how. The aim of this article is to present an up-to-date Standard review of current XML-based industry-neutral and domain-specific standardization initiatives towards Industry-neutral standardization initiative seamless communication, discussing their commonalities and differences, and highlighting directions Domain-specific standardization initiatives ’ XML for further research and development work. An overview of the main standards elements is also made, analyzing how different initiatives address them. As numerous standardization initiatives are quickly emerging and changing, it is not easy to understand them. By analyzing the most commonly referred XML-based standards for interoperability, this article has two main contributions.
    [Show full text]
  • Universal Binary JSON Documentation Release 0.9-Dev
    Universal Binary JSON Documentation Release 0.9-dev UBJSON Community October 13, 2014 Contents 1 Specification 3 1.1 Data Format...............................................3 1.2 Value Types................................................3 1.3 Container Types.............................................8 1.4 Streaming Types............................................. 11 1.5 Size Requirements............................................ 13 1.6 Endianness................................................ 13 1.7 MIME Type............................................... 13 1.8 File Extension.............................................. 14 1.9 Best Practices............................................... 14 2 Type reference 15 2.1 Numeric Types.............................................. 15 2.2 String Encoding............................................. 16 2.3 Arrays & Objects............................................. 16 2.4 Support for ‘huge’ Numeric Type.................................... 16 2.5 Optimized Storage Size......................................... 17 2.6 noop and Streaming Support....................................... 17 2.7 Examples................................................. 17 3 Libraries 19 3.1 D..................................................... 19 3.2 Java.................................................... 19 3.3 .NET................................................... 19 3.4 Node.js.................................................. 19 3.5 Python.................................................. 19 4 Thanks
    [Show full text]
  • Reusable Components, Techniques and Standards for the Ciaas
    FP7-ICT-2013-EU-Japan ClouT: Cloud of Things for empowering the citizen clout in smart cities FP7 contract number: 608641 NICT management number: 167ア Project deliverable D2.1 – Reusable components, techniques and standards for the CIaaS ABSTRACT This document contains a collection of reusable objects (hardware and software components, techniQues, protocols and standards) candidate to be part of the Infrastructure layer in the ClouT Reference Architecture. All these objects - described in detail within the document - derive from the experience of each project’s partner, from pilot cities existing assets and from the state of the art. The final result will be the input – together with other deliverables, namely D1.1 (“Use Cases & User Requirements”) and D3.1 (“Reusable Components and techniques for CPaaS”) - for the definition of the ClouT Reference Architecture that will be described in the next deliverables, being D1.2 (“First version of Reference Architecture and Reusable Components”) and D1.3 (“Final Requirements and Reference Architecture”). Only some of the illustrated objects in this document will be inserted in the ClouT Reference Architecture: the choice depends on the features and advantages related to each component, and especially from User and system requirements that will be extracted from the collection of the use cases. D2.1 - Reusable components, techniQues and standards for the CIaaS Disclaimer This document has been produced in the context of the ClouT Project which is jointly funded by the European Commission (grant agreement n° 608641) and NICT from Japan (management number 167ア). All information provided in this document is provided "as is" and no guarantee or warranty is given that the information is fit for any particular purpose.
    [Show full text]
  • A Gentle Introduction to Document Stores and Querying with the SQL/JSON Path Language
    Faculty of Computer Science Database and Software Engineering Group A Gentle Introduction to Document Stores and Querying with the SQL/JSON Path Language Marcus Pinnecke Advanced Topics in Databases, 2019/June/21 Otto-von-Guericke University of Magdeburg Thanks to! Prof. Dr. Bernhard Seeger & Nikolaus Glombiewski, M.Sc. (University Marburg), and Prof. Dr. Anika Groß (University Leipzig) ● For their support and slides on NoSQL/Document Store topics Prof. Dr. Kai-Uwe Sattler (University Ilmenau), and The SQL-Standardisierungskomitee ● For their pointers to JSON support in the SQL Standard David Broneske , M.Sc. (University Magdeburg) Gabriel Campero, M.Sc. (University Magdeburg) ● For feedback and proofreading Marcus Pinnecke | Physical Design for Document Store Analytics 2 About Myself Marcus Pinnecke, M.Sc. (Computer Science) ● Full-time database research associate ● Information technology system electronics engineer Faculty of Computer Science Datenbanken & Software Engineering Universitätsplatz 2, G29-125 39106, Magdeburg, Germany Marcus Pinnecke | Physical Design for Document Store Analytics 3 About Myself /marcus_pinnecke /pinnecke /in/marcus-pinnecke-459a494a/ /pers/hd/p/Pinnecke:Marcus marcus.pinnecke{at-ovgu} /citations?user=wcuhwpwAAAAJ&hl=en /profile/Marcus_Pinnecke www.pinnecke.info Marcus Pinnecke | Physical Design for Document Store Analytics 4 4 There’s a lot to come, fast. The Matrix (1999). Warner Bros. 5 Make notes and visit these slides twice. Rough Outline - What you’ll learn The Case for Semi-Structured Data ● Semi-structured data, arguments and implications ● Overview of database systems, and rankings ● Document Database Model Document Stores ● Document Stores Overview and Comparison ● CRUD (Create, Read, Update, Delete) Operations in mongoDB and CouchDB Storage Engine Overview ● Insights into CouchDBs Append-Only storage engine ● Insights into mongoDBs Update-In-Place storage engine ● Physical Record Organization (JSON, UBJSON, BSON, CARBON) JSON Documents in Rel.
    [Show full text]