Distributed Data Analytics Thorsten Papenbrock Encoding and Evolution G-3.1.09, Campus III Hasso Plattner Institut
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Arxiv:2103.14485V2 [Physics.Med-Ph] 12 Apr 2021
ADWI-BIDS: AN EXTENSION TO THE BRAIN IMAGING DATA STRUCTURE FOR ADVANCED DIFFUSION WEIGHTED IMAGING James Gholam1,2, Filip Szczepankiewicz3, Chantal M.W. Tax1,2,4, Lars Mueller2,5, Emre Kopanoglu2,5, Markus Nilsson3, Santiago Aja-Fernandez6, Matt Griffin1, Derek K. Jones2,5, and Leandro Beltrachini1,2 1School of Physics and Astronomy, Cardiff University, Cardiff, United Kingdom 2Cardiff Univeristy Brain Research Imaging Centre (CUBRIC), Cardiff, United Kingdom 3Department of Diagnostic Radiology, Lund University, Lund, Sweden 4Image Sciences Institute, University Medical Center Utrecht, Utrecht, Netherlands 5School of Psychology, Cardiff University, Cardiff, United Kingdom 6Universidad de Valladolid, Valladolid, Spain ABSTRACT Diffusion weighted imaging techniques permit us to infer microstructural detail in biological tissue in vivo and noninvasively. Modern sequences are based on advanced diffusion encoding schemes, allowing probing of more revealing measures of tissue microstructure than the standard apparent diffusion coefficient or fractional anisotropy. Though these methods may result in faster or more revealing acquisitions, they generally demand prior knowledge of sequence-specific parameters for which there is no accepted sharing standard. Here, we present a metadata labelling scheme suitable for the needs of developers and users within the diffusion neuroimaging community alike: a lightweight, unambiguous parametric map relaying acqusition parameters. This extensible scheme supports a wide spectrum of diffusion encoding methods, from single diffusion encoding to highly complex sequences involving arbitrary gradient waveforms. Built under the brain imaging data structure (BIDS), it allows storage of advanced diffusion MRI data comprehensively alongside any other neuroimaging information, facilitating processing pipelines and multimodal analyses. We illustrate the usefulness of this BIDS-extension with a range of example data, and discuss the extension’s impact on pre- and post-processing software. -
Msgpack Documentation Release 1.0
msgpack Documentation Release 1.0 Author 2021-03-18 Contents 1 API reference 3 2 Advanced usage 9 Python Module Index 11 Index 13 i ii msgpack Documentation, Release 1.0 MessagePack is a efficient format for inter language data exchange. Contents 1 msgpack Documentation, Release 1.0 2 Contents CHAPTER 1 API reference msgpack.pack(o, stream, **kwargs) Pack object o and write it to stream See Packer for options. dump() is alias for pack() msgpack.packb(o, **kwargs) Pack object o and return packed bytes See Packer for options. dumps() is alias for packb() msgpack.unpack(stream, **kwargs) Unpack an object from stream. Raises ExtraData when stream contains extra bytes. See Unpacker for options. load() is alias for unpack() msgpack.unpackb(packed, *, object_hook=None, list_hook=None, bool use_list=True, bool raw=False, int timestamp=0, bool strict_map_key=True, unicode_errors=None, object_pairs_hook=None, ext_hook=ExtType, Py_ssize_t max_str_len=-1, Py_ssize_t max_bin_len=-1, Py_ssize_t max_array_len=-1, Py_ssize_t max_map_len=-1, Py_ssize_t max_ext_len=-1) Unpack packed_bytes to object. Returns an unpacked object. Raises ExtraData when packed contains extra bytes. Raises ValueError when packed is incomplete. Raises FormatError when packed is not valid msgpack. Raises StackError when packed contains too nested. Other exceptions can be raised during unpacking. See Unpacker for options. max_xxx_len options are configured automatically from len(packed). loads() is alias for unpackb() 3 msgpack Documentation, Release 1.0 class msgpack.Packer(default=None, *, bool use_single_float=False, bool autoreset=True, bool use_bin_type=True, bool strict_types=False, bool datetime=False, uni- code_errors=None) MessagePack Packer Usage: packer= Packer() astream.write(packer.pack(a)) astream.write(packer.pack(b)) Packer’s constructor has some keyword arguments: Parameters • default (callable) – Convert user type to builtin type that Packer supports. -
Advanced JSON Handling in Go 19:40 05 Mar 2020 Jonathan Hall Devops Evangelist / Go Developer / Clean Coder / Salsa Dancer About Me
Advanced JSON handling in Go 19:40 05 Mar 2020 Jonathan Hall DevOps Evangelist / Go Developer / Clean Coder / Salsa Dancer About me Open Source contributor; CouchDB PMC, author of Kivik Core Tech Lead for Lana Former eCommerce Dev Manager at Bugaboo Former backend developer at Teamwork.com Former backend developer at Booking.com Former tech lead at eFolder/DoubleCheck 2 Show of hands Who has... ...used JSON in a Go program? ...been frustrated by Go's strict typing when dealing with JSON? ...felt limited by Go's standard JSON handling? What have been your biggest frustrations? 3 Today's Topics Very brief intro to JSON in Go Basic use of maps and structs Handling inputs of unknown type Handling data with some unknown fields 4 A brief intro to JSON JavaScript Object Notation, defined by RFC 8259 Human-readable, textual representation of arbitrary data Limted types: null, Number, String, Boolean, Array, Object Broad applications: Config files, data interchange, simple messaging 5 Alternatives to JSON YAML, TOML, INI BSON, MessagePack, CBOR, Smile XML ProtoBuf Custom/proprietary formats Many principles discussed in this presentation apply to any of the above formats. 6 Marshaling JSON Creating JSON from a Go object is (usually) very straight forward: func main() { x := map[string]string{ "foo": "bar", } data, _ := json.Marshal(x) fmt.Println(string(data)) } Run 7 Marshaling JSON, #2 Creating JSON from a Go object is (usually) very straight forward: func main() { type person struct { Name string `json:"name"` Age int `json:"age"` Description string `json:"descr,omitempty"` secret string // Unexported fields are never (un)marshaled } x := person{ Name: "Bob", Age: 32, secret: "Shhh!", } data, _ := json.Marshal(x) fmt.Println(string(data)) } Run 8 Unmarshaling JSON Unmarshaling JSON is often a bit trickier. -
A Coap Server with a Rack Interface for Use of Web Frameworks Such As Ruby on Rails in the Internet of Things
A CoAP Server with a Rack Interface for Use of Web Frameworks such as Ruby on Rails in the Internet of Things Diploma Thesis Henning Muller¨ Matriculation No. 2198830 March 10, 2015 Supervisor Prof. Dr.-Ing. Carsten Bormann Reviewer Dr.-Ing. Olaf Bergmann Adviser Dipl.-Inf. Florian Junge Faculty 3: Computer Science and Mathematics 2afc1e5 cbna This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License. http://creativecommons.org/licenses/by-nc-sa/4.0/ Henning Muller¨ [email protected] Abstract We present a Constrained Application Protocol (CoAP) server with a Rack interface to enable application development for the Internet of Things (or Wireless Embedded Internet) using frameworks such as Ruby on Rails. Those frameworks avoid the need for reinvention of the wheel, and simplify the use of Test-driven Development (TDD) and other agile software development methods. They are especially beneficial on less constrained devices such as infrastructure devices or application servers. Our solution supports development of applications almost without paradigm change compared to HTTP and provides performant handling of numerous concurrent clients. The server translates transparently between the protocols and also supports specifics of CoAP such as service and resource discovery, block-wise transfers and observing resources. It also offers the possibility of transparent transcoding between JSON and CBOR payloads. The Resource Directory draft was implemented by us as a Rails application running on our server software. Wir stellen einen Constrained Application Protocol (CoAP) Server mit einem Rack In- terface vor, der Anwendungsentwicklung fur¨ das Internet der Dinge (bzw. das Wireless Embedded Internet) mit Frameworks wie Ruby on Rails ermoglicht.¨ Solche Framworks verhindern die Notwendigkeits, das Rad neu zu erfinden und vereinfachen die Anwen- dung testgetriebener Entwicklung (TDD) und anderer agiler Methoden der Softwareen- twicklung. -
Performance Comparison of Data Serialization Schemes for ETSI ITS Car-To-X Communication Systems
International Journal on Advances in Telecommunications, vol 8 no 1 & 2, year 2015, http://www.iariajournals.org/telecommunications/ 48 Performance Comparison of Data Serialization Schemes for ETSI ITS Car-to-X Communication Systems Sebastian Bittl, Arturo A. Gonzalez, Michael Spahn,¨ and Wolf A. Heidrich Fraunhofer ESK Munich, Germany Email: fsebastian.bittl, arturo.gonzalez, michael.spaehn, [email protected] Abstract—Wireless Car-to-X communication is about to enter sense collision avoidance mechanism (CSMA-CA) for multi- the mass market in upcoming years. The non-licensed, but user channel access, longer packets imply longer transmission reserved bandwidth for such communications is relatively narrow times and hence a higher channel busy ratio. This means that in terms of the few usable channels and the expected number nodes near a transmitter, which are willing to transmit as well, of communicating entities. Among other communication aspects, must wait longer until the channel is free to use. Moreover, data serialization schemes that allow compact encoding as well as the wireless channel has a limited capacity allowing only a fast encoding and decoding are required. Compact representation of data helps in keeping a minimal bandwidth overhead in limited number of transmitted bits per unit of time. The lower the wireless channel. Moreover, since delay is an important the number of transmitted messages, the higher the number of performance parameter in Car-to-X communication, the pro- users that could use the given frequency spectrum in parallel. cessing time of the serialization/deserialization schemes shall be kept to a minimum. So far, no detailed analysis of different The data serialization schemes also influence processing data serialization schemes for Car-to-X communication regarding power requirements in both ETSI Intelligent Transport Sys- important properties such as runtime, memory consumption tems (ITS) in Europe [2] and Wireless Access in Vehicular and encoded output length has been published. -
A Lightweight and Efficient Document Representation
docrep: A lightweight and efficient document representation framework Tim Dawborn and James R. Curran -lab,e School of Information Technologies University of Sydney NSW 2006, Australia tim.dawborn,james.r.curran @sydney.edu.au { } Abstract Modelling linguistic phenomena requires highly structured and complex data representations. Document representation frameworks (DRFs) provide an interface to store and retrieve multiple annotation layers over a document. Researchers face a difficult choice: using a heavy-weight DRF or implement a custom DRF. The cost is substantial, either learning a new complex system, or continually adding features to a home-grown system that risks overrunning its original scope. We introduce DOCREP, a lightweight and efficient DRF, and compare it against existing DRFs. We discuss our design goals and implementations in C++, Python, and Java. We transform the OntoNotes 5 corpus using DOCREP and UIMA, providing a quantitative comparison, as well as discussing modelling trade-offs. We conclude with qualitative feedback from researchers who have used DOCREP for their own projects. Ultimately, we hope DOCREP is useful for the busy researcher who wants the benefits of a DRF, but has better things to do than to write one. 1 Introduction Computational Linguistics (CL) is increasingly a data-driven research discipline with researchers us- ing diverse collections of large-scale corpora (Parker et al., 2011). Representing linguistic phenomena can require modelling intricate data structures, both flat and hierarchical, layered over the original text; e.g. tokens, sentences, parts-of-speech, named entities, coreference relations, and trees. The scale and complexity of the data demands efficient representations. A document representation framework (DRF) should support the creation, storage, and retrieval of different annotation layers over collections of hetero- geneous documents. -
Fullpaper Format
Virtual ITS European Congress, 9-10 November 2020 Paper number ITS-TP2189 TPEG3: Dynamic Information for Automated Vehicles Matthias Unbehaun1, Teun Hendriks2, Martin Pfeifle3, Pieter Pauwels4, 1 TISA – Traveller Information Services Association, Belgium, [email protected] 2 TISA – Traveller Information Services Association, The Netherlands, [email protected] 3 NNG, Hungary, [email protected] , 4 Be-Mobile, Belgium, [email protected] Abstract The TPEG21 protocol suite for traffic and traveller information is developed and maintained by the Traveller Information Services Association (TISA) for delivering real-time services to support drivers. With the advent of automated vehicles, the role of the drivers and their need for information are changing. As the vehicle will be controlling its own trajectory, it will need more detailed and precise external Information for Automated Driving (I4AD) to do this safely. This paper focusses on traffic-related aspects of I4AD; supporting AD vehicles and passengers to stay safe while driving. This paper outlines technical solutions that solve relevant technical and business issues in context with automated driving, addressing also that not all vehicles are automated. The paper describes example use cases and ‘TPEG3 as a product’ in terms of functional, non-functional and business requirements. Basic ideas and concepts that address these requirements are presented. Keywords: Traffic Information, Automated Vehicles, TPEG Introduction Real-time information to support Automated Vehicles (AD) is very relevant for AD vehicles to be able to properly drive through non-standard situations, such as roadworks where roadworkers may move about and dynamic changes in road layouts are to be expected. -
Package 'Rcppmsgpack'
Package ‘RcppMsgPack’ November 18, 2018 Type Package Title 'MsgPack' C++ Header Files and Interface Functions for R Version 0.2.3 Date 2018-11-18 Author Travers Ching and Dirk Eddelbuettel; the authors and contributors of MsgPack Maintainer Dirk Eddelbuettel <[email protected]> Description 'MsgPack' header files are provided for use by R packages, along with the ability to access, create and alter 'MsgPack' objects directly from R. 'MsgPack' is an efficient binary serialization format. It lets you exchange data among multiple languages like 'JSON' but it is faster and smaller. Small integers are encoded into a single byte, and typical short strings require only one extra byte in addition to the strings themselves. This package provides headers from the 'msgpack-c' implementation for C and C++(11) for use by R, particularly 'Rcpp'. The included 'msgpack-c' headers are licensed under the Boost Software License (Version 1.0); the code added by this package as well the R integration are licensed under the GPL (>= 2). See the files 'COPYRIGHTS' and 'AUTHORS' for a full list of copyright holders and contributors to 'msgpack-c'. Copyright file inst/COPYRIGHTS License GPL (>= 2) Imports Rcpp LinkingTo Rcpp, BH BugReports https://github.com/eddelbuettel/rcppmsgpack/issues RoxygenNote 6.0.1 Suggests knitr, rmarkdown, microbenchmark VignetteBuilder knitr NeedsCompilation yes Repository CRAN Date/Publication 2018-11-18 21:10:03 UTC 1 2 RcppMsgPack-package R topics documented: RcppMsgPack-package . .2 arrayEx . .3 enumEx . .4 msgpack_format . .4 msgpack_map . .5 msgpack_pack . .6 msgpack_read . .6 msgpack_simplify . .7 msgpack_timestamp_decode . .8 msgpack_timestamp_encode . .8 msgpack_unpack . .9 msgpack_write . 10 Index 11 RcppMsgPack-package ’MsgPack’ C++ Header Files and Interface Functions for R Description ’MsgPack’ header files are provided for use by R packages, along with the ability to access, create and alter ’MsgPack’ objects directly from R. -
PURDUE UNIVERSITY GRADUATE SCHOOL Thesis/Dissertation Acceptance
Graduate School Form 30 Updated 12/26/2015 PURDUE UNIVERSITY GRADUATE SCHOOL Thesis/Dissertation Acceptance This is to certify that the thesis/dissertation prepared By GEETHA R SATYANARAYANA Entitled A QUANTITATIVE COMPARISON & EVALUATION OF PROMINENT MARSHALLING/UN-MARSHALLING FORMATS IN DISTRIBUTED REAL-TIME & EMBEDDED SYSTEMS For the degree of Master of Science Is approved by the final examining committee: JAMES H HILL Chair RAJEEV RAJE MIHRAN TUCERYAN To the best of my knowledge and as understood by the student in the Thesis/Dissertation Agreement, Publication Delay, and Certification Disclaimer (Graduate School Form 32), this thesis/dissertation adheres to the provisions of Purdue University’s “Policy of Integrity in Research” and the use of copyright material. Approved by Major Professor(s): JAMES H HILL SHIAOFEN FANG 7/12/2016 Approved by: Head of the Departmental Graduate Program Date A QUANTITATIVE COMPARISON & EVALUATION OF PROMINENT MARSHALLING/UN-MARSHALLING FORMATS IN DISTRIBUTED REAL-TIME & EMBEDDED SYSTEMS A Thesis Submitted to the Faculty of Purdue University by Geetha R. Satyanarayana In Partial Fulfillment of the Requirements for the Degree of Master of Science August 2016 Purdue University Indianapolis, Indiana ii To Amma & Appa iii ACKNOWLEDGMENTS \All that I am, or hope to be, I owe it to my angel mother." I cannot begin to imagine how life would have been if it weren't for all the hardships my mother had to go through, and all the sacrifices she had to make as a single mother to bring my brother and me to where we are today. I cannot thank her enough for her love, her support, her belief in me and her constant motivation to give my best in everything I do. -
Enabling Digital Twins – a Comparative Study on Messaging Protocols and Serialization Formats for Digital Twins in Iov
Linköping University | Department of Computer and Information Science Master’s thesis, 30 ECTS | Datateknik 2019 | LIU-IDA/LITH-EX-A--19/066--SE Enabling Digital Twins – A comparative study on messaging protocols and serialization formats for Digital Twins in IoV Att möjliggöra digitala tvillingar Daniel Persson Proos Supervisor : Patrick Lambrix Examiner : Niklas Carlsson Linköpings universitet SE–581 83 Linköping +46 13 28 10 00 , www.liu.se Upphovsrätt Detta dokument hålls tillgängligt på Internet - eller dess framtida ersättare - under 25 år från publicer- ingsdatum under förutsättning att inga extraordinära omständigheter uppstår. Tillgång till dokumentet innebär tillstånd för var och en att läsa, ladda ner, skriva ut enstaka ko- pior för enskilt bruk och att använda det oförändrat för ickekommersiell forskning och för undervis- ning. Överföring av upphovsrätten vid en senare tidpunkt kan inte upphäva detta tillstånd. All annan användning av dokumentet kräver upphovsmannens medgivande. För att garantera äktheten, säker- heten och tillgängligheten finns lösningar av teknisk och administrativ art. Upphovsmannens ideella rätt innefattar rätt att bli nämnd som upphovsman i den omfattning som god sed kräver vid användning av dokumentet på ovan beskrivna sätt samt skydd mot att dokumentet ändras eller presenteras i sådan form eller i sådant sammanhang som är kränkande för upphovsman- nens litterära eller konstnärliga anseende eller egenart. För ytterligare information om Linköping University Electronic Press se förlagets hemsida http://www.ep.liu.se/. Copyright The publishers will keep this document online on the Internet - or its possible replacement - for a period of 25 years starting from the date of publication barring exceptional circumstances. The online availability of the document implies permanent permission for anyone to read, to down- load, or to print out single copies for his/hers own use and to use it unchanged for non-commercial research and educational purpose. -
Author(S) Hill, Bruce W. Title Evaluation of Efficient XML
Author(s) Hill, Bruce W. Evaluation of efficient XML interchange (EXI) for large datasets and as an alternative Title to binary JSON encodings Publisher Monterey, California: Naval Postgraduate School Issue Date 2015-03 URL http://hdl.handle.net/10945/45196 This document was downloaded on June 12, 2015 at 11:53:11 NAVAL POSTGRADUATE SCHOOL MONTEREY, CALIFORNIA THESIS EVALUATION OF EFFICIENT XML INTERCHANGE (EXI) FOR LARGE DATASETS AND AS AN ALTERNATIVE TO BINARY JSON ENCODINGS by Bruce W. Hill March 2015 Thesis Advisor: Don Brutzman Co-Advisor: Don McGregor Approved for public release; distribution is unlimited THIS PAGE INTENTIONALLY LEFT BLANK REPORT DOCUMENTATION PAGE Form Approved OMB No. 0704–0188 Public reporting burden for this collection of information is estimated to average 1 hour per response, including the time for reviewing instruction, searching existing data sources, gathering and maintaining the data needed, and completing and reviewing the collection of information. Send comments regarding this burden estimate or any other aspect of this collection of information, including suggestions for reducing this burden, to Washington headquarters Services, Directorate for Information Operations and Reports, 1215 Jefferson Davis Highway, Suite 1204, Arlington, VA 22202-4302, and to the Office of Management and Budget, Paperwork Reduction Project (0704-0188) Washington, DC 20503. 1. AGENCY USE ONLY (Leave blank) 2. REPORT DATE 3. REPORT TYPE AND DATES COVERED March 2015 Master’s Thesis 4. TITLE AND SUBTITLE 5. FUNDING NUMBERS EVALUATION OF EFFICIENT XML INTERCHANGE (EXI) FOR LARGE W4V02 DATASETS AND AS AN ALTERNATIVE TO BINARY JSON ENCODINGS 6. AUTHOR Bruce W. Hill 7. PERFORMING ORGANIZATION NAME AND ADDRESS 8. -
Parsing Gigabytes of JSON Per Second
Parsing Gigabytes of JSON per Second Geoff Langdale · Daniel Lemire Abstract JavaScript Object Notation or JSON is a The JSON syntax can be viewed as a restricted form ubiquitous data exchange format on the Web. Ingesting of JavaScript, but it is used in many programming lan- JSON documents can become a performance bottleneck guages. JSON has four primitive types or atoms (string, due to the sheer volume of data. We are thus motivated number, Boolean, null) that can be embedded within to make JSON parsing as fast as possible. composed types (arrays and objects). An object takes Despite the maturity of the problem of JSON pars- the form of a series of key-value pairs between braces, ing, we show that substantial speedups are possible. where keys are strings (e.g., f"name":"Jack","age":22g). We present the first standard-compliant JSON parser An array is a list of comma-separated values between to process gigabytes of data per second on a single brackets (e.g., [1,"abc",null]). Composed types can core, using commodity processors. We can use a quar- contain primitive types or arbitrarily deeply nested com- ter or fewer instructions than a state-of-the-art refer- posed types as values. See Fig. 1 for an example. The ence parser like RapidJSON. Unlike other validating JSON specification defines six structural characters (`[', parsers, our software (simdjson) makes extensive use of `f', `]', `g', `:', `,'): they serve to delimit the locations Single Instruction, Multiple Data (SIMD) instructions. and structure of objects and arrays. To ensure reproducibility, simdjson is freely available as To access the data contained in a JSON document open-source software under a liberal license.