The Life-Cycle of Data with an Eye on Integration Of

Total Page:16

File Type:pdf, Size:1020Kb

The Life-Cycle of Data with an Eye on Integration Of The Life-Cycle of Data With an eye on integration of embedded devices with the Cloud, Raima CTO Wayne Warren differentiates between live, actionable information and data with ongoing value, and argues that to realise the true power of the Cloud, businesses must utilize the power of the collecting and controlling computers on the edge of the grid. The rise of the Cloud has presented companies of all sizes with new opportunities to store, manage and analyze data – easily, effectively and at low cost. Data management in the Cloud has enabled these companies to reduce their in-house systems costs and complexity, while actually gaining increased visibility on plant and processes. At the same time, third party service organisations have emerged, providing data dashboards that give companies 'live real time control' of their assets, often from remote locations, as well as historical trend analysis. Consider, for example, a company at a central location with a key asset in an entirely different or isolated location. It may be advantageous to monitor key operational data to ensure the equipment itself is not trending towards some catastrophic fault, and some performance data to ensure output is optimal. That might be relatively few sensors over all, and perhaps some diagnostics feedback from onboard control systems. But getting at that data directly might mean setting up embedded web servers or establishing some form of telemetry, and then getting that data into management software and delivering it in a means that enables it to be acted upon. How much easier to simply provide those same outputs to a Cloud-based data management provider, and then log-in to a customized dashboard that provides visualization and control, complete with alarms, actions, reports and more? And all for a nominal monthly fee. Further, with virtually unlimited storage in the Cloud, all data can be stored, mined, analyzed and disseminated as reports that provide unprecedented levels of traceability (important to many sectors of industry) and long term trend analysis that can really help companies to boost performance and, ultimately, improve profitability. As our data output increases, it might seem reasonable to expect that the quality of information being returned from the Cloud should improve as well, enabling us to make better operational decisions that improve performance still further. And to an extent, this is true. But there is also danger on that path, because as we move into an era of 'big data', it is becoming increasingly difficult to pull meaningful, 'actionable information' out from the background noise. Where once a data analyst might simply have been interested in production line quotas and the link to plant or asset uptime, today they may also be interested in accessing the data generated by the myriad of automated devices along the production line, because that raw data may well hold the key to increased productivity, reduced energy consumption, elimination of waste, reduction in down time, improved overall equipment effectiveness, and ultimately a better bottom line. Date: 09/10/2014 RaimaDMA025D page 1 / 6 And we really are talking about huge amounts of data. The rise of the 'Internet of Things' and machine-to- machine (M2M) communications, combined with the latest GSM networks that deliver high-speed, bi- directional transfer without the limitations of range, power, data size and network infrastructure that held back traditional telematics solutions, has seen data transmission increase exponentially in the last few years. As of 2012, across the globe over 2.5 exabytes (2.5x10exp18) of data were being created every day, and it is certainly not unusual for individual companies to be generating hundreds of gigabytes of data. Importantly, different types of data will have different lifecycles, and this impacts on how that data needs to be managed. The phasor measurement devices, for example, monitoring variables on the power grid that highlight changes in frequency, power, voltage etc, might generate perhaps a few terabytes of information per month. Certainly this is a lot of data, and it has a mixture of lifecycles; long term information indicative of trends, and live data that can flag up an immediate fault. A complex product test, by contrast, might generate the same volume of information in an hour or less, but again there will be a mixture of data lifecycles; the complex information that provides a pass/fail output for the test needs to be immediately available to optimise production cycles, but has no value subsequently, while the overview information might be important to store for traceability reasons. The common thread, however, is the large amounts of data being generated. Indeed, this is so much information that it is no longer meaningful to measure today's data in terms of the number of records, but rather by the velocity of the stream. Live data – that is, captured data about something happening right now – is available in great quantities and at low cost. Sensors on embedded and real-time computers are able to capture information at a rate that exceeds our ability to use it. That means that the moment for which any given volume of data has real value may well come and go faster than we can actually exploit it. If our only response is simply to send all of that data to the Cloud, with no regard for the lifecycle of the data, then the Cloud becomes little more than a dumping ground for data that may well have no ongoing value. It is vital, then, to consider the life-cycle of live data, and how that data is best distributed between embedded devices and the Cloud. For Cloud resources to be truly optimized while enabling meaningful operational decisions to be made locally, in the moment, then the power of embedded systems on the edge of the grid must be fully utilized. Only by delegation of responsibilities for data collection, filtering and decision making to the increasingly powerful computers deployed within the 'Internet of Things' can we have effective management of data from its inception to disposal. The embedded database industry has responded to this requirement with data management products that deliver the requisite performance and availability in products that are readily scalable. These data management products can take the captured live data, process it (aggregating and simplifying the data as required) and then distribute it to deliver the visualization and analytics that will enable meaningful decisions to be made. The ability to do all of this locally within embedded systems – acting on data that is only of real value in the moment – has a huge impact on the performance of plant and assets, while the data that has ongoing value can be sorted and sent to the Cloud. Date: 09/10/2014 RaimaDMA025D page 2 / 6 Consider, for example, the testing of consumer products where the way the product sounds or feels is taken as an indicator of its quality. Such quality testing is common in a host of domestic and automotive products, which possess intrinsic vibration and sound characteristics that may be used as indictors of mechanical integrity. A part under test might be subjected to a period of controlled operation while measuring millions of data points. A multitude of metrics and algorithms need to be applied to this data to create a 'signature', which determines whether the product passes or fails the quality check. Raima was involved in just such an application, in a market where production cycle times were critical and where new data sets were being generated every two seconds. The live data had to be acted on in real time to match the required production cycle time while providing reliable pass/fail information. At the same time, it is important to aggregate, manage and store the essential test information for the long term so that in the event of an operational fault or a customer complaint, the product serial number can be quickly checked against the test history. It is important to be able to reprocess the historical data when considering warranty costs or perhaps even the need for a product batch recall. This is a very clear differentiation of data lifecycles – historical data that can be aggregated, sorted and then stored for the long term (ideal for the Cloud), and live data that impacts directly on production performance. When we talk about performance, we do not necessarily have to think about 'real time' response in a deterministic sense for streaming data, but we must have 'live real time' response that is simply fast enough to work with live information that appears quickly and has a short life-cycle. The database might need to be able to keep up with data rates that may measure thousands of events per minute, with burst rates many times higher, and must able to raise alarms or trigger additional actions when particular conditions are met. Those conditions might involve the presence or absence of data in the database, so quick lookups must be performed. They may also depend on connections between records in the database, so the database system needs to be able to maintain associations and lookups that can be quickly created or queried. The high-speed processors in modern computer systems play a part, but increasingly meeting performance requirements depends on scalability, which comes from the ability to distribute the database operations across multiple CPUs and multiple processor cores. This not only makes best use of available resources, but also opens up possibilities for parallel data access, allowing very fast throughput. Consider the example of wind turbine control, where operators need to constantly monitor variables such as wind speed, vibration and temperature. Because wind turbines are often in remote locations and are unmanned, a database is required that can store large amounts of data – perhaps in the order of terabytes per day – and that will continue to operate reliably 24/7 without intervention.
Recommended publications
  • RDM Embedded 10-Dataflow-Datasheet
    RDMe DataFlow™ Product Data Sheet Raima Database Manager (RDM) Embedded DataFlow™ extension provides additional reliability to our time tested and dependable RDM database engine. In this DataFlow solution we have added functionality that will enable embedded system developers to develop sophisticated applications capable of moving information collected on the smallest devices up to the largest enterprise systems. Overview: In today’s world the need for the flow of information throughout the many levels of an organization is becoming even more essential to the success of a business. Tradition- ally, embedded applications have been closed systems completely isolated from the enterprise infrastructure. Typically, if data from a device is allowed into the enterprise Key New Features: the movement of the data is done via off line batch processing at periodic time Master-Slave Replication intervals. It often takes hours for these batch processes to complete, rendering the information out of date by the time it reaches key decision makers. RDM Embedded 3rd Party Database DataFlow allows for the safe real-time movement of data captured on the shop floor to Replication flow up to the enterprise providing instant actionable information to decision makers. Key Functionality: Key Benefits: Master-Slave Replication Host 1 Host 2 Host 4 Application Reliability Create applications that replicate Application data between different R R Performance e e p p l l i i c c databases on different systems, a a t t i i Efficiency In Memory o o In-Memory n n R Database E E Database e on the same system, in memory p n n l g g i c i i n n Innovation a e e t i and on disk.
    [Show full text]
  • Raima Database API for Labview
    PRODUCT DATA SHEET Raima Database API for LabVIEW Raima Database API for LabVIEW is an interface package to Raima Database Manager (RDM), which is a high-performance database management system optimized for operating systems commonly used within the embedded market. Both Win- dows and RT VxWorks (on the CompactRIO-9024 and Single-Board RIO) are supported in this package. The database en- gine has been developed to fully utilize multi-core processors and networks of embedded computers. It runs with minimal memory and supports both in-memory and on-disk storage. RDM provides Embedded SQL that is suitable for running on embedded computers with requirements to store live streaming data or sets of configuration parameters. Multi-Core Scalability- Efficiently use threads with transaction processing to take advantage of multicore Key Benefits: systems for optimal speed. Local RT Database Multi-Versioning Concurrency Control (MVCC) - Implement read-only transactions to see a virtual LabVIEW VI Interface snapshot of your embedded database while it is being High Performance concurrently updated. Avoids read locks to improve Interoperation multiuser performance. Multi-Core Scalability Store Databases on RT Device, In-Memory or On-Disk - Configure your database to run completely on-disk, completely in-memory, or a hybrid of both. Local storage allows disconnected and/or synchronized data operation. Windows/cRIO Interoperation - Create a database on Windows, use from both Windows and cRIO concurrent- ly. Share/Use Databases - A cRIO database may be shared, other cRIOs on same network can use it. True Global Queries - Multiple shared databases may be opened together and queried as though they are one uni- fied database.
    [Show full text]
  • Raima Database Manager 12.0
    PRODUCT DATA SHEET Raima Database Manager 12.0 Raima Database Manager (RDM) is a high-performance database management system that is optimized for workgroup, real-time and embedded, and mobile operating systems. It is ideal for programming interoperating systems of networked and distributed applications and data such as those found in financial, telecom, industrial automation or medical systems. Multiple APIs and configurations provide developers a wide variety of powerful programming options and functionality. The database engine utilizes multi-core processors, runs within limited memory, and supports - - Key Features: both in memory and on disk storage. Security is provided through encryption. The database becomes an embedded part of your applications when implemented as a linkable library. Multi-Core Scalability RDM Embedded SQL has been designed for embedded systems applications, and as such it is suitable for running on a wide variety of computers and embedded operating systems many Distributed Architecture of which have limited capacities. Portability/Multi-Platform Standard Package—Performance Features Enhanced SQL Optimization Multi-Core Support—Efficiently distribute Pure and Hybrid In-Memory Database Support processing to take advantage of multi-core Operation—Configure your database to parallelism. run completely on-disk, completely in- Shared Memory Protocol memory, or a hybrid of both; combining the Multi-Versioning Concurrency Control speed of an in-memory database and the New Data Types (MVCC)—Implement read-only stability of on-disk in a single system. transactions you can read a virtual Fast! snapshot of your embedded database while Multiple Indexing Methods—Use B-Trees it is being concurrently updated. Avoid read or Hash Indexes on tables.
    [Show full text]
  • Linux-Database-Bible.Pdf
    Table of Contents Linux Database Bible..........................................................................................................................................1 Preface..................................................................................................................................................................4 The Importance of This Book.................................................................................................................4 Getting Started........................................................................................................................................4 Icons in This Book..................................................................................................................................5 How This Book Is Organized.................................................................................................................5 Part ILinux and Databases................................................................................................................5 Part IIInstallation and Configuration................................................................................................5 Part IIIInteraction and Usage...........................................................................................................5 Part IVProgramming Applications...................................................................................................6 Part VAdministrivia.........................................................................................................................6
    [Show full text]
  • Release Notes
    Juniper Networks Steel-Belted Radius Release Notes Release 6.0.1 November 2008 Juniper Networks, Inc. 1194 North Mathilda Avenue Sunnyvale, CA 94089 USA 408-745-2000 www.juniper.net Part Number: SBR-TD-RN601 Revision 01 Copyright © 1999–2007 Juniper Networks, Inc. All rights reserved. Printed in USA. Steel-Belted Radius, Juniper Networks, the Juniper Networks logo are registered trademark of Juniper Networks, Inc. in the United States and other countries. Raima, Raima Database Manager and Raima Object Manager are trademarks of Birdstep Technology. All other trademarks, service marks, registered trademarks, or registered service marks are the property of their respective owners. All specifications are subject to change without notice. Juniper Networks assumes no responsibility for any inaccuracies in this document. Juniper Networks reserves the right to change, modify, transfer, or otherwise revise this publication without notice. Revision History Date Description 10 June 2007 First release of Steel-Belted Radius Release 6.0.1 release notes. 1 December 2008 Updated discussion of 32-bit support for Linux and Windows operating systems. M08121 Table of Contents System Requirements ......................................................................................1 SBR Administrator.....................................................................................1 Solaris........................................................................................................1 Linux .........................................................................................................3
    [Show full text]
  • Raima Company Whitepaper Enabling Edge Applications to Move Data to the Cloud
    Raima Company Whitepaper Enabling edge applications to move data to the cloud Raima is a leading provider of OLTP high-performance, real-time • Provider of High Performance In database management systems for both in-memory and persistent Memory Database Technology storage for Edge and IOT devices. • Small footprint, cross platform Raima´s focus is high speed database solutions which are cross- database with both In-Memory platform, small footprint designed for distributed architecture in and persistent storage device resource-constrained environments. functionality • Optimized for flash & SSD by Raima Database Manager (RDM) is made to Collect, Store, Manage minimal writes to the medium and Move data securily from the devices on the edge to the Cloud. for longer lifespan Our Raima Database Manager (RDM) product is optimized to allow • Well field tested. flash and SSD devices to live longer through less writes to the medium. RDM offers a tested and well proven reliable ACID compliant database technology and employs a number of advanced solutions to meet today’s complex edge data management challenges such as storing and moving data in a timely fashion from a small low-powered IoT/IIoT embedded device up into larger cloud-based enterprise systems . CONTENTS: 1. RAIMA IN BRIEF ......................................................................................................................................................... 3 2. POTENTIAL USE CASES FOR RAIMA TO THE CLOUD .................................................................................................
    [Show full text]
  • Partner Directory Wind River Partner Program
    PARTNER DIRECTORY WIND RIVER PARTNER PROGRAM The Internet of Things (IoT), cloud computing, and Network Functions Virtualization are but some of the market forces at play today. These forces impact Wind River® customers in markets ranging from aerospace and defense to consumer, networking to automotive, and industrial to medical. The Wind River® edge-to-cloud portfolio of products is ideally suited to address the emerging needs of IoT, from the secure and managed intelligent devices at the edge to the gateway, into the critical network infrastructure, and up into the cloud. Wind River offers cross-architecture support. We are proud to partner with leading companies across various industries to help our mutual customers ease integration challenges; shorten development times; and provide greater functionality to their devices, systems, and networks for building IoT. With more than 200 members and still growing, Wind River has one of the embedded software industry’s largest ecosystems to complement its comprehensive portfolio. Please use this guide as a resource to identify companies that can help with your development across markets. For updates, browse our online Partner Directory. 2 | Partner Program Guide MARKET FOCUS For an alphabetical listing of all members of the *Clavister ..................................................37 Wind River Partner Program, please see the Cloudera ...................................................37 Partner Index on page 139. *Dell ..........................................................45 *EnterpriseWeb
    [Show full text]
  • Database Music a History, Technology, and Aesthetics of the Database in Music Composition
    Database Music A History, Technology, and Aesthetics of the Database in Music Composition by Federico Nicolás Cámara Halac A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy Department of Music New York University May, 2019 Jaime Oliver La Rosa c Federico Nicolás Cámara Halac All Rights Reserved, 2019 Dedication For my mother and father, who have always taught me to never give up with my research, even during the most difficult times. Also to my advisor, Jaime Oliver La Rosa, without his help and continuous guidance, this would have never been possible. For Elizabeth Hoffman and Judy Klein, who always believed in me, and whose words and music I bring everywhere. Finally to Aye, whose love I cannot even begin to describe. iv Acknowledgements I would like to thank my advisor, Jaime Oliver La Rosa, for his role in inspiring this project, as well as his commitment to research, clarity, and academic rigor. I am also indebted to committee members Martin Daughtry and Elizabeth Hoffman, for their ongoing guidance and support even at the very early stages of this project, and William Brent and Robert Rowe, whose insightful, thought-provoking input made this dissertation come to fruition. I am also everlastingly grateful to Judy Klein, for always being available to listen and share her listening. As well as to Aye, for her endless support and her helping me maintain hope in developing this project. I would also like to thank my parents, Ana and Hector, who inspired and nurtured my interest in music from a young age, and my sister Flor and my brother Joaquin who were always with me, next to every word.
    [Show full text]
  • Steel – Belted Radius Release Notes
    Steel – Belted Radius Release Notes Release, Build 6.23 Build 3 Published November, 2016 Document Version 1.2 Steel-Belted Radius Release Notes Pulse Secure, LLC 2700 Zanker Road, Suite 200 San Jose, CA 95134 http://www.pulsesecure.net © 2016 by Pulse Secure, LLC. All rights reserved Steel-Belted Radius, Pulse Secure, the Pulse Secure logo are registered trademark of Pulse Secure, Inc. in the United States and other countries. Raima, Raima Database Manager and Raima Object Manager are trademarks of Birdstep Technology. All other trademarks, service marks, registered trademarks, or registered service marks are the property of their respective owners. All specifications are subject to change without notice. Pulse Secure assumes no responsibility for any inaccuracies in this document. Pulse Secure reserves the right to change, modify, transfer, or otherwise revise this publication without notice. Revision History The following table lists the revision history for this document. Date Description November 2016 Maintenance release of Steel-Belted Radius Release 6.23 build 3 release notes. September2016 Maintenance release of Steel-Belted Radius Release 6.23 build 2 release notes. August 2016 Maintenance release of Steel-Belted Radius Release 6.23 release notes. June 2016 Maintenance release of Steel-Belted Radius Release 6.22 release notes. February 2016 Maintenance release of Steel-Belted Radius Release 6.21 release notes. October 2015 Initial release of Steel-Belted Radius Release 6.2 release notes. © 2016 by Pulse Secure, LLC. All rights
    [Show full text]
  • 6.0 SBR Administration Guide
    Juniper Networks Steel-Belted Radius Administration Guide Global Enterprise Edition Release 6.0 February 2007 Juniper Networks, Inc. 1194 North Mathilda Avenue Sunnyvale, CA 94089 USA 408-745-2000 www.juniper.net Part Number: SBR-PF-GEEMANL Revision 01 Copyright © 2004–2007 Juniper Networks, Inc. All rights reserved. Printed in USA. Steel-Belted Radius, Juniper Networks, the Juniper Networks logo are registered trademark of Juniper Networks, Inc. in the United States and other countries. Raima, Raima Database Manager and Raima Object Manager are trademarks of Birdstep Technology. All other trademarks, service marks, registered trademarks, or registered service marks are the property of their respective owners. All specifications are subject to change without notice. Juniper Networks assumes no responsibility for any inaccuracies in this document. Juniper Networks reserves the right to change, modify, transfer, or otherwise revise this publication without notice. Portions of this software copyright 1989, 1991, 1992 by Carnegie Mellon University Derivative Work - 1996, 1998-2000 Copyright 1996, 1998-2000 The Regents of the University of California All Rights Reserved Permission to use, copy, modify and distribute this software and its documentation for any purpose and without fee is hereby granted, provided that the above copyright notice appears in all copies and that both that copyright notice and this permission notice appear in supporting documentation, and that the name of CMU and The Regents of the University of California not be used in advertising or publicity pertaining to distribution of the software without specific written permission. CMU AND THE REGENTS OF THE UNIVERSITY OF CALIFORNIA DISCLAIM ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS.
    [Show full text]
  • Raima Database Manager 14.1
    RDM 14.1 DATA SHEET Raima Database Manager 14.1 RDM is a high-performance database management system that is optimized for operating systems commonly used within the embedded market. The database engine has been developed to fully utilize multi-core processors, run with mini- mal memory, and support both in-memory and on-disk storage. It provides a subset of the ANSI/ISO standard SQL that is suitable for running on a wide vari- ety of computers and embedded operating systems which may have limited re- sources. Core Package • Upgradability - Database upgrada- bility with respect to database migra- Key Features: • - Improved perfor- Performance tion from prior RDM versions can be mance over previous version of done through import/export function- • RDM and competitor products. Next Generation Storage ality. • Compression - Store only the data Format • In-Memory Performance Optimi- needed per row to avoid underuti- zation—RDM 14.1 will have a fully • Performance lized space. Also column level optimized architecture for memory compression to increase the packing resident databases to improve per- • Compression of rows and reduce overall data file formance and offer additional bene- size. • fits. The single-process/multi-thread Portability • - Portability Database content will application architecture will have • SQL/Core Compatibility be independent of the CPU architec- additional performance features. ture, allowing databases to be • Upgradability copied between platforms, or • Dynamic DDL - This feature is im- concurrently accessed by computers portant to meet customer feature • In-Memory Optimizations with different operating systems or demand for the ability to create and CPU architectures. alter database and table definitions, • SQL/PL • SQL/Core Compatibility - This which enhances the customer appli- • SQL Triggers version of RDM will combine the cation upgrade scenarios.
    [Show full text]
  • RDM Embedded 10-HA-Datasheet
    RDMe HA Product Data Sheet Raima Database Manager (RDM) Embedded HA extension provides additional reliability to the time tested and dependable RDM database engine. We have added many components in this HA solution required by embedded system developers to achieve the highest availability with no impact to the high-performance, real-time, and small footprint of the database engine. Overview: High availability is critical in today’s environment and most applications are striving to achieve even the minimum requirements of five nines. RDM Embedded is designed from the ground up to always be on and if it does go down it will always be able to get back on its feet. This has been a core requirement from the beginning in 1984, Key New Features: “Zero Database Administration”, and it remains one of the key components of high Master-Slave Mirroring availability within the RDM products. But as time has changed the requirements of the database market, RDM Embedded has also made some changes to meet these Key Benefits: requirements. As part of the core engine both a Two Phase Commit Protocol and Hot-Online Backup have been added. Now RDM Embedded applications can achieve Reliability greater high availability with the RDMe HA which provides Synchronous and Performance Asynchronous Master-Slave Mirroring. Developers looking to achieve high availability Efficiency from a proven high quality embedded database at a reasonable cost; need to Innovation consider RDM Embedded. Flexibility Excellent Support Key Features: Master-Slave Mirroring Master Site Slave Site Partners: Data redundancy without sacrificing M i r r o performance! Master-Slave mirroring r i n g E RDM Embedded introduces data redundancy by n g i n automatically mirroring the application e database locally or across a network.
    [Show full text]