The Evolution of System Reliability Optimization David Coit, Enrico Zio
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
An Architect's Guide to Site Reliability Engineering Nathaniel T
An Architect's Guide to Site Reliability Engineering Nathaniel T. Schutta @ntschutta ntschutta.io https://content.pivotal.io/ ebooks/thinking-architecturally Sofware development practices evolve. Feature not a bug. It is the agile thing to do! We’ve gone from devs and ops separated by a large wall… To DevOps all the things. We’ve gone from monoliths to service oriented to microserivces. And it isn’t all puppies and rainbows. Shoot. A new role is emerging - the site reliability engineer. Why? What does that mean to our teams? What principles and practices should we adopt? How do we work together? What is SRE? Important to understand the history. Not a new concept but a catchy name! Arguably goes back to the Apollo program. Margaret Hamilton. Crashed a simulator by inadvertently running a prelaunch program. That wipes out the navigation data. Recalculating… Hamilton wanted to add error-checking code to the Apollo system that would prevent this from messing up the systems. But that seemed excessive to her higher- ups. “Everyone said, ‘That would never happen,’” Hamilton remembers. But it did. Right around Christmas 1968. — ROBERT MCMILLAN https://www.wired.com/2015/10/margaret-hamilton-nasa-apollo/ Luckily she did manage to update the documentation. Allowed them to recover the data. Doubt that would have turned into a Hollywood blockbuster… Hope is not a strategy. But it is what rebellions are built on. Failures, uh find a way. Traditionally, systems were run by sys admins. AKA Prod Ops. Or something similar. And that worked OK. For a while. But look around your world today. -
Chapter 6 Structural Reliability
MIL-HDBK-17-3E, Working Draft CHAPTER 6 STRUCTURAL RELIABILITY Page 6.1 INTRODUCTION ....................................................................................................................... 2 6.2 FACTORS AFFECTING STRUCTURAL RELIABILITY............................................................. 2 6.2.1 Static strength.................................................................................................................... 2 6.2.2 Environmental effects ........................................................................................................ 3 6.2.3 Fatigue............................................................................................................................... 3 6.2.4 Damage tolerance ............................................................................................................. 4 6.3 RELIABILITY ENGINEERING ................................................................................................... 4 6.4 RELIABILITY DESIGN CONSIDERATIONS ............................................................................. 5 6.5 RELIABILITY ASSESSMENT AND DESIGN............................................................................. 6 6.5.1 Background........................................................................................................................ 6 6.5.2 Deterministic vs. Probabilistic Design Approach ............................................................... 7 6.5.3 Probabilistic Design Methodology..................................................................................... -
Existing Cybernetics Foundations - B
SYSTEMS SCIENCE AND CYBERNETICS – Vol. III - Existing Cybernetics Foundations - B. M. Vladimirski EXISTING CYBERNETICS FOUNDATIONS B. M. Vladimirski Rostov State University, Russia Keywords: Cybernetics, system, control, black box, entropy, information theory, mathematical modeling, feedback, homeostasis, hierarchy. Contents 1. Introduction 2. Organization 2.1 Systems and Complexity 2.2 Organizability 2.3 Black Box 3. Modeling 4. Information 4.1 Notion of Information 4.2 Generalized Communication System 4.3 Information Theory 4.4 Principle of Necessary Variety 5. Control 5.1 Essence of Control 5.2 Structure and Functions of a Control System 5.3 Feedback and Homeostasis 6. Conclusions Glossary Bibliography Biographical Sketch Summary Cybernetics is a science that studies systems of any nature that are capable of perceiving, storing, and processing information, as well as of using it for control and regulation. UNESCO – EOLSS The second title of the Norbert Wiener’s book “Cybernetics” reads “Control and Communication in the Animal and the Machine”. However, it is not recognition of the external similaritySAMPLE between the functions of animalsCHAPTERS and machines that Norbert Wiener is credited with. That had been done well before and can be traced back to La Mettrie and Descartes. Nor is it his contribution that he introduced the notion of feedback; that has been known since the times of the creation of the first irrigation systems in ancient Babylon. His distinctive contribution lies in demonstrating that both animals and machines can be combined into a new, wider class of objects which is characterized by the presence of control systems; furthermore, living organisms, including humans and machines, can be talked about in the same language that is suitable for a description of any teleological (goal-directed) systems. -
Introduction to Cybernetics and the Design of Systems
Introduction to Cybernetics and the Design of Systems © Hugh Dubberly & Paul Pangaro 2004 Cybernetics named From Greek ‘kubernetes’ — same root as ‘steering’ — becomes ‘governor’ in Latin Steering wind or tide course set Steering wind or tide course set Steering wind or tide course set Steering wind or tide course set correction of error Steering wind or tide course set correction of error Steering wind or tide course set correction of error correction of error Steering wind or tide course set correction of error correction of error Steering wind or tide course set correction of error correction of error Cybernetics named From Greek ‘kubernetes’ — same root as ‘steering’ — becomes ‘governor’ in Latin Cybernetic point-of-view - system has goal - system acts, aims toward the goal - environment affects aim - information returns to system — ‘feedback’ - system measures difference between state and goal — detects ‘error’ - system corrects action to aim toward goal - repeat Steering as a feedback loop compares heading with goal of reaching port adjusts rudder to correct heading ship’s heading Steering as a feedback loop detection of error compares heading with goal of reaching port adjusts rudder feedback to correct heading correction of error ship’s heading Automation of feedback thermostat heater temperature of room air Automation of feedback thermostat compares to setpoint and, if below, activates measured by heater raises temperature of room air The feedback loop ‘Cybernetics introduces for the first time — and not only by saying it, but -
Training-Sre.Pdf
C om p lim e nt s of Training Site Reliability Engineers What Your Organization Needs to Create a Learning Program Jennifer Petoff, JC van Winkel & Preston Yoshioka with Jessie Yang, Jesus Climent Collado & Myk Taylor REPORT Want to know more about SRE? To learn more, visit google.com/sre Training Site Reliability Engineers What Your Organization Needs to Create a Learning Program Jennifer Petoff, JC van Winkel, and Preston Yoshioka, with Jessie Yang, Jesus Climent Collado, and Myk Taylor Beijing Boston Farnham Sebastopol Tokyo Training Site Reliability Engineers by Jennifer Petoff, JC van Winkel, and Preston Yoshioka, with Jessie Yang, Jesus Climent Collado, and Myk Taylor Copyright © 2020 O’Reilly Media. All rights reserved. Printed in the United States of America. Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472. O’Reilly books may be purchased for educational, business, or sales promotional use. Online editions are also available for most titles (http://oreilly.com). For more infor‐ mation, contact our corporate/institutional sales department: 800-998-9938 or [email protected]. Acquistions Editor: John Devins Proofreader: Charles Roumeliotis Development Editor: Virginia Wilson Interior Designer: David Futato Production Editor: Beth Kelly Cover Designer: Karen Montgomery Copyeditor: Octal Publishing, Inc. Illustrator: Rebecca Demarest November 2019: First Edition Revision History for the First Edition 2019-11-15: First Release See http://oreilly.com/catalog/errata.csp?isbn=9781492076001 for release details. The O’Reilly logo is a registered trademark of O’Reilly Media, Inc. Training Site Reli‐ ability Engineers, the cover image, and related trade dress are trademarks of O’Reilly Media, Inc. -
A Quantitative Reliability, Maintainability and Supportability Approach for NASA's Second Generation Reusable Launch Vehicle
A Quantitative Reliability, Maintainability and Supportability Approach for NASA's Second Generation Reusable Launch Vehicle Fayssai M. Safie, Ph. D. Marshall Space Flight Center Huntsville, Alabama Tel: 256-544-5278 E-mail: Fayssal.Safie @ msfc.nasa.gov Charles Daniel, Ph.D. Marshall Space Flight Center Huntsville, Alabama Tel: 256-544-5278 E-mail: Charles.Daniel @msfc.nasa.gov Prince Kalia Raytheon ITSS Marshall Space Flight Center Huntsville, Alabama Tel: 256-544-6871 E-mail: Prince.Kalia @ msfc.nasa.gov ABSTRACT The United States National Aeronautics and Space Administration (NASA) is in the midst of a 10-year Second Generation Reusable Launch Vehicle (RLV) program to improve its space transportation capabilities for both cargo and crewed missions. The objectives of the program are to: significantly increase safety and reliability, reduce the cost of accessing low-earth orbit, attempt to leverage commercial launch capabilities, and provide a growth path for manned space exploration. The safety, reliability and life cycle cost of the next generation vehicles are major concerns, and NASA aims to achieve orders of magnitude improvement in these areas. To get these significant improvements, requires a rigorous process that addresses Reliability, Maintainability and Supportability (RMS) and safety through all the phases of the life cycle of the program. This paper discusses the RMS process being implemented for the Second Generation RLV program. 1.0 INTRODUCTION The 2nd Generation RLV program has in place quantitative Level-I RMS, and cost requirements [Ref 1] as shown in Table 1, a paradigm shift from the Space Shuttle program. This paradigm shift is generating a change in how space flight system design is approached. -
3 System Design 71 NYS Project Management Guidebook
Section III:3 System Design 71 NYS Project Management Guidebook 3 SYSTEM DESIGN Purpose The purpose of System Design is to create a technical solution that satisfies the functional requirements for the system. At this point in the project lifecycle there should be a Functional Specification, written primarily in business terminology, con- taining a complete description of the operational needs of the various organizational entities that will use the new system. The challenge is to translate all of this information into Technical Specifications that accurately describe the design of the system, and that can be used as input to System Construction. The Functional Specification produced during System Require- ments Analysis is transformed into a physical architecture. System components are distributed across the physical archi- tecture, usable interfaces are designed and prototyped, and Technical Specifications are created for the Application Developers, enabling them to build and test the system. Many organizations look at System Design primarily as the preparation of the system component specifications; however, constructing the various system components is only one of a set of major steps in successfully building a system. The prepara- tion of the environment needed to build the system, the testing of the system, and the migration and preparation of the data that will ultimately be used by the system are equally impor- tant. In addition to designing the technical solution, System Design is the time to initiate focused planning efforts for both the testing and data preparation activities. List of Processes This phase consists of the following processes: N Prepare for System Design, where the existing project repositories are expanded to accommodate the design work products, the technical environment and tools needed to support System Design are established, and training needs of the team members involved in System Design are addressed. -
Collection System Manager
CITY OF DALY CITY JOB SPECIFICATION EXEMPT POSITION COLLECTION SYSTEM MANAGER DEFINITION Under the general direction of the Director of Water and Wastewater Resources or authorized representative, supervises all activities of the Collection System Maintenance Division, including wastewater collection and recycled water systems, and performs other duties as assigned. EXAMPLES OF DUTIES Manage, supervise, schedule, and review the work of maintenance personnel in the maintenance, inspection and repair of wastewater collection and recycled water distribution systems in the Collection System Maintenance Division of the Department of Water and Wastewater Resources; ensure safe, efficient and effective compliance with local, state and federal laws, rules and regulations, and industrial practices. Implement City, Department and Division Rules and Regulations, policies, procedures and practices. Ensure that required records are accurately maintained. Oversee a comprehensive preventive maintenance program of facilities. Ensure that supervisors conduct regular safety training and maintain a safe work environment. Develop and maintain training programs for employees; ensure that employees maintain required certification. Formally evaluate or ensure the evaluation and satisfactory job performance of assigned employees. Manage crews in support of City departments in maintenance operations, such as the storm drain collection system maintenance and construction program. Work to complete Department and Division goals and objectives. Prepare and manage the Division budget; propose, develop and manage capital projects and manage contracts for the Division. Prepare reports, and make presentations to the City Manager, City Council or others as required. When required, maintain 24-hour availability to address Division emergencies. Develop and maintain good working relationships with supervisors, coworkers, elected officials, professional peer groups and the public. -
Reliability: Software Software Vs
Reliability Theory SENG 521 Re lia bility th eory d evel oped apart f rom th e mainstream of probability and statistics, and Software Reliability & was usedid primar ily as a tool to h hlelp Software Quality nineteenth century maritime and life iifiblinsurance companies compute profitable rates Chapter 5: Overview of Software to charge their customers. Even today, the Reliability Engineering terms “failure rate” and “hazard rate” are often used interchangeably. Department of Electrical & Computer Engineering, University of Calgary Probability of survival of merchandize after B.H. Far ([email protected]) 1 http://www. enel.ucalgary . ca/People/far/Lectures/SENG521/ ooene MTTF is R e 0.37 From Engineering Statistics Handbook [email protected] 1 [email protected] 2 Reliability: Natural System Reliability: Hardware Natural system Hardware life life cycle. cycle. Aging effect: Useful life span Life span of a of a hardware natural system is system is limited limited by the by the age (wear maximum out) of the system. reproduction rate of the cells. Figure from Pressman’s book Figure from Pressman’s book [email protected] 3 [email protected] 4 Reliability: Software Software vs. Hardware So ftware life cyc le. Software reliability doesn’t decrease with Software systems time, i.e., software doesn’t wear out. are changed (updated) many Hardware faults are mostly physical faults, times during their e. g., fatigue. life cycle. Each update adds to Software faults are mostly design faults the structural which are harder to measure, model, detect deterioration of the and correct. software system. Figure from Pressman’s book [email protected] 5 [email protected] 6 Software vs. -
Work System Theory: Overview of Core Concepts, Extensions, and Challenges for the Future Steven Alter University of San Francisco, [email protected]
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by University of San Francisco The University of San Francisco USF Scholarship: a digital repository @ Gleeson Library | Geschke Center Business Analytics and Information Systems School of Management February 2013 Work System Theory: Overview of Core Concepts, Extensions, and Challenges for the Future Steven Alter University of San Francisco, [email protected] Follow this and additional works at: http://repository.usfca.edu/at Part of the Business Administration, Management, and Operations Commons, Management Information Systems Commons, and the Technology and Innovation Commons Recommended Citation Alter, Steven, "Work System Theory: Overview of Core Concepts, Extensions, and Challenges for the Future" (2013). Business Analytics and Information Systems. Paper 35. http://repository.usfca.edu/at/35 This Article is brought to you for free and open access by the School of Management at USF Scholarship: a digital repository @ Gleeson Library | Geschke Center. It has been accepted for inclusion in Business Analytics and Information Systems by an authorized administrator of USF Scholarship: a digital repository @ Gleeson Library | Geschke Center. For more information, please contact [email protected]. Research Article Work System Theory: Overview of Core Concepts, Extensions, and Challenges for the Future Steven Alter University of San Francisco [email protected] Abstract This paper presents a current, accessible, and overarching view of work system theory. WST is the core of an integrated body of theory that emerged from a long-term research project to develop a systems analysis and design method for business professionals called the work system method (WSM). -
Software Reliability and Dependability: a Roadmap Bev Littlewood & Lorenzo Strigini
Software Reliability and Dependability: a Roadmap Bev Littlewood & Lorenzo Strigini Key Research Pointers Shifting the focus from software reliability to user-centred measures of dependability in complete software-based systems. Influencing design practice to facilitate dependability assessment. Propagating awareness of dependability issues and the use of existing, useful methods. Injecting some rigour in the use of process-related evidence for dependability assessment. Better understanding issues of diversity and variation as drivers of dependability. The Authors Bev Littlewood is founder-Director of the Centre for Software Reliability, and Professor of Software Engineering at City University, London. Prof Littlewood has worked for many years on problems associated with the modelling and evaluation of the dependability of software-based systems; he has published many papers in international journals and conference proceedings and has edited several books. Much of this work has been carried out in collaborative projects, including the successful EC-funded projects SHIP, PDCS, PDCS2, DeVa. He has been employed as a consultant to industrial companies in France, Germany, Italy, the USA and the UK. He is a member of the UK Nuclear Safety Advisory Committee, of IFIPWorking Group 10.4 on Dependable Computing and Fault Tolerance, and of the BCS Safety-Critical Systems Task Force. He is on the editorial boards of several international scientific journals. 175 Lorenzo Strigini is Professor of Systems Engineering in the Centre for Software Reliability at City University, London, which he joined in 1995. In 1985-1995 he was a researcher with the Institute for Information Processing of the National Research Council of Italy (IEI-CNR), Pisa, Italy, and spent several periods as a research visitor with the Computer Science Department at the University of California, Los Angeles, and the Bell Communication Research laboratories in Morristown, New Jersey. -
Rocket D3 Database Management System
datasheet Rocket® D3 Database Management System Delivering Proven Multidimensional Technology to the Evolving Enterprise Ecient Performance: C#, and C++. In addition, Java developers can access Delivers high performance via Proven Technology the D3 data files using its Java API. The MVS Toolkit ecient le management For over 30 years, the Pick Universal Data Model (Pick provides access to data and business processes via that requires minimal system UDM) has been synonymous with performance and and memory resources both SOAP and RESTful Web Services. reliability; providing the flexible multidimensional Scalability and Flexibility: database infrastructure to develop critical transac- Scales with your enterprise, tional and analytical business applications. Based on from one to thousands of Why Developers the Pick UDM, the Rocket® D3 Database Manage- users ment System offers enterprise-level scalability and Choose D3 D3 is the choice of more than a thousand applica- Seamless Interoperability: efficiency to support the dynamic growth of any tion developers world-wide—serving top industries Interoperates with varied organization. databases and host including manufacturing, distribution, healthcare, environments through government, retail, and other vertical markets. connectivity tools Rapid application development and application Rocket D3 database-centric development environ- customization requires an underlying data structure Data Security: ment provides software developers with all the that can respond effectively to ever-changing Provides secure, simultaneous necessary tools to quickly adapt to changes and access to the database from business requirements. Rocket D3 is simplistic in its build critical business applications in a fraction of the remote or disparate locations structure, yet allows for complex definitions of data time as compared to other databases; without worldwide structures and program logic.