Uncertainty Modelling for Scarce and Imprecise Data in Engineering Applications
Total Page:16
File Type:pdf, Size:1020Kb
Uncertainty Modelling for Scarce and Imprecise Data in Engineering Applications Thesis submitted in accordance with the requirements of the University of Liverpool for the degree of Doctor in Philosophy by Jonathan Cyrus Sadeghi January 2020 Uncertainty Modelling for Scarce and Imprecise Data in Engineering Applications Abstract In this thesis, models for uncertainty quantification in the case of scarce and imprecise data are described, and the computational efficiency of simulations with these models is improved. Specifically, probability boxes are used to describe imprecision in cumulative distribution functions. This may be the case when imprecise data is used to train a model, or the prior knowledge regarding a property of the system being studied is very weak. Performing simulations with probability boxes is often computationally expensive, because an optimisation program must be solved to obtain each sample in a Monte Carlo simulation. When the system model is known analytically, it is possible to significantly reduce the cost of the analysis. However, the system model is often a black box which can only be queried for a particular point value of the input. Each evaluation or query of the system model is often computationally expensive in itself. Currently, few efficient methods exist to perform computations with probability boxes, and the techniques which exist do not provide rigorous bounds on the obtained probability of failure. Interval Predictor Models are a technique to create an approximate representation of a function, where the uncertainty in the true function is described as an interval, with statistical guarantees on the coverage of the true function. This thesis proposes the use of Interval Predictor Models to create an approximate surrogate model for the true black box system model and hence obtain rigorous bounds on the probability of failure of a system. Techniques are described to create Interval Predictor Models which are tailored to model the performance of a system for reliability analysis. This thesis also describes analytical techniques which can be used for probabilistic safety analysis, in the case that the system model is not a black box. This is advantageous as it enables engineers to perform calculations without spending time programming complex Monte Carlo simulations. A technique is presented to efficiently create Interval Predictor Models for datasets of arbitrary complexity and size, which may contain imprecise data, and we call these models Interval Neural Networks. Case studies or numerical examples are presented to demonstrate the performance of the proposed techniques, including some common benchmarks and a finite element model. The interval predictor models used in this thesis were implemented in the open source uncertainty quantification software OpenCossan, and are now freely i Jonathan Cyrus Sadeghi available.∗ ∗https://github.com/cossan-working-group/OpenCossan/tree/development/+opencossan/ +metamodels/@IntervalPredictorModel ii Acknowledgements I am grateful for the academic support of my supervisors Dr. Edoardo Patelli and Dr. Marco de Angelis. Edoardo Patelli has introduced me to the field of uncertainty quantification, whilst always giving me the opportunity to grow as a researcher. In particular, he has allowed me to contribute to the software development of OpenCossan, which has provided me with an invaluable educational experience. I am very grateful to him for having taken me on board. Marco de Angelis has provided excellent guidance, even before he joined the supervisory team for my PhD. His explanations of concepts in imprecise probability have had a profound impact on my understanding of the field, and these conversations have shaped my opinion of how reliable engineering can be achieved. In addition, he has frequently provided practical assistance to assist me with the OpenCossan software. I gratefully acknowledge the financial support of Wood Plc (formerly Amec Foster Wheeler) and the Engineering and Physical Sciences Research Council (EPSRC Grant reference: EP/L015390/1), which enabled me to undertake my studies. In addition, the Next Generation Nuclear centre for doctoral training provided many opportunities to develop my knowledge of the nuclear industry. The Institute for Risk and Uncertainty at the University of Liverpool was also an important part of doctoral studies, where I benefited from frequent seminars and events. The welcoming atmosphere of the Institute provided an enjoyable environment in which to conduct my studies. During my time there, I met many collaborators and friends, who are too numerous to mention on this page and hence I apologise to anyone who I have not mentioned. The contribution of Prof. Nawal Prinja (Wood Plc) to the work in Chapter 5 is gratefully acknowledged. Specifically, Prof. Prinja was responsible for the description of the structural experiments and devising the structural model to which he applied the First Order Reliability Method. Conversations with Prof. Prinja also inspired many of the developments in Chapter 6. His decades of experience in the nuclear industry have been a great inspiration. The contribution of Dr. Matthias Faes to Chapter 7, is gratefully acknowledged. Specifically, Dr. Faes contributed the benchmarks to which the Interval Predictor Model methodology was compared in the case study, in addition to a large amount of the text for the original paper, on which this chapter is based (not including novel contributions related to Interval Predictor Models by the present author). iii Jonathan Cyrus Sadeghi I owe a debt of gratitude to Phoebe, my partner and closest friend, who has been beside me to continually raise my spirits during my PhD. Finally, I wish to thank my family, and in particular my parents, for their invaluable support over the years. They have always pushed me forward at difficult times throughout my studies. iv Declaration I hereby confirm that the results presented in this dissertation are from my own work and that I have not presented anyone else's work and that full and appropriate acknowledgements have been given where references have been made to the work of others. This dissertation contains 202 pages, 59 figures, 23 tables and 38362 words. Jonathan Sadeghi v Jonathan Cyrus Sadeghi vi Contents Abstract i Acknowledgements iii Declaration v Contents x List of Figures xv List of Tables xvii List of Published Work xviii Nomenclature xx 1 Introduction 1 1.1 Motivation . .1 1.1.1 Uncertainty in engineering simulation . .1 1.1.2 Reliability engineering . .2 1.1.3 Models of uncertainty . .3 1.2 Problem definition and objectives . .4 1.3 Structure of this thesis . .5 2 Models of Uncertainty 9 2.1 Overview of uncertainty models . .9 2.1.1 Probabilistic models of uncertainty . 10 2.1.2 Set-based models of uncertainty . 14 2.1.3 Imprecise probabilistic models of uncertainty . 17 2.2 Creating models in practice . 20 2.2.1 Choosing a model . 20 2.2.2 Training models from data . 21 vii Jonathan Cyrus Sadeghi 2.2.3 Creating uncertainty models without data . 25 2.2.4 Validating a trained model . 27 2.3 Chapter summary . 28 3 Machine Learning of Regression Models 30 3.1 Parametric regression models . 30 3.1.1 Bayesian parameter learning . 33 3.1.2 Validation . 36 3.1.3 The bias-variance tradeoff . 37 3.2 Non-parametric models . 37 3.3 Learning bounds on a model . 38 3.3.1 Training interval predictor models . 39 3.3.2 Validating models with the scenario approach . 44 3.3.3 Software for interval predictor models . 48 3.4 Chapter summary . 48 4 Reliability Analysis 49 4.1 Reliability analysis with random variables . 49 4.1.1 Problem definition . 49 4.1.2 Methods to compute the failure probability . 52 4.2 Convex set models for reliability . 58 4.2.1 Problem definition . 58 4.3 Reliability analysis with probability boxes . 59 4.3.1 Problem definition . 59 4.3.2 Methods to compute the failure probability . 60 4.4 Chapter summary . 62 5 Structural Reliability of Pre-stressed Concrete Containments under Dis- tributional Uncertainty 63 5.1 Introduction . 63 5.2 Structural model . 64 5.3 Analysis . 68 5.4 Discussion . 70 5.5 Chapter summary . 79 6 Analytic Imprecise Probabilistic Safety Analysis 81 6.1 Introduction . 81 6.2 Probabilistic safety analysis . 82 6.3 Probability bounds analysis . 85 6.3.1 Fragility curve . 85 6.3.2 Product of log-normally distributed random variables . 89 viii Uncertainty Modelling for Scarce and Imprecise Data in Engineering Applications 6.3.3 Failure probability . 90 6.3.4 Summary of failure probability expressions . 93 6.3.5 Imprecise FORM . 98 6.4 Numerical examples . 99 6.4.1 Reliability analysis of a simple concrete containment . 99 6.4.2 Containment with additive component strengths . 101 6.5 Chapter summary . 103 7 Interval Predictor Models for Reliability Analysis 104 7.1 Introduction . 104 7.2 Interval failure probability . 105 7.3 Uncertain surrogate model predictions . 106 7.3.1 Kriging . 107 7.3.2 Interval predictor models . 108 7.4 Case study . 110 7.4.1 Advanced Monte Carlo sampling . 111 7.4.2 Surrogate model based estimation . 115 7.4.3 When is creating an IPM surrogate worthwhile? . 117 7.5 Conclusions . 119 8 Interval Predictor Models for Propagation of Probability Boxes 120 8.1 Introduction . 120 8.2 Proposed approaches: obtaining bounds on the failure probability . 121 8.2.1 Approach 1: metamodels for na¨ıve double loop approach . 122 8.2.2 Approach 2: IPMs trained on propagated focal elements . 123 8.2.3 Approach 3: metamodels for non-na¨ıve approach . 124 8.2.4 Confidence bounds on failure probability . 126 8.3 Numerical examples . 127 8.3.1 Cantilever beam . 127 8.3.2 Dynamic response of a non-linear oscillator . 129 8.3.3 Small satellite . 137 8.4 Chapter summary . 139 9 Interval Neural Networks 143 9.1 Introduction .