Industrial Big Data Analytics for Quality Improvement in Complex Systems
Total Page:16
File Type:pdf, Size:1020Kb
Industrial Big Data Analytics for Quality Improvement in Complex Systems Dr. Kaibo Liu Department of Industrial and Systems Engineering University of Wisconsin-Madison 1 Lab for System Informatics and Data Analytics (SIDA) Background • A.P. 2013-now, Department of industrial and Systems Engineering, UW-Madison • Ph.D. 2013, Industrial Engineering (Minor: Machine Learning), Georgia Institute of Technology • M.S. 2011, Statistics, Georgia Institute of Technology • B.S. 2009, Industrial Engineering and Engineering Management, Hong Kong University of Science and Technology, Hong Kong 2 Lab for System Informatics and Data Analytics (SIDA) My Research & Expertise Research Interests Expertise System Informatics and data analytics: • Complex system modeling and Engineering performance assessment Multidisciplinary approach • Data fusion for online process monitoring, diagnosis and prognostics Operation Statistics/ Research/ Machine • Statistical learning, data mining, and Control Learning decision making Multi-disciplinary Research Sensor Measurement System Degradation Spatiotemporal Field and Monitoring Strategy Analysis and Prognostics Modeling and Prediction Overall, my research goal is to make sense of big data for better decision making! 3 Lab for System Informatics and Data Analytics (SIDA) Sensor Measurement and Monitoring Strategy 4 Lab for System Informatics and Data Analytics (SIDA) Objective-oriented sensor system designs in complex systems Objective Approaches • Obtain an optimal sensor allocation design at • A best allocation subsets by intelligent search, minimum cost under different user specified named BASIS algorithm that intelligently quality requirements searches for the optimal sensor allocation solution • Features • Consider the trade-off of detection speed, fault diagnosis accuracy, and cost savings Results Summary • Ensure customer satisfaction by optimally designing sensor allocation strategy • The average cycle time, cost and inventory level can be greatly reduced • Algorithms have been tested in several applications, e.g., the hot forming and the Effectively search for optimal sensor system design cap alignment processes solutions • Supported several students 5 Lab for System Informatics and Data Analytics (SIDA) Causation-based monitoring, diagnosis and control Objective Approaches • Transform from existing correlation-based techniques into a new causation-based quality control paradigm to achieve effective online quality monitoring and inference, root cause diagnosis, and proactive process control • Features • Engineering knowledge enhanced causal modeling • Causation-based online quality monitoring, inference, and diagnosis • Causation-based online feed-forward and feed-back process control Results Summary • Establish a series of causation-based monitoring, diagnosis and control techniques for quality improvement in complex systems • Algorithms have been tested in the hot forming, the cap alignment, and the rolling processes improved efficiency, yield, and quality • Supported several students 6 Lab for System Informatics and Data Analytics (SIDA) Online monitoring of Big Data Streams Objective Approaches • Create a new paradigm of dynamic data-driven • A self-updated statistical model to fully modeling, sampling and monitoring schemes characterize the changing background for Big Data Streams (e.g., Video streams) • A dynamic, data-driven sampling strategy subject to practical resources constraints • A scalable and robust statistical process control method tailored for Big Data Streams • Features • Scalability: linear complexity that ensures practical implementation Examples of thermal profiles on the polishing pad • Adaptability: automatically localize the during CMP process under different conditions anomaly regions without any prior knowledge Results Summary • Establish a series of real-time monitoring methodologies that are tailored for Big data streams for quick anomaly detection (either cyber of physical) and localization • Algorithms have been tested in various applications, e.g., diaper manufacturing, Maximize the detection capability with climate monitoring and solar flare detection practical resources constraints • Supported several students 7 Lab for System Informatics and Data Analytics (SIDA) Dynamic Data-Driven Modeling, Sampling and Monitoring for Real-Time Solar Flare Detection Original Solar Image Updated Solar Image • A dynamically updated spatial-temporal statistical model fully Update characterize the Model changing background • A dynamic sampling algorithm that 푡 (a) Applications (b) Applications modeling actively decides DDDAS Sample data Update sampling which data streams to Framework observe given the SPC Chart Dynamic Sampling resources constraints Update • A scalable and robust SPC SPC to effectively combine the information from significant data streams to produce an 푡 overall global (d) Mathematical and (c) Application measurement monitoring system statistical algorithms systems and methods 8 Lab for System Informatics and Data Analytics (SIDA) Sensor Measurement and Monitoring Strategy • Objective-Oriented Optimal Sensor Allocation Strategy: determine the minimum number of sensors needed given user specified requirements • Adaptive Sensor Allocation Strategy: Adaptively adjust sensor allocation in a Bayesian Network to enhance monitoring and diagnosis • A Top-r based Adaptive Sampling Strategy: Online monitor normally distributed big data streams in the context of limited resources • A Nonparametric Adaptive Sampling Strategy: Online monitor non-normal big data streams in the context of limited resources • Effective Online Data Monitoring and Saving Strategy: intelligently select and record the most informative extreme values in the simulation data • A Spatial Adaptive Sampling Procedure: leverage the spatial information and adaptively and intelligently integrate two seemingly contradictory ideas (Wide and deep searches) • A Rank-based Sampling Algorithm by Data Augmentation: automatically augment information for unobservable variables based on the online observations 9 Lab for System Informatics and Data Analytics (SIDA) System Degradation Modeling and Prognostics 10 Lab for System Informatics and Data Analytics (SIDA) Internet of Things-enabled Condition-based Monitoring, Diagnosis, and Prognostics Objective Approaches • Leverage condition monitoring signals • Novel data fusion methods that select collected from multiple and heterogeneous best sensors and combine their sensors to better visualize and assess the information to construct health indices current system health status and predict its for system performance assessment future behavior in real time and visualization, ℎ푖,푡 = 푓 풙푖,.,푡 • Features • Combine data-driven approaches and engineering principles governing the underlying failure mechanism to Aircraft engine diagram ensure satisfactory performance Results Summary • Establish a series of data fusion methodologies that are tailored for IoT- enabled service systems for health status visualization, characterization and prediction • Algorithms have been tested in various applications, e.g., engine health monitoring, Better health status Better fault Better RUL Alzheimer's disease and forklift management characterization diagnosis prediction • Supported several students 11 Lab for System Informatics and Data Analytics (SIDA) Case Study – Engine RUL prediction ∗ ∗ • Optimal weights 풘 : ℎ푖 푡 = 푳푖 푡 풘 Name T24 T50 P30 Nf Ps30 phi NRf BPR htBleed W31 W32 Value 0.13 0.37 -0.03 -0.05 0.23 -0.21 -0.08 0.16 0.12 -0.05 -0.16 The stochastic T24 degradation models … Bayesian (Gebraeel, 2006) Remaining life updating W32 prediction Real time sensor methods Health index information • Developed HI-QL improved the RUL prediction accuracy o by 64.83% compared with the best single sensor o by 20.7% compared with existing HI-based models 12 Lab for System Informatics and Data Analytics (SIDA) System Degradation Modeling and Prognostics • Non-parametric data fusion model: does not need to know the parametric form of the degradation signal • semi-parametric data fusion model: integrate degradation modeling and prognostics in an integrated manner • SNR-based data fusion model: immune to the heterogeneous sensor challenges in terms of signal scales and measurement units • Quantile regression-based data fusion model: ensure to recover the underlying degradation status with estimated fusion coefficients converging to the true values • Sensory-based Failure Threshold Estimation: online update the failure threshold estimation of the in-field unit • Kernel-trick for nonlinear data fusion model • Generic data fusion model with automatic sensor selection • Data fusion model for multiple failure modes • Data fusion model when there are multiple environmental conditions • Generic data fusion model when mutisensor signals are asynchronous • Dynamic control of degradation speed and RLD via workload adjustment 13 Lab for System Informatics and Data Analytics (SIDA) Smart Monitoring of Alzheimer’s Disease via Data Fusion, Personalized Prognostics, and Selective Sensing Existing Screening New Approaches Methodology Biomarkers Screening Tests Smart Monitoring Passive Proactive Expensive, information information Effective e.g., $ 5000 collection: collection driven -ness per scan for burden, and by accurate PiB-PET complexity statistical models Proposed Smart Monitoring Method The model of AD trajectory [3] 14 Lab for System Informatics and Data Analytics (SIDA) Data-Driven Failure Predictive Analytics