processes
Review Applications of Machine Learning to Reciprocating Compressor Fault Diagnosis: A Review
Qian Lv 1, Xiaoling Yu 1,*, Haihui Ma 1, Junchao Ye 1, Weifeng Wu 1 and Xiaolin Wang 2
1 School of Energy and Power Engineering, Xi’an Jiaotong University, Xi’an 710049, China; [email protected] (Q.L.); [email protected] (H.M.); [email protected] (J.Y.); [email protected] (W.W.) 2 School of Engineering, University of Tasmania, Hobart, TAS 7001, Australia; [email protected] * Correspondence: [email protected]
Abstract: Operating condition detection and fault diagnosis are very important for reliable operation of reciprocating compressors. Machine learning is one of the most powerful tools in this field. However, there are very few comprehensive reviews which summarize the current research of machine learning in monitoring reciprocating compressor operating condition and fault diagnosis. In this paper, the recent application of machine learning techniques in reciprocating compressor fault diagnosis is reviewed. The advantages and challenges in the detection process, based on three main monitoring parameters in practical applications, are discussed. Future research direction and development are proposed.
Keywords: reciprocating compressor; condition monitoring; fault diagnosis; machine learning
Citation: Lv, Q.; Yu, X.; Ma, H.; Ye, J.; 1. Introduction Wu, W.; Wang, X. Applications of The reciprocating compressor (RC) is a key piece of equipment in petroleum and Machine Learning to Reciprocating chemical industries. If the RC does not operate in the rated efficiency, it will lead to great Compressor Fault Diagnosis: A economic loss to the company. Sometimes RCs are used to compress inflammable and ex- Review. Processes 2021, 9, 909. plosive gases working under high pressures and temperatures, such as hydrogen, ethylene, https://doi.org/10.3390/pr9060909 and natural gas, which would threat human life once the machine malfunctions [1]. Fur- thermore, due to the intricate structure of the compressor, a large amount of wearing parts, Academic Editor: Jie Zhang and the complicated interactional relationship between moving parts of the compressor, it is essential to monitor the compressor operating condition and detect failures of RCs Received: 21 April 2021 accurately and in a timely manner [2]. Accepted: 18 May 2021 Traditionally, the fault diagnosis of RCs was carried out by engineers who had exten- Published: 21 May 2021 sive experience and knowledge. As an example, an experienced engineer is able to detect the faults of RCs according to the abnormal noise occurring in sound signals [3,4]. However, Publisher’s Note: MDPI stays neutral users still tend to predict faults more accurately before the RC faults occur. This will reduce with regard to jurisdictional claims in published maps and institutional affil- the operation and maintenance (O&M) costs by using an automatic condition classification iations. system [5]. Recently, with the rapid development of artificial intelligence (AI), machine learning (ML) has turned out to make a great difference in machine fault diagnosis and prediction [6]. Zhang et al. [6] discussed the computational intelligence techniques utilized in machinery condition monitoring and fault diagnosis. A comprehensive overview of some common natural computing methods and their applications in mechanical systems Copyright: © 2021 by the authors. was made by Worden et al. in [7]. Some other researchers [8] intend to develop a review Licensee MDPI, Basel, Switzerland. of bearing fault diagnosis based on deep learning (DL), whereas very few reviews were This article is an open access article distributed under the terms and conducted for intelligent fault diagnosis of reciprocating compressors. conditions of the Creative Commons This paper reviews the recent research and development of fault diagnosis technolo- Attribution (CC BY) license (https:// gies for reciprocating compressors utilizing machine learning in terms of theoretical studies creativecommons.org/licenses/by/ and applications. To finish this review, search terms such as artificial intelligence, machine 4.0/). learning, reciprocating compressor, fault diagnosis, fault detection, monitoring system, etc.,
Processes 2021, 9, 909. https://doi.org/10.3390/pr9060909 https://www.mdpi.com/journal/processes Processes 2021, 9, x FOR PEER REVIEW 2 of 15
machine learning, reciprocating compressor, fault diagnosis, fault detection, monitoring system, etc., were used in the process of searching literature. The filtering mechanism is mainly based on multiple combinations of the search terms mentioned above. Most of the literature surveyed is from journals, in which the English literature was searched using Web of Science and the Chinese literature was searched using China National Knowledge Infrastructure (CNKI). The theories of RCs and ML methods were mostly from books. After the searching process, the literature was classified into several categories based on Processes 2021, 9, 909 ML methods applied and the main monitoring parameters. The following reviewing2 sec- of 14 tions are also introduced according to the categories. After the review of literature, the advantages and limitations of the ML technologies, comparison of different monitoring parameters,were used in and the an process overlook of searching of future literature.research are The also filtering discussed. mechanism is mainly based on multiple combinations of the search terms mentioned above. Most of the literature sur- 2.veyed Overview is from of journals, Reciprocating in which Compressors the English literature was searched using Web of Science andFigure the Chinese 1 shows literature a single-stage was searched reciprocating using China compressor, National which Knowledge is mainly Infrastructure made up of two(CNKI). valves, The a theories piston, a of cylinder, RCs and a ML piston methods rod, a were crosshead, mostly froma connecting books. After rod, theand searching a crank- shaft.process, The the crankshaft literature wasis driven classified by intomotor, several then categories the crankshaft based onreciprocates ML methods the appliedpiston throughand the mainthe slide-crank monitoring mechanism, parameters. so The that following the piston reviewing can compress sections gas are in alsothe cylinder introduced to aaccording designated to high the categories. pressure [9]. After RCs the can review be applied of literature, in chemical, the advantages refining, and and petrochem- limitations icalof the plants, ML technologies,and they can compress comparison almost of different any gas monitoring mixture from parameters, vacuum to and over an 3000 overlook atm. of futureThe faults research of arecompressors also discussed. are caused by failures of different components. In [10], Kostyukov performed a survey into the fault causes of reciprocating compressors based on2. Overviewconsumers of and Reciprocating manufacturers Compressors of RCs. The results showed that one of the main reasons for compressorFigure1 shows failure a is single-stage valves, and reciprocating it makes up 36%. compressor, Piston-cylinder which units is mainly also constitute made up overof two 30% valves, of all afaults, piston, where a cylinder, the failures a piston of rings rod, account a crosshead, for 25%. a connecting Failures in rod, the andslide- a crankcrankshaft. mechanism The crankshaft and cranking is driven mechanism by motor, are thenalso thesignificant crankshaft [10]. reciprocates To monitor thecompres- piston sorthrough conditions, the slide-crank many kinds mechanism, of sensors so were that theused piston in fault can detection compress systems, gas in the such cylinder as vibra- to a tiondesignated sensors, high temperature pressure sensor, [9]. RCs pressure can be applied sensor, indisplacement chemical, refining, sensor, andacoustic petrochemical emission sensor,plants, and so they on. can compress almost any gas mixture from vacuum to over 3000 atm.
Figure 1. Principle of a one-stage reciprocating compressor. 1—suction valve; 2—discharge valve; Figure3—piston; 1. Principle 4—cylinder; of a one-stage 5—piston reciprocating rod; 6—crosshead; compressor. 7—connecting 1—suction rod; valve; 8—crankshaft. 2—discharge valve; 3—piston; 4—cylinder; 5—piston rod; 6—crosshead; 7—connecting rod; 8—crankshaft. The faults of compressors are caused by failures of different components. In [10], 3.Kostyukov Overview performed of the Four a surveyMajor intoMachine the fault Learning causes Methods of reciprocating compressors based on consumers and manufacturers of RCs. The results showed that one of the main reasons for Machine learning is a subject that focuses on research of learning algorithms by compressor failure is valves, and it makes up 36%. Piston-cylinder units also constitute over which a machine can learn from the data nearly as well as people do [11]. Up to now, there 30% of all faults, where the failures of rings account for 25%. Failures in the slide-crank are a lot of machine learning methods that have been applied in RC fault diagnosis. In this mechanism and cranking mechanism are also significant [10]. To monitor compressor section,conditions, the four many most kinds prevalent of sensors algorithms were used in in machine fault detection learning systems, are reviewed. such as The vibration artifi- cialsensors, neural temperature network, support sensor, pressure vector mach sensor,ine, displacement and Bayesian sensor, network acoustic are emissionthree common sensor, traditionaland so on. machine learning methods, and the deep learning method is one of the latest machine learning algorithms. 3. Overview of the Four Major Machine Learning Methods Machine learning is a subject that focuses on research of learning algorithms by which a machine can learn from the data nearly as well as people do [11]. Up to now, there are a lot of machine learning methods that have been applied in RC fault diagnosis. In this section, the four most prevalent algorithms in machine learning are reviewed. The artificial neural network, support vector machine, and Bayesian network are three common traditional machine learning methods, and the deep learning method is one of the latest machine learning algorithms.
3.1. Artificial Neural Network (ANN) The artificial neural network (ANN) is a mathematical model, inspired by biological neural networks, which consists of a supply of interconnected basic processing elements, Processes 2021, 9, x FOR PEER REVIEW 3 of 15
3.1. Artificial Neural Network (ANN) The artificial neural network (ANN) is a mathematical model, inspired by biological neural networks, which consists of a supply of interconnected basic processing elements, called artificial neurons. Artificial neurons are connected with each other by connection links integrated with different weights. Figure 2 shows an ANN with four layers which Processes 2021, 9, 909 are input layer, output layer, and two hidden layers (layers 1 and 2). Each hidden layer 3 of 14 includes several neurons, and each neuron is connected to each element of the output vector of the last layer through the weight matrix 𝑊 (the weight matrix for the 𝑖th hid- den layer is written as 𝑊 ). Besides, each neuron has a bias 𝑏 (the bias for the 𝑗th neuron in the called𝑖th hidden artificial layer neurons. is written Artificial as 𝑏 ), neuronsa summer, are a connected transfer function with each 𝑓 other (the bytransfer connection links integrated with different weights. Figure2 shows an ANN with four layers which function for the 𝑗th neuron in the 𝑖th hidden layer is written as 𝑓 ), and an output 𝑎 (the are input layer, output layer, and two hidden layers (layers 1 and 2). Each hidden layer output for the 𝑗th neuron in the 𝑖th hidden layer is written as 𝑎 ). Therefore, the calculat- includes several neurons, and each neuron is connected to each element of the output ing functionvector of of each the last neuron layer is through indicated the by weight Equation matrix (1).W i (the weight matrix for the ith hidden i i layer is written as W ). Besides,𝑎 each 𝑓 𝑊 neuron𝑎 has 𝑏 a bias, bj (the bias for the jth neuron(1) in the ith hidden layer is written as bi), a summer, a transfer function f i (the transfer function for where 𝑎 is the output vector of thej (𝑖 1)th hidden layer (note thatj when 𝑖 1, 𝑎 i i is the inputthe jth vector neuron of inthe the inputith hiddenlayer of layer the whole is written network). as fj ), and an output aj (the output for the i Typically,jth neuron in inan theANNith model, hidden the layer transfer is written functions as aj). are Therefore, selected theby the calculating designer, function and of the weightseach neuron and biases is indicated are adjustable by Equation parameters (1). which can be adjusted by the learning means such as error back propagation algorithm. Therefore, the input and output rela- i i i i−1 i tionship of the network can meet a specificaj goal= fj ([12,13].W a Thus,+ bj) ,the ANN model can be used (1) to deduce a function from the observations, which is helpful in solving complex problems. i−1 i−1 Hence,where it cana be broadlyis the output applied vector in fault of the diagnosis, (i − 1)th hiddenwhich layeris an (noteessential that classification when i = 1, a is problem.the input vector of the input layer of the whole network).
Figure Figure2. Graphical 2. Graphical model of model an ANN. of an ANN.
3.2. BayesianTypically, Network in(BN) an ANN model, the transfer functions are selected by the designer, and the weights and biases are adjustable parameters which can be adjusted by the learning means The Bayesian network (also called belief network) [14] is a directed acyclic graph (as such as error back propagation algorithm. Therefore, the input and output relationship of shown in Figure 3) where the nodes, such as 𝑧 ,𝑧 ,…,𝑧 , are perceived to be the propo- the network can meet a specific goal [12,13 ]. Thus, the ANN model can be used to deduce sitional variables. The arrow between two nodes means that the two nodes are related a function from the observations, which is helpful in solving complex problems. Hence, it directly, and the weight therein is quantified by a conditional probability. The two essen- can be broadly applied in fault diagnosis, which is an essential classification problem. tial natures of these networks are consistency and completeness, while the chain-rule rep- resentation3.2. Bayesian of the joint Network distributions (BN) is employed to guarantee the two natures for its form [15,16]: The Bayesian network (also called belief network) [14] is a directed acyclic graph
(as shown in Figure𝑃 𝑧 ,𝑧3 ),…,𝑧 where the 𝑃 nodes, 𝑧 |𝑧 such,…,𝑧 as ⋯𝑃{z1, 𝑧z 2|,...,𝑧 𝑃 𝑧z 7 }, , are perceived(2) to be the propositional variables. The arrow between two nodes means that the two nodes are related directly, and the weight therein is quantified by a conditional probability. The two essential natures of these networks are consistency and completeness, while the chain-rule representation of the joint distributions is employed to guarantee the two natures for its form [15,16]:
P(z1, z2,..., zn) = P(zn|zn−1,..., z1) ··· P(z2|z1)P(z1), (2) ProcessesProcesses 2021, 92021, x FOR, 9, 909 PEER REVIEW 4 of 154 of 14
It can be seenIt can that be in seen the right-chained that in the right-chained formula, each formula, variable each appears variable once appears on the left once side on the of theleft conditioning side of the bar, conditioning which can bar, facilitate which the can dependence facilitate the quantification dependence quantificationof the network. of the For instance,network. the For chain instance, rule representation the chain rule representationfor the network for shown the network in Figure shown 3 is: in Figure3 is:
𝑃 𝑧 ,𝑧 ,…,𝑧 𝑃 𝑧 |𝑧 ,𝑧 𝑃 𝑧 |𝑧 𝑃 𝑧 |𝑧 ,𝑧 𝑃 𝑧 𝑧 ,𝑧 ,𝑧 𝑃 𝑧 𝑃 𝑧 𝑃 𝑧 (3) P(z1, z2,..., z7) = P(z7|z5, z4)P(z6|z4)P(z5|z3, z1)P(z4|z3, z2,z1)P(z3)P(z2)P(z1) (3) The Bayesian network is a methodology integrating the probability theory and graph theory. NotThe only Bayesian can it visually network exhibit is a methodology the structure integrating of real tasks the by probability graph, but theory it can and also graph exploittheory. the structure Not only based can on it visuallythe principle exhibit of the the probability structure oftheory, real tasks which by would graph, dimin- but it can ish thealso complexity exploit the of structurereasoning. based Therefore, on the the principle Bayesian of network the probability is applied theory, in many which var- would ious domains.diminish theThe complexity Bayesian network of reasoning. also Therefore,provides a the framework Bayesian networkfor new ismodels, applied and in many thereinvarious a naive domains. Bayes model The is Bayesian normally network selected alsofor classification provides a frameworkand prediction for of new multi- models, dimensionaland therein discrete a naive time Bayes series model [17,18]. is normally selected for classification and prediction of multi-dimensional discrete time series [17,18].
Figure 3. A typical Bayesian network. Figure3.3. 3. A Support typical VectorBayesian Machine network. (SVM) The support vector machine (SVM) is a supervised learning technique developed 3.3. Support Vector Machine (SVM) based on a statistical learning theory, which aims to find a hyperplane (see Figure4). It can Theseparate support n-dimensional vector machine inputs (SVM) into two is a parts supervised associated learning with the technique real distinct developed classes. The basedhyperplane on a statistical can belearning depicted theory, as [19 which]: aims to find a hyperplane (see Figure 4). It can separate n-dimensional inputs into two parts associated with the real distinct classes. The hyperplane can be depicted as [19]: wT x + b = 0, (4)