energies
Article Accuracy Improvement of Transformer Faults Diagnostic Based on DGA Data Using SVM-BA Classifier
Youcef Benmahamed 1 , Omar Kherif 1 , Madjid Teguar 1, Ahmed Boubakeur 1 and Sherif S. M. Ghoneim 2,*
1 Research Laboratories, National Polytechnic School (ENP), B.P 182, El-Harrach, Algiers 16200, Algeria; [email protected] (Y.B.); [email protected] (O.K.); [email protected] (M.T.); [email protected] (A.B.) 2 Electrical Engineering Department, College of Engineering, Taif University, Taif 21944, Saudi Arabia * Correspondence: [email protected]
Abstract: The main objective of the current work was to enhance the transformer fault diagnostic accuracy based on dissolved gas analysis (DGA) data with a proposed coupled system of support vector machine (SVM)-bat algorithm (BA) and Gaussian classifiers. Six electrical and thermal fault classes were categorized based on the IEC and IEEE standard rules. The concentration of five main combustible gases (hydrogen, methane, ethane, ethylene, and acetylene) was utilized as an input vector of the two classifiers. Two types of input vectors have been tested; the first input type considered the five gases in ppm, and the second input type considered the gases introduced in the percentage of the sum of the five gases. An extensive database of 481 had been used for training and testing phases (321 data samples for training and 160 data samples for testing). The SVM model conditioning parameter “λ” and penalty margin parameter “C” were adjusted through the bat algorithm to develop a maximum accuracy rate. The SVM-BA and Gaussian classifiers’ accuracy was
Citation: Benmahamed, Y.; Kherif, evaluated and compared with several DGA techniques in the literature. O.; Teguar, M.; Boubakeur, A.; Ghoneim, S.S.M. Accuracy Keywords: transformer faults; SVM-BA classifier; DGA; DGALab Improvement of Transformer Faults Diagnostic Based on DGA Data Using SVM-BA Classifier. Energies 2021, 14, 2970. https://doi.org/10.3390/ 1. Introduction en14102970 The insulation system state of the power transformers is responsible for determining the transformers’ lifetime. It is generally exposed to a couple of defects arising from Academic Editor: Ayman El-Hag overheating, paper carbonization, arcing, and discharges of low or high energy [1–3]. These faults might accelerate the insulation degradation, affecting the transformer reliability and Received: 15 April 2021 lifetime [4]. Early detection of these faults can avoid the undesired abnormal operating Accepted: 19 May 2021 Published: 20 May 2021 conditions or transformer outages [5,6]. Several DGA techniques in the literature were proposed to detect the faults in trans-
Publisher’s Note: MDPI stays neutral formers, but in some cases, these DGA techniques’ diagnostic accuracy is inadequate. with regard to jurisdictional claims in The dissolved gas analysis (DGA) technique considers one of the fastest and economical published maps and institutional affil- techniques widely used to diagnose the transformer fault types of the insulation system [7]. iations. The insulating oil decomposes into hydrocarbon products, which are categorized as com- bustible and incombustible gases. The five main combustible gases are Hydrogen (H2), Methane (CH4), Acetylene (C2H2), Ethylene (C2H4), and Ethane (C2H6), which might be generated within the oil during a faulty mode [1]. The concentrations of these gases were used as an input vector to interpret the DGA results in transformer oil, associated with six Copyright: © 2021 by the authors. Licensee MDPI, Basel, Switzerland. basic electrical and thermal faults [4,8]. Different DGA techniques have been developed This article is an open access article to diagnose the transformer faults, including graphical DGA methods (e.g., [1,9–11]) and distributed under the terms and artificial intelligence techniques (e.g., [12,13]). Improved coupled techniques have also conditions of the Creative Commons been developed to diagnose multiple transformer faults and quantitatively indicate each Attribution (CC BY) license (https:// fault’s likelihood (e.g., [14]). creativecommons.org/licenses/by/ Artificial intelligence techniques such as artificial neural networks (ANN) can combine 4.0/). with the traditional DGA techniques to enhance the diagnostic accuracy of the transformer
Energies 2021, 14, 2970. https://doi.org/10.3390/en14102970 https://www.mdpi.com/journal/energies Energies 2021, 14, 2970 2 of 17
faults, such as the California State University Sacramento artificial neural network method (CSUS-ANN) [13]. The CSUS-ANN DGA technique used the gas concentration percentage from the five main combustible gases as inputs to the backpropagation neural network to determine the transformer faults based on the training process of DGA samples with knowing transformer fault types. Ghoneim and Taha [15] proposed a new approach (clus- tering) to enhance the diagnostic transformer faults by developing new gas ratios with the IEC ratios and defining its limits to improve diagnostic accuracy. The traditional IEC code 60599 and Rogers’ four ratios gave a poor diagnostic accuracy of the transformer faults. Enhancing the diagnostic accuracy by modifying the two previous DGA methods’ ratio limits using the particle swarm optimization with fuzzy logic is presented [6]. The conditional probability in [16] introduced a new concept using the likelihood of the faults’ occurrence and the likelihood of un-occurrence of the fault via the mean and standard deviation of the two events’ DGA samples. The conditional probability of the fault oc- currence is identified using the multivariate normal probability density function. Three scenarios were developed depending on how to separate among the different faults. All these techniques are merged into one software package (DGALab), which is own as in [17] to facilitate the comparison process between them and any new proposed DGA techniques with the advantage of using an extensive database of DGA samples [17,18]. In this paper, SVM-BA and Gaussian classifiers have been used to detect faults within an oil-immersed power transformer. The concentration of gases in the ppm and percentage of the sum of the five main combustible gases have been used as an input vector for Gaussian and SVM classifiers. Kernel parameter λ and penalty margin C of the SVM model have been optimized by a Bat algorithm (SVM-BA) to adjust the model, getting a high diagnostic accuracy. Electrical and thermal transformer faults have characterized the output of each classifier including partial discharge (PD), low energy discharges (D1), high energy discharges (D2), thermal faults < 300 ◦C (T1), thermal faults of 300 ◦C to 700 ◦C (T2), and thermal faults > 700 ◦C (T3) [1]. The performance of each classifier has been investigated in terms of accuracy rate. A total of 481 sample datasets have been considered, where two-thirds were used for the training process (321 samples) while the rest was used for the testing process (160 samples). A comparative study was accomplished with the other DGA techniques in the literature to identify the proposed DGA technique’s diagnostic improvement. The current work presents a classification technique (SVM-BAT and Gaussian classi- fiers) to enhance the transformer faults’ diagnostic accuracy, which considers one of the new trends in condition monitoring and diagnostics of power system assets.
2. Problem Formulation Highly reliable transformers are mainly made of iron core and windings; both are placed in the oil tank filled with insulating oil, as shown in Figure1. Mineral insulating oil is the most common type of oil used in outdoor transformers [19]. This insulating oil has significant dielectric strength so that it can withstand a pretty high voltage. It also reduces heat generated by transformer windings employing the cooler (radiators, air fans, ... ). Therefore, the heat generated in the transformer results in a temperature rise in the internal transformer structures. Under electrical and thermal stresses, different hydrocarbon gases are liberated due to the insulating oil decomposition. Particular gases characterize each type of fault. For instance, hydrogen concentration, produced by ionic bombardment, increases with partial discharges within a transformer oil. In this context, a general review about the gases produced during the deterioration of mineral oil and their interpretation has been detailed in [10]. Energies 20212021,, 1414,, 2970x FOR PEER REVIEW 3 of 16 17
Figure 1. Oil-immersedOil-immersed power power transformer cross-section.
MineralEarly-stage insulating detection oil is of the these most faults common should ty bepe carriedof oil used out in to outdoor avoid the transformers undesired abnormal operating conditions or transformer outages. For this purpose, periodic monitor- [19]. This insulating oil has significant dielectric strength so that it can withstand a pretty ing of the oil should be conducted during transformer service, whether in-situ or at the high voltage. It also reduces heat generated by transformer windings employing the laboratory, using a multi-stage gas-extractor (a device for sampling transformer oil) [10]. In cooler (radiators, air fans, …). Therefore, the heat generated in the transformer results in general, the most important gases are Hydrogen (H ), Methane (CH ), Acetylene (C H ), a temperature rise in the internal transformer structures.2 Under electrical4 and thermal2 2 Ethylene (C H ), and Ethane (C H ). The distribution of these gases is related to the type of stresses, different2 4 hydrocarbon 2gases6 are liberated due to the insulating oil decomposition. transformer fault, and the rate of gas generation can indicate the severity of the fault [5,20]. Particular gases characterize each type of fault. For instance, hydrogen concentration, pro- In [6], the authors have collected 481 samples associating with the six different faults duced by ionic bombardment, increases with partial discharges within a transformer oil. as indicated in the Introduction (i.e., PD, D1, D2, T1, T2, and T3). The number of samples In this context, a general review about the gases produced during the deterioration of associated with each fault is given in Table1. mineral oil and their interpretation has been detailed in [10]. TableEarly-stage 1. Database distribution.detection of these faults should be carried out to avoid the undesired ab- normal operating conditions or transformer outages. For this purpose, periodic monitor- ing of theDefect oil should be conducted Interpretationduring transformer service, whether Number ofin-situ Samples or at the laboratory, PDusing a multi-stage gas-extractor Partial discharge (a device for sampling transformer 48 oil) [10]. In general, the most important gases are Hydrogen (H2), Methane (CH4), Acetylene (C2H2), D1 Low energy discharges 79 Ethylene (C2H4), and Ethane (C2H6). The distribution of these gases is related to the type of transformerD2 fault, and the rate High of energygas gene dischargesration can indicate the severity 126 of the fault [5,20]. T1 Thermal faults of <300 ◦C 95 In [6], theT2 authors have Thermal collected faults 481 of samples 300 ◦C to associating 700 ◦C with the six 48 different faults as indicated in the Introduction (i.e., PD, D1, D2, T1, T2, and T3). The number of samples T3 Thermal faults of >700 ◦C 85 associated with each fault is given in Table 1. All 481 Table 1. Database distribution.
DefectThedatabase set has beenInterpretation exploited in the present investigationNum tober detect of Samples and identify faults.PD As shown in this table,Partial only discharge separated faults (no combined faults) have48 been consid- ered. The fault detection has been examined using the concentration of each dissolved gas. D1 Low energy discharges 79 Since the weight percent of the gases as mentioned earlier would result in an inopportunely D2 High energy discharges 126 small number, concentration in parts per million, or ppm, has been considered for each gas. T1 Thermal faults of < 300 °C 95 Furthermore, percent concentration of the total sum was also used, where each sample T2 Thermal faults of 300 °C to 700 °C 48 X = [x , x ,..., x ] is scaled as follows: T31 2 5 Thermal faults of > 700 °C 85 All X 481 = × X 5 100% (1) ∑i=1 xi
Energies 2021, 14, x FOR PEER REVIEW 4 of 16
The database set has been exploited in the present investigation to detect and identify faults. As shown in this table, only separated faults (no combined faults) have been con- sidered. The fault detection has been examined using the concentration of each dissolved gas. Since the weight percent of the gases as mentioned earlier would result in an inop- portunely small number, concentration in parts per million, or ppm, has been considered for each gas. Furthermore, percent concentration of the total sum was also used, where Energies 2021, 14, 2970 each sample X = [x1, x2, …, x5] is scaled as follows: 4 of 17 𝑋 X = 100% (1) ∑ 𝑥 TheThe faultsfaults diagnostic diagnostic method method has has been been carried carried out out elaborating elaborating two two different different classifiers, classifi- namelyers, namely Gaussian Gaussian and SVM-BA.and SVM-BA. The flowchartThe flowchart given given in Figure in Figure2 summarizes 2 summarizes the various the var- stagesious stages of the of diagnostic the diagnostic approach. approach.
FigureFigure 2.2.Flowchart Flowchart ofof thethe problemproblem formulation.formulation.
3.3. ClassificationClassification Approach Approach ForFor both,both, Gaussian Gaussian and and SVM-BA, SVM-BA, classifier, classifier the, concentrationsthe concentrations in percentages in percentages and ppm and ofppm the fiveof the dissolved five dissolved gases have gases been have used been as anused input as an vector, input denoted vector, bydenotedX = [x 1by, x 2X,..., = [x1x, 5x]2,, associated…, x5], associated with a particular with a particular class of faultclass (denoted of fault (denoted by y) representing by y) representing the classifier the decisionclassifier (classifierdecision (classifier output). output).
3.1.3.1. GaussianGaussian ClassifierClassifier InIn thisthis part,part, the Gaussian Gaussian classification classification is is used used as as a aprobabilistic probabilistic learning learning method method for forconstructing constructing a classifier a classifier by applying by applying Bayes’ Bayes’ theorem. theorem. It concerns It concerns the conditional the conditional and mar- and marginal probabilities of two random events. The classifier is based on the comparison of ginal probabilities of two random events. The classifier is based on the comparison of the the posterior probability P (wi|x): posterior probability P (wi|x):
P(𝑃x(|w𝑥|i𝑤) P )( w𝑃i()𝑤 ) P(w𝑃(i|𝑤x)|𝑥=) = , i =, 𝑖1, 2, = . 1,2 .,...,6 . , 6 (2)(2) P(𝑃x)(𝑥)
wherewhereP P (x|w (x|wii)) isis thethe conditionalconditional probabilityprobability (likelihood)(likelihood) givengiven by:by:
6 P(x|wi) = ∏ P(xk|wi) (3) k=1
and P(x) is the unconditional density that normalizes the posteriors, computed as follows:
6 P(x) = ∑ P(x|wi)P(wi) (4) i=1
in which P(wi) is the prior probability of each class. Energies 2021, 14, 2970 5 of 17
Firstly, the training phase has been carried out for constructing the parameters of the Gaussian model. In this phase, 321 samples of the data set have been reserved to determine the Gaussian distributions, consisting of the mean value (µ) and the matrix covariance (σ) of the gas concentration for each defect class. Since the number of samples differs from one fault to another, every distribution is multiplied by a weight corresponding to its samples’ number on the database’s total size. In the next step, Gaussian has been employed to compute the conditional probability P (x|wi) as indicated in Equation (3), where the posterior probability is calculated using the probability density function of a univariate normal distribution as follows:
2 1 (xk−m) 1 − 2 2 P(xij|wk) = √ e σ (5) 2πσ2 Since it is required to know the likelihood of observing the k-th sample while con- sidering all the different distributions, one can sum the likelihood of observing the given sample from each possible Gaussian, using:
exp[− 1 X − µ)Tσ−1(X − µ) P(x |w ) = 2 (6) k i 6/2p (2π) |σ|
in which, |σ| and σ−1 denote the determinant and inverse of the covariance matrix σ. Each Gaussian model’s parameters (i.e., variance, mean, and weight) have been addressed to cluster the data and estimate those having the same parameters. Moreover, a maximum likelihood estimate (MLE) was used to find the optimal mean and variance, maximizing the data’s likelihood. After training the model, the classifier output ideally ends up with six distributions on the same axis. Depending on the axis’s location, each Energies 2021, 14, x FOR PEER REVIEWtesting sample (a total of 160 testing ones) is placed in one of the defect classes.6 of 16 Figure3
illustrates the different steps of the Gaussian classifier.
FigureFigure 3. Flowchart 3. Flowchart formulating formulating the problem the problem using using Gaussian Gaussian classifier. classifier.
3.2. SVM Classifier Coupled with BA SVMs techniques are used in the problem of classification, regression, and prediction models [21]. For the classification problems, hyperplanes are required in a multidimen- sional space separating data points of both fault classes. These hyperplanes are used to distinguish between every two classes (yi and yj) of faults associated with two different input vectors (Xi and Xj) [22–24]. Among these hyperplanes, it is suggested to find the one that has the maximum margin (denoted by M). In this light, the classification becomes an optimization problem where hyperplanes represent the decision boundaries that help classify the data points. Usually, an orthogonal vector (denoted by ω) to the hyperplane defined by:
𝜔=[𝜔 ,𝜔 ,...,𝜔 ] (7) which is used in combination with an input vector (Xi) to define the hyperplane function, h, as follows [22]: