<<

Latin Hypercube and Support Vector Regression-Assisted Analysis of Electromagnetic Scattering Characteristics of Targets

Dong-Hai Xiao, Li-Xin Guo, Wei Liu, Wan-Qiang Qin, Mu-Yu Hou School of Physics and Optoelectronic Engineering Xidian University Xi’an 710071, China

Abstract-Reliable and efficient analysis of electromagnetic and eliminates the effect of various experimental scattering from targets is important for target recognition and factors without increasing the number of subjects in the stealth design. In this paper, a method combining sampling design with machine learning is proposed. Latin hypercube [10]. Kent R. Davey [11] proposed a hybrid sampling is adopted to sample backscattering data in hopes of approach in magnetic field optimization problems combining improving sampling , and support vector regression Latin hypercube sampling and pattern search and achieved (SVR) is utilized to train the model so as to obtain an efficient remarkable acceleration effect. Fei Kang et al. [12] developed and accurate nonlinear regressor. With the aid of multilevel fast a system probabilistic stability evaluation method for slopes multipole method (MLFMM), the feasibility of the proposed method is validated by the simulation data of a SLICY model. It based on Gaussian process regression (GPR) and Latin also shows the potential of sampling design in analysis of hypercube sampling (LHS) and obtained very reliable electromagnetic scattering from a complex target. performance. Index Terms—Latin hypercube sampling (LHS); support vector In this paper, SVR is selected as the regression model for its regression (SVR); electromagnetic scattering; sampling design; good applicability to small datasets[13]. And Latin hypercube machine learning sampling (LHS) and uniformly-spaced sampling (USS) are applied to obtained required datasets for two variables: zenith I. INTRODUCTION angle  and azimuth angle  . Experimental results show that How to efficiently and accurately acquire the LHS can effectively improve the accuracy of regression electromagnetic scattering characteristics of targets has always compared with USS. been a research hotspot in computational electromagnetics. It II. METHODOLOGY plays a vital role in target recognition and stealth structure design. Traditional numerical methods, such as MOM, have A. Latin hypercube sampling (LHS) high accuracy, but their computational efficiency is limited [1]. To tackle this difficulty, a large number of acceleration Latin hypercube sampling (LHS), essentially, is a form of algorithms have emerged in the past decades [2],[3]. The core . The detailed procedure for producing a idea of these algorithms is to reduce computational complexity s-dimensional Latin hypercube sample of size N can be and realize parallel computing. In recent years, machine described as follow [14]: 1) divide the continuous of learning has made remarkable achievements in many fields, each variable into N equally spaced intervals, and mark the such as image recognition [4], medical diagnosis [5] and cell interval (i 1) N , i N with i ; 2) randomly arrange labels natural language processing [6]. Researchers have also  (1,,) N for each dimension, such as 1 j,, Nj  denotes a extended machine learning to computational electromagnetics. For instance, Du et al. [7] proposed a machine learning random arrangement of the j-dimensional coordinates; 3) scheme for the analysis of polarimetric bistatic scattering from randomly disrupt the order of s dimensions and obtain a random matrix  ; 4) the final Latin hypercube sample a finite dielectric cylinder, and experimental results have  ij ns demonstrated its feasibility. Li et al. [8] proposed a 3D CC  ij  can be expressed as Poisson's equation solver based on deep learning technique, ns 1 taking advantage of the power of CNN in approximation of Cij( ij  u ij ) N , i  1, , N , j  1, , s (1) highly nonlinear functions and prediction of the potential 2 distribution of the electrostatic field. On the other hand, Latin where uij denotes an i.i.d. sample uniformly distributed on hypercube sampling (LHS) [9] is widely used in regression [ 0.5,0.5] and independent of  . Fig. 1 shows a Latin analysis for its advantages that it produces samples with high hypercube sample when N  7 and s  2 . Obviously, each representativeness in the domain. It ensures that each value (or row and each column of the constructed table contain only one a range of values) of a variable is represented in the samples sample. That is, each of those components is represented in an y fx()  fxwx()T 

fx() 

x o

Figure 1. A Latin hypercube sample with N 10 , Figure 2. the -insensitive . Figure 3. the SLICY model s  2 for X  [,]XX Distributed Uniformly on the 12 Unit Square. perfectly conducting objects, and the frequency was set to undifferentiated and fully stratified manner, which eliminates 1GHz. In addition, sampling operations to make datasets were the confounding effect of various experimental factors. carried out only in the upper space for the practical significance of the aforementioned model. More details about B. Support Vector Regression(SVR) these datasets can be found in TABLE I. Support vector machine (SVM) was first used to solve the problem of pattern recognition [15]. Since Vapnik introduced TABLE I the - i n s e n s i t i v e loss function (Fig. 2) into support vector DATASETS INFORMATION Properties machines, this algorithm has been used to solve the problem Datasets of non-, which is called support vector Sampling Number Polarization regression (SVR) [13]. U1387H USS 1387 HH The solution of SVR is to transform the original L1387H LHS 1387 HH minimization problem into the dual maximization problem. For the sample set ( ,xy )ii, i 1,2,, , the dual problem can be expressed as follows: III. RESULTS AND DISCUSSIONS In this section, two subgoals are established here for max ()()yiiiiiˆ  ˆ α,αˆ  comprehensively analyzing the advantages of this method: 1) i1 to investigate the regression precision of the proposed method; 1 ˆ ˆ ()() (,) iijjij k xx 2) to analyze the error distribution. 2 ij,1 (2) A. Performance Verification ˆ s.t. ()0 ii i1 The SVR model is trained on two data sets U1387H and L1387H, respectively. They are divided into a training dataset ,[0,]iiˆ  C and a test dataset in proportion to 9:1, and therefore the ˆ where i and i denote the Lagrange multipliers, and training dataset contains 1248 HH-polarized monostatic k(,)xxij represents the underlying kernel function such as scattering data about SLICY model. K- f o l d cross validation linear kernel function, Gaussian kernel function and so on. (CV) is applied to adjust hyper-parameters, and K is set to 7 Gaussian kernel function earns our preference because of the for ln(1248)7.13 as recommended in [16]. Grid search as nonlinearity of scattering characteristics. Thus the SVR one of the commonly used search algorithms for hyper- approximation can be obtained by parameters is also taken into account. ˆ Fig. 4 and Fig. 5 are the test results of training SVR with f()()(,).x i  i k x i x  b (3) i1 datasets obtained by different sampling methods in HH polarization. yˆ and y represent the predicted value and the C. Datasets ground truth. It can be seen that the RMSE of SVR trained on In order to make a comprehensive investigation into the U1387H is 1.343 dBsm but on L1387H is 1.182 dBsm, and 2 presented method, two datasets about monostatic scattering the R of the former is 0.949 less than the latter 0.951. It from metal objects were selected to do experimental that LHS can effectively improve the regression researches, wherein a SLICY model (Fig. 3) was involved. accuracy of SVR for electromagnetic scattering regression One was sampled with LHS, and the other with uniformly- tasks. spaced sampling (USS) that the interval of  and  was 5 . Considering the computational efficiency and fidelity, the B. Performance Verification multilevel fast multipole method (MLFMM) was employed to Error analysis is of positive significance for the analysis of calculate the monostatic radar cross section (RCS) of the

Figure 4. test results of training SVR with U1387H. Figure 5. test results of training SVR with L1387H. Figure 6. the spherical distribution of regression errors where the SVR model is trained with U1387H.

ACKNOWLEDGMENT This research was supported by the National Natural Science Foundation of China (Grant No.61871457, Grant No.61431010) and the Foundation for Innovative Research Groups of the National Natural Science Foundation of China (Grant No.61621005).

REFERENCES [1] C. C. Lu and C. C. Weng, "A multilevel algorithm for solving a boundary integral equation of wave scattering," Microwave & Optical Technology Figure 7. the spherical distribution of regression errors Letters, vol. 7, no. 10, pp. 466-470, 2010. where the SVR model is trained with L1387H. [2] J. Guan, S. Yan, and J. M. Jin, "An OpenMP-CUDA Implementation of Multilevel Fast Multipole Algorithm for Electromagnetic Simulation on causes and further improvement. Therefore, it is necessary to Multi-GPU Computing Systems," IEEE Transactions on Antennas & analyze the spherical distribution of regression errors in our Propagation, vol. 61, no. 7, pp. 3607-3616, 2013. . Here, the error between the predicted value yˆ [3] Z. Peng, R. Hiptmair, Y. Shao, and B. Mackie-Mason, "Domain Decomposition Preconditioning for Surface Integral Equations in Solving and the ground truth y is defined as Challenging Electromagnetic Scattering Problems," IEEE Transactions err  yy ˆ . (4) on Antennas & Propagation, vol. 64, no. 1, pp. 210-223, 2015. [4] J. Shuiwang, Y. Ming, and Y. Kai, "3D convolutional neural networks Fig. 6 and Fig. 7 show us the spherical distribution of for human action recognition," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 35, no. 1, pp. 221-231, 2013. regression errors in HH polarization. Clearly, the larger errors [5] H. Liang et al., "Evaluation and accurate diagnoses of pediatric diseases almost distribute near the equator and the smaller errors near using artificial intelligence," Nature Medicine, vol. 25, no. 3, pp. 433-438, the poles when SVR is trained with the data sampled by USS, 2019/03/01 2019. [6] A. Vaswani et al., "Attention Is All You Need," 2017. but the error distribution is fairly uniform when trained with [7] C. Hao, Y. Chao, and D. Yang, "Machine Learning-Assisted Analysis of the data sampled by LHS. It is well proved that LHS can bring Polarimetric Scattering From Cylindrical Components of Vegetation," higher regression accuracy to the regression task of IEEE Transactions on Geoscience & Remote Sensing, vol. 57, no. 1, pp. 1-11, 2018. electromagnetic scattering characteristics for its high [8] W. Tang et al., Study on a Poisson's equation solver based on deep representativeness. learning technique. 2017, pp. 1-3. [9] M. D. McKay, R. J. Beckman, and W. J. Conover, "Comparison of three methods for selecting values of input variables in the analysis of output IV. CONCLUSION from a computer code," Technometrics, vol. 21, no. 2, pp. 239-245, 1979. [10] J. Cheng and M. J. Druzdzel, "Latin hypercube sampling in Bayesian In this study, we have combined the advantages of LHS and networks," in FLAIRS Conference, 2000, pp. 287-292. SVR to fully exploit the intrinsic correlation between data, and [11] K. R. Davey, "Latin hypercube sampling and pattern search in magnetic established the regression model of monostatic scattering from field optimization problems," IEEE Transactions on Magnetics, vol. 44, no. 6, pp. 974-977, 2008. complex objects. With these monostatic RCS datasets sampled [12] F. Kang, S. Han, R. Salgado, and J. Li, "System probabilistic stability from a perfectly conducting SLICY model, we found that LHS analysis of soil slopes using Gaussian process regression with Latin could help the SVR scheme get 12% performance gains than hypercube sampling," Computers and Geotechnics, vol. 63, pp. 13-25, 2015. USS. This leads us to a conclusion that the combination of [13] G. W. Flake and S. Lawrence, "Efficient SVM Regression Training with SVR and LHS does make sense in assistantly improving the SMO," Machine Learning, vol. 46, no. 1-3, pp. 271-290, 2002. efficiency of analysis of electromagnetic scattering from [14] M. Petelet, B. Iooss, O. Asserin, and A. Loredo, "Latin hypercube sampling with inequality constraints," AStA Advances in Statistical complex targets with considerable reliability. Analysis, vol. 94, no. 4, pp. 325-339, 2010. [15] V. N. Vapnik, The nature of statistical learning theory. 1995. [16] Y. Jung, "Multiple predicting K-fold cross-validation for ," Journal of , pp. 1-19, 2018.