S S symmetry
Article Landslide Susceptibility Mapping Using the Slope Unit for Southeastern Helong City, Jilin Province, China: A Comparison of ANN and SVM
Chenglong Yu 1,2 and Jianping Chen 1,*
1 College of Construction Engineering, Jilin University, Changchun 130026, China; [email protected] 2 Jilin Team of Geological Survey Center of China Building Materials Industrial, Changchun 130026, China * Correspondence: [email protected]; Tel.: +86-0431-8850-2353
Received: 1 June 2020; Accepted: 21 June 2020; Published: 23 June 2020
Abstract: The purpose of this study is to produce a landslide susceptibility map of Southeastern Helong City, Jilin Province, Northeastern China. According to the geological hazard survey (1:50,000) project of Helong city, a total of 83 landslides were mapped in the study area. The slope unit, which is classified based on the curvature watershed method, is selected as the mapping unit. Based on field investigations and previous studies, three groups of influencing Factors—Lithological factors, topographic factors, and geological environment factors (including ten influencing factors)—are selected as the influencing factors. Artificial neural networks (ANN’s) and support vector machines (SVM’s) are introduced to build the landslide susceptibility model. Five-fold cross-validation, the receiver operating characteristic curve, and statistical parameters are used to optimize model. The results show that the SVM model is the optimal model. The landslide susceptibility maps produced using the SVM model are classified into five grades—very high, high, moderate, low, and very low—and the areas of the five grades were 127.43, 151.60, 198.77, 491.19, and 506.91 km2, respectively. The very high and high susceptibility areas included 79.52% of the total landslides, demonstrating that the landslide susceptibility map produced in this paper is reasonable. Consequently, this study can serve as a guide for landslide prevention and for future land planning in the southeast of Helong city.
Keywords: landslide susceptibility mapping; artificial neural networks; support vector machines; five-fold cross-validation; receiver operating characteristic curve; statistical parameters
1. Introduction Landslides are one of the most common geological disasters in the mountainous areas of China [1–3]. In recent years, the Chinese economy has developed steadily and rapidly, and has become the second largest economy in the world. With such rapid development of the economy, many previously inaccessible areas have carried out corresponding infrastructure construction and various engineering activities in succession. Therefore, it is necessary to evaluate the risk of natural disasters such as landslides in these areas. Landslide susceptibility mapping is the basis of landslide hazard and risk assessment [4]. The purpose of landslide susceptibility mapping is to answer the question “what is the geological background of landslides and where are the areas which are most prone to them?” [2]. Based on the review of relevant papers, it was found that landslide susceptibility mapping mainly includes the following [5,6]: (a) landslide inventory data acquisition; (b) mapping unit selection; (c) influencing factor selection; (d) establishment of the evaluation model; and (e) production of the landslide susceptibility map. Landslide inventory data is the basis of landslide susceptibility mapping. The early acquisition of landslide inventory data mainly depends on field geological disaster surveys. With the development
Symmetry 2020, 12, 1047; doi:10.3390/sym12061047 www.mdpi.com/journal/symmetry Symmetry 2020, 12, 1047 2 of 23 of remote sensing technology, landslide interpretation based on remote sensing technology has become one of the most popular tools to quickly acquire landslide inventory data [7,8]. Remote sensing techniques commonly used in landslide interpretation mainly include (a) visible optical remote sensing and (b) InSAR (Interferometric Synthetic Aperture Radar) remote sensing. The key to remote sensing interpretation of landslides based on visible optics is to interpret the landslide based on the special topographic and geomorphic features of the landslide in the remote sensing image. However, this method is greatly affected by the spatial resolution of remote sensing data and is unable to identify the surface deformation characteristics (i.e., the potential and slow deformation of the landslide); thus, it has certain limitations. However, landslide interpretation based on InSAR technology can effectively identify the surface deformation and identify the potential landslides that are under deformation [9], which makes up for the limitations of visible optical remote sensing interpretation. The combination of these two methods has become a common method for landslide interpretation. As for mapping units, they are the basic units in landslide susceptibility mapping [10]. At present, the commonly used mapping units are the grid unit, watershed unit, slope unit, region unit, and uniform condition unit [4]. Among these, the grid unit is the most widely used but, due to its lack of connection with the topography and other geological information, the slope unit [11,12] has become more and more respected by relevant scholars. The selection of influencing factors is one of the key issues in the study of landslide susceptibility mapping [13]. The choice of influencing factors used in the evaluation will affect the final mapping results. The selection of influencing factors mostly relies on field investigation, cause analysis, expert experience, and related research of landslides; the factors with the highest correlation with the occurrence of landslides in the study area are typically selected as the influencing factors to evaluate the susceptibility to landslides [2]. At present, many scholars also use methods of automatic forward selection of parameters that weight and evaluate the effectiveness of each influencing factor and make use only of the “best” ones [14,15]. The evaluation models applied to landslide susceptibility mapping are mainly divided into qualitative and quantitative evaluation [6,16]. Qualitative evaluation is mainly based on the knowledge and experience of relevant experts, where the susceptibility to geological hazards such as landslides is determined by means of grading and weighting [17,18]. Quantitative evaluation methods include statistics, machine learning, deep mining, and other methods [6]. With the rapid improvement of calculation ability, an increasing number of quantitative evaluation methods have been applied to landslide susceptibility mapping. In addition, evaluation methods of the performance of landslide susceptibility models have also been developed rapidly [16,19]. Finally, the landslide susceptibility map can be produced according to the established model. According to the basic steps of landslide susceptibility mapping, this paper carries out a relative evaluation of the southeast of Helong City, Jilin Province, northeastern China. Firstly, according to the geological hazard survey (1:50,000) project, 83 landslides were mapped in the study. Then, the slope unit classified by the curvature watershed method was applied as the mapping unit, which can identify the horizontal surface and the inclined surface. Moreover, the area of the slope unit is more concentrated, the shapes mostly range between isosceles triangles and squares, and there are few elongated units. Next, ten influencing factors were selected. Artificial neural networks (ANN) and support vector machines (SVM) were introduced to build the landslide susceptibility model. Five-fold cross-validation, the receiver operating characteristic curve (ROC), and statistical parameters were used to optimize the model. Finally, the landslide susceptibility map was divided into five classes. This study can serve as a guide for landslide prevention and for future land planning in the southeast of Helong city.
2. Study Area The study area is in the Chongshan and Nanping counties of the southeast Helong city, Jinlin province, China, along the Tumen river. The study area is marked by valleys and rugged topography, and covers an area of 1476 km2 (Figure1a–c). More than 90% of the total area of Symmetry 2020, 12, 1047 3 of 23 the study area consists of mountainous areas. In the study area, the maximum elevation is 1450 m, and the minimum elevation is 350 m. The southern part of the study area is the Chinese and Korean quasi-platform, and the northern part is the Jihei fold system in the Tianshan-Xingan geosyncline fold area, which are bounded by the deep and large fault of Gudong River. The study area belongs to the temperate monsoon sub-humid climate zone. Based on rainfall data collected from 1960 to 2012, the maximum daily rainfall of the study area is 164.8 mm. The perennial average temperature is 5.6 ◦C. The vegetation coverage in the study area is relatively high. The seismic intensity of the study area has a degree of VI on the modified Mercalli index. At present, no earthquake-induced landslides have been detected in the study area. According to the field investigation, the landslides in the study area are mainly distributed along the Tumen River. According to a geological map (downloaded using the 91 Weitu software, with a scale of 1:500,000) (Figure1d), it can be seen that the mainly exposed strata in the study area are Quaternary (Q), Neogene (N), Cretaceous (K), Jurassic (J), Middle Proterozoic (Pt), and New Archean (Ar). The lithological information of the study area is listed in Table1. There are several reverse faults in the study area, which affect the regional stability. The Tumen river and its tributariesSymmetry flow through2020, 12, x FOR the PEER study REVIEW area, which has a significant impact on landslide risk.4 of 24
Figure 1. GeographicalFigure 1. Geographical position position and and landslide landslide inventory inventory of the study study area. area. (a) and (a,b (b):): geographicalgeographical position position of the study area; (c) landslide inventory of the study area; (c) geological map of the study of the study area; (c) landslide inventory of the study area; (d) geological map of the study area; area; (e) and (f) typical landslides of the study area. (e,f) typical landslides of the study area. 3. Methodology Figure 2 shows the three main steps of this paper, namely (a) data preparation, (b) landslide susceptibility modelling, and (c) validation and selection of the optimal models.
Symmetry 2020, 12, 1047 4 of 23
Table 1. Description of the Lithology.
System Code Lithology Quaternary Q Alluvial–Diluvial, Gravel, Sub-Sandy Soil, Sub-Clay, and Basalt Neogene N Sandstone, Conglomerate, and Siltstone with Basalt Cretaceous K Sandstone, Conglomerate, Siltstone with Limestone, and Oil Limestone Jurassic J Andesite and Tuff Middle Proterozoic Pt Marble New Archean Ar Black Cloud Amphibolic Granulite and Granulite
3. Methodology Figure2 shows the three main steps of this paper, namely (a) data preparation, (b) landslide susceptibility modelling, and (c) validation and selection of the optimal models. Symmetry 2020, 12, x FOR PEER REVIEW 5 of 24
Original data Influencing factors DEM Google images 5-cross-validation geological map Field surveying and repots Slope unit map
Data preparation
ANN model SVM model
Landslide susceptibility modeling
Statistical Analyzing Method Receiver Operating Characteristic Curve
Validation and selection of the optimal models
Figure 2.2. Flowchart of this study.
3.1.3.1. The Mapping Unit ReasonableReasonable mapping units shouldshould bebe determineddetermined beforebefore landslidelandslide susceptibilitysusceptibility mappingmapping isis carriedcarried outout [20[20].]. WhetherWhether the the mapping mapping unit unit is reasonableis reasonable or notor not directly directly affects affects the accuracythe accuracy of the of final the evaluationfinal evaluation result. result. Slope unitsSlope are units often are used often to studyused to landslide study landslide susceptibility susceptibility mapping duemapping to their due close to relationshiptheir close relationship with the topography. with the topography. The dividing The principle dividing of slopeprinciple units of is slope to divide units the is researchto divide area the intoresearch many area small into areas many with small diff erentareas sizeswith bydifferent cutting sizes along by the cutting ridge andalong valley the ridge lines [and21]. valley At present, lines hydrologic[21]. At present, analysis hydrologic is the most analysis used is method the most for us dividinged method slope for units. dividing However, slope units. this method However, cannot this identifymethod thecannot boundary identify between the boundary the horizontal between and the thehorizontal inclined and surface the inclined and generates surface a and large generates number ofa large parallel number river of networks, parallel river which networks, increases which the di ffiincreasesculty of the manual difficulty modification of manual [21 modification]. By comparing [21]. theBy comparing slope unit division the slope results unit division of the curvature results of watershed the curvature method watershed and the hydrologicmethod and analysis the hydrologic method, itanalysis was found method, that it the was slope found unit that divided the slope based unit on divided the curvature based on watershed the curvature method watershed has a uniform method size,has a auniform regular size, shape, a regular and small shape, terrain and small variation terrain inside, variation which inside, is obviously which is betterobviously than better the result than basedthe result on thebased hydrological on the hydrological method. Therefore, method. Ther we choseefore, thewe curvaturechose the curvature watershed watershed method to method divide theto divide slope the units slope in thisunits study. in this Slopestudy. units Slope are unit divideds are divided by valley by valley and ridge and ridge lines, lines, where where there there are abruptare abrupt changes changes in slope in slope angle angle and and slope slope aspect. aspect Therefore,. Therefore, slope slope units units can can be dividedbe divided according according to theto the change change of slopeof slope angle angle and and slope slope aspect. aspect. Profile Profile curvature curvature and and plan plan curvature curvature are are the the derivatives derivatives of slopeof slope angle angle and and slope slope aspect, aspect, respectively. respectively. Their Thei maximumr maximum and and minimum minimum values values cancan reflectreflect abruptabrupt changes of slope angle and slope aspect. Therefore, the curvature can be used to divide the slope units. The specific process of dividing slope units based on curvature is shown in Figure 3. It is mainly used to identify the boundary of concave terrain and convex terrain by the curvature and reverse curvature, respectively, to divide the slope units. The classification steps of the slope units can be divided into two parts: positive relief extraction and negative relief extraction. The original curvature was used to extract the positive relief, and the flip curvature was used to extract the negative relief, which can be obtained by multiplying the curvature by −1. Before calculating the curvature, to remove the influence of the roughness of the original Digital Elevation Model (DEM) on the curvature calculation results, focal statistics were made on the DEM with three pixels as the radius. After the flow direction and sink calculations, the positive and negative relief boundary can be obtained. The results of dividing slope units can be obtained by merging the positive and negative relief boundaries and manually modifying the unreasonable units.
Symmetry 2020, 12, 1047 5 of 23 changes of slope angle and slope aspect. Therefore, the curvature can be used to divide the slope units. The specific process of dividing slope units based on curvature is shown in Figure3. It is mainly used to identify the boundary of concave terrain and convex terrain by the curvature and reverse curvature, respectively, to divide the slope units. The classification steps of the slope units can be divided into two parts: positive relief extraction and negative relief extraction. The original curvature was used to extract the positive relief, and the flip curvature was used to extract the negative relief, which can be obtained by multiplying the curvature by 1. Before calculating the curvature, to remove the influence − of the roughness of the original Digital Elevation Model (DEM) on the curvature calculation results, focal statistics were made on the DEM with three pixels as the radius. After the flow direction and sink calculations, the positive and negative relief boundary can be obtained. The results of dividing slope units can be obtained by merging the positive and negative relief boundaries and manually modifying Symmetrythe unreasonable 2020, 12, x FOR units. PEER REVIEW 6 of 24
DEM
Focal statistics
Calculator Curvature Flip curvature
Flow direction map
Sink map
Catchment map Flip Catchment map
Raster to Polygon Polygon merging
Modify unreasonable cell
Slope units
Figure 3. ConstructionConstruction process process of of the slope units based on the curvature watershed method. 3.2. Landslide Inventory 3.2. Landslide Inventory Landslide inventory data is the basis of landslide susceptibility mapping. In order to grasp Landslide inventory data is the basis of landslide susceptibility mapping. In order to grasp the the spatial distribution characteristic information and develop a mechanism of landslide hazards spatial distribution characteristic information and develop a mechanism of landslide hazards in the in the study area, the Jilin team of the Geological Survey Center of China Industrial Building study area, the Jilin team of the Geological Survey Center of China Industrial Building Materials Materials carried out the geological hazard survey (1:50,000) project of Helong city. The characteristic carried out the geological hazard survey (1:50,000) project of Helong city. The characteristic geomorphological features of the landslides, such as the chair-like landform of the landslide, a color geomorphological features of the landslides, such as the chair-like landform of the landslide, a color difference between the landslide and the surrounding environment, and a change of vegetation before difference between the landslide and the surrounding environment, and a change of vegetation and after the landslide, can be clearly identified in a remote sensing image. Thus, the project team before and after the landslide, can be clearly identified in a remote sensing image. Thus, the project preliminarily identified 112 landslides in the study area using the visible optical remote sensing team preliminarily identified 112 landslides in the study area using the visible optical remote sensing technology and Google images. Then, during the field geological survey, the remote sensing technology and Google images. Then, during the field geological survey, the remote sensing interpretation results were reviewed, and 42 interpretation results were excluded (such as exposed interpretation results were reviewed, and 42 interpretation results were excluded (such as exposed steep cliffs of bedrock and a waste rock dump in a quarry) and the uninterpreted landslides were steep cliffs of bedrock and a waste rock dump in a quarry) and the uninterpreted landslides were supplemented. Finally, a total of a total of 83 landslides were mapped in the study area (Figure1c), supplemented. Finally, a total of a total of 83 landslides were mapped in the study area (Figure 1c), including soil and rock slope deformation, soil slide, and collapse. Figure1e,f are examples of including soil and rock slope deformation, soil slide, and collapse. Figure 1e,f are examples of the the landslides. Table A1 shows the basic information for the landslides. According to Table A1, landslides. Table A1 shows the basic information for the landslides. According to Table A1, the the landslides in the study area are mostly rock landslides, which are mainly developed in crystalline landslides in the study area are mostly rock landslides, which are mainly developed in crystalline rocks. These landslides are mainly controlled by the rock mass structural plane, and a small number of landslides are controlled by the contact surface between the overburden layer and the rock mass. The scale of landslide is mainly small and medium size. The failure mode of the landslides is mainly pull-type and toppling-type.
3.3. Influencing Factors The occurrence of a landslide is a complicated process. Therefore, the factors influencing the occurrence of a landslide are also varied. Pourghasemi and Rossi [13] reviewed a total of 220 related papers and found that the factors with the highest application frequency in landslide susceptibility mapping were slope angle, lithology, slope aspect, land use, distance to river, elevation, distance to faults, curvature, distance to road, and soil type, among others. The selection of influencing factors for the landslide susceptibility mapping in the study area should be based on the understanding of the characteristics of the landslide and the geological environment. Therefore, the relationship between the landslide and the geological environment in the study area was analyzed as follows.
Symmetry 2020, 12, 1047 6 of 23 rocks. These landslides are mainly controlled by the rock mass structural plane, and a small number of landslides are controlled by the contact surface between the overburden layer and the rock mass. The scale of landslide is mainly small and medium size. The failure mode of the landslides is mainly pull-type and toppling-type.
3.3. Influencing Factors The occurrence of a landslide is a complicated process. Therefore, the factors influencing the occurrence of a landslide are also varied. Pourghasemi and Rossi [13] reviewed a total of 220 related papers and found that the factors with the highest application frequency in landslide susceptibility mapping were slope angle, lithology, slope aspect, land use, distance to river, elevation, distance to faults, curvature, distance to road, and soil type, among others. The selection of influencing factors for the landslide susceptibility mapping in the study area should be based on the understanding of the characteristics of the landslide and the geological environment. Therefore, the relationship between theSymmetry landslide 2020, 12 and, x FOR the PEER geological REVIEW environment in the study area was analyzed as follows. 7 of 24
3.3.1.3.3.1. RelationshipRelationship betweenBetween GeologicalGeological EnvironmentEnvironment and Landslides Relationship between Topographic Features and Landslides Relationship Between Topographic Features and Landslides The topography is the basic condition of a landslide and determines the formation and development The topography is the basic condition of a landslide and determines the formation and of landslide to a great extent. For example, convex and bedding slopes are prone to landslides. Based development of landslide to a great extent. For example, convex and bedding slopes are prone to onlandslides. field investigation Based on andfield analysis, investigation it was and found analysis, that most it was landslides found that in the most study landslides area occurred in the instudy hills andarea low occurred mountains in hills with and an low elevation mountains range with of 200–1000 an elevation m and range a slope of angle200–1000m more and than a 30 slope◦ (Figure angle4). Furthermore,more than 30° the (Figure number 4). ofFurthermore, landslides increasedthe number with of landslides the increase increased of the slope with angle.the increase The number of the ofslope landslides angle. The in a number convex slopeof landslides was much in a largerconvex than slope that was in much concave larger slope. than The that micro-landform in concave slope. of theThe landslide micro-landform site was of a steepthe landslide slope or site steep wa coasts a steep (Table slope A1 or). steep coast (Table A1).
FigureFigure 4.4. ((a)) Elevation map; ( b)) slope slope angle angle map. map.
RelationshipRelationship between Between Lithological Lithological FeaturesFeatures andand LandslidesLandslides TheThe lithology lithology of of landslides landslides in in the the study study area area was was mainly mainly crystalline crystalline rocks, rocks, such assuch granite, as granite, diorite, anddiorite, other and magmatic other magmatic rocks (Table rocks A1 (Table). The A1). number The nu ofmber landslides of landslides in the in crystalline the crystalline rock inrock the in study the areastudy was area the was largest the and largest was and obviously was obviously higher than high thater inthan other that geotechnical in other geotechnical rock type distribution rock type areas.distribution This is areas. because This the is crystallinebecause the rock crystalline is usually rock strongly is usually influenced strongly influenced by joints, cracks,by joints, and cracks, faults. Theand presence faults. The of presence these structural of these planesstructural greatly planes reduces greatly the reduces shear strengththe shear ofstrength the rock of mass,the rock resulting mass, inresulting the sliding in the of thesliding upper of the rock upper mass rock or overlying mass or overlying loose rock. loose rock.
Relationship Between Geologic Features and Landslides The number of landslides in fault geological tectonic units was the largest in the study area. This is mainly because the in-situ stress of the fracture is stronger than that of other parts, and its influence range is larger and wider. The rock in these parts has deformations such as bending, squeezing, and tearing, which reduces the structural mechanical properties of the rock mass.
Relationship Between Rainfall Features and Landslides According to statistics, the landslides that occurred in the study area were mostly caused by heavy rainfall, and the two are positively correlated. Continuous heavy rainfall, particularly during the flood season, results in a high incidence of landslides.
Relationship Between Other Features and Landslides Landslides occurred more frequently in areas with low vegetation coverage in the study area. Landslides occurred extensively along the Tumen river, which is the main river in the study area. Earthquakes are an important trigger of landslides. They damage the integrity of rock and soil mass and make slopes prone to landslide. However, there is no historical record of landslides triggered by
Symmetry 2020, 12, 1047 7 of 23
Relationship between Geologic Features and Landslides The number of landslides in fault geological tectonic units was the largest in the study area. This is mainly because the in-situ stress of the fracture is stronger than that of other parts, and its influence range is larger and wider. The rock in these parts has deformations such as bending, squeezing, and tearing, which reduces the structural mechanical properties of the rock mass.
Relationship between Rainfall Features and Landslides According to statistics, the landslides that occurred in the study area were mostly caused by heavy rainfall, and the two are positively correlated. Continuous heavy rainfall, particularly during the flood season, results in a high incidence of landslides.
Relationship between Other Features and Landslides Landslides occurred more frequently in areas with low vegetation coverage in the study area. Landslides occurred extensively along the Tumen river, which is the main river in the study area. Earthquakes are an important trigger of landslides. They damage the integrity of rock and soil mass and make slopes prone to landslide. However, there is no historical record of landslides triggered by earthquakes in the study area. Thus, seismic activity has had little effect on the occurrence of landslides in the study area.
3.3.2. Selection of Influencing Factors Based on the above analysis, and the statistical results of Pourghasemi and Rossi [13], three groups of influencing factors were selected for this study: geologic factors, topographic factors, and environment factors. The geologic factors only consist of geology, while there were five topographic factors: elevation, slope angle, slope aspect, topographic relief, and curvature. Finally, there were four environment factors: land use, rainfall, distance to river, and distance to faults. In this study, all the influencing factor maps were extracted using the slope unit. Furthermore, the dominant category of lithology, land use, was estimated as the influencing factor value of each slope unit, with the average value used for the other factors.
3.3.3. Extraction of Influencing Factors
Geologic Factor The study area is covered with six types of geologic formations, composed of different lithologies pertaining to different geologic ages. The combination of these two characteristics is one of the most important factors influencing the occurrence of landslides [22]. The shear strength, weathering resistance, and crushing degree of different lithologies are greatly different [23]. In addition, the same lithology with different structural planes will have different effects on the stability of the slope. A rock mass with joint and fissure development is more prone to landslides than a complete rock mass. A highly weathered rock mass is also less stable than a fresh rock mass. In this study, the geology map was obtained based on a geological map with a scale of 1:500,000 (downloaded using 91 Weitu software); the geology map of the study area is shown in Figure5a. Symmetry 2020, 12, 1047 8 of 23 Symmetry 2020, 12, x FOR PEER REVIEW 9 of 24
Figure 5. Influencing factor maps of the study area: (a) lithology (downloaded using 91 Weitu software), (b)Figure elevation, 5. Influencing (c) slope angle, factor and maps (d) slopeof the aspect.study area: (a) lithology (downloaded using 91 Weitu software), (b) elevation, (c) slope angle, and (d) slope aspect. Topographic Factors Topographic Factors Elevation usually affects the depth of the water table aquifer [1,24]. In addition, the higher the elevation,Elevation the usually less human affects activitythe depth takes of the place, water which table aquifer is also more[1,24]. conduciveIn addition, to the the higher stability the of theelevation, slope. Regarding the less human the slope activity angle, takes a place, change which of slope is also angle more changes conducive the to original the stability stress of state the of slope. Regarding the slope angle, a change of slope angle changes the original stress state of the slope the slope [25]. Within a certain range of slope, the self-gravity stress and shear stress in the slope [25]. Within a certain range of slope, the self-gravity stress and shear stress in the slope generally generally increase with an increase of the slope angle, and the probability of slope instability increases increase with an increase of the slope angle, and the probability of slope instability increases accordingly. The intensity of light exposure, the type and extent of vegetation cover, and the supply of accordingly. The intensity of light exposure, the type and extent of vegetation cover, and the supply surface water vary greatly with differing slope aspect [2,26]. For example, the illumination time on a of surface water vary greatly with differing slope aspect [2,26]. For example, the illumination time on sunny slope is much longer than that on a shady slope; therefore, the temperature difference between a sunny slope is much longer than that on a shady slope; therefore, the temperature difference daybetween and night day on and a sunnynight on slope a sunny is also slope larger is also than larger that on than a shady that on slope a shady and theslope dry–wet and the cycle dry–wet is also faster.cycle In is thisalso case, faster. the In weathering this case, the strength weathering of the strength rock mass of the on rock a sunny mass slope on a sunny is larger slope than is thatlarger on a shadythan slope, that on which a shady reduces slope, the which strength reduces and stabilitythe strength of rock and and stability soil mass of rock on theand sunny soil mass slope, on in the turn increasingsunny slope, the probability in turn increasing of landslide the [4 ].probabilit The topographicy of landslide Relief—The [4]. The diff topographicerence between Relief—The the highest anddifference lowest Points—Can between the reflect highest the and degree lowest of Points—Can relief in a specific reflect area the [degree4]. Curvature of relief can in a reflect specific the area shape of[4]. a slope Curvature body. Accordingcan reflect tothe the shape shape of characteristics,a slope body. According a slope can to be the divided shape characteristics, into three forms: a slope convex slope,can be straight divided slope, into three and concaveforms: convex slope slope, [27]. Astraight DEM (Digitalslope, and Elevation concave Model)slope [27]. with A DEM a resolution (Digital of 10 Elevation10 m was Model) used with to extract a resolution the five of influencing10 × 10 m was factor used mapsto extract through the five the influencing slope unit factor using maps ArcGis × software.through The the fiveslope influencing unit using ArcGis factor mapssoftware. are shownThe five in influencing Figures5b–d factor and maps6a,b. are shown in Figure 5b–d and Figure 6a,b.
Symmetry 2020, 12, 1047 9 of 23 Symmetry 2020, 12, x FOR PEER REVIEW 10 of 24
FigureFigure 6. 6.Influencing Influencing factors factors maps maps of of the the studystudy area:area: ( (aa)) topographic topographic relief, relief, (b (b) curvature,) curvature, (c) ( cland-use,) land-use, andand (d )(d rainfall.) rainfall.
EnvironmentEnvironment Factors Factors VegetationVegetation has has a positivea positive e ffeffectect on on the the stabilitystability ofof aa slope, which can can enhance enhance the the resistance resistance of of thethe slope slope surface surface to to water water erosion erosion and and influence influence rainfallrainfall infiltration.infiltration. Rainfall Rainfall has has a asignificant significant impact impact onon slope slope risk. risk. Rainwater Rainwater infiltration infiltration willwill increaseincrease the gravity of of rock rock and and soil soil mass, mass, reducing reducing their their shearshear strength strength parameters, parameters, which which will will have have anan adverseadverse effect effect on slope slope stability. stability. The The distance distance to toriver river factorfactor has has an importantan important influence influence on the on occurrencethe occurre ofnce landslides of landslides [28]. The[28]. downcut The downcut of a river of increasesa river theincreases slope angle the alongslope angle the river. along To the adapt river. to To the adapt rapid to downcut, the rapid disastersdowncut, such disast asers landslides such as landslides often occur alongoften the occur river, along thus the reducing river, thethus slope reducing angle. the The slope existence angle. ofThe faults existence leads of to faults the development leads to the of jointsdevelopment and fissures of joints in the and surrounding fissures in rockthe surrounding mass, which rock leads mass, to fragmentation which leads to offragmentation the rock mass of andthe a reductionrock mass of weatheringand a reduction resistance of weathering [29]. Therefore, resistance many [29]. Therefore, geological many hazards geological such as hazards landslides such often as landslides often develop around deep and large fissures. These four influencing factors are extracted develop around deep and large fissures. These four influencing factors are extracted through the slope through the slope unit using ArcGis software. The five influencing factor maps are shown in Figure unit using ArcGis software. The five influencing factor maps are shown in Figure6c,d and Figure7a,b. 6c,d and Figure 7a,b.
Symmetry 2020, 12, 1047 10 of 23
Symmetry 2020, 12, x FOR PEER REVIEW 11 of 24
FigureFigure 7. 7.Influencing Influencing factors factors maps maps of of the the studystudy area:area: ( a)) distance distance to to river river and and (b (b) distance) distance to tofault. fault.
3.4.3.4. Landslide Landslide Susceptibility Susceptibility Modeling Modeling AtAt present, present, an increasingan increasing number number of models of models are being are applied being toapplied the study to the of landslide study of susceptibility landslide mappingsusceptibility [30–33 ].mapping Each of[30–33]. these modelsEach of hasthes itse models advantages has its and advantages disadvantages; and disadvantages; combined with thecombined complexity with of the the complexity factors aff ofecting the factors slope stabilityaffecting andslope the stability different and geologicalthe different environments geological in dienvironmentsfferent regions, in different it is impossible regions, to it applyis impossi one modelble to apply to the one study model of landslide to the study susceptibility of landslide in all areas.susceptibility Therefore, inin all the areas. relevant Therefore, evaluation in the releva of thent studyevaluation area, of the the evaluation study area, model the evaluation adopted model should beadopted optimized should to determine be optimized the mostto determine suitable the landslide most suitable susceptibility landslide evaluation susceptibility model. evaluation Recently, somemodel. machine Recently, learning some approachesmachine learning have approaches been developed have been with developed the aim ofwith evaluating the aim of the evaluating landslide susceptibilitythe landslide [21 susceptibility]. Due to its [21]. strong Due learning to its strong ability, learning fast calculation ability, fast speed, calculation and strong speed, fault and tolerance, strong thefault artificial tolerance, neural the network artificial (ANN) neural model network has (ANN) become model one ofhas the become most used one of machine the most learning used machine methods learning methods in landslide susceptibility mapping. By comparison, the support vector machine in landslide susceptibility mapping. By comparison, the support vector machine (SVM) model requires (SVM) model requires fewer modeling data and is suitable for binary classification problems. Thus, fewer modeling data and is suitable for binary classification problems. Thus, the SVM model may be the SVM model may be more suitable for landslide susceptibility prediction with fewer data, such as more suitable for landslide susceptibility prediction with fewer data, such as in the case of this study. in the case of this study. Therefore, we selected the artificial neural network (ANN) and support Therefore, we selected the artificial neural network (ANN) and support vector machine (SVM) models vector machine (SVM) models to optimize the evaluation model of landslide susceptibility in the to optimizestudy area. the evaluation model of landslide susceptibility in the study area. 3.4.1. Artificial Neural Network (ANN) 3.4.1. Artificial Neural Network (ANN) AsAs a a machine machine learninglearning method, method, the the artificial artificial neural neural network network model model has been has widely been widelyused in the used instudy the study of landslide of landslide susceptibility susceptibility mapping mapping [26,34,35]. [26 The,34 ,ANN35]. model The ANN has the model following has the advantages following advantages[36]: (a) better [36]: non-linear (a) better non-linearmapping capability; mapping (b) capability; highly self-learning (b) highly self-learningand adaptive; and (c) adaptive;strong (c)generalization strong generalization ability; and ability; (d) strong and (d)fault strong tolera faultnce. Thus, tolerance. the ANN Thus, model the can ANN simulate model the can complex simulate thenon-linear complex non-linearinteractions interactions between influencing between influencing factors and factors landslides and through landslides the through interaction the interactionbetween betweenneurons. neurons. Moreover, Moreover, in an ANN in an model ANN it model is not itnece is notssary necessary to describe to describethe interactions the interactions between factors between factorsby complex by complex mathematical mathematical formulae formulae [37]. [ 37The]. Thefitting fitting effect eff ectis relatively is relatively good, good, which which is isespecially especially suitablesuitable for for the the simulation simulation of complex of complex geological geological phenomena phenomena influenced influenced by many by many factors factors and presents and advantagespresents advantages for the simulation for the simulation of phenomena of phenom suchena as landslides,such as landslides, in which in variouswhich various factors factors interact andinteract havecomplex and have relationships. complex relationships. A complete A artificial complete neural artificial network neural model network is usually model composedis usually of ancomposed input layer, of an an output input layer, andan output one or layer, more and hidden one layers.or more Figure hidden8 shows layers. a schematicFigure 8 shows diagram a of aschematic simple artificial diagram neuralof a simple network artificial model. neural Each network layercontains model. Each several layer of contains the model’s several basic of units:the neurons.model’s The basic data units: processing neurons. and The storage data processing of an artificial and neural storage network of an artificial are represented neural network as the mutual are relationshipsrepresented and as the connections mutual relationships between neurons. and conn Throughections between training neurons. using samples, Through the training ANN changesusing samples, the ANN changes the weights of its internal connections to minimize the error between the the weights of its internal connections to minimize the error between the output value and the target output value and the target value, to achieve the purpose of accurate modeling. Each node in the value, to achieve the purpose of accurate modeling. Each node in the input layer corresponds to a input layer corresponds to a predictive variable, while the node in the output layer corresponds to predictive variable, while the node in the output layer corresponds to the target variable. A hidden the target variable. A hidden layer is a regular layer connecting the input layer and output layer, and
Symmetry 2020, 12, 1047 11 of 23 Symmetry 2020, 12, x FOR PEER REVIEW 12 of 24 thelayer number is a regular of the layerhidden connecting layers and the the input number layer of andnodes output in each layer, layer and determine the number the complexity of the hidden of thelayers network. and the The number process of of nodes forecasting in each with layer an determine ANN can the be complexity divided into of thea learning network. process The process and a predictionof forecasting process with [38]. an ANNThe learni canng be dividedprocess, intoin which a learning a large process number and of learning a prediction samples process and [the38]. iterativeThe learning function process, of the in network which a largeare used number to train of learning the network samples by andoptimizing the iterative the process function via of minimizingthe network the are usednetwork to train error, the includes network bythe optimizing forward transmission the process via of minimizing input information the network and error, the reverseincludes transmission the forward of transmission error [34]. The of prediction input information process is and the the process reverse of substituting transmission the of unknown error [34]. sampleThe prediction into the processmodel after is the training. process According of substituting to the the rules unknown of the learning sample into sample, the model the process after training. finally obtainsAccording the tooutput the rules result of theof learningthe sample sample, through the processthe forward finally transmission obtains the output of the resultinput ofinformation the sample [39].through the forward transmission of the input information [39].
1 Landslide
0 Non-Landslide Output layers
Input layers Hidden layers
FigureFigure 8. 8. SchematicSchematic diagram diagram of of a a simple simple artificial artificial neural neural network network (ANN) (ANN) model. model.
3.4.2.3.4.2. Support Support Vector Vector Machine Machine (SVM) (SVM) TheThe support support vector machine (SVM)(SVM) modelmodel isis alsoalso aa machine machine learning learning model. model. Its Its theoretical theoretical basis basis is isstatistical statistical learning learning theory theory [40 [40–43].–43]. The The SVM SVM model model has has the the following following advantages advantages [40, 44[40,44]:]: (a) low (a) datalow datavolume volume requirement; requirement; (b) strong (b) generalizationstrong generalization ability; (c)ability; strong (c) optimization strong optimization ability; (d) adaptabilityability; (d) adaptabilityto high-dimensional to high-dimensional samples; and samples; (e) strong and (e) learning strong abilitylearning and ability fast and convergence. fast convergence. It can realize It can realizethe linear the segmentationlinear segmentation of data of by data transforming by transfor eachming evaluation each evaluation index from index low from dimensional low dimensional space to spacehigh dimensionalto high dimensional space. Thus, space. it can Thus, analyze it can and analyze evaluate and non-linear evaluate problemsnon-linear in problems low dimensional in low dimensionalspace. Due to space. its low Due requirement to its low inrequirement terms of data in term volume,s of data it has volume, also been it widelyhas also used been in widely the study used of inlandslide the study susceptibility of landslide mapping.susceptibility The mapping. process of Th modelinge process for of landslidemodeling susceptibilityfor landslide susceptibility evaluation by evaluationSVM is summarized by SVM is assummarized follows [40 ,as44 ]:follows [40,44]: Consider some linearly separable data points x (i = 1, 2, ... ,n) that fall into two different classes Consider some linearly separable data points xii (i = 1, 2,…,n) that fall into two different classes y = 1. The goal of SVM is to find a hyperplane in n-dimensional data space which can separate the two yi i= ±1.± The goal of SVM is to find a hyperplane in n-dimensional data space which can separate the twoclasses classes of data of data based based on on the the maximum maximum interval. interval. The The hyperplane hyperplane cancan bebe expressed mathematically mathematically asas follows: follows: 1 2 L = w , (1) 𝐿=2||𝑤|||| || , (1) which should satisfy the following constraint conditions: which should satisfy the following constraint conditions:
yi((w xi) + b) 1, (2) 𝑦 ((𝑤∙𝑥· ) +𝑏)≥1≥ , (2) wherewhere ||ww||||is theis the norm norm of theof the normal normal vector vector of the of hyperplane,the hyperplane,b is ab scalar,is a scalar, and represents and represents the scalar the scalarproduct. product. Based Based on the on Lagrange the Lagrange multiplier, multiplier, the cost the function cost function can be can expressed be expressed as follows: as follows: