Next Article in Journal
Editorial for the Special Issue: “Human-Environment Interactions Research Using Remote Sensing”
Next Article in Special Issue
Detecting Urban Floods with Small and Large Scale Analysis of ALOS-2/PALSAR-2 Data
Previous Article in Journal
Convolutional Neural Networks for Automated Built Infrastructure Detection in the Arctic Using Sub-Meter Spatial Resolution Satellite Imagery
Previous Article in Special Issue
On Transfer Learning for Building Damage Assessment from Satellite Imagery in Emergency Contexts
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Evaluation and Prediction of Landslide Susceptibility in Yichang Section of Yangtze River Basin Based on Integrated Deep Learning Algorithm

1
Institute of Geophysics and Geomatics, China University of Geosciences, Wuhan 430079, China
2
Land Satellite Remote Sensing Application Center, Ministry of Natural Resources of the People’s Republic of China, Beijing 100048, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2022, 14(11), 2717; https://doi.org/10.3390/rs14112717
Submission received: 25 April 2022 / Revised: 25 May 2022 / Accepted: 2 June 2022 / Published: 6 June 2022

Abstract

:
Landslide susceptibility evaluation (LSE) refers to the probability of landslide occurrence in a region under a specific geological environment and trigger conditions, which is crucial to preventing and controlling landslide risk. The mainstream of the Yangtze River in Yichang City belongs to the largest basin in the Three Gorges Reservoir area and is prone to landslides. Affected by global climate change, seismic activity, and accelerated urbanization, geological disasters such as landslide collapses and debris flows in the study area have increased significantly. Therefore, it is urgent to carry out the LSE in the Yichang section of the Yangtze River Basin. The main results are as follows: (1) Based on historical landslide catalog, geological data, geographic data, hydrological data, remote sensing data, and other multi-source spatial-temporal big data, we construct the LSE index system; (2) In this paper, unsupervised Deep Embedding Clustering (DEC) algorithm and deep integration network (Capsule Neural Network based on SENet: SE-CapNet) are used for the first time to participate in non-landslide sample selection, and LSE in the study area and the accuracy of the algorithm is 96.29; (3) Based on the constructed sensitivity model and rainfall forecast data, the main driving mechanisms of landslides in the Yangtze River Basin were revealed. In this paper, the study area’s mid-long term LSE prediction and trend analysis are carried out. (4) The complete results show that the method has good performance and high precision, providing a reference for subsequent LSE, landslide susceptibility prediction (LSP), and change rule research, and providing a scientific basis for landslide disaster prevention.

1. Introduction

In recent years, affected by global climate change, seismic activities, and accelerated urbanization, geological disasters such as landslides, collapses, and debris flows have increased significantly [1]. As a common geological disaster, landslides cause severe economic losses and unfortunate casualties and seriously block transport lines and waterways [2,3]. Especially in recent years, accelerated global change and rapid urbanization and industrialization have increased the likelihood of landslides, leading to more casualties and property losses [4,5,6]. The mountainous and hilly landforms in the Yichang section of the Yangtze River Basin are widely distributed. The geological environment is very fragile, and there are many geological disasters such as landslides, collapses, and debris flow [7,8,9]. Especially along the Three Gorges Reservoir area, the development of geological disasters is more intense, which poses a threat to urban construction and residents’ production and life [10]. In recent years, accelerated global change, urbanization and industrialization have increased the likelihood of landslides, resulting in more casualties and property damage [11]. Landslide susceptibility evaluation (LSE) is the basis for landslide disaster risk assessment and prediction prevention, which can help relevant departments take preventive measures to reduce casualties and economic losses caused by landslides. Therefore, it is necessary to carry out LSE in the Yichang section of the Yangtze River Basin.
With the development of 3S technology, extensive data mining, and artificial intelligence, these technologies have been widely used in all walks of life [12,13,14,15,16,17]. LSE combined with data mining technology can significantly improve the efficiency of data acquisition, analysis, and processing and quickly and accurately establish a practical set of influencing factors to promote the efficient application of model methods [18,19]. Research on LSE is gradually divided into three categories: traditional regression analysis, machine learning and its integration method, and deep learning [20,21,22]. The traditional regression analysis methods mainly include frequency ratio, exponential entropy model, landslide density, logical regression, evidence weight, and Fisher discriminant analysis [23,24]. These methods generally use the landslide list as the prediction variable and establish a statistical regression model to predict the probability of landslide occurrence. However, there is a certain degree of subjectivity in factor selection and weight (or other parameters) allocation, so this method relies on expert experience to a certain extent [25]. Traditional machine learning methods and ensemble techniques mainly include Artificial Neural Networks, Random Forest (RF), Support Vector Machine, Classification, and Regression Tree Bag Algorithm [26,27,28]. These methods have essential similarities in selecting key influencing factors, reducing the influence of highly correlated factors on model generalization ability. In addition, these methods can support the comprehensive analysis of various influencing factors and better depict the nonlinear correlation between influencing factors and landslide susceptibility [29,30]. Therefore, they can achieve relatively high LSE accuracy. The most widely used deep learning methods include Convolutional Neural Networks (CNN) and Support Vector Machine, RF and Logistic Regression, CNN and Support Vector Machine, RF or Logistic Regression, Recurrent Neural Networks, CNN and Spatial Explicit Deep Learning Neural Networks [31,32,33]. Compared with traditional machine learning methods, deep learning has a more complex structure, which is more competitive in describing complex nonlinear problems [34]. In addition, due to solid learning strategies, deep learning can obtain better generalization ability than traditional machine learning.
However, in the LSE study, there is a problem that the accuracy of model training is susceptible to sample quality [35,36]. The samples include landslide samples with very high susceptibility or probability of 100% and non-landslide samples with shallow susceptibility or probability of 0 [37]. Landslides samples can be determined by field investigation and remote sensing interpretation of the landslide that has occurred, which is authentic. However, most non-landslide samples are randomly or subjectively selected, and the selected non-landslide grid cells cannot be well guaranteed to be “non-landslide” with high probability. Therefore, the correct selection of non-landslide samples is essential in ensuring sample quality and model accuracy [38]. Non-landslide samples are generally selected by manual screening and statistical methods, including random, buffer, slope, clustering. [39,40]. The random method is to generate non-landslide samples randomly outside the landslide range [41]; like the random method, the buffering method buffers the landslide and generates non-landslide samples in the buffer [42]; the slope method sets the slope threshold to generate non-landslide samples in areas less than the threshold [43]. However, the non-landslides selected by these methods are only subjective speculation or random selection of non-landslides, which cannot guarantee the low susceptibility of the selected non-landslides. Compared with the first three methods, the clustering method pre-classifies the study area according to certain rules and selects non-landslide samples in areas with extremely low susceptibility with high rationality [44]. The non-landslide samples selected by the clustering method are not very close to the landslide samples in space, which improves the quality of the non-landslide samples, avoids the over-fitting of the susceptibility model, and improves the evaluation accuracy.
To sum up, in order to overcome the lack of accuracy in the selection of non-landslide samples of landslides and to take into account a large number of influencing factors, comprehensive sources, and high dimensions, we have carried out three aspects of research on LSE: (1) Non-landslide samples selection based on DEC unsupervised learning; (2) Establishment of a landslide susceptibility evaluation model based on SE-CapNet deep learning technology; (3) Based on the analysis of the established LSE model and driving mechanism, mid- and long-term prediction of landslide susceptibility to rainfall under global change. Figure 1 shows the technical route of this paper.

2. Study Area and Data

2.1. Study Area

Mountainous and hilly areas in the Yangtze River Basin, interlaced plains and lakes, rich and varied topography, and abundant water resources are the basis for our survival and economic development [45]. Yichang is located in the junction of the middle and lower reaches of the Yangtze River at the lower end of the Three Gorges Reservoir area (Figure 2). The famous Three Gorges Dam is built in Yichang City [46,47]. Therefore, the research on resource protection and geological disaster prevention in the Yichang area of the Yangtze River Basin has significant practical value. The high mountains, hills, and plains in the Yichang area of the Yangtze River Basin have complex geological conditions and frequent geological disasters. It is one of the areas with severe geological disasters such as landslides, debris flow, collapse, and ground collapse in Hubei Province [48,49]. According to statistics, about 599 geological disasters occur in the study area [50,51]. Frequent geological disasters seriously threaten people’s lives and property safety and significantly affect social stability and local economic development [52]. Therefore, it is of great significance to strengthen the monitoring and early warning research of the geological disaster susceptibility evaluation in the Yichang section of the Yangtze River Basin if we are to protect people’s life and material property from loss and the safety of the Three Gorges Reservoir area.

2.2. Data

2.2.1. Landslide Inventory

As the first step of LSE, landslide investigation is necessary for modeling [53]. We used the global landslide catalog (GLC) opened by NASA as the data source, which can be downloaded and used free on Cooperative Open Online Landslide Repository (COOLR). The landslide catalog included 7556 records of training, testing, and prediction data. The data existed in the form of landslide points from 1996 to 2019. Among them, the landslide data in the Yichang section of the Yangtze River Basin contain 1270 records, and each record contains fields such as disaster type, county, township, village group, inducement, and area. Figure 2c shows the distribution of landslide data in the Yichang section of the Yangtze River Basin.

2.2.2. Data Sources

LSE predicts the probability of geological disasters in an area according to the historical disaster data given in the list. The data used usually include a variety of environmental factors and historical records of geological disasters [53,54]. Therefore, studying various influencing factors of landslide genesis is also necessary for LSM research. Based on previous LSM studies in the Yangtze River Basin, we use elevation data, primary geological and topographic data, and primary geographic data to produce LSE factors [55,56,57]. The ground resolution of the Landsat-8 remote sensing image is up to 15 m, and its wavelength range can be from 0.43 μ m to 2.29 μ m , including nine bands [58]. This paper mainly extracts soil vegetation index, land use, road, and water distance index factors based on Landsat-8 remote sensing image. Primary geological data from the National Geological Database is used to draw stratigraphic lithology and the geological structure fault index factor. Topographic data come from the free DEM products with a ground resolution of 12.5 m provided by Japan’s Advanced Land Observing Satellite (ALOS), which extracts slope, aspect, and topographic factors. Table 1 shows the data in this paper.

2.2.3. Indicator System

The geological conditions of the Yichang section in the Yangtze River Basin are complex, and the occurrence of landslides is affected by many factors. By analyzing the development law, temporal and spatial distribution characteristics of landslide disasters in the Yichang section of the Yangtze River Basin, it is known that the landslide disasters in the study area are mainly caused by the interaction of triggering factors such as topography and geomorphology conditions, vegetation coverage and human activities. According to previous research and survey results, 12 factors were selected from five categories: topography, introductory geology, hydrological and soil conditions, surface coating, and disaster-causing factors (Figure 3). Figure 4 shows a thematic map of the 12 factors in the study area.
(1)
Topography
The slope is considered the critical topographic factor that directly affects the slope stability [59]. The slope will affect the seepage process and stress field distribution. The statistical relationship between slope and landslide in the study area is shown in Figure 3a. About 80% of landslides are distributed in the range of 15–35°, and the frequency ratio is more significant than 100%, indicating that the landslide mainly occurs on the slope with medium slope.
Aspect is another crucial factor affecting the development of landslides, which will affect rainfall leakage and runoff and the absorption of solar radiation, thereby indirectly affecting the occurrence of landslides [60]. The slope direction of the study area is shown in Figure 3b. The frequency ratios of the sunny slope, semi-sunny slope, and flat slope are all greater than 1, indicating that the slope direction interval has a specific effect on landslide occurrence.
The landslide development is closely related to landform, and their relationship is shown in Figure 3c. The hilly area of the study area is the smallest, but the landslide disaster point is about 35%, and the frequency ratio is about 2. It shows that hills have the most significant impact on landslide development in the topography of the study area.
(2)
Geological factors
Formation lithology plays an essential role in developing landslides, which mainly affects the bedrock’s physical and mechanical properties and is an essential internal factor for landslide disasters [61]. The strata in the study area are exposed from Paleozoic to Quaternary, and the lithology can be summarized as hard rock, harder rock, hard-soft-integrated, weak rock, and extremely weak rock (Figure 3d). Spatial statistical analysis of disaster points and lithological layers shows that landslides are developed in all strata, but weak rock, and extremely weak rock and other low-strength strata are the most developed. The development density of hard-soft-integrated is second, and the distribution of landslides in hard rock is tiny.
The distance from the fault is also a common geological factor for LSE. The fractured zone formed by the fault control has a soft surface or weak zone locally, and the rock and soil strength are low and easy to be weathered and denuded. In particular, there are cut slopes caused by artificial activities at the fractures, which are prone to landslide disasters under the action of rainfall, weathering, and erosion. Therefore, the area close to the fault fracture zone can easily become the most developed area of regional geological disasters and hidden dangers within a specific range. Figure 3e shows that the area proportion and landslide proportion of the study area from the fault distance between 0.5–1.0 km are the highest, and the frequency ratio is greater than 1. We see that the closer the distance from the fault, the higher the probability of landslide occurrence.
(3)
Hydrological and soil conditions
The Yangtze River Basin is rich in water systems, and the river is densely distributed. When the river erodes at the bottom of the slope, the bottom of the slope will be filled with pore water, resulting in a decrease in slope stability. Therefore, the distance from the water system is also an essential factor for LSE in this study area. The results in Figure 3f further show that when the distance to the water system is less than 0.5 km, the maximum frequency ratio is 2.47, followed by 0.5–1.0 km, and the frequency ratio continues to decrease with the increase of the distance to the water body.
Due to the different domsoil, the characteristics of soil water content and viscosity are also different, and so the friction degree between the affected body and the surface of different soil types is significantly different. Figure 3g shows the soils in the study area, which are mainly summarized into eleven categories, including Haplic Luvisols (LVh), Haplic Alisols (ALh), Dystric Cambisols (CMd), Cumulic Anthrosols (ATc), Calcaric Regosols (RGc), Eutric Planosols (Ple), Inland Water (WR), Ferric Alisols (Alf), Calcaric Fluvisols (FLc), Eutric Fluvisols (Fle), and Humic Acrisols (Acu). Figure 3g shows that the area ratio and landslide ratio of the LVh are the largest, which were 36.7% and 32.3%, respectively, but the frequency ratio is only 0.8. The highest frequency ratio of RGc is 1.5, and the next highest frequency ratio of ALh is 1.3. Although the area of the two soil types and the proportion of landslides are not high, the occurrence of landslides is more frequent, indicating that it has an important impact on the development of landslides.
(4)
Surface coverage
The vegetation index has a specific influence on slope stability, and its roots can improve the shear strength of the soil. Leaf transpiration can promote groundwater discharge and soil slope protection. Therefore, generally, the denser the vegetation, the better the slope stability. It can be seen from Figure 3h that the area of NDVI in the study area was the highest between 0.8 and 1, accounting for about 50%. The landslide frequency ratio of NDVI < 0.6 is the highest, showing that landslide probability is higher in the low vegetation area.
(5)
Disaster-causing factors
Rainfall
Rainfall is one of the most critical factors causing slope landslides. Since the study area is in the subtropical climate zone with a mild and humid climate, abundant rainfall, continuous rainfall, and other related functions are the main causing factors for landslide development in this area. Based on the statistical analysis in Figure 3i, the average annual rainfall is the highest in the range of 1000–1300 mm, and the accumulation of rainfall will aggravate the occurrence of landslides.
Human engineering activities
With the development of the social economy, the scale and intensity of human activities have become larger and larger, and their speed has exceeded the development of natural geology, becoming a vast force affecting the development of landslides. Human activities such as urban construction, highway reconstruction, and mineral exploitation in the Yichang section of the Yangtze River Basin are directly or indirectly related to landslide development. Therefore, we used land use, distance from roads and distance from mines in LSE.
Land use types in the Yichang section of the Yangtze River Basin can be divided into five categories: cultivated land, water system, forest land, unused land, and buildings (Figure 3j). The statistical results show that the number of cultivated land landslides is the least, and the frequency ratio is only 0.18. The highest proportion of landslides in unused land is about 50%, and the frequency ratios to buildings are 1.37 and 1.41, respectively. The frequency of landslides in a specific area is the highest, indicating that landslides in woodland and buildings are the most developed.
The road is a kind of widely distributed artificial building. Road construction will interfere with earthwork to a certain extent and indirectly lead to slope instability. The frequency of landslides between different distances from the road in the study area is shown in Figure 3k. According to the statistical results, the landslide accounts for a relatively high proportion at 5 km and 30–50 km from the road. However, when it is farther away from the road, the frequency ratio does not appear to be significantly reduced, indicating that the road distance in the study area is not apparent for landslide development, which can be used as an indirect adjustment factor.
The Yichang section of the Yangtze River Basin is rich in mineral resources generated by intense geological activities in the geological history period, accounting for 62% and 51.2% of the discovered mineral types in the whole province and the whole country, respectively. The risk of geological disasters caused by mining is severe. The landslide frequency of the distance from the mine in the study area is shown in Figure 3l. We see that the landslide occurs most frequently in the study area with a distance less than 7 km from the mine, and the frequency ratio is as high as about 2.0, followed by the relatively developed within the range of 7–13 km. The smaller the distance from the mine, the lower the landslide frequency is, indicating that mining has played a crucial role in landslide development.

3. Methodology

3.1. Non-Landslide Samples Selection Network Based on DEC

To overcome the limitations of large randomness and strong human dependence in the selection of non-landslide samples in traditional methods, we use a typical unsupervised DEC to select non-landslide samples. DEC is composed of stacked autoencoders and soft allocation models [62,63]. DEC can learn latent feature representation and clustering allocation as a joint model of dimensionality reduction and clustering. Compared with K-means, Space Optimal Aggregation Model (SOAM), Density-Based Spatial Clustering of Applications with Noise (DBSACAN), and other clustering methods, the stacked automatic encoder architecture helps to reduce redundant information and high-dimensional noise, which is convenient to solve the high-dimensional problem of multi-source heterogeneous data [64]. As shown in Figure 5, the unsupervised DEC is composed of a fully connected stacked automatic encoder (SAE) and soft allocation model of T distribution measurement, which is trained by matching soft allocation with target distribution.
The specific process of non-landslide sample selection is as follows. First, the DEC neural network is used to cluster the landslide susceptibility in the study area systematically. Then the non-landslides are selected from the extremely low prone areas of the preliminary clustering results to ensure that the selected grid units have a very low probability of landslide occurrence. Finally, LSE model based on SE-CapNet network is constructed based on the sample data set composed of non-landslides classified and selected.

3.2. Capsule Neural Network Based on SENet

CNN can maintain the spatial invariance of input data or tolerate small spatial changes through operations in the convolution and pooling layers [65]. Nevertheless, it loses detailed information to a certain extent, especially for remote sensing images with rich detail texture information. Aiming at the problem of feature information loss and poor generalization ability caused by CNN classification of remote sensing images, we use an improved Capsule Neural Network model (CapNet) based on Squeeze-and-Excitation Model (SENet) model. CapNet used capsule to replace neurons in CNN, so that the network can retain detailed attitude information and hierarchical spatial relationship between objects and make up for the defects of CNN [66]. At the same time, the SENet model can obtain the importance of each feature channel through learning in the training process of CapNet, and it will automatically improve the valuable features according to the importance and suppress the features that are not very useful for the current task, to obtain more accurate susceptibility results [67]. As shown in Figure 6, the SE-CapNet network structure consists of the SENet feature extraction part, the Capsule neuron part, the dynamic routing part, and the classification capsule network part.
(1) SENet Feature Extraction Network
SENet won the first place in the ImageNet2017 classification task. It adds processing between adjacent two layers, making the information interaction between channels possible and further improving the accuracy of the network [68]. As shown in Figure 7, SENet mainly consists of two parts. In the Squeeze part, the global vision is obtained by increasing the sensory area while reducing the dimensionality of the image data. The Excitation section adds a fully connected layer to predict the importance of each channel, get the importance of different channels, then act on the compressed feature map, and finally input the feature map into the CapNet network.
(2) Capsule neurons
The purpose is to fuse the features extracted from the previous convolution layer and input them into the dynamic routing layer. The essence of the capsule neuron is similar to the convolution layer, and each capsule layer corresponds to different input layers. In the inner layer, the m-dimensional space of the input is mapped to the n-dimensional space of the output by weight matrix processing.
(3) Dynamic routing
The dynamic routing algorithm mechanism measures the similarity between input and output by calculating the dot product of the input and output of the capsule and then updates the routing coefficient bij of the neural network according to the dot product value [69]. Firstly, all bij is initialized to 0, and iterative calculation is started. Each iteration first calculates the cij value by softmax function and then combines Uij, wij, and cij to do linear summation to obtain Sj. Then Sj is input into the activation function Squash to obtain Vj. Finally, U j | i ^ and Vj is used to update the bij value. After all, calculations start the next iteration and use three iterations for best practices in practice. The update expression for bij is shown in Equation (1):
b i j b i j + U j | i ^ V j
The coupling coefficient cij is updated through the dynamic routing algorithm, but the other convolution parameters of the entire network and wij in the capsule need to be updated according to the loss function.
(4) Classification capsule
The multidimensional vector output by the classification capsule can be reduced in the fully connectional layer and converted from vector to scalar. In the capsule network, the fully connectional layer will integrate all the features obtained previously and enhance the robustness of the network. Multiple fully connectional layers can also increase the nonlinear expression ability of the network, which is more conducive to network learning. However, the number of fully connectional layers and the number of neurons in each layer will increase the number of parameters in the network and lead to over-fitting. Therefore, we set the fully connectional layer to three layers, and the number of neurons in the last layer should be consistent with the number of different categories in the classification results of the selected dataset. The vector value from the classification capsule layer is converted into scalar data in the fully connectional layer after the operation. After integration, it is mapped to n classification nodes to realize the spatial transformation of features and output the classification results.

3.3. Precision Evaluation Indicators

Usually, the magnitude of landslide and non-landslide samples is not the same. For such unbalanced learning problems, it is challenging to obtain ideal results by using classification accuracy alone to evaluate the model’s performance, leading to high accuracy and low recall rate [70,71]. Therefore, in this study, we used the four statistical indicators of accuracy, precision, susceptibility, and specificity, as well as the Receiver Operating Characteristic (ROC) curve and the curve below, and Area Under Curve (AUC) based on susceptibility and specificity to evaluate the performance of LSE model [72,73,74]. Among them, sensitivity is the proportion of landslide samples in the correct classification to all landslide samples; specificity is the proportion of non-landslide samples in the correct classification to all non-landslide samples. Equations (2)–(5) shows how the indicator is calculated.
Accurary = TP + TN TP + FP + TN + FN
Precision = TP TP + FP
Sensitivity = TP TP + FN
Specificity = TN FP + TN
TP (True Positive) and TN (True Negative) are the numbers of grids correctly classified, while FP (False Positive) and FN (False Negative) are the numbers of grids wrongly classified [75]. ROC curve is often used to evaluate the performance of diagnostic signals and prediction models, with sensitivity as the Y-axis and Specificity as the X-axis. AUC represents the ability of the model to predict landslide and non-landslide grids. When its value is 1, it represents the perfect model, while 0 represents the invalid model.

4. Results

4.1. Training Based on Integrated Deep Learning Algorithm

4.1.1. Non-Landslide Samples Set Selection

In this paper, the frequency ratios of 12 environmental factors in the index system are standardized as input variables of DEC, and the preliminary classification results are shown in Figure 8. The natural breakpoint method was used to classify the susceptibility results of DEC landslide, and five categories were obtained: extremely high, high, medium, low, and extremely low. The frequency ratio statistical results are shown in Table 2. It can be seen from Table 2 that the proportions of each grade of LSE are extremely low, low, medium, high and extremely high respectively. The extremely low susceptibility area accounts for 18.07% of the total area of the study area but only 8.46% of the total number of landslide grid units. At the same time, the extremely high and high landslide susceptibility areas in the study area contain about 62.11% of the landslide grid units, and the frequency ratio of the extremely high and high landslide susceptibility areas accounts for 59.52% of the total frequency ratio. The validity of the DEC-based LSE results has been shown.
Then the non-landslide grid cells were selected from the extremely low area of DEC preliminary LSE results. To verify the rationality of the non-landslide sample selection, some sample images were randomly selected and projected onto Google Earth (Figure 9). Figure 9(a1) and Figure 9(a2) show that landslide samples are generally close to roads and water systems, while Figure 9(a3) shows that landslides develop more widely in areas with frequent human activities. It can be seen from the image of the example in Figure 9b that the non-landslide samples are evenly distributed in urban areas, mountainous areas, water systems, and so on. With gentle terrain, dense vegetation, and less human activities. Combined with prior knowledge, topographic data and hydrological data, it can be seen that the locations of non-landslide samples and landslide samples are quite different, indicating that the selection method of non-landslide samples based on DEC clustering is reasonable.

4.1.2. Environment and Training Parameters

In this study, 5621 landslide points in the whole province were randomly selected as positive samples, and the training samples of SE-CapNet model were constructed by using the selected 10,242 non-landslide points. At the same time, the study area was selected as the verification sample of the model.
The experiment is carried out in a 16 GB Windows 64bit operating system. The intel CORE i59th Gen CPU is configured, and the GeForce GTX 1650TI card is mounted. We select TensorFlow as a learning framework, mainly relying on libraries such as TensorFlow, Keras, OpenCV, and PIL. The specific experimental environment configuration is shown in Table 3.

4.2. LSE Results from Integrated Deep Learning Algorithm

4.2.1. Accuracy Assessment and Algorithm Comparison

To verify the effectiveness of the proposed method, experiments were carried out on the constructed landslide data set in RF, CNN, CapNet, and the SE-CapNet deep learning algorithm integrated with this paper. Table 4 lists the precision index values used to assist in evaluating the model’s performance. From the table, it can be seen that in terms of sensitivity, the integrated algorithm in this paper obtains the maximum value (95.12%), followed by the convolution neural network CapNet algorithm (91.37%), CNN (89.02%), and RF (82.60%), which shows that the proportion of positive samples of landslide can be detected by this method is high. In terms of specificity, RF achieved the value (89.27%), followed by CNN (91.42%), CapNet (94.05%), and SE-CapNet (96.83%). In terms of accuracy and precision, the integrated model in this paper still achieves the maximum value: 96.06% and 96.82%. The proposed algorithm is superior to all models based on several important metrics, except that RF exceeds the specific value. It indicates that the method in this paper can meet the LSE.
The ROC curve analysis of the model is shown in Figure 10. We use AUC value as the primary criterion for evaluating various models. The comparison of AUC values showed that all application models performed well in LSE (accuracy > 0.8). Compared with the other three methods, the AUC value in this paper is the highest, reaching 0.973, indicating that the method in this paper has the best performance in LSE. For CapNet and CNN, their performance is as good as ever, 0.965 and 0.931, respectively. RF also showed good generalization ability, and its AUC value was as high as 0.897. The method has a strong classification ability, but it is not stable because the variables are randomly selected and reclassified before each model training. The comparison results of the above indicators show that the deep learning integration scheme makes the model produce a diversified learning process, effectively reduces the generalization error caused by various preferences, and improves the prediction ability of the vulnerability evaluation model.

4.2.2. Verification and Algorithm Comparison

According to the trained model, the data of the whole study area is used as the model’s input, and the final susceptibility results obtained by the integrated algorithm are shown in Figure 11d. Each grid point in the study area has the corresponding probability value of landslide susceptibility. The probability of landslide susceptibility is divided into five categories by using the natural breakpoint method: extremely low, low, medium, high, and extremely high. According to the evaluation result map, the area with high landslide susceptibility is consistent with the existing landslide area, mainly distributed in the Yangtze River Mainstream Region, and the landslide susceptibility on both sides of the watershed is generally high. The susceptibility evaluation results of RF, CNN, and CapNet are shown in Figure 11a–c. The graph shows that the regions with high susceptibility to the three methods are consistent with the distribution of known landslide points, which is consistent with the results of this method. However, the regions with high susceptibility in the final results of the above three methods are smaller than those of the artificial neural network.
At the same time, to further compare the performance of the four methods, the historical landslide disaster points in the study area are used to verify the results. The statistical table is shown in Figure 12. It can be seen from the table that 1270 landslide points in the known landslide points are located in the areas above the medium susceptibility predicted by the model, including 412 high susceptibility areas and 512 extremely high susceptibility areas, accounting for 73.05% of the total number of known landslides, and the susceptibility evaluation effect is good. Compared with this method, there are 471 landslide points in the high-prone areas of RF, accounting for 37.07% of the total landslides. There are 751 landslide points in the high-prone areas of CNN, accounting for 59.12% of the total number of landslides. There are 855 landslide points in CapNet, accounting for 67.35% of the total number of landslides. The above four methods have certain applicability in LSE, among which the method in this paper has the highest accuracy, indicating that the LSE model based on integrated deep learning has the best effect. This is because after the integration of SENet network, this method will obtain the importance of each feature channel through learning in the training process, and automatically improve the valuable features according to the importance while suppressing the features that are not very useful for the current task. As a result, the extremely high-prone area is large. At the same time, the DEC is used to make the recognition rate of unbalanced landslide samples higher, so the final result accuracy is higher than other algorithms.

5. Discussions

5.1. LSE Results and Influencing Factors

5.1.1. Topography

(1) Slope: In the extremely low, low, medium, high, and extremely high-prone areas in the study area, the slope of different regions is affected by the slope of different regions, and the slope of different intervals is distributed in each grade. In this paper, the most significant proportion of slope intervals in each grade is taken as the main influencing factor (other factors are used in the same statistical method). The statistical analysis results are shown in Table 5. It can be seen from the table that the slope ranges of the high and extremely high-prone areas in this study area are 15–25° and 25–35°, and the slope ranges of the low and extremely low prone areas are >35° and <5°, which is consistent with the statistical characteristics that the landslide mainly occurs on the slope with the medium slope in the evaluation index.
(2) Slope aspect: Slope aspect factors mainly control the degree of rock weathering, which is one of the parameters that cause the development characteristics of geomorphological differences. According to the statistical results, the areas with high and extremely high susceptibility are mainly affected by the West, Southwest, North, Northwest, and Northeast, indicating that the slope direction interval has a specific effect on landslide occurrence.
(3) Geomorphology: The study area in this paper generally forms three basic geomorphologic types, namely, mountains, hills, and plains. The mountains and hills are mainly distributed in the northwest of the study area, and the central and southeast of the study area are mainly plains. The development of landslides is closely related to topography. The hilly landform area in the study area is the smallest, but the proportion is the highest in the highly prone area to landslides, followed by low and moderate mountains.

5.1.2. Geological Factors

(1) Engineering rock group: The stratigraphic lithology in the geological environment is divided into five types according to the lithologic combination: hard-soft-integrated, weak rock, extremely weak rock, harder rock, hard rock. The extremely weak rock group in the study area is mainly distributed in the southeast, mainly composed of Quaternary clay and sandy clay, and the hard rock group composed of volcanic rocks and metamorphism is mainly distributed in the central and northern parts of the study area. After statistical analysis, the high-prone areas and high-prone engineering rock groups in the study area are mainly hard-soft-integrated and Weak rock, which are distributed in relatively weak or weak rock groups such as Quaternary clay, Cretaceous Glutenite and Devonian shale and limestone. In low and very low susceptibility areas, engineering rock groups are mainly harder rock and hard rock, and hard or hard engineering rock groups such as volcanic rock, metamorphic rock, dolomite, or limestone. The engineering rock group plays an essential role in developing landslides, mainly by affecting the physical and mechanical properties of bedrock and the accumulation body.
(2) Faults: The study area is located in the composite part of the southern section of the third uplift belt of the first-order structure of the Neocathaysian system and the Huaiyangshan type structure system in geological structure, and the fault structure is mainly developed in the northern and northwestern parts of the study area. The closer the distance from the fault, the larger the statistical area of the extremely high and high-prone areas is, indicating that the closer the fault, the more unstable is the rock, and the looser the soil, the higher is the possibility of a landslide.

5.1.3. Hydrology and Rainfall

(1) Hydrology: The Yangtze River is the main river in Yichang City. The river network is dense, and the water is abundant. The distance from the water body can be used to express the information about river development and river basin erosion, which is an essential factor in regional ecological stability. The extremely high and high-prone areas in this study area showed a high correlation with the distance from the river, and the area less than 0.5 km was the most prone, followed by the 0.5–1.5 interval. The greater the distance from the river, the lower the susceptibility.
(2) Rainfall: The rainfall index factor in the ecological environment is the crucial factor affecting the shear strength of engineering slope in evaluating geological environment carrying capacity. The rainfall in the study area varies from 1000 mm to 1500 mm, and the rainfall in the Northwest and Northeast is less than 1000 mm. The middle and south area is more prominent, about 1300 mm, and shows a trend of more in the middle and less on both sides. After statistical analysis, the areas with extremely high and high susceptibility in this study area are mainly distributed in areas more fabulous than 1200 mm.

5.1.4. Landcover

(1) NDVI: The results of LSE in the study area are approximately positively correlated with the distribution of NDVI. The NDVI index of high and high-prone areas is generally 0.2–0.6, and the NDVI index of low and very low prone areas is generally between B0.8–1.0 and 0–0.2.
(2) Soil types: The soil types in the study area are complex. In general, Dystric Cambisols (CMd) and Calcaric Regosols (RGc) are the leading soil carriers in the high-risk areas of the study area, while Eutric Planosols (Ple), Calcaric Fluvisols (FLc), Eutric Fluvisols (Fle) and Humic Acrisols (Acu) do not use landslide development. The susceptibility is low.
(3) Land use: Land use classification is a unit to distinguish the spatial and geographical composition of land use, showing the way and results of land use and transformation and reflecting the form and use of land. The construction land in the study area is mainly distributed in the Yangtze River and its tributaries, and woodland accounts for more than 69% of the total area. When the land use types in the study area are Building and Unutilized land, the susceptibility is high.

5.1.5. Human Engineering Activity

(1) Road: The relevant factors of human engineering activities are negative factors for LSE. In road construction, the damage to the slope and after the completion of road construction, a series of transportation processes make the surrounding geological environment destroyed. The central and eastern parts of the road distribution are dense, the road density in the region is large, and the susceptibility is the highest when the distance from the road is between 2.0 and 5.0.
(2) Mines: The study area is rich in mineral resources, and mining activities are becoming increasingly intense, which has caused a certain degree of damage to topography. Waste rock and slag piles, tailings ponds, dumps, and transit sites will also produce environmental problems such as land occupation, and waste gas and wastewater will also be generated in production activities. The significant influence of the susceptibility is the highest nearest distance from the mine, and the susceptibility gradually decreases with the increase of distance.
According to the above statistical analysis, when the slope is 15–25°, the slope direction is West, Southwest, and North. The topography is Hill, the lithology is hard-soft-integrated, the distance from the fault is 0.5–1.0, the distance from the water system is less than 0.5, the soil types are CMd and RGc, the NDVI is 0.4–0.6, the rainfall is 1200–1400, the distance from the road is 2.0–5.0, and the distance from the mine is less than 7.0, the landslide disaster is most likely to occur in this study area.

5.2. LSE Driving Mechanism

5.2.1. Rainfall

Rainfall is one of the most critical factors causing slope landslides [76,77]. Since the study area is located in the subtropical climate zone with a mild and humid climate, abundant rainfall, and heavy rain or continuous rainfall, rainfall and other related functions are one of the main external forces for slope deformation and failure in this area. The rainfall driving factors in the study area can be mainly divided into early adequate rainfall and current instantaneous rainfall.
(1) Pre-effective rainfall: pre-effective rainfall is adequate before landslide formation and ultimately retained in the soil [78]. In the early stage, adequate rainfall enters the soil and remains stagnant. This rainfall process produces changes in pore water pressure and affects the stability of the slope. For soil landslides, rainfall is easy to penetrate the slope because of its loose composition. Accordingly, soil water content increases bulk density while reducing the shear strength of the sliding surface, and it is easy to induce a landslide.
(2) Instantaneous rainfall: The effect of rainfall on landslide is mainly manifested in that a large amount of rainwater infiltrates, resulting in the saturation of the soil and rock layer on the slope, and even the accumulation of water on the aquifuge below the slope, thereby increasing the weight of the landslide and reducing the shear strength of the soil and rock layer or the deformation characteristics of accumulation landslide induced by the rise of library water level [79].

5.2.2. Human Engineering Activity

The basic idea of land use prediction is to analyze and detect the driving factors of land use change between different land use types according to the characteristics of historical land use distribution and to predict the land use distribution in a certain period by using the law of land use change in the past and the demand of land use in the future. The current land use situation reflects the trend of landslides under the interference of the geological environment on human economic and social activities. The magmatic activity in the study area formed a variety of acidic to ultrabasic magmatic rocks in the pre-Sinian period and produced a series of metamorphic rock series. In addition, the tectonic activity is vigorous, and Yichang City is rich in mineral resources. Mineral development, road construction, and other engineering activities cause land occupation and topographic and geomorphological damage, which dramatically impacts the geological environment, making the LSE gradually increasing trend [80].

5.3. Landslide Susceptibility Prediction

It can be seen from the above that, besides the relatively static disaster-causing factors, the landslide susceptibility is also significant for its development. A landslide disaster is a dynamic evolution system. In the context of climate changes, human engineering activities, and other induced factors, the susceptibility of landslides itself will also change accordingly. Therefore, it is a new challenge to analyze the impact of changes in induced factors on landslide susceptibility and predict the trend of long-term landslide susceptibility in response to global changes. Therefore, based on the above LSE, this chapter predicts the rainfall information of the study area in 2035 and 2055 through rainfall change analysis and takes the future rainfall as a new influencing factor of landslide susceptibility analysis to predict and analyze the trend of long-term landslide susceptibility.
The original rainfall data in this paper comes from the Chinese meteorological data website. There is a high correlation between the rainfall prediction value and the historical rainfall by the atmospheric circulation model–AGCM model [81,82]. Therefore, based on the annual rainfall from 2000–to 2019 (as shown in Figure 13), we used the AGCM model to predict the annual rainfall in 2035 and 2055. The results are shown in Figure 14. The spatial distribution shows a gradual decrease in rainfall from west to east. In 2055, the overall annual rainfall increased, and the spatial distribution trend was the opposite in 2035. The rainfall showed a trend of high in the east and low in the west. Overall, in the early 21st century, the annual rainfall in the region has decreased and will increase in the middle and late stages. The trend is consistent with the impact assessment report of climate change results in the Yichang area of the Three Gorges Reservoir area.
Taking the rainfall prediction data as a new influencing factor and based on the vulnerability model established in this paper, the LSE prediction results in 2035 and 2055 can be obtained (Figure 15). The high susceptibility areas in 2035 are still concentrated in the coastal areas of the Yangtze River Basin and the main tributaries, while the low susceptibility areas are mainly distributed in high mountains and woodland areas far from rivers and low human activities. Figure 15 shows that the landslide susceptibility in 2050 has a significant change compared to 2030. The landslide susceptibility in Zigui area in the western part of the study area and Zhijiang area in the eastern part of the study area is indigenous. The landslide susceptibility in the southern part of Zigui will decrease, while the landslide susceptibility in the southeast of Zhijiang will increase. In general, the future landslide susceptibility in the study area has an overall increasing trend with the change of rainfall, while it will decrease in some areas. In the early stage, the landslide susceptibility is mainly concentrated in the western Zigui area, while in the late stage, it is transferred to the eastern Zhijiang area.
The results show that the landslide disaster system is a dynamic evolution system. Under the background of climate, human engineering activities, and other environmental factors, landslide susceptibility will also change accordingly. Therefore, analyzing the impact of environmental factors on landslide susceptibility and predicting the trend of long-term landslide susceptibility can play a guiding role in dealing with the new challenges faced by landslide disaster prevention and mitigation under the conditions of global change.

6. Conclusions

We construct an index system to simulate the landslide susceptibility in the Yichang section based on multi-source spatio-temporal big data such as historical landslide catalog, geological data, geographical data, hydrological data, and remote sensing data of the Yangtze River Basin. For the first time, DEC and SE-CapNet deep integration networks are involved in selecting landslide samples and the training of susceptibility evaluation, revealing the primary driving mechanism of landslides in the Yangtze River Basin. At the same time, based on the constructed susceptibility model and rainfall prediction data, we carried out the mid-long term LSP and trend analysis in the study area. The conclusions and results of this paper are as follows:
(1) Based on the SE-CapNet vulnerability evaluation model after DEC non-landslide samples selection, we ensure the quality of non-landslide samples selection, retains the hierarchical relationship between factors, and automatically learns the importance of factors to enhance valuable features. Therefore, the experimental results are better than other models. The four precision index values of sensitivity, specificity, accuracy, and AUC all reach the highest values in the method comparison, which are 95.12%, 96.83%, 96.06%, and 97.3%, respectively, showing the best performance in LSE.
(2) Based on the SE-CapNet susceptibility results, the study area’s hazard-causing factors and hazard-causing factors were extracted and statistically analyzed. The effects of each factor on the landslide susceptibility in the study area were evaluated, providing a reference for the subsequent LSE and variation study and providing a scientific basis for the prevention and control of landslide disasters.
(3) Based on the predicted future rainfall data as a new factor for LSE, we carried out the prediction and variation trend analysis of medium and long-term landslide susceptibility in the Yichang section of the Yangtze River Basin. The results show that with the change in rainfall in global change, the landslide susceptibility will also change accordingly. In other words, the landslide susceptibility in the study area will increase while it will decrease in some areas. In the early stage, the susceptibility is mainly concentrated in the eastern part, and in the late stage, it will be transferred to Zigui area.
Overall, this method has good performance and high precision, providing a reference for subsequent landslide susceptibility mapping, prediction and change rule research, and providing a scientific basis for landslide disaster prevention. However, many environmental and social factors affect the changes of future climate and human engineering activities. There is a certain degree of uncertainty in the future scenario predicted based on its historical change rule. Therefore, the uncertainty of predicting landslide disaster risk remains inevitable under the background of future changes.

Author Contributions

Data curation, C.W.; Formal analysis, R.Z.; Funding acquisition, R.Z.; Methodology, L.C.; Writing—original draft, L.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Due to the nature of this research, participants of this study did not agree for their data to be shared publicly, so supporting data is not available.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Reichenbach, P.; Rossi, M.; Malamud, B.D.; Mihir, M.; Guzzetti, F. A review of statistically-based landslide susceptibility models. Earth-Sci. Rev. 2018, 180, 60–91. [Google Scholar] [CrossRef]
  2. Huang, Y.; Zhao, L. Review on landslide susceptibility mapping using support vector machines. Catena 2018, 165, 520–529. [Google Scholar] [CrossRef]
  3. Yuan, L.; Xudong, Y.; Hui, M. National Analysis Report on Casualties Caused by Sudden Geological Hazards. Chin. J. Geol. Hazards Prev. 2006, 17, 146–147. [Google Scholar]
  4. Yan, L.; Xu, W.; Wang, H.; Wang, R.; Meng, Q.; Yu, J.; Xie, W.C. Drainage controls on the Donglingxing landslide (China) induced by rainfall and fluctuation in reservoir water levels. Landslides 2019, 16, 1583–1593. [Google Scholar] [CrossRef]
  5. Yueping, Y. Preliminary study on the mitigation strategy of geological disasters in China. J. Geol. Disasters Prev. 2004, 15, 1–8. [Google Scholar]
  6. Chuanzheng, L.I.U.; Chunli, C. Achievements and countermeasures in risk reduction of geological disasters in China. Eng. Geol. 2020, 28, 375–383. [Google Scholar]
  7. Gao, Y.; Cao, G.; Ni, P.; Tang, Y.; Liu, Y.; Bi, J.; Ma, Z. Natural hazard triggered technological risks in the Yangtze River Economic Belt, China. Sci. Rep. 2021, 11, 13842. [Google Scholar] [CrossRef] [PubMed]
  8. Wang, J.; Schweizer, D.; Liu, Q.; Su, A.; Hu, X.; Blum, P. Three-dimensional landslide evolution model at the Yangtze River. Eng. Geol. 2021, 292, 106275. [Google Scholar] [CrossRef]
  9. Zhang, L.; Xiao, T.; He, J.; Chen, C. Erosion-based analysis of breaching of Baige landslide dams on the Jinsha River, China, in 2018. Landslides 2019, 16, 1965–1979. [Google Scholar] [CrossRef]
  10. Lu, S.; Yi, Q.; Yi, W.U.; Zhang, G.; He, X. Study on dynamic deformation mechanism of landslide in drawdown of reservoir water leveltake Baishuihe landslide in Three Gorges Reservoir area for example. J. Eng. Geol. 2014, 22, 869–875. [Google Scholar]
  11. Wang, F.; Li, T. Landslide Disaster Mitigation in Three Gorges Reservoir, China; Springer: Berlin, Germany, 2009. [Google Scholar]
  12. Tan, K.; Qiao, J. Development history and prospect of remote sensing technology in coal geology of China. Int. J. Coal Sci. Technol. 2020, 7, 311–319. [Google Scholar] [CrossRef]
  13. Ranjan, A.K.; Anand, A.; Vallisree, S.; Singh, R.K. LU/LC change detection and forest degradation analysis in Dalma wildlife sanctuary using 3S technology: A case study in Jamshedpur-India. Aims Geosci. 2016, 2, 273–285. [Google Scholar] [CrossRef]
  14. Velickov, S.; Solomatine, D.P.; Yu, X.; Price, R.K. Application of data mining techniques for remote sensing image analysis. In Proceedings of the 4th International Conference on Hydroinformatics, Cedar Rapids, IA, USA, 23–27 August 2000. [Google Scholar]
  15. Lary, D.J.; Alavi, A.H.; Gandomi, A.H.; Walker, A.L. Machine learning in geosciences and remote sensing. Geosci. Front. 2016, 7, 3–10. [Google Scholar] [CrossRef] [Green Version]
  16. Mellit, A.; Kalogirou, S. Artificial intelligence and internet of things to improve efficacy of diagnosis and remote sensing of solar photovoltaic systems: Challenges, recommendations and future directions. Renew. Sustain. Energy Rev. 2021, 143, 110889. [Google Scholar] [CrossRef]
  17. Haefner, N.; Wincent, J.; Parida, V.; Gassmann, O. Artificial intelligence and innovation management: A review, framework, and research agenda✰. Technol. Forecast. Soc. Chang. 2021, 162, 120392. [Google Scholar] [CrossRef]
  18. Hong, H.; Pourghasemi, H.R.; Pourtaghi, Z.S. Landslide susceptibility assessment in Lianhua County (China): A comparison between a random forest data mining technique and bivariate and multivariate statistical models. Geomorphology 2016, 259, 105–118. [Google Scholar] [CrossRef]
  19. Lee, S.; Lee, M.J.; Jung, H.S. Data mining approaches for landslide susceptibility mapping in Umyeonsan, Seoul, South Korea. Appl. Sci. 2017, 7, 683. [Google Scholar] [CrossRef] [Green Version]
  20. Shano, L.; Raghuvanshi, T.K.; Meten, M. LSE and hazard zonation techniques—A review. Geoenviron. Disasters 2020, 7, 1–19. [Google Scholar] [CrossRef]
  21. Pourghasemi, H.R.; Teimoori Yansari, Z.; Panagos, P.; Pradhan, B. Analysis and evaluation of landslide susceptibility: A review on articles published during 2005–2016 (periods of 2005–2012 and 2013–2016). Arab. J. Geosci. 2018, 11, 193. [Google Scholar] [CrossRef]
  22. Chen, Y.; Dong, J.L.; Guo, F.; Tong, B.; Zhou, T.; Fang, H.; Wang, L.; Zhan, Q.H. Review of landslide susceptibility assessment based on knowledge mapping. Stoch. Environ. Res. Risk Assess. 2022, 2022, 1–19. [Google Scholar]
  23. Süzen, M.L.; Doyuran, V. A comparison of the GIS based landslide susceptibility assessment methods: Multivariate versus bivariate. Environ. Geol. 2004, 45, 665–679. [Google Scholar] [CrossRef]
  24. Lee, S.; Min, K. Statistical analysis of landslide susceptibility at Yongin, Korea. Environ. Geol. 2001, 40, 1095–1113. [Google Scholar] [CrossRef]
  25. Ercanoglu, M. An Overview on the Landslide Susceptibility Assessment Techniques. In Proceedings of the 1st WSEAS International Conference on Environmental and Geological Science and Engineering (EG’08), Malta, 11–13 September 2008. [Google Scholar]
  26. Marjanović, M.; Kovačević, M.; Bajat, B.; Voženílek, V. Landslide susceptibility assessment using SVM machine learning algorithm. Eng. Geol. 2011, 123, 225–234. [Google Scholar] [CrossRef]
  27. Pham, B.T.; Pradhan, B.; Bui, D.T.; Prakash, I.; Dholakia, M.B. A comparative study of different machine learning methods for landslide susceptibility assessment: A case study of Uttarakhand area (India). Environ. Model. Softw. 2016, 84, 240–250. [Google Scholar] [CrossRef]
  28. Thai Pham, B.; Shirzadi, A.; Shahabi, H.; Omidvar, E.; Singh, S.K.; Sahana, M.; Talebpour Asl, D.; Bin Ahmad, B.; Kim Quoc, N.; Lee, S. Landslide susceptibility assessment by novel hybrid machine learning algorithms. Sustainability 2019, 11, 4386. [Google Scholar] [CrossRef] [Green Version]
  29. Kavzoglu, T.; Colkesen, I.; Sahin, E.K. Machine learning techniques in landslide susceptibility mapping: A survey and a case study. In Landslides: Theory, Practice and Modelling; Springer: Berlin, Germany, 2019; pp. 283–301. [Google Scholar]
  30. Marjanovic, M.; Bajat, B.; Kovacevic, M. Landslide susceptibility assessment with machine learning algorithms. In Proceedings of the International Conference on Intelligent Networking and Collaborative Systems, Barcelona, Spain, 4–6 November 2009; pp. 273–278. [Google Scholar]
  31. Bui, D.T.; Tsangaratos, P.; Nguyen, V.T.; Van Liem, N.; Trinh, P.T. Comparing the prediction performance of a Deep Learning Neural Network model with conventional machine learning models in landslide susceptibility assessment. Catena 2020, 188, 104426. [Google Scholar] [CrossRef]
  32. Xiao, L.; Zhang, Y.; Peng, G. Landslide susceptibility assessment using integrated deep learning algorithm along the China-Nepal highway. Sensors 2018, 18, 4436. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Habumugisha, J.M.; Chen, N.; Rahman, M.; Islam, M.M.; Ahmad, H.; Elbeltagi, A.; Sharma, G.; Liza, S.N.; Dewan, A. Landslide susceptibility mapping with deep learning algorithms. Sustainability 2022, 14, 1734. [Google Scholar] [CrossRef]
  34. Azarafza, M.; Azarafza, M.; Akgün, H.; Atkinson, P.M.; Derakhshani, R. Deep learning-based landslide susceptibility mapping. Sci. Rep. 2021, 11, 1–16. [Google Scholar] [CrossRef] [PubMed]
  35. Falaschi, F.; Giacomelli, F.; Federici, P.R.; Puccinelli, A.; D’Amato Avanzi, G.; Pochini, A.; Ribolini, A. Logistic regression versus artificial neural networks: LSE in a sample area of the Serchio River valley, Italy. Nat. Hazards 2009, 50, 551–569. [Google Scholar] [CrossRef]
  36. Kornejady, A.; Ownegh, M.; Bahremand, A. Landslide susceptibility assessment using maximum entropy model with two different data sampling methods. Catena 2017, 152, 144–162. [Google Scholar] [CrossRef]
  37. Huang, F.; Cao, Z.; Jiang, S.H.; Zhou, C.; Huang, J.; Guo, Z. Landslide susceptibility prediction based on a semi-supervised multiple-layer perceptron model. Landslides 2020, 17, 2919–2930. [Google Scholar] [CrossRef]
  38. Huang, F.M.; Yin, K.L.; Jiang, S.H.; Huang, J.S.; Cao, Z.S. Landslide susceptibility evaluation based on cluster analysis and support vector machine. Chin. J. Rock Mech. Eng. 2018, 37, 156–167. [Google Scholar]
  39. Nefeslioglu, H.A.; Gokceoglu, C.; Sonmez, H. An assessment on the use of logistic regression and artificial neural networks with different sampling strategies for the preparation of landslide susceptibility maps. Eng. Geol. 2008, 97, 171–191. [Google Scholar] [CrossRef]
  40. Mingoti, S.A.; Lima, J.O. Comparing SOM neural network with Fuzzy c-means, K-means and traditional hierarchical clustering algorithms. Eur. J. Oper. Res. 2006, 174, 1742–1759. [Google Scholar] [CrossRef]
  41. Melchiorre, C.; Matteucci, M.; Azzoni, A.; Zanchi, A. Artificial neural networks and cluster analysis in landslide susceptibility zonation. Geomorphology 2008, 94, 379–400. [Google Scholar] [CrossRef]
  42. Xu, S.H.; Liu, J.P.; Wang, X.H.; Zhang, Y.; Lin, R.F.; Zhang, M.; Liu, M.; Jiang, T. Landslide Susceptibility Assessment Method Incorporating Index of Entropy Based on Support Vector Machine: A Case Study of Shaanxi Province. Geomat. Inf. Sci. Wuhan Univ. 2020, 45, 1214–1222. [Google Scholar]
  43. Kavzoglu, T.; Sahin, E.K.; Colkesen, I. Landslide susceptibility mapping using GIS-based multi-criteria decision analysis, support vector machines, and logistic regression. Landslides 2014, 11, 425–439. [Google Scholar] [CrossRef]
  44. Shuai, B.; Jiping, L.; Liang, W. Evaluation of landslide susceptibility combined with DBSCAN clustering sampling and SVM classification. Disaster Prev. Technol. 2021, 16, 12. [Google Scholar]
  45. Jiang, Y.H.; Lin, L.J.; Ni, H.Y.; Ge, W.Y.; Cheng, H.X.; Zhai, G.Y.; Wang, G.L.; Ban, Y.Z.; Li, Y.; Lei, M.T.; et al. An overview of the resources and environment conditions and major geological problems in the Yangtze River economic zone, China. China Geol. 2018, 1, 435–449. [Google Scholar] [CrossRef]
  46. Xiang, F.; Zhu, L.; Wang, C.; Zhao, X.; Chen, H.; Yang, W. Quaternary sediment in the Yichang area: Implications for the formation of the Three Gorges of the Yangtze River. Geomorphology 2007, 85, 249–258. [Google Scholar] [CrossRef]
  47. Cao, Z.; Tang, J.; Zhao, X.; Zhang, Y.; Wang, B.; Li, L.; Guo, F. Failure Mechanism of Colluvial Landslide Influenced by the Water Level Change in the Three Gorges Reservoir Area. Geofluids 2021, 2021, 6865129. [Google Scholar] [CrossRef]
  48. Runqing, Y.E.; Xiaolin, F.U.; Fei, G.U.O.; Qinglin, Y.I.; Junyi, Z.H.A.N.G.; Changming, L.I.; Shiping, H.O.U.; Na, L.I.U. Deformation characteristics and mechanism analysis of geological hazards during operation period of three gorges reservoir. J. Eng. Geol. 2021, 29, 680–692. [Google Scholar]
  49. Li, T.; Chen, H.; Wang, R. Formation mechanism of Yanchihe landslide in Yichang city, Hubei province. J. Eng. Geol. 2016, 24, 578–583. [Google Scholar]
  50. Jun, Y.; Huali, X. Research on regional vulnerability of geological disasters based on HOP model-Taking Yichang area of Hubei Province as an example. Disastery 2014, 29, 131–138. [Google Scholar]
  51. Jinlin, Z.; Wei, W.; Mingzheng, L. Main mine geological environment problems and control measures and achievements in Yichang. Sci. Technol. Inf. 2018, 16, 4. [Google Scholar]
  52. Wang, J.P.; Ding, H.R. Practice and thinking of emergency prevention and control of geological disasters in Yichang Three Gorges Reservoir Area. Emerg. Manag. China 2014, 10, 52–55. [Google Scholar]
  53. Du, J.; Glade, T.; Woldai, T.; Chai, B.; Zeng, B. Landslide susceptibility assessment based on an incomplete landslide inventory in the Jilong Valley, Tibet, Chinese Himalayas. Eng. Geol. 2020, 270, 105572. [Google Scholar] [CrossRef]
  54. Frattini, P.; Crosta, G.; Carrara, A. Techniques for evaluating the performance of landslide susceptibility models. Eng. Geol. 2010, 111, 62–72. [Google Scholar] [CrossRef]
  55. Peng, L.; Niu, R.; Huang, B.; Wu, X.; Zhao, Y.; Ye, R. Landslide susceptibility mapping based on rough set theory and support vector machines: A case of the Three Gorges area, China. Geomorphology 2014, 204, 287–301. [Google Scholar] [CrossRef]
  56. Zhu, A.X.; Wang, R.; Qiao, J.; Qin, C.Z.; Chen, Y.; Liu, J.; Du, F.; Lin, Y.; Zhu, T. An expert knowledge-based approach to landslide susceptibility mapping using GIS and fuzzy logic. Geomorphology 2014, 214, 128–138. [Google Scholar] [CrossRef]
  57. Zhang, H.; Song, Y.; Xu, S.; He, Y.; Li, Z.; Yu, X.; Liang, Y.; Wu, W.; Wang, Y. Combining a class-weighted algorithm and machine learning models in landslide susceptibility mapping: A case study of Wanzhou section of the Three Gorges Reservoir, China. Comput. Geosci. 2022, 158, 104966. [Google Scholar] [CrossRef]
  58. Roy, D.P.; Wulder, M.A.; Loveland, T.R.; Woodcock, C.E.; Allen, R.G.; Anderson, M.C.; Helder, D.; Irons, J.R.; Johnson, D.M.; Kennedy, R.; et al. Landsat-8: Science and product vision for terrestrial global change research. Remote Sens. Environ. 2014, 145, 154–172. [Google Scholar] [CrossRef] [Green Version]
  59. Keles, F.; Nefeslioglu, H.A. Infinite slope stability model and steady-state hydrology-based shallow LSEs: The Guneysu catchment area (Rize, Turkey). Catena 2021, 200, 105161. [Google Scholar] [CrossRef]
  60. Zêzere, J.L. Landslide susceptibility assessment considering landslide typology. A case study in the area north of Lisbon (Portugal). Nat. Hazards Earth Syst. Sci. 2002, 2, 73–82. [Google Scholar] [CrossRef]
  61. Bednarik, M.; Magulová, B.; Matys, M.; Marschalko, M. Landslide susceptibility assessment of the Kraľovany–Liptovský Mikuláš railway case study. Phys. Chem. Earth Parts A/B/C 2010, 35, 162–171. [Google Scholar] [CrossRef]
  62. Ren, Y.; Hu, K.; Dai, X.; Pan, L.; Hoi, S.C.; Xu, Z. Semi-supervised deep embedded clustering. Neurocomputing 2019, 325, 121–130. [Google Scholar] [CrossRef]
  63. Guo, X.; Liu, X.; Zhu, E.; Yin, J. Deep Clustering with Convolutional Autoencoders. In Neural Information Processing. ICONIP 2017. Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2017; pp. 373–382. [Google Scholar]
  64. Obeid, A.; Elfadel, I.M.; Werghi, N. Unsupervised Land-Cover Segmentation Using Accelerated Balanced Deep Embedded Clustering. IEEE Geosci. Remote Sens. Lett. 2021, 19, 1–5. [Google Scholar] [CrossRef]
  65. Shin, H.C.; Roth, H.R.; Gao, M.; Lu, L.; Xu, Z.; Nogues, I.; Yao, J.; Mollura, D.; Summers, R.M. Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans. Med. Imaging 2016, 35, 1285–1298. [Google Scholar] [CrossRef] [Green Version]
  66. Mukhometzianov, R.; Carrillo, J. CapNet comparative performance evaluation for image classification. arXiv 2018, arXiv:1805.11195, 2018. [Google Scholar]
  67. Hu, J.; Shen, L.; Sun, G. Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 7132–7141. [Google Scholar]
  68. Ma, H.; Han, G.; Peng, L.; Zhu, L.; Shu, J. Rock thin sections identification based on improved squeeze-and-Excitation Networks model. Comput. Geosci. 2021, 152, 104780. [Google Scholar] [CrossRef]
  69. Feng, W.Y.; Liao, K.F.; Ou, Y.S.; Niu, Y. The synthetic aperture radar image classification method based on capsule neural network. Sci. Technol. Eng. 2019, 19, 203–207. [Google Scholar]
  70. Zhao, N.; Zhang, X.; Zhang, L. Overview of imbalanced data classification. Comput. Sci. 2018, 45, 22–27. [Google Scholar]
  71. Chawla, N.V.; Japkowicz, N.; Kotcz, A. Special issue on learning from imbalanced data sets. ACM SIGKDD Explor. Newsl. 2004, 6, 1–6. [Google Scholar] [CrossRef]
  72. Metz, C.E. Basic principles of ROC analysis[C]//Seminars in nuclear medicine. WB Saunders 1978, 8, 283–298. [Google Scholar]
  73. Fawcett, T. An introduction to ROC analysis. Pattern Recognit. Lett. 2006, 27, 861–874. [Google Scholar] [CrossRef]
  74. Huang, J.; Ling, C.X. Using AUC and Accuracy in evaluating learning algorithms. IEEE Trans. Knowl. Data Eng. 2005, 17, 299–310. [Google Scholar] [CrossRef] [Green Version]
  75. García, V.; Mollineda, R.A.; Sánchez, J.S. Index of balanced Accuracy: A performance measure for skewed class distributions. In Proceedings of the Iberian Conference on Pattern Recognition and Image Analysis, Madrid, Spain, 1–4 July 2019; Springer: Berlin/Heidelberg, Germany, 2009; pp. 441–448. [Google Scholar]
  76. Lepore, C.; Kamal, S.A.; Shanahan, P.; Bras, R.L. Rainfall-induced landslide susceptibility zonation of Puerto Rico. Environ. Earth Sci. 2012, 66, 1667–1681. [Google Scholar] [CrossRef]
  77. Hong, H.; Chen, W.; Xu, C.; Youssef, A.M.; Pradhan, B.; Tien Bui, D. Rainfall-induced landslide susceptibility assessment at the Chongren area (China) using frequency ratio, certainty factor, and index of entropy. Geocarto Int. 2017, 32, 139–154. [Google Scholar] [CrossRef]
  78. Wang, X.; Wang, C.; Zhang, C. Early warning of debris flow using optimized self-organizing feature mapping network. Water Supply 2020, 20, 2455–2470. [Google Scholar] [CrossRef]
  79. Montrasio, L.; Valentino, R.; Losi, G.L. Towards a real-time susceptibility assessment of rainfall-induced shallow landslides on a regional scale. Nat. Hazards Earth Syst. Sci. 2011, 11, 1927–1947. [Google Scholar] [CrossRef] [Green Version]
  80. Chen, L.; Guo, Z.; Yin, K.; Shrestha, D.P.; Jin, S. The influence of land use and land cover change on landslide susceptibility: A case study in Zhushan Town, Xuan’en County (Hubei, China). Nat. Hazards Earth Syst. Sci. 2019, 19, 2207–2228. [Google Scholar] [CrossRef] [Green Version]
  81. Hamilton, K. Numerical resolution and modeling of the global atmospheric circulation: A review of our current understanding and outstanding issues. High Resolut. Numer. Model. Atmos. Ocean. 2008, 1, 7–27. [Google Scholar]
  82. He, J.; Soden, B.J.; Kirtman, B. The robustness of the atmospheric circulation and precipitation response to future anthropogenic surface warming. Geophys. Res. Lett. 2014, 41, 2614–2622. [Google Scholar] [CrossRef]
Figure 1. Technical Route.
Figure 1. Technical Route.
Remotesensing 14 02717 g001
Figure 2. Geographical location of the study area. (a) Hubei Province in China; (b) Study area in the Hubei Province; (c) Yangtze River in the Yichang City.
Figure 2. Geographical location of the study area. (a) Hubei Province in China; (b) Study area in the Hubei Province; (c) Yangtze River in the Yichang City.
Remotesensing 14 02717 g002
Figure 3. Analysis of influence factor and landslide frequency ratio. (a) Slope. (b) Aspect. (c) Landform. (d) Lithology. (e) Distance from fault. (f) Distance from water. (g) Domsoil. (h) NDVI. (i) Rainfall. (j) Landuse. (k) Distance from road. (l) Distance from mine. Where, the Landslides ratio is the gray histogram, indicating the proportion of landslides in each factor interval to all landslides; the Area ratio is the blue histogram, indicating the proportion of the area of each factor interval to the total area; the Frequency ratio is the yellow line graph, indicating the ratio of landslide proportion to area proportion in each factor interval.
Figure 3. Analysis of influence factor and landslide frequency ratio. (a) Slope. (b) Aspect. (c) Landform. (d) Lithology. (e) Distance from fault. (f) Distance from water. (g) Domsoil. (h) NDVI. (i) Rainfall. (j) Landuse. (k) Distance from road. (l) Distance from mine. Where, the Landslides ratio is the gray histogram, indicating the proportion of landslides in each factor interval to all landslides; the Area ratio is the blue histogram, indicating the proportion of the area of each factor interval to the total area; the Frequency ratio is the yellow line graph, indicating the ratio of landslide proportion to area proportion in each factor interval.
Remotesensing 14 02717 g003aRemotesensing 14 02717 g003b
Figure 4. Thematic map of factors. (a) Slope; (b) Aspect; (c) Landform; (d) Lithology; (e) Distance from fault; (f) Distance from water; (g) Domsoil; (h) NDVI; (i) Rainfall; (j) Landuse; (k) Distance from road; (l) Distance from mine.
Figure 4. Thematic map of factors. (a) Slope; (b) Aspect; (c) Landform; (d) Lithology; (e) Distance from fault; (f) Distance from water; (g) Domsoil; (h) NDVI; (i) Rainfall; (j) Landuse; (k) Distance from road; (l) Distance from mine.
Remotesensing 14 02717 g004
Figure 5. The framework of DEC.
Figure 5. The framework of DEC.
Remotesensing 14 02717 g005
Figure 6. The framework of SE-CapNet.
Figure 6. The framework of SE-CapNet.
Remotesensing 14 02717 g006
Figure 7. The framework of SENet.
Figure 7. The framework of SENet.
Remotesensing 14 02717 g007
Figure 8. LSE result based on DEC.
Figure 8. LSE result based on DEC.
Remotesensing 14 02717 g008
Figure 9. Examples of non-landslide sample. (a) Examples of landslide sample; (b) Examples of Non-landslide sample; (a1a3) Landslide samples from different regions; (b1,b2) Landslide samples from different towns; (b3,b4) Landslide samples from different mountains; (b5,b6) Landslide samples from different mountains.
Figure 9. Examples of non-landslide sample. (a) Examples of landslide sample; (b) Examples of Non-landslide sample; (a1a3) Landslide samples from different regions; (b1,b2) Landslide samples from different towns; (b3,b4) Landslide samples from different mountains; (b5,b6) Landslide samples from different mountains.
Remotesensing 14 02717 g009
Figure 10. ROC Curve.
Figure 10. ROC Curve.
Remotesensing 14 02717 g010
Figure 11. LSE Based on various methods. (a) RF based LSE result; (b) CNN based LSE result; (c) CapNet based LSE result; (d) SE-CapNet based LSE result.
Figure 11. LSE Based on various methods. (a) RF based LSE result; (b) CNN based LSE result; (c) CapNet based LSE result; (d) SE-CapNet based LSE result.
Remotesensing 14 02717 g011
Figure 12. Frequency Based on Various Methods.
Figure 12. Frequency Based on Various Methods.
Remotesensing 14 02717 g012
Figure 13. Annual rainfall change.
Figure 13. Annual rainfall change.
Remotesensing 14 02717 g013
Figure 14. Rainfall Forecast Results. (a) Annual rainfall predication for 2035; (b) Annual rainfall predication for 2055.
Figure 14. Rainfall Forecast Results. (a) Annual rainfall predication for 2035; (b) Annual rainfall predication for 2055.
Remotesensing 14 02717 g014
Figure 15. LSE Forecast Results. (a) LSP result for 2035; (b) LSP result for 2055.
Figure 15. LSE Forecast Results. (a) LSP result for 2035; (b) LSP result for 2055.
Remotesensing 14 02717 g015
Table 1. Details of the dataset used for LSE.
Table 1. Details of the dataset used for LSE.
Data NameData SourceResolutionPurpose
GF-1
GF-2
Natural Resources Satellite Remote Sensing Cloud Service Platform2 m
1 m
Landslide Inventory
Google Earth Local Space Viewer2 m
Landsat-8 USGS30 mExtraction of Vegetation Index, Road, Water System, Land Use.
Fundamental terrain dataNASA30 mExtraction of topography, slope, and aspect.
Fundamental geological dataNational Geological Archives of ChinaDraw stratigraphic lithology, geological disasters, and geological structure.
Fundamental geographic dataChina Meteorological Data Network and Hubei Provincial Geological SurveyPrecipitation and mine data sources.
Administrative division dataGlobal Administrative Division DatabaseExtraction of administrative boundaries.
Table 2. DEC-Result.
Table 2. DEC-Result.
LSEArea Ratio/%Landslide Ratio/%Frequency Ratio/%
Extremely high29.2432.35110.64
High15.3929.76193.37
Moderate25.1119.4177.30
Low12.4010.2582.66
Extremely low18.078.4646.82
Table 3. Experimental environment configuration.
Table 3. Experimental environment configuration.
Hardware DeviceCPU: Intel CORE i5 9th Gen
GPU: NVIDIA GeForce GTX 1650TI
System platformWindows10 64-bit
Development environmentPython 3.6.5, TensorFlow-GPU 1.9.0, Keras 2.1.6
Compile environmentAnaconda3, Jupyter
Table 4. Comparison of precision index results.
Table 4. Comparison of precision index results.
MethodsAccuracy (%)Precision (%)Sensitive (%)Specificity (%)
SE-CapNet96.0696.8295.1296.83
CapNet93.3094.2991.3794.05
CNN91.5792.3689.0291.42
RF87.2388.4182.6089.27
Table 5. LSE results and influencing factors.
Table 5. LSE results and influencing factors.
LSEExtremely HighHighModerateLowExtremely Low
Area ratio14.67%29.47%30.24%17.29%8.33%
Slope15–25°25–35°5–15°>35°<5°
AspectWest, Southwest, NorthNorthwest, NortheastSoutheast, EastPlaneSouth
LandformHillLow mountainsModerate mountainsPlaneHigh mountains
LithologyHard-soft-integratedWeak rockExtremely weak rockHarder rockHard rock
Fault line0.5–1.0 km<0.5 km1.0–1.5 km1.5–2.0 km >5.0 km
Distance from water<0.5 km0.5–1.5 km1.5–2.5 km2.5–10.0 km>10.0 km
DomsoilCMd, RGcAlh, LVhAtc, Alf, WRPleFLc, Fle, Acu
NDVI0.4–0.60.2–0.40.6–0.80.8–1.00–0.2
Rainfall1200–1400 mm1100–1200 mm1000–1100 mm<1000 mm1400–1500 mm
LanduseBuildingUnutilized landWoodlandWaterCultivated land
Distance from road 2.0–5.0 km5.0–10.0 km0.5–2.0 km<0.5 km>10.0 km
Distance from mine <7.0 km7–13, 26–32 km13–26 km32–40 km>40 km
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Chang, L.; Zhang, R.; Wang, C. Evaluation and Prediction of Landslide Susceptibility in Yichang Section of Yangtze River Basin Based on Integrated Deep Learning Algorithm. Remote Sens. 2022, 14, 2717. https://doi.org/10.3390/rs14112717

AMA Style

Chang L, Zhang R, Wang C. Evaluation and Prediction of Landslide Susceptibility in Yichang Section of Yangtze River Basin Based on Integrated Deep Learning Algorithm. Remote Sensing. 2022; 14(11):2717. https://doi.org/10.3390/rs14112717

Chicago/Turabian Style

Chang, Lili, Rui Zhang, and Chunsheng Wang. 2022. "Evaluation and Prediction of Landslide Susceptibility in Yichang Section of Yangtze River Basin Based on Integrated Deep Learning Algorithm" Remote Sensing 14, no. 11: 2717. https://doi.org/10.3390/rs14112717

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop