Next Article in Journal
Does Gamifying Homework Influence Performance and Perceived Gameful Experience?
Next Article in Special Issue
Performance Evaluation of Five GIS-Based Models for Landslide Susceptibility Prediction and Mapping: A Case Study of Kaiyang County, China
Previous Article in Journal
Extraction of Essential Oil from River Tea Tree (Melaleuca bracteata F. Muell.): Antioxidant and Antimicrobial Properties
Previous Article in Special Issue
Surface and Underground Geomechanical Characterization of an Area Affected by Instability Phenomena in Zaruma Mining Zone (Ecuador)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Landslide Geo-Hazard Risk Mapping Using Logistic Regression Modeling in Guixi, Jiangxi, China

by
Wenchao Huangfu
1,
Weicheng Wu
1,*,
Xiaoting Zhou
1,
Ziyu Lin
1,
Guiliang Zhang
2,
Renxiang Chen
2,
Yong Song
2,
Tao Lang
2,
Yaozu Qin
1,
Penghui Ou
1,
Yang Zhang
1,
Lifeng Xie
1,
Xiaolan Huang
1,
Xiao Fu
1,
Jie Li
1,
Jingheng Jiang
1,
Ming Zhang
1,
Yixuan Liu
1,
Shanling Peng
1,
Chongjian Shao
1,
Yonghui Bai
1,
Xiaofeng Zhang
3,
Xiangtong Liu
4 and
Wenheng Liu
1
add Show full author list remove Hide full author list
1
Key Laboratory of Digital Lands and Resources and Faculty of Earth Sciences, East China University of Technology, Nanchang 330013, China
2
264 Geological Team of Jiangxi Nuclear Industry, Ganzhou 341000, China
3
School of Geophysics and Measurement-Control Technology, East China University of Technology, Nanchang 330013, China
4
Faculty of Geomatics, East China University of Technology, Nanchang 330013, China
*
Author to whom correspondence should be addressed.
Sustainability 2021, 13(9), 4830; https://doi.org/10.3390/su13094830
Submission received: 19 March 2021 / Revised: 13 April 2021 / Accepted: 15 April 2021 / Published: 25 April 2021
(This article belongs to the Special Issue Geo-Hazards and Risk Reduction Approaches)

Abstract

:
Reliable prediction of landslide occurrence is important for hazard risk reduction and prevention. Taking Guixi in northeast Jiangxi as an example, this research aimed to conduct such a landslide risk assessment using a multiple logistic regression (MLR) algorithm. Field-investigated landslides and non-landslide sites were converted into polygons. We randomly generated 50,000 sampling points to intersect these polygons and the intersected points were divided into two parts, a training set (TS) and a validation set (VT) in a ratio of 7 to 3. Thirteen geo-environmental factors, including elevation, slope, and distance from roads were employed as hazard-causative factors, which were intersected by the TS to create the random point (RP)-based dataset. The next step was to compute the certainty factor (CF) of each factor to constitute a CF-based dataset. MLR was applied to the two datasets for landslide risk modeling. The probability of landslides was then calculated in each pixel, and risk maps were produced. The overall accuracy of these two models versus VS was 91.5% and 90.4% with a Kappa coefficient of 0.814 and 0.782, respectively. The RP-based MLR modeling achieved more reliable predictions and its risk map seems more plausible for providing technical support for implementing disaster prevention measures in Guixi.

1. Introduction

Landslide hazard is one of the frequent geo-hazards, especially in south China, and it has brought severe damages and losses to human society, as well as to the environment [1,2,3]. According to incomplete statistics, from the beginning of the 21st century to present time, landslide hazards in China have caused as many as 1200 casualties and the estimated economic loss is as high as 5–10 billion yuan, which further restricts the socioeconomic development in the hazard-affected regions [2]. It is therefore essential to conduct an accurate and reliable risk prediction for landslide prevention and early warning.
For this purpose, it is necessary to understand the physical process, or rather, the mechanism of landslides. In general, studied landslides are presumably triggered by precipitation, which may cause landslides when the rainfall is intense, soil resistance is relatively small, and the slope inclination is relatively high. The landslides cover a larger area if the slip length is longer [3,4,5,6,7]. However, the works of Zhang et al. [8], Ou et al. [9], and Zhou et al. [10] also revealed the important role of road construction and housing development in landslide events. Hence, it is critical to consider geological conditions, environmental conditions, and human activities in landslide risk assessment.
With the advent of the big data era, traditional geological methods have gradually become unable to meet current needs to deal with the more and more abundant big geodata. Artificial intelligence (AI) has long been used in scientific research but it was further promoted by machine learning (ML) and especially by deep learning (DL) techniques because of the capacity for processing hyper-dimensional data and for accurate prediction [11]. AI has not yet been widely applied in geoscience; it may therefore be of critical importance to develop and apply relevant algorithms or adapt the available algorithms to tackle big geodata for ore mineralization prediction or geo-hazard risk assessment to serve our society [12,13]. This will clearly be a meaningful exploratory experiment [13]. Landslide risk prediction is one of the important tasks for ML application in environmental geoscience.
As early as the 1970s, developed countries started to study the risk of geo-hazards, mainly landslides, in the world. In 1999, the United Nations General Assembly proposed an initiative on “the international strategy for disaster reduction” to reduce social vulnerability and the risk of natural disasters. In response to this initiative and based on previous research, countries in North America and Europe carried out studies on geo-hazard risk and its relationship with land use, making great progress in the domain of hazard mitigation and risk reduction [14]. For example, Lumb [15] analyzed the relationship between rock-soil types of slope mass and the occurrence of landslide. Collison et al. [16] suggested that the incline stability is associated with the vegetation coverage in tropical regions. Evans [17] revealed the fact that rainfall distribution in mountainous areas was affected to a certain extent by the elevation of the incline. Mark et al. [18] used 1500 landslide data to study the relationship between the shallow landslide frequency and terrain, and concluded that the steeper the incline, the greater the probability of a landslide.
Studies have also been undertaken to determine the concrete landslide causative factors. For example, Jiang and Eastman [19] used genetic algorithms to optimize the hazard-causative factors in landslide susceptibility mapping; and in terms of information entropy, Wan et al. [20] harnessed a data mining technique to determine the key controlling factors and their thresholds to provoke landslide hazards. At present, there are in fact a number of methods for landslide risk zoning, e.g., analytic hierarchy process (AHP), Fuzzy comprehensive evaluation, information quantity, certainty factor (CF) approach, and multiple logistic regression (MLR) analysis [21,22,23,24]. However, there are problems with the single evaluation model, e.g., its limitation in objectively determining the weight of the causative factors, subjective disturbances to the processing of variables in the modeling process, and negligence of the auto-correlation of the causative factors. It is thus a challenge to objectively, quantitatively, and accurately conduct hazard risk assessment at a regional scale [25,26]. Machine learning algorithms have also been utilized for landslide research [8,10], and a series of models are available, such as support vector machines (SVMs) [27], artificial neural networks (ANNs) [28], random forests (RFs) [29], boosted regression trees (BRTs) [30,31], and limit learning machine (LLM) [32]. These algorithms have been reported to have high accuracy for landslide risk assessment, though they have clear disadvantages such as a complex modeling process, unstable model performance, and weak explanatory capability [33,34]. For example, it is difficult to select the penalty coefficient c and kernel function g for SVM models, and to determine the number of neuron nodes in the hidden layer of backpropagation (BP) neural networks, etc. [35]. At present, the selection of these parameters lacks a systematic method and largely relies on the experience of researchers, which brings great uncertainty and difficulty for machine learning modeling. The combination of the MLR method based on GIS and some quantitative evaluation methods may have great potential in geo-hazard risk zoning and assessment. For example, Xu et al. [36] evaluated the landslide susceptibility provoked by an earthquake based on GIS and certainty coefficient analysis, and Miao et al. [37] conducted the susceptibility analysis of Lanzhou landslide hazard based on GIS and a MLR model. Qian et al. [38] and Ou et al. [9] used a combination of binary MLR and an information value (IV) approach to study the geo-hazard risk of Renhe watershed and Jiangxi, respectively. It is seen that MLR has a high potential for this kind of risk assessment. For this reason, this paper intends to utilize MLR to establish the functional relation between the landslide hazard (probability) and its geo-environmental factors, and to produce a risk map to provide technical support for disaster reduction and prevention action taking Guixi as an example.

2. Methodology

2.1. Study Area

Situated in the northeast of Jiangxi, Guixi is a county covering a surface area of 2292 km2, extending geographically between 116°57′43″ E and 117°28′06″ E and between 27°50′53″ N and 28°37′3″ N in space (Figure 1). Dominated by the humid subtropical monsoon climate, Guixi is characterized by a hot summer and mild winter, plentiful precipitation, and four distinct seasons. The mean annual temperature is 18.2 °C and the average annual precipitation is 1790 mm with a maximum of 2761.2 mm and a minimum of 1056.8 mm. The rainfall period is mainly concentrated in March to July, with an accumulated average of 1229 mm, accounting for 68.7% of the annual precipitation. However, it is less humid in the period from September to January. The terrain is complex in the study area, with an elevation ranging from 20 to 1504 m and a slope from 0 to 73°. Guixi is generally high in altitude in the south as a part of the Wuyishan Mountains (Mts), which is composed of granitic massif, and low in the north as a plain, with Red Basin and hills in the center. Xinjiang is the main river in the central part of Guixi flowing from east to west, 60 km in length of which passes through the study area. The basin encompasses more than 50 km2 in surface and includes 11 tributaries, e.g., Baita and Luotang Rivers from the south, and Yingshi, Taqiao, and Sili Rivers from the north.
Landslides are the most widespread and harmful geo-hazard in Guixi. According to field investigation, this hazard occurs frequently in spring and summer, especially in June and July; they are mostly small in scale and distributed on both sides of the roads. Most of the landslides contain a clear sliding surface, i.e., soil-rock interface, and are shallow slides (<10 m, mostly <3 m in thickness).

2.2. Data

The selection of landslide causative factors based on field observation is the first key step in risk zoning and assessment. In combination with the results of other authors [8,9,10,39,40,41,42,43,44], the following geo-environmental factors were obtained and prepared in order to achieve our objectives: elevation, slope, aspect, proximity to roads, proximity to rivers, proximity to faults, lithology of strata, land cover, average annual precipitation, accumulated average precipitations of March–July, May–July, and March–June, and the normalized difference vegetation index (NDVI) standing for the condition of vegetation development,. The sources of these data are listed in Table 1 and details of their quantification are presented in the following sections.
In the study area, 273 landslides were recorded based on the field survey, and they were divided randomly into two parts in a ratio of 7:3: training set (TS, 191 landslides) and validation set (VS, 82 landslides). Additionally, 514 non-landslide points (i.e., no risk) were identified and selected in the areas where slope is less than 3°, e.g., urban areas or cropland where there is low probability of landslide occurrence. A total of 360 non-landslide points were added to the TS for the successive risk modeling and 154 points were incorporated into the VS for verification and validation purposes.

2.3. Geo-Environmental Factors

The geological and environmental factors involved in the study are presented in the following.

2.3.1. Elevation

Elevation is a fundamental characteristic of landform. A number of studies have revealed that there is an obvious relationship between geo-hazards and elevation [45,46,47,48,49,50], as the latter is an important indicator of the potential energy of a slope. The higher the incline in elevation, the higher the potential of energy.

2.3.2. Slope

Slope is a critical environmental factor that affects the occurrence of geo-hazards as it determines the possibility of slope mass displacement, or rather, soils in the weathered crust that have vertically different layers. The slope indirectly affects the relative stability between the layers and this determines, to a large extent, the mechanism of the incline deformation and formation of tension joints or fissures that make the slope massif unstable [51,52]. Slope may also influence rainwater permeation and water erosion. However, landslides may occur only when the slope reaches certain thresholds, e.g., 28–38° [53], in natural conditions. Generally, bank slopes of large rivers, lakes (reservoirs), seas, and ditches, open front hillsides, railways, highways, and slope-cutting for building, etc., are all prone to landslides (Figure 2b).

2.3.3. Aspect

Different aspects result in different exposure to sunlight, which brings about differences in vegetation coverage, evapotranspiration, soil erosion, and especially, a different weathering process. These will, in turn, influence the slope runoff, rainfall penetration, the physical and mechanical properties of slope mass, and finally, the stability of the incline [8,54]. Aspects are presented in Figure 2c.

2.3.4. Lithology

Stratigraphic lithology is an important factor for all geo-hazards. On the one hand, lithology determines the development of joints and fractures of rocks under stress or tension. On the other hand, if the lithological characteristics of a given slope are different from those of the underlying rocks, there will be a high potential to form an interbedding or unconformity surface favorable for landslide. Given the same other condition, different lithologies may have different resistance to landslide events, e.g., rigid granitic massif would be more resistant than shale and mudstone. Instead of using AHP as in Roccati et al. [24], we followed Zhang et al. [8] in assigning a proneness weight to each category of lithology shown in Table 2. The assignment rule is that the higher the value, the higher the tendency of the lithology to landslides.

2.3.5. Proximity to Faults

The influence of the fault structure on the landslide mainly lies in that whether it is a normal or a reverse fault, a certain width of the fracture zone is generated on both sides of it. In areas where fractures or joint fissures are well developed, dense, weak, and vulnerable zones may be formed [37]. Thus, the nearer to the faults, the higher susceptibility to landslide; and the bigger the scale of the faults, the greater the impacts they exert. In this study, the distance to faults was calculated by a buffer operation within GIS, and the fault buffers were defined as 0–30 m, 30–60 m, 60–90 m, 90–120 m, and >120 m for small scale faults (<10 km in length), which were assigned a weight value of 13, 10, 7, 4, and 1, respectively, and 0–60 m, 60–120 m, 120–180 m, 180–240 m, and >240 m for bigger faults (≥10 km in length), which were assigned a weight value of 20, 15, 10, 5, and 1, respectively [8,10] (Figure 3c).

2.3.6. Proximity to Roads

Slope cutting and excavation for road construction and housing development destroyed the stability of the incline, provoking landslides [8,10,24]. In terms of our understanding about the impacts of road construction on landslides in the field, it was noted that distance plays a key role in such hazard events, i.e., the closer to the road, the higher the propensity of landslide. Thus, the distance to roads was regarded as a critical factor which was calculated by a buffering operation. The road buffers were defined with intervals of 0–30 m, 30–60 m, 60–90 m, and 90–120 m, and assigned, respectively, a weight value of 10, 8, 4, and 1 [8,10] (Figure 3a).

2.3.7. Proximity to Rivers

River development is also an unignorable factor for geo-hazards because it erodes and liquidizes its banks or bottom slopes on the two sides, causing collapses and landslides [8]. The development of a river system favors the occurrence of the geo-hazard. Similar to other linear factors, the closer to the rivers, the higher the propensity of landslides. Hence, the distance to rivers was also used as an environmental factor. It was buffered into different zones, e.g., 0–30 m, 30–60 m, 60–90 m, 90–120 m, and 120–150 m for small rivers and streams, and 0–60 m, 60–120 m, 120–180 m, 180–240 m, and 240–300 m for the main river, i.e., Xinjiang River. A weight was allocated to each buffer zone based on its proclivity to landslide, e.g., zones 0–30 m, 30–60 m, 60–90 m, 90–120 m, and 120–150 m were assigned, respectively, 12, 10, 6, 3, and 1, and zones 0–60 m, 60–120 m, 120–180 m, 180–240 m, and 240–300 m, respectively, 20, 16, 10, 5, and 1 [8,10] (Figure 3b).

2.3.8. Precipitation

A large number of landslides show that precipitation is the first important cause of their occurrence. There are generally two cases: either sufficient precipitation duration or intensity [37]. The rainy season in southern China goes from March to July. During this period, there is usually a lot of rainfall and heavy periods of rainfall occur in June–July of each year, and landslide hazards frequently take place in these two months. Therefore, the annual mean precipitation, the accumulated average precipitation of March–July, March–June, and May–July of the past 20 years were obtained from 40 ground weather stations, and then calculated and interpolated with the inverse distance weighting (IDW) method into a grid. These precipitation layers were selected for our risk modeling to investigate which months’ combination of rainfall would best reveal its role in the landslide events.

2.3.9. Land Cover Types

Land use refers to all activities that humans impose on land for survival and development. This can be not only production activities such as planting crops, trees, pasture, construction of factories, roads, etc., but also non-production activities such as construction of scenic tourist areas and nature reserves. With increasing demand from human beings, land use has become more and more intensive, resulting in a decline in woodlands/forests and grasslands in some regions or countries [55]. Such land use and change may exert a significant impact on soil erosion, especially, when there is concentrated rainfall [55,56]. The development of burn-and-slash agriculture makes land prone to landslide hazards. In fact, there are at present six land cover types in the study area, namely farmlands, water bodies (rivers and lakes), built-up areas, forests, shrubs, and finally bare land, and these were allocated a proclivity weight of 0, 0, 0, 1, 4, and 10, respectively [8,10] (Figure 2a). Here, forest cover has the least proclivity while bare land without vegetation protection is the most vulnerable category when the other conditions are the same.
The land cover map was produced from Landsat TM images dated 31 May (Spring) and 7 November 2010 (Autumn) based on the approach proposed by Wu et al. [57].

2.3.10. NDVI

The NDVI (normalized difference vegetation index) reveals information about vegetation cover, e.g., vegetation vigor, density, biomass, and its root system. Roots are able to hold soil and stabilize the hillside mass, and reduce the impact of precipitation on the soil. The level of vegetation coverage, in particular forest cover, reflects the degree of vegetation protection of the land surface. Under the same conditions, areas with greater vegetation coverage are less prone to landslide hazards.
The NDVI of the study area was generated from Landsat 5 TM images acquired in late autumn (late October and early November) in the five-year period 2005–2010. After atmospheric correction using the COST model [55,58], an averaged five-year NDVI was produced to represent the natural vegetation (e.g., forests, woodlands, and shrublands) as croplands were harvested and herbaceous vegetation became withered at that time. This mean NDVI image ranges from −0.297 to 0.823 (Figure 2d).

2.4. Methods

2.4.1. MLR Model

Based on the linear regression in which the dependent variables are binomial, the MLR model applies a logical function to solve the nonlinear problems and this conversion has made MLR highly successful in the field of machine learning. For this reason, it has been increasingly drawing the attention of geographers, biologists, and environmentalists in recent decades thanks to its ability to analyze spatially explicit causes and predict the potential changes and events in the ecosystem [55]. It has hence been widely applied in deforestation [59,60], land use change [61,62,63], and landslide prediction [64]. Within MLR, independent variables can be continuous or discrete and do not necessarily meet the requirement of normal distribution. Supposing that the probability of occurrence of geo-hazards is P, and the probability of non-occurrence is 1 − P, then ln[P/(1 − P)] is taken as the dependent variable, the hazard-causative factors xi (i = 1, 2, 3, .... n) as the independent variables, and the MLR model can be expressed as a linear regression in Equation (1).
ln [ P / ( 1 P ) ] = α 0 + α 1 x 1 + α 2 x 2 + + α n x n
The probability of landslide occurrence can be computed with Equation (2).
Ζ = α 0 + α 1 x 1 + α 2 x 2 + + α n x n Ρ = exp Ζ 1 + exp Ζ
where P is the calculated probability of landslide occurrence, xi (i = 1, 2, 3, ... n) are the geo-environmental factors as independent predictive variables, αi (i = 1, 2, 3, ... n) are the coefficients of these predictive variables, and α0 is a constant.
After calculation of the probability of landslide occurrence, the landslide risk zoning map can be created.

2.4.2. Modeling Procedures

For landslide risk zoning, it is requisite to determine the potential role of the geo-environmental factors in the landslide event, in other words, to understand which factors have a greater effect on this hazard occurrence than others. In this research, the prediction and assessment of landslide hazards in Guixi was conducted using two MLR models, i.e., CF-based and RP-based MLR models.

CF Calculation

Proposed by Shortliffe and Buchanan for certainty reasoning in medical diagnosis in 1975 [65], the CF was redefined and improved by Heckerman in 1986 [66]. It is a probability function [67], which is used to examine the tendency of factors governing the occurrence of an event. As the evaluation process is relatively simple and accurate, it is widely used in geo-hazard risk assessment [68,69,70,71,72]. The premise is to assume that the geo-hazards that have occurred and those that will probably take place in the future are under the same geological and environmental conditions. The calculation formula is shown in Equation (3):
C F = p p a p p s p p a 1 p p s , p p a p p s p p a p p s p p s 1 p p a , p p a < p p s
where PPa is the conditional probability, or rather, the ratio of the area or number of landslide hazards appearing in subset a to the area of subset a. PPs is the probability of occurrence of the whole event, and in our case, it can be represented as the ratio of the landslide area or number occurring in the whole study area to the surface area of the study area.
According to Equation (3), the CF varies from −1 to 1. The closer the CF to 1, the greater the possibility of a landslide hazard or vice versa. When CF is 0, it indicates the same probability of landslide occurrence and non-occurrence.
Superimposing the training samples (landslides and non-landslides) on each factor layer that has been divided into subsets, the CF approach was applied to calculate the CF value of each subset of the 13 geo-environmental factors. The CF-based dataset was hence produced and the areas and numbers of the subsets of a part of the factors are presented in Table 3.

RP Resampling

Within GIS, 50,000 RPs were generated and used to intersect samples (i.e., landslide event polygons for training), and 1858 points were retained in the polygons (0 or 1) in the dependent variable layer, in which 589 points fell in the landslide areas and 1269 points in the no-risk areas. These points were superimposed and intersected with each geo-environmental factor layer to obtain the values corresponding to the landslides and no-risk areas. Thus, an integrated dataset including the training set, i.e., RP-based dataset, was produced.

MLR Modeling

The RP-based and CF-based datasets were input into SPSS20.0 for binary MLR analysis, where landslide events were treated as a dependent variable and other geo-environmental factors as independent (predictive) variables. The first and second landslide risk models, i.e., RP-based MLR-1 and CF-based MLR-2, were thus obtained. The regression coefficient of each factor was considered as its contribution weight to the landslide event.

Risk Mapping

After modeling, the two models were applied back to the predictive variables to calculate the landslide hazard risk of the whole study area. Landslide risk zoning was conducted based on the calculated probability, which was further graded into different risk levels, e.g., when the probability is 0–0.2, the zone is considered stable (no risk), 0.2–0.4 low, 0.4–0.6 moderate, 0.6–0.8 high, and 0.8–1.0 extremely high risk.

2.4.3. Verification and Reliability Analysis

Overall accuracy (OA) and Kappa coefficient [8,11,57,73,74] were the metrics to evaluate the accuracy and reliability of the landslide risk models produced by MLR modeling using RP- and CF-based datasets against the VS. As Landis and Koch [74] interpreted, if KC reaches 0.81–1.00, the prediction reliability is “almost perfect” [8].

2.4.4. Multicollinearity Diagnosis

Collinearity means that two predictive variables are linearly correlated to each other in a regression model, and if there are more than two variables involved, we call it multicollinearity. In such a case, it is not possible to conduct regression modeling as independent variables cannot independently predict the dependent variable. MLR modeling is susceptible to such collinearities among the predictive variables. Tolerance (TOL) and variance inflation factor (VIF) are two criteria for diagnosing multicollinearity [75,76]. If TOL is <0.1 or VIF > 10, it indicates a strong multicollinearity among the independent variables [77].

3. Results

The intermediate processing and final results are presented in Table 4, Table 5, Table 6 and Table 7, and Figure 4 in this section for further discussion.

3.1. MLR Models

RP-based and CF-based MLR models are shown in Table 4 and Table 5, respectively, where the estimated coefficients of the predictive factors (independent variables) are demonstrated. These models were utilized to compute the probability of landslide risk for the whole study area. The observation accuracy of the two modeling results are illustrated in Table 6. The two obtained models MLR-1 and MLR-2 are expressed in Equations (4) and (5):
P = exp ( 6.675 0.007 x 1 + 0.135 x 2 + + 0.310 x 10 ) 1 + exp ( 6.675 0.007 x 1 + 0.135 x 2 + + 0.310 x 10 )
P = exp ( 2.999 + 1.720 x 1 + 2.645 x 2 + + 0.908 x 13 ) 1 + exp ( 2.999 + 1.720 x 1 + 2.645 x 2 + + 0.908 x 13 )
In Equations (4) and (5), x1—elevation, x2—slope, x3—aspect, x4—proximity to roads, x5—proximity to rivers, x6—proximity to faults, x7—NDVI, x8—lithology, x9—land cover, x10—accumulated average precipitation of March–July, x11—accumulated average precipitation of May–July, x12—accumulated average precipitation of March–June, and x13—average annual rainfall.
It is worth mentioning that in Table 4 and Table 5, we can calculate the odds ratio (OR) [55,62] using exp(β) between the geo-environmental factors to understand the likelihood contribution to the hazard event of one factor over the other.
Evaluated versus the VS, the OA and KC of the two MLR models are 91.5% and 0.814 and 90.4% and 0.782, respectively (Table 6). Based on the interpretation of Landis and Koch (1976), the modeling result of Model MLR-1 reaches “almost perfect” while that of MLR-2 is “substantially” good, though the observation accuracy of the latter appears to be slightly higher as more field-observed landslides are located in the predicted high and extremely high zones (Table 7 and Figure 4).

3.2. Collinearity

The results of the collinearity diagnosis are demonstrated in Table 8. For model MLR-1, the TOL of all rainfall factors was <0.1 or VIF > 10, which means that they are correlated and it is not necessary to use all of them for MLR modeling. However, for model MLR-2, the TOL was >0.1 with VIF < 10, indicating that there was not a serious multicollinearity problem among the independent factors for CF-based MLR modeling.

3.3. Landslide Risk Maps

The calculated risk probability from the two models, MLR-1 (Equation (4)) and MLR-2 (Equation (5)), ranging from 0 to 1.0, was classified into five grades: stable (no risk, 0–0.2), low (0.2–0.4), moderate (0.4–0.6), high (0.6–0.8), and extremely high risk (0.8–1.0). The zonation map of landslide hazard was thus created and presented in Figure 4.
The two maps show that the areas of Guixi susceptible to landslide hazard are largely spread alongside the roads. The high risk and extremely high risk areas are mainly distributed in the towns of Wenfang, Jintun, and Zhangping in the south and Baitian in the north of the study area. The central part of Guixi is a relatively flat plain with a low risk of landslides.
As seen in Table 4 and Table 5, the regression coefficients of the geo-environmental factors obtained from MLR analysis denote the order of importance. For example, the importance of roads, slope, and rainfall has been well illustrated in MLR-1 and MLR-2. Road construction and housing development destroy the original stability of the slope massif, triggering the downward slide of the overlying geological strata and the weathered crust along the bedding surface or the downhill slide of the slope massif on the unconformed surface with the underlying rocks. Hence, these human activities form a key part of the landslide disaster. Precipitation is considered the most important triggering factor of this kind of disaster, especially in south China.

4. Discussion

4.1. Digitization and Weight Assignment

In order to model and predict the landslide risk in Guixi by the MLR algorithm, we employed 13 geological and environmental factors either as continuous or discrete variables, together with field observation data. The non-numeric factors (e.g., geological strata, faults, road, and rivers) were first digitized and then allocated a propensity weight either to different lithology or to buffers of the linear features in line with their proximities to the latter to make these factors digitized. Thence, such processing made data-driven approaches possible and successful in the landslide risk modeling of this study.

4.2. Importance of the Variables in Landslide Events

As revealed in Table 4 and Table 5, the first three important geo-environmental factors of Model 1 (MLR-1) are the proximity to roads, the accumulated average precipitation of March–July, and the slope, and of Model 2 (MLR-2) are the slope, the proximity to roads, and the accumulated average precipitation of March–June. Both models have illustrated more or less the similar correlation of the hazard-causative factors such as roads, slope, and accumulated average rainfall of March–June or March–July with landslide events. The accuracy of the RP-based Model 1 seems to be higher than the CF-based Model 2 (Table 6). As already mentioned earlier, most of the observed landslides were distributed along the roads or behind the houses, and resulted directly from human activities, i.e., road construction and housing development. This is the same as what we discovered in Jiangxi [8,9,10].
Collinearity analysis reveals that, for RP-based MLR modeling, one of the rainfall-related factors is enough, e.g., annual average precipitation or accumulated average precipitation of March–July or May–July. As demonstrated in Table 4, the accumulated average precipitation of March–July is more important than other rainfall factors, and hence can be selected for final risk mapping.

4.3. Lowering the Threshold of Slope

We noted that 234 landslides, taking up 85.9% of the total, took place in zones with a slope ranging from 3° to 21°. Yet, Fan et al. suggested that the sliding threshold be 28°–38° for natural landslides [53]. This means that the threshold has been lowered by the intervention of human activities. This finding is also supported by two other parallel studies [8,9,10] in the same province.

4.4. Comparison with the Results of Others

Another study [8] employed an RF algorithm to predict the landslide risk in Guixi and achieved the mapping work with an OA of 91.23% and KC of 0.82. This is close to those of Model 1 of this study, indicating that RP-based MLR modeling and prediction can achieve the same satisfactory results as RF algorithm. Ou et al. [9] conducted a comprehensive regional-scale study on landslide hazard mapping in Jiangxi using MLR based on an information value (IV) approach and achieved an accuracy of 86%. This is also comparable to what we have effectuated. In future work, a comparative analysis is to be conducted using SVM and BRT algorithms, as [31] reported that these algorithms may lead to relevant risk prediction and evaluation.

5. Conclusions

The prediction of landslide geo-hazards is a complex nonlinear process. This study has not only indicated ways of processing geo-environmental data, especially non-digital data for quantitative analysis, but also exhibits the fact that even the same modeling algorithm may reach different results if the sampling scheme is different. It is therefore necessary to select a reasonable one for achieving reliable risk prediction. The RP-based MLR approach seems more reasonable than the CF-based one and will be recommended for extension to similar landslide hazard prediction elsewhere. The prediction and its results may provide technical support and relevant advice to decision-makers for disaster reduction and prevention management in the study area.
This study also illustrates the unfavorable role of anthropogenic activities in causing such geological disasters, confirming the results of Zhang et al., Ou et al. and Zhou et al. [8,9,10]. Man exploits natural resources, modifies the land surface, and, at the same time, directly breaks the original balance of the landscape. This is the origin of the artificial landslides that are related to the road system development and urbanization. Apart from a better and more comprehensive geological survey before road construction and housing development, man has to think of optimal land use planning to minimize the man-made disasters.

Author Contributions

Conceptualization, W.H. and W.W.; methodology, W.W. and W.H.; software, W.W.; validation, W.H., X.Z. (Xiaoting Zhou), T.L., J.J., M.Z. and Y.L.; formal analysis, W.H., X.Z. (Xiaoting Zhou), P.O., and Y.Z.; investigation, W.H., X.Z. (Xiaofeng Zhang), W.W., Y.S., Z.L., P.O., Y.Z., L.X., Y.L., Y.Q., S.P., C.S., Y.B., X.Z. (Xiaofeng Zhang), X.L., and W.L. and X.H.; resources, W.W., G.Z., R.C., Y.S., Z.L., X.Z. (Xiaofeng Zhang), Y.L. and T.L.; data curation, W.H., X.Z. (Xiaoting Zhou), P.O., and Y.Z.; writing—original draft preparation, W.H. and W.W; writing—review and editing, W.W.; visualization, W.H, X.F. and J.L.; supervision, W.W.; project administration, W.W. and G.Z.; funding acquisition, W.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Jiangxi Talent Program (Grant No. 900/2120800004) and by the Start-up Fund for Scientific Research from the East China University of Technology (Grant No. DHTP2018001) granted to Weicheng Wu.

Data Availability Statement

The data that support the findings of this study are available the corresponding author, Weicheng Wu, upon reasonable request.

Acknowledgments

A part of the field survey data and the Geological Map of Guixi on scale of 1/50,000 were supplied by the 264 Geological Team of Jiangxi Nuclear Industry. DEM data (ASTGTMV003 30m) were obtained from NASA (www.earthdata.nasa.gov, accessed on 11 August 2020) and Landsat data from USGS data server (https://glovis.usgs.gov, accessed on 8 May 2020).

Conflicts of Interest

No potential conflict of interest was reported by the authors.

References

  1. Wu, W.; Ai, G. Risk assessment of natural disasters in the course of selection of nuclear waste disposal. J. East China Geol. Inst. 1995, 18, 260–265, (In Chinese with English abstract). [Google Scholar]
  2. Li, X. Study on Logistic Regression Model Applied to Regional Slope Stability Evaluation Based on GIS. Ph.D. Thesis, China University of Geosciences, Wuhan, China, 2005. (In Chinese with English abstract). [Google Scholar]
  3. Bordoni, M.; Valentino, R.; Meisina, C.; Bittelli, M.; Chersich, S. A Simplified Approach to Assess the Soil Saturation Degree and Stability of a Representative Slope Affected by Shallow Landslides in Oltrepò Pavese (Italy). Geosciences 2018, 8, 472. [Google Scholar] [CrossRef] [Green Version]
  4. Crosta, G.B.; Frattini, P. Rainfall thresholds for the triggering of soil slips and debris flows. In Proceedings of the 2nd Plinius Conference on Mediterranean Storms, Siena, Italy, 16–18 October 2000; GNDCI: Perugia, Italy, 2001; pp. 463–488. [Google Scholar]
  5. Muntohar, A.S.; Liao, H.-J. Rainfall infiltration: Infinite slope model for landslides triggering by rainstorm. Nat. Hazards 2010, 54, 967–984. [Google Scholar] [CrossRef]
  6. Stamatopoulos, C.A.; Di, B. Analytical and approximate expressions predicting post-failure landslide displacement using the multi-block model and energy methods. Landslides 2015, 12, 1207–1213. [Google Scholar] [CrossRef]
  7. Take, W.A.; Bolton, M.D.; Wong, P.C.P.; Yeung, F.J. Evaluation of landslide triggering mechanisms in model fill slopes. Landslides 2004, 1, 173–184. [Google Scholar] [CrossRef]
  8. Zhang, Y.; Wu, W.; Qin, Y.; Lin, Z.; Zhang, G.; Chen, R.; Song, Y.; Lang, T.; Zhou, X.; Huangfu, W.; et al. Mapping Landslide Hazard Risk Using Random Forest Algorithm in Guixi, Jiangxi, China. ISPRS Int. J. Geo-Inf. 2020, 9, 695. [Google Scholar] [CrossRef]
  9. Ou, P.; Wu, W.; Qin, Y.; Zhou, X.; Huangfu, W.; Zhang, Y.; Xie, L.; Huang, X.; Fu, X.; Li, J.; et al. Assessment of landslide hazard in Jiangxi using geo-information technology. Front. Earth Sci. Geohazards Georisks 2021, 9. [Google Scholar] [CrossRef]
  10. Zhou, X.; Wu, W.; Lin, Z.; Zhang, G.; Chen, R.; Song, Y.; Wang, Z.; Lang, T.; Qin, Y.; Ou, P.; et al. Zonation of landslide susceptibility in Ruijin, Jiangxi, China. Int. J. Environ. Res. Public Health 2020. under review. [Google Scholar]
  11. Wu, W.; Zucca, C.; Muhaimeed, A.S.; Al-Shafie, W.M.; Al-Quraishi, A.M.F.; Nangia, V.; Zhu, M.; Liu, G. Soil salinity prediction and mapping by machine learning regression in Central Mesopotamia, Iraq. Land Degrad. Dev. 2018, 29, 4005–4014. [Google Scholar] [CrossRef]
  12. Qin, Y.; Liu, L. Quantitative 3D Association of Geological Factors and Geophysical Fields with Mineralization and Its Significance for Ore Prediction: An Example from Anqing Orefield, China. Minerals 2018, 8, 300, (In Chinese with English abstract). [Google Scholar] [CrossRef] [Green Version]
  13. Zhou, Y.; Chen, S.; Zhang, Q.; Xiao, F.; Wang, S.; Liu, Y.; Jiao, S. Advances in Big Data and Mathematical Earth Science Research-Preface to Big Data and Mathematical Earth Science Topics. Acta Petrol. Sin. 2018, 34, 255–263, (In Chinese with English abstract). [Google Scholar]
  14. Liu, X. Risk zoning of avalanche and slip flow based on logistic regression model and certainty coefficient. J. Eng. Geol. 2014, 22, 1250–1256, (In Chinese with English abstract). [Google Scholar]
  15. Lumb, P. Slope failures in Hong Kong. Q. J. Eng. Geol. Hydrogeol. 1975, 8, 31–65. [Google Scholar] [CrossRef]
  16. Collison, A.J.C.; Anderson, M.G. Using a combined slope hydrology—Stability model to identify suitable conditions for landslide prevention by vegetation in the humid tropics. Earth Surf. Process. Landf. 1996, 21, 737–747. [Google Scholar] [CrossRef]
  17. Evans, N.C. Natural Terrain Landslide Study: Preliminary Assessment of the Influence of Rainfall on Natural Terrain Landslide Initiation; Geotechnical Engineering Office: Hong Kong, China, 1997; pp. 28–29. [Google Scholar]
  18. Mark, R.K.; Ellen, S.D. Statistical and Simulation Models for Mapping Debris Flow Hazard; Carrara, A., Guzetti, F., Eds.; Geographical Information System in Assessing Natural Hazard; Academic Publishers: Amsterdam, The Netherlands, 1995; pp. 93–106. [Google Scholar]
  19. Jiang, H.; Eastman, J.R. Application of fuzzy measures in multi-criteria evaluation in GIS. Int. J. Geogr. Inf. Sci. 2000, 14, 173–184. [Google Scholar] [CrossRef]
  20. Wan, E.T.; Mitani, Y.; Xie, M.; Mori, J. Spatial probabilistic modeling of slope failure using an integrated GIS Monte Carlo simulation approach. Eng. Geol. 2003, 68, 373–386. [Google Scholar]
  21. Chau, K.T.; Sze, Y.L.; Fung, M.K.; Wong, W.Y.; Fong, E.L.; Chan, L.C.P. Landslide hazard analysis for Hong Kong using landslide inventory and GIS. Comput. Geosci. 2004, 30, 429–443. [Google Scholar] [CrossRef]
  22. Meng, Q.; Sun, W.; Wang, T. Research on geological disasters susceptibility evaluation in Fengxian, Shaanxi. J. Eng. Geol. 2011, 19, 388–396, (In Chinese with English abstract). [Google Scholar]
  23. Zhang, C.; Chen, Y.; Zhang, Y.; Sun, X. Evaluation of the susceptibility of geological disasters in Zhaotong, Yunnan based on multiple linear regression model. Hydrogeol. Eng. Geol. 2016, 43, 159–163, (In Chinese with English abstract). [Google Scholar] [CrossRef]
  24. Roccati, A.; Paliaga, G.; Luino, F.; Faccini, F.; Turconi, L. GIS-Based Landslide Susceptibility Mapping for Land Use Planning and Risk Assessment. Land 2021, 10, 162. [Google Scholar] [CrossRef]
  25. Tian, C.; Liu, X.; Wang, J. Evaluation of the susceptibility of geological disasters in Guangdong Province based on CF and Logistic regression models. Hydrogeol. Eng. Geol. 2016, 43, 154–161, (In Chinese with English abstract). [Google Scholar] [CrossRef]
  26. Yue, X.; Huang, M.; Xu, Q.; Chen, C.; Gu, X. Susceptibility evaluation of debris flow disaster in Karst area of Guizhou Province. J. Earth Inf. Sci. 2015, 17, 1396–1403, (In Chinese with English abstract). [Google Scholar]
  27. Huang, F.; Yin, K.; Jiang, S.; Huang, J.; Cao, Z. Assessment of landslide susceptibility based on cluster analysis and support vector machine. J. Rock Mech. Eng. 2018, 37, 156–167, (In Chinese with English abstract). [Google Scholar] [CrossRef]
  28. Yang, H.; Wu, B.; Wang, L. Application of PSO-RBF coupling model of chaotic sequence in landslide displacement prediction. Sci. Technol. Eng. 2013, 13, 9118–9121, 9126, (In Chinese with English abstract). [Google Scholar]
  29. Catani, F.; Lagomarsino, D.; Segoni, S.; Tofani, V. Landslide susceptibility estimation by random forests technique: Sensitivity and scaling issues. Nat. Hazards Earth Syst. Sci. 2013, 13, 2815–2831. [Google Scholar] [CrossRef] [Green Version]
  30. Pourghasemi, H.R.; Kornejady, A.; Kerle, N.; Shabani, F. Investigating the effects of different landslide positioning techniques, landslide partitioning approaches, and presence-absence balances on landslide susceptibility mapping. Catena 2020, 187, 104364. [Google Scholar] [CrossRef]
  31. Xiong, K.; Adhikari, B.R.; Stamatopoulos, C.A.; Zhan, Y.; Wu, S.; Dong, Z.; Di, B. Comparison of Different Machine Learning Methods for Debris Flow Susceptibility Mapping: A Case Study in the Sichuan Province, China. Remote Sens. 2020, 12, 295. [Google Scholar] [CrossRef] [Green Version]
  32. Huang, F.; Yin, K.; Huang, J.; Gui, L.; Wang, P. Landslide susceptibility mapping based on self-organizing-map network and extreme learning machine. Eng. Geol. 2017, 223, 11–22. [Google Scholar] [CrossRef]
  33. Xia, H.; Yin, K.; Liang, X.; Ma, F. Evaluation of landslide susceptibility based on SVM-ANN model-Taking Wushan County in the Three Gorges Reservoir Area as an example. Chin. J. Geol. Hazard Control 2018, 29, 13–19, (In Chinese with English abstract). [Google Scholar] [CrossRef]
  34. Li, Y.; Mei, H.; Ren, X.; Hu, X.; Li, M. Evaluation of geological hazard susceptibility based on deterministic coefficient and support vector machine. J. Geo-Inf. Sci. 2018, 20, 1699–1709, (In Chinese with English abstract). [Google Scholar]
  35. Feng, F.; Wu, X.; Niu, R.; Xu, S.; Yu, X. Landslide sensitivity evaluation of BP neural network based on particle swarm optimization. Surv. Mapp. Sci. 2017, 42, 170–175, (In Chinese with English abstract). [Google Scholar] [CrossRef]
  36. Xu, C.; Dai, F.; Yao, X.; Zhao, Z.; Xiao, J. GIS platform and certainty factor analysis method based Wenchuan earthquake-induced landslide susceptibility evaluation. J. Eng. Geol. 2010, 18, 15–26, (In Chinese with English abstract). [Google Scholar]
  37. Fang, M.; Zhang, J.; Xu, Y. Research on Lanzhou City Landslide Disaster Sensitivity Based on GIS and Logistic Regression Model. Remote Sens. Technol. Appl. 2011, 26, 845–853, (In Chinese with English abstract). [Google Scholar]
  38. Du, Q.; Fan, W.; Li, K.; Yang, D.; Lv, J. The Application of Binary Logistic Regression and Information Model in Geological Hazard Zoning. Disaster Sci. 2017, 32, 220–226, (In Chinese with English abstract). [Google Scholar] [CrossRef]
  39. Aditian, A.; Kubota, T.; Shinohara, Y. Comparison of GIS-based landslide susceptibility models using frequency ratio, logistic regression, and artificial neural network in a tertiary region of Ambon, Indonesia. Geomorphology 2018, 318, 101–111. [Google Scholar] [CrossRef]
  40. Feby, B.; Achu, A.; Jimnisha, K.; Ayisha, V.; Reghunath, R. Landslide susceptibility modelling using integrated evidential belief function based logistic regression method: A study from Southern Western Ghats, India. Remote Sens. Appl. Soc. Environ. 2020, 20, 100411. [Google Scholar] [CrossRef]
  41. Mandal, K.; Saha, S.; Mandal, S. Applying deep learning and benchmark machine learning algorithms for landslide susceptibility modelling in Rorachu river basin of Sikkim Himalaya, India. Geosci. Front 2021, 101203, in press. [Google Scholar] [CrossRef]
  42. Shahri, A.A.; Spross, J.; Johansson, F.; Larsson, S. Landslide susceptibility hazard map in southwest Sweden using artificial neural network. Catena 2019, 183, 104225. [Google Scholar] [CrossRef]
  43. Othman, A.A.; Gloaguen, R.; Andreani, L.; Rahnama, M. Improving landslide susceptibility mapping using morphometric features in the Mawat area, Kurdistan Region, NE Iraq: Comparison of different statistical models. Geomorphology 2018, 319, 147–160. [Google Scholar] [CrossRef]
  44. Nicu, I.C. Frequency ratio and GIS-based evaluation of landslide susceptibility applied to cultural heritage assessment. J. Cult. Heritage 2017, 28, 172–176. [Google Scholar] [CrossRef]
  45. Chen, X.; Ran, H. Triggering factors susceptibility of earthquake-induced Landslides in 1976 Longling earthquake. Acta Sci. Nat. Univ. Pekin. 2009, 45, 104–110, (In Chinese with English abstract). [Google Scholar]
  46. Yang, G.; Xu, P.; Cao, S.; Zhang, W.; Lan, Z.; Chen, J.; Dong, X. Evaluation of regional landslide susceptibility based on certainty factor combination model. J. Eng. Geol. 2019, 27, 1153–1163. [Google Scholar] [CrossRef]
  47. Wu, S.; Li, H.; Zhang, K.; Chen, G. Susceptibility analysis of elevation factors to the development of landslide disaster in Pingchang county. J. Three Gorges Univ. 2015, 37, 62–66. [Google Scholar] [CrossRef]
  48. Bălteanu, D.; Chendeş, V.; Sima, M.; Enciu, P. A country-wide spatial assessment of landslide susceptibility in Romania. Geomorphology 2010, 124, 102–112. [Google Scholar] [CrossRef]
  49. Kamp, U.; Growley, B.J.; Khattak, G.A.; Owen, L.A. GIS-based landslide susceptibility mapping for the 2005 Kashmir earthquake region. Geomorphology 2008, 101, 631–642. [Google Scholar] [CrossRef]
  50. Süzen, M.L.; Doyuran, V. Data driven bivariate landslide susceptibility assessment using geographical information systems: A method and application to Asarsuyu catchment, Turkey. Eng. Geol. 2004, 71, 303–321. [Google Scholar] [CrossRef]
  51. Yang, D.; Fan, W. ArcGIS-based zoning assessment of geological hazards susceptibility—Taking Xunyang County as an example. Chin. J. Geol. Hazards Prev. 2015, 26, 82–86, (In Chinese with English abstract). [Google Scholar] [CrossRef]
  52. Zhao, Y.; Niu, R. Exploration of landslide risk zoning based on evidence weight method. Geogr. Geogr. Inf. Sci. 2010, 26, 19–23. [Google Scholar]
  53. Fan, L.; Lehmann, P.; Or, D. Effects of soil spatial variability at the hillslope and catchment scales on characteristics of rainfall-induced landslides. Water Resour. Res. 2016, 52, 1781–1799. [Google Scholar] [CrossRef] [Green Version]
  54. Lan, H.; Wu, F.; Zhou, C.; Wang, S. Sensitivity analysis of landslide factors in the Xiao Jiang River Basin in Yunnan based on GIS. J. Rock Mech. Eng. 2002, 10, 1500–1506, (In Chinese with English abstract). [Google Scholar]
  55. Wu, W. Application de la Géomatique au Suivi de la Dynamique Environnementale en Zones Arides. Ph.D. Thesis, Université de Paris 1-Panthéon-Sorbonne, Paris, France, 2003. (In French). [Google Scholar]
  56. Wu, W.; De Pauw, E.; Helldén, U. Assessing woody biomass in African tropical savannahs by multiscale remote sensing. Int. J. Remote Sens. 2013, 34, 4525–4549. [Google Scholar] [CrossRef]
  57. Wu, W.; Zucca, C.; Karam, F.; Liu, G. Enhancing the performance of regional land cover mapping. Int. J. Appl. Earth Obs. Geoinf. 2016, 52, 422–432. [Google Scholar] [CrossRef]
  58. Wu, W.; De Pauw, E.; Zucca, C. Using remote sensing to assess impacts of land management policies in the Ordos rangelands in China. Int. J. Digit. Earth 2013, 6, 1–22. [Google Scholar] [CrossRef]
  59. Nualchawee, K.; Miller, L.; Tom, C.; Christenson, J.; Williams, D. Spatial Inventory and Modelling of Shifting Cultivation and Forest Land Cover of Northern Thailand, with Inputs from Maps, Airphotos and Landsat; Remote Sensing Center Technical Report, No. 4177; Texas A&M University: College Station, TX, USA, 1981. [Google Scholar]
  60. Lambin, E.F. Modelling Deforestation Processes: A Review; Tropical Ecosystem Environment Observations by Satellites; TREES Series B: Research Report No. 1, EUR15744EN; European Commission: Luxembourg, 1994; pp. 45–101. [Google Scholar]
  61. Mertens, B.; Lambin, E.F. Land-Cover-Change Trajectories in Southern Cameroon. Ann. Assoc. Am. Geogr. 2000, 90, 467–494. [Google Scholar] [CrossRef]
  62. Serneels, S.; Lambin, E.F. Proximate causes of land-use change in Narok District, Kenya: A spatial statistical model. Agric. Ecosyst. Environ. 2001, 85, 65–81. [Google Scholar] [CrossRef] [Green Version]
  63. Verburg, P.H.; Soepboer, W.; Veldkamp, A.; Limpiada, R.; Espaldon, V.; Mastura, S.S.A. Modelling the spatial dynamics of regional land use: The CLUE-S model. Environ. Manag. 2002, 30, 391–405. [Google Scholar] [CrossRef]
  64. Wang, X.; Huang, H.; Ban, Y.; Li, J. Drawing of landslide hazard zoning map with GIS assistance—A case study of Panzhihua City, Sichuan Province. Bull. Surv. Mapp. 2015, 2, 46–50, (In Chinese with English abstract). [Google Scholar] [CrossRef]
  65. Shortliffe, E.H.; Buchanan, B.G. A model of inexact reasoning in medicine. Math. Biosci. 1975, 23, 351–379. [Google Scholar] [CrossRef]
  66. Heckerman, D. Probabilistic interpretation for MYCIN’s certainty factors. Mach. Intell. Pattern Recognit. 1986, 4, 167–196. [Google Scholar]
  67. Xiao, T. GIS-Based Research on Spatial Simulation of Landslide in Lanzhou. Ph.D. Thesis, College of Earth and Environmental Science, Lanzhou University, Lanzhou, China, 2006. (In Chinese with English abstract). [Google Scholar]
  68. Arabameri, A.; Pradhan, B.; Rezaei, K. Gully erosion zonation mapping using integrated geographically weighted regression with certainty factor and random forest models in GIS. J. Environ. Manag. 2019, 232, 928–942. [Google Scholar] [CrossRef]
  69. Azareh, A.; Rahmati, O.; Rafiei-Sardooi, E.; Sankey, J.B.; Lee, S.; Shahabi, H.; Bin Ahmad, B. Modelling gully-erosion susceptibility in a semi-arid region, Iran: Investigation of applicability of certainty factor and maximum entropy models. Sci. Total Environ. 2019, 655, 684–696. [Google Scholar] [CrossRef]
  70. Costache, R.; Pham, Q.B.; Avand, M.; Linh, N.T.T.; Vojtek, M.; Vojteková, J.; Lee, S.; Khoi, D.N.; Nhi, P.T.T.; Dung, T.D. Novel hybrid models between bivariate statistics, artificial neural networks and boosting algorithms for flood susceptibility assessment. J. Environ. Manag. 2020, 265, 110485. [Google Scholar] [CrossRef] [PubMed]
  71. Chen, W.; Xie, X.; Peng, J.; Shahabi, H.; Hong, H.; Bui, D.T.; Duan, Z.; Li, S.; Zhu, A. GIS-based landslide susceptibility evaluation using a novel hybrid integration approach of bivariate statistical based random forest method. Catena 2018, 164, 135–149. [Google Scholar] [CrossRef]
  72. Juliev, M.; Mergili, M.; Mondal, I.; Nurtaev, B.; Pulatov, A.; Hübl, J. Comparative analysis of statistical methods for landslide susceptibility mapping in the Bostanlik District, Uzbekistan. Sci. Total Environ. 2019, 653, 801–814. [Google Scholar] [CrossRef] [PubMed]
  73. Cohen, J. A Coefficient of Agreement for Nominal Scales. Educ. Psychol. Meas. 1960, 20, 37–46. [Google Scholar] [CrossRef]
  74. Landis, J.R.; Koch, G.G. The measurement of observer agreement for categorical data. Biometrics 1977, 33, 159–174. [Google Scholar] [CrossRef] [Green Version]
  75. Zhu, L.; Huang, J.F. GIS-based logistic regression method for landslide susceptibility mapping in regional scale. J. Zhejiang Univ. Sci. A 2006, 7, 2007–2017, (In Chinese with English abstract). [Google Scholar] [CrossRef]
  76. Ozdemir, A.; Altural, T. A comparative study of frequency ratio, weights of evidence and logistic regression methods for landslide susceptibility mapping: Sultan Mountains, SW Turkey. J. Asian Earth Sci. 2013, 64, 180–197. [Google Scholar] [CrossRef]
  77. Belsley, D.A. Conditioning Diagnostics: Collinearity and Weak Data in Regression; Wiley: New York, NY, USA, 1991. [Google Scholar]
Figure 1. Location map of the study area, Guixi, Jiangxi, China.
Figure 1. Location map of the study area, Guixi, Jiangxi, China.
Sustainability 13 04830 g001
Figure 2. Geo-environmental factors used in the study: (a) land cover, (b) slope, (c) aspect, (d) NDVI.
Figure 2. Geo-environmental factors used in the study: (a) land cover, (b) slope, (c) aspect, (d) NDVI.
Sustainability 13 04830 g002
Figure 3. Geo-environmental factors used in the study: (a) roads, (b) rivers, (c) faults, (d) annual average precipitation.
Figure 3. Geo-environmental factors used in the study: (a) roads, (b) rivers, (c) faults, (d) annual average precipitation.
Sustainability 13 04830 g003
Figure 4. Landslide risk maps: (a) from model MLR-1, and (b) from model MLR-2.
Figure 4. Landslide risk maps: (a) from model MLR-1, and (b) from model MLR-2.
Sustainability 13 04830 g004
Table 1. Data types and sources.
Table 1. Data types and sources.
Data TypeSourceType
Geological map1/50,000 scale geological mapshapefile
Rivers and roadsGoogle Earthshapefile
Meteorological dataGround stationsshapefile and grid
Topographical dataASTGTMV003 30m from NASA (www.earthdata.nasa.gov, accessed on 11 August 2020)grid (30 m)
Land cover types and NDVILandsat data from USGS data server (https://glovis.usgs.gov, accessed on 8 May 2020)grid (30 m)
Table 2. Proneness weight of the different lithological formations.
Table 2. Proneness weight of the different lithological formations.
LithologyGranitic and Volcanic RocksSandstoneMetamorphic RocksLimestone and CarbonatiteMudstone and Shale
Weight157810
Table 3. Division and CF values of the main geo-environmental factors.
Table 3. Division and CF values of the main geo-environmental factors.
VariableSubsetArea of Subset (km2)Area of Landslide (km2)CF
Elevation20–2671658.7360.3951−0.268
267–515355.5970.1134−0.018
515–762160.4390.0135−0.742
762–100973.5260.0063−0.735
1009–125732.5520.0018−0.831
1257–15048.9550.0000−1
Slope<3271.5310.0234−0.735
3–9774.3070.1395−0.446
9–15448.9840.18000.188
15–21296.9660.10620.092
21–27220.8820.0567−0.209
27–33144.2430.0171−0.634
33–3979.5660.0063−0.757
39–4536.9400.0009−0.926
>4516.3860.0000−1
AspectFlat23.67540−1.000
North143.120.0261−0.44
Northeast299.6950.063−0.354
East313.2120.0576−0.434
Southeast302.9420.0621−0.369
South281.110.0873−0.043
Southwest301.7160.0909−0.074
West323.2070.0684−0.348
Northwest311.5660.0549−0.459
North146.1090.0198−0.582
Proximity to roads0–3073.52460.3060.922
30–6071.16030.12510.815
60–9069.03360.03690.393
90–12066.83220.0117−0.462
Proximity to rivers0–3050.44950.0540.696
30–6048.58290.0450.649
60–9047.01870.04590.667
90–12045.98730.0270.446
0–6069.45480.02430.071
60–12062.42040.0126−0.379
120–18057.97080.0135−0.283
180–24054.74520.02430.268
Proximity to faults0–3037.45260.01980.386
30–6038.05920.01890.346
60–9038.98530.01530.171
90–12039.22650.01890.326
0–6017.56980.0045−0.212
60–12017.90280.0090.354
120–18017.70840.0054−0.062
180–24017.18730.00630.114
NDVI−0.297–−0.07310.0340.0000−1
−0.073–0.15130.3110.0027−0.726
0.151–0.375687.5690.0954−0.572
0.375–0.599737.2380.34290.301
0.599–0.823824.7460.0891−0.668
Annual average precipitation1604.7–1683.8160.93170.0144−0.723
1683.8–1762.8204.51240.0081−0.877
1762.8–1841.9947.53980.1251−0.594
1841.9–1920.9698.87610.1863−0.179
1920.9–2000277.96410.19620.54
Table 4. Results of random points (RP)-based MLR modeling (Model MLR-1).
Table 4. Results of random points (RP)-based MLR modeling (Model MLR-1).
VariableβSEWalsdfSigexp(β)Order
Elevation−0.0070.00155.01510.0000.9937
Slope0.1350.01675.71210.0001.1453
Aspect0.0030.0017.85410.0051.0036
Proximity to roads0.4400.025298.67410.0001.5521
Proximity to rivers0.0240.0211.25910.2621.0244
Proximity to faults−0.0070.0330.03910.8430.9937
NDVI−2.1551.0284.38910.0360.11612
Lithology−0.1450.03715.26910.0000.86510
Land use0.0100.0780.01610.8991.0105
Accumulated average precipitation of March–July0.3100.06125.91710.0001.3642
Accumulated average precipitation of May–July−0.1190.03214.12310.0000.8889
Accumulated average precipitation of March–June−0.2060.02854.25910.0000.81411
Annual average precipitation−0.0320.0163.92210.0480.9698
Constant−6.6753.8852.95210.0860.001
Table 5. Results of certainty factor (CF)-based MLR modeling (Model MLR-2).
Table 5. Results of certainty factor (CF)-based MLR modeling (Model MLR-2).
VariableβSEWalsdfSigexp(β)Order
Elevation1.7200.7285.58410.0185.5855
Slope2.6450.35854.52410.00014.0831
Aspect0.7820.6001.69910.1922.1868
Proximity to roads2.6080.150301.48010.00013.5722
Proximity to rivers0.5920.2585.26610.0221.80810
Proximity to faults−0.0850.5730.02210.8820.91812
NDVI1.4030.25430.58010.0004.0676
Lithology2.2250.31350.62310.0009.2564
Land use0.7290.5701.64010.2002.0749
Accumulated average precipitation of March–July−1.5820.8133.78710.0520.20613
Accumulated average precipitation of May–July0.5530.7570.53310.4651.73811
Accumulated average precipitation of March–June2.4000.67412.68110.00011.0193
Annual average precipitation0.9080.5992.29410.1302.4797
Constant2.9990.38959.51510.00020.058
Note: β—regression coefficient of each predictive factor in the model, SE—standard error, Wals—Chi-square value, df—degree of freedom, and Sig—significance.
Table 6. Overall accuracy (OA) and Kappa coefficient (KC) of the two MLR models versus the validation set (VS).
Table 6. Overall accuracy (OA) and Kappa coefficient (KC) of the two MLR models versus the validation set (VS).
ModelOverall Accuracy (%)Kappa Coefficient
RP-based MLR-191.5%0.814
CF-based MLR-290.4%0.782
Table 7. Observation accuracy of the predicted risk map versus the observed landslides.
Table 7. Observation accuracy of the predicted risk map versus the observed landslides.
Risk LevelDistribution of the Field Observed Landslides
MLR-1MLR-2
No risk4.76%6.60%
Low risk3.66%1.83%
Median risk4.76%2.93%
High risk9.89%6.60%
Extremely high risk76.92%82.05%
Table 8. Collinearity of the geo-environmental factors of the two models: MLR-1 and MLR-2.
Table 8. Collinearity of the geo-environmental factors of the two models: MLR-1 and MLR-2.
VariableMLR-1MLR-2
Collinearity StatisticsCollinearity Statistics
ToleranceVIFToleranceVIF
Elevation0.4032.4790.9191.088
Slope0.5411.8470.7051.419
Aspect0.9191.0880.8571.167
Proximity to roads0.5961.6780.5871.702
Proximity to rivers0.8631.1590.8711.148
Proximity to faults0.9371.0670.9281.078
NDVI0.3892.5700.8511.176
Lithology0.6521.5340.6861.458
Land use0.9321.0730.8471.181
Accumulated average precipitation of March–June0.01473.4950.1785.623
Accumulated average precipitation of May–July0.01192.4000.1297.745
Annual average precipitation0.009106.5640.1248.074
Accumulated average precipitation of March–July0.002571.3620.1168.649
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Huangfu, W.; Wu, W.; Zhou, X.; Lin, Z.; Zhang, G.; Chen, R.; Song, Y.; Lang, T.; Qin, Y.; Ou, P.; et al. Landslide Geo-Hazard Risk Mapping Using Logistic Regression Modeling in Guixi, Jiangxi, China. Sustainability 2021, 13, 4830. https://doi.org/10.3390/su13094830

AMA Style

Huangfu W, Wu W, Zhou X, Lin Z, Zhang G, Chen R, Song Y, Lang T, Qin Y, Ou P, et al. Landslide Geo-Hazard Risk Mapping Using Logistic Regression Modeling in Guixi, Jiangxi, China. Sustainability. 2021; 13(9):4830. https://doi.org/10.3390/su13094830

Chicago/Turabian Style

Huangfu, Wenchao, Weicheng Wu, Xiaoting Zhou, Ziyu Lin, Guiliang Zhang, Renxiang Chen, Yong Song, Tao Lang, Yaozu Qin, Penghui Ou, and et al. 2021. "Landslide Geo-Hazard Risk Mapping Using Logistic Regression Modeling in Guixi, Jiangxi, China" Sustainability 13, no. 9: 4830. https://doi.org/10.3390/su13094830

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop