Machine-Learning-Based Prediction Modeling for Debris Flow Occurrence: A Meta-Analysis

Yang, Lianbing; Ge, Yonggang; Chen, Baili; Wu, Yuhong; Fu, Runde

doi:10.3390/w16070923

Open AccessReview

Machine-Learning-Based Prediction Modeling for Debris Flow Occurrence: A Meta-Analysis

by

Lianbing Yang

^1,2,3,

Yonggang Ge

^1,2,*

,

Baili Chen

^3,4,

Yuhong Wu

^2,3 and

Runde Fu

^3,5

¹

Key Laboratory of Mountain Hazards and Earth Surface Process, Chinese Academy of Sciences, Chengdu 610299, China

²

Institute of Mountain Hazards and Environment, Chinese Academy of Sciences, Chengdu 610299, China

³

University of Chinese Academy of Sciences, Beijing 101408, China

⁴

Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 730000, China

⁵

Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China

^*

Author to whom correspondence should be addressed.

Water 2024, 16(7), 923; https://doi.org/10.3390/w16070923

Submission received: 24 February 2024 / Revised: 13 March 2024 / Accepted: 19 March 2024 / Published: 22 March 2024

(This article belongs to the Special Issue Monitoring, Modelling, Assessment and Mitigation of Debris Flow Hazards)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Machine learning (ML) has become increasingly popular in the prediction of debris flow occurrence, but the various ML models utilized as baseline predictors reported in previous studies are typically limited to individual case bases. A comprehensive and systematic evaluation of existing empirical evidence on the utilization of ML as baseline predictors for debris flow occurrence is lacking. To address this gap, we conducted a meta-analysis of ML-based prediction modeling of debris flow occurrence by retrieving papers that were published between 2000 and 2023 from the Scopus and Web of Science databases. The general findings were as follows: (1) A total of 84 papers, distributed across 37 different journals in this time period, reflecting an overall upward trend. (2) Debris flow disasters occur throughout the world, and a total of 13 countries carried out research on the prediction of debris flow occurrence based on ML; China made significant contributions, but more research efforts in African countries should be considered. (3) A total of 36 categories of ML models were utilized as baseline predictors for debris flow occurrence, with logistic regression (LR) and random forest (RF) emerging as the most popular choices. (4) Feature engineering and model comparison were the most commonly utilized strategies in predicting debris flow occurrence based on ML (53 and 46 papers, respectively). (5) Interpretation methods were rarely utilized in predicting debris flow occurrence based on ML, with only 16 papers reporting their utilization. (6) In the prediction of debris flow occurrence based on ML, interpretation methods were rarely utilized, searching by data materials was the most important sample data source, the topographic factors were the most commonly utilized category of candidate variables, and the area under the ROC curve (AUROC) was the most frequently reported evaluation metric. (7) LR’s prediction performance for debris flow occurrence was inferior to that of RF, BPNN, and SVM; SVM was comparable to RF, and all superior to BPNN. (8) The application process for the prediction of debris flow occurrence based on ML consisted of three main steps: data preparation, model construction and evaluation, and prediction outcomes. The research gaps in predicting debris flow occurrence based on ML include utilizing new ML techniques and enhancing the interpretability of ML. Consequently, this study contributes both to academic ML research and to practical applications in the prediction of debris flow occurrence.

Keywords:

machine learning; debris flow; occurrence; prediction; meta-analysis

1. Introduction

Debris flow is a frequent natural geological phenomenon in valleys, which is a three-phase saturated fluid composed of solids, liquids, and gases [1,2]. Its formation is catalyzed by triggering conditions such as heavy rains, glacial and snowmelt waters, and dam failures [3,4]. In recent years, the occurrence of debris flow disasters has risen due to extreme weather, earthquakes, forest fires, and human engineering activities [5]. Debris flow is characterized by sudden and rapid movement, which can quickly erode, transport, accumulate, and impact the earth’s surface [6,7]. This phenomenon has posed a serious threat to human life and property, and the ecological environment of mountainous regions, and therefore, it has become a major disaster factor hindering the social and economic development of mountain areas around the world [8,9]. Proactive deployment of disaster prevention and mitigation measures based on predictive information regarding debris flow occurrence can minimize the impact of such disasters [10,11,12]. Hence, the accurate and scientific prediction of debris flow occurrence holds immense significance in effectively preventing and managing debris flow disasters.

The genesis of debris flow is fundamentally governed by three primary factors: material source, water source, and topography [13,14,15]. Moreover, the causative factors behind debris flow are intricate, involving a multitude of elements such as geology, topography, landform, soil, vegetation, rainfall, and temperature [16,17,18,19,20]. Moreover, the formation process of debris flow involves theoretical knowledge of many disciplines, and shows complex nonlinear characteristics [21]. Consequently, numerous researchers have endeavored to construct qualitative or quantitative models to comprehensively simulate the intricate mechanisms underlying debris flow formation, thereby enhancing the accuracy of prediction of debris flow occurrence [22,23,24,25,26]. In this study, the prediction of debris flow occurrence refers to whether debris flow occurs or the possibility of debris flow occurrence within a certain area based on the prediction model. The possibility of debris flow occurrence is generally characterized by the susceptibility [27,28] or hazard [29,30] level of the debris flow. Susceptibility refers to the possibility of debris flow occurrence in a certain evaluation unit, considering non-triggering factors such as topography, geomorphology, and surface cover characteristics. Hazard, on the other hand, incorporates triggering factors like rainfall into the susceptibility assessment. Debris flow prediction models fall into four main categories: knowledge-driven models (analytic hierarchy process [31,32], rainfall threshold method [33,34,35], geomorphic information entropy [36], etc.), traditional statistical models (weight of evidence method [37,38], certainty factor [39,40], frequency ratio [40,41], etc.), numerical simulation models (FLO-2D [42], Flow-3D [43,44], Debris2D [45], etc.), and ML models (LR [46,47], RF [24,48], convolutional neural networks [49,50], etc.).

ML employs specific algorithms within computers to discern patterns within data and construct models [51,52]. Owing to its rapid advancement in recent years and its robust capability to capture intricate relationships between predictors and response variables, ML has gained substantial traction in predicting debris flow occurrence [22,53,54]. While numerous studies have affirmed the effectiveness, applicability, and advantages of utilizing ML models as baseline predictors for debris flow occurrence on an individual case basis, these singular instances provide limited reference information. Therefore, there is a need to systematically summarize the research findings related to the utilization of ML models as baseline predictors for debris flow occurrence. To address these issues, this study collated journal papers published between 2000 and 2023 from the Scopus and Web of Science databases to consolidate the collective knowledge concerning the prediction of debris flow occurrence based on ML. Subsequently, a meta-analysis was conducted within this domain.

2. Data Processing Workflow

2.1. Literature Retrieval and Selection Criteria

To comprehensively retrieve literature highly relevant to the research topic, while reducing the subsequent manual screening workload, this study utilized a three-level search approach, incorporating conditions related to the problem, model, and other factors, and linking them using the logical operator “and” (Figure 1). At the problem level, the article’s title was required to incorporate words characterizing debris flow and terms associated with the prediction of debris flow occurrence. For this purpose, synonymous terms for debris flow, such as debris slide, debris flood, and mudflow, were considered. Additionally, words related to debris flow prediction, including occurrence, initiation, prediction, assessment, warning, and modeling, were also included in the search criteria. At the modeling level, the abstract was mandated to include terms related to ML, considering aspects like commonly used ML models and terminologies. At the other level, we included English-language papers published between 2000 and 2023 (as there were fewer studies before 2000); we excluded review articles and conference papers.

Scopus and Web of Science, renowned literature databases covering diverse fields, were utilized for the literature search. Based on the criteria outlined in Figure 1, this study systematically searched for literature related to ML-based prediction of debris flow occurrence in the Scopus and Web of Science databases on 27 December 2023. Afterward, following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses methodology (PRISMA) [55], papers for inclusion in this study were selected. The PRISMA framework comprises four key phases: “identification”, “screening”, “eligibility”, and “included”. In addition, in this study, the PRISMA selection criteria primarily encompassed two aspects: (1) inclusion of papers featuring one or more ML models as a baseline predictor for debris flow occurrence, with a clear training and verification process, and quantitative prediction performance evaluation; (2) exclusion of papers focused on purely experimental methods for predicting debris flow occurrence.

Firstly, in the “identification” phase, we retrieved 410 journal papers from the two databases and removed 189 duplicate papers. Secondly, during the “screening” phase, we assessed the titles and abstracts of 221 papers, eliminating 64 papers that did not meet the PRISMA selection criteria. Thirdly, in the “eligibility” phase, we scrutinized the 157 remaining papers in detail, and after eligibility assessment, we identified 73 papers as unrelated to this meta-analysis and removed them. Finally, in the “included” phase, 84 papers were deemed suitable for inclusion in this meta-analysis (Figure 2).

2.2. Data Extraction

The data used for the meta-analysis in this study were collected from the 84 papers, and the results extracted from each paper were compiled into a structured database. As outlined in Table 1, the database encompassed 15 attribute fields derived from the fundamental aspects of papers, and the modeling and prediction of debris flow occurrence using ML. The attribute fields included journal, title, year, study area, institution, type of occurrence, mapping unit, baseline model, improvement strategy, sample data, candidate variable, validation technique, evaluation metric, area under the curve, and case number. The specific meanings of each attribute field are shown in the attribute description column in Table 1.

The determination of the number of cases in each paper was based on the quantity of data sets used for ML training in the respective articles. For instance, if an article incorporated three training sets, all utilized for ML modeling in the prediction of debris flow occurrence with quantitative results, then the article was recorded as contributing three cases. The area under the curve was recorded on a case-by-case basis.

3. Results

In the research on ML-based prediction modeling of debris flow occurrence, both whether debris flow occurs or the possibility of debris flow occurrence essentially rely on ML classification models [21,28,29]. Therefore, based on the 84 papers selected in Section 2, this study performed a meta-analysis on the prediction of debris flow occurrence based on ML from an overall perspective, and did not distinguish between the types of debris flow occurrence in detail. Firstly, the general characteristics of the 84 papers were analyzed, covering the annual number of papers, the published journals, and the geographical distribution of the study areas and first institutions. Subsequently, the fundamental characteristics of the ML application in the prediction of debris flow occurrence were elaborated, considering ML categories, strategies for improving prediction performance, model interpretation, sample data sources, evaluation units, candidate variable categories, validation techniques, evaluation metrics, prediction performance, and application processes.

3.1. General Characteristics of Studies

During the literature search timeframe for this study (2000–2023), papers on the prediction of debris flow occurrence based on ML emerged in 2006, and continued to appear each year from 2006 to 2023, which indicated an overall upward trend (Figure 3). A total of 84 papers were published, with the lowest numbers recorded in 2008 and 2010 (1 paper each), and the highest in 2022 (18 papers). Notably, approximately 58.33% (49 out of 84) of the papers were published within the last five years (2018–2023), underscoring the increasing attention given to the prediction of debris flow occurrence based on ML in recent years.

The 84 papers were distributed across 37 journals (Table 2), with 13 journals publishing 2 or more papers. Among these, 9 publishers were involved, with significant contributions from Springer, Elsevier, and MDPI. The journal with the highest number of published papers was Natural Hazards, totaling 15, followed by Remote Sensing with 8 papers. Except for Disaster Advances, which is currently not included in the Science Citation Index (SCI), the remaining 12 journals are SCI journals, with Engineering Geology having the highest impact factor (7.4). In terms of subject types, these 13 journals primarily cover earth science, remote sensing science, disaster science, hydrology science, geological science, and mountain science. This diversity underscores that the prediction of debris flow involves knowledge from many subject fields.

In this study, the geographical distribution of the research areas and the first institutions of the 84 papers was determined, according to the country reported in the papers, with the exclusion of the two intercontinental-scale papers [56,57]. Out of the 84 papers, the research regions of the paper [58] corresponded to 8 countries, with the remaining papers each corresponding to a single country. The count encompassed both the country in which the research area of the paper was situated and the country in which the first institution of the paper was located. The results are illustrated in Figure 4. The research areas of the 84 papers spanned 18 countries, distributed across 6 continents (Asia, Africa, North America, South America, Europe, and Oceania), indicating the widespread occurrence of debris flow disasters around the world. Studies on the prediction of debris flow occurrence based on ML were conducted in a total of 13 countries, showing a global interest in this field. China led in both the numbers of papers by the research areas and the first institutions, at 53 and 54, respectively, suggesting a significant contribution to this field. Notably, there was a lack of studies from African countries, with no first institution from this continent among the studies. Moreover, in the geographical distribution of the research areas, Ethiopia was the only African country reported [59]. This indicated that more study efforts on the prediction of debris flow occurrence based on ML in African countries should be considered, especially considering the severity of debris flow disasters in this region.

3.2. General Characteristics of ML Applications

3.2.1. ML Categories

Since various ML models have been utilized as baseline predictors of debris flow occurrence in different years, relying solely on the number of papers may lead to significant errors in judging the popularity or application rate of a particular ML model in predicting debris flow occurrence. To address this, this study adopted the concept of mean value and calculated the relative annual number of papers reported for a specific ML model. The calculation formula is as follows:

μ = \frac{F}{Y_{e n d} - Y_{b e g i n} + 1}

(1)

In the formula,

μ

is the relative annual number of papers reported, F is the number of papers reported for a specific ML model in the literature search period,

Y_{b e g i n}

is the initial year of application of a specific ML model in a certain field, and

Y_{e n d}

is the last year of the literature search period.

Based on analysis of the 84 papers, the ML models utilized as baseline predictors of debris flow occurrence were summarized (Figure 5). A total of 36 categories of ML models were utilized as baseline predictors of debris flow occurrence, which were further grouped into 10 broad categories: generalized linear models, ensemble models, shallow neural networks, discriminant analysis, tree models, kernel models, Bayesian models, evolutionary models, instance-based models, and deep learning. It is worth noting that compared with other broad categories, within deep learning, only convolutional neural network was utilized as a baseline predictor of debris flow occurrence, in only five studies. This underscores the need for further study on the applicability of complex network structures in deep learning for predicting debris flow occurrence [50,60].

Examining the initial years of utilization for the 36 categories of ML models, BPNN (2006) and SVM (2006) were the first ML models utilized in the prediction of debris flow occurrence, followed by genetic algorithm (2007), LR (2008), linear discriminant analysis (2008), and so on. In the recent 5 years (2018–2023), 19 categories of ML models were utilized as baseline predictors of debris flow occurrence, accounting for 52.778% of the total number of ML categories retrieved in this study (2000–2023). This suggested a growing trend of utilizing a diverse range of ML models in recent years, possibly influenced by the rapid development and popularization of ML technologies [61,62].

Among the 36 categories of ML models, 4 categories were reported in more than 20 papers, in the following order: LR (43), SVM (24), BPNN (21), and RF (21). Additionally, nine categories had a relative annual number of papers greater than or equal to one, in the following order: RF (3.5), LR (2.688), SVM (1.333), extreme gradient boosting (1.2), BPNN (1.167), decision tree (1.091), gradient tree boosting (1), convolutional neural network (1), and multilayer perceptron (1). Among these, extreme gradient boosting, gradient tree boosting, convolutional neural network, and multilayer perceptron were the most utilized ML models in the prediction of debris flow in the recent 5 years, indicating their strong popularity in predictive modeling of debris flow occurrence. Considering the number of papers and the relative annual number of papers for the 36 categories of ML models, LR and RF emerged as the most popular models for predicting debris flow occurrence based on ML.

3.2.2. Prediction Performance Improvement Strategies

In the study of prediction modeling of debris flow occurrence based on ML, researchers have employed various performance improvement strategies to obtain a better model. This study categorized the prediction performance improvement strategies utilized in the 84 studies into five groups: feature engineering, model comparison, hyperparameter tuning, model coupling, and structure optimization. Feature engineering encompasses methods such as feature selection, dimensionality reduction, and weighting. Within feature selection, the methods include stepwise feature screening, multicollinearity analysis, and importance analysis. Model comparison involves comparisons of different ML models or comparisons between ML models and non-ML models. Hyperparameter tuning focuses on optimizing ML hyperparameters using various optimization algorithms. Model coupling involves integrating different ML models or combining ML models with non-ML models. Structure optimization includes enhancing the network structure of deep learning models.

Out of the 84 studies, the number of studies using each of the five categories of ML prediction performance improvement strategies was statistically analyzed, with the results presented in Figure 6. Feature engineering and model comparison were the most commonly utilized strategies in predicting debris flow occurrence based on ML, with 53 and 46 studies, respectively. Out of the 84 studies, 72 used one or more of these strategies, constituting 85.71% of all studies, highlighting the widespread utilization of these strategies in predicting debris flow occurrence based on ML. Among these 72 studies, 36 used only one strategy, 30 used two strategies, 5 used three strategies, and 1 reported four strategies. The statistical results revealed that the most common method of model improvement in predicting debris flow occurrence based on ML involved the utilization of one or two of these strategies, while fewer studies utilized three or four of these strategies.

From the 53 studies utilizing feature engineering, the number of studies using each method of feature engineering was statistically analyzed. The feature selection methods within feature engineering were further subdivided, with the results presented in Figure 7. Feature selection was the most popular method of feature engineering in the prediction modeling of debris flow occurrence based on ML, with 49 studies. Among these 49 studies, the number of studies utilizing each method of feature selection was as follows: multicollinearity analysis (26), stepwise feature screening (18), and importance analysis (13). Single methods were the most commonly utilized, rather than combinations of multiple methods for feature selection in predicting debris flow occurrence based on ML. Only a few studies used a combination of multiple methods such as multicollinearity analysis and stepwise feature screening (four) and multicollinearity analysis and importance analysis (four).

Among the 26 studies utilizing multicollinearity analysis, the used algorithms included Pearson correlation analysis [41], Spearman correlation analysis [63], variance inflation factor [64], and the tolerance method [65]. Among the 18 studies using stepwise feature screening, the methods mainly included forward selection [28], back selection [66], and artificial stepwise feature combination [67]. Among the 13 studies using importance analysis, the algorithms used included Pearson correlation analysis [68] and information gain ratio [69]. In the two studies using feature dimensionality reduction, the algorithms utilized were principal component analysis [70,71]. Among the two papers using feature weighting, the algorithms utilized were certainty factor and genetic algorithm.

Among the 46 studies using model comparison, the number of papers comparing ML models (36) was significantly higher than that of those comparing ML models and non-ML models (10), as shown in Figure 8. Among the 36 papers comparing ML models, the number of papers comparing shallow ML models (32) was much higher than that of those comparing shallow ML models and deep learning models (4). This discrepancy may be attributed to the fact that deep learning (2019) emerged much later than shallow ML (2006), and convolutional neural network was the only deep learning model utilized as a baseline predictor in the prediction of debris flow occurrence based on ML.

Among the eight studies using hyperparameter tuning, some studies utilized multiple optimization algorithms; the number of studies using each of the different optimization algorithms was as follows: grid search algorithm (four), particle swarm optimization algorithm (three), genetic algorithm (two), cuckoo optimization algorithm (one) and gray wolf optimization algorithm (one). Among the eight studies using model coupling, the number of studies using each of the model coupling methods was as follows: coupling of ML model and traditional statistical model (three), coupling of ML and mechanism model (two), and coupling between ML models (two). In one paper using structure optimization, the author improved the structure of the convolutional neural network according to the characteristics of debris flow [50].

3.2.3. Model Interpretation

ML is an inexplicable “black-box” model, so enhancing its interpretability is important to the scientific understanding of prediction outcomes of debris flow occurrence based on ML [72]. Of the 84 papers, 16 papers reported interpretation methods of ML, constituting 19.05% of all papers (Figure 9). This indicated that explanations for ML outputs were provided in only a few papers. The interpretation methods utilized in the 16 studies included tree-based feature importance (TFI) [73], sensitivity analysis (SA) [74], permutation feature importance (PFI), partial dependence plot (PDP) [75], and Shapley additive explanations (SHAP) [76]. TFI calculates the contribution of each feature and is exclusively applicable to tree models, unlike the other four interpretation methods. Among the 16 studies, 10 studies interpreted ML using TFI, while PFI, PDP, and SHAP were rarely utilized.

3.2.4. Sample Sources

In the prediction of debris flow occurrence based on ML, the sample data, which serve as the output of the predictor, primarily consist of debris flow events or non-debris flow events in the evaluation unit. The sample data sources in this study were categorized into three groups: searching by data materials (including historical records, official announcements, related websites, etc.), remote sensing interpretation (utilizing high-resolution satellite images and aerial photographs), and field survey; the results are presented in Figure 10. In this study, “searching by data materials” refers to extracting sample data from relevant texts containing information about debris flow events. These texts encompass electronic texts downloaded from online sources and paper texts. Searching by data materials was the most important sample data source in predicting debris flow occurrence based on ML, with 58 studies. Of the 84 studies, 52 used a single sample data source, and 32 utilized a combination of sample data sources. This suggested that a single method was more commonly utilized than a combination of multiple sample data sources in predicting debris flow occurrence based on ML.

3.2.5. Evaluation Units and Candidate Variable Categories

From the 84 papers, the types of evaluation units were classified and summarized, as depicted in Figure 11. The evaluation unit categories were classified into two broad categories, with 49 studies using surface evaluation units and 36 using point evaluation units. Only one paper [77] utilized both categories of evaluation units, and the remaining studies all utilized only one category of evaluation units. The point evaluation units correspond to the grid cell. Regarding the surface evaluation unit, watershed was the most important evaluation unit in predicting debris flow occurrence based on ML, with 21 papers. Only one paper selected village with social property as the evaluation unit, while the remaining studies selected evaluation unit with natural property in predicting debris flow occurrence based on ML.

In the prediction of debris flow occurrence, the depiction of candidate variables is intricately linked to the author’s cognition, the research domain, and the chosen evaluation unit. Notably, the representation of identical candidate variables varies across the papers, reflecting the nuances of individual research intentions. Therefore, with reference to relevant studies [24,78,79,80,81,82], the candidate variables from the 84 papers were systematically categorized into 12 groups: topography factors, morphology factors, geomorphology factors, geology factors, meteorology factors, hydrology factors, soil factors, vegetation factors, fire factors, material source factors, human activity factors, and past debris flow characteristic factors (Table 3).

The studies using factors from each of the 12 categories were counted from the two aspects of the point evaluation units and surface evaluation units, and the results are shown in Figure 12. Among the 49 papers utilizing the surface evaluation units, all of the 12 categories were used, with the top three being topography factors (45), meteorology factors (40), and morphology factors (35). By contrast, in the 36 studies utilizing point evaluation units, morphology factors, fire factors, material source factors, and characteristics factors of past debris flow were not used; the top three with the highest number of papers reported were topography factor (34), hydrology factor (27), and human activity factor (24). This discrepancy may be due to morphology factors, material source factors, and characteristic factors of past debris flow being associated with surface evaluation units, and researchers were more inclined to utilize the surface evaluation units to predict the occurrence of post-fire debris flow (fire factor utilized) based on ML. In the studies utilizing point evaluation units and surface evaluation units, there was a difference in the ranking of the number of studies using each variable category, with topographic factors being the most popular, with a total of 78 papers. This preference can be attributed to its commendable capacity to represent potential material source and energy, rendering it suitable for predicting diverse debris flow types, and its data accessibility.

3.2.6. Validation Techniques and Evaluation Metrics

In the prediction of debris flow occurrence based on ML, it is necessary to select validation techniques and evaluation metrics to evaluate the performance of the model. Among the 84 papers reviewed, two primary validation techniques were utilized: hold-out (61) and cross-validation (23). Notably, hold-out emerged as the most prevalent validation technique in predicting debris flow occurrence based on ML, as illustrated in Figure 13. A total of 21 evaluation metrics were grouped into model fitting metrics (root mean square error (RMSE), mean absolute percentage error (MAPE), R-squared (R²), etc.) and prediction performance metrics (AUROC, overall accuracy (ACC), Kappa coefficient(kappa), etc.). Among the 21 evaluation metrics, 12 evaluation metrics were reported in 2 or more papers, with 3 evaluation metrics reported in over 20 papers, in the following order: AUROC (57), ACC (49), and sensitivity (21), as shown in Figure 12. The prominence of AUROC can be attributed to the fact that ML-based prediction modeling of debris flow occurrence essentially involves classification models, for which AUROC serves as a robust performance measure.

3.2.7. Prediction Performance

According to Section 3.2.1, the baseline predictors of debris flow occurrence in more than 20 papers were as follows: LR, SVM, BPNN, and RF. Additionally, as indicated in Section 3.2.6, the most utilized evaluation metric was AUROC. Given the sample size, in this study, the prediction performances of the main baseline predictors (LR, SVM, BPNN, and RF) were analyzed based on AUROC.

Figure 14 shows the number of sample data of the four baseline predictors, with the following distribution: LR (38), RF (22), SVM (21), and BPNN (13). The sample data for the four baseline predictors were extracted from their cases using AUROC as the evaluation metric in the 84 studies. The average AUROC for the four baseline predictors was greater than 81%, with the following specific values: RF (0.870), LR (0.859), BPNN (0.845), and SVM (0.816). These results indicated that the four ML models as baseline predictors exhibited good performance in the prediction of debris flow occurrence.

Among the 84 studies, different studies utilized different ML models, input variables, and data sets. To compare the prediction performances of the four models, pairs of baseline predictors were selected, and the results are illustrated in Figure 14. The number of sample data of each pairwise comparison was as follows: LR and RF (6), LR and BPNN (3), LR and SVM (5), SVM and RF (10), BPNN and RF (6), BPNN and SVM (10). The sample data of six pairs of baseline predictors were extracted from their cases using AUROC as an evaluation metric in the 84 studies. In Figure 15a, two points are above the 1:1 line, and the remaining points are below. In Figure 15b, the three points are all below, but very close to the 1:1 line. In Figure 15c, only one point is above the 1:1 line, and the remaining points are below. Therefore, on the whole, the prediction performance of LR was worse than that of RF, BPNN, and SVM in the prediction of debris flow occurrence. In Figure 15d, five points are above the 1:1 line, four points are below, and one point is on the line, indicating that the prediction performance of SVM as a baseline predictor was comparable to that of RF in the prediction of debris flow occurrence. In Figure 15e, only one point is above the 1:1 line, and the remaining points are below, indicating that the prediction performance of BPNN as the baseline predictor was worse than RF in the prediction of debris flow occurrence. In Figure 15f, three points are above the 1:1 line, six points are below, and one point is on the line, indicating that the prediction performance of SVM as a baseline predictor was better than that of BPNN in the prediction of debris flow occurrence, but there was no absolute advantage.

3.2.8. Application Processes

Figure 16 summarizes the processes involved in the prediction of debris flow occurrence based on ML, as extracted from the 84 papers. The processes consisted of three main steps: data preparation, model construction and evaluation, and prediction outcomes. In the data preparation process, first, the evaluation unit for the study area is chosen, such as a watershed, catchment, or grid cell. Then, debris flow sample data are collected as output from one or more sample data sources, including searching by data materials, field survey, and remote sensing interpretation. Next, candidate variables containing geo-environmental information are extracted based on the relevant literature and expert experience, considering the availability and reliability of data (remote sensing data, digital elevation models, thematic maps, etc.). Finally, the raw dataset of prediction of debris flow occurrence is constructed. In the model construction and evaluation process, first, the raw dataset is split into training and testing datasets (few studies split the raw dataset sets into training, validation, and testing datasets), with some studies utilizing only cross-validation to evaluate model performance. Then, suitable ML models (BPNN, SVM, CNN, etc.) are selected as baseline models based on different research purposes and the characteristics of the ML models. Next, certain improvement strategies are implemented, such as feature engineering, hyperparameter tuning, and model coupling to improve the baseline model’s prediction performance, or some studies directly utilized the original ML models. Finally, suitable evaluation metrics (AUROC, ACC, RMSE, etc.) are selected to evaluate the performance of the model, and the optimal model is selected according to the evaluation results. In the prediction outcome process, the statement of theory assumes that “past events have a great influence on the future” [83] is necessary. The prediction of debris flow occurrence in future situations based on the optimal model is performed, in other words, determining whether debris flow will occur or the possibility of debris flow occurrence (susceptibility assessment or hazard assessment).

4. Discussion

4.1. Challenges and Future Trends

Without a doubt, ML is a promising approach for the prediction of debris flow occurrence, as evidenced by the analysis of 84 papers. However, because of the limited skills of debris flow disaster researchers in ML and the lack of modeling data, there are still shortcomings and challenges in the current research. First, the utilization of new ML techniques in the prediction of debris flow occurrence has an obvious lag. For instance, according to Section 3, reinforcement learning and transfer learning were rarely used, and compared with shallow ML, deep learning was also less commonly utilized. Second, the application of interpretation methods in the prediction of debris flow occurrence based on ML lacked breadth and depth. For example, as indicated in Section 3.2.3, most of the selected papers did not interpret the ML results, and the few that did employ limited categories of interpretation methods to analyze feature importance did not use model visualization or provide post-hoc explanations. To address these problems, we propose the following two general recommendations for future research to seek suitable solutions. The details of the recommendations are as follows:

ML is evolving rapidly, and the utilization of new ML techniques may revitalize the research of prediction of debris flow occurrence. On the one hand, educating geoscientists on the advantages of utilizing new techniques, such as deep learning, reinforcement learning, and transfer learning, to predict debris flow occurrence. On the other hand, the integration of domain knowledge of debris flow occurrence with the new techniques should be further explored.
Comparing various features of explainable frameworks, such as SHAP and local interpretable model-agnostic explanations (LIME) [84], and selecting suitable interpretation methods could improve the transparency and credibility of ML in the prediction of debris flow occurrence. Model visualization and post-hoc explanations should be given more attention to provide insights into the utilization of ML as a predictor of debris flow occurrence. Furthermore, through mechanism-learning coupling methods, such as mechanism cascaded learning, learning-embedded mechanisms, and mechanism-integrated learning, mechanism models and ML models can be combined to improve the physical interpretability for prediction outcomes of debris flow occurrence [85].

4.2. Uncertainties and Limitations

The potential uncertainties in the results and limitations of this meta-analysis are outlined below.

Collection of papers: While a considerable effort was invested in defining the search criteria for ML and debris flow occurrence, we may omit certain papers. Additionally, the scope of this study was confined to papers published in English-language journals. It is worth acknowledging that numerous studies, particularly in regions susceptible to debris flow that are non-English speaking, may have been published in other languages such as Chinese, Japanese, and Portuguese. This language restriction could potentially exclude relevant contributions in languages other than English.
Prediction performance of ML: Given the sample size, the quantitative analysis of prediction performance was limited to the four most frequently reported ML models (LR, SVM, BPNN, and RF), neglecting potential insights from less-reported models. In addition, variations in the evaluation units, study areas, and types of debris flow were not accounted for, potentially influencing the results of the quantitative analyses.

5. Conclusions

In this study, a meta-analysis of the research on the prediction of debris flow occurrence based on ML was conducted by reviewing the relevant papers from the Scopus and Web of Science databases. A summary of this study’s content and crucial findings are presented below.

A total of 84 papers were published from 2006 to 2023, with an overall rising trend, particularly in recent years (2018–2023), suggesting an increasing interest in predicting debris flow occurrence based on ML. Debris flow disasters occur throughout the world, and many countries have carried out research on the prediction of debris flow occurrence based on ML; China has made significant contributions, but more research efforts in African countries should be considered.
A total of 36 categories of ML models were utilized as baseline predictors for debris flow occurrence. Notably, extreme gradient boosting, gradient tree boosting, convolutional neural network, and multilayer perceptron had strong popularity in predictive modeling of debris flow occurrence. Additionally, LR and RF emerged as the most popular ML models in predicting debris flow occurrence.
In the prediction of debris flow occurrence based on ML, a variety of prediction performance improvement strategies, including feature engineering, model comparison, hyperparameter tuning, model coupling, and structure optimization, were widely utilized. Among these strategies, feature engineering and model comparison emerged as the most common strategies; the most common approach for model improvement in predicting debris flow occurrence based on ML involved the utilization of one or two of these strategies, while fewer studies utilized three or four of these strategies.
In the prediction of debris flow occurrence based on ML, few papers provided interpretation methods of ML; searching by data materials emerged as the most crucial debris flow sample data source. There was a difference in the ranking of the number of studies using each candidate variable category between the studies utilizing point evaluation units and those using surface evaluation units, but the number of topographic factors was the highest. Two validation techniques, hold-out and cross-validation, were utilized. AUROC was the most frequently reported evaluation metric, followed by ACC, sensitivity, and specificity.
The four ML models (RF, LR, BPNN, and SVM) used as baseline predictors exhibited good prediction performance in the prediction of debris flow occurrence. LR’s prediction performance for debris flow occurrence was inferior to RF, BPNN, and SVM; SVM was comparable to RF, and all were superior to BPNN.
The process of predicting debris flow occurrence based on ML consisted of three main steps: data preparation, model construction and evaluation, and prediction outcomes.
Future work on the prediction of debris flow occurrence based on ML can focus on two aspects: utilizing new ML techniques, and enhancing the interpretability of the ML models.

Author Contributions

Conceptualization, L.Y. and Y.G.; Supervision, Y.G.; Formal Analysis, L.Y. and Y.G.; Data Collection, L.Y.; Visualization, L.Y., B.C. and Y.W.; Writing—Original Draft Preparation L.Y., B.C. and R.F.; Writing—Review and Editing, L.Y., Y.G., B.C., Y.W. and R.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Second Tibetan Plateau Scientific Expedition and Research Program (STEP) (Grant No. 2019QZKK0902).

Data Availability Statement

The data presented in this study can be made available upon request from the authors. The data are not publicly available due to privacy restrictions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Iverson, R.M. Debris flows: Behaviour and hazard assessment. Geol. Today 2014, 30, 15–20. [Google Scholar] [CrossRef]
Hungr, O.; Evans, S.G.; Bovis, M.J.; Hutchinson, J.N. A review of the classification of landslides of the flow type. Environ. Eng. Geosci. 2001, 7, 221–238. [Google Scholar] [CrossRef]
Xu, L.; Nie, L.; Yu, Y. Study on Formation Mechanism of Gunmaling Debris Flow in China. Disaster Adv. 2012, 5, 1059–1062. [Google Scholar]
Jiang, X.; Cui, P.; Chen, H.; Guo, Y. Formation conditions of outburst debris flow triggered by overtopped natural dam failure. Landslides 2017, 14, 821–831. [Google Scholar] [CrossRef]
Liang, X.; Segoni, S.; Yin, K.; Du, J.; Chai, B.; Tofani, V.; Casagli, N. Characteristics of landslides and debris flows triggered by extreme rainfall in Daoshi Town during the 2019 Typhoon Lekima, Zhejiang Province, China. Landslides 2022, 19, 1735–1749. [Google Scholar] [CrossRef]
Wieczorek, G.F.; Naeser, N.D. Debris-flow hazards mitigation: Mechanics, prediction and assessment. In Proceedings of the Second International Conference on Debris-Flow Hazards Mitigation, Taipei, Taiwan, 16–18 August 2000. [Google Scholar]
De Haas, T.; Braat, L.; Leuven, J.R.; Lokhorst, I.R.; Kleinhans, M.G. Effects of debris flow composition on runout, depositional mechanisms, and deposit morphology in laboratory experiments. J. Geophys. Res. Earth Surf. 2015, 120, 1949–1972. [Google Scholar] [CrossRef]
Jakob, M.; Hungr, O.; Jakob, M. Debris-flow hazard analysis. In Debris-Flow Hazards and Related Phenomena; Springer: Berlin/Heidelberg, Germany, 2005; pp. 411–443. [Google Scholar] [CrossRef]
Jakob, M. Debris-flow hazard assessments: A practitioner’s view. Environ. Eng. Geosci. 2021, 27, 153–166. [Google Scholar] [CrossRef]
Xiong, M.; Meng, X.; Wang, S.; Guo, P.; Li, Y.; Chen, G.; Qing, F.; Cui, Z.; Zhao, Y. Effectiveness of debris flow mitigation strategies in mountainous regions. Prog. Phys. Geogr. 2016, 40, 768–793. [Google Scholar] [CrossRef]
Vagnon, F. Design of active debris flow mitigation measures: A comprehensive analysis of existing impact models. Landslides 2020, 17, 313–333. [Google Scholar] [CrossRef]
McCoy, K.; Krasko, V.; Santi, P.; Kaffine, D.; Rebennack, S. Minimizing economic impacts from post-fire debris flows in the western United States. Nat. Hazards 2016, 83, 149–176. [Google Scholar] [CrossRef]
Ma, C.; Hu, K.; Tian, M. Comparison of debris-flow volume and activity under different formation conditions. Nat. Hazards 2013, 67, 261–273. [Google Scholar] [CrossRef]
Liu, W.; He, S. Comprehensive modelling of runoff-generated debris flow from formation to propagation in a catchment. Landslides 2020, 17, 1529–1544. [Google Scholar] [CrossRef]
Yang, W.; Wu, S.; Zhang, Y.; Shi, J.; Xiang, L. Research on formation mechanism of the debris flow on slope induced by rainfall. Earth Sci. Front. 2007, 14, 197–204. [Google Scholar] [CrossRef]
Zhang, J.; Liu, J.; Li, Y.; Wang, J.; Chen, L.; Gao, B. Effects of Glacier and Geomorphology on the Mechanism Difference of Glacier-Related Debris Flow on the South and North Banks of Parlung Zangbo River, Southeastern Tibetan Plateau. Adv. Civ. Eng. 2022, 2022, 3510944. [Google Scholar] [CrossRef]
Yu, B.; Wang, T.; Zhu, Y.; Zhu, Y. Topographical and rainfall factors determining the formation of gully-type debris flows caused by shallow landslides in the Dayi area, Guizhou Province, China. Environ. Earth Sci. 2016, 75, 551. [Google Scholar] [CrossRef]
Yu, B.; Li, L.; Wu, Y.; Chu, S. A formation model for debris flows in the Chenyulan River Watershed, Taiwan. Nat. Hazards 2013, 68, 745–762. [Google Scholar] [CrossRef]
Tie, Y.B.; Xu, R.G.; Ba, R.J. The formation of runoff-generated debris flow in Southwestern of China: Take Gangou as an example. Environ. Earth Sci. 2014, 72, 1479–1490. [Google Scholar] [CrossRef]
Tang, H.; McGuire, L.A.; Rengers, F.K.; Kean, J.W.; Staley, D.M.; Smith, J.B. Developing and Testing Physically Based Triggering Thresholds for Runoff-Generated Debris Flows. Geophys. Res. Lett. 2019, 46, 8830–8839. [Google Scholar] [CrossRef]
Ponziani, M.; Ponziani, D.; Giorgi, A.; Stevenin, H.; Ratto, S.M. The use of machine learning techniques for a predictive model of debris flows triggered by short intense rainfall. Nat. Hazards 2023, 117, 143–162. [Google Scholar] [CrossRef]
Zhao, Y.; Meng, X.; Qi, T.; Li, Y.; Chen, G.; Yue, D.; Qing, F. AI-based rainfall prediction model for debris flows. Eng. Geol. 2022, 296, 106456. [Google Scholar] [CrossRef]
Xiong, K.; Adhikari, B.R.; Stamatopoulos, C.A.; Zhan, Y.; Wu, S.; Dong, Z.; Di, B. Comparison of Different Machine Learning Methods for Debris Flow Susceptibility Mapping: A Case Study in the Sichuan Province, China. Remote Sens. 2020, 12, 295. [Google Scholar] [CrossRef]
Jiang, H.; Zou, Q.; Zhou, B.; Hu, Z.; Li, C.; Yao, S.; Yao, H. Susceptibility Assessment of Debris Flows Coupled with Ecohydrological Activation in the Eastern Qinghai-Tibet Plateau. Remote Sens. 2022, 14, 1444. [Google Scholar] [CrossRef]
Giuseppe, C.; Marco, M.; Alessandro, C. Combining spatial modelling and regionalization of rainfall thresholds for debris flows hazard mapping in the Emilia-Romagna Apennines (Italy). Landslides 2021, 18, 3513–3529. [Google Scholar] [CrossRef]
Liu, K.-F.; Huang, M.C. Numerical simulation of debris flow with application on hazard area mapping. Comput. Geosci. 2006, 10, 221–240. [Google Scholar] [CrossRef]
Si, A.; Zhang, J.; Zhang, Y.; Kazuva, E.; Dong, Z.; Bao, Y.; Rong, G. Debris Flow Susceptibility Assessment Using the Integrated Random Forest Based Steady-State Infinite Slope Method: A Case Study in Changbai Mountain, China. Water 2020, 12, 2057. [Google Scholar] [CrossRef]
Cama, M.; Conoscenti, C.; Lombardo, L.; Rotigliano, E. Exploring relationships between grid cell size and accuracy for debris-flow susceptibility models: A test in the Giampilieri catchment (Sicily, Italy). Environ. Earth Sci. 2016, 75, 238. [Google Scholar] [CrossRef]
Lin, J.W. Neural network model and geographic grouping for risk assessment of debris flow. Int. J. Phys. Sci. 2011, 6, 1374–1378. [Google Scholar]
Liang, W.-J.; Zhuang, D.-f.; Jiang, D.; Pan, J.-J.; Ren, H.-Y. Assessment of debris flow hazards using a Bayesian Network. Geomorphology 2012, 171, 94–100. [Google Scholar] [CrossRef]
Shen, S.; Liao, W.; Nie, L.; Xu, Y.; Zhang, M. Debris flow hazard assessment at Dongmatun Village in Laomao mountainous area of Dalian, Northeast China. Arab. J. Geosci. 2018, 11, 648. [Google Scholar] [CrossRef]
Li, D.; Zhang, H.; Li, Y.; Zhen, Z.; Bu, S.; Tang, X.; Chen, S.; Luo, S.; Tian, S.; Xiong, M. Hazard assessment of debris flow in Guangxi, China based on hydrodynamics mechanism. Environ. Earth Sci. 2019, 78, 50. [Google Scholar] [CrossRef]
Hirschberg, J.; Badoux, A.; McArdell, B.W.; Leonarduzzi, E.; Molnar, P. Evaluating methods for debris-flow prediction based on rainfall in an Alpine catchment. Nat. Hazards Earth Syst. Sci. 2021, 21, 2773–2789. [Google Scholar] [CrossRef]
Long, K.; Zhang, S.; Wei, F.; Hu, K.; Zhang, Q.; Luo, Y. A hydrology-process based method for correlating debris flow density to rainfall parameters and its application on debris flow prediction. J. Hydrol. 2020, 589, 125124. [Google Scholar] [CrossRef]
Zhao, Y.; Meng, X.; Qi, T.; Chen, G.; Li, Y.; Yue, D.; Qing, F. Estimating the daily rainfall thresholds of regional debris flows in the Bailong River Basin, China. Bull. Eng. Geol. Environ. 2023, 82, 46. [Google Scholar] [CrossRef]
Lin, R.; Mei, G.; Liu, Z.; Xi, N.; Zhang, X. Susceptibility Analysis of Glacier Debris Flow by Investigating the Changes in Glaciers Based on Remote Sensing: A Case Study. Sustainability 2021, 13, 7196. [Google Scholar] [CrossRef]
Sun, Y.; Ge, Y.; Chen, X.; Zeng, L.; Liang, X. Risk assessment of debris flow along the northern line of the Sichuan-Tibet highway. Geomat. Nat. Hazards Risk 2023, 14, 2195531. [Google Scholar] [CrossRef]
Chen, X.; Chen, H.; You, Y.; Chen, X.; Liu, J. Weights-of-evidence method based on GIS for assessing susceptibility to debris flows in Kangding County, Sichuan Province, China. Environ. Earth Sci. 2016, 75, 70. [Google Scholar] [CrossRef]
Li, Y.; Chen, J.; Tan, C.; Li, Y.; Gu, F.; Zhang, Y.; Mehmood, Q. Application of the borderline-SMOTE method in susceptibility assessments of debris flows in Pinggu District, Beijing, China. Nat. Hazards 2021, 105, 2499–2522. [Google Scholar] [CrossRef]
Dash, R.K.; Falae, P.O.; Kanungo, D.P. Debris flow susceptibility zonation using statistical models in parts of Northwest Indian Himalayas-implementation, validation, and comparative evaluation. Nat. Hazards 2022, 111, 2011–2058. [Google Scholar] [CrossRef]
Esper Angillieri, M.Y. Debris flow susceptibility mapping in a portion of the Andes and Preandes of San Juan, Argentina using frequency ratio and logistic regression models. Earth Sci. Res. J. 2013, 17, 159–167. [Google Scholar]
Quan Luna, B.; Blahut, J.; Van Westen, C.; Sterlacchini, S.; van Asch, T.W.; Akbas, S. The application of numerical debris flow modelling for the generation of physical vulnerability curves. Nat. Hazards Earth Syst. Sci. 2011, 11, 2047–2060. [Google Scholar] [CrossRef]
Wang, F.; Wang, J.; Chen, X.; Zhang, S.; Qiu, H.; Lou, C. Numerical Simulation of Boulder Fluid-Solid Coupling in Debris Flow: A Case Study in Zhouqu County, Gansu Province, China. Water 2022, 14, 3884. [Google Scholar] [CrossRef]
Huang, X.; Zhang, Z.; Xiang, G. Sensitivity analysis of a built environment exposed to the synthetic monophasic viscous debris flow impacts with 3-D numerical simulations. Nat. Hazards Earth Syst. Sci. 2023, 23, 871–889. [Google Scholar] [CrossRef]
Wu, Y.-H.; Liu, K.-F.; Chen, Y.-C. Comparison between FLO-2D and Debris-2D on the application of assessment of granular debris flow hazards with case study. J. Mt. Sci. 2013, 10, 293–304. [Google Scholar] [CrossRef]
Cabral, V.; Reis, F.; Veloso, V.; Ogura, A.; Zarfl, C. A multi-step hazard assessment for debris-flow prone areas influenced by hydroclimatic events. Eng. Geol. 2023, 313, 106961. [Google Scholar] [CrossRef]
Gu, F.; Chen, J.; Sun, X.; Li, Y.; Zhang, Y.; Wang, Q. Comparison of Machine Learning and Traditional Statistical Methods in Debris Flow Susceptibility Assessment: A Case Study of Changping District, Beijing. Water 2023, 15, 705. [Google Scholar] [CrossRef]
Nikolopoulos, E.I.; Destro, E.; Bhuiyan, M.A.E.; Borga, M.; Anagnostou, E.N. Evaluation of predictive models for post-fire debris flow occurrence in the western United States. Nat. Hazards Earth Syst. Sci. 2018, 18, 2331–2343. [Google Scholar] [CrossRef]
Zhang, Y.; Ge, T.; Tian, W.; Liou, Y.A. Debris flow susceptibility mapping using machine-learning techniques in Shigatse area, China. Remote Sens. 2019, 11, 2801. [Google Scholar] [CrossRef]
Xu, F.; Wang, B. Debris flow susceptibility mapping in mountainous area based on multi-source data fusion and CNN model—Taking Nujiang Prefecture, China as an example. Int. J. Digit. Earth 2022, 15, 1967–1989. [Google Scholar] [CrossRef]
Reichstein, M.; Camps-Valls, G.; Stevens, B.; Jung, M.; Denzler, J.; Carvalhais, N.; Prabhat, F. Deep learning and process understanding for data-driven Earth system science. Nature 2019, 566, 195–204. [Google Scholar] [CrossRef]
Domingos, P. A Few Useful Things to Know about Machine Learning. Commun. ACM 2012, 55, 78–87. [Google Scholar] [CrossRef]
Li, X.; Kong, J. Application of Support Vector Machine with Posterior Probability Estimates in Debris Flow Hazard Assessment. Disaster Adv. 2011, 4, 38–44. [Google Scholar]
Chang, T.-C.; Chao, R.-J. Application of back-propagation networks in debris flow prediction. Eng. Geol. 2006, 85, 270–280. [Google Scholar] [CrossRef]
Moher, D.; Liberati, A.; Tetzlaff, J.; Altman, D.G.; Grp, P. Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. PLoS Med. 2009, 6, 336–341. [Google Scholar] [CrossRef] [PubMed]
Kurilla, L.J.; Fubelli, G. Global debris flow susceptibility based on a comparative analysis of a single global model versus a continent-by-continent approach. Nat. Hazards 2022, 113, 527–546. [Google Scholar] [CrossRef]
Kurilla, L.J.; Fubelli, G. Impact and a Novel Representation of Spatial Data Uncertainty in Debris Flow Susceptibility Analysis. Appl. Sci. 2022, 12, 6697. [Google Scholar] [CrossRef]
Bertrand, M.; Liebault, F.; Piegay, H. Debris-flow susceptibility of upland catchments. Nat. Hazards 2013, 67, 497–511. [Google Scholar] [CrossRef]
Märker, M.; Hochschild, V.; Maca, V.; Vilímek, V. Stochastic assessment of landslides and debris flows in the Jemma basin, Blue Nile, Central Ethiopia. Geogr. Fis. Din. Quat. 2016, 39, 51–58. [Google Scholar] [CrossRef]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
Soori, M.; Arezoo, B.; Dastres, R. Artificial intelligence, machine learning and deep learning in advanced robotics, A review. Cogn. Robot. 2023, 3, 54–70. [Google Scholar] [CrossRef]
Sharifani, K.; Amini, M. Machine Learning and Deep Learning: A Review of Methods and Applications. World Inf. Technol. Eng. J. 2023, 10, 3897–3904. [Google Scholar]
Fan, J.C.; Huang, H.Y.; Liu, C.H.; Yang, C.H.; Guo, J.J.; Chang, C.F.; Chang, Y.C. Effects of landslide and other physiographic factors on the occurrence probability of debris flows in central Taiwan. Environ. Earth Sci. 2015, 74, 1785–1801. [Google Scholar] [CrossRef]
Heckmann, T.; Gegg, K.; Gegg, A.; Becht, M. Sample size matters: Investigating the effect of sample size on a logistic regression susceptibility model for debris flows. Nat. Hazards Earth Syst. Sci. 2014, 14, 259–278. [Google Scholar] [CrossRef]
Sun, J.; Qin, S.; Qiao, S.; Chen, Y.; Su, G.; Cheng, Q.; Zhang, Y.; Guo, X. Exploring the impact of introducing a physical model into statistical methods on the evaluation of regional scale debris flow susceptibility. Nat. Hazards 2021, 106, 881–912. [Google Scholar] [CrossRef]
Di, B.; Zhang, H.; Liu, Y.; Li, J.; Chen, N.; Stamatopoulos, C.A.; Luo, Y.; Zhan, Y. Assessing Susceptibility of Debris Flow in Southwest China Using Gradient Boosting Machine. Sci. Rep. 2019, 9, 12532. [Google Scholar] [CrossRef] [PubMed]
Chang, F.J.; Tseng, K.J.; Chaves, P. Shared near neighbours neural network model: A debris flow warning system. Hydrol. Process. 2007, 21, 1968–1976. [Google Scholar] [CrossRef]
Tang, W.; Ding, H.-T.; Chen, N.-S.; Ma, S.-C.; Liu, L.-H.; Wu, K.-L.; Tian, S.-F. Artificial Neural Network-based prediction of glacial debris flows in the ParlungZangbo Basin, southeastern Tibetan Plateau, China. J. Mt. Sci. 2021, 18, 51–67. [Google Scholar] [CrossRef]
Gao, R.; Wang, C.; Han, S.; Liu, H.; Liu, X.; Wu, D. A Research on Cross-Regional Debris Flow Susceptibility Mapping Based on Transfer Learning. Remote Sens. 2022, 14, 4829. [Google Scholar] [CrossRef]
Qian, X.; Chen, J.P.; Xiang, L.J.; Zhang, W.; Niu, C.C. A novel hybrid KPCA and SVM with PSO model for identifying debris flow hazard degree: A case study in Southwest China. Environ. Earth Sci. 2016, 75, 991. [Google Scholar] [CrossRef]
Shen, C.-W.; Lo, W.-C.; Chen, C.-Y. Evaluating Susceptibility of Debris Flow Hazard using Multivariate Statistical Analysis in Hualien County. Disaster Adv. 2012, 5, 743–755. [Google Scholar]
Barredo Arrieta, A.; Diaz-Rodriguez, N.; Del Ser, J.; Bennetot, A.; Tabik, S.; Barbado, A.; Garcia, S.; Gil-Lopez, S.; Molina, D.; Benjamins, R.; et al. Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Inf. Fusion 2020, 58, 82–115. [Google Scholar] [CrossRef]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Ankenbrand, M.J.; Shainberg, L.; Hock, M.; Lohr, D.; Schreiber, L.M. Sensitivity analysis for interpretation of machine learning based segmentation models in cardiac MRI. BMC Med. Imaging 2021, 21, 27. [Google Scholar] [CrossRef] [PubMed]
Greenwell, B.M. pdp: An R package for constructing partial dependence plots. R J. 2017, 9, 421. [Google Scholar] [CrossRef]
Lundberg, S.M.; Lee, S.-I. A Unified Approach to Interpreting Model Predictions. In Proceedings of the 31st Annual Conference on Neural Information Processing Systems (NIPS), Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
Carrara, A.; Crosta, G.; Frattini, P. Comparing models of debris-flow susceptibility in the alpine environment. Geomorphology 2008, 94, 353–378. [Google Scholar] [CrossRef]
Yuan, L.; Zhang, Y. Debris flow hazard assessment based on support vector machine. Wuhan Univ. J. Nat. Sci. 2006, 11, 897–900. [Google Scholar] [CrossRef]
Huang, H.; Wang, Y.; Li, Y.; Zhou, Y.; Zeng, Z. Debris-Flow Susceptibility Assessment in China: A Comparison between Traditional Statistical and Machine Learning Methods. Remote Sens. 2022, 14, 4475. [Google Scholar] [CrossRef]
Jin, T.; Hu, X.; Liu, B.; Xi, C.; He, K.; Cao, X.; Luo, G.; Han, M.; Ma, G.; Yang, Y.; et al. Susceptibility Prediction of Post-Fire Debris Flows in Xichang, China, Using a Logistic Regression Model from a Spatiotemporal Perspective. Remote Sens. 2022, 14, 1306. [Google Scholar] [CrossRef]
Moss, R.E.S.; Lyman, N. Incorporating shear stiffness into post-fire debris flow statistical triggering models. Nat. Hazards 2022, 113, 913–932. [Google Scholar] [CrossRef]
Zhou, Y.; Yue, D.; Liang, G.; Li, S.; Zhao, Y.; Chao, Z.; Meng, X. Risk Assessment of Debris Flow in a Mountain-Basin Area, Western China. Remote Sens. 2022, 14, 2942. [Google Scholar] [CrossRef]
Varnes, D.J. Landslide Hazard Zonation: A Review of Principles and Practice; United Nations: Paris, France, 1984.
Ribeiro, M.T.; Singh, S.; Guestrin, C.; Association for Computing Machinery (ACM). “Why Should I Trust You?” Explaining the Predictions of Any Classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), San Francisco, CA, USA, 13–17 August 2016; pp. 1135–1144. [Google Scholar]
Shen, H.; Zhang, L. Mechanism-learning coupling paradigms for parameter inversion and simulation in earth surface systems. Sci. China-Earth Sci. 2023, 66, 568–582. [Google Scholar] [CrossRef]

Figure 1. Article search query design. The logical relationship of the words in the square brackets is OR.

Figure 2. PRISMA flowchart demonstrating the selection of papers.

Figure 3. Annual and cumulative number of papers on the prediction of debris flow occurrence based on ML.

Figure 4. Distribution of study areas and first research institutions, according to the countries reported in the papers. The number before the slash reflects the number of papers based on the study areas; the number after slash reflects the number of papers based on first research institutions.

Figure 5. Broad categories of ML models, categories of ML models, the initial year of utilizing each category of ML model as baseline predictor, number of papers of ML for each category of ML model, and relative annual number of papers reported for each category of ML model. The three numbers in curly braces indicate the initial year, number of papers, and relative annual number of papers, respectively.

Figure 6. Combination of ML prediction performance improvement strategies. Numbers indicate the number of studies that utilized each strategy.

Figure 7. The number of studies using each feature engineering method (feature selection methods were further subdivided).

Figure 8. The number of studies using each type of model comparison (the comparisons between ML models were further subdivided).

Figure 9. The number of studies using each interpretation method for ML.

Figure 10. The number of studies utilizing each category of sample data sources.

Figure 11. The number of studies using each evaluation unit, according to the surface evaluation unit and point evaluation unit reported in the papers.

Figure 12. The number of studies using each candidate variable category, according to the surface evaluation units and point evaluation units reported in the papers.

Figure 13. The number of studies using each validation technique and evaluation metric (only those evaluation metrics reported in 2 or more papers were included).

Figure 14. AUROC of different baseline predictors. The baseline predictors were logistic regression (LR), support vector machine (SVM), back-propagation neural network (BPNN), and random forest (RF).

Figure 15. Comparison of selected pairs of baseline predictors based on AUROC.

Figure 16. Three main steps in predicting debris flow occurrence based on ML.

Table 1. Database attribute fields for meta-analysis.

ID	Field Name	Description	Type
1	Journal	Name of journal	Text
2	Title	Title of paper	Text
3	Year	Year of publication	Numeric
4	Study area	Country of study area of paper	Text
5	Institution	Country of first research institution	Text
6	Type of occurrence	Examples include occurrence or nonoccurrence, susceptibility assessments, hazard assessment of debris flow	Text
7	Evaluation unit	Unit utilized for prediction of debris flow occurrence	Text
8	Baseline model	ML utilized as baseline predictor of debris flow occurrence	Text
9	Improvement strategy	Modeling strategy for improving performance of ML utilized for predictor of debris flow occurrence	Text
10	Sample data	Source of debris flow sample data	Text
11	Candidate variables	Candidate feature utilized for prediction of debris flow occurrence based on ML	Text
12	Validation technique	Method used to divide the training set and test set utilized for prediction of debris flow occurrence based on ML	Text
13	Evaluation metric	Metrics utilized to report performance of the ML model utilized for prediction of debris flow occurrence	Text
14	Area under the curve	Prediction area under the ROC curve of debris flow occurrence based on ML	Numeric
15	Number of cases	Number of combinations of training and test sets utilized for debris flow occurrence based on ML	Numeric

Table 2. Journals that published papers on the prediction of debris flow occurrence based on ML used in meta-analysis (only those journals that published 2 or more papers were included).

Journal	Science Citation Index	Publisher	Impact Factor (2022)	Number of Papers
Natural Hazards	Yes	Springer	3.7	15
Remote Sensing	Yes	MDPI	5.0	8
Engineering Geology	Yes	Elsevier	7.4	5
Water	Yes	MDPI	3.4	5
Environmental Earth Sciences	Yes	Springer	2.8	5
Natural Hazards and Earth System Sciences	Yes	Copernicus Gesellschaft MBH	4.6	4
Geomorphology	Yes	Elsevier	3.9	3
Bulletin of Engineering Geology and the Environment	Yes	Springer	4.2	3
Landslides	Yes	Springer	6.7	2
Hydrological Processes	Yes	Wiley	3.2	2
Journal of Mountain Science	Yes	Science Press	2.5	2
Open Geosciences	Yes	De Gruyter Poland SP Z O O	2.0	2
Disaster Advances	No	Disaster Advances	None	2
Natural Hazards and Earth System Sciences	Yes	Copernicus Gesellschaft MBH	4.6	4

Table 3. Classification criteria for candidate variables.

Category	Description
Topography	Factors related to topography, such as slope, curvature, main channel length, etc.
Morphology	Factors related to the morphology of the surface evaluation unit, such as area, shape coefficient, perimeter, etc.
Geomorphology	Factors related to geomorphic type and evolution, such as landform, hypsometric integra, geomorphic information entropy, etc.
Geology	Factors related to geological structure, geological movement, and geological type, such as active fault density, seismic intensity, lithology, etc.
Meteorology	Factors related to meteorological factors such as rainfall, temperature, snow cover, etc.
Hydrology	Factors related to water flow movement, such as flow accumulation, stream power index, distance to rivers, etc.
Soil	Factors related to soil type, property, and thickness, such as soil texture, soil types, soil depth, etc.
Vegetation	Factors related to vegetation type and state, such as vegetation coverage index, normalized difference vegetation index, forest density, etc.
Fire	Factors related to forest fires, such as fire severity (low, moderate, high), proportion of watersheds burned at high or moderate severity, etc.
Material source	Factors related to loosen accumulation of internal solids, such as collapsed areas, landslide areas, debris reserves, etc.
Human activity	Factors that directly or indirectly characterize human behavior, such as land use, population density, distance to road, etc.
Past debris flow	Factors related to past debris flows in the evaluation unit, such as maximum volume, occurrence frequency, etc.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, L.; Ge, Y.; Chen, B.; Wu, Y.; Fu, R. Machine-Learning-Based Prediction Modeling for Debris Flow Occurrence: A Meta-Analysis. Water 2024, 16, 923. https://doi.org/10.3390/w16070923

AMA Style

Yang L, Ge Y, Chen B, Wu Y, Fu R. Machine-Learning-Based Prediction Modeling for Debris Flow Occurrence: A Meta-Analysis. Water. 2024; 16(7):923. https://doi.org/10.3390/w16070923

Chicago/Turabian Style

Yang, Lianbing, Yonggang Ge, Baili Chen, Yuhong Wu, and Runde Fu. 2024. "Machine-Learning-Based Prediction Modeling for Debris Flow Occurrence: A Meta-Analysis" Water 16, no. 7: 923. https://doi.org/10.3390/w16070923

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Machine-Learning-Based Prediction Modeling for Debris Flow Occurrence: A Meta-Analysis

Abstract

1. Introduction

2. Data Processing Workflow

2.1. Literature Retrieval and Selection Criteria

2.2. Data Extraction

3. Results

3.1. General Characteristics of Studies

3.2. General Characteristics of ML Applications

3.2.1. ML Categories

3.2.2. Prediction Performance Improvement Strategies

3.2.3. Model Interpretation

3.2.4. Sample Sources

3.2.5. Evaluation Units and Candidate Variable Categories

3.2.6. Validation Techniques and Evaluation Metrics

3.2.7. Prediction Performance

3.2.8. Application Processes

4. Discussion

4.1. Challenges and Future Trends

4.2. Uncertainties and Limitations

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI