Next Article in Journal
Bayesian Linear Seismic Inversion Integrating Uncertainty of Noise Level Estimation and Wavelet Extraction
Next Article in Special Issue
Study on Key Parameter Design and Adaptability Technology of the 110 Mining Method for the Yuwang NO.1 Coal Mine in the Diandong Mining Area
Previous Article in Journal
Three-Dimensional Lithospheric Electrical Structure beneath the Handan-Xingtai District, North China: Implications for Tectonic Control of Skarn-Iron Mineralization
Previous Article in Special Issue
Applying Machine Learning and Automatic Speech Recognition for Intelligent Evaluation of Coal Failure Probability under Uniaxial Compression
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Classification of Coal Bursting Liability Based on Support Vector Machine and Imbalanced Sample Set

1
Faculty of Land Resources Engineering, Kunming University of Science and Technology, Kunming 650093, China
2
Key Laboratory of Geohazard Forecast and Geoecological Restoration in Plateau Mountainous Area, Ministry of Natural Resources of the People’s Republic of China, Kunming 650093, China
*
Author to whom correspondence should be addressed.
Minerals 2023, 13(1), 15; https://doi.org/10.3390/min13010015
Submission received: 18 November 2022 / Revised: 14 December 2022 / Accepted: 17 December 2022 / Published: 23 December 2022

Abstract

:
As an inherent property of the accumulation of elastic energy and the sudden instability failure of coal, coal bursting liability (CBL) is the basis of the research on the early warning and prevention of coal burst. To accurately classify the CBL level, the support-vector-machine (SVM) method was introduced in this paper, and the dynamic failure time (DT), elastic energy index (WET), impact energy index (KE) and uniaxial compressive strength (RC) were selected as the classification indexes. An imbalanced sample set, containing 95 groups of measured data of CBL, was established, and eight SVM classification models were constructed, based on different kernel functions and swarm-intelligence-optimization algorithms. Focusing on the problem of sample imbalance, the classification accuracy, A, F1-score and kappa coefficient were used to comprehensively evaluate the classification performance of SVM models, and the grey-wolf-optimizer SVM (GWO-SVM) model was selected as the best model in this paper, reaching the highest accuracy of 98.9%. The GWO-SVM was applied to identify the CBL level of the 4# coal seam in Xiaozhuang Coal Mine and the 1# coal seam in the Wanfeng Coal Mine. The results of the engineering application are consistent with those from the engineering field, and show that the proposed model is scientific and practical, and can be a new method for CBL classification.

1. Introduction

Coal burst is a kind of dynamic disaster in coal mining, and its harm is mainly manifested in roadway destruction, causing casualties and inducing secondary disasters [1,2,3,4,5]. Figure 1 shows the field damage of coal bursts in Wudong Coal Mine, China [6]. The coal bursting liability (CBL) is an internal cause of coal burst [7,8]. Almost all coal seams of coal-burst mines in China have bursting liability [9], so an accurate evaluation of CBL is the basis of coal-burst risk assessment.
Scholars have carried out quantitative research on CBL classification from different angles, and the current approaches [10] can be divided into two types, namely the single-index criterion classification method and the multi-index comprehensive classification method. The former type refers to the use of a single index to determine the CBL level, based on the index classification standard, mainly using the energy index [11,12,13], time index [14,15,16], strain index [17,18], stiffness index [19,20,21], and so on. This method is intuitive, but as an inherent property of coal, CBL is affected by many factors. Considering only a single index will cause some errors. Therefore, researchers generally adopt the multi-index comprehensive classification method to classify the CBL, for example, the Chinese national standard Classification and Laboratory Test Method on Bursting Liability of Coal (GB/T 25217.2-2010) comprehensively adopts the dynamic failure time (DT), elastic energy index (WET), impact energy index (KE), and uniaxial compressive strength (RC), and divides the CBL level into three categories of strong (I), weak (II), and none (III). In the view of the different categories of bursting liability of the four indexes, the standard uses the fuzzy-comprehensive-evaluation method to list the classification results of 73 different combinations, but the remaining eight combinations do not give clear categories.
Looking at this problem, previous researchers have extensively conducted multi-index classification research. Xu et al. [22] established a comprehensive-evaluation model of CBL by introducing the unascertained measure theory. Wang et al. [23] observed a CBL discrimination model based on the theory of information entropy and the method of entropy-weight ideal point. Jia et al. [24] established the mathematical attribute model of CBL classification. Guo [25] constructed a grey prediction model of CBL according to the variable weight thought and grey-correlation analysis. Wang et al. [26] introduced the Mahalanobis distance-discriminant-analysis (DDA) method to establish a DDA model for CBL classification. The above research focuses on the mathematical methods (e.g., the fuzzy-comprehensive-evaluation method, the unascertained-measurement model, attribute mathematics, etc.), taking into account the degree of importance and correlation of evaluation indexes, and achieves some results, but there are also existing problems, such as complex calculation, and a great influence of subjective factors. How to better solve the problem of CBL classification has become a top priority, and will be a feasible answer to developing a scientific, practical, accurate, efficient and intelligent classification method.
As a supervised-machine-learning algorithm for classification problems, the support vector machine (SVM) has been widely used in engineering, and its black box feature can skip the complex mechanisms and conditions of engineering problems. SVM can summarize the commonness of data, and the established classification model has strong applicability. At present, the application of SVM in CBL classification has been rarely reported. In view of this, the SVM method was introduced in this paper to establish eight SVM models, and the comprehensive performance of each model was evaluated using the classification accuracy A, F1-score and kappa coefficient. Finally, the best SVM model was selected and used in the engineering application for CBL classification.

2. Methods

2.1. SVM

SVM is based on the Vapnik–Chervonenkis dimension theory of statistical learning and the principle of structural risk minimization; it has a good generalization ability for unknown samples, which are classified using the optimal hyperplane [27,28]. In this paper, eight SVM models are established in the MATLAB 2018b environment, including Linear SVM (LSVM), Quadratic SVM (QSVM), Cubic SVM (CSVM), Fine Gaussian SVM (FG-SVM), Medium Gaussian SVM (MG-SVM), Coarse Gaussian SVM (CG-SVM), Adaptive Particle Swarm Optimization SVM (APSO-SVM) and Grey Wolf Optimization SVM (GWO-SVM). Among them, APSO-SVM is an adaptive-particle-swarm-optimization algorithm model with compression factor and asynchronous learning factor, and GWO-SVM is a grey-wolf-algorithm-optimization model simulating the group predation of gray wolves.
This paper takes SVM based on the RBF kernel as an example, to introduce its classification principle [29].
Sampling in n-dimensional space, setting dataset E = [(x1, y1), …, (xl, yl)], decision function f(x) = ω·k(x) + b, where ω is the weight vector, b is the threshold, k(x) is a non-linear mapping function, and the optimization of classification plane-constraints should satisfy the Formula (1):
y i ( ω T k ( x i ) + b ) 1
The optimization problem is converted to Formulas (2) and (3) by introducing a non-negative relaxation variable ξ i :
min 1 2 ω 2 + c i = 1 n ξ i , c 0
s . t . y i ( ω T k ( x i ) + b ) 1 ξ i
where c is a penalty factor. When the c value is large, the penalty value for misclassification will increase; otherwise, it will be small. The Lagrange multiplier algorithm is introduced to obtain Formula (4):
L ( ω , b , α i ) = 1 2 ω 2 + i = 1 n α i ( 1 y i ( ω T k ( x i ) + b ) ) , α i = ( α 1 , α 2 , α n )
The partial derivatives of the Lagrange function L with respect to ω and b, respectively, can be obtained by Formulas (5) and (6):
ω = i = 1 m α i y i x i
i = 1 m α i y i = 0
Formula (7) can be obtained by substituting the Lagrange multiplier algorithm:
L ( ω , b , α i ) = i = 1 n α i 1 2 i = 1 n j = 1 n α i α j y i y j x i T x j
That is min ω , b L ( ω , b , α i ) = i = 1 n α i 1 2 i = 1 n j = 1 n α i α j y i y j x i x j . Where ω is the weight vector, and b is the threshold. Thus, the optimization problem is transformed into a dual problem, namely Formulas (8) and (9):
min 1 2 i = 1 n j = 1 n y i y j x i x j K ( x i , x j ) i = 1 n α i
s . t . i = 1 n y i α i = 0 , 0 α i c
where K ( x i , x j ) = ( k ( x i ) k ( x j ) ) , the RBF kernel is Formula (10):
K ( x i , x j ) = exp ( g x i x j ) 2
where, g is the kernel function parameter, so the above optimization problem is converted to Formulas (11) and (12):
min 1 2 i = 1 n j = 1 n y i y j α i α j exp ( g x i x j ) 2 i = 1 n α i
s . t . i = 1 n y i α i = 0 , 0 α i c

2.2. Data Processing

(1)
Sample Dataset
In accordance with the standard GB/T 25217.2-2010 and Reference [26], DT, WET, KE and RC are selected as the classification indexes of CBL, and the bursting-liability grade is divided into three categories of strong (I), weak (II), and none (III). DT is the time from ultimate strength to complete failure of coal samples under the uniaxial-compression-test conditions. WET is the ratio of elastic deformation energy to plastic deformation energy of coal specimens unloaded when the stress reaches a certain value under uniaxial compression. KE is the ratio of deformation energy accumulated before the maximum peak to that dissipated after the maximum peak, in the complete stress–strain curve of a specimen under uniaxial compression. RC is the maximum load on unit area of the specimen under the condition of infinite-side and axial-pressure for failure. A total of 95 groups of CBL samples in Table 1 are chosen as the training samples and the test samples.
The number of samples with strong (I), weak (II) and none (III) bursting liability in Table 1 is 44, 45 and 6, respectively. Figure 2 shows the proportion of the three categories of CBL samples, which clearly exhibits the imbalance of samples.
The imbalanced data will have a direct impact on the quality of modeling. Ensuring the rationality of data is an important condition for modeling, and data analysis of sample sets can determine the rationality of data. Table 2 lists the statistical information of each classification index, including minimum, maximum, median, interquartile range (IQR), lower quartile, upper quartile, mean, standard deviation (SD) and standard error (SE). Figure 3 shows the boxplot of the corresponding sample set.
Index correlation analysis is also an important part of data analysis. The correlation information among the classification indexes selected in this paper is shown in Figure 4, in which the correlation curve is fitted, where r is the correlation coefficient and P is the significance level. As can be seen from the figure, the correlation between any two of the four indexes is weak, which indicates that the four indexes are very suitable for the establishment of the classification model.
(2)
Data Standardization
Table 1 shows that the dimensions and orders of magnitude of the four classification indexes are different. This can also be roughly seen from the boxplot in Figure 3. The value ranges of the vertical axes of the boxplot of different indexes are significantly different, and the difference is even more obvious under different levels of bursting liability. In order to clearly reflect the dimensional differences among indexes, the data of different indexes are plotted into ridge plots, as shown in Figure 5, so that the variation trend of each group of data and the data comparison between groups can be intuitively seen.
Figure 5 shows that the orders of magnitude of the four classification indexes are very different, resulting in a large gap in data distribution. For example, in the sample of strong bursting liability, the data distribution of WET, KE and RC are concentrated in the area with small values, while the data distribution of DT is relatively scattered. If the original index value is directly used in the classification, the function of the index with the higher value will be highlighted, and the function of the index with the lower value will be relatively weakened. Therefore, it is necessary to standardize the original data. In this paper, the mapminmax function is selected for normalization processing, and the principle is shown in Formula (13):
x i j * = x i j min ( x j ) max ( x j ) min ( x j ) ( i = 1 , 2 , , m ; j = 1 , 2 , , n )
where, xij is the original data of jth index of ith sample; xij* is the processed data, whose value is between 0 and 1, and the data distribution is consistent with that processed before; max(xj) and min(xj) are the maximum and minimum value of the original data of index j, respectively.
Table 3 shows the standardized data. Draw the above standardized data into ridge plots, as shown in Figure 6. X1~X4 correspond to the standardized data labels of the four indexes respectively. It can be seen that the standardized data reduces the difference of orders of magnitude between indexes, due to dimensional differences, which is most obvious in the samples of weak bursting-liability.
(3)
Data Segmentation
Currently, there are two ways to divide samples in classification research: one is where there is no need to divide them into the training set and the test set, with small samples, and the other is to divide them according to a fixed proportion. The existing research on CBL classification mainly focuses on small-scale data sets, which is prone to the problem of unreasonable data division, leading to the overfitting of the model on the training set. To solve this problem, this paper adopts a 5-fold cross-validation method to divide the training set and the test set, which can avoid the limitations and particularity of the fixed-division dataset, and this advantage is more obvious in the small-scale dataset. The 5-fold cross-validation divides the data set into five parts in equal proportion, four of which are used as training data, and the remaining one is used as test data. After five operations, each data was tested once, and the average of the results of the five models was used as the final result.

2.3. Parameters Optimization

The penalty factor, c, and the kernel-function parameter, g, are essential parameters for SVM classification, and determine the classification effect of SVM. In this paper, three optimization methods were used to obtain these two parameters, among which the LSVM, QSVM, CSVM, FG-SVM, MG-SVM and CG-SVM models were optimized by a grid-search algorithm, the APSO-SVM model was optimized by the APSO algorithm, and the GWO-SVM model was optimized by the GWO algorithm. The optimization process of the three methods is shown in Figure 7.

2.4. Research Route

In this paper, an imbalanced sample set of CBL, containing 95 groups of measured data, is constructed. The sample set is divided into training set and test set, in accordance with the 5-fold cross-validation. Eight SVM models are used for classification research, and the best SVM model is selected for the engineering application. The research flowchart is shown in Figure 8.

3. Results and Discussion

3.1. Classified Results

To better evaluate the performance of the eight classification models, it is necessary to consider not only the overall classification effect of the models, but also the classification effect of the models on different levels of bursting liability. Sorting the classification results of the models built in this paper into a confusion matrix graph, the classification of samples of each category by different models can be obtained, as shown in Figure 9.

3.2. Performance Evaluation

3.2.1. Evaluation Index

For the imbalanced sample dataset established in this paper, the classification accuracy A, F1-score and kappa coefficient are comprehensively selected, to evaluate the performance of the classification models. The three indexes are all positive indexes, that is, the larger the value, the better the performance of the model.
(1)
Classification Accuracy A
The classification accuracy A is the ratio of the number of accurately classified samples (Naccurate) to the total number of samples (Ntotal) in the modeling process. It represents the overall classification accuracy of the model and is the most commonly used evaluation index in classification problems. Its calculation formula is given as follows:
A = N a c c u r a t e N t o t a l
(2)
F1-Score
As an index used to measure the accuracy of classification models in machine learning, the F1-score is often used to evaluate models based on imbalanced dataset. This index weighs the accuracy rate and recall rate of models.
(3)
Kappa Coefficient
The kappa coefficient is an index adopted to measure whether the classification result of the model is consistent with the actual classification result, and it has certain penalty attribute. When the classification result of the model built based on the imbalanced dataset favors the category with a large number and ignores the category with a small number, the kappa coefficient value of the model is low. This biased evaluation mechanism ensures the rationality of the model performance evaluation.

3.2.2. Evaluation Result

In this paper, eight SVM models are adopted to classify the CBL grade, and three indexes are used to evaluate the classification performance of the models. The results are shown in Table 4 and Figure 10.

3.3. Model Analysis and Optimization

(1)
Model Analysis
It can be seen from Table 4 and Figure 10 that the SVM models established in this paper have excellent classification performance, and are suitable for CBL classification under an imbalanced dataset. The following analysis is of the impact of data quality, model selection and data-processing methods on the SVM model performance.
In machine-learning research, the quality of data used for modeling often has a great effect on the classification results. The indexes used in modeling are an essential part of the dataset, and the scientific and reasonable selection of the indexes is the key to subsequent research. The CBL is affected by water, gas, temperature, size effect, loading rate, pore structure of coal and many other factors, which are finally reflected in the forms of strength and energy. The DT and RC selected in this paper are strength indexes, and WET and KE are energy indexes. These four indexes are more consistent with the nature of CBL, so the dataset established in this paper is of high quality.
Model selection is also a key factor affecting the classification effect. The SVM model constructed in this paper has a high matching degree with the imbalanced sample set, and the SVM model based on the principle of structural risk minimization can avoid the problem of over-learning, enhance the generalization ability, and reduce the requirement of data size and distribution.
The CBL data contains information about energy and strength. In this paper, two kinds of data processing methods, normalization and 5-fold cross-validation, are carried out on the sample data. The 5-fold cross-validation can avoid the limitations and particularities of a fixed-partition dataset, thus improving the generalization ability of the model, and is very suitable for the imbalanced sample set established in this paper.
(2)
Model Optimization
The classification accuracy A, F1-score and kappa coefficient selected in this paper comprehensively consider the overall and local classification effects of the model, and are suitable for the classification of imbalanced sample sets. As can be seen from Table 3, from the overall classification effect, the performance of the GWO-SVM model is the best, with a classification accuracy of 98.9%, higher than that of the other seven SVM models. The F1-score value (0.993) and kappa coefficient value (0.980) of this model are also the largest among all models, indicating that the model has the best classification effect for each level of CBL classification. As can be seen from Figure 10, the classification accuracy of this model in different levels of bursting liability is the largest, so the GWO-SVM model is selected as the optimal SVM model built in this paper.

4. Engineering Application

In order to verify the practicability of the GWO-SVM model, the model was applied to the 4# coal seam of the Xiaozhuang coal mine [30] and the 1# coal seam of Wanfeng coal mine for CBL classification [31].
(1)
4# Coal Seam of Xiaozhuang Coal Mine
The 4# coal seam is the main coal seam, and the coal pillar of the 40214 working face will cause stress concentration. The buried depth of the coal seam is approximately 430~600 m, the average buried depth is 560 m, and the mining depth exceeds the critical depth of coal burst of 380 m. During the mining of the 40203 working surface, strong mine-pressure behaviors appeared several times. The optimized GWO-SVM model was applied to classify the CBL of three groups of samples from the 4# coal seam. In accordance with the GB/T 25217.2-2010, engineering-guidance-level determination results are shown in Table 5; the single index in the table determines the results on the right-hand side of the index value, and the results are shown in Table 5.
(2)
1# Coal Seam of Wanfeng Coal Mine
The Wanfeng coal mine adopts the underground mining method: the mining level is +430~−10 m, and no coal burst occurred in the mining process. The engineering-guidance level obtained from the single-index discrimination result and the actual situation are consistent with the engineering guidance level. The GWO-SVM model is adopted to classify the bursting liability of the coal seam and compare it with the actual situation. See Table 6 for details.
It can be seen from Table 5 and Table 6 that the classification results of the GWO-SVM model selected in this paper are consistent with the actual situation of the mine, and it is an intelligent model that can manage the classification of CBL.

5. Conclusions

The existing CBL classification methods are challenging for giving clear results when identifying specific types of coal, and the calculations are complicated and subject to subjective factors. In view of these deficiencies, DT, WET, KE and RC were selected as classification indexes, an imbalanced sample set was constructed based on 95 groups of measured data, and eight SVM models were established to carry out intelligent classification and optimization of CBL. The following conclusions were reached:
(1) Differently from the existing data segmentation methods, this paper adopted 5-fold cross-validation to divide the dataset into training set and test set. The classification accuracy of the eight SVM models is above 87.4%, with an average of 94.0%.
(2) Aiming at the problem of sample imbalance at different levels, the classification accuracy A, F1-score and kappa coefficient were comprehensively used to evaluate the performance of the established models, and the best model was selected as the GWO-SVM model, with a classification accuracy of 98.9%, F1-score of 0.993, and kappa coefficient of 0.980. All of these were the highest values in the eight SVM models.
(3) The GWO-SVM model was applied to the working surfaces of the Xiaozhuang Coal Mine and Wanfeng Coal Mine, for CBL classification. The results were consistent with the actual situation on site, indicating that the model has good practicability.

Author Contributions

Y.L. (Yuefeng Li): data curation, writing–original draft, methodology, validation, investigation; C.W.: funding acquisition, supervision, conceptualization, methodology, software, writing–review and editing; Y.L. (Yv Liu): software, writing–review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Science and Research Fund from the Educational Department of Yunnan Province (2021J0060), the National Natural Science Foundation of China (52264019), the Major Science and Technology Special Project of Yunnan Province (202202AG050014), and the Yunnan Innovation Team (202105AE160023).

Data Availability Statement

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Cao, A.Y.; Liu, Y.Q.; Jiang, S.Q.; Hao, Q.; Peng, Y.J.; Bai, X.X.; Yang, X. Numerical investigation on influence of two combined faults and its structure features on rock burst mechanism. Minerals 2021, 11, 1438. [Google Scholar] [CrossRef]
  2. Mottahedi, A.; Ataei, M. Fuzzy fault tree analysis for coal burst occurrence probability in underground coal mining. Tunn. Undergr. Space Technol. 2019, 83, 165–174. [Google Scholar] [CrossRef]
  3. Vardar, O.; Zhang, C.G.; Canbulat, I.; Hebblewhite, B. A semi-quantitative coal burst risk classification system. Int. J. Min. Sci. Technol. 2018, 28, 721–727. [Google Scholar] [CrossRef]
  4. Zhang, C.G.; Canbulat, I.; Hebblewhite, B.; Ward, C.R. Assessing coal burst phenomena in mining and insights into directions for future research. Int. J. Coal Geol. 2017, 179, 28–44. [Google Scholar] [CrossRef]
  5. Mark, C. Coal bursts that occur during development: A rock mechanics enigma. Int. J. Min. Sci. Technol. 2018, 28, 35–42. [Google Scholar] [CrossRef]
  6. He, S.Q.; He, X.Q.; Song, D.Z.; Li, Z.L.; Chen, J.Q.; Xue, Y.R.; Li, Y. Multi-parameter integrated early warning model and an intelligent identification cloud platform of rockburst. J. China Univ. Min. Technol. 2022, 51, 850–862. [Google Scholar]
  7. Bieniawski, Z.T.; Denkhaus, H.G.; Vogler, U.W. Failure of fractured rock. Int. J. Min. Sci. Technol. 1969, 6, 323–330. [Google Scholar] [CrossRef]
  8. Bieniawsk, Z.T. Mechanism of brittle fracture of rocks (Part I, П and Ш). Int. J. Min. Sci. Technol. 1967, 4, 395–406, 425–426. [Google Scholar]
  9. Ju, W.J.; Lu, Z.G.; Gao, F.Q.; Zhao, Y.X.; Li, W.Z.; Sun, Z.Y.; Hao, X.J. Research progress and comprehensive quantitative evaluation index of coal rock bursting liability. Chin. J. Rock Mech. Eng. 2021, 40, 1839–1856. [Google Scholar]
  10. Li, Y.F. Comparison and Optimization of Multi-Index Evaluation Models for Coal Sample Bursting Liability Classification. Master’s Thesis, Kunming University of Science and Technology, Kunming, China, 2022. [Google Scholar]
  11. Su, C.D.; Yuan, R.F.; Zhai, X.X. Experimental research on bursting liability index of coal samples of Chengjiao coal mine. Chin. J. Rock Mech. Eng. 2013, 32, 3696–3704. [Google Scholar]
  12. Yang, D.S.; Chen, W.Z.; Yang, W.M.; Li, S.C.; Li, Y.C.; Zhun, W.S. Stability analysis of surrounding rock mass of Longtan underground caverns. Rock Soil Mech. 2004, 3, 391–395. [Google Scholar]
  13. Cook, N.G.W.; Hoek, E.; Pretorius, J.P.G.; Ortlepp, W.D.; Salamon, H.D.G. Rock mechanics applied to the study of rockbursts. J. S. Afr. Inst. Min. Metall. 1966, 66, 435–528. [Google Scholar]
  14. GT/T25217.2-2010; The Professional Standards Compilation Group of People’s Republic of China. Classification and Laboratory Test Method on Bursting Liability of Coal. Standards Press of China: Beijing, China, 2010.
  15. Wang, C. Research of Rockburst Risk Comprehensive Evaluation Method Based on Unascertained Measurement Model and Application. Ph.D. Thesis, China University of Mining and Technology, Xuzhou, China, 2011. [Google Scholar]
  16. Zhang, X.Y.; Kang, L.X.; Yang, S.S. Study on the distribution situation of REE in some tailing ore and its synthctic recovery. Saf. Coal Mines 2009, 40, 74–76. [Google Scholar]
  17. Zhou, X.J.; Xian, X.F. Relationship between rock burst tendency and residual energy release rate. West-China Explor. Eng. 1999, 1, 46–50. [Google Scholar]
  18. Wang, H.T.; Xu, J.; Wei, F.S.; Xian, X.F. Evalutaion of tendency indexes of coal or rock burst. J. Min. Saf. Eng. 1999, Z1, 204–207+210–239. [Google Scholar]
  19. Zhou, X.J. Study on the Rockburst Conditions and It’s Control Theory and Application. Ph.D. Thesis, Chongqing University, Chongqing, China, 1997. [Google Scholar]
  20. Wu, S.C.; Li, L.P.; Zhang, X.P. Rock Mechanics; Higher Education Press: Beijing, China, 2021. [Google Scholar]
  21. Wang, J.A.; Park, H.D. Comprehensive prediction of rock burst based on analysis of strain energy in rock. Tunn. Undergr. Space Technol. 2001, 16, 49–57. [Google Scholar] [CrossRef]
  22. Xu, J.K.; Wang, E.Y.; Wang, C. Study of rock burst tendency of coal based on uncertainty measurement theory. Saf. Coal Mines 2011, 42, 19–22. [Google Scholar]
  23. Wang, C.; Wang, E.Y.; Liu, X.F. Classification of rock burst tendency of coal seam based on entropy and ideal point method. J. Liaoning Tech. Univ. Nat. Sci. Ed. 2012, 31, 838–841. [Google Scholar]
  24. Jia, X.W.; Wang, E.Y. Coal burst tendency classification based on attribute mathematical model. Saf. Coal Mines 2014, 31, 838–841. [Google Scholar]
  25. Guo, J.D. Application of variable weight and grey classification recognition model in rock burst tendency evaluation of coal seam. J. North China Inst. Sci. Technol. 2017, 14, 44–49. [Google Scholar]
  26. Wang, C.; Song, D.Z.; Zhang, C.L.; Liu, L.; Zhou, Z.H.; Huang, X.C. Research on the classification model of coal’s bursting liability based on database with large samples. Arab. J. Geosci. 2019, 12, 411. [Google Scholar] [CrossRef]
  27. Brierley, S.D.; Chiasson, J.N.; Lee, E.B.; Zak, S. On stability independent of delay for linear systems. IEEE Trans. Autom. Control 1982, 27, 252–254. [Google Scholar] [CrossRef]
  28. Park, S.; Jung, D.; Nguyen, H.; Choi, Y. Diagnosis of problems in truck ore transport operations in underground mines using various machine learning models and data collected by internet of things systems. Minerals 2021, 11, 1128. [Google Scholar] [CrossRef]
  29. Li, Y.F.; Wang, C.; Xu, J.K.; Zhou, Z.H.; Xu, J.H.; Chen, J.W. Rockburst prediction based on the KPCA-APSO-SVM model and its engineering application. Shock Vib. 2021, 2021, 7968730. [Google Scholar] [CrossRef]
  30. Zhao, H. Research on rockburst risk assessment and prevention and control design in Xiaozhuang coal mine. Shaanxi Coal 2022, 41, 100–104+148. [Google Scholar]
  31. Du, X.J. Wanfeng coal mine 1# coal seam burst tendency determination and analysis of research. Energy Technol. Manag. 2022, 47, 108–110. [Google Scholar]
Figure 1. Field damage of coal bursts in Wudong Coal Mine, China.
Figure 1. Field damage of coal bursts in Wudong Coal Mine, China.
Minerals 13 00015 g001
Figure 2. The proportion of three categories of CBL samples: (I) strong; (II) weak; (III) none.
Figure 2. The proportion of three categories of CBL samples: (I) strong; (II) weak; (III) none.
Minerals 13 00015 g002
Figure 3. Boxplot of the classification indexes: (a) DT; (b) WET; (c) KE; (d) RC.
Figure 3. Boxplot of the classification indexes: (a) DT; (b) WET; (c) KE; (d) RC.
Minerals 13 00015 g003
Figure 4. Correlation analysis of classification indexes: (a) DT and WET; (b) DT and KE; (c) DT and RC; (d) WET and KE; (e) WET and RC; (f) KE and RC.
Figure 4. Correlation analysis of classification indexes: (a) DT and WET; (b) DT and KE; (c) DT and RC; (d) WET and KE; (e) WET and RC; (f) KE and RC.
Minerals 13 00015 g004
Figure 5. Ridge plots of classification index data. (a) Strong CBL; (b) Weak CBL; (c) None CBL.
Figure 5. Ridge plots of classification index data. (a) Strong CBL; (b) Weak CBL; (c) None CBL.
Minerals 13 00015 g005
Figure 6. Ridge plots of classification index data after standardization: (a) strong CBL; (b) weak CBL; (c) No CBL.
Figure 6. Ridge plots of classification index data after standardization: (a) strong CBL; (b) weak CBL; (c) No CBL.
Minerals 13 00015 g006
Figure 7. SVM parameter optimization.
Figure 7. SVM parameter optimization.
Minerals 13 00015 g007
Figure 8. Research flowchart.
Figure 8. Research flowchart.
Minerals 13 00015 g008
Figure 9. Classification results of eight SVM models: (a) LSVM; (b) QSVM; (c) CSVM; (d) FG-SVM; (e) MG-SVM; (f) CG-SVM; (g) APSO-SVM; (h) GWO-SVM.
Figure 9. Classification results of eight SVM models: (a) LSVM; (b) QSVM; (c) CSVM; (d) FG-SVM; (e) MG-SVM; (f) CG-SVM; (g) APSO-SVM; (h) GWO-SVM.
Minerals 13 00015 g009
Figure 10. Comparison of model-classification performance.
Figure 10. Comparison of model-classification performance.
Minerals 13 00015 g010
Table 1. Measurement data and classification of coal samples.
Table 1. Measurement data and classification of coal samples.
No.IndexEngineering Guidance LevelNo.IndexEngineering Guidance Level
DT/msWETKERC/MpaDT/msWETKERC/Mpa
11617.17.0825.6I491931.145.7412.5II
2335.2692.84I5069.11.142.1II
344.851126.3I5156.31.92.01II
474.8965.8I526641.71.8II
575.9675.4I53775.84.61.4II
6136.51235.1I543002.762.7411.8II
7143.4665.67I553512.631.6413II
8144.11123.23I562553.43.711.2II
9228.2874.3I572124.340.887.31II
102246.446.3218.6I583636.021.348.61II
11345.156.517.4I591403.193.3413.3II
1226712.40.8724.8I602492.151.8412II
134214.43.6329I61536.624.329.52II
14133.675.6729.2I621380.9683.5410II
15454.345.9922.6I63824.7810.213.9II
163168.11.416.7I643404.581.2615.8II
176812.76.7314.6I652753.533.5613.1II
1834611.36.4515.2I663061.632.069.93II
194512.312.618.8I67554.674.9316II
203310.39.8412.1I683013.842.4710.2II
21209.438.7216.5I69924.34.5314.7II
225419.61.2917.3I702523.111.936.51II
23123.62.337.7I712162.621.8113.1II
244111.911.85.49I723163.472.274.93II
25479.24.1317.4I732843.961.8410.5II
263117.55.4211.9I742563.962.3117II
276610.46.328.6I753916.291.489.4II
282442.8523.9I762881.632.3415.4II
29427.395.6720.5I772391.42.0311.1II
30455.134.9618.5I781194.552.9515.3II
31468.1210.625.9I79583.772.1319.8II
32406.497.7324.3I801564.193.6714.4II
33344.4512.624.4I813752.11.9311.4II
34197.4213.79.72I821375.284.1513.8II
354314.611.834.1I832582.012.0512.5II
3616717.615.722.6I841852.783.2613.2II
376912.535.733.9I854643.161.58.9II
383010.914.829.3I862132.062.6821.4II
39325.355.2519.2I872879.184.999.53II
40445.035.9618.4I88902.52.69.51II
41333.632.2529.4I892601.881.6712.9II
42153.375.2530I9029431.12.172.19III
43488.069.410.6I91831.21.31.5III
441896.056.4918.6I9214143.292.385.26III
454612.231.457.3II937602.131.175.92III
464092.161.398.16II945201.61.90.4III
473065.912.488.86II957251.581.45.36III
481022.672.2613.3II
Table 2. Information statistics of CBL classification indexes.
Table 2. Information statistics of CBL classification indexes.
IndexCBL LevelMinimumMaximumMedianIQRLower QuartileUpper QuartileMeanSDSE
DT (ms)I434640.52821.549.565.88682.26712.402
II5464249182119301221.8120.57217.974
III832943742.5679.25571.251250.51074.1671011.746413.044
WETI3.3719.67.2456.0154.985118.2474.210.635
II0.9689.183.472.352.234.583.7471.8890.282
III1.13.291.590.7031.2951.9981.8170.8080.33
KEI0.871236.9058.5975.37713.97523.08534.3435.177
II0.8810.22.271.731.813.542.7691.630.243
III1.172.381.650.7771.3252.1031.720.50.204
RC (MPa)I2.8437.718.5513.42511.5752518.4429.3521.41
II1.421.411.44.48.913.311.0754.3770.653
III0.45.923.7253.6631.6725.3353.4382.3540.961
Table 3. Standardized data.
Table 3. Standardized data.
No.IndexEngineering Guidance LevelNo.IndexEngineering Guidance Level
X1X2X3X4X1X2X3X4
10.0534 0.3291 0.0508 0.6756 I490.0643 0.0092 0.0399 0.3244 II
20.0099 0.2271 0.5578 0.0654 I500.0007 0.4365 0.0022 0.0456 II
30.0000 0.2084 0.9099 0.1582 I510.0003 0.2862 0.0084 0.0432 II
40.0010 0.2057 0.7789 0.1448 I520.0211 0.1627 0.0068 0.0375 II
50.0010 0.2647 0.5415 0.1340 I530.0248 0.2593 0.0305 0.0268 II
60.0031 0.2969 1.0000 0.1260 I540.1007 0.0962 0.0153 0.3056 II
70.0034 0.1305 0.5333 0.1413 I550.1181 0.0892 0.0063 0.3378 II
80.0034 0.1681 0.9099 0.0759 I560.0854 0.1305 0.0232 0.2895 II
90.0061 0.3881 0.7052 0.1046 I570.0708 0.1810 0.0001 0.1853 II
100.0749 0.2937 0.0446 0.4879 I580.1222 0.2711 0.0038 0.2201 II
110.0102 0.2245 0.0461 0.4558 I590.0463 0.1193 0.0202 0.3458 II
120.0895 0.6136 0.0000 0.6542 I600.0834 0.0634 0.0079 0.3110 II
130.0129 0.7209 0.0226 0.7668 I610.0167 0.3033 0.0282 0.2445 II
140.0031 0.1450 0.0393 0.7721 I620.0456 0.0000 0.0219 0.2574 II
150.0140 0.1810 0.0419 0.5952 I630.0265 0.2046 0.0764 0.3619 II
160.1062 0.3828 0.0043 0.4370 I640.1143 0.1939 0.0032 0.4129 II
170.0218 0.6297 0.0480 0.3807 I650.0922 0.1375 0.0220 0.3405 II
180.1164 0.5545 0.0457 0.3968 I660.1028 0.0355 0.0097 0.2555 II
190.0140 0.6082 0.0960 0.4933 I670.0174 0.1987 0.0332 0.4182 II
200.0099 0.5009 0.0734 0.3137 I680.1011 0.1541 0.0131 0.2627 II
210.0054 0.4542 0.0643 0.4316 I690.0299 0.1788 0.0300 0.3834 II
220.0170 1.0000 0.0034 0.4531 I700.0844 0.1150 0.0087 0.1638 II
230.0027 0.1413 0.0117 1.0000 I710.0721 0.0887 0.0077 0.3405 II
240.0126 0.5867 0.0895 0.1365 I720.1062 0.1343 0.0115 0.1214 II
250.0146 0.4418 0.0267 0.4558 I730.0953 0.1606 0.0079 0.2708 II
260.0092 0.8873 0.0373 0.3083 I740.0857 0.1606 0.0118 0.4450 II
270.0211 0.5062 0.0445 0.7560 I750.1317 0.2856 0.0050 0.2413 II
280.0068 0.1627 0.0162 0.6300 I760.0966 0.0355 0.0120 0.4021 II
290.0129 0.3447 0.0393 0.5389 I770.0800 0.0232 0.0095 0.2869 II
300.0140 0.2234 0.0335 0.4853 I780.0391 0.1922 0.0170 0.3995 II
310.0143 0.3839 0.0797 0.6836 I790.0184 0.1504 0.0103 0.5201 II
320.0122 0.2964 0.0562 0.6408 I800.0517 0.1729 0.0229 0.3753 II
330.0102 0.1869 0.0960 0.6434 I810.1262 0.0608 0.0087 0.2949 II
340.0051 0.3463 0.1051 0.2499 I820.0453 0.2314 0.0269 0.3592 II
350.0133 0.7316 0.0895 0.9035 I830.0864 0.0559 0.0097 0.3244 II
360.0555 0.8927 0.1214 0.5952 I840.0616 0.0973 0.0196 0.3432 II
370.0221 0.6189 0.2852 0.8981 I850.1565 0.1176 0.0052 0.2279 II
380.0088 0.5331 0.1141 0.7748 I860.0711 0.0586 0.0148 0.5630 II
390.0095 0.2352 0.0359 0.5040 I870.0963 0.4407 0.0337 0.2448 II
400.0136 0.2180 0.0417 0.4826 I880.0293 0.0822 0.0142 0.2442 II
410.0099 0.1429 0.0113 0.7775 I890.0871 0.0489 0.0066 0.3351 II
420.0037 0.1289 0.0359 0.7936 I901.0000 0.0071 0.0106 0.0480 III
430.0150 0.3806 0.0698 0.2735 I910.0269 0.0125 0.0035 0.0295 III
440.0629 0.2728 0.0460 0.4879 I920.4798 0.1246 0.0124 0.1303 III
450.1555 0.0677 0.0047 0.1850 II930.2572 0.0624 0.0025 0.1480 III
460.1378 0.0640 0.0043 0.2080 II940.1756 0.0339 0.0084 0.0000 III
470.1028 0.2652 0.0132 0.2268 II950.2453 0.0328 0.0043 0.1330 III
480.0333 0.0913 0.0114 0.3458 II
Table 4. Evaluation performance of SVM models.
Table 4. Evaluation performance of SVM models.
No.SVM ModelAccuracy (%)F1-ScoreKappa Coefficient
1LSVM91.60.8040.843
2QSVM97.90.9600.962
3CSVM97.90.9600.962
4FG-SVM88.40.8170.786
5MG-SVM95.80.9200.923
6CG-SVM87.40.7740.765
7APSO-SVM93.70.8780.884
8GWO-SVM98.90.9930.980
Table 5. CBL classification results of 4# Coal Seam.
Table 5. CBL classification results of 4# Coal Seam.
No.DT (ms)Level
(DT)
WETLevel
(WET)
KELevel
(KE)
RC (MPa)Level
(RC)
Engineering Guidance LevelGWO-SVM
1208.8II7.316I1.526II13.232IIIIII
2172.8II14.948I1.561II18.105IIIII
3201.6II6.529I1.724II20.615IIIII
Table 6. CBL classification results of 1# Coal Seam.
Table 6. CBL classification results of 1# Coal Seam.
No.DT (ms)Level
(DT)
WETLevel
(WET)
KELevel
(KE)
RC (MPa)Level
(RC)
Engineering Guidance LevelGWO-SVM
1463II1.216III1.266III4.133IIIIIIII
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, Y.; Wang, C.; Liu, Y. Classification of Coal Bursting Liability Based on Support Vector Machine and Imbalanced Sample Set. Minerals 2023, 13, 15. https://doi.org/10.3390/min13010015

AMA Style

Li Y, Wang C, Liu Y. Classification of Coal Bursting Liability Based on Support Vector Machine and Imbalanced Sample Set. Minerals. 2023; 13(1):15. https://doi.org/10.3390/min13010015

Chicago/Turabian Style

Li, Yuefeng, Chao Wang, and Yv Liu. 2023. "Classification of Coal Bursting Liability Based on Support Vector Machine and Imbalanced Sample Set" Minerals 13, no. 1: 15. https://doi.org/10.3390/min13010015

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop