Next Article in Journal
Neural Network-Based Analysis and Its Application to Spectroscopy for Mango
Previous Article in Journal
Modeling and Analysis of a Long-Range Target Localization Problem Based on an XS Anode Single-Photon Detector
Previous Article in Special Issue
Randomly Distributed Passive Seismic Source Reconstruction Record Waveform Rectification Based on Deep Learning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Machine Learning-Based Precursor Detection Using Seismic Multi-Parameter Data

1
China Earthquake Networks Center, Beijing 100045, China
2
Seismological Bureau of Xinjiang Wulumuqi Autonomous Region, Urumqi 830011, China
3
Institute of Earthquake Forecasting, China Earthquake Administration, Beijing 100036, China
4
Shanghai Artificial Intelligence Laboratory, Shanghai 200232, China
*
Authors to whom correspondence should be addressed.
Appl. Sci. 2024, 14(6), 2401; https://doi.org/10.3390/app14062401
Submission received: 6 February 2024 / Revised: 7 March 2024 / Accepted: 9 March 2024 / Published: 13 March 2024
(This article belongs to the Special Issue Machine Learning Applications in Seismology)

Abstract

:
The application of certain mathematical–statistical methods can quantitatively identify and extract the abnormal characteristics from the observation data, and the comprehensive analysis of seismic multi-parameters can study and judge the risk of the tectonic regions better than a single parameter. In this study, the machine learning-based detection of seismic multi-parameters using the sliding extreme value relevancy method, based on the earthquake-corresponding relevancy spectrum, was calculated in the tectonic regions in the western Chinese mainland, and the R-value evaluation was completed. Multi-parameter data included the b value, M value (missing earthquakes), ƞ value (the relationship between seismic magnitude and frequency), D value (seismic hazard), Mf value (intensity factor), N value (earthquake frequency), and Rm value (modulation parameter). The temporal results showed that the high-value anomalies appeared before most target earthquakes during the training period. Moreover, some target earthquakes also occurred during the advantageous extrapolation period with high-value anomalies. The spatial results showed that some months before the target earthquakes, there was indeed a significant abnormal enhancement area that appeared near the epicenter, and the anomaly gradually disappeared after the earthquakes. This study demonstrated that machine learning techniques for detecting earthquake anomalies using seismic multi-parameter data were feasible.

1. Introduction

Various seismological parameters may exhibit different degrees of precursor anomaly characteristics before major earthquakes and reveal certain laws of earthquake development from different perspectives. There were many studies on the b value within seismological parameters [1,2,3]. The b value is a critical parameter in seismic hazard studies [4], and a high b value indicates a larger proportion of small earthquakes, and vice versa. In addition, earthquake frequency is also a commonly used seismological parameter internationally [5,6]. What would be the effect of integrating these seismic parameters for analysis? There has been relatively little comprehensive study of seismic parameters, which could solve the limitations of single parameter and method approaches [7]. The comprehensive analysis of multiple methods based on the quantitative description of a single anomaly is of great significance in earthquake research. Based on this idea, the study was conducted on the quantitative identification and extraction of anomalies from observation data using mathematical–statistical methods. For example, Wang Haitao [8] transformed the time series of original data into a probability time series based on the corresponding relevancy spectrum in different investigation durations, and used the multi-point cumulative moving average method to obtain the time curve of average probability; then, the earthquake precursor anomalies were identified, providing quantitative single factor data for the comprehensive method. Bo [9] used the multi-point group slope method and composite information flow method to convert the deformation data into the ‘standardized information curve’. Wang [10] quantitatively identified and extracted anomalies from various seismic parameter data in the western section of the southern Tianshan of Xinjiang by using the multi-parameter sliding extreme-value relevancy based on the earthquake-corresponding relevancy spectrum. Lu also conducted a long-term tracking study on the central and southern sections of the Tanlu fault, utilizing the earthquake-corresponding relevancy spectrum of seismic multi-parameters, and found that the earthquake-corresponding efficiency calculated by combining multiple advantageous parameters was indeed better than that of a single parameter [11].
Some studies also found that although some calculated factors and models were different, the regional scales of the predicted earthquake were consistent [12,13,14,15]. Perhaps more earthquake factors could more easily predict the spatial–temporal and intensity information of earthquakes. Therefore, it is possible to try combining multiple different factors within a unified and reasonable physical framework for the extraction of seismic anomalies. For example, establishing a physically reasonable framework that combines the pattern informatics (PI) prediction method [16,17] with the load/unload response ratio (LURR), state vector (SV), and accelerating moment release (AMR) method [18] could be considered. Alternatively, the LURR method could be combined with other methods for seismic hazard assessment [19]. Scholars have also attempted to use the LURR method to study the electromechanical coupling process before large earthquakes, using geoelectric data and Benioff strain data from small earthquakes as input [20], and the results were both well. These studies fully demonstrated the advantages of using multiple different factors and models for comprehensive calculation. Recently, there have been many studies on the application of machine learning for earthquake precursor mining and analysis. Possible precursors from the surface to the ionosphere using machine learning techniques were analyzed [21]. And earthquake precursors could be detected by a novel machine learning-based technique with GPS data [22].
The strongest earthquakes in China occur in the western part of the Chinese mainland. This paper focuses on analyzing the characteristics of comprehensive anomalies before and after target earthquakes in the study area through the calculation of the comprehensive probability of seismic multi-parameters in several tectonic areas in the western Chinese mainland. Moreover, the evaluations of the R-value for these tectonic areas were calculated, and the effectiveness of corresponding earthquakes during the advantageous prediction time for the regions with anomalies was tested.

2. Materials and Methods

2.1. Materials

The study areas mainly encompassed 12 tectonic regions in the western Chinese mainland, including the western section of southern Tianshan, the middle of Tianshan, the northwest of Yunnan, the east of the border between Sichuan and Yunnan, the western portions of southern Xizang, Yutian, Xinjiang, and so on (Figure 1). The blue lines in Figure 1 represents the faults. Meanwhile, a completeness analysis and collation of earthquakes since 1991 were carried out. The completeness of earthquake catalogs directly affects the understanding of seismic activity patterns. For a certain tectonic zone, if the earthquake catalog is basically complete in a certain period, the annual frequency of earthquakes in a certain magnitude segment should basically be similar [23]. The seismic data in this study primarily relied on the local magnitude (ML) and adopted the ≥ML3.0 earthquake catalog from January 1991 to December 2021 provided by the China Earthquake Networks Center, with good completeness.
The appropriate earthquake catalogs for each region in the training database were selected, and the multi-parameter sliding extreme-value relevancy of the training database was calculated, combined with the target earthquakes. Then, the study extrapolated the probability and advantageous time range of target earthquakes in each study area and completed the calculation of the R-value evaluation for each study area.

2.2. Method

Assuming the observation sequence was x 1 , x 2 , …, x n , x max was the maximum value in the sequence, x min was the minimum value. The mean value, x ¯ , and standard deviation, σ , of the sequence x i (i = 1,2,…, n) could be calculated as follows:
x ¯ = 1 n i = 1 n x i
σ = i = 1 n ( x i x ¯ ) 2 / ( n + 1 )
Based on the calculation of the mean value and standard deviations, the rules for the value range distribution interval are defined as follows:
D x 0 ( x ¯ k σ , x ¯ + k σ )
D x 1 [ x ¯ + k σ , x ¯ + 2 k σ )
D x m [ x ¯ + m k σ , x max ]
D x 1 ( x ¯ 2 k σ , x ¯ k σ ]
D x m [ x min , x ¯ m k σ ]
D x 0 , D x 1 , …, D x m in the formula represent the observed values in different value ranges. Formula (3) calculates the frequency of the observed value sequence, x i , distributed across different intervals, thereby constructing a range spectrum curve to define the distribution interval of the value ranges. Taking the range spectrum curve of the seismic parameter, b, in the western section of southern Tianshan as an example (Figure 2), 0 on the horizontal axis corresponds to the value range of D x 0 , which was near the mean. Value 4 corresponds to the value range D x 4 , −4 corresponds to the value range D x 4 , and so on. The vertical axis represents the frequency of data in the corresponding range.
The range spectrum curve is similar to a normal distribution, indicating that the range selection is appropriate. k and m are both parameters of the range spectrum curve, where k could adjust the size of each range interval and m could adjust the number of range intervals. This study conducted extensive data calculations and found that the best results were achieved by uniformly using k = 0.35 and m = 6 to eliminate errors caused by calculating with different parameters of the value range.
Based on the analysis of the value range spectrum, the earthquake-corresponding relevancy spectra (ECRSs) in different value ranges could be solved. By using the ECRS, one can analyze the basic abnormal characteristics of the original observation value sequence and determine the abnormal reliability attributes of data in different value ranges. Firstly, the target earthquakes for retrospective testing in different study areas must be determined. Then, the data falling into the value range from D x m to D x m (from low to high) are counted, point by point, according to the time series of observation data. At the same time, the method needs to count whether there are target earthquakes in different inspection periods t separately.
According to the above rules, the corresponding number of occurred earthquakes, n D x m , as well as the total number, N D x m , in the corresponding value range are determined through the count of the observation value sequence, point by point. Then, the ratio is calculated by P D x m = n D x m / N D x m . The ECRSs in different value ranges of the observation value sequence could be obtained by counting the corresponding results of all value ranges.
The time series of different seismic parameters based on the ECRS is x i j (i = 1, 2, …, n; j = 1, 2, …, k; k represent the different seismic parameter numbers). The relevancy time sequence, p i j , corresponding to different parameters, j, could be converted from the P D x m in different value ranges, point-to-point. Then, the sliding average relevancy sequence values, p ¯ i j , of different seismic parameters, according to the lengths of different inspection periods, t, are obtained by the multi-point cumulative average and point-by-point sliding calculation methods.
Setting t = 12 (month) represents the different investigation durations. The sliding average relevancy sequence, p ¯ i j , for different seismic parameters is as follows:
p ¯ i j = ( p i j + p ( i + 1 ) j + + p ( i + t 1 ) j ) / t ,   i = 1 , 2 , , n t + 1
By analyzing the sliding average relevancy sequences, p ¯ i j , of different seismic parameters, the precursor anomaly characteristics of individual seismic parameters could be quantitatively identified and studied.
Based on the above analysis, we could obtain the multi-point sliding extreme-value relevancy sequence, M i j , of the sliding average relevancy sequence, p ¯ i j , of different parameters, point by point, and finally, calculate the average value of the sliding extreme-value relevancy sequences, M j , of different parameters to obtain the multi-parameter sliding extreme-value relevancy value, p c . In order to observe the curve changes more clearly, 6 points are selected as the sliding window lengths in this paper, and the comprehensive results of sliding, point by point, are marked on the time coordinates of the last point.
Suppose that j = 1, 2, …, k (k represents the different seismic parameters), w = 6 (w represents the sliding window lengths of different points).
M i j = M a x { p ¯ i j , p ¯ ( i + 1 ) j , , p ¯ ( i + w 1 ) j } , i = 1 , 2 , , n w + 1
M j = ( M 1 j , M 2 j , , M ( n w + 1 ) j ) , j = 1 , 2 , , k
In the formula, M i j represents the multi-point sliding extreme-value relevancy for different parameters; M j represents the multi-point sliding extreme-value relevancy of a single parameter, j.
Then, the multi-parameter sliding extreme-value relevancy value, p c , is calculated by averaging the values of M j from different parameters.
p c = j = 1 k M j / k

3. Results

3.1. Multi-Parameter Relevancy Spectrum Calculation

The most reasonable single parameter and its threshold settings, within the probability spectrum of earthquake occurrence and the value range spectrum in each study area, were analyzed through time scanning with different statistical window lengths and step sizes. After years of analysis and experience summaries, the optimal seven parameters were ultimately selected to participate in the calculation of each research area, namely, the b value, M value (missing earthquake), and η Value—these three parameters could describe the relationship between seismic magnitude and frequency; the D value describes seismic hazard; the Mf value denotes the intensity factor; the N value denotes the earthquake frequency, which describes the intensity changes of seismic activity; and the Rm value is the environmental modulation parameter.
A large portion of the computational analysis was conducted across different research areas. Using the three study areas of border region as examples, e.g., Qinghai and Xizang, the western section of southern Tianshan, and the central and western sections of the Qilian Mountains, different time scanning scales were applied for calculation: 3 months, 6 months, and 12 months. The results showed that the curve did not change much, but the anomaly and earthquake-corresponding rates of the three study areas were relatively high at the 12-month scale (Table 1). Therefore, this study uniformly used a 12-month time scanning scale for all study areas, and the seismic parameters and thresholds used in all study areas were also the same, which could help to avoid errors caused by differences in the time scale and parameters.
Taking the abnormal changes in the border areas of Qinghai and Xizang, where the Maduo M7.4 earthquake occurred on 22 May 2021, as an example, and based on the previous summary and analysis that 12 months was a good sliding window length for calculation, the window lengths of the time calculations for seven parameters were all selected as 12 months, and the calculation step sizes were all 1 month. The 6-point sliding average and 3-point sliding extreme-value algorithms were used to calculate the multi-parameter sliding extreme-value relevancy of earthquake data from January 1991 to December 2021. According to the time distribution of target earthquakes in the study area, it is necessary to ensure that the training period had sufficient sample data and a certain number of target earthquakes to test the extrapolation effect. Therefore, the deadline for the training period here was selected as December 2020, and the target earthquakes in the study area were set as ML ≥ 6.3 earthquakes. The results that began in January 2021 were extrapolation values after the machine learning, and the threshold of the probability curve was taken as 0.5 times the standard deviation. The threshold line was obtained through a large number of calculations in 12 study areas in the western Chinese mainland. It was found that when 0.5 times the standard deviation of the multi-parameter probability result was taken as the threshold line, the corresponding effect between the probability value anomaly and the target earthquake was the best. Therefore, in order to eliminate the error caused by using different threshold lines, all study areas uniformly used 0.5 times the standard deviation as the threshold line.
Figure 3 shows the study area of the border regions of Qinghai and Xizang. Figure 4 shows the MSER results of the study area. The results show that the earthquake-corresponding rate was good during the training period from January 1991 to December 2020. During the training period, a total of seven target earthquakes occurred, all with high-value anomalies exceeding the threshold line before the earthquakes. The extrapolation period began in January 2021. The probability curve had been in an abnormally high-value state and reached its peak in January 2021. On 19 March 2021, the Xizang Biru ML6.4 earthquake occurred in the study area, proving that the extrapolation effect was good. According to the retrospective study of earthquakes in this region, there was also an anomaly peak before the Qinghai Yushu ML7.4 earthquake on 14 April 2010; the anomaly peak occurred in September 2009 and the Xizang Nierong ML6.6 earthquake occurred on 24 March 2010 (about one month prior). The Maduo ML7.9 earthquake was similar to the Yushu earthquake. In the process of the curve turning and falling, the Xizang Biru ML6.4 earthquake first occurred on 19 March, then the Qinghai Maduo ML7.9 earthquake occurred on 22 May (about two months later).
The spatial scanning calculation of the border regions of Qinghai and Xizang adopted the same seven dominant seismic parameters, which were used in the time scanning. ML ≥ 3.0 earthquakes from January 1991 to December 2020 were selected as the training data. The probability spectrum calculation analysis was carried out with a spatial window length of 2° × 2° and a step size of 0.2° × 0.2°, with a duration of 1 year and a step size of 1 month. January 2021 was the starting time for extrapolation testing. The Xizang Biru ML6.4 earthquake on 19 March 2021 and the Qinghai Maduo ML7.9 earthquake on 22 May 2021 both occurred in the study area. From the results in Figure 5, it could be seen that there were almost no high-value anomalies in the study area from January to May 2019. In June 2019, sporadic high-value anomalies began to appear near the Xizang Biru region. Afterward, the abnormal area and amplitude gradually expanded and strengthened, reaching their maximum value in December 2020. The Biru ML6.4 earthquake occurred in the abnormal area in March 2021. At the same time, abnormal high points began to appear between Yushu and Maduo in Qinghai from March 2021 (inside the red dashed box in Figure 5). Although the abnormal points were small, they showed a gradually increasing trend. The Maduo ML7.9 earthquake occurred to the northeast of the abnormal points in May 2021. Afterward, the abnormal values in both regions gradually weakened, and the high-value abnormal points near the Maduo earthquake completely disappeared in December 2021.
It was found that there were similar highly abnormal changes in 2010. From January 2008 to 2009, the anomaly gradually increased, and almost reached its maximum value in December 2009. In March 2010, the Xizang Nierong ML6.6 earthquake occurred near the anomaly concentration area. At the same time, the abnormal high-value points also gradually increased near the Yushu region in December 2009 (inside the red dashed box in Figure 6). In April 2010, the Qinghai Yushu ML7.4 earthquake occurred near the high-value points. Afterward, the abnormal areas gradually weakened, and almost all of them disappeared in December 2010 (Figure 6).

3.2. Risk Assessment of the Multi-Parameter Relevancy Spectrum in the Western Chinese Mainland

The R-value evaluation is a widely recognized method used for evaluating the effectiveness of earthquake prediction [24,25]. The R-value evaluation is calculated by the difference between the earthquake prediction accuracy rate and the time occupancy rate of prediction. The specific formula is as follows:
R = A O = N a N t S a S t
A represents the earthquake prediction accuracy rate, O represents the time occupancy rate of the predicted area, N a represents the number of predicted earthquakes successfully, N t represents the total number of earthquakes that should be predicted, S a represents the time occupied by prediction, and S t represents the total time spent on prediction research.
Assuming that earthquakes occur independently of each other, the number of earthquakes that occur within a certain period should follow the discretization state of the Poisson distribution. The probability of a single earthquake occurring during this period is represented by the time occupancy rate, O , of the predicted area, and the corresponding significance level should be satisfied as follows [26]:
α = i = N a N t [ ( N t i O i ( 1 O ) N t i ) ] N t i = N t ! i ! ( N t i ) !
In the formula, α represents the significance level, taken as 5%. Based on the known A value, the value O is calculated, and the corresponding R score for the significance level α is obtained and represented by R . When R > R 0 , the predicted result is considered to have high statistical significance [27,28].
In this paper, the R-values of the multi-parameter calculation results of 12 tectonic regions in the western Chinese mainland were evaluated, and finally, 7 tectonic regions passed the test (Table 2), namely, central Tianshan, the western section of southern Tianshan, the border regions of Qinghai and Xizang, Yutian in Xinjiang, Nima in Xizang, the central and western sections of the Qilian Mountains, and the eastern section of southern Xizang. The criterion for determining anomalies was that the relevancy value exceeded the threshold line, and an abnormal peak appeared. The time among the three elements was considered the advantageous extrapolation time when calculating the optimal R-value by different times (Table 2). The extrapolation time period started with the abnormal peak time. Taking central Tianshan in Table 2 as an example, if its relevancy value exceeded the threshold line and reached an abnormal peak in October 2019, its advantageous extrapolation time was 290 days. Therefore, the validity period should be until July 2020. The intensity of the predicted earthquake was based on the minimum magnitude of target earthquakes used for simulation training in the training period. The predicted location refers to the area studied and calculated.

3.3. Extrapolation Inspection

The research data used in this paper were all up to December 2021. Among the seven areas that passed the R-value evaluation test, in addition to the border regions of Qinghai and Xizang, there were also areas with high anomalies, such as the western section of southern Tianshan and the central and western sections of the Qilian Mountains. The advantageous extrapolation times for the western section of southern Tianshan and the central and western sections of the Qilian Mountains were 380 days and 170 days after the multi-parameter relevancy value exceeded the threshold line and experienced abnormal peaks, respectively (Table 2).
Figure 7 shows the study area of the western section of southern Tianshan. Figure 8 shows that the target earthquakes in the western section of southern Tian Shan had good earthquake-corresponding rates during the training period (January 1991 to October 2016). There were seven target earthquakes during the training period, and all high-value anomalies exceeding the threshold line appeared before the earthquakes. Taking November 2016 as the starting time for extrapolation, the relevancy curve was still at a high value. The Aktao ML7.0 earthquake in Xinjiang occurred in the study area on 25 November 2016, which showed that the extrapolation effect was good. However, there was no high-value anomaly before the Jiashi ML6.7 earthquake in Xinjiang on 19 January 2020, which was considered a missed earthquake. Afterward, the relevancy curve continued to rise, and it exceeded the threshold line, reaching an abnormal peak in February 2021, indicating the possibility of ≥ML6.7 earthquakes within the study area. According to Table 2, the advantageous time for the multi-parameter relevancy spectrum in this region was 380 days. Assuming the peak time of the anomaly in February 2021 as the starting time, the validity period would be until 16 February 2022. On 13 February 2022, the Tajikistan MS6.1 earthquake occurred near the anomaly area, corresponding to this high-value anomaly.
The Aktao ML7.0 earthquake in Xinjiang on 25 November 2016 occurred during the extrapolation test period. The spatial evolution of the MSER anomaly in the western section of the southern Tianshan Mountains (Figure 9) showed that the anomaly gradually increased from June 2016, reaching its maximum intensity in September 2016, and the Aktao ML7.0 earthquake occurred near the anomaly area in November. Afterward, the anomaly gradually weakened until it disappeared.
The training period in the central and western sections of the Qilian Mountains (Figure 10) was from January 1991 to December 2015, during which, a total of 10 target earthquakes occurred. Among them, eight target earthquakes had high-value anomalies exceeding the threshold line before the earthquakes, and two target earthquakes were missed (Figure 11). In January 2016, as the starting time for extrapolation, the relevancy curve continued to decline after reaching the abnormal peak in July 2018, and reached its lowest point in February 2019. During this period, there was no corresponding target earthquake. Afterward, the abnormal curve rose again and reached its peak in July 2019. Two months later, on 16 September 2019, the Gansu Zhangye ML5.5 earthquake occurred. The relevancy curve had been continuously decreasing and remained below the threshold line after the Zhangye ML5.5 earthquake in Gansu. The curve began to turn and rise in March 2021, and the Aksai ML6.0 earthquake in Gansu occurred on 26 August 2021 during the curve ascent process, exceeding the threshold line. The curve reached its peak again in January 2022. Table 2 shows that the advantageous extrapolation time in the central and western sections of the Qilian Mountains was 170 days, so the validity period should be until 20 June 2022. From January 2022 to 20 June 2022, the study region experienced the Qinghai Menyuan ML7.1 earthquake on 8 January 2022, the Gansu Sunan ML5.5 earthquake on 17 March 2022, and the Qinghai Delingha ML6.3 earthquake on 26 March 2022, respectively, proving that the extrapolation effect in this study area was very good.
The extrapolation test period for the central and western sections of the Qilian Mountains began in January 2016. The spatial evolution of the MSER anomaly before the 8 January 2022 Qinghai Menyuan ML7.1 earthquake showed that the anomaly amplitude in the study area began to increase from August 2020, with the largest anomaly increase in November 2021 (Figure 12). The red pentagram in Figure 12 represents the epicenter position of the Qinghai Menyuan ML7.1 earthquake in January 2022.

4. Discussion

The abnormal patterns of different parameters before the target earthquakes were different. There were differences in the starting times, peak values, and end times of the anomalies. The comprehensive analysis method of seismic parameters can combine single parameters related to physical processes at different stages of earthquake preparation, which could extract the comprehensive abnormal characteristics during the earthquake preparation process more accurately. Wang [10] conducted a comprehensive multi-parameter study on earthquakes from 1979 to 2008 in the western section of southern Tianshan, and the results showed that target earthquakes in the study area exhibited significant high-probability anomalies of multiple parameters 1–2 years prior to the earthquakes. This research conclusion was consistent with the advantageous extrapolation time of 380 days in the western section of southern Tianshan in this paper. In addition, the anomaly-corresponding rate is the ratio of the number of anomalies corresponding to the target earthquake to the total number of anomalies. The earthquake-corresponding rate is the ratio of the number of target earthquakes with anomalies before the earthquake to the number of target earthquakes. By statistically analyzing the anomaly-corresponding rates and earthquake-corresponding rates of single parameters and multiple parameters in different inspection periods and regions of Xinjiang (Table 3), it was found that the anomaly-corresponding rates and earthquake-corresponding rates of multiple parameters were indeed higher than the predictive efficiency of a single parameter.
Related studies had also found that—when combined with methods such as PI, LURR, SV, and AMR—the earthquake prediction performance was better than that of a single method [18]. At present, this study mainly applied different seismological parameters for comprehensive analysis through machine learning. This method could also comprehensively calculate other observation data of earthquake precursors, such as deformation, electromagnetic, and underground fluid anomalies. It was even possible to merge and summarize all seismic observation data to complete a comprehensive probability analysis. At the same time, by analyzing the spatial evolution of the MSER during the extrapolation test periods, it was found that the target earthquakes generally occurred after the maximum amplitude of anomaly enhancement was reached, and the epicenter was located in or near the anomaly concentration area. This indicated that the model obtained by using this method for machine learning of historical earthquakes had a good extrapolation effect, which can provide a reference for the seismic hazard of the study area.
In addition, the R-value in this manuscript is different from the linear correlation coefficient, R, which measures the relationship between two sets of data. The R-value evaluation is calculated by the difference between the earthquake prediction accuracy rate and the time occupancy rate of prediction, as outlined in Formula (8). The best prediction performance corresponds to paying the minimum cost (O→0) at the highest accuracy rate (A→1), that is, R→1. On the contrary, the worst prediction performance corresponds to paying the maximum cost (O→1) at the lowest accuracy rate (A→0), that is, R→−1. It is generally believed that when R > 0, the success rate of the forecasting is higher than the random probability, and this result has a certain predictive effectiveness [25]. R0 represents the derived R-value based on the significance level of 5% (95% confidence level). Therefore, when R > R 0 , the predicted result is considered to have high statistical significance [27,28].

5. Conclusions

Twelve regions in the western Chinese mainland were studied using a comprehensive multi-parameter method, selecting sufficient earthquake events for machine learning to extrapolate and predict the risk of the study areas. The results showed that there were three structural regions with obvious anomalies during the extrapolation period, namely the border regions of Qinghai and Xizang, the western section of southern Tianshan, and the central and western sections of the Qilian Mountains. And the three structural regions all had corresponding target earthquakes within the advantageous extrapolation time. A total of seven advantageous seismological parameters were selected for training and calculation. In order to eliminate errors, all 12 study areas in this paper used the same seismic parameters, parameter thresholds, spatiotemporal scanning scales, and anomaly threshold lines for their calculation data.
In the calculated results, the extrapolation times of the border regions of Qinghai and Xizang started from January 2021, and the results showed that the relevancy curve of extrapolation was in a highly abnormal state. During the advantageous extrapolation time, the Xizang Biru ML6.4 earthquake on 19 March 2021 and the Qinghai Maduo ML7.9 earthquake on 22 May 2021 occurred, which were similar to the change states of the previous Xizang Nierong ML6.4 earthquake on 24 March 2010 and the Qinghai Yushu ML7.4 earthquake on 14 April 2010. Both spatial evolution processes were also very similar, with earthquakes occurring in areas with abnormally high values and their vicinity. This indicated that the seismic activity in this area had a certain regularity. If similar anomalies occur again in the future, it will be necessary to be vigilant about the risk of short-term continuous occurrences of earthquakes in this region.
The extrapolation effects in the western section of southern Tianshan and the central and western sections of the Qilian Mountains were both highly well through the machine learning-based detection of seismic multi-parameters. The anomaly-corresponding rate and earthquake-corresponding rate were all very high in the calculated results. The application of the comprehensive analysis of seismic multi-parameters in the western Chinese mainland could not only infer the urgency of the target earthquakes (in terms of time) but also distinguish the possible areas where the target earthquakes occurred from the spatial distribution of abnormal areas. In all, the results showed that the extrapolation effect using the machine learning method in the western Chinese mainland was very good.

Author Contributions

Conceptualization, H.W. and X.Z.; methodology, X.L. and Q.W.; validation, W.Y. and L.M.; writing—original draft preparation, X.L.; writing—review and editing, X.Z.; supervision, H.W.; project administration, L.M.; funding acquisition, X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research and Development Program of China, grant number 2021YFC3000700, and supported by the Shanghai Artificial Intelligence Laboratory; the Open Fund for Earthquake Prediction, grant number XH24013D; the National Nature Science Youth Fund, grant number 41704062; and the Project of Earthquake Tracking, grant number 2024020509.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data source of this manuscript is from the China Earthquake Networks Center (https://news.ceic.ac.cn/, accessed on 1 March 2024).

Acknowledgments

Thanks go out to Tao Xie and Xianghua Jiang from the China Earthquake Networks Center for their help in the research process of this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Correction Statement

This article has been republished with a minor correction to the existing affiliation information. This change does not affect the scientific content of the article.

References

  1. Wiemer, S.; Katsumata, K. Spatial variability of seismicity parameters in aftershock zones. J. Geophys. Res. 1999, 104, 13135–13151. [Google Scholar] [CrossRef]
  2. Ethen, G. Analysis of the b-values before and after the 23 October 2011 Mw7.2 Van-Erciş, Turkey earthquake. Tectonophysics 2013, 603, 213–221. [Google Scholar]
  3. Laura, G.; Stefan, W. Real-time discrimination of earthquake foreshocks and aftershocks. Nature 2019, 574, 193–200. [Google Scholar]
  4. Danijel, S.; Stefan, W.; Max, W. Variations in earthquake size distribution across different stress regimes. Nature 2005, 437, 539–542. [Google Scholar]
  5. Liu, Z.R. Earthquake frequency and prediction. Bull. Seismol. Soc. Am. 1984, 74, 255–265. [Google Scholar] [CrossRef]
  6. Markus, B. Earthquake frequency and energy in Greece. Tectonophysics 1983, 95, 233–252. [Google Scholar]
  7. Tang, L.L.; Wang, H.T.; Wang, Q. Anomaly study of ECRS method before mid-strong earthquakes in Xinjiang. China Earthq. Eng. J. 2011, 33, 159–165. [Google Scholar]
  8. Wang, H.T.; Wang, Q.; Tang, L.L. Comprehensive precursor anomaly research based on earthquake corresponding relevancy spectrum. Earthq. Sci. 2009, 22, 675–681. [Google Scholar] [CrossRef]
  9. Bo, W.J.; Wu, Y.L. Standardization methods for abnormal information flow and their applications. Crustal Deform. Earthq. 1993, 13, 9–15. [Google Scholar]
  10. Wang, Q.; Wang, H.T.; Tang, L.L. Research on multi-parameter comprehensive anomaly based on earthquake corresponding probability spectrum. Earthq. Res. China 2009, 25, 294–302. [Google Scholar]
  11. Lu, X.; Wang, Q.; Li, G.; Li, M.X.; Tang, L.L. Tracking study for earthquakes of the middle-south part of the Tancheng-Lujiang fault zone based on the ECRS method. Earthq. Res. China 2016, 25, 294–302. [Google Scholar]
  12. Bowman, D.D.; King, G.C.P. Accelerating Seismicity and stress Accumulation Before large earthquake. Geophys. Res. Lett. 2001, 28, 4039–4042. [Google Scholar] [CrossRef]
  13. Yin, X.C.; Mora, P.; Peng, K.Y.; Wang, Y.C.; Weatherly, D. Load-Unload Response Ratio and Accelerating Moment/Energy Release, Critical Region Scaling and Earthquake prediction. Pure Appl. Geophys. 2002, 159, 2511–2524. [Google Scholar]
  14. Zhang, Y.X.; Wu, Y.J.; Yin, X.C.; Peng, K.Y.; Zhang, L.P.; Yu, A.Q.; Zhang, X.T. Comparison Between LURR and State Vector Analysis Before Strong Earthquakes in Southern California Since 1980. Pure Appl. Geophys. 2008, 165, 737–748. [Google Scholar] [CrossRef]
  15. Rhoades, D.A.; Gerstenberger, M.C. Mixture models for improved short-term earthquake forecasting. Bull. Seismol. Soc. Am. 2009, 99, 636–646. [Google Scholar] [CrossRef]
  16. Rundle, J.B.; Tiampo, K.F.; Klein, W.; Martins, J.S.S. Self-organization in leaky threshold systems: The influence of near-mean field dynamics and its implications for earthquakes, neurobiology, and forecasting. Proc. Natl. Acad. Sci. USA 2002, 99, 2514–2521. [Google Scholar] [CrossRef] [PubMed]
  17. Rundle, J.B.; Turcotte, D.L.; Shcherbakov, R.; Klein, W.; Sammis, C. Statistical physics approach to understanding the multiscale dynamics of earthquake fault systems. Rev. Geophys. 2003, 41, 1019. [Google Scholar] [CrossRef]
  18. Yu, H.Z.; Zhu, Q.Y.; Zhou, F.R.; Tian, L.; Zhang, Y.X. An ensemble approach for improved short-to-intermediate-term seismic potential evaluation. Pure Appl. Geophys. 2017, 174, 2381–2399. [Google Scholar] [CrossRef]
  19. Zakupin, A.S.; Bogomolov, L.M.; Boginskaya, N.V. Using the load/unload response ratio and self-developing processes methods of analyzing seismic sequences to predict earthquakes in Sakhalin. Izv. Atmos. Ocean. Phys. 2020, 56, 693–705. [Google Scholar] [CrossRef]
  20. Yu, C.; Wang, S.Y.; Yu, H.Z. Study of the electromechanical coupling process before the 2020 MS6.4 Yutian, China Earthquake. Sci. Rep. 2022, 12, 17622. [Google Scholar] [CrossRef]
  21. Draz, M.U.; Shah, M.; Jamjareegulgarn, P.; Shahzad, R.; Hasan, A.M.; Ghamry, N.A. Deep machine learning based possible atmospheric and ionospheric precursors of the 2021 MW7.1 Japan earthquake. Remote Sens. 2023, 15, 1904. [Google Scholar] [CrossRef]
  22. Akyol, A.A.; Arikan, O.; Arikan, F. A machine learning-based detection of earthquake precursors using ionospheric data. Radio Sci. 2020, 55, e2019RS006931. [Google Scholar] [CrossRef]
  23. Huang, W.Q.; Li, W.X.; Cao, X.F. Study on Seismic Data Integrity in Chinese Mainland (II). Acta Seismol. Sin. 1994, 16, 423–432. [Google Scholar]
  24. Xu, S.X. Scoring of Earthquake Prediction Ability, Collection of Practical Researches on Earthquake Prediction Methods: Seismological Album, 1st ed.; Academic Book and Journal Publishing House: Beijing, China, 1989; pp. 586–589. [Google Scholar]
  25. Zhang, G.M.; Liu, J.; Shi, Y.L. A scientific evaluation of annual earthquake prediction ability. Acta Seismol. Sin. 2002, 24, 525–532. [Google Scholar] [CrossRef]
  26. Yuan, Z.Y.; Jiang, X.H.; Liu, Q. Assessment of the number of annual seismic risk areas based on R-value and statistical models. Earthq. Res. China 2023, 39, 88–97. [Google Scholar]
  27. Zechar, J.D.; Jordan, T.H. Testing alarm-based earthquake predictions. Geophys. J. Int. 2008, 172, 715–724. [Google Scholar] [CrossRef]
  28. Jiang, C.; Wu, Z. PI forecast for the Sichuan-Yunnan region: Retrospective test after the May 12, 2008, Wenchuan earthquake. Pure Appl. Geophys. 2010, 167, 751–761. [Google Scholar] [CrossRef]
Figure 1. Research regions in the western Chinese mainland.
Figure 1. Research regions in the western Chinese mainland.
Applsci 14 02401 g001
Figure 2. The value range spectrum curve.
Figure 2. The value range spectrum curve.
Applsci 14 02401 g002
Figure 3. Border regions of Qinghai and Xizang (within the dashed box) and ML ≥ 6.3 earthquakes from January 2021 to December 2021.
Figure 3. Border regions of Qinghai and Xizang (within the dashed box) and ML ≥ 6.3 earthquakes from January 2021 to December 2021.
Applsci 14 02401 g003
Figure 4. The multi-point sliding extreme-value relevancy (MSER) of the border regions of Qinghai and Xizang.
Figure 4. The multi-point sliding extreme-value relevancy (MSER) of the border regions of Qinghai and Xizang.
Applsci 14 02401 g004
Figure 5. The spatial evolution of the MSER of the border regions of Qinghai and Xizang from January 2019 to September 2021.
Figure 5. The spatial evolution of the MSER of the border regions of Qinghai and Xizang from January 2019 to September 2021.
Applsci 14 02401 g005
Figure 6. The spatial evolution of the MSER of the border regions of Qinghai and Xizang from January 2008 to September 2010.
Figure 6. The spatial evolution of the MSER of the border regions of Qinghai and Xizang from January 2008 to September 2010.
Applsci 14 02401 g006
Figure 7. Western section of southern Tianshan (within the dashed box) and ML ≥ 6.3 earthquakes from November 2016 to December 2021.
Figure 7. Western section of southern Tianshan (within the dashed box) and ML ≥ 6.3 earthquakes from November 2016 to December 2021.
Applsci 14 02401 g007
Figure 8. The MSER changes in the western section of southern Tianshan.
Figure 8. The MSER changes in the western section of southern Tianshan.
Applsci 14 02401 g008
Figure 9. The spatial evolution of the MSER of the western section of southern Tianshan from January 2016 to October 2017.
Figure 9. The spatial evolution of the MSER of the western section of southern Tianshan from January 2016 to October 2017.
Applsci 14 02401 g009
Figure 10. Central and western sections of the Qilian Mountains (within the dashed box) and ML ≥ 5.5 earthquakes from January 2016 to December 2021.
Figure 10. Central and western sections of the Qilian Mountains (within the dashed box) and ML ≥ 5.5 earthquakes from January 2016 to December 2021.
Applsci 14 02401 g010
Figure 11. The MSER changes of the central and western sections of the Qilian Mountains.
Figure 11. The MSER changes of the central and western sections of the Qilian Mountains.
Applsci 14 02401 g011
Figure 12. The spatial evolution of the MSER of central and western sections of the Qilian Mountains from June 2020 to September 2021.
Figure 12. The spatial evolution of the MSER of central and western sections of the Qilian Mountains from June 2020 to September 2021.
Applsci 14 02401 g012
Table 1. Anomaly-corresponding rate and earthquake-corresponding rate of different time scanning scales.
Table 1. Anomaly-corresponding rate and earthquake-corresponding rate of different time scanning scales.
NameAnomaly-Corresponding Rate (%)Earthquake-Corresponding Rate (%)
Time scanning scale (month)36123612
Study regionBorder regions of Qinghai and Xizang50.0066.6775.0071.43100.00100.00
Western section of southern Tianshan50.0066.67100.0071.4385.71100.00
Central and western sections of the Qilian Mountains77.7885.7185.7180.0080.0080.00
Table 2. Characteristics of multi-parameter relevancy spectrum anomalies in the western Chinese mainland.
Table 2. Characteristics of multi-parameter relevancy spectrum anomalies in the western Chinese mainland.
No.RegionThree Elements of PredictionR-Value
Time/DayIntensity/MLLocation
1Central Tianshan290≥6.0Central TianshanR = 0.55
R0 = 0.38
2Western section of southern Tianshan380≥6.3Western section of southern TianshanR = 0.44
R0 = 0.29
3Border regions of Qinghai and Xizang260≥6.3Border area of Qinghai and XizangR = 0.68
R0 = 0.35
4Yutian, Xinjiang190≥5.5Yutian, XinjiangR = 0.58
R0 = 0.29
5Nima in Xizang100≥5.5Nyima in XizangR = 0.32
R0 = 0.28
6The central and western section of the Qilian Mountains170≥5.5The central and western section of the Qilian MountainsR = 0.54
R0 = 0.35
7Eastern of Southern Xizang370≥6.0Eastern of Southern XizangR = 0.51
R0 = 0.36
Table 3. The anomaly-corresponding rates and earthquake-corresponding rates of single parameters and multiple parameters in different regions of Xinjiang [10].
Table 3. The anomaly-corresponding rates and earthquake-corresponding rates of single parameters and multiple parameters in different regions of Xinjiang [10].
Study RegionSingle ParameterMulti-Parameter
Anomaly-Corresponding Rate (%)Earthquake-Corresponding Rate (%)Anomaly-Corresponding Rate (%)Earthquake-Corresponding Rate (%)
West section of Tianshan54.3246.8866.6768.75
Baicheng-Kuche region49.6658.2970.5095.00
Korla region55.6463.6970.5983.33
Keping region73.1255.71100.0077.50
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lu, X.; Wang, Q.; Zhang, X.; Yan, W.; Meng, L.; Wang, H. Machine Learning-Based Precursor Detection Using Seismic Multi-Parameter Data. Appl. Sci. 2024, 14, 2401. https://doi.org/10.3390/app14062401

AMA Style

Lu X, Wang Q, Zhang X, Yan W, Meng L, Wang H. Machine Learning-Based Precursor Detection Using Seismic Multi-Parameter Data. Applied Sciences. 2024; 14(6):2401. https://doi.org/10.3390/app14062401

Chicago/Turabian Style

Lu, Xian, Qiong Wang, Xiaodong Zhang, Wei Yan, Lingyuan Meng, and Haitao Wang. 2024. "Machine Learning-Based Precursor Detection Using Seismic Multi-Parameter Data" Applied Sciences 14, no. 6: 2401. https://doi.org/10.3390/app14062401

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop