Microseismic Velocity Inversion Based on Deep Learning and Data Augmentation

Li, Lei; Zeng, Xiaobao; Pan, Xinpeng; Peng, Ling; Tan, Yuyang; Liu, Jianxin

doi:10.3390/app14052194

Open AccessArticle

Microseismic Velocity Inversion Based on Deep Learning and Data Augmentation

by

Lei Li

^1,2

,

Xiaobao Zeng

^1,2,

Xinpeng Pan

^1,2,*,

Ling Peng

^1,2,

Yuyang Tan

³ and

Jianxin Liu

^1,2

¹

Key Laboratory of Metallogenic Prediction of Nonferrous Metals and Geological Environment Monitoring (Central South University), Ministry of Education, Changsha 410083, China

²

School of Geosciences and Info-Physics, Central South University, Changsha 410083, China

³

Frontiers Science Center for Deep Ocean Multispheres and Earth System, Key Lab of Submarine Geosciences and Prospecting Techniques MOE, College of Marine Geosciences, Ocean University of China, Qingdao 266100, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2024, 14(5), 2194; https://doi.org/10.3390/app14052194

Submission received: 20 February 2024 / Revised: 1 March 2024 / Accepted: 4 March 2024 / Published: 6 March 2024

(This article belongs to the Special Issue Machine Learning Applications in Seismology)

Download

Browse Figures

Versions Notes

Abstract

:

Microseismic monitoring plays an essential role for reservoir characterization and earthquake disaster monitoring and early warning. The accuracy of the subsurface velocity model directly affects the precision of event localization and subsequent processing. It is challenging for traditional methods to realize efficient and accurate microseismic velocity inversion due to the low signal-to-noise ratio of field data. Deep learning can efficiently invert the velocity model by constructing a mapping relationship from the waveform data domain to the velocity model domain. The predicted and reference values are fitted with mean square error as the loss function. To reduce the feature mismatch between the synthetic and real microseismic data, data augmentation is also performed using correlation and convolution operations. Moreover, a hybrid training strategy is proposed by combining synthetic and augmented data. By testing real microseismic data, the results show that the Unet is capable of high-resolution and robust velocity prediction. The data augmentation method complements more high-frequency components, while the hybrid training strategy fully combines the low-frequency and high-frequency components in the data to improve the inversion accuracy.

Keywords:

microseismic velocity inversion; deep learning; data augmentation; hybrid training; Unet

1. Introduction

Microseismic monitoring plays an important role for both fault/fracture characterization and seismic risk analysis in unconventional reservoirs and rock masses [1,2,3,4,5]. Most current microseismic inversion procedures require realistic velocity models. For example, the reliability of microseismic inversion and interpretation depends heavily on the accuracy of the velocity model [6,7]. However, most microseismic velocity models used in production are directly adapted from the well-logging curves, which are generally approximate to simplified models and may be contaminated by noise. Various velocity model calibration methods have been proposed based on traveltime (difference)-based inversion [8,9,10]. Additionally, full waveform inversion (FWI), as a strong inversion tool, has also been introduced to microseismic inversion [11,12]. However, FWI usually involves a higher computational demand and is also affected by cycle skipping due to the sinusoidal nature of the wavefield and complex scattering [13]. Cycle skipping can lead convergence at local minima and thus yield incorrect velocity models.

Traditional traveltime-based velocity inversion and full-waveform inversion rely on data quality, such as signal-to-noise ratio (SNR) [14]. However, the real microseismic data are usually of low SNR, which largely affects the accuracy of the inversion. In addition, traditional velocity inversion methods rely on the accuracy of the initial velocity. Recently, deep learning (DL) has shown excellent capabilities for nonlinear mapping function approximation in computer vision, especially in the tasks of reconstructing models and high-resolution images [15,16]. The development of DL has also brought new opportunities to seismic and microseismic data processing and inversion [17], such as signal denoising [18], signal identification and classification [19,20], first-arrival picking [21,22,23], source location [24], and velocity model building and calibration [25]. Using seismic waveforms as the feature input and velocity models as the labels, the trained models with the nonlinear mapping capability of neural networks can effectively predict velocity models from seismic waveforms. There are already several studies on using DL algorithms to invert velocity models. Araya-Polo et al. [26] extracted features from the acquired seismic data and proposed using deep convolutional neural networks (DCNNs), instead of seismic tomography, to reconstruct velocity models. Yang et al. [27] proposed a supervised deep fully convolutional neural network (FCN) approach to build velocity models directly from raw seismic data.

However, there are only a few studies on DL-based downhole microseismic velocity inversion to take advantage of the nonlinear mapping ability of deep neural networks (DNNs) to carry out velocity inversion tasks [28,29]. Unlike velocity model inversion in active seismology, there is generally only one velocity model corresponding to hundreds, possibly even thousands, of microseismic events. The combination of abundant microseismic events within restricted regions and limited velocity model information hinders dataset construction and network performance. Additionally, microseismic processing and interpretation is dependent on activities and geology in the region of interest, which may limit the availability of past microseismic events for DL algorithms. In this sense, the training data play a vital role to ensure the learning performance of the network. FWI in active seismology relies heavily on low-frequency components [30], while field microseismic data generally contain higher frequency contents than active seismic data, and the high-frequency information might be missing in synthetic data considering the computational expense. Yang et al. [31] found that integrating physical information with synthetic data can improve the effectiveness of the training data and network performance. Alkhalifah et al. [32] employed the domain adaptation approach to introduce real signal features into the synthetic data by correlation and convolution operations. They demonstrated the effectiveness of domain adaptation by applying it to seismic imaging problems. Wu et al. [33] proposed to integrate domain knowledge to impose prior constraints for geophysical problems, which can improve the generalizability and interpretability of DNN models.

In this study, we adopt the Unet model to construct a mapping relationship between microseismic waveform data and the velocity model. The data augmentation is implemented by correlation and convolution operations to alleviate the feature differences between the training and real data. We also propose a hybrid training strategy to better integrate the low-frequency feature in synthetic data and high-frequency feature in augmented data. By testing real data of downhole microseismic monitoring, we demonstrate that the proposed data augmentation and hybrid training strategy is reliable and effective in predicting microseismic velocity models.

2. Methodology

2.1. Velocity Inversion and Network Architecture

The velocity inversion can be expressed as the minimization of the following objective function:

J = {‖d^{s y n} - d^{o b s}‖}_{2}

(1)

where

J

is the objective function,

{‖‖}_{2}

denotes the Euclidean norm,

d^{s y n}

is the synthetic data vector, and

d^{o b s}

is the recorded data vector.

Conventional methods for velocity inversion include seismic tomography and full-waveform inversion, which are based on travel time and waveform, respectively. As mentioned before, the two methods rely on the data quality and the setting of the initial velocity model, both of which cannot be well satisfied in microseismic monitoring. In this paper, we use neural networks to solve this nonlinear function. Neural networks can create strongly nonlinear mappings between microseismic gathers and velocities by building multiple hidden layers:

v = N e t (d; θ)

(2)

where

v \equiv [v_{p}, v_{s}]

denotes the predicted velocity value, and

θ

indicates the total weight in the network. The training process of the network is realized through forward propagation and back propagation in the network models to update the

θ

. The testing process involves directly predicting the velocity model by inputting waveform data to the trained model.

We adopt the Unet (Figure 1), as it has shown great potential for many geophysical inversion tasks [34,35]. We make microseismic data and the associated velocity model

\{d, v\}

in pairs as the network input. We use the leaky rectified linear unit (LeakyReLU) activation function, which alleviates the problems of gradient vanishing and allows for a better fitting of the model [36].

2.2. Data Augmentation

Domain adaptation refers to learning when the feature distributions of the source and target domains are inconsistent [37]. It aims to narrow the distribution gap between the two domains to achieve a better learning performance in the target domain. Based on the idea of domain adaptation, data augmentation is achieved by linear operations of correlation and convolution operations between synthetic and real data [38]:

\bar{d_{s}^{i}} (t) = d_{s}^{i} (t) \otimes d_{s}^{k} (t) * d_{r}^{i j} (t) \otimes d_{r}^{i j} (t)

(3)

where

i

is the index of the single trace,

j

is the index of an arbitrary event of the real data,

k

is the index of the reference trace and we set

k = 1

,

\bar{d_{s}^{i}} (t)

is the new augmented data,

d_{s}^{i} (t)

is the single trace of the synthetic data,

d_{s}^{k} (t)

is the reference trace of the synthetic data,

d_{r}^{i j} (t)

is the single trace of the real data,

\otimes

is the correlation operator, and

*

is the convolution operator.

Here, we randomly select one reference field event for each synthetic event corresponding to each stage and set the first trace as the reference trace. The high-frequency information in the real data can be implicitly introduced through the operations in Equation (3). The correlation operation can eliminate the effects of recording time delays between the synthetic and real data. The data augmentation operation can reduce the feature difference between the training (source) synthetic data and the (target) real data and will finally contribute to enhancing the performance of the neural network model when applying to the real data.

2.3. Loss Functions and Quantitative Metrics

Deep learning-based microseismic velocity inversion is a regression problem. We use

M S E

as the loss function to fit the reference velocity model and the predicted values:

L_{M S E} (x_{i}, {\bar{x}}_{i}) = \frac{1}{N} \sum_{i = 1}^{N} {(x_{i} - {\bar{x}}_{i})}^{2}

(4)

where

N

is the total number of pixels in a single velocity image;

x_{i}

and

{\bar{x}}_{i}

are a reference velocity value and a predicted value, respectively.

We use the regression metrics peak signal-to-noise ratio

(P S N R)

, structural similarity

(S S I M)

, and mean absolute error

(M A E)

to quantify the prediction results and evaluate the inversion performance [39,40,41].

P S N R

reflects the degree of global reconstruction of the velocity image. The

P S N R

unit is dB, and the larger the value, the better the inversion performance:

P S N R (x, \bar{x}) = 20 \times \log_{10} (\frac{M a x (x)}{\sqrt{M S E (x, \bar{x})}})

(5)

where

x

and

\bar{x}

denote the velocity label and inverted velocity, respectively.

Local structure and detail are important factors when recovering a velocity model. To evaluate the performance of the network model in reconstructing the local details, we use

S S I M

to characterize the similarity between the predicted velocity model and the reference velocity model. The values range from 0 to 1. The higher the value, the lower the image distortion, indicating that the predicted velocity model is closer to the ground truth:

S S I M (x, \bar{x}) = \frac{(2 μ_{x} μ_{\bar{x}} + G_{1}) (2 σ_{x \bar{x}} + G_{2})}{(μ_{x}^{2} + μ_{\bar{x}}^{2} + G_{1}) (σ_{x}^{2} + σ_{\bar{x}}^{2} + G_{2})}

(6)

where

μ_{x}

and

μ_{\bar{x}}

represent the mean values of

x_{i}

and

{\bar{x}}_{i}

values, respectively,

σ_{x}

and

σ_{\bar{x}}

are their standard deviations,

σ_{x \bar{x}}

denotes their covariance, and

G_{1}

and

G_{2}

represent the constants to avoid a zero denominator.

M A E

is utilized to evaluate the variation in velocity across the stratigraphic interface. The lower the value, the lower the error:

M A E (x_{i}, {\bar{x}}_{i}) = \frac{1}{N} \sum_{i = 1}^{N} |x_{i} - {\bar{x}}_{i}|

(7)

2.4. Training Procedure

We investigate three different training strategies, training only the synthetic dataset, training only the augmented dataset, and the hybrid training strategy:

l o s s = \{\begin{cases} l o s s_s y n, e p o c h < e p o c h s_s y n \\ w \times l o s s_a u g, e l s e \end{cases}

(8)

where

e p o c h s_s y n

is the number of epochs when training the synthetic data, and

w

is a weight coefficient that indicates the smoothness of the loss curve, enabling the loss value to have a smooth transition from the synthetic data training stage to the augmented data training stage.

In our single-stage and multi-stage examples, we use different parameter settings. The optimizers are Adam. After many rounds of parameter tuning and tests, we finally select the following hyperparameters: the batch sizes are 32, and

w

values are 0.1, while the learn rates are 0.001 and 0.0001, training epochs are 200 and 300, and

e p o c h s_s y n

has values of 80 and 140, respectively.

We work with a PyTorch implementation of the neural network [42]. All network training and testing in this study was performed on a CPU with a frequency of 2.90 GHz and 512 GB RAM.

3. Data

To generate more training data, we prepare a horizontally layered model adapted from a field downhole monitoring of five-stage hydraulic fracturing [10], as shown in Figure 2 and Figure 3a. There are 395 events in total and the event numbers from stage 1 to stage 5 are 105, 116, 48, 66, and 60, respectively. The field microseismic data contain three components and we consider only the Z component to reduce the number of operations. The acquisition system consists of 15 receivers (black reverse triangles) placed at a constant spacing of 20 m in a vertical linear array. Each trace has 1201 samples with a time interval of 0.5 ms. Four-layer velocity models are constructed referring to the velocity model from traveltime inversion with eight ball-hit events [10]. We obtain 200 velocity models by adding random ±10% perturbations to the P- and S-wave velocities with fixed layer depths. We randomly set 30 source locations in the source region (Figure 3a) for each velocity model. The velocity model has a size of 64 × 200, with a grid spacing of 5 m. A Ricker wavelet with a peak frequency of 100 Hz is used as the source function. We use 6000 synthetic gathers (200 models × 30 sources) as the initial training dataset. The testing dataset included 105 field microseismic events from stage 1 (corresponding to a single reference velocity model).

Figure 3b shows the results of the power spectra comparison. The augmented data approaches the real data in terms of energy distribution by retaining more high-frequency contents of the real data. The exemplary synthetic and real microseismic waveform data are shown in Figure 3c–f.

4. Result

4.1. Single-Stage Examples

For single-stage examples, we focus on the feasibility of the network and training strategy. The overall quantitative metrics are listed in Table 1. As indicated in Equation (8), the hybrid dataset here denotes a hybrid strategy involving both synthetic and augmented data. It shows that the hybrid training strategy outperforms the other two training strategies for almost all metrics in the velocity inversion task under the same conditions.

The predicted one-dimensional velocity profiles of the Unet model using the three training strategies are shown in Figure 4. The displayed velocity values correspond to two arbitrary events and are averaged along the horizontal direction. We can find that augmented data and the hybrid training strategy yield better fittings to the reference velocity model. Figure 5 shows the two-dimensional profiles corresponding to Figure 4b by the hybrid training strategy. Training with the synthetic data involves first learning the low-frequency information in the data, and then it can provide an initial velocity model (Figure 5c,d). The model obtained by training the synthetic data (low frequency) may also predict high-frequency velocity components with the real data (with high frequency), but the results have a large error since the model did not learn these high-frequency features. After training with the augmented data containing high-frequency information, the model improves the precision of the predicted velocity models (Figure 5e,f).

4.2. Robustness Testing

In order to further evaluate the superiority of the proposed data augmentation method and hybrid training strategy, we carry out robustness tests on the real data of the first stage. We denoise the real data by wavelet filtering to obtain the clean signals, and then calculate the SNR of the real data [43]:

S / N = 10 \times \log_{10} (\frac{S_{c}}{S_{n}}) = 10 \times \log_{10} (\frac{S_{c}}{S_{r} - S_{c}})

(9)

where

S / N

is the SNR,

S_{r}

is the real data signal,

S_{c}

is the clean signal after denoising the real data signal, and

S_{n}

is the noise of the real data signal.

The distribution of the SNRs for all events in the first stage is shown in Figure 6. Most of the SNRs of the real events are lower than 5 dB. We select a sample event (

S / N = 3.44

) to quantitatively evaluate the stability and robustness of the network. The predicted two-dimensional and one-dimensional velocity profiles of the Unet model using the three training strategies are shown in Figure 7 and Figure 8. The detailed values of quantitative metrics are listed in Figure 7. The results suggested that the data augmentation method can significantly improve the prediction accuracy of purely synthetic data by introducing real data information. Moreover, the hybrid training strategy effectively utilizes the useful information of the synthetic data in the low-frequency components and yields the best inversion results.

4.3. Multi-Stage Examples

From the results of single-stage examples, we believe that the augmented data and hybrid training strategy have higher accuracies for efficient velocity inversion. Therefore, we try to expand the research area to consider more fracturing stages. We consider all five stages, corresponding to five reference velocity models. We generate 12,000 gathers (1000 models × 12 sources) as the initial training dataset. The quantitative metrics are shown in Table 2. Compared to single-stage examples, the predictions are generally worse due to the combined effects of increased area and characteristics and limited field samples. Please also note that these metrics are mean values for all the predictions in five stages. The one-dimensional velocity profiles and the loss curves are shown in Figure 9 and Figure 10, respectively. The predictions for the first stage (Figure 9a) are better than other stages (Figure 9b), especially for the two deep layers, mainly due to the largest number and best coverage of the microseismic events in the first stage. The hybrid training strategy can achieve slightly faster convergence rates than the other two strategies.

5. Discussion and Conclusions

We attempt to directly invert the velocity models from microseismic waveforms in this study. The testing results with purely synthetic data demonstrate the Unet model can predict the layered velocity model quite well and in an efficient manner. Since the predicted velocity models are almost the same as the real ones and thus do not contain much information, we do not show those simple results in this manuscript. Zhou et al. [29] demonstrated the effectiveness of a modified Attention Unet in predicting complex synthetic velocity models with microseismic records. They did not consider field microseismic data and adopted Gaussian noise to evaluate the robustness of the model, while we used field data to enhance the synthetic data by data augmentation operations. We also investigate and test many other scenarios by considering different SNRs, source locations, source mechanisms, and model numbers and sizes to mimic the field cases. Specially, the number and coverage of real microseismic events largely determine the features and constraints that can be extracted by the network model. However, these cases just introduce more complicated features which require a larger training dataset and computation expense. Further investigation of the influential factors on deep learning-based microseismic velocity inversion is out of the scope of the current study.

The disadvantage of most current deep learning algorithms is the heavy dependence on the training dataset and weak generalization capability. The introduced data augmentation method and hybrid training strategy proved to be effective in alleviating the feature gap in data domains and improving the generalization ability of the network model, which may provide guidance for other deep learning-based seismic inversion tasks. Transfer learning is also helpful to fill the feature gap, but also relies on the scale of the training data. Another feasible approach to realize seismic inversion with a limited training dataset is combing data-driven algorithms with the physical laws of seismic wave propagation, to provide more physical constraints and optimize the learning performance. In this work, we only consider a horizontally layered model, which is the most-commonly used model in microseismic processing. We will investigate the performance of the proposed method on heterogeneous models and compare it with conventional velocity inversion methods (e.g., FWI method). One of the advantages of deep learning methods is the weak dependence on the raypath coverage since we can train the model with a large and complete dataset.

In this paper, we propose an improved deep learning method for microseismic velocity inversion. The synthetic data are augmented to incorporate the features of the real data, and a hybrid training strategy that integrates the synthetic and augmented data is introduced. The Unet model can directly predict the layered velocity model from microseismic waveforms. Training the synthetic data involves first learning the low-frequency information in the data, and then it can provide an initial velocity model. Then, the augmented data are trained to learn the high-frequency information, which can improve the precision of the predicted velocity model. The hybrid training strategy makes better use of the data and enables the model to learn more imbedded connections between the waveforms and velocity models. Field downhole microseismic examples demonstrate the feasibility and superiority of the proposed method for efficient inversion of microseismic velocity models.

Author Contributions

Formal analysis, L.L. and X.Z.; Investigation, L.L., X.Z. and Y.T.; Methodology, L.L., X.Z. and L.P.; Supervision, X.P. and J.L.; Writing—original draft, L.L. and X.Z.; Writing—review and editing, X.P., L.P., Y.T. and J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported the National Natural Science Foundation of China, grant number 42374076; Natural Science Foundation for Excellent Young Scholars of Hunan Province, China, grant number 2022JJ20057; Central South University Innovation-Driven Research Programme, grant number 2023CXQD063.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The dataset for this research is available by contacting the corresponding author. The data are not publicly available due to privacy.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Eaton, D.W. Passive Seismic Monitoring of Induced Seismicity: Fundamental Principles and Application to Energy Technologies, 1st ed.; Cambridge University Press: Cambridge, UK, 2018; ISBN 978-1-107-14525-2. [Google Scholar]
Li, L.; Tan, J.; Wood, D.A.; Zhao, Z.; Becker, D.; Lyu, Q.; Shu, B.; Chen, H. A Review of the Current Status of Induced Seismicity Monitoring for Hydraulic Fracturing in Unconventional Tight Oil and Gas Reservoirs. Fuel 2019, 242, 195–210. [Google Scholar] [CrossRef]
Meng, X.-B.; Chen, H.-C.; Niu, F.-L.; Du, Y.-J. Master Event Based Backazimuth Estimation and Its Application to Downhole Microseismic Monitoring. Pet. Sci. 2022, 19, 2675–2682. [Google Scholar] [CrossRef]
Li, L.; Tan, J.; Tan, Y.; Pan, X.; Zhao, Z. Chapter Eight: Microseismic Analysis to Aid Gas Reservoir Characterization. In Sustainable Natural Gas Reservoir and Production Engineering; Wood, D.A., Cai, J., Eds.; Elsevier: Amsterdam, The Netherlands, 2022; pp. 219–242. [Google Scholar]
Tomassi, A.; Milli, S.; Tentori, D. Synthetic Seismic Forward Modeling of a High-Frequency Depositional Sequence: The Example of the Tiber Depositional Sequence (Central Italy). Mar. Pet. Geol. 2024, 160, 106624. [Google Scholar] [CrossRef]
Jansky, J.; Plicka, V.; Eisner, L. Feasibility of Joint 1D Velocity Model and Event Location Inversion by the Neighbourhood Algorithm. Geophys. Prospect. 2010, 58, 229–234. [Google Scholar] [CrossRef]
Li, L.; Tan, J.; Schwarz, B.; Staněk, F.; Poiata, N.; Shi, P.; Diekmann, L.; Eisner, L.; Gajewski, D. Recent Advances and Challenges of Waveform-based Seismic Location Methods at Multiple Scales. Rev. Geophys. 2020, 58, e2019RG000667. [Google Scholar] [CrossRef]
Warpinski, N.R.; Sullivan, R.B.; Uhl, J.E.; Waltman, C.K.; Machovoe, S.R. Improved Microseismic Fracture Mapping Using Perforation Timing Measurements for Velocity Calibration. SPE J. 2005, 10, 14–23. [Google Scholar] [CrossRef]
Pei, D.; Quirein, J.A.; Cornish, B.E.Q.; Quinn, D.; Warpinski, N.R. Velocity Calibration for Microseismic Monitoring: A Very Fast Simulated Annealing (VFSA) Approach for Joint-Objective Optimization. Geophysics 2009, 74, WCB47–WCB55. [Google Scholar] [CrossRef]
Tan, Y.; He, C.; Mao, Z. Microseismic Velocity Model Inversion and Source Location: The Use of Neighborhood Algorithm and Master Station Method. Geophysics 2018, 83, 1JA-Z18. [Google Scholar] [CrossRef]
Igonin, N.; Innanen, K.A. Analysis of Simultaneous Velocity and Source Parameter Updates in Microseismic FWI. In Proceedings of the SEG Technical Program Expanded Abstracts 2018, Anaheim, CA, USA, 27 August 2018; Society of Exploration Geophysicists: Houston, TX, USA, 2018; pp. 1033–1037. [Google Scholar]
Wang, H.; Alkhalifah, T. Microseismic Imaging Using a Source Function Independent Full Waveform Inversion Method. Geophys. J. Int. 2018, 214, 46–57. [Google Scholar] [CrossRef]
Virieux, J.; Operto, S. An Overview of Full-Waveform Inversion in Exploration Geophysics. Geophysics 2009, 74, WCC1–WCC26. [Google Scholar] [CrossRef]
Guitton, A.; Díaz, E. Attenuating Crosstalk Noise with Simultaneous Source Full Waveform Inversion. Geophys. Prospect. 2012, 60, 759–768. [Google Scholar] [CrossRef]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep Learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
Mousavi, S.M.; Beroza, G.C. Deep-Learning Seismology. Science 2022, 377, eabm4470. [Google Scholar] [CrossRef] [PubMed]
Anikiev, D.; Birnie, C.; bin Waheed, U.; Alkhalifah, T.; Gu, C.; Verschuur, D.J.; Eisner, L. Machine Learning in Microseismic Monitoring. Earth-Sci. Rev. 2023, 239, 104371. [Google Scholar] [CrossRef]
Zhang, H.; Ma, C.; Pazzi, V.; Zou, Y.; Casagli, N. Microseismic Signal Denoising and Separation Based on Fully Convolutional Encoder–Decoder Network. Appl. Sci. 2020, 10, 6621. [Google Scholar] [CrossRef]
Shang, G.; Li, L.; Zhang, L.; Liu, X.; Li, D.; Qin, G.; Li, H. Research on Automatic Classification of Coal Mine Microseismic Events Based on Data Enhancement and FCN-LSTM Network. Appl. Sci. 2023, 13, 11158. [Google Scholar] [CrossRef]
Ma, C.; Ran, X.; Xu, W.; Yan, W.; Li, T.; Dai, K.; Wan, J.; Lin, Y.; Tong, K. Fine Classification Method for Massive Microseismic Signals Based on Short-Time Fourier Transform and Deep Learning. Remote Sens. 2023, 15, 502. [Google Scholar] [CrossRef]
Liu, N.; Chen, J.; Wu, H.; Li, F.; Gao, J. Microseismic First-Arrival Picking Using Fine-Tuning Feature Pyramid Networks. IEEE Geosci. Remote Sens. Lett. 2022, 19, 7505105. [Google Scholar] [CrossRef]
Yuan, S.-Y.; Zhao, Y.; Xie, T.; Qi, J.; Wang, S.-X. SegNet-Based First-Break Picking via Seismic Waveform Classification Directly from Shot Gathers with Sparsely Distributed Traces. Pet. Sci. 2022, 19, 162–179. [Google Scholar] [CrossRef]
Zhang, Y.; Leng, J.; Dong, Y.; Yu, Z.; Hu, T.; He, C. Phase Arrival Picking for Bridging Multi-Source Downhole Microseismic Data Using Deep Transfer Learning. J. Geophys. Eng. 2022, 19, 178–191. [Google Scholar] [CrossRef]
Wamriew, D.; Dorhjie, D.B.; Bogoedov, D.; Pevzner, R.; Maltsev, E.; Charara, M.; Pissarenko, D.; Koroteev, D. Microseismic Monitoring and Analysis Using Cutting-Edge Technology: A Key Enabler for Reservoir Characterization. Remote Sens. 2022, 14, 3417. [Google Scholar] [CrossRef]
Wamriew, D.; Pevzner, R.; Maltsev, E.; Pissarenko, D. Deep Neural Networks for Detection and Location of Microseismic Events and Velocity Model Inversion from Microseismic Data Acquired by Distributed Acoustic Sensing Array. Sensors 2021, 21, 6627. [Google Scholar] [CrossRef] [PubMed]
Araya-Polo, M.; Jennings, J.; Adler, A.; Dahlke, T. Deep-Learning Tomography. Lead. Edge 2018, 37, 58–66. [Google Scholar] [CrossRef]
Yang, F.; Ma, J. Deep-Learning Inversion: A next-Generation Seismic Velocity Model Building Method. Geophysics 2019, 84, R583–R599. [Google Scholar] [CrossRef]
Wamriew, D.; Charara, M.; Pissarenko, D. Joint Event Location and Velocity Model Update in Real-Time for Downhole Microseismic Monitoring: A Deep Learning Approach. Comput. Geosci. 2022, 158, 104965. [Google Scholar] [CrossRef]
Zhou, Y.; Han, L.; Zhang, P.; Zeng, J.; Shang, X.; Huang, W. Microseismic Data-Direct Velocity Modeling Method Based on a Modified Attention U-Net Architecture. Appl. Sci. 2023, 13, 11166. [Google Scholar] [CrossRef]
Xu, X.; Guo, P.; Yang, J.; Xu, W.; Tong, S. Compensating Low-Frequency Signals for Prestack Seismic Data and Its Applications in Full-Waveform Inversion. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5920814. [Google Scholar] [CrossRef]
Yang, Y.; Zhang, X.; Guan, Q.; Lin, Y. Enhancing Data-Driven Seismic Inversion Using Physics-Guided Spatiotemporal Data Augmentation. In Proceedings of the First International Meeting for Applied Geoscience & Energy Expanded Abstracts, Denver, CO, USA, 1 September 2021; Society of Exploration Geophysicists: Houston, TX, USA, 2021; pp. 1395–1399. [Google Scholar]
Alkhalifah, T.; Wang, H.; Ovcharenko, O. MLReal: Bridging the Gap between Training on Synthetic Data and Real Data Applications in Machine Learning. Artif. Intell. Geosci. 2022, 3, 101–114. [Google Scholar] [CrossRef]
Wu, X.; Ma, J.; Si, X.; Bi, Z.; Yang, J.; Gao, H.; Xie, D.; Guo, Z.; Zhang, J. Sensing Prior Constraints in Deep Neural Networks for Solving Exploration Geophysical Problems. Proc. Natl. Acad. Sci. USA 2023, 120, e2219573120. [Google Scholar] [CrossRef]
Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015; Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F., Eds.; Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2015; Volume 9351, pp. 234–241. ISBN 978-3-319-24573-7. [Google Scholar]
Zhang, S.-B.; Si, H.-J.; Wu, X.-M.; Yan, S.-S. A Comparison of Deep Learning Methods for Seismic Impedance Inversion. Pet. Sci. 2022, 19, 1019–1030. [Google Scholar] [CrossRef]
Xu, B.; Wang, N.; Chen, T.; Li, M. Empirical Evaluation of Rectified Activations in Convolutional Network. arXiv 2015, arXiv:1505.00853. [Google Scholar]
Glorot, X.; Bordes, A.; Bengio, Y. Domain Adaptation for Large-Scale Sentiment Classification: A Deep Learning Approach. In Proceedings of the 28th International Conference on Machine Learning (ICML-11), Bellevue, WA, USA, 28 June–2 July 2011; pp. 513–520. [Google Scholar]
Wang, H.; Alkhalifah, T. Direct Microseismic Event Location and Characterization from Passive Seismic Data Using Convolutional Neural Networks. Geophysics 2021, 86, KS109–KS121. [Google Scholar] [CrossRef]
Zhang, Z.; Lin, Y. Data-Driven Seismic Waveform Inversion: A Study on the Robustness and Generalization. IEEE Trans. Geosci. Remote Sens. 2020, 58, 6900–6913. [Google Scholar] [CrossRef]
Li, F.; Guo, Z.; Pan, X.; Liu, J.; Wang, Y.; Gao, D. Deep Learning with Adaptive Attention for Seismic Velocity Inversion. Remote Sens. 2022, 14, 3810. [Google Scholar] [CrossRef]
Li, S.; Liu, B.; Ren, Y.; Chen, Y.; Yang, S.; Wang, Y.; Jiang, P. Deep-Learning Inversion of Seismic Data. IEEE Trans. Geosci. Remote Sens. 2020, 58, 2135–2149. [Google Scholar] [CrossRef]
Paszke, A.; Gross, S.; Massa, F.; Lerer, A.; Bradbury, J.; Chanan, G.; Killeen, T.; Lin, Z.; Gimelshein, N.; Antiga, L.; et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Proceedings of the 33rd Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 8–14 December 2019; pp. 1–12. [Google Scholar]
Zhao, R.; Cui, H. Improved Threshold Denoising Method Based on Wavelet Transform. In Proceedings of the 7th International Conference on Modelling, Identification and Control (ICMIC 2015), Sousse, Tunisia, 18–20 December 2015; p. 1. [Google Scholar]

Figure 1. Unet network architecture. Gathers are input features, and the outputs are velocity models. Each box represents the output feature map of the convolutional layer. The number at the top of each box indicates the channel number in the corresponding feature map. The encoder consists of a convolution layer with a 3 × 3 convolution kernel size (blue arrow), a batch normalization (BN) layer, a leaky rectified linear unit (LeakyReLU), and a 2 × 2 maximum pooling layer and the Dropout layer (yellow arrow). Each decoder replaces the maximum pooling layer with a 5 × 5 transposed convolution layer (black arrows). Skip connections indicate the corresponding channel feature maps connecting the encoder and decoder sections (green arrows).

Figure 2. The layout of a real downhole microseismic monitoring project. (a) Three-dimensional view. (b) Side view of (a). Black reverse triangles indicate the receivers and the dots are microseismic events.

Figure 3. Model and data. (a) A horizontally layered model for downhole microseismic monitoring. The black rectangle indicates the region where the sources are located, the red pentagram indicates an arbitrary source, and black reverse triangles indicate the receivers. (b) Power spectra comparison. (c) The original noise-free synthetic waveforms generated by ray-tracing. (d) Real microseismic data. (e) Result of the real data autocorrelation. (f) The augmented data for the synthetic waveforms in (c).

Figure 4. One-dimensional profiles of the reference and predicted velocity values of two arbitrary events from the first stage. (a) Velocity curves of one sample event. The red solid and dashed lines indicate the reference velocity for P- and S-wave, respectively, and the blue, magenta, and black dashed lines indicate the results of training with synthetic, augmented, and hybrid dataset. (b) Velocity curves of another sample event. The meanings of the symbols and colors are the same with (a).

Figure 5. Two-dimensional profiles of the reference and predicted velocity values of an arbitrary event from the first stage. (a,b) The reference P- and S-wave velocities. (c,d) Predictions of P- and S-wave velocities trained with the synthetic dataset (when the epoch is 80). (e,f) Predictions of P- and S-wave velocities trained with both the synthetic and the augmented dataset (when the epoch is 200).

Figure 6. The distribution of SNRs of the events of the first stage.

Figure 7. Two-dimensional profiles of the predicted velocity values of the sample event. (a,b) Predictions of P- and S-wave velocities trained with the synthetic dataset only. (c,d) Predictions of P- and S-wave velocities trained with the augmented dataset only. (e,f) Predictions of P- and S-wave velocities trained with the hybrid strategy involving both synthetic and augmented data. The reference velocity models are shown in Figure 5a,b.

Figure 8. One-dimensional profiles of the reference and predicted velocity values of the sample event. The meanings of the symbols and colors are the same as Figure 4.

Figure 9. One-dimensional profiles of the reference and predicted velocity values of two arbitrary events from two stages. (a) Velocity curves of one sample event. The red solid and dashed lines indicate the reference velocity for the P- and S-wave, respectively, and the blue, magenta, and black dashed lines indicate the results of training with synthetic, augmented, and hybrid dataset. (b) Velocity curves of another sample event. The meanings of the symbols and colors are the same as Figure 4.

Figure 10. The loss curves for three different training strategies.

Table 1. The mean values of quantitative metrics for single-stage examples.

Training Dataset	Phase	PSNR	SSIM	MAE
Synthetic	P	19.88	0.7097	272.243
Synthetic	S	19.92	0.8139	133.958
Augmented	P	27.90	0.8644	113.514
Augmented	S	27.71	0.8912	57.431
Hybrid	P	30.04	0.8591	93.143
Hybrid	S	29.60	0.8911	48.308

Table 2. The mean values of quantitative metrics for multi-stage examples.

Training Dataset	Phase	PSNR	SSIM	MAE
Synthetic	P	18.75	0.7094	314.843
Synthetic	S	19.16	0.7209	155.461
Augmented	P	21.50	0.7030	228.985
Augmented	S	17.89	0.6759	152.441
Hybrid	P	22.24	0.7478	221.382
Hybrid	S	18.46	0.6369	165.114

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, L.; Zeng, X.; Pan, X.; Peng, L.; Tan, Y.; Liu, J. Microseismic Velocity Inversion Based on Deep Learning and Data Augmentation. Appl. Sci. 2024, 14, 2194. https://doi.org/10.3390/app14052194

AMA Style

Li L, Zeng X, Pan X, Peng L, Tan Y, Liu J. Microseismic Velocity Inversion Based on Deep Learning and Data Augmentation. Applied Sciences. 2024; 14(5):2194. https://doi.org/10.3390/app14052194

Chicago/Turabian Style

Li, Lei, Xiaobao Zeng, Xinpeng Pan, Ling Peng, Yuyang Tan, and Jianxin Liu. 2024. "Microseismic Velocity Inversion Based on Deep Learning and Data Augmentation" Applied Sciences 14, no. 5: 2194. https://doi.org/10.3390/app14052194

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Microseismic Velocity Inversion Based on Deep Learning and Data Augmentation

Abstract

1. Introduction

2. Methodology

2.1. Velocity Inversion and Network Architecture

2.2. Data Augmentation

2.3. Loss Functions and Quantitative Metrics

2.4. Training Procedure

3. Data

4. Result

4.1. Single-Stage Examples

4.2. Robustness Testing

4.3. Multi-Stage Examples

5. Discussion and Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI