Next Article in Journal
Boundary-Aware Deformable Spiking Neural Network for Hyperspectral Image Classification
Previous Article in Journal
Mangrove Forest Cover Change in the Conterminous United States from 1980–2020
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Technical Note

Automatic Detection of VLF Tweek Signals Based on the YOLO Model

1
Department of Space Physics, School of Electronic Information, Wuhan University, Wuhan 430072, China
2
Hubei Luojia Laboratory, Wuhan 430072, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2023, 15(20), 5019; https://doi.org/10.3390/rs15205019
Submission received: 23 August 2023 / Revised: 19 September 2023 / Accepted: 14 October 2023 / Published: 19 October 2023

Abstract

:
Tweek signals are a special type of VLF (very low frequency) pulse, originally produced by lightning discharge, which becomes dispersive after repetitive bounces within the waveguide between the Earth’s surface and lower ionosphere. As such, tweek signals carry critical information about the region near the reflection height of the VLF waves, namely the D-region ionosphere. Although tweek measurements have been widely utilized in studies of the D-region ionosphere and lightning discharge, few statistical studies have been conducted, mainly due to the difficulty of manually identifying tweek signals from the enormous amount of VLF data with heavy noise. Considering the importance of tweek signals and the lack of a high-precision detection model, in this study, we propose a method to automatically and accurately pick out tweek signals from VLF measurements. This method is explicitly developed based on the you only look once (YOLO) model and a post-tracing process. Using a total of 2495 randomly selected VLF spectrogram images as the testing set, we evaluated the performance of this method. The precision and recall are found to be 92.0% and 89.2% for the first-order mode, and 97.5% and 86.7% for the first-two-order mode tweek, respectively. The time needed to process 10-s VLF measurements with a cadence of 4 μs is only 6.5 s, allowing for identifying the tweek signals from continuous VLF measurements in real time. Therefore, this method represents a reliable means to automatically detect tweek signals and enables the opportunity to statistically investigate the D-region ionosphere and lightning discharge via these signals.

1. Introduction

Very low frequency (VLF, 3–30 kHz) waves in the Earth’s atmosphere mainly originate from lightning discharge or man-made navy transmitters. These waves can travel long distances within the Earth–ionosphere waveguide (EIWG) with a typical attenuation coefficient of ~2–3 dB/Mm [1], being well trapped between the ground and the D-region ionosphere [2,3]. The D-region ionosphere refers to the partially ionized atmosphere at altitudes of 60–100 km. Being too low for satellite-based instruments and too high for balloon-based instruments, this region still remains insufficiently investigated. The electron density in the D-region ionosphere is too low to be effectively measured by instruments that are specifically designed for other regions of the ionosphere [4,5]. Sudden perturbations in the D-region ionosphere could lead to abnormal changes in the VLF transmitter signals; for example, the electron density change at D-region altitudes produced by solar flares [6,7,8,9], solar eclipses [10,11], lightning discharge [12,13,14] and electron precipitation from the radiation belts [15]. Therefore, the VLF technique has been widely utilized to remotely sense the lower ionosphere and represents one of the most reliable and effective approaches for long-term observation of the D-region ionosphere [16,17,18,19].
Lightning-generated radio signals, known as sferics, can propagate in the EIWG with different transmission waveguide modes [20]. When the wave frequency is lower than the cutoff frequency of the waveguide mode, it becomes dissipative and is shortly attenuated. Therefore, the higher-order modes cannot propagate below the cutoff frequencies of EIWG [20]. Tweeks, named after their unique clicking sound, are a special type of VLF signal produced by the dispersion of lightning sferics propagating in the EIWG [21,22,23], recorded as the lower frequency component arriving later in time. Due to the strong attenuation of lightning sferics during daytime conditions, tweeks are mainly recorded at night [24,25]. However, Singh et al. [24] reported tweek measurements at the Allahabad (25.40°N, 81.93°E) and Nanital (29.35°N, 79.45°E) station in India, when partial nighttime conditions were produced by the obscuration of solar fluxes during solar eclipses. Ohya et al. [25] first reported the measurement of tweek signals during daytime conditions and without solar eclipses. The daytime tweeks were observed under geomagnetically quiet conditions and had an average duration of approximately 12 ms, notably shorter than that of nighttime tweeks. Moreover, the daytime reflection height was found to be similar to that of nighttime but with more significant variation [25].
Tweek signals provide direct information about the source of lightning discharges, as well as the D-region ionosphere from which they are reflected [26,27]. Moreover, the dispersion feature and the cutoff frequency of tweek signals can be used to inversely sense the reflection height of VLF waves [28], namely the height of the D-region ionosphere and the corresponding electron density [29,30]. Yusop et al. [30] investigated tweeks received at the Gakona station (62.71°N, 143.99°W) from January to April 2011 and found that the occurrence rate was higher during the equinox compared to the winter season. The ionospheric reflection height was found to be almost invariant in the high-, mid-, and low-latitude regions, mostly between 75–91 km, with a typical electron density of 22–27 el/cm3. The tweek signals can be further utilized to estimate the distance from the source lightning discharge and potentially used in lightning identification [28]. Kumar et al. [28] analyzed the propagation distances of tweeks observed at the Suva station (18.2°S, 178.3°E) in Fiji. For 90% of the observed tweeks, the propagation distance was found to be between 1000–5000 km.
Despite their importance explained above, few statistical studies of the D-region ionosphere have been conducted using tweek measurements, mainly due to the difficulty of identifying tweek signals, particularly if one needs to pick out these signals manually from enormous VLF data with heavy noise. Therefore, it is of great importance to develop a reliable method to automatically detect the tweek signals and then conduct statistical studies. Some methods exist that are dedicated to solving this problem, most of which, however, first process/filter the VLF data, and then pick out tweek signals based on the dispersive feature [31,32,33]. More recently, Zhou et al. [34] proposed an automatic tweek detection model, based on the maximum entropy spectral estimation (MESE) method, by setting some thresholds for the morphological feature of tweek signals and achieved a detection accuracy of 77.4%. Nevertheless, as it was developed based on the traditional image processing method by setting threshold values, the applicability of this model in identifying target signals other than tweeks is limited due to the variation of background noise.
Artificial neural networks (ANNs) and deep learning methods have been greatly developed over the past decade and widely used in VLF studies [35,36,37,38]. Golden et al. [35] trained ANNs using 19 hand-selected features to classify the VLF spectrograms measured at the Palmer station into hiss, chorus, or noise. An accuracy of 92% was achieved after training using 10,109 hiss or chorus events. To identify whistler signals, Konan et al. [36] utilized the you only look once (YOLO) model, and the misdetection and false alarm of the trained model were 0.082 and 0.002, respectively. Harid et al. [37] used 1038 VLF spectrogram images collected in Antarctica to train the mask-scoring regional convolutional neural network (MSRCNN), and an accuracy of 92.14% was obtained when picking out whistler waves from VLF measurements. Moreover, Maslej-Krešňáková et al. [38] developed a method based on YOLO-v5 to distinguish between lightning sferics and tweek signals. The entire structure of tweek signals was used as the target objection, and the precision of this model in identifying tweek signals was found to be 58%, with a recall of 56%.
Given the importance of tweek signals and the lack of high-precision models to automatically identify those signals, in this study, we propose an automatic detection model. The goal is to pick out tweek signals from long-term VLF measurements and, therefore, better understand the D-region ionosphere and lightning discharge via statistical studies. In the following, Section 2 introduces the VLF measurements, data processing, and how the training, validation, and testing datasets are built. Section 3 describes the structure and details of the tweek detection method. Section 4 presents the evaluation results of model performance, and the conclusion and discussion are given in Section 5.

2. VLF Data

2.1. VLF Measurements and Data Processing

In this paper, we use the data collected using the VLF detection system developed by Wuhan University [39,40,41]. The receiver consists of two orthogonal magnetic loop antennas, set up in both East–West (EW) and North–South (NS) directions. It specifically uses a 16-bit A/D converter chip with 250 kSPS (kilo samples per second) and can detect radio waves with frequencies of 1–50 kHz, with a dynamic range of ~110 dB and a timing accuracy of ~100 ns. More details of this detection system can be found in Chen et al. [42]. A set of this receiver was deployed in Suizhou, China (31.57°N, 113.32°E) in 2016, a location with a quiet electromagnetic environment. This station has been operational since January 2016 and has been measuring ELF/VLF emissions in two alternative modes: daytime mode (08:00–18:00 LT, first 10 s of every minute) and nighttime mode (18:00–08:00 LT, 10 s on and 10 s off). Tweeks were mostly measured during nighttime conditions rather than daytime [29,30,43], and in this study, we mainly focused on the VLF data collected during local nighttime.
We perform a short-time Fourier transform (STFT) on VLF measurements and calculate the spectrograms with a time resolution of 2 ms and a frequency resolution of 24 Hz. According to the waveguide mode theory [20], the cutoff frequency of the second-order mode tweek is ~3.4 kHz for a reflection height of 88 km. To ensure the integrity of the first two modes, we use spectrogram images between 1–5 kHz for the training and testing of our model. Considering that the duration of each tweek signal is approximately 50 ms [44,45], we divide the 10-s raw VLF data into 20 segments, with a duration of 0.5 s for each segment. This duration is chosen since a clear tweek structure can be observed in spectrograms with 0.5-s duration, and using a shorter duration would unnecessarily increase the computation time.
Figure 1 shows a typical spectrogram of VLF data measured at the Suizhou station between 00:50:00 and 00:50:10 LT (local time in Suizhou, UTC+8) on 1 November 2020. The VLF data are normalized so that the maximum value is unity. Figure 1a shows the spectrogram of the 10-s VLF data at frequencies of 1–50 kHz. The horizontal lines correspond to VLF signals from man-made transmitters at discrete frequencies. The frequencies of major operating transmitters are mainly between 10–30 kHz, e.g., 19.8 kHz for the NWC transmitter in Australia, 22.2 kHz for the JJI transmitter in Japan, and 25.2 kHz for the NLM transmitter in the United States. The vertical lines are lightning sferics, typically centered at frequencies from a few to a few tens of kilohertz. Unlike tweeks, sferics exhibit no time dispersion. Its time domain waveform rapidly decays to the background level and has no dispersive feature in the frequency domain. Figure 1b is a zoom-in view of Figure 1a in the frequency range from 1 to 5 kHz. We see two tweek signals at ~1.5 and ~3.5 s, as marked using red boxes. These two tweeks are shown separately in Figure 1c and 1d, respectively. The tweek at ~1.5 s contains the first- and second-order modes, whereas the one at ~3.5 s only contains the first-order mode.

2.2. VLF Dataset

To build the training, validation, and testing datasets, we first calculate the spectrogram images using the VLF data recorded at the Suizhou station. Spectrogram images with dimensions of 224 × 224 pixels are obtained using the STFT method. Second, these spectrograms are divided into three groups: the training set, the validation set, and the testing set. The training and validation sets are used to train the model, while the testing set is used to evaluate the performance of the model. For the training and validation sets, we manually draw the ground truth bounding boxes using LabelImg v1.8.1 (available at https://github.com/tzutalin/labelImg, accessed on 15 March 2021). The annotated files are then converted into the PASCAL VOC format [46] for ease of validation. As for the testing set, the spectrograms are not labeled but manually inspected in order to evaluate the effectiveness of our method. A total of 2275 images are labeled and then divided into the training and validation sets with a ratio of 9:1, corresponding to 2047 and 228 images in the training and validation sets, respectively. On the other hand, the testing set contains another 2495 spectrogram images. Table 1 presents the total number of spectrogram images and tweek images contained in the training, validation, and testing datasets.
A main obstacle in identifying tweek signals from the enormous VLF data is that the noise signal has a similar temporal structure to tweeks. The two most frequent types are band-limited noise and environmental interference, as illustrated in Figure 2c,d. The performance of the signal detection model would be highly limited if the trained model cannot distinguish the target signal from those noise signals with similar structures. As such, in this study, we use a dataset that contains both tweeks and tweek-like noise signals in order to unbiasedly train the detection model.
The most prominent feature of tweeks is the tail-like structure (denoted as the “tail” in the following) caused by the temporal dispersion of electromagnetic waves with different frequencies propagating within EIWG. Different from the model of Maslej-Krešňáková et al. [38], which uses the entire signal as the detection target, we use the tail as the detection target in this study. Figure 2a,b shows typical tweek signals; the dispersive tails are manually labeled and used as the training object in the detection model. In addition to tweeks, we have included some common types of VLF spectrograms during quiet-time conditions as negative samples, as well as noise signals, in the dataset. Figure 2c shows a spectrogram image of band-limited noise, which is often observed at frequencies of 1–2 kHz in the VLF data. Figure 2d shows a spectrogram image of wave-like noise, which most likely originates from some artificial interference. Although this kind of noise is uncommon, a quarter of this signal appears to be similar to tweeks after a rotation of 90° in either the clockwise or counterclockwise direction. This could somehow mislead the detection model. Figure 2e shows a spectrogram image of the tweek signals, but, different from Figure 2a,b, this signal is distorted and, thus, not considered a target signal. Figure 2f shows a typical spectrogram of lightning sferics. Note that lightning sferics are not as dispersive and do not contain the tail of tweek signals. These types of signals are included in the dataset in order to enhance the robustness of our model.

3. Automatic Tweek Detection Method

The method that we propose to automatically detect tweek signals mainly consists of two parts: an object detection part and a post-tracing part. The object detection part, as developed based on the YOLO model, is firstly utilized to provide raw predictions of tweek signals from VLF spectrogram images. The post-tracing part is then applied to reexamine the predicted tweek signal, and to depict the spectral peaks for the next-step analysis. In the following, we describe the details of the object detection part, the post-tracing process, and the hyperparameters in Section 3.1, Section 3.2, and Section 3.3, respectively.
The architecture of our detection method is shown explicitly in Figure 3a. The VLF spectrogram images are first processed using the YOLO model, which provides a raw prediction of tweek signals. In order to extract the dispersive feature, we then trace the tail of the predicted tweek signals in order to find out the spectral peak. Finally, low-quality tweek signals are discarded based on the spectral peaks, and results such as the occurrence time, number of modes, and cutoff frequency are derived.

3.1. The Object Detection Model

Object detection is a classic problem in the research area of computer vision, and various algorithms have been employed in machine learning studies to classify objects and draw bounding boxes. YOLO is one of the most widely used object-detection models, and it stands out mainly because of its relatively simpler structure and faster detection speed: the detection accuracy of YOLO could be largely improved after several iterations, and its effectiveness in detecting small targets is especially noteworthy.
The YOLO-v5 model is an improved version of the widely used YOLO-v4 model [47] and has many different versions: YOLO-v5x, v5l, v5m, and v5s, based on the complexity of the networks. Compared with other versions, YOLO-v5x uses a more complex structure and can achieve better performance, but it is extremely computationally demanding, and the training and detection time is considerably longer than other versions. The main goal of this study is to detect tweek signals, in real time, from continuous VLF measurements, which requires the processing time to be sufficiently short. Tweek signals contain a clear tail structure and are not difficult to detect if one uses object detection algorithms. Therefore, as a compromise between the accuracy and processing time, the YOLO-v5s model is chosen in the present study.
Figure 3b shows the block diagram of the object detection model developed based on YOLO-v5s, corresponding to the second box in the architecture shown in Figure 3a. This model inputs spectrogram images with sizes of 224 × 224 pixels. It consists of the backbone network, neck, and prediction parts. The training images are first cut, rotated, and merged by mosaic data augmentation (MDA). The backbone part is the main component of the detection network, and its main purpose is to extract features from the input images to generate a feature pyramid. A pre-trained cross-stage partial dark-net53 (CSPDarknet53) network is used for the backbone, which is an updated version of the cross stage partial network (CSPN) [48]. To preserve useful information from the input images, a Focus module is used to slice the input images and increase the number of input channels while down-sampling the input. The output of the Focus module is a 28 × 28 × 256 feature, which is further converted into two higher-level feature (14 × 14 × 512 and 7 × 7 × 2048) via convolution.
The neck part is composed of a feature pyramid network (FPN) [49] and path aggregation network (PAN) [50]. FPN enhances the entire feature pyramid by combining low-resolution and high-level features with high-resolution and low-level features using a top-down architecture with lateral connections. The top-down architecture contains two up-sampling processes (the red arrows in Figure 3b), which ultimately generate 14 × 14 × 256 and 28 × 28 × 256 features and combines them with the original feature pyramid. PAN adds a bottom-up pathway to FPN for feature fusion, which in turn enhances the feature pyramid by incorporating accurate localization signals in lower-level features. Two down-sampling processes (the green arrows in Figure 3b) are applied in the bottom-up pathway and this generates 14 × 14 × 512 and 7 × 7 × 1024 features. The main purpose of the neck part is to build a feature pyramid with high-resolution and semantically strong features.
The prediction part divides the features into grids and draws bounding boxes of dispersive tails as the raw predictions. The outputs of the detection model are the coordinates, height, and width of the bounding box for the dispersive tail and the probability of prediction. The predicted tweek signals are further examined using a post-tracing process.

3.2. The Post-Tracing Process

Based on the bounding boxes provided by the detection model in the first step, we then depict the spectral peaks along the tweek tail, namely a post-tracing process. The main purpose is to (1) reexamine the predicted tweek signal; (2) determine if a first-order or second-order mode tweek belongs to the same lightning discharge; (3) extract information such as the occurrence time and cutoff frequency.
In order to provide sufficient data points for the analysis of VLF reflection height, we have recalculated the spectrogram images with higher temporal and frequency resolutions. The time resolution of VLF spectrograms in the tracing process is 0.4 ms with a frequency resolution of 4 Hz, compared to 2 ms and 24 Hz for the spectrogram images used in the first step (see the difference between Figure 2 and Figure 4). Because of the relatively lower resolution, some first two-order-mode tweeks could be mistakenly detected as first-order mode by the detection model. This error can be corrected thanks to the higher resolution used in the post-tracing process. Note that this resolution is not used in the first step of the tweek detection since it would significantly increase the computation time.
Before tracing the tweek signal, we first pick out those first two-order-mode tweeks, namely if a first-order tweek and a second-order tweek that occur close in time belong to the same lightning discharge. The time corresponding to the left edge of the bounding box is regarded as the start time of the n-th mode tweeks:  t s n . The time corresponding to the right edge of the bounding box is the end time of the n-th mode tweeks:  t e n . The frequency corresponding to the lower edge of the bounding box is recorded as  f c n . Figure 4a shows an example of a first-order mode tweek with  t s 1  = 40.58 s and  f c 1  = 1.78 kHz, while Figure 4b shows an example of a first two-order-mode tweek, with  t s 1  = 21.46 s,  f c 1  = 1.76 kHz,  t s 2  = 21.46 s, and  f c 2  = 3.61 kHz.
We use the following criteria to pick out first two-order-mode tweeks: (1) the time difference between the first-order and second-order mode is smaller than T; (2)  | f c 2 f c 1 2 |  is smaller than α, meaning that  f c 2 f c 1  ∈ (2 − α, 2 + α). The start time of the first two-order- mode tweek is set to the smaller value between  t s 1  and  t s 2 , and the end time is set to be the larger value between  t e 1  and  t e 2 .
The parameters T and α can somewhat influence the effectiveness in detecting first-two-order-mode tweeks. To check the dependence of model performance on these two parameters, we analyzed the VLF dataset recorded on 5 March 2020, excluding the error samples provided by the detection model, using different combinations of T and α values. Table 2 summarizes the testing results obtained using five different combinations of T and α values. The detection model found 162 dispersive tails, of which 60 were first-order mode and 51 were first-two-order mode. In Table 2, the tweeks predicted correctly by the above-mentioned criteria are marked as “positive”, otherwise as “negative”. When T increased to 10 ms and α increased to 0.2, the ratio between positive and negative cases increased notably. However, larger values of T and α could lead to unnecessary errors. Therefore, in this study, we use the following values: T = 10 ms and α = 0.3.
Typical altitudes at which VLF waves are reflected are in the range of 60–100 km. According to the waveguide mode theory (Budden, 1961), for these reflection altitudes, the cutoff frequency of the first-order and second-order mode tweeks are  f c 1  = 1.5–2.5 kHz and  f c 2  = 3.0–5.0 kHz, respectively. These two frequency ranges are specifically used for the tracing of the first- and second-order mode tweeks in this study. We trace the tweeks predicted in the first step in order to find out the spectral peak dataset ( t i n ,   f i n ), where n represents the n-th order of waveguide mode,  f i n  is the frequency at different arrival times  t i n  along the tweek tail.
The tracing process is explicitly conducted with the following steps: (1) for the first-order mode tweeks, the first time stamp, at which the maximum spectral intensity in the range of 1.5–3.0 kHz is greater than a threshold value of −20 dB, is recorded as  t 1 1 . The first spectral peak is recorded as ( t 1 1 ,   f 1 1 ), where  f 1 1  is the frequency corresponding to the maximum spectral intensity at  t 1 1 ; (2) we then search forward in time and find out all pairs of spectral peaks ( t i 1 ,   f i 1 ) until the spectral intensity is less than −20 dB. The mean value of the frequencies corresponding to the last five data points is chosen to be the cutoff frequency for the n-th mode. An example of this sequence of spectral peaks is shown using black points in Figure 4a. In this case, 28 spectral peaks were extracted for a first-order mode tweek and the cutoff frequency was 1.89 kHz.
Similarly, we traced the second-order mode tweeks. First, the time with the highest frequency in the range of 3.0–5.5 kHz and within five time-steps of  t 1 1  was regarded as the start time  t 1 2 . We then search forward in time and find out all the pairs of spectral peaks until the spectral intensity is less than −20 dB. Figure 4b shows an example of the tracing results for both the first-order and second-order tweeks; 49 and 37 spectral peaks were found for the first-order and second-order modes, respectively. The cutoff frequency was found to be 1.81 kHz and 3.72 kHz for the first-order and second-order modes, respectively.

3.3. Hyperparameters

In general, neural network models have two kinds of parameters: model parameters and hyperparameters. The model parameters are those learned from the dataset, while hyperparameters are those set manually. To achieve optimal training performance, we conducted a parametric study to evaluate the dependence of model performance on hyperparameters, especially the learning rate, optimizer, and epoch. The default learning rate was 0.1, the default optimizers of YOLO-v5 were the stochastic gradient descent (SGD) and the adaptive moment estimation (Adam), and the default epoch was 300.
The learning rate determines if and how fast the model converges to the optimum condition. The training loss obtained with the default hyperparameters is considered as the baseline and shown as the green curve in Figure 5a. With a learning rate of 0.1, the training loss decreased slowly after 150 epochs, reaching approximately 0.0333 after 300 epochs. The training loss corresponding to the learning rate of 0.01 and 0.001 are shown as the blue and orange lines, respectively, in Figure 5a. Compared with the default learning rate (0.1), the training loss corresponding to the learning rate of 0.01 and 0.001 decreased faster before 150 epochs, reaching 0.0280 and 0.0276 after 300 epochs, respectively. After 200 epochs, the training loss obtained with the learning rate of 0.01 was always slightly lower than that of 0.001. Therefore, the learning rate of 0.01 provided the best performance since the convergence speed was faster, and the training loss was lower compared with the other two values. Thus, the learning rate was set to be 0.01 in this study.
Optimizer is the method used to update parameters by backpropagation during the training process. The training loss for the SGD and Adam optimizers are shown as green and orange curves, respectively, in Figure 5b. It is clear that the training loss of the Adam optimizer converges much slower compared to SGD, while the training loss of Adam is notably higher than SGD. Thus, the SGD optimizer is used in this study.
One epoch refers to the process of forward and backward propagation of all data input into the neural network. While training, the model performance gradually changes from the initial unfitting state to the overfitting state, as shown in Figure 5c. Figure 5c shows the training loss (green curves) and validation loss (blue curves) versus epochs. The training loss first decreased rapidly with increasing epochs before ~100 epochs. As for the validation loss, it first decreased before 400 epochs and then increased after 400 epochs. An increase in the validation loss is clearly evident around 400 and 600 epochs, indicating that the model was overfitting. Thus, 400 was chosen as the final epoch in this study.
After a series of tests, we adopted the following hyperparameters in this study: batch = 16, epoch = 400, learning rate = 0.01, optimizer = SGD, and intersection over union (IoU) threshold = 0.45. The training, testing, and detection of our model were conducted on a computer with the following specifications: Intel(R) Core (TM) i9-10850K CPU @ 3.60 GHz, 32 GB RAM, NVIDIA GTX1080Ti GPU (12 GB). It takes approximately 1.8 h to train the detection model with a single GPU using the above-mentioned datasets and hyperparameters. The time needed for processing 10-s of raw VLF data and providing the predicted tweeks is approximately ~6.5 s, which fulfills the need to identify tweek signals in real time.

4. Model Performance

To evaluate the performance of this model, we use the occurrence rate of tweek signals as a means of evaluating the detection accuracy. A sample is regarded as a true positive only if it is correctly detected. If a tweek signal containing the first two-order-mode is detected as a first-order mode, it is considered a false positive for the first-order mode tweek and a false negative for the first two-order-mode tweek. A sample is also considered as a false positive if this method detects a tweek signal, while the VLF spectrogram does not contain any tweek. This is manually checked using the testing set, which includes 2495 randomly selected 0.5-s spectrogram images.
For the 2495 spectrogram images in the testing dataset, we first manually checked and found 129 first-order and 181 first-two-order mode tweeks. On the other hand, the detection method found 115 and 157 true positive cases for the first-order and first-two-order mode tweeks, respectively. Ten first-order and four first two-order-mode tweeks identified by our method turn out to be false positives. Thus, the precision and recall for the first-order-mode tweeks are 92.0% (115/125) and 89.2% (115/129), respectively. The precision and recall of the first two-order-mode tweeks are 97.5% (157/161) and 86.7% (157/181), respectively. In addition, to examine the influence of the ratio between training and validation datasets on the model performance, another model was trained, with the ratio being 8:2 between the training and the validation datasets. With this new model and the same testing dataset, 117 and 152 true positive cases were found for the first-order and first-two-order mode tweeks, respectively. Fourteen first-order and six first-two-order mode tweeks by this new model are actually false positives. Thus, the precision and recall for the first-order-mode tweek are 89.3% (117/131) and 90.7% (117/129), respectively. The precision and recall of the first two-order-mode tweeks are 96.2% (152/158) and 84.0% (152/181), respectively. The comparison of model performance between the old (ratio 9:1) and new (ratio 8:2) models are shown in Table 3. The performance of the new model trained with the ratio of 8:2 was not significantly different from that of the old model, which was due to the decrease of training data. If the size of the training dataset remains unchanged, the precision and recall rates could be even closer.
Figure 6 shows the cases that were missed and mistakenly detected by our method. The lightning sferics shown in Figure 6a were incorrectly detected by our method since, due to the interference with the narrowband noise, lightning sferics turned out to be similar to the tweek signals. In general, the duration of these signals was short, resulting in insufficient spectral peaks for the tracing process, which could be eliminated by setting a threshold for the number of spectral peaks in the tracing process. Figure 6b shows an example in which the detection model only identified the first-order mode and missed the second-order mode. This is mainly because the amplitude of those tweek signals that are not captured by our method was too weak. Figure 6c shows an example in which the duration of the second-order mode was too short and, thus, missed by manual labeling and our model.
Figure 6d shows the tracing results for the tweek signal shown in Figure 6c; the second-order mode tweek can be correctly detected in the post-tracing process but cannot be detected in the 0.5-s spectrogram by the YOLO model. Figure 6e shows a case in which the tweek signals were mistakenly detected during the tracing process. This spectrogram image contained intense background noise with decreasing frequency, which was mistakenly detected as a second-order mode tweek. This type of misdetection mainly occurs for the second-order mode tweek, which may cause some first-order modes to be misclassified as first two-order-mode tweeks. However, we have checked that this scenario accounts for less than 1% of all the tests that we performed. Figure 6f shows an example in which a first two-order-mode tweek was missed in the tracing process, and the second-order-mode tweek was discontinuous in the spectrogram image, only with 14 spectral peaks.

5. Discussion and Conclusions

In this study, we have proposed a method to automatically identify tweek signals from continuous VLF measurements. This method was developed based on image processing and machine learning algorithms and explicitly uses the YOLO-v5 model to identify the dispersive tail of the tweek signals. A post-tracing process is applied in order to reexamine the prediction results and depict the spectral peak of tweek signals, thereby providing the occurrence time, the modes, and the cutoff frequencies. Moreover, we have evaluated the performance of this method using randomly selected spectrogram images.
To check the effectiveness, this method has been employed to identify tweek signals from a testing set that consists of 2495 randomly selected spectrogram images. This method found 115 and 157 true positive cases for the first-order and first two-order-mode tweeks, respectively, corresponding to a precision of 92.0% and 97.5%. The recall was found to be 89.2% and 86.7% for the first-order and first two-order-mode tweeks, respectively. The time needed to process 10-s continuous VLF measurements with a cadence of 4 μs cadence was, on average, 6.5 s. With this level of precision and detection time, our method represents a reliable approach to identify tweek signals from continuous VLF measurements in real time.
Compared with previously proposed models for tweek detection [34,38], the detection accuracy of our method is largely improved, as well as the efficiency. The model of Zhou et al. [34] utilized the traditional image processing method and the detection accuracy was 77.4%, compared to 92.0% for the present method (first-order mode tweeks). Although the model of Maslej-Krešňáková et al. [38] was also built upon YOLO-v5, the detection accuracy and recall for the tweek signals were 58.0% and 56.0%. Note that this model is not particularly developed for tweek signals, but to distinguish between lightning sferics and tweeks. As such, this model utilized the entire lightning signal as the detection target, while we have only used the tweek tail as the detection target in this study. Therefore, the precision and recall rate of our model is notably improved. However, it is important to note that the performance of our method in detecting tweeks with higher-order modes could be limited if the energy density is too low and the pulse duration is too short. To further improve the accuracy of our method in identifying weak tweek signals is complicated and, thus, left for future studies.
Another advantage of our method is that it is adaptive for the detection of other types of VLF signatures. We have demonstrated the effectiveness of YOLO-v5 in detecting tweek signals in this study. One advantage of the ANNs module is that the physical model is unconstrained. After manually labeling the data, this method is also applicable to other types of VLF signals that are of critical importance in space physics research, for example, the chorus, hiss, and lightning-generated whistler waves [35].
The Suizhou station has been operational since 2016, and we have collected more than 7 years of VLF data at this location. More recently, a VLF receiver has been deployed at the Great Wall station in Antarctica. Our next-step study is to apply the method explained in this manuscript and to build large datasets of tweek signals from the measurements in Suizhou and Antarctica, which is almost impossible to be accomplished manually. Such tweek datasets would be helpful for studies related to the D-region ionosphere and lightning discharge. Moreover, given the detection efficiency of our method, we plan on incorporating this method as a built-in feature in VLF receivers, with the goal of providing an estimate of the condition of the D-region ionosphere in real time.

Author Contributions

Data curation, X.G., S.W., Q.W. and M.H.; methodology, X.G., W.M., W.X. and S.W.; validation, X.G., W.M., W.X. and W.C.; investigation, W.X., X.G., W.M. and J.F.; writing—original draft preparation, X.G., W.M. and W.X.; writing—review and editing, X.G., W.X., W.M. and B.N.; project administration, X.G., W.X., S.W. and B.N.; funding acquisition, X.G., B.N. and W.X. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Key R&D Program of China (Grant No. 2022YFF0503700), the National Natural Science Foundation of China (Grant Nos. 42188101, 42025404, 41874195, 42274205), and the open fund of Hubei Luojia Laboratory (Grant No. 220100051).

Data Availability Statement

The data and codes used to generate all figures and results in this paper are available at https://doi.org/10.5281/zenodo.7964529. The data manager for the WHU VLF detection system is Xudong Gu (guxudong@whu.edu.cn).

Acknowledgments

We are grateful to the Editor and anonymous reviewers for their assistance in evaluating this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Yamashita, M. Propagation of tweek atmospherics. J. Atmos. Terr. Phys. 1978, 40, 151–156. [Google Scholar] [CrossRef]
  2. Barr, R.; Llanwyn, J.D.; Rodger, C.J. ELF and VLF radio waves. J. Atmos. Sol. Terr. Phys. 2000, 62, 1689–1718. [Google Scholar] [CrossRef]
  3. Cohen, M.B.; Inan, U.S.; Gołkowski, M.; McCarrick, M.J. ELF/VLF wave generation via ionospheric HF heating: Experimental comparison of amplitude modulation, beam painting, and geometric modulation. J. Geophys. Res. Space Phys. 2010, 115, A02302. [Google Scholar] [CrossRef]
  4. Thomson, N.R.; Clilverd, M.A.; McRae, W.M. Nighttime ionospheric D region parameters from VLF phase and amplitude. J. Geophys. Res. Space Phys. 2007, 112, A07304. [Google Scholar] [CrossRef]
  5. Gasdia, F.; Marshall, R.A. Assimilating VLF Transmitter Observations With an LETKF for Spatial Estimates of the D-Region Ionosphere. IEEE Trans. Geosci. Remote Sens. 2020, 58, 3526–3543. [Google Scholar] [CrossRef]
  6. Thome, G.D.; Wagner, L.S. Electron density enhancements in the E and F regions of the ionosphere during solar flares. J. Geophys. Res. 1971, 76, 6883–6895. [Google Scholar] [CrossRef]
  7. Gu, X.; Luo, F.; Peng, R.; Li, G.J.; Chen, H.; Wang, S.W.; Yi, J.; Li, Z.P.; Ni, B.B.; Zhao, Z.Y.; et al. Response characteristics of very low frequency signals from JJI transmitter to solar flare events. Chin. J. Geophys. 2021, 64, 1508–1517. [Google Scholar] [CrossRef]
  8. Gu, X.; Yi, J.; Wang, S.; Hu, Z.; Xu, W.; Ni, B.; Li, B.; He, F.; Chen, X.; Hu, H. Comparison of VLF Signal Responses to Solar Flares along Daytime and Nighttime Propagation Paths. Remote Sens. 2023, 15, 1018. [Google Scholar] [CrossRef]
  9. Xu, W.; Gu, X.; Ni, B.; Wang, S.; Yang, Z.; Cheng, W.; Hu, Z.J.; He, F.; Li, B.; Chen, X.C.; et al. Measurements and modeling of the responses of VLF transmitter signals to X-class solar flares at the Great Wall Station in Antarctica. Space Weather 2023, 21, e2022SW003249. [Google Scholar] [CrossRef]
  10. Xu, W.; Marshall, R.A.; Kero, A.; Turunen, E.; Drob, D.; Sojka, J.; Rice, D. Vlf measurements and modeling of the d-region response to the 2017 total solar eclipse. IEEE Trans. Geosci. Remote Sens. 2019, 57, 7613–7622. [Google Scholar] [CrossRef]
  11. Gu, X.; Peng, R.; Wang, S.W.; Ni, B.B.; Luo, F.; Li, G.J.; Li, Z.P. Responses of the very low frequency transmitter signals during the solar eclipse on Dec 26, 2019 over a North-South propagation path. IEEE Trans. Geosci. Remote Sens. 2022, 60, 2000207. [Google Scholar] [CrossRef]
  12. Marshall, R.A.; Inan, U.S.; Chevalier, T.W. Early VLF perturbations caused by lightning EMP-driven dissociative attachment. Geophys. Res. Lett. 2008, 35, L21807. [Google Scholar] [CrossRef]
  13. Marshall, R.A.; Inan, U.S. Two-dimensional frequency domain modeling of lightning EMP-induced perturbations to VLF transmitter signals. J. Geophys. Res. Space Phys. 2010, 115, A00E29. [Google Scholar] [CrossRef]
  14. Gołkowski, M.; Renick, C.; Cohen, M.B. Quantification of ionospheric perturbations from lightning using overlapping paths of VLF signal propagation. J. Geophys. Res. Space Phys. 2021, 126, e2020JA028540. [Google Scholar] [CrossRef]
  15. Rodger, C.J.; Clilverd, M.A.; Thomson, N.R.; Gamble, R.J.; Seppälä, A.; Turunen, E.; Meredith, N.P.; Parrot, M.; Sauvaud, J.-A.; Berthelier, J.J. Radiation belt electron precipitation into the atmosphere: Recovery from a geomagnetic storm. J. Geophys. Res. Space Phys. 2007, 112, A11307. [Google Scholar] [CrossRef]
  16. Cummer, S.A.; Inan, U.S.; Bell, T.F. Ionospheric D region remote sensing using VLF radio atmospherics. Radio Sci. 1998, 33, 1781–1792. [Google Scholar] [CrossRef]
  17. Gołkowski, M.; Sarker, S.R.; Renick, C.; Moore, R.C.; Cohen, M.B.; Kułak, A.; Młynarczyk, J.; Kubisz, J. Ionospheric D region remote sensing using ELF sferic group velocity. Geophys. Res. Lett. 2018, 45, 12739–12748. [Google Scholar] [CrossRef]
  18. Gross, N.C.; Cohen, M.B. VLF remote sensing of the D region ionosphere using neural networks. J. Geophys. Res. Space Phys. 2020, 125, e2019JA027135. [Google Scholar] [CrossRef]
  19. Xu, W.; Marshall, R.A.; Bortnik, J.; Bonnell, J.W. An electron density model of the D- and E-region ionosphere for transionospheric VLF propagation. J. Geophys. Res. Space Phys. 2021, 26, e2021JA029288. [Google Scholar] [CrossRef]
  20. Budden, K.G. The Wave-Guide Mode Theory of Wave Propagation; Logos Press: Moscow, Russia, 1961. [Google Scholar]
  21. Outsu, J. Numerical study of tweeks based on wave-guide mode theory. J. Proc. Res. Inst. Atmos. 1960, 7, 58–71. [Google Scholar]
  22. Kumar, S.; Dixit, S.K.; Gwal, A.K. Propagation of tweek atmospherics in the earth-ionosphere wave guide. II Nuovo Cim. C 1994, 17, 275–280. [Google Scholar] [CrossRef]
  23. Shvets, A.V.; Serdiuk, T.M.; Gorishnyaya, Y.V.; Hobara, Y.; Hayakawa, M. Estimating the lower ionosphere height and lightning location using multimode “tweek” atmospherics. J. Atmos. Sol. Terr. Phys. 2014, 108, 1–9. [Google Scholar] [CrossRef]
  24. Singh, R.; Veenadhari, B.; Maurya, A.K.; Cohen, M.B.; Kumar, S.; Selvakumaran, R.; Pant, P.; Singh, A.K.; Inan, U.S. D-region ionosphere response to the total solar eclipse of Jul 22 2009 deduced from ELF-VLF tweek observations in the Indian sector. J. Geophys. Res. Space Phys. 2011, 116, A10301. [Google Scholar] [CrossRef]
  25. Ohya, H.; Shiokawa, K.; Miyoshi, Y. Daytime tweek atmospherics. J. Geophys. Res. Space Phys. 2015, 120, 654–665. [Google Scholar] [CrossRef]
  26. Yu, G.; Alexander, S. Statistical study of multimodal tweek-atmospherics. In Proceedings of the 2010 International Conference on Mathematical Methods in Electromagnetic Theory, Kyiv, Ukraine, 6–8 September 2010. [Google Scholar]
  27. Maurya, A.K.; Singh, R.; Veenadhari, B.; Kumar, S.; Cohen, M.B.; Selvakumaran, R.; Pant, P.; Singh, A.K.; Siingh, D.; Inan, U.S. Morphological features of tweeks and nighttime D region ionosphere at tweek reflection height from the observations in the low-latitude Indian sector. J. Geophys. Res. Space Phys. 2012, 117, A05301. [Google Scholar]
  28. Kumar, S.; Deo, A.; Ramachandran, V. Nighttime D-region equivalent electron density determined from tweek sferics observed in the South Pacific Region. Earth Planets Space 2009, 61, 905–911. [Google Scholar] [CrossRef]
  29. Maurya, A.K.; Veenadhari, B.; Singh, R.; Kumar, S.; Cohen, M.B.; Selvakumaran, R.; Gokani, S.; Pant, P.; Singh, A.K.; Inan, U.S. Nighttime D region electron density measurements from ELF-VLF tweek radio atmospherics recorded at low latitudes. J. Geophys. Res. Space Phys. 2012, 117, A11308. [Google Scholar] [CrossRef]
  30. Yusop, N.; Nor Azlan, M.A.; Chachuli, S.; Meor Said, M.A. Characteristic of Tweek Atmospherics Observed in Mid-latitude using AWESOME VLF Receiver. Res. J. Appl. Sci. Eng. Technol. 2014, 7, 2502–2508. [Google Scholar] [CrossRef]
  31. Lichtenberger, J.; Ferencz, C.; Bodnar, L.; Hamar, D.; Steinbach, P. Automatic Whistler Detector and Analyzer system: Automatic Whistler Detector. J. Geophys. Res. Space Phys. 2008, 113, A12201. [Google Scholar] [CrossRef]
  32. Hosseini, P.; Gołkowski, M.; Turner, D.L. Unique concurrent observations of whistler mode hiss, chorus, and triggered emissions. J. Geophys. Res. Space Phys. 2017, 122, 6271–6282. [Google Scholar] [CrossRef]
  33. Ali Ahmad, U.; Kasahara, Y.; Matsuda, S.; Ozaki, M.; Goto, Y. Automatic Detection of Lightning Whistlers Observed by the Plasma Wave Experiment Onboard the Arase Satellite Using the OpenCV Library. Remote Sens. 2019, 11, 1785. [Google Scholar] [CrossRef]
  34. Zhou, R.; Gu, X.; Yang, K.; Li, G.; Ni, B.; Yi, J.; Chen, L.; Zhao, F.; Zhao, Z.; Wang, Q.; et al. A detailed investigation of low latitude tweek atmospherics observed by the WHU ELF/VLF receiver: I. Automatic detection and analysis method. Earth Planet. Phys. 2020, 4, 120–130. [Google Scholar] [CrossRef]
  35. Golden, D.I.; Spasojevic, M.; Inan, U.S. Determination of solar cycle variations of midlatitude ELF/VLF chorus and hiss via automated signal detection. J. Geophys. Res. Space Phys. 2011, 116, A03225. [Google Scholar] [CrossRef]
  36. Konan, O.J.; Mishra, A.K.; Lotz, S. Machine learning techniques to detect and characterise whistler radio waves. arXiv 2020, arXiv:2002.01244. [Google Scholar]
  37. Harid, V.; Liu, C.; Pang, Y.; Alvina, A.J.; Golkowski, M.; Hosseini, P.; Cohen, M.B. Automated large-scale extraction of whistlers using mask-scoring regional convolutional neural network. Geophys. Res. Lett. 2021, 48, e2021GL093819. [Google Scholar] [CrossRef]
  38. Maslej-Krešňáková, V.; Kundrát, A.; Mackovjak, Š.; Butka, P.; Jaščur, S.; Kolmašová, I.; Santolík, O. Automatic detection of atmospherics and tweek atmospherics in radio spectrograms based on a deep learning approach. Earth Space Sci. 2021, 8, e2021EA002007. [Google Scholar] [CrossRef]
  39. Gu, X.; Li, G.J.; Pang, H.; Wang, S.W.; Ni, B.B.; Luo, F.; Peng, R.; Chen, L. Statistical analysis of very low frequency atmospheric noise caused by the global lightning using ground-based observations in China. J. Geophys. Res. Space Phys. 2021, 126, e2020JA029101. [Google Scholar] [CrossRef]
  40. Gu, X.; Chen, H.; Wang, S.; Lu, Z.; Ni, B.; Li, G.; Cheng, W. Extraction of Alpha transmitter signals from single-station observations using the direction-finding method. Sci. China Technol. Sci. 2022, 65, 1727–1737. [Google Scholar] [CrossRef]
  41. Gu, X.; Wang, Q.; Ni, B.; Xu, W.; Wang, S.; Yi, J.; Hu, Z.J.; Li, B.; He, F.; Chen, X.C.; et al. First results of the wave measurements by the WHU VLF wave detection system at the Chinese Great Wall station in Antarctica. J. Geophys. Res. Space Phys. 2022, 127, e2022JA030784. [Google Scholar] [CrossRef]
  42. Chen, Y.; Ni, B.; Gu, X.; Zhao, Z.; Yang, G.; Zhou, C.; Zhang, Y. First observations of low latitude whistlers using WHU ELF/VLF receiver system. Sci. China Technol. Sci. 2017, 60, 166–174. [Google Scholar] [CrossRef]
  43. Yi, J.; Gu, X.; Cheng, W.; Tang, X.Y.; Chen, L.; Ni, B.B.; Zhou, R.X.; Zhao, Z.Y.; Wang, Q.; Zhou, L.Q. A detailed investigation of low latitude tweek atmospherics observed by the WHU ELF/VLF receiver: 2. Occurrence features and associated ionospheric parameters. Earth Planet. Phys. 2020, 4, 238–245. [Google Scholar] [CrossRef]
  44. Ryabov, B.S. Tweek propagation peculiarities in the Earth-ionosphere waveguide and low ionosphere parameters. Adv. Space Res. 1992, 12, 255–258. [Google Scholar] [CrossRef]
  45. Kumar, S.; Kishore, A.; Ramachandran, V. Higher harmonic tweek sferics observed at low latitude: Estimation of VLF reflection heights and tweek propagation distance. Ann. Geophys. 2008, 26, 1451–1459. [Google Scholar] [CrossRef]
  46. Everingham, M.; Van Gool, L.; Williams, C.K.; Winn, J.; Zisserman, A. The PASCAL Visual Object Classes (VOC) Challenge. Int. J. Comput. Vis. 2010, 88, 303–338. [Google Scholar] [CrossRef]
  47. Bochkovskiy, A.; Wang, C.; Liao, H.M. YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv 2020, arXiv:2004.10934. [Google Scholar]
  48. Wang, C.; Liao, H.M.; Yeh, I.; Wu, Y.; Chen, P.; Hsieh, J. CSPNet: A New Backbone that can Enhance Learning Capability of CNN. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, USA, 14–19 June 2020; pp. 1571–1580. [Google Scholar]
  49. Lin, T.; Dollár, P.; Girshick, R.B.; He, K.; Hariharan, B.; Belongie, S.J. Feature Pyramid Networks for Object Detection. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 2117–2125. [Google Scholar]
  50. Liu, S.; Qi, L.; Qin, H.; Shi, J.; Jia, J. Path Aggregation Network for Instance Segmentation. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 8759–8768. [Google Scholar]
Figure 1. A typical spectrogram of VLF data measured at the Suizhou station between 00:50:00 and 00:50:10 LT (local time in Suizhou, UTC+8) on 1 November 2020. (a) Spectrogram of the 10-s raw VLF data. Zoom-in view of the VLF spectrogram at frequencies of 1–5 kHz (b) between 0 and 10 s; (c) between 1.5 and 2.0 s; (d) between 3.5 and 4.0 s. The tweek signals are marked using red boxes in the upper two panels.
Figure 1. A typical spectrogram of VLF data measured at the Suizhou station between 00:50:00 and 00:50:10 LT (local time in Suizhou, UTC+8) on 1 November 2020. (a) Spectrogram of the 10-s raw VLF data. Zoom-in view of the VLF spectrogram at frequencies of 1–5 kHz (b) between 0 and 10 s; (c) between 1.5 and 2.0 s; (d) between 3.5 and 4.0 s. The tweek signals are marked using red boxes in the upper two panels.
Remotesensing 15 05019 g001
Figure 2. Spectrogram images contained in the VLF dataset. Spectrogram images showing (a) a first-order mode tweek signal; (b) a first-two-order mode tweek signal; (c) a band-limited noise signal; (d) a wave-like noise signal; (e) a distorted tweek signal; (f) lightning sferics. The red boxes mark the first- and second-order tweeks labelled using LabelImg.
Figure 2. Spectrogram images contained in the VLF dataset. Spectrogram images showing (a) a first-order mode tweek signal; (b) a first-two-order mode tweek signal; (c) a band-limited noise signal; (d) a wave-like noise signal; (e) a distorted tweek signal; (f) lightning sferics. The red boxes mark the first- and second-order tweeks labelled using LabelImg.
Remotesensing 15 05019 g002
Figure 3. (a) The architecture of the proposed tweek detection method. (b) The block diagram of the detection model, corresponding to the second box in the architecture.
Figure 3. (a) The architecture of the proposed tweek detection method. (b) The block diagram of the detection model, corresponding to the second box in the architecture.
Remotesensing 15 05019 g003
Figure 4. (a) The tracing results for a first-order mode tweek, as marked using the black points. (b) The tracing result for a first-two-order mode tweek, as marked using black points.
Figure 4. (a) The tracing results for a first-order mode tweek, as marked using the black points. (b) The tracing result for a first-two-order mode tweek, as marked using black points.
Remotesensing 15 05019 g004
Figure 5. Loss of training obtained with different hyperparameters. The training loss is estimated using the training set, and the validation loss is estimated using the validation set. (a) The training loss for the learning rates of 0.1, 0.01, and 0.001; (b) the training loss for the SDG and Adam optimizers; (c) the training and validation loss for epochs of 1000 with the red dot line marking the 400 epoch.
Figure 5. Loss of training obtained with different hyperparameters. The training loss is estimated using the training set, and the validation loss is estimated using the validation set. (a) The training loss for the learning rates of 0.1, 0.01, and 0.001; (b) the training loss for the SDG and Adam optimizers; (c) the training and validation loss for epochs of 1000 with the red dot line marking the 400 epoch.
Remotesensing 15 05019 g005
Figure 6. (a) An example mistakenly identified by the detection model; (b) a second-order mode tweek missed by the detection model; (c) a second-order mode tweek missed by manual labeling; (d) the tracing results of the tweek signal in panel (c); (e) an example mistakenly detected during the tracing process; (f) a second-order mode tweek missed during the tracing process.
Figure 6. (a) An example mistakenly identified by the detection model; (b) a second-order mode tweek missed by the detection model; (c) a second-order mode tweek missed by manual labeling; (d) the tracing results of the tweek signal in panel (c); (e) an example mistakenly detected during the tracing process; (f) a second-order mode tweek missed during the tracing process.
Remotesensing 15 05019 g006
Table 1. The composition of the training, validation, and testing datasets.
Table 1. The composition of the training, validation, and testing datasets.
SetSpectrogram ImagesTweeks
Training20471656
Validation228212
Testing2495310
Table 2. Positive and negative cases for the detected first-order-mode and first two-order-mode tweek signals using different combinations of T and α.
Table 2. Positive and negative cases for the detected first-order-mode and first two-order-mode tweek signals using different combinations of T and α.
T (ms), (2 − α, 2 + α)First-Order-Mode TweekFirst Two-Order-Mode Tweek
PositiveNegativePositiveNegative
5, (1.9, 2.1)60331833
10, (1.9, 2.1)60232823
15, (1.9, 2.1)60232823
10, (1.8, 2.2)603483
10, (1.7, 2.3)601501
Table 3. Detection results of the detection method.
Table 3. Detection results of the detection method.
Training Data: Validation DataFirst-Order-Mode Tweek First Two-Order-Mode Tweek
PrecisionRecallPrecisionRecall
9:192%89.2%97.5%86.7%
8:289.3%90.7%96.2%84.0%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xu, W.; Ma, W.; Wang, S.; Gu, X.; Ni, B.; Cheng, W.; Feng, J.; Wang, Q.; Hu, M. Automatic Detection of VLF Tweek Signals Based on the YOLO Model. Remote Sens. 2023, 15, 5019. https://doi.org/10.3390/rs15205019

AMA Style

Xu W, Ma W, Wang S, Gu X, Ni B, Cheng W, Feng J, Wang Q, Hu M. Automatic Detection of VLF Tweek Signals Based on the YOLO Model. Remote Sensing. 2023; 15(20):5019. https://doi.org/10.3390/rs15205019

Chicago/Turabian Style

Xu, Wei, Wenchen Ma, Shiwei Wang, Xudong Gu, Binbin Ni, Wen Cheng, Jingyuan Feng, Qingshan Wang, and Mengyao Hu. 2023. "Automatic Detection of VLF Tweek Signals Based on the YOLO Model" Remote Sensing 15, no. 20: 5019. https://doi.org/10.3390/rs15205019

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop