Extreme-Low-Speed Heavy Load Bearing Fault Diagnosis by Using Improved RepVGG and Acoustic Emission Signals

Jiang, Peng; Sun, Wenyu; Li, Wei; Wang, Hongyu; Liu, Cong

doi:10.3390/s23073541

Open AccessArticle

Extreme-Low-Speed Heavy Load Bearing Fault Diagnosis by Using Improved RepVGG and Acoustic Emission Signals

by

Peng Jiang

¹,

Wenyu Sun

^1,*

,

Wei Li

¹,

Hongyu Wang

² and

Cong Liu

¹

School of Mechanical Science and Engineering, Northeast Petroleum University, Daqing 163318, China

²

Beijing Tongtai Hengsheng Technology Co., Ltd., Beijing 100020, China

^*

Author to whom correspondence should be addressed.

Sensors 2023, 23(7), 3541; https://doi.org/10.3390/s23073541

Submission received: 7 March 2023 / Revised: 22 March 2023 / Accepted: 24 March 2023 / Published: 28 March 2023

(This article belongs to the Special Issue Sensors for Wind Turbine Fault Diagnosis and Prognosis)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

With the worldwide carbon neutralization boom, low-speed heavy load bearings have been widely used in the field of wind power. Bearing failure generates impulses when the rolling element passes the cracked surface of the bearing. Over the past decade, acoustic emission (AE) techniques have been used to detect failure signals. However, the high sampling rates of AE signals make it difficult to design and extract fault features; thus, deep neural network-based approaches have been proposed. In this paper, we proposed an improved RepVGG bearing fault diagnosis technique. The normalized and noise-reduced bearing signals were first converted into Mel frequency cepstrum coefficients (MFCCs) and then inputted into the model. In addition, the exponential moving average method was used to optimize the model and improve its accuracy. Data were extracted from the test bench and wind turbine main shaft bearing. Four damage classes were studied experimentally. The experimental results demonstrated that the improved RepVGG model could be employed for classifying low-speed heavy load bearing states by using MFCCs. Furthermore, the effectiveness of the proposed model was assessed by performing comparisons with existing models.

Keywords:

intelligent fault diagnosis; deep learning; extreme-low-speed heavy load bearing; MFCCs; EMA; RepVGG; wind turbine

1. Introduction

With changes in the world’s energy structures, installation capacity and investments in renewable energy have witnessed a stable and sustained growth. In July 2020, the EU announced the goal of achieving carbon neutrality by 2050 [1]. In September 2020, China proposed to peak carbon dioxide emissions by 2030 and to achieve carbon neutrality by 2060; in addition, China aims to achieve 1200 GW of wind power by 2030, which is equivalent to quadrupling the installation target for 2019 [2]. Therefore, new challenges are being put forth for the maintenance and fault detection of wind turbines.

Bearings are essential constituents of heavy rotating machinery; they mitigate the effects of friction between rotating shafts and stationary components, such as the bearing housing [3]. Bearing defects account for a significant portion (45–55%) of rotating machinery failures. Therefore, detecting incipient bearing faults at an early stage is crucial to preventing consequential failures of the manufacturing equipment [4]. In recent decades, many bearing fault diagnosis techniques have been developed based on acoustic emission (AE). AE-based analysis enables detection of very low-energy signals caused by bearing failures at an early stage or during low-speed operation [5,6,7]. However, because the sampling rate used for AE signal collection is usually greater than 1 MHz, analysis of AE signals is difficult because of the tremendous amount of data in the collected time series and the computational time required for analysis [8].

With the advent of computer technology and machine learning (ML), signal processing technologies, such as fast Fourier transform (FFT) [9], kurtosis spectrum analysis [10], envelope analysis [11], and variational mode decomposition (VMD) [12], have been developed for bearing fault diagnosis. In the field of ML, artificial intelligence (AI) algorithms, such as support vector machine (SVM) [13], random forest (RF) [14], and backpropagation neural network (BPNN) [15], have been developed. In 2006, Hinton proposed deep learning (DL) technology; subsequently, it was widely employed for machine fault diagnosis [16]. Grezmak et al. directly converted one-dimensional (1D) vibration signals into two-dimensional images and then used the convolutional neural network (CNN) to diagnose the images [17]. Guo et al. used the CNN to diagnose 1D vibration signals after Fourier transformation and achieved ideal classification results [18]. However, two problems are encountered with low-speed heavy load bearings: first, AI algorithms, such as SVM, do not fully utilize the powerful feature self-extraction ability of DL, and the extremely low rotation rate makes feature extraction challenging. Second, the working conditions of heavy-duty bearings are relatively bad, with a lot of noise and interference; as such, the fault signal is difficult to detect at low speeds.

This study primarily aimed to examine a novel dataset designed for detecting faults in low-speed heavy bearings used in industrial settings [19]. The commonly used CWRU dataset has been subject to various concerns, which were discussed in 2015 by Smith and Randall [20] and, more recently, in 2022 by Hendriks et al. [20,21]. These studies reported that the CWRU dataset may not accurately represent bearing faults in general and that it is even less representative of the specific industrial case analyzed in this study. Moreover, this study is motivated by the fact that although the use of CNNs for bearing fault diagnosis has been analyzed in the literature, CNNs are difficult to deploy in industry due to the requirement for a large number of model parameters. RepVGG’s structural reparameterization technique can greatly reduce the number of model parameters during reasoning, thus making it suitable for industrial applications [22].

In this study, a DL method was designed for AE signals for extreme-low-speed heavy load bearings for the use of large-sized bearings in industrial wind turbines. This is the first work that includes experiments conducted on large-size industrial bearings with localized faults. First, considering the low frequency of bearing fault signals at low speeds, the proposed method uses denoised Mel frequency cepstrum coefficients (MFCCs) as the input of the model. VMD-kurtosis denoising is employed as the denoising method to reduce high-frequency signals and to remove a large amount of background noise. Second, for industrial applications, the RepVGG convolutional network is employed for bearing fault diagnosis. The RepVGG network uses structure reparameterization technology to increase the reasoning speed of the model and to decrease the memory requirement, thereby enabling the remote deployment of the RepVGG for industrial applications. The experimental results corroborated this hypothesis and demonstrated that the feature extraction capabilities of the RepVGG network can be effectively utilized for fault diagnosis. Furthermore, for the same reasoning sample and approximate prediction performance, the reasoning speed of RepVGG is considerably better than that of other DL models.

This paper comprises five sections. In Section 1, the topic of intelligent fault diagnosis of industrial bearings is presented, and the authors’ motivations for undertaking this investigation in light of the existing literature are outlined. In Section 2, details regarding the AI methodologies utilized in this study, particularly VMD-kurtosis denoising MFCCs, CNNs, and RepVGG, are presented. In Section 3, the test rig employed for industrial bearings is described, and the results of a wind turbine bearing fault experiment conducted with AE signals are presented. In Section 4, the findings, discussions, and implications of the investigation are presented. Finally, the concluding remarks are provided in Section 5.

2. Proposed Method

In this section, a summary of the main method employed in this study is provided. VMD-kurtosis denoising, MFCCs, CNNs, and the RepVGG feature extractor are discussed.

2.1. VMD-Kurtosis Denoising

The VMD method treats the problem of mode decomposition as an optimization problem by decomposing the 1D input signal into a specified number of modes. The signal is fully reproduced by summing up the K number of decomposition modes:

H (t) = \sum_{k = 1}^{K} u_{k} (t),

(1)

where k is the index of modes, K is the total number of modes, and

u_{k} (t)

is the k-th mode and is an amplitude-modulated–frequency-modulated signal.

The decomposition of the time series by using the VMD method can be considered a constrained variational problem (Dragomiretskiy and Zosso, 2014) [23], whose objective function can be expressed as follows:

\{\begin{matrix} \min_{\{u_{k}\}, \{w_{k}\}} \{\sum_{k} ‖ \partial_{t}  [(δ (t) + \frac{j}{π t}) {* u}_{k} (t)] e^{{- jw}_{k} t} ‖_{2}^{2}\} \\ s . t . \sum_{k} u_{k} = f \end{matrix},

(2)

where

\{u_{k}\} = \{u_{1}, \dots {, u}_{k}\}

and

\{w_{k} {= w}_{1}, \dots {, w}_{k}\}

are, respectively, the sets of all modes and their related center frequencies,

δ (t)

is the Dirac function, * denotes the convolution, and j =

\sqrt{- 1}

.

Dragomiretskiy and Zosso (2014) used a quadratic penalty term and Lagrangian multiplier to transform Equation (2) into an unconstrained optimization problem:

L (\{u_{k}\}, \{w_{k}\}, λ) ≔ α \sum_{k} ‖ \partial_{t}  [(δ (t) + \frac{j}{π t}) {* u}_{k} (t)] e^{{- jw}_{k} t} ‖_{2}^{2} + ‖ f (t) - \sum_{k} u_{k} (t) ‖_{2}^{2} + 〈 λ (t), f (t) - \sum_{k} u_{k (t)} 〉,

(3)

where α is the regularization parameter representing the variance of the white noise, and λ(t) is the Lagrangian multiplier.

The solution of Equation (3) is obtained using the alternate direction method of multipliers (Dragomiretskiy and Zosso, 2014). By iteratively optimizing the saddle points of

u_{k}

,

w_{k}

, and

λ n

+ 1 search in Equation (3), the optimal solution of the variational model is obtained, and the center frequency and signal bandwidth of each IMF are continuously updated until the termination conditions are satisfied.

The concept of kurtosis was proposed by the British statistician Karl Pearson in 1887. The term “kurtosis” describes the degree of peaks of the data distribution and is an indicator of the skewness and dispersion of the data distribution. It is widely used in statistics, finance, physics, and other disciplines. Nowadays, kurtosis is also often used in signal time–frequency processing [24,25].

The kurtosis factor K is defined as follows:

KU = \frac{1}{N} \sum_{i = 1}^{N} {(\frac{x (i) - \bar{x}}{σ})}^{4},

(4)

where

\bar{x}

is the mean value of signal x(t), and

σ

is the standard deviation of signal x(t).

When the rolling bearing fails, the amplitude of the bearing vibration signal increases greatly, and the value of the kurtosis factor increases; the larger the amplitude of the fault signal, the more obvious the fault characteristics. IMF components with a kurtosis factor greater than 3 are usually screened out because these components can retain a large number of fault features in the fault signal and abandon the interference of other IMF components to the reconstructed signal (Figure 1).

2.2. MFCCs

MFCC was proposed in the 1980s by Davis and Mermelstein [26]. MFCCs are more effective for identification purposes than other parameters. As such, MFCCs have been extensively employed in various speech recognition applications, including instruction recognition, emotion recognition, and person recognition. Moreover, MFCCs are being increasingly used for the status and fault identification of mechanical equipment, such as transformers and UAVs [27,28].

The fundamental concept of MFCC refers to a cepstral coefficient calculated in the Mel-scale frequency domain. MFCC is a parameter that corresponds to the energy of the audio signal in different frequency ranges within the cepstrum. By capturing both the low-frequency envelope and high-frequency detail information, MFCCs can effectively reflect the acoustic characteristics of the signal. The Mel-scale frequency domain simulates the human ear’s perception of audio frequencies and is defined as follows:

M (f) = 2595 \lg (1 + \frac{f}{700}),

(5)

where M is the Mel frequency, and f is the frequency.

2.3. CNNs

CNNs are feed-forward neural networks that include convolution calculations. CNNs greatly reduce the number of network parameters through local connections and weight sharing, and have thus been widely used in the field of computer vision [29,30]. Local connections mean that neurons do not need to be connected one-to-one but only need to be connected in groups. The structure of the image and the pixels in the adjacent area are more closely connected; thus, a single neuron only needs to perceive the local area of the image and then combine the local features of the image obtained by the lower layers in the higher layer. Weight sharing means that each channel uses the same convolution kernel to deconvolute the image at all positions [31,32,33]. CNNs have many structures, but as shown in the LeNet-5 network in Figure 2, CNNs typically include convolutional layers, pooling layers, fully connected layers, and activation functions [34].

In the training process, the loss function, which serves as a metric for evaluating the discrepancy between the model’s predictions and the ground truth, is optimized. This loss function is typically specific to the given task and can take various forms. The cross-entropy function (Equation (6)) is a common example of loss functions used for classification tasks with mutually exclusive classes [19]:

Loss = - \frac{1}{M} \sum_{n = 1}^{N} \sum_{m = 1}^{M} y_{mn} \ln \hat{y_{mn}},

(6)

where M is the number of observations, N is the number of classes,

\hat{y_{mn}}

is the network output for the m-th observation and n-th class, and y_mn is the ground truth for the m-th observation and n-th class.

2.4. EMA-RepVGG

In 2021, Ding [22] proposed a CNN architecture known as RepVGG, which is characterized by a simple yet effective design that incorporates VGG-style deep convolutional networks comprising a sequence of 3 × 3 convolutions and rectified linear unit activations. The architecture is further enhanced by the use of structural reparameterization technology, which enables efficient training and inference on GPUs and specialized chips. To achieve superior training results and feature extraction capability, RepVGG employs a multibranching residual structure similar to that of ResNet. Notably, the training and inference stages of the network utilize different architectures, with training emphasizing accuracy and inference prioritizing speed [34]. The RepVGG architecture comprises five stages, with downsampling achieved via stride-2 convolution at the start of each stage. Figure 3 illustrates the first four layers of a specific stage.

In the field of statistics, the moving average (MA) computation method is used for creating a series of averages for various subsets of a complete dataset and is typically applied to time series data for smoothing out short-term fluctuations [35]. Exponential moving average (EMA), a type of MA, assigns greater weight and importance to the most recent data points, which is consistent with the principles of the human auditory system, placing greater emphasis on recent information. In deep learning, optimization algorithms are a crucial component. Traditional optimization algorithms, such as SGD (Stochastic Gradient Descent) [36] and Adam [37], only use the gradient information from the current time step for updating parameters, which can result in non-smooth updates and oscillations in the parameter space. To address this issue, exponential moving average (EMA) can be introduced to smooth out the gradient information. Specifically, EMA calculates the exponential weighted average of gradients, which makes the updating process smoother. In addition, EMA can be used to calculate the average accuracy or average loss in the evaluation of DL models to better evaluate the performance of models.

The EMA for a series signal can be formulated as follows:

θ_{E M A, t + 1} = (1 - λ) \cdot θ_{E M A, t} + λ \cdot θ_{t},

(7)

where

θ_{t}

is the parameter at time t,

θ_{E M A, t}

is the network parameter after moving average at time t, and

λ

is the weighted weight value (usually takes a number close to 1, such as 0.9995).

The working environment of most heavy load bearings in industrial applications is challenging, especially in wind turbines and even offshore wind turbines; thus, the most reasonable monitoring method is to deploy the processing system locally. For this purpose, we used EMA-RepVGG for identifying the condition of the load bearings. MFCCs and VMD-kurtosis denoising technology greatly reduce high-frequency signals and increase the proportion of fault signals under low-speed rotations. The structural reparameterization technique adopted by RepVGG greatly reduces the cost of reasoning, which is conducive to remote deployment and the application of specialized inference chips. In addition, because at very low speeds not every segment of the signal contains bearing fault information, the use of EMA can improve the robustness of model prediction and reduce errors caused by data instability. Table 1 reports the set of hyperparameters adopted in this work.

3. Experimental Setup and Data Acquisition

To validate the proposed method, a bearing fault simulator and wind turbine spindle bearing were used for measuring healthy and faulty AE signals of the rolling element bearing.

3.1. Bearing Fault Simulator Experiment

As shown in Figure 4, the bearing fault simulator comprises a main bench, a main shaft, an induction motor, and a test bearing. The bench is made of welded section steel and is used to support the main shaft and its bearings; it also bears the force under radial and axial compression. The inner race is installed on the main shaft to realize radial and axial positioning. The outer race is prevented from rotating by a stop (not shown) installed on the bench or a radial compression jack. The main shaft is installed on the bench by using support bearings at both ends. The motor reducer installed at one end drives the main shaft to transfer and drive the inner ring of the tested bearing to rotate. The motor reducer adopts a variable frequency drive, which can adjust the speed from 1 to 40 rpm.

Double-row tapered roller bearings were used. The bearing parameters are presented in Table 2.

Four health states were considered: normal, rolling element (RE) damage (Figure 5a), outer race (OR) damage, and multirolling element (MRE) damage (Figure 5b). Faults (diameter: 5 mm, depth: 0.5 mm) were mechanically machined on the race that was most loaded in the case of an axial load. The bearings were dismounted, and faults were drilled on the race of interest by using a solid carbide drill with a diameter of 2 mm. Although the produced faults represent localized defects in rolling bearings, the AE data extracted cannot provide a comprehensive scenario of defects detectable in rolling bearings.

Four load cases at four shaft speeds were analyzed (Table 3 and Table 4). Next, 200 signals were extracted for each health state, totaling 800 signals. The AE signals were acquired using the Express-8 AE acquisition system (American Physical Acoustics Company) and sampled at 25,000 Hz. Each round of acquisition lasted 10 s.

Non-overlapping chunks of 1 s duration were extracted from the AE signals to construct the dataset (Table 5). Each signal yielded 10 chunks, resulting in a balanced dataset of 8000 samples across the four classes: normal, RE, OR, and MRE. The experimental results provided labeled data for use in supervised learning. The dataset size was sufficient for use in large DL architectures.

The proposed method’s applicability was analyzed using a randomly split dataset. The data were divided using a common DL splitting method, with 80% used for training the RepVGG model, 10% used for validation, and the remaining 10% used for testing. Details regarding the data split are presented in Table 6.

3.2. Wind Turbine Spindle Bearing on-Line Experiment

The experimental process is illustrated in Figure 6. The AE signals of the main bearing of a variable-speed constant-frequency doubly fed wind turbine unit were monitored. The main components of the transmission system were the main shaft, bearing, gearbox, coupling, generator, and main frame. To avoid the interference of the gearbox noise signal, the sensors were placed on the front bearing (near the hub side). Four DP15I AE sensors with a frequency of 60–400 kHz were used on each wind turbine, arranged on the inner wall of the main bearing counterclockwise. The diameter of the inner wall of the main bearing was approximately 50 cm, and the arc distance between each sensor was approximately 39 cm.

For the on-line experiment, the 24 h real-time monitoring method was adopted to conduct 24 h long-term monitoring on the main bearings of the three wind turbines. The bearings were classified according to the health status as healthy and faulty based on the characteristics of the distribution law of AE signals during the rotation of the main bearings of the fans. The waveform flow technology was used to collect and process the waveform flow data (duration: 60 s) with 24 h as the cycle, 1 h as the time interval, and 1 min of data in each hour as the time length.

4. Experimental Results and Discussion

We studied the ability of EMA-RepVGG, especially RepVGG A0, to perform bearing fault diagnosis. To validate the performance of this method, AE signals from the bearing fault simulator and wind turbine spindle bearing on-line experiment were acquired using the testbed. As discussed in Section 2.4, the MEA-RepVGG convolutional architecture can achieve feature extraction and fault diagnosis of extreme-low-speed heavy load bearings. Signal processing examples of normalized AE signals for the normal, RE, OR, and MRE damage states are shown in Figure 7, Figure 8, Figure 9 and Figure 10, where (a) is the original signal, (b) is the signal after denoising, and (c) are MFCCs, respectively. The corresponding feature map output from the EMA-RepVGG feature extractor is shown in Figure 10a–c. The information contained in the multifaceted MFCC spectrograms was translated and synthesized in a low-dimensional feature space through feature embedding. The classifier discerns classes by learning the differences between feature embeddings.

The model was fine-tuned using the hyperparameters listed in Table 1. The training time was 1611 s on a standard laptop with Nvidia 1650Ti GPU. The model was implemented in Python 3.7 and PyTorch. The feature map in stage 4 of RepVGG with the three damage classes are shown in Figure 11, from which it can be seen that the model extracts fault information. The loss functions of the proposed model and RepVGG during training are shown in Figure 12. The use of EMA prevented model fluctuations caused by data containing noise and large data gaps and model fluctuations near the optimal solution. The accuracies and reasoning times presented in Table 7 indicate the applicability of the diagnosis model to the test data. The reasoning time of the proposed model was much shorter than that of other models under the same accuracy rate. The complete confusion matrices obtained from the laboratory and wind turbine are shown in Figure 13. The diagnostic accuracy of the proposed model on the bearing of the industrial wind turbine was 86.25%. Thus, the results demonstrate that the classifier yields a high precision and recall rate, as reported in Table 8.

Furthermore, the proposed model was compared with ResNet-50 [38], EfficientNet V1 [39], EMA-RepVGG with FFT, and EMA-RepVGG without VMD-kurtosis denoising. The accuracies obtained using different models are presented in Table 7, and the precision and recall for different classes are listed in Table 8. Although ResNet and RepVGG use a similar residual technique, the use of the structural reparameterization technique greatly increased the reasoning speed of RepVGG. The EfficientNet V1 model exhibited satisfactory accuracies and reduced training times; however, some amount of overfitting was observed. EMA-RepVGG with FFT yielded lower accuracies than the proposed model because MFCCs can better reflect the low-frequency part of the signal; thus, RepVGG with MFCCs is more suitable for low-speed industrial equipment. EMA-RepVGG without VMD-kurtosis denoising yielded a lower accuracy than the proposed model under the same training cycle; although its overall training duration and reasoning duration were less than that of the proposed model, subsequent experiments revealed that it took more time to reach the same accuracy by improving the number of training cycles alone without preprocessing and denoising, which also demonstrates that for a certain understanding of signal fault information, proper pretreatment can optimize the fault diagnosis process.

However, this poses a problem in the interpretation of diagnosis results; the characteristic graph of each convolution kernel in the convolution layer has no clear physical interpretation. In contrast to traditional signal processing tools, where some parameters (e.g., kurtosis, crest factor, and ball passing frequencies) have a physical meaning, the user does not know what the features represent for data-driven fault diagnosis, even though they may work perfectly. Thus, estimating the feature variability with respect to the changes in the input signals is challenging. Furthermore, the development of appropriate tools for interpretability is crucial to accurately visualize the alignment of domains in DL [18].

5. Conclusions

An extreme-low-speed heavy load bearing fault diagnosis model was proposed using AE signals and DL. VMD-kurtosis denoising was used for data preprocessing, and MFCCs were used as the model input. An improved model, EMA-RepVGG, was then established for feature extraction and classification. Experimental data were obtained from the experimental set-up at the NEPU and wind turbine main shaft bearing of the wind turbine. The experiment involved four bearing states, four rotational speeds, and four loads, with a total of 8000 signals. The wind turbine spindle bearing experiment involved two wind turbines with different conditions in the same wind farm. AE signals were classified with an accuracy of 98.57%. The training time was 1611 s. Based on the results, the following conclusions can be drawn:

AE technology can be employed for the signal acquisition of low-speed heavy load bearings.
DL CNNs can be used for industrial condition monitoring.
The EMA-RepVGG model is an industrial application model suitable for remote deployment and embedded development.
MFCCs focus on low-frequency signals in AE signals, which is beneficial for low-speed bearing failure detection.
Denoising the model input aids in the optimization of the training process under specific circumstances.

Author Contributions

Data curation, W.S. and H.W.; formal analysis, W.S.; investigation, H.W. and C.L.; resources, W.L. and P.J.; conceptualization, W.L.; supervision, W.L. and P.J.; writing—review and editing, C.L.; writing—original draft, W.S. and H.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Natural Science Foundation of Heilongjiang Province, grant number LH2022E029, and the Postdoctoral Research Foundation of Heilongjiang Province, grant number LBH-Q21083.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data can be provided upon request from the correspondence author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Rogelj, J.; Schaeffer, M.; Meinshausen, M.; Knutti, R.; Alcamo, J.; Riahi, K.; Hare, W. Zero emission targets as long-term global goals for climate protection. Environ. Res. Lett. 2015, 10, 105007. [Google Scholar] [CrossRef]
Chen, H.; Chen, J.; Han, G.; Cui, Q. Winding down the wind power curtailment in China: What made the difference? Renew. Sustain. Energy Rev. 2022, 167, 112725. [Google Scholar] [CrossRef]
Brusa, E. Design of a kinematic vibration energy harvester for a smart bearing with piezoelectric/magnetic coupling. Mech. Adv. Mater. Struct. 2020, 27, 1322–1330. [Google Scholar] [CrossRef]
Garcia-Calva, T.; Morinigo-Sotelo, D.; Fernandez-Cavero, V.; Romero-Troncoso, R. Early detection of faults in induction motors—A review. Energies 2022, 15, 7855. [Google Scholar] [CrossRef]
Bharadwaj, S.; Singh, S.; Kumar, R. A review on acoustic emission analysis for bearing fault detection and classification. Measurement 2019, 145, 800–819. [Google Scholar] [CrossRef]
Ali, Y.H.; Abd Rahman, R.; Hamzah, R.I.R. Acoustic emission signal analysis and artificial intelligence techniques in machine condition monitoring and fault diagnosis: A review. J. Teknol. 2014, 69, 121–126. [Google Scholar] [CrossRef] [Green Version]
Wu, G.; Yan, T.; Yang, G.; Chai, H.; Cao, C. A review on rolling bearing fault signal detection methods based on different sensors. Sensors 2022, 22, 8330. [Google Scholar] [CrossRef]
Kim, J.Y.; Kim, J.M. Bearing fault diagnosis using grad-CAM and acoustic emission signals. App. Sci. 2020, 10, 2050. [Google Scholar] [CrossRef] [Green Version]
Cooley, J.W.; Tukey, J.W. An algorithm for the machine calculation of complex Fourier series. Math. Comput. 1965, 19, 297–301. [Google Scholar] [CrossRef]
Kraśny, M.J.; Bowen, C.R. A system for characterisation of piezoelectric materials and associated electronics for vibration powered energy harvesting devices. Measurement 2021, 168, 108285. [Google Scholar] [CrossRef]
Zhang, L.; Zhang, X.; Lu, G. Application of improved Hilbert envelope analysis for incipient fault diagnosis of rolling bearings. Mech. Syst. Signal Process. 2018, 99, 731–747. [Google Scholar] [CrossRef]
Lu, Q.; Shen, X.; Wang, X.; Li, M.; Li, J.; Zhang, M. Fault diagnosis of rolling bearing based on improved VMD and KNN. Math. Probl. Eng. 2021, 2021, 2530315. [Google Scholar] [CrossRef]
Baccarini, L.M.R.; e Silva, V.V.R.; de Menezes, B.R.; Caminhas, W.M. SVM practical industrial application for mechanical faults diagnostic. Expert Syst. Appl. 2011, 38, 6980–6984. [Google Scholar] [CrossRef]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
Hinton, G.E.; Osindero, S.; Teh, Y.-W. A fast learning algorithm for deep belief nets. Neural Comput. 2006, 18, 1527–1554. [Google Scholar] [CrossRef]
Grezmak, J.; Zhang, J.; Wang, P.; Loparo, K.A.; Gao, R.X. Interpretable convolutional neural network through layer-wise relevance propagation for machine fault diagnosis. IEEE Sens. J. 2019, 20, 3172–3181. [Google Scholar] [CrossRef]
Li, G.; Deng, C.; Wu, J.; Chen, Z.; Xu, X. Rolling bearing fault diagnosis based on wavelet packet transform and convolutional neural network. Appl. Sci. 2020, 10, 770. [Google Scholar] [CrossRef] [Green Version]
Di Maggio, L.G. Intelligent fault diagnosis of industrial bearings using transfer learning and CNNs pre-trained for audio classification. Sensors 2022, 23, 211. [Google Scholar] [CrossRef]
Smith, W.A.; Randall, R.B. Rolling element bearing diagnostics using the Case Western Reserve University data: A benchmark study. Mech. Syst. Signal Process. 2015, 64, 100–131. [Google Scholar] [CrossRef]
Hendriks, J.; Dumond, P.; Knox, D.A. Towards better benchmarking using the CWRU bearing fault dataset. Mech. Syst. Signal Process. 2022, 169, 108732. [Google Scholar] [CrossRef]
Ding, X.; Zhang, X.; Ma, N.; Han, J.; Ding, G.; Sun, J. RepVGG: Making VGG-style ConvNets Great Again. arXiv 2021, arXiv:2101.03697. [Google Scholar] [CrossRef]
Dragomiretskiy, K.; Zosso, D. Variational mode decomposition. IEEE Trans. Signal Process. 2013, 62, 531–544. [Google Scholar] [CrossRef]
Westfall, P.H. Kurtosis as peakedness, 1905–2014. RIP. Am. Stat. 2014, 68, 191–195. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Gu, X.; Yang, S.; Liu, Y.; Hao, R. Rolling element bearing faults diagnosis based on kurtogram and frequency domain correlated kurtosis. Meas. Sci. Technol. 2016, 27, 125019. [Google Scholar] [CrossRef]
Davis, S.; Mermelstein, P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 1980, 28, 357–366. [Google Scholar] [CrossRef] [Green Version]
Cai, R.; Wang, Q.; Hou, Y.; Liu, H. Event monitoring of transformer discharge sounds based on voiceprint. J. Phys. Conf. Ser. 2021, 2078, 012066. [Google Scholar] [CrossRef]
Shi, L.; Ahmad, I.; He, Y.; Chang, K. Hidden Markov model based drone sound recognition using MFCC technique in practical noisy environments. J. Commun. Netw. 2018, 20, 509–518. [Google Scholar] [CrossRef]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. In Proceedings of the Advances in Neural Information Processing Systems 25 (NIPS 2012), Lake Tahoe, NV, USA, 3–6 December 2012; pp. 1097–1105. [Google Scholar]
LeCun, Y.; Boser, B.; Denker, J.; Henderson, D.; Howard, R.; Hubbard, W.; Jackel, L. Handwritten digit recognition with a back-propagation network. In Proceedings of the Advances in Neural Information Processing Systems 25 (NIPS 2012), Denver, CO, USA, 27–30 November 1989; pp. 396–404. [Google Scholar]
Zhao, R.; Yan, R.; Chen, Z.; Mao, K.; Wang, P.; Gao, R.X. Deep learning and its applications to machine health monitoring. Mech. Syst. Signal Process. 2019, 115, 213–237. [Google Scholar] [CrossRef]
Lei, Y.; Yang, B.; Jiang, X.; Jia, F.; Li, N.; Nandi, A.K. Applications of machine learning to machine fault diagnosis: A review and roadmap. Mech. Syst. Signal Process. 2020, 138, 106587. [Google Scholar] [CrossRef]
Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 580–587. [Google Scholar]
Lecun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef] [Green Version]
Liu, Y.; Zhang, C.; Hang, B.; Wang, S.; Chao, H.-C. An audio attention computational model based on information entropy of two channels and exponential moving average. Hum. Cent. Comput. Inf. Sci. 2019, 9, 7. [Google Scholar] [CrossRef] [Green Version]
Bottou, L. Stochastic Gradient Descent Tricks. In Neural Networks: Tricks of the Trade; Springer: Berlin/Heidelberg, Germany, 2012; pp. 421–436. [Google Scholar] [CrossRef] [Green Version]
Kingma, D.P.; Ba, J.L. Adam: A Method for Stochastic Optimization. In Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, 7–9 May 2015; pp. 1–15. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Tan, M.; Le, Q.V. EfficientNet: Rethinking model scaling for convolutional neural networks. arXiv 2019, arXiv:1905.11946. [Google Scholar] [CrossRef]

Figure 1. Flowchart of the VMD-kurtosis denoising method. First, VMD decomposition is applied to the acoustic emission (AE) signal to convert the signal into eight IMFs. Next, the kurtosis factor value of each IMF is calculated. Finally, the IMF with kurtosis factor > 3 is reconstructed.

Figure 2. Typical CNN network: LeNet-5.

Figure 3. RepVGG network architecture. Left image: residual structure in ResNet. Middle image: training phase, using a similar multibranch residual structure inspired by ResNet. Right image: inference stage, converting all the network layers to Conv 3 × 3 through the reparameterization process. The rectified linear unit is a piecewise linear activation function that outputs the input directly if it is positive and zero otherwise [22].

Figure 4. Bearing fault simulator: (a) diagram of the bearing fault simulator; (b) photographs of the bearing fault simulator.

Figure 5. SKF 3519/560/HCYA6: (a) rolling element damage with 2 mm diameter and 0–5 mm depth; (b) multirolling element damage with 2 mm diameter and 0.5 mm depth.

Figure 6. The wind turbine spindle bearing on-line experimental setup.

Figure 7. AE signals of the normal health state at 20 rpm and 100 kN radial load: (a) original signal; (b) signal after denoising; (c) MFCCs.

Figure 8. AE signals of the RE damage state at 20 rpm and 100 kN radial load: (a) original signal; (b) signal after denoising; (c) MFCCs.

Figure 9. AE signals of the OR damage state at 20 rpm and 100 kN radial load: (a) original signal; (b) signal after denoising; (c) MFCCs.

Figure 10. AE signals of the MRE damage state at 20 rpm and 100 kN radial load: (a) original signal; (b) signal after denoising; (c) MFCCs.

Figure 11. Feature map in stage 4 of RepVGG: (a) RE damage; (b) OR damage; (c) MRE damage.

Figure 12. Loss functions.

Figure 13. Test confusion matrix: (a) bearing fault simulator; (b) wind turbine spindle bearing.

Table 1. Hyperparameters for EMA-RepVGG transfer learning.

Hyperparameters	Value
Optimizer	Adam [37]
L2 regularization	1 × 10⁻⁶
Mini batch size	32
Initial learning rate	5 × 10⁻⁴
Max epochs	30
$λ$	0.999

Table 2. Bearing parameters of indoor acoustic emission test bearing.

Bearing Outer Diameter D	Bearing Inner Diameter d	Thickness	Number of Rolling Elements	Contact Angle
733.4 mm	509.94 mm	200 mm	2 × 35	45°

Table 3. Parameter setting of the AE monitoring system.

Project	Channel	Threshold (dB)	PDT (μs)	HDT (μs)	HLT (μs)
parameter	4	35	1000	2000	2000

PDT (Peak Definition Time), HDT (Hit Definition Time), HLT (Hit Locking Time).

Table 4. Test conditions.

	Load Case 1	Load Case 1	Load Case 1	Load Case 1
Radial load (kN)	0	50	100	100
Axial load (kN)	0	0	0	50
Nominal speeds (rpm)	5, 10, 15, 20

Table 5. Signal extraction.

Total acquisition duration (s)	10
Sampling frequency (Hz)	25,000
Chunk length (samples)	25,000
Chunk length (s)	1.0
Number of chunks per signal	10

Table 6. Details of the dataset.

Classes	Label	Training Samples (80%)	Validation Samples (10%)	Test Samples (10%)
4	Normal	1600	200	200
	IR	1600	200	200
	OR	1600	200	200
	RE	1600	200	200
	Total	8000	800	800

Table 7. Diagnosis accuracy and reasoning time.

Model	Training Accuracy	Test Accuracy	Training Time (s)	Reasoning Time (s)	Peak Memory Allocated (MB)
EMA-RepVGG with MFCCs	98.88%	97.57%	1611	9.08	305
EMA-RepVGG with FFT	95.46%	95.05%	1688	9.12	305
EMA-RepVGG without VMD-Kurtosis Denoising	91.48%	92.68%	1679	9.49	305
ResNet [38]	96.44%	96.25%	4640	15.27	477
EfficientNet V1 [39]	90.12%	92.96%	1501	13.99	430

Table 8. Precision and recall of diagnosis models.

Model	Label	Precision	Recall	Specificity
EMA-RepVGG with MFCCs	Normal	93.1%	100%	97.9%
	ER	100%	92.3%	100%
	OR	97.0%	99.2%	99.1%
	MER	99.3%	97.3%	99.8%
EMA-RepVGG with FFT	Normal	90.8%	93.2%	92.1%
	ER	91.7%	91.6%	94.7%
	OR	91.3%	92.7%	93.2%
	MER	93.4%	94.1%	94.1%
EMA-RepVGG without VMD-Kurtosis Denoising	Normal	89.1%	90.8%	92.6%
	ER	92.6%	88.9%	89.7%
	OR	92.4%	89.7%	91.3%
	MER	93.4%	91.7%	93.4%
ResNet 50 [38]	Normal	94.4%	92.0%	98.2%
	ER	100%	94.5%	100%
	OR	99.0%	99.0%	99.7%
	MER	92.1%	99.5%	97.2%
EfficientNet V1 [39]	Normal	91.3%	93.9%	97.0%
	ER	93.5%	90.7%	98.0%
	OR	93.9%	92.5%	98.1%
	MER	93.4%	94.6%	97.5%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jiang, P.; Sun, W.; Li, W.; Wang, H.; Liu, C. Extreme-Low-Speed Heavy Load Bearing Fault Diagnosis by Using Improved RepVGG and Acoustic Emission Signals. Sensors 2023, 23, 3541. https://doi.org/10.3390/s23073541

AMA Style

Jiang P, Sun W, Li W, Wang H, Liu C. Extreme-Low-Speed Heavy Load Bearing Fault Diagnosis by Using Improved RepVGG and Acoustic Emission Signals. Sensors. 2023; 23(7):3541. https://doi.org/10.3390/s23073541

Chicago/Turabian Style

Jiang, Peng, Wenyu Sun, Wei Li, Hongyu Wang, and Cong Liu. 2023. "Extreme-Low-Speed Heavy Load Bearing Fault Diagnosis by Using Improved RepVGG and Acoustic Emission Signals" Sensors 23, no. 7: 3541. https://doi.org/10.3390/s23073541

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Extreme-Low-Speed Heavy Load Bearing Fault Diagnosis by Using Improved RepVGG and Acoustic Emission Signals

Abstract

1. Introduction

2. Proposed Method

2.1. VMD-Kurtosis Denoising

2.2. MFCCs

2.3. CNNs

2.4. EMA-RepVGG

3. Experimental Setup and Data Acquisition

3.1. Bearing Fault Simulator Experiment

3.2. Wind Turbine Spindle Bearing on-Line Experiment

4. Experimental Results and Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI