A Transformer-Based Bridge Structural Response Prediction Framework

Li, Ziqi; Li, Dongsheng; Sun, Tianshu

doi:10.3390/s22083100

Open AccessArticle

A Transformer-Based Bridge Structural Response Prediction Framework

by

Ziqi Li

,

Dongsheng Li

^* and

Tianshu Sun

School of Civil Engineering, Dalian University of Technology, Dalian 116024, China

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(8), 3100; https://doi.org/10.3390/s22083100

Submission received: 31 March 2022 / Revised: 13 April 2022 / Accepted: 15 April 2022 / Published: 18 April 2022

(This article belongs to the Section Intelligent Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

Structural response prediction with desirable accuracy is considerably essential for the health monitoring of bridges. However, it appears to be difficult in accurately extracting structural response features on account of complex on-site environment and noise disturbance, resulting in poor prediction accuracy of the response values. To address this issue, a Transformer-based bridge structural response prediction framework was proposed in this paper. The framework contains multi-layer encoder modules and attention modules that can precisely capture the history-dependent features in time-series data. The effectiveness of the proposed method was validated with the use of six-month strain response data of a concrete bridge, and the results are also compared with those of the most commonly used Long Short-Term Memory (LSTM)-based structural response prediction framework. The analysis indicated that the proposed method was effective in predicting structural response, with the prediction error less than 50% of the LSTM-based framework. The proposed method can be applied in damage diagnosis and disaster warning of bridges.

Keywords:

bridge structural response prediction; transformer; deep learning; structural health monitoring; encoder–decoder

1. Introduction

Response data of bridge structures can be used to participate in the assessment of structural health, and when the response values are above a certain range of the norm, it indicates that the monitored bridge structure is at risk of abnormalities or damage [1,2,3,4]. In this context, there is a need to be able to accurately predict the response of bridge structures. Structural response data are a kind of time-series data, which are usually used to predict the future period of data from a certain period of time in the past. These kinds of data have been widely used in economic and social science fields, such as weather prediction, stock prediction, etc. Traditional structural response prediction methods mainly focus on some linear and nonlinear models [5,6,7,8], which have good results in some simple systems, but for high-order nonlinear systems with long-term time dependence and spatial correlation, these methods have the drawbacks of huge computational effort and insufficient accuracy [9].

Early neural networks mainly refer to Back Propagation (BP) neural networks [10], and these networks were used in areas such as predicting financial markets [11,12], electrical loads [13,14,15], and traffic accidents [16,17]. These networks have a common problem: the output data are only related to the input data, but not the order of the input data. The neurons themselves do not have the ability to store information, and the whole network has no “memory” capability. With the rise of deep learning, deep learning networks have shown good performance in speech recognition and image processing [18,19,20], and LSTM is considered to be the best network model for processing time-series data [21,22]. Therefore, LSTM has gradually become a research hotspot in the field of engineering structures, especially in the operation and maintenance phase of bridges, where it can predict the response of bridges in a short period of time, and many researchers have proposed LSTM-based response prediction models. Zhang et al. [23] established a convolutional long-short term memory (ConvLSTM) network to learn spatiotemporal latent features from data, and thus establish a surrogate model for structural response forecasting. Li et al. [24] applied LSTM to model the bridge aerodynamic system with the potential fluid memory effect. Bilal et al. [25] developed a LSTM network with overlapping data to evaluate important response data after earthquakes. However, LSTM still has its limitations, and the semantic capture capability for time-series data is still insufficient, which may lead to poor prediction accuracy. To remedy this deficiency, researchers have proposed the Transformer structure [26], which is better than LSTM in the semantic capture of time-series data due to the use of an attention mechanism as the underlying network.

Therefore, in this paper, a Transformer-based bridge structural response prediction framework is proposed for improving the accuracy of bridge structural response prediction, and the performance of the proposed framework is tested on a concrete bridge. To the best of our knowledge, this is the first Transformer framework that has been used for bridge structural response prediction. The paper is organized as follows. Section 2 provides the details of the proposed framework. Section 3 provides basic information about the bridge response prediction experiments, containing the dataset, training parameters, etc. Section 4 provides the experimental results. Section 5 provides the discussion. Section 6 provides the limitations of the proposed method. Section 7 provides the conclusion.

2. Framework for Structural Response Prediction

The traditional CNN and RNN are discarded in Transformer, and the whole network structure is composed entirely of an attention mechanism. The original Transformer consists of and only consists of Self-Attention and Feed Forward Neural Network. After years of development, Transformer has produced many variants.

The Transformer structure used in this paper is shown in the Figure 1, with an encoder–decoder structure. The encoder consists of 6 encoding blocks, and similarly the decoder is composed of 6 decoding blocks. As with all generative models, the output of the encoder will be used as the input to the decoder.

2.1. Attention Mechanism

Attention mechanisms [27] have been widely used in various areas of deep learning in recent years, and is easily encountered in various different types of tasks, be it image processing, speech recognition, or natural language processing. Its inspiration comes from the human attention mechanism. The visual attention mechanism is a signal processing mechanism in the brain that is unique to human vision. By quickly scanning the global image, human vision obtains the target area to focus on, which is generally known as the focus of attention, and then devotes more attention resources to this area to obtain more detailed information about the target to be focused on, while suppressing other useless information. [28,29]. Its purpose is to select the information that is more critical to the current task goal from among the many information available.

The input vector is denoted by

X = [X_{1}, X_{2}, \dots, X_{n}]

. In order to match a weight to the input vector, the attention weight

Attention = [a_{1}, a_{2}, \dots, a_{n}]

is calculated as follows:

Attention = Softmax (\frac{Q K^{T}}{\sqrt{d_{k}}}) V

(1)

where Q, K, and V denote “query”, “key”, and “value”, respectively; d_k is the scaling factor and denotes the dimensionality of K. For larger values of d_k, the product of dot products is too large, thus pushing the Softmax function to regions with very small gradients. To counteract this effect, the dot product is scaled using

\frac{1}{\sqrt{d_{k}}}

.

After getting Attention, it will be sent to the next module of encoder, i.e., feed forward neural network. This is fully connected and has two layers: the first layer has an activation function of ReLU, and the second layer is a linear activation function that can be expressed as:

FFN (Attention) = \max (0, Attention W_{1} + b_{1}) W_{2} + b_{2}

(2)

As shown in Figure 2, there are two kinds of attention mechanisms used in Transformer, Self-Attention and Encoder–Decoder attention. Both are computed in a multi-head way, but Encoder–Decoder attention uses the traditional attention mechanism, where Query is the encoder value at the last time i had been computed by Self-Attention, and both Key and Value are the output of Encoder. Self-Attention only calculates the attention (or weight matrix) inside the encoder or decoder without reference to the current state at the decoder’s side.

2.2. Positional Encoding

Since there is no loop nor convolutional structure in the Transformer, in order for the model to be able to utilize the order of the sequence, it is necessary to insert some information that can represent the relative or absolute position in the sequence [30]. Position encoding is not part of the model architecture, it is actually only part of the preprocessing. For each position vector, it provides a unique encoding. The dimensionality of the encoding used in this paper is 250 dimensions.

2.3. Multi-Head Attention

Multi-Head attention [31] provides multiple “representation subspaces” for an attention. Because different Query/Key/Value weight matrices are used in each attention, each matrix is generated by random initialization. Then, through training, the response values are projected into different “representation subspaces”. Multi-Head attention in this paper consists of self-attention stacking to form a depth structure. The calculation is shown as follows:

Q_{i} = Q W_{i}^{Q}, K_{i} = K W_{i}^{K}, V_{i} = V W_{i}^{V}, i = 1, \dots, 8

(3)

{head}_{i} = Attention (Q_{i}, K_{i}, V_{i}), i = 1, \dots, 8

(4)

MultiHead (Q, K, V) = Contact ({head}_{1}, \dots, {head}_{8}) W^{O}

(5)

In this paper, the

Q, K, V \in R^{250}, W_{i}^{Q}, W_{i}^{K}, W_{i}^{V} \in R^{250 \times 64}, W^{O} \in R^{250 \times 250}, {head}_{i} \in R^{64}

.

3. Experiment

In this paper, our objective is to predict the bridge response for a specific period in the future based on the bridge response for a given period in the past, where the main load on the bridge is the vehicle load, and the bridge response used in this paper is the strain. We established a dataset of bridge strains, and the specific information of this dataset is shown in the description later. To show the performance of our proposed method, we compare the proposed method with previous LSTM-based methods.

3.1. Songhua River Bridge Structural Response Dataset

This bridge is located in Tonghe County, Heilongjiang Province. The total length of the bridge is 2578.28 m. The main bridge structure is a prestressed concrete continuous box girder divided into two links. Each link span arrangement is 1132 m, and 14 sensors were placed on each link span (From left to right, S01–S14). The strain responses of four sensors (S01, S02, S03, and S09) were selected as the dataset in this paper. The reason for our arrangement of the 14 sensors was due to the need of other projects, and this number of sensors was not needed for the study in this paper. The method in this paper has good prediction for each sensor, only four sensors were randomly selected. The strain response was monitored for 6 months, and was measured every half hour. The sensor layout is shown in Figure 3.

3.2. Training Platform

The training process was performed on a single workstation using a high-performance GPU (NVIDIA RTX 2080Ti) and CPU (AMD Ryzen 2700X 3.7 GHz). The code was written in Python 3.6, the framework was built using Pytorch 1.8.0, and the training process was performed on Windows 10. The optimizer was Adam with a learning rate of 0.001 and a decay rate of 0.0001. Epoch was set to 100.

3.3. Loss Function and Evaluation Metrics

RMSE (Root Mean Square Error): RMSE is the most commonly used regression loss function as well as evaluation metrics; it is calculated by finding the square root of the sum of squares of the distance between the predicted and true values, with the following formula:

R M S E = \sqrt{\frac{1}{N} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(6)

where

y_{i}

is the true value,

{\hat{y}}_{i}

is the predicted value, and n is the number of samples.

3.4. Experimental Design

In this paper, two deep learning methods were used for strain response prediction; the first one is the proposed Transformer-based method and the second one is the LSTM-based method. The dataset was used as the bridge strain response dataset in Section 3.1. Both methods used the same training parameters, with the strain response of the previous two hours as the input and the strain response of the next half hour as the prediction target, and the epochs were set to 100. The ratio of training set, validation set, and test set was 0.7:0.1:0.2.

4. Experimental Results

Figure 4 shows the strain response prediction results of the two methods. From the prediction results of the Transformer, the strain response was successfully predicted and the predicted strain response values basically matched with the field test values, indicating that the proposed method is able to predict the strain response in the short term. From the comparison results, the agreement of the proposed method is better than that of the LSTM, which, in general, predicts the strain response trend of the structure, but from the details, there is still a significant difference between the predicted strain response and the test value at some time points, indicating that the stability of the LSTM prediction is not as good as that of the proposed method. To better show the details of the errors, the variation of the errors with time points and the probability density function are calculated so that the variation of the errors with time points and the distribution of the errors can be more accurately reflected.

The Figure 5 shows the errors of the Transformer-based method and LSTM, as well as the fitted curves of the normal distribution of their error fits. From the error curves, it can be seen that the error of Transformer is smaller than that of LSTM, and both have the same trend of change, with jitter occurring at the 200th time point. Table 1 shows the mean error and the 95% confidence interval (CI). From the mean value of the error, the mean error of the Transformer is about 19.2–55.5% of that of the LSTM. From the 95% CI, there is a much narrower CI for the Transformer of approximately 59.0–87.7% for the LSTM. The strain responses used in this study are all raw data without pre-processing such as filtering, and the error is controlled within an acceptable range in the presence of noise, which shows the engineering feasibility of the proposed method.

5. Discussion

5.1. Impact of Different Number of Prediction Points

To test the performance of the proposed method in predicting different numbers of strain responses, we changed the parameters of the sliding window and adjusted the number of prediction points to two, four, six, eight, ten, and twelve, respectively, and the results are shown in Figure 6. In particular, when the number of prediction points is greater than one, multiple batches of prediction values are generated for each time point, and the mean value of multiple prediction values is used as the final prediction value in this paper. The RSME for each of the six cases are calculated, and it indicates the magnitude of the prediction error, and the larger the RSME, the larger the prediction error.

The prediction errors are counted in Figure 7. The results show that the proposed methods can predict the strain response when the prediction value is less than twelve points, but the accuracy varies widely. As the number of prediction points increases, the prediction error becomes larger. When the number of prediction points is four, the RSME increases significantly; when the number of prediction points increases to six, the increase in RSME becomes flat; when the number of prediction points is ten, the increasing trend of RSME becomes faster. Combining the results of Figure 6 and Figure 7, when the number of prediction points is within four points, the prediction error is small, and the prediction results are more credible; when the number of prediction points is between four and ten, the prediction error is moderate, and the results are less credible; when the number of prediction points is greater than ten, the prediction error is relatively large, and the results are not credible.

5.2. Impact of Different Time Intervals

In this section, the effect of different time intervals on the prediction results is discussed, and the dataset is set to a uniform length of 2000 samples in order to demonstrate fairness. After our experiments, it was found that the length of the batch had a large impact on the training results. In previous tests, the batch was set to 32, and the previous conclusions are based on this setting. In this section, when the batch was set to 32, the prediction accuracy was found to drop sharply. As a result, the length of the batch was reduced to six for multiple tests, and the prediction accuracy dropped more slowly when the time interval was increased at this point.

Figure 8 shows the prediction results and errors of the Transformer at different time intervals, which were set to 0.5 h, 1 h, 1.5 h, and 2 h. The results show that, in general, the prediction errors gradually increase as the time interval increases. Figure 9 shows the prediction errors of different time intervals; it can be seen the prediction results are more reliable for time intervals of 0.5 h and 1 h, and the prediction errors are mainly distributed in (−3, 3) and (−6, 3), while the prediction results are less reliable for time intervals of 1.5 h and 2 h, and the main distribution intervals of the errors are in (−15, 15) and (−12, 16). According to the prediction results, it can be seen that the time interval has a large influence on the prediction accuracy, and the prediction accuracy decreases significantly when the time interval is increased from 1 to 1.5 h, indicating that the response data of the bridges are more regular when the time interval is small, which is related to the form of external loads on the bridges and the local traffic conditions. Therefore, in practical applications, it is best to keep the time interval at a low level, as much as possible within 1 h.

6. Limitations

This section analyzes the limitations of the proposed method. Since there are various options for the number of input points and number of output points for structural response prediction, the use case presented in the previous section is the best application case with both four points for input and one point for prediction. However, as the input points change, not all cases are more accurate for Transformer than LSTM, so this section changes the input points to three points to compare the effects of the two prediction methods. The results are shown in Figure 10. In the case of three input points, the prediction accuracy of Transformer for S01, S02, and S09 is slightly lower than that of LSTM, and the prediction accuracy of Transformer for S03 is almost the same as that of LSTM. In contrast to the case with four input points, the prediction accuracy of Transformer for all sensors with three input points is less than the former, while the prediction accuracy of LSTM for S01, S02, and S09 decreases, and increases for S03. After several tests, the prediction result tends to be stable after increasing the input points, which is similar to the prediction result in Figure 4, so it is recommended to use at least four points as input points in order to ensure stable prediction results.

7. Conclusions

In this paper, a Transformer-based time series prediction framework is proposed for predicting the structural response of bridges with time dependence. The proposed framework contains multiple encoder modules and attention modules, and this structure enhances the semantic recognition of the temporal series data and is more conducive to extracting the features of the structural response. The accuracy of the proposed framework is verified by six-month strain response data of a concrete bridge. The proposed framework is compared with the most commonly used LSTM-based structural response prediction framework, and the results is shown as follows:

From the mean value of the error, the mean error of the Transformer is about 19.2%–55.5% of that of the LSTM.
From the 95% CI, with a much narrower CI for the Transformer of approximately 59.0%–87.7% for the LSTM.

Deep learning-based structural response prediction has the drawback of poor interpretability. Compared with traditional methods, deep learning-based methods rely more on the temporal regularity features of the data itself, which mainly reflects the approximation ability of deep learning and cannot correspond to the real physical features. In the future, we will pay more attention to the interpretability of time-series prediction, and calculate which time points of data are more valuable for predicting future responses.

Author Contributions

Conceptualization, Z.L. and D.L.; methodology, Z.L.; software, Z.L.; validation, Z.L., D.L. and Sun, T. A; formal analysis, Z.L.; investigation, D.L.; resources, D.L.; data curation, Z.L.; writing—original draft preparation, Z.L.; writing—review and editing, T.S.; visualization, T.S.; supervision, D.L.; project administration, Z.L.; funding acquisition, D.L. All authors have read and agreed to the published version of the manuscript.

Funding

The authors are grateful for the financial support from National Natural Science Foundation of China (NSFC) under Grant Nos. 51778104. Fundamental Research Funds for the Central Universities (Project No. DUT19LAB26).

Conflicts of Interest

The authors declare no conflict of interest.

References

Yi, T.; Li, H.; Gu, M. Recent research and applications of GPS based technology for bridge health monitoring. Sci. China Technol. Sci. 2010, 53, 2597–2610. [Google Scholar] [CrossRef]
Zhang, X.-J.; Zhang, C. Study of seismic performance and favorable structural system of suspension bridges. Struct. Eng. Mech. 2016, 60, 595–614. [Google Scholar] [CrossRef]
Zheng, X.; Yang, D.-H.; Yi, T.-H.; Li, H.-N. Bridge influence line identification from structural dynamic responses induced by a high-speed vehicle. Struct. Control. Health Monit. 2020, 27, e2554. [Google Scholar] [CrossRef]
Zhu, X.Q.; Law, S.S. Structural Health Monitoring Based on Vehicle-Bridge Interaction: Accomplishments and Challenges. Adv. Struct. Eng. 2015, 18, 1999–2015. [Google Scholar] [CrossRef]
Fujino, Y.; Iwamoto, M.; Ito, M.; Hikami, Y. Wind-Tunnel Experiments Using 3D Models and Response Prediction for a Long-Span Suspension Bridge. J. Wind. Eng. Ind. Aerodyn. 1992, 42, 1333–1344. [Google Scholar] [CrossRef]
Jakobsen, J.B.; Tanaka, H. Modelling uncertainties in prediction of aeroelastic bridge behavior. J. Wind. Eng. Ind. Aerodyn. 2003, 91, 1485–1498. [Google Scholar] [CrossRef]
Lee, J.S.; Lee, S.S.; Comp, S.O.C.I. Computational method for the prediction of dynamic response of long-span bridges due to unsteady wind load. In Proceedings of the Conference on High Performance Computing on the Information Superhighway (HPC Asia 97), Seoul, Korea, 28 April–2 May 1997; pp. 419–424. [Google Scholar]
Yan, G.-Y.; Zhang, Z. Predictive Control of a Cable-Stayed Bridge under Multiple-support Excitations. In Proceedings of the International Conference on Mechanical Materials and Manufacturing Engineering (ICMMME 2011), Nanchang, China, 20–22 June 2011; Volume 66–68, pp. 268–272. [Google Scholar]
Ding, Y.; Ren, P.; Zhao, H.; Miao, C. Structural health monitoring of a high-speed railway bridge: Five years review and lessons learned. Smart Struct. Syst. 2018, 21, 695–703. [Google Scholar]
MacKay, D.J. A practical Bayesian framework for backpropagation networks. Neural Comput. 1992, 4, 448–472. [Google Scholar] [CrossRef]
Ma, X.; Lv, S. Financial credit risk prediction in internet finance driven by machine learning. Neural Comput. Appl. 2019, 31, 8359–8367. [Google Scholar] [CrossRef]
Wang, W.; Zheng, H.; Wu, Y.J. Prediction of fundraising outcomes for crowdfunding projects based on deep learning: A multimodel comparative study. Soft Comput. 2020, 24, 8323–8341. [Google Scholar] [CrossRef]
Hartono, A.M.; Ahmad, M. Sadikin, Comparison methods of short term electrical load forecasting. In Proceedings of the 1st International Conference on Industrial, Electrical and Electronics (ICIEE), Anyer, Indonesia, 4–5 September 2018; Volume 218. [Google Scholar]
Khanesar, M.A.; Lu, J.; Smith, T.; Branson, D. Electrical Load Prediction Using Interval Type-2 Atanassov Intuitionist Fuzzy System: Gravitational Search Algorithm Tuning Approach. Energies 2021, 14, 3591. [Google Scholar] [CrossRef]
Peng, W.; Xu, L.; Li, C.; Xie, X.; Zhang, G. Stacked autoencoders and extreme learning machine based hybrid model for electrical load prediction. J. Intell. Fuzzy Syst. 2019, 37, 5403–5416. [Google Scholar] [CrossRef]
Li, W.; Zhao, X.; Liu, S. Traffic Accident Prediction Based on Multivariable Grey Model. Information 2020, 11, 184. [Google Scholar] [CrossRef] [Green Version]
Zheng, M.; Li, T.; Zhu, R.; Chen, J.; Ma, Z.; Tang, M.; Cui, Z.; Wang, Z. Traffic Accident’s Severity Prediction: A Deep-Learning Approach-Based CNN Network. IEEE Access 2019, 7, 39897–39910. [Google Scholar] [CrossRef]
Zhang, Y.; Wang, T.; Ka-Veng, Y. Construction site information decentralized management using blockchain and smart contracts. Comput. Aided Civ. Infrastruct. Eng. 2021. [Google Scholar] [CrossRef]
Zhang, Y.; Yuen, K.V. Crack detection using fusion features-based broad learning system and image processing. Comput. Aided Civ. Infrastruct. Eng. 2021, 36, 1568–1584. [Google Scholar] [CrossRef]
Li, Z.; Li, D. Action recognition of construction workers under occlusion. J. Build. Eng. 2022, 45, 103352. [Google Scholar] [CrossRef]
Chen, S.; Ge, L. Exploring the attention mechanism in LSTM-based Hong Kong stock price movement prediction. Quant. Financ. 2019, 19, 1507–1515. [Google Scholar] [CrossRef]
Li, G.; Zhao, X.; Fan, C.; Fang, X.; Li, F.; Wu, Y. Assessment of long short-term memory and its modifications for enhanced short-term building energy predictions. J. Build. Eng. 2021, 43, 103182. [Google Scholar] [CrossRef]
Zhang, R.; Meng, L.; Mao, Z.; Sun, H. Spatiotemporal Deep Learning for Bridge Response Forecasting. J. Struct. Eng. 2021, 147, 04021070. [Google Scholar] [CrossRef]
Li, S.; Li, S.; Laima, S.; Li, H. Data-driven modeling of bridge buffeting in the time domain using long short-term memory network based on structural health monitoring. Struct. Control. Health Monit. 2021, 28, e2772. [Google Scholar] [CrossRef]
Ahmed, B.; Mangalathu, S.; Jeon, J.-S. Seismic damage state predictions of reinforced concrete structures using stacked long short-term memory neural networks. J. Build. Eng. 2022, 46, 103737. [Google Scholar] [CrossRef]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. In Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017; Volume 30. [Google Scholar]
Mnih, V.; Heess, N.; Graves, A. Recurrent models of visual attention. In Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, Canada, 8–13 December 2014; Volume 27. [Google Scholar]
Sutskever, I.; Vinyals, O.; Le, Q.V. Sequence to sequence learning with neural networks. In Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, Canada, 8–13 December 2014; Volume 27. [Google Scholar]
Xu, K.; Ba, J.; Kiros, R.; Cho, K.; Courville, A.; Salakhudinov, R.; Zemel, R.; Bengio, Y. Show, attend and tell: Neural image caption generation with visual attention. In Proceedings of the 32nd International Conference on Machine Learning, PMLR, Lille, France, 6–11 July 2015; pp. 2048–2057. [Google Scholar]
Takase, S.; Okazaki, N. Positional encoding to control output sequence length. arXiv 2019, arXiv:1904.07418. [Google Scholar]
Li, J.; Tu, Z.; Yang, B.; Lyu, M.R.; Zhang, T. Multi-head attention with disagreement regularization. arXiv 2018, arXiv:1810.10183. [Google Scholar]

Figure 1. Structure of the used Transformer.

Figure 2. Internal Structure of Encoder and Decoder.

Figure 3. Location of Songhua River Bridge and sensor layout.

Figure 4. Strain response prediction results of the two methods: (a) Transformer prediction for S01. (b) LSTM prediction for S01. (c) Transformer prediction for S02. (d) LSTM prediction for S02. (e) Transformer prediction for S03. (f) LSTM prediction for S03. (g) Transformer prediction for S09. (h) LSTM prediction for S09.

Figure 5. Prediction error statistics and error distribution for Transformer and LSTM: (a) Error curve of sensor S01. (b) Fitting curve of normal distribution of sensor S01. (c) Error curve of sensor S02. (d) Fitting curve of normal distribution of sensor S02. (e) Error curve of sensor S03. (f) Fitting curve of normal distribution of sensor S03. (g) Error curve of sensor S09. (h) Fitting curve of normal distribution of sensor S13.

Figure 6. Prediction results for different numbers of prediction points: (a) Two-point prediction. (b) Four-point prediction. (c) Six-point prediction. (d) Eight-point prediction. (e) Ten-point prediction. (f) Twelve-point prediction.

Figure 7. Prediction error for different number of prediction points.

Figure 8. Prediction results at different time intervals. (a) 0.5 h. (b) 1 h. (c) 1.5 h. (d) 2 h.

Figure 9. Prediction errors at different time intervals. (a) 0.5 h. (b) 1 h. (c) 1.5 h. (d) 2 h.

Figure 10. The prediction results of Transformer and LSTM when the input points are 3 points: (a) Transformer prediction for S01. (b) LSTM prediction for S01. (c) Transformer prediction for S02. (d) LSTM prediction for S02. (e) Transformer prediction for S03. (f) LSTM prediction for S03. (g) Transformer prediction for S09. (h) LSTM prediction for S09.

Table 1. Mean error and 95%CI statistical results.

Sensor No.	Mean Error		95%CI
Sensor No.	Transformer	LSTM	Transformer	LSTM
1	0.48	0.90	(−1.29, 1.28)	(−0.79, 2.14)
2	0.48	0.85	(−1.29, 1.28)	(−0.75, 2.26)
3	0.37	1.12	(−1.03, 0.89)	(−0.35, 2.64)
9	0.31	1.61	(−1.04, 0.70)	(0.35, 3.30)

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, Z.; Li, D.; Sun, T. A Transformer-Based Bridge Structural Response Prediction Framework. Sensors 2022, 22, 3100. https://doi.org/10.3390/s22083100

AMA Style

Li Z, Li D, Sun T. A Transformer-Based Bridge Structural Response Prediction Framework. Sensors. 2022; 22(8):3100. https://doi.org/10.3390/s22083100

Chicago/Turabian Style

Li, Ziqi, Dongsheng Li, and Tianshu Sun. 2022. "A Transformer-Based Bridge Structural Response Prediction Framework" Sensors 22, no. 8: 3100. https://doi.org/10.3390/s22083100

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Transformer-Based Bridge Structural Response Prediction Framework

Abstract

1. Introduction

2. Framework for Structural Response Prediction

2.1. Attention Mechanism

2.2. Positional Encoding

2.3. Multi-Head Attention

3. Experiment

3.1. Songhua River Bridge Structural Response Dataset

3.2. Training Platform

3.3. Loss Function and Evaluation Metrics

3.4. Experimental Design

4. Experimental Results

5. Discussion

5.1. Impact of Different Number of Prediction Points

5.2. Impact of Different Time Intervals

6. Limitations

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI