Convolutional LSTM Architecture for Precipitation Nowcasting Using Satellite Data

Gamboa-Villafruela, Carlos Javier; Fernández-Alvarez, José Carlos; Márquez-Mijares, Maykel; Pérez-Alarcón, Albenis; Batista-Leyva, Alfo José

doi:10.3390/ecas2021-10340

Open AccessProceeding Paper

Convolutional LSTM Architecture for Precipitation Nowcasting Using Satellite Data^†

by

Carlos Javier Gamboa-Villafruela

¹

,

José Carlos Fernández-Alvarez

^1,2

,

Maykel Márquez-Mijares

^1,*

,

Albenis Pérez-Alarcón

^1,2

and

Alfo José Batista-Leyva

¹

Instituto Superior de Tecnologías y Ciencias Aplicadas (InSTEC), Universidad de la Habana, La Habana 10400, Cuba

²

Centro de Investigación Marina, Environmental Physics Laboratory (EPhysLab), Campus as Lagoas s/n, Universidad de Vigo, 32004 Ourense, Spain

^*

Author to whom correspondence should be addressed.

^†

Presented at the 4th International Electronic Conference on Atmospheric Sciences, 16–31 July 2021; Available online: https://ecas2021.sciforum.net.

Environ. Sci. Proc. 2021, 8(1), 33; https://doi.org/10.3390/ecas2021-10340

Published: 22 June 2021

(This article belongs to the Proceedings of The 4th International Electronic Conference on Atmospheric Sciences)

Download

Browse Figures

Versions Notes

Abstract

:

The short-term prediction of precipitation is a difficult spatio-temporal task due to the non-uniform characterization of meteorological structures over time. Currently, neural networks such as convolutional LSTM have shown ability for the spatio-temporal prediction of complex problems. In this research, we propose an LSTM convolutional neural network (CNN-LSTM) architecture for immediate prediction of various short-term precipitation events using satellite data. The CNN-LSTM is trained with NASA Global Precipitation Measurement (GPM) precipitation data sets, each at 30-min intervals. The trained neural network model is used to predict the sixteenth precipitation data of the corresponding fifteen precipitation sequence and up to a time interval of 180 min. The results show that the increase in the number of layers, as well as in the amount of data in the training data set, improves the quality of the forecast.

Keywords:

convolutional LSTM; nowcasting; precipitation; GPM

1. Introduction

Precipitation nowcasting refers to the prediction of rainfall in a local region over a short period of time, generally up to six hours [1]. Short-term prediction of weather events is important for public safety from high-impact meteorological events such as flash floods, tropical cyclones, thunderstorms, lightning, high-speed wind, etc. which can affect large populations or areas of significant economic investment. Precipitation nowcasting is also useful for weather forecasts and guidance in aviation, marine safety, ground traffic control, and construction industries. Nowcasting is one of the most challenging problems in weather forecasting because of the non-uniform and flawed characterization of the meteorological structures over time. Traditional methods for forecasting based on numerical weather prediction (NWP) are not suitable for short-term predictions because they are highly computationally expensive, sensitive to noise, and they depend a lot on initial conditions of the event [2]. They cause a delay in short-term predictions because of data assimilation and simulation steps required in NWP models which make the forecast irrelevant by the time it is made.

Existing methods for precipitation nowcasting can roughly be categorized into two classes [3], namely, NWP based methods and radar echo extrapolation-based methods. For the NWP approach, making predictions at the nowcasting timescale requires a complex simulation of the physical equations in the atmosphere model. Thus, the current state-of-the-art operational precipitation nowcasting systems [4,5] often adopt the faster and more accurate extrapolation-based methods. Some computer vision techniques, especially optical flow-based methods, have proven useful for making accurate extrapolation of radar maps [4,6,7]. However, the success of these optical flow-based methods is limited because the flow estimation step and the radar echo extrapolation step are separated and it is challenging to determine the model parameters to give good prediction performance. These technical issues may be addressed by viewing the problem from the machine learning perspective. In essence, precipitation nowcasting is a spatiotemporal sequence forecasting problem with the sequence of past satellite images as input and the sequence of a fixed number of future satellite images as output. However, such learning problems, regardless of their exact applications, are nontrivial in the first place due to the high dimensionality of the spatiotemporal sequences especially when multi-step predictions have to be made, unless the spatiotemporal structure of the data is captured well by the prediction model. Moreover, building an effective prediction model for the radar echo data is even more challenging due to the chaotic nature of the atmosphere.

Recent advances in deep learning, especially recurrent neural network (RNN) and long short-term memory (LSTM) models [8,9,10,11,12,13,14,15,16], provide some useful insights on how to tackle this problem. According to the philosophy underlying the deep learning approach, if we have a reasonable end-to-end model and sufficient data for training it, we are close to solving the problem. In this paper, we propose a novel convolutional LSTM network for precipitation nowcasting. We formulate precipitation nowcasting as a spatiotemporal sequence forecasting problem that can be solved under the general sequence-to-sequence learning framework proposed in [15].

2. Methodology

2.1. IMERG Dataset

IMERG is the unified algorithm that provides multi-satellite precipitation data. The data is obtained from passive microwave sensors of the precipitation measuring satellites comprising the global precipitation measurement (GPM) constellation [17]. The IMERG dataset is available in temporal resolutions of 30 min, 3 h, 1 day, 7 days, and 30 days. All IMERG datasets have a spatial resolution of 0.1. Since our goal is short-term forecasting of precipitation, we use the IMERG datasets with a temporal resolution of 30 min. The datasets with a temporal resolution of 30 min have been available since March 2014. IMERG datasets with a temporal resolution of 30 min are available in HDF5, GeoTIFF, NetCDF, ASCII, PNG, KMZ, OpenDAP, GrADS and THREDDS data formats. For our research, we use the HDF5 format IMERG dataset [18] for all subsequent analysis. We only use the ‘precipitatonCal’ field from the HDF5 dataset, which is multi-satellite precipitation data with gauge calibration and has a unit of mm/h.

2.2. Nowcasting Problem and Training Data

In a precipitation nowcasting problem using satellite data, the spatial region is represented by a M x N grid with Z measurement values varying over time. At any time (t), the observation is a tensor X where X ∈ R^MxNxZ where R is the observed feature (precipitation). If the observation is recorded periodically, we get a sequence of observed features X^<1>, X^<2>, X^<3>, …, X^<t>. The nowcasting problem is then to predict the next sequence X<t+1> given the previous observations. In this research, we choose a square grid (M = N = 120) from the IMERG dataset as shown in Figure 1.

In our study, we would like to predict the sequence X^<1>, X^<2>, X^<3>, …, X^<15> from the previous fifteen observations at an interval of 30 min. For each input precipitation data, we use the subsequent precipitation data as the output precipitation in the training set. Therefore, we prepare 4000 examples in the training set, 1000 examples in the validation set, and 24 examples for the test set. All two sets in training and validation have diverse sets of precipitation examples such as hurricanes, storms, tropical depressions, etc.

2.3. Development of the Convolutional LSTM Network Architecture

We develop convolutional neural networks by stacking one, two and three LSTM layers for spatial and temporal feature learning which is followed by a 3D convolutional layer for the next 30 min of precipitation prediction, as shown in Figure 2. In the last layer of the architecture, we use ReLU as the activation layer. This is because precipitation nowcasting is a regression problem where the output of the convolutional neural network is a precipitation value. Since precipitation cannot take negative values, we choose ReLU to turn any negative activations into zeros (i.e., no rain).

3. Results and Discussion

Below is an example in the 29 April 2015 test dataset of a predicted storm from t + 30 min to t + 180 min illustrated below using the convolutional LSTM network with three layers (Figure 3). The model predicts that the precipitation values are initially good up to t + 180 min, although for the last intervals the precision decreases slightly. Interestingly, in all cases, the model preserves the direction and speed of the storm.

From the images, we find that the model slightly underestimates precipitation values above 20 mm/h as we rarely find predicted precipitation above 20mm/h. The reason for this is that the number of training samples is much smaller for higher precipitation values and so the neural network is more biased towards the prediction of lower precipitation values. This is also a general problem with the unbalanced dataset in deep learning-based techniques [19]. The neural network, however, estimates the speed and direction of storms accurately from past precipitation data, and the shape of the predicted precipitation corresponds to the observed precipitation. This is because the network has learned the spatial correlations between different timestamps of the previous sequences during end-to-end training.

In Figure 4, Figure 5, Figure 6 and Figure 7 we see how the Convolutional LSTM neural network with three layers exceeds the ones with one and two layersby having smaller RMSE and MAE. We also compared the prediction results of each model using the correlation at each time step in the prediction. Although the accuracy of the prediction decreases as the prediction time step progresses, the Convolutional LSTM network with more stacked layers continues to perform better at each time step.

4. Conclusions

In this article, we presented a new convolutional LSTM architecture for forecasting precipitation from space satellite data. We found that the LSTM model with three layers obtained the best results and predicts precipitation with good accuracy even for a lead time of 180 min. We conclude that convolutional LSTM is very suitable for capturing spatiotemporal relations in the satellite-based precipitation dataset for short-term forecasting. The model preserves the speed and directions of the precipitation in the forecasted results well. Satellite-based precipitation nowcasting is quite important, as radar data has limitations of not being available in all regions. A significant improvement in results could be expected by using a larger training set, using a convolutional LSTM neural network with four layers, by performing hyperparameter tuning, by pre-classifying the storm type with geographic information, and through the use of a weighted loss function.

Author Contributions

C.J.G.-V., J.C.F.-A. and M.M.-M. conceived the idea of the study; C.J.G.-V., J.C.F.-A. and M.M.-M. processed the data and made the figures; C.J.G.-V., J.C.F.-A., M.M.-M., A.P.-A. and A.J.B.-L. analyzed the results and wrote the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets to training and prove the neural network was obtained from NASA GESDISC DATA ARCHIVE in website https://gpm1.gesdisc.eosdis.nasa.gov/data/ (Accessed on 18 February 2021).

Conflicts of Interest

The authors declare no conflict of interest.

References

Smith, K.; Austin, G. Nowcasting precipitation—A proposal for a way forward. J. Hydrol. 2000, 239, 34–45. [Google Scholar] [CrossRef]
Heye, A.; Venkatesan, K.; Cain, J. Precipitation nowcasting: Leveraging deep recurrent convolutional neural networks. In Proceedings of the Cray User Group (CUG), Redmond, WA, USA, 7–11 May 2017. [Google Scholar]
Sun, J.; Xue, M.; Wilson, J.W.; Zawadzki, I.; Ballard, S.P.; Onvlee-Hooimeyer, J.; Joe, P.; Barker, D.M.; Li, P.W.; Golding, B.; et al. Use of NWP for nowcasting convective precipitation: Recent progress and challenges. Bull. Am. Meteorol. Soc. 2014, 95, 409–426. [Google Scholar] [CrossRef] [Green Version]
Cheung, P.; Yeung, H.Y. Application of optical-flow technique to significant convection nowcast for terminal areas in Hong Kong. In Proceedings of the 3rd WMO International Symposium on Nowcasting and Very Short-Range Forecasting (WSN12), Rio de Janeiro, Brazil, 6–10 August 2012; pp. 6–10. [Google Scholar]
Reyniers, M. Quantitative Precipitation Forecasts Based on Radar Observations: Principles, Algorithms and Operational Systems; Institut Royal Météorologique de Belgique: Brussels, Belgium, 2008. [Google Scholar]
Germann, U.; Zawadzki, I. Scale-dependence of the predictability of precipitation from continental radar images. Part I: Description of the methodology. Mon. Weather. Rev. 2002, 130, 2859–2873. [Google Scholar] [CrossRef]
Sakaino, H. Spatio-temporal image pattern prediction method based on a physical model with time-varying optical flow. IEEE Trans. Geosci. Remote Sens. 2013, 51, 3023–3036. [Google Scholar] [CrossRef]
Cho, K.; van Merrienboer, B.; Gulcehre, C.; Bougares, F.; Schwenk, H.; Bengio, Y. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv 2014, arXiv:1406.1078. [Google Scholar]
Donahue, J.; Hendricks, L.A.; Guadarrama, S.; Rohrbach, M.; Venugopalan, S.; Saenko, K.; Darrell, T. Long-term recurrent convolutional networks for visual recognition and description. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015. [Google Scholar]
Graves, A. Generating sequences with recurrent neural networks. arXiv 2013, arXiv:1308.0850. [Google Scholar]
Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
Karpathy, A.; Li, F.F. Deep visual-semantic alignments for generating image descriptions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015. [Google Scholar]
Ranzato, M.; Szlam, A.; Bruna, J.; Mathieu, M.; Collobert, R.; Chopra, S. Video (language) modeling: A baseline for generative models of natural videos. arXiv 2014, arXiv:1412.6604. [Google Scholar]
Srivastava, N.; Mansimov, E.; Salakhutdinov, R. Unsupervised learning of video representations using LSTMs. In Proceedings of the 32nd International Conference on International Conference on Machine Learning, Lille, France, 6–11 July 2015. [Google Scholar]
Sutskever, I.; Vinyals, O.; Le, Q.V. Sequence to sequence learning with neural networks. In Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, QC, Canada, 8–13 December 2014; pp. 3104–3112. [Google Scholar]
Bergstra, J.S.; Bardenet, R.; Bengio, Y.; Ke’gl, B. Algorithms for hyper-parameter optimization. In Proceedings of the 24th International Conference on Neural Information Processing Systems, Granada, Spain, 12–15 December 2011; pp. 2546–2554. [Google Scholar]
Huffman, G.J.; Bolvin, D.T.; Nelkin, E.J. Integrated Multi-SatellitE Retrievals for GPM (IMERG) technical documentation. NASA/GSFC Code 2015, 612, 47. [Google Scholar]
Huffman, G.J.; Bolvin, D.T.; Nelkin, E.J. Integrated Multi-SatellitE Retrievals for GPM (IMERG). 2014. Available online: ftp://arthurhou.pps.eosdis.nasa.gov/gpmdata/ (accessed on 21 June 2021).
Wang, S.; Liu, W.; Wu, J.; Cao, L.; Meng, Q.; Kennedy, P.J. Training deep neural networks on imbalanced data sets. In Proceedings of the 2016 International Joint Conference on Neural Networks (IJCNN), Vancouver, BC, Canada, 24–29 July 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 4368–4374. [Google Scholar]

Figure 1. Intensity of the precipitation variable of the IMERG HDF5 file.

Figure 2. Convolutional LSTM architecture for precipitation nowcasting using the IMERG dataset.

Figure 3. Nowcasting of a storm that occurred on 29 April 2015 for (a) t + 30, (b) t + 60, (c) t + 90, (d) Table 120, (e) t + 150 and (f) t + 180 min using Convolutional LSTM.

Figure 4. Plots Bias of magnitude of observed precipitation and predicted precipitation for the three layers.

Figure 5. Plots Correlation of magnitude of observed precipitation and predicted precipitation for the three layers.

Figure 6. Plots Mean Absolute Error of magnitude of observed precipitation and predicted precipitation for the three layers.

Figure 7. Plots Root Mean Square Error of magnitude of observed precipitation and predicted precipitation for the three layers.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gamboa-Villafruela, C.J.; Fernández-Alvarez, J.C.; Márquez-Mijares, M.; Pérez-Alarcón, A.; Batista-Leyva, A.J. Convolutional LSTM Architecture for Precipitation Nowcasting Using Satellite Data. Environ. Sci. Proc. 2021, 8, 33. https://doi.org/10.3390/ecas2021-10340

AMA Style

Gamboa-Villafruela CJ, Fernández-Alvarez JC, Márquez-Mijares M, Pérez-Alarcón A, Batista-Leyva AJ. Convolutional LSTM Architecture for Precipitation Nowcasting Using Satellite Data. Environmental Sciences Proceedings. 2021; 8(1):33. https://doi.org/10.3390/ecas2021-10340

Chicago/Turabian Style

Gamboa-Villafruela, Carlos Javier, José Carlos Fernández-Alvarez, Maykel Márquez-Mijares, Albenis Pérez-Alarcón, and Alfo José Batista-Leyva. 2021. "Convolutional LSTM Architecture for Precipitation Nowcasting Using Satellite Data" Environmental Sciences Proceedings 8, no. 1: 33. https://doi.org/10.3390/ecas2021-10340

Article Menu

Convolutional LSTM Architecture for Precipitation Nowcasting Using Satellite Data^†

Abstract

1. Introduction

2. Methodology

2.1. IMERG Dataset

2.2. Nowcasting Problem and Training Data

2.3. Development of the Convolutional LSTM Network Architecture

3. Results and Discussion

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Convolutional LSTM Architecture for Precipitation Nowcasting Using Satellite Data †

Abstract

1. Introduction

2. Methodology

2.1. IMERG Dataset

2.2. Nowcasting Problem and Training Data

2.3. Development of the Convolutional LSTM Network Architecture

3. Results and Discussion

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Convolutional LSTM Architecture for Precipitation Nowcasting Using Satellite Data^†