Next Article in Journal
Experimental Investigation of the Mechanical and Tribological Properties of Jute Fiber Composites with Nano-Sized Al2O3 Ceramic Particle Reinforcement
Previous Article in Journal
Synthesis and Characterization of Starch-Based Bioplastics: A Promising Alternative for a Sustainable Future
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Statistical Downscaling of Global Climate Models for Temperature Trend Analysis in Calgary †

by
Mahdi Alipour
1,*,
Mohammad Bejani
2 and
Arman Hosseinpour Salehi
1
1
Department of Civil and Environmental Engineering, University of Tabriz, Tabriz 51368, Iran
2
Center of Excellence in Hydroinformatics, University of Tabriz, Tabriz 51368, Iran
*
Author to whom correspondence should be addressed.
Presented at the 4th International Electronic Conference on Applied Sciences, 27 October–10 November 2023; Available online: https://asec2023.sciforum.net/.
Eng. Proc. 2023, 56(1), 320; https://doi.org/10.3390/ASEC2023-15976
Published: 14 November 2023
(This article belongs to the Proceedings of The 4th International Electronic Conference on Applied Sciences)

Abstract

:
Climate change, particularly global warming, is a significant environmental issue that has gained widespread attention in recent decades. This study aimed to complement the model for the future by utilizing Global Climate Models (GCMs) data. The shallow-layered Artificial Neural Network (ANN) and deep-based Long Short-Term Memory (LSTM) network was applied to extract the historical temperature trend of Calgary, Canada. Mutual Information (MI) was employed for screening purposes to ensure the quality of the input variables. The results of the study indicate that the LSTM model, which relied on the data screening method using MI, achieved RMSE of 0.01 °C, DC of 0.93, a CC of 0.75 and a Bias of 1.89, and has superiority over the ANN method in the Alberta region.

1. Introduction

With the development of technology and industrialization of human societies, along with the increase in greenhouse gases in the past decades, the temperature of the Earth has risen, and other climate parameters have changed. This phenomenon is known as climate change. Accurate prediction of these climate parameters is vital for proper planning for future statistical downscaling methods, including dynamic and interpolation approaches, which are commonly utilized to simulate weather and climate at different scales, ranging from global to regional. These downscaling methods aim to bridge the spatial gap between large-scale climate data and the specific requirements of local and regional applications. However, there are still challenges in applying statistical downscaling methods across different climate regions, ranging from cold regions to tropical regions. Understanding these challenges and addressing the gaps in the methodology is crucial for improving the performance of statistical downscaling and making it a preferred approach over other methods [1]. Short and long-term weather forecasting is one of the fundamental challenges for researchers in the field of water resources and climate. To address this challenge, various tools such as atmospheric and oceanic General Circulation Models (GCMs), prediction scenarios, and downscaling models are used. By combining large-scale data from GCMs with small-scale synoptic data, these tools help predict and provide a more accurate trend of weather and climate in the future. Climate change encompasses various parameters such as temperature and precipitation. Therefore, knowledge of the future changes of these parameters as well as temporal representations of them, can be a useful tool for water resource management in any region [2]. To investigate climate change at the local and regional scale, it is necessary to use climate data and downscale them to a finer scale. The information available on climate data is at a large scale and is not suitable for local and regional applications. Therefore, to assess the impacts of the future climate at a smaller scale, there is a need to downscale these scenarios [3]. Among the statistical downscaling methods, downscaling through the use of generators, classification, and regression are more commonly utilized due to their practicality and low cost. These downscaling methods are used to simulate weather and climate in three categories: weather and climate generators, weather and climate classification, and regression-based methods [4]. simulated the temperature and precipitation using GCM models, over two lakes in China until 2100. The results showed that in the future, temperature and precipitation fluctuations in these regions will rise [5]. Two methods, ANN and statistical downscaling model, were compared to downscale daily precipitation and temperature data in the Saguenay River basin in northern Quebec, Canada. The study concluded that ANNs perform better than SDSMs in downscaling daily temperature data [6]. The long-term trend of temperature and its related factors in the North China region using EOF analysis and statistical dimensionality reduction, two temperature variables at the height of 850 hPa and the combination of geographic height and temperature at the height of 850 hPa, have been used as predictors [7]. The results show that the temperature trend in July is related to the main EOF pattern of the two composite fields, and the statistical dimensionality reduction method is able to correctly reproduce the long-term temperature trend in North China. The use of these methods can help to better understand the temperature trend in different geographical areas, even if the main statistical models do not have enough accuracy in simulating the trend. Statistical downscaling of climate scenarios in Scandinavia is investigated by Bauren [8]. This method derives accurate and local climate information from discovery data by using statistical relationships of big data and local conditions. The importance of having accurate climate information to analyze the effects of climate change on the environment and societies makes it mandatory to use statistical downscaling in Scandinavian climate studies. The results show how the rate of warming in this region will change over the course of the 21st century and the performance between different scenarios. The prediction of the future maximum and minimum temperature in the Boston Lake basin has been made using SDSMs [9]. The models used data from the United States Centers for Environmental Prediction (NCEP) analysis and observations at four stations. The data series from the first thirty years (1961–1990) have been used to determine and calibrate the model, and the data series of the remaining ten years (1991–2001) have been used to validate the model. The results show that in the Boston Lake basin, the maximum and minimum temperatures will increase significantly in the future on a daily, monthly, seasonal, and annual scale. The temperature increase in scenario A2 is higher than scenario B2, and the highest increase is observed in summer and the lowest increase is observed in winter [10].
In this study, 14 different geographical patterns were used as inputs, with the aim to simulate historical temperature data using Long-short Term Memory (LSTM) and Artificial Neural Network (ANN) models that utilize large-scale GCMs and assess the suitability of these methods in simulating purposes.

2. Method and Materials

2.1. Study Area and Data Set

The study area for this research is Calgary, located in the province of Alberta, Canada. Calgary is situated at coordinates 51.0447 degrees north and 114.0719 degrees west, with an elevation of 1100 m above sea level. The city experiences cold winters and mild to warm summers, with an average annual temperature of around 9 degrees Celsius. Annual precipitation in Calgary is approximately 425 mm, distributed throughout the year. The land use distribution in Calgary is as follows: agricultural activities occupy about 30% of the total area, while forested areas constitute around 20%, urban areas cover approximately 40% of the study region, consisting of residential, commercial, and industrial zones. Barren land, such as rocky terrain or areas with limited vegetation, accounts for roughly 5% of the total area. The remaining 5% includes water bodies, wetlands, parks, and protected areas. In terms of the socioeconomic context, Calgary has a diverse economy with key sectors including oil and gas, technology, finance, tourism, and transportation. Figure 1 shows the study site with the elevation map. The data used in this research is from the Can_ESM5, which is from www.canada.ca (accessed on 30 January 2023).This information includes variables such as climatic, humidity, temperature and pressure type predictors for the period from 1979 to 2014, as well as observational data from www.acis.alberta.ca (accessed on 20 January 2023); which process is depict in the Figure 2.

2.2. Data Preprocessing and Model Calibration

Data preprocessing plays a crucial role in increasing the accuracy and reliability of statistical downscaling models. MI was utilized as a preprocessing technique to screen the input variables and ensure their quality. MI helps identify the relevant predictors and their relationships with the temperature of the study area.

2.3. Statistical Downscaling Methods

In order to extract the historical temperature trend of Calgary, statistical downscaling methods based on ANN and LSTM networks were employed. These advanced machine learning methods were used as simulation tools to capture complex relationships and patterns in the input data, making them suitable for climate modeling. ANN models consist of interconnected artificial neurons that can learn from the data and make predictions. They are effective in capturing nonlinear relationships and have been widely used in various fields, including climate modeling. In this study, the ANN model was employed alongside the LSTM network to downscale large-scale data from GCMs to a finer scale suitable for local and regional applications. LSTM networks, on the other hand, are a type of recurrent neural network (RNN) that can process and predict sequences of data. LSTM networks have the ability to capture long-term dependencies in the input data and are particularly suited for time series analysis. In the context of climate modeling, LSTM networks can effectively capture temporal patterns and trends in temperature data. Both the ANN and LSTM models were calibrated and trained using historical temperature data in Calgary. Hyperparameters of downscaling models were obtained using the following setup: Relu activation function, Epochs = 50, batch size 32, 80% train and 20% test set. The LSTM model was configured with four hidden layers, while the ANN model had one hidden layer. The calibration process included training the models with the selected inputs and their performance. By utilizing both ANN and LSTM models in the statistical downscaling process, this study aimed to capture both the complex nonlinear relationships and the temporal patterns in the historical temperature data of Calgary.

2.4. Evaluation Criteria

To evaluate the accuracy of the utilized models, evaluation criteria are used: determination of error criteria (RMSE), determination of coefficient (DC), average error (Bias) and correlation coefficient (CC).
RMSE measures the overall error of the model by calculating the square root of the difference between the estimate and the target.
DC represents the best model in the observed explanation, so that it shows the highest fit.
The bias measures the difference between the estimate and the target and provides the systematic error or bias of the model.
CC evaluates the strength and linear relationship between model output and observed data.
Using these criteria, the performance of ANN and LSTM models has been observed against the data.
The mathematical equations behind evaluation criteria denote as:
R M S E = i = 1 N   R i Z i 2 N
C C = x x ¯ . ( y y ¯ ) x x ¯ 2 . ( y y ¯ ) 2
D C = 1 i = 1 N   Z i R i 2 i = 1 N   Z i Z 2
where Ri signifies the estimated value, Zi represents the target value, Z ¯ stands for the average value of the target observations, and N denotes the size of the sample. The RMSE maintains the same dimension as the observations, whereas the DC is unitless and falls within the range of (−∞, 1]. A higher DC value approaching 1 indicates a greater degree of accuracy in the regression analysis. Considering the CC criteria, the x indicates actual value input data, x ¯ denotes the mean value of input data, y refers to the observed value, and y ¯ indicates the mean value of observed data.

3. Results

This study aimed to use GCM data as input for a statistical downscaling method based on LSTM and ANN combined with MI screening which illustrated in Figure 3. The methodology for simulating historical temperatures involved subjecting GCM data to MI screening to identify ten climate parameters that show a significant non-linear relationship with temperature in the study area. These parameters are selected based on the non-linear and significant relationships they have with the temperature trend. Non-linear relationships based on MI show that the change in a parameter does not have a constant or proportional effect on the temperature trend. Instead, these relationships may show complex patterns and interactions, where the effect of a single parameter on temperature may vary across different ranges and conditions (See Table 1).
Based on the MI screening results, the selected climate parameters were used as input for LSTM and ANN models. Based on Table 2, the results show that the LSTM model has a better ability in recording the historical temperature trend compared to the ANN model. The superiority of the LSTM model can be attributed to its deep layers and frequent connections. Deep layers allow the model to learn complex patterns and relationships in the data, while frequent connections enable error correction and integration of information from previous layers. In contrast, the ANN model, with a hidden layer, is limited in its ability to capture complex relationships. Therefore, the LSTM-based network is a more effective method for simulating historical temperature trends. The findings of this study show the effectiveness of the LSTM model as a reference model for the purposes of simulation and impact assessment in the Calgary region. The LSTM model’s ability to accurately capture historical temperature trends can provide valuable insights for climate-related studies and decision-making processes. The graph of the LSTM model results is shown in Figure 4.

4. Conclusions

This study showed that the LSTM model, combined with data screening using MI, has better performance in extracting historical temperature trends compared to the ANN model. The LSTM model obtained higher accuracy compared to the ANN model resulting in higher DC, CC and lower RMSE and bias. The LSTM-based network shows potential as a simulation and impact assessment tool for the Calgary region. However, it is important to acknowledge the limitations and uncertainties associated with the models, data, and methods used in this research. These uncertainties must be effectively addressed to policymakers and stakeholders to allow them to make informed decisions about climate change adaptation and mitigation strategies.

4.1. Shortcomings, Uncertainties, and Policy Aspects

Despite the promising results obtained from the LSTM model, there are several shortcomings and uncertainties that should be considered. First, this study focused solely on temperature trends, and analysis of other climate parameters such as precipitation was not included. Future research could consider incorporating additional variables to provide a more comprehensive understanding of the effects of climate change. Second, this study assumes stationarity of climate data, which may not be true for future climate conditions. It is necessary to consider the non-stationarity of climate variables when making long-term predictions. Furthermore, the accuracy and reliability of downscaling models depend on the quality and resolution of input data from GCMs. Uncertainties associated with the ability of GCMs to capture regional climate patterns and processes must be acknowledged. Additionally, the downscaling models used in this study provide insights into historical trends but may have limitations when predicting future climate scenarios.
From a policy perspective, the findings of this study can help make informed decisions about climate change adaptation and mitigation strategies in the Calgary region. The improved accuracy of downscaling models can help policymakers and stakeholders understand local temperature trends and their potential implications for different sectors, such as agriculture, water resources, and infrastructure planning. However, policymakers should also consider the uncertainties and limitations associated with the models and data used in this study. Strong and comprehensive assessments, combining multiple information sources, and considering different climate scenarios are essential for effective policy formulation and implementation.

4.2. Future Research Directions

This study paves the way for further research on statistical downscaling of climate variables in the Calgary region. Future studies could investigate the downscaling of other climate parameters, such as precipitation, to provide a more comprehensive understanding of the local effects of climate change. In addition, incorporating more advanced machine learning techniques or ensemble modeling approaches can increase the accuracy and robustness of downscaling models. Addressing the non-stationarity of climate variables and considering the uncertainties associated with GCM forecasts should also be prioritized. In addition, expanding the temporal and spatial domain of analysis and incorporating a longer period of observational data can lead to more reliable and robust results.

Author Contributions

Conceptualization, M.A. and M.B.; methodology, M.A.; software, M.A. and M.B.; validation, M.A. and A.H.S.; formal analysis, A.H.S. and M.B.; investigation, M.A.; re-sources, M.A.; data curation, M.A.; writing—original draft preparation, M.A. and M.B.; writing—review and editing, M.B., M.A. and A.H.S.; visu-alization, A.H.S.; supervision, M.A.; project administration, M.B., M.A. and A.H.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are available on request.

Conflicts of Interest

The authors declare no conflict of interest. The authors have no relevant financial or non-financial interest to disclose.

References

  1. Hay, L.E.; Wilby, R.L.; Leavesley, G.H. A Comparison of Delta Change and Downscaled Gcm Scenarios for Three Mountainous Basins in the United States. JAWRA J. Am. Water Resour. Assoc. 2000, 36, 387–397. [Google Scholar] [CrossRef]
  2. Grotch, S.L.; MacCracken, M.C. The use of general circulation models to predict regional climatic change. J. Clim. 1991, 4, 286–303. [Google Scholar] [CrossRef]
  3. Wilby, R.; Dawson, C.; Barrow, E. sdsm—A decision support tool for the assessment of regional climate change impacts. Environ. Model. Softw. 2002, 17, 145–157. [Google Scholar] [CrossRef]
  4. Murphy, J. An Evaluation of Statistical and Dynamical Techniques for Downscaling Local Climate. J. Clim. 1999, 12, 2256–2284. [Google Scholar] [CrossRef]
  5. Zhu, B.; Xue, L.; Wei, G.; Zhang, L.; Chen, X. CMIP5 projected changes in temperature and precipitation in arid and humid basins. Theor. Appl. Clim. 2018, 136, 1133–1144. [Google Scholar] [CrossRef]
  6. Tripathi, S.; Srinivas, V.; Nanjundiah, R.S. Downscaling of precipitation for climate change scenarios: A support vector machine approach. J. Hydrol. 2006, 330, 621–640. [Google Scholar] [CrossRef]
  7. Fan, L.; Fu, C.; Chen, D. Long-term trend of temperature derived by statistical downscaling based on EOF analysis. J. Meteorol. Res. 2011, 25, 327–339. [Google Scholar] [CrossRef]
  8. Hanssen-Bauer, I.; Achberger, C.; Benestad, R.E.; Chen, D.; Førland, E.J. Statistical downscaling of climate scenarios over Scandinavia. Clim. Res. 2005, 29, 255–268. [Google Scholar] [CrossRef]
  9. Bing, Q.; Jiahu, J. Analysis of trents of future temperature in the Bosten Lake Basin based on a statistical downscaling model. J. Resour. Ecol. 2010, 1, 268–273. [Google Scholar]
  10. Baño-Medina, J.; Manzanas, R.; Gutiérrez, J.M. Configuration and intercomparison of deep learning neural models for statistical downscaling. Geosci. Model Dev. 2020, 13, 2109–2124. [Google Scholar] [CrossRef]
Figure 1. Study area and the surrounded grid points.
Figure 1. Study area and the surrounded grid points.
Engproc 56 00320 g001
Figure 2. Historical trend of temperature at the statistical period.
Figure 2. Historical trend of temperature at the statistical period.
Engproc 56 00320 g002
Figure 3. The flowchart of the proposed methodology.
Figure 3. The flowchart of the proposed methodology.
Engproc 56 00320 g003
Figure 4. Diagram of the LSTM model results.
Figure 4. Diagram of the LSTM model results.
Engproc 56 00320 g004
Table 1. MI-screened GCM data.
Table 1. MI-screened GCM data.
Temp(1)Temp(2)Temp(3)Temp(4)Shum(4)S850(4)Shum(3)S850(3)S850(2)Shum(2)
mean2.452.951.23−0.240.120.120.130.120.120.13
std9.4010.4210.669.380.070.070.080.070.060.06
min−19.20−22.32−25.23−24.850.020.020.010.010.020.02
25%−5.08−5.85−7.62−7.460.070.070.060.060.070.07
50%0.871.810.82−1.360.100.100.100.090.100.11
75%10.8312.5811.538.510.180.180.190.170.170.17
max20.1221.7718.7116.260.320.320.340.320.320.33
Table 2. Evaluation criteria results.
Table 2. Evaluation criteria results.
ModelCCRMSE (C°)DCBias
TrainValidationTrainValidationTrainValidationTrainValidation
LSTM0.750.690.010.030.930.891.892.2
ANN0.620.541.21.50.780.712.012.54
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Alipour, M.; Bejani, M.; Salehi, A.H. Statistical Downscaling of Global Climate Models for Temperature Trend Analysis in Calgary. Eng. Proc. 2023, 56, 320. https://doi.org/10.3390/ASEC2023-15976

AMA Style

Alipour M, Bejani M, Salehi AH. Statistical Downscaling of Global Climate Models for Temperature Trend Analysis in Calgary. Engineering Proceedings. 2023; 56(1):320. https://doi.org/10.3390/ASEC2023-15976

Chicago/Turabian Style

Alipour, Mahdi, Mohammad Bejani, and Arman Hosseinpour Salehi. 2023. "Statistical Downscaling of Global Climate Models for Temperature Trend Analysis in Calgary" Engineering Proceedings 56, no. 1: 320. https://doi.org/10.3390/ASEC2023-15976

Article Metrics

Back to TopTop