Next Article in Journal
Assessing the Impacts of Human Activities on Air Quality during the COVID-19 Pandemic through Case Analysis
Previous Article in Journal
Research on Pre-Seismic Feature Recognition of Spatial Electric Field Data Recorded by CSES
Previous Article in Special Issue
A Novel Multi-Input Multi-Output Recurrent Neural Network Based on Multimodal Fusion and Spatiotemporal Prediction for 0–4 Hour Precipitation Nowcasting
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Machine Learning in Weather Prediction and Climate Analyses—Applications and Perspectives

by
Bogdan Bochenek
1,* and
Zbigniew Ustrnul
1,2
1
Institute of Meteorology and Water Management—National Research Institute, 01-673 Warsaw, Poland
2
Department of Climatology, Jagiellonian University in Krakow, 31-007 Krakow, Poland
*
Author to whom correspondence should be addressed.
Atmosphere 2022, 13(2), 180; https://doi.org/10.3390/atmos13020180
Submission received: 23 December 2021 / Revised: 18 January 2022 / Accepted: 20 January 2022 / Published: 23 January 2022

Abstract

:
In this paper, we performed an analysis of the 500 most relevant scientific articles published since 2018, concerning machine learning methods in the field of climate and numerical weather prediction using the Google Scholar search engine. The most common topics of interest in the abstracts were identified, and some of them examined in detail: in numerical weather prediction research—photovoltaic and wind energy, atmospheric physics and processes; in climate research—parametrizations, extreme events, and climate change. With the created database, it was also possible to extract the most commonly examined meteorological fields (wind, precipitation, temperature, pressure, and radiation), methods (Deep Learning, Random Forest, Artificial Neural Networks, Support Vector Machine, and XGBoost), and countries (China, USA, Australia, India, and Germany) in these topics. Performing critical reviews of the literature, authors are trying to predict the future research direction of these fields, with the main conclusion being that machine learning methods will be a key feature in future weather forecasting.

1. Introduction

The beginning of the 21st century, with the advent of big data, efficient supercomputers with Graphics Processing Units (GPU), and scientific interest in emerging new methods, turned out to be crucial in the history of machine learning [1]. Although many methods are known from the 1960s and have been examined in detail in many studies since then, recent years, with unprecedented increases in data volume and computer power, are seen as the golden era for artificial intelligence and machine learning (https://www.forbes.com/sites/joemckendrick/2019/10/23/artificial-intelligence-enters-its-golden-age/?sh=75d495f0734e, accessed on 17 December 2021).
Detailed reviews of machine learning algorithms, as the most important subgroup of artificial intelligence methods (Figure 1) in atmospheric science, can be found in many thematic articles [2,3,4]. In these publications, one can find details about many methods and their classifications. For atmospheric scientists, the most interesting group of techniques was found to be supervised learning (Figure 1), the most dominant group in the recent publications in the field. In the case that some labelled data are available, one can use it as a training dataset from which to build a function that maps given inputs to outputs. That function can be used in a different dataset, named testing one, to evaluate the model, and if the results are satisfactory, it can be used in the classification or regression of any kind of application needed. In that group we find methods, such as Decision Trees, e.g., Random Forest (RF) [5] or XGBoost (XGB) [6], Artificial Neural Networks (ANN) [7], Deep Learning (DL) [8], and Support Vector Machine (SVM) [9]. The second group in machine learning is unsupervised learning (Figure 1), in which algorithms do not have labelled data to train from, and must decide upon other ways to divide a given dataset, or reduce the dimensions of it, for further analysis. In this group, the popular methods among atmospheric scientists are K-means Clustering (K-means) [10] and Principal Component Analysis (PCA) [11].
The main goal of this study is to present a review of the machine learning methods and applications within the main topics of meteorology, as well as in climate analyses. We show examples of the use of machine learning techniques as a new method that helps to solve important and complex issues in weather forecasting and in the study of climate change over different temporal- and spatial scales. We did not intend to present every aspect of each problem (e.g., in the field of climate change at least 13 particular aspects can be distinguished, https://vitalflux.com/machine-learning-use-cases-climate-change/, accessed on 17 December 2021), but to point out only the most important and attractive issues, which were obtained via text mining of scientific publications, and to prove that machine learning techniques can be used with success in meteorology and climatology. We also present possible future research directions for these fields.
This paper is organized as follows: Section 2 presents the materials and methods used for text mining of scientific papers. Section 3 consists of a summary of the results of the text mining of scientific papers, which reveals the most common topics; Section 3.1 and Section 3.2 presents a review of the well-known scientific papers related to the use of machine learning in Numerical Weather Prediction (NWP) and climate analysis. Section 4 provides a discussion of our results and conclusions about the use of machine learning at present and in the future.

2. Materials and Methods

This study was performed using a database with information for 500 scientific articles (published since 2018), obtained from the Google Scholar search engine (https://scholar.google.com/, accessed on 10 November 2021), which were related to the phrases “numerical weather prediction” and “machine learning”—250 papers, and “climate” and “machine learning”—250 papers. All search results were organized by relevancy; every item in the search results was checked in order to choose only research papers and to exclude unrelated articles. Thus, a database with 500 papers was created. Subsequently, every manuscript was saved onto Zotero software (https://www.zotero.org, accessed on 10 November 2021), which helps to organize data and extract text databases using important information, such as title, abstract, keywords, authors, journals, etc. All the prepared data are available in the supplementary comma separated (csv) files (Tables S1 and S2). Text mining, using the ‘tidytext’ [12] R package [13], was performed on our database to search for the most common phrases included in the abstracts, and mostly used meteorological fields, methods, and countries of analysis. Additional to text mining, we analyzed well known papers on the relevant weather forecasting and climate change issues.

3. Results

For the first group of research papers related to machine learning methods and NWP models, we first built a list of search items based on American Geophysical Union (AGU) index terms (https://www.agu.org/Publish-with-AGU/Publish/Author-Resources/Index-terms, accessed on 22 December 2021), then measured how frequently these search items occurred in the abstracts. The 10 most common phrases are presented in Figure 2. Since the topic of post-processing NWP results to improve forecasts concerning renewable energy is very common among scientists, the phrase “Wind Forecasting” had the highest count (Figure 2). The phrase with the second highest count turned out to be “Ensemble Forecasting”, due to the growing interest in improving probabilistic forecasts and in the methods required to interpret them correctly. Slightly fewer counts were recorded for phrases such as “Data Assimilation”, “Extreme Events”, “Remote Sensing”, and “Land Cover”. Less than 10 counts were found for the phrases “Tropical Cyclones”, “Coupled Models”, “Cloud Physics”, and “Boundary Layer”.
For the second group of research papers related to machine learning methods in climate, a similar histogram is presented in Figure 3. Unsurprisingly, the most common phrase in this group was “Climate Change”, with more than 140 counts. Almost three times more counts were recorded for the phrase “Global Climate Models” than for “Regional Climate Models”. Slightly less common phrases were “Climate Impact”, “Remote Sensing”, “Land Cover”, and “Extreme Events”, while phrases such as “Coupled Models”, “Convection”, and “Calibration” had less than 10 counts.
In addition to text mining of research topics and phrases, similar word counts in the abstracts of selected publications can also give an interesting insight into the most common topics of interest. Some of the most interesting results derived using this method are presented in the following section. Figure 4 presents the most commonly used meteorological fields in NWP studies. Scientists mentioned the term ‘wind’ more than 200 times, and this term is related to an important group of renewable energy and wind forecasting studies, as presented in Figure 2. The term ‘precipitation’ was used almost 150 times, usually with regards to applications for short-range prediction, and downscaling or post-processing. Several papers on bias correction of temperature and air pressure were present, as well as studies on radiation, both using photovoltaic application and emulating this scheme in NWP models.
To better understand the practices used by scientists for exploring machine learning techniques in NWP, the most commonly applied methods are presented in Figure 5. The most dominant algorithms were ANN and DL. Decision trees methods, such as RF, XGB, and SVM, are often used. Based on our experience, it seems that all of these methods can be successfully applied to NWP and climate analyses.
In the case of research related to climate studies with machine learning methods, the most common countries taken into consideration are presented in Figure 6. It must be noted that only 25% (62 articles out of 250) of the papers under consideration in Figure 6 had a specific geographical region included in the abstract. Very often, the abstracts were more focused on the methods and data used in the study. An example of how this effected our results is that 25 papers on climate-related aspects in China represented almost 40% of all the papers with specified regions included in the abstract (Figure 6). The most dominant group of papers related to studies about climate in China. Slightly fewer occurrences of the following countries, USA, Australia, India, and Germany, were found in selected abstracts. Figure 4, Figure 5 and Figure 6 show the results from our analysis designed to capture all possible occurrences of a given phrase (e.g., the phrase ‘USA’ was a sum of the occurrences of the words ‘U.S.’, ‘USA’, ‘United States’, etc.).
A more detailed insight into the selected fields of interest to scientists, generated using the text mining method in the form of co-occurrence networks from Figure 2 and Figure 3, is presented below. Section 3.1 and Section 3.2 consider NWP and climate research, respectively.

3.1. Numerical Weather Prediction

3.1.1. Photovoltaic and Wind Energy

Many countries all over the world are in the stages of moving away from fossil-fuel power plants and towards the implementation of cleaner technologies, such as harnessing energy from wind or solar radiation; however, this transition leads to new challenges, one of which being the stability of power grids. Energy from conventional power plants is more stable and it is relatively easy to alter the production power with changing demand from customers, whereas renewable energy production is highly dependent on weather conditions. Therefore, accurate predictions are required, not only of meteorological fields, but also for energy production.
The standard procedure for predicting energy production from wind farms is to use NWP models and power curves of installed wind turbines, although many applications also use machine learning techniques. The research in this field has, in recent years, focused on using new machine learning methods [14,15,16,17], different NWP models and configurations, such as ensemble forecasting [18,19], or different approaches, from forecasting wind power for every wind turbine with high-resolution NWP models [20] to wind power production over whole countries [21]. With the obvious limitations of accuracy of NWP models, authors are trying to build models using methods such as RF, XGB, ANN, and DL to increase the accuracy of very short-range forecasts up to few hours, most commonly examined day-ahead forecasts, and predictions up to several days in advance.
In terms of photovoltaic (PV) energy, similar to wind power, research has focused on examining the different architectures of machine learning models for improved post-processing of NWP forecasts, with the use of similar methods [22,23]. One example is the PVNet model, designed to predict spatially aggregated PV production in Germany [24]. This model, based on LRCN (Long-Term Recurrent Convolutional Network) architecture, was not only proven to predict PV energy with high accuracy, but also to provide valuable insight into the dependence on energy production from different meteorological fields with respect to geographical location. Machine learning can be also an important tool in planning future installations of power plants [25]. With the use of existing PV systems, NWP forecasts, and observational data, it is possible to build an accurate model that can be used to determine favorable locations for new PV installations, even if weather measurements are not available. Machine learning techniques can be also combined with more basic statistical methods to provide location independent, day-ahead PV production forecasts [26].

3.1.2. Atmospheric Physics and Processes

In recent years there has been a growing interest in machine learning methods in many aspects, from the post-processing and bias correction of model forecasts [27,28] to emulating full model physics [29]. There are three important aspects to be taken into account when planning work using machine learning methods in NWP models. The first is to speed up computations of very computationally expensive parts of the model, the second is to improve the performance of current algorithms, and the third is to emulate the existing code with machine learning models in order to easily allow a model to run on a computer cluster with GPU accelerators. It is worth mentioning here some of the events held by leading NWP centers that focused on sharing knowledge regarding the use of machine learning in their applications, with recordings of meetings and presentations available online: NOAA Workshop on Leveraging AI in Environment Sciences (https://2021noaaaiworkshop.sched.com/info, accessed on 10 November 2021), ESA-ECMWF Workshop 2021 (https://www.ml4esop.esa.int/, accessed on 10 November 2021), and EUMETNET 43rd EMGLAM 2021 (http://srnwp.met.hu/Annual_Meetings/2021/index.html, accessed on 10 November 2021).
One of the most challenging problems with very high-resolution NWP models is related to land-cover classifications. Currently used databases are usually available with very coarse resolution and consist of numerous errors. Convolutional Neural Networks (CNN) can be used to improve them with the use of Sentinel-2 satellite data, the CORINE land-cover, and the BigEarthNet database [30]. This method was not only able to produce a model land-cover database that outperformed the currently used model, but also allowed the updating of maps to any time of the year, which is important for regions with large seasonal variations.
Several papers were also recently published on the topic of emulating different parts of NWP models by machine learning [31,32,33,34]. Authors are using either benchmark solutions to provide reliable estimates of examined algorithms, or using observational data from special campaigns to train models on real and accurate data. Very promising results were also obtained with NWP modules prepared especially for GPU accelerators, with speedup reaching 120 times those of their versions on standard Central Processing Unit-based (CPU) computers, in the cases of both the Radiation Transfer Model and Aerosol Microphysics.
Another interesting aspect relates to the tuning of NWP model parameters. Currently, in every existing NWP model, there are several parameters that have to be tuned manually. Scientists, while running some long or short-range experiments, usually do it, and compare verification results over different configurations. A machine learning-based approach for this process has been proposed in the literature [35,36]. Various microphysics schemes, cumulus parameterizations, and shortwave and longwave radiation schemes were examined, and based on the relationship between the choice of physical processes and the resulting forecast errors, a machine learning model was built to assess WRF model uncertainty.

3.2. Climate

3.2.1. Parametrizations

One of the challenges in improving General Circulation Models (GCM) is related to the proper parametrization of several atmospheric processes, e.g., moist convection. One example of how to tackle this problem comes with the use of machine learning methods [37]. It was proposed that RF models be trained from the output of high-resolution atmospheric NWP models and incorporated into the GCM model. It was shown that, using this technique, GCMs can run stably and accurately capture even extremes in precipitation. The RF method was used to ensure, for example, energy conservation, but authors commented that it can also be achieved with other machine learning techniques with an adjustment of the field’s tendencies in the training process.
Interesting insight into parametrization performance was presented by Juval and Gorman [38]. Consistent with O’Gorman at al. [37], the RF method was used to learn from high-resolution, idealized atmospheric models, and it also led to stable forecasts in the coarse-grid model.
Different approaches to the problem of using machine learning with parametrizations can be divided into three groups [39]. The first relates to the use of machine learning with observed data to develop improved individual parameterizations of features not explicitly resolved by the dynamics of the models [40,41,42,43]. The second is similar to the first group, although the parametrization scheme is not improved here, but replaced completely by machine learning [31,37,38,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59]. The third group relates to when observed data are used to produce forecasts of key weather features at specific locations [60,61,62,63].

3.2.2. Extreme Events

Extreme meteorological events are often related to the occurrence of weather fronts. Several studies were compared in order to examine their climatology with the use of machine learning methods [64,65,66,67], which can help provide more objective tools, in contrast with manually drawn maps with fronts. Authors are using several databases with labelled weather fronts, meteorological reanalysis, and several other methods to provide accurate models that can be used for the climatological analysis of positions of weather fronts.
Precipitation is also often considered in studies using machine learning methods. Since there is a big difference in the level of accuracy of prediction of synoptic-scale climate features and precipitation field, a 2D Convolution Neural Network has been proposed to develop approximators of regional precipitation and discharge extremes based on synoptic-scale predictions from general circulation models [68]. With such a method, not only is it possible to find the most reliable fields in estimation of precipitation extremes, but also to identify important regional and seasonal differences. Machine learning methods can be also used to better predict future intensity–duration–frequency curves that are important in terms of extreme precipitation and flooding events [69], or to estimate the trends and seasonal components of rainfall and streamflow [70,71,72] with the use of Wavelet analysis.

3.2.3. Climate Change

The previous sections show that the role of machine learning in many areas of meteorology, especially operational meteorology working for weather forecasts, is significant. This role has grown in recent years. The question that now arises is how machine learning will contribute to the field of climate change, which is probably the most significant issue in Earth sciences in recent years. The answer is not unequivocal here, because due to hundreds of articles on this topic, covering both global aspects as well as regional and local, there is a dominance of works without reference to machine learning. This situation is slowly changing, and the number of works using machine learning in climate change analyses has recently grown. We present here the most important works, in our opinion, which are important from the methodological and cognitive point of view. From the outset, it is worth citing a fundamental publication by 22 authors entitled ‘Tackling Climate Change with Machine Learning’ [73], which includes a very wide spectrum of machine learning applications in various climate change issues. It is written by many researchers from renowned research centers, specializing in particular climatic issues. This publication contains over 800 references to different aspects of climate change. In three main parts, titled ‘mitigation’, ‘adaptation’ and ‘meta tools’, the authors provide a detailed review of the literature on specific issues of climate change and its interactions with the environment and human activities. Moreover, in the work one can find many recommendations for various recipients and decision makers. The more than 800 works cited in total provide an excellent source of numerous analyses and introduce the possibilities of machine learning applications in research and activities related to climate change.
An example of a somewhat similar work, but concerning the modeling of the climatic conditions and climate projections, is the study by Schneider et al. 2017 [74]. Although this work is not as recent, it gives a good look at climate modeling and the application of new data and tools with the special use of machine learning. A broad view of the problems of climate modeling and the application of machine learning is presented in the work of Rechstein et al. [75] and Huntingford et al. [76]. A narrower meaning, concerning only selected elements of the climate system, is represented by the works of O’Gorman and Dwyer [37] and Dijkstra et al. [77]. The latter presents the advantages of using ML in the prediction of the El Niño phenomenon.
Several scientific publications considered the use of machine learning techniques in tackling climate change, where specific applications were considered, having the potential to be successfully examined similar to teleconnection identification and climate connection to extreme weather. Climate change research is present in many aspects of everyday life, e.g., in predicting building energy use [78] or pavement condition [79], in the future climate. Studies such as these can be very important and beneficial to long-term policymaking.
Another important aspect in this field is related to agriculture. Crop [80] and wheat yields [81] were modelled with successful results that outperformed classical statistical methods. Increasing heat events were identified, with machine learning techniques in the study conducted in Australia, to be a major factor causing yield losses in the future.
Based on the findings in this section, we summarize that machine learning helps to improve analysis and find links between different predictors and climate conditions in different issues. Simultaneously, it can also be used to generate high-resolution data and to explore the drivers of climate change [82,83,84,85,86,87].

4. Discussion and Conclusions

In this article, we present a review of the studies that aimed to use machine learning and artificial intelligence methods in meteorology and climatology. First, we extracted relevant information about the current studies in the field using text mining methods. With the use of Google Scholar search engine, we collected 500 articles, published since 2018, related to the use of machine learning techniques in numerical weather prediction and climate analysis. Based on the created dataset, we identified the most relevant topics of currently published studies, as well as other characteristics, such as analyzed meteorological fields, used methods, and the most common countries mentioned in the abstracts of the papers. This method has several limitations that should be mentioned here. The search engine will favor publications with the “machine learning” phrase in the title or abstract, and can omit important papers that use this phrase only in the main text. In addition, since the articles were collected manually, their overall number was only 500, far less than the requirement of several thousand, as suggested in other text mining publications related to searching for patterns in scientific articles (in those publications, databases were already prepared for specific topics, such as COVID-19). On the other hand, every publication was checked by a specialist in the field, so unrelated papers were immediately excluded from further analysis.
In terms of the presented results, it is clear that there are wide possibilities for using the methods mentioned previously, which have recently become a very important part of atmospheric science due to their research and applicational potential. Applicability in terms of prognostic models is indisputable, therefore machine learning methods can be successfully used to analyze and determine important problems in meteorology and synoptic climatology, such as current circulation types (patterns), types of weather, weather fronts, and air masses.
In our opinion, machine learning may have a particularly significant application in synoptic meteorology and climatology. This is because in many circulation-related issues there are no unambiguous, quantitative definitions or criteria, which makes it difficult and sometimes impossible to conduct objective analyses. Only for weather types can those criteria be found, but for others there are usually no strict and precise definitions without quantitative criteria and indices, and even if they exist, they are only available to selected regions on a local scale [88,89,90,91]. Therefore, machine learning can be used to objectively determine those elements, both in a supervised way when labelled data are available, and in an unsupervised way when we need to divide different features based on common characteristics. For example, the k-means clustering method can be found in many publications in which the authors intended to determine specific types of circulation, weather types, or types of dependence between different characteristics of meteorological and environment variables [92,93,94,95,96,97,98].
It is worth mentioning here that in previous review papers from the 20th century and the beginning of the 21st century, machine learning was not often mentioned in the perspectives for future emerging developments [99,100,101,102]. However, looking at the progress in the field of numerical meteorological analysis in recent years, this is not surprising. At the beginning of the 21st century, access to computer clusters, specialized software, and professional databases was very limited. It is clearly visible also in terms of meteorological reanalysis, that is now freely available to the research community, in very high spatial- and temporal resolution [103]. Although the interest in using machine learning in atmospheric science is visible from the beginning of 1990s and earlier [104,105], they were much more limited than more recent versions [106,107,108].
Even throughout the history of development in meteorology and synoptic climatology in the 21st century, it is hard to find a perspective for machine learning and artificial intelligence [90,100,101,102,103,104,105], where greater importance is placed on downscaling and GIS methods. With that in mind, authors are trying to answer the question about the future of machine learning in atmospheric science, and it seems that, at least in the coming years, interest will grow. The increase in available computer power and emerging new technologies, the development and access to specialized software, and improved reanalysis will be key factors determining the use of machine learning in many studies. There are several limitations and problems that scientists can face when using machine learning techniques. One of the most obvious is related to knowledge of tools and methods. Fortunately, many institutions are now trying to organize workshops and seminars that are freely available online to help to tackle this problem. Proper use of machine learning methods also requires some level of interdisciplinary cooperation between scientists [109].
With fast growing interest in the use of machine learning methods in NWP and climate research, it is difficult to judge what the near future looks like. Some scientists are predicting that these methods will not play a significant role, while others see machine learning as a solution to almost every problem, and believe that in a few years it will suppress the standard way of working with models. We decided to look at the written plans belonging to world-leading NWP and climate consortia, such as the European Centre for Medium-Range Weather Forecasts (ECMWF) (https://www.ecmwf.int/node/19877, accessed on 10 November 2021), and agencies such as the National Oceanic and Atmospheric Administration (NOAA) (https://sciencecouncil.noaa.gov/Portals/0/Artificial%20Intelligence%20Strategic%20Plan_Final%20Signed.pdf?ver=2021-01-19-114254-380, accessed on 10 November 2021). Both institutions are highly involved in research related to the use of modern machine learning techniques, and have extensive plans for the near future that can be used as a proxy for what we can expect in the field.
Based on previously mentioned plans and the current progress in atmospheric science, there is a clear tendency to tackle many aspects of research and operational areas of work with machine learning techniques. Both NOAA and ECMWF have assembled groups of scientists that will be responsible for accelerating artificial intelligence across whole institutions, and working together with other researchers and computer companies. Several goals and milestones have been established. There is a plan to organize several workshops and conferences related to the progress in the use of machine learning, research applications with the use of machine learning are supposed to be transferred faster to operational mode, and artificial intelligence should be widely promoted.
Another sign that machine learning methods will be present in many components of modern NWP and climate models is related to trends in new computer clusters. What can be seen in recent years is a rapid growth of top speed clusters in the world with GPU accelerators (https://blogs.nvidia.com/blog/2020/06/22/top500-isc-supercomputing/, accessed on 10 November 2021, and https://www.lumi-supercomputer.eu/lumi-provides-new-opportunities-for-artificial-intelligence-research/, accessed on 10 November 2021). Computer codes and the overall design of NWP models, written in programming languages such as Fortran, were prepared for standard CPU machines, therefore research into the use of machine learning to emulate some parts of the model, or even the whole model, can be very beneficial for agencies and consortia in the future [110]. It should be mentioned here that there are also some other initiatives that promote the use of machine learning in NWP and climate models. The Destination Earth project from European Commission’s Green Deal and the Digital Strategy (https://digital-strategy.ec.europa.eu/en/policies/destination-earth, accessed on 10 November 2021) with it’s very challenging aims to provide a digital twin of the Earth with very high-resolution, will require speedup of current NWP models and fast post-processing of hundreds of terabytes of data every day. Most probably, in order to achieve this goal, state of the art machine learning methods will have to be implemented in future operational suites.
According to our knowledge and experience in this field, it is important first to properly understand the processes and relationships between meteorological and environment variables in analyzed problems, and to correctly implement any machine learning method and not to use it as a black box. With proper investigation taken into account, the aforementioned use of new technologies, and the cooperation between different fields, we believe that machine learning methods will be a key feature in future weather forecasting. Bias correction, ensemble forecasting interpretation, better data assimilation, and the emulation of computationally costly parametrizations can help us achieve accurate, high-resolution NWP model forecasts. It is worth mentioning that, although all the methods referenced in this paper can be used with success in many applications, some of them, for example RF, require less knowledge in the field of machine learning and are more suitable for beginners, while DL or CNN needs more experience to be used properly. We agree with [111] that artificial intelligence will be a very important technique that will help in the monitoring and forecasting of weather conditions. Independent of operational use, those methods can be also highly valuable in climate change research at spatial- and temporal scales [112], although it will depend strongly on data availability, which over recent years has been constantly improving.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/atmos13020180/s1, Table S1: AI_2021_CLIMATE; Table S2: AI_2021_NWP.

Author Contributions

Conceptualization, B.B. and Z.U.; methodology, B.B.; software, B.B.; validation, B.B. and Z.U.; formal analysis, B.B.; investigation, Z.U.; resources, Z.U.; data curation, B.B.; writing—original draft preparation, B.B. and Z.U.; writing—review and editing, B.B. and Z.U.; visualization, B.B.; supervision, Z.U.; project administration, Z.U.; funding acquisition, Z.U. All authors have read and agreed to the published version of the manuscript.

Funding

The study was performed as part of the research task “Development of forecasting methods to improve existing products and develop new application solutions, task 4. Implementation and development of methods of analysis and meteorological forecasting for the needs of renewable energy sources (S-6/2021)” and “Modern climate warming and its impact on environment (DS-1/2021)”, financed by the Ministry of Science and Higher Education (Poland), statutory activity of the Institute of Meteorology and Water Management-National Research Institute.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available in Supplementary Materials https://www.mdpi.com/article/10.3390/atmos13020180/s1.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

ANNArtificial Neural Networks
CPUCentral Processing Unit
DLDeep Learning
ECMWFEuropean Centre for Medium-Range Weather Forecasts
GCMGeneral Circulation Model
GPUGraphics Processing Units
K-meansK-means Clustering
NOAANational Oceanic and Atmospheric Administration
NWPNumerical Weather Prediction
PCAPrincipal Component Analysis
PVPhotovoltaic
RFRandom Forest
SVMSupport Vector Machine
XGBXGBoost

References

  1. Fradkov, A.L. Early History of Machine Learning. IFAC-PapersOnLine 2020, 53, 1385–1390. [Google Scholar] [CrossRef]
  2. Mahesh, B. Machine Learning Algorithms—A Review; International Journal of Science and Research: Raipur, India, 2019. [Google Scholar] [CrossRef]
  3. Dhall, D.; Kaur, R.; Juneja, M. Machine Learning: A Review of the Algorithms and Its Applications. In Lecture Notes in Electrical Engineering, Proceedings of the ICRIC, Jammu, India, 8–9 March 2019; Singh, P.K., Kar, A.K., Singh, Y., Kolekar, M.H., Tanwar, S., Eds.; Springer International Publishing: Cham, Switzerland, 2020; pp. 47–63. [Google Scholar] [CrossRef]
  4. Singh, A.; Thakur, N.; Sharma, A. A Review of Supervised Machine Learning Algorithms. In Proceedings of the 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, India, 16–18 March 2016; pp. 1310–1315. [Google Scholar]
  5. Iverson, L.R.; Prasad, A.M.; Liaw, A. New Machine Learning Tools for Predictive Vegetation Mapping after Climate Change: Bagging and Random Forest Perform Better than Regression Tree Analysis. In Landscape Ecology of Trees and Forests, Proceedings of the 12th Annual IALE(UK) Conference, Cirencester, UK, 21–24 June 2004; Smithers, R., Ed.; International Association for Landscape Ecology: Piestany, Slovakia, 2004; pp. 317–320. [Google Scholar]
  6. Huang, L.; Kang, J.; Wan, M.; Fang, L.; Zhang, C.; Zeng, Z. Solar Radiation Prediction Using Different Machine Learning Algorithms and Implications for Extreme Climate Events. Front. Earth Sci. 2021, 9, 202. [Google Scholar] [CrossRef]
  7. Abbot, J.; Marohasy, J. The Application of Machine Learning for Evaluating Anthropogenic versus Natural Climate Change. GeoResJ 2017, 14, 36–46. [Google Scholar] [CrossRef]
  8. Davenport, F.V.; Diffenbaugh, N.S. Using Machine Learning to Analyze Physical Causes of Climate Change: A Case Study of U.S. Midwest Extreme Precipitation. Geophys. Res. Lett. 2021, 48, e2021GL093787. [Google Scholar] [CrossRef]
  9. Tripathi, S.; Srinivas, V.V.; Nanjundiah, R.S. Downscaling of Precipitation for Climate Change Scenarios: A Support Vector Machine Approach. J. Hydrol. 2006, 330, 621–640. [Google Scholar] [CrossRef]
  10. Fang, Y.; Chen, H.; Lin, Y.; Zhao, C.; Lin, Y.; Zhou, F. Classification of Northeast China Cold Vortex Activity Paths in Early Summer Based on K-Means Clustering and Their Climate Impact. Adv. Atmos. Sci. 2021, 38, 400–412. [Google Scholar] [CrossRef]
  11. Zong, L.; Yang, Y.; Gao, M.; Wang, H.; Wang, P.; Zhang, H.; Wang, L.; Ning, G.; Liu, C.; Li, Y.; et al. Large-Scale Synoptic Drivers of Co-Occurring Summertime Ozone and PM2.5 Pollution in Eastern China. Atmos. Chem. Phys. 2021, 21, 9105–9124. [Google Scholar] [CrossRef]
  12. Silge, J.; Robinson, D. tidytext: Text Mining and Analysis Using Tidy Data Principles in R. J. Open Sour. Softw. 2016, 1, 37. [Google Scholar] [CrossRef] [Green Version]
  13. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2017; Available online: https://www.R-project.org/ (accessed on 17 December 2021).
  14. Kosovic, B.; Haupt, S.E.; Adriaansen, D.; Alessandrini, S.; Wiener, G.; Delle Monache, L.; Liu, Y.; Linden, S.; Jensen, T.; Cheng, W.; et al. A Comprehensive Wind Power Forecasting System Integrating Artificial Intelligence and Numerical Weather Prediction. Energies 2020, 13, 1372. [Google Scholar] [CrossRef] [Green Version]
  15. Dong, W.; Yang, Q.; Fang, X. Multi-Step Ahead Wind Power Generation Prediction Based on Hybrid Machine Learning Techniques. Energies 2018, 11, 1975. [Google Scholar] [CrossRef] [Green Version]
  16. Barque, M.; Martin, S.; Vianin, J.E.N.; Genoud, D.; Wannier, D. Improving Wind Power Prediction with Retraining Machine Learning Algorithms. In Proceedings of the 2018 International Workshop on Big Data and Information Security (IWBIS), Jakarta, Indonesia, 12–13 May 2018; pp. 43–48. [Google Scholar] [CrossRef]
  17. Ma, Y.-J.; Zhai, M.-Y. A Dual-Step Integrated Machine Learning Model for 24h-Ahead Wind Energy Generation Prediction Based on Actual Measurement Data and Environmental Factors. Appl. Sci. 2019, 9, 2125. [Google Scholar] [CrossRef] [Green Version]
  18. Wu, Y.-K.; Su, P.-E.; Wu, T.-Y.; Hong, J.-S.; Hassan, M.Y. Probabilistic Wind Power Forecasting Using Weather Ensemble Models. IEEE Trans. Ind. Appl. 2018, 54, 5609–5620. [Google Scholar] [CrossRef]
  19. Kim, D.; Hur, J. Short-Term Probabilistic Forecasting of Wind Energy Resources Using the Enhanced Ensemble Method. Energy 2018, 157, 211–226. [Google Scholar] [CrossRef]
  20. Donadio, L.; Fang, J.; Porté-Agel, F. Numerical Weather Prediction and Artificial Neural Network Coupling for Wind Energy Forecast. Energies 2021, 14, 338. [Google Scholar] [CrossRef]
  21. Bochenek, B.; Jurasz, J.; Jaczewski, A.; Stachura, G.; Sekuła, P.; Strzyżewski, T.; Wdowikowski, M.; Figurski, M. Day-Ahead Wind Power Forecasting in Poland Based on Numerical Weather Prediction. Energies 2021, 14, 2164. [Google Scholar] [CrossRef]
  22. Theocharides, S.; Venizelou, V.; Makrides, G.; Georghiou, G.E. Day-Ahead Forecasting of Solar Power Output from Photovoltaic Systems Utilising Gradient Boosting Machines. In Proceedings of the 2018 IEEE 7th World Conference on Photovoltaic Energy Conversion (WCPEC) (A Joint Conference of 45th IEEE PVSC, 28th PVSEC 34th EU PVSEC), Waikoloa, HI, USA, 10–15 June 2018; pp. 2371–2375. [Google Scholar] [CrossRef]
  23. Mellit, A. An Overview on the Application of Machine Learning and Deep Learning for Photovoltaic Output Power Forecasting. In Lecture Notes in Electrical Engineering, Proceedings of the 2nd International Conference on Electronic Engineering and Renewable Energy Systems, Saidia, Morocco, 13–15 April 2020; Hajji, B., Mellit, A., Marco, T.G., Rabhi, A., Launay, J., Naimi, S.E., Eds.; Springer: Singapore, 2021; pp. 55–68. [Google Scholar] [CrossRef]
  24. Mathe, J.; Miolane, N.; Sebastien, N.; Lequeux, J. PVNet: A LRCN Architecture for Spatio-Temporal Photovoltaic PowerForecasting from Numerical Weather Prediction. arXiv 2020, arXiv:1902.01453. [Google Scholar]
  25. Gómez, J.L.; Martínez, A.O.; Pastoriza, F.T.; Garrido, L.F.; Álvarez, E.G.; García, J.A.O. Photovoltaic Power Prediction Using Artificial Neural Networks and Numerical Weather Data. Sustainability 2020, 12, 10295. [Google Scholar] [CrossRef]
  26. Theocharides, S.; Makrides, G.; Livera, A.; Theristis, M.; Kaimakis, P.; Georghiou, G.E. Day-Ahead Photovoltaic Power Production Forecasting Methodology Based on Machine Learning and Statistical Post-Processing. Appl. Energy 2020, 268, 115023. [Google Scholar] [CrossRef]
  27. Kirkwood, C.; Economou, T.; Odbert, H.; Pugeault, N. A Framework for Probabilistic Weather Forecast Post-Processing across Models and Lead Times Using Machine Learning. Philos. Trans. Royal Soc. A Math. Phys. Eng. Sci. 2021, 379, 20200099. [Google Scholar] [CrossRef]
  28. Cho, D.; Yoo, C.; Im, J.; Cha, D.-H. Comparative Assessment of Various Machine Learning-Based Bias Correction Methods for Numerical Weather Prediction Model Forecasts of Extreme Air Temperatures in Urban Areas. Earth Sp. Sci. 2020, 7, e2019EA000740. [Google Scholar] [CrossRef] [Green Version]
  29. Belochitski, A.; Krasnopolsky, V. Stable Emulation of an Entire Suite of Model Physics in a State-of-the-Art GCM Using a Neural Network. arXiv 2021, arXiv:2103.07028. [Google Scholar]
  30. Walsh, E.; Bessardon, G.; Gleeson, E.; Ulmas, P. Using Machine Learning to Produce a Very High Resolution Land-Cover Map for Ireland. Adv. Sci. Res. 2021, 18, 65–87. [Google Scholar] [CrossRef]
  31. Seifert, A.; Rasp, S. Potential and Limitations of Machine Learning for Modeling Warm-Rain Cloud Microphysical Processes. J. Adv. Model. Earth Syst. 2020, 12, e2020MS002301. [Google Scholar] [CrossRef]
  32. Wang, Y.; Zeng, X.; Decker, J. A GPU-Accelerated Radiation Transfer Model Using the Lattice Boltzmann Method. Atmosphere 2021, 12, 1316. [Google Scholar] [CrossRef]
  33. Bodini, N.; Lundquist, J.K.; Optis, M. Can Machine Learning Improve the Model Representation of Turbulent Kinetic Energy Dissipation Rate in the Boundary Layer for Complex Terrain? Geosci. Model Dev. 2020, 13, 4271–4285. [Google Scholar] [CrossRef]
  34. Harder, P.; Watson-Parris, D.; Strassel, D.; Gauger, N.; Stier, P.; Keuper, J. Emulating Aerosol Microphysics with Machine Learning. arXiv 2021, arXiv:2109.10593. [Google Scholar]
  35. Moosavi, A.; Rao, V.; Sandu, A. A Learning Based Approach for Uncertainty Analysis in Numerical Weather Prediction Models. arXiv 2018, arXiv:1802.08055. [Google Scholar]
  36. Moosavi, A.; Rao, V.; Sandu, A. Machine Learning Based Algorithms for Uncertainty Quantification in Numerical Weather Prediction Models. J. Comput. Sci. 2021, 50, 101295. [Google Scholar] [CrossRef]
  37. O’Gorman, P.A.; Dwyer, J.G. Using Machine Learning to Parameterize Moist Convection: Potential for Modeling of Climate, Climate Change, and Extreme Events. J. Adv. Model. Earth Syst. 2018, 10, 2548–2563. [Google Scholar] [CrossRef] [Green Version]
  38. Yuval, J.; O’Gorman, P.A. Stable Machine-Learning Parameterization of Subgrid Processes for Climate Modeling at a Range of Resolutions. Nat. Commun. 2020, 11, 3295. [Google Scholar] [CrossRef]
  39. Pielke, R.A. Chapter 8—New Parameterization Approaches. In Mesoscale Meteorological Modeling; International Geophysics; Pielke, R.A., Ed.; Academic Press: Cambridge, MA, USA, 2013; Volume 98, pp. 225–235. [Google Scholar] [CrossRef]
  40. Brajard, J.; Carrassi, A.; Bocquet, M.; Bertino, L. Combining Data Assimilation and Machine Learning to Infer Unresolved Scale Parametrization. Philos. Trans. Royal Soc. A Math. Phys. Eng. Sci. 2021, 379, 20200086. [Google Scholar] [CrossRef] [PubMed]
  41. Watt-Meyer, O.; Brenowitz, N.D.; Clark, S.K.; Henn, B.; Kwa, A.; McGibbon, J.; Perkins, W.A.; Bretherton, C.S. Correcting Weather and Climate Models by Machine Learning Nudged Historical Simulations. Geophys. Res. Lett. 2021, 48, e2021GL092555. [Google Scholar] [CrossRef]
  42. Couvreux, F.; Hourdin, F.; Williamson, D.; Roehrig, R.; Volodina, V.; Villefranque, N.; Rio, C.; Audouin, O.; Salter, J.; Bazile, E.; et al. Process-Based Climate Model Development Harnessing Machine Learning: I. A Calibration Tool for Parameterization Improvement. J. Adv. Model. Earth Syst. 2021, 13, e2020MS002217. [Google Scholar] [CrossRef]
  43. Nowack, P.; Braesicke, P.; Haigh, J.; Abraham, N.L.; Pyle, J.; Voulgarakis, A. Using Machine Learning to Build Temperature-Based Ozone Parameterizations for Climate Sensitivity Simulations. Environ. Res. Lett. 2018, 13, 104016. [Google Scholar] [CrossRef]
  44. Alfonso, L.; Zamora, J.M. A Two-Moment Machine Learning Parameterization of the Autoconversion Process. Atmos. Res. 2021, 249, 105269. [Google Scholar] [CrossRef]
  45. Ukkonen, P.; Pincus, R.; Hogan, R.J.; Pagh Nielsen, K.; Kaas, E. Accelerating Radiation Computations for Dynamical Models With Targeted Machine Learning and Code Optimization. J. Adv. Model. Earth Syst. 2020, 12, e2020MS002226. [Google Scholar] [CrossRef]
  46. Beucler, T.; Rasp, S.; Pritchard, M.; Gentine, P. Achieving Conservation of Energy in Neural Network Emulators for Climate Modeling. arXiv 2019, arXiv:1906.06622. [Google Scholar]
  47. Ramadhan, A.; Marshall, J.; Souza, A.; Wagner, G.L.; Ponnapati, M.; Rackauckas, C. Capturing Missing Physics in Climate Model Parameterizations Using Neural Differential Equations. arXiv 2020, arXiv:2010.12559. [Google Scholar]
  48. Beucler, T.; Pritchard, M.; Yuval, J.; Gupta, A.; Peng, L.; Rasp, S.; Ahmed, F.; O’Gorman, P.A.; Neelin, J.D.; Lutsko, N.J.; et al. Climate-Invariant Machine Learning. arXiv 2021, arXiv:2112.08440. [Google Scholar]
  49. Krasnopolsky, V.M.; Fox-Rabinovitz, M.S. Complex Hybrid Models Combining Deterministic and Machine Learning Components for Numerical Climate Modeling and Weather Prediction. Neural Netw. 2006, 19, 122–134. [Google Scholar] [CrossRef]
  50. Gentine, P.; Pritchard, M.; Rasp, S.; Reinaudi, G.; Yacalis, G. Could Machine Learning Break the Convection Parameterization Deadlock? Geophy. Res. Lett. 2018, 45, 5742–5751. [Google Scholar] [CrossRef]
  51. Rasp, S. Coupled Online Learning as a Way to Tackle Instabilities and Biases in Neural Network Parameterizations. arXiv 2020, arXiv:1907.01351. [Google Scholar] [CrossRef] [Green Version]
  52. Brenowitz, N.D.; Beucler, T.; Pritchard, M.; Bretherton, C.S. Interpreting and Stabilizing Machine-Learning Parametrizations of Convection. J. Atmos. Sci. 2020, 77, 4357–4375. [Google Scholar] [CrossRef]
  53. Brenowitz, N.D.; Henn, B.; McGibbon, J.; Clark, S.K.; Kwa, A.; Perkins, W.A.; Watt-Meyer, O.; Bretherton, C.S. Machine Learning Climate Model Dynamics: Offline versus Online Performance. arXiv 2020, arXiv:2011.03081. [Google Scholar]
  54. Gagne, D.J., II; Christensen, H.M.; Subramanian, A.C.; Monahan, A.H. Machine Learning for Stochastic Parameterization: Generative Adversarial Networks in the Lorenz ’96 Model. J. Adv. Model. Earth Syst. 2020, 12, e2019MS001896. [Google Scholar] [CrossRef] [Green Version]
  55. Gettelman, A.; Gagne, D.J.; Chen, C.-C.; Christensen, M.W.; Lebo, Z.J.; Morrison, H.; Gantos, G. Machine Learning the Warm Rain Process. J. Adv. Model. Earth Syst. 2021, 13, e2020MS002268. [Google Scholar] [CrossRef]
  56. Silva, S.J.; Ma, P.-L.; Hardin, J.C.; Rothenberg, D. Physically Regularized Machine Learning Emulators of Aerosol Activation. Geosci. Model Dev. 2021, 14, 3067–3077. [Google Scholar] [CrossRef]
  57. Pielke, R., Sr.; Stokowski, D.; Wang, J.-W.; Vukicevic, T.; Leoncini, G.; Matsui, T.; Castro, C.L.; Niyogi, D.; Kishtawal, C.M.; Biazar, A.; et al. Satellite-Based Model Parameterization of Diabatic Heating. Eos Trans. Am. Geophys. Un. 2007, 88, 96–97. [Google Scholar] [CrossRef] [Green Version]
  58. Wang, X.; Han, Y.; Xue, W.; Yang, G.; Zhang, G.J. Stable Climate Simulations Using a Realistic GCM with Neural Network Parameterizations for Atmospheric Moist Physics and Radiation Processes. Geosci. Model Dev. Discuss. 2021, 1–35. [Google Scholar] [CrossRef]
  59. Yuval, J.; O’Gorman, P.A.; Hill, C.N. Use of Neural Networks for Stable, Accurate and Physically Consistent Parameterization of Subgrid Atmospheric Processes With Good Performance at Reduced Precision. Geophys. Res. Lett. 2021, 48, e2020GL091363. [Google Scholar] [CrossRef]
  60. Anochi, J.A.; de Almeida, V.A.; de Campos Velho, H.F. Machine Learning for Climate Precipitation Prediction Modeling over South America. Remote Sens. 2021, 13, 2468. [Google Scholar] [CrossRef]
  61. Anderson, G.J.; Lucas, D.D. Machine Learning Predictions of a Multiresolution Climate Model Ensemble. Geophys. Res. Lett. 2018, 45, 4273–4280. [Google Scholar] [CrossRef]
  62. Mansfield, L.A.; Nowack, P.J.; Kasoar, M.; Everitt, R.G.; Collins, W.J.; Voulgarakis, A. Predicting Global Patterns of Long-Term Climate Change from Short-Term Simulations Using Machine Learning. NPJ Clim. Atmos. Sci. 2020, 3, 1–9. [Google Scholar] [CrossRef]
  63. Hourdin, F.; Williamson, D.; Rio, C.; Couvreux, F.; Roehrig, R.; Villefranque, N.; Musat, I.; Fairhead, L.; Diallo, F.B.; Volodina, V. Process-Based Climate Model Development Harnessing Machine Learning: II. Model Calibration From Single Column to Global. J. Adv. Model. Earth Syst. 2021, 13, e2020MS002225. [Google Scholar] [CrossRef]
  64. Bochenek, B.; Ustrnul, Z.; Wypych, A.; Kubacka, D. Machine Learning-Based Front Detection in Central Europe. Atmosphere 2021, 12, 1312. [Google Scholar] [CrossRef]
  65. Liu, Y.; Racah, E.; Prabhat; Correa, J.; Khosrowshahi, A.; Lavers, D.; Kunkel, K.; Wehner, M.; Collins, W. Application of Deep Convolutional Neural Networks for Detecting Extreme Weather in Climate Datasets. arXiv 2016, arXiv:1605.01156. [Google Scholar]
  66. Biard, J.C.; Kunkel, K.E. Automated Detection of Weather Fronts Using a Deep Learning Neural Network. Adv. Stat. Climatol. Meteorol. Oceanogr. 2019, 5, 147–160. [Google Scholar] [CrossRef] [Green Version]
  67. Niebler, S.; Miltenberger, A.; Schmidt, B.; Spichtinger, P. Automated Detection and Classification of Synoptic Scale Fronts from Atmospheric Data Grids. Weather. Clim. Dyn. Discuss. [Preprint] 2021, 1–28. [Google Scholar] [CrossRef]
  68. Knighton, J.; Pleiss, G.; Carter, E.; Lyon, S.; Walter, M.T.; Steinschneider, S. Potential Predictability of Regional Precipitation and Discharge Extremes Using Synoptic-Scale Climate Information via Machine Learning: An Evaluation for the Eastern Continental United States. J. Hydrometeorol. 2019, 20, 883–900. [Google Scholar] [CrossRef]
  69. Hu, H.; Ayyub, B.M. Machine Learning for Projecting Extreme Precipitation Intensity for Short Durations in a Changing Climate. Geosciences 2019, 9, 209. [Google Scholar] [CrossRef] [Green Version]
  70. Elsanabary, M.H.; Gan, T.Y. Wavelet Analysis of Seasonal Rainfall Variability of the Upper Blue Nile Basin, Its Teleconnection to Global Sea Surface Temperature, and Its Forecasting by an Artificial Neural Network. Mon. Weather Rev. 2014, 142, 1771–1791. [Google Scholar] [CrossRef]
  71. Ghaderpour, E.; Vujadinovic, T.; Hassan, Q.K. Application of the Least-Squares Wavelet Software in Hydrology: Athabasca River Basin. J. Hydrol. Reg. Stud. 2021, 36, 100847. [Google Scholar] [CrossRef]
  72. Coulibaly, P.; Burn, D.H. Wavelet Analysis of Variability in Annual Canadian Streamflows. Water Resour. Res. 2004, 40. [Google Scholar] [CrossRef]
  73. Rolnick, D.; Donti, P.L.; Kaack, L.H.; Kochanski, K.; Lacoste, A.; Sankaran, K.; Ross, A.S.; Milojevic-Dupont, N.; Jaques, N.; Waldman-Brown, A.; et al. Tackling Climate Change with Machine Learning. arXiv 2019, arXiv:1906.05433. [Google Scholar]
  74. Schneider, T.; Lan, S.; Stuart, A.; Teixeira, J. Earth system modeling 2.0: A blueprint for models that learn from observations and targeted high-resolution simulations. Geophys. Res. Lett. 2017, 44, 12396–12417. [Google Scholar] [CrossRef] [Green Version]
  75. Reichstein, M.; Camps-Valls, G.; Stevens, B.; Jung, M.; Denzler, J.; Carvalhais, N.; Prabhat. Deep learning and process understanding for data-driven Earth system science. Nature 2019, 566, 195–204. [Google Scholar] [CrossRef]
  76. Huntingford, C.; Jeffers, E.S.; Bonsall, M.B.; Christensen, H.M.; Lees, T.; Yang, H. Machine Learning and Artificial Intelligence to Aid Climate Change Research and Preparedness. Environ. Res. Lett. 2019, 14, 124007. [Google Scholar] [CrossRef] [Green Version]
  77. Dijkstra, H.A.; Petersik, P.; Hernández-García, E.; López, C. The Application of Machine Learning Techniques to Improve El Niño Prediction Skill. Front. Phys. 2019, 7, 153. [Google Scholar] [CrossRef] [Green Version]
  78. Mohammadiziazi, R.; Bilec, M.M. Application of Machine Learning for Predicting Building Energy Use at Different Temporal and Spatial Resolution under Climate Change in USA. Buildings 2020, 10, 139. [Google Scholar] [CrossRef]
  79. Piryonesi, S.M.; El-Diraby, T. Climate Change Impact on Infrastructure: A Machine Learning Solution for Predicting Pavement Condition Index. Constr. Build. Mater. 2021, 306, 124905. [Google Scholar] [CrossRef]
  80. Leng, G.; Hall, J.W. Predicting Spatial and Temporal Variability in Crop Yields: An Inter-Comparison of Machine Learning, Regression and Process-Based Models. Environ. Res. Lett. 2020, 15, 044027. [Google Scholar] [CrossRef] [PubMed]
  81. Feng, P.; Wang, B.; Liu, D.L.; Waters, C.; Yu, Q. Incorporating Machine Learning with Biophysical Model Can Improve the Evaluation of Climate Extremes Impacts on Wheat Yield in South-Eastern Australia. Agric. For. Meteorol. 2019, 275, 100–113. [Google Scholar] [CrossRef]
  82. Yu, T.; Zhang, Q.; Sun, R. Comparison of Machine Learning Methods to Up-Scale Gross Primary Production. Remote Sens. 2021, 13, 2448. [Google Scholar] [CrossRef]
  83. Duan, Z.; Yang, Y.; Zhou, S.; Gao, Z.; Zong, L.; Fan, S.; Yin, J. Estimating Gross Primary Productivity (GPP) over Rice–Wheat-Rotation Croplands by Using the Random Forest Model and Eddy Covariance Measurements: Upscaling and Comparison with the MODIS Product. Remote Sens. 2021, 13, 4229. [Google Scholar] [CrossRef]
  84. Zeng, J.; Matsunaga, T.; Tan, Z.-H.; Saigusa, N.; Shirai, T.; Tang, Y.; Peng, S.; Fukuda, Y. Global Terrestrial Carbon Fluxes of 1999–2019 Estimated by Upscaling Eddy Covariance Data with a Random Forest. Sci. Data 2020, 7, 313. [Google Scholar] [CrossRef]
  85. Zeng, Z.; Wang, Z.; Gui, K.; Yan, X.; Gao, M.; Luo, M.; Geng, H.; Liao, T.; Li, X.; An, J.; et al. Daily Global Solar Radiation in China Estimated From High-Density Meteorological Observations: A Random Forest Model Framework. Earth Space Sci. 2020, 7, e2019EA001058. [Google Scholar] [CrossRef] [Green Version]
  86. Yang, Y.; Zhang, M.; Li, Q.; Chen, B.; Gao, Z.; Ning, G.; Liu, C.; Li, Y.; Luo, M. Modulations of Surface Thermal Environment and Agricultural Activity on Intraseasonal Variations of Summer Diurnal Temperature Range in the Yangtze River Delta of China. Sci. Environ. 2020, 736, 139445. [Google Scholar] [CrossRef] [PubMed]
  87. Li, X.; Yang, Y.; Mi, J.; Bi, X.; Zhao, Y.; Huang, Z.; Liu, C.; Zong, L.; Li, W. Leveraging Machine Learning for Quantitative Precipitation Estimation from Fengyun-4 Geostationary Observations and Ground Meteorological Measurements. Atmos. Meas. Tech. 2021, 14, 7007–7023. [Google Scholar] [CrossRef]
  88. Ustrnul, Z. Zmienność Cyrkulacji Atmosfery na Półkuli Północnej w XX Wieku (Variability of the Atmospheric Circulation on the Northern Hemisphere in the 20th Century); Materiały Badawcze, seria Meteorologia; IMGW: Warszawa, Poland, 1997; Volume 27, p. 208, (In Polish, Eng. Summary). [Google Scholar]
  89. Tveito, O.E.; Ustrnul, Z. A Review of The Use of Large-Scale Atmospheric Circulation Classification in Spatial Climatology; KLIMA; DNMI: Oslo, Norway, 3 October 2003; p. 27. [Google Scholar]
  90. Huth, R.; Beck, C.; Philipp, A.; Demuzere, M.; Ustrnul, Z.; Cahynová, M.; Kyselý, J.; Tveito, O.E. Classifications of atmospheric circulation patterns: Recent advances and applications. Ann. N. Y. Acad. Sci. 2008, 1146, 105–152. [Google Scholar] [CrossRef]
  91. Ustrnul, Z.; Wypych, A.; Czekierda, D. Composite circulation index of weather extremes (the example for Poland). Meteorol. Z. 2013, 22, 551–559. [Google Scholar] [CrossRef]
  92. Gordon, A.D. Classification; Chapman & Hall: London, UK; New York, NY, USA; Washington, DC, USA, 1999. [Google Scholar]
  93. Everitt, B.S.; Landau, S.; Leese, M. Cluster Analysis; Arnold: London, UK; Oxford University Press: New York, NY, USA, 2001. [Google Scholar]
  94. Bednorz, E.; Kolendowicz, L.; Bielec-Bąkowska, Z.; Bokwa, A.; Żelazny, M.; Kicińska, B.; Lewik, P.; Nowosad, M.; Ustrnul, Z. Regionalizacje, typologie i wydzielanie sezonów klimatycznych z zastosowaniem analizy skupień (Regionalisations, typologies and climatic seasons with application of cluster analysis). Prz. Geofiz. 2003, 1–2, 11–32, (In Polish, Eng. Summary). [Google Scholar]
  95. Sheridan, S.C.; Lee, C.C. The self-organizing map in synoptic climatological research. Prog. Phys. Geogr. Earth Environ. 2011, 35, 109–119. [Google Scholar] [CrossRef] [Green Version]
  96. Lee, C.C.; Sheridan, S.C. A six-step approach to developing future synoptic classifications based on GCM output. Int. J. Climatol. 2012, 32, 1792–1802. [Google Scholar] [CrossRef]
  97. Davis, R.E.; Kalkstein, L.S. Development of an automated spatial synoptic classification. Int. J. Climatol. 1990, 10, 769–794. [Google Scholar] [CrossRef]
  98. Wilks, S.D. Statistical Methods in the Atmospheric Sciences, an Introduction; Academic Press: San Diego, CA, USA, 1995; p. 464. [Google Scholar]
  99. Oliver, J.E. The history, status, and future of climatic classifications. Phys. Geogr. 1991, 12, 231–251. [Google Scholar] [CrossRef]
  100. Yarnal, B. Synoptic Climatology in Environmental Analysis: A Primer; Belhaven Press: London, UK, 1993; pp. xv + 195. [Google Scholar]
  101. Barry, R.G.; Carleton, A.M. Synoptic and Dynamic Climatology; Routledge: London, UK, 2001; p. 664. [Google Scholar] [CrossRef]
  102. Yarnal, B.; Comrie, A.C.; Frakes, B.; Brown, D.P. Developments and prospects in synoptic climatology. Int. J. Climatol. 2001, 21, 1923–1950. [Google Scholar] [CrossRef]
  103. Palarz, A.; Luterbacher, J.; Ustrnul, Z.; Xoplaki, E.; Celiński-Myslaw, D. Representation of low-tropospheric temperature inversions in ECMWF reanalyses over Europe. Environ. Res. Lett. 2020, 15, 074043. [Google Scholar] [CrossRef]
  104. Hertz, J.; Krogh, A.; Palmer, R.G. Introduction to the Theory of Neural Computation; Addison-Wesley Publ. Company: Reading, MA, USA, 1991. [Google Scholar] [CrossRef] [Green Version]
  105. Pasini, A.; Pelino, V.; Potesta, S. A neural network model for visibility nowcasting from surface observations: Results and sensitivity to physical input variables. J. Geophys. Res. Atmos. 2001, 106, 14951–14960. [Google Scholar] [CrossRef]
  106. Domańska, D.; Wojtylak, M. Application of fuzzy time series models for forecasting pollution concentrations. Expert. Syst. Appl. 2012, 39, 7673–7679. [Google Scholar] [CrossRef]
  107. Czernecki, B.; Marosz, M.; Jędruszkiewicz, J. Assessment of Machine Learning Algorithms in Short-Term Forecasting of PM10 and PM2.5 Concentrations in Selected Polish Agglomerations. Aerosol Air Qual. Res. 2021, 21, 200586. [Google Scholar] [CrossRef]
  108. Zalewska, T.; Biernacik, D.; Marosz, M. Correlations between 7Be, 210Pb, Dust and PM10 Concentrations in Relation to Meteorological Conditions in Northern Poland in 1998–2018. J. Environ. Radioact. 2021, 228, 106526. [Google Scholar] [CrossRef] [PubMed]
  109. Dixon, P.G.; Allen, M.; Gosling, S.N.; Hondula, D.M.; Vijendra, I.; Lucas, R.; Vanos, J. Perspectives on the Synoptic Climate Classification and its Role in Interdisciplinary Research. Geogr. Compass 2016, 10, 147–164. [Google Scholar] [CrossRef] [Green Version]
  110. Bauer, P.; Dueben, P.D.; Hoefler, T.; Quintino, T.; Schulthess, T.C.; Wedi, N.P. The Digital Revolution of Earth-System Science. Nat. Comput. Sci. 2021, 1, 104–113. [Google Scholar] [CrossRef]
  111. Dewitte, S.; Cornelis, J.P.; Müller, R.; Munteanu, A. Artificial Intelligence Revolutionises Weather Forecast, Climate Monitoring and Decadal Prediction. Remote Sens. 2021, 13, 3209. [Google Scholar] [CrossRef]
  112. Mock, C.J.; Shinker, J.J. Paleoclimate—Modern Analog Approaches in Paleoclimatology. In Paleoclimate, Encyclopedia of Quaternary Science, 2nd ed.; Elias, S.A., Mock, C.J., Eds.; Elsevier: Amsterdam, The Netherlands, 2013; pp. 102–112. [Google Scholar]
Figure 1. Artificial intelligence, machine learning, and their classification. ANN—Artificial Neural Networks, DL—Deep Learning, RF—Random Forest, XGB—XGBoost, K-means—K-means Clustering, PCA—Principal Component Analysis.
Figure 1. Artificial intelligence, machine learning, and their classification. ANN—Artificial Neural Networks, DL—Deep Learning, RF—Random Forest, XGB—XGBoost, K-means—K-means Clustering, PCA—Principal Component Analysis.
Atmosphere 13 00180 g001
Figure 2. Most common phrases from articles related to numerical weather prediction and machine learning.
Figure 2. Most common phrases from articles related to numerical weather prediction and machine learning.
Atmosphere 13 00180 g002
Figure 3. Most common phrases from articles related to climate and machine learning.
Figure 3. Most common phrases from articles related to climate and machine learning.
Atmosphere 13 00180 g003
Figure 4. Most common meteorological fields in NWP-related articles.
Figure 4. Most common meteorological fields in NWP-related articles.
Atmosphere 13 00180 g004
Figure 5. Most common methods in NWP-related articles.
Figure 5. Most common methods in NWP-related articles.
Atmosphere 13 00180 g005
Figure 6. Most common countries examined in climate-related articles.
Figure 6. Most common countries examined in climate-related articles.
Atmosphere 13 00180 g006
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Bochenek, B.; Ustrnul, Z. Machine Learning in Weather Prediction and Climate Analyses—Applications and Perspectives. Atmosphere 2022, 13, 180. https://doi.org/10.3390/atmos13020180

AMA Style

Bochenek B, Ustrnul Z. Machine Learning in Weather Prediction and Climate Analyses—Applications and Perspectives. Atmosphere. 2022; 13(2):180. https://doi.org/10.3390/atmos13020180

Chicago/Turabian Style

Bochenek, Bogdan, and Zbigniew Ustrnul. 2022. "Machine Learning in Weather Prediction and Climate Analyses—Applications and Perspectives" Atmosphere 13, no. 2: 180. https://doi.org/10.3390/atmos13020180

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop