Next Article in Journal
Phytoextraction Potential of Sunn Hemp, Sunflower, and Marigold for Carbaryl Contamination: Hydroponic Experiment
Next Article in Special Issue
Indoor Exposure and Regional Inhaled Deposited Dose Rate during Smoking and Incense Stick Burning—The Jordanian Case as an Example for Eastern Mediterranean Conditions
Previous Article in Journal
The Ecotoxicology of Microbial Insecticides and Their Toxins in Genetically Modified Crops: An Overview
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Air Quality Modeling with the Use of Regression Neural Networks

Faculty of Infrastructure and Environment, Czestochowa University of Technology, 69 Dabrowskiego St., 42-200 Czestochowa, Poland
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2022, 19(24), 16494; https://doi.org/10.3390/ijerph192416494
Submission received: 2 November 2022 / Revised: 26 November 2022 / Accepted: 6 December 2022 / Published: 8 December 2022
(This article belongs to the Special Issue The 2nd Edition: Outdoor and Indoor Air Quality)

Abstract

:
Air quality is assessed on the basis of air monitoring data. Monitoring data are often not complete enough to carry out an air quality assessment. To fill the measurement gaps, predictive models can be used, which enable the approximation of missing data. Prediction models use historical data and relationships between measured variables, including air pollutant concentrations and meteorological factors. The known predictive air quality models are not accurate, so it is important to look for models that give a lower approximation error. The use of artificial neural networks reduces the prediction error compared to classical regression methods. In previous studies, a single regression model over the entire concentration range was used to approximate the concentrations of a selected pollutant. In this study, it was assumed that not a single model, but a group of models, could be used for the prediction. In this approach, each model from the group was dedicated to a different sub-range of the concentration of the modeled pollutant. The aim of the analysis was to check whether this approach would improve the quality of modeling. A long-term data set recorded at two air monitoring stations in Poland was used in the examination. Hourly data of basic air pollutants and meteorological parameters were used to create predictive regression models. The prediction errors for the sub-range models were compared with the corresponding errors calculated for one full-range regression model. It was found that the application of sub-range models reduced the modeling error of basic air pollutants.

1. Introduction

Air pollution is considered one of the main factors affecting the human population and the environment. The World Health Organization estimates that many millions of people die prematurely due to poor air quality [1]. Once pollutants are emitted into the air, it is impossible to stop them. If pollutants get into the atmosphere, they contribute to the deterioration of air quality in the vicinity of the emission, but by spreading they can have negative effects hundreds and thousands of kilometers from the point of emission. Therefore, air pollution is treated as a global threat and emission reduction strategies are implemented in many countries. It should be highlighted that, in Poland, the PM10 (particulate matter) concentration still exceeds the permissible limits and causes premature deaths of over 40,000 people every year [2].
Air pollution causes negative changes in the human respiratory and circulatory systems, even when pollutant concentrations do not exceed permissible levels [3,4,5,6,7,8]. It may trigger various reactions of organisms, including mental health disorders [9,10]. It was also found that air pollution can negatively affect the economy [11,12,13,14]. High concentrations of air pollutants have a negative effect on plants, which is manifested in the reduction of crop yields in agriculture [15,16].
The control and reduction of anthropogenic emissions are now recognized as the keys to good global air quality. An important element of the control system is the assessment of air quality. This task is performed using air quality monitoring. Air monitoring includes continuous measurements of air pollutants. The main air pollutants include O3, SO2, NOx, CO and PM10 [17]. Concentrations of the mentioned pollutants can be measured with automatic air monitoring stations. These types of stations are often equipped with devices that also continuously measure meteorological data such as temperature, wind direction, wind speed and solar radiation. Measurement results at automatic air monitoring stations are recorded in the form of 1 h averages, e.g., hourly concentrations, hourly temperature, etc. In the EU, average hourly concentrations are the basis for calculating averages over longer periods of time that are required for air quality standards, such as 8 h, 24 h and annual concentrations [18]. The system acquires and collects measurement data of the air pollutants’ concentration levels at many individual air monitoring stations, according to the standardized measurement methods [19,20,21,22,23]. The collected hourly concentrations in the air monitoring system constitute the basis for direct and indirect statistical evaluation of air quality in the zones represented with individual monitoring stations, in accordance with the procedures described in the relevant legal acts [18]. Correct assessment requires a high degree of completeness of the time series of concentrations obtained at the monitoring stations. Usually, the completeness should exceed 90% in an annual series of hourly measurements. Unfortunately, monitoring data are never 100% complete in annual terms, and often they do not even have the completeness required by the regulations. When there is a deficit of data in a series of measurements, then there is a need to complete the missing data.
Missing data can be supplemented by introducing modeled concentrations in the measurement gaps [24,25,26,27]. The time series data obtained with air monitoring have specific characteristics. All measurements, both of concentrations and meteorological parameters, are performed simultaneously and are recorded in similar time series. Therefore, methods based on auto-regression and regression can be used for modeling monitoring concentrations. If historical data from the selected air monitoring station are available, they can be used to explore the knowledge hidden in them. Autonomous models of this type can be accurate and have a very significant advantage: the approximation of concentrations does not require external data from outside the monitoring system [26,27]. In the first models, classical statistical regression techniques were used [28]. Classical methods are increasingly being replaced by methods that use machine learning artificial intelligence, including artificial neural networks (ANNs) [29,30,31,32,33,34,35,36,37]. ANN models enable a deeper exploration of knowledge hidden in historical data and, as a result, more accurate concentration predictions.
In regression modeling, it was found that the application of one neural network to the entire range of concentrations of the predicted pollutant resulted in different prediction accuracies in the concentration sub-ranges [38,39]. It was considered advisable to replace one neural network with several networks (sub-models), each of which would be adjusted to specific concentration sub-ranges [39]. The use of several sub-range models should improve the accuracy of the prediction. This paper presents an analysis that verifies the above thesis.
The main aim of the study was to improve the accuracy of prediction of air pollutant concentrations in neural regression models, using many predictive models created for various sub-ranges of air pollutant concentrations. The analyzed data came from two air monitoring stations in the Upper Silesian Region, Poland. The quality of prediction was assessed separately for the concentrations of six main air pollutants: O3, NO, NO2, SO2, PM10 and CO. Multi-layer perceptrons with an identical architecture were used to model all pollutants. The predicted concentrations were compared with the observed ones to estimate the prediction error. The prediction errors were calculated for various sub-models, and, based on them, the prediction error was estimated over the entire concentration range. This error was compared with the approximation error obtained from a single model covering the entire concentration range.

2. Materials and Methods

2.1. Air Monitoring Data

Hourly data from two air monitoring stations located in a highly industrialized region of Poland were used for the analysis. The data were recorded in Zabrze and Złoty Potok in the years 2011–2016. The location of selected monitoring stations is shown in Figure 1.
The city of Zabrze is one of the most polluted towns in Poland and throughout the EU. The station at Zabrze was an urban background monitoring site. Złoty Potok is a rural town, located outside the Upper Silesian Agglomeration. In Złoty Potok, there is a background monitoring station for the Upper Silesian Region. The data were provided by the Voivodeship Inspectorate of Environmental Protection in Katowice. Time series data, including 1 h average values of O3, NO, NO2, SO2, PM10 and CO concentrations, as well as meteorological data for temperature, wind speed, solar radiation and relative humidity, were recorded. The analyzed data sets are not publicly available. We received them on an individual request from the Voivodeship Inspectorate for Environmental Protection. Time series of concentrations are available on the website of the Chief Inspectorate of Environmental Protection: https://powietrze.gios.gov.pl/pjp/archives, accessed on 30 April 2022. This is only part of the data. Meteorological data are not available online. The data also include two variables describing the time: day and hour. These two variables were converted to numeric form following the procedure described in [39].
The following symbols were used to describe the time series (variables):
D—numeric day,
H—numeric hour,
O3—hourly average of O3 concentration (µg/m3),
NO—hourly average of NO concentration (µg/m3),
NO2—hourly average of NO2 concentration (µg/m3),
SO2—hourly average of SO2 concentration (µg/m3),
CO—hourly average of CO concentration (mg/m3),
PM10—hourly average of PM10 concentration (µg/m3),
WS—hourly average of wind speed (m/s),
T—hourly average of air temperature (°C),
I—hourly average of solar radiation intensity (W/m2),
RH—hourly average of relative humidity (%).
Descriptive statistics of the 6-year set of hourly concentrations of monitored air pollutants are presented in Table 1.

2.2. Regression Models

Multi-layer perceptrons with identical architecture were used to model all pollutants. The predicted concentrations were compared with the observed ones to estimate the prediction error. The prediction errors were calculated for various sub-models, and, based on them, the prediction error was estimated over the entire concentration range. This error was compared with the approximation error obtained from a single model covering the entire concentration range. For all air pollutants, similar perceptron models were created. The output of this was the concentration of the chosen air pollutant (explained variable), and the inputs were the date and hour, concentrations of other air pollutants and meteorological parameters (explanatory variables). Table 2 presents the predicted variables and predictors, with separate lines for individual models. For example, the following 11 input variables were used to model the ozone concentration in Złoty Potok: H, D, NO, NO2, SO2, CO, PM10, WS, T, I and RH.
For regression modeling, artificial neural networks in the form of multi-layer perceptrons were used. Each perceptron consisted of input neurons, with 10 neurons in one hidden layer and one output neuron. Figure 2 shows such a perceptron used to model O3 concentrations. The Broyden–Fletcher–Goldfarb-Shanno (BFGS) algorithm was used in the learning process. The Broyden–Fletcher–Goldfarb–Shanno algorithm is used in problems related to numerical optimization [40]. The BFGS algorithm was developed on the basis of solutions proposed in 1970 by the four mathematicians mentioned in its name [41,42,43,44]. The algorithm uses an iterative method of solving unlimited non-linear optimization problems.
The learning process was always limited to 300 epochs. An epoch is a specialized expression related to the learning process of a neural network. In the network learning process, one epoch means a single learning cycle. In the learning process, the network repeats the cycles many times in order to minimize the learning error. The function of activating hidden and output neurons was a logistic function. The scales were initiated randomly. The network initialization was random (Gaussian). Modeling was performed using the Statistica program, version 13.3. Each prediction was performed 5 times. Repeating the training of the neural network, while maintaining the same parameters of the learning process, is a routine procedure. Each model’s learning process is somewhat random and leads to a different network. The created networks have the same structure of neurons, but differ in terms of weights and the degree of activation of individual neurons. In general, they differ slightly in the modeling error. The most accurate of the 5 created models was selected for reporting. The other models were rejected. The sum of squares (SOS) was assumed as the error function. SOS is the sum of the squared distances of all predicted values from the actual values.

2.3. Preparation of Data for Modeling

The complete set of hourly air monitoring data from the 6-year measurement period should have included 52,608 cases (hourly observations). Prior to modeling, raw data from the air monitoring database were prepared by removing the cases where there were missing data. After removing the cases with missing data, a set of cases with complete data was obtained for further analysis. This set was called the full-range set. The full-range sets included 36,460 and 15,536 cases, for Zabrze and Złoty Potok stations, respectively. Before starting the learning process, each data set was divided into three subsets: the training subset consisted of 70% of the cases, the validation subset consisted of 15% of the cases and the test subset consisted of 15% of the cases.
Two approaches were adopted during modeling and, therefore, two groups of models were developed for each of the stations. In the first approach, all cases in the data set were sorted according to the real value of the predicted variable, from the lowest concentration to the highest. Then, a set of cases sorted in this way was divided into subsets. The entire set of cases was divided gradually into 2, then 4, and finally 8 subsets of the same size. The subsets prepared in this way differed in the ranges of the real concentration values of the predicted variable. For each of these subsets, regression models called RVS (Real Values Sorting) were created. As a result, for each set/subset, a separate predictive model was obtained, marked with the same symbol as the modeled set/subset. The name of the monitoring station (ZAB or ZP) was also added to the designations of sets and subsets. The full-range model was marked as RVS-1/1. As a result of the division of the full-range set, the following subsets and submodels were obtained:
  • − Two sub-models (RVS-1/2, RVS-2/2), after division into two subsets;
  • − Four sub-models (RVS-1/4, RVS-2/4, RVS-3/4, RVS-4/4), after division into four subsets;
  • − Eight sub-models (RVS-1/8, RVS-2/8, RVS-3/8, RVS-4/8, RVS-5/8, RVS-6/8, RVS-7/8, RVS-8/8), after division into eight subsets.
In a situation where the real concentrations of the pollutant to be modeled were not known, the RVS-type models could not be used because it was not possible to sort and classify the cases into the real concentration sub-ranges. Therefore, RVS models reflect only the potential, not practical, opportunities to improve the quality of modeling through segmentation of the prediction process. If the real concentrations are not known and there is a need to predict them, then the RVS sub-range models are not available. In such a situation, a different approach can be proposed, which also uses division into sub-ranges, but then with sectoral modeling in designated sub-ranges. The most important step in this approach is the initial modeling of the concentrations of the selected pollutant in the full-range set, understood as a set of all cases containing complete data of all explanatory variables (model inputs) for the modeled pollutant. After initial modeling of the entire range of cases, predictive concentrations of the dependent variable are obtained. Then, the entire set of cases is sorted according to the increasing predictive concentrations of the modeled pollutant. The next step is to divide the sorted full-range set of cases into a specified number of equal sub-ranges. In this way, you can generate a model called PVS (Predicted Values Sorting). As a result of the division of the full-range set, the following subsets and submodels were obtained:
  • − Two sub-models (PVS-1/2, PVS-2/2), after division into two subsets;
  • − Four sub-models (PVS-1/4, PVS-2/4, PVS-3/4, PVS-4/4), after division into four subsets;
  • − Eight sub-models (PVS-1/8, PVS-2/8, PVS-3/8, PVS-4/8, PVS-5/8, PVS-6/8, PVS-7/8, PVS-8/8), after division into eight subsets.
The scheme of dividing the full-range set into sub-ranges is shown in Figure 3 and Figure 4 for both monitoring stations.
Thanks to the divisions of the full-range set, it was possible to check how the modeling accuracy changed in the concentration sub-ranges, and whether the modeling carried out in the sub-ranges would improve the modeling quality in relation to the full-range modeling.

2.4. Assessment of the Approximation Error

To assess the accuracy of the obtained regression models, the MAE and RMSE values were used, which were calculated on the basis of the discrepancy between the actual and predicted values. The formulas for calculating individual errors are presented in Equations (1) and (2).
MAE—mean absolute error
M A E = 1 n i = 1 n | x i y i |
RMSE—root-mean-squared error
R M S E = i = 1 n ( x i y i ) 2 n
where n—number of cases, y—predicted concentrations, x—real concentrations, i—the case number.

3. Results

For each pollutant, modeling errors were calculated in relation to the real pollutant concentrations (O3, NO, NO2, SO2, CO and PM10 for the Zabrze station; and O3, NO, NO2, SO2 and PM10 for the Zloty Potok station). To assess the modeling accuracy, two error measures were calculated: MAE and RMSE.

3.1. The Results of the Modeling of O3 Concentrations

Table 3, Table 4, Table 5 and Table 6 show the results of O3 concentration predictions obtained with the full-range and sub-range models. The errors of the PVS models for the Zabrze monitoring station are presented in Table 3 and the Złoty Potok monitoring station in Table 4. Similar lists of errors for the RVS models are presented in Table 5 and Table 6. The presented results show that the sub-range modeling errors changed for individual sub-ranges. Regardless of the number of sub-range models, the modeling error usually increased with increasing concentration values in the sub-ranges. A good way to assess the quality of the prediction is to compare the overall prediction error over the entire range of concentrations of the modeled pollutant, and not in individual sub-ranges. Therefore, the tables include average values for the entire ranges: “overall MAE” and “overall RMSE”.
Division into the sub-ranges generally improved the accuracy of prediction, especially in the case of RVS models. The exceptions were the eight-sub-range PVS models for both monitoring stations. In the case of Zabrze, the same average value of MAE was recorded in the eight-sub-range models as in the four-sub-range models (8.02 µg/m3). In the case of Zloty Potok, even an increase in the overall values of MAE and RMSE errors was observed in the eight-sub-range models compared to the overall error of the four-sub-range models. In the case of RVS models, a decrease in the overall values of MAE and RMSE was always observed with an increase in the number of sub-ranges. The comparison showed that dividing the area into sub-ranges and separately modeling these sub-ranges improved the overall quality of the prediction, but that having too many sub-ranges and sub-models may be ineffective.
The values of “overall MAE” and “overall RMSE” depending on the number of sub-ranges are presented in Figure 5, Figure 6, Figure 7 and Figure 8.

3.2. The Results of the Modeling of NO Concentrations

Table 7, Table 8, Table 9 and Table 10 show the results of NO concentration predictions obtained with the full-range and sub-range models. The errors of the PVS models for the Zabrze monitoring station are presented in Table 7 and the Złoty Potok monitoring station in Table 8. The corresponding errors for the RVS models are presented in Table 9 and Table 10. The division into sub-ranges generally improved the accuracy of prediction, especially in the case of RVS models. In general, as the number of sub-models increased, the overall measures of modeling error decreased. The exceptions were the eight-sub-range PVS models for the monitoring station at Złoty Potok. At this station, the overall MAE value for the eight-sub-range sub-models (0.429 µg/m3) was comparable to the overall MAE value for the four-sub-range sub-models (0.428 µg/m3). In the case of RVS models, a decrease in the overall values of MAE and RMSE errors was always observed with an increase in the number of sub-ranges. In the Zloty Potok station, for the real concentrations in sub-ranges with a range of 0.0-0.0 µg/m3 and 1.0-1.0 µg/m3, RVS sub-models could not be created due to the lack of variability in these concentration ranges. Therefore, the overall error for this group of sub-models was not estimated. Figure 9, Figure 10, Figure 11 and Figure 12 show the overall values of MAE and RMSE graphically.

3.3. The Results of the Modeling of NO2 Concentrations

Table 11, Table 12, Table 13 and Table 14 show the results of NO2 concentration predictions obtained with the full-range and sub-range models. The errors of the PVS models for the Zabrze monitoring station are presented in Table 11 and the Złoty Potok monitoring station in Table 12. The corresponding errors for the RVS models are presented in Table 13 and Table 14. The division into sub-ranges generally improved the accuracy of prediction, especially in the case of RVS models. In general, as the number of sub-models increased, the overall measures of modeling error decreased. The exceptions were the eight-sub-range PVS models for both monitoring stations. The overall MAE and RMSE values for the eight-sub-range sub-models were higher than the overall MAE and RMSE values for the four-sub-range sub-models. In the case of RVS models, a decrease in the overall values of MAE and RMSE errors was always observed with an increase in the number of sub-ranges. In the case of PVS models, it appeared that having too many sub-ranges and sub-models could degrade the quality of the modeling. Figure 13, Figure 14, Figure 15 and Figure 16 show the overall values of MAE and RMSE graphically.

3.4. The Results of the Modeling of SO2 Concentrations

Table 15, Table 16, Table 17 and Table 18 show the results of SO2 concentration predictions obtained with the full-range and sub-range models. The errors of the PVS models for the Zabrze monitoring station are presented in Table 15 and the Złoty Potok monitoring station in Table 16. The corresponding errors for the RVS models are presented in Table 17 and Table 18. The division into sub-ranges generally improved the accuracy of prediction, especially in the case of RVS models. In general, as the number of sub-models increased, the overall measures of modeling error decreased. The exceptions were the four-sub-range and eight-sub-range PVS sub-models for the monitoring station at Zabrze.
At this station, the overall MAE values (5.18 µg/m3 for the eight-sub-range sub-models, and 5.17 µg/m3 for the four-sub-range sub-models) were comparable to the overall MAE value for the two-sub-range sub-models (5.17 µg/m3). In the case of RVS models, a decrease in the overall values of MAE and RMSE was always observed with an increase in the number of sub-ranges. In the case of PVS models, it appeared that having too many sub-ranges and sub-models could degrade the quality of the modeling. Figure 17, Figure 18, Figure 19 and Figure 20 show the overall values of MAE and RMSE graphically.

3.5. The Results of the Modeling of PM10 Concentrations

Table 19, Table 20, Table 21 and Table 22 show the results of PM10 concentration predictions obtained in the full-range and sub-range models. The errors of the PVS models for the Zabrze monitoring station are presented in Table 19 and the Złoty Potok monitoring station in Table 20. The corresponding errors for the RVS models are presented in Table 21 and Table 22. The division into sub-ranges generally improved the accuracy of prediction, especially in the case of RVS models. In general, as the number of sub-models increased, the overall measures of modeling error decreased. The exceptions were the eight-sub-range PVS models for the monitoring station at Złoty Potok. The overall MAE and RMSE values for the eight-sub-range models were higher than the overall MAE and RMSE values for the four-sub-range models and even the two-sub-range models. In the case of RVS models, a decrease in the overall values of MAE and RMSE was always observed with an increase in the number of sub-ranges. In the case of PVS models, it appeared that having too many sub-ranges and submodels could degrade the quality of the modeling. Figure 21, Figure 22, Figure 23 and Figure 24 show the overall values of MAE and RMSE graphically.

3.6. The Results of the Modeling of CO Concentrations

CO concentrations were not monitored at the Złoty Potok station, so the analysis was carried out only using monitoring data from Zabrze.
Table 23 and Table 24 show the results of PM10 concentration predictions obtained with the full-range and sub-range models. The errors in the PVS models for the Zabrze monitoring station are presented in Table 23. The errors in the RVS models are presented in Table 24. The division into sub-ranges improved the accuracy of prediction in the case of RVS models. In general, as the number of sub-models increased, the overall measures of modeling error decreased. The PVS models showed slight changes in accuracy. The MAE level did not change much. The overall MAE value for the eight-sub-range models was higher than the overall MAE value for the four-sub-range models and equal to the MAE value for the two-sub-range models. In the case of RVS models, a decrease in the overall values of MAE and RMSE was always observed with an increase in the number of sub-ranges. In the case of PVS models, it appeared that having too many sub-ranges and sub-models could degrade the quality of the modeling. Figure 25 and Figure 26 show the overall values of MAE and RMSE graphically.

4. Summary and Discussion

Table 25 shows the percentage changes in the overall values of MAE and RMSE obtained by modeling the concentrations in sub-ranges, calculated in relation to the error values of the corresponding full-range models.
In the case of RVS models, each division into narrower concentration sub-ranges and the development of appropriate sub-range models resulted in a significant reduction in the overall value of the modeling error. The division into sub-models always improved the accuracy of predictions in the case of RVS models. When the number of sub-models increased, the overall measures of modeling error decreased. A significant improvement in the quality of modeling was achieved at both air monitoring stations. Modeling errors could be reduced by more than 60% using eight sub-models. However, it should be emphasized that the RVS models are not of great practical importance because their use is related to knowledge of the real concentrations of the pollutants. Moreover, once the concentration values are known, there is no need to perform modeling. The importance of the RVS models was that they allowed us to assess the potential for improving the quality of the modeling.
The division into sub-models generally improved the accuracy of the PVS models; however, the decrease in modeling error was not as great as in the RVS models. Moreover, quite often, after splitting the full-range set into eight sub-ranges and running eight sub-models, the MAE and RMSE values could be higher than in the sub-models created after division into only four sub-ranges. Such an effect was found for O3, NO, NO2 and PM10 in Złoty Potok, and NO2, SO2 and CO in Zabrze. The probable cause of the deterioration in the quality of prediction in some eight sub-range PVS models was the error in the classification of cases into individual sub-ranges. The classification was made on the basis of the predicted concentration values obtained as a result of the preliminary prediction. With an increasing number of sub-ranges, the sub-ranges of concentrations became narrower and the number of misclassified cases also increased. The number of misclassified cases became so large that it increased the mean prediction error in the sub-range.
The PVS models always showed a lower accuracy than the RVS models. This is understandable, as the PVS models required a preliminary prediction, which introduced an additional error. In conclusion, having too many sub-ranges and sub-models can degrade the quality of modeling for the PVS models.
After division into sub-ranges (two sub-ranges, four sub-ranges, eight sub-ranges), the error in the models for the highest sub-ranges was always very large compared to the error in the models for the lower sub-ranges. There are two reasons for this effect. The first is that the width of the highest sub-range was always greater than that of the models of the lower sub-ranges. The second reason was the need to predict extremely high concentrations. The approximation of such unusual concentrations is always burdened with a higher error.

5. Conclusions

Monitoring data are often not complete enough to carry out an air quality assessment. To fill the measurement gaps, predictive models are used. Such models often use archival measurement data from air monitoring systems. This is the best source of knowledge about the relationships between measured variables (concentrations and meteorological parameters). There is a need to model the missing concentrations as accurately as possible. The use of artificial neural networks reduces the prediction error compared to classical regression methods. In previous studies, a single regression model over the entire concentration range was used to approximate the concentrations of a selected pollutant. In this study, it was assumed that not a single model, but a group of models, could be used for the prediction. In this approach, each model from the group is dedicated to a different sub-range of the concentration of a modeled pollutant. The aim of this analysis was to check whether this approach would improve the quality of modeling.
The aim of the analysis was not to create the most up-to-date models based on possible new data. Once trained, a model using historical data (e.g., from 2011 to 2016) should also be able to predict concentrations for current data. This feature of the model’s “generalization of acquired knowledge” was tested during the learning process on the cases from the test subset, and also on the cases from the validation subset after the network training was completed. The performed validation should show that the model can approximate the target value using data independent of the learning process.
Air monitoring data from the period 2011–2016 allowed us to verify the possibility of improving the accuracy of modeling by carrying out modeling in subsets. A similar analysis can be carried out using data from other monitoring stations, or data from Zabrze and Złoty Potok stations from a different period. However, the selected set of cases should cover a measurement period of several years, so that the recorded cases correspond to different meteorological situations and different ranges of concentrations of monitored pollutants.
The most important conclusions that resulted from the conducted analysis are as follows:
  • Modeling segmentation, consisting in prediction in the sub-ranges of concentrations of the modeled pollutant, allowed for a higher overall modeling accuracy.
  • For RVS models, segmentation of the modeling process guarantees a significant increase in modeling accuracy compared to a model based on the full-range of concentrations.
  • In the case of PVS models, segmentation of the modeling process allows to reduce overall prediction errors. However, the number of concentration sub-ranges cannot be too large. When predicting with 8 submodels, the modeling accuracy may be lower than when predicting with 4 submodels.
The authors are aware that the article has its shortcomings and limitations. The analysis was carried out using historical data from only two air monitoring stations. Stations from the same region of Poland were selected. We are convinced that a similar analysis of completely different data, e.g., from other regions of Poland or other countries/continents, would confirm the formulated conclusions. It can be assumed with high probability that the concept is valid and should yield similar results for other air monitoring data sets. The effect was confirmed for six air pollutants. Not all possible air pollutants, e.g., PM2.5, were included. The authors are convinced that a similar effect could be obtained for other monitored pollutants.

Author Contributions

Conceptualization, S.H.; methodology, S.H. and M.F.; software, M.F.; validation, M.F.; formal analysis, M.F.; investigation, S.H. and M.F.; resources, M.F.; data curation, M.F and R.J.; writing—original draft preparation, S.H. and M.F.; writing—review and editing, S.H; visualization, S.H. and M.F.; supervision, S.H. and R.J.; project administration, S.H.; funding acquisition, S.H. All authors have read and agreed to the published version of the manuscript.

Funding

The research was funded by the statute subvention of the Czestochowa University of Technology, Faculty of Infrastructure and Environment, BS/PB-400-301/22.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. World Health Organization. 7 Million Premature Deaths Annually Linked to Air Pollution. Available online: www.who.int/mediacentre/news/releases/2014/air-pollution/en (accessed on 29 April 2021).
  2. European Environment Agency. Air Quality in Europe-2020 Report. No. 12/2018; Publications Office of the European Union: Luxembourg, 2020. [Google Scholar]
  3. Widziewicz, Y.; Rogula-Kozlowska, W.; Loska, K.; Kociszewska, K.; Majewski, G. Health Risk Impacts of Exposure to Airborne Metals and Benzo(a)Pyrene during Episodes of High PM10 Concentrations in Poland. Biomed. Environ. Sci. 2018, 31, 23–36. [Google Scholar] [CrossRef] [PubMed]
  4. Trojanowska, M.; Świetlik, R. Heavy metals cadmium, nickel and arsenic environmental inhalation hazard of residents of Polish cities. Environ. Med. 2012, 15, 33–41. [Google Scholar]
  5. Gurjar, B.R.; Molina, L.T.; Ojha, C.S.P. Air Pollution: Health and Environmental Impacts; CRC Press: Boca Raton, FL, USA, 2010. [Google Scholar] [CrossRef]
  6. Brook, R.D.; Rajagopalan, S.; Pope, C.A., 3rd; Brook, J.R.; Bhatnagar, A.; Diez-Roux, A.V.; Holguin, F.; Hong, Y.; Luepker, R.V.; Mittleman, M.A.; et al. Particulate matter air pollution and cardiovascular disease: An update to the scientific statement from the American Heart Association. Circulation 2010, 121, 2331–2378. [Google Scholar] [CrossRef] [Green Version]
  7. Maesano, I. The Air of Europe: Where Are We Going? Eur. Respir. Rev. 2017, 26, 170024. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Tiotiu, A.I.; Novakova, P.; Nedeva, D.; Chong-Neto, H.J.; Novakova, S.; Steiropoulos, P.; Kowal, K. Impact of Air Pollution on Asthma Outcomes. Int. J. Environ. Res. Public Health 2020, 17, 6212. [Google Scholar] [CrossRef]
  9. Peterson, B.S.; Rauh, V.A.; Bansal, R.; Hao, X.; Toth, Z.; Nati, G.; Walsh, K.; Miller, R.L.; Arias, F.; Semanek, D.; et al. Effects of Prenatal Exposure to Air Pollutants (Polycyclic Aromatic Hydrocarbons) on the Development of Brain White Matter, Cognition, and Behavior in Later Childhood. JAMA Psychiatry 2015, 72, 531–540. [Google Scholar] [CrossRef] [Green Version]
  10. Kim, Y.; Manley, J.; Radoias, V. Air Pollution and Long Term Mental Health. Atmosphere 2020, 11, 1355. [Google Scholar] [CrossRef]
  11. Chang, T.; Zivin, J.G.; Gross, T.; Neidell, M. Particulate Pollution and the Productivity of Pear Packers. Am. Econ. J. Econ. Policy 2016, 8, 141–169. [Google Scholar] [CrossRef] [Green Version]
  12. Graff-Zivin, J.; Neidell, M. The Impact of Pollution on Worker Productivity. Am. Econ. Rev. 2012, 102, 3652–3673. [Google Scholar] [CrossRef] [Green Version]
  13. Hanna, R.; Oliva, P. The Effect of Pollution on Labor Supply: Evidence from a Natural Experiment in Mexico City. J. Public Econ. 2015, 122, 68–79. [Google Scholar] [CrossRef]
  14. Aragon, F.; Miranda, J.; Oliva, P. Particulate Matter and Labor Supply: The Role of Caregiving and Non-linearities. J. Environ. Econ. Manag. 2017, 86, 295–309. [Google Scholar] [CrossRef] [Green Version]
  15. Pandya, S.; Gadekallu, T.R.; Maddikunta, P.K.R.; Sharma, R. A Study of the Impacts of Air Pollution on the Agricultural Community and Yield Crops (Indian Context). Sustainability 2022, 14, 13098. [Google Scholar] [CrossRef]
  16. Wei, W.; Wang, Z. Impact of Industrial Air Pollution on Agricultural Production. Atmosphere 2021, 12, 639. [Google Scholar] [CrossRef]
  17. Vallero, D.A. Fundamentals of Air Pollution; Academic Press: London, UK, 2014. [Google Scholar]
  18. EU. Directive 2008/50/EC of the European Parliament and of the Council of 21 May 2008 on Ambient Air Quality and Cleaner air for Europe (OJ L 152, 11.6.2008, pp. 1–44), Current Consolidated Version: 18 September 2015. Available online: https://eur-lex.europa.eu/eli/dir/2008/50/oj (accessed on 20 December 2021).
  19. EN 14625:2012; Ambient Air—Standard Method for the Measurement of the Concentration of Ozone by Ultraviolet Photometry. European Committee for Standardization: Brussels, Belgium, 2012.
  20. EN 14211:2012; Ambient Air—Standard Method for the Measurement of the Concentration of Nitrogen Dioxide and Nitrogen Monoxide by Chemiluminescence. European Committee for Standardization: Brussels, Belgium, 2012.
  21. EN 14212:2012; Ambient Air—Standard Method for the Measurement of the Concentration of Sulphur Dioxide by Ultraviolet Fluorescence. European Committee for Standardization: Brussels, Belgium, 2012.
  22. EN 14626:2012; Ambient Air—Standard Method for the Measurement of the Concentration of Carbon Monoxide by Non dispersive Infrared Spectroscopy. European Committee for Standardization: Brussels, Belgium, 2012.
  23. EN 12341:2014; Ambient Air—Standard Gravimetric Measurement Method for the Determination of the PM10 or PM2.5 Mass Concentration of Suspended Particulate Matter. European Committee for Standardization: Brussels, Belgium, 2014.
  24. Plaia, A.; Bondi, A.L. Single Imputation Method of Missing Values in Environmental Pollution Data Sets. Atmos. Environ. 2006, 40, 7316–7330. [Google Scholar] [CrossRef]
  25. Gentili, S.; Magnaterra, L.; Passerini, G. An introduction to the statistical filling of environmental data time series. In Handling Missing Data: Applications to Environmental Analysis; Latini, G., Passerini, G., Eds.; Wit Press: Southampton, UK, 2004. [Google Scholar]
  26. Hoffman, S. Environmental Engineering; Pawłowski, L., Dudzińska, M.R., Pawłowski, A., Eds.; Taylor & Francis Group: London, UK, 2007; pp. 349–353. [Google Scholar]
  27. Hoffman, S. Approximation of Imission Level at Air Monitoring Stations by Means of Autonomous Neural Models. Environ. Prot. Eng. 2012, 38, 109–119. [Google Scholar] [CrossRef]
  28. Milionis, A.E.; Davies, T.D. Regression and Stochastic Models for Air Pollution-I. Review, Comments and Suggestions. Atmos. Environ. 1994, 28, 2801–2810. [Google Scholar] [CrossRef]
  29. Gardner, M.W.; Dorling, S.R. Artificial Neural Networks (the Multilayer Perceptron)—A Review of Applications in the Atmospheric Sciences. Atmos. Environ. 1998, 32, 2627–2636. [Google Scholar] [CrossRef]
  30. Zhang, B.; Rong, Y.; Yong, R.; Qin, D.; Li, M.; Zou, G.; Pan, J. Deep learning for air pollutant concentration prediction: A review. Atmos. Environ. 2022, 290, 119347. [Google Scholar] [CrossRef]
  31. Dorling, S.R.; Gardner, M.W. Statistical Surface Ozone Models: An Improved Methodology to Account for Non-linear Behaviour. Atmos. Environ. 2000, 34, 21–34. [Google Scholar]
  32. Karppinen, A.; Kukkonen, J.; Elolähde, T.; Konttinen, M.; Koskentalo, T.; Rantakrans, E. A Modelling System for Predicting Urban Air Pollution: Comparison of Model Predictions with the Data of an Urban Measurement Network in Helsinki. Atmos. Environ. 2000, 34, 3735–3743. [Google Scholar] [CrossRef]
  33. Nagendra, S.S.M.; Khare, M. Modelling Urban Air Quality Using Artificial Neural Network. Clean. Technol. Environ. Policy 2005, 7, 116–126. [Google Scholar] [CrossRef]
  34. Maleki, H.; Sorooshian, A.; Goudarzi, G.; Baboli, Z.; Birgani, Y.T.; Rahmati, M. Air pollution prediction by using an artificial neural network model. Clean. Technol. Environ. Policy 2019, 21, 1341–1352. [Google Scholar] [CrossRef] [PubMed]
  35. Fallahizadeh, S.; Kermani, M.; Esrafili, A.; Asadgol, Z.; Gholami, M. The effects of meteorological parameters on PM10: Health impacts assessment using AirQ+ model and prediction by an artificial neural network (ANN). Urban Clim. 2021, 38, 100905. [Google Scholar] [CrossRef]
  36. Shams, S.R.; Jahani, A.; Kalantary, S.; Moeinaddini, M.; Khorasani, N. The evaluation on artificial neural networks (ANN) and multiple linear regressions (MLR) models for predicting SO2 concentration. Urban Clim. 2021, 37, 100837. [Google Scholar] [CrossRef]
  37. Lin, W.C.; Tsai, C.F. Missing value imputation: A review and analysis of the literature (2006–2017). Artif. Intell. Rev. 2020, 53, 1487–1509. [Google Scholar] [CrossRef]
  38. Hoffman, S. Assessment of Prediction Accuracy in Autonomous Air Quality Models. Desalination Water Treat. 2015, 57, 1322–1326. [Google Scholar] [CrossRef]
  39. Hoffman, S. Estimation of Prediction Error in Regression Air Quality Models. Energies 2021, 14, 7387. [Google Scholar] [CrossRef]
  40. Fletcher, R. Practical Methods of Optimization, 2nd ed.; John Wiley & Sons: New York, NY, USA, 2000. [Google Scholar]
  41. Broyden, C.G. The convergence of a class of double-rank minimization algorithms. J. Inst. Math. Its Appl. 1970, 6, 76–90. [Google Scholar] [CrossRef] [Green Version]
  42. Fletcher, R. A New Approach to Variable Metric Algorithms. Comput. J. 1970, 13, 317–322. [Google Scholar] [CrossRef] [Green Version]
  43. Goldfarb, D. A Family of Variable Metric Updates Derived by Variational Means. Math. Comput. 1970, 24, 23–26. [Google Scholar] [CrossRef]
  44. Shanno, D.F. Conditioning of quasi-Newton methods for function minimization. Math. Comput. 1970, 24, 647–656. [Google Scholar] [CrossRef]
Figure 1. Locations of the air monitoring stations Zabrze and Złoty Potok in Poland.
Figure 1. Locations of the air monitoring stations Zabrze and Złoty Potok in Poland.
Ijerph 19 16494 g001
Figure 2. An architecture diagram of the multi-layer perceptron with ten neurons in a single hidden layer.
Figure 2. An architecture diagram of the multi-layer perceptron with ten neurons in a single hidden layer.
Ijerph 19 16494 g002
Figure 3. The scheme of the division into sub-ranges for the Zabrze station.
Figure 3. The scheme of the division into sub-ranges for the Zabrze station.
Ijerph 19 16494 g003
Figure 4. The scheme of the division into sub-ranges for the Złoty Potok station.
Figure 4. The scheme of the division into sub-ranges for the Złoty Potok station.
Ijerph 19 16494 g004
Figure 5. Overall MAE and RMSE values for O3 concentration prediction in PVS models depending on the number of created sub-models, Zabrze.
Figure 5. Overall MAE and RMSE values for O3 concentration prediction in PVS models depending on the number of created sub-models, Zabrze.
Ijerph 19 16494 g005
Figure 6. Overall MAE and RMSE values for O3 concentration prediction in PVS models depending on the number of created sub-models, Złoty Potok.
Figure 6. Overall MAE and RMSE values for O3 concentration prediction in PVS models depending on the number of created sub-models, Złoty Potok.
Ijerph 19 16494 g006
Figure 7. Overall MAE and RMSE values for O3 concentration prediction in RVS models depending on the number of created sub-models, Zabrze.
Figure 7. Overall MAE and RMSE values for O3 concentration prediction in RVS models depending on the number of created sub-models, Zabrze.
Ijerph 19 16494 g007
Figure 8. Overall MAE and RMSE values for O3 concentration prediction in RVS models depending on the number of created sub-models, Złoty Potok.
Figure 8. Overall MAE and RMSE values for O3 concentration prediction in RVS models depending on the number of created sub-models, Złoty Potok.
Ijerph 19 16494 g008
Figure 9. Overall MAE and RMSE values for NO concentration prediction in PVS models depending on the number of created sub-models, Zabrze.
Figure 9. Overall MAE and RMSE values for NO concentration prediction in PVS models depending on the number of created sub-models, Zabrze.
Ijerph 19 16494 g009
Figure 10. Overall MAE and RMSE values for NO concentration prediction in PVS models depending on the number of created sub-models, Złoty Potok.
Figure 10. Overall MAE and RMSE values for NO concentration prediction in PVS models depending on the number of created sub-models, Złoty Potok.
Ijerph 19 16494 g010
Figure 11. Overall MAE and RMSE values for NO concentration prediction in RVS models depending on the number of created sub-models, Zabrze.
Figure 11. Overall MAE and RMSE values for NO concentration prediction in RVS models depending on the number of created sub-models, Zabrze.
Ijerph 19 16494 g011
Figure 12. Overall MAE and RMSE values for NO concentration prediction in RVS models depending on the number of created sub-models, Złoty Potok.
Figure 12. Overall MAE and RMSE values for NO concentration prediction in RVS models depending on the number of created sub-models, Złoty Potok.
Ijerph 19 16494 g012
Figure 13. Overall MAE and RMSE values for NO2 concentration prediction in PVS models depending on the number of created sub-models, Zabrze.
Figure 13. Overall MAE and RMSE values for NO2 concentration prediction in PVS models depending on the number of created sub-models, Zabrze.
Ijerph 19 16494 g013
Figure 14. Overall MAE and RMSE values for NO2 concentration prediction in PVS models depending on the number of created sub-models, Złoty Potok.
Figure 14. Overall MAE and RMSE values for NO2 concentration prediction in PVS models depending on the number of created sub-models, Złoty Potok.
Ijerph 19 16494 g014
Figure 15. Overall MAE and RMSE values for NO2 concentration prediction in RVS models depending on the number of created sub-models, Zabrze.
Figure 15. Overall MAE and RMSE values for NO2 concentration prediction in RVS models depending on the number of created sub-models, Zabrze.
Ijerph 19 16494 g015
Figure 16. Overall MAE and RMSE values for NO2 concentration prediction in RVS models depending on the number of created sub-models, Złoty Potok.
Figure 16. Overall MAE and RMSE values for NO2 concentration prediction in RVS models depending on the number of created sub-models, Złoty Potok.
Ijerph 19 16494 g016
Figure 17. Overall MAE and RMSE values for SO2 concentration prediction in PVS models depending on the number of created sub-models, Zabrze.
Figure 17. Overall MAE and RMSE values for SO2 concentration prediction in PVS models depending on the number of created sub-models, Zabrze.
Ijerph 19 16494 g017
Figure 18. Overall MAE and RMSE values for SO2 concentration prediction in PVS models depending on the number of created sub-models, Złoty Potok.
Figure 18. Overall MAE and RMSE values for SO2 concentration prediction in PVS models depending on the number of created sub-models, Złoty Potok.
Ijerph 19 16494 g018
Figure 19. Overall MAE and RMSE values for SO2 concentration prediction in RVS models depending on the number of created sub-models, Zabrze.
Figure 19. Overall MAE and RMSE values for SO2 concentration prediction in RVS models depending on the number of created sub-models, Zabrze.
Ijerph 19 16494 g019
Figure 20. Overall MAE and RMSE values for SO2 concentration prediction in RVS models depending on the number of created sub-models, Złoty Potok.
Figure 20. Overall MAE and RMSE values for SO2 concentration prediction in RVS models depending on the number of created sub-models, Złoty Potok.
Ijerph 19 16494 g020
Figure 21. Overall MAE and RMSE values for PM10 concentration prediction in PVS models depending on the number of created sub-models, Zabrze.
Figure 21. Overall MAE and RMSE values for PM10 concentration prediction in PVS models depending on the number of created sub-models, Zabrze.
Ijerph 19 16494 g021
Figure 22. Overall MAE and RMSE values for PM10 concentration prediction in PVS models depending on the number of created sub-models, Złoty Potok.
Figure 22. Overall MAE and RMSE values for PM10 concentration prediction in PVS models depending on the number of created sub-models, Złoty Potok.
Ijerph 19 16494 g022
Figure 23. Overall MAE and RMSE values for PM10 concentration prediction in RVS models depending on the number of created sub-models, Zabrze.
Figure 23. Overall MAE and RMSE values for PM10 concentration prediction in RVS models depending on the number of created sub-models, Zabrze.
Ijerph 19 16494 g023
Figure 24. Overall MAE and RMSE values for PM10 concentration prediction in RVS models depending on the number of created sub-models, Złoty Potok.
Figure 24. Overall MAE and RMSE values for PM10 concentration prediction in RVS models depending on the number of created sub-models, Złoty Potok.
Ijerph 19 16494 g024
Figure 25. Overall MAE and RMSE values for CO concentration prediction in PVS models depending on the number of created sub-models, Zabrze.
Figure 25. Overall MAE and RMSE values for CO concentration prediction in PVS models depending on the number of created sub-models, Zabrze.
Ijerph 19 16494 g025
Figure 26. Overall MAE and RMSE values for CO concentration prediction in RVS models depending on the number of created sub-models, Zabrze.
Figure 26. Overall MAE and RMSE values for CO concentration prediction in RVS models depending on the number of created sub-models, Zabrze.
Ijerph 19 16494 g026
Table 1. Descriptive statistics of the set of hourly concentrations of monitored air pollutants, 2011–2016.
Table 1. Descriptive statistics of the set of hourly concentrations of monitored air pollutants, 2011–2016.
ZabrzeZłoty Potok
Descriptive StatisticsO3
µg/m3
NO
µg/m3
NO2
µg/m3
SO2
µg/m3
PM10
µg/m3
CO
mg/m3
O3
µg/m3
NO
µg/m3
NO2
µg/m3
SO2
µg/m3
PM10
µg/m3
CO
mg/m3
Minimum0.70.01.00.10.10.071.10.00.00.01.0-
Maximum198.0709.0160.0362.01187.09.02212.650.082.0116.0298.0-
Mean42.310.224.416.851.00.6059.51.19.67.427.8-
Median37.03.020.010.033.80.4256.00.977.04.622.3-
Standard deviation32.424.216.819.461.10.5931.01.87.98.120.7-
Completeness94.278.894.591.080.294.292.387.187.188.595.7-
Table 2. Classification of predictors for individual models at both air monitoring stations. A + symbol means the variable was treated as an input in the model. A − symbol means no measurement.
Table 2. Classification of predictors for individual models at both air monitoring stations. A + symbol means the variable was treated as an input in the model. A − symbol means no measurement.
LocalityExplained VariableExplanatory Variables (Predictors)
HDO3NONO2SO2COPM10WSTIRH
ZabrzeO3++ ++++++++
NO+++ +++++++
NO2++++ ++++++
SO2+++++ +++++
CO++++++ ++++
PM10+++++++ +++
Złoty PotokO3++ +++++++++
NO+++ ++++++++
NO2++++ +++++++
SO2+++++ ++++++
CO
PM10+++++++ ++++
Table 3. Values of approximation errors of O3 concentrations in PVS models, Zabrze 2011–2016.
Table 3. Values of approximation errors of O3 concentrations in PVS models, Zabrze 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of O3 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1PVS-1/1-ZAB1.0–165.036,4608.48.411.311.3
2PVS-1/2-ZAB1.0–37.418,2306.18.258.510.94
PVS-2/2-ZAB37.4–165.018,23010.413.4
4PVS-1/4-ZAB1.0–17.791153.58.025.010.43
PVS-2/4-ZAB17.7–37.491158.4510.8
PVS-3/4-ZAB37.4–59.991159.8412.7
PVS-4/4-ZAB59.9–165.0911510.313.2
8PVS-1/8-ZAB1.0–8.645581.988.022.810.34
PVS-2/8-ZAB8.6–17.745584.816.3
PVS-3/8-ZAB17.7–27.545587.479.6
PVS-4/8-ZAB27.5–37.445589.411.9
PVS-5/8-ZAB37.4–47.645589.9112.7
PVS-6/8-ZAB47.6–59.945589.9812.9
PVS-7/8-ZAB59.9–79.545589.7812.7
PVS-8/8-ZAB79.5–165.0455810.8113.7
Table 4. Values of approximation errors of O3 concentrations in PVS models, Złoty Potok 2011–2016.
Table 4. Values of approximation errors of O3 concentrations in PVS models, Złoty Potok 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of O3 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1PVS-1/1-ZP1.6–147.515,5368.418.4110.710.7
2PVS-1/2-ZP1.6–50.377687.667.919.910.09
PVS-2/2-ZP50.3–147.577688.1610.3
4PVS-1/4-ZP1.6–34.838846.227.558.29.68
PVS-2/4-ZP34.8–50.338848.5210.9
PVS-3/4-ZP50.3–68.738847.8110.0
PVS-4/4-ZP68.7–147.538847.649.6
8PVS-1/8-ZP1.6–25.319425.467.677.29.84
PVS-2/8-ZP25.3–34.819428.2610.6
PVS-3/8-ZP34.8–42.919428.5710.9
PVS-4/8-ZP42.9–50.319428.6911.3
PVS-5/8-ZP50.3–58.419427.8610.1
PVS-6/8-ZP58.4–68.719427.579.6
PVS-7/8-ZP68.7–89.919427.609.7
PVS-8/8-ZP90.0–147.519427.369.3
Table 5. Values of approximation errors of O3 concentrations in RVS models, Zabrze 2011–2016.
Table 5. Values of approximation errors of O3 concentrations in RVS models, Zabrze 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of O3 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1RVS-1/1-ZAB0.7–198.036,4608.48.411.311.3
2RVS-1/2-ZAB0.7–37.018,2304.86.86.38.9
RVS-2/2-ZAB37–198.018,2308.811.4
4RVS-1/4-ZAB0.7–15.091152.25.12.86.4
RVS-2/4-ZAB15.0–37.091154.65.6
RVS-3/4-ZAB37.0–62.091155.16.1
RVS-4/4-ZAB62.0–198.091158.611.1
8RVS-1/8-ZAB0.7–6.045581.03.31.24.0
RVS-2/8-ZAB6.0–15.045581.82.2
RVS-3/8-ZAB15.0–26.045582.63.0
RVS-4/8-ZAB26.0–37.045582.83.2
RVS-5/8-ZAB37.0–49.045582.83.3
RVS-6/8-ZAB49.0–62.045583.13.6
RVS-7/8-ZAB62.0–82.045584.25.1
RVS-8/8-ZAB82.0–198.045588.310.8
Table 6. Values of approximation errors of O3 concentrations in RVS models, Złoty Potok 2011–2016.
Table 6. Values of approximation errors of O3 concentrations in RVS models, Złoty Potok 2011–2016.
Number of Sub-RangesRegression ModelSub-ranges of O3 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1RVS-1/1-ZP1.1–16215,5368.388.3810.710.7
2RVS-1/2-ZP1.1–51.077685.976.517.68.25
RVS-2/2-ZP51.0–162.077687.058.9
4RVS-1/4-ZP1.1–32.338844.064.675.15.84
RVS-2/4-ZP32.3–51.038843.854.7
RVS-3/4-ZP51.0–72.038844.205.1
RVS-4/4-ZP72.0–162.038846.568.4
8RVS-1/8-ZP1.1–21.119422.983.063.63.74
RVS-2/8-ZP21.1–32.319422.483.0
RVS-3/8-ZP32.3–42.019422.272.7
RVS-4/8-ZP42.0–51.019422.172.6
RVS-5/8-ZP51.0–60.419422.282.6
RVS-6/8-ZP60.4–72.019422.563.1
RVS-7/8-ZP72.0–90.319423.684.5
RVS-8/8-ZP90.4–162.019426.067.9
Table 7. Values of approximation errors of NO concentrations in PVS models, Zabrze 2011–2016.
Table 7. Values of approximation errors of NO concentrations in PVS models, Zabrze 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of NO Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1PVS-1/1-ZAB0.0–709.036,4603.743.748.268.26
2PVS-1/2-ZAB0.0–3.018,2300.523.370.666.18
PVS-2/2-ZAB3.0–709.018,2306.2211.70
4PVS-1/4-ZAB0.0–1.191150.232.760.324.32
PVS-2/4-ZAB1.1–3.091150.340.42
PVS-3/4-ZAB3.0–8.091150.941.17
PVS-4/4-ZAB8.0–709.091159.5515.37
8PVS-1/8-ZAB0.0–1.045580.232.220.323.14
PVS-2/8-ZAB1.0–1.145580.010.01
PVS-3/8-ZAB1.1–2.045580.150.21
PVS-4/8-ZAB2.0–3.045580.190.26
PVS-5/8-ZAB3.0–4.745580.390.47
PVS-6/8-ZAB4.7–8.045580.770.92
PVS-7/8-ZAB8.0–19.345582.162.64
PVS-8/8-ZAB19.4–709.0455813.8620.25
Table 8. Values of approximation errors of NO concentrations in PVS models, Złoty Potok 2011–2016.
Table 8. Values of approximation errors of NO concentrations in PVS models, Złoty Potok 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of NO Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1PVS-1/1-ZP0.1–49.815,5360.4590.4590.7890.789
2PVS-1/2-ZP0.1–0.677680.3240.4300.4250.702
PVS-2/2-ZP0.6–49.877680.5350.979
4PVS-1/4-ZP0.1–0.438840.3010.4280.3730.663
PVS-2/4-ZP0.4–0.638840.3190.434
PVS-3/4-ZP0.6–1.138840.3310.501
PVS-4/4-ZP1.1–49.838840.7621.344
8PVS-1/8-ZP0.1–0.319420.2610.4290.3260.634
PVS-2/8-ZP0.3–0.419420.3020.389
PVS-3/8-ZP0.4–0.519420.3110.398
PVS-4/8-ZP0.5–0.619420.3570.497
PVS-5/8-ZP0.6–0.819420.3500.497
PVS-6/8-ZP0.8–1.119420.3160.517
PVS-7/8-ZP1.1–1.819420.3860.586
PVS-8/8-ZP1.8–49.819421.1501.859
Table 9. Values of approximation errors of NO concentrations in RVS models, Zabrze 2011–2016.
Table 9. Values of approximation errors of NO concentrations in RVS models, Zabrze 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of NO Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1RVS-1/1-ZAB0.0–709.036,4603.743.748.268.26
2RVS-1/2-ZAB0.0–3.018,2300.523.370.666.18
RVS-2/2-ZAB3.0–709.018,2306.2211.70
4RVS-1/4-ZAB0.0–1.191150.232.760.324.32
RVS-2/4-ZAB1.1–3.091150.340.42
RVS-3/4-ZAB3.0–8.091150.941.17
RVS-4/4-ZAB8.0–709.091159.5515.37
8RVS-1/8-ZAB0.0–1.045580.232.220.323.14
RVS-2/8-ZAB1.0–1.145580.010.01
RVS-3/8-ZAB1.1–2.045580.150.21
RVS-4/8-ZAB2.0–3.045580.190.26
RVS-5/8-ZAB3.0–4.745580.390.47
RVS-6/8-ZAB4.7–8.045580.770.92
RVS-7/8-ZAB8.0–19.345582.162.64
RVS-8/8-ZAB19.4–709.0455813.8620.25
Table 10. Values of approximation errors of NO concentrations in RVS models, Złoty Potok 2011–2016.
Table 10. Values of approximation errors of NO concentrations in RVS models, Złoty Potok 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of NO Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1RVS-1/1-ZP0.0–50.015,5360.4300.4300.7460.746
2RVS-1/2-ZP0.0–0.877680.1840.3100.2340.571
RVS-2/2-ZP0.8–50.077680.4360.908
4RVS-1/4-ZP0.0–0.338840.0540.2290.0880.408
RVS-2/4-ZP0.3–0.838840.0640.081
RVS-3/4-ZP0.8–1.038840.0370.053
RVS-4/4-ZP1.0–50.038840.7601.411
8RVS-1/8-ZP0.0–0.01942----
RVS-2/8-ZP0.0–0.319420.0690.097
RVS-3/8-ZP0.3–0.619420.0480.060
RVS-4/8-ZP0.6–0.819420.0410.049
RVS-5/8-ZP0.8–1.019420.0450.056
RVS-6/8-ZP1.0–1.01942--
RVS-7/8-ZP1.0–2.019420.1480.206
RVS-8/8-ZP2.0–50.019420.9911.752
Table 11. Values of approximation errors of NO2 concentrations in PVS models, Zabrze 2011–2016.
Table 11. Values of approximation errors of NO2 concentrations in PVS models, Zabrze 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of NO2 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1PVS-1/1-ZAB3.6–134.436,4605.35.37.47.4
2PVS-1/2-ZAB3.6–20.618,2303.135.214.26.89
PVS-2/2-ZAB20.6–134.418,2307.299.6
4PVS-1/4-ZAB3.6–12.291152.295.143.06.66
PVS-2/4-ZAB12.3–20.691153.855.0
PVS-3/4-ZAB20.6–32.891155.897.6
PVS-4/4-ZAB32.8–134.491158.5411.1
8PVS-1/8-ZAB3.6–9.245581.805.202.46.72
PVS-2/8-ZAB9.2–12.245582.883.7
PVS-3/8-ZAB12.3–1645583.434.4
PVS-4/8-ZAB16–20.645584.335.6
PVS-5/8-ZAB20.6–26.245585.717.3
PVS-6/8-ZAB26.2–32.845586.338.2
PVS-7/8-ZAB32.8–41.745587.659.9
PVS-8/8-ZAB41.7–134.445589.4712.3
Table 12. Values of approximation errors of NO2 concentrations in PVS models, Złoty Potok 2011–2016.
Table 12. Values of approximation errors of NO2 concentrations in PVS models, Złoty Potok 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of NO2 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1PVS-1/1-ZP1.6–52.915,5361.9191.9192.652.65
2PVS-1/2-ZP1.6–6.777681.2681.8521.692.463
PVS-2/2-ZP6.7–52.977682.4353.24
4PVS-1/4-ZP1.6–4.738840.9891.6861.332.207
PVS-2/4-ZP4.7–6.738840.9891.33
PVS-3/4-ZP6.7–10.838841.8522.39
PVS-4/4-ZP10.8–52.938842.9153.78
8PVS-1/8-ZP1.6–3.719420.8121.8111.082.342
PVS-2/8-ZP3.7–4.719421.2731.66
PVS-3/8-ZP4.7–5.619421.3661.73
PVS-4/8-ZP5.6–6.719421.6712.17
PVS-5/8-ZP6.7–8.219421.7862.26
PVS-6/8-ZP8.2–10.819421.9722.55
PVS-7/8-ZP10.8–16.019422.3613.06
PVS-8/8-ZP16.1–52.919423.2464.22
Table 13. Values of approximation errors of NO2 concentrations in RVS models, Zabrze 2011–2016.
Table 13. Values of approximation errors of NO2 concentrations in RVS models, Zabrze 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of NO2 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1RVS-1/1-ZAB1.2–14536,4605.315.317.497.49
2RVS-1/2-ZAB1.2–20.018,2302.374.422.975.91
RVS-2/2-ZAB20.0–14518,2306.478.84
4RVS-1/4-ZAB1.2–11.591151.563.181.914.10
RVS-2/4-ZAB11.5–20.091151.822.19
RVS-3/4-ZAB20.0–33.091152.733.27
RVS-4/4-ZAB33.0–145.091156.619.03
8RVS-1/8-ZAB1.2–8.045581.032.041.262.57
RVS-2/8-ZAB8.0–11.545580.861.00
RVS-3/8-ZAB11.5–15.245580.931.07
RVS-4/8-ZAB15.2–20.045581.121.32
RVS-5/8-ZAB20.0–26.045581.401.64
RVS-6/8-ZAB26.0–33.045581.681.97
RVS-7/8-ZAB33.0–43.145582.442.87
RVS-8/8-ZAB43.1–145.045586.849.44
Table 14. Values of approximation errors of NO2 concentrations in RVS models, Złoty Potok 2011–2016.
Table 14. Values of approximation errors of NO2 concentrations in RVS models, Złoty Potok 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of NO2 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1RVS-1/1-ZP0.4.0–60.315,5361.9191.9192.6722.672
2RVS-1/2-ZP0.4–7.077680.8941.5291.1102.047
RVS-2/2-ZP7.0–60.377682.1642.985
4RVS-1/4-ZP0.4–4.038840.5931.1500.7261.487
RVS-2/4-ZP4.0–7.038840.5860.706
RVS-3/4-ZP7.0–11.138840.8911.088
RVS-4/4-ZP11.1–60.338842.5293.426
8RVS-1/8-ZP0.4–3.019420.4270.7730.5210.968
RVS-2/8-ZP3.0–4.019420.3040.354
RVS-3/8-ZP4.0–5.419420.2830.349
RVS-4/8-ZP5.4–7.019420.3320.408
RVS-5/8-ZP7.0–8.719420.4380.501
RVS-6/8-ZP8.7–11.119420.6310.736
RVS-7/8-ZP11.1–16.619421.1071.332
RVS-8/8-ZP16.6–60.319422.6613.544
Table 15. Values of approximation errors of SO2 concentrations in PVS models, Zabrze 2011–2016.
Table 15. Values of approximation errors of SO2 concentrations in PVS models, Zabrze 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of SO2 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1PVS-1/1-ZAB1.3–321.936,4605.255.258.18.1
2PVS-1/2-ZAB1.3–10.618,2302.575.173.77.30
PVS-2/2-ZAB10.6–321.918,2307.7810.9
4PVS-1/4-ZAB1.3–6.491151.815.172.67.15
PVS-2/4-ZAB6.4–10.691153.304.6
PVS-3/4-ZAB10.6–21.391155.888.1
PVS-4/4-ZAB21.3–321.991159.6913.3
8PVS-1/8-ZAB1.3–4.945581.495.182.27.02
PVS-2/8-ZAB4.9–6.445582.193.0
PVS-3/8-ZAB6.4–8.245582.853.9
PVS-4/8-ZAB8.2–10.645583.855.3
PVS-5/8-ZAB10.6–14.445585.467.5
PVS-6/8-ZAB14.4–21.345586.699.1
PVS-7/8-ZAB21.3–34.045587.089.3
PVS-8/8-ZAB34.0–321.9455811.8215.9
Table 16. Values of approximation errors of SO2 concentrations in PVS models, Złoty Potok 2011–2016.
Table 16. Values of approximation errors of SO2 concentrations in PVS models, Złoty Potok 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of SO2 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1PVS-1/1-ZP0.8–74.215,5361.9201.9203.1333.133
2PVS-1/2-ZP0.8–4.077680.9351.9171.3022.800
PVS-2/2-ZP4.0–74.277682.8994.298
4PVS-1/4-ZP0.8–2.838840.8231.8761.0992.620
PVS-2/4-ZP2.8–4.038841.1351.597
PVS-3/4-ZP4.0–7.238841.7102.395
PVS-4/4-ZP7.2–74.238843.8365.389
8PVS-1/8-ZP0.8–2.219420.6991.7590.8992.413
PVS-2/8-ZP2.2–2.819420.8551.131
PVS-3/8-ZP2.8–3.319421.0081.379
PVS-4/8-ZP3.3–4.019421.2461.778
PVS-5/8-ZP4.0–5.119420.0570.080
PVS-6/8-ZP5.1–7.219422.0242.813
PVS-7/8-ZP7.2–11.819423.1084.545
PVS-8/8-ZP11.8–74.219425.0726.681
Table 17. Values of approximation errors of SO2 concentrations in RVS models, Zabrze 2011–2016.
Table 17. Values of approximation errors of SO2 concentrations in RVS models, Zabrze 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of SO2 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1RVS-1/1-ZAB0.1–36236,4605.265.268.158.15
2RVS-1/2-ZAB0.1–10.018,2301.584.331.956.14
RVS-2/2-ZAB10.0–362.018,2307.0810.33
4RVS-1/4-ZAB0.1–5.091150.853.181.034.30
RVS-2/4-ZAB5.0–1091151.111.32
RVS-3/4-ZAB10.0–22.391152.663.21
RVS-4/4-ZAB22.4–362.091158.1111.64
8RVS-1/8-ZAB0.1–3.245580.552.250.642.98
RVS-2/8-ZAB3.2–5.045580.410.48
RVS-3/8-ZAB5.0–7.045580.480.56
RVS-4/8-ZAB7.0–1045580.730.85
RVS-5/8-ZAB10.0–14.945581.161.36
RVS-6/8-ZAB14.9–22.345581.822.14
RVS-7/8-ZAB22.4–36.045583.023.58
RVS-8/8-ZAB36.0–362.045589.8414.21
Table 18. Values of approximation errors of SO2 concentrations in RVS models, Złoty Potok 2011–2016.
Table 18. Values of approximation errors of SO2 concentrations in RVS models, Złoty Potok 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of SO2 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1RVS-1/1-ZP0.0–85.015,5361.961.963.1863.186
2RVS-1/2-ZP0.0–4.077680.631.660.7802.45
RVS-2/2-ZP4.0–85.077682.684.116
4RVS-1/4-ZP0.0–2.138840.381.230.4551.71
RVS-2/4-ZP2.1–4.038840.360.461
RVS-3/4-ZP4.0–7.438840.730.891
RVS-4/4-ZP7.4–85.038843.455.048
8RVS-1/8-ZP0.0–1.819420.260.840.3331.14
RVS-2/8-ZP1.8–2.119420.040.063
RVS-3/8-ZP2.1–3.019420.190.245
RVS-4/8-ZP3.0–4.019420.230.289
RVS-5/8-ZP4.0–5.019420.340.385
RVS-6/8-ZP5.0–7.419420.580.683
RVS-7/8-ZP7.4–12.019421.011.216
RVS-8/8-ZP12.0–85.019424.055.909
Table 19. Values of approximation errors of PM10 concentrations in PVS models, Zabrze 2011–2016.
Table 19. Values of approximation errors of PM10 concentrations in PVS models, Zabrze 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of PM10 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1PVS-1/1-ZAB8.2–980.536,46011.8311.8318.718.7
2PVS-1/2-ZAB8.2–33.118,2306.5311.768.7917.00
PVS-2/2-ZAB33.1–980.518,23016.9925.22
4PVS-1/4-ZAB8.2–22.191155.5211.517.615.76
PVS-2/4-ZAB22.1–33.191157.429.7
PVS-3/4-ZAB33.1–55.6911510.0113.2
PVS-4/4-ZAB55.6–980.5911523.0732.6
8PVS-1/8-ZAB8.2–17.945584.4711.205.914.93
PVS-2/8-ZAB17.9–22.145586.398.8
PVS-3/8-ZAB22.1–26.945586.728.9
PVS-4/8-ZAB26.9–33.145588.2410.7
PVS-5/8-ZAB33.1–41.245589.1812.1
PVS-6/8-ZAB41.2–55.6455811.1114.4
PVS-7/8-ZAB55.6–90.1455814.6719.0
PVS-8/8-ZAB90.2–980.5455828.7939.7
Table 20. Values of approximation errors of PM10 concentrations in PVS models, Złoty Potok 2011–2016.
Table 20. Values of approximation errors of PM10 concentrations in PVS models, Złoty Potok 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of PM10 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1PVS-1/1-ZP5.1–120.415,5366.7266.7269.109.10
2PVS-1/2-ZP5.1–22.777684.6386.5896.118.646
PVS-2/2-ZP22.7–120.477688.54111.18
4PVS-1/4-ZP5.1–18.038843.9646.4425.128.285
PVS-2/4-ZP18.0–22.738845.2306.76
PVS-3/4-ZP22.7–30.938846.6028.57
PVS-4/4-ZP30.9–120.438849.97212.70
8PVS-1/8-ZP5.1–15.719423.4066.6394.398.519
PVS-2/8-ZP15.7–18.019424.6346.01
PVS-3/8-ZP18.0–20.219424.9636.43
PVS-4/8-ZP20.2–22.719425.6137.20
PVS-5/8-ZP22.7–25.919426.7518.74
PVS-6/8-ZP25.9–30.919427.2029.28
PVS-7/8-ZP30.9–40.519428.02710.23
PVS-8/8-ZP40.5–120.4194212.51615.88
Table 21. Values of approximation errors of PM10 concentrations in RVS models, Zabrze 2011–2016.
Table 21. Values of approximation errors of PM10 concentrations in RVS models, Zabrze 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of PM10 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1RVS-1/1-ZAB0.1–1145.036,46011.8411.8418.818.8
2RVS-1/2-ZAB0.1–33.018,2304.669.935.814.77
RVS-2/2-ZAB33.0–1145.018,23015.2023.8
4RVS-1/4-ZAB0.1–20.091153.097.833.810.96
RVS-2/4-ZAB20.0–33.091152.973.5
RVS-3/4-ZAB33.0–57.191155.026.1
RVS-4/4-ZAB57.1–1145.0911520.2330.4
8RVS-1/8-ZAB0.1–14.045582.265.762.87.68
RVS-2/8-ZAB14.0–20.045581.491.7
RVS-3/8-ZAB20.0–26.045581.511.8
RVS-4/8-ZAB26.0–33.045581.742.0
RVS-5/8-ZAB33.0–42.545582.302.7
RVS-6/8-ZAB42.5–57.145583.514.1
RVS-7/8-ZAB57.1–91.145586.998.4
RVS-8/8-ZAB91.1–1145.0455826.3238.0
Table 22. Values of approximation errors of PM10 concentrations in RVS models, Złoty Potok 2011–2016.
Table 22. Values of approximation errors of PM10 concentrations in RVS models, Złoty Potok 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of PM10 Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1RVS-1/1-ZP3.0–138.015,5366.886.889.349.34
2RVS-1/2-ZP3.0–22.877682.995.053.646.70
RVS-2/2-ZP22.8–138.077687.119.75
4RVS-1/4-ZP3.0–16.038841.883.462.314.48
RVS-2/4-ZP16.0–22.838841.621.90
RVS-3/4-ZP22.8–32.938842.292.71
RVS-4/4-ZP32.9–138.038848.0711.00
8RVS-1/8-ZP3.0–12.119421.432.111.762.66
RVS-2/8-ZP12.1–16.019420.840.99
RVS-3/8-ZP16.0–19.019420.840.97
RVS-4/8-ZP19.0–22.819420.891.03
RVS-5/8-ZP22.8–27.019421.041.21
RVS-6/8-ZP27.0–32.919421.461.69
RVS-7/8-ZP32.9–43.019422.502.94
RVS-8/8-ZP43.0–138.019427.8810.71
Table 23. Values of approximation errors of CO concentrations in PVS models, Zabrze 2011–2016.
Table 23. Values of approximation errors of CO concentrations in PVS models, Zabrze 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of CO Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1PVS-1/1-ZAB0.16–8.2836,4600.1190.1190.2020.202
2PVS-1/2-ZAB0.16–0.4218,2300.0710.1170.1550.193
PVS-2/2-ZAB0.42–8.2818,2300.1620.230
4PVS-1/4-ZAB0.16–0.3091150.0660.1150.1960.182
PVS-2/4-ZAB0.30–0.4291150.0740.098
PVS-3/4-ZAB0.42–0.6991150.1110.149
PVS-4/4-ZAB0.69–8.2891150.2100.285
8PVS-1/8-ZAB0.16–0.2645580.0720.1170.1980.178
PVS-2/8-ZAB0.26–0.3045580.0690.177
PVS-3/8-ZAB0.30–0.3545580.0650.085
PVS-4/8-ZAB0.35–0.4245580.0790.106
PVS-5/8-ZAB0.42–0.5245580.0990.132
PVS-6/8-ZAB0.52–0.6945580.1210.158
PVS-7/8-ZAB0.69–1.0345580.1530.203
PVS-8/8-ZAB1.03–8.2845580.2750.365
Table 24. Values of approximation errors of CO concentrations in RVS models, Zabrze 2011–2016.
Table 24. Values of approximation errors of CO concentrations in RVS models, Zabrze 2011–2016.
Number of Sub-RangesRegression ModelSub-Ranges of CO Concentrations, µg/m3Number of ObservationsMAE, µg/m3Overall MAE, µg/m3RMSE, µg/m3Overall RMSE, µg/m3
1RVS-1/1-ZAB0.1–9.036,4600.1180.1180.1980.198
2RVS-1/2-ZAB0.1–0.418,2300.0470.1040.0590.161
RVS-2/2-ZAB0.4–9.018,2300.1600.262
4RVS-1/4-ZAB0.1–0.391150.0340.0780.0420.104
RVS-2/4-ZAB0.3–0.491150.0300.035
RVS-3/4-ZAB0.4–0.791150.0550.066
RVS-4/4-ZAB0.7–9.091150.1960.274
8RVS-1/8-ZAB0.1–0.245580.0270.0550.0330.070
RVS-2/8-ZAB0.2–0.345580.0140.016
RVS-3/8-ZAB0.3–0.345580.0140.017
RVS-4/8-ZAB0.3–0.445580.0180.021
RVS-5/8-ZAB0.4–0.545580.0250.030
RVS-6/8-ZAB0.5–0.745580.0380.045
RVS-7/8-ZAB0.7–1.145580.0770.093
RVS-8/8-ZAB1.1–9.045580.2230.309
Table 25. The percentage changes in the overall values of MAE and RMSE obtained by modeling the concentrations in sub-ranges, in relation to the MAE and RMSE values of the corresponding full-range models.
Table 25. The percentage changes in the overall values of MAE and RMSE obtained by modeling the concentrations in sub-ranges, in relation to the MAE and RMSE values of the corresponding full-range models.
Air
Pollutant
Number of
Sub-Models
ZabrzeZloty Potok
PVS Sub-ModelsRVS Sub-ModelsPVS Sub-ModelsRVS Sub-Models
ΔMAE
%
ΔRMSE
%
ΔMAE
%
ΔRMSE
%
ΔMAE
%
ΔRMSE
%
ΔMAE
%
ΔRMSE
%
O32−2.1−2.9−19.2−21.3−5.9−5.6−22.3−23.0
4−4.9−7.4−39.5−43.4−10.2−9.5−44.3−45.5
8−4.9−8.2−60.7−64.1−8.7−8.0−63.5−65.1
NO2−4.6−22.9−9.9−25.2−6.4−11.0−27.9−23.5
4−7.7−36.9−26.1−47.7−6.7−16.0−46.8−45.3
8−8.3−41.9−40.6−62.0−6.5−19.7--
NO22−1.7−7.5−16.8−21.2−3.5−7.0−20.3−23.4
4−3.0−10.5−40.2−45.3−12.1−16.6−40.1−44.4
8−1.9−9.8−61.6−65.7−5.6−11.5−59.7−63.8
SO22−1.5−10.2−17.7−24.7−0.1−10.6−15.4−23.2
4−1.6−12.0−39.5−47.3−2.3−16.4−37.2−46.2
8−1.4−13.6−57.2−63.5−8.4−23.0−57.2−64.2
PM102−0.5−8.9−16.1−21.3−2.0−5.0−26.6−28.2
4−2.7−15.5−33.9−41.7−4.2−9.0−49.6−52.0
8−5.3−20.0−51.3−59.1−1.3−6.4−69.3−71.5
CO2−2.4−4.8−11.8−18.9----
4−3.5−10.1−33.4−47.3----
8−2.2−12.1−53.6−64.5----
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Hoffman, S.; Filak, M.; Jasiński, R. Air Quality Modeling with the Use of Regression Neural Networks. Int. J. Environ. Res. Public Health 2022, 19, 16494. https://doi.org/10.3390/ijerph192416494

AMA Style

Hoffman S, Filak M, Jasiński R. Air Quality Modeling with the Use of Regression Neural Networks. International Journal of Environmental Research and Public Health. 2022; 19(24):16494. https://doi.org/10.3390/ijerph192416494

Chicago/Turabian Style

Hoffman, Szymon, Mariusz Filak, and Rafał Jasiński. 2022. "Air Quality Modeling with the Use of Regression Neural Networks" International Journal of Environmental Research and Public Health 19, no. 24: 16494. https://doi.org/10.3390/ijerph192416494

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop