Next Article in Journal
Xanthone Derivatives in the Fight against Glioblastoma and Other Cancers
Next Article in Special Issue
Evaluation on Lateral Stability of Vehicle: Impacts of Pavement Rutting, Road Alignment, and Adverse Weather
Previous Article in Journal
Cattle Weight Estimation Using Fully and Weakly Supervised Segmentation from 2D Images
Previous Article in Special Issue
Chassis Design Target Setting for a High-Performance Car Using a Virtual Prototype
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Forecasting the Number of Road Accidents in Polish Provinces Using Trend Models

by
Piotr Gorzelańczyk
Transport Department Podchorazych 10 Street, Stanislaw Staszic State University of Applied Sciences in Pila, 64-920 Pila, Poland
Appl. Sci. 2023, 13(5), 2898; https://doi.org/10.3390/app13052898
Submission received: 16 January 2023 / Revised: 17 February 2023 / Accepted: 18 February 2023 / Published: 23 February 2023

Abstract

:
Many people die on the streets every year. The value is declining year by year, but there are still plenty of them. Although the COVID-19 pandemic reduced the number of traffic accidents, it is still very high. For this reason, in order to do everything possible to minimize the number of road accidents, it is important to know the federal states with the most road accidents and what the accident forecast is for the next few years. The purpose of this article is to predict the number of road accidents by state in Poland. The survey was divided into two parts. The first is an analysis of the annual data of police statistics on the number of road accidents in Poland for the period 2000–2021, upon the prediction of the number of traffic accidents from 2022 to 2031 was decided. The second part of the study looked at monthly data from 2000 to 2021. Again, the forecasts analyzed were determined for the period from January 2022 to December 2023. The results of this study indicate that a decrease in the number of accidents is also expected in the coming years, which becomes especially clear when analyzing the annual data. It is worth noting that the prevailing COVID-19 pandemic has distorted the results obtained. The study was performed in MS Excel using the selected propensity model.

1. Introduction

A traffic accident is an event that causes damage to property, as well as injury or death to traffic participants. According to the World Health Organization (WHO), about 1.3 million people die each year in road accidents. Those accidents account for about 3% of GDP in most countries around the world. Automobile accidents are the leading cause of death for minors and young people between the ages of 5 and 29 [1]. The United Nations General Assembly has set an ambitious goal of halving the number of traffic accident victims by 2030 [2,3].
Road accident severity is an attribute used to determine the severity of traffic collisions. Predicting accident severity is important for relevant authorities to develop road safety policies to prevent accidents and reduce injuries, fatalities and property damage [4,5,6,7,8]. Identifying key factors affecting accident severity is a prerequisite for taking measures to eliminate and reduce accident severity [9]. Yang et al. [10] proposed a DNN (deep neutral network) multicarbon structure for predicting various degrees of injury, death and severity of property loss. This allows for a comprehensive and accurate analysis of the severity of traffic accidents.
There are several sources of accident data. They are usually collected and evaluated by government agencies through responsible leadership agencies. Data are collected from police reports, insurance databases or hospital records. In some cases, traffic accident information is processed on a large scale for the transportation sector [11].
Intelligent transportation systems are currently the most important source of data for analyzing and predicting traffic accidents. These data can be processed using the vehicle’s GPS device [12]. In addition, vehicle information can be detected using a roadside vehicle detection system, which can continuously record vehicle information (speed, traffic volume, vehicle type, etc.) [13]. License plate recognition systems can also collect large amounts of traffic data during surveillance [14]. Another source of traffic and accident information is social media, but its accuracy can be insufficient due to the incompetence of reporters [15].
In this article, we chose a trend model to predict the number of traffic accidents by state. Exponential smoothing and neural networks for predicting the number of traffic accidents have been used by authors in other studies [16,17].

2. Literature Review

For accident data to be relevant, multiple data sources must be handled properly. Combining data from many sources, by merging various road accident data, increases the accuracy of analytical results [18].
A statistical study to estimate the severity of road accidents and clarify the relationship between the accidents and road users was conducted by Vilaca et al. [19]. The result of this research was suggestions for improving traffic safety standards and adopting other traffic safety measures.
Buck et al. [20] conducted a statistical study of traffic safety in selected regions of Poland based on the speed at which the number of road accidents and their causes were determined. The study used multivariate statistical analysis to examine the safety aspects of accident perpetrators.
The choice of the source of accident data for analysis depends on the type of traffic problem under consideration. Combining statistical models with other natural driving data or other data acquired by intelligent transportation systems can improve the accuracy of accident prediction and contribute to the elimination of accidents [21].
Various methods for predicting the number of accidents can be found in the literature. Time series methods are most commonly used to predict the number of traffic accidents [22,23]. The disadvantages are the inability to assess the quality of the prediction based on outdated forecasts and the frequent autocorrelation of the residual components of the accident rates [24]. Prochazka et al. [25] used multiple seasonality models for forecasting, and Sunny et al. [26] used Holt–Winters exponential smoothing. Its limitations include the inability to introduce exogenous variables into the model [27,28].
The vector autoregression model has also been used to predict the number of traffic accidents. The disadvantage is that it requires a large number of variable observations to properly estimate the parameters [29]. To analyze the number of deaths [30] and the Al-Madani curve-fitting regression model [31], they require only a simple linear relationship [32] and an order of autoregression (assuming the series is already stationary) [33].
Biswas et al. [34] used random forest regression to predict the number of traffic accidents. In this case, the data contain clusters of correlated features with similar validity to the original data, with small clusters favored over large clusters [35] and method instability and spike prediction [36]. Chudy-Laskowska and Pisula [37] applied autoregressive quadratic trend models, one-dimensional cyclical trend models and exponentially adjusted models to the forecasting problem at hand. Moving average models can also be used to predict the problem at hand. Its drawbacks are poor prediction accuracy, the resulting loss of data, and failure to account for trends and seasonal effects [38]. Prochozka and Camej [39] used the GARMA method. This method imposes some restrictions on the parameter space to ensure the stationarity of the process. ARMA models for steady-state processes and ARIMA or SARIMA models for transient processes are often used for forecasting [25,40,41]. Although these models lead to great flexibility of the models in question, they are also a disadvantage, as identifying good models requires more experience from the researcher than, for example, regression analysis [42]. Another disadvantage is the linearity of ARIMA models [43].
Chudy-Laskowska and Pisula used the ANOVA method in a study [44] to predict the number of traffic accidents. The disadvantage of this method is additional assumptions, especially the assumption of sphericity, the violation of which can lead to erroneous conclusions [45]. Neural network models are also used to predict the number of traffic accidents.
The disadvantages of SSNs are the need for experience in this area [44,46] and the dependence of the final solution on the initial conditions of the network and the lack of interpretability in the traditional way. As a black box, the model provides results as they come in, without any analytical knowledge [47].
A new method of prediction is the use of the Hadoop model by Kumar et al. [48]. The disadvantage of this method is that it does not support small data files [49]. Karlaftis and Vlahogianni [41] used the GARCH model for prediction. The disadvantage of this method is its complex formalism and complex model [50,51]. On the other hand, McIlroy and his team used the ADF test [52], which suffers from low power due to autocorrelation of random components [53]. Authors in other publications [54,55] have also used data mining techniques for prediction, but those usually suffer from a large number of general explanations [56]. Sevego et al. also found combinations of different models [57]. Parametric models have also been proposed in Bloomfield’s work [58].
Given the above literature review, a quick and simple method for determining the forecast of the number of traffic accidents offered by MS Excel, such as trend models, was used for the study. Despite many studies using trend models, they have not been used to forecast the number of traffic accidents in Poland. For this reason, the author addressed the subject under discussion. Despite some limitations—it does not take into account the influence of seasonality in road accidents—it can be used to forecast the number of road accidents.

3. Materials and Methods

More than 38 million people live in Poland (Figure 1). It occupies an area of 312,705 km2 and is divided into 16 provinces (Table 1, Figure 2). In the states studied, the average reduction in traffic accidents between 2001 and 2021 was more than 56%. Strongest in Kuyavsko–Pomorskie (70%) and Podraskiye (69%), lowest in Rubskiye (32%). The number of automobile accidents depends on the number of residents living in a particular state [59,60]. The number of accidents in Poland is still very high compared to the rest of the European Union [61]. Therefore, every effort should be made to reduce this value and identify the states with the most traffic accidents (Figure 3, Figure 4 and Figure 5). Based on Figure 1 and Figure 3, it can be concluded that the highest number of traffic accidents is in the Mazowieckie Province and the highest number of accidents per 10,000 inhabitants is in the Lodz Province.
In addition, since this study only used historical data on the number of traffic accidents and did not consider other factors, there is a relationship between the data used, and for this reason the method discussed can be used.
Considering Figure 3 and Figure 4, it can be seen that the number of accidents is decreasing, and accidents are seasonal. The highest number of accidents occurs in the summer months and the lowest in the winter months. In addition, we can see that the beginning of the COVID-19 pandemic, 2020, disrupted the number of traffic accidents, when there was a huge decrease. Between 2001 and 2021, the largest decrease in the number of traffic accidents, more than 300%, is seen in the following provinces: kujawsko–pomorskie, lubelskie, podlaskie. On the other hand, the smallest decrease, during the analyzed period, was seen in Lubuskie province and amounted to less than 50%.
The purpose of this article is to forecast the number of road accidents in Poland in each province. Statistical data of the Police from 2007 to 2021 were used as input data. The following trend models available in Excel software were used to forecast the number of road accidents in each province:
-
Exponential;
-
Linear;
-
Logarithmic;
-
2nd degree polynomial;
-
3rd degree polynomial;
-
Polynomial of the 4th degree;
-
Polynomial of the 5th degree;
-
Polynomial of the 6th degree;
-
Potentiometric.
Then, for the obtained forecasts, the errors of expired forecasts were determined based on Equations (1)–(5):
  • ME—mean error
M E = 1 n i = 1 n ( Y i Y p )
  • MAE—mean average error
M A E = 1 n i = 1 n | Y i Y p |
  • MPE—mean percentage error
M P E = 1 n i = 1 n Y i Y p Y i
  • MAPE—mean absolute percentage error
M A P E = 1 n i = 1 n | Y i Y p | Y i
  • MSE—mean square error
M S E = 1 n i = 1 n ( Y i Y p ) 2
where
n—the length of the forecast horizon,
Y—observed value of road accidents,
Yp—forecasted value of road accidents.

4. Results

For the analyzed trend models, in the first step, the formulas for the statistical data analyzed on an annual and monthly basis for each federal state were determined using Excel software. For annual (2001 to 2021) and monthly (January 2007 to December 2021) data, serial graphs were drawn, and trend lines and R-squared values were determined. As can be seen, the R-squared coefficient, a measure of the quality of the model fit, is mostly good or fair for annual data, and mostly poor or satisfactory for monthly data, regardless of the model used. This is mainly due to the seasonality of traffic accident numbers in each state, which is not well accounted for by the methods used. Table 2 and Table 3 and Appendix A shows trend model formulas with annual and monthly data for each state analyzed.
We then used the data in Table 2 and Table 3 and Appendix A to determine the number of traffic accident probabilities. For yearly data, this is the period from 2022 to 2031, and for monthly data, it is from January 2022 to December 2023. In this case, the prediction is based on data from police statistics. Prediction results using this method depend on the choice of model and its tuning.
A trend model with the lowest mean percentage error and mean absolute percentage error was chosen to predict the number of traffic accidents in the analyzed federal states. Based on this, we found that for annual statistical data, the best model is often the exponential model, which has the smallest analyzed error. In addition, polynomial, logarithmic and power models were also used for annual data. In this case, the average MAPE error was 0.52%. On the other hand, for monthly data, linear and exponential models produce the smallest MAPE errors depending on the states studied. However, the average for this error was 65%. This is a very large value and suggests that the propensity model should not be used to predict the number of traffic accidents. Table 4 and Table 5 summarize the errors between the lowest modeled annual and monthly data. Using these models, the predicted number of accidents for the next year was determined on a monthly and yearly basis (Figure 6 and Figure 7). Based on Figure 4 and Figure 5, the number of traffic accidents is expected to decline further in the next few years. Please note that the COVID-19 pandemic has changed our forecast significantly. As can be seen from Figure 6, the trend model does not account for the seasonality that occurs in traffic accidents and should not be used in the case under consideration.

5. Conclusions

Forecasts of the number of accidents in Poland for individual provinces were determined by selected trend models using Excel. The results show that we can still expect a decrease in the number of traffic accidents in the coming years. It should be noted that the COVID-19 pandemic has distorted the results obtained, and if it continues and traffic restrictions are introduced, the proposed model may not be adequate. The value of the average error of 0.52% for annual data can testify to the choice of an effective forecasting method. As we can see, trend models fail for forecasting the monthly number of traffic accidents in which there is seasonality. On the other hand, for annual data, the results are at a high level. The advantage of trend models is their speed of determining the forecast.
The forecast number of traffic accidents obtained in the article can be used, in the future, to formulate further measures to minimize the number of accidents in the analyzed country. These measures may include, for example, the introduction of higher fines for traffic offenses on Polish roads from 1 January 2022.
In his further research, the author plans to take into account more factors affecting the accident rate in Poland and use other methods to forecast the number of road accidents. These may include, but are not limited to traffic volume, day of the week or the age of the perpetrator of the accident and also conduct research using other methods for forecasting the number of road accidents in Poland, such as neural networks and exponential smoothing.

Funding

The article was financed by the university’s own funds.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The article was written on the basis of public data available on the pages of the Police Department.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A

Table A1. Trend models for the Lower Silesian province.
Table A1. Trend models for the Lower Silesian province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 3660.6e−0.032xy = 1E-08x2.4702
R2 = 0.7993R2 = 0.3518
Lineary = −79.014x + 3506.4y = 0.1826x − 4546.7
R2 = 0.8145R2 = 0.3689
Logarithmicy = −539.9ln(x) + 3804.1y = 7618.2ln(x) − 77969
R2 = 0.6604R2 = 0.3679
Polynomial of 2nd degreey = −2.662x2 − 20.451x + 3282y = 8E-06x2 − 0.4824x + 9334.2
R2 = 0.8415R2 = 0.3703
Polynomial of 3rd degreey = −0.3402x3 + 8.5652x2 − 121.56x + 3488.5y = −3E-08x3 + 0.0041x2 − 171.09x + 2E+06
R2 = 0.8537R2 = 0.4156
Polynomial of 4th degreey = 0.0557x4 − 2.789x3 + 43.763x2 − 303.3x + 3729.9y = −2E-11x4 + 3E-06x3 − 0.1807x2 + 4971.6x − 5E+07
R2 = 0.8625R2 = 0.4408
Polynomial of 5th degreey = −0.0019x5 + 0.1617x4 − 4.8892x3 + 61.752x2 − 365.45x + 3790.3y = 2E-14x5 − 3E-09x4 + 0.0003x3 − 12.199x2 + 255831x − 2E+09
R2 = 0.8628R2 = 0.4826
Polynomial of 6th degreey = 0.002x6 − 0.1329x5 + 3.4719x4 − 44.893x3 + 296.45x2 − 961.09x + 4248y = 2E-17x6 − 4E-12x5 + 4E-07x4 − 0.0233x3 + 727.3x2 − 1E+07x + 8E+10
R2 = 0.8703R2 = 0.5586
Potentiometricy = 4062.9x−0.21y = 1E-08x2.4702
R2 = 0.6058R2 = 0.3518
Table A2. Trend models for the Kuyavian–Pomeranian province.
Table A2. Trend models for the Kuyavian–Pomeranian province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 2968.8e−0.066xy = 5563.1e−2E−05x
R2 = 0.9776R2 = 0.0624
Lineary = −100.67x + 2672.2y = −0.0491x + 4343.8
R2 = 0.9481R2 = 0.0651
Logarithmicy = −749.4ln(x) + 3184.3y = −2093ln(x) + 24565
R2 = 0.9122R2 = 0.068
Polynomial of 2nd degreey = 3.7309x2 − 182.75x + 2986.9y = 4E-05x2 − 3.7996x + 82631
R2 = 0.9861R2 = 0.174
Polynomial of 3rd degreey = 0.2451x3 − 4.357x2 − 109.91x + 2838.1y = −2E-09x3 + 0.0003x2 − 14.473x + 231109
R2 = 0.9906R2 = 0.1745
Polynomial of 4th degreey = −0.0404x4 + 2.0229x3 − 29.91x2 + 22.025x + 2662.8y = −3E-11x4 + 5E-06x3 − 0.313x2 + 8706.8x − 9E+07
R2 = 0.994R2 = 0.3524
Polynomial of 5th degreey = −0.0044x5 + 0.1989x4 − 2.7169x3 + 10.689x2 − 118.23x + 2799.1y = 1E-14x5 − 2E-09x4 + 0.0002x3 − 7.3524x2 + 155643x − 1E+09
R2 = 0.995R2 = 0.3874
Polynomial of 6th degreey = −0.0001x6 + 0.0026x5 + 0.0235x4 − 0.5973x3 − 1.7467x2 − 86.666x + 2774.9y = 1E-17x6 − 3E-12x5 + 3E-07x4 − 0.0159x3 + 494.62x2 − 8E+06x + 6E+10
R2 = 0.995R2 = 0.4732
Potentiometricy = 3938.4x−0.464y = 4E+07x−0.914
R2 = 0.8524R2 = 0.0652
Table A3. Trend models for Lublin province.
Table A3. Trend models for Lublin province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 3305.9e−0.059xy = 3789.7e−2E−05x
R2 = 0.9677R2 = 0.0549
Lineary = −103.43x + 2975.2y = −0.0316x + 3035.6
R2 = 0.9615R2 = 0.0557
Logarithmicy = −733.2ln(x) + 3421.7y = −1340ln(x) + 15972
R2 = 0.8388R2 = 0.0576
Polynomial of 2nd degreey = 1.1254x2 − 128.19x + 3070.1y = 2E-05x2 − 1.8755x + 41525
R2 = 0.9649R2 = 0.1102
Polynomial of 3rd degreey = 0.2624x3 − 7.5334x2 − 50.211x + 2910.7y = 7E-10x3 − 7E-05x2 + 1.9618x − 11858
R2 = 0.9699R2 = 0.1103
Polynomial of 4th degreey = −0.0316x4 + 1.6547x3 − 27.546x2 + 53.118x + 2773.5y = −1E-11x4 + 2E-06x3 − 0.1304x2 + 3628.8x − 4E+07
R2 = 0.9718R2 = 0.174
Polynomial of 5th degreey = −0.0058x5 + 0.2879x4 − 4.6744x3 + 26.667x2 − 134.16x + 2955.5y = 7E-15x5 − 1E-09x4 + 0.0001x3 − 5.1059x2 + 107484x − 9E+08
R2 = 0.9736R2 = 0.2102
Polynomial of 6th degreey = −0.0012x6 + 0.0765x5 − 1.7937x4 + 20.481x3 − 120.92x2 + 240.4x + 2667.7y = 6E-18x6 − 1E-12x5 + 2E-07x4 − 0.0086x3 + 267.23x2 − 4E+06x + 3E+10
R2 = 0.9756R2 = 0.2624
Potentiometricy = 4104.7x−0.402y = 1E+07x−0.812
R2 = 0.7726R2 = 0.0567
Table A4. Trend models for Lubuskie province.
Table A4. Trend models for Lubuskie province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 937.95e−0.019xy = 900.21e2E−06x
R2 = 0.6083R2 = 0.0005
Lineary = −13.829x + 920.97y = 0.0027x + 862.02
R2 = 0.5766R2 = 0.0013
Logarithmicy = −73.48ln(x) + 927.65y = 106.88ln(x) − 161.11
R2 = 0.2827R2 = 0.0011
Polynomial of 2nd degreey = −1.6013x2 + 21.401x + 785.93y = 7E-06x2 − 0.5564x + 12532
R2 = 0.8019R2 = 0.0163
Polynomial of 3rd degreey = 0.1435x3 − 6.3358x2 + 64.039x + 698.81y = −1E-08x3 + 0.0017x2 − 69.977x + 978278
R2 = 0.8521R2 = 0.1304
Polynomial of 4th degreey = 0.0061x4 − 0.1246x3 − 2.4829x2 + 44.145x + 725.24y = −5E-12x4 + 8E-07x3 − 0.0477x2 + 1303.7x − 1E+07
R2 = 0.8546R2 = 0.1579
Polynomial of 5th degreey = −0.0067x5 + 0.3758x4 − 7.4473x3 + 60.24x2 − 172.54x + 935.78y = 4E-15x5 − 9E-10x4 + 8E-05x3 − 3.3252x2 + 69716x − 6E+08
R2 = 0.9327R2 = 0.2052
Polynomial of 6th degreey = −0.0003x6 + 0.0147x5 − 0.1649x4 − 0.9136x3 + 21.908x2 − 75.253x + 861.02y = 4E-18x6 − 1E-12x5 + 1E-07x4 − 0.0056x3 + 174.52x2 − 3E+06x + 2E+10
R2 = 0.9373R2 = 0.2721
Potentiometricy = 949.27x−0.102y = 481.83x0.0656
R2 = 0.3059R2 = 0.0004
Table A5. Trend models for Łódź Province.
Table A5. Trend models for Łódź Province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 5462.8e−0.027xy = 1286.6e2E−05x
R2 = 0.6749R2 = 0.0459
Lineary = −101x + 5231.5y = 0.0403x + 778.52
R2 = 0.7447R2 = 0.053
Logarithmicy = −624.7ln(x) + 5470.4y = 1658.5ln(x) − 15182
R2 = 0.4947R2 = 0.0515
Polynomial of 2nd degreey = −7.4751x2 + 63.449x + 4601.1y = 2E-05x2 − 2.0299x + 43991
R2 = 0.8636R2 = 0.0931
Polynomial of 3rd degreey = −0.5521x3 + 10.745x2 − 100.64x + 4936.3y = −1E-08x3 + 0.0017x2 − 73.911x + 1E+06
R2 = 0.8816R2 = 0.1168
Polynomial of 4th degreey = −0.1513x4 + 6.1071x3 − 84.97x2 + 393.56x + 4279.9y = −2E-11x4 + 3E-06x3 − 0.1873x2 + 5189.5x − 5E+07
R2 = 0.9181R2 = 0.195
Polynomial of 5th degreey = −0.0197x5 + 0.9343x4 − 15.398x3 + 99.229x2 − 242.77x + 4898.2y = 4E-15x5 − 9E-10x4 + 7E-05x3 − 3.1414x2 + 66850x − 6E+08
R2 = 0.9344R2 = 0.2024
Polynomial of 6th degreey = 0.0044x6 − 0.3069x5 + 8.1925x4 − 103.11x3 + 613.84x2 − 1548.8x + 5901.8y = 1E-17x6 − 3E-12x5 + 3E-07x4 − 0.0154x3 + 479.91x2 − 8E+06x + 6E+10
R2 = 0.9546R2 = 0.2982
Potentiometricy = 5778.5x−0.165y = 2.9338x0.6322
R2 = 0.428R2 = 0.0444
Table A6. Trend models for the Lesser Poland province.
Table A6. Trend models for the Lesser Poland province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 5751.5e−0.034xy = 3818.4e−8E−06x
R2 = 0.7969R2 = 0.0093
Lineary = −125.74x + 5453.3y = −0.0187x + 3570.8
R2 = 0.8736R2 = 0.0077
Logarithmicy = −856.4ln(x) + 5920.7y = −816ln(x) + 11472
R2 = 0.7035R2 = 0.0085
Polynomial of 2nd degreey = −4.2552x2 − 32.128x + 5094.5y = 4E-05x2 − 3.0715x + 67295
R2 = 0.9027R2 = 0.0669
Polynomial of 3rd degreey = −0.9622x3 + 27.499x2 − 318.11x + 5678.7y = −4E-09x3 + 0.0005x2 − 24.164x + 360727
R2 = 0.9441R2 = 0.0683
Polynomial of 4th degreey = −0.0701x4 + 2.1228x3 − 16.844x2 − 89.152x + 5374.6y = −3E-11x4 + 5E-06x3 − 0.3065x2 + 8523.8x − 9E+07
R2 = 0.9501R2 = 0.2084
Polynomial of 5th degreey = −0.0057x5 + 0.2409x4 − 4.0384x3 + 35.93x2 − 271.46x + 5551.8y = 2E-14x5 − 4E-09x4 + 0.0003x3 − 12.543x2 + 263934x − 2E+09
R2 = 0.9511R2 = 0.2952
Polynomial of 6th degreey = 0.0014x6 − 0.1005x5 + 2.6393x4 − 33.023x3 + 205.98x2 − 703.03x + 5883.4y = 1E-17x6 − 3E-12x5 + 3E-07x4 − 0.0176x3 + 547.22x2 − 9E+06x + 6E+10
R2 = 0.9527R2 = 0.3826
Potentiometricy = 6386.4x−0.219y = 96700x−0.334
R2 = 0.5906R2 = 0.0101
Table A7. Trend models for the Masovia province.
Table A7. Trend models for the Masovia province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 8271.5e−0.048xy = 121.05e9E−05x
R2 = 0.9283R2 = 0.4903
Lineary = −244.75x + 7786.2y = 0.4231x − 12672
R2 = 0.8992R2 = 0.483
Logarithmicy = −1744ln(x) + 8908.6y = 17743ln(x) − 183769
R2 = 0.8619R2 = 0.4869
Polynomial of 2nd degreey = 4.749x2 − 344.48x + 8151.9y = −8E-05x2 + 6.7035x − 143764
R2 = 0.9082R2 = 0.5134
Polynomial of 3rd degreey = −0.1364x3 + 9.0453x2 − 381.46x + 8224.4y = −2E-08x3 + 0.0026x2 − 104.17x + 1E+06
R2 = 0.9084R2 = 0.5181
Polynomial of 4th degreey = 0.0347x4 − 1.593x3 + 29.046x2 − 480.3x + 8350.7y = −4E-11x4 + 6E-06x3 − 0.3799x2 + 10544x − 1E+08
R2 = 0.9086R2 = 0.5446
Polynomial of 5th degreey = −0.0841x5 + 4.4507x4 − 85.146x3 + 713.52x2 − 2747.9x + 10479y = 8E-15x5 − 2E-09x4 + 0.0001x3 − 6.0186x2 + 128241x − 1E+09
R2 = 0.9493R2 = 0.5468
Polynomial of 6th degreey = 0.0162x6 − 1.1063x5 + 29.128x4 − 370.28x3 + 2315.8x2 − 6656.5x + 13390y = 4E-17x6 − 1E-11x5 + 1E-06x4 − 0.0603x3 + 1885.7x2 − 3E+07x + 2E+11
R2 = 0.984R2 = 0.6682
Potentiometricy = 9945.1x−0.324y = 3E-14x3.7203
R2 = 0.8059R2 = 0.4954
Table A8. Trend models for Opole province.
Table A8. Trend models for Opole province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 1384.8e−0.044xy = 313.34e2E−05x
R2 = 0.9257R2 = 0.0597
Lineary = −37.996x + 1306.3y = 0.0201x − 25.195
R2 = 0.909R2 = 0.0728
Logarithmicy = −270ln(x) + 1471.8y = 826.22ln(x) − 7974.5
R2 = 0.797R2 = 0.0703
Polynomial of 2nd degreey = 0.5472x2 − 50.034x + 1352.4y = 1E-05x2 − 1.1854x + 25138
R2 = 0.9145R2 = 0.1476
Polynomial of 3rd degreey = 0.0029x3 + 0.451x2 − 49.168x + 1350.7y = −3E-09x3 + 0.0004x2 − 15.737x + 227570
R2 = 0.9145R2 = 0.153
Polynomial of 4th degreey = −0.0389x4 + 1.7163x3 − 24.176x2 + 77.988x + 1181.8y = −8E-12x4 + 1E-06x3 − 0.0787x2 + 2184.9x − 2E+07
R2 = 0.9353R2 = 0.2282
Polynomial of 5th degreey = 0.0044x5 − 0.281x4 + 6.5105x3 − 65.24x2 + 219.85x + 1043.9y = 4E-15x5 − 8E-10x4 + 7E-05x3 − 2.8588x2 + 60215x − 5E+08
R2 = 0.9423R2 = 0.2645
Polynomial of 6th degreey = −0.0006x6 + 0.0425x5 − 1.2443x4 + 18.152x3 − 133.54x2 + 393.19x + 910.74y = 4E-18x6 − 1E-12x5 + 1E-07x4 − 0.0064x3 + 199.59x2 − 3E+06x + 2E+10
R2 = 0.9454R2 = 0.3572
Potentiometricy = 1636x−0.3y = 0.0415x0.9283
R2 = 0.7549R2 = 0.0575
Table A9. Trend models for the Subcarpathia province.
Table A9. Trend models for the Subcarpathia province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 2661.8e−0.032xy = 1690.1e−2E−06x
R2 = 0.849R2 = 0.0006
Lineary = −56.701x + 2542.6y = −0.0013x + 1628.5
R2 = 0.8799R2 = 0.0001
Logarithmicy = −359.2ln(x) + 2695y = −74.05ln(x) + 2360.1
R2 = 0.613R2 = 0.0002
Polynomial of 2nd degreey = −2.7746x2 + 4.3389x + 2308.6y = 2E-05x2 − 1.4932x + 32768
R2 = 0.9413R2 = 0.0423
Polynomial of 3rd degreey = 0.1297x3 − 7.0544x2 + 42.883x + 2229.9y = −2E-09x3 + 0.0002x2 − 9.9627x + 150593
R2 = 0.945R2 = 0.0429
Polynomial of 4th degreey = −0.022x4 + 1.0973x3 − 20.961x2 + 114.69x + 2134.5y = −2E-11x4 + 3E-06x3 − 0.1615x2 + 4490.4x − 5E+07
R2 = 0.9479R2 = 0.1587
Polynomial of 5th degreey = −0.0062x5 + 0.32x4 − 5.676x3 + 37.055x2 − 85.735x + 2329.3y = 8E-15x5 − 2E-09x4 + 0.0001x3 − 5.7557x2 + 121260x − 1E+09
R2 = 0.954R2 = 0.2129
Polynomial of 6th degreey = 0.0012x6 − 0.0856x5 + 2.3262x4 − 29.922x3 + 179.3x2 − 446.75x + 2606.7y = 8E-18x6 − 2E-12x5 + 2E-07x4 − 0.0116x3 + 360.56x2 − 6E+06x + 4E+10
R2 = 0.9598R2 = 0.3245
Potentiometricy = 2875.5x−0.197y = 4146.2x−0.092
R2 = 0.5705R2 = 0.0007
Table A10. Trend models for Podlaskie province.
Table A10. Trend models for Podlaskie province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 1605.6e−0.057xy = 1308.3e−4E−06x
R2 = 0.9468R2 = 0.0019
Lineary = −49.516x + 1456.8y = −0.0061x + 1364.2
R2 = 0.9544R2 = 0.004
Logarithmicy = −359.1ln(x) + 1688.1y = −260.2ln(x) + 3878.5
R2 = 0.8715R2 = 0.0042
Polynomial of 2nd degreey = 0.8747x2 − 68.759x + 1530.5y = 6E-06x2 − 0.5033x + 11742
R2 = 0.963R2 = 0.0116
Polynomial of 3rd degreey = −0.0076x3 + 1.1243x2 − 71.007x + 1535.1y = 3E-09x3 − 0.0003x2 + 12.994x − 176022
R2 = 0.9631R2 = 0.0144
Polynomial of 4th degreey = −0.0326x4 + 1.4274x3 − 19.501x2 + 35.49x + 1393.7y = −1E-11x4 + 2E-06x3 − 0.1254x2 + 3495.8x − 4E+07
R2 = 0.9721R2 = 0.1273
Polynomial of 5th degreey = −0.0041x5 + 0.1948x4 − 3.0764x3 + 19.076x2 − 97.781x + 1523.2y = 3E-15x5 − 6E-10x4 + 5E-05x3 − 2.1244x2 + 45221x − 4E+08
R2 = 0.9759R2 = 0.1385
Polynomial of 6th degreey = 0.0003x6 − 0.0258x5 + 0.7421x4 − 9.6906x3 + 57.881x2 − 196.26x + 1598.8y = 6E-18x6 − 2E-12x5 + 2E-07x4 − 0.0091x3 + 282.88x2 − 5E+06x + 3E+10
R2 = 0.9765R2 = 0.2485
Potentiometricy = 2004.2x−0.392y = 7319.3x−0.178
R2 = 0.7814R2 = 0.002
Table A11. Trend models for Pomeranian province.
Table A11. Trend models for Pomeranian province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 3663.6e−0.026xy = 84.959e8E−05x
R2 = 0.7854R2 = 0.417
Lineary = −69.164x + 3559.7y = 0.1573x − 4551.4
R2 = 0.8237R2 = 0.4341
Logarithmicy = −496.7ln(x) + 3872.3y = 6541.5ln(x) − 67579
R2 = 0.7377R2 = 0.4306
Polynomial of 2nd degreey = −1.0377x2 − 46.334x + 3472.2y = 2E-05x2 − 1.8844x + 38064
R2 = 0.8291R2 = 0.455
Polynomial of 3rd degreey = −0.8089x3 + 25.655x2 − 286.73x + 3963.3y = −3E-08x3 + 0.0033x2 − 139.08x + 2E+06
R2 = 0.9203R2 = 0.5015
Polynomial of 4th degreey = −0.0349x4 + 0.7259x3 + 3.5961x2 − 172.83x + 3812.1y = −2E-11x4 + 3E-06x3 − 0.1737x2 + 4788.7x − 5E+07
R2 = 0.9249R2 = 0.5384
Polynomial of 5th degreey = −0.002x5 + 0.0738x4 − 1.4261x3 + 22.029x2 − 236.51x + 3873.9y = 1E-14x5 − 2E-09x4 + 0.0002x3 − 7.3691x2 + 154981x − 1E+09
R2 = 0.9253R2 = 0.5622
Polynomial of 6th degreey = 1E-04x6 − 0.0085x5 + 0.2378x4 − 3.4082x3 + 33.657x2 − 266.02x + 3896.6y = 8E-18x6 − 2E-12x5 + 2E-07x4 − 0.0108x3 + 336.03x2 − 6E+06x + 4E+10
R2 = 0.9253R2 = 0.5882
Potentiometricy = 4053x−0.178y = 6E-12x3.1379
R2 = 0.6494R2 = 0.4138
Table A12. Trend models for the Silesian province.
Table A12. Trend models for the Silesian province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 8278.3e−0.053xy = 1795.9e2E−05x
R2 = 0.9134R2 = 0.0739
Lineary = −234.11x + 7432y = 0.1003x + 182.21
R2 = 0.9536R2 = 0.0874
Logarithmicy = −1535ln(x) + 8173.7y = 4135.4ln(x) − 39626
R2 = 0.7117R2 = 0.0852
Polynomial of 2nd degreey = −6.7177x2 − 86.325x + 6865.4y = 5E-05x2 − 4.2277x + 90523
R2 = 0.9765R2 = 0.134
Polynomial of 3rd degreey = 0.6756x3 − 29.012x2 + 114.46x + 6455.2y = −2E-08x3 + 0.0029x2 − 124.35x + 2E+06
R2 = 0.9829R2 = 0.1517
Polynomial of 4th degreey = −0.1059x4 + 5.337x3 − 96.012x2 + 460.4x + 5995.8y = −4E-11x4 + 6E-06x3 − 0.3704x2 + 10268x − 1E+08
R2 = 0.9872R2 = 0.2328
Polynomial of 5th degreey = −0.0031x5 + 0.0633x4 + 1.9849x3 − 67.299x2 + 361.21x + 6092.1y = 2E-14x5 − 4E-09x4 + 0.0003x3 − 13.992x2 + 294603x − 2E+09
R2 = 0.9873R2 = 0.275
Polynomial of 6th degreey = 0.0019x6 − 0.1298x5 + 3.2675x4 − 36.739x3 + 159.89x2 − 215.37x + 6535.2y = 2E-17x6 − 5E-12x5 + 5E-07x4 − 0.0288x3 + 897.12x2 − 1E+07x + 1E+11
R2 = 0.9882R2 = 0.3657
Potentiometricy = 9554.3x−0.337y = 0.4145x0.87
R2 = 0.6368R2 = 0.0719
Table A13. Trend models for the Holy Cross province.
Table A13. Trend models for the Holy Cross province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 2545.8e−0.047xy = 1906e−1E−05x
R2 = 0.9269R2 = 0.0241
Lineary = −70.049x + 2347.9y = −0.0142x + 1713.4
R2 = 0.9651R2 = 0.0243
Logarithmicy = −489.1ln(x) + 2634.3y = −610.5ln(x) + 7615.9
R2 = 0.8169R2 = 0.0258
Polynomial of 2nd degreey = −0.2957x2 − 63.544x + 2323y = 2E-05x2 − 1.5293x + 33338
R2 = 0.9656R2 = 0.1037
Polynomial of 3rd degreey = −0.0844x3 + 2.4883x2 − 88.617x + 2374.2y = −4E-10x3 + 7E-05x2 − 3.8804x + 66046
R2 = 0.9667R2 = 0.1037
Polynomial of 4th degreey = −0.0538x4 + 2.2813x3 − 31.514x2 + 86.947x + 2141y = −1E-11x4 + 2E-06x3 − 0.1485x2 + 4133.1x − 4E+07
R2 = 0.9791R2 = 0.2824
Polynomial of 5th degreey = −0.0025x5 + 0.083x4 − 0.4272x3 − 8.3145x2 + 6.8028x + 2218.9y = 4E-15x5 − 8E-10x4 + 7E-05x3 − 2.8272x2 + 60046x − 5E+08
R2 = 0.9798R2 = 0.3051
Polynomial of 6th degreey = 0.0006x6 − 0.0449x5 + 1.1552x4 − 13.385x3 + 67.706x2 − 186.13x + 2367.2y = 5E-18x6 − 1E-12x5 + 1E-07x4 − 0.0079x3 + 245.49x2 − 4E+06x + 3E+10
R2 = 0.981R2 = 0.3988
Potentiometricy = 2996.8x−0.316y = 412018x−0.556
R2 = 0.7194R2 = 0.0257
Table A14. Trend models for the Warmian–Masurian province.
Table A14. Trend models for the Warmian–Masurian province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 2216.7e−0.029xy = 3353.1e−2E−05x
R2 = 0.6897R2 = 0.0613
Lineary = −43.531x + 2119.9y = −0.0281x + 2606.2
R2 = 0.699R2 = 0.0597
Logarithmicy = −256.3ln(x) + 2194.8y = −1178ln(x) + 13969
R2 = 0.4206R2 = 0.0602
Polynomial of 2nd degreey = −3.7986x2 + 40.037x + 1799.5y = 5E-06x2 − 0.4703x + 11837
R2 = 0.8541R2 = 0.0639
Polynomial of 3rd degreey = −0.0283x3 − 2.8652x2 + 31.631x + 1816.7y = −1E-09x3 + 0.0002x2 − 7.8062x + 113891
R2 = 0.8543R2 = 0.0645
Polynomial of 4th degreey = −0.033x4 + 1.4255x3 − 23.761x2 + 139.52x + 1673.4y = −1E-11x4 + 2E-06x3 − 0.1233x2 + 3430.8x − 4E+07
R2 = 0.8631R2 = 0.1418
Polynomial of 5th degreey = −0.0037x5 + 0.1709x4 − 2.6132x3 + 10.833x2 + 20.015x + 1789.5y = 7E-15x5 − 1E-09x4 + 0.0001x3 − 5.2879x2 + 111231x − 9E+08
R2 = 0.866R2 = 0.1945
Polynomial of 6th degreey = 0.0013x6 − 0.0879x5 + 2.2988x4 − 28.33x3 + 161.71x2 − 362.89x + 2083.8y = 7E-18x6 − 2E-12x5 + 2E-07x4 − 0.0095x3 + 297.52x2 − 5E+06x + 3E+10
R2 = 0.8748R2 = 0.2818
Potentiometricy = 2322.2x−0.17y = 1E+07x−0.862
R2 = 0.4065R2 = 0.0618
Table A15. Trend models for Greater Poland province.
Table A15. Trend models for Greater Poland province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 5383.4e−0.039xy = 2269.7e6E−06x
R2 = 0.6153R2 = 0.0067
Lineary = −146.33x + 5263.9y = 0.0232x + 1999.8
R2 = 0.6566R2 = 0.0109
Logarithmicy = −1137ln(x) + 6111y = 924.51ln(x) − 6869.2
R2 = 0.6881R2 = 0.0099
Polynomial of 2nd degreey = 10.982x2 − 387.94x + 6190y = 4E-05x2 − 3.4456x + 74403
R2 = 0.7643R2 = 0.0807
Polynomial of 3rd degreey = 1.0563x3 − 23.874x2 − 74.018x + 5548.7y = −2E-08x3 + 0.0031x2 − 129.52x + 2E+06
R2 = 0.792R2 = 0.126
Polynomial of 4th degreey = −0.247x4 + 11.924x3 − 180.07x2 + 732.49x + 4477.5y = −2E-11x4 + 4E-06x3 − 0.2567x2 + 7101.8x − 7E+07
R2 = 0.8328R2 = 0.2176
Polynomial of 5th degreey = −0.0718x5 + 3.7046x4 − 66.349x3 + 490.37x2 − 1583.6x + 6728y = 8E-15x5 − 2E-09x4 + 0.0001x3 − 6.2467x2 + 132133x − 1E+09
R2 = 0.9236R2 = 0.2366
Polynomial of 6th degreey = −0.0021x6 + 0.0687x5 + 0.1521x4 − 23.418x3 + 238.5x2 − 944.4x + 6236.8y = 2E-17x6 − 4E-12x5 + 4E-07x4 − 0.0231x3 + 723.06x2 − 1E+07x + 8E+10
R2 = 0.9256R2 = 0.3722
Potentiometricy = 6695.8x−0.301y = 215.04x0.246
R2 = 0.6259R2 = 0.006
Table A16. Trend models for the West Pomeranian province.
Table A16. Trend models for the West Pomeranian province.
Data/ModelAnnual DataMonthly Data
Exponentialy = 2381.2e−0.04xy = 389.87e3E−05x
R2 = 0.939R2 = 0.0946
Lineary = −60.835x + 2250.1y = 0.0534x − 685.14
R2 = 0.9558R2 = 0.1096
Logarithmicy = −430ln(x) + 2510.2y = 2186.6ln(x) − 21717
R2 = 0.8291R2 = 0.1053
Polynomial of 2nd degreey = 0.022x2 − 61.319x + 2252y = 4E-05x2 − 3.6238x + 76071
R2 = 0.9558R2 = 0.2583
Polynomial of 3rd degreey = −0.0448x3 + 1.4998x2 − 74.628x + 2279.2y = −2E-08x3 + 0.0029x2 − 121.55x + 2E+06
R2 = 0.9562R2 = 0.3334
Polynomial of 4th degreey = −0.0181x4 + 0.7514x3 − 9.9445x2 − 15.538x + 2200.7y = −1E-11x4 + 2E-06x3 − 0.1535x2 + 4230.3x − 4E+07
R2 = 0.9581R2 = 0.3963
Polynomial of 5th degreey = −0.0034x5 + 0.1701x4 − 2.9764x3 + 21.986x2 − 125.85x + 2307.9y = 9E-15x5 − 2E-09x4 + 0.0002x3 − 6.3676x2 + 133940x − 1E+09
R2 = 0.9598R2 = 0.4351
Polynomial of 6th degreey = 0.0002x6 − 0.0185x5 + 0.5508x4 − 7.5768x3 + 48.976x2 − 194.34x + 2360.5y = 5E-18x6 − 1E-12x5 + 1E-07x4 − 0.0066x3 + 205.46x2 − 3E+06x + 2E+10
R2 = 0.96R2 = 0.4567
Potentiometricy = 2763.2x−0.272y = 0.001x1.3345
R2 = 0.7568R2 = 0.0905

References

  1. WHO. The Global Status on Road Safety; 2018; p. 403 p. Available online: https://www.who.int/publications/i/item/9789241565684 (accessed on 17 April 2022)ISBN 9789241565684.
  2. Raza, A.; Safdar, M.; Zhong, M.; Hunt, J.D. Analyzing Spatial Location Preference of Urban Activities with Mode-Dependent Accessibility Using Integrated Land Use–Transport Models. Land 2022, 11, 1139. [Google Scholar] [CrossRef]
  3. Raza, A.; Zhong, M.; Safdar, M. Evaluating Locational Preference of Urban Activities with the Time-Dependent Accessibility Using Integrated Spatial Economic Models. Int. J. Environ. Res. Public Health 2022, 19, 8317. [Google Scholar] [CrossRef]
  4. Tambouratzis, T.; Souliou, D.; Chalikias, M.; Gregoriades, A. Maximising accuracy and efficiency of traffic accident prediction combining information mining with computational intelligence approaches and decision trees. J. Artif. Intell. Soft Comput. Res. 2014, 4, 31–42. [Google Scholar] [CrossRef] [Green Version]
  5. Zhu, L.; Lu, L.; Zhang, W.; Zhao, Y.; Song, M. Analysis of accident severity for curved roadways based on bayesian networks. Sustainability 2019, 11, 2223. [Google Scholar] [CrossRef] [Green Version]
  6. Wang, Z.; Safdar, M.; Zhong, S.; Liu, J.; Xiao, F. Public Preferences of Shared Autonomous Vehicles in Developing Countries: A Cross-National Study of Pakistan and China. J. Adv. Transp. 2021, 2021, 5141798. [Google Scholar] [CrossRef]
  7. Safdar, M.; Jamal, A.; Al-Ahmadi, H.M.; Rahman, M.T.; Almoshaogeh, M. Analysis of the Influential Factors towards Adoption of Car-Sharing: A Case Study of a Megacity in a Developing Country. Sustainability 2022, 14, 2778. [Google Scholar] [CrossRef]
  8. Waiss, A.A.; Jianfeng, A.; Irfan, U.; Muhammad, A. Public Preferences Towards Car Sharing Service: The Case of Djibouti. Front. Environ. Sci. 2022, 10. [Google Scholar] [CrossRef]
  9. Arteaga, C.; Paz, A.; Park, J. Injury severity on traffic crashes: A text mining with an interpretable machine-learning approach. Saf. Sci. 2020, 132, 104988. [Google Scholar] [CrossRef]
  10. Yang, Z.; Zhang, W.; Feng, J. Predicting multiple types of traffic accident severity with explanations: A multi-task deep learning framework. Saf. Sci. 2022, 146, 105522. [Google Scholar] [CrossRef]
  11. Gorzelanczyk, P.; Pyszewska, D.; Kalina, T.; Jurkovic, M. Analysis of road traffic safety in the Pila poviat. Sci. J. Sil. Univ. Technol. Ser. Transp. 2020, 107, 33–52. [Google Scholar] [CrossRef]
  12. Chen, C. Analysis and forecast of traffic accident big data. In Proceedings of the 4th Annual International Conference on Information Technology and Applications (ITA 2017), online, 5 September 2017; Volume 12, p. 04029. [Google Scholar] [CrossRef]
  13. Khaliq, K.A.; Chughtai, O.; Shahwani, A.; Qayyum, A.; Pannek, J. Road accidents detection, data collection and data analysis using V2X communication and edge/cloud computing. Electronics 2019, 8, 896. [Google Scholar] [CrossRef] [Green Version]
  14. Rajput, H.; Som, T.; Kar. S. An automated vehicle license plate recognition system. Computer 2015, 48, 56–61. [Google Scholar] [CrossRef]
  15. Zheng, Z.; Wang, C.; Wang, P.; Xiong, Y.; Zhang, F.; Lv, Y. Framework for fusing traffic information from social and physical transportation data. PLoS ONE 2018, 13, e0201531. [Google Scholar] [CrossRef] [Green Version]
  16. Jurkovic, M.; Gorzelanczyk, P.; Kalina, T.; Jaros, J.; Mohanty, M. Impact of the COVID-19 pandemic on road traffic accident forecasting in Poland and Slovakia. Open Eng. 2022, 12, 578–589. [Google Scholar] [CrossRef]
  17. Gorzelańczyk, P.; Jurkovič, M.; Kalina, T.; Mohanty, M. Forecasting the road accident rate and the impact of the COVID 19 on its frequency in the polish provinces. Commun.—Sci. Lett. Univ. Zilina 2022, 24, A216–A231. [Google Scholar] [CrossRef]
  18. Abdullah, E.; Emam, A. Traffic accidents analyzer using big data. In Proceedings of the 2015 International Conference on Computational Science and Computational Intelligence (CSCI), Las Vegas, NV, USA, 7–9 December 2015; Institute of Electrical and Electronics Engineers Inc., 2016; pp. 392–397. [Google Scholar] [CrossRef]
  19. Vilaça, M.; Silva, N.; Coelho, M.C. Statistical analysis of the occurrence and severity of crashes involving vulnerable road users. Transp. Res. Procedia 2017, 27, 1113–1120. [Google Scholar] [CrossRef]
  20. Bąk, I.; Cheba, K.; Szczecińska, B. The statistical analysis of road traffic in cities of Poland. Transp. Res. Procedia 2019, 39, 14–23. [Google Scholar] [CrossRef]
  21. Chand, A.; Jayesh, S.; Bhasi, A.B. Road traffic accidents: An overview of data sources, analysis techniques and contributing factors. Mater. Today: Proc. 2021, 47, 5135–5141. [Google Scholar] [CrossRef]
  22. Helgason, A. Fractional integration methods and short Time series: Evidence from asimulation study. Polit. Anal. 2016, 24, 59–68. Available online: http://www.jstor.org/stable/24573204 (accessed on 17 April 2022). [CrossRef]
  23. Lavrenz, S.; Vlahogianni, E.; Gkritza, K.; Ke, Y. Time series modeling in traffic safetyresearch. Accid. Anal. Prev. 2018, 117, 368–380. [Google Scholar] [CrossRef]
  24. Forecasting Based on Time Series. Available online: http://pis.rezolwenta.eu.org/Materialy/PiS-W-5.pdf (accessed on 17 April 2022).
  25. Procházka, J.; Flimmel, S.; Čamaj, M.; Bašta, M. Modelling the Number of Road Accidents; Publishing house of the University of Economics in Wrocław: Wrocław, Poland, 2017. [Google Scholar] [CrossRef] [Green Version]
  26. 26. Sunny, C.M.; Nithya, S.; Sinshi, K.S.; Vinodini, V.M.D.; Lakshmi, A.K.G.; Anjana, S.; Manojkumar, T.K. Forecasting of Road Accident in Kerala: A Case Study. In Proceedings of the 2018 International Conference on Data Science and Engineering (ICDSE), Kochi, India, 7–9 August 2018. [CrossRef]
  27. Dudek, G. Forecasting Time Series with Multiple Seasonal Cycles Using Neural Networks with Local Learning. In Artificial Intelligence and Soft Computing. ICAISC 2013. Lecture Notes in Computer Science; Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M., Eds.; Springer: Berlin/Heidelberg, Germany, 2013; Volume 7894. [Google Scholar] [CrossRef]
  28. Szmuksta-Zawadzka, M.; Zawadzki, J. Forecasting on the Basis of Holt-Winters Models for Complete and Incomplete Data; Research Papers of the Wrocław University of Economics: Poland, 2009; Volume 38. [Google Scholar]
  29. Wójcik, A. Autoregressive Vector Models as a Response to the Critique of Multi-Equation Structural Econometric Models; Publishing house of the University of Economics in Katowice: Katowice, Poland, 2014; Volume 193. [Google Scholar]
  30. Monederoa, B.D.; Gil-Alanaa, L.A.; Martínezaa, M.C.V. Road accidents in Spain: Are they persistent? IATSS Res. 2021, 45, 317–325. [Google Scholar] [CrossRef]
  31. Al-Madani, H. Global road fatality trends’estimations based on country-wise microlevel data. Accid. Anal. Prev. 2018, 111, 297–310. [Google Scholar] [CrossRef]
  32. Mamczur, M. Machine Learning How Does Linear Regression Work? And is it WORTH Using? Available online: https://miroslawmamczur.pl/jak-dziala-regresja-liniowa-i-czy-warto-ja-stosowac/ (accessed on 17 April 2022).
  33. Piłatowska, M. The choice of the order of autoregression depending on the parameters of the generating model. Econometrics 2012, 4, 16–35. [Google Scholar]
  34. Biswas, A.A.; Mia, J.; Majumder, A. Forecasting the Number of Road Accidents and Casualties using Random Forest Regression in the Context of Bangladesh. In Proceedings of the 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT), Kanpur, India, 6–8 July 2019. [Google Scholar]
  35. Random Forest. Available online: https://pl.wikipedia.org/wiki/Las_losowy (accessed on 17 April 2022).
  36. Fijorek, K.; Mróz, K.; Niedziela, K.; Fijorek, D. Forecasting electricity prices on the day-ahead market using data mining methods. Energy Mark. 2010. [Google Scholar]
  37. Chudy-Laskowska, K.; Pisula, T. Forecast of the Number of Road Accidents in Poland. Logistics 2014, 6, 2710–2721. [Google Scholar]
  38. Kashpruk, N. Comparative Research of Statistical Models and Soft Computing for Identification of Time Series and Forecasting; Opole University of Technology: Opole, Poland, 2010. [Google Scholar]
  39. Prochazka, J.; Camaj, M. Modelling the number of road accidents of uninsured drivers and their severity. In Proceedings of the International Academic Conferences 5408040, International Institute of Social and Economic Sciences, Geneva, Switzerland, 27 June 2017. [Google Scholar]
  40. Dutta, B.; Barman, M.P.; Patowary, A.N. Application of Arima model for forecasting road accident deaths in India. Int. J. Agric. Stat. Sci. 2020, 16, 607–615. [Google Scholar]
  41. Karlaftis, M.; Vlahogianni, E. Memory properties and fractional integration in trans-portation time-series. Transp. Res. C 2009, 17, 444–453. [Google Scholar] [CrossRef]
  42. Łobejko, S. Time Series Analysis and Forecasting with SAS; Main business school in Warsaw: Warsaw, Poland, 2015. [Google Scholar]
  43. Dudek, G. Exponential smoothing models for short-term power system load forecasting. Energy Mark. 2013, 3, 14-–19. [Google Scholar]
  44. Chudy-Laskowska, K.; Pisula, T. Prognozowanie Liczby Wypadków Drogowych na Podkarpaciu; Logistics: 2015. Logistics 2015, 4, 2782–2796. [Google Scholar]
  45. Road Safety Assessment Handbook. Available online: https://www.iung.pl/PJA/wydane/11/PJA11_3.pdf (accessed on 17 April 2022).
  46. Wrobel, M.S. Application of Neural Fuzzy Systems in Chemistry. Ph.D. Thesis, University of Silesia, Katowice, Poland, 2017. [Google Scholar]
  47. Data Mining Techniques StatSoft. Available online: https://www.statsoft.pl/textbook/stathome_stat.html?https%3A%2F%2Fwww.statsoft.pl%2Ftextbook%2Fstdatmin.html (accessed on 17 April 2022).
  48. Kumar, S.; Viswanadham, V.; Bharathi, B. Analysis of road accident. IOP Conference Series Materials Science and Engineering. In Proceedings of the International Conference on Frontiers in Materials and Smart System Technologies, Tamil Nadu, India, 10 April 2019; Volume 590, p. 012029. [Google Scholar] [CrossRef]
  49. Top Advantages and Disadvantages of Hadoop 3 DataFlair. Available online: https://data-flair.training/blogs/advantages-and-disadvantages-of-hadoop/ (accessed on 17 April 2022).
  50. Perczak, G.; Fiszeder, P. GARCH model—Using additional information on minimum and maximum prices. Bank Credit. 2014, 2, 105–132. [Google Scholar]
  51. Fiszeder, P. GARCH Class Models in Empirical Financial Research; Scientific Publishers of the Nicolaus Copernicus University: Torun, Poland, 2009. [Google Scholar]
  52. McIlroy, R.C.; Plant, K.A.; Hoque, M.S.; Wu, J.; Kokwaro, G.O.; Nam, V.H.; Stanton, N.A. Who is responsible for global road safety? A cross-cultural comparison ofactor maps. Accid. Anal. Prev. 2019, 122, 8–18. [Google Scholar] [CrossRef]
  53. Muck, J. Econometrics. Modeling of Time Series. Stationary. Unit Root Tests. ARDL models. Co-integration. Available online: http://web.sgh.waw.pl/~jmuck/Ekonometria/EkonometriaPrezentacja5.pdf (accessed on 17 April 2022).
  54. Shetty, P.; Sachin, P.C.; Kashyap, V.K.; Madi, V. Analysis of road accidents using data mining techniques. Int. Res. J. Eng. Technol. 2017, 4, 1494–1496. [Google Scholar]
  55. Li, L.; Shrestha, S.; Hu, G. Analysis of road traffic fatal accidents using data mining techniques. In Proceedings of the 2017 IEEE 15th International Conference on Software Engineering Research, Management and Applications (SERA), London, UK, 7–9 June 2017; pp. 363–370. [Google Scholar] [CrossRef]
  56. Marcinkowska, J. Statistical Methods and Data Mining in Assessing the Occurrence of Syncope in the Group of Narrow-QRS Tachycardia (AVNRT and AVRT); Medical University of Karol Marcinkowski in Poznań: Poznań, Poland, 2015; Available online: http://www.wbc.poznan.pl/Content/373785/index.pdf (accessed on 17 April 2022).
  57. Sebego, M.; Naumann, R.B.; Rudd, R.A.; Voetsch, K. Dellinger, A.M., Ndlovu, C., The impact of alcohol and road traffic policies on crash rates in Botswana, 2004–2011: Atime-series analysis. Accid. Anal. Prev. 2008, 70, 33–39. [Google Scholar] [CrossRef] [Green Version]
  58. Bloomfield, P. An exponential model in the spectrum of a scalar time series. Biometrika 1973, 60, 217–226. Available online: https://www.jstor.org/stable/2334533 (accessed on 17 April 2022). [CrossRef]
  59. Central Statistical Office. Available online: www.gus.pl (accessed on 17 April 2022).
  60. Statistic Road Accident. Available online: https://statystyka.policja.pl/ (accessed on 17 April 2022).
  61. Eurostat Statistics. Available online: https://ec.europa.eu/eurostat/data/database (accessed on 17 April 2022).
  62. Province Names in English. Available online: https://www.facebook.com/Pogoda-Na-Przygod%C4%99-687347091288220/photos/a.1077421475614111.1073741838.687347091288220/1077421502280775 (accessed on 17 April 2022).
Figure 1. Population of Poland from 2001 to 2020 (thousand people) [62].
Figure 1. Population of Poland from 2001 to 2020 (thousand people) [62].
Applsci 13 02898 g001
Figure 2. Location of provinces in Poland [62]. Scale: 1:1,000,000.
Figure 2. Location of provinces in Poland [62]. Scale: 1:1,000,000.
Applsci 13 02898 g002
Figure 3. Accident number per 100,000 population in 2020 [62].
Figure 3. Accident number per 100,000 population in 2020 [62].
Applsci 13 02898 g003
Figure 4. Number of road accidents in Poland by province from 2001 to 2021 [60].
Figure 4. Number of road accidents in Poland by province from 2001 to 2021 [60].
Applsci 13 02898 g004
Figure 5. Number of road accidents in Poland by province from 2007 to 2021 [60].
Figure 5. Number of road accidents in Poland by province from 2007 to 2021 [60].
Applsci 13 02898 g005
Figure 6. Forecasting annual number of road accidents for 2022–2031.
Figure 6. Forecasting annual number of road accidents for 2022–2031.
Applsci 13 02898 g006
Figure 7. Forecasting monthly number of road accidents for 2022–2023.
Figure 7. Forecasting monthly number of road accidents for 2022–2023.
Applsci 13 02898 g007
Table 1. Area, population by province in Poland in 2020 [62].
Table 1. Area, population by province in Poland in 2020 [62].
ProvinceArea Population
in km2TotalPersons/km2
POLAND312,70538,265,013122
Lower Silesia 19,9472,891,321145
Kuyavia–Pomerania 17,9712,061,942115
Lublin Province25,1232,095,25883
Lubusz Province13,9881,007,14572
Łodz Province18,2192,437,970134
Lesser Poland 15,1833,410,441225
Masovia 35,5595,425,028153
Opole Province9412976,774104
Subcarpathia 17,8462,121,229119
Podlasie Province20,1871,173,28658
Pomerania 18,3232,346,671128
Silesia 12,3334,492,330364
Holy Cross 11,7101,224,626105
Warmia–Masuria24,1731,416,49559
Greater Poland 29,8263,496,450117
West Pomerania 22,9051,688,04774
Table 2. Best trend models for annual data.
Table 2. Best trend models for annual data.
ProvinceModelAnnual Data
Lower SilesiaExponentialy = 3660.6e−0.032x
R2 = 0.7993
Kuyavia–PomeraniaExponentialy = 2968.8e−0.066x
R2 = 0.9776
Lublin ProvincePolynomial of 2nd degreey = 1.1254x2 − 128.19x + 3070.1
R2 = 0.9649
Lubusz ProvinceExponentialy = 937.95e−0.019x
R2 = 0.6083
Łodz ProvinceExponentialy = 5462.8e−0.027x
R2 = 0.6749
Lesser PolandExponentialy = 5751.5e−0.034x
R2 = 0.7969
MasoviaLogarithmicy = −1744ln(x) + 8908.6
R2 = 0.8619
Opole ProvinceExponentialy = 1384.8e−0.044x
R2 = 0.9257
SubcarpathiaExponentialy = 2661.8e−0.032x
R2 = 0.849
Podlasie ProvinceExponentialy = 1605.6e−0.057x
R2 = 0.9468
PomeraniaExponentialy = 3663.6e−0.026x
R2 = 0.7854
SilesianPolynomial of 2nd degreey = −6.7177x2 − 86.325x + 6865.4
R2 = 0.9765
Warmia–MasuriaExponentialy = 2545.8e−0.047x
R2 = 0.9269
Holy CrossPolynomial of 2nd degreey = −0.2957x2 − 63.544x + 2323
R2 = 0.9656
Greater PolandPotentiometricy = 6695.8x−0.301
R2 = 0.6259
West PomeraniaExponentialy = 2381.2e−0.04x
R2 = 0.939
Table 3. Best trend models for monthly data.
Table 3. Best trend models for monthly data.
ProvinceModelMonthly Data
Lower SilesiaPolynomial of 2nd degreey = 8E-06x2 − 0.4824x + 9334.2
R2 = 0.3703
Kuyavia–PomeraniaLineary = −0.0491x + 4343.8
R2 = 0.0651
Lublin ProvinceLineary = −0.0316x + 3035.6
R2 = 0.0557
Lubusz ProvinceExponentialy = 900.21e2E−06x
R2 = 0.0005
Łodz ProvinceExponentialy = 1286.6e2E−05x
R2 = 0.0459
Lesser PolandLineary = −0.0187x + 3570.8
R2 = 0.0077
MasoviaExponentialy = 121.05e9E−05x
R2 = 0.4903
Opole ProvinceExponentialy = 313.34e2E−05x
R2 = 0.0597
SubcarpathiaLineary = −0.0013x + 1628.5
R2 = 0.0001
Podlasie ProvinceExponentialy = 1308.3e−4E−06x
R2 = 0.0019
PomeraniaExponentialy = 84.959e8E−05x
R2 = 0.417
SilesianExponentialy = 1795.9e2E−05x
R2 = 0.0739
Holy CrossLineary = −0.0142x + 1713.4
R2 = 0.0243
Warmia-MasuriaLineary = −0.0281x + 2606.2
R2 = 0.0597
Greater PolandExponentialy = 2269.7e6E−06x
R2 = 0.0067
West PomeraniaExponentialy = 389.87e3E−05x
R2 = 0.0946
Table 4. Summary of errors for annual data.
Table 4. Summary of errors for annual data.
Province/ModelModelMEMPEThe Sum of the SquaresMSEMAPE (%)MAE (%)
Lower SilesiaExponential14.26184.471,305,179.1862,151.390.267.22
Kuyavia–PomeraniaExponential10.9570.32149,986.767142.230.355.00
Lublin ProvincePolynomial of 2nd degree0.0793.93301,100.5114,338.120.325.09
Lubusz ProvinceExponential2.7655.03119,113.405672.070.476.93
Łodz ProvinceExponential6.86317.713,302,784.34157,275.441.018.72
Lesser PolandExponential26.74290.752,392,974.3611,3951.160.148.17
MasoviaLogarithmic176.78497.8110,533,523.06501,596.341.778.75
Opole ProvinceExponential4.2554.32107,325.175110.720.025.98
SubcarpathiaExponential11.57120.66495,205.4123,581.210.116.41
Podlasie ProvinceExponential2.3952.8281,724.623891.650.206.66
PomeraniaExponential12.34168.89853,023.8240,620.180.146.61
SilesiaPolynomial of 3rd degree0.02144.78755,411.7635,971.990.173.24
Holy CrossPolynomial of 2nd degree0.0172.91134,675.616413.120.415.26
Warmia–MasuriaExponential4.82150.77765,403.0236,447.761.009.60
Greater PolandExponential48.45563.329,570,955.89455,759.801.8116.12
West PomeraniaExponential1.9975.89166,539.047930.430.165.12
Table 5. Summary of errors for monthly data.
Table 5. Summary of errors for monthly data.
Province/ModelModelMEMPEThe Sum of the SquaresMSEMAPE (%)MAE (%)
Lower SilesiaPolynomial of 2nd degree6201.426201.426,965,812,277.6138,698,957.10208.40208.40
Kuyavia–PomeraniaLinear2046.272046.27770,270,675.154,279,281.5392.7592.75
Lublin ProvinceLinear1316.421316.42319,951,089.841,777,506.0579.6879.68
Lubusz ProvinceExponential75.75111.953,708,472.7820,602.636.2910.97
Łodz ProvinceExponential1175.171175.17262,350,703.451,457,503.9147.0147.01
Lesser PolandLinear778.52778.59129,389,638.72718,831.3329.9529.95
MasoviaExponential4894.724894.724,479,240,358.3724,884,668.6697.4797.47
Opole ProvinceExponential502.58502.5847,967,460.50266,485.8960.7260.72
SubcarpathiaLinear56.11156.757,374,357.2040,968.655.3110.83
Podlasie ProvinceExponential198.30206.6811,259,059.5562,550.3320.6621.26
PomeraniaExponential1937.331937.33701,204,745.033,895,581.9295.6395.63
SilesiaExponential2575.682575.681,245,839,964.636,921,333.1458.2558.25
Holy CrossLinear591.69591.6966,739,299.40370,773.8955.4755.47
Warmia–MasuriaLinear1171.741171.74253,066,000.671,405,922.2385.0085.00
Greater PolandExponential699.20707.79110,227,747.58612,376.3822.4422.90
West PomeraniaExponential1157.771157.77252,984,847.291,405,471.3774.0474.04
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gorzelańczyk, P. Forecasting the Number of Road Accidents in Polish Provinces Using Trend Models. Appl. Sci. 2023, 13, 2898. https://doi.org/10.3390/app13052898

AMA Style

Gorzelańczyk P. Forecasting the Number of Road Accidents in Polish Provinces Using Trend Models. Applied Sciences. 2023; 13(5):2898. https://doi.org/10.3390/app13052898

Chicago/Turabian Style

Gorzelańczyk, Piotr. 2023. "Forecasting the Number of Road Accidents in Polish Provinces Using Trend Models" Applied Sciences 13, no. 5: 2898. https://doi.org/10.3390/app13052898

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop