Next Article in Journal
‘It’s like Taking a Sleeping Pill’: Student Experience of Autonomous Sensory Meridian Response (ASMR) to Promote Health and Mental Wellbeing
Previous Article in Journal
Implementation of Health Impact Assessment in the Healthcare System of the Republic of Kazakhstan
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Impact of Driver’s Age and Gender, Built Environment, and Road Conditions on Crash Severity: A Logit Modeling Approach

by
Dongkwan Lee
1,*,
Jean-Michel Guldmann
2 and
Burkhard von Rabenau
2
1
Gangwon Institute, Chuncheon 24265, Republic of Korea
2
Department of City and Regional Planning, The Ohio State University, Columbus, OH 43210, USA
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2023, 20(3), 2338; https://doi.org/10.3390/ijerph20032338
Submission received: 22 November 2022 / Revised: 25 January 2023 / Accepted: 26 January 2023 / Published: 28 January 2023

Abstract

:
The purpose of this research is (1) to investigate the relationship between crash severity and the age and gender of the at-fault driver, the socio-economic characteristics of the surrounding environment, and road conditions, and (2) to explain the probability of a bodily injury crash, including fatality, with the alternative being a property damage only crash. In contrast to earlier research that has focused on young and old drivers, age is considered here on its lifetime continuum. A logit model is adopted and the gender and age of the at-fault drivers are part of the independent explanatory variables. The unit of analysis is the individual crash. Since age is a continuous variable, this analysis shows more precisely how age impacts accident severity and identifies when age has little effect. According to the results, the type of vehicle, timing of the crash, type of road and intersection, road condition, regional and locational factors, and socio-economic characteristic have a significant impact on crashes. Regarding the effect of age, when an accident occurs the probability of bodily injury or fatality is 0.703 for female drivers, and 0.718 for male drivers at 15 years of age. These probabilities decline very slightly to 0.696 and 0.711, respectively, around 33 years of age, then very slightly increase to 0.697 and 0.712, respectively, around 47.5 years of age. The results show that age affects crash severity following a polynomial curve. While the overall pattern is one of a downward trend with age, this trend is weak until the senior years. The policy implications of the results are discussed.

1. Introduction

People tend to explain crashes with simple facts, such as a driver’s sex and age, and to relate crash severity to the built environment. However, the demographic characteristics of a driver in a car accident may have multiplier effects, for instance the fact that the driver is 20 years of age and male may lead to a higher-severity crash. Moreover, the surrounding environment near a crash site may have different effects on crashes. The environment represents primarily the built environment, infrastructure, or socio-economic characteristics of a neighborhood or community. For example, for older drivers with less ability to cope with high-density environments, complex situations may increase crash likelihood. On the other hand, young drivers with less experience and judgment, may produce greater crash incidence and severity in situations encountered more rarely, such as driving on a slippery iced road. Hence, it is necessary to investigate the interactions between a driver’s demographics and the surrounding environment in generating crashes of varying severity.
Regarding crash severity, crashes differ. Some are minor while others result in injury or death. Many factors impact the severity of a crash. This research identifies the factors that determine the severity of a crash once it has occurred, specifically the likelihood that it is a bodily injury or fatal (BI) crash, as opposed to a property damage only (PDO) crash. Unlike macro-level analyses that focus on the numbers of crashes in larger neighborhood units [1], micro-level analyses focus on an individual crash at a given location, such as an intersection. Recent research includes the creation of buffers around the crash point in order to investigate the direct surrounding environment.
It is not clear what variations in crash behavior would emerge if age appeared as a continuous variable in explaining crashes, or if age was analyzed for smaller age groups across the full age spectrum. As already noted, the disaggregate approach adds to the number of variables beyond those of the aggregate model, down to the precise vehicle type, driver condition including alcohol or drug use, and light, weather, and road conditions at the precise crash location. While the literature has analyzed how young and senior drivers differ from others, the classification of young and old drivers is somewhat arbitrary.
The literature, as summarized in Table 1, has looked at age as a factor in crash frequency, but has limited itself to the young (usually 24 years of age and younger) and the old (usually older than 65 or 70 years of age), and has contrasted their behavior with that of the remaining population. It is not clear that the particular age intervals chosen in the literature are appropriate, in the sense that behavior within the groups is homogenous and behavior between age groups is different. Moreover, in past research it is often not clear whether data pertain to all drivers involved in an accident or to drivers at fault in an accident. The latter is a subset of the former. The database available for this research provides information on drivers at fault. It is the characteristics of these drivers, the age, gender of those who cause the accident, that matter in explaining the accident.
The goal of this research is to better understand the interactions between a driver’s demographics and the surrounding environment in generating crashes of varying severity. For this, the age and sex of the guilty driver are taken as proxies of a driver’s characteristics. The focus is on the driver who is most responsible for a crash. Moreover, there are different built environments in urban and rural areas. Therefore, the overall environment for drivers and vehicles varies in these areas. The study area, in this research, provides enough variability in the built environment to uncover the influence of built-environment factors on crash severity.
The present approach is a micro analysis using logit modeling to estimate the likelihood that an accident is severe, i.e., results in injury or death. In a departure from earlier research, the focus is on the age and gender of the driver of the at-fault vehicle, and age is taken as a continuous independent variable, encompassing young, old, and middle-aged drivers. Other explanatory variables related to the crash, such as its location, vehicle attributes, road features, weather, and time, are also used as they are derived from the extensive crash reporting system. In addition, variables defined at the level of Traffic Analysis Zones (TAZs) are also considered to account for the impacts of the socio-economic characteristics of the neighborhood surrounding the crash location, including land uses. The remainder of the paper is organized as follows: Section 2 describes data sources and their processing and presents the modeling methodology. The results and their analysis are presented in Section 3. Section 4 includes discussions related to the results. Section 5 concludes and outlines areas for further research.

2. Materials and Methods

2.1. Data

The data characterize the Central Ohio Region (part of the Columbus Metropolitan Area (CMA)), which includes Delaware, Fairfield, Franklin, Madison, Licking, Pickaway, and Union counties, and the City of Columbus, the capital of Ohio and the 15th largest U.S. city. According to the 2010 Census, its total population was 1,654,374. The Traffic Analysis Zone (TAZ) is selected as the geographical unit. The seven counties consist of 2052 TAZs. However, crash data are not available for some peripheral areas, hence only the area in yellow on Figure 1 is selected for this analysis. It is made up of 1805 TAZs and covers 87% of the area of the seven counties.
About 284,611 crashes have occurred over 2006–2011 in the seven counties, 535,385 units (cars, motorcycles, etc.) were involved in these crashes, and 704,883 people were affected by them. The crash database was obtained from the Ohio Department of Public Safety (ODPS), and includes crash information, vehicle information, and people information. As the focus is on crashes where a vehicle and its drivers are at fault, such crashes are first extracted from the database, leading to the selection of 161,925 individual crashes. Figure 2 displays their spatial distribution. In addition to crash severity (fatalities, injuries, and property damages), a wide range of crash attributes are extracted from the ODPS database, such as at-fault driver’s sex and age, crash time, driving conditions, weather, and road conditions. The TAZ within which the crash took place is identified by overlaying the crash point locations with the TAZ map, using a Geographical Information System (GIS).
Socio-economic and built-environment data were obtained from several sources, are defined at the TAZ level, and are grouped into six categorizes as described below.
  • Regional and Locational Factors
These variables characterize the general crash location: urban versus rural, distance to the CMA center, and inclusion in the Mid-Ohio Regional Planning Commission (MORPC) transportation planning area. These data were obtained from MORPC. Infrastructure and surrounding driving conditions vary across these locations.
2.
Socio-Economic Factors
These variables characterize population, employment, income, school enrollment, and housing. These data are all derived from MORPC and the 2010 Census of Population and Housing.
3.
Land-Use Factors
Six land-use variables are considered: Agricultural, Commercial, Residential, Public Space, Industrial, and Undeveloped. These data were obtained from MORPC, which processed parcel-level data from County Auditors’ databases, are closely related to transportation flows generation and attraction.
4.
Circulation and Network Factors
Public transit variables include numbers of bus stops and railroad crossings. Traffic flow variables include traffic volumes estimated with data obtained from ODOT, the Central Ohio Transit Authority (COTA), and the Delaware Area Transit Authority (DATA). Measures of car use, such as vehicle miles traveled per household, were derived from the National Household Travel Survey (NHTS).
5.
Physical Environment Factors
These variables include number of bars, average age of buildings, area of historic districts, and size of shopping centers. These data were obtained from County Auditors’ databases and the National Register of Historic Places.

2.2. Modeling Methodology

Several types of models have been used in transportation choice research, such as the logit model [2,3,4,5,6], binary logit model [7,8] which uses a binary dependent variables (e.g., yes/no, pass/fail, win/ lose, etc.), ordered mixed logit model, which uses a discrete and ordinal dependent variable with more than two outcomes (e.g., customer satisfaction rating, etc.), mixed logit model [11,12,13,14,15,16,17], which assumes that the parameters vary from one individual to another, multinomial logit model [18,19,20,21,22,23,24,25,26], which uses a discrete and nominal dependent variable with more than two outcomes (e.g., red/blue/green, etc.), discrete choice model [48], which explain a choice from a set of two or more discrete alternatives. This research adopts the logit model to investigate how age impacts accident severity and to uncover when age has little effect, using age as a continuous variable.
The dependent variable Y is binary, set equal to 1 if the crash is BI-type, and to 0 if it is PDO-type. Fatal and injury crashes are regrouped as bodily injury (BI) crashes because of the small number of fatal crashes. The independent variables can be grouped into two groups of vectors: (1) variables specific to the individual crash, and (2) variables characterizing the TAZ where the crash took place.
The categories of crash-specific variables are:
  • X1: age and gender of the at-fault driver;
  • X2: vehicle type;
  • X3: timing of crash;
  • X4: crash location;
  • X5: road condition;
  • X6: other crash factors.
The categories of TAZ-related variables are:
  • Z1: regional and locational;
  • Z2: socio-economic;
  • Z3: land use;
  • Z4: circulation and network;
  • Z5: physical characteristics.
The general form of the probability model is then:
P (Y = 1) = F (X1, X2, X3, X4, X5, X6, Z1, Z2, Z3, Z4, Z5)
In order to assess the factors in the surrounding environment that can affect the probability of being injured or dead in a crash, a logit model is formulated:
Pr ( Y = 1 ) = e x p ( α + i = 1 6 β i + j = 1 5 γ j Z j )   1 + e x p ( α + i = 1 6 β i X i + j = 1 5 γ j Z j )  
where,
  • α : constant;
  • β i : coefficient for categories of crash-specific variables;
  • γ j : coefficient for categories of TAZ-related variables;
  • X i : categories of crash-specific variables;
  • Z j : categories of TAZ-related variables.

3. Results

3.1. Descripctive Statistics

Descriptive statistics for the crash-specific variables used in the logit model are presented in Table 2. Descriptive statistics for the TAZ-level selected variables are presented in Table 3.

3.2. Results of Logit Model

The relationship between each potential explanatory variable and the dependent variable was first investigated with one-variable logit models. Variables with at least a 10% significance level were retained. Next, these variables were combined into a unique logit model, and only those variables significant at the 10% level were retained. The estimated parameters for the selected logit model are presented in Table 4.
The following analyses focus on the model in Table 4, which includes 43 explanatory variables, all except four significant at the 0.01% level.

3.3. Age and Gender of the Culpable Drivers

Both the AG and MD variables were initially significant, with a negative and positive sign, respectively. As the AG variable is continuous, higher-order terms were introduced, and both the square (AG2) and cube (AG3) of AG also turned out to be very significant. However, cross-products between these variables and MD turned out insignificant, and were discarded. In order to better assess the effect of AG, the polynomial function F made of the three age-related variables in Table 4 is analyzed, with:
F= −0.0134AG + 0.000344AG2 − 0.00000285AG3
In order to assess the variations in F, its first-order derivative is computed:
d F d x = 0.0134 + 0.000688 AG 0.00000855 AG 2
The derivative has a value of zero at the following two values of AG:
AGlow = 33.06
AGhigh = 47.41.
The value AGlow corresponds to a minimum, and the value AGhigh to a maximum in the F curve, as illustrated in Figure 3. Equation (3) represents the “Female” curve. The “Male” curve is obtained by adding the coefficient 0.1269 to Equation (3).
For both male and female drivers, F declines slightly from AG = 15 to AG = 33.06, then increases very slightly to AG = 47.41. From that point on, F declines precipitously. However, while the variations in F are a reflection of the variations in the probability of a crash being BI-type, they are not the same, and it is necessary to compute these probabilities for the same AG range. To do so, it is necessary to select values for all the other variables appearing in Table 3. For the sake of simplicity, all the other independent variables are set at their mean values. The simplified probability functions are then:
(1).
Male Drivers
Pr ( Y = 1 | A G ) = e x p ( 0.0134 A G + 0.000344 A G 2 0.00000285 A G 3 + 1.0697 ) 1 + e x p ( 0.0134 A G + 0.000344 A G 2 0.00000285 A G 3 + 1.0697 )
(2).
Female Drivers
Pr ( Y = 1 | A G ) = e x p ( 0.0134 A G + 0.000344 A G 2 0.00000285 A G 3 + 0.9968 ) 1 + e x p ( 0.0134 A G + 0.000344 A G 2 0.00000285 A G 3 + 0.9968 )
The corresponding probability curves are presented in Figure 4.
At age 15, the probability of the crash BI is 0.703 for female drivers, and 0.718 for male drivers. These probabilities decline very slightly to 0.696 and 0.711, respectively, around age 33.06, then very slightly increase to 0.697 and 0.712, respectively, around age 47.41. From that point on, the two curves first decline very slightly up to age 70, after which the decline accelerates. At age 80, the female and male probabilities are 0.661 and 0.677, respectively, and at age 90 they are equal to 0.622 and 0.639, respectively.
The above results are somewhat surprising, as one would have expected more differentiated probabilities. While the BI crash probability is highest for the very young drivers (15–20), it is not very different from the probabilities for adulthood years. As expected, the probability declines significantly for senior drivers, but mostly in late years. In summary, except for these very senior drivers, age does not appear to strongly affect the probability for a crash to be of the BI-type.
The crash literature has heavily focused on driver age as influencing crashes, and in particular how age affects the likelihood of crashes among teenage and senior drivers. However, the present results differ in at least two ways: The analysis looks at all age groups in a continuous manner, and hence can identify the age at which crash incidence changes. In the literature, the cut-off points for the young- and old-age cohorts are assumed rather than being the result of an analysis. Second, the literature does not explicitly deal with drivers at fault, and hence it is difficult to identify the extent to which a particular age group is involved in accidents or causes accidents as faulty party. In a general way, the present results confirm some of the findings in the literature. They suggest that over the middle age range, crash probabilities do not change much and hence the literature is correct in focusing on the young and the old, for which severe crash probability differs. Further, the literature suggests that male drivers are at a greater risk of BI accidents and women are more involved in non-fatal crashes [49,50]. This is confirmed by this study.

3.4. Type of Vehicle

The benchmark is a mix of trucks, taxi, emergency, motorhome, equipment, and other vehicles, as well as trains. Accidents in which at-fault drivers drive compact, mid-size, or full-size vehicles are less likely to result in BI than those involving benchmark vehicles. Similar results hold for vans, SUVs, pickup trucks, and motorcycles. However, bus-type vehicles have a positive sign, which indicates that they are more likely than the benchmark vehicles to generate BI crashes. The effect of vehicle type on accident severity has not yet been much studied in the literature. An exception is [42], which considers the special case of pedestrian crashes and finds that emergency vehicles and motorcycles/mopeds have a positive impact, and trucks, vans, and buses a negative impact on such crashes. These results are somewhat counterintuitive. The present results have much greater generality as they do not focus only on pedestrian crashes.
The result for motorcycles is interesting. The coefficient (−2.97) is much larger (in absolute term) than those for all the other vehicle-type variables. Usually, riding a motorcycle is considered more dangerous than riding a car. Motorcyclists are not protected by the body shell of a car or other safety attributes, such as seatbelts or airbags. The driver’s body is directly exposed to the impact of a car or obstacle. The results suggest that if the motorcyclist is the culpable driver in a crash, body injuries are less likely than if the driver of a car were the at-fault driver. While this seems counterintuitive, there are possible explanations. First, BI crashes are not limited to accidents in which the at-fault driver suffers injuries. Much of the time it is others, passengers in the vehicle of the at-fault driver, pedestrians, or passengers in other vehicles, that suffer injury. Comparing the vulnerability of at-fault drivers is therefore besides the point. It is the individual vulnerability and the number of all participants in an accident that matters. Hence the result can be explained, all else being equal, if the number of participants in a crash caused by a motorcycle were smaller than the number caused by a car. The same is true, again all else being equal, if the vulnerability of participants other than the crash-causing driver is smaller in a crash caused by a motorcycle than by a car. The number of crash participants is likely to vary between the two types of accidents. Given the greater instability of 2-wheel vehicles over 4-wheel vehicles, motorcycles are more prone to accidents that involve only the motorcycle. These types of accidents would automatically fault the motorcyclist. However, quite a number of such accidents may not result in injury, as the driver anticipates the accident.

3.5. Timing of Crash

Seasonal variables are associated with the type of crash. If a crash occurs in summer, the odds of a BI crash are smaller than in spring or autumn (benchmark seasons). On the other hand, winter increases the odds of a BI crash. In summer, driving conditions may be better than in other seasons in the Central Ohio Region due to the summer break in schools and colleges. The summer break lessens traffic congestion and may calm traffic. Driving during the winter season increases the probability of a BI crash. Ice, slippery roads, and low temperatures may affect a driver’s physical conditions, resulting in worse vehicle maneuverability.
The results also show that driving on Friday may lead to more BI crashes. Friday is the end of the week, which may make people more relaxed, and drivers less attentive. People may participate in parties or other nightly events, which may also contribute to more BI crashes.
With respect to time of day, driving during morning peak hour (7:00–9:00 am), night peak hour (5:00–7:00 pm) and from 12:00 pm to 5:00 pm increases the likelihood of a crash to be BI-type. The morning and evening rush hours, and to some extent the lesser peak hours of the lunch break, are characterized by congestion. Drivers frequently change lanes, do not keep a minimum distance to other cars, and encounter difficult situations when entering and exiting highways or turning on multi-lane arterial roads. The literature indicates that these conditions raise accident rates, and it is not surprising that they raise the rate of BI crashes. Morning and evening rush hours also take place at least in part during twilight and dawn hours when visibility is reduced. See [51] for results on this effect.

3.6. Type of Road and Intersection

The benchmark cases include 5-point intersections, crossovers, railways, shared paths, not an intersection, and others. Compared with the benchmarks, 4-way intersections and driveways are less likely to lead to BI crashes, whereas roundabouts and ramps are more likely to do so.
At 4-way intersections, drivers generally must stop their car or reduce speed before turning or proceeding ahead, and this caution leads to less BI crashes. The higher likelihood of BI crashes at roundabouts may be due to less visibility of other cars entering the roundabout. The results imply that when a crash occurs, it is more likely to be BI-type than at other intersections. However, overall, the number of crashes may be much lower and hence the incidence of BI crashes at roundabouts is lower than at other intersections. The results indicate that ramps increase the likelihood that a crash is BI-type. Vehicles at high speed on the highway need to reduce their speed abruptly or need to change lane suddenly to enter the exit ramp. A ramp with a sharp curve and a 25 m/h speed limit or steep slope, may make it difficult to control the vehicle. Using a ramp to enter a highway may also be dangerous, particularly if high-speed vehicles on the highway do not provide some leeway for the entering cars. Crashes that occur at driveways are more likely to be PDO crashes due to the low speed of the vehicle entering or exiting the driveway.

3.7. Road Conditions

Road shape can be an influential factor regarding the type of crash. When the road contour is curved, whether level or grade, a BI crash is less likely, possibly because of the need to reduce the driving speed. The results also show that road conditions may be influential factors. A road covered by ice increases the odds of a BI crash. The benchmark includes all other conditions (dry, wet, snow, etc.). This result is likely related to the difficult maneuverability of cars under icy road conditions. Moreover, ice conditions may overlap with other conditions, such as rain or snow, which can reduce the visual field. Drivers often inadequately compensate for such difficult conditions, perhaps because these conditions are not always recognizable, because compensation is not feasible, or because of a lack of experience with these conditions. Snow in southern states will likely result in much greater accident severity than snow in Canada. Several studies confirm the positive effect of bad weather on crashes, for instance [42] for vehicle–pedestrian crashes and [52] for crashes resulting in injury or fatalities.
In contrast to expectations, poor light and visibility conditions do not necessarily contribute to crash severity. Darkness decreases the odds of bodily injury crashes. Apparently, this is yet another situation when drivers may compensate by driving more cautiously or at a reduced speed, limiting accident severity even if accident likelihood rises. The authors of [51] suggest that the smaller the daylight minutes during rush hours, the greater the number of crashes.

3.8. Alcohol, Drug, and Work Zone

The results indicate that alcohol- and drug-related crashes are more likely to be PDO than BI crashes. This is quite an unexpected result, but there might be a possible explanation: when drivers are influenced by alcohol or drugs, there are more chances that a crash will happen, but since they are mentally and physically exhausted under the influence, they may drive more slowly and in zigzag, hitting objects and properties. Moreover, surprisingly, pedestrian-related crashes are more PDO than BI-type. However, in such pedestrian-related crashes, the pedestrian is not the at-fault party, and it may be that the driver, trying to avoid a pedestrian, may swerve suddenly and hit property rather than the pedestrian.
Finally, accidents in work zone areas are more likely to result in BI. Workers operate under dangerous conditions, where they can be struck by cars. Inexperienced or incautious drivers may run over the work zone. Driving lanes are often narrow, and may change in unexpected ways, which makes it difficult for drivers to fully adjust their driving. There is also asymmetry of traffic participants, as workers must work in the immediate vicinity of traffic, yet lack protection.

3.9. Regional and Locational Factors

An increasing distance from downtown Columbus increases the likelihood of a crash being BI. The closer to downtown, the greater the level of congestion and the slower the traffic. While accidents may be more frequent closer to downtown, the likelihood that they result in injury declines. This holds despite the greater asymmetry in street participants, and in particular the greater presence of vulnerable pedestrians. More complex signage, more pedestrians, and more law enforcement officers may lead to more careful driving. The urban dummy variable has a positive and significant coefficient. In contrast to expectations, an urban environment raises the odds of a BI crash. In urban areas, there are many attractions which can capture drivers’ attention in contrast to rural areas, and this may lead to more serious crashes.

3.10. Socio-Economic Factors

Population density, retail service employment, household income, and the proportion of Whites, all increase the likelihood for a crash to be BI-type. Denser areas with a lot of commercial activities may be more difficult to drive in, increasing the likelihood of BI crashes. It is not clear why a large share of Whites and higher household income increase the likelihood of BI crashes. Unexpectedly, when a neighborhood has a high proportion of young (Age < 14) or older (Age > 50) residents, the likelihood that an accident is of BI-type declines. Both younger and older residents are more vulnerable than middle-age adults, particularly as pedestrians. The young are less familiar with traffic dangers, older residents react more slowly, and both young and old often act impulsively and without proper precaution, so the result is unexpected. There are a number of possible reasons, though all remain speculative without additional data. As elsewhere, drivers in potentially vulnerable neighborhoods may overcompensate, resulting if not in fewer accidents, at least in accidents that are less often BI-type. Neighborhoods with preponderance of young or senior populations may differ from others in ways not captured by other physical neighborhood characteristics, making them safer in terms of BI. Future research will have to identify these reasons.

3.11. Land-Use

The results show that crashes in predominantly agricultural and residential areas are less likely to be BI-type. There may be less traffic in agricultural areas, hence less opportunities for severe accidents. Residential areas require lower speeds and more stops, which reduces the likelihood of BI crashes.

3.12. Neighborhood Physical Factors

The results show that physical neighborhood characteristics are associated with the possibility of a crash being the BI-type. The average age of buildings has a negative sign. Older neighborhoods may require more careful driving, leading to less BI crashes. However, the Historic District variable has the opposite effect, for which there is no clear explanation. The size of shopping centers has a positive effect on the odds that an accident is BI-type. This can be attributed to the difficulty of driving in such areas.

4. Discussion

This research tried to identify the variables that determine whether a crash results in Bodily Injury (BI) or Property Damage Only (PDO). BI crashes are of interest because they are generally more serious and costly than PDO crashes. The analysis focused on driver attributes, physical neighborhood characteristics, neighborhood resident and employment populations, and other factors such as road and weather condition. It adds significant detail to driver and vehicle attributes, and crash-site and weather conditions. One finding is that age affects crash severity following a polynomial curve. While the overall pattern is one of a downward trend with age, this trend is weak until the senior years. Among vehicle types, bus-type vehicles significantly increase the probability of a crash being BI, while motorcycles have the opposite negative effect. The results indicate that seasonal, weekly, and daytime variables are associated with the probability of BI crashes. Crashes that occur in summer are less likely to be BI crashes, while those in winter are more likely to be so. In terms of day of the week, driving Friday contributes to a crash being BI-type. Regarding the time of day, driving during morning and night rush hour increases the probability of a crash being BI-type.
The type of road and road conditions are also important factors for crash severity. Crashes that occur at 4-way intersections and driveways are less likely to be BI-type. Ramps and roundabouts have the opposite effect. Driving on icy roads is more likely to result in crashes being BI-type. Unexpectedly, driving under poor light conditions has the opposite effect. Among other factors influencing the likelihood of a crash being BI-type, alcohol and drugs have a negative effect, whereas work zones have a positive one.
Regional and locational factors, such as the distance to the center of the city of Columbus and the urban character of a TAZ positively and significantly impact the likelihood of a crash being BI-type. Neighborhood socio-economic conditions also matter. The higher the proportion of residents under 14 and over 50, the lower the likelihood of a crash being BI-type. Higher population density and larger retail service employment have the opposite effect. Two land uses, agricultural and residential, are negatively associated with BI crashes. Finally, the likelihood of a BI crash increases at lower posted limits, particularly less than 45 mph.
The research has shown that TAZ socio-economic characteristic have a significant impact on crashes. Yet not much is known about the mechanisms by which they affect crashes. Three such mechanisms are suggested, the effects that TAZ socio-economic conditions have on (i) the local composition of drivers at fault, (ii) the density and type of street activities in its area, and (iii) the built environment. However, some of the variables, such as the share of younger or older residents or the share of the White population, can influence accidents through all three mechanisms, and it is not clear which is most important. Indeed, we do not know the extent to which TAZ residents are also TAZ drivers at fault, or the extent to which TAZ residents are involved in crashes in their own neighborhood. Additional data may permit to shed light on these questions.

5. Conclusions

The modeling approach explained crash severity by combining explanatory variables related to the neighborhood (TAZ) of the crash with variables related to individual driver behavior, vehicle characteristics, and crash environment, such as weather, time of day, visibility, and road curvature. One contribution of this approach was to identify the age effect on crash severity. In approximate terms, the likelihood that a crash is severe does not change much for at-fault drivers between the age of 25 and 55, suggesting that limiting the study of age effects to the young and old, plus a single group in between may be sufficient for most purposes. The model is used to explain the probability of a bodily injury crash, including fatality, with the alternative being a property damage only crash. Because the age of the at-fault driver is taken as a continuous independent variable, it is possible to assess the lifetime impact of age, for both male and female drivers. According to the results, when an accident occurs, the probability of a bodily injury or fatal crash is 0.703 for female drivers, and 0.718 for male drivers at 15 years of age. These probabilities decline very slightly to 0.696 and 0.711, respectively, around 33 years of age, then very slightly increase to 0.697 and 0.712, respectively, around age 47.5 years of age. The results also confirm suggestions in the literature that males are more aggressive drivers and take greater risks. Male drivers of all ages are more likely to produce a crash leading to injury or death than female drivers.
Many variables that impact crashes and crash severity are beyond the control of planners or transport engineers. Even these variables may generate policy implications. For example, crashes are more likely in urban than rural areas, and crashes diminish with distance from the city center. While beyond control, they suggest transporting planners that, given limited budgets, it is wise to invest scarce safety resources in urban areas and close to the city center rather than in more distant rural areas. They also suggest the need to better understand just exactly what lies behind the differences in crash incidence. The most obvious reason is the higher traffic volume and the greater population and employment densities in urban than rural areas. As these variables were controlled for, other urban attributes must be the reason and will have to be investigated.
The research conducted in this study suggests numerous avenues for follow-up. The database developed for this research has the potential for a wide range of follow-up studies. One aim of the research was to introduce the at-fault driver into the analysis of crashes, as opposed to other crash-involved drivers. While the research succeeded in this aim, future research should analyze how the two types of analysis differ. The database allows for an analysis of crash-involved drivers, crash at-fault drivers, and the residual drivers that are involved but not at-fault. Fault is rarely quite clear-cut. Rear-end crashes, for example, are always taken to be the fault of the rear-ending vehicle, though the behavior of the front vehicle often contributes to this type of crash. It would be of interest to see whether there is a significant difference between these sets of drivers, and how they differ. This would also help in interpreting existing studies that have mostly focused on crash-involved drivers.
This research has some limitations that need to be addressed in future research. First, the data cover the period 2006–2011. Changes in vehicle safety standards, improvements in road infrastructure (e.g., roundabouts), changes in local enforcement, and expanded driver and pedestrian education may have since modified the roles of the various factors considered in this study. In particular, the COVID-19 pandemic may have influenced the overall environment for drivers, pedestrians, and traffic conditions. In this change context, it is noticeable that road fatalities in the U.S. have increased from 30,000 in 2011 to 36,000 in 2020 “https://www-fars.nhtsa.dot.gov/Main/index.aspx (accessed on 24 January 2023)”. Second, more spatial attributes need to be considered by adopting more advanced spatial analysis techniques.

Author Contributions

Writing—original draft, D.L.; writing—review and editing, J.-M.G.; providing insights and suggestions, B.v.R. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Ministry of Education of the Republic of Korea and the National Research Foundation of Korea (NRF-2021S1A5B5A17049828).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Lee, D.; Guldmann, J.M.; Von Rabenau, B. Interactions between the built and socio-economic environment and driver demographics: Spatial econometric models of car crashes in the Columbus Metropolitan Area. Int. J. Urban Sci. 2018, 22, 17–37. [Google Scholar] [CrossRef]
  2. Al-Ghamdi, A.S. Using logistic regression to estimate the influence of accident factors on accident severity. Accid. Anal. Prev. 2002, 34, 729–741. [Google Scholar] [CrossRef] [PubMed]
  3. Tay, R.; Rifaata, S.M.; Chin, H.C. A logistic model of the effects of roadway, environmental, vehicle, crash and driver characteristics on hit-and-run crashes. Accid. Anal. Prev. 2008, 40, 1330–1336. [Google Scholar] [CrossRef] [PubMed]
  4. Nowakowska, M. logistic models in crash severity classification based on road characteristics. Transp. Res. Rec. 2010, 2148, 16–26. [Google Scholar] [CrossRef]
  5. Liu, L.; Dissanayake, S. Factors affecting crash severity on gravel roads. J. Transp. Saf. Secur. 2009, 1, 254–267. [Google Scholar] [CrossRef]
  6. Sagar, S.; Stamatiadis, N.; Codden, R.; Benedetti, M.; Cook, L.; Zhu, M. Socioeconomic and Demographic Factors Effect in Association with Driver’s Medical Services after Crashes. Int. J. Environ. Res. Public Health 2022, 19, 9087. [Google Scholar] [CrossRef]
  7. Young, R.K.; Liesman, J. Estimating the relationship between measured wind speed and overturning truck crashes using a binary logit model. Accid. Anal. Prev. 2007, 39, 574–580. [Google Scholar] [CrossRef] [PubMed]
  8. Eboli, L.; Forciniti, C.; Mazzulla, G. Factors influencing accident severity: An analysis by road accident type. Transp. Res. Proc. 2020, 47, 449–456. [Google Scholar] [CrossRef]
  9. Hu, S.; Li, C.; Lee, C. Investigation of key factors for accident severity at railroad grade crossings by using a logit model. Saf. Sci. 2010, 8, 186–194. [Google Scholar] [CrossRef] [Green Version]
  10. Srinivasan, K.K. Injury severity analysis with variable and correlated thresholds: Ordered mixed logit formulation. Transp. Res. Rec. 2002, 1784, 132–142. [Google Scholar] [CrossRef]
  11. Gkritza, K.; Mannering, F.L. Mixed logit analysis of safety-belt use in single- and multi-occupant vehicles. Accid. Anal. Prev. 2008, 40, 443–451. [Google Scholar] [CrossRef]
  12. Milton, J.D.; Shankar, V.N.; Mannering, F.L. Highway accident severities and the mixed logit model: An exploratory empirical analysis. Accid. Anal. Prev. 2008, 40, 260–266. [Google Scholar] [CrossRef] [PubMed]
  13. Kim, J.; Ulfarsson, G.F.; Shankar, V.N.; Mannering, F.L. A note on modeling pedestrian-injury severity in motor-vehicle crashes with the mixed logit model. Accid. Anal. Prev. 2010, 42, 1751–1758. [Google Scholar] [CrossRef] [PubMed]
  14. Morgan, A.; Mannering, F.L. The effects of road-surface conditions, age, and gender on driver-injury severities. Accid. Anal. Prev. 2011, 43, 1852–1863. [Google Scholar] [CrossRef]
  15. Kim, J.; Ulfarsson, G.F.; Kim, S.; Shankar, V.N. Driver-injury severity in single-vehicle crashes in California: A mixed logit analysis of heterogeneity due to age and gender. Accid. Anal. Prev. 2013, 50, 1073–1081. [Google Scholar] [CrossRef]
  16. Haleem, K.; Gan, A. Effect of driver’s age and side of impact on crash severity along urban freeways: A mixed logit approach. J. Saf. Res. 2013, 46, 67–76. [Google Scholar] [CrossRef] [PubMed]
  17. Shaheed, M.S.; Gkritza, K.; Zhang, W.; Hans, Z. A mixed logit analysis of two-vehicle crash severities involving a motorcycle. Accid. Anal. Prev. 2013, 61, 119–128. [Google Scholar] [CrossRef]
  18. Shankar, V.; Mannering, F. An exploratory multinomial logit analysis of single-vehicle motorcycle accident severity. J. Saf. Res. 1996, 27, 183–194. [Google Scholar] [CrossRef]
  19. Islam, S.; Mannering, F. Driver aging and its effect on male and female single-vehicle accident injuries: Some additional evidence. J. Saf. Res. 2006, 37, 267–276. [Google Scholar] [CrossRef]
  20. Malyshkin, N.V.; Mannering, F.L. Markov switching multinomial logit model: An application to accident-injury severities. Accid. Anal. Prev. 2009, 41, 829–838. [Google Scholar] [CrossRef]
  21. Ulfarsson, G.F.; Kim, S.; Booth, K.M. Analyzing fault in pedestrian–motor vehicle crashes in North Carolina. Accid. Anal. Prev. 2010, 42, 1805–1813. [Google Scholar] [CrossRef] [PubMed]
  22. Geedipally, S.R.; Turner, P.A.; Patil, S. Analysis of motorcycle crashes in Texas with multinomial logit model. Transp. Res. Rec. 2011, 2265, 62–69. [Google Scholar] [CrossRef]
  23. Tay, R.; Choi, J.; Kattan, L.; Khan, A. A multinomial logit model of pedestrian–vehicle crash severity. Int. J. Sustain. Transp. 2011, 5, 233–249. [Google Scholar] [CrossRef]
  24. Shaheed, M.S.; Gkritza, K. A latent class analysis of single-vehicle motorcycle crash severity outcomes. Anal. Methods Accid. Res. 2014, 2, 30–38. [Google Scholar] [CrossRef]
  25. Kim, J.; Ulfarsson, G.F.; Shankar, V.N.; Kim, S. Age and pedestrian injury severity in motor-vehicle crashes: A heteroskedastic logit analysis. Accid. Anal. Prev. 2008, 40, 1695–1702. [Google Scholar] [CrossRef]
  26. Ohashi, H.; Kim, T.; Oum, T.; Yu, C. Choice of air cargo transshipment airport: An application to air cargo traffic to/from Northeast Asia. J. Air Transp. Manag. 2005, 11, 149–159. [Google Scholar] [CrossRef]
  27. Moore, D.N.; Schneider, W.H., IV; Savolainen, P.T.; Farzaneh, M. Mixed logit analysis of bicyclist injury severity resulting from motor vehicle crashes at intersection and non-intersection locations. Accid. Anal. Prev. 2011, 43, 621–630. [Google Scholar] [CrossRef]
  28. Xie, Y.; Zhao, K.; Huynh, N. Analysis of driver injury severity in rural single-vehicle crashes. Accid. Anal. Prev. 2012, 47, 36–44. [Google Scholar] [CrossRef]
  29. Kockelman, K.M.; Kweon, Y.J. Driver injury severity: An application of ordered probit models. Accid. Anal. Prev. 2002, 34, 313–321. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Quddus, M.A.; Noland, R.B.; Chin, H.C. An analysis of motorcycle injury and vehicle damage severity using ordered probit models. J. Saf. Res. 2002, 33, 445–462. [Google Scholar] [CrossRef] [PubMed]
  31. Abdel-Aty, M. Analysis of driver injury severity levels at multiple locations using ordered probit models. J. Saf. Res. 2003, 34, 597–603. [Google Scholar] [CrossRef] [PubMed]
  32. Yamamoto, T.; Shankar, V.N. Bivariate ordered-response probit model of driver’s and passenger’s injury severities in collisions with fixed objects. Accid. Anal. Prev. 2004, 36, 869–876. [Google Scholar] [CrossRef] [PubMed]
  33. Chiou, Y.; Hwang, C.; Chang, C.; Fu, C. Reprint of “Modeling two-vehicle crash severity by a bivariate generalized ordered probit approach”. Accid. Anal. Prev. 2013, 61, 97–106. [Google Scholar] [CrossRef] [PubMed]
  34. Garrido, R.; Bastos, A.; Almeida, A.; Elvas, J.P. Prediction of road accident severity using the ordered probit model. Transp. Res. Procedia 2014, 3, 214–223. [Google Scholar] [CrossRef] [Green Version]
  35. Xie, Y.; Zhang, Y.; Liang, F. Crash injury severity analysis using bayesian ordered probit models. J. Transp. Eng. 2009, 135, 18–25. [Google Scholar] [CrossRef] [Green Version]
  36. Eustace, D.; Indupuru, V.K.; Hovey, P. Identification of risk factors associated with motorcycle-related fatalities in Ohio. J. Transp. Eng. 2011, 134, 474–480. [Google Scholar] [CrossRef]
  37. Ye, F.; Lord, D. Investigation of effects of underreporting crash data on three commonly used traffic crash severity models: Multinomial logit, ordered probit, and mixed logit. Transp. Res. Rec. 2011, 2241, 51–58. [Google Scholar] [CrossRef]
  38. Lemp, J.D.; Kockelman, K.M.; Unnikrishnan, A. Analysis of large truck crash severity using heteroskedastic ordered probit models. Accid. Anal. Prev. 2011, 43, 370–380. [Google Scholar] [CrossRef] [Green Version]
  39. Park, S.; Jang, K.; Park, S.H.; Kim, D.; Chon, K.S. Analysis of injury severity in traffic crashes: A case study of Korean expressways. KSCE J. Civ. Eng. 2012, 16, 1280–1288. [Google Scholar] [CrossRef]
  40. Schneider, R.J.; Ryznar, R.M.; Khattak, A.J. An accident waiting to happen: A spatial approach to proactive pedestrian planning. Accid. Anal. Prev. 2004, 36, 193–211. [Google Scholar] [CrossRef]
  41. Clifton, K.J.; Fults, K.K. An examination of the environmental attributes associated with pedestrian-vehicular crashes near public schools. Accid. Anal. Prev. 2007, 39, 708–715. [Google Scholar] [CrossRef] [PubMed]
  42. Clifton, K.J.; Burnier, C.V.; Akar, G. Severity of injury resulting from pedestrian–vehicle crashes: What can we learn from examining the built environment? Transp. Res. Part D 2009, 14, 425–436. [Google Scholar] [CrossRef]
  43. Pulugurtha, S.S.; Sambhara, V.R. Pedestrian crash estimation models for signalized intersections. Accid. Anal. Prev. 2011, 43, 439–446. [Google Scholar] [CrossRef] [PubMed]
  44. Hanson, C.S.; Noland, R.B.; Brown, C. The severity of pedestrian crashes: An analysis using Google Street View imagery. J. Transp. Geogr. 2013, 33, 42–53. [Google Scholar] [CrossRef]
  45. Hanumegowda, P.K.; Gnanasekaran, S. Prediction of Work-Related Risk Factors among Bus Drivers Using Machine Learning. Int. J. Environ. Res. Public Health 2022, 19, 15179. [Google Scholar] [CrossRef] [PubMed]
  46. Roh, H. Comparative study on the performance of four fundamental optimization algorithms applied for transportation mode choice modelling. Transp. Eng. 2021, 3, 100043. [Google Scholar] [CrossRef]
  47. Mannering, F.L.; Bhat, C.R. Analytic methods in accident research: Methodological frontier and future directions. Anal. Methods Accid. Res. 2014, 1, 1–22. [Google Scholar] [CrossRef]
  48. Ben-Akiva, M.; Lermans, S.R. Discrete Choice Analysis: Theory and Application to Travel Demand; The MIT Press: Cambridge, MA, USA; The MIT Press: London, UK, 1985. [Google Scholar]
  49. Rampinelli, A.; Calderón, J.F.; Blazquez, C.A.; Sauer-Brand, K.; Hamann, N.; Nazif-Munoz, J.I. Investigating the Risk Factors Associated with Injury Severity in Pedestrian Crashes in Santiago, Chile. Int. J. Environ. Res. Public Health 2022, 19, 11126. [Google Scholar] [CrossRef]
  50. Massie, D.L.; Green, P.E.; Campbell, K.L. Crash involvement rates by driver gender and the role of average annual mileage. Accid. Anal. Prev. 1997, 29, 675–685. [Google Scholar] [CrossRef]
  51. Graham, D.J.; Glaister, S. Spatial variation in road pedestrian casualties: The role of urban scale, density and land-use mix. Urban Stud. 2003, 40, 1591–1607. [Google Scholar] [CrossRef]
  52. Fridstrøm, L.; Ingebrigten, S. An aggregate accident model based on pooled, regional time-series data. Accid. Anal. Prev. 1991, 23, 363–378. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Central Ohio Region.
Figure 1. Central Ohio Region.
Ijerph 20 02338 g001
Figure 2. Individual crashes in the study area over 2006–2011.
Figure 2. Individual crashes in the study area over 2006–2011.
Ijerph 20 02338 g002
Figure 3. Age and the value of F.
Figure 3. Age and the value of F.
Ijerph 20 02338 g003
Figure 4. Age and probability of BI crash.
Figure 4. Age and probability of BI crash.
Ijerph 20 02338 g004
Table 1. Contents of the literature.
Table 1. Contents of the literature.
ModelsVariablesMain ResultsReference
Logistic modelAccident severity, namely, location, cause of accident, etc.Contributing factors to accident severity, with data derived from a sample of 560 crashes in Riyadh, Saudi Arabia. Location (intersection) is closely associated with crash severity.[2]
Hit-and-run crashes, driver characteristics, vehicle types, crash characteristics, roadway features and environmental characteristics.Identifies the associated factors of hit-and-run crashes in Singapore.
Male drivers, drivers aged between 45 and 69 years, two-wheel vehicles are more associated with these crashes.
[3]
Road design elements (vertical and horizontal curves) road access (bus stops, public and private access lanes) and land use (gas stations, parking places).Contributing road characteristics to crashes severity in Poland.
The type of shoulders on both sides of a roadway, area type, pedestrian sidewalks, and intersection significantly influence crash severity.
[4]
Driver characteristics, road condition, collision type, safety equipment usage, driver ejection, alcohol involvement, speed limit.Examines the relationship between crash severity and the characteristics of gravel roads, using data over 1996–2005 in Kansas. Safety equipment usage, alcohol involvement, speed limit, and driver-related factors, have significant influences on crash severity.[5]
Educational attainment, median housing value, gender and age, median housing value, rurality percentage at the zip code level.Investigates whether the socioeconomic characteristics of a driver-based residence zip code have any relationship with the likelihood of post-crash medical services.[6]
Binary logit modelWind data, overturning truck crash.Wind speed is a critical factor in overturning freight vehicle crashes in Wyoming.[7]
Road (location, street, paving stretch, surface, signposting), external environment (day of week, weather, hour, season), driver (age, gender, license), accident (crash type, vehicle, dead).It is found that male drivers are more likely to be involved in fatal crashes in an intersection crash than female drivers. Drivers aged 65 years and above are more likely to be involved in fatal intersection crashes than other age groups. Drivers aged below 45 years have lower probability to be involved in a front/side collision.[8]
Generalized logit modelRailway features, highway features, crossing features, traffic controls, and land use.Key factors for crash severity at a railroad grade crossing using logit analysis. The number of daily trains and the presence of a law enforcement camera can influence crash severity.[9]
Ordered mixed logit modelDriver, traffic, crash-related and vehicle characteristics.Certain factors including tripped rollovers can increase chances of fatal injury and injuries can be sustained by moped riders.[10]
Mixed logit modelRoadway characteristics, vehicle attributes, and driver behavior.The effect of the use of safety belts in single and multi-occupant vehicles in Indiana. Safety belt is associated with vehicle type, gender, time, but these effects vary across the population.[11]
Average daily traffic per lane, average daily truck traffic, truck percentage, interchanges per mile and weather effects such as snowfall, the number of horizontal curves, number of grade breaks per mile and pavement friction.Volume-related variables are good for random parameters, while roadway-related parameters are good fixed parameters for these crashes.[12]
Roadway characteristics, vehicle attributes, and driver behavior.The possible unobserved heterogeneity in pedestrian injury severity caused by motor vehicle crashes in North Carolina. Darkness, truck, freeway, and age of pedestrians can increase the possibility of fatal injury in pedestrian-related crashes.[13]
Roadway-surface conditions, driver’s age, gender.Drivers in different age and gender groups perceive and react to road-surface conditions in different ways, which may result in varying crash severity.[14]
Driver characteristics (age, gender.), vehicle attributes, and roadway characteristics (light condition,).Driver injury severity in single-vehicle crashes in California, focusing on the heterogeneous effects of age and gender. Male driver, drunk driving, unsafe speed, older driver, driving older vehicle and driving in darkness without streetlights can increase the probability of fatal crashes.[15]
Traffic volume, distance of the crash to the nearest ramp, and detailed driver’s age, vehicle types, and sides of impact, etc.At-fault driver’s influential factors on crash severity on urban freeways in Florida. Age, traffic volume, distance of the crash to the nearest ramp, vehicle type, side of impact, and percentage of trucks are important.[16]
Manner of collision, motorcycle rider and non-motorcycle driver and vehicle actions, roadway and environmental conditions, location and time, motorcycle rider and non-motorcycle driver and vehicle attributes, etc.Adopts mixed logit models to investigate the effects of crash factors on crash severity. Non-uniform effects of rear-end collisions, roadway speed limit, type of area, riding season, motorcyclist’s gender, light conditions, roadway surface conditions, helmet use influence on crash severity.[17]
Multinomial logit modelEnvironmental factors, roadway conditions, vehicle characteristics, and rider attributes.Investigates the influential factors on crash severity (five levels) in single-vehicle motorcycle accidents, addressing the need of multinomial logit formulation.[18]
Driver characteristics (age, gender.),
single-vehicle accidents involving passenger cars.
The effect of driver age and gender on crash injury severity in single-vehicle crashes. There are significant different behavioral issues between genders in different age groups[19]
Roadway characteristics, accident types, weather conditions, etc.Weather condition is frequently associated with roadway safety.[20]
Road characteristics, vehicle attributes, and driver behavior.Crash injury severity and influential factors (crosswalk spacing, presence of both horizontal and vertical curves), vehicle type, signal timings, alcohol, gender, light condition, weekend.[21,22,23,24]
Pedestrian age, male driver, intoxicated driver, traffic sign, commercial area, darkness with or without streetlights, sport-utility vehicle, truck, freeway, two-way divided roadway, speeding-involved, off roadway, motorist turning or backing, both driver and pedestrian at fault, and pedestrian only at fault, etc.Develops heteroskedastic logit analysis to investigate the influential factors associated with the injury severity of pedestrians in motor-vehicle crashes in North Carolina.
Pedestrian age can increase the probability of fatal injury and it grows more pronounced with increasing age past 65 years.
[25]
Monetary cost factors, time cost factors, throughputs, airports, etc.In terms of the methodology, it adopts a multinomial logit model. It shows that reducing air cargo connecting time at an airport via adequate investment in capacity is important in terms of saving time costs.[26]
Standard multinomial logit and mixed logit modelsRoadway characteristics (intersection and non-intersection), vehicle attributes, and driver characteristics (bicyclist, wearing a helmet, drugs, alcohol).Finds that (1) driving under the influence of drugs or alcohol, (2) striking the side of the bicycle, and (3) crashes involved with a heavy-duty truck can increase the likelihood of severe injuries from motor vehicle crashes at intersections and non-intersection.[27]
Multinomial logit model and latent class logit modelSingle-vehicle crashes on rural roads.Uses multinomial logit model and latent class model at the same time, finding that vehicle age and surface condition (such as dry, wet, or icy) do not significantly impact driver injury severity.[28]
Ordered probit modelRoad characteristics, vehicle attributes, driver behavior, and driver characteristics.Uses ordered probit models for crash injury severity analysis to examine the factors that affect the risk of different injury levels sustained under various types of crashes including two-vehicle crashes, single-vehicle crashes, motorcycle, pedestrian, etc.[29,30,31,32,33,34]
Bayesian ordered probit modelsDriver’s characteristics, vehicle type, and roadway conditions, etc.Introduces Bayesian ordered probit models and compares the results with those of ordered probit models. When the sample data size is small, the Bayesian ordered probit model can produce better prediction performance than the ordered probit model.[35]
Multinomial probit modelGender of the motorcyclist, speeding, use of alcohol and/or drugs, helmet use, being involved in a single-vehicle crash or at a non-intersection location, horizontal curves, graded segments, major roadway.Finds that being a female motorcyclist, excessive speeding, use of alcohol and/or drugs, and riding without a helmet significantly increases fatality and severe injury in terms of in a single-vehicle crash, at a non-intersection location, on horizontal curves or graded segments, and major roadways.[36]
Multinomial logit, ordered probit, and mixed logitEnvironmental factors, vehicle attributes, and driver characteristics.Uses three commonly used methods, multinomial logit, ordered probit, and mixed logit, to investigate the effects of under-reported crash data. Fatal crashes must be the baseline severity for the MNL and ML models to minimize the bias and the variability of a model. The rank for the crash severity must be from fatal to property damage only in a descending order for the ordered probit models.[37]
Ordered probit
model
Impact of vehicle, occupant, driver, and environmental characteristics.Investigates the influential factors of large truck crash severity using ordered probit models. Finds that the likelihood of fatalities and severe injury rises with the number of trailers, while it falls with the truck length and gross vehicle weight rating.[38]
Ordered probit, ordered logit, and
multinomial logit model
Time, emergency service arrival time, crash location, primary crash factors, weather, radius of curvature, vertical grade, type of vehicle at fault, driver age.The time between midnight and 6:00 a.m.; driving while drowsy; median violation; car versus car collision; car versus people collision; car only collision; two or more related vehicles involved; van are the factors increasing risk of severe accident. Time between 6:00 A.M. and noon; ramp; toll gate; vehicle defects; obstacles and poor road conditions; rainy or snowy weather are the factors decreasing risk.[39]
Spatial analysis and negative binomial regressionRoadway characteristics and spatial/land use, vehicle attributes, and driver characteristics.Incomplete sidewalks and high crosswalk density are associated with pedestrian crash risk. People perceive a lower risk near university libraries, stadiums, and academic buildings.[40]
Multivariate modelsSpatial/land use, transit access, commercial access, and population density, built environment and design characteristics.Examines both risk exposure and injuries sustained in child pedestrian-vehicular crashes in the vicinity of public schools. There is a significant association with several built-environment and design characteristics.[41]
Generalized ordered probit modelEnvironmental characteristics on the severity of injuries sustained in pedestrian–vehicle crashesIt is found that women pedestrians tend to be injured less frequently than male pedestrians; children have an increased likelihood of injuries; older persons are more likely to be fatally injured in pedestrian–vehicle crashes.[42]
Generalized linear model and
negative binomial model
Pedestrian crashes and demographic (population and household units) and socio-economic characteristics (mean income and total employment), land use), and accessibility to public transit systems, road network characteristics (the number of lanes, speed limit, presence of median, and pedestrian and vehicular volume).Population, transit stops, and pedestrians can increase pedestrian crashes. On the other hand, single family, urban residential commercial and neighborhood service can lower pedestrian crashes.
Demographic, socio-economic, land use, and road data are better predictors than traffic data. Buffers of 0.5 mile yield better estimates for all and low activity intersections, while 1 mile buffers yield better estimates for high activity intersections.
[43]
Visual inspection of Google Street View and logit modelPresence of sidewalks, buffers between the road and the sidewalk, street lighting, number of travel lanes and the presence of medians, traffic controls at intersections, and posted speed limits.Lack of sidewalks, buffers, high-speed roads, roads with six or more lanes, lack of traffic lighting, speed are associated with severity of pedestrian casualties. Age of pedestrians can cause more severe casualties.[44]
Machine learning techniquesSurvey dataIn terms of Work-related Musculoskeletal Disorders (WMSDs), it is found that several risk factors (involvement in physical activities, frequent posture change, exposure to vibration, egress/ingress, duty breaks, and seat adaptability issues) influence the frequency of pain of drivers.[45]
Multinomial logit model and
discrete choice model
Mode choice preference data collected from airport passengers (540 observations).Discrete choice model optimization algorithms using Excel is proved to be efficient in managing model tasks.
Maximum likelihood method is an optimal method for estimating the coefficients of the variables. Newton Raphson is one of the best algorithms, while the worst performed algorithm is the Steepest Ascent (SA) method.
[46]
Review Presents a complete review regarding analytical methodologies of crash injury severity models and approaches in highway accident research.[47]
Table 2. Descriptive statistics of the selected individual crash variables.
Table 2. Descriptive statistics of the selected individual crash variables.
CategoryVariableDescriptionMeanStd DevMinimumMaximum
Crash SeverityBodily
Damage
If the crash resulted in injury damage or fatality then = 1, else = 00.2960.45701
SexMDIf driver is male = 1, if not = 00.5740.49401
AgeAgeDriver’s age361616105
Age2Driver’s age21575146025611,025
Age3Driver’s age380,391112,9184096115,7625
Type of VehicleCompactIf type of unit is sub-compact, compact then = 1, else = 00.1720.37701
MidSizeIf type of unit is midsize then = 1, else = 00.3290.4701
FullSizeIf type of unit is full size then = 1, else = 00.1010.30101
VanIf type of unit is mini-van or van then = 1, else = 00.0810.27301
SUVIf type of unit is SUV then = 1, else = 00.1560.36301
PickupIf type of unit is pickup then = 1, else = 00.10.301
TruckIf type of unit is single unit truck, trailer, truck tractor, truck trailer with double short or long, converter dolly, semi-trailer, fifth wheel, tractor with triples then = 1, else = 00.0420.20101
MotorcycleIf type of unit is motorcycle then = 1, else = 00.0070.08301
BusIf type of unit is school bus, church bus, public bus, or other bus then = 1, else = 00.0060.07901
TimeAfternoonIf time is between 12 pm and 5 pm then = 1, else = 00.3620.48101
Peak Time1If time is between 7 am and 9 am then = 1, else = 00.1110.31501
Peak Time2If time is between 5 pm and 7 pm then = 1, else = 00.1640.3701
SeasonSummerIf date is June or July or August then = 1, else = 00.2450.4301
WinterIf date is December or January or February then = 1, else = 00.2610.43901
Day of WeekFridayIf day of week is Friday then = 1, else = 00.180.38401
Crash LocationIntersection_
4ty
If crash occurred on four-way intersection, T-intersection and Y-intersection then = 1, else = 0.0.4220.49401
Intersection_
roundabout
If crash occurred on roundabout then = 1, else =0.0.0030.05201
RampIf the crash occurred on the ramp then = 1, else = 00.0430.20301
DrivewayIf the crash occurred on the driveway then = 1, else =00.0370.18901
CurveIf road contour is curved then = 1, else = 00.0840.27701
Light ConditionIceIf road condition is ice or snow then = 1, else = 00.060.23801
DarkIf light condition is lighted roadway, roadway not lighted and unknown roadway lighting then = 1, else = 00.2290.4201
SpeedSpeed 45If the crash occurred on the road under posted speed 45 then = 1, else = 00.5270.49901
Other Crash FactorsWork zoneIf the crash occurred within a work zone then = 1, else = 00.0130.11401
AlcoholIf the driver was influenced by alcohol then = 1, else = 00.0490.21601
DrugIf the driver was influenced by drugs then = 1, else = 00.0110.10201
PedestrianIf pedestrian was involved in the crash then = 1, else = 00.0040.06401
Table 3. Descriptive statistics of the selected TAZ-level variables.
Table 3. Descriptive statistics of the selected TAZ-level variables.
VariableDescriptionMeanStd DevMinimumMaximum
UrbanIf the TAZ is urban = 1, if not = 00.850.35701
Mile_Distance to center of Columbus (mile)9.9788.250.07344.466
Columbus
PopdensityPopulation/TAZ area (acre)5.1815.788047.043
Hhinc10Household income 201052,079.2125,292.198785161,377
Empretsrv1Retail goods employment 2010113.915196.60902124
WHITE_P% of Whites in the TAZ0.7170.26901
Punder14% of population under 140.1840.07900.442
P5064% of population between 50 and 640.1770.0701
POver65% of population over 650.1160.101
Agriculture% of agricultural land use0.1370.27501
Residential% of residential land use0.3460.27100.962
Built_ageBuilt age of construction in the TAZ where the crash occurred44.30723.3715509.25
HistoricDIf the crash occurred in historic districts = 1, if not = 00.0210.14301
Shop_acreArea of Shopping Centers3.4329.085076.859
Table 4. Logit estimation results.
Table 4. Logit estimation results.
VariableDFEstimateStandard ErrorWald Chi-SquarePr > ChiSq
Intercept11.07660.094130.3<0.0001
Crash-Related Factors
AG1−0.01340.0065.260.0219
AG210.0003440.00016.640.01
AG31−0.000002850.000000929.460.0021
MD10.12690.012116.73<0.0001
Compact1−0.31520.031102.1<0.0001
MidSize1−0.3310.03124.17<0.0001
FullSize1−0.30330.03384.89<0.0001
Van1−0.32950.03495.49<0.0001
SUV1−0.390.031157.35<0.0001
Pickup1−0.26990.03368.94<0.0001
Motorcycle1−2.97430.0931032.12<0.0001
Bus10.39280.09218.15<0.0001
PeakTime110.06190.0218.910.0028
PeakTime210.08210.01722.38<0.0001
Afternoon10.0330.0154.60.032
Summer1−0.06120.01420.06<0.0001
Winter10.1530.014118.38<0.0001
Friday10.04220.0158.370.0038
Intersection_4ty1−0.21630.012306.18<0.0001
Intersection_roundabout11.14750.15256.7<0.0001
Ramp10.16640.02931.92<0.0001
Driveway1−0.05420.0313.070.0799
Curve1−0.19250.02185.77<0.0001
Ice10.22590.0432.04<0.0001
Dark1−0.05510.01711.180.0008
Speed4510.41730.0131040.73<0.0001
Work zone10.19440.0514.920.0001
Alcohol1−0.45810.027288.72<0.0001
Drug1−0.46780.05377.93<0.0001
Pedestrian1−3.30780.132630.66<0.0001
TAZ- Related Factors
Urban10.09360.02218.55<0.0001
Mile_Columbus10.007880.00151.49<0.0001
Popdensity10.01330.00179.92<0.0001
Hhinc1010.000000860.00000038.380.0038
Empretsrv110.0002580.0000447.4<0.0001
WHITE_P10.2330.02871.55<0.0001
Punder141−0.61210.08650.64<0.0001
P50641−0.57420.09238.95<0.0001
POver651−0.31840.0628.07<0.0001
Agriculture1−0.23580.03642.54<0.0001
Residential1−0.16550.0330.7<0.0001
Built_age1−0.000870.00039.290.0023
HistoricD10.20650.04520.93<0.0001
Shop_acre10.001450.0013.710.0539
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lee, D.; Guldmann, J.-M.; von Rabenau, B. Impact of Driver’s Age and Gender, Built Environment, and Road Conditions on Crash Severity: A Logit Modeling Approach. Int. J. Environ. Res. Public Health 2023, 20, 2338. https://doi.org/10.3390/ijerph20032338

AMA Style

Lee D, Guldmann J-M, von Rabenau B. Impact of Driver’s Age and Gender, Built Environment, and Road Conditions on Crash Severity: A Logit Modeling Approach. International Journal of Environmental Research and Public Health. 2023; 20(3):2338. https://doi.org/10.3390/ijerph20032338

Chicago/Turabian Style

Lee, Dongkwan, Jean-Michel Guldmann, and Burkhard von Rabenau. 2023. "Impact of Driver’s Age and Gender, Built Environment, and Road Conditions on Crash Severity: A Logit Modeling Approach" International Journal of Environmental Research and Public Health 20, no. 3: 2338. https://doi.org/10.3390/ijerph20032338

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop