Localized Space-Time Autoregressive Parameters Estimation for Traffic Flow Prediction in Urban Road Networks

Chen, Jianbin; Li, Demin; Zhang, Guanglin; Zhang, Xiaolu

doi:10.3390/app8020277

Open AccessArticle

Localized Space-Time Autoregressive Parameters Estimation for Traffic Flow Prediction in Urban Road Networks

by

Jianbin Chen

^1,2

,

Demin Li

^1,2,*,

Guanglin Zhang

^1,2

and

Xiaolu Zhang

^1,2

¹

College of Information Science & Technology, Donghua University, Shanghai 201620, China

²

Engineering Research Center of Digitized Textile & Fashion Technology, Ministry of Education, Donghua University, Shanghai 201620, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2018, 8(2), 277; https://doi.org/10.3390/app8020277

Submission received: 26 October 2017 / Revised: 18 January 2018 / Accepted: 9 February 2018 / Published: 12 February 2018

(This article belongs to the Special Issue Road Vehicles Surroundings Supervision: On-Board Sensors and Communications)

Download

Browse Figures

Versions Notes

Abstract

:

With the rapid increase of private vehicles, traffic congestion has become a worldwide problem. Various models have been proposed to undertake traffic prediction. Among them, autoregressive integrated moving average (ARIMA) models are quite popular for their good performance (simple, low complexity, etc.) in traffic prediction. Localized Space-Time ARIMA (LSTARIMA) improves ARIMA’s prediction accuracy by extending the widely used STARIMA with a dynamic weight matrix. In this paper, a localized space-time autoregressive (LSTAR) model was proposed and a new parameters estimation method was formulated based on the LSTARIMA model to reduce computational complexity for real-time prediction purposes. Moreover, two theorems are given and verified for parameter estimation of our proposed LSTAR model. The simulation results showed that LSTAR provided better prediction accuracy when compared to other time series models such as Shift, autoregressive (AR), seasonal moving average (Seasonal MA), and Space-Time AR (STAR). We found that the prediction accuracy of LSTAR was a bit lower than the LSTARIMA model in the simulation results. However, the computational complexity of the LSTAR model was also lower than the LSTARIMA model. Therefore, there exists a tradeoff between the prediction accuracy and the computational complexity for the two models.

Keywords:

LSTAR; STARIMA; parameters estimation; traffic flow prediction; urban road network

1. Introduction

In recent decades, the number of vehicles in urban areas has increased rapidly and the urban road network is becoming larger and more complex. Due to this, traffic congestion has become a major problem in big cities, which has led to more fuel consumption and environment pollution. Statistics show that the average annual traffic congestion cost in the US in 2014 was 1433 dollars per auto commuter, or over 5 billion dollars per city for very large urban areas [1]. In order to improve the efficiency of the urban road network and reduce traffic congestion, intelligent transportation systems (ITS) [2] have been developed by integrating information technology, automatic control technology, and geographic information systems (GIS). With the introduction of ITS, real-time traffic information is available to the vehicles in the road network for trip planning through vehicular navigation systems or dynamic route guidance systems. Unlike the computer network, which can transmit data package from source to destination in a very short time, vehicles in urban road networks need much longer times to travel to their destinations. Thus, trip planning should consider not only current traffic information, but also future traffic conditions. Therefore, short-term traffic flow prediction with real-time traffic information as prior knowledge is quite essential and has attracted increasing attention [3].

The earliest traffic prediction method was based on the macroscopic traffic simulation model proposed by Lighthill and Whitham [4] and Richards [5], known as the Lighthill-Whitham-Richards (LWR) model. In this model, vehicles in the highway are treated as “traffic flow” and their dynamics can be analyzed with the continuous fluid conservation equation in fluid mechanics. The microscopic traffic modeling method of cellular automaton (CA) [6] simulates the traffic flow dynamics by analysis of the interaction of individual vehicles. Traffic simulation models focus on traffic flow dynamics using only current traffic information; no historic information is needed. The weakness of the traffic simulation model is that it needs the origination-destination (OD) matrix of all vehicles to simulate the traffic dynamics, which is normally hard to collect.

The autoregressive moving average (ARMA) model or autoregressive integrated moving average (ARIMA) model [7], also called the Box-Jenkins model, is an important prediction model in economics and other areas. Furthermore, it is considered as the standard of time series prediction. ARIMA and its variations as seasonal ARIMA (SARIMA) [8], vector ARMA (VARMA) models [9], and so on have been widely used for traffic prediction. The space-time ARIMA (STARIMA) [10] model has a long historical background which is based on the ARMA with exogenous inputs (ARMAX) model. Since the 1980s, STARIMA has been applied to different areas such as river flow, spread of disease, spatial econometrics, and so on. In 2005 [11], the STARIMA methodology was first proposed for the spatiotemporal behavior of traffic flow. In the STARIMA model, traffic flow data is in the form of a spatial time series which is collected at specific locations at constant intervals of time to be used for the short-term forecasting of space-time stationary traffic-flow processes. Furthermore, the model can be used for assessing the impact of traffic-flow changes on other parts of the network through the use of weight matrices estimated on the basis of the distances among the various locations. Unlike the VARMA model, which is a generic model without any known information, the number of parameters to be estimated for STARIMA is much less, as the road network topology is considered. However, in the proposed STARIMA model [11], the weight matrices are assigned equally without considering the traffic condition differences between the directly connected first order spatial neighbors and the not directly connected higher order neighbors in the whole road network.

In the past 10 years, there have been many research studies on STARIMA. Min et al. [12] presented a dynamic form of the STARIMA that accounted for temporal dynamics. They replaced the traditional distance-weighted spatial weight matrix with a temporally dynamic matrix that reflected the current traffic turn ratios observed at each road intersection. The weight matrix can be updated in real time based on current conditions, but the method was limited to intersection-based flow data and was fixed spatially.

Tao Cheng et al. [13] extended the standard STARIMA model to a Localized STARIMA (LSTARIMA) model, which described the modeling of dynamic and heterogeneous autocorrelation in network data with improved traditional models. The constructed model provided an improvement over the traditional space-time series models. Their paper showed that the performance of prediction was improved when compared to standard STARIMA models. The LSTARIMA model captured the autocorrelation of traffic data locally and dynamically in the road network with a dynamic spatial weight matrix. The LSTARIMA model has also shown good performance in traffic prediction without the need for data pre-processing (e.g., a logarithmic transformation and differencing). Compared with other ARIMA variations, the LSTARIMA model has a simpler structure (because the LSTARIMA has smaller AR, and MA order values p and q). As the future traffic state of the current road depends not only on its prior states, but also on its neighbor roads, the weight matrix of roads is key to traffic flow prediction.

In this paper, our contributions are as follows:

An LSTAR model with lower computational complexity based on the LSTARIMA was proposed. In the LSTARIMA model of Cheng et al. [13], the same weight matrix W was used for AR and MA components of the whole road network. We used different matrices, W and U, for AR and MA components. And individual observation $z_{i} (t)$ was used instead of the N-dimension column vector $Z (t)$ to allow each road to have its own weight matrix W, U. Since the ARMA model can be properly approximated by a high-order AR model, we further developed the reconstructed LSTARIMA model into our proposed LSTAR model.
A more reasonable weight matrix and new traffic information collection with the Vehicular Ad hoc Networks (VANET) approach was proposed. As the number of vehicles output from upstream roads has more impact on the future traffic condition compared to speed difference, it was used to determine the dynamic spatial weights instead of the speed difference. To obtain the traffic information needed for weight matrix determination, the vehicles stopped at red lights were used to collect traffic information via VANET.
Two theorems were given and verified for parameter estimation of our proposed LSTAR model. When the distribution of traffic flow is stable, the weight matrix can be treated as time invariant. When the traffic flow distribution is not stable, the weight matrix is time variant. For these two different cases, we provided two theorems to determine the parameters.
Related simulations were performed. Through the simulation results, we observed that the prediction accuracy of LSTAR was a bit lower than the LSTARIMA model. However, the computational complexity of the LSTAR model was also lower than the LSTARIMA model. Therefore, there existed a tradeoff between the prediction accuracy and the computational complexity for the two models.

The rest of this paper is organized as follows. In Section 2, state-of-the-art traffic information collection, traffic prediction, and traffic applications are reviewed. In Section 3, we introduce the LSTAR model, the construction of the weight matrix of the LSTAR model, and a new traffic information collection method. In Section 4, parameters estimation methods of the LSTAR model are given and proven. In Section 5, the experimental evaluation is presented. Finally, Section 6 provides the conclusions and identifies future research directions.

2. State-of-the-Art and Related Topics

Traffic information collection provides the data input for traffic prediction, and there are many applications that use traffic prediction results to improve traffic conditions. In this section, the state-of-the-art traffic information collection, traffic prediction, and urban traffic applications are reviewed.

2.1. Traffic Information Collection

As traffic information is the base data of ITS, how to collect traffic information efficiently is an important area of research. Loop detectors are pressure, magnetic, and other sensors buried underground to detect if there are vehicles passing over them. They are widely deployed in urban areas to count the number of vehicles passing through fixed points of roads. Compared to loop detector technology, machine-vision-based traffic monitoring is a state-of-the-art approach with the advantages of easy maintenance, real-time visualization, and high flexibility [14]. With properly installed cameras, traffic information such as speed, volume, and even traffic accidents can be detected. However, it is expensive to establish these systems as well as to maintain a huge number of fixed devices on the road side, and they can only gather the traffic information of fixed points.

With the equipment of global position system (GPS) receivers on vehicles or mobile phones, vehicles can detect their own real-time location and speed. An alternative traffic information collection approach has been proposed to estimate the traffic state by checking the location and speed of some vehicles running on the road [15,16]. These probe vehicles are known as floating vehicles, and are normally buses and taxis. Due to the low cost of GPS receivers, the overall cost of the floating vehicles system is low. The shortcoming of floating vehicles is that their distribution in the urban traffic network is not even in space and time, which means that they may not be able to provide complete traffic states of the whole road network.

Vehicular Ad hoc Networks (VANET) [17,18,19] are an emerging technology developed for traffic security and data transformation [20]. In recent years, many research studies have used VANET to collect traffic information. The first type of system uses VANET to only estimate the traffic density of the road by detecting the number of vehicles in the VANET communication range [21]. The other type of system assumes that each vehicle is also equipped with a GPS receiver so more detailed traffic information can be collected [22,23]. As GPS receivers are more commonly equipped when compared with VANET and GPS can provide more information, more research has focused on the approach with GPS. With the communication capability of VANET, the collected traffic information can be easily shared and used by other ITS applications such as traffic prediction, route planning, and so on.

After traffic information is collected, it can be used as an input to other ITS applications as base data. Since VANET can obtain the traffic information of the whole network without infrastructure and can be easily integrated with other ITS systems, it was used to collect traffic information in our paper. Section 3.3 discussed our traffic information collection method via VANET in detail.

2.2. Traffic Prediction

Short-term traffic prediction is one of the most important topics in ITS research and practice. Aside from the ARIMA series prediction method, there have also been many other methodologies engaged for this purpose.

The Kalman filtering method [24,25], which is based on historical data and present data to predict a future state, has been widely used in the forecasting of traffic flow. However, its computational complexity is too high for complex urban traffic flow prediction. Neural Networks (NNs) models [26,27] have also been utilized to predict traffic flow for their high prediction accuracy. The weak point of NNs is the long model training time. Other research studies have referred to Support Vector Machines (SVM) [28]. However, the error of SVM is high under the circumstance of peak periods and blocking traffic accidents compared with the Bayesian network [29].

The K-NN method (K-nearest neighborhood) [30] performs well in short-term forecasting even when accidents have occurred. However, this algorithm has high complexity and needs a large amount of calculation when searching for class neighbors. Markov-based models [31] have also shown good performance on traffic flow prediction since the traffic condition at the next interval is closely related to the recent states. However, in these models, there are many states to consider. New technologies such as big data [32] and particle filtering [33,34] have also been used in traffic predication and other urban mobility applications.

Although these traffic prediction approaches have provided good prediction results in some scenarios, most of them are too complex or require a long training time. Thus, there have been respectable efforts put towards improving various ARIMA prediction models.

2.3. Urban Traffic Applications

Current traffic information and predicted traffic information are the base data of ITS, but they are meaningless without practical applications. Route planning and vehicle navigation systems are some of the most popular applications that use real-time and predicted traffic information [35]. The first-generation route planning system only considers the static features of the road network to obtain the shortest path. With the development of real-time traffic information collection technology, dynamic route planning has been proposed to re-calculate the new shortest path with the updated real-time data at each intersection. Such a system provides better travel planning when compared with the static route. However, using only current data may lead to frequent route changes in complex traffic conditions. The newest route planning systems use the predicted traffic information to arrive at the best and most stable route to the destination.

The other main applications using current and predicted traffic information are traffic management applications. The most common usage is to display current traffic states, the predicted traveling time to land mark locations, and traveling proposals on the traffic information board. With this information, drivers can re-plan their travel accordingly. Adaptive traffic signal control is the key technology of traffic management. The reactive traffic signaling control system adjusts the signal phase and cycle lengths according to current traffic data. The predictive traffic signal systems retime the traffic sign according to the predicted traffic information. With the predictive approach, the total waiting time of vehicles is reduced and the efficiency of the traffic network is improved [36].

Predicted traffic information is not only used by navigation systems and traffic management systems, but also by other urban applications such as parking management and so on. For the wide usage of traffic prediction, there are continuous research interests in this field.

3. Model and Preliminaries

3.1. LSTAR Model Construction

According to the STARIMA model defined in Reference [11], both space and time are considered.

Z_{t} = \sum_{k = 1}^{p} \sum_{l = 0}^{λ_{k}} ϕ_{k l} W_{l} z_{t - k} - \sum_{k = 1}^{q} \sum_{l = 0}^{m_{k}} θ_{k l} W_{l} ε_{t - k} + ε_{t} .

(1)

Z_{t}

is an N-dimensional column vector of road i, while i = 1, 2,…, N, and

W_{l}

is an

N \times N

matrix with element

w_{i j}^{l}

.

ε_{t}

is the residual vector.

ϕ_{k l}

,

θ_{k l}

are the AR and MA parameters, respectively.

Then, the observation of road

i

,

z_{i} (t)

can be described as:

z_{i} (t) = \sum_{k = 1}^{p} \sum_{l = 0}^{λ_{k}} \sum_{j = 1}^{N} ϕ_{k l} w_{i j}^{(l)} z_{j} (t - k) - \sum_{k = 1}^{q} \sum_{l = 0}^{m_{k}} \sum_{j = 1}^{N} θ_{k l} w_{i j}^{(l)} ε_{j} (t - k) + ε_{i} (t) .

(2)

In 2014, a new space-time model, the localized STARIMA (LSTARIMA) model, was proposed by Cheng et al. [13] to consider spatial heterogeneity and temporal non-stationarity. The model is described by the following form:

Z_{i} (t) = \sum_{k = 1}^{p_{i}} \sum_{h = 0}^{λ_{k} (t - k, i)} ϕ_{i, k h} W^{(h, t - k, i)} Z_{i} (t - k) - \sum_{l = 1}^{q_{i}} \sum_{h = 0}^{n_{l} (t - l, i)} θ_{i, l h} W^{(h, t - l, i)} ε_{i} (t - l) + ε_{i} (t) .

(3)

Z_{i} (t)

is an N-dimensional column vector of the observation value on link 1, …, N with tag

i

at time

t

, which can be any prediction variable of roads such as speed, traffic flow, density, and so on. The term

ε_{i} (t)

is a residual on link 1, …, N at time

t

. The first term in Equation (3) is the AR component, whereas the second term is the MA. The parameters

p_{i}

and

q_{i}

are the AR and MA orders, respectively.

h

is the spatial order that represents the order of spatial separation between two locations. The parameters

λ_{k} (t - k, i)

and

n_{l} (t - l, i)

are the dynamic spatial orders associated with the

k th

and

l th

temporally lagged terms in the AR and MA components, respectively. They specify the size of the spatial neighborhood that could influence the link of interest

i

within temporal lags

k

and

l

. The parameters

W^{(h, t - k, i)}

and

W^{(h, t - l, i)}

are the dynamic spatial weight matrices

W^{(h, t, i)}

pertaining to link

i

at temporal lags

k

and

l

.

ϕ_{i, k h}

and

θ_{i, l h}

are the AR and MA parameters for each link

i (i = 1, 2, \dots, N) .

Although spatial tag

i

was added to

Z_{i} (t)

in the LSTARIMA model construction, the spatial heterogeneity was not fully considered. As the

Z_{i} (t)

here is an N-dimension column vector which covers all of the roads (road 1, 2, …, N) in the network, all of the roads will share the same

ϕ_{i, k h}

and

θ_{i, l h}

. In this LSTARIMA model, the same matrix W is used for both AR and MA components. As the weight matrix of AR and MA components is not always the same, using only one weight matrix

W

is not proper.

In this paper, different weight matrices

W

,

U

were used for AR and MA components and individual road traffic flow observation

z_{i} (t)

, according to Equation (2), was defined to allow every road to have its own weight matrix

W_{i}

,

U_{i}

according to spatial location, but not sharing the same weight matrix among all roads, as in Equation (3).

Then, the LSTARIMA can be rewritten as follows:

z_{i} (t) = \sum_{k = 1}^{p_{i}} \sum_{h = 0}^{λ_{i, k}} \sum_{j = 1}^{N_{i}} ϕ_{i, k h} w_{i j}^{(h)} (t - k) z_{j} (t - k) - \sum_{k = 1}^{q_{i}} \sum_{h = 0}^{m_{i, k}} \sum_{j = 1}^{N_{i}} θ_{i, k h} u_{i j}^{(h)} (t - k) ε_{j} (t - k) + ε_{i} (t) .

(4)

The parameters

w_{i j}^{(h)} (t - k)

and

u_{i j}^{(h)} (t - l)

are the elements of dynamic spatial weight matrices

W_{i}^{(h)} (t - k)

and

U_{i}^{(h)} (t - l)

pertaining to link

i

at temporal lags

k

and

l

. Like the traditional STARIMA model, LSTARIMA makes use of spatial weight matrices

W

and

U

to model the influence of the spatiotemporal neighborhoods. However, it relaxes the globally fixed temporal dependence for all locations by using different AR and MA parameters according to location. Furthermore, it accounts for the temporal non-stationarity by allowing the matrix elements value and size of the spatial neighborhoods to vary with time.

According to Bo [37], the ARMA model can be properly approximated by the high-order AR model. As AR only has one type of parameter to be estimated, ARMA and ARIMA have two or three types of parameters to be estimated, and the parameters estimation of AR is easy even when the order is a little bit higher. There exist plenty of studies that have used a high-order AR model to approximate many processes of interest [37]. Furthermore, traffic flow has complex dynamics and may not exactly match an ARIMA model. In addition, many studies have removed MA and used only different AR models to conduct traffic prediction and obtain good results with limited AR order [38]. For reducing computational complexity and real-time prediction purpose, in this paper, we proposed the LSTAR model for traffic flow prediction.

With the MA component removed, the LSTARIMA prediction model (Equation (4)) is changed into the following LSTAR model:

z_{i} (t) = \sum_{k = 1}^{p_{i}} \sum_{h = 0}^{λ_{i, k}} \sum_{j = 1}^{N_{i}} ϕ_{i, k h} w_{i j}^{(h)} (t - k) z_{j} (t - k) + ε_{i} (t)

(5)

where

ε_{i} (t)

is white noise,

ϕ_{i, h k}

is the parameter for each link

i (i = 1, 2, \dots, N)

, and

w_{i j}^{(h)} (t - k)

are the elements of the dynamic spatial weight matrix

W_{i}^{(h)} (t - k)

pertaining to link

i

at temporal lag

k

.

3.2. Weight Matrix Construction

Weight matrix construction is an essential topic in STARIMA models. In the STARIMA model, the weight matrix is time invariant and equal for the same neighbor order. In the LSTARIMA model, a time variant weight matrix was introduced to improve traffic prediction accuracy with lower AR and MA orders. Furthermore, the speed difference was used to construct the weight matrix in the LSTARIMA model [13]. The speed of a road is an important character of traffic flow, but is not the essential one in terms of impact to surrounding roads. It is obvious that a road that outputs only one vehicle will not impact neighbor roads at the same level as saturated roads with the same speed. The traffic output amount to the neighbor roads in a time slot has more impact on the future traffic state of neighbor roads. Thus, the output vehicle number during a time slot was used in the weight matrix construction instead of speed in this paper.

For all pairwise road sections

(i, j)

with spatial lag

h

, the corresponding

w_{i j}^{(h)} (t)

is defined as follows:

w_{i j}^{(h)} (t) = {\begin{matrix} 1 & h = 0, i = j \\ \frac{Q_{i j} (t)}{\sum Q (t)} & h = 1, \sum Q (t) \neq 0 \\ \frac{Q^{(h)}_{i j} (t)}{\sum Q^{(h)} (t)} & h \neq 1, \sum Q^{(h)} (t) \neq 0 \\ 0 & \sum Q^{(h)} (t) = 0 \end{matrix}

(6)

where

Q_{i j} (t)

is the number of vehicles running from road

j

towards road

i

at time slot

t

and

\sum Q (t)

is the sum of vehicle numbers from directly connected roads to road

i

at time slot

t

.

Q^{(h)}_{i j} (t)

is the number of vehicles on the

h

order neighbor road

j

towards

h - 1

order neighbor of road

i

at time slot

t

, and

Q^{(h)}_{i j} (t)

= 0 if

j

is not the

h

order neighbor of

i

.

\sum Q^{(h)} (t)

is the sum of

Q^{(h)}_{i j} (t)

.

3.3. Traffic Information Collection

In this paper, we proposed a traffic information collection system via VANET for urban areas. In this system, each vehicle was assumed to have a GPS receiver and VANET equipped to report its location. This assumption is reasonable as currently more and more vehicles are equipped with such devices. Considering that there are always traffic lights at the intersection of urban roads, we used vehicles stopped at red lights to collect traffic information by checking the location of all vehicles inside their VANET communication range periodically.

As shown in Figure 1, when the traffic light turned red for the east-west direction, the first vehicle stopped at the west side was selected as the traffic information collector (TIC). If there were no vehicle stops at the west side, the first vehicle stopped at the east side would be the TIC. In the example of Figure 1, vehicle V11 is the TIC and collects the traffic information during the red light period.

Step 1.: When the traffic light turns to red at T0, V11 broadcasts traffic information collection request.
Step 2.: All of the vehicles in the communication range $R$ of V11 will report their locations to V11 after receiving the request from V11.
Step 3.: V11 catalogs the vehicles to four vehicle sets according to the location. They are marked as $V s e t (E, T 0)$ , $V s e t (S, T 0)$ , $V s e t (W, T 0)$ , and $V s e t (N, T 0)$ .
Step 4.: After time $τ < R / V_{m a x}$ , V11 collects the traffic information again according to Steps 1–3 and obtains $V s e t (E, T 0 + τ)$ , $V s e t (S, T 0 + τ)$ , $V s e t (W, T 0 + τ)$ , $V s e t (N, T 0 + τ)$ . $V_{m a x}$ is the maximum allowed velocity. Time $τ < R / V_{m a x}$ will let all vehicles running towards the intersection be detectable at time $T 0 + τ$ .
For example:
If R = 150 m and $V_{m a x} = 20 m / s$ , $τ$ = 5 s can be used as $5 < \frac{150}{20} = 7.5$ . The maximal length a vehicle can run during $τ$ is $\frac{20 m}{s} \times 5 s = 100 m < 150 m$ . Then, no vehicle entering the intersection at T0 can run outside the communication range of V11 and be detectable at $T 0 + τ$ .
Step 5.: The vehicles’ set run from road A to road B is calculated by formula: $V s e t (A \to B, T 0 + τ) = V s e t (A, T 0) \cap V s e t (B, T 0 + τ)$ .
For example:
As shown in Figure 1, the traffic output from S to E from T0 to $T 0 + τ$ is:

$V s e t (S \to E, T 0 + τ) = V s e t (S, T 0) \cap V s e t (E, T 0 + τ) = {V 1, V 2, V 3, V 4} \cap {V 1, V 8, V 9, V 10} = {V 1}$
Step 6.: The TIC calculates the traffic output of each road with time interval $τ$ until the traffic light for the east-west direction turns green. As the traffic light for the south-north direction turns red, the first vehicle stopped at the north or south side will be selected as the TIC and collect traffic information continuously.

As a part of the advance travel information system (ATIS), the TIC will send out the collected traffic information via VANET for applications such as traffic prediction. In normal urban traffic conditions, there should always be vehicles stopped at the red light to act as the TIC. If there is no vehicle stopped at the red light to be the TIC, the traffic information of the last time slot will be used. This is acceptable as it normally happens in very low traffic density cases and real-time traffic information is not important.

With the collected traffic information, the traffic output of each road section j can be calculated by:

Q^{(h)}_{i j} (t) = \sum_{n} \sum_{m} V s e t (j \to j_{m}, T + n τ),

where

T + n τ \in (t - 1, t]

,

j

is the

h

order neighbor of road

i

,

j_{m}

is the

h - 1

order neighbor of road

i

(

j_{m}

is

i

when

h = 1

),

m

is the total number of

j_{m}

which are downstream

j

.

4. Main Results

For the LSTAR model, in this section we discuss how to determine the parameter

ϕ_{i, k h}

with a given weight matrix

W_{i}^{(h)} (t)

. With the determined weight matrix

W_{i}^{(h)} (t)

and

ϕ_{i, k h}

, the future traffic observation

\hat{z} (t)

of road

i

can be predicted. Normally,

w_{i j}^{(h)} (t)

is time variant. When the traffic flow distribution is stable in the road network,

w_{i j}^{(h)} (t)

will be time invariant since

\frac{Q^{(h)}_{i j} (t)}{\sum Q^{(h)} (t)}

is constant. Considering the weight matrix differences of being time invariant or not, two theorems are discussed, and the LSTAR parameters estimation can be conducted accordingly.

Definitions 1.

r_{i j} (m)

is the correlation coefficient of road

i

and

j

,

r_{i} (m) = r_{i i} (m)

is the autocorrelation coefficient of road

i

. We define

Φ_{i} = {[ϕ_{i, 10}, ϕ_{i, 11}, \dots ϕ_{i, 1 λ_{i, 0}}, ϕ_{i, 20}, ϕ_{i, 21}, \dots, ϕ_{i, p_{i} λ_{i, k}}]}^{T}

, let

n = \sum_{k = 1}^{p_{i}} λ_{i, k}

to be the dimension of

Φ_{i}

.

R_{i} = {[r_{i} (0), r_{i} (1), \dots, r_{i} (n - 1)]}^{T}, Σ = {[σ_{i}^{2}, 0, \dots, 0]}^{T}, S (h, m) = \sum_{j = 1}^{N_{i}} w_{i j}^{(h)} r_{i j} (m)

,

A = [\begin{matrix} S (0, 1) & S (1, 1) & \dots & S (λ_{i, k}, p_{i}) \\ S (0, 0) & S (1, 0) & \dots & S (λ_{i, k}, p_{i} - 1) \\ S (0, 1) & S (1, 1) & \dots & S (λ_{i, k}, p_{i} - 2) \\ ⋮ & ⋮ & ⋮ & ⋮ \\ S (0, n - 1) & S (1, n - 1) & \dots & S (λ_{i, k}, n - 1 - p_{i}) \end{matrix}], \bar{A} = [\begin{matrix} A & R_{i} - Σ \end{matrix}] .

Theorem 1 follows:

Theorem 1.

If

W_{i}^{(h)} (t)

is time invariant and

R a n k (A) = R a n k (\bar{A}) = n

, the LSTAR model can be uniquely determined.

Proof.

As

W_{i}^{(h)} (t)

is time invariant, we can observe that

W_{i}^{(h)} (t) = W_{i}^{(h)} (t - k)

and it can be rewritten to

W_{i}^{(h)}

.

The model of Equation (5) will be a time invariant system. Since

z_{i} (t)

is a stationary random signal, Equation (5) will be:

z_{i} (t) = \sum_{k = 1}^{p_{i}} \sum_{h = 0}^{λ_{i, k}} \sum_{j = 1}^{N_{i}} ϕ_{i, k h} w_{i j}^{(h)} z_{j} (t - k) + ε_{i} (t) .

(7)

Pre-multiplying both sides of the model (Equation (7)) by

z_{i} (t - m)

:

z_{i} (t) z_{i} (t - m) = \sum_{k = 1}^{p_{i}} \sum_{h = 0}^{λ_{i, k}} \sum_{j = 1}^{N_{i}} ϕ_{i, k h} w_{i j}^{(h)} z_{j} (t - k) z_{i} (t - m) + ε_{i} (t) z_{i} (t - m) .

(8)

Taking the expected values in both sides, we obtain an equation similar to the Yule-Walker equation:

r_{i} (m) = \sum_{k = 1}^{p_{i}} \sum_{h = 0}^{λ_{i, k}} \sum_{j = 1}^{N_{i}} ϕ_{i, k h} w_{i j}^{(h)} r_{i j} (m - k) + σ_{i}^{2} δ (m)

(9)

where the expected value is:

E (ε_{i} (t) z_{i} (t - m)) = σ_{i}^{2} δ (m) .

(10)

As

ϕ_{i, h k}

does not have tag j, we obtain:

r_{i} (m) = \sum_{k = 1}^{p_{i}} \sum_{h = 0}^{λ_{i, k}} ϕ_{i, k h} \sum_{j = 1}^{N_{i}} w_{i j}^{(h)} r_{i j} (m - k) + σ_{i}^{2} δ (m) .

(11)

For Equation (11), rewrite

ϕ_{i, k h}

to a column vector as

Φ_{i} = {[ϕ_{i, 10}, ϕ_{i, 11}, \dots, ϕ_{i, 1 λ_{i, 0}}, ϕ_{i, 20}, ϕ_{i, 21}, \dots ϕ_{i, k λ_{i, k}}]}^{T}

. In addition,

n = {\sum_{k = 1}^{p_{i}} λ}_{i, k}

is the number of parameters

ϕ_{i, k h}

to be estimated.

As

S (h, m) = \sum_{j = 1}^{N_{i}} w_{i j}^{(h)} r_{i, j} (m)

, we obtain:

[\begin{matrix} r_{i} (0) \\ r_{i} (1) \\ \begin{matrix} r_{i} (2) \\ ⋮ \\ r_{i} (n - 1) \end{matrix} \end{matrix}] = [\begin{matrix} S (0, 1) & S (1, 1) & \dots & S (λ_{i, k}, p_{i}) \\ S (0, 0) & S (1, 0) & \dots & S (λ_{i, k}, p_{i} - 1) \\ S (0, 1) & S (1, 1) & \dots & S (λ_{i, k}, p_{i} - 2) \\ \begin{matrix} ⋮ \\ S (0, n - 1) \end{matrix} & \begin{matrix} ⋮ \\ S (1, n - 1) \end{matrix} & \begin{matrix} ⋮ \\ \dots \end{matrix} & \begin{matrix} ⋮ \\ S (λ_{i, k}, n - 1 - p_{i}) \end{matrix} \end{matrix}] [\begin{matrix} ϕ_{i, 10} \\ ϕ_{i, 11} \\ \begin{matrix} ϕ_{i, 12} \\ ⋮ \\ ϕ_{i, k λ_{i, k}} \end{matrix} \end{matrix}] + [\begin{matrix} σ_{i}^{2} \\ 0 \\ \begin{matrix} 0 \\ ⋮ \\ 0 \end{matrix} \end{matrix}] .

(12)

Let

A = [\begin{matrix} S (0, 1) & S (1, 1) & \dots & S (λ_{i, k}, p_{i}) \\ S (0, 0) & S (1, 0) & \dots & S (λ_{i, k}, p_{i} - 1) \\ S (0, 1) & S (1, 1) & \dots & S (λ_{i, k}, p_{i} - 2) \\ ⋮ & ⋮ & ⋮ & ⋮ \\ S (0, n - 1) & S (1, n - 1) & \dots & S (λ_{i, k}, n - 1 - p_{i}) \end{matrix}]

,

R_{i} = [\begin{matrix} r_{i} (0) \\ \begin{matrix} r_{i} (1) \\ r_{i} (2) \\ ⋮ \end{matrix} \\ r_{i} (n - 1) \end{matrix}]

,

Σ = [\begin{matrix} σ_{i}^{2} \\ \begin{matrix} 0 \\ 0 \\ ⋮ \\ 0 \end{matrix} \end{matrix}]

.

Then, the augmented matrix of Equation (12) is:

\bar{A} = [\begin{matrix} S (0, 1) & S (1, 1) & \dots & S (λ_{i, k}, p_{i}) & r_{i} (0) - σ_{i}^{2} \\ S (0, 0) & S (1, 0) & \dots & S (λ_{i, k}, p_{i} - 1) & r_{i} (1) \\ S (0, 1) & S (1, 1) & \dots & S (λ_{i, k}, p_{i} - 2) & r_{i} (2) \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ S (0, n - 1) & S (1, n - 1) & \dots & S (λ_{i, k}, n - 1 - p_{i}) & r_{i} (n - 1) \end{matrix}] = [\begin{matrix} A & R_{i} \end{matrix} - Σ] .

If

R a n k (A) = n

, then

R a n k (\bar{A}) = n

and we will have a unique solution of parameters

Φ_{i}

:

Φ_{i} = A^{- 1} (R_{i} - Σ)

(13)

Then, we can uniquely define the LSTAR model and predict traffic flow with it.

If

R (A) \neq R (\bar{A})

, there is no solution for Equation (12).

If

R a n k (A) = R a n k (\bar{A}) < n

, there are infinite solutions for

Φ_{i}

.☐

Remark 1.

In case the spatial weight matrix

W_{i}^{(h)} (t)

is time invariant, we can determine the LSTAR prediction model by the correlation of roads. We can uniquely define the LSTAR model when

R a n k (A) = R a n k (\bar{A}) = n

. When

R a n k (A) = R a n k (\bar{A}) < n

, there will be many solutions for parameter

Φ

. This means that we have defined more parameters than are actually needed. We can reduce

λ_{i, k}

and/or

p_{i}

to obtain a unique LSTAR model.

Definitions 2.

Let

r_{i}^{'} (m)

be the autocorrelation coefficient of

i t h

element of

U (t) = W (t) Z (t)

, where the matrix

W (t) = \sum_{h = 0}^{λ_{i, k}} W_{i}^{(h)} (t)

, which combines all spatial effect defined in

W_{i}^{(h)} (t)

to one matrix, and

Z (t) = {[z_{1} (t), z_{2} (t), \dots, z_{N_{i}} (t)]}^{T}

is the vector form of

z_{i} (t)

. We define

Φ_{i} = {[ϕ_{i, 1}, ϕ_{i, 2}, \dots, ϕ_{i, p_{i}}]}^{T}

,

{\bar{R}}_{i}^{'} = {[{\bar{r}}_{i}^{'} (0), {\bar{r}}_{i}^{'} (1), \dots, {\bar{r}}_{i}^{'} (p_{i} - 1)]}^{T}

,

Σ = {[σ_{i}^{2}, 0, \dots, 0]}^{T}

,

A^{'} = [\begin{matrix} r_{i}^{'} (1) & r_{i}^{'} (2) & \dots & r_{i}^{'} (p_{i}) \\ r_{i}^{'} (0) & r_{i}^{'} (1) & \dots & r_{i}^{'} (p_{i} - 1) \\ ⋮ & ⋮ & ⋮ & ⋮ \\ r_{i}^{'} (p_{i} - 2) & r_{i}^{'} (p_{i} - 3) & \dots & r_{i}^{'} (1) \end{matrix}], \bar{A^{'}} = [\begin{matrix} A^{'} & {\bar{R}}_{i}^{'} - Σ \end{matrix}] .

We then present Theorem 2.

Theorem 2.

If

W_{i}^{(h)} (t)

is time variant, the combined weight matrix

W (t)

is full ranked, and

R a n k (A^{'}) = R a n k (\bar{A^{'}}) = p_{i}

, the LSTAR model can be uniquely determined.

Proof.

According to the weight matrix

W_{i}^{(h)} (t)

construction in the LSTAR model, the element

w_{i j}^{(h)} (t)

will always be zero when

i

,

j

is not at spatial order

h

. We combine all of the

λ_{i, k}

weight matrix into one weight matrix,

W (t)

=

\sum_{h = 0}^{λ_{i, k}} W_{i}^{(h)} (t)

. This simplification is reasonable as (1) time variant

w_{i j} (t)

can somehow give an effect similar to spatial order

h

; and (2)

w_{i j} (t)

is always equal to the only non-zero

w_{i j}^{(h)} (t)

:

w_{i j} (t) = 0 + 0 + \dots + w_{i j}^{(h)} (t) + \dots + 0 = w_{i j}^{(h)} (t) number of zero is λ_{i, k} - 1

Considering the combined spatial weight matrix

W (t) = \sum_{h = 0}^{λ_{i, k}} W_{i}^{(h)} (t)

,

Z (t) = {[z_{1} (t), z_{2} (t), \dots, z_{N_{i}} (t)]}^{T}

is an

N_{i}

dimension column vector that includes all neighbor roads within the spatial order to be considered by road

i

, and

Σ = {[σ_{i}^{2}, 0, \dots, 0]}^{T}

, we can obtain a matrix from the LSTAR model according to Equation (5).

Z (t) = \sum_{k = 1}^{p_{i}} ϕ_{i, k} W (t - k) Z (t - k) + Σ (t)

(14)

when the rank of

W (t)

is

N_{i}

, Equation (14) can be rewritten as:

{[W (t)]}^{- 1} [W (t) Z (t)] = \sum_{k = 1}^{p_{i}} ϕ_{i, k} W (t - k) Z (t - k) + Σ (t) .

(15)

Let

U (t) = [W (t) Z (t)]

, we obtain:

{[W (t)]}^{- 1} U (t) = \sum_{k = 1}^{p_{i}} ϕ_{i, k} U (t - k) + Σ (t) .

(16)

{[W_{i}^{(h)} (t)]}^{- 1}

can be treated as an instantaneous window to

U (t)

, so

U (t)

is stationary in the short term. We have:

\bar{U} (t) = \sum_{k = 1}^{p_{i}} ϕ_{i, k} U (t - k) + Σ (t) .

(17)

Let

u_{i} (t)

be the element of

U (t)

, then:

{\bar{u}}_{i} (t) = \sum_{k = 1}^{p_{i}} ϕ_{i, k} u_{i} (t - k) + ε_{i} (t) .

(18)

Pre-multiplying both sides of Equation (18) by

u_{i} (t - m)

:

{\bar{u}}_{i} (t) u_{i} (t - m) = \sum_{k = 1}^{p_{i}} ϕ_{i, k} u_{i} (t - k) u_{i} (t - m) + ε_{i} (t) u_{i} (t - m) .

(19)

Taking expected values in both sides, we obtain:

{\bar{r}}_{i}^{'} (m) = \sum_{k = 1}^{p_{i}} ϕ_{i, k} r_{i}^{'} (m - k) + σ_{i}^{2} δ (m)

(20)

where the expected value

r_{i}^{'} (m) = E (u_{i} (t - m) u_{i} (t - k))

,

{\bar{r}}_{i}^{'} (m) = E ({[W_{i}^{(h)} (t)]}^{- 1} u_{i} (t) u_{i} (t - m))

,

E (ε_{i} (t) u_{i} (t - m)) = σ_{i}^{2} δ (m) .

We can then obtain:

[\begin{matrix} {\bar{r}}_{i}^{'} (0) \\ \begin{matrix} {\bar{r}}_{i}^{'} (1) \\ ⋮ \end{matrix} \\ {\bar{r}}_{i}^{'} (p_{i} - 1) \end{matrix}] = [\begin{matrix} r_{i}^{'} (1) & r_{i}^{'} (2) & \dots & r_{i}^{'} (p_{i}) \\ r_{i}^{'} (0) & r_{i}^{'} (1) & \dots & r_{i}^{'} (p_{i} - 1) \\ ⋮ & ⋮ & ⋮ & ⋮ \\ r_{i}^{'} (p_{i} - 2) & r_{i}^{'} (p_{i} - 3) & \dots & r_{i}^{'} (1) \end{matrix}] [\begin{matrix} ϕ_{i, 1} \\ ϕ_{i, 2} \\ \begin{matrix} ⋮ \\ ϕ_{i, p_{i}} \end{matrix} \end{matrix}] + [\begin{matrix} σ_{i}^{2} \\ \begin{matrix} 0 \\ ⋮ \\ 0 \end{matrix} \end{matrix}]

(21)

Let

Φ_{i} = [\begin{matrix} ϕ_{i, 1} \\ ϕ_{i, 2} \\ \begin{matrix} ⋮ \\ ϕ_{i, p_{i}} \end{matrix} \end{matrix}]

,

A^{'} = [\begin{matrix} r_{i}^{'} (1) & r_{i}^{'} (2) & \dots & r_{i}^{'} (p_{i}) \\ r_{i}^{'} (0) & r_{i}^{'} (1) & \dots & r_{i}^{'} (p_{i} - 1) \\ ⋮ & ⋮ & ⋮ & ⋮ \\ r_{i}^{'} (p_{i} - 2) & r_{i}^{'} (p_{i} - 3) & \dots & r_{i}^{'} (1) \end{matrix}]

,

{\bar{R}}_{i}^{'} = [\begin{matrix} {\bar{r}}_{i}^{'} (0) \\ \begin{matrix} {\bar{r}}_{i}^{'} (1) \\ ⋮ \end{matrix} \\ {\bar{r}}_{i}^{'} (p_{i} - 1) \end{matrix}]

,

Σ = [\begin{matrix} σ_{i}^{2} \\ \begin{matrix} 0 \\ ⋮ \\ 0 \end{matrix} \end{matrix}]

.

Then the augmented matrix of Equation (21) is:

\bar{A^{'}} = [\begin{matrix} r_{i}^{'} (1) & r_{i}^{'} (2) & \dots & r_{i}^{'} (p_{i}) & {\bar{r}}_{i}^{'} (0) - σ_{i}^{2} \\ r_{i}^{'} (0) & r_{i}^{'} (1) & \dots & r_{i}^{'} (p_{i} - 1) & {\bar{r}}_{i}^{'} (1) \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ r_{i}^{'} (p_{i} - 2) & r_{i}^{'} (p_{i} - 3) & \dots & r_{i}^{'} (1) & {\bar{r}}_{i}^{'} (p_{i} - 1) \end{matrix}] = [\begin{matrix} A^{'} & {\bar{R}}_{i}^{'} - Σ \end{matrix}] .

If

R a n k (A^{'}) = p_{i}

, then

R a n k (\bar{A^{'}}) = p_{i}

and we will have a unique solution of parameters

Φ

:

Φ_{i} = {[A^{'}]}^{- 1} ({\bar{R}}_{i}^{'} - Σ) .

(22)

If

R (A^{'}) \neq R (\bar{A^{'}})

, there is no solution for Equation (21).

If

R a n k (A^{'}) = R a n k (\bar{A^{'}}) < p_{i}

, there are infinite solutions for

Φ

. ☐

Remark 2.

Unlike

W_{i}^{(h)} (t)

with most of its elements being zero and normally not being full ranked, most elements of the combined weight matrix

W (t)

are not zero. So

W (t)

is normally a full rank matrix. For some special cases when

W (t)

is not a full rank matrix, we can reduce the size of

W (t)

to make it fully ranked. When

W (t)

is a full rank matrix, we can uniquely define the LSTAR model when

R a n k (A) = R a n k (\bar{A}) = n

. Similar to Remark 1, we can reduce

p_{i}

to obtain a unique LSTAR model if

R a n k (A) = R a n k (\bar{A}) < n

.

In this section, two theorems were given and proven according to the weight matrix determined. When the traffic flow distribution is stable,

W_{i}^{(h)} (t)

can be treated as time invariant and Theorem 1 can be used. When the traffic flow distribution is not stable,

W_{i}^{(h)} (t)

is time variant and Theorem 2 should be used. With the measured weight matrix

W_{i}^{(h)} (t)

and estimated

Φ_{i}

, future traffic state

{\hat{z}}_{i} (t + 1)

can be predicted according to the LSTAR model (Equation (5)) by one-time slot shifting.

5. Practical Example and Experimental Evaluation

5.1. Practical Example

In this paper, we provide a practical example on how to use our LSTAR model to predict future traffic flow of the Shanghai Century Park area. To evaluate the prediction approach of LSTAR, we adopted the widely used traffic simulation tools Simulation of Urban Mobility (SUMO) [39] and OpenStreetMap (OSM) [40], which are recognized as promising candidates for traffic simulations, and the simulation results are commonly accepted as a replacement of real data. Additionally, plenty of works exist that have adopted SUMO and OSM as tools to generate traffic data for research [41,42].

In this example, we demonstrate the model-building procedure for our proposed LSTAR model in the context of traffic flow prediction on a road network. First, we downloaded the OSM format road network map of the area near Shanghai Century Park, as shown in Figure 2. The OSM format map not only included the geography topology of the road network, but also the road type, lane number, speed limitation, traffic light duration, and so on, according to real-world information. Then, the SUMO NetConvert tool was used to convert the OSM format map to a SUMO format map. SUMO was then used to simulate the traffic flow of this area according to the road network information converted from the OSM map.

In the simulation, trip demands were generated randomly every two seconds according to the edge length. The “Fringe factor” was set to 4, which means that roads with no successor or no predecessor had four times the possibility of being selected as the start or end of a trip when compared to other roads. The speed limitations, traffic light durations of each road, and so on were obtained from the real-world data of the OSM map. The simulation duration was one week. The detailed simulation parameters are listed in Table 1.

After we obtained traffic flow data generated from SUMO, they were used to conduct traffic flow prediction with different prediction models. The prediction intervals were five minutes, 15 min, and 30 min, as normally a prediction interval over 30 min has less significance to real-time route planning or vehicle navigation.

The SUMO format map converted from the OSM map is shown in Figure 3. The roads were renamed as Rn-m for easy usage in the following discussion. In the following section, road R7-3 in a north-to-south direction was selected as the example road to demonstrate the LSTAR prediction procedure. Furthermore, we conducted traffic flow prediction for roads R7-2, R3-3, and R3-4 with the same procedure used for road R7-3.

Construction of a Dynamic Spatial Weight Matrix

Step 1. Build a spatial adjacency matrix.

The first step was to build a spatial adjacency matrix based on the topological structure of the network, which appears in Figure 1. In this paper, spatial adjacency matrices of spatial orders up to three were constructed as per Reference [13]. The spatial neighborhood information can be found in Table 2 with the first, second, and third order neighbors separated.

Step 2. Determine the dynamic spatial order and weights.

The second step was to determine the dynamical spatial order and weights for every road link with the method proposed in this paper (Section 3.2. Weight Matrix Construction). In this simulation, only road R7-3 in a north-to-south direction was selected to show how the weight matrix was determined. According to weight matrix definition, only upstream road sections of R7-3 in the north-to-south direction were considered. The dynamic spatial weights calculation results of road R7-3 in a north-to-south direction with a five-minute time step are shown in Table 2.

With the dynamic spatial weights estimated in Table 2, we can see that the weights are time variant in this case as the traffic flow was time variant. Then, we used Theorem 2 to conduct a parameters estimation and traffic flow prediction.

After the future traffic states are predicted, the information can be used to conduct route planning or predictive traffic signal control applications, and so on.

5.2. Experimental Evaluation

The traffic flow prediction accuracy results of the different prediction methods by means of Root Mean Square Error (RMSE) are shown in Figure 4. Figure 5 shows the average RMSE and Root Mean Square Percentage Error (RMSPE). The average of Figure 4 and Figure 5 is the average RMSE, RMSPE values of roads R7-3, R7-2, R3-3, and R3-4 per the prediction models. The definition of RMSE and RMSPE are shown below.

RMSE = \sqrt{\frac{\sum_{i = 1}^{n} {(x_{i} - {\hat{x}}_{i})}^{2}}{n}}

(23)

RMSPE = \frac{\sqrt{\frac{\sum_{i = 1}^{n} {(x_{i} - {\hat{x}}_{i})}^{2}}{n}}}{\frac{\sum_{i = 1}^{n} x_{i}}{n}} \times 100 %

(24)

where n is the prediction interval number,

x_{i}

is the actual value, and

{\hat{x}}_{i}

is the prediction value.

The average RMSE and RMSPE values of all road sections are shown in Figure 5.

From Figure 4, the results showed that on most roads, the prediction accuracies of the different prediction models were similar for all intervals. The predication accuracy from low to high was Shift, AR, Seasonal MA, STAR, LSTAR, and LSTARIMA, with some exceptions on R3-4 and R3-3. From Figure 5, we can see that the RMSE increased as the prediction intervals increased for all prediction methods, while the RMSPE decreased when the prediction intervals increased. This indicates that although the absolute error increased as the prediction intervals increased, the actual prediction accuracy increased with larger prediction intervals as the percentage form errors decreased. Figure 5 also shows that, regardless of the prediction interval, the average prediction accuracy of LSTAR was always better than Shift, AR, Seasonal MA, and STAR. Moreover, LSTARIMA always had a little higher accuracy when compared to LSTAR in all prediction intervals.

According to Diebold [43], only comparing values such as RMSE is not sufficient to declare that one prediction model is better than another without a statistics significance check. There are many hypothesis tests designed for prediction accuracy comparison and the Diebold-Mariano (DM) test [43] is the most popular one. To further evaluate the prediction performance of LSTAR, the DM test was used to check if LSTAR was better than other statistically significant prediction models. The forecast package of R [44] was used to conduct the DM test for the prediction results of road R7-3. As the DM test can only compare the prediction accuracy of two models, we did the DM test for LSTAR and the other models one by one. The DM test hypothesis was that LSTAR had better performance than all of the other methods subjected to the test. The p-values of each DM test are shown in Table 3.

The DM test results showed that the LSTAR prediction accuracy was significantly better than Shift at p-value < 1%, better than AR at p-value < 5%, and almost better than Seasonal MA and STAR at p-value < 10% (one p-value of STAR > 10%). LSTARIMA was not significantly better than LSTAR at p-value < 10%, as no p-value was greater than 90% with the hypothesis that LSTAR is better than LSTARIMA.

6. Conclusions

This paper discussed the application of a local space-time autoregressive (LSTAR) model for traffic flow prediction. In this paper, we showed the prediction process of the LSTAR model in detail. The LSTAR model appears to be the best model among the Shift, AR, Seasonal MA, and STAR models given its greater parameter flexibility (dynamic spatial neighborhood and dynamic spatial weight). According to the DM test results, the LSTAR prediction accuracy was significantly better than Shift and AR, and was better than seasonal MA and STAR, but not significantly. As LSTARIMA also considers the local spatial and time dynamics and still keeps the MA component, the prediction accuracy was always better than the LSTAR model in the simulation results. However, the decrease in LSTAR prediction accuracy was very minor when compared to LSTARIMA, and was not statically significant. Furthermore, the computational complexity of the LSTAR model was also lower than that of the LSTARIMA model. Therefore, there existed a tradeoff between the prediction accuracy and the computational complexity for the two models.

Future studies will be carried out to assess the performance of the LSTAR model with different real-world traffic data and the usage of prediction data for different urban traffic applications. We will also conduct the simulation and performance evaluation of the traffic information collection via the VANET approach.

Acknowledgments

This work is supported by the NSF of China under Grants No. 61772130, No. 61301118, No. 71171045; The International S&T Cooperation Program of Shanghai Science and Technology Commission under Grant No. 15220710600, and the Innovation Program of Shanghai Municipal Education Commission under Grant No. 14YZ130.

Author Contributions

Jianbin Chen and Demin Li conceived and designed the method. Jianbin Chen performed the experiments and analyzed the experimental data. Finally, Jianbin Chen and Xiaolu Zhang wrote the paper with the help of Demin Li and Guanglin Zhang.

Conflicts of Interest

The authors declare no conflict of interest.

References

United States Department of Transportation, National Transportation Statistics. Table 1-72: Annual Highway Congestion Cost. 2017. Available online: https://www.rita.dot.gov/bts/sites/rita.dot.gov.bts/files/NTS_Entire_2017Q2.pdf (accessed on 8 January 2018).
Alam, M.; Ferreira, J.; Fonseca, J. Introduction to Intelligent Transportation Systems; Springer: Cham, Switzerland, 2016; pp. 552–557. [Google Scholar] [CrossRef]
Kong, Q.J.; Xu, Y.; Lin, S.; Wen, D.; Zhu, F.; Liu, Y. UTN-Model-Based Traffic Flow Prediction for Parallel-Transportation Management Systems. IEEE Trans. Intell. Transp. Syst. 2013, 14, 1541–1547. [Google Scholar] [CrossRef]
Lighthill, M.J.; Whitham, G.B. On kinematic waves II. A therory of traffic flow on long crowded roads. Proc. R. Soc. A Math. Phys. Eng. Sci. 1955, 229, 317–345. [Google Scholar] [CrossRef]
Richards, P.I. Shock Waves on the Highway. Op. Res. 1956, 4, 42–51. [Google Scholar] [CrossRef]
Tian, J.F.; Li, G.Y.; Treiber, M.; Jiang, R.; Jia, N.; Ma, S.F. Cellular automaton model simulating spatiotemporal patterns, phase transitions and concave growth pattern of oscillations in traffic flow. Trans. Res. B Methodol. 2016, 93, 560–575. [Google Scholar] [CrossRef]
Box, G.E.; Jenkins, G.M. Time Series Analysis: Forecasting and Control; Holden-Day: Oakland, CA, USA, 1976; Volume 31, p. 303. [Google Scholar]
Williams, B.M.; Hoel, L.A. Modeling and Forecasting Vehicular Traffic Flow as a Seasonal ARIMA Process: Theoretical Basis and Empirical Results. J. Trans. Eng. 2003, 129, 664–672. [Google Scholar] [CrossRef]
Kamarianakis, Y.; Prastacos, P. Forecasting traffic flow conditions in an urban network—Comparison of multivariate and univariate approaches. Trans. Res. Rec. 2003, 74–84. [Google Scholar] [CrossRef]
Pfeifer, P.E.; Deutsch, S.J. A Three-Stage Iterative Procedure for Space-Time Modeling. Technometrics 1980, 22, 35–47. [Google Scholar] [CrossRef]
Kamarianakis, Y.; Prastacos, P. Space–time modeling of traffic flow. Comput. Geosci. 2005, 31, 119–133. [Google Scholar] [CrossRef]
Min, X.; Hu, J.; Chen, Q.; Zhang, T.; Zhang, Y. Short-term traffic flow forecasting of urban network based on dynamic STARIMA model. In Proceedings of the International IEEE Conference on Intelligent Transportation Systems, St. Louis, MO, USA, 4–7 October 2009; pp. 1–6. [Google Scholar] [CrossRef]
Cheng, T.; Wang, J.; Haworth, J.; Heydecker, B.; Chow, A. A Dynamic Spatial Weight Matrix and Localized Space—Time Autoregressive Integrated Moving Average for Network Modeling. Geogr. Anal. 2014, 46, 75–97. [Google Scholar] [CrossRef]
Wan, Y.; Huang, Y.; Buckles, B. Camera calibration and vehicle tracking: Highway traffic video analytics. Trans. Res. Part C 2014, 44, 202–213. [Google Scholar] [CrossRef]
Mehta, V.; Chana, I. Urban Traffic State Estimation Techniques Using Probe Vehicles: A Review. In Computing and Network Sustainability; Vishwakarma, H., Akashe, S., Eds.; Lecture Notes in Networks and Systems; Springer: Singapore, 2017; Volume 12, pp. 273–281. [Google Scholar] [CrossRef]
Lai, W.-K.; Kuo, T.-H.; Chen, C.-H. Vehicle Speed Estimation and Forecasting Methods Based on Cellular Floating Vehicle Data. Appl. Sci. 2016, 6, 47. [Google Scholar] [CrossRef]
Zhang, G.; Xu, Y.; Wang, X.; Tian, X.; Liu, J.; Gan, X.; Qian, L. Multicast capacity for VANETs with directional antenna and delay constraint. IEEE J. Sel. Areas Commun. 2012, 30, 818–833. [Google Scholar] [CrossRef]
Zhang, G.; Liu, J.; Ren, J. Multicast capacity of cache enabled content-centric wireless Ad Hoc networks. China Commun. 2017, 14, 1–9. [Google Scholar] [CrossRef]
Ren, J.; Zhang, G.; Li, D. Multicast capacity for VANETs with directional antenna and delay constraint under random walk mobility model. IEEE Access 2017, 5, 3958–3970. [Google Scholar] [CrossRef]
Guo, C.; Li, D.; Zhang, G.; Cui, Z. Data delivery delay reduction for VANETs on bi-directional roadway. IEEE Access 2017, 4, 8514–8524. [Google Scholar] [CrossRef]
Hussain, R.; Kim, S.; Oh, H. Traffic Information Dissemination System: Extending Cooperative Awareness among Smart Vehicles with Only Single-Hop Beacons in VANET. Wirel. Pers. Commun. 2016, 88, 151–172. [Google Scholar] [CrossRef]
Li, D.; Li, Q.; Wang, J. Traffic information collecting algorithms for road selection decision support in vehicle ad hoc networks. Int. J. Simul. Proc. Modell. 2012, 7, 50–56. [Google Scholar] [CrossRef]
Darwish, T.; Bakar, A.K. Traffic density estimation in vehicular ad hoc networks: A review. IEICE Trans. Inf. Syst. 2015, 24, 337–351. [Google Scholar] [CrossRef]
Guo, J.; Huang, W.; Williams, B.M. Adaptive Kalman filter approach for stochastic short-term traffic flow rate prediction and uncertainty quantification. Transp. Res. Part C Emerg. Technol. 2014, 43, 50–64. [Google Scholar] [CrossRef]
Abidin, A.F.; Kolberg, M. Towards improved vehicle arrival time prediction in public transportation: integrating SUMO and Kalman filter models. In Proceedings of the 2015 17th UKSim-AMSS International Conference on Modelling and Simulation (UKSim), Cambridge, UK, 25–27 March 2015; pp. 147–152. [Google Scholar] [CrossRef]
Çetiner, B.G.; Sari, M.; Borat, O. A Neural Network Based Traffic-Flow Prediction Model. Math. Comput. Appl. 2010, 15, 269–278. [Google Scholar] [CrossRef]
Tang, J.; Liu, F.; Zou, Y.; Zhang, W.; Wang, Y. An Improved Fuzzy Neural Network for Traffic Speed Prediction Considering Periodic Characteristic. IEEE Trans. Intell. Transp. Syst. 2017, 18, 2340–2350. [Google Scholar] [CrossRef]
Ma, Y.; Chowdhury, M.; Sadek, A.; Jeihani, M. Integrated Traffic and Communication Performance Evaluation of an Intelligent Vehicle Infrastructure Integration (VII) System for Online Travel-Time Prediction. IEEE Trans. Intell. Transp. Syst. 2012, 13, 1369–1382. [Google Scholar] [CrossRef]
Deng, L.; He, Z.; Zhong, R. The Bus Travel Time Prediction Based on Bayesian Networks. In Proceedings of the 2013 International Conference on Information Technology and Applications, Chengdu, China, 16–17 November 2013; pp. 282–285. [Google Scholar] [CrossRef]
Yu, B.; Song, X.L.; Guan, F.; Yang, Z.M.; Yao, B.Z. k-Nearest Neighbor Model for Multiple-Time-Step Prediction of Short-Term Traffic Condition. J. Transp. Eng. 2016, 142. [Google Scholar] [CrossRef]
Qi, Y.; Ishak, S. A Hidden Markov Model for short term prediction of traffic conditions on freeways. Transp. Res. Part C Emerg. Technol. 2014, 43, 95–111. [Google Scholar] [CrossRef]
Lv, Y.; Duan, Y.; Kang, W.; Li, Z.; Wang, F.Y. Traffic Flow Prediction With Big Data: A Deep Learning Approach. IEEE Trans. Intell. Transp. Syst. 2015, 16, 865–873. [Google Scholar] [CrossRef]
Dhivyabharathi, B.; Hima, E.S.; Vanajakshi, L. Stream travel time prediction using particle filtering approach. Transp. Lett. Int. J. Transp. Res. 2016, 1–8. [Google Scholar] [CrossRef]
Martino, L.; Read, J.; Elvira, V.; Louzada, F. Cooperative parallel particle filters for online model selection and applications to urban mobility. Digit. Signal Proc. 2017, 60, 172–185. [Google Scholar] [CrossRef]
Liebig, T.; Piatkowski, N.; Bockermann, C.; Morik, K. Dynamic route planning with real-time traffic predictions. Inf. Syst. 2017, 64, 258–265. [Google Scholar] [CrossRef]
Florin, R.; Olariu, S. A survey of vehicular communications for traffic signal optimization. Veh. Commun. 2015, 2, 70–79. [Google Scholar] [CrossRef]
Bo, W. Estimation of Autoregressive Moving-Average Models via High-Order Autoregressive Approximations. J. Time 2010, 10, 283–299. [Google Scholar] [CrossRef]
Griffith, D.A.; Heuvelink, G.B.M. Deriving Space-Time Variograms from Space-Time Autoregressive (STAR) Model Specifications. In Proceedings of the StatGIS09: Geo Informatics for Environmental Surveillance, Milos, Greece, 17–19 June 2009; Volume 38, pp. 285–303. [Google Scholar] [CrossRef]
Behrisch, M.; Bieker, L.; Erdmann, J.; Krajzewicz, D. SUMO—Simulation of Urban Mobility: An Overview; SIMUL: Barcelona, Spain, 2011; pp. 63–68. [Google Scholar]
Haklay, M.; Weber, P. OpenStreetMap: User-Generated Street Maps. IEEE Pervasive Comput. 2008, 7, 12–18. [Google Scholar] [CrossRef]
Wang, Y.; Jiang, J.; Mu, T. Context-Aware and Energy-Driven Route Optimization for Fully Electric Vehicles via Crowdsourcing. IEEE Trans. Intell. Transp. Syst. 2013, 14, 1331–1345. [Google Scholar] [CrossRef]
Griggs, W.M.; Ordóñez-Hurtado, R.H.; Crisostomi, E.; Häusler, F.; Massow, K.; Shorten, R.N. A Large-Scale SUMO-Based Emulation Platform. IEEE Trans. Intell. Transp. Syst. 2015, 16, 3050–3059. [Google Scholar] [CrossRef]
Diebold, F.X.; Mariano, R.S. Comparing Predictive Accuracy. J. Bus. Econ. Stat. 1995, 20, 134–144. [Google Scholar] [CrossRef]
Coreteam, R. R: A language and environment for statistical computing. Computing 2015, 1, 12–21. [Google Scholar]

Figure 1. Traffic information collection.

Figure 2. OpenStreetMap (OSM) map of Shanghai Century Park area.

Figure 3. Simulation of Urban Mobility (SUMO) road network.

Figure 4. Prediction accuracy comparison of different models with: (a) 5-min prediction interval; (b) 15-min prediction interval; and (c) 30-min prediction interval.

Figure 5. Average prediction accuracy comparison of different models: (a) Root Mean Square Error (RMSE); and (b) Root Mean Square Percentage Error (RMSPE).

Table 1. Simulation parameters.

Parameters	Value
Trip Generation Method	Random
Trip Possibility Weight	Edge Length
New Trip Start Interval	2 s
Fringe Factor	4
Max Vehicle Number	300
Traffic Light Duration	OSM Map data
Speed Limitation	OSM Map data
Simulation Duration	604,800 s (1 Week)

Table 2. The dynamic spatial weights.

Spatial	First			Second							Third
Temporal Order $k$	R7-2	R3-3	R3-4	R7-1	R2-3	R2-4	R3-2	R6-1	R3-5	R8-1	R3-1	R5-1	R2-2	R1-1	R1-2
5	0.74	0.11	0.16	0.48	0.22	0.13	0.13	0.00	0.04	0.00	0.17	0.09	0.22	0.26	0.26
10	0.69	0.31	0.00	0.29	0.21	0.11	0.25	0.07	0.04	0.04	0.43	0.07	0.13	0.33	0.03
15	0.61	0.35	0.04	0.59	0.07	0.07	0.21	0.03	0.00	0.03	0.29	0.13	0.10	0.42	0.06
20	0.65	0.23	0.13	0.49	0.19	0.08	0.16	0.03	0.03	0.03	0.26	0.12	0.06	0.44	0.12
25	0.72	0.07	0.21	0.63	0.13	0.08	0.13	0.00	0.04	0.00	0.43	0.00	0.09	0.35	0.13
30	0.36	0.27	0.36	0.32	0.09	0.09	0.32	0.09	0.05	0.05	0.17	0.08	0.00	0.58	0.17
35	0.38	0.38	0.25	0.59	0.29	0.06	0.00	0.00	0.00	0.06	0.23	0.03	0.16	0.48	0.10
40	0.78	0.11	0.11	0.48	0.14	0.07	0.21	0.03	0.07	0.00	0.36	0.04	0.04	0.48	0.08
45	0.74	0.19	0.07	0.50	0.23	0.08	0.19	0.00	0.00	0.00	0.26	0.07	0.04	0.52	0.11
50	0.71	0.14	0.14	0.49	0.16	0.08	0.19	0.05	0.00	0.03	0.19	0.11	0.08	0.56	0.06
55	0.53	0.33	0.13	0.46	0.17	0.13	0.13	0.04	0.08	0.00	0.11	0.05	0.21	0.58	0.05
60	0.47	0.27	0.27	0.29	0.24	0.05	0.24	0.10	0.10	0.00	0.19	0.24	0.29	0.29	0.00
65	0.55	0.27	0.18	0.39	0.18	0.09	0.12	0.12	0.00	0.09	0.25	0.16	0.19	0.38	0.03
70	0.48	0.33	0.19	0.41	0.19	0.15	0.15	0.00	0.07	0.04	0.26	0.06	0.13	0.52	0.03
75	0.56	0.25	0.19	0.50	0.17	0.08	0.13	0.08	0.04	0.00	0.33	0.06	0.11	0.50	0.00
80	0.71	0.29	0.00	0.43	0.19	0.00	0.33	0.05	0.00	0.00	0.24	0.16	0.12	0.36	0.12
85	0.76	0.10	0.14	0.59	0.09	0.05	0.18	0.05	0.05	0.00	0.29	0.04	0.04	0.54	0.08
90	0.67	0.13	0.20	0.46	0.14	0.07	0.25	0.04	0.04	0.00	0.33	0.00	0.21	0.42	0.04
95	0.23	0.69	0.08	0.40	0.20	0.05	0.15	0.00	0.10	0.10	0.22	0.13	0.04	0.39	0.22
100	0.79	0.10	0.10	0.24	0.19	0.10	0.38	0.05	0.05	0.00	0.13	0.13	0.08	0.65	0.03

Table 3. Diebold-Mariano (DM) test results; AR: autoregressive; Seasonal MA: seasonal moving average; STAR: Space-Time AR.

Prediction Model	5 min	15 min	30 min
Shift	0.0000	0.0000	0.0000
AR	0.0351	0.0225	0.0000
Seasonal MA	0.0611	0.0822	0.07814
STAR	0.1023	0.0884	0.0929
LSTARIMA	0.8985	0.7828	0.6797

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, J.; Li, D.; Zhang, G.; Zhang, X. Localized Space-Time Autoregressive Parameters Estimation for Traffic Flow Prediction in Urban Road Networks. Appl. Sci. 2018, 8, 277. https://doi.org/10.3390/app8020277

AMA Style

Chen J, Li D, Zhang G, Zhang X. Localized Space-Time Autoregressive Parameters Estimation for Traffic Flow Prediction in Urban Road Networks. Applied Sciences. 2018; 8(2):277. https://doi.org/10.3390/app8020277

Chicago/Turabian Style

Chen, Jianbin, Demin Li, Guanglin Zhang, and Xiaolu Zhang. 2018. "Localized Space-Time Autoregressive Parameters Estimation for Traffic Flow Prediction in Urban Road Networks" Applied Sciences 8, no. 2: 277. https://doi.org/10.3390/app8020277

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Localized Space-Time Autoregressive Parameters Estimation for Traffic Flow Prediction in Urban Road Networks

Abstract

1. Introduction

2. State-of-the-Art and Related Topics

2.1. Traffic Information Collection

2.2. Traffic Prediction

2.3. Urban Traffic Applications

3. Model and Preliminaries

3.1. LSTAR Model Construction

3.2. Weight Matrix Construction

3.3. Traffic Information Collection

4. Main Results

5. Practical Example and Experimental Evaluation

5.1. Practical Example

Construction of a Dynamic Spatial Weight Matrix

5.2. Experimental Evaluation

6. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI