Region-Level Traffic Prediction Based on Temporal Multi-Spatial Dependence Graph Convolutional Network from GPS Data

Yang, Haiqiang; Zhang, Xinming; Li, Zihan; Cui, Jianxun

doi:10.3390/rs14020303

Open AccessArticle

Region-Level Traffic Prediction Based on Temporal Multi-Spatial Dependence Graph Convolutional Network from GPS Data

¹

Institute for Future, School of Automation, Qingdao University, Qingdao 266071, China

²

Shandong Key Laboratory of Industial Control Technology, Qingdao 266071, China

³

Logistics and E-commerce School, Zhejiang Wanli University, Ningbo 315100, China

⁴

Institute for Future, College of Physics, Qingdao University, Qingdao 266071, China

⁵

School of Transportation Science and Engineering, Harbin Institute of Technology, Harbin 150090, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2022, 14(2), 303; https://0-doi-org.brum.beds.ac.uk/10.3390/rs14020303

Submission received: 13 December 2021 / Revised: 4 January 2022 / Accepted: 6 January 2022 / Published: 10 January 2022

(This article belongs to the Special Issue Applications of Remote Sensing to Inland Transportation Infrastructure Monitoring and Intelligent Transport System Planning)

Download

Browse Figures

Versions Notes

Abstract

:

Region-level traffic information can characterize dynamic changes of urban traffic at the macro level. Real-time region-level traffic prediction help city traffic managers with traffic demand analysis, traffic congestion control, and other activities, and it has become a research hotspot. As more vehicles are equipped with GPS devices, remote sensing data can be collected and used to conduct data-driven region-level-based traffic prediction. However, due to dynamism and randomness of urban traffic and the complexity of urban road networks, the study of such issues faces many challenges. This paper proposes a new deep learning model named TmS-GCN to predict region-level traffic information, which is composed of Graph Convolutional Network (GCN) and Gated Recurrent Unit (GRU). The GCN part captures spatial dependence among regions, while the GRU part captures the dynamic change of traffic within the region. Model verification and comparison are carried out using real taxi GPS data from Shenzhen. The experimental results show that the proposed model outperforms both the classic time series prediction model and the deep learning model at different scales.

Keywords:

region-level traffic; traffic prediction; temporal and spatial; graph convolutional network

1. Introduction

Region-level traffic prediction has received increasing attention in the field of Intelligent Transportation System (ITS). It is a key aspect of urban traffic management. With the widespread use of remote sensing equipment, such as the Global Positioning System (GPS), radar, and other sensors, remote sensing data-driven-based region-level traffic predictions has become popular [1,2,3]. Vehicles equipped with GPS devices that travel on urban roads can dynamically upload their latitude, longitude, velocity, and other data to a server. When the number of such vehicles reaches tens of thousands, traffic managers can obtain dynamic traffic information of urban roads, such as traffic flow and traffic speed, to support various research and applications in the field of Intelligent Transportation System. For instance, identifying areas of interest (AOIs) based on taxi GPS in New York City [4], estimating urban network-wide traffic speed estimation based on massive ride-sourcing GPS traces [5], assessing individual activity-related exposures to traffic congestion using GPS trajectory data [6], and so on. Therefore, using GPS data to perform region-level traffic prediction is reasonable and representative. Region-level traffic prediction, on the one hand, can support the development of real-time traffic management applications such as traffic control and traffic guidance, and, on the other hand, it can also help carry out real-time OD estimation [7]. For instance, using the average speed of a specific region deviates from the average value, and the traffic manager can reschedule the traffic signal in advance or send the police to divert traffic. With region-level dynamic traffic flow information, which represents travel demand, companies can dispatch vehicles based on this forecast information to obtain greater economic benefits.

However, due to the complexity and relevance of the traffic state in both the time and space dimensions, traffic prediction at the region level is of a great challenge. Firstly, traffic prediction is a time-series task that uses historical traffic information in the region to forecast future traffic information. As a result, many studies based on time series models, such as the Autoregressive Integrated Moving Average (ARRIMA) model [8], the Kalman filtering model [9], Bayesian network [10], Neural Network [11], Recurrent Neural Network [12], and so on, have been proposed. However, these models only consider temporal dependence and ignore spatial dependence of adjacent regions, which has an impact on prediction result. In recent years, some studies have proposed new hybrid models based on the time-series model and spatial feature mining model to comprehensively use the temporal dependence of the predicted region and spatial dependence from adjacent regions to forecast traffic information [13,14].

Furthermore, regular regions divided by square grids and hexagons cannot accurately describe the evolution of traffic conditions due to the complexity of the road network and randomness of traffic. Although irregularly divided regions based on natural boundaries, such as roads, can accurately describe the evolution of the traffic state, it is difficult to extract spatial dependence from non-Euclidean distance data (irregularly regions). With the application of Graph Convolutional Network, it is now possible to extract spatial features from non-Euclidean distance data [15,16]. Most studies based on GCN assumed that adjacent regions have the same effect on the predicted region, which is imprecise. There are two errors, as shown in Figure 1: (1) g1 and g2 are adjacent grids of g4, and the traffic flow from g2 to g4 is obviously greater than the one from g1 to g4; therefore, it is reasonable that the influence of g2 is greater than that of g1 when extracting spatial features of g4; (2) g3 and g5 are adjacent regions of g4, and

d_{34}

and

d_{45}

are the distances from the centroid of GPS points located in g3 to the centroid of GPS points located in g4 and from the centroid of GPS points located in g5 to the centroid of GPS points located in g4, respectively. Due to the fact that

d_{34}

is greater than

d_{45}

, it is reasonable that the influence of g5 is greater than that of g3 when extracting the spatial features of g4. However, most existing models cannot consider these spatial dependencies, resulting in poor prediction results.

In order to solve the above problems, we propose a new traffic state prediction method called the temporal multi-spatial dependence graph convolutional network (TmS-GCN) to forecast region-level traffic states. Our main contributions in this study are summarized as follows:

We proposed the TmS-GCN model, which integrates Gated Recurrent Units (GRU) and GCN. GRU is used to obtain temporal dependence based on historical traffic state data. GCN is used to capture spatial dependence based on the graph of irregular regions.
The TmS-GCN model fully takes into account multiple types of spatial dependencies. Compared with the classic GCN model, which assumed that adjacent regions have the same effect on the predicted region, our model also considers traffic flow propagation and spatial distance among regions.
We evaluate our method using real-world GPS data collected from Taxi vehicles in Shenzhen, China. The results show that our method outperforms baseline methods.

The rest of the paper is organized as follows. Section 2 reviews relevant research on traffic state prediction. Section 3 introduces the details of the method, including problem description, method overall framework, spatial dependence modeling, and temporal dependence modeling. In Section 4, we use real-world GPS dataset to evaluate the TmS-GCN model. Finally, we conclude the paper in Section 5.

2. Related Work

Region-level traffic states provide a macro view of urban traffic, which is useful for traffic control, traffic guidance, and other applications. Region-level traffic state prediction has become the hotspot in the Intelligent Transportation System, and it can be divided into model-driven research and data-driven research. The existing researches on traffic prediction are shown in Table 1. Model-driven research is typically based on a variety of assumptions and ideal conditions, with extremely high application scenario requirements. Typical model-driven research includes the following: car-following model [17], queuing theory [18], cell transmission model [19], three-phase traffic theory [20], etc. Data-driven research, on the other hand, considers both feasibility and accuracy in practical applications; thus, it has received more attention from researchers.

Many data-driven studies were carried out using time series models due to the periodicity and tendency of urban traffic flow in the time dimension. Parametric models and deep learning models are two types of prediction research based on time series features. The parametric model assumes that the regression function conforms to traffic flow distribution; then, it uses historical data to fit the function’s parameters. As early as 1970s, researchers used classic ARIMA [21], Kalman filter [22], Bayesian model [23], and Network Fundamental Diagram (NFD) [24] to conduct traffic prediction research, followed by studies using variants of these models. For example, Chen et al. proposed an Autoregressive Integrated Moving Average with Generalized Autoregressive Conditional Heteroscedasticity (ARIMA-GARCH) model for traffic flow prediction [25]; Azari et al. employed a machine learning and statistical learning time-series prediction method based on long short-term memory and ARIMA to predict cellular network traffic, with a fix-point algorithm embedded to update posterior estimations of maximum correntropy-deduced Kalman filter. Cai et al. achieved better traffic forecasting [9]. Gu et al. proposed an improved Bayesian combination model with deep learning (IBCM-DL) for traffic flow prediction and proved it outperforms other state-of-the-art methods in terms of accuracy and stability [10]. Deep learning models, such as Artificial Neural Network [11], Recurrent Neural Network (RNN) [26], Long Short-term Memory (LSTM) [27], etc., have also performed well in terms of prediction.

Due to the connectivity of the urban road network, the traffic state of a region can be influenced by traffic states of its neighboring regions. As a result, many studies forecast traffic states by extracting both temporal and spatial features. For example, Ma et al. proposed a convolutional neural network (CNN)-based method that learns traffic as images and predicts large-scale, network-wide traffic speed with high accuracy [28]. Zhang et al. proposed a short-term traffic-flow prediction model based on a Convolution Neural Network (CNN) deep learning framework [29]. Shi et al. proposed a novel Attention-based Periodic-Temporal neural Network (APTN), which is an end-to-end solution for traffic foresting that captures spatial, short-term, and long-term periodical dependencies [30]. However, region-level traffic state prediction still faces challenges: (1) The topological structure of the city’s complex road network is destroyed when regions are divided into squares or hexagons, and it is difficult to extract accurate spatial features of these regions; (2) irregular regions based on natural roads, administrative divisions, and other factors are typically non-Euclidean distance data. The classic CNN model is difficult to apply to this type of data.

With the rise of Graph Convolutional Network (GCN), we can capture exactly spatial features from irregular regions [15,16]. Researchers tried to employ GCN and time series model to forecast traffic state. For example, Zhao et al. proposed a temporal graph convolutional network (T-GCN) model, which is combined with the graph convolutional network (GCN) and the gated recurrent unit (GRU), for traffic prediction [14]. Yu et al. devised a novel graph-based neural network that expanded the existing GCN to predict road traffic speeds [31]. Zhang et al. proposed a novel end-to-end deep learning framework named Graph Attention Temporal Convolutional Network (GATCN) for traffic speed forecasting [13], etc. Most studies based on GCN assumed that adjacent regions have the same effect on the predicted region, which is imprecise. As discussed in Section 1, the features of traffic flow propagation and distance between regions should also be considered.

In this context, we propose a new deep learning method that can capture complex temporal and spatial features from remote sensing data in this research and can be used for traffic state forecasting based on irregular region graph.

3. Methodology

In order to capture not only the temporal features of traffic flow, but also the spatial dependencies of the irregular non-Euclidean distance graph structure, we propose the TmS-GCN model. Figure 2 shows the step-by-step diagram of the method. Firstly, the urban area is divided into regions, and a region graph is conducted, and traffic information (i.e., traffic speed, traffic demand, and etc.) in each region are obtained. Secondly, the graph convolutional neural network is used to capture the spatial features. Finally, the outputs of the GCN part are inputed into GRU part to forecast future traffic information.

3.1. Problem Definition

For region-level traffic state prediction, we first define several parameters as follows:

Definition 1.

region graph

G

.

G = (V, E)

, where

V

is the set of nodes. In our research, each region represents a node;

E

is the set of edges that defines the topology of

G

, which is described by adjacency matrix

A

.

Definition 2.

feature matrix

X \in R^{N \times P}

.

X

represents the feature matrix of all regions,

N

is the number of regions, and

P

is the number of historical time series.

X_{m} = [x_{m}^{t - P}, x_{m}^{t - P + 1}, \dots, x_{m}^{t}]

represents traffic feature information, such as traffic flow, traffic speed, and traffic density, from time series

t - P

to time series

t

in the

m t h

region.

Therefore, region-level traffic state prediction can be defined as follows:

[X_{t + 1}, X_{t + 2}, \dots, X_{t + T}] = f ([X_{t - P}, X_{t - P + 1}, \dots, X_{t}]; G)

(1)

which is learning the function

f

to mapping from historical traffic feature matrix

[X_{t - P}, X_{t - P + 1}, \dots, X_{t}]

to future traffic feature matrix

[X_{t + 1}, X_{t + 2}, \dots, X_{t + T}]

based on graph

G

.

3.2. Overall Framework

Figure 3 shows the framework of the proposed TmS-GCN model consisting of two parts: Grated Recurrent Units and Graph Convolutional Network. Firstly, the GCN part uses the historical traffic state data, i.e., feature matrix

[X_{t - P}, X_{t - P + 1}, \dots, X_{t}]

, and three types of convolutional filter to obtain the intermediate feature matrix

[{\tilde{X}}_{t - P}, {\tilde{X}}_{t - P + 1}, \dots, {\tilde{X}}_{t}]

. Secondly, intermediate feature matrix

[{\tilde{X}}_{t - P}, {\tilde{X}}_{t - P + 1}, \dots, {\tilde{X}}_{t}]

is input to GRU part to obtain future traffic feature matrix

[X_{t + 1}, X_{t + 2}, \dots, X_{t + T}]

.

3.3. Spatial Dependence Modeling

The datasets used in many deep learning studies are Euclidean distance datasets with regular shapes, such as images, videos, and audios. The classic Convolutional Neural Network (CNN) model can be used to extract effective features from these datasets [28,32]. However, classic CNN models cannot be deployed on irregular non-Euclidean distance datasets, such as social networks and road networks. In recent years, researchers have tried to use graph convolutional neural networks to capture features from irregular graph structure, such as the following: identifying disease-gene association [33], forecasting road traffic speeds [31], classifying node [34], etc. Given a graph

G = (V, E)

, an adjacency matrix

A

and feature matrix

X

can be obtained. The GCN model defines convolutional operation in the Fourier domain. The convolutional filter captures spatial features of each node from its first-order neighborhood and itself. A typical multi-layered GCN is shown in Figure 4, in which the relationship between two adjacency layers can be expressed as follows:

H^{(l + 1)} = σ ({\tilde{D}}^{- \frac{1}{2}} \hat{A} {\tilde{D}}^{- \frac{1}{2}} H^{(l)} θ^{(l)})

(2)

\hat{A} = A + I_{N}

(3)

where

A

represents adjacency matrix,

I

is the identity matrix,

\tilde{D}

is the degree matix,

H^{(l)}

and

H^{(l + 1)}

are the outputs of

l

and

l + 1

layer,

θ^{(l)}

represents all parameters of

l

layer, and

σ ()

is the activation function.

We stated in the first section that the spatial dependence of region-level traffic states cannot only rely on adjacency matrix, which result in poor prediction. Thus, here, we introduce two more special adjacency matrices

A^{P}

and

A^{d}

to capture richer spatial dependencies.

(1) Traffic flow propagating matrix

A^{P}

captures the features of traffic flow propagating between regions, which is defined as follows:

A_{i j}^{P} = {\begin{matrix} 0 & no vehicle travel from region i to region j \\ \frac{{\bar{Q}}_{i j}}{{\bar{Q}}_{j}} & otherwise \end{matrix}

(4)

where

{\bar{Q}}_{j}

is the average number of vehicles located in region

j

, and

{\bar{Q}}_{i j}

represents the average number of vehicles travelling from region

i

to region

j

. According to Equation (4), a greater value of

A_{i j}^{P}

indicates that region

i

provides more information than other regions for capturing spatial features of region

j

. As shown in Figure 1,

A_{24}^{P}

is greater than

A_{14}^{P}

.

(2) Centroid distance matrix

A^{d}

captures the feature of distance of GPS points’ centroid between regions, which is defined as follows:

A_{i j}^{d} = {\begin{matrix} 0 & if region j and region i is not adjacent \\ \frac{1}{φ (L a t^{i}, L a t^{j}, L n g^{i}, L n g^{j})} & otherwise \end{matrix}

(5)

where

φ ()

represents the function to calculate distance based on latitude and longitude coordinates,

L a t^{i}

and

L n g^{i}

are the latitude and longitude coordinates of the GPS points’ centroid in region

i

, and

L a t^{j}

and

L n g^{j}

are the latitude and longitude coordinates of the GPS points centroid in region

j

. A greater value of

A_{i j}^{d}

indicates that region

i

provides more information than other regions for capturing spatial features of region

j

. As shown in Figure 1,

A_{54}^{d}

is greater than

A_{34}^{d}

.

3.4. Temporal Dependence Modeling

Traffic state prediction is a typical time series task, and some classic time series model can be employed to this task, such as Moving Average (MA) model, Auto Regressive (AR) model, ARIMA, etc. With the rise of deep learning methods, Recurrent Neural Network (RNN) models with better prediction effects on time-series tasks have been proposed. However, the RNN model has been replaced by Long Short-Term Memory (LSTM) [35] and GRU [36] because it is prone to gradient disappearance and gradient explosion problems. The GRU model has a small number of parameters, which speeds up model convergence without sacrificing prediction accuracy. As a result, we employ the GRU model to extract the traffic state’s dynamic change characteristics from time series.

Figure 5 shows the structure of GRU, where

h_{(t - 1)}

and

h_{t}

are the latent state at

t - 1

and

t

,

X_{t}

represents the input of GRU at

t

, i.e., traffic state information,

{\hat{Y}}_{t}

is the output at

t

, which is the predicted traffic state information. Overall, the model is capable of predicting future traffic states by combining both traffic state information and hidden state information from previous time intervals. There are two important gate controls added to the model, including reset gate

r_{t}

and update gate

z_{t}

.

r_{t}

and

z_{t}

are defined in Formulas (7) and (8), respectively. With the two gates, GRU is able to control whether current traffic state information and previous hidden state information can be imported. In addition, the sigmoid activation function is added to the two gates to make sure that input size is controlled in the range of 0% to 100%.

r_{t} = σ (W_{r} [f (A, X_{t}), h_{(t - 1)}] + b_{r})

(6)

z_{t} = σ (W_{u} [f (A, X_{t}), h_{(t - 1)}] + b_{u})

(7)

{\tilde{h}}_{t} = t a n h (W_{\tilde{h}} [f (A, X_{t}), (r_{t} ⊙ h_{(t - 1)})] + b_{\tilde{h}})

(8)

h_{t} = z_{t} ⊙ h_{(t - 1)} + (1 - z_{t}) ⊙ {\tilde{h}}_{t}

(9)

4. Experiment

4.1. Data Description

The dataset for this paper was derived from taxi GPS data collected in Shenzhen, China, in January 2019. There are about 30,000 taxis in total and over 900 million positioning points. The sampling rate of GPS is 1–3 s, and the average penetration rate of GPS is above 5%. Our research is similar to other studies using GPS data from taxis, such as mining Urban Recurrent Congestion Evolution Patterns [1], forecasting Citywide Traffic Congestion [2], and identifying areas of interest [4], which all reflect urban traffic status from a macro level. Compared with other types of vehicles, taxis perform better in terms of overall number, sampling rate, penetration rate, and other indicators. Therefore, it is reasonable and representative to carry out verification of this paper based on taxi GPS data.

Before placing data into the model for training, some pre-processing work was performed: (1) Weekend data and holiday data were removed, leaving only 22 workdays; (2) incorrect and redundant data, such as one vehicle’s GPS data located in one region for an unreasonable amount of time, were deleted. (3) Average speed values of all vehicles, which represent region-level traffic features, were aggregated every 15 min in each region. (4) The 22-day dataset was divided into a training dataset and a test dataset at a ratio of 8:2. The test dataset includes data on 21–24 January 2019. Moreover, the rest part is the training dataset.

The urban area of Shenzhen is divided into 78 regions according to administrative zip code, as shown in Figure 6. Adjacent matrix

A

, traffic flow propagating matrix

A^{P}

, and Centroid distance matrix

A^{d}

were obtained based on the connectivity of graph generated by regions, Equation (4), and Equation (5), respectively.

4.2. Benchmark Model and Evaluation Measurement

We evaluate the performance of the TmS-GCN model with the following benchmark models:

Historical average (HA) [37] uses the average value of historical state traffic information as the prediction result.
Auto-Regressive Moving Average (ARIMA) [25] is a parameter model widely used in various traffic prediction research.
Multiple Layer Perceptron (MLP) [38] is a classic feedforward neural network model.
Graph Convolutional Network model (GCN) [39] is described in Section 3.3.
Gated Recurrent Unit model (GRU) [40] is described in Section 3.4.
Long Short-Term Memory (LSTM) [27] is similar to the GRU model and widely used in traffic prediction areas. The settings of LSTM are the same as GRU.
Temporal Graph Convolutional Network (T-GCN) [14] captures both temporal and spatial dependencies to forecast short-term traffic flow.

Three common indicators are used to compare the performance of the TmS-GCN model and benchmark models:

(1) Mean Absolute Error (MAE):

M A E = \frac{1}{N} \sum_{i = 1}^{N} | y_{i} - \hat{y_{i}} |

(10)

(2) Mean Absolute Percentage Error (MAPE):

M A P E = \frac{1}{N} \sum_{i = 1}^{N} \frac{| y_{i} - \hat{y_{i}} |}{y_{i}}

(11)

(3) Root Mean Square Error (RMSE):

R M S E = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} (y_{i} - \hat{y_{i}})}

(12)

where

y_{i}

and

\hat{y_{i}}

represent the

i^{t h}

real value and predicted value. The smaller MAE, MAPE, and RMSE values are, the higher the accuracy of the model and the better prediction performance will be.

We use Pytorch to implement the TmS-GCN model and other benchmark models. Some parameters involved in the TmS-GCN model are as follows: the learning rate is set to 0.001, the batch size is set to 32, and the training Epoch is 600. The L2 loss function is used to calculate the difference between true and predicted values.

4.3. Results

Table 2 shows the prediction performance of the TmS-GCN model and other benchmark models for 5 min, 15 min, 30 min, 45 min, and 60 min. It can be observed that the proposed TmS-GCN model has the best prediction performance for all prediction horizons.

We can deduce the following information from Table 2: (1) For all prediction horizons, the GCN model, which simply considers spatial dependencies, is the worst. It shows that in the study of regional traffic state prediction, temporal dependencies have a greater impact than spatial dependencies. (2) The deep learning model outperforms the traditional time series model. The GRU model that ignores spatial dependencies outperforms HA and ARIMA models for all prediction horizons. (3) The T-GCN model and TmS-GCN models, considering both spatial and temporal dependencies, outperform not only the GCN model but also the GRU model. (4) The TmS-GCN model outperforms T-GCN model for most prediction horizons, which demonstrating the validity of our hypothesis that adding two more special adjacency matrices

A^{P}

and

A^{d}

to capture richer spatial dependencies improves forecasting performance.

4.4. Discussion

4.4.1. Results Analysis

We analyze the prediction results of the TmS-GCN model from the time dimension and space dimension.

In Figure 7, the true traffic feature value of Region 60 (i.e., Longhua subdistinct, the red region in Figure 6) is compared to the predicted values of 15 min, 30 min, 45 min, and 60 min, respectively. The following conclusions can be drawn: (1) The TmS-GCN model performs better in short-term prediction tasks than in long-term prediction tasks, which are determined by GRU features. (2) The model does not perform well when regional traffic speed increases or decreases sharply. This is due to the fact that our model ignores the random effect induced by a lack of taxi vehicles. In the future, other GPS data sources could be collected to improve it.

Figure 8 shows the comparison of prediction results for 78 regions. We can draw the following conclusions:

Figure 8a shows that the majority of the locations with good prediction results are in the city center, such as Nanshan District, Futian District, Bao’an District, and so on. However, suburban areas such as Guangming District in the northwest corner and Longgang District in the southeast corner have a poor prediction results. This is due to the fact that suburb regions have less adjacent regions, making it difficult to acquire effective spatial dependencies. Furthermore, the amount of taxi GPS data in the suburbs is tiny, resulting in randomness.
Figure 8 shows that the number of taxi GPS points has a significant impact on prediction effect. When there are a few taxi vehicles between 3:00 and 4:00 a.m. every day, the prediction results of practically all regions are smaller than in other time periods. The increase in the number of taxi GPS points between 12:00 and 13:00 improves forecasting results of all regions.

4.4.2. Analysis of Influencing Factors

Type and number of GPS points

There are nearly 30,000 taxis in Shenzhen cruising on roads at any time of the day. Each taxi vehicle uploads location information every 3 s, including latitude, longitude, and instantaneous speed. These data can comprehensively and accurately reflect travel demand, traffic flow speed, and other information of the divided regions. GPS data of other vehicles can also be applied to the TmS-GCN model, such as online car-hailing vehicles, bus, private car, etc.

Intuitively, the number of GPS points within the region has an impact on prediction results. Figure 9 shows the results of regional traffic prediction at various GPS points. We observe that when the average number of GPS points within a region in 15 min exceeds 600, the values of MAE, MAPE, and RMSE are obviously smaller and more stable, and the prediction effect is greatly improved. Therefore, in order to achieve better prediction, the average number of GPS points in each region needs to reach 600 every 15 min.

Time interval

In this paper, we set the time interval to 15 min. In terms of calculation, the smaller the time interval, the more calculations are required. The calculation amount for a 5-min scale forecast, for example, is three times that of a 15-min forecast. However, the 5-min forecast is more relevant than the 15-min forecast in terms of application. We compared prediction impacts over different time intervals, as shown in Figure 10. As the time interval becomes larger, the values of MAE, MAPE, and RMSE also increase correspondingly, which means that the prediction effect becomes worse. Compared with the prediction results on the 5-min scale, MAE, MAPE, and RMSE values on the 15-min scale only increased by 8.2%, 15%, and 12.6%, respectively. If calculation efficiency is not considered, it is recommended to perform a 5-min traffic forecast. However, 15-min traffic forecasting is the most appropriate if computing efficiency, practicability, and forecasting consequences are all taken into account.

4.4.3. Potential Application Direction

Since the average speed values of all regions can be obtained, the most direct application is to provide a macroscopic visual display of traffic operation status in the ITS system. If the average speed of a specific region deviates from the average value of all regions, the traffic manager can reschedule the traffic signal in advance or send the police to divert the traffic. Furthermore, if the speed of traffic in a certain location is. reduced, this information can be displayed on a public information platform. Furthermore, if the traffic speed in a specific region decreases, an individual can use this knowledge to choose the subway instead of a private car, or the individual can adjust travel time to avoid wasting time.

In this research study, we verified traffic speed prediction within the region. In fact, our model can also predict region-level traffic flow as long as traffic speed is replaced by traffic flow. Thus, the most direct application is to estimate the OD matrix at the regional level. Furthermore, if we know where and when a taxi picks up passengers, we can incorporate this type of location information into our model and forecast the demand for individuals to travel by taxi. Taxi companies can dispatch vehicles based on this forecast information to obtain greater economic benefits. These applications are also applicable to private cars, online car-hailing, shared bicycles, etc.

5. Conclusions

This paper proposes a deep learning model for predicting region-level traffic state called TmS-GCN, including the following two parts: GRU and GCN. In the GCN part, not only adjacency matrix information but also traffic propagation features between regions and GPS positioning points’ centroid distance feature are used to capture spatial dependencies of the region’s graph. In the GRU part, temporal dependencies are captured in order to predict region-level traffic state for prediction horizons of 15 min, 30 min, 45 min, and 60 min. Using real GPS data from Shenzhen taxis, the model is evaluated and compared to HA, ARIMA, MLP, GCN, GRU, and T-GCN. In most regions and prediction horizons, the model outperforms other benchmark models. Our model is expected to be used to analyze and capture spatio-temporal features at the regional level in other scenarios.

The contributions of this article are listed as follows:

We propose a complete region-level traffic prediction method named TmS-GCN composed of GCN and GRU. Based on GCN, TmS-GCN can capture multi-spatial correlation features of regions on non-Euclidean distance data composed of divided regions. In addition, based on GRU, TmS-GCN can capture temporal features of traffic parameters within the region.
We validate the TmS-GCN model by using real taxi GPS data from Shenzhen. The findings show that the model produces the desired impact and that the model’s effect is much better when traffic flow propagation information and centroid distance information are included. TmS-GCN outperforms baseline models of HA, ARIMA, MLP, GCN, GRU, and T-GCN on prediction scales of 15 min, 30 min, 45 min, and 60 min. The MAE, MAPE, and RMSE of the model’s prediction outcomes are 4.1558, 17.5257, and 7.7170, respectively, on a 15-min prediction scale.

This study can provide support for better understanding and carrying out region-level traffic prediction. The research’s findings, in particular, are extremely useful for decision making in Intelligent Transportation Systems. For example, using forecasted dynamic region-level traffic demand, managers can dispatch vehicles to balance traffic demand. Using information on abnormal changes in region-level traffic speeds, managers can implement traffic control or traffic guidance to relieve traffic congestion. Although the model in this paper has a good prediction effect in most regions and for most of the time period, it fails to respond well to the impact of severe weather and traffic accidents. Future research and improvement ideas include two types: one is to design an adaptive model for a specific shock event, and the other is to consider the impact of shock events by using the existing TmS-GCN model, such as time, location, and scale of shock events, as input features into the neural network.

Author Contributions

H.Y.: Conceptualization, methodology, software, writing—original draft, and writing—review and editing. X.Z.: Data curation and review and editing. Z.L.: Software, validation, and visualization. J.C.: Conceptualization, methodology, and writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research and Development Project (Grant Number: 2020YFB1313604).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data, models, or code generated in our study are available at https://github.com/Joker-L0912/Tms-GCN-Py (accessed on 24 January 2019).

Conflicts of Interest

The authors declare no conflict of interest.

References

An, S.; Yang, H.; Wang, J.; Cui, N.; Cui, J. Mining Urban Recurrent Congestion Evolution Patterns from GPS-Equipped Vehicle Mobility Data. Inf. Sci. 2016, 373, 515–526. [Google Scholar] [CrossRef]
Guo, J.; Liu, Y.; Yang, Q.; Wang, Y.; Fang, S. GPS-Based Citywide Traffic Congestion Forecasting Using CNN-RNN and C3D Hybrid Model. Transp. A: Transp. Sci. 2021, 17, 190–211. [Google Scholar] [CrossRef]
Tang, J.; Liang, J.; Liu, F.; Hao, J.; Wang, Y. Multi-Community Passenger Demand Prediction at Region Level Based on Spatio-Temporal Graph Convolutional Network. Transp. Res. Part C Emerg. Technol. 2021, 124, 102951. [Google Scholar] [CrossRef]
Liu, Y.; Singleton, A.; Arribas-bel, D.; Chen, M. Identifying and Understanding Road-Constrained Areas of Interest (AOIs) through Spatiotemporal Taxi GPS Data: A Case Study in New York City. Comput. Environ. Urban Syst. 2021, 86, 101592. [Google Scholar] [CrossRef]
Yu, J.; Stettler, M.E.J.; Angeloudis, P.; Hu, S.; Chen, X. Urban Network-Wide Traffic Speed Estimation with Massive Ride-Sourcing GPS Traces. Transp. Res. Part C Emerg. Technol. 2020, 112, 136–152. [Google Scholar] [CrossRef]
Kan, Z.; Kwan, M.-P.; Liu, D.; Tang, L.; Chen, Y.; Fang, M. Assessing Individual Activity-Related Exposures to Traffic Congestion Using GPS Trajectory Data. J. Transp. Geogr. 2022, 98, 103240. [Google Scholar] [CrossRef]
Cascetta, E. Transportation Systems Analysis: Models and Applications; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2009; Volume 29. [Google Scholar]
Azari, A.; Papapetrou, P.; Denic, S.; Peters, G. Cellular Traffic Prediction and Classification: A Comparative Evaluation of LSTM and ARIMA. In Proceedings of the Discovery Science; Kralj Novak, P., Šmuc, T., Džeroski, S., Eds.; Springer International Publishing: Cham, Switzerland, 2019; pp. 129–144. [Google Scholar]
Cai, L.; Zhang, Z.; Yang, J.; Yu, Y.; Zhou, T.; Qin, J. A Noise-Immune Kalman Filter for Short-Term Traffic Flow Forecasting. Phys. A Stat. Mech. Its Appl. 2019, 536, 122601. [Google Scholar] [CrossRef]
Gu, Y.; Lu, W.; Xu, X.; Qin, L.; Shao, Z.; Zhang, H. An Improved Bayesian Combination Model for Short-Term Traffic Prediction With Deep Learning. IEEE Trans. Intell. Transp. Syst. 2020, 21, 1332–1342. [Google Scholar] [CrossRef]
Chen, X.; Lu, J.; Zhao, J.; Qu, Z.; Yang, Y.; Xian, J. Traffic Flow Prediction at Varied Time Scales via Ensemble Empirical Mode Decomposition and Artificial Neural Network. Sustainability 2020, 12, 3678. [Google Scholar] [CrossRef]
Azzouni, A.; Pujolle, G. A Long Short-Term Memory Recurrent Neural Network Framework for Network Traffic Matrix Prediction. arXiv 2017, arXiv:1705.05690. [Google Scholar]
Zhang, K.; He, F.; Zhang, Z.; Lin, X.; Li, M. Graph Attention Temporal Convolutional Network for Traffic Speed Forecasting on Road Networks. Transp. B Transp. Dyn. 2021, 9, 153–171. [Google Scholar] [CrossRef]
Zhao, L.; Song, Y.; Zhang, C.; Liu, Y.; Wang, P.; Lin, T.; Deng, M.; Li, H. T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction. IEEE Trans. Intell. Transport. Syst. 2020, 21, 3848–3858. [Google Scholar] [CrossRef] [Green Version]
Kipf, T.N.; Welling, M. Semi-Supervised Classification with Graph Convolutional Networks. arXiv 2017, arXiv:1609.02907. [Google Scholar]
Zhang, S.; Tong, H.; Xu, J.; Maciejewski, R. Graph Convolutional Networks: A Comprehensive Review. Comput. Soc. Netw. 2019, 6, 11. [Google Scholar] [CrossRef] [Green Version]
Li, X.; Wang, X.; Ouyang, Y. Prediction and Field Validation of Traffic Oscillation Propagation under Nonlinear Car-Following Laws. Transp. Res. Part B: Methodol. 2012, 46, 409–423. [Google Scholar] [CrossRef]
Lartey, J.D. Predicting Traffic Congestion: A Queuing Perspective. Open J. Model. Simul. 2014, 2, 57. [Google Scholar] [CrossRef]
Hu, X.; Wang, W.; Sheng, H. Urban Traffic Flow Prediction with Variable Cell Transmission Model. J. Transp. Syst. Eng. Inf. Technol. 2010, 10, 73–78. [Google Scholar] [CrossRef]
Wegerle, D.; Kerner, B.S.; Schreckenberg, M.; Klenov, S.L. Prediction of Moving Bottleneck through the Use of Probe Vehicles: A Simulation Approach in the Framework of Three-Phase Traffic Theory. J. Intell. Transp. Syst. 2020, 24, 598–616. [Google Scholar] [CrossRef]
Kinney, W.R. ARIMA and Regression in Analytical Review: An Empirical Test. Account. Rev. 1978, 53, 48–60. [Google Scholar]
Ghosh, D.; Knapp, C.H. Estimation of Traffic Variables Using a Linear Model of Traffic Flow. Transp. Res. 1978, 12, 395–402. [Google Scholar] [CrossRef]
Levin, M.; Krause, G.M. Incident Detection: A Bayesian Approach. Transp. Res. Rec. 1978, 682, 52–58. [Google Scholar]
Alonso, B.; Pòrtilla, Á.I.; Musolino, G.; Rindone, C.; Vitetta, A. Network Fundamental Diagram (NFD) and Traffic Signal Control: First Empirical Evidences from the City of Santander. Transp. Res. Procedia 2017, 27, 27–34. [Google Scholar] [CrossRef]
Chen, C.; Hu, J.; Meng, Q.; Zhang, Y. Short-Time Traffic Flow Prediction with ARIMA-GARCH Model. In Proceedings of the 2011 IEEE Intelligent Vehicles Symposium (IV), Baden-Baden, Germany, 5–9 June 2011; pp. 607–612. [Google Scholar]
Zhene, Z.; Hao, P.; Lin, L.; Guixi, X.; Du, B.; Bhuiyan, M.Z.A.; Long, Y.; Li, D. Deep Convolutional Mesh RNN for Urban Traffic Passenger Flows Prediction. In Proceedings of the 2018 IEEE SmartWorld, Ubiquitous Intelligence Computing, Advanced Trusted Computing, Scalable Computing Communications, Cloud Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI), Guangzhou, China, 8–12 October 2018; pp. 1305–1310. [Google Scholar]
Zhao, Z.; Chen, W.; Wu, X.; Chen, P.C.Y.; Liu, J. LSTM Network: A Deep Learning Approach for Short-term Traffic Forecast. IET Intell. Transp. Syst. 2017, 11, 68–75. [Google Scholar] [CrossRef] [Green Version]
Ma, X.; Dai, Z.; He, Z.; Ma, J.; Wang, Y.; Wang, Y. Learning Traffic as Images: A Deep Convolutional Neural Network for Large-Scale Transportation Network Speed Prediction. Sensors 2017, 17, 818. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Zhang, W.; Yu, Y.; Qi, Y.; Shu, F.; Wang, Y. Short-Term Traffic Flow Prediction Based on Spatio-Temporal Analysis and CNN Deep Learning. Transp. A Transp. Sci. 2019, 15, 1688–1711. [Google Scholar] [CrossRef]
Shi, X.; Qi, H.; Shen, Y.; Wu, G.; Yin, B. A Spatial–Temporal Attention Approach for Traffic Prediction. IEEE Trans. Intell. Transp. Syst. 2021, 22, 4909–4918. [Google Scholar] [CrossRef]
Yu, B.; Lee, Y.; Sohn, K. Forecasting Road Traffic Speeds by Considering Area-Wide Spatio-Temporal Dependencies Based on a Graph Convolutional Neural Network (GCN). Transp. Res. Part C Emerg. Technol. 2020, 114, 189–204. [Google Scholar] [CrossRef]
Li, X.; Li, J.; Hu, X.; Yang, J. Line-CNN: End-to-End Traffic Line Detection With Line Proposal Unit. IEEE Trans. Intell. Transp. Syst. 2020, 21, 248–258. [Google Scholar] [CrossRef]
Han, P.; Yang, P.; Zhao, P.; Shang, S.; Liu, Y.; Zhou, J.; Gao, X.; Kalnis, P. GCN-MF: Disease-Gene Association Identification By Graph Convolutional Networks and Matrix Factorization. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA, 4–8 August 2019; pp. 705–713. [Google Scholar]
Abu-El-Haija, S.; Kapoor, A.; Perozzi, B.; Lee, J. N-GCN: Multi-Scale Graph Convolution for Semi-Supervised Node Classification. In Proceedings of the 35th Uncertainty in Artificial Intelligence Conference, Tel Aviv, Israel, 22–25 July 2019; pp. 841–851. [Google Scholar]
Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
Cho, K.; van Merrienboer, B.; Gulcehre, C.; Bahdanau, D.; Bougares, F.; Schwenk, H.; Bengio, Y. Learning Phrase Representations Using RNN Encoder-Decoder for Statistical Machine Translation. arXiv 2014, arXiv:1406.1078. [Google Scholar]
Kamarianakis, Y.; Prastacos, P. Forecasting Traffic Flow Conditions in an Urban Network: Comparison of Multivariate and Univariate Approaches. Transp. Res. Rec. 2003, 1857, 74–84. [Google Scholar] [CrossRef]
Chen, H.; Grant-Muller, S.; Mussone, L.; Montgomery, F. A Study of Hybrid Neural Network Approaches and the Effects of Missing Data on Traffic Forecasting. NCA 2001, 10, 277–286. [Google Scholar] [CrossRef]
Chen, Z.; Zhao, B.; Wang, Y.; Duan, Z.; Zhao, X. Multitask Learning and GCN-Based Taxi Demand Prediction for a Traffic Road Network. Sensors 2020, 20, 3776. [Google Scholar] [CrossRef]
Wang, S.; Zhao, J.; Shao, C.; Dong, C.; Yin, C. Truck Traffic Flow Prediction Based on LSTM and GRU Methods With Sampled GPS Data. IEEE Access 2020, 8, 208158–208169. [Google Scholar] [CrossRef]

Figure 1. Multi-spatial dependencies of region graph.

Figure 2. The step-by-step diagram of the method.

Figure 3. Overall framework of TmS-GCN.

Figure 4. The structure of multi layers GCN.

Figure 5. The structure of multi-layered GCN.

Figure 6. Visualization of 78 divided regions of Shenzhen.

Figure 7. Comparison of Region 60 ground truth and prediction results for 4 prediction horizons: (a) 15-min; (b) 30-min; (c) 45-min; (d) 60-min.

Figure 8. Comparison of the RMSE values of all 78 regions for various time periods.

Figure 9. The impact of the numbers of GPS points within region.

Figure 10. Comparison of prediction impacts over different time intervals.

Table 1. Existing research on traffic prediction.

Category		Models	Contribution	Shortcomings
Model-driven		car-following model [13], queuing theory [1,4], cell transmission model [15], three-phase traffic theory [16]	Establish a microscopic mathematical model for traffic forecasting; work better in micro scenes	based on a variety of assumptions and ideal conditions; with extremely high application scenario requirements
Data-driven	Parametric model	ARIMA [17], Kalman filter [18], Bayesian model [19]	treat traffic prediction as time-series task; work better in a single region scene	not consider the impact of spatial characteristics on traffic prediction
	Deep learning model	ARIMA-GARCH [20], LSTM-ARIMA [5], IBCM-DL [7], RNN [21], LSTM [22], CNN [23,24], APTN [25]	considering not only temporal characteristics but also spatial characteristics	only applicable to regular Euclidean datasets
	Deep learning model	GATCN [9], T-GCN [10], GCN [26]	applicable to irregular non-Euclidean datasets	assume that adjacent regions have the same effect on the predicted region

Table 2. Comparison of prediction performance.

Prediction Horizons	Indicator	Models
Prediction Horizons	Indicator	HA	ARIMA	GCN	GRU	T-GCN	TmS-GCN
15 min	MAE	4.9444	4.7388	6.4901	4.6163	4.3459	4.1558
	MAPE	22.1308	15.4636	30.3969	18.6844	17.8520	17.5257
	RMSE	9.4545	8.7343	10.4114	8.5333	7.9872	7.7170
30 min	MAE	4.9444	6.3588	6.5580	4.6815	4.2521	4.2626
	MAPE	22.1308	19.9847	30.6031	19.4643	18.6607	17.9527
	RMSE	9.4545	10.4867	10.5076	8.9123	7.7833	7.7935
45 min	MAE	4.9444	6.0853	6.5832	4.8029	4.4019	4.3449
	MAPE	22.1308	21.5859	30.7334	19.6285	18.9232	18.8494
	RMSE	9.4545	9.8143	10.5497	9.3611	7.9631	7.8322
60 min	MAE	4.9444	6.8783	6.5979	4.7871	4.7776	4.2945
	MAPE	22.1308	25.6076	30.8103	18.6647	21.3096	17.9971
	RMSE	9.4545	11.2398	10.5686	10.5902	8.2070	7.8740

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, H.; Zhang, X.; Li, Z.; Cui, J. Region-Level Traffic Prediction Based on Temporal Multi-Spatial Dependence Graph Convolutional Network from GPS Data. Remote Sens. 2022, 14, 303. https://0-doi-org.brum.beds.ac.uk/10.3390/rs14020303

AMA Style

Yang H, Zhang X, Li Z, Cui J. Region-Level Traffic Prediction Based on Temporal Multi-Spatial Dependence Graph Convolutional Network from GPS Data. Remote Sensing. 2022; 14(2):303. https://0-doi-org.brum.beds.ac.uk/10.3390/rs14020303

Chicago/Turabian Style

Yang, Haiqiang, Xinming Zhang, Zihan Li, and Jianxun Cui. 2022. "Region-Level Traffic Prediction Based on Temporal Multi-Spatial Dependence Graph Convolutional Network from GPS Data" Remote Sensing 14, no. 2: 303. https://0-doi-org.brum.beds.ac.uk/10.3390/rs14020303

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Region-Level Traffic Prediction Based on Temporal Multi-Spatial Dependence Graph Convolutional Network from GPS Data

Abstract

1. Introduction

2. Related Work

3. Methodology

3.1. Problem Definition

3.2. Overall Framework

3.3. Spatial Dependence Modeling

3.4. Temporal Dependence Modeling

4. Experiment

4.1. Data Description

4.2. Benchmark Model and Evaluation Measurement

4.3. Results

4.4. Discussion

4.4.1. Results Analysis

4.4.2. Analysis of Influencing Factors

4.4.3. Potential Application Direction

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI