CN113408786B

CN113408786B - Traffic characteristic prediction method and system

Info

Publication number: CN113408786B
Application number: CN202110555841.9A
Authority: CN
Inventors: 戎丁丁; 季青原; 温晓岳; 徐甲; 何尚秋; 陈乾; 吴建平; 聂文涛; 林文霞; 吴占宁
Original assignee: Zhejiang Yinjiang Intelligent Transportation Engineering Technology Research Institute Co ltd
Current assignee: Zhejiang Yinjiang Intelligent Transportation Engineering Technology Research Institute Co ltd
Priority date: 2021-05-21
Filing date: 2021-05-21
Publication date: 2023-03-14
Anticipated expiration: 2041-05-21
Also published as: CN113408786A

Abstract

The invention relates to a traffic characteristic prediction method and a system, control information is introduced into traffic characteristic prediction to respectively process traffic speed, road network structure and traffic control data; the data are respectively sent to a neural network to obtain space embedding, time embedding and control embedding, and the space embedding, the time embedding and the control embedding are combined to form comprehensive embedding; and finally obtaining the predicted traffic speed value through an encoder based on a space-time attention mechanism module, a data conversion module and a decoder based on the space-time attention mechanism module. The invention improves the effectiveness of the speed prediction of the signal control road network, can better model the dynamic space correlation and the nonlinear time correlation, and simultaneously can avoid the error accumulation to relieve the error propagation effect, thereby improving the long-term traffic flow prediction performance and solving the problem of being suitable for long-term traffic prediction.

Description

Traffic characteristic prediction method and system

Technical Field

The invention relates to the technical field of intelligent traffic, in particular to a traffic characteristic prediction method and system.

Background

Traffic characteristic prediction is a classic time series prediction problem, and the future traffic condition is predicted according to the observed value of the historical traffic condition. The accuracy of traffic characteristic prediction influences various applications such as traffic signal control, traffic navigation and the like. Accurate traffic prediction may increase the effectiveness of traffic decisions, thereby better reducing traffic congestion. However, the existing traffic characteristic prediction method still has defects, which are specifically represented as follows: 1) The effect on long-time traffic characteristic prediction is not optimistic because the traffic characteristics have complex space-time relationship and the error of the long-time prediction is amplified at each step; 2) In the prior art, the relation between the predicted traffic characteristics and historical traffic characteristics and the traffic characteristics of the similar regions is constructed from the perspective of time correlation or space correlation, and the influence of control information on the predicted values of the traffic characteristics is ignored. In order to effectively solve the above problems, it is necessary to design a traffic characteristic prediction method and system.

Disclosure of Invention

The invention aims to overcome the defects and provides a traffic characteristic prediction method and a system, control information is introduced into traffic characteristic prediction to respectively process traffic speed, road network structure and traffic control data; the data are respectively sent to a neural network to obtain spatial embedding, time embedding and control embedding, and comprehensive embedding is combined; and finally obtaining the predicted traffic speed value through an encoder based on a space-time attention mechanism module, a data conversion module and a decoder based on the space-time attention mechanism module. The invention improves the effectiveness of the speed prediction of the signal control road network, can better model the dynamic space correlation and the nonlinear time correlation, and simultaneously can avoid the error accumulation to relieve the error propagation effect, thereby improving the long-term traffic flow prediction performance and solving the problem of being suitable for long-term traffic prediction.

The invention achieves the aim through the following technical scheme: a traffic characteristic prediction method comprises the following steps:

(1) The data preprocessing module acquires a first traffic characteristic, a second traffic characteristic and a third traffic characteristic and processes the first traffic characteristic, the second traffic characteristic and the third traffic characteristic into the input of the space-time data embedding module; the first traffic characteristic is characteristic data with time variation generated by traffic operation, including but not limited to: traffic speed, flow, occupancy, queuing length, congestion index; the second traffic characteristic is characteristic data with short-time invariance of the environment where the traffic runs, and the characteristic data comprises but is not limited to: road network structure, POI distribution and traffic infrastructure distribution; the third characteristic of the traffic is characteristic data with time variation generated by controlling traffic operation, including but not limited to: traffic control data, traffic guidance data, traffic restriction data;

(2) The space-time data embedding module is used for respectively sending the traffic first characteristic, the traffic second characteristic and the traffic third characteristic processed in the step (1) into a neural network to obtain space embedding, time embedding and control embedding, and combining the space embedding, the time embedding and the control embedding into comprehensive embedding;

(3) Processing the comprehensive embedding obtained in the step (2) by utilizing an encoder based on a space-time attention mechanism module;

(4) The data conversion module converts the coded comprehensive embedding into the input of a decoder by using a transfer attention mechanism;

(5) And (4) processing the output of the data conversion module in the step (4) by using a decoder of the space-time attention mechanism module to obtain the finally predicted traffic characteristics.

Preferably, the step (1) specifically includes the steps of:

(1.1) constructing a connection graph by using the traffic second characteristic data; the second traffic characteristic is that, taking road network structure data as an example, topology of an urban road network is converted into a weighted directed connection graph G = (V, E, a), where V is a set of nodes, represents a road segment in an actual road network, is a finite set, and | = N, | V | = N, that is, the number of road segments in the actual road network is N; e is a set of edges, represents the connectivity among the road sections in the actual road network, and takes the direction of traffic flow among the road sections as the direction of the edges; a is an element of R ^N×N Representing a weighted adjacency matrix in which,

representing a node v _i To node v _j The weight of (c); specifically, the weight of the adjacent matrix is calculated by utilizing a Gaussian weight model, and the density of the adjacent matrix can be effectively controlled by utilizing a threshold value;

wherein, the first and the second end of the pipe are connected with each other,

is a node v _i To node v _j By the distance of node v _i And node v _j Half of the sum of the lengths of the represented road segments is approximately replaced; sigma is the standard deviation of all distance values, epsilon is a threshold value used for controlling the sparsity of the adjacent matrix and is set to be 0.1;

(1.2) normalizing the traffic first characteristic data; the traffic first characteristic data takes traffic speed data as an example, and the specific processing flow is as follows: the original traffic data speed at time t is represented as X _t ∈R ^N×C Wherein N is the number of nodes, C is the number of node characteristics, and 1 is taken here and only includes the speed characteristics of the road section; taking 2, road section speed and road section flow; taking 3, including road section speed, road section flow and road section density; finally, the Z-score method and the max-min method are used for aligning X _t Carrying out normalization;

(1.3) matching the traffic third characteristic data to a road section, converting the road section into a period index and a green signal ratio index, carrying out discretization treatment, and processing the road section into a single-hot code; the third traffic characteristic is exemplified by control data, the control data indicates cycle and split data of the road, and the matching of the control data to the road section specifically comprises the following steps: combining the road network structure data, taking the period of the downstream intersection of the road section as the period of the road section, and taking the split of the phase of the vehicle which can enter the intersection of the road section as the split of the road section; then, calculating a period index as a ratio of a road section period to a historical road section maximum period, and calculating a green signal ratio index as a ratio of a road section green signal ratio to a historical maximum green signal ratio; and discretizing the period index and the green signal ratio index according to the corresponding table.

Preferably, the step (2) is specifically as follows:

(2.1) generating spatial embedding, specifically: learning the vector representation of the vertex by using any one method of Deepwalk, node2Vec and GraphSAGE on the connection graph constructed in the step (1.1); and feeding the vectors into a two-layer fully-connected neural network to obtain spatial embedding represented as

(2.2) generating time embedding, specifically: encoding each time corresponding to the traffic speed data in the step (1.2) into a vector; encoding time as R according to the time of seven days of the week and each day as time steps ⁷ And R ^T And splicing them into R ^7+T The vector of (a); wherein the value of T can be 24 in terms of hour value, and 1440 in terms of minute value; sending the vector into a neural network such as a full-connection neural network, a circulation neural network, a deep belief network and the like with two or more layers, converting the vector into a D-dimension vector, namely time embedding, and expressing the D-dimension vector as time embedding

Wherein P represents the historical time step number of the input, Q represents the time step number of the output needing to be predicted;

(2.3) generating control embedding, specifically: respectively processing the discretization period index and the green signal ratio index obtained in the step (1.3) with R ¹⁰ And splicing them into R ²⁰ The vector of (a); collecting a vector corresponding to each control data, and sending into a two-layer or more than two-layer fully-connected neural network, cyclic neural network, deep belief network, etc. to obtain control embedding represented as

(2.4) synthesizing comprehensive embedding, specifically, synthesizing comprehensive embedding by fusing the spatial embedding and the temporal embedding and the control embedding; for at time step t _j Node v _i Comprehensive embedding is defined as

Or

α, β, and γ are trainable weights, respectively; thus the integrated embedding of N nodes containing P + Q time steps is denoted as E ∈ R ^(P+Q)×N×D (ii) a Where the composite embedding contains both temporal, spatial and control information.

Preferably, when the step (3) utilizes a space-time attention mechanism module to process the comprehensive embedding obtained in the encoding step (2), before entering the encoder, the speed data X ∈ R normalized in the step (1.2) ^P×N×C Is converted into H through the full connection layer ⁽⁰⁾ ∈R ^P×N×D (ii) a Then H ⁽⁰⁾ Obtaining an H through an encoder of an L-layer space-time attention mechanism module ^(L) ∈R ^P×N×D An output of (d); the space-time attention mechanism module is formed by fusing a time attention mechanism and a space attention mechanism by a gate control fusion device; denote the input of the l-th layer spatiotemporal attention Module as H ^(l-1) Wherein at time step t _j Node v of _i Is represented as

The spatial attention mechanism and the temporal attention mechanism output in the l-th layer space-time attention module are respectively expressed as

And

preferably, the spatial attention mechanism is used for adaptively grasping the relation among the traffic characteristics of different road sections in the road network, and the core of the spatial attention mechanism is to dynamically set different weights at different time steps to be connected to different nodes; wherein at time step t _j Node v of _i The weighted sum of all nodes is calculated as:

where V represents the set of all nodes,

is to represent node v to node v _i Attention score of importance, the sum of which is 1, i.e.

For attention at time step t in the space-time attention mechanism module of the l-th layer _j Node v of _i The input of (a) is performed,

for attention at time step t in the space-time attention mechanism module of the l-th layer _j Node v of _i The output of the spatial attention mechanism.

Preferably, the calculation of the attention score is specifically as follows: at a specific time step, the current traffic state and the road network structure simultaneously influence the relationship between the sensors; considering traffic characteristics and graph structure and control information to learn an overall attention score, in particularThe comprehensive embedding and the hidden state are connected, and a scaling dot product method is applied to calculate the node v and the node v _i The correlation between:

wherein the content of the first and second substances,

is shown at time step t _j Node v _i The comprehensive embedding of (1), the splicing operation is represented by,<·,·>inner product representation, 2D representation

Dimension (d); function pairs are then activated with softmax

Normalization:

in particular, in order to make the learning process more stable, the spatial attention mechanism is upgraded to a multi-head attention mechanism; namely, K parallel attention mechanisms are set, and K sets of different learnable equations are set:

wherein the content of the first and second substances,

and

three different non-linear equations representing the kth spatial attention mechanism, which ultimately can output a D = D/K dimensional vector; the nonlinear equation is of the form:

f(x)＝ReLU(xW+b)

where W and b are trainable parameters, respectively, and ReLU is an activation function.

Preferably, the time attention mechanism is used for adaptively modeling the nonlinear relation between different time steps of the same node; the time correlation continuously changes between different time steps and is influenced by factors such as traffic state, related time, control state and the like; therefore, the comprehensive embedding containing the information of the three is utilized to combine the hidden state, and a multi-head attention mechanism is applied to calculate the time attention score; wherein for node v _i Time step t _j The correlation with t is defined as follows:

wherein the content of the first and second substances,

representing the time step t in the kth time attention mechanism _j The correlation with the time step t,

representing time step t versus time step t in the kth attention mechanism _j Attention score of importance of;

in the attention mechanism for representing the kth timeTwo different learnable non-linear equations, the form of the non-linear equations being the same as in spatial attention;

represents t _j A set of all time steps prior to the time step; an attention score is obtained

Then at t _j Vertex v of time step _i The hidden state of (c) may be updated according to:

wherein the content of the first and second substances,

represents a non-linear equation in the kth temporal attention mechanism, of the same form as in spatial attention; the learnable parameters in the above three equations are shared among all nodes and time steps when computed in parallel.

Preferably, the gated fuser functions to adaptively fuse temporal and spatial representations, or fused temporal, spatial, controlled representations; at layer I the output of the spatial and temporal attention module, temporal and spatial attention module are respectively represented as

And

the fusion mode is as follows:

wherein, W _z,1 ∈R ^D×D 、W _z,2 ∈R ^D×D And b _z ∈R ^D As a learnable parameter, [ (-) denotes a point multiplication, [ (-) denotes a sigmoid activation function, and z denotes a gate; h ^(l) The output of the space-time attention mechanism module is the first layer; this gated fuser can adaptively control the weight of the spatio-temporal dependencies for each vertex and each time step.

Preferably, the data conversion module of step (4) models a direct relationship between each future time step and the historical time step to convert the encoded traffic characteristics to generate a future representation as an input to the decoder; in particular, the encoded features H are transformed ^(L) ∈R ^P×N×D To generate a future sequence representation H ^(L+1) ∈R ^Q×N×D (ii) a For each node v _i Predicting the time step t _j (t _j ＝t _P+1 ,...,t _P+Q ) And historical time step t (t = t) ₁ ,...,t _P ) The relationship of (c) is measured by synthetic embedding:

representing the predicted time step t in the kth time attention mechanism _j The correlation with the historical time step t,

representing the historical time step t versus the predicted time step t in the kth attention mechanism _j Attention score of importance of;

represents two different learnable nonlinear equations in the kth head distraction mechanism, the form of the nonlinear equations being the same as in spatial attention; attention scores are then used

Adaptively selecting the relevant characteristics of historical P time steps, and converting the coded traffic characteristics into the input of a decoder:

represents a non-linear equation in the kth temporal attention mechanism, of the same form as in spatial attention;

is a node v _i Historical time step t, input at level l,

for after conversion, node v _i At the predicted time step t _j A vector representation of the output of (a); in the three formulas, trainable parameters of all nodes and time steps can be calculated in parallel and shared.

Preferably, the output of the data conversion module in the step (5) is H ^(L+1) ∈R ^Q×N×D The decoder of the space-time attention mechanism module comprises an L-layer space-time attention mechanism module and outputs H ^(2L+1) ∈R ^Q×N×D (ii) a Finally, outputting predicted values in advance of Q time steps by full connection layer

A traffic characteristic prediction system comprises a data preprocessing module, a space-time data embedding module, a space-time attention mechanism module and a data conversion module; the data preprocessing module is used for acquiring a first traffic characteristic, a second traffic characteristic and a third traffic characteristic and processing the first traffic characteristic, the second traffic characteristic and the third traffic characteristic into input of the space-time data embedding module; the space-time data embedding module respectively sends the first traffic characteristic, the second traffic characteristic and the third traffic characteristic to a neural network to obtain space embedding, time embedding and control embedding, and the space embedding, the time embedding and the control embedding are combined into comprehensive embedding; the data conversion module is used for converting the coded comprehensive embedding to be used as the input of a decoder; the space-time attention mechanism module comprises an encoder and a decoder, wherein the encoder of the space-time attention mechanism module is used for processing the comprehensive embedding of the output of the encoding space-time data embedding module; and the decoder of the space-time attention mechanism module is used for processing the output of the decoded data conversion module to obtain the finally predicted traffic characteristics.

The invention has the beneficial effects that: (1) The invention introduces traffic control information, combines the control information with traffic speed and road network structure information according to specific processing, effectively captures the influence of the control information on traffic characteristics, and improves the effectiveness of speed prediction of a signal control road network; (2) The invention provides a space-time attention mechanism which can better model dynamic space correlation and nonlinear time correlation, and designs a gating fusion device for adaptively fusing information extracted by the space-time attention mechanism; (3) The invention designs a data conversion module, which transfers the historical traffic characteristics to the future representation, models the direct relation between the historical time step and the future time step, avoids error accumulation, relieves the error propagation effect, improves the long-term traffic flow prediction performance and solves the problem of being suitable for long-term traffic prediction.

Drawings

FIG. 1 is a schematic of the system of the present invention;

FIG. 2 is a schematic flow diagram of the method of the present invention;

FIG. 3 is a schematic diagram of a spatial attention mechanism capturing relationships between nodes over time in accordance with an embodiment of the present invention;

FIG. 4 is a schematic diagram of a temporal attention mechanism capturing relationships between nodes as a function of time in accordance with an embodiment of the present invention.

Detailed Description

The invention will be further described with reference to specific examples, but the scope of the invention is not limited thereto:

example (b): as shown in fig. 1, a traffic characteristic prediction system includes a data preprocessing module, a spatiotemporal data embedding module, a spatiotemporal attention mechanism module, and a data conversion module; the data preprocessing module is used for acquiring a first traffic characteristic, a second traffic characteristic and a third traffic characteristic and processing the first traffic characteristic, the second traffic characteristic and the third traffic characteristic into input of the space-time data embedding module; the space-time data embedding module respectively sends the first traffic characteristic, the second traffic characteristic and the third traffic characteristic to a neural network to obtain space embedding, time embedding and control embedding, and the space embedding, the time embedding and the control embedding are combined into comprehensive embedding; the data conversion module is used for converting the coded comprehensive embedding to be used as the input of a decoder; the space-time attention mechanism module comprises an encoder and a decoder, wherein the encoder of the space-time attention mechanism module is used for processing the comprehensive embedding of the output of the encoding space-time data embedding module; and the decoder of the space-time attention mechanism module is used for processing the output of the decoded data conversion module to obtain the finally predicted traffic characteristics.

As shown in fig. 2, a traffic characteristic prediction method includes the following steps:

s1, a data preprocessing module: the method comprises the steps of acquiring first traffic characteristics (traffic operation generated characteristic data with time-varying property, including but not limited to traffic speed, flow, occupancy, queue length and congestion index), second traffic characteristics (traffic operation environment characteristic data with short-time invariance, including but not limited to road network structure, POI distribution and traffic infrastructure distribution) and third traffic characteristics (traffic operation controlled characteristic data with time-varying property, including but not limited to traffic control data, traffic induction data and traffic restriction data), and processing the first traffic characteristics (traffic operation generated characteristic data with time-varying property), wherein the first traffic characteristics, including but not limited to traffic speed, flow, occupancy, queue length and congestion index, and the third traffic characteristics (traffic operation controlled characteristic data with time-varying property, including but not limited to traffic control data, traffic induction data and traffic restriction data) into input of a space-time data embedding module. The method specifically comprises the following steps:

s1.1, a connection diagram is constructed by utilizing the traffic second characteristic data. The second traffic characteristic is described as follows by taking a road network structure as an example: the road network structure data includes, but is not limited to, the connection relationship between intersections in the actual road network, the length of road segments connecting intersections, and the like, and can be acquired from each of the map network stations and government agencies. Data samples are shown in table 1 below:

TABLE 1

Further, a connection graph is constructed by using the road network structure data. Converting the topology of the urban road network into a weighted directed connection graph G = (V, E, A), wherein V is a set of nodes, represents road segments in the actual road network, is a finite set, and | < V | = N, namely the number of the road segments in the actual road network is N; e is a set of edges representing connectivity between road segments in the actual road network, and the direction in which traffic flows between road segments is taken as the direction of the edge. A is an element of R ^N×N Representing a weighted adjacency matrix in which,

representing a node v _i To node v _j The weight of (c). Specifically, the weight of the adjacency matrix is calculated by using a Gaussian weight model, and the density of the adjacency matrix can be effectively controlled by using a threshold value.

Wherein the content of the first and second substances,

is a node v _i To node v _j By the distance of node v _i And node v _j Half of the sum of the lengths of the represented road segments is approximately substituted. σ is the standard deviation of all distance values, and ε is a threshold used to control the sparsity of the adjacency matrix, set to 0.1.

S1.2, traffic first characteristic data are normalized. The traffic speed is taken as an example for explanation: the traffic speed data refers to road segment traffic speed data acquired based on a movement detection technology or a section detection technology, and includes, but is not limited to, a road segment number, time, and speed. Data samples are shown in table 2 below:

road segment numbering	Time	Speed of rotation
			14L020980T0	2021-03-1209:02:00	30
14L020979T0	2021-03-1209:03:00	40
			14L020978T0	2021-03-1209:04:00	35
14L020977T0	2021-03-1209:05:00	36

TABLE 2

The specific treatment process comprises the following steps: the original traffic data speed at time t is represented as X _t ∈R ^N×C Wherein N is the number of nodes, C is the number of node characteristics, and 1 is taken here and only includes the speed characteristics of the road section; taking 2, road section speed and road section flow; get 3, including road speed, road flow and roadSegment density. Then, X is subjected to the Z-score method and the max-min method _t And (6) carrying out normalization.

S1.3, matching the traffic third characteristic data to a road section, converting the road section into a period index and a green signal ratio index, and then carrying out discretization processing. The third traffic characteristic is illustrated by taking control data as an example: the control data refers to the period, the split ratio and other data of the intersection. Data samples are shown in table 3 below:

TABLE 3

Matching the control data to the road segment specifically is: and combining the road network structure data, taking the period of the downstream intersection of the road section as the period of the road section, and taking the split of the phases of the vehicles which can enter the intersection of the road section and the split of the road section. Then, the period index is calculated as the ratio of the road section period to the maximum period of the historical road sections, and the green ratio index is calculated as the ratio of the road section green ratio to the maximum historical green ratio. The data obtained are shown in Table 4 below:

TABLE 4

Then, discretizing the period index and the green signal ratio index according to a corresponding table shown in the following table 5:

TABLE 5

S2, a space-time data embedding module: and respectively sending the traffic first characteristic, the traffic second characteristic and the traffic third characteristic processed in the last step into a neural network to obtain spatial embedding, time embedding and control embedding, and finally combining the spatial embedding, the time embedding and the control embedding into comprehensive embedding. The method specifically comprises the following steps:

s2.1 generating spatial embedding. The method specifically comprises the following steps: and (3) learning the vector representation of the vertex by using methods such as Deepwalk, node2Vec, graphSAGE and the like on the connection graph constructed by the S1.1. These vectors are then fed into a two-layer fully-connected neural network, resulting in spatial embedding, denoted as

S2.2 generate time embedding. The method comprises the following specific steps: and coding each time corresponding to the traffic speed data in the S1.2 into a vector. Encoding time as R according to the time of seven days of the week and each day as time steps ⁷ And R ^T (the value of T can be 24 according to the hour value and 1440 according to the minute value) and splicing the two into R ^7+T The vector of (2). For example, 2021-03-1209]) The time value is encoded into 24-dimensional vector ([ 000000000100000000000000) according to 24 hours per day]) Splicing the two into a 31-dimensional vector ([ 0000100000000000100000000000000)]). Then, the vector is sent into a neural network with two or more layers, such as a fully-connected neural network, a cyclic neural network, a deep belief network and the like, and is converted into a D-dimension vector, namely, the D-dimension vector is time-embedded and is represented as time

Where P represents the historical number of time steps of the input and Q represents the number of time steps of the output that need to be predicted.

S2.3 generating control embedding. The method specifically comprises the following steps: respectively processing the discretized period index and the green ratio index obtained from S1.3 by R ¹⁰ And splicing them into R ²⁰ The vector of (2). For example, if the period index of the discretization is (0.5, 1.5,2.5,3.5,4.5,5.5,6.5,7.5,8.5, 9.5), the converted one-hot codes are respectively ([ 0000000000 ]],[1000000000],[0100000000],[00100000000],[0001000000],[0000100000],[0000010000],[0000001000],[0000000100],[0000000010]). If the period index is obtainedHas a unique heat code of [1000000000]The unique heat code obtained from the split green index is [0000100000 ]]Then the vector formed by splicing is [10000000000000100000 ]]. Then, these vectors (each control data corresponds to a vector) are sent into a neural network with two or more layers, such as a fully-connected neural network, a cyclic neural network, a deep belief network, etc., to obtain control embedding, which is expressed as control embedding

S2.4 Synthesis of synthetic intercalations. The method specifically comprises the following steps: and integrating the spatial embedding and the temporal embedding and the control embedding into comprehensive embedding. For at time step t _j Node v _i Comprehensive embedding is defined as

Or

α, β, and γ are trainable weights, respectively. Thus the integrated embedding of N nodes containing P + Q time steps is denoted as E ∈ R ^(P+Q)×N×D . The integrated embedding contains both temporal, spatial and control information.

S3, an encoder based on a space-time attention mechanism module: processing the comprehensive embedding obtained in the previous step of coding by utilizing a space-time attention mechanism module; the method comprises the following specific steps:

and processing the comprehensive embedding obtained in the last step of coding by utilizing a space-time attention mechanism module. Normalized speed data X ∈ R in S1.2 before entering the encoder ^P×N×C Is converted into H through the full connection layer ⁽⁰⁾ ∈R ^P×N×D . Then, H ⁽⁰⁾ Obtaining an H through an encoder of an L-layer space-time attention mechanism module ^(L) ∈R ^P×N×D To output of (c).

The space-time attention module is formed by fusing a time attention mechanism and a space attention mechanism through a gating fusion device. The input to the l-th layer spatiotemporal attention module is denoted as H ^(l-1) Wherein at time step t _j Node v of _i Hidden state ofIs shown as

And

the spatial attention mechanism is to capture spatial correlation. Spatial correlation means that the traffic state of one road (or segment) is influenced by other roads (or segments), these factors being highly dynamic and changing over time. In order to model such characteristics, the invention designs a spatial attention mechanism to adaptively grasp the relation between the traffic characteristics of different road sections in the road network. The core of the method is to dynamically set different weights at different time steps to be connected to different nodes, as shown in FIG. 3.

At time step t _j Node v of _i The weighted sum of all nodes is calculated as:

where V represents the set of all nodes,

in a space-time attention mechanism module at the first layerAt time step t _j Node v of _i The output of the spatial attention mechanism.

Further, the attention score is calculated as follows, at a particular time step, the current traffic state and the road network structure simultaneously affect the relationship between the sensors. For example, congestion on one road may affect the traffic status of his neighboring roads. With this organizational heuristic, the entire attention score is learned while taking into account traffic characteristics and graph structure and control information. In particular, the synthetic embedding is connected with the hidden state, and the scaling dot product method is applied to calculate the node v and the node v _i The relevance between the two is as follows:

Of (c) is calculated. Then activate function pairs with softmax

Normalization:

in particular, to make the learning process more stable, the spatial attention mechanism is upgraded to a multi-head attention mechanism. That is, setting K parallel attention mechanisms, K sets of different learnable equations are set:

wherein the content of the first and second substances,

and

three different non-linear equations representing the kth head space attention mechanism can ultimately output a D = D/K dimensional vector. The nonlinear equation is of the form:

f(x)＝ReLU(xW+b)

Further, the time attention mechanism is used for adaptively modeling the nonlinear relation between different time steps of the same node. The time dependency varies continuously between non-use time steps and is influenced by traffic conditions and related time and control conditions. Thus, the present invention combines hidden states with comprehensive embedding that contains the three pieces of information and applies a multi-point attention mechanism to calculate the temporal attention score, as shown in FIG. 4.

For node v _i Time step t _j The correlation with t is defined as follows:

wherein the content of the first and second substances,

representing time step t in kth time attention mechanism _j The correlation with the time step t,

representing time step t versus time step t in the kth attention mechanism _j Attention to the importance of.

Represents two different learnable non-linear equations in the kth temporal attention mechanism, the form of the non-linear equations being the same as in the spatial attention mechanism mentioned above.

Represents t _j The set of all time steps before a time step, i.e. only causal relationships between time steps earlier than the target step are considered. An attention score is obtained

Then at t _j Vertex v of time step _i May be updated according to:

wherein the content of the first and second substances,

represents a non-linear equation in the kth temporal attention mechanism, of the same form as in the spatial attention mechanism mentioned above. The learnable parameters in the above three equations are shared among all nodes and time steps when computed in parallel.

Further, the gated fusion device is used for adaptively fusing the representations of time and space or fusing the representations of time, space and control. At layer I the output of the spatial and temporal attention module, temporal and spatial attention module are respectively represented as

And

the fusion mode is as follows:

wherein, W _z,1 ∈R ^D×D 、W _z,2 ∈R ^D×D And b _z ∈R ^D As a learnable parameter, ", indicates a point multiplication, (. Sigma.)" indicates a sigmoid activation function, and z indicates a gate. H ^(l) The output of the attention module is the space-time attention of the layer I. This gated fuser can adaptively control the weight of the spatio-temporal dependencies of each vertex and each time step.

S4, a data conversion module: the transform-coded synthesis is embedded as input to the decoder. The method specifically comprises the following steps: with the shift attention mechanism, the synthesis in which the conversion is encoded is embedded as input to the decoder. In order to reduce error propagation at different prediction time steps in long-term prediction, the invention adds a data conversion module between a decoder and an encoder. It models the direct relationship of each future time step to the historical time step to convert the encoded traffic characteristics to generate a future representation as input to the decoder. In particular, the encoded features H are transformed ^(L) ∈R ^P×N×D To generate a future sequence representation H ^(L+1) ∈R ^Q ^×N×D . For each node v _i Predicting the time step t _j (t _j ＝t _P+1 ,...,t _P+Q ) And historical time step t (t = t) ₁ ,...,t _P ) The relationship of (c) is measured by synthetic embedding.

Wherein the content of the first and second substances,

representing the historical time step t versus the predicted time step t in the kth attention mechanism _j Attention to the importance of.

Representing two different learnable non-linear equations in the kth head distraction mechanism, the form of the non-linear equations being the same as in the spatial attention referred to above. Then, the attention score is used

represents a non-linear equation in the kth temporal attention mechanism, of the same form as in the spatial attention mechanism mentioned above.

Is a node v _i Historical time step t, input at level l,

for after conversion, node v _i At the predicted time step t _j Is detected. In the three formulas, trainable parameters of all nodes and time steps can be calculated in parallel and shared.

S5, a decoder based on a space-time attention mechanism module: and processing and decoding the output of the data conversion module in the last step by using a space-time attention mechanism module to obtain the finally predicted traffic characteristics. Wherein the output of the data conversion module is H ^(L+1) ∈R ^Q ^×N×D . The decoder comprises an L-layer space-time attention mechanism module and outputs H ^(2L+1) ∈R ^Q×N×D . Finally, the all-connected layer outputs the predicted values in advance of Q time steps

In conclusion, the invention introduces the control information into the traffic characteristic prediction, and respectively processes the traffic speed, the road network structure and the traffic control data; the data are respectively sent to a neural network to obtain spatial embedding, time embedding and control embedding, and comprehensive embedding is combined; and finally obtaining the predicted traffic speed value through an encoder based on a space-time attention mechanism module, a data conversion module and a decoder based on the space-time attention mechanism module. The beneficial effects are mainly shown as follows:

in the prior correlation study, the relation between the predicted traffic characteristics and the historical traffic characteristics and the traffic characteristics of the similar areas is constructed generally from the perspective of time correlation or space correlation, and the influence of control information on the predicted values of the traffic characteristics is ignored. The invention introduces traffic control information, combines the control information with traffic speed and road network structure information according to specific processing, effectively captures the influence of the control information on traffic characteristics, and improves the effectiveness of speed prediction of the traffic control road network. In a road network, a signal control intersection controls the vehicles at the intersection to pass through, under the normal condition, the vehicles at the intersection normally pass through, if the signal control is carried out, the vehicles can be blocked from directly entering the next road section, and the correlation of speed, flow and the like between upstream and downstream can be influenced by the change of the duration of a green light. The control information is introduced as embedded to calculate the attention scores of time and space, and is used for describing the influence as a supplement besides the space information and the time information.

And (II) a space-time attention mechanism is provided, dynamic space correlation and nonlinear time correlation can be better modeled, and a gating fusion device is designed to adaptively fuse the information extracted by the space-time attention mechanism.

And (III) designing a data conversion module, transferring the historical traffic characteristics to future characteristics, modeling a direct relation between the historical time step and the future time step, and avoiding error accumulation to relieve an error propagation effect, thereby improving the long-term traffic flow prediction performance and solving the problem of being suitable for long-term traffic prediction.

While the invention has been described in connection with specific embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

1. A traffic characteristic prediction method is characterized by comprising the following steps:

(1) The data preprocessing module acquires a first traffic characteristic, a second traffic characteristic and a third traffic characteristic and processes the first traffic characteristic, the second traffic characteristic and the third traffic characteristic into the input of the space-time data embedding module; wherein the first traffic characteristic is characteristic data with time variation generated by traffic operation; the second traffic characteristic is characteristic data with short-time invariance of the environment where the traffic runs; the traffic third characteristic is characteristic data with time variation generated by controlling traffic operation; the method specifically comprises the following steps:

(1.1) constructing a connection diagram by using the traffic second characteristic data; wherein the second characteristic of traffic refers to road network structure data, and the topology of the urban road network is converted into weighted directed connectionGraph G = (V, E, a), where V is a set of nodes, represents links in the actual road network, is a finite set, | V | = N, that is, the number of links in the actual road network is N; e is a set of edges, represents the connectivity among the road sections in the actual road network, and takes the direction of traffic flow among the road sections as the direction of the edges; a is equal to R ^N×N Representing a weighted adjacency matrix in which,

representing a node v _i To node v _j The weight of (c); specifically, the weight of the adjacent matrix is calculated by using a Gaussian weight model, and the density of the adjacent matrix can be effectively controlled by using a threshold value;

is a node v _i To node v _j By the distance of node v _i And node v _j Half of the sum of the lengths of the represented road segments is approximately replaced; sigma is the standard deviation of all distance values, and epsilon is a threshold value used for controlling the sparsity of the adjacent matrix;

(1.2) normalizing the traffic first characteristic data; the first traffic characteristic data refers to traffic speed data, and the specific processing flow comprises the following steps: the original traffic data speed at time t is represented as X _t ∈R ^N×C Wherein N is the number of nodes, C is the number of node characteristics, and 1 is taken here and only includes the speed characteristics of the road section; taking 2, road section speed and road section flow; taking 3, including road speed, road flow and road density; finally, X is subjected to the Z-score method or the max-min method _t Carrying out normalization;

(1.3) matching the traffic third characteristic data to a road section, converting the road section into a period index and a green signal ratio index, carrying out discretization treatment, and processing the road section into a unique hot code; the third feature of the traffic refers to control data, the control data refers to cycle and split ratio data of the road, and the matching of the control data to the road section specifically comprises the following steps: combining the road network structure data, taking the period of the downstream intersection of the road section as the period of the road section, and taking the split of the phase of the vehicle which can enter the intersection of the road section as the split of the road section; then, calculating a period index as a ratio of a road section period to a historical road section maximum period, and calculating a green signal ratio index as a ratio of a road section green signal ratio to a historical maximum green signal ratio; discretizing the period index and the green signal ratio index according to a corresponding table;

(2) The space-time data embedding module respectively sends the traffic first characteristic, the traffic second characteristic and the traffic third characteristic processed in the step (1) into a neural network to obtain space embedding, time embedding and control embedding, and the space embedding, the time embedding and the control embedding are combined into comprehensive embedding;

2. A traffic characteristic prediction method according to claim 1, characterized in that: the step (2) is specifically as follows:

v _i ∈V；

(2.2) generating time embedding, specifically: encoding each time corresponding to the traffic speed data in the step (1.2) into a vector; respectively encode the time according to the time of seven days of the week and each day as the time stepR ⁷ And R ^T And splicing them into R ^7+T The vector of (a); sending into a neural network with two or more layers, converting into a D-dimension vector, which is time-embedded and expressed as

t _j ∈t ₁ ,...,t _P ,...,t _P+Q Wherein P represents the historical time step number of the input, and Q represents the time step number of the output needing to be predicted;

(2.3) generating control embedding, specifically: respectively processing the discretized period index and the discretized green ratio index obtained in the step (1.3) by R ¹⁰ And splicing them into R ²⁰ The vector of (a); collecting a vector corresponding to each control data, and sending the vector into a neural network with two or more layers to obtain control embedding represented as

v _i ∈V,t _j ∈t ₁ ,...,t _P ,...,t _P+Q ；

(2.4) synthesizing and integrating the embedding, in particular, integrating the spatial embedding and the temporal embedding and the control embedding into the integrated embedding; for at time step t _j Node v _i Comprehensive embedding is defined as

Or

α, β, and γ are trainable weights, respectively; thus the integrated embedding of N nodes containing P + Q time steps is denoted as E ∈ R ^(P ^+Q)×N×D (ii) a Where the composite embedding contains both temporal, spatial and control information.

3. A traffic characteristic prediction method according to claim 1, characterized in that: said step (3) utilizes space-time attentionWhen the mechanism module processes the comprehensive embedding obtained in the step (2), before entering the encoder, the speed data X belonging to the R after normalization in the step (1.2) ^P×N×C Is converted into H through the full connection layer ⁽⁰⁾ ∈R ^P×N×D (ii) a Then H ⁽⁰⁾ Obtaining an H through an encoder of an L-layer space-time attention mechanism module ^(L) ∈R ^P×N×D An output of (d); the space-time attention mechanism module is formed by fusing a time attention mechanism and a space attention mechanism through a gate control fusion device; the input to the l-th layer spatiotemporal attention module is denoted as H ^(l-1) Wherein at time step t _j Node v of _i Is represented as

And

4. a traffic feature prediction method according to claim 3, characterized in that: the spatial attention mechanism is used for self-adaptively mastering the relation among the traffic characteristics of different road sections in the road network, and the core of the spatial attention mechanism is that different weights are dynamically set at different time steps and are connected to different nodes; wherein at time step t _j Node v of _i The weighted sum of all nodes is calculated as:

where V represents the set of all nodes,

watch with clockShow node v to node v _i Attention scores of importance, the sum of which is 1, i.e.

For the time step t in the space-time attention mechanism module at the l layer _j Node v of _i The input of (a) is performed,

5. The traffic feature prediction method of claim 4, wherein: the attention score calculation specifically comprises the following steps: at a particular time step, the current traffic state and road network structure simultaneously affect the relationship between the sensors; considering traffic characteristics and graph structure and control information to learn the whole attention score, connecting the comprehensive embedding and the hidden state, and applying a scaling dot product method to calculate the node v and the node v _i The correlation between:

wherein the content of the first and second substances,

is shown at time step t _j Node v _i The comprehensive embedding of (1), the splicing operation is represented by | l,<·,·>inner product representation, 2D representation

Dimension (d); then activate function pairs with softmax

Normalization:

in order to make the learning process more stable, the space attention mechanism is upgraded to a multi-head attention mechanism; namely, K parallel attention mechanisms are set, and K sets of different learnable equations are set:

wherein the content of the first and second substances,

and

three different nonlinear equations representing the kth spatial attention mechanism can finally output a D = D/K dimensional vector; the nonlinear equation is of the form:

f(x)＝ReLU(xW+b)

6. A traffic feature prediction method according to claim 3, characterized in that: the time attention mechanism is used for adaptively modeling the nonlinear relation between different time steps of the same node(ii) a The nonlinear relation continuously changes in different time steps and is influenced by factors such as traffic states, relevant time, control states and the like; therefore, the comprehensive embedding containing the information of the three is utilized to combine the hidden state, and a multi-head attention mechanism is applied to calculate the time attention score; wherein for node v _i Time step t _j The correlation with t is defined as follows:

wherein the content of the first and second substances,

represents two different learnable non-linear equations in the kth temporal attention mechanism, the form of the non-linear equations being the same as in the spatial attention mechanism;

Then at t _j Vertex v of time step _i Can be updated according to：

Wherein the content of the first and second substances,

7. A traffic feature prediction method according to claim 3, characterized in that: the gated fuser has the function of adaptively fusing representations of time and space, or representations of time, space, control; at layer I the output of the spatial and temporal attention module, temporal and spatial attention module are respectively represented as

And

the fusion mode is as follows:

wherein, W _z,1 ∈R ^D×D 、W _z,2 ∈R ^D×D And b _z ∈R ^D For a learnable parameter, <' > indicates a point multiplication, (. Sigma.. Cndot.) indicates a sigmoid activation function, and z indicates a gate; h ^(l) The output of the space-time attention mechanism module of the l layer; this gated fuser can be adaptiveThe weight of the spatio-temporal dependencies at each vertex and each time step should be controlled.

8. A traffic characteristic prediction method according to claim 1, characterized in that: the data conversion module of step (4) modeling a direct relationship between each future time step and the historical time step to convert the encoded traffic characteristics to generate a future representation as input to a decoder; in particular, the encoded features H are transformed ^(L) ∈R ^P×N×D To generate a future sequence representation H ^(L+1) ∈R ^Q×N×D (ii) a For each node v _i Predicting the time step t _j (t _j ＝t _P+1 ,...,t _P+Q ) And historical time step t (t = t) ₁ ,...,t _P ) The relationship of (c) is measured by synthetic embedding:

wherein the content of the first and second substances,

representing historical time step t versus predicted time step t in the kth attention mechanism _j Attention score of importance of;

represents two different learnable nonlinear equations in the kth head distraction mechanism, the form of the nonlinear equations being the same as in spatial attention; then using the notesScore of intention

representing a non-linear equation in the kth temporal attention mechanism, of the same form as in spatial attention;

is a node v _i Historical time step t, input at level l,

9. A traffic characteristic prediction method according to claim 1, characterized in that: the output of the data conversion module in the step (5) is H ^(L+1) ∈R ^Q×N×D The decoder of the space-time attention mechanism module comprises an L-layer space-time attention mechanism module and outputs H ^(2L+1) ∈R ^Q×N×D (ii) a Finally, the full connection layer outputs the predicted values in advance of Q time steps

10. A traffic characteristic prediction system applying the method of claim 1, comprising a data preprocessing module, a spatiotemporal data embedding module, a spatiotemporal attention mechanism module and a data conversion module; the data preprocessing module is used for acquiring a first traffic characteristic, a second traffic characteristic and a third traffic characteristic and processing the first traffic characteristic, the second traffic characteristic and the third traffic characteristic into input of the space-time data embedding module; the space-time data embedding module respectively sends the first traffic characteristic, the second traffic characteristic and the third traffic characteristic to a neural network to obtain space embedding, time embedding and control embedding, and the space embedding, the time embedding and the control embedding are combined into comprehensive embedding; the data conversion module is used for converting the coded comprehensive embedding to be used as the input of a decoder; the space-time attention mechanism module comprises an encoder and a decoder, wherein the encoder of the space-time attention mechanism module is used for processing the comprehensive embedding of the output of the encoding space-time data embedding module; and the decoder of the space-time attention mechanism module is used for processing the output of the decoded data conversion module to obtain the finally predicted traffic characteristics.