CN115423048B

CN115423048B - Traffic flow anomaly detection method and system based on pattern similarity

Info

Publication number: CN115423048B
Application number: CN202211365058.7A
Authority: CN
Inventors: 张彩明; 马翔; 袁晨迅; 李雪梅
Original assignee: Shandong University
Current assignee: Shandong University
Priority date: 2022-11-03
Filing date: 2022-11-03
Publication date: 2023-04-25
Anticipated expiration: 2042-11-03
Also published as: CN115423048A

Abstract

The invention discloses a traffic flow anomaly detection method and a system based on pattern similarity, which relate to the technical field of traffic flow anomaly detection models and comprise the following steps: extracting time sequence characteristics from traffic flow data by adopting an improved long-short-term memory neural network; dividing and clustering traffic flow data by adopting a sliding window, and taking a short-term sequence corresponding to a clustering center as a mode characteristic; calculating time sequence similarity for time sequence features of different space positions; determining the mode characteristics closest to each mode characteristic, and weighting the nearest neighbor distances of the mode characteristic pairs to obtain the mode similarity of different spatial positions; determining sequence similarity according to the time sequence similarity and the mode similarity, and constructing traffic flow dynamic relation diagrams of different time and different space positions according to the sequence similarity; and detecting abnormal traffic flow states by adopting a traffic flow dynamic relation diagram and time sequence similarity so as to improve the accuracy of detecting abnormal traffic flow.

Description

Traffic flow anomaly detection method and system based on pattern similarity

Technical Field

The invention relates to the technical field of traffic flow anomaly detection models, in particular to a traffic flow anomaly detection method and system based on pattern similarity.

Background

Along with the related development of big data technology, the artificial intelligence technology is widely applied to traffic flow anomaly detection and traffic flow prediction, accurately detects the anomaly condition of traffic flow, not only can provide favorable decision reference for traffic management departments, but also can provide more proper route selection for pedestrians, and is favorable for relieving traffic pressure.

The change of the traffic flow at the intersection is affected by various aspects such as time, weather, traffic policy and the like, has obvious periodicity, and the existing traffic flow anomaly detection algorithm using the machine learning method has at least the following three problems:

(1) A single recurrent neural network model cannot more effectively extract information of the traffic flow history sequence.

(2) The existing traffic flow anomaly detection only considers the traffic condition of a single intersection, and does not consider the associated influence factors of other intersections.

(3) Calculation lacks an effective measure when calculating the similarity of traffic flows between different roads.

Disclosure of Invention

In order to solve the problems, the invention provides a traffic flow anomaly detection method and a system based on pattern similarity, which respectively extract time sequence characteristics and pattern characteristics from traffic flow data and construct a traffic flow dynamic relationship diagram, so as to judge traffic flow anomalies and improve the accuracy of traffic flow anomaly detection.

In order to achieve the above purpose, the present invention adopts the following technical scheme:

in a first aspect, the present invention provides a traffic flow anomaly detection method based on pattern similarity, including:

acquiring traffic flow data;

extracting time sequence characteristics from traffic flow data by adopting an improved long-short-term memory neural network; the improved long-short-term memory neural network obtains time sequence characteristics after weighting and summing hidden states obtained at different moments;

dividing traffic flow data by adopting a sliding window to obtain a short-term sequence set, clustering the short-term sequence set, and taking a short-term sequence corresponding to a clustering center of each category as a mode characteristic;

calculating time sequence similarity for time sequence features of different space positions;

determining the mode characteristics closest to each mode characteristic, and obtaining mode similarity of different spatial positions after weighting the nearest neighbor distances of the mode characteristic pairs by forming mode characteristic pairs;

determining sequence similarity according to the time sequence similarity and the mode similarity, and constructing traffic flow dynamic relation diagrams of different time and different space positions according to the sequence similarity;

and detecting abnormal traffic flow states by adopting a traffic flow dynamic relation diagram and time sequence similarity.

In the process of weighting and summing the hidden states obtained at different moments to obtain the time sequence characteristics, the weight is determined according to the correlation between the hidden states at different moments and the traffic flow data

The weight is as follows:

wherein ,

is the firsttThe traffic flow data of the day is used,

in order to be in a hidden state,

as a function of the correlation,

in order for the parameters to be learned,

is the number of days of the traffic flow data entered,

is a transpose operation.

Alternatively, the timing similarity is calculated for timing characteristics of different spatial locations

The process of (1) is as follows:

wherein ,

is the firsttSpace position of dayaIs used for the time sequence characteristics of the (a),

is the firsttSpace position of daybIs used for the time sequence characteristics of the (a),

is composed of weight matrix to be learned

And a network of an activation function tanh,

finger will

And

and (5) splicing.

In an alternative embodiment, in the process of weighting the nearest neighbor distance of the pattern feature pair, the weight is the number of elements included in the category of the pattern feature.

Alternatively, the sequence similarity is determined by summing weighted time-series similarity and pattern similarity.

As an alternative embodiment, the process of constructing the traffic flow dynamic relationship graph includes:

constructing a relationship diagram of different spatial positions at the same time according to the sequence similarity of traffic flow data of different spatial positions

；

Introducing a communication relation matrix between traffic flow data of different spatial positions, and constructing a traffic flow dynamic relation graph according to the relation graph and the communication relation matrix

；

wherein ,

in order for the parameters to be learned,

for a connected relation matrix, tanh is the activation function,

and

the current time and the time indicated by the a priori data respectively,

in order for the time difference to be a function of the time difference,

is a decreasing function.

Alternatively, the connectivity matrix is:

wherein ,X_a Is the space positionaTraffic flow data, X _b Is the space positionbIs used for determining the traffic flow data of the vehicle,

is X _a and X_b A connected relation matrix between the two.

In a second aspect, the present invention provides a traffic flow anomaly detection system based on pattern similarity, including:

the data acquisition module is configured to acquire traffic flow data;

a timing feature extraction module configured to extract timing features for traffic flow data using the modified long-short term memory neural network; the improved long-short-term memory neural network obtains time sequence characteristics after weighting and summing hidden states obtained at different moments;

the mode feature extraction module is configured to segment traffic flow data by adopting a sliding window to obtain a short-term sequence set, and after clustering the short-term sequence set, taking a short-term sequence corresponding to a clustering center of each category as a mode feature;

the time sequence similarity determining module is configured to calculate time sequence similarity for time sequence characteristics of different space positions;

the mode similarity determining module is configured to determine the mode feature closest to each mode feature so as to form a mode feature pair, and the nearest neighbor distances of the mode feature pair are weighted to obtain the mode similarity of different spatial positions;

the dynamic relation diagram construction module is configured to determine sequence similarity according to the time sequence similarity and the mode similarity, and construct traffic flow dynamic relation diagrams of different time and different space positions according to the sequence similarity;

the abnormal detection module is configured to detect abnormal states of the traffic flow by adopting the traffic flow dynamic relation diagram and the time sequence similarity.

In a third aspect, the invention provides an electronic device comprising a memory and a processor and computer instructions stored on the memory and running on the processor, which when executed by the processor, perform the method of the first aspect.

In a fourth aspect, the present invention provides a computer readable storage medium storing computer instructions which, when executed by a processor, perform the method of the first aspect.

Compared with the prior art, the invention has the beneficial effects that:

the invention provides a traffic flow anomaly detection method and a system based on pattern similarity, which adopt an improved long-short-term memory neural network to extract time sequence characteristics, meanwhile, the pattern characteristics are extracted to comprehensively consider the periodic characteristics of traffic flow data, and after similarity calculation is carried out on the extracted two parts of characteristics, a traffic flow dynamic relation graph is constructed, the influence of association relations among different spatial positions is considered in the traffic flow dynamic relation graph, the influence of different time on the current association relation is also considered, finally, the traffic flow anomaly condition is judged by utilizing a graph attention network, and the accuracy of traffic flow anomaly detection is improved.

Additional aspects of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention.

Fig. 1 is a flow chart of a traffic flow anomaly detection method based on pattern similarity provided in embodiment 1 of the present invention;

FIG. 2 is a schematic diagram of the dynamic relationship diagram provided in embodiment 1 of the present invention;

fig. 3 is a flowchart of anomaly determination provided in embodiment 1 of the present invention.

Detailed Description

The invention is further described below with reference to the drawings and examples.

It should be noted that the following detailed description is exemplary and is intended to provide further explanation of the invention. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.

It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of exemplary embodiments according to the present invention. As used herein, unless the context clearly indicates otherwise, the singular forms also are intended to include the plural forms, and furthermore, it is to be understood that the terms "comprises" and "comprising" and any variations thereof are intended to cover non-exclusive inclusions, such as, for example, processes, methods, systems, products or devices that comprise a series of steps or units, are not necessarily limited to those steps or units that are expressly listed, but may include other steps or units that are not expressly listed or inherent to such processes, methods, products or devices.

Embodiments of the invention and features of the embodiments may be combined with each other without conflict.

Example 1

The embodiment proposes a traffic flow anomaly detection method based on pattern similarity, as shown in fig. 1, including:

acquiring traffic flow data;

In the present embodiment, traffic flow data within T days is defined as

The method comprises the steps of carrying out a first treatment on the surface of the Wherein, the firsttThe traffic flow data of the day is

，

Is the firsttDay 3nMinute data, N is the length of the traffic flow data in the day.

Since traffic flow data is affected by various complex factors, in order to vector the timing characteristics thereof, the embodiment adopts a modified long-short-term memory neural network (Long Short Term Memory, LSTM) to model the acquired traffic flow data so as to extract the timing characteristics.

LSTM is an improved algorithm for Recurrent Neural Networks (RNNs) and is widely used in time series modeling, and the gating unit adopted by LSTM can suppress the gradient disappearance problem of RNNs to some extent. For each traffic flow datax ^t For LSTM constructionThe mode formula is shown as formula (1) -formula (6):

an input door:

（1）

forgetting the door:

（2）

（3）

output door:

（4）

long memory:

（5）

short memory:

（6）

wherein ,W _i 、W _f 、W _C andW _o are all parameters to be learned, and are used for learning,C ^t in the state of a cell, the cell is in a state of being,

is an intermediate quantity of the state of the cell,

for the Hadamard product,h ^t is in a hidden state. For convenience in explaining the specific improved algorithm, the above formula ignores biasAnd (5) transferring items.

In most existing algorithms, the last hidden state is typicallyh ^t As a result of the LSTM output, this tends to ignore features contained in the previous hidden state.

Therefore, in this embodiment, the hidden states at different moments are fused in a weighted summation manner to obtain a time sequence feature; wherein, the hidden states and the hidden states at different moments are usedx ^t The correlation of the hidden state is defined with a weight corresponding to a hidden state with large correlation, thereby improving the output result pairx ^t The expression ability of (a) is represented by the following formula (7) -formula (9):

（7）

（8）

（9）

wherein ,

is that

Is calculated from the correlation function

Determining;

is the parameter to be learned.

After the above processing is carried out on all traffic flow data, the traffic flow data is obtainedx ^t Vector representation, i.e. temporal featurev ^t The method comprises the steps of carrying out a first treatment on the surface of the The process is simplified to be represented by formula (10):

（10）

wherein ,

is the number of days data is entered.

Pattern features refer to a series of approximately short-term data that recurs over historical data. Thus, the present embodiment proposes a pattern feature extraction method based on segmentation and clustering to capture periodic features.

Firstly, dividing traffic flow data into a plurality of short-term sequences by adopting a sliding window; specifically:

adopt sliding window to make the firsttTraffic flow data for daysx ^t Dividing into M windows to construct the thtShort-term set of sequences for days

； wherein

For short-term sequences, L is the window length, m=n-l+1.

Then, clustering the short-term sequence sets according to the distance between the short-term sequences to capture repeated short-term sequences, namely pattern features;

specifically: integrating short-term sequences into collections

In by the pair of

All short-term sequences in (a)

Clustering to capture pattern features;

belonging to the same category

With approximate short-term sequences, taking the cluster center of each category

As the firsttPattern features of the traffic flow data of the day, where each element represents a cluster center of each category, g is the number of categories.

In this embodiment, similarity calculation is performed on the time sequence feature and the mode feature, and then, sequence similarity is determined according to the time sequence similarity and the mode similarity, and balance of the two similarities is controlled.

In this embodiment, the time sequence similarity between the time sequence features of the traffic flow data of different spatial positions (such as different traffic intersections) at the same time is calculated

：

（11）

（12）

wherein ,

is composed of weight matrix to be learned

And a network of an activation function tanh,

finger will

And

and (5) splicing.

In the present embodiment, the pattern features of traffic flow data of all spatial locations are obtained

After that, by calculating the space positionaTraffic flow data of (a)

Pattern features of (2)

And spatial positionbTraffic flow data of (a)

Pattern features of (2)

Distance between them to obtain

And

to determine pattern similarity.

Due to

Does not have a sequential relationship per se, and

and (3) with

The number of elements contained may vary, resulting in a computational process

And (3) with

The correspondence of the elements is not easily determined. In order to ensure the simplicity and robustness of the algorithm, the embodiment adopts the calculation of the nearest neighbor distance of each mode feature to solve the problem that trend mode features of different traffic flow data sequences have no one-to-one correspondence.

Nearest neighbor distance refers to the distance D between each pattern feature and its nearest pattern feature _1NN Expressed as:

wherein ,

is that

Is the first of (2)

The characteristics of the individual modes,

is that

Is the first of (2)

A personal pattern feature;

will be

When it is "1

And

the Euclidean distance between them is

Will be

All elements of (3) are relative to

Is represented as an array

；

Notably, when

Is that

Is used to determine the nearest-neighbor of the cell,

may not be

Is the nearest neighbor of (2);

therefore, it is necessary to use

And

respectively represent

All elements of (3) are relative to

Nearest neighbor sum of (2)

All elements of (3) are relative to

Is the nearest neighbor of (2);

in order to make

And

the distance measurement between the two is symmetrical

And

is combined into

, wherein

And

respectively is

And

pattern feature quantity of (2);

Included

and (3) with

The question is how to choose the most reasonable value to represent the nearest neighbor of all pattern features in (a)

And

the distance between, if chosen

Maximum value of (2), then at random

Noise peaks that occur in (a) will seriously affect the distance determination, whereas most if based on a minimum value

There is little distinction between them.

In order to consider the influence of all modes as much as possible, the present embodiment selects pairs

All the values in the pattern are weighted to obtain the pattern similarity

. Structure of the device

And (3) with

When each pattern feature is located in the category, the number of the elements is recorded

And

similar handle

And

is combined into

The weighting function is as shown in equation (13):

（13）

wherein ,

for sum operation.

In this embodiment, the sequence similarity is determined based on the time sequence similarity and the pattern similarity

As shown in formula (14):

（14）

wherein ,

is a weight parameter to be learned for controlling the balance of the two similarities.

In the embodiment, a relationship diagram of different spatial positions (traffic intersections) at the same time is constructed according to the sequence similarity, and then the relationship diagram at different times is processed by using LSTM (least squares) based on the dynamic diagram so as to construct a traffic flow dynamic relationship diagram containing time sequence characteristics;

specifically, according to the differencesSequence similarity of traffic flow data of spatial positions to construct a relationship diagram of different spatial positions at the same time (same day)

As shown in formula (15):

wherein ,

is the firsttA relationship diagram constructed by the days,

is the space positionaAnd spatial positionbSequence similarity between traffic flow data.

Relationship diagram

Can reflect the firsttThe association relation of the day, but the influence of other time on the current association relation is ignored. For this reason, by referring to the gating structure of LSTM, a Dynamic relationship graph construction method is designed, as shown in fig. 2, named as Dynamic-based LSTM (DGLSTM), where the part actually optimizes the input data, and the corresponding specific formula is shown in formula (16):

（16）

wherein ,

in order for the parameters to be learned,

for the connection relation matrix between different space positions, the method is used for guiding the construction of dynamic relation diagrams and definesFormula (17):

（17）

wherein ,

in order for the time difference to be a function of the time difference,

，

and

the time indicated by the current time and the prior data is respectively;

is a decreasing function for assigning a priori data

And means

The data in (c) is gradually forgotten with increasing time interval.

DGLSTM can be represented by the formula (18) -formula (23):

an input door:

（18）

forgetting the door:

（19）

（20）

output door:

（21）

long memory:

（22）

short memory:

（23）

after DGLSTM, a dynamic relation graph is obtained

The method comprises the steps of carrying out a first treatment on the surface of the Relative to

，

The method not only can reflect the relevance among the current time sequences, but also is influenced by other time relation diagrams in history. For convenience of description, the following will be made

The construction process of (2) is expressed as shown in formula (24):

（24）

in this embodiment, the graph attention network (Graph attention networks, GAT) is adopted to perform traffic flow anomaly judgment, and by aggregating the effects between approximate sequences, so as to capture implicit information in the dynamic relationship graph, compared with the traditional graph roll-up neural network (Graph Convolutional Networks, GCN) model, the GAT can selectively aggregate the effects of the approximate sequences, and the expression is shown in the formula (25):

（25）

wherein ,

is the firstkThe traffic flow data of each intersection is provided with a time sequence feature obtained through LSTM;

is an intersectionkAnd crossingpSequence similarity of (2) of the order of magnitude of a dynamic relationship graph

First, thekLine 1pThe value of the column;

the weight matrix to be learned; crossingkThe output at GAT is implicit information

；

Acquiring hidden information of all intersections through GAT

The process of (2) may be represented by formula (26):

（26）

defining a label when abnormality determination is performed

Adopts a single-layer full-connection layer (Fully connected layer, FC) as a prediction function pair

The prediction is performed as shown in fig. 3, and the prediction formula is shown in formula (27):

（27）

wherein ,

is the abnormal result of the judgment.

In this embodiment, the graph annotation force network is trained using cross entropy, as shown in equation (28):

（28）

wherein ,

and

respectively, are intersectionskIn the first placetThe true category and predicted value of the moment in time,Lis a loss function for minimizing the gap between the predicted value and the true class.

Since anomaly detection is a typical classification task, the present embodiment uses the Accuracy (ACC) and Ma Xiusi correlation coefficient (Matthews correlation coefficient, MCC) widely accepted in classification tasks to evaluate the predictive effect of a graph attention network.

The ACC can intuitively express the prediction effect of the model, and the formula is shown as formula (29):

（29）

wherein, true Positive (TP) is the result that both the predicted and the true value are normal; true Negative (TN) is the result that both the predicted and the true values are abnormal; false Positives (FP) are predicted as normal, actually abnormal results; false Negatives (FN) are predicted to be abnormal and actually normal.

MCC is an index that evaluates the performance of a model classification, and is actually a correlation coefficient that describes the relationship between the actual classification and the predicted classification. Its value is between-1 and +1, the coefficient +1 representing perfect prediction, 0 representing no better than random prediction, -1 representing complete inconsistency between prediction and observation. The MCC calculation formula is shown in formula (30):

（30）

after comparing the method of this embodiment with 3 conventional methods, namely, the graph annotation force network (Graph attention networks, GAT), the time convolution neural network (Temporal Convolutional Neural Network, TCN), and the gated loop unit neural network (Gated Recurrent Unit, GRU), the two indexes of the method of this embodiment are the highest, and the method of this embodiment is ranked first in the comparison method, so as to confirm the effectiveness of the method of this embodiment.

Example 2

The embodiment provides a traffic flow anomaly detection system based on pattern similarity, which comprises:

the data acquisition module is configured to acquire traffic flow data;

It should be noted that the above modules correspond to the steps described in embodiment 1, and the above modules are the same as examples and application scenarios implemented by the corresponding steps, but are not limited to those disclosed in embodiment 1. It should be noted that the modules described above may be implemented as part of a system in a computer system, such as a set of computer-executable instructions.

In further embodiments, there is also provided:

an electronic device comprising a memory and a processor and computer instructions stored on the memory and running on the processor, which when executed by the processor, perform the method described in embodiment 1. For brevity, the description is omitted here.

It should be understood that in this embodiment, the processor may be a central processing unit CPU, and the processor may also be other general purpose processors, digital signal processors DSP, application specific integrated circuits ASIC, off-the-shelf programmable gate array FPGA or other programmable logic device, discrete gate or transistor logic devices, discrete hardware components, or the like. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.

The memory may include read only memory and random access memory and provide instructions and data to the processor, and a portion of the memory may also include non-volatile random access memory. For example, the memory may also store information of the device type.

A computer readable storage medium storing computer instructions which, when executed by a processor, perform the method described in embodiment 1.

The method in embodiment 1 may be directly embodied as a hardware processor executing or executed with a combination of hardware and software modules in the processor. The software modules may be located in a random access memory, flash memory, read only memory, programmable read only memory, or electrically erasable programmable memory, registers, etc. as well known in the art. The storage medium is located in a memory, and the processor reads the information in the memory and, in combination with its hardware, performs the steps of the above method. To avoid repetition, a detailed description is not provided herein.

Those of ordinary skill in the art will appreciate that the elements of the various examples described in connection with the present embodiments, i.e., the algorithm steps, can be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.

While the foregoing description of the embodiments of the present invention has been presented in conjunction with the drawings, it should be understood that it is not intended to limit the scope of the invention, but rather, it is intended to cover all modifications or variations within the scope of the invention as defined by the claims of the present invention.

Claims

1. The traffic flow anomaly detection method based on the pattern similarity is characterized by comprising the following steps of:

acquiring traffic flow data;

detecting abnormal traffic flow states by adopting a traffic flow dynamic relation diagram and time sequence similarity;

The weight is as follows:

wherein ,x ^t is the firsttThe traffic flow data of the day is used,

in order to be in a hidden state,

as a function of the correlation,

in order for the parameters to be learned,

is the number of days of the traffic flow data entered,

is a transposition operation;

calculating time sequence similarity for time sequence characteristics of different space positions

The process of (1) is as follows:

wherein ,

is composed of weight matrix to be learned

And a network of an activation function tanh,

finger will

And

and (5) splicing.

2. The traffic flow anomaly detection method based on pattern similarity according to claim 1, wherein in the process of weighting nearest neighbor distances of pattern feature pairs, the weight is the number of elements contained in the category of the pattern feature.

3. The traffic flow anomaly detection method based on pattern similarity according to claim 1, wherein the sequence similarity is determined by summing up weighted time sequence similarity and pattern similarity.

4. The traffic flow anomaly detection method based on pattern similarity as claimed in claim 1, wherein the process of constructing the traffic flow dynamic relationship graph comprises:

；

；

wherein ,

in order for the parameters to be learned,

for a connected relation matrix, tanh is the activation function,

and

the current time and the time indicated by the a priori data respectively,

in order for the time difference to be a function of the time difference,

is a decreasing function.

5. The traffic flow anomaly detection method based on pattern similarity as claimed in claim 4, wherein the connectivity matrix is:

is X _a and X_b A connected relation matrix between the two.

6. A traffic flow anomaly detection system based on pattern similarity, comprising:

the data acquisition module is configured to acquire traffic flow data;

the abnormal detection module is configured to detect abnormal traffic flow states by adopting a traffic flow dynamic relation diagram and time sequence similarity;

The weight is as follows:

wherein ,x ^t is the firsttThe traffic flow data of the day is used,

in order to be in a hidden state,

as a function of the correlation,

in order for the parameters to be learned,

is the number of days of the traffic flow data entered,

is a transposition operation;

The process of (1) is as follows:

wherein ,

is composed of weight matrix to be learned

And a network of an activation function tanh,

finger will

And

and (5) splicing.

7. An electronic device comprising a memory and a processor and computer instructions stored on the memory and running on the processor, which when executed by the processor, perform the method of any one of claims 1-5.

8. A computer readable storage medium storing computer instructions which, when executed by a processor, perform the method of any of claims 1-5.