CN101982843B

CN101982843B - Method for selecting state vector in nonparametric regression short-time traffic flow prediction

Info

Publication number: CN101982843B
Application number: CN2010105141116A
Authority: CN
Inventors: 郑亮; 马寿峰; 贾宁; 朱宁; 王鹏飞
Original assignee: Tianjin University
Current assignee: Tianjin University
Priority date: 2010-10-21
Filing date: 2010-10-21
Publication date: 2012-05-09
Anticipated expiration: 2030-10-21
Also published as: CN101982843A

Abstract

The invention discloses a method for selecting the state vector in the nonparametric regression short-time traffic flow prediction, relating to the technical field of short-time traffic flow prediction. At four conditions comprising peak hours, even hours, low hours and all the day, by using the method provided by the invention, the forecast accuracy, the stability, the speed and the transportability are improved, and the operation time is shortened, thus verifying the effectiveness and the necessity of the method provided by the invention.

Description

Method for selecting state vector in nonparametric regression short-time traffic flow prediction

Technical Field

The invention relates to the technical field of short-term traffic flow prediction, in particular to a method for selecting a state vector in nonparametric regression short-term traffic flow prediction.

Background

At present, many researchers at home and abroad apply the non-parameter regression method to the short-time traffic flow prediction research, and the non-parameter regression method is necessarily improved according to the requirements of practical problems. In 1991, Davis and Nihan really apply a nonparametric regression method to traffic prediction, and although problems of model selection, parameter setting and the like are avoided, the method needs a huge representative historical database and consumes a long time for running. In 1995, Smith applies a non-parametric regression method to single-point short-term traffic flow prediction, and experimental results achieve better effects than historical average and neural networks, but the problem of too slow search speed also exists. Aiming at the problem of too low searching speed, Oswald et al sets up a fuzzy nearest neighbor method from a KD tree, thereby improving a historical data structure mode and a neighbor searching method in a nonparametric regression method and improving the operating efficiency of the method. Zhangxiaoli provides a K-neighborhood nonparametric short-time traffic flow prediction method based on a balanced binary tree, and a case database is established by adopting a clustering method and a balanced binary tree structure, so that the prediction precision is improved, and the real-time requirement is met. These are mainly improvements from the storage patterns of the history database and the neighbor search method.

However, the selection of the state vector describing the causal relationship between the flow rates of the upstream road segment and the road segment to be detected mainly includes a principal component analysis method, a correlation coefficient method, an autocorrelation coefficient and the like, which are all analyzed from the point of statistics, and the factors relatively related to the flow rate of the road segment to be detected are used as the components of the state vector, so that the study on whether the state vector is selected and the prediction effect is improved is lacked. It is noted that even if the operation time of the method is shortened by improving the storage mode of the historical database and the neighbor searching method, the final prediction effect is not satisfactory if the selection of the state vector is not enough to describe the flow causal relationship between the upstream road segment and the road segment to be detected.

Disclosure of Invention

In order to solve the problems, improve the prediction precision, shorten the running time and meet the requirements in practical application, the invention provides a method for selecting a state vector in non-parametric regression short-time traffic flow prediction, which comprises the following steps:

(1) judging whether an upstream road section related to the road section to be detected is in the upstream road section set according to a first preset criterion, and if so, executing the step (2); if not, the upstream road segment is not in the upstream road segment set;

(2) acquiring the average speed of the traffic flow in the range of a square circle L of a road section to be detected through preset data;

(3) obtaining historical retroactive maximum cycle number m according to the average speed and the prediction cycle;

(4) acquiring an initial state vector according to the upstream road section set and the historical retroactive maximum cycle number m;

(5) determining the encoding length of the particle according to the dimension M of the initial state vector;

(6) setting the number of particles as Z, and randomly generating Z particles;

(7) defining a fitness function, and acquiring the fitness of Z particles according to the fitness function;

(8) acquiring an individual extreme value and a global extreme value of the particles according to the fitness of the Z particles;

(9) respectively performing cross operation on the codes of the Z particles, the codes of the individual extremum and the global extremum, and performing mutation operation according to a preset probability to obtain global optimal particles;

(10) judging whether the preset times is reached, if so, outputting the global optimal particles; if not, re-executing the step (7);

(11) and performing dot product operation on the global optimal particles and the initial state vector to obtain a state vector.

The first preset criterion in the step (1) is specifically:

<math> <mrow> <munder> <mi>Σ</mi> <mrow> <mi>i</mi> <mo>,</mo> <mi>j</mi> </mrow> </munder> <mi>dis</mi> <mrow> <mo>(</mo> <msubsup> <mi>p</mi> <mi>i</mi> <mi>upstream</mi> </msubsup> <mo>,</mo> <msubsup> <mi>p</mi> <mi>j</mi> <mrow> <mi>inter</mi> <mi>sec</mi> <mi>tion</mi> </mrow> </msubsup> <mo>)</mo> </mrow> <mo>≤</mo> <mi>L</mi> </mrow> </math>

wherein,indicating the coordinate position of the point in the ith link in the upstream link,

a coordinate position indicating the j-th intersection center of the upstream link,

and represents the distance between the coordinate position of the midpoint of the ith road segment in the upstream road segment and the coordinate position of the jth intersection center of the upstream road segment.

The history tracing maximum cycle number m in the step (3) is specifically as follows:

c denotes a prediction period of the time period,

the average speed is indicated.

The dimension M of the initial state vector in step (5) is specifically:

m is (s +1) (M +1), and s represents the number of elements in the upstream segment number set.

The fitness function in the step (7) is specifically as follows:

F(VAR，ARE，PER，EC)＝λ₁EV+λ₂ARE+λ₃/PER+λ₄EC, EV represents the variance of the prediction error, ARE represents the average relative error, PER represents the prediction relative error in the interval [0, alpha ]]In percent between, EC represents the coefficient of equality, lambda₁Denotes the weight of EV, λ₂Denotes the weight of ARE, λ₃Represents the weight of PER, λ₄Denotes the weight of EC and α denotes the prediction relative error.

The step (7) of obtaining the fitness of the Z particles according to the fitness function specifically includes:

defining a current prediction period flow state mode;

performing dot product operation on the particle codes and the current prediction period flow state mode to obtain a current flow state mode;

performing dot product operation on the particle codes and the historical database flow state mode to obtain the current historical database flow state mode;

predicting the flow of the next cycle of the current prediction cycle through K neighbor matching and equal weight prediction according to the current flow state mode and the current historical database flow state mode to obtain a first prediction error, and taking the first prediction error as the fitness of the current particles.

The technical scheme provided by the invention has the beneficial effects that:

the embodiment of the invention provides a method for selecting state vectors in non-parametric regression short-time traffic flow prediction, which is adopted under four conditions of peak time, flat time, low time and all weather, so that the prediction precision, stability, speed and portability are improved, the running time is shortened, and the effectiveness and the necessity of the method are verified.

Drawings

FIG. 1 is a flow chart of non-parametric regression provided by the present invention;

FIG. 2 is a schematic diagram of a distance method according to the present invention;

FIG. 3 is a flow chart of a method for selecting a state vector in non-parametric regression short-term traffic flow prediction according to the present invention;

FIG. 4 is a diagram illustrating the fitness of Z particles obtained according to the fitness function F according to the present invention;

FIG. 5 is a graph comparing peak time prediction results provided by the present invention;

FIG. 6 is a comparison graph of peak-flattening period predictions provided by the present invention;

FIG. 7 is a comparison graph of the prediction results for the low peak periods provided by the present invention;

FIG. 8 is a comparison graph of all-weather prediction results provided by the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

In order to solve the problems, improve the prediction precision, shorten the running time and meet the requirements in practical application, the embodiment of the invention provides a method for selecting a state vector in non-parametric regression short-time traffic flow prediction.

Referring to fig. 1, nonparametric regression is a data-driven heuristic prediction mechanism that predicts future values by searching a historical database for data that is similar to the current observed value. It can be generally divided into five components: selection of historical data, generation of a sample database, definition of data similarity, K neighbor matching and prediction methods. When nonparametric regression is adopted to predict short-term traffic flow, a historical database is firstly constructed, and if traffic flow data are stored in the database indiscriminately, problems that the size of the database cannot be borne, dimension disasters are generated when the data are matched and the like are caused. Therefore, the selected traffic flow data should be the flow or combination of flows with completeness and typicality that is closest to the flow of the section under test. Meanwhile, the sample database is the core of the non-parametric regression, and the structure (including a logical structure and a physical structure) and the space-time efficiency of a search data algorithm play a decisive role in the quality of the non-parametric regression performance. Therefore, the important point of the research in the embodiment of the present invention is how to reasonably select the organization mode (i.e., the state vector) of the sample database, so that the causal relationship between the flow rates of the upstream road segment and the road segment to be detected can be described, the storage space can be saved, and the search efficiency can be increased. After the sample database is generated, the definition of data similarity, K neighbor matching and prediction can be carried out. After relevant elements of the nonparametric regression model are set, neighbor matching with the current real-time observation data K can be found from the historical database, and finally, traffic flow prediction quantity at the next moment of the current moment can be obtained by prediction. When comparing the current traffic flow observation data with the traffic flow historical database, a comparison standard is needed, and the state vector is the description of the standard. Such as road occupancy, driving speed and weather conditions, affect the traffic at the next time of the road segment. Traffic flow data for adjacent road segments involves the problem of taking several time intervals and several road segments adjacent to each other, even at the closest adjacent time. Whether the state vector is reasonable or not is directly related to the prediction precision. At present, the selection of the state vector has no unified standard, and the accuracy of prediction cannot be improved by considering as many factors as possible in the state vector, but a longer running time is caused; however, if the selected state vector is not enough to describe the main cause and effect of the upstream and downstream road section flow, the good prediction effect is not achieved. The selection of the K value of the neighbor point in the K neighbor matching is very important, and the prediction accuracy is affected by too large or too small K value. If the selected K value is equal to the number of historical database patterns, then non-parametric regression is not accurate. However, the selection of the K value cannot be too small, and if the K value is too small, the component of the incidental factor is increased, which affects the accuracy of prediction. In addition, during some abnormal time, the flow rate of some road sections is obviously reduced, while the flow rate of other road sections is suddenly increased, and K can take a smaller value, and if the value is taken to be a larger value, the information is weakened, so that the prediction error is larger. Thus, in the case of an abnormally large or low flow rate section, the value of K may be set small or predicted with one prediction section. However, when the short-term traffic prediction is performed, a uniform paradigm is not provided for determining the accurate K value, so that the optimal K value needs to be selected by analyzing a curve graph of prediction errors and the K value according to different series of sample data participating in offline prediction inspection. Because the cause-and-effect relationship of the flow of the upstream road section and the road section to be detected at different time intervals in a day is different, when online rolling prediction is carried out, firstly, necessary offline detection analysis is carried out by using the method provided by the embodiment of the invention and historical data of the corresponding time interval to obtain a state vector describing the cause-and-effect relationship of the flow, and then, the corresponding time interval of the road section to be detected is subjected to real-time online prediction by using the state vector. Both during peak-flat periods and all weather conditions, the relative error of the predicted average correction is high due to the low flow, even 0, that occurs. In order to improve the practicability and operability of prediction, two different K values can be selected to construct a prediction interval, and as long as the predicted values are acceptable in the corresponding prediction interval, the average correction relative error of the non-parametric regression prediction result is improved to a certain extent. Suppose that the real-time traffic flow data mode is:

{v_i(t-m)，V_i(t-m+1)，L，V_i(t), i belongs to U + { f }, wherein U is a related upstream road section number set, the number of elements in the upstream road section number set U is set to be s, f is a road section number to be tested, { f } is a road section number set to be tested, m is the maximum cycle number of historical retrospection, t is prediction time, and i is the mark number of the upstream road section. The traffic flow data mode of the historical database is { V }_ih(t-m)，V_ih(t-m+1)，LV_ih(t) }, i ∈ U + { f }. The distance measures the matching degree of the real-time data and the sample data, however, different K neighbors can be searched by different distance measurement criteria, and the accuracy of the predicted value is further influenced. The distance measurement criterion adopted by the embodiment of the invention is as follows:

<math> <mrow> <mi>D</mi> <mo>=</mo> <munder> <mi>max</mi> <mrow> <mi>i</mi> <mo>&Element;</mo> <mi>U</mi> <mo>+</mo> <mo>{</mo> <mi>f</mi> <mo>}</mo> <mo>,</mo> <mi>l</mi> <mo>&Element;</mo> <mo>{</mo> <mn>0,1</mn> <mo>,</mo> <mi>L</mi> <mo>,</mo> <mi>m</mi> <mo>}</mo> </mrow> </munder> <mo>|</mo> <msub> <mi>V</mi> <mi>i</mi> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>-</mo> <mi>m</mi> <mo>+</mo> <mi>l</mi> <mo>)</mo> </mrow> <mo>-</mo> <msub> <mi>V</mi> <mi>ih</mi> </msub> <mrow> <mo>(</mo> <mi>t</mi> <mo>-</mo> <mi>m</mi> <mo>+</mo> <mi>l</mi> <mo>)</mo> </mrow> <mo>|</mo> </mrow> </math>

wherein the distance metric criterion D is (s +1) (m +1) -dimensional, and the component distance with the largest distance is selected from the (s +1) (m +1) -dimensional components of the real-time data and the sample data as the matched metric criterion. The distance measurement criterion fully considers different distance information of (s +1) (m +1) dimensions, and is equivalent to a hypercube of (s +1) (m +1) dimensions, wherein the side length of the hypercube is the dimension with the largest distance, namely, the smaller the value of D, the higher the similarity of matching.

Selecting K nearest historical state vectors in the historical database through a distance measurement criterion, and supposing that K neighbors are found in the historical database, wherein the distances between the real-time data and the K neighbors are respectively D_k(K ═ 1, 2, L, K), these neighbors correspond to the neighbors to be treatedMeasuring the flow of the next prediction period of the road section as V_kh(t+1)。

The prediction method mainly comprises a weighted prediction method and an equal-weighted prediction method. Since traffic systems contain determinism and randomness: the certainty is reflected by the close proximity, namely when the state vectors are close, the predicted value and the true value are also close to have certain certainty; however, the system has randomness, so that no rule that the closer the state vectors are, the closer the predicted value and the true value are exists. According to the fact that two identical propositions do not exist in the world: when 95% of the characteristics of two leaves are almost identical, the possibility of obvious difference of the other 5% of the characteristics is rather high. Therefore, the prediction method in the embodiment of the present invention adopts an equal weight prediction method, and the expression is as follows:

referring to fig. 2, the number s of relevant upstream segments and the maximum number m of cycles for historical retrospection can be determined more coarsely by the distance method. The road section marked by the black rhombus is the road section to be detected, the road section marked by the black point is the central point of the intersection, and the road section marked by the black square frame is the upstream road section of the road section to be detected. Through empirical analysis, it is found that if the upstream road sections with the urban distance to the road section to be detected within the range of L have a significant influence on the flow of the road section to be detected, the upstream road sections form an upstream road section set of the state vector, wherein the values of the number s and the range of L of the upstream road sections are determined according to the specific application condition in practical application, and the embodiment of the invention does not limit the flow in the specific implementation. Obtaining a relatively rough state vector according to the road network characteristics and the distance method, and then optimizing the first prediction error by using a PSO (particle swarm Optimization) -GA (Genetic Algorithm) hybrid intelligent Algorithm and inspection data, so as to obtain the state vector capable of describing the flow causal relationship between the upstream road section and the road section to be detected. Finally, the found state vector can be used for real-time online flow prediction. Referring to fig. 3, a detailed implementation process of the embodiment of the present invention is described:

101: judging whether an upstream road section related to the road section to be detected is in the upstream road section set according to a first preset criterion, and if so, executing the step 102; if not, the upstream road section is not in the upstream road section set;

wherein the first preset criterion is

Indicating the coordinate position of the point in the ith link in the upstream link,

representing the distance between the coordinate position of the middle point of the ith road section in the upstream road section and the coordinate position of the center of the jth intersection of the upstream road section, wherein when a first preset criterion is met, the upstream road section belongs to the upstream road section set, namely i belongs to U; when the first preset criterion is not satisfied, the upstream road segment is not in the upstream road segment set, namely

102：Acquiring the average speed of traffic flow in the range of square circle L of the road section to be detected through preset data

The preset data is traffic flow of a certain road section in the time periods of early peak time, noon peak time, late peak time and the like, and the average speed of traffic flow in the range of the square circle L of the road section to be detected can be obtained by counting and analyzing the traffic flow

103: obtaining historical retroactive maximum cycle number m according to the average speed and the prediction cycle;

c denotes a prediction period of the time period,

the average speed is indicated.

The prediction period may be set to 5min, 10min, 15min, and the like according to a specific application condition in an actual application, and this is not limited in the embodiment of the present invention in specific implementation, and the embodiment of the present invention is described with 5min as an example.

104: acquiring an initial state vector according to the upstream road section set U and the maximum cycle number m of historical retrospection;

{V_i(t-m)，V_i(t-m+1)，L，V_i(t)}，i∈U+{f}

105: determining the encoding length of the particle according to the dimension M of the initial state vector;

the dimension M of the initial state vector is (s +1) (M +1), and s represents the number of elements in the upstream segment number set.

106: setting the number of particles as Z, and randomly generating Z particles;

where the dimension of each particle is the dimension M of the initial state vector.

107: defining a fitness function F, and acquiring the fitness of Z particles according to the fitness function F;

F(VAR，ARE，PER，EC)＝λ₁EV+λ₂ARE+λ₃/PER+λ₄/EC

wherein EV represents the variance of the prediction error and represents the robustness of the prediction algorithm; ARE represents the average relative error and represents the overall performance of the prediction algorithm; PER represents that the prediction relative error is in the interval 0, alpha]The percentage between, the individual performance of the predicted effect; EC represents the coefficient of equality of the coefficients,the quality of the overall prediction effect is shown; lambda [ alpha ]₁Represents weight, λ, of EV₂Weight, λ, representing ARE₃Weight, λ, representing PER₄Denotes the weight of EC and α denotes the prediction relative error. Lambda [ alpha ]₁、λ₂、λ₃And λ₄Is determined according to the situation in practical application by the value of lambda₁、λ₂、λ₃And λ₄The adjustment of (a) adjusts the proportion of EV, ARE, PER and EC in the fitness function F, and the value range of alpha is usually 20% or 30%.

The fitness function F represents the fitness of the particle, and the smaller the value of the fitness function F, the stronger the fitness of the particle, and the more likely the good gene is to be inherited to the next generation. In each iteration process, each particle adjusts itself by tracking two extreme values, so that the particle adapts to the living environment of the particle more and more, one is an individual extreme value which can be found by the particle itself, and the other is a global extreme value which can be found by the whole particle swarm at present. Referring to fig. 4, the steps specifically include the following steps, which are described in detail below:

1. defining a current prediction period flow state mode CM;

CM＝[V_s(t-m)V_s(t-m+1)LV_s(t)LV_f(t-m)V_f(t-m+1)LV_f(t)]1_×M

2. performing dot product operation on the particle codes and the flow state mode of the current prediction period to obtain a current flow state mode CM^*；

The embodiments of the present invention are described by taking binary coding as an example, and binary coding of a particle is [ 10L 1L 10L 0 ]]1_×MPerforming dot product operation on the particle and each state mode in the CM to obtain a current flow state mode: CM (compact message processor)^*＝[V_s(t-m)0LV_s(t)LV_f(t-m)0L0]_1×M。

3. Performing dot product operation on the particle codes and the flow state mode HM of the historical database to obtain the current flow state mode HM of the historical database^*；

HM＝{V_ih(t-m)，V_ih(t-m+1)，LV_ih(t)}_H×M，i∈U+{f}

Wherein, H is the number of the flow state mode of the historical database, H belongs to [1, H]Performing dot product operation on the particle and each state mode in the HM to obtain the current historical database flow state mode HM^*. The capacity of the constructed historical database is large enough and representative, namely the historical database contains various traffic state change trends and typical laws, and the currently acquired real-time data mode can find a similar historical data mode.

4. According to the current flow state mode CM^*And current historical database traffic status mode HM^*Predicting the flow of the next cycle of the current prediction cycle through K neighbor matching and equal weight prediction to obtain a first prediction error, and taking the first prediction error as the fitness of the current particles.

108: acquiring individual extreme values and global extreme values of the particles according to the fitness of the Z particles;

and taking the value with the minimum fitness of each particle as an individual extreme value of each particle, and taking the minimum value in the Z individual extreme values as a global extreme value.

109: respectively performing cross operation on the codes of the Z particles and the codes corresponding to the individual extreme values and the global extreme values, and performing mutation operation according to a preset probability to obtain global optimal particles;

wherein the steps are as follows: and defining a single-point crossover operator, and respectively performing crossover operation on the codes of the Z particles with the codes corresponding to the individual extreme values and the codes corresponding to the global extreme values according to the defined single-point crossover operator. The codes of the Z particles are respectively crossed with the particle codes corresponding to the individual extremum, so that each particle can inherit the own superior partial gene. The codes of the Z particles are respectively crossed with the particle codes corresponding to the global extremum, so that each particle can inherit the optimal partial gene of the particle swarm. Defining mutation operators, and the codes of 2 parents are recombined to have possible mutation of children, and the children are converted with preset probability. During specific implementation, firstly, one sub-individual is randomly selected from a group consisting of the sub-individuals, and the value of a certain bit of code in the sub-individual is randomly changed for the selected sub-individual with preset probability. As in the biological world, the probability of occurrence of the mutation in GA is very low, and the preset probability value is usually between 0.001 and 0.01, so that the mutation provides an opportunity for the generation of new children.

The particle coding mode is various, binary coding, real number coding and the like can be adopted, the binary coding is preferably adopted in the embodiment of the invention, and 0 represents the state vector component and has no obvious influence on the prediction result; and 1 represents that the state vector component has obvious influence on the prediction result, and the optimal binary coding individual is obtained after the iteration of a PSO-GA mixed intelligent algorithm. Wherein the number of digits of the parent individual is defined according to the dimension of the initial state vector, a cross point p of the parent individual is randomly generated, the range of the cross point p is [1, M-1], and the high p bits of the parent individual 1 and the high p bits of the parent individual 2 are exchanged at the cross point p. For example: dimension M of the initial state vector is equal to 9 and parent individual 1 can be defined as 101011101; the parent individual 2 is 010101010; the range of the intersection p is [1, 8], and when the intersection position p is 5, the 5-high bit of the parent individual 1 and the 5-high bit of the parent individual 2 are exchanged at the intersection 5, and two children are generated after intersection, which are: subjects 1: 010101101, respectively; subjects 2: 101011010. for binary coded children, mutation means that the value at a certain bit flips. For each sub-individual, the value change encoded on a particular bit is random, for example: the children before mutation were: 010101101, when the fourth bit is mutated, the mutated offspring are: 010001101. in order to prevent convergence to the local optimal solution, the preset probability needs to be linearly increased, a specific value of the preset probability is set according to a specific application condition in practical application, and the embodiment of the present invention is not limited in specific implementation.

110: judging whether the preset times are reached, and if so, outputting globally optimal particles; if not, step 107 is re-executed.

The preset number is specifically set according to the situation in practical application, and the embodiment of the present invention is not limited thereto, and the preset number is generally about 2000.

111: and performing dot multiplication on the global optimal particles and the initial state vector to obtain a state vector.

In summary, the embodiment of the invention provides a method for selecting a state vector in non-parametric regression short-time traffic flow prediction, and the method provided by the embodiment of the invention is adopted under four conditions of peak time, peak-off time, low-peak time and all weather, so that the prediction precision, stability, speed and portability are improved, and the effectiveness and necessity of the method provided by the embodiment of the invention are verified.

The feasibility of the method for selecting the state vector in the non-parametric regression short-time traffic flow prediction provided by the embodiment of the invention is verified by adopting a test, which is described in the following:

traffic data as used herein is from the University of MinnesotaDuluth (University of MinnesotaDuluth, http:// www.d.umn.edu/tdrl/traffic /). In order to verify the effectiveness and the necessity of searching for the optimal state vector in the off-line analysis process of the PSO-GA algorithm proposed herein, the predicted effect is compared with the predicted effect which is not subjected to the off-line analysis of the PSO-GA algorithm under different traffic conditions, the comparison predicted effect is shown in table 1, and the comparison predicted result is shown in fig. 5, 6, 7 and 8.

TABLE 1

From the test data in table 1, the feasibility of the embodiment of the present invention can be verified by analyzing the data of EV, ARE, PER, and EC. From the comparison among the method provided by the embodiment of the present invention, the direct prediction result obtained by the method in the prior art, and the actual value of the road section in fig. 5, fig. 6, fig. 7, and fig. 8, the feasibility of the method for selecting the state vector in the non-parametric regression short-time traffic flow prediction provided by the embodiment of the present invention can be obtained, the prediction accuracy is improved, a better prediction effect is obtained, and the requirements in practical application are met.

Those skilled in the art will appreciate that the drawings are only schematic illustrations of preferred embodiments, and the above-described embodiments of the present invention are merely provided for description and do not represent the merits of the embodiments.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims

1. A method for selecting a state vector in non-parametric regression short-time traffic flow prediction is characterized by comprising the following steps:

(6) setting the number of particles as Z, and randomly generating Z particles;

(11) performing dot product operation on the global optimal particles and the initial state vector to obtain a state vector;

wherein, the obtaining the fitness of the Z particles according to the fitness function in the step (7) specifically includes:

defining a current prediction period flow state mode;

2. The method for selecting the state vector in the nonparametric regression short-term traffic flow prediction according to claim 1, wherein the first preset criterion in the step (1) is specifically:

and the distance between the coordinate position of the middle point of the ith road section in the upstream road section and the coordinate position of the center of the jth intersection of the upstream road section is represented, and L represents the square circle of the road section to be measured.

3. The method for selecting the state vector in the nonparametric regression short-term traffic flow prediction according to claim 1, wherein the historical retroactive maximum cycle number m in the step (3) is specifically:

c denotes a prediction period of the time period,

the average speed is indicated.

4. The method for selecting the state vector in the non-parametric regression short-time traffic flow prediction according to claim 1, wherein the dimension M of the initial state vector in the step (5) is specifically:

5. The method for selecting the state vector in the nonparametric regression short-term traffic flow prediction according to claim 1, wherein the fitness function in the step (7) is specifically:

F(EV，ARE，PER，EC)＝λ₁EV+λ₂ARE+λ₃/PER+λ₄EC, EV represents the variance of the prediction error, ARE represents the average relative error, PER represents the prediction relative error in the interval [0, alpha ]]In percent between, EC represents the coefficient of equality, lambda₁Denotes the weight of EV, λ₂Denotes the weight of ARE, λ₃Represents the weight of PER, λ₄Denotes the weight of EC and α denotes the prediction relative error.