CN114399743A

CN114399743A - Method for generating future track of obstacle

Info

Publication number: CN114399743A
Application number: CN202111506962.0A
Authority: CN
Inventors: 肖钟雯; 王耀农; 张震; 陈啟煌
Original assignee: Zhejiang Zero Run Technology Co Ltd
Current assignee: Zhejiang Zero Run Technology Co Ltd
Priority date: 2021-12-10
Filing date: 2021-12-10
Publication date: 2022-04-26
Anticipated expiration: 2041-12-10
Also published as: CN114399743B

Abstract

The invention discloses a method for generating a future track of an obstacle. In order to overcome the problem of serious track prediction distortion caused by neglecting the interference of other traffic participants, the invention provides a multi-track generation method by combining CVAE and GAN, the coding characteristics of historical tracks are combined with the interactive characteristics of surrounding traffic participants, future tracks are generated after decoding of a generation model, the generated tracks are judged to be true and false by a judgment model, meanwhile, the generated future tracks are restrained by combining the information of a travelable area, and the generation of unreasonable tracks is avoided. The method for generating the multiple tracks by combining the multiple tracks with the interactive characteristics of surrounding traffic participants is obtained through multiple random sampling, the influence of other traffic participants is considered, the network automatically learns the multi-intention behaviors of the predicted target in different environments, the uncertainty of the future tracks is fully expressed, and meanwhile, the future tracks of the predicted target can be predicted under a complex scene by the constraint of a travelable area on the future tracks.

Description

Method for generating future track of obstacle

Technical Field

The invention relates to the field of automatic control of intelligent automobiles, in particular to a method for generating future trajectories of obstacles.

Background

The intelligent driving automobile senses the surrounding environment and obstacle information through a plurality of sensors, and the sensed result is sent to a motion planning system to plan the motion of the intelligent driving automobile, so that the automatic control of the automobile is realized. The behavior prediction is that the future movement track of the surrounding traffic participants is predicted, the track of the surrounding traffic participants can be accurately predicted, collision can be effectively avoided, the intelligent driving automobile can be ensured to run safely in a complex environment, and the stability and the comfort degree of planning control can be improved.

In the existing research, the main points are as follows: the method based on the physical model and the data driving predicts the future track according to the historical track based on the kinematics principle, but the method only considers the motion information of the predicted target and can only predict for a short time, and the prediction for a long time is inaccurate; a proper machine learning or deep learning model is designed based on a data-driven method, namely a machine learning or deep learning method, and prediction is carried out in a data training mode. The method based on data driving can be used for carrying out more accurate long-term prediction by combining target historical information, environmental information and surrounding target interaction information. However, since predicting the trajectory of the surrounding traffic participants is not a deterministic task, and is influenced by the environment, the intention of the driver, and the driving habits, how to accurately predict the future trajectory of the surrounding traffic participants to improve the driving safety and comfort of the intelligent driving vehicle is a current problem and difficulty to be solved urgently.

For example, a proposed vehicle intention and trajectory prediction method, which is disclosed in publication No. CN112347567A, simplifies the prediction of human driving behavior into which region the vehicle is predicted to be inserted, determines the region in which the vehicle is finally inserted from the information of the vehicle itself and neighbors, identifies the vehicle intention based on a hidden markov model, and predicts the trajectory in combination with a vehicle dynamics model.

A vehicle track prediction model with a semantic map and an LSTM with publication number CN113128326A provides a track prediction method combining the semantic map, wherein the LSTM model is adopted to extract dynamic characteristics of a target obstacle, and the CNN model is adopted to learn semantic map characteristics of a driving environment in a part of an image map. And obtaining track prediction information through MLP by combining the dynamic characteristics of the target obstacle and the semantic map characteristics of the driving environment. And applying a neural network to carry out iterative prediction on the positions of the moving obstacles at the continuous time points of the prediction interval so as to complete the prediction of all the time points.

A trajectory prediction method and apparatus for an obstacle, disclosed in publication No. CN112364997B, predict a probability that an intention of a target obstacle belongs to the intention type and a predicted trajectory of the target obstacle under the intention type, based on environmental information around the target obstacle, historical trajectory data of the target obstacle, and predetermined reference trajectories. The method comprises the steps that a predetermined reference track divides a real track into a plurality of environment types according to the real tracks of a plurality of obstacles acquired in advance and environmental information around each real track, each real track under the environment is clustered aiming at each environment type, the clustered tracks are used as each reference track under the environment type, and each reference track corresponds to an intention type.

Disclosure of Invention

The invention mainly solves the problem of serious track prediction distortion caused by neglecting the interference of other traffic participants in the prior art; a method for generating a future trajectory of an obstacle is provided.

The technical problem of the invention is mainly solved by the following technical scheme:

the method comprises a training stage and a prediction stage, wherein the training stage simultaneously runs a track generation model and a track judgment model, the track generation model generates a series of predicted future tracks with constraints, and the track judgment model is responsible for judging the truth of the generated predicted future tracks; the training stage only runs during development, and when the prediction stage is applied in an actual scene, namely the product is trained during product development, the prediction accuracy is improved, and the trajectory prediction can be performed after the accuracy meets the requirement.

The training phase performs the following steps:

a1, acquiring historical tracks and real future tracks of a predicted target and other obstacles around the predicted target, coding the historical tracks and the real future tracks to obtain coding features of the historical tracks and the real future tracks, and acquiring the coding features of the historical tracks subjected to distributed sampling; extracting interactive features of obstacles around the predicted target, and generating a predicted future trajectory of the predicted target; the obstacles may be all types of surrounding traffic participants, including pedestrians, motor vehicles, non-motor vehicles; in the training phase, the time that the prediction occurs is adopted, namely the historical track of the predicted target and other obstacles and the future track at that time are confirmed, the confirmed future track is called as a real future track, and the predicted future track is the future track generated by the generation model and comes in and goes out of the real future track; after the interactive features of the surrounding obstacles are extracted, the influence of the surrounding obstacles is used as one of indexes for predicting future tracks, and the prediction precision is improved.

A2, acquiring a flying view of a travelable area of a scene where a predicted target is located, and constraining the predicted future track of the predicted target; in an actual traffic scene, some places such as a one-way road, a bus lane, a red light intersection and the like cannot be driven, and the predicted future track can be in accordance with the actual situation only by restricting the predicted track by using the conditions.

A3, alternately updating the weights of the discriminant model and the generated model;

the prediction phase performs the following steps:

b1: coding the historical tracks of the predicted target and other obstacles around the predicted target to obtain historical track coding features of the predicted target and other obstacles around the predicted target, and obtaining interaction features of the obstacles around the predicted target according to the historical track coding features of the other obstacles around the predicted target; in the prediction stage, the track prediction is carried out in a real application scene, so that no real future track exists; acquiring the historical track coding features subjected to distributed sampling, and acquiring the predicted future track of the predicted target according to the historical track coding features subjected to distributed sampling;

b2: projecting the predicted future trajectory of the predicted target into the travelable area aerial view to constrain the predicted future trajectory of the predicted target;

b3: step B1 is performed until enough predicted future trajectories for the plurality of predicted targets are generated, which refers to K predicted future trajectories.

Preferably, if there are n obstacles in the scene and the ith obstacle is taken as a prediction target, the coordinate expression of the history track of the prediction target at the t-th time is as follows:

wherein tau is tau times before the t time as a reference,

the coordinate of the target t moment in a world coordinate system is predicted; the real future trajectory of the predicted target at the time t is:

wherein tau 'is tau' moments after the t moment serving as a reference; hist history track_iAnd the true Future trajectory Future_iThe target trajectory is a series of trajectories formed by coordinate points of a predicted target in a world coordinate system at a plurality of moments, and these coordinate points are referred to as trajectory points.

Preferably, the training phase uses the historical track Hist_iObtaining the encoding characteristics of the historical track and the Future track Future after encoding_iObtaining future track coding characteristics after coding; after the historical track coding features and the future track coding features are superposed, the mean value mu and the variance sigma are obtained through two full-connected layers²For learning scoreCloth Z (. mu.,. sigma.)²)～P(Z|Hist_i，Future_i) Wherein Z is modeled as a Gaussian distribution and Z-N (0, I); calculating Z (. mu.,. sigma.)²) The distance penalty from the gaussian distribution N (0, I) is: l1(i) ═ KLD (Z (μ, σ)²) N (0, I)); for Z (mu, sigma)²) Carrying out random sampling to obtain a group of hidden variables, obtaining feature vectors with the same dimensionality as the historical track coding features by the hidden variables through a multilayer perceptron, and then multiplying the feature vectors with the historical track coding features through normalization to obtain the historical track coding features subjected to distributed sampling; the training phase adopts the distribution Z (mu, sigma) of KL divergence metric learning²) Distance loss from gaussian distribution N (0, I).

Preferably, the method for generating future obstacle trajectories according to claim 2 is characterized in that the historical trajectories of all obstacles around the predicted target are encoded to obtain encoding features of surrounding obstacles, and the encoding features of the surrounding obstacles are averaged to obtain the interaction features of the obstacles around the predicted target; the method comprises the steps of superposing historical track coding features subjected to distributed sampling and interactive features of obstacles around a predicted target to obtain final coding features, decoding the final coding features of the predicted future track to obtain an expression of the predicted future track of the ith predicted target

Wherein the Prediction is_iTo predict future trajectories.

Preferably, the method for generating the future trajectory of the obstacle according to claim 3, wherein the distribution Z (μ, σ) is generated²) Carrying out random sampling for K times to generate K groups of predicted future tracks of predicted targets, wherein the expression of the predicted future track of the K group of predicted targets is

Wherein (K ∈ K); in the training stage, K groups are used for predicting the future track Prediction of the ith obstacle_ikAnd said real Future trajectory Future_iCalculating to obtain the mean square error lossComprises the following steps:

preferably, a bird's eye view of a travelable area of a scene where the automatic driving vehicle is located is obtained; projecting points in the predicted future track into a driving-capable area aerial view, and constraining track points of the predicted future track according to the driving-capable area; the aerial view of the travelable area can be obtained by data identification obtained by a vision or laser sensor, unreasonable tracks in the predicted future tracks are removed through the constraint of the travelable area, and the predicted future tracks are optimized.

Preferably, the track points in the predicted future track are sequentially projected to a bird's-eye view, and the shortest distance from the track points to the travelable area is calculated according to the travelable area bird's-eye view

Wherein the content of the first and second substances,

corresponding to a certain future moment, if the track point is projected in the travelable area, then

Computing constraint loss

Wherein, alpha, beta and gamma are hyper-parameter constants,

for calculating distance-dependent constraint values, beta and gamma are used for adjusting the constraint strength, the greater the distance is, the more deviation from the travelable region is shown, the greater the penalty is, namely the loss is,

for the purpose of a time-dependent adjustment factor,

the penalty is larger and larger with smaller influence, and the loss is larger, so that the track points predicted closer to the current moment are expected to be more accurate.

Preferably, the updating of the weight of the discriminant model includes: splicing the historical track and the predicted future track into a complete track, sending the complete track into a discrimination model, and obtaining a probability value pred of track truth and falseness by a multilayer perceptron after the complete track is coded_ikAnd a label gt_ikValue computation cross entropy loss, corresponding label gt_ikIs false; splicing the historical track and the real future track, sending the spliced historical track and the real future track into a discrimination model again, coding the complete track, and obtaining a probability value pred of whether the track is true or false through a multilayer perceptron_{i_real}And a label gt_{i_real}Value computation cross entropy loss, corresponding label gt_{i_real}Is true; the discrimination loss L4(i) is calculated as

Total Loss of discriminant model_{D_i}L4 (i); the updating of the weights of the generative model comprises: splicing the historical track and the generated track into a complete track, sending the complete track into a discrimination model, and obtaining a probability value pred of whether the track is true or false through a multilayer perceptron after the complete track is coded_ikAnd a label gt_ikValue computation cross entropy loss, at which point the corresponding label gt_ikIs true; the discriminant loss at this time is expressed as:

the total generative model loss requires a weighted sum of the generative model and the discriminant model losses: loss_{G_i}＝α₁L1(i)+α₂L2(i)+α₃L3(i)+α₄L4(i), wherein α₁，α₂，α₃，α₄Is the corresponding loss weight.

Preferably, the prediction stage encodes the historical trajectories of the predicted target and the obstacles around the predicted target to obtain the encoding characteristics of the historical trajectories of the predicted target and the obstacles around the predicted target, and averages the encoding characteristics to obtain the predictionMeasuring the interactive characteristics of surrounding obstacles of the target; according to the learned distribution Z (mu, sigma)²) Obtaining a characteristic vector by using a hidden variable obtained after random sampling, and obtaining a historical track coding characteristic after the characteristic vector is normalized and multiplied by the historical track coding characteristic to be subjected to distributed sampling; the final coded feature is obtained by overlapping the historical track coding feature of distributed sampling and the interactive feature of the obstacle around the predicted target, and the coordinate expression of the predicted future track is obtained after decoding

The predicted future trajectory Prediction_iProjecting the point in the space to the aerial view of the travelable area, restraining the predicted future track, and removing the track beyond the travelable area; again according to the distribution Z (μ, σ)²) Carrying out random sampling, and repeating the process until K predicted future tracks are generated, wherein the K predicted future tracks are used as predicted target future possible driving tracks; the distribution of the prediction phase learning is the same as the step of the training phase learning distribution.

The invention has the beneficial effects that:

1. the generation model fully considers the historical track characteristics of the predicted target and the interactive characteristics of surrounding vehicles, and the prediction precision is improved;

2. the network can obtain a more real track through the learning of the discrimination model;

3. by carrying out travelable area constraint on the generated track, filtering unreasonable tracks and obtaining more reasonable and effective tracks;

4. the method is suitable for all types of surrounding traffic participants, including pedestrians, motor vehicles and non-motor vehicles;

5. in the track generation stage, the influence of other traffic participants is considered, multi-intention behaviors of the predicted target in different environments are automatically learned through a network, and the uncertainty of future tracks is fully expressed.

Drawings

FIG. 1 is a schematic flow chart of a training phase of a method for generating a future trajectory of an obstacle according to the present invention;

fig. 2 is a schematic flow chart of a prediction stage of a method for generating a future obstacle trajectory according to the present invention.

Detailed Description

The technical scheme of the invention is further specifically described by the following embodiments and the accompanying drawings.

Example (b):

the method for generating the future trajectory of the obstacle of the embodiment is mainly applied to the automatic driving automobile as shown in fig. 1 and fig. 2, and combines a multi-trajectory generation method of CVAE and GAN, and comprises a training phase and a prediction phase, wherein the training phase is operated only when a product is developed; after the product is really put into use, only a prediction stage is implemented to predict the future track; in the training stage, a track generation model and a track judgment model are simultaneously operated, the track generation model learns Gaussian distribution of track coding characteristics by taking the coding characteristics of historical tracks and the coding characteristics of future tracks as conditions, a series of tracks with constraints are generated after the coding characteristics of the historical tracks and the interactive characteristics of surrounding traffic vehicles are combined according to Gaussian distribution sampling, and the judgment model is responsible for judging the truth and falseness of the generated tracks; and in the prediction stage, only the generating model is operated, the prediction future track is generated after the Gaussian distribution learned in the training stage is sampled for multiple times and combined with the historical track coding characteristics and the interactive characteristics of surrounding traffic participants through a decoder, and the final multiple tracks are obtained after the prediction future track is projected and the tracks outside the travelable region are removed.

In order to clearly express the trajectory, the predicted target may be regarded as traveling on a world coordinate system, i.e., the trajectory of the predicted target is formed by coordinate points on the world coordinate system.

Assuming that the current time is the t-th time, n other traffic participants exist in the traffic scene of the automatically driven vehicle, and the other traffic participants are regarded as the obstacles, the historical track of the ith obstacle at the t-th time is as follows:

tau is tau times before t time, tau is shownThe length of the historical track is obtained; the future trajectory at time t is:

where τ ' is τ ' from time t as a reference, τ ' indicates the length of the future trace.

In the training stage, a sensor on the automatic driving vehicle is used for obtaining a historical track and a future track of the automatic driving vehicle and surrounding vehicles, the historical track and the future track can be obtained by data identification obtained by a vision or laser sensor, and a specific identification method is not described in detail in the embodiment; for the ith prediction target, the historical track Hist_iObtaining the encoding characteristics of the historical track and the Future track Future through the encoder 1_iObtaining future track coding characteristics through the encoder 2, obtaining a mean value mu and a variance sigma through two full-connected layers after the superposition of the historical track coding characteristics and the future track coding characteristics²For learning the distribution Z (μ, σ)²)～P(Z|Hist_i，Future_i) Z is modeled as a Gaussian distribution, Z-N (0, I); the training phase adopts the distribution Z (mu, sigma) of KL divergence metric learning²) Distance loss from gaussian N (0, I) L1(I) ═ KLD (Z (μ, σ)²) N (0, I)); and then according to the learned distribution Z (mu, sigma)²)～P(Z|Hist_i，Future_i) And carrying out random sampling to obtain a group of hidden variables, obtaining a feature vector with the same dimensionality as the historical track coding feature by the hidden variables through a multilayer perceptron, and multiplying the feature vector with the historical track coding feature through softmax normalization to obtain the historical track coding feature subjected to distributed sampling.

Meanwhile, the interactive features of the vehicles around the ith target are extracted, and the specific method is as follows: coding features of historical tracks of all obstacles around the ith prediction target are obtained by passing the historical tracks of all the obstacles around the ith prediction target through the coder 3 respectively, and then the average value of the coding features of the historical tracks of the obstacles is obtained to obtain interactive features of the obstacles around the ith prediction target; the finally coded characteristics are obtained by overlapping the history track coding characteristics subjected to distributed sampling and the interactive characteristics of the obstacles, and then the finally coded characteristics are decodedObtaining a representation of coordinates of the predicted future trajectory

K random sampling is carried out on the learned distribution to generate K groups of tracks,

in the training phase, the K groups generate a trajectory Prediction_ikWith the true Future trajectory Future_iCalculating mean square error loss

Acquiring a travelable area aerial view of a scene where the automatic driving vehicle is located, wherein the travelable area aerial view can be obtained by data identification obtained by a vision or laser sensor, and the specific identification method is not described in detail in the embodiment; will generate the trajectory Prediction_ikThe point in the space is projected to a fly-eye view of a travelable area, and the track points for generating the track are constrained according to the travelable area, and the specific method comprises the following steps: will predict future trajectory Prediction in turn_ikThe track point in the target area is projected into the aerial view, and the shortest distance from the track point to the travelable area is calculated according to the travelable area aerial view

Corresponding to a future time, wherein if the track point is projected in the travelable area, then

According to the shortest distance from the track point to the travelable area

And a point in time in the future

Computing constraint lossConstraint loss based on travelable region

Wherein, alpha, beta and gamma are hyper-parameter constants,

calculating distance-related constraint values, wherein beta and gamma are used for adjusting constraint intensity, the greater the distance is, the more deviation from a travelable region is shown, the greater the penalty is, namely, the greater the loss is,

for the purpose of a time-dependent adjustment factor,

The discrimination model consists of an encoder 4 and a multilayer perceptron; in the whole training process, the weight of the discriminant model and the weight of the generated model are alternately updated.

The process of updating the discriminant model is as follows: hist history track_iAnd predicting future trajectory Prediction_ikSplicing into a complete track, sending the complete track into a discrimination model, and obtaining a probability value pred of whether the track is true or false through a multilayer perceptron after the complete track is coded by a coder 4_ikAnd a label gt_ikValue computation cross entropy loss, corresponding label gt_ikIs false; meanwhile, history track Hist_iSplicing with a real future track Futurei, sending the real future track Futurei into the discrimination model again, coding the complete track by a coder 4, and obtaining a probability value pred of whether the track is true or false by a multilayer perceptron_{i_real}And a label gt_{i_real}Value computation cross entropy loss, corresponding label gt_{i_real}The result is true; at this time, the expression for the discrimination loss is obtained as:

total Loss of discriminant model_{D_i}The total discriminant model loss is used for L4(i)And the weight corresponding to the new discriminant model.

The process of updating the generative model is as follows: hist history track_iAnd generating a trajectory Predicton_ikSplicing into a complete track, sending the complete track into a discrimination model, and obtaining a probability value pred of whether the track is true or false through a multilayer perceptron after the complete track is coded by a coder 4_ikAnd a label gt_ikValue computation cross entropy loss, at which point the corresponding label gt_ikIs true; at this time, the discriminant loss is expressed as:

the total loss of the generative model needs to be weighted and summed with the loss of the generative model and the discriminant model: loss_{G_i}＝α₁L1(i)+a₂ L2(i)+α₃ L3(i)+α₄L4(i), wherein α₁，α₂，α₃，α₄For the corresponding weight of loss, the generative model total loss is used to update the weight corresponding to the generative model.

Wherein, the above mentioned encoders 1, 2, 3, 4 can be one-dimensional convolution network or RNN series cyclic neural network such as LSTM, GRU, etc., and the decoder can be RNN series cyclic neural network (LSTM, GRU, etc.).

When the prediction stage is executed, the automatic driving automobile predicts the track of the surrounding obstacles in a real driving scene, and only historical track information is available at the moment, but no future track information is available. The specific process of the prediction stage is as follows: for the ith prediction target, the historical track Hist_iHistorical track coding characteristics are obtained through the coder 1, coding characteristics of surrounding obstacles are obtained through the coder 3 for historical tracks of the obstacles around the ith prediction target, and then the interactive characteristics of the obstacles around the ith prediction target are obtained after the coding characteristics of the obstacles are averaged.

According to the distribution Z (mu, sigma)²) Random sampling is carried out to obtain a group of hidden variables, the hidden variables are subjected to multi-layer perceptron to obtain characteristic vectors with the same dimensionality as the coding characteristics of the historical track, the characteristic vectors are subjected to softmax normalization and then multiplied by the coding characteristics of the historical track to obtain the historical track subjected to distributed samplingA coding feature; the final coded feature is obtained by superposing the historical track coding feature subjected to distributed sampling and the interactive feature of surrounding obstacles, and then the coordinate expression of the predicted future track is obtained by a decoder as follows:

will predict future trajectory Prediction_iProjecting the points in the image to a bird's-eye view of the travelable area, constraining track points for generating tracks according to the travelable area, and removing the tracks corresponding to the track points beyond the travelable area; again according to the distribution Z (μ, σ)²) Random sampling is carried out, and the process is repeated until K tracks are generated:

the above-mentioned K value may adopt different K values according to actual needs, and in this embodiment, the K value generally takes 6, 8, and 10.

Future trajectories of other obstacles around the current autonomous vehicle can be predicted and generated according to the prediction method of the ith prediction target.

In the present embodiment, by a multi-track generation method combining CVAE and GAN, a data-driven manner is adopted to learn a generative model in which a distribution Z (μ, σ) under given historical track and future track characteristics is learned and a discriminant model²) The method comprises the steps of performing multiple random sampling on distribution through learning to generate multiple tracks, fully considering historical track characteristics of a predicted target and interaction characteristics of surrounding vehicles through a generation model, and enabling a network to obtain a more real track through learning of a discrimination model; meanwhile, unreasonable tracks are filtered by restricting the driving areas of the generated tracks, and more reasonable and effective tracks are obtained.

In addition, the method in the embodiment can be applied to all types of surrounding traffic participants, including pedestrians, motor vehicles and non-motor vehicles, and the method can be used only by obtaining track information of corresponding types of drivable areas and targets in a scene; if only the track information of the target can generate multiple tracks, only the constraint of the travelable area is lacked.

It should be understood that the examples are for illustrative purposes only and are not intended to limit the scope of the present invention. Further, it should be understood that various changes or modifications of the present invention may be made by those skilled in the art after reading the teaching of the present invention, and such equivalents may fall within the scope of the present invention as defined in the appended claims.

Claims

1. The method for generating the future trajectory of the obstacle is characterized by comprising a training stage and a prediction stage, wherein the training stage simultaneously runs a trajectory generation model and a trajectory discrimination model, the trajectory generation model generates a series of predicted future trajectories with constraints, and the trajectory discrimination model is responsible for discriminating the truth of the generated predicted future trajectories;

the training phase performs the following steps:

a1: acquiring historical tracks and real future tracks of a predicted target and other obstacles around the predicted target, coding the historical tracks and the real future tracks to obtain coding features of the historical tracks and the real future tracks, and acquiring the coding features of the historical tracks subjected to distributed sampling; extracting interactive features of obstacles around the predicted target, and generating a predicted future trajectory of the predicted target;

a2: acquiring a navigable area aerial view of a scene where a predicted target is located, and constraining a predicted future track of the predicted target;

a3: alternately updating the weights of the discrimination model and the generation model;

the prediction phase performs the following steps:

b1: coding the historical tracks of the predicted target and other obstacles around the predicted target to obtain historical track coding features of the predicted target and other obstacles around the predicted target, and obtaining interaction features of the obstacles around the predicted target according to the historical track coding features of the other obstacles around the predicted target; acquiring the historical track coding features subjected to distributed sampling, and acquiring the predicted future track of the predicted target according to the historical track coding features subjected to distributed sampling;

b3: step B1 is performed until a predicted future trajectory is generated for enough of the plurality of predicted targets.

2. The method according to claim 1, wherein if there are n obstacles in the scene and the ith obstacle is taken as a prediction target, the coordinate expression of the historical trajectory at the t-th time of the prediction target is:

i is less than or equal to n, wherein tau is tau times before the t time,

wherein τ 'is τ' from time t as a reference.

3. Method for generating future trajectory of obstacle according to claim 2, wherein said training phase combines the historical trajectories Hist_iObtaining the encoding characteristics of the historical track and the Future track Future after encoding_iObtaining future track coding characteristics after coding; overlapping the historical track coding features and the future track coding features, and obtaining a mean value mu and a variance sigma through two full-connected layers²For learning the distribution Z (mu, sigma)²)～P(Z|Hist_i，Future_i) Wherein Z is modeled as a Gaussian distribution and Z-N (0, I); calculating Z (. mu.,. sigma.)²) And Gaussian distribution N: (Distance loss of 0, I) is: l1(i) ═ KLD (Z (μ, σ)²) N (0, I)); for Z (mu, sigma)²) Carrying out random sampling to obtain a group of hidden variables, obtaining feature vectors with the same dimensionality as the historical track coding features through the hidden variables by a multilayer perceptron, and then multiplying the feature vectors with the historical track coding features through normalization to obtain the historical track coding features subjected to distributed sampling.

4. The method according to claim 2, wherein the historical trajectories of all obstacles around the predicted target are encoded to obtain the encoding characteristics of surrounding obstacles, and the average of the encoding characteristics of the surrounding obstacles is obtained to obtain the interaction characteristics of the obstacles around the predicted target; overlapping the historical track coding features subjected to distributed sampling and the interactive features of obstacles around the predicted target to obtain the final coding features of the predicted future track, and decoding the final coding features to obtain the expression of the predicted future track of the ith predicted target

Wherein the Prediction is_iTo predict future trajectories.

5. A method for generating future trajectory of obstacle according to claim 3, characterized by distributing Z (μ, σ)²) Carrying out random sampling for K times to generate K groups of predicted future tracks of predicted targets, wherein the expression of the predicted future track of the K group of predicted targets is

Wherein (K ∈ K); in the training stage, K groups are used for predicting the future track Prediction of the ith obstacle_ikWith the true Future trajectory Future_iThe mean square error loss is calculated as:

6. the method for generating the future obstacle trajectory according to claim 1, wherein a bird's eye view of a travelable area of a scene where the autonomous vehicle is located is obtained; and projecting the points in the predicted future track to the aerial view of the travelable area, and constraining the track points of the predicted future track according to the travelable area.

7. The method according to claim 1, wherein the track points in the predicted future track are projected into the bird's-eye view in sequence, and the shortest distance from the track points to the travelable area is calculated according to the travelable area bird's-eye view

Wherein the content of the first and second substances,

corresponding to the future time, if the track point is projected in the travelable area, then

Computing constraint loss

Wherein, alpha, beta and gamma are hyper-parameter constants.

8. The method according to claim 1, wherein the weight updating of the discriminant model comprises: splicing the historical track and the predicted future track into a complete track, sending the complete track into a discrimination model, and obtaining a probability value pred of track truth and falseness by a multilayer perceptron after the complete track is coded_ikAnd a label gt_ikValue computation cross entropy loss, corresponding label gt_ikIs false; splicing the historical track and the real future track, sending the spliced historical track and the real future track into a discrimination model again, coding the complete track, and obtaining whether the track is true or false through a multilayer perceptronProbability value pred_{i_real}And a label gt_{i_real}Value computation cross entropy loss, corresponding label gt_{i_real}Is true; the discrimination loss L4(i) is calculated as

Total Loss of discriminant model_{D_i}＝L4(i)；

The updating of the weights of the generative model comprises: splicing the historical track and the generated track into a complete track, sending the complete track into a discrimination model, and obtaining a probability value pred of whether the track is true or false through a multilayer perceptron after the complete track is coded_ikAnd a label gt_ikValue computation cross entropy loss, at which point the corresponding label gt_ikIs true; the discriminant loss at this time is expressed as:

the total generative model loss requires a weighted sum of the generative model and the discriminant model losses: loss_{G_i}＝α₁L1(i)+α₂L2(i)+α₃L3(i)+α₄L4(i), wherein α₁，α₂，α₃，α₄Is the loss of weight.

9. The method according to claim 2, wherein in the prediction stage, the historical trajectories of the predicted target and the obstacles around the predicted target are encoded to obtain the encoding characteristics of the historical trajectories of the predicted target and the obstacles around the predicted target, and the average of the encoding characteristics is obtained to obtain the interaction characteristics of the obstacles around the predicted target; according to the learned distribution Z (mu, sigma)²) Carrying out random sampling for K times to obtain a hidden variable to obtain a feature vector, normalizing the feature vector, and multiplying the normalized feature vector by the historical track coding features to obtain the historical track coding features subjected to distributed sampling; the final coded features are obtained by overlapping the historical track coding features of distributed sampling and the interactive features of obstacles around the predicted target, and K predicted future track expressions are obtained after decoding

The predicted future trajectory Prediction_ikAnd (5) projecting the points in the space to the aerial view of the travelable area, restraining the predicted future track and removing the track beyond the travelable area.