CN117075473A

CN117075473A - Multi-vehicle collaborative decision-making method in man-machine mixed driving environment

Info

Publication number: CN117075473A
Application number: CN202311030796.0A
Authority: CN
Inventors: 孙剑; 杭鹏; 崔一鸣
Original assignee: Tongji University
Current assignee: Tongji University
Priority date: 2023-08-15
Filing date: 2023-08-15
Publication date: 2023-11-17

Abstract

The application relates to a multi-vehicle collaborative decision-making method in a man-machine mixed driving environment, which comprises the steps of acquiring vehicle attributes and vehicle states of adjacent conflict areas, updating a game vehicle list based on a game range, and switching corresponding models according to the types and the numbers of vehicles in the list to carry out strategy solving; and deducing the action strategy of the HV through the CAV and HV interaction model, the CAV collaborative decision model and the man-machine mixed driving interaction and collaborative model coupled with the CAV and HV interaction model respectively, and finally solving the CAV optimal strategy. Compared with the prior art, the method considers the cooperation between CAVs and the interaction between CAVs and HV, adjusts and adapts to the uncertainty in the dynamic interaction process of the human driving vehicle in real time through the optimal parameters, and realizes the efficient and safe passing of vehicles in the mixed driving scene; in addition, the method can also adapt to the personalized driving requirement of CAV.

Description

Multi-vehicle collaborative decision-making method in man-machine mixed driving environment

Technical Field

The application relates to the field of automatic driving, in particular to a multi-vehicle collaborative decision-making method in a man-machine mixed driving environment.

Background

With the rapid development of autopilot vehicles, human-driving vehicles (HV) and internet autopilot vehicles (Connected and Automated Vehicle, CAV) will share road resources in the form of man-machine traffic streams for a long time in the future. How to ensure safety and improve efficiency in a man-machine mixed driving scene is an important issue of future automatic driving research. The intelligent automatic driving of the bicycle has environmental perception limitation in a complex interaction environment, and interaction safety is difficult to ensure; relatively conservative interaction behavior is often shown, and interaction efficiency is difficult to ensure; optimal decisions are difficult to make when facing complex scenes, and the realistic problems of high equipment cost and the like exist. Along with the improvement of CAV permeability, the driving conflict in the complex environment can be solved in a vehicle-vehicle cooperative mode, and safety and efficiency are improved.

However, at present, vehicle-vehicle coordination related researches are mostly researched under a pure CAV environment or interaction of CAV and HV is researched from a single vehicle intelligent view angle, and comprehensive consideration of interaction of CAV and HV in a mixed driving scene and cooperation between CAV is lacking. Most of the current researches only take HV as a dynamic obstacle or simply express the HV by using a following model, the dynamic interaction characteristics of HV cannot be effectively captured, and the interactivity of CAV and HV cannot be well reflected. In addition, the existing research does not consider individual demand differences among collaborative CAVs, and the obtained model is difficult to meet the individual driving demands of the CAVs.

Disclosure of Invention

The application aims to overcome the defects of the prior art, provides a multi-vehicle collaborative decision-making method in a man-machine mixed driving environment, and is a collaborative decision-making method capable of realizing cooperation between CAV workshops and interaction coupling between CAV and HV. In the specific model method design, CAV personalized driving preference is realized through a profit function, model dynamic adjustment is realized by adopting a twin game mode according to the uncertainty of HV interaction, and then model solving is carried out to obtain an optimal vehicle action decision result.

The aim of the application can be achieved by the following technical scheme:

a multi-vehicle collaborative decision-making method in a man-machine mixed driving environment generally relates to the construction of three decision-making models: a CAV and HV interaction model, a CAV collaborative decision model, a man-machine hybrid interaction and collaboration model; the general reference includes the following steps:

s1, updating vehicle attribute and state information: inputting attribute information of all vehicles in a scene, including vehicle types, kinematic parameters and the like; the vehicle state information includes a vehicle position, a vehicle speed, an acceleration, and the like.

S2, updating a game object list and switching a model: judging vehicles entering the near conflict area and the conflict area according to the vehicle state, and updating interaction and collaboration vehicles contained in the game object list; and determining the actually executed model according to the type and the number of the game objects.

S3, interaction modeling step of CAV and HV: and establishing a model of interaction of CAV and HV.

S4 CAV collaborative decision modeling step: and establishing a model of collaborative decision-making in the CAV vehicle.

S5, man-machine hybrid interaction collaborative modeling step: and (3) coupling the CAV and HV interaction model and the CAV collaborative decision model, carrying out a man-machine mixed driving interaction collaborative model, carrying out model solving, predicting the action of the HV in the game object list, and solving the optimal action strategy of the CAV, namely the acceleration.

S6, updating the vehicle state: the CAV realizes vehicle state update through a kinematic formula according to the output acceleration decision result, and solves the update speed and position; acceleration, speed, position of HV are re-acquired from the environment.

Compared with the prior art, the application has the following advantages:

(1) The application provides the method for simultaneously considering the cooperation between the networked automatic driving vehicles of the mixed driving scene and the interaction between the networked automatic driving vehicles and the human driving vehicles, which is more suitable for the real mixed driving scene and can effectively solve the safety efficiency problem of intelligent automatic driving of the single vehicle.

(2) The application fully considers the uncertainty in the dynamic interaction process of the human driving vehicle through the identification and dynamic adjustment of the weight related parameters of the human driving vehicle income function components.

(3) The cooperative game-based cooperative decision model of the online automatic driving vehicle can also adapt to the actual personalized driving demand of the online automatic driving vehicle.

Drawings

FIG. 1 is a flow chart of the method of the present application

FIG. 2 is a flowchart of the algorithm of the application S3

FIG. 3 is a flowchart of the algorithm of the application S4

FIG. 4 is a schematic diagram of the data flow relationship between key steps in the algorithm of the present application

FIG. 5 is a diagram of an embodiment scenario

Detailed Description

Definition of conflict area, near conflict area:

the conflict area is the area containing conflict points generated by vehicle track intersection, such as the inside of intersections in urban roads and highways;

the application defines the near conflict area as a distance before the vehicle reaches the conflict area, and the application calculates the range by taking the length of the parking line front parking sight as the near conflict area. The present application considers the near-conflict area and vehicles within the conflict area.

The stopping sight distance is a term, and refers to the shortest driving distance required by braking and stopping when a vehicle encounters a front obstacle during driving on the same lane.

The technical scheme of the application is suitable for urban scenes, expressway scenes and the like. The method of the application detects and collects the near conflict area and the vehicle state information in the conflict area through the road side equipment of the network connection, and performs policy calculation and decision information release. Technically feasible.

The technical scheme of the application is further described below with reference to the accompanying drawings and the embodiments.

Examples

Further, in S1, the step of updating the vehicle attribute and the state information specifically includes:

acquiring vehicle type information of each vehicle from the environment, namely, the vehicle is CAV or HV; kinematically related parameters such as maximum acceleration, maximum deceleration, maximum speed, etc.

Collecting the current acceleration, speed and position of HV in the updated environment in each time step; and the CAV updates the speed and the position of the vehicle according to the optimal acceleration output by the model and the kinematic formula. The vehicle state information is used as an input to the model.

Further, in S2, the steps of updating the game object list and switching the model are specifically:

all vehicles s= { S in the scene ₁ ,S ₂ ,…,S _k ,…,S _K I CAV, denoted m= { M }, is contained therein ₁ ,M ₂ ,…,M _i ,…,M _I And J vehicles HV, denoted as N= { N ₁ ,N ₂ ,…,N _j ,…,N _J }。

The game scope, i.e., the near conflict area and within the conflict area, is defined. Vehicles entering the game range are added to the game object list; the vehicle exits the game object list after passing through the conflict area. Vehicles in game object listWherein P CAV are contained, expressed as +.> Q vehicle HV, denoted->

Selecting and switching the model according to the types of vehicles and the number of various vehicles in the game object list,

when no CAV vehicle or only one vehicle exists in the game object list, the vehicle can freely pass through without considering interaction cooperation;

when the vehicles in the game object list are all CAV vehicles, only CAV collaborative decisions are carried out, and a CAV collaborative decision model is selected for carrying out vehicle optimal acceleration strategy solving;

when the vehicles in the game object list contain CAV and HV at the same time and only one CAV exists, only a CAV and HV interaction decision is needed, and a CAV and HV interaction model is selected for carrying out CAV optimal acceleration strategy solving;

when the vehicles in the game object list contain CAV and HV simultaneously and contain a plurality of CAV, the CAV collaborative decision model and the CAV and HV interactive decision model are required to be coupled to realize man-machine hybrid interactive collaboration.

S3, interaction modeling step of CAV and HV: and establishing a model of interaction of CAV and HV. The interaction modeling of HV and a plurality of CAVs can be realized by taking the CAVs as a whole. On the basis of solving the HV actions through model prediction, the overall benefit of the CAV is optimized.

Further, in S3, the interaction model in the step of modeling CAV and HV interaction is specifically:

based on game theory, a non-cooperative model is constructed, and CAV and HV are respectively divided into a leader and a follower.

Design of the CAV benefit function f taking into consideration the requirements of safety, efficiency and comfort in the running process of the vehicle _p And a benefit function f of HV _q . The two gain function forms are consistent, and the individual gain value of the vehicle is the weighted summation of the three.

In the method, in the process of the application,is the safety index of CAV, +.>Is an efficiency index of CAV, < >>Is a CAV comfort index.The weight values of the three indexes are respectively. Likewise, a +>Safety, efficiency, comfort index of HV, respectively,>are the respective weight values. The specific index calculation is shown in the following formula.

Wherein DeltaL _p Is the distance between the CAV vehicle p and the conflict point of the HV vehicle q, v _p (k+1) is the speed of the CAV vehicle p at time k+1, a _p (k)、a _p (k+1) is the acceleration of the CAV vehicle p at the k-th and k+1-th times, respectively. ΔL _q V is the distance between the q distance of the HV vehicle and the conflict point of the CAV vehicle p _q (k+1) is the speed of the HV vehicle q at time k+1, a _q (k)、a _q (k+1) is the acceleration of the HV vehicle q at the kth and k+1 times, respectively.

On the basis of the weight value of the previous moment, the safety and efficiency weight value of the HV is updated in real time according to the similarity degree of the predicted acceleration of the CAV to the actual acceleration of the CAV. In order to reduce the complexity of model parameter updating, the weight value corresponding to the safety and efficiency index is converted into the form of a function of the same parameter theta. The following formula is shown:

where A, B is a constant.

Establishing an interaction model, and predicting the possibility of HV under the condition that the CAV makes a certain acceleration action decision by itself according to the profit function of HVOptimum acceleration to be takenOn the basis, self-income is calculated, and the self-income is calculated in CAV self-acceleration strategy spaceOptimal solution for maximizing self-income by searching in->The following formula is shown:

s.t.

in the method, in the process of the application,CAV object list M for participating in gaming _g Is +.> Policy space for HV at this time->Middle->Income of corresponding HV, ++>To enable->A maximum optimal acceleration strategy; />To get +.>HV adopts optimal acceleration strategy +.>The corresponding CAV overall benefit, +.>A CAV optimal acceleration strategy to maximize the benefit. At the same time (I)>Is->Minimum and maximum of>Is->Minimum and maximum values of (2);is a safety constraint, wherein->For the time when the CAV vehicle p arrives at the conflict point according to the current state, the +.>The time for the HV vehicle q to reach the conflict point is set according to the current state. Optimal acceleration strategy will be achieved at CAV +.>HV optimal acceleration strategy determined on the basis of the model>As a model in->The prediction of HV is described as +.>For use in subsequent twinning games.

In order to achieve more accurate capturing of the HV motion real-time interaction uncertainty, updating of HV weight related parameters theta is achieved in a twin game mode.

At the initial moment, weight values corresponding to three CAV indexes are obtained based on natural driving data calibration And CAV fixed parameter value +.>And the corresponding HV initial weight value related parameter is theta ₀ . At theta ₀ R (r is set to be odd number) values are discretized again in the range of the definition domain to form an initial theta value space theta ₀ 。

In which the value space Θ ₀ The specific value taking method of each theta value is as follows: at the initial moment, in order to take values as comprehensively as possible, the range of a value taking space is determined as followsDividing the minimum value in the value space>Optimal parameter of initial moment->Maximum value->In addition, at [0, θ ₀ ]、/>Respectively equidistantly taking (r-3)/2 values, adding them into the value space, and forming value space theta with r number of elements ₀ 。

Twin game solving: at each subsequent time step, the value space theta is based on the last moment _k Each theta value in the model is solved on HV through a CAV and HV interaction model to obtain a series of HV optimal acceleration strategies corresponding to the theta values

Fitness calculation of optimal parametersAnd (3) determining: calculating Θ _k HV optimal acceleration strategy corresponding to each theta value in the systemIs +.>Is shown in the following formula.

The larger the alpha value, i.e. the smaller the difference between the calculated value and the actual value, the higher the fitness. The corresponding fitness alpha in all theta values is highest and is used as the optimal parameter at the next moment

Value list theta _k+1 Updating: according to the optimal parameters at the next momentAnd the corresponding fitness alpha ^* The value number r at the next time is checked. In this embodiment, the rule is formulated as shown in table 1.

TABLE 1 correspondence between the number of twin values and fitness

When the adaptability reaches a certain level, the method is optimalAnd (5) performing model calculation, and not performing twin game. Until the fitness exceeds a level that does not require twinning.

Setting the length of the value interval asMinimum value of value limit->Maximum value->The calculation method is shown in the following formula.

If calculatedMake->Correspondingly->

Finally, the value space theta of the value of the next moment theta _k+1 At the position ofThe nearby discrete values are obtained as shown in the following formula.

S4 CAV collaborative decision modeling step: and establishing a model of collaborative decision-making in the CAV vehicle. The method can be used as a vehicle collaborative decision model in a pure CAV scene, and can also be used as an inner layer model of man-machine hybrid driving interaction collaborative modeling to realize CAV individual acceleration solution.

Further, in S4, the collaborative decision model in the CAV collaborative decision modeling step is specifically:

and (3) designing a characteristic function of the CAV by considering the requirements of safety, efficiency and comfort in the running process of the vehicle, wherein the individual benefit value of the vehicle is the weighted summation of the three. And the calculation mode of each index is consistent with the CAV and HV interaction model.

Wherein p' is M in the game object list _g CAV vehicles other than CAV vehicle p, i.e.The security inside the CAV is calculated. Weight value +.>Can be according to the personalized preference of CAVAnd (5) adjusting.

According to the basic principle of the cooperative game, a characteristic function, an optimization target and constraint conditions are designed, benefits are distributed according to the shape principle, and the optimal action of the vehicle is planned. Shapley is calculated as follows:

in the psi- _p Is the Shapley value of CAV vehicle p. s is M _g Representing a collection of possible participant-cooperative gaming objects during the computing process. v(s) is the benefit generated by the set of participants for s. S and M _g I represents s, M respectively _g The number of vehicles involved.

v(s) is the benefit generated by the participant set s, calculated by:

and (5) taking the maximum total profit of the CAV vehicle as a target, and establishing a model. The following formula is shown:

s.t.

in the method, in the process of the application,is a safety constraint between CAVs.

Further, in S5, the interactive decision model in the man-machine hybrid interactive decision modeling step is specifically:

meanwhile, interaction of CAV and HV is considered, cooperation of the CAV is achieved, and a double-layer optimization model is built. The interaction of the outer layer of CAV as a whole with HV is shown in the following formula:

s.t.

in the method, in the process of the application,the overall benefit of CAV is the sum of the benefits of CAV contained in the game object list. Safety restraint examinationSafety between CAV and HV is a concern. The rest of the amounts have the same meaning as the CAV and HV interaction models.

The inner layer is CAV internal cooperation, and the optimal acceleration strategy of the CAV vehicle is obtained by solving according to a CAV cooperation decision modelHowever, in the model, all CAV and HV except for the safety index in the game object list are considered in calculation.

Further, in S6, after an appropriate model is selected according to the game object list to solve the optimal acceleration strategy, the vehicle state is updated.

The state of CAV is expressed and updated according to a kinematic formula:

in the method, in the process of the application,to participate in game CAV vehicle speed, acceleration, position.

The HV real state is used for carrying out information collection again from the environment to obtain the actual position of the HV real stateSpeed->Acceleration->

An example scenario is shown in fig. 5.

The above description is only illustrative of the preferred embodiments of the application and is not intended to limit the scope of the application in any way. Any alterations or modifications of the application, which are obvious to those skilled in the art based on the teachings disclosed above, are intended to be equally effective embodiments, and are intended to be within the scope of the appended claims.

Claims

1. A multi-vehicle collaborative decision-making method in a man-machine mixed driving environment is characterized by generally comprising the steps of constructing three decision-making models: a CAV and HV interaction model, a CAV collaborative decision model, a man-machine hybrid interaction and collaboration model; the general reference includes the following steps:

s1, updating vehicle attribute and state information: inputting attribute information of all vehicles in a scene, wherein the attribute information comprises vehicle types and kinematic parameters; vehicle state information including vehicle position, vehicle speed, acceleration;

s2, updating a game object list and switching a model: judging vehicles entering the near conflict area and the conflict area according to the vehicle state, and updating interaction and collaboration vehicles contained in the game object list; determining an actually executed model according to the types and the quantity of the game objects;

s3, interaction modeling step of CAV and HV: establishing a model of interaction between CAV and HV;

s4 CAV collaborative decision modeling step: establishing a model of collaborative decision in the CAV vehicle;

s5, man-machine hybrid interaction collaborative modeling step: coupling the CAV, the HV interaction model and the CAV collaborative decision model, carrying out a man-machine mixed driving interaction collaborative model, carrying out model solving, predicting the action of the HV in the game object list, and solving the optimal action strategy of the CAV, namely the acceleration;

2. The method according to claim 1, wherein in the step S1, the vehicle attribute and status information updating step specifically includes:

acquiring vehicle type information of each vehicle from the environment, namely, the vehicle is CAV or HV; the kinematics related parameters comprise maximum acceleration, maximum deceleration and maximum speed;

collecting the current acceleration, speed and position of HV in the updated environment in each time step; the CAV updates the speed and the position of the vehicle according to the optimal acceleration output by the model and the kinematic formula; the vehicle state information is used as an input to the model.

3. The method of claim 1, wherein in the step S2, the game object list updating and model switching steps are as follows:

all vehicles s= { S in the scene ₁ ，S ₂ ，…，S _k ，…，S _K I CAV, denoted m= { M }, is contained therein ₁ ，M ₂ ，…，M _i ，…，M _I And J vehicles HV, denoted as N= { N ₁ ，N ₂ ，…，N _j ，…，N _J }；

Defining a game range, namely a near conflict area and a conflict area; vehicles entering the game range are added to the game object list; the vehicle exits the game object list after passing through the conflict area; vehicles in game object listWherein P CAV are contained, expressed as +.> Q vehicle HV, denoted->

when the vehicles in the game object list contain CAV and HV simultaneously and contain a plurality of CAV, the CAV collaborative decision model and the CAV and HV interactive decision model are required to be coupled to realize collaborative decision in the man-machine mixed driving scene.

4. The method according to claim 1, wherein in step S3,

the interaction model in the CAV and HV interaction modeling step is specifically as follows:

based on game theory, constructing a non-cooperative model, and dividing CAV and HV into a leader and a follower respectively;

design of the CAV benefit function f taking into consideration the requirements of safety, efficiency and comfort in the running process of the vehicle _p And a benefit function f of HV _q The method comprises the steps of carrying out a first treatment on the surface of the The two gain function forms are consistent, and the individual gain value of the vehicle is the weighted summation of the three;

in the method, in the process of the application,is the safety index of CAV, +.>Is an efficiency index of CAV, < >>Is a CAV comfort index;the weight values of the three indexes are respectively; />Safety, efficiency, comfort index of HV, respectively,>is the weight value of each; the specific index calculation is shown as the following formula:

wherein DeltaL _p Is the distance between the CAV vehicle p and the conflict point of the HV vehicle q, v _p (k+1) is the speed of the CAV vehicle p at time k+1, a _p (k)、a _p (k+1) is the acceleration of the CAV vehicle p at the kth and k+1 times, respectively; ΔL _q V is the distance between the q distance of the HV vehicle and the conflict point of the CAV vehicle p _q (k+1) is the speed of the HV vehicle q at time k+1, a _q (k)、a _q (k+1) is the acceleration of the HV vehicle q at the kth and k+1 times, respectively;

on the basis of the weight value of the HV at the previous moment, the safety and efficiency weight value of the HV is updated in real time according to the similarity degree of the predicted acceleration of the CAV to the actual acceleration of the CAV; in order to reduce the complexity of model parameter updating, the weight value corresponding to the safety and efficiency index is converted into a function form of the same parameter theta; the following formula is shown:

wherein A, B is a constant;

establishing an interaction model, and predicting the optimal acceleration which the HV may take under the condition that the CAV makes a certain acceleration action decision by itself according to the profit function of the HVOn the basis, self-income is calculated, and in CAV self-acceleration strategy space +.>Optimal solution for maximizing self-income by searching in->The following formula is shown:

s.t.

in the method, in the process of the application,CAV object list M for participating in gaming _g Is +.>Policy space for HV at this time->Middle->The benefit of the corresponding HV,to enable->A maximum optimal acceleration strategy; />To get +.>HV adopts optimal acceleration strategy +.>The corresponding CAV overall benefit, +.>A CAV optimal acceleration strategy to maximize the benefit; at the same time (I)>Is->Minimum and maximum of>Is->Minimum and maximum values of (2);is a safety constraint, wherein->For the time when the CAV vehicle p arrives at the conflict point according to the current state, the +.>The time for the HV vehicle q to reach the conflict point according to the current state; optimal acceleration strategy will be achieved at CAV +.>HV optimal acceleration strategy determined on the basis of the model>As a model in->The prediction of HV is described as +.>For use in subsequent twinning games.

5. The method of claim 4, wherein,

updating the HV weight related parameter theta by adopting a twin game mode;

at the initial moment, weight values corresponding to three CAV indexes are obtained based on natural driving data calibration CAV fixed parameter value +.>The corresponding HV initial weight value related parameter is theta ₀ The method comprises the steps of carrying out a first treatment on the surface of the At theta ₀ R values are discretized again in the range of the definition domain to form an initial theta value space theta ₀ R is set to an odd number;

in which the value space Θ ₀ The value method of each theta value is as follows: at the initial moment, determining the range of the value space asDividing the minimum value in the value space>Optimal parameter of initial moment->Maximum value->In addition, at [0, θ ₀ ]、/>Respectively equidistantly taking (r-3)/2 values, adding them into the value space, and forming value space theta with r number of elements ₀ ；

Fitness calculation of optimal parametersAnd (3) determining: calculating Θ _k HV optimal acceleration strategy corresponding to each theta value in the systemIs +.>Is shown in the following formula:

alpha is defined as fitness, the larger the alpha value, i.e. the smaller the difference between the calculated value and the true value, the higher the fitness; the corresponding fitness alpha in all theta values is highest and is used as the optimal parameter at the next moment

Value list theta _k+1 Updating: according to the optimal parameters at the next momentAnd the corresponding fitness alpha ^* Confirming the value number r of the next moment; when the adaptability reaches a certain level, the optimal +.>Performing model calculation, and not performing twin game; until the fitness exceeds the level without twinning;

setting the length of the value interval asMinimum value of value limit->Maximum value->The calculation method is as follows:

if calculatedMake->Correspondingly->

If calculatedMake->Correspondingly->

Finally, the value space theta of the value of the next moment theta _k+1 At the position ofThe nearby discrete values are obtained as shown in the following formula:

6. the method of claim 5, wherein the number of twinning values corresponds to the fitness as shown in table 1:

TABLE 1 correspondence between the number of twin values and fitness

7. The method according to claim 1, wherein in the step S4, the collaborative decision model in the CAV collaborative decision modeling step is specifically:

designing a characteristic function of CAV (computer aided design) by considering the requirements of safety, efficiency and comfort in the running process of the vehicle, wherein the individual profit value of the vehicle is the weighted summation of the three; the calculation mode of each index is consistent with the CAV and HV interaction model;

wherein p is ^′ For M in a game object list _g CAV vehicles other than CAV vehicle p, i.e.Calculating the security inside the CAV; weight value +.>According to the personality of CAVThe chemical preference is adjusted;

according to the basic principle of the cooperative game, designing a characteristic function, an optimization target and constraint conditions, distributing benefits according to the shape principle, and planning the optimal action of the vehicle; shapley is calculated as follows:

in the psi- _p Is the shape of CAV vehicle p; s is M _g Representing a collection of possible participant-cooperative gaming objects during the computing process; v(s) is the benefit generated by the set of participants for s; s and M _g I represents s, M respectively _g The number of vehicles involved;

v(s) is the benefit generated by the participant set s, calculated by:

establishing a model by taking the maximum total income of the CAV vehicle as a target; the following formula is shown:

s.t.

8. The method of claim 1, wherein in the step S5, the interactive decision model in the man-machine hybrid interactive decision modeling step is specifically:

meanwhile, interaction between CAV and HV is considered, and cooperation inside the CAV is considered to construct a double-layer optimization model; the interaction of the outer layer of CAV as a whole with HV is shown in the following formula:

s.t.

in the method, in the process of the application,the overall income of the CAV is the sum of the income of the CAV contained in the game object list; safety constraints consider the safety between CAV and HV; the rest of the amounts have the same meaning as CAV and HV interaction models;

9. The method according to claim 1, wherein in the step S6, after selecting an appropriate model according to the game object list to solve the optimal acceleration strategy, the vehicle state is updated;

the state of CAV is expressed and updated according to a kinematic formula:

in the method, in the process of the application,the speed, acceleration and position of the CAV vehicle are participated in the game;