CN113299078B

CN113299078B - Multi-mode traffic trunk line signal coordination control method and device based on multi-agent cooperation

Info

Publication number: CN113299078B
Application number: CN202110331935.8A
Authority: CN
Inventors: 王昊; 王雷震; 董长印; 杨朝友
Original assignee: Yangzhou Fama Intelligent Equipment Co ltd; Southeast University
Current assignee: Yangzhou Fama Intelligent Equipment Co ltd; Southeast University
Priority date: 2021-03-29
Filing date: 2021-03-29
Publication date: 2022-04-08
Anticipated expiration: 2041-03-29
Also published as: CN113299078A

Abstract

The invention discloses a multi-mode traffic trunk line signal coordination control method and device based on multi-agent cooperation, wherein the method comprises the following steps: multi-mode traffic trunk simulation calibration and flow generation; designing a plurality of intelligent agents for signal control of each intersection of a trunk line; constructing a collaborative value decomposition multi-agent reinforcement learning framework; and training and outputting the intelligent agents of the intersections of the multi-mode traffic trunk line. The method provided by the invention treats the multi-mode traffic signal control of each intersection as an intelligent body, comprehensively considers the cooperation of all intersections of the traffic trunk line, takes the integral pedestrian flow and delay of the trunk line as targets to optimally train the intelligent body for controlling the traffic signal, provides a control basis for a road traffic manager, realizes the integral optimal target of the traffic trunk line, and improves the urban road traffic service level.

Description

Multi-mode traffic trunk line signal coordination control method and device based on multi-agent cooperation

Technical Field

The invention relates to the field of urban traffic signal control, in particular to a multi-mode traffic trunk line signal coordination control method and device based on multi-agent cooperation.

Background

In recent years, due to the rapid increase of traffic demand, road congestion and blockage, air pollution and transportation efficiency reduction are caused, and the economic development of cities and the daily life of citizens are seriously influenced. In order to relieve traffic problems, traffic trunk line signal coordination control is an optimal mode in urban traffic management and control, and a reasonable trunk line control method can effectively improve vehicle speed and traffic efficiency and reduce oil consumption and tail gas emission.

The traditional coordination control of traffic trunk signals mainly adopts a green wave model, the duration of a public period of use of each intersection of a trunk is set, and the phase sequence and the phase difference of each intersection are calculated by taking the number of vehicle stops, the width of a green wave band, the vehicle delay and the like as optimization indexes. However, such approaches greatly limit the efficiency of individual intersections, giving way to the benefits of a trunked vehicle. In the existing research of the Chinese patent, the Chinese patent 202010793652.0 builds a bidirectional optimization model of a trunk line by modeling a trunk line target road section in a tidal traffic state and taking weighted throughput as an optimization target, so that the aim of minimizing vehicle delay on the basis of maximizing system traffic capacity is fulfilled; similarly, according to the running track of the bus, the chinese patent 201910092239.9 establishes the model optimization cycle and the phase difference based on the bus priority policy, and realizes the trunk line green wave of the social bus and the bus. Generally, the existing research is biased to the benefit maximization of a trunk line vehicle and a public transport, the efficiency of branch lines and single-point intersections is sacrificed in a model, the comprehensive consideration of multi-mode traffic such as public transport, pedestrians and non-motor vehicles on the trunk line is lacked, and the single-point intersections of the trunk line are cooperated on the basis of multi-mode traffic adaptive control to realize the overall optimal microscopic research of the multi-mode trunk line.

Disclosure of Invention

The purpose of the invention is as follows: in order to overcome the defects of the prior art, the invention aims to provide a multi-mode traffic trunk line signal coordination control method and device based on multi-agent cooperation, which are used for carrying out multi-mode traffic simulation calibration and flow generation on a target trunk line; designing a signal control intelligent agent at each intersection of a trunk line; constructing a collaborative value decomposition multi-agent reinforcement learning framework; training and outputting intelligent agents of all intersections of the multi-mode traffic trunk line; on the basis of single-point multi-mode traffic self-adaptive control, the cooperation of each intersection of the traffic trunk line is considered, and the overall optimal target of the traffic trunk line is achieved.

The technical scheme is as follows: in order to solve the technical problems, the technical scheme adopted by the invention is as follows: a multi-mode traffic trunk signal coordination control method based on multi-agent cooperation comprises the following steps:

(1) and acquiring intersection information of the traffic trunk line and the multi-mode traffic flow mode, performing simulation calibration on the multi-mode traffic trunk line by using simulation software according to the data, and restoring the arrival rate of the multi-mode traffic flow.

(2) Generating a signal control agent for each intersection in the trunk line, wherein n intersections of the traffic trunk line correspond to n agents, and the agent i reads the time t_kThe intersection comprises the states of multi-mode traffic position, queuing length and speed information

Will state

Inputting agent i at time t_kThe time parameter is

The neural network of (1) outputs intersection agent i at time t_kPhase of motion of

Wherein the content of the first and second substances,

representing parameters in a neural network

Selecting the operation phase a_iAnd the state is

Value function in the case of (1), Q value, A_iIndicates the set of motion phases, a, that can be released at this intersection i_iIs represented by A_iOne action phase of;

(3) initializing neural network parameters and experience playback pools of all agents in a trunk line, and setting the number N of training rounds_episode；

(4) Initializing simulated multi-mode traffic trunk traffic flow toThe arrival rate is set to be initial simulation time t₀Total simulation time T;

(5) acquiring the multi-mode traffic state of each agent, taking agent i as an example, acquiring the corresponding intersection i at the time t_kMulti-mode traffic local observation state

Wherein

Respectively show that the intersection i is at the time t_kThe social vehicle state, the public traffic state, the pedestrian state and the non-motor vehicle state, the states comprise the information of the position, the queuing length, the speed and the like,

indicating intersection adjacent to intersection i at time t_kThe phase state of (a);

(6) the local observed state of each agent is input into its neural network, and for agent i, it will be

Return time t after input to neural network_kPhase of motion of

Phase of simultaneous return motion

Corresponding Q value

Wherein A is_iRepresenting the set of action phases that intersection i can clear,

indicating agent i at time t_kParameter of temporal neural network, a_iIs represented by A_iOne operating phase of (1), Q_i(. cndot.) represents the neural network Q function corresponding to agent i,

neural network representing agent i at time t_kThe parameters of (1);

(7) phase of action to be returned by each agent

Executing delta t seconds in each corresponding intersection signal lamp of traffic trunk simulation, and time t_k+1＝t_k+ Δ t, return multi-mode traffic trunk multi-agent at time t_kTeam prize value of

Wherein k is_d、k_f、k_lRespectively representing the per-person delay variation balance coefficient, the people flow throughput balance coefficient and the queuing length variation balance coefficient,

it represents the amount of variation in the delay per person,

wherein

And

respectively, at time t_kAnd time t_k+1The trunk line of (1) is delayed by all people,

representing the throughput of people, i.e. the total number of people passing through the traffic trunk during at,

indicating the amount of change in the queue length,

wherein

And

respectively, at time t_kAnd time t_k+1The number of people queuing in the traffic trunk;

(8) repeating the step (5) to obtain the time t_k+1Multi-mode traffic status for each agent

Will be provided with

Saving the experience to an experience playback pool, wherein,

indicating that multiple agents are at time t_kThe value of the team award of (a),

and

respectively time t_kAnd time t_k+1The global state list of (a) is,

wherein

Indicating that the nth agent is at time t_kThe state of execution is such that,

wherein

Indicating that the nth agent is at time t_k+1The state of execution is such that,

is shown at time t_kA list of actions selected by all agents,

wherein

Indicating that the nth agent is at time t_kAn action to perform;

(9) judging whether the preset simulation time is reached, if t, judging whether the preset simulation time is reached_k+1And (5) if the value is more than or equal to T, entering the step (10), and otherwise, returning to the step (5) for iteration.

(10) Randomly sampling N pairs of data from an empirical playback pool according to a loss function

Updating each agent neural network parameter using a gradient descent, wherein θ_allThe neural network parameters representing all of the agents,

a global reward function representing multi-agent collaboration,

wherein k is_bRepresenting the trade-off coefficient of the intersection b, n representing the number of agents, theta_bNeural network parameter, target reward value representing agent b

Wherein γ represents an attenuation coefficient, u_allA list of actions representing all agents;

(11) judging whether the updating times reach the preset training round number N_episodeIf the preset number of training rounds N is not reached_episodeAnd (5) returning to the step (4) for loop iteration, and if the preset number of training rounds N is reached_episodeAnd outputting the intelligent agents of each intersection of the multi-mode traffic trunk based on multi-agent cooperative training.

The invention also provides a multi-mode traffic trunk line signal coordination control device based on multi-agent cooperation, which comprises the following components:

the multi-mode traffic trunk sensing module comprises a traffic trunk data sensing unit and a traffic trunk state sensing unit, wherein the traffic trunk data sensing unit is used for acquiring the channelized design, the number of entrance lanes, the length of road sections, the positions of bus stations, non-motor vehicle lanes and the positions of sidewalks of all intersections of a target trunk, and the traffic trunk state sensing unit is used for acquiring the number of bus runs and routes, departure intervals, parking time, the number and speed of passengers of social vehicles, pedestrians and non-motor vehicles, the queuing length in front of the intersections and the like;

the data storage module comprises a traffic trunk intersection data unit and a traffic trunk traffic flow data unit and is respectively used for storing the data acquired by the multi-mode traffic trunk sensing module and the traffic trunk state sensing unit;

the cooperative multi-mode traffic trunk signal coordination control intelligent agent calculation module comprises an intelligent agent calculation and storage unit which is respectively used for calculating and storing the intelligent agents at the intersection of the iterative training cooperative trunk line in the method and outputting and storing the intelligent agents at each intersection of the multi-mode traffic trunk line cooperatively trained by the multiple intelligent agents.

In addition, the invention also provides a computer device, which comprises a processor, a memory and a computer program stored on the memory and capable of running on the processor, wherein the computer program realizes the steps of the multi-agent cooperation based multi-mode traffic trunk signal coordination control method when being executed by the processor.

In addition, the present invention also provides a computer readable storage medium, which stores a computer program, and the computer program when executed by a processor implements the steps of the multi-agent cooperation based multi-mode transportation trunk signal coordination control method.

Has the advantages that: compared with the prior art, the technical scheme of the invention has the following beneficial technical effects:

the invention provides a multi-mode traffic trunk signal coordination control method and device based on multi-agent cooperation, wherein a multi-mode traffic trunk and flow generation are simulated and modeled; designing a plurality of intelligent agents for signal control of each intersection of a trunk line; constructing a collaborative value decomposition multi-agent reinforcement learning framework; and training and outputting the intelligent agents of the intersections of the multi-mode traffic trunk line. The invention designs the multi-mode traffic signal control of each intersection as an intelligent body, simultaneously comprehensively considers the cooperation of each intersection of the traffic trunk line, takes the integral pedestrian flow and delay of the trunk line as targets to optimize and train the traffic signal control intelligent body, provides a control basis for a road traffic manager, realizes the integral optimal target of the traffic trunk line and improves the urban road traffic service level.

Drawings

FIG. 1 is a flow chart of a method of an embodiment of the present invention;

FIG. 2 is a flow diagram of a multi-agent collaborative reinforcement learning framework of an embodiment of the present invention;

FIG. 3 is a schematic diagram of a multi-mode traffic trunk simulation of an embodiment of the present invention;

fig. 4 is a schematic structural diagram of an apparatus according to an embodiment of the present invention.

Detailed Description

In order that the present disclosure may be more readily and clearly understood, reference is now made to the following detailed description taken in conjunction with the accompanying drawings and specific examples.

As shown in fig. 1, the multi-agent cooperation-based multi-mode traffic trunk signal coordination control method disclosed in the embodiment of the present invention includes the following steps:

(1) acquiring intersection information of the traffic trunk line and a multi-mode traffic flow mode, performing simulation calibration on the multi-mode traffic trunk line by using simulation software according to the data, and reducing the arrival rate of the multi-mode traffic flow;

specifically, the information of the intersection of the traffic trunk and the data of the multi-mode traffic flow mode can be acquired by a field sensing device, the data can also be acquired on the field, and the simulation software can be sumo, vissim and the like;

(2) in this embodiment, for each intersection in the trunk lineA signal control agent is generated at the intersection, n intersections of the main traffic line correspond to n agents, and the agent i reads the time t_kThe intersection comprises the states of multi-mode traffic position, queuing length and speed information

Will state

Inputting agent i at time t_kThe time parameter is

Wherein the content of the first and second substances,

representing parameters in a neural network

Selecting the operation phase a_iAnd the state is

(3) in this embodiment, the neural network parameters and experience playback pool of all agents in the trunk are initialized, and the number N of training rounds is set_episode；

(4) Specifically, initializing the simulated multi-mode traffic trunk flow arrival rate, and setting the initial simulation time t₀Total simulation time T;

(5) in this embodiment, the multi-mode traffic state of each agent is obtained, and agent i is taken as an example to obtain the corresponding intersection i at time t_kIn a multi-mode traffic bureauPart observation state

Wherein

(6) in this embodiment, the local observed states of each agent are input into its neural network, and for agent i, it will be

Return time t after input to neural network_kPhase of motion of

Phase of simultaneous return motion

Corresponding Q value

neural network representing agent i at time t_kThe parameters of (1);

(7) in this embodiment, the operation phase returned by each agent

it represents the amount of variation in the delay per person,

wherein

And

indicating the amount of change in the queue length,

wherein

And

(8) in this embodiment, the step (5) is repeated to obtain the time t_k+1Multi-mode traffic status for each agent

Will be provided with

Saving the experience to an experience playback pool, wherein,

and

respectively time t_kAnd time t_k+1The global state list of (a) is,

wherein

wherein

is shown at time t_kA list of actions selected by all agents,

wherein

Indicating that the nth agent is at time t_kAn action to perform;

(9) specifically, whether the preset simulation time is reached is judged, and if t, the preset simulation time is judged_k+1And (5) if the value is more than or equal to T, entering the step (10), and otherwise, returning to the step (5) for iteration.

(10) In this embodiment, N pairs of data are randomly sampled from the empirical playback pool, according to the loss function

a global reward function representing multi-agent collaboration,

(11) in this embodiment, it is determined whether the number of updates reaches the preset number N of training rounds_episodeIf the preset number of training rounds N is not reached_episodeAnd (5) returning to the step (4) for loop iteration, and if the preset number of training rounds N is reached_episodeAnd outputting the intelligent agents of each intersection of the multi-mode traffic trunk based on multi-agent cooperative training.

The invention is further elucidated below on the basis of an example of a traffic trunk situation.

Traffic example: the method is characterized in that 4 intersections are arranged at a certain traffic trunk, namely an intersection 1, an intersection 2, an intersection 3 and an intersection 4 from west to east in sequence, the distances among the intersections are 160m, 140m and 180m in sequence, wherein the intersection 1 and the intersection 4 are the intersections of the trunk and the trunk, each entrance road is a bidirectional 8 lane, the intersections 2 and the intersections 3 are the intersections of the trunk and branches, the entrances in the trunk direction are bidirectional 8 lanes, the entrances in the branches are bidirectional 2 lanes, and all motor vehicle lanes are provided with pedestrians and non-motor vehicle lanes.

The invention provides a multi-mode traffic trunk line signal coordination control method based on multi-agent cooperation, which comprises the following steps:

(1) as shown in fig. 3, the intersection information of the traffic trunk and the multi-mode traffic flow mode are acquired, the multi-mode traffic trunk is simulated and calibrated by using simulation software sumo according to the data, and the multi-mode traffic flow arrival rate is restored.

(2) Generating a signal control agent for each intersection in the trunk line, wherein 4 intersections of the trunk line correspond to 4 agents, taking the agent 2 as an example, the agent 2 reads the time t_kThe intersection comprises the states of multi-mode traffic position, queuing length and speed information

Will state

Input agent 2 at time t_kThe time parameter is

The output intersection agent 2, at time t_kPhase of motion of

Wherein the content of the first and second substances,

to representIn neural network parameters

Selecting the operation phase a₂And the state is

The cost function in the case of (1), A₂Shows the set of motion phases, a, that can be released at this intersection 2₂Is represented by A₂One action phase of;

(3) initializing neural network parameters and experience playback pools of all agents in a trunk line, and setting the number N of training rounds_episode＝1000；

(4) Initializing the simulated multi-mode traffic trunk flow arrival rate, and setting the initial simulation time t₀0, 10800 total simulation time T;

(5) acquiring the multi-mode traffic state of each agent, taking agent 2 as an example, acquiring the corresponding intersection 2 at time t₀Multi-mode traffic local observation state

Wherein

Respectively, at time t₀The social vehicle state, the public traffic state, the pedestrian state and the non-motor vehicle state, the states comprise the information of the position, the queuing length, the speed and the like,

indicating intersection 1 and intersection 3 adjacent to intersection 2 at time t₀The phase state of (a);

(6) the local observed state of each agent is input into its neural network, for agent 2 as an example, it will be

Return time t after input to neural network₀Phase of motion of

Phase of simultaneous return motion

Corresponding Q value

Wherein A is₂Indicating the set of action phases that intersection 2 can clear,

indicating agent 2 at time t_kParameter of temporal neural network, a₂In the representation A₂A phase of action, Q₂(. cndot.) represents the neural network Q function corresponding to agent 2,

the neural network representing agent 2 at time t₀The parameters of (1);

(7) phase of action to be returned by each agent

Executing delta t in each corresponding intersection signal lamp of the traffic trunk simulation for 5 seconds, and executing the time t₁＝t₀+ Δ t ═ 5, return multi-mode traffic trunk multi-agent at time t₀Team prize value of

it represents the amount of variation in the delay per person,

wherein

And

respectively, at time t₀And time t₁The trunk line of (1) is delayed by all people,

indicating the amount of change in the queue length,

wherein

And

respectively, at time t₀And time t₁The number of people queuing in the traffic trunk;

(8) repeating the step (5) to obtain the time t₁Multi-mode traffic status for each agent

Will be provided with

Saving to an experience playback pool, wherein

Indicating that multiple agents are at time t₀The value of the team award of (a),

and

respectively time t₀And time t₁Global state list of (2) to

For the purpose of example only,

wherein

Indicating that agent 1 is at time t₀The status of the acquisition is determined by the state of the acquisition,

is shown at time t₀A list of actions selected by all agents,

wherein

Indicating that agent 1 is at time t₀An action to perform;

(9) judging whether a preset simulation time t is reached₁And (5) and T10800, and the step (5) is returned to iterate until T is satisfied_k+1Entering the step (10) at the temperature of more than or equal to T.

(10) Randomly sampling N-64 pairs of data from an empirical playback pool according to a loss function

a global reward function representing 4 agent collaborations,

wherein k is_bThe importance balance coefficients representing the intersection b are all 1 in this example, and theta is taken as_bNeural network parameters representing agent bTarget prize value

Where γ represents the attenuation coefficient, in this example 0.85, u_allA list of actions representing all agents; (11) every time the step (10) is carried out for representing 1 round of training, whether the updating times reach the preset training round number N is judged_episode1000, if the preset training wheel number N is not reached_episodeAnd (5) returning to the step (4) for loop iteration, and if the preset number of training rounds N is reached_episodeAnd outputting the intelligent agents of 4 intersections of the multi-mode traffic trunk based on multi-agent cooperative training.

As shown in fig. 4, the multi-mode traffic trunk signal coordination control device based on multi-agent cooperation disclosed in the embodiment of the present invention includes: the system comprises a multi-mode traffic trunk line sensing module, a data storage module and a cooperative multi-mode traffic trunk line signal coordination control intelligent agent calculating module; the multi-mode traffic trunk sensing module is used for acquiring the channelized design, the number of entrance roads, the length of road sections, the positions of bus stations, non-motor vehicle lanes and the positions of sidewalks of all intersections of a target trunk, acquiring the number and the route of buses on the trunk, departure intervals, parking time, the number and the speed of passengers of social vehicles, pedestrians and non-motor vehicles, the queuing length in front of the intersections and the like; the data storage module is used for storing the data acquired by the multi-mode traffic trunk sensing module and the traffic trunk state sensing unit; the cooperative multi-mode traffic trunk signal coordination control intelligent agent calculation module is used for calculating and storing the intelligent agents at the cooperative trunk intersection according to the iterative training in the claim 1 and outputting and storing the intelligent agents at each intersection of the multi-mode traffic trunks cooperatively trained by the multiple intelligent agents.

Wherein the multi-mode traffic trunk perception module: the system comprises a traffic trunk data sensing unit and a traffic trunk state sensing unit; the data storage module includes: a traffic trunk intersection data unit and a traffic trunk traffic flow data unit; the cooperative multi-mode traffic trunk signal coordination control intelligent agent calculation module comprises: and the intelligent agent computing and storing unit.

The embodiment of the multi-mode traffic trunk signal coordination control device based on multi-agent cooperation and the embodiment of the multi-mode traffic trunk signal coordination control method based on multi-agent cooperation disclosed by the embodiment belong to the same concept, and the specific implementation process is described in the embodiment of the method, and is not described herein again.

It should be understood that the above examples are only for clarity of illustration and are not intended to limit the embodiments. Other variations and modifications will be apparent to persons skilled in the art in light of the above description. And are neither required nor exhaustive of all embodiments. And obvious variations or modifications therefrom are within the scope of the invention.

Claims

1. A multi-mode traffic trunk signal coordination control method based on multi-agent cooperation is characterized by comprising the following steps:

Will state

Inputting agent i at time t_kThe time parameter is

Wherein the content of the first and second substances,

representing parameters in a neural network

Selecting the operation phase a_iAnd the state is

Value function in the case of (1), Q value, A_iIndicating the set of motion phases, a, that can be released at intersection i_iIs represented by A_iOne action phase of;

(4) Initializing the simulated multi-mode traffic trunk flow arrival rate, and setting the initial simulation time t₀Total simulation time T;

(5) obtaining the multi-mode traffic state of each agent in the traffic trunk simulation, and for agent i, obtaining the corresponding ith intersection at the time t_kMulti-mode traffic local observation state

Wherein the content of the first and second substances,

respectively shows the ith intersection at the time t_kThe social vehicle state, the public traffic state, the pedestrian state and the non-motor vehicle state, the states comprise the position, the queuing length and the speed information,

indicating that the intersection adjacent to the ith intersection is at time t_kThe phase state of (a);

Return time t after input to neural network_kPhase of motion of

Phase of simultaneous return motion

Corresponding Q value

neural network representing agent i at time t_kThe parameters of (1);

(7) phase of action to be returned by each agent

Delta t seconds are executed in each corresponding intersection signal lamp of the traffic trunk simulation, and the time is changed into t_k+1＝t_k+ Δ t, the simulation environment returns to the multi-mode traffic trunk multi-agent at time t_kTeam prize value of

it represents the amount of variation in the delay per person,

wherein the content of the first and second substances,

and

indicating the amount of change in the queue length,

wherein the content of the first and second substances,

and

Will be provided with

Saving the experience to an experience playback pool, wherein,

and

respectively represent the time t_kAnd time t_k+1The global state list of (a) is,

wherein the content of the first and second substances,

wherein the content of the first and second substances,

is shown at time t_kA list of actions selected by all agents,

wherein the content of the first and second substances,

indicating that the nth agent is at time t_kAn action to perform;

(9) judging whether the preset simulation time is reached, if t, judging whether the preset simulation time is reached_k+1If the value is more than or equal to T, entering the step (10), otherwise, returning to the step (5) for iteration;

a global reward function representing multi-agent collaboration,

Wherein γ represents an attenuation coefficient, u_allA set of actions representing all agents;

2. A multi-mode traffic trunk line signal coordination control device based on multi-agent cooperation is characterized by comprising:

the multi-mode traffic trunk sensing module comprises a traffic trunk data sensing unit and a traffic trunk state sensing unit, wherein the traffic trunk data sensing unit is used for acquiring the channelized design, the number of entrance lanes, the length of road sections, the positions of bus stations, non-motor vehicle lanes and the positions of sidewalks of all intersections of a target trunk, and the traffic trunk state sensing unit is used for acquiring the number of bus runs and routes, departure intervals, parking time, the number and speed of passengers of social vehicles, pedestrians and non-motor vehicles, the queuing length in front of the intersections and the passing phase of the current intersection;

the cooperative multi-mode traffic trunk signal coordination control intelligent agent calculation module comprises an intelligent agent calculation and storage unit, wherein the intelligent agent calculation and storage unit is used for calculating and storing the intelligent agents at the cooperative trunk intersection according to the iterative training in the claim 1 and outputting and storing the intelligent agents at each intersection of the multi-mode traffic trunks cooperatively trained by the multiple intelligent agents.

3. A computer device comprising a processor, a memory, and a computer program stored on the memory and executable on the processor, wherein the computer program when executed by the processor implements the steps of the multi-agent collaboration based multi-mode transportation trunk signal coordination control method of claim 1.

4. A computer-readable storage medium, characterized in that the computer-readable storage medium has stored thereon a computer program which, when being executed by a processor, realizes the steps of the multi-agent cooperation based multi-mode transportation trunk signal coordination control method of claim 1.