CN115574826B

CN115574826B - National park unmanned aerial vehicle patrol path optimization method based on reinforcement learning

Info

Publication number: CN115574826B
Application number: CN202211572414.2A
Authority: CN
Inventors: 郭强辉; 殷虹娇; 张鹏; 王永峰; 宋尚源; 刘兆泽; 高琳
Original assignee: Beijing Deepiot Technology Co ltd; Nankai University
Current assignee: Beijing Deepiot Technology Co ltd; Nankai University
Priority date: 2022-12-08
Filing date: 2022-12-08
Publication date: 2023-04-07
Anticipated expiration: 2042-12-08
Also published as: CN115574826A

Abstract

The invention discloses a national park Unmanned Aerial Vehicle (UAV) patrol route optimization method based on reinforcement learning, which comprises the steps of firstly, taking a flight route of an UAV as an optimization target, adding constraint conditions of traversal route points of the UAV, electric quantity limitation of the UAV and energy consumption of task execution of the route points, and establishing an UAV route planning model with a self-service charging function; then respectively corresponding the unmanned aerial vehicle, the path points, the charging base station, the energy, the battery capacity, the flight path energy consumption and the path point task energy consumption in the unmanned aerial vehicle path planning model to a CVRP problem model; the unmanned aerial vehicle patrol path planning problem which needs to consider side energy consumption constraint and point energy consumption constraint originally is reduced to a CVRP problem which takes path length as an optimization target and takes customer demand and vehicle cargo as constraints by using a feedforward weighting method; and finally, solving the reduced CVRP problem by using a multi-decoder attention model.

Description

National park unmanned aerial vehicle patrol path optimization method based on reinforcement learning

Technical Field

The invention belongs to the technical field of computer intelligent calculation and unmanned aerial vehicle flight control, and particularly relates to a national park unmanned aerial vehicle patrol path optimization method based on reinforcement learning.

Background

The field patrol monitoring is the most important ecological monitoring and daily supervision means in national parks and natural conservation places, and a patrol guard collects data in the aspects of wild species population, habitat, phenology and the like through patrol monitoring, can timely discover ecological environment problems, inhibit illegal activities and the like, realizes effective protection on the national parks and the natural conservation places, and provides decision basis for natural resource supervision. However, national parks and natural protection lands have large areas, wide ranges and complex terrains, people and vehicles in most regions are difficult to reach, and the traditional manual patrol mode has low efficiency and wastes time and labor. Therefore, in recent years, unmanned aerial vehicles are increasingly used for patrol monitoring work of various natural protection places.

The unmanned aerial vehicle technology is an unmanned aerial vehicle remote sensing technology which is realized by fusing an aircraft technology, a communication technology, a GPS (global positioning system), a differential positioning technology and an image technology, and automatic acquisition and transmission of monitoring data are realized by carrying sensing equipment such as a high-definition camera and an intelligent sensor and combining a wireless communication network. The current unmanned aerial vehicle used for patrol monitoring of national parks and natural conservation places has the challenges of short endurance, high requirement on flight control personnel, difficulty in storage and transportation of airplanes, high difficulty in application integration and the like, and is difficult to meet the application requirements of normalized monitoring.

The automatic airport of unmanned aerial vehicle is the ground automation facility of assisting unmanned aerial vehicle full flow operation, for unmanned aerial vehicle provides all-weather protection, through automatic opening and shutting, go up and down, get and unload structural design, let unmanned aerial vehicle take off, descend, deposit and battery management all can accomplish automatically, need not artificial intervention. The unmanned aerial vehicle is stored in the automatic airport, and when flight demands exist, the unmanned aerial vehicle takes off from the airport autonomously, and automatically lands in the automatic airport after a task is finished, so that the unmanned aerial vehicle is charged in the automatic airport, preparation is made for the next task, and full-automatic operation is realized.

For realizing the normalized development of unmanned aerial vehicle in national park and the ecological monitoring work of nature protected area, satisfy the field and patrol and protect the monitoring management demand, this patent carries out path planning, electric quantity state control, commander's dispatch to unmanned aerial vehicle based on the automatic airport of unmanned aerial vehicle, and very big degree promotes unmanned aerial vehicle and patrols and protects monitoring efficiency.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides a national park unmanned aerial vehicle patrol route optimization method based on reinforcement learning.

The invention is realized by the following technical scheme:

a national park unmanned aerial vehicle patrol path optimization method based on reinforcement learning comprises the following steps:

step 1: inputting three-dimensional terrain data to generate a bounded three-dimensional region

And setting a path point set & ltSUB & gt in the air above the area according to the performance and patrol requirement of the unmanned aerial vehicle airborne camera>

The unmanned aerial vehicle is required to complete the visual coverage task after traversing all path points;

step 2: taking the flight path of the unmanned aerial vehicle as an optimization target, adding constraint conditions of traversal path points of the unmanned aerial vehicle, electric quantity limitation of the unmanned aerial vehicle and energy consumption of task execution of the path points, and establishing an unmanned aerial vehicle path planning model with a self-service charging function;

and 3, step 3: respectively corresponding unmanned aerial vehicle, path points, charging base stations, energy, battery capacity, flight path energy consumption and path point task energy consumption in the established unmanned aerial vehicle path planning model with the self-service charging function to vehicles, clients, warehouses, goods, the maximum cargo capacity of the vehicles, the path length and the client requirements in the CVRP problem model; defining new path point task energy consumption by using a feedforward weighting method, so that the new path point task energy consumption comprises the task energy consumption of a path point and the average edge energy consumption reaching the path point; the obtained new path point task energy consumption corresponds to the customer requirements of the CVRP problem model, and then the unmanned aerial vehicle patrol path planning problem is reduced to a CVRP problem which takes the path length as an optimization target and the customer requirements and the vehicle cargo as constraints;

and 4, step 4: the CVRP problem reduced in step 3 is solved using a multi-decoder attention model.

In the above technical scheme, in step 2, an unmanned aerial vehicle path planning model with a self-service charging function is established, and the specific steps are as follows:

step 2.1: defining unmanned aerial vehicleFlight path decision variables ofx _ij ；

x _ij =1, representing unmanned aerial vehicle from a waypointiFly to the waypointj；

x _ij =0, meaning that the drone is not following a waypointiFly to the waypointj；

Defining an objective function:

（1）

wherein,

is flight path energy consumption and represents the path point of the unmanned planeiAnd waypointsjEnergy consumption is needed;

the flight path decision variables are to form a complete and feasible one-time traversal path, and the constraints are as follows:

（2）

（3）

step 2.2: aiming at the self-service charging function of the unmanned aerial vehicle, the route planning with the charging base station is adjusted, the energy consumption of the unmanned aerial vehicle is measured by the flight path, and the maximum endurance of the unmanned aerial vehicle is recorded asQDefining the energy loss variable

The charging base station is the starting point of the unmanned aerial vehicle and is recorded as ^ er>

；

During the execution of the taskThe residual cruising distance of the unmanned aerial vehicle does not exceed the maximum cruising distance

Is given by the following equation:

（4）

（5）

wherein,

is the way point->

Task energy consumption, representing the unmanned aerial vehicle completing the path point->

The patrol task needs energy consumption and is based on the judgment result>

Represents a waypoint pick>

Out of path point->

To the waypoint->

Is taken into consideration, based on the decision variable of the side in question, is taken>

Indicating that the drone is from a waypoint pick-and-place>

Executing the task and flying to the path point->

The residual energy after the reaction;

when unmanned aerial vehicle leaves charging base station, the electric quantity is full, and the formula is as follows:

（6）

indicating that the drone leaves the charging base station to a waypoint->

The remaining energy is then selected>

Indicating that the drone is flying from the charging base station to a waypoint->

Is greater than or equal to>

Is a way point>

Energy consumption required by the patrol task.

In the above technical scheme, in step 3, firstly, under the condition that the edge energy consumption constraint between the path point and the path point is not considered, a deep reinforcement learning method is used to independently solve the CVRP problem corresponding to the unmanned aerial vehicle patrol path for multiple times, and the number of the solution times is recorded as

And training the neural network in the deep reinforcement learning model again every time of solving, and using the neural network trained every time for predicting the CVRP (continuously variable Transmission protocol) problem corresponding to the patrol problem of the original unmanned aerial vehicle, and then judging whether the CVRP problem is greater than or equal to the patrol problem of the original unmanned aerial vehicle>

Sub-resolution in>

Different solutions are grouped to form a solution set>

De-collecting/collecting->

In which comprises>

A patrol path scheme is planted;

redefining new task point energy consumption variables on the basis of the known solution set

；

（7）

Wherein,

represents a waypoint pick>

To the waypoint->

Is on the side of the solution set->

The number of occurrences in (1) corresponds to a weighted average of the path energy consumption required to reach any one of the waypoints, the weight->

Then it is the solution set based on the path length optimization of the total patrol task>

。

In the above technical solution, the solving process of step 4 includes the following steps:

step 4.1: firstly, according to the scale of input information, several groups of data sets with identical path point quantity are produced, and said data sets are equipped with

Group data set, th->

The information in the group dataset comprises a randomly generated start point->

And the way point position>

And energy is consumed by the randomly generated waypoint task->

Wherein->

；

Step 4.2: using generated

Training a multi-decoder attention model in a block data set, where the encoder and decoder parameters are ≥ s>

The model is trained by a strategy gradient algorithm with baseline, and parameters of the optimized model are continuously updated circularly to obtain a trained attention model of the multi-decoder;

step 4.3: after the training of the model parameters is finished, inputting the data of the task planning problem of the original unmanned aerial vehicle as a reduced CVRP problem example into the trained model, and taking the output sequence of the model at the moment as a path point access scheme of the unmanned aerial vehicle patrol problem.

In the above technical solution, in step 4.3, the data of the original unmanned aerial vehicle mission planning problem includes a starting point

、/>

Number of way points->

And information of energy consumption of each path point task, wherein the energy consumption of the path point task refers to the energy consumption of the new path point task defined in the step 2.

The invention has the advantages and beneficial effects that:

the base station is introduced to provide real-time charging service for the working unmanned aerial vehicle, and the unmanned aerial vehicle can access the base station to perform charging for multiple times when executing tasks. Under the system, a constraint formula is constructed by taking the optimized unmanned aerial vehicle task path length as a target, a multi-unmanned aerial vehicle path planning model is established, and the problem is converted into a combined optimization problem. A known combined optimization solver is utilized, a feedforward weighting method is designed to calculate the path energy consumption constraint, and the problem is further converted into a vehicle path problem (CVRP) with capacity limitation. In addition, the deep reinforcement learning method based on the multi-decoder attention model can stably output a high-quality solution of a visual coverage problem for a specific scene, has generalization capability on solving the reduced unmanned aerial vehicle path planning problem, has strong adaptability to a training data set, can guarantee an efficient training network for path planning under different scenes, and can obtain the high-quality solution. Based on a trained learning model, the result can be quickly obtained by only calling neural network prediction after the unmanned aerial vehicle path problem example is reduced, the solving speed is higher than the efficiency of the traditional search algorithm, and the decision requirement of the unmanned aerial vehicle quick scheduling planning can be met.

Drawings

FIG. 1 is a flow chart of the national park unmanned aerial vehicle patrol route optimization method based on reinforcement learning.

FIG. 2 is a flow diagram of a solution of a problem instance to a multi-decoder attention model.

For a person skilled in the art, without inventive effort, other relevant figures can be derived from the above figures.

Detailed Description

In order to make the technical solution of the present invention better understood, the technical solution of the present invention is further described below with reference to specific examples.

A national park unmanned aerial vehicle patrol path optimization method based on reinforcement learning is disclosed, referring to the attached figure 1, and comprises the following steps:

Obtaining initial data

And the unmanned aerial vehicle is required to complete the visual coverage task after traversing all path points.

Step 2: and establishing a constraint formula, taking the flight path of the unmanned aerial vehicle as an optimization target, adding constraint conditions of traversal path points of the unmanned aerial vehicle, electric quantity limitation of the unmanned aerial vehicle and energy consumption of task execution of the path points, and establishing an unmanned aerial vehicle path planning model with a self-service charging function without considering uncontrollable factors such as wind power, visibility and faults of the unmanned aerial vehicle. The method comprises the following specific steps.

Step 2.1: defining flight path decision variables for dronesx _ij ；

x _ij =0, mean that the drone will not follow the waypointiFly to the waypointj；

Defining an objective function:

（1）

wherein,

is flight path energy consumption and represents the path points of the unmanned aerial vehicleiAnd a waypointjThe energy consumption between the path points is in direct proportion to the distance, and the aim of the task is to optimize the flight path of the unmanned aerial vehicle so as to minimize the flight path on the premise of completing the task. Meanwhile, flight path decision variables need to ensure that a complete and feasible one-time traversal path can be formed, and the specific constraints are as follows:

（2）

（3）

step 2.2: aiming at the self-service charging function of the unmanned aerial vehicle, the route planning with the charging base station is adjusted, the energy consumption of the unmanned aerial vehicle is measured by the flight path, and the maximum endurance of the unmanned aerial vehicle is recorded asQDefining energy lossVariables of

And the charging base station, namely the departure point of the unmanned aerial vehicle, is recorded as ^ er>

。/>

First, the drone consumes energy as it moves between waypoints and the remaining range of the drone during the mission should not exceed the maximum range

Is given by the following equation:

（4）

（5）

wherein,

is the way point->

The required energy consumption of the patrol task is reduced,

represents a waypoint pick>

Other waypoints>

To the path point/>

Indicating that the drone is from a waypoint pick-and-place>

Executing the task and flying to the path point->

The remaining energy (i.e., electricity).

Secondly, when unmanned aerial vehicle leaves charging base station, the electric quantity is full, and the formula is expressed as follows:

（6）

indicating that the drone leaves the charging base station to a waypoint->

The remaining energy is then selected>

Is greater than or equal to>

Is the way point->

Task energy consumption for indicating that unmanned aerial vehicle completes path point->

Energy consumption required by the patrol task.

In conclusion, an unmanned aerial vehicle path planning model with a self-service charging function is established, and the model comprises an objective function (1) and constraint formulas (2), (3), (4), (5) and (6). The solution of the model is a combined optimization problem, namely, the unmanned aerial vehicle patrol path planning problem is converted into the combined optimization problem.

And step 3: referring to table 1, the unmanned aerial vehicle, the waypoints, the charging base station, the energy (i.e., the electric quantity), the battery capacity, the flight path energy consumption, and the waypoint task energy consumption in the unmanned aerial vehicle path planning model with the self-service charging function, which are established as above, are respectively corresponding to the maximum cargo capacity, the path length, and the customer demand of the vehicle, the customer, the warehouse, the goods, and the vehicle in the CVRP problem (the capacity-limited vehicle path solving problem) model, and then the unmanned aerial vehicle path planning model is converted into the capacity-limited vehicle path solving problem (CVRP).

Table 1: correspondence between unmanned aerial vehicle path planning and CVRP problem model

The energy consumption of the unmanned aerial vehicle comprises the edge energy consumption from the path point to the path point and the point energy consumption required by the path point to complete the patrol task, but in the CVRP problem model, the edge energy consumption is only used as an optimization target for planning the vehicle path, and only the point energy consumption is used as a constraint condition of the vehicle path. Therefore, the invention uses a feedforward weighting method to enable point energy consumption to replace 'point plus edge energy consumption', and then add edge energy consumption into the constraint condition, so that the problem of unmanned aerial vehicle patrol route planning which originally needs to consider edge energy consumption constraint and point energy consumption constraint is reduced to a CVRP problem which takes the route length as an optimization target and takes customer requirements and vehicle cargo as constraints. The specific treatment method is as follows.

First, do not examineUnder the condition of considering energy consumption constraint, a deep reinforcement learning method is used for solving CVRP (continuously variable Transmission protocol) problems corresponding to unmanned aerial vehicle routing paths for multiple times independently, and the solving times are recorded as

And training the neural network in the deep reinforcement learning model again (or independently) every time of solving, using the neural network trained every time for predicting the CVRP (continuously variable Transmission protocol) problem corresponding to the original unmanned aerial vehicle patrol problem, and combining the training set because the generation and extraction of the training set are random>

The sum obtained in the next training->

The neural networks are different, and their prediction of the problem is different, so that a decision can be made whether or not a decision is made>

Different solutions are grouped to form a solution set>

De-collecting/collecting->

In which comprises>

And (6) a patrol path scheme is planted.

Redefining new path point task energy consumption based on known solution set

(i.e., waypoint @)>

Energy consumption required for completing the patrol task):

（7）

wherein,

represents a waypoint pick>

To the waypoint->

In the solution set>

Then it is consulted with a solution set based on a total patrol task path length optimization>

。

The obtained new path point task energy consumption

Customer demand corresponding to CVRP problem model such that new waypoint tasks consume >>

The task energy consumption of a path point and the average side energy consumption for reaching the path point are included, and the patrol path problem which originally needs to consider side energy consumption constraint and point energy consumption constraint is reduced to a CVRP problem which takes path length as an optimization target and takes customer demand and vehicle cargo as constraints.

And 4, step 4: the CVRP problem after reduction in step 3 is solved using a multi-decoder attention model.

The data of the unmanned aerial vehicle path planning problem comprises a starting point

And->

Number of way points->

And information of task energy consumption of each path point (the path point task energy consumption refers to new path point task energy consumption defined in the step 2), and the information is reduced to information of warehouse, client demand and the like in the CVRP problem according to the step 3 and serves as input information of the model. The encoder structure of the model is based on a transformer model, a plurality of decoders with the same structure and independent parameters are used in a decoder part, the difference degree of construction solutions between the decoders is measured by Kullback-Leibler divergence (abbreviated as 'KL divergence') between probability distributions calculated by different decoders, and in addition, each decoder increases the masking of nodes when calculating attention weights and is used for realizing task path constraint in the CVRP problem. The model is trained by a policy gradient algorithm with baseline and a plurality of data sets which are randomly generated and have the same scale with the problem to be solved. Referring to fig. 2, the specific solving process is as follows.

Step 4.1: firstly, groups with the same path point number (namely, the same path point number) are generated according to the scale of input information

) Is assumed to have shared ≥ is>

Group data set on a fifth basis>

For the example of a group dataset, the information therein includes a randomly generated start point @>

And the way point position>

And a randomly generated waypoint task energy consumption >>

Wherein->

。

Step 4.2: using generated

The model is trained by a policy gradient algorithm with baseline, parameters of the optimized model are continuously updated in a circulating way, the training target is the model parameter which is optimized to ensure that the path length of a client access scheme is shortest and KL divergence of a decoder parameter is recorded and recorded>

Calculating the total length of the task path for the model parameters, and recording>

And performing parameter training for the KL divergence of the decoder parameters under the model parameters according to the following algorithm to obtain the trained attention model of the multi-decoder.

The reinforcement learning algorithm with baseline is as follows:

1, inputting

Group dataset significance level>

Training period>

；

2, initializing model parameters

；

3, recording Baseline parameters

；

4, current number of training sessions

；

5, combining the optimization objectives according to the current

Group dataset and parameter->

The task path length and KL divergence of the underlying model output result are calculated ^ based on ^ s>

Optimized direction->

；

6 according to the optimization direction

Updating parameters ÷ using Adam function>

；

7, using t test parameters

And a baseline parameter>

If the significance is less than ≦>

Is updated->

；

8, if there is

， />

Returning to the step 5; otherwise, turning to the next step;

9, training is finished, and the obtained parameters are

The multi-decoder attention model of (1).

Step 4.3: after the training of the model parameters is finished, the data (including the starting point) of the original unmanned aerial vehicle mission planning problem is processed

、

Number of way points->

And the energy consumption information of each path point task) as a reduced CVRP problem example, inputting a trained model, and taking an output sequence of the model at the moment as a path point access scheme of the unmanned aerial vehicle patrol problem.

The invention has been described in an illustrative manner, and it is to be understood that any simple variations, modifications or other equivalent changes which can be made by one skilled in the art without departing from the spirit of the invention fall within the scope of the invention.

Claims

1. A national park Unmanned Aerial Vehicle (UAV) patrol path optimization method based on reinforcement learning is characterized by comprising the following steps of:

According to the performance and the patrol requirement of an airborne camera of the unmanned aerial vehicle, setting a path point set V = { V } in the air above the area ₁ ，v ₂ ，...，v _n The unmanned aerial vehicle is required to complete the visual coverage task after traversing all the path points;

and 2, step: taking the flight path of the unmanned aerial vehicle as an optimization target, adding constraint conditions of traversal path points of the unmanned aerial vehicle, electric quantity limitation of the unmanned aerial vehicle and energy consumption of task execution of the path points, and establishing an unmanned aerial vehicle path planning model with a self-service charging function; the method comprises the following specific steps:

step 2.1: defining a flight path decision variable x for an unmanned aerial vehicle _ij ；

x _ij =1, indicating that the drone flies from waypoint i to waypoint j;

x _ij =0, meaning that the drone will not fly from waypoint i to waypoint j;

defining an objective function:

wherein, c _ij The energy consumption of the flight path represents the energy consumption required between a path point i and a path point j of the unmanned aerial vehicle;

step 2.2: aiming at the self-service charging function of the unmanned aerial vehicle, the route planning with the charging base station is adjusted, the energy consumption of the unmanned aerial vehicle is measured by the flight path, the maximum endurance of the unmanned aerial vehicle is recorded as Q, and an energy loss variable E is defined _ij The charging base station is the starting point of the unmanned aerial vehicle and is marked as v ₀ ；

The remaining range of the drone during the mission does not exceed the non-negative of the maximum range Q, the formula being as follows:

wherein r is _j Is the task energy consumption of the path point j, which represents the energy consumption, x, required by the unmanned aerial vehicle to finish the patrol task of the path point j _ki A decision variable E representing the edge from a path point k to a path point i other than the path point i _ki Representing the energy left after the unmanned aerial vehicle flies to a path point i from a path point k to execute a task;

E _0i representing the remaining energy, x, of the unmanned aerial vehicle after leaving the charging base station and arriving at a path point i _0i A decision variable r representing the flight of the unmanned aerial vehicle from the charging base station to a path point i _i The energy consumption of the task at the path point i represents the energy consumption required by the unmanned aerial vehicle to finish the patrol task at the path point i;

and step 3: respectively corresponding unmanned aerial vehicle, path points, charging base stations, energy, battery capacity, flight path energy consumption and path point task energy consumption in the established unmanned aerial vehicle path planning model with the self-service charging function to vehicles, clients, warehouses, goods, the maximum cargo capacity of the vehicles, the path length and the client requirements in the CVRP problem model; defining new path point task energy consumption by using a feedforward weighting method, so that the new path point task energy consumption comprises the task energy consumption of a path point and the average edge energy consumption reaching the path point; the obtained new path point task energy consumption corresponds to the customer requirements of the CVRP problem model, and then the unmanned aerial vehicle patrol path planning problem is reduced to a CVRP problem which takes the path length as an optimization target and the customer requirements and the vehicle cargo as constraints;

in step 3, firstly, under the condition of not considering the limit energy consumption constraint between the path points, a deep reinforcement learning method is used for solving CVRP (continuously variable Transmission protocol) problems corresponding to the unmanned aerial vehicle patrol path for multiple times independently, the solving times are recorded as N, the neural network in the deep reinforcement learning model is trained again in each solving, the neural network trained in each time is used for predicting the CVRP (continuously variable Transmission protocol) problems corresponding to the original unmanned aerial vehicle patrol problems, N groups of different solutions are obtained through N times of solving, and a solution set S is formed _N Solution set S _N N patrol path schemes are included;

redefining new task point energy consumption variable r 'based on known solution set' _j ；

Wherein, N _ij Representing the edge from Path point i to Path point j in solution set S _N The number of occurrences in (1) is equivalent to the weighted average of the path energy consumption required for reaching any path point, and the weight N is _ij Then the solution set S is obtained by optimizing the path length of the reference total patrol task _N ；

2. The reinforcement learning-based national park unmanned aerial vehicle patrol route optimization method according to claim 1, wherein: the solving process of the step 4 comprises the following steps:

step 4.1: firstly, according to the scale of input information, several groups of data sets with identical path point quantity are generated, K groups of data sets are set, and the information in the ith group of data sets includes randomly-generated starting point v ₀ And the position of the path point

And randomly generated waypoint task energy consumption>

Wherein i =1,2,. K;

step 4.2: training a multi-decoder attention model by using the generated K groups of data set, wherein the parameters of an encoder and a decoder in the model are theta, the model is trained by a policy gradient algorithm with baseline, and parameters of the optimization model are continuously updated in a circulating manner to obtain the trained multi-decoder attention model;

3. The reinforcement learning-based national park unmanned aerial vehicle patrol route optimization method according to claim 2, wherein: in step 4.3, the data of the original unmanned aerial vehicle mission planning problem comprises a starting point v ₀ N path points { v ₁ ，...，v _n And the energy consumption of each path point task refers to the new energy consumption of the path point task defined in step 2.