CN111461403B

CN111461403B - Vehicle path planning method and device, computer readable storage medium and terminal

Info

Publication number: CN111461403B
Application number: CN202010151343.3A
Authority: CN
Inventors: 郑仁; 项党
Original assignee: SAIC Motor Corp Ltd; Shanghai Automotive Industry Corp Group
Current assignee: SAIC Motor Corp Ltd; Shanghai Automotive Industry Corp Group
Priority date: 2020-03-06
Filing date: 2020-03-06
Publication date: 2023-09-29
Anticipated expiration: 2040-03-06
Also published as: CN111461403A

Abstract

The application provides a vehicle path planning method and device, a computer readable storage medium and a terminal, wherein a path planning model is obtained by training a depth network of a pointer network type, a name sequence and a distance sequence of an access site to be planned, which are obtained by coding, are input into the path planning model, the name sequence is reordered by the path planning model based on the distance sequence, and a new name sequence which can enable the planned path distance of the access site to be shortest is output, so that a planned path corresponding to the new name sequence is obtained. Based on the application, the path planning strategy can be molded by the deep network, so that the optimization result of the vehicle path planning problem can be directly calculated without searching.

Description

Vehicle path planning method and device, computer readable storage medium and terminal

Technical Field

The present application relates to the field of logistics technologies, and in particular, to a vehicle path planning method and apparatus, a computer readable storage medium, and a terminal.

Background

The vehicle path planning problem (VRP) is a combination optimization problem that has proven to be an NP-hard problem, and the optimal solution to the problem cannot be determined unless all combinations are exhaustive. In common scenes such as actual logistics transportation, vehicle scheduling, route planning and the like, the solving requirement of the vehicle path planning problem is often met.

At present, common optimization methods for solving the problems are heuristic algorithms, such as genetic algorithm, ant colony algorithm, particle swarm algorithm, simulated annealing, tabu search and the like. When the calculation scale of the problem is not large, the optimal path can be solved by using a dynamic programming algorithm, the time complexity and the space complexity required by the dynamic programming for solving the problem are high, and the calculation speed is very slow when more than 20 sites are generally visited. However, there is generally no specific optimization rule for such a problem, and the heuristic algorithm is essentially to gradually find a solution with better performance in a random search manner, so that it is undoubtedly necessary to consume time to wait for the solution to be continuously searched randomly. That is, the more likely a search is, the closer to the optimal solution is possible.

However, considering that the solving time of the problem is limited in the actual scenario, how to balance the performance of the solution and the computation loss time becomes a problem to be solved.

Disclosure of Invention

In view of the above, the present application provides a vehicle path planning method and apparatus, a computer readable storage medium, and a terminal. The technical proposal is as follows:

a vehicle path planning method, the method comprising:

encoding a name sequence and a distance sequence of access sites to be planned, wherein the name sequence comprises a name encoding value of each site in the access sites, and the distance sequence comprises a distance value between every two sites in the access sites;

inputting the name sequence and the distance sequence into a pre-trained path planning model, so that the path planning model reorders the name sequence based on the distance sequence and outputs a new name sequence, wherein the new name sequence can enable the planned path distance of the access site to be the shortest, and the path planning model is obtained by training a depth network of a pointer network type in advance;

and acquiring a planning path corresponding to the new name sequence.

Preferably, the process of training the depth network of the pointer network type in advance to obtain the path planning model includes:

acquiring training samples, wherein the training samples comprise a plurality of groups of name sequence samples and distance sequence samples of a training access site, and one group of name sequence samples corresponds to one planned training path and one distance sequence sample;

extracting a group of name sequence samples for the current training from the training samples, inputting the group of name sequence samples and corresponding distance sequence samples into the depth network of the pointer network type, so that the depth network of the pointer network type outputs one target name sequence sample capable of converging the distance of the corresponding planned training path in the group of name sequence samples based on the input distance sequence samples, wherein the initial value of the network parameter of the depth network of the pointer network type is preset;

the network parameters are adjusted through calculating the distance of the planned training path corresponding to the target name sequence sample, so that the depth network of the pointer network type re-executes the target name sequence sample which can enable the corresponding planned training path distance to be converged in the group of name sequence samples based on the input distance sequence sample, the training is finished when the finishing condition of the training is met, and a pointer is set for the final target name sequence sample output by the depth network of the pointer network type when the training is finished, wherein the pointer is the basis or the basis for re-training the depth network of the pointer network type;

and taking the depth network of the pointer network type after multiple training as the path planning model.

Preferably, the adjusting the network parameter by calculating the distance of the training planned path corresponding to the target name sequence sample includes:

calculating the distance of the planned training path corresponding to the target name sequence sample;

calculating the reference distance of the group of name sequence samples for the training;

calculating the gradient of the network parameter according to the distance of the planned training path and the reference distance;

the network parameters are updated based on the gradient of the network parameters.

Preferably, the calculating the reference distance of the set of name sequence samples for the training includes:

under the condition that the training is not the first training, acquiring a historical reference distance of the last training;

and calculating the reference distance of the group of name sequence samples for the training by utilizing the distance of the planned training path and the historical reference distance.

Preferably, the calculating the reference distance of the set of name sequence samples for the training further includes:

under the condition that the training is the first training, determining an optimized path of the set of name sequence samples by using a heuristic algorithm, and taking the distance of the optimized path as the reference distance of the set of name sequence samples in the training.

A vehicle path planning apparatus, the apparatus comprising: the system comprises an encoding module, a path planning module and a path acquisition module, wherein the path planning module comprises a model training unit;

the model training unit is used for training the depth network of the pointer network type in advance to obtain a path planning model;

the coding module is used for coding a name sequence and a distance sequence of the access sites to be planned, wherein the name sequence comprises a name coding value of each site in the access sites, and the distance sequence comprises a distance value between every two sites in the access sites;

the path planning module is used for inputting the name sequence and the distance sequence into the path planning model so that the path planning model reorders the name sequence based on the distance sequence and outputs a new name sequence, and the new name sequence can enable the planned path distance for accessing the access site to be the shortest;

the path acquisition module is used for acquiring the planning path corresponding to the new name sequence.

Preferably, the model training unit is configured to train the depth network of the pointer network type in advance to obtain a path planning model, and is specifically configured to:

acquiring training samples, wherein the training samples comprise a plurality of groups of name sequence samples and distance sequence samples of a training access site, and one group of name sequence samples corresponds to one planned training path and one distance sequence sample; extracting a group of name sequence samples for the current training from the training samples, inputting the group of name sequence samples and corresponding distance sequence samples into the depth network of the pointer network type, so that the depth network of the pointer network type outputs one target name sequence sample capable of converging the distance of the corresponding planned training path in the group of name sequence samples based on the input distance sequence samples, wherein the initial value of the network parameter of the depth network of the pointer network type is preset; the network parameters are adjusted through calculating the distance of the planned training path corresponding to the target name sequence sample, so that the depth network of the pointer network type re-executes the target name sequence sample which can enable the corresponding planned training path distance to be converged in the group of name sequence samples based on the input distance sequence sample, the training is finished when the finishing condition of the training is met, and a pointer is set for the final target name sequence sample output by the depth network of the pointer network type when the training is finished, wherein the pointer is the basis or the basis for re-training the depth network of the pointer network type; and taking the depth network of the pointer network type after multiple training as the path planning model.

Preferably, the model training unit is configured to adjust the network parameter by calculating a distance of a planned training path corresponding to the target name sequence sample, and is specifically configured to:

calculating the distance of the planned training path corresponding to the target name sequence sample; calculating the reference distance of the group of name sequence samples for the training; calculating the gradient of the network parameter according to the distance of the planned training path and the reference distance; the network parameters are updated based on the gradient of the network parameters.

A computer readable storage medium having stored thereon computer instructions, which when run perform the steps of any of the vehicle path planning methods described herein.

A terminal comprising a memory and a processor, the memory having stored thereon computer instructions executable on the processor, wherein the processor, when executing the computer instructions, performs the steps of any of the vehicle path planning methods.

The vehicle path planning method, the vehicle path planning device, the computer-readable storage medium and the terminal provided by the application have the advantages that the path planning model is obtained by training the depth network of the pointer network type, the name sequence and the distance sequence of the access site to be planned, which are obtained by encoding, are input into the path planning model, the name sequence is reordered by the path planning model based on the distance sequence, and a new name sequence which can enable the planned path distance of the access site to be shortest is output, so that the planned path corresponding to the new name sequence is obtained. Based on the application, the path planning strategy can be molded by the deep network, so that the optimization result of the vehicle path planning problem can be directly calculated without searching.

Drawings

In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings that are required to be used in the embodiments or the description of the prior art will be briefly described below, and it is obvious that the drawings in the following description are only embodiments of the present application, and that other drawings can be obtained according to the provided drawings without inventive effort for a person skilled in the art.

FIG. 1 is a flow chart of a vehicle path planning method according to an embodiment of the present application;

fig. 2 is a network structure diagram of a pointer network type deep network according to an embodiment of the present application;

FIG. 3 is a partial method flow chart of a vehicle path planning method provided by an embodiment of the present application;

fig. 4 is a schematic structural diagram of a vehicle path planning apparatus according to an embodiment of the present application.

Detailed Description

The following description of the embodiments of the present application will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present application, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.

In the present disclosure, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

The embodiment of the application provides a vehicle path planning method, a flow chart of the method is shown in fig. 1, and the method comprises the following steps:

s10, encoding a name sequence and a distance sequence of the access sites to be planned, wherein the name sequence comprises a name code value of each site in the access sites, and the distance sequence comprises a distance value between every two sites in the access sites.

In the embodiment of the application, the name of each site in the access sites on the path to be planned can be subjected to integer coding, and a sequence containing the name coding value of each site is recorded as a name sequence, and the name of one site corresponds to a unique name coding value; and meanwhile, the distance between every two stations in the access stations is coded, and a sequence containing the distance value between every two stations is recorded as a distance sequence.

For ease of understanding, three access sites A, B, C on the path to be planned are described below as an example, where the distance from site a to site B is a, the distance from site a to site C is B, the distance from site B to site C is C, and b=a+c:

the method comprises the steps of (1) carrying out integer coding on the site names of A, B, C three access sites, wherein the coded name code value of the site A is 1, the coded name code value of the site B is 2, and the coded name code value of the site C is 3, and the name sequence X= [1 2 3];

the distance between every two stations of the A, B, C three access stations is coded, and the distance value from the station A to the station A after coding is (A, A): the distance between site A and site B is (A, B) a, the distance between site A and site C is (A, C) B, the distance between site B and site A is (B, A) a, the distance between site B and site B is (B, B) 0, the distance between site B and site C is (B, C) C, the distance between site C and site A is (C, A) B, the distance between site C and site B is (C, B) C, the distance between site C and site C is (C, C) 0, and the distance sequence Y = { (A, A) 0 (A, B) a (A, C) B (B, A) a (B, B) 0 (B, C) C (C, A) B (C, C) 0}.

S20, inputting the name sequence and the distance sequence into a pre-trained path planning model, so that the path planning model reorders the name sequence based on the distance sequence and outputs a new name sequence, the new name sequence can enable the planned path distance of the access site to be the shortest, and the path planning model is obtained by training a pointer network type deep network in advance.

See figure 2 for a network structure diagram of a pointer network type of deep network. The pointer network type of deep network is an improved version of the attention mechanism network, which has two special recurrent neural networks, one of which encodes an input sequence as an encoder and converts it into a vector, and the other of which re-decodes the encoded vector as a decoder into a sequence and outputs it. Compared with the attention mechanism network, the output sequence and the input sequence of the network have one-to-one pointer relation, so that the input sequence and the output sequence of the network are identical in length and content and only different in arrangement order.

In the embodiment of the application, the network can be trained in a reinforcement learning mode, and the path planning model obtained after a period of training can be automatically approximated to a real optimal path planning strategy. The path planning model may encode the input name sequence and convert it into a vector, and re-decode the encoded vector into a new name sequence based on the input distance sequence and output. The new name sequence and the name sequence have the same length and content and are only different in arrangement order, and the planned path distance indicated by the new name sequence is shortest, namely an optimized path is planned.

In the specific implementation process, the deep network of the pointer network type can be repeatedly trained and modified in a reinforcement learning mode, so that the path planning strategy can be adjusted and improved according to actual conditions. In step S20, the process of training the pointer network type deep network in advance to obtain the path planning model may adopt the following steps, where the method flowchart is shown in fig. 2:

s201, acquiring training samples, wherein the training samples comprise a plurality of groups of name sequence samples and distance sequence samples of a training access site, and one group of name sequence samples corresponds to one planned training path and one distance sequence sample.

In the embodiment of the application, when training samples are accumulated for training of the pointer network type deep network, K groups of training access sites are assumed to be optimized in the training samples, and one group of training access sites corresponds to one training planning path. For each of the K sets of training access sites, performing integer encoding on the name of each of the K sets of training access sites, and recording a sequence containing the name encoding value of each of the K sets of training access sites as a name sequence sample; and simultaneously encoding the distance between every two stations in the group of training access stations, and recording a sequence containing the distance value between every two stations as a distance sequence sample.

S202, extracting a group of name sequence samples for the current training from the training samples, inputting the group of name sequence samples and the corresponding distance sequence samples into a depth network of a pointer network type, so that the depth network of the pointer network type outputs one target name sequence sample which can enable the corresponding planned training path distance to be converged in the group of name sequence samples based on the input distance sequence samples, wherein the initial value of the network parameter of the depth network of the pointer network type is preset.

In the embodiment of the application, a small batch of samples are randomly extracted from training samples to train the depth network of the pointer network type, and the number of the samples is set to be a fixed value B. This training is performed using a set of name sequence samples, denoted s, of randomly extracted samples _i The target name sequence sample of the depth network solving output of the pointer network type is recorded as o _i Wherein i is more than or equal to 1 and less than or equal to B.

It should be noted that, the network parameter θ of the deep network of the pointer network type is initialized in advance, and the network parameter determines the output characteristics of the network.

S203, the network parameters are adjusted by calculating the distance of the planned training path corresponding to the target name sequence sample, so that the depth network of the pointer network type re-executes the output of one target name sequence sample which can enable the corresponding planned training path distance to be converged in the group of name sequence samples based on the input distance sequence sample, the current training is ended when the ending condition of the current training is met, and a pointer is set for the final target name sequence sample output by the depth network of the pointer network type when the current training is ended, wherein the pointer is the basis or the basis of the depth network of the pointer network type for the re-training.

In the embodiment of the application, according to the conditional probability p (o) of the mathematical relationship between the input sequence and the output sequence of the depth network representing the pointer network type _i |s _i ) To calculate the target name sequence sample o _i Distance Dis (o) of representative planned training path _i |s _i )。

Further, the network parameters are adjusted through a preset parameter adjustment strategy corresponding to the distance of the planned path for training, and the depth network of the pointer network type is enabled to output a target name sequence sample which can enable the corresponding planned path distance for training to be converged again until the end condition of the training is met, for example, the iteration number reaches the maximum value, and for example, the training is ended when the planned path distance for training meets the performance index.

In some other embodiments, the "adjusting the network parameter by calculating the distance of the planned training path corresponding to the target name sequence sample" in step S203 may include the following steps:

calculating the distance of a planned training path corresponding to the target name sequence sample; calculating the reference distance of the group of name sequence samples for the training; calculating the gradient of the network parameters according to the distance of the planned path for training and the reference distance; the network parameters are updated based on the gradient of the network parameters.

In the embodiment of the application, the probability p (o) _i |s _i ) To calculate the distance Dis (o) of the planned training path _i |s _i )。

In the process of calculating the reference distance, if the training is the first training, determining the set of name sequence samples s by using a heuristic algorithm _i And taking the distance of the optimized path as the reference distance b of the set of name sequence samples in the training _i . Note that the heuristic algorithm may be one of genetic algorithm, simulated annealing, particle swarm algorithm, ant colony algorithm, and tabu search, but is not limited to the above algorithm.

If the training is not the first training, the reference distance used in the last training, namely the historical reference distance b, can be obtained _i ' and calculate the reference distance b of the group of name sequence samples trained this time according to the following formula (1) _i ：

b _i ＝b _i ′+α(Dis(o _i |s _i )-b _i ′) (1)

Wherein alpha is a preset learning rate.

The embodiment of the application sets the reference distance b _i The purpose of (a) is to ensure that the final optimization effect of deep reinforcement learning is at least greater than benchmark b in subsequent training _i Good.

Further, it can be expressed as follows(2) Calculating the gradient g of the network parameter θ _θ ：

Finally, the network parameter theta is updated by using a gradient descent method, and the updating process can adopt a traditional gradient descent method or an improved gradient updating method such as Newton method, ADAM (advanced dynamic random access memory) method and the like.

S204, taking the depth network of the pointer network type after multiple training as a path planning model.

In the embodiment of the application, a depth network of a pointer network type trained by using a small batch of samples randomly extracted from training samples is used as a path planning model. The path planning model is trained to achieve the optimal effect, and only the access site to be planned is required to be input into the path planning model in a sequence format, the path planning model can output a sequence representing the access sequence of the actual site, and the output sequence is the solution of the planned path with the shortest required distance.

After the path planning model is trained, an optimized path planning result can be directly calculated according to the access site to be planned, a large number of searching and exploring processes are not needed, and the quality of the solution is not lower than that of a heuristic algorithm for providing a reference. In addition, the path planning model is trained based on the intensity learning mode, parameters can be updated in real time, the planning performance of the model is improved, the model gradually adapts to a real scene along with the growth of time and experience, and the planning result is further improved to be closer to a theoretical optimal solution. Finally, the path planning model can be suitable for most combination optimization problems, and has strong algorithm universality.

S30, acquiring a planning path corresponding to the new name sequence.

In the embodiment of the application, one name code value in the name sequence uniquely corresponds to the name of one access site to be planned, and the new name sequence is identical with the length and the content of the name sequence and is only different in arrangement order, so that the order of accessing the access site to be planned can be obtained based on the order of the name code values in the new name sequence, thereby determining the corresponding planned path, and the distance of the planned path is shortest.

According to the vehicle path planning method provided by the embodiment of the application, the path planning model is obtained by training the depth network of the pointer network type, the name sequence and the distance sequence of the access site to be planned, which are obtained by encoding, are input into the path planning model, the name sequence is reordered by the path planning model based on the distance sequence, and a new name sequence which can enable the planned path distance of the access site to be shortest is output, so that the planned path corresponding to the new name sequence is obtained. Based on the application, the path planning strategy can be molded by the deep network, so that the optimization result of the vehicle path planning problem can be directly calculated without searching.

Based on the vehicle path planning method provided by the foregoing embodiment, an embodiment of the present application provides an apparatus for executing the vehicle path planning method, where a schematic structural diagram of the apparatus is shown in fig. 4, and the apparatus includes: the device comprises an encoding module 10, a path planning module 20 and a path acquisition module 30, wherein the path planning module 20 comprises a model training unit 201;

the model training unit 201 is configured to train the depth network of the pointer network type in advance to obtain a path planning model;

the encoding module 10 is configured to encode a name sequence of the access site to be planned, where the name sequence includes a name encoding value of each site in the access site, and a distance sequence including a distance value between every two sites in the access site;

the path planning module 20 is configured to input the name sequence and the distance sequence into the path planning model, so that the path planning model reorders the name sequence based on the distance sequence and outputs a new name sequence, where the new name sequence can minimize a planned path distance for accessing the site;

the path obtaining module 30 is configured to obtain a planned path corresponding to the new name sequence.

Optionally, the model training unit 201 is configured to train the depth network of the pointer network type in advance to obtain a path planning model, and is specifically configured to:

acquiring training samples, wherein the training samples comprise a plurality of groups of name sequence samples and distance sequence samples of access sites for training, and one group of name sequence samples corresponds to one planned training path and one distance sequence sample; extracting a group of name sequence samples for the current training from the training samples, inputting the group of name sequence samples and the corresponding distance sequence samples into a depth network of a pointer network type, so that the depth network of the pointer network type outputs one target name sequence sample which can enable the corresponding planned training path distance to be converged in the group of name sequence samples based on the input distance sequence samples, wherein the initial value of the network parameter of the depth network of the pointer network type is preset; the method comprises the steps that network parameters are adjusted through calculating the distance of a planned training path corresponding to a target name sequence sample, so that a depth network of a pointer network type re-executes the output of one target name sequence sample which can enable the distance of the corresponding planned training path to be converged in the group of name sequence samples based on the input distance sequence sample, the training is finished when the finishing condition of the training is met, a pointer is set for a final target name sequence sample output by the depth network of the pointer network type when the training is finished, and the pointer is the basis or the basis of the depth network of the pointer network type for the re-training; and taking the depth network of the pointer network type after multiple training as a path planning model.

Optionally, the model training unit 201 is configured to adjust a network parameter by calculating a distance of a planned training path corresponding to the target name sequence sample, and is specifically configured to:

Optionally, the model training unit 201 is configured to calculate a reference distance for training the set of name sequence samples this time, and is specifically configured to:

under the condition that the training is not the first training, acquiring a historical reference distance of the last training; and calculating the reference distance of the group of name sequence samples for the training by utilizing the distance of the planned training path and the historical reference distance.

Optionally, the model training unit 201 configured to calculate the reference distance for training the set of name sequence samples this time is further configured to:

According to the vehicle path planning device provided by the embodiment of the application, the path planning model is obtained by training the depth network of the pointer network type, the name sequence and the distance sequence of the access site to be planned, which are obtained by encoding, are input into the path planning model, the name sequence is reordered by the path planning model based on the distance sequence, and a new name sequence which can enable the planned path distance of the access site to be shortest is output, so that the planned path corresponding to the new name sequence is obtained. Based on the application, the path planning strategy can be molded by the deep network, so that the optimization result of the vehicle path planning problem can be directly calculated without searching.

The embodiment of the application also provides a computer readable storage medium, on which computer instructions are stored, which when run execute the steps of the vehicle path planning method shown in the embodiment of the application. The computer readable storage medium may include, for example, a non-volatile memory (non-volatile) or a non-transitory memory (non-transitory) and may also include an optical disc, a mechanical hard disc, a solid state hard disc, and the like.

The embodiment of the application also provides a terminal, which comprises a memory and a processor, wherein the memory stores computer instructions capable of running on the processor, and the processor executes the steps of the vehicle path planning method shown in the embodiment of the application when running the computer instructions. The terminal comprises, but is not limited to, a mobile phone, a computer, a tablet personal computer and other terminal equipment.

In this specification, each embodiment is described in a progressive manner, and identical and similar parts of each embodiment are all referred to each other, and each embodiment mainly describes differences from other embodiments. In particular, for a system or system embodiment, since it is substantially similar to a method embodiment, the description is relatively simple, with reference to the description of the method embodiment being made in part. The systems and system embodiments described above are merely illustrative, wherein the elements illustrated as separate elements may or may not be physically separate, and the elements shown as elements may or may not be physical elements, may be located in one place, or may be distributed over a plurality of network elements. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of this embodiment. Those of ordinary skill in the art will understand and implement the present application without undue burden.

Those of skill would further appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that the various illustrative elements and steps are described above generally in terms of functionality in order to clearly illustrate the interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.

The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present application. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the application. Thus, the present application is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

The foregoing is merely a preferred embodiment of the present application and it should be noted that modifications and adaptations to those skilled in the art may be made without departing from the principles of the present application, which are intended to be comprehended within the scope of the present application.

Claims

1. A vehicle path planning method, the method comprising:

acquiring a planning path corresponding to the new name sequence;

the process of training the depth network of the pointer network type in advance to obtain the path planning model comprises the following steps:

2. The method of claim 1, wherein said adjusting the network parameters by calculating distances of the training planned paths corresponding to the target name sequence samples comprises:

3. The method of claim 2, wherein calculating the reference distance for the set of name sequence samples for the present training comprises:

4. A method according to claim 3, wherein said calculating the reference distance for the set of name sequence samples for the present training further comprises:

5. A vehicle path planning apparatus, the apparatus comprising: the system comprises an encoding module, a path planning module and a path acquisition module, wherein the path planning module comprises a model training unit;

the path acquisition module is used for acquiring a planning path corresponding to the new name sequence;

the model training unit is used for training the depth network of the pointer network type in advance to obtain a path planning model, and is specifically used for:

6. The apparatus according to claim 5, wherein the model training unit for adjusting the network parameters by calculating the distances of the planned training paths corresponding to the target name sequence samples is specifically configured to:

7. A computer readable storage medium having stored thereon computer instructions, which when run perform the steps of the vehicle path planning method of any of claims 1 to 4.

8. A terminal comprising a memory and a processor, the memory having stored thereon computer instructions executable on the processor, wherein the processor, when executing the computer instructions, performs the steps of the vehicle path planning method of any of claims 1 to 4.