CN109556609A

CN109556609A - A kind of collision prevention method and device based on artificial intelligence

Info

Publication number: CN109556609A
Application number: CN201811358583.XA
Authority: CN
Inventors: 徐应年; 铁井华; 邹绍云; 罗永涛
Original assignee: WUHAN NANHUA INDUSTRIAL EQUIPMENT ENGINEERING Co Ltd
Current assignee: WUHAN NANHUA INDUSTRIAL EQUIPMENT ENGINEERING Co Ltd
Priority date: 2018-11-15
Filing date: 2018-11-15
Publication date: 2019-04-02
Anticipated expiration: 2038-11-15
Also published as: CN109556609B

Abstract

The collision prevention method and device based on artificial intelligence that the invention discloses a kind of.Wherein, this method comprises: establishing collision prevention Policy model；The collision prevention Policy model is sent by current motion data, obtains preferred solution space；Monte Carlo tree search algorithm is applied in the preferred solution space, adjusting parameter is calculated, the motion model of analytical form is not needed to establish not only, and the low defect with poor robustness of control precision that accurate manipulation mathematical model can not be established under the conditions of the large disturbances such as stormy waves stream is also overcomed, it realizes and improves intelligence, the technical effect of stability and applicability.

Description

A kind of collision prevention method and device based on artificial intelligence

Technical field

The present invention relates to artificial intelligence and collision prevention technical field more particularly to a kind of collision prevention method based on artificial intelligence and Device.

Background technique

Research of the country in terms of water surface unmanned boat control technology is started late, but from initial conceptual phase by Gradual transition is to the practice stage.Due to the complexity and non-intellectual of water surface unmanned boat working environment, need to continuously improve with it is complete It is apt to existing smart system structure, is promoted to following predictive ability, the independent learning ability of strengthen the system makes intelligence system more It is forward-looking.

There are many information content of required processing when ship collision prevention, and operation is considerably complicated, the research of relevant avoidance algorithm technology Need to be goed deep into.Since current avoidance algorithm has limitation, lead to ship collision prevention mode not intelligence, and most of algorithms Just for the collision prevention under broad sea area, the less environment of dynamic barrier.

Therefore, a kind of temporarily good without intelligence, stability is strong, suitable for the collision prevention method under Various Complex sea situation.

Summary of the invention

The present invention solves intelligent in the prior art by providing a kind of collision prevention method and device based on artificial intelligence It is low, stability is poor, can not be suitable for the technical issues of Various Complex sea situation, realize raising intelligence, stability and applicability Technical effect.

The collision prevention method based on artificial intelligence that the present invention provides a kind of, comprising:

Establish collision prevention Policy model；

The collision prevention Policy model is sent by current motion data, obtains preferred solution space；

Monte Carlo tree search algorithm is applied in the preferred solution space, adjusting parameter is calculated.

Further, described to send the collision prevention Policy model for current motion data, it obtains preferred solution space, wraps It includes:

The collision prevention Policy model is sent by the current motion data, obtain the preferred solution space and estimates movement Data；

Further include:

Establish evaluation network model relevant to automatic obstacle avoiding performance indicator；

The exercise data of estimating is sent in the evaluation network model, obtains assessed value；

It is described to apply to Monte Carlo tree search algorithm in the preferred solution space, adjusting parameter is calculated, comprising:

The Monte Carlo tree search algorithm is applied in the preferred solution space, the adjusting parameter is calculated；

The adjusting parameter is adjusted based on the assessed value, obtains optimal correction parameter.

It is further, described to establish evaluation network model relevant to automatic obstacle avoiding performance indicator, comprising:

Pass through formulaIt establishes and path length f₁Relevant evaluation network model；

Wherein, d_iFor the path point at i moment and the distance between the path point at i+1 moment, expression formula isIn formula, P_iFor the path point at i moment, P_iCoordinate be (x_i, y_i)；P_i+1For the path point at i+1 moment, P_i+1Coordinate be (x_i+1,y_i+1)。

Pass through formulaN > 0 is established and path smooth degree f₂Relevant evaluation network model；

Wherein, α_iFor the path point P at i moment_iThe turning at place, expression formula areIn formula, P_iFor the path point at i moment, P_iCoordinate be (x_i,y_i)；P_i+1For the path point at i+1 moment, P_i+1Coordinate be (x_i+1, y_i+1)；For from path point P_i-1To path point P_iVector, | P_i-1P_i| it is vectorLength；For from path Point P_iTo path point P_i+1Vector, | P_iP_i+1| it is vectorLength；K is the number at the turning more than or equal to pi/2.

Pass through formulaIt establishes and path safety f₃Relevant evaluation network model；

Wherein, λ is weight regulation coefficient；M is the second punishment parameter；D is all path points being averaged most apart from barrier Short distance, expression formula areIn formula, n is the path point number in path in addition to starting point and target point；d_iFor The shortest distance between path point and barrier, expression formula are WithRespectively For path point P_iThe two-end-point of place linkage lines.

The present invention also provides a kind of anticollision devices based on artificial intelligence, comprising:

Collision prevention Policy model establishes module, for establishing collision prevention Policy model；

Collision prevention Policy model computing module obtains excellent for sending the collision prevention Policy model for current motion data Select solution space；

Adjustment is calculated for applying to Monte Carlo tree search algorithm in the preferred solution space in computing module Parameter.

Further, the collision prevention Policy model computing module, comprising:

First model arithmetic unit obtains institute for sending the collision prevention Policy model for the current motion data State preferred solution space；

Second model arithmetic unit obtains pre- for sending the collision prevention Policy model for the current motion data Estimate exercise data；

Further include:

Evaluation network model establishes module, for establishing evaluation network model relevant to automatic obstacle avoiding performance indicator；

Network model computing module is evaluated, for sending the exercise data of estimating in the evaluation network model, is obtained Assessed value out；

The computing module, comprising:

Computing unit is calculated for applying to the Monte Carlo tree search algorithm in the preferred solution space The adjusting parameter；

Adjustment unit obtains optimal correction parameter for being adjusted based on the assessed value to the adjusting parameter.

Further, the evaluation network model establishes module, comprising:

First evaluation network model establishes unit, for passing through formulaIt establishes and path length f₁It is related Evaluation network model；Wherein, d_iFor the path point at i moment and the distance between the path point at i+1 moment, expression formula isIn formula, P_iFor the path point at i moment, P_iCoordinate be (x_i, y_i)；P_i+1For the path point at i+1 moment, P_i+1Coordinate be (x_i+1,y_i+1)。

Further, the evaluation network model establishes module, comprising:

Second evaluation network model establishes unit, for passing through formulaN > 0 is established and path Smoothness f₂Relevant evaluation network model；Wherein, α_iFor the path point P at i moment_iThe turning at place, expression formula areIn formula, P_iFor the path point at i moment, P_iCoordinate be (x_i,y_i)；P_i+1For the i+1 moment Path point, P_i+1Coordinate be (x_i+1,y_i+1)；For from path point P_i-1To path point P_iVector, | P_i-1P_i| for AmountLength；For from path point P_iTo path point P_i+1Vector, | P_iP_i+1| it is vectorLength Degree；K is the number at the turning more than or equal to pi/2.

Further, the evaluation network model establishes module, comprising:

Third evaluation network model establishes unit, for passing through formulaIt establishes and path safety f₃It is related Evaluation network model；Wherein, λ is weight regulation coefficient；M is the second punishment parameter；D is all path points apart from barrier Average shortest distance, expression formula areIn formula, n is the path point in path in addition to starting point and target point Number；d_iThe shortest distance between path point and barrier, expression formula are WithRespectively path point P_iThe two-end-point of place linkage lines.

One or more technical solution provided in the present invention, has at least the following technical effects or advantages:

The present invention establishes the collision prevention Policy model based on deep learning (convolutional neural networks), and by current motion data It is sent to collision prevention Policy model, obtains preferred solution space；Monte Carlo tree search algorithm is applied in preferred solution space again, is counted Calculation is adjusted parameter, does not need the motion model for establishing analytical form not only, but also also overcomes in large disturbances such as stormy waves streams Under the conditions of can not establish accurate manipulation mathematical model control precision is low and the defect of poor robustness, realize raising intelligence, The technical effect of stability and applicability.

Detailed description of the invention

Fig. 1 is the flow chart of the collision prevention method provided in an embodiment of the present invention based on artificial intelligence；

Fig. 2 is the schematic illustration of the collision prevention method provided in an embodiment of the present invention based on artificial intelligence；

Fig. 3 is the process that Monte Carlo tree is established in the collision prevention method provided in an embodiment of the present invention based on artificial intelligence Figure；

Fig. 4 is the turning schematic diagram in the collision prevention method provided in an embodiment of the present invention based on artificial intelligence；

Fig. 5 is the path point in the collision prevention method provided in an embodiment of the present invention based on artificial intelligence from barrier most short distance From schematic diagram；

Fig. 6 is the module map of the anticollision device provided in an embodiment of the present invention based on artificial intelligence.

Specific embodiment

The embodiment of the present invention solves in the prior art by providing a kind of collision prevention method and device based on artificial intelligence Intelligence it is low, stability is poor, can not be suitable for Various Complex sea situation the technical issues of, realize raising intelligence, stability and The technical effect of applicability.

Technical solution in the embodiment of the present invention is to solve the above problems, general thought is as follows:

The embodiment of the present invention establishes the collision prevention Policy model based on deep learning (convolutional neural networks), and will currently transport Dynamic data are sent to collision prevention Policy model, obtain preferred solution space；It is empty that Monte Carlo tree search algorithm is applied into preferred solution again Between in, adjusting parameter is calculated, does not need the motion model for establishing analytical form not only, but also also overcomes in stormy waves stream etc. The low defect with poor robustness of control precision that accurate manipulation mathematical model can not be established under the conditions of large disturbances, realizes raising intelligence It can property, the technical effect of stability and applicability.

Above-mentioned technical proposal in order to better understand, in conjunction with appended figures and specific embodiments to upper Technical solution is stated to be described in detail.

Referring to Fig. 1 and Fig. 2, the collision prevention method provided in an embodiment of the present invention based on artificial intelligence, comprising:

Step S110: collision prevention Policy model is established；

This step is illustrated:

Receive original aeronautical data；

Collision prevention Policy model is established based on original aeronautical data.

In the present embodiment, original aeronautical data be at least it is below any one:

The speed of a ship or plane of aircraft, the course of aircraft, wind speed, wind direction, the position of static obstruction, static obstruction and navigation The relative bearing of device, static obstruction and the relative velocity of aircraft, the position of dynamic obstruction, dynamic obstruction and aircraft Relative bearing, the quantity of the relative velocity of dynamic obstruction and aircraft, obstruction.

In order to improve the operation accuracy of collision prevention Policy model, the collision prevention Policy model of foundation is learnt.

Specifically learning process includes:

History collision prevention case data is input in collision prevention Policy model, with determine weight optimal inside prototype network and Excitation function.Collision prevention Policy model at this time is the network of a deep learning.

In the present embodiment, history collision prevention case data be at least it is below any one:

Course during the longitude and latitude of obstruction, the longitude and latitude of collision prevention route, collision prevention, the speed of a ship or plane during collision prevention.

What needs to be explained here is that expert's steering data, Collision Accidents of Ships case data, International Maritime can also be collected The data such as collision regulation construct data sample library, for exercising supervision study to collision prevention Policy model.

In order to realize the scalability of the embodiment of the present invention, in data sample library spare interface, other rule can be accessed at any time Then it is used as sample.

Collision prevention Policy model setting constraint in order to further increase the operation accuracy of collision prevention Policy model, to succeeding in school Condition.

In the present embodiment, constraint condition is the course of aircraft and/or the solution space of the speed of a ship or plane.

Step S120: collision prevention Policy model is sent by current motion data, obtains preferred solution space；

Specifically, collision prevention Policy model is sent by current motion data, autonomous learning is carried out based on constraint condition, is obtained It is preferred that solution space.

Step S130: Monte Carlo tree search algorithm is applied in preferred solution space, adjusting parameter is calculated.

This step is illustrated:

Referring to Fig. 3, rectangle indicates root node, and the foundation of tree is extended downwards by root node.The state of the node is usually Refer to that aircraft encounters obstruction and need to carry out collision prevention.Ellipse indicates that child node, child node are generating state transfers during collision prevention General node.The transfer between node is generated when aircraft selection movement, the node is in store to be taken for a period of time at certain Collision prevention strategy (i.e. solution space).Triangle indicate leaf node, represent search tree reach aircraft movement boundary or Uncertain environment, the state of the node are divided into collision prevention success and collision prevention two kinds of situations of failure.Monte Carlo tree search algorithm is general It is divided into 4 stages, i.e. choice phase, extension phase, dummy run phase and backtracking more new stage.Algorithm can be repeatedly carried out this 4 Stage, until meeting some specific condition of collision prevention.

In order to keep the adjusting parameter finally obtained optimal, step S120 is specifically included:

Collision prevention Policy model is sent by current motion data, obtain preferred solution space and estimates exercise data；

The embodiment of the invention also includes:

Exercise data will be estimated to send in evaluation network model, obtain assessed value；

Then step S130 is specifically included:

Monte Carlo tree search algorithm is applied in preferred solution space, adjusting parameter is calculated；

Adjusting parameter is adjusted based on assessed value, obtains optimal correction parameter.

It should be noted that evaluation network model can be established simultaneously with collision prevention Policy model, it can also be in collision prevention strategy It establishes, can also be established after collision prevention Policy model, the embodiment of the present invention is not particularly limited this before model.

In the present embodiment, automatic obstacle avoiding performance indicator includes at least: path length, path smooth degree and path safety Property.

Specifically, the detailed process of evaluation network model relevant to path length is established are as follows:

In the same circumstances, navigation path length is shorter, and hours underway is shorter, evaluates higher.

Establish the detailed process of evaluation network model relevant to path smooth degree are as follows:

Wherein, referring to fig. 4, α_iFor the path point P at i moment_iThe turning at place, expression formula are In formula, P_iFor the path point at i moment, P_iCoordinate be (x_i,y_i)；P_i+1For the path at i+1 moment Point, P_i+1Coordinate be (x_i+1,y_i+1)；For from path point P_i-1To path point P_iVector, | P_i-1P_i| it is vectorLength；For from path point P_iTo path point P_i+1Vector, | P_iP_i+1| it is vectorLength；k For the number at the turning more than or equal to pi/2, also referred to as the first punishment parameter, i.e., when a certain turning is greater than or equal to pi/2, Target value is punished；As n=0, path is line of the starting point to target point, path smooth degree f₂Value be 0.

Path smooth degree f₂It is indicated with turning average value, turning average value is smaller, turns more smooth, then path is more flat It is sliding, it evaluates higher.

Establish the detailed process of evaluation network model relevant to path safety are as follows:

Wherein, referring to Fig. 5, λ is weight regulation coefficient, for solving to seek average distance what the value is too small after inverse to ask Topic, λ random value in 0-1；M is the second punishment parameter；In the present embodiment, m is zero for the shortest distance to barrier The number of path point.D is average shortest distance of all path points apart from barrier, and expression formula isIn formula, n For the path point number in path in addition to starting point and target point；d_iThe shortest distance between path point and barrier, Expression formula isTakeWithSmaller value in two distances；WithRespectively path point P_iThe two-end-point of place linkage lines.

In the same circumstances, average shortest distance of all path points apart from barrier is longer, then path is safer, evaluation It is higher.

It should be noted that can will estimate exercise data be sent to evaluation network model relevant to path length, with These three evaluation network models of the relevant evaluation network model of path smooth degree, evaluation network model relevant to path safety In any one or any two or three in.Correspondingly, an assessed value or two assessed values or three are based on A assessed value is adjusted adjusting parameter, obtains optimal correction parameter.It follows that adjusted based on three assessed values Adjusting parameter is optimal.

Referring to Fig. 6, the anticollision device provided in an embodiment of the present invention based on artificial intelligence, comprising:

Collision prevention Policy model establishes module 100, for establishing collision prevention Policy model；

Specifically, collision prevention Policy model establishes module 100, comprising:

Data receipt unit, for receiving original aeronautical data；

Execution unit is established, for establishing collision prevention Policy model based on original aeronautical data.

In order to improve the operation accuracy of collision prevention Policy model, further includes:

Model learning module, for learning to the collision prevention Policy model of foundation.

In the present embodiment, model learning module, specifically for history collision prevention case data is input to collision prevention strategy mould In type, to determine weight and excitation function optimal inside prototype network.Collision prevention Policy model at this time is a deep learning Network.

Specifically, history collision prevention case data be at least it is below any one:

In order to exercise supervision study to collision prevention Policy model, further includes:

Data collection module, for collecting expert's steering data, Collision Accidents of Ships case data, International Maritime collision prevention rule The data such as then；

Sample database constructs module, for constructing data sample library.

In order to further increase the operation accuracy of collision prevention Policy model, further includes:

Constraint condition setting module, for setting constraint condition to the collision prevention Policy model succeeded in school.

Collision prevention Policy model computing module 200 obtains preferably for sending collision prevention Policy model for current motion data Solution space；

In the present embodiment, collision prevention Policy model computing module 200, specifically for sending collision prevention for current motion data Policy model carries out autonomous learning based on constraint condition, obtains preferred solution space.

Adjustment ginseng is calculated for applying to Monte Carlo tree search algorithm in preferred solution space in computing module 300 Number.

This step is illustrated:

In the present embodiment, collision prevention Policy model computing module 200, comprising:

First model arithmetic unit obtains preferred solution space for sending collision prevention Policy model for current motion data；

Second model arithmetic unit obtains for sending collision prevention Policy model for current motion data and estimates movement number According to；

In order to keep the adjusting parameter finally obtained optimal, the embodiment of the invention also includes:

Network model computing module is evaluated, is sent in evaluation network model for exercise data will to be estimated, obtains assessed value；

Then computing module 300, comprising:

Adjusting parameter is calculated for applying to Monte Carlo tree search algorithm in preferred solution space in computing unit；

Adjustment unit obtains optimal correction parameter for being adjusted based on assessed value to adjusting parameter.

Specifically, evaluation network model establishes module, comprising:

First evaluation network model establishes unit, for passing through formulaIt establishes and path length f₁It is related Evaluation network model；Wherein, d_iFor the path point at i moment and the distance between the path point at i+1 moment, expression formula isIn formula, P_iFor the path point at i moment, P_iCoordinate be (x_i,y_i)；P_i+1For the path point at i+1 moment, P_i+1Coordinate be (x_i+1, y_i+1).In the same circumstances, navigation path length is got over Short, hours underway is shorter, evaluates higher.

Second evaluation network model establishes unit, for passing through formulaN > 0 is established and path Smoothness f₂Relevant evaluation network model；Wherein, α_iFor the path point P at i moment_iThe turning at place, expression formula areIn formula, P_iFor the path point at i moment, P_iCoordinate be (x_i,y_i)；P_i+1For the i+1 moment Path point, P_i+1Coordinate be (x_i+1, y_i+1)；For from path point P_i-1To path point P_iVector, | P_i-1P_i| for AmountLength；For from path point P_iTo path point P_i+1Vector, | P_iP_i+1| it is vectorLength Degree；K be more than or equal to pi/2 turning number, also referred to as the first punishment parameter, i.e., when a certain turning be greater than or equal to π/ When 2, target value is punished；As n=0, path is line of the starting point to target point, path smooth degree f₂Value be 0. In the present embodiment, path smooth degree f₂It is indicated with turning average value, turning average value is smaller, turns more smooth, then path It is more smooth, it evaluates higher.

Third evaluation network model establishes unit, for passing through formulaIt establishes and path safety f₃It is related Evaluation network model；Wherein, λ is weight regulation coefficient, and for solving the problems, such as to seek average distance, the value is too small after inverse, λ random value in 0-1；M is the second punishment parameter；In the present embodiment, m is the path for being zero to the shortest distance of barrier The number of point.D is average shortest distance of all path points apart from barrier, and expression formula isIn formula, n is road Path point number in diameter in addition to starting point and target point；d_iThe shortest distance between path point and barrier, expression Formula isIt takesWithSmaller value in two distances；WithRespectively For path point P_iThe two-end-point of place linkage lines.In the same circumstances, average shortest distance of all path points apart from barrier Longer, then path is safer, evaluates higher.

It establishes unit it should be noted that can will estimate exercise data and be sent to the first evaluation network model, second comment Valence network model establish unit, third evaluation network model establish unit these three evaluation network models in any one or In any two or three.Correspondingly, adjustment is joined based on an assessed value or two assessed values or three assessed values Number is adjusted, and obtains optimal correction parameter.It follows that optimal based on the adjusting parameter that three assessed values adjust.

[technical effect]

1, the embodiment of the present invention establishes the collision prevention Policy model based on deep learning (convolutional neural networks), and will be current Exercise data is sent to collision prevention Policy model, obtains preferred solution space；Monte Carlo tree search algorithm is applied into preferred solution again In space, adjusting parameter is calculated, does not need the motion model for establishing analytical form not only, but also also overcomes in stormy waves stream The low defect with poor robustness of control precision that accurate manipulation mathematical model can not be established under the conditions of equal large disturbances, realizes raising Intelligence, the technical effect of stability and applicability.

2, the assessed value that Utilization assessment network model provides assesses the movement selection of aircraft, in conjunction with Monte Carlo Tree search algorithm updates the evaluation of estimate in entire path under the movement, and the course until searching and speed of a ship or plane value restrain, and as most Excellent adjusting parameter is given to executing agency, improves the validity of collision prevention.

3, expert's steering data (including real ship sea examination, historical experience etc.), accident case, International Regulations for Preventing Collisions at Sea are based on Etc. data establish training sample database, establish the collision prevention Policy model of basic depth enhancing study, not only sufficiently combine expert behaviour The relevant history information of rudder experience and accident case improves the feasibility of Decision of Collision Avoidance；And have also combined International Maritime Collision regulation is more in line with practical collision prevention situation, further improves the applicability of collision prevention.

4, in data sample library spare interface, other rules can be accessed at any time as sample, realize the embodiment of the present invention Scalability, further improve the applicability of the embodiment of the present invention.

The embodiment of the present invention is applicable not only to broad sea area, but also suitable for the complicated ocean such as high sea condition, narrow water Environment, the effective guarantee navigation safety of ship.

It should be understood by those skilled in the art that, the embodiment of the present invention can provide as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the present invention Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the present invention, which can be used in one or more, The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces The form of product.

The present invention be referring to according to the method for the embodiment of the present invention, the process of equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.

These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.

These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.

Although preferred embodiments of the present invention have been described, it is created once a person skilled in the art knows basic Property concept, then additional changes and modifications may be made to these embodiments.So it includes excellent that the following claims are intended to be interpreted as It selects embodiment and falls into all change and modification of the scope of the invention.

Obviously, various changes and modifications can be made to the invention without departing from essence of the invention by those skilled in the art Mind and range.In this way, if these modifications and changes of the present invention belongs to the range of the claims in the present invention and its equivalent technologies Within, then the present invention is also intended to include these modifications and variations.

Claims

1. a kind of collision prevention method based on artificial intelligence characterized by comprising

Establish collision prevention Policy model；

2. the method as described in claim 1, which is characterized in that described to send the collision prevention strategy mould for current motion data Type obtains preferred solution space, comprising:

The collision prevention Policy model is sent by the current motion data, obtain the preferred solution space and estimates movement number According to；

Further include:

3. method according to claim 2, which is characterized in that described to establish evaluation net relevant to automatic obstacle avoiding performance indicator Network model, comprising:

Wherein, d_iFor the path point at i moment and the distance between the path point at i+1 moment, expression formula isIn formula, P_iFor the path point at i moment, P_iCoordinate be (x_i,y_i)；P_i+1For the path point at i+1 moment, P_i+1Coordinate be (x_i+1, y_i+1)。

4. method according to claim 2, which is characterized in that described to establish evaluation net relevant to automatic obstacle avoiding performance indicator Network model, comprising:

Wherein, α_iFor the path point P at i moment_iThe turning at place, expression formula areIn formula, P_iFor i The path point at moment, P_iCoordinate be (x_i,y_i)；P_i+1For the path point at i+1 moment, P_i+1Coordinate be (x_i+1, y_i+1)；For from path point P_i-1To path point P_iVector, | P_i-1P_i| it is vectorLength；For from path Point P_iTo path point P_i+1Vector, | P_iP_i+1| it is vectorLength；K is the number at the turning more than or equal to pi/2.

5. method according to claim 2, which is characterized in that described to establish evaluation net relevant to automatic obstacle avoiding performance indicator Network model, comprising:

Wherein, λ is weight regulation coefficient；M is the second punishment parameter；D is average most short distance of all path points apart from barrier From expression formula isIn formula, n is the path point number in path in addition to starting point and target point；d_iFor path The shortest distance between point and barrier, expression formula are WithRespectively road Diameter point P_iThe two-end-point of place linkage lines.

6. a kind of anticollision device based on artificial intelligence characterized by comprising

Collision prevention Policy model computing module obtains preferred solution for sending the collision prevention Policy model for current motion data Space；

Adjusting parameter is calculated for applying to Monte Carlo tree search algorithm in the preferred solution space in computing module.

7. device as claimed in claim 6, which is characterized in that the collision prevention Policy model computing module, comprising:

First model arithmetic unit obtains described excellent for sending the collision prevention Policy model for the current motion data Select solution space；

Second model arithmetic unit obtains for sending the collision prevention Policy model for the current motion data and estimates fortune Dynamic data；

Further include:

Network model computing module is evaluated, for sending the exercise data of estimating in the evaluation network model, obtains and comments Valuation；

The computing module, comprising:

Computing unit is calculated described for applying to the Monte Carlo tree search algorithm in the preferred solution space Adjusting parameter；

8. device as claimed in claim 7, which is characterized in that the evaluation network model establishes module, comprising:

First evaluation network model establishes unit, for passing through formulaIt establishes and path length f₁It is relevant to comment Valence network model；Wherein, d_iFor the path point at i moment and the distance between the path point at i+1 moment, expression formula isIn formula, P_iFor the path point at i moment, P_iCoordinate be (x_i,y_i)；P_i+1For the path point at i+1 moment, P_i+1Coordinate be (x_i+1,y_i+1)。

9. device as claimed in claim 7, which is characterized in that the evaluation network model establishes module, comprising:

Second evaluation network model establishes unit, for passing through formulaN > 0 is established and path smooth Spend f₂Relevant evaluation network model；Wherein, α_iFor the path point P at i moment_iThe turning at place, expression formula areIn formula, P_iFor the path point at i moment, P_iCoordinate be (x_i,y_i)；P_i+1For the i+1 moment Path point, P_i+1Coordinate be (x_i+1, y_i+1)；For from path point P_i-1To path point P_iVector, | P_i-1P_i| for AmountLength；For from path point P_iTo path point P_i+1Vector, | P_iP_i+1| it is vectorLength； K is the number at the turning more than or equal to pi/2.

10. device as claimed in claim 7, which is characterized in that the evaluation network model establishes module, comprising:

Third evaluation network model establishes unit, for passing through formulaIt establishes and path safety f₃Relevant evaluation Network model；Wherein, λ is weight regulation coefficient；M is the second punishment parameter；D is all path points being averaged most apart from barrier Short distance, expression formula areIn formula, n is the path point number in path in addition to starting point and target point；d_iFor The shortest distance between path point and barrier, expression formula are WithRespectively For path point P_iThe two-end-point of place linkage lines.