CN114442625A

CN114442625A - Environment map construction method and device based on multi-strategy joint control agent

Info

Publication number: CN114442625A
Application number: CN202210076715.XA
Authority: CN
Inventors: 王勇; 肖德虎; 王博; 陈珺
Original assignee: China University of Geosciences
Current assignee: China University of Geosciences
Priority date: 2022-01-24
Filing date: 2022-01-24
Publication date: 2022-05-06
Anticipated expiration: 2042-01-24
Also published as: CN114442625B

Abstract

The invention relates to an environment map construction method and device based on a multi-strategy joint control agent, which mainly comprises three stages of boundary detection, global detection and map construction. Firstly, detecting the boundary of an unknown environment by adopting an improved wall-following strategy; secondly, dividing the area surrounded by the boundary into grids, sequentially taking a randomly selected target point in each grid as an end point of the movement of the intelligent body by using a simplified spiral traversal strategy, and controlling the movement of the intelligent body by combining a D × Lite algorithm, so as to reduce the time for completing the global detection of the unknown environment; and finally, generating obstacle distribution and concentration field distribution of the unknown environment through a space occupying grid map algorithm and an RBF neural network respectively. The invention realizes the detection of the whole unknown environment by combining a plurality of strategy control agents, improves the calculation efficiency and can completely and accurately construct an environment map.

Description

Environment map construction method and device based on multi-strategy joint control agent

Technical Field

The invention belongs to the field of intersection of information science and environmental science, and particularly relates to an environmental map construction method and device based on a multi-strategy joint control agent.

Background

Environmental monitoring is of great importance to public safety and the sustainability of ecosystems. Pollution source localization is one of the most critical issues in environmental monitoring applications. By locating the pollution source, effective measures can be taken in time to prevent further expansion of pollution and reduce the risk of leakage of harmful substances. In recent years, the implementation of automatic search and localization of pollution sources based on agents has become a new direction of research. Problems such as obstacle avoidance and path planning are involved in the positioning process of the pollution source of the intelligent agent, and specific information of the monitored environment needs to be known in advance. In the practical application process, the monitoring environment is mostly unknown, so that the intelligent body needs to be controlled to move through an intelligent algorithm, and an environment map is quickly and accurately constructed.

The environment map construction is a process that an intelligent agent detects environment information by utilizing a sensor carried by the intelligent agent and establishes a recognizable model. At present, the environment map construction method can be roughly divided into three categories: a geometric map based method, a topological map based method and a grid map based method. The first two methods are difficult to meet the actual application requirements in the aspects of precision and real-time performance, and the method based on the grid map has no limitation, can be used for positioning and navigation of an intelligent agent, and is the most widely applied method at present. The grid map-based method mainly includes a rapid SLAM (Simultaneous Localization And Mapping), a micro SLAM, a Cartographer algorithm, And the like. However, the three methods all involve motion estimation and update matching, which cannot guarantee robustness and computational efficiency at the same time, and thus are difficult to meet the requirements of practical applications. In addition, the D × Lite algorithm is often used for dynamic path planning in an unknown environment, but the method can only detect a grid region near a path planned in advance, cannot completely detect the whole environment, and cannot solve the problem that a target point is unknown. In order to detect the environment as much as possible, multiple applications are often needed, so that part of the environment is repeatedly detected, and the calculation efficiency of the algorithm is reduced.

Disclosure of Invention

The invention aims to solve the technical problems that the existing method is low in calculation efficiency, incomplete in map construction and the like, and provides a new method capable of effectively considering both accuracy and timeliness. In order to achieve the technical purpose, the invention provides an environment map construction method and device based on a multi-strategy joint control agent.

According to one aspect of the invention, the invention provides an environment map construction method based on a multi-strategy joint control agent, which comprises the following steps:

step S1: based on an intelligent agent sensing model, an improved wall-following strategy is adopted to control the intelligent agent to move for a circle along the boundary of an unknown environment for boundary detection, and the boundary of the unknown environment is obtained according to the pose of the intelligent agent and the measurement data analysis of the laser radar in the boundary detection process;

step S2: dividing a region surrounded by a boundary into a plurality of grids, randomly selecting a target point in each grid, and sequentially taking each target point as an end point of the motion of the intelligent agent by adopting a simplified spiral traversal strategy;

step S3: dynamically planning a path from the intelligent agent to each terminal by adopting a D-Lite algorithm, and sequentially reaching target points by the intelligent agent according to the planned trajectory to carry out global detection;

step S4: generating an obstacle distribution map of an unknown environment by adopting an occupancy grid map algorithm based on the pose of the intelligent agent and the measurement data of the laser radar in the global detection process;

step S5: training the RBF neural network based on the self position information recorded in the intelligent body movement process and the collected pollutant concentration, and after the training is finished, fitting the trained RBF neural network to generate a concentration distribution map of an unknown environment;

step S6: and combining the barrier distribution map and the concentration distribution map to generate an environment map, and completing the map construction of the unknown environment.

Preferably, in step S1, the agent is a mobile carrier carrying two ultrasonic sensors, a lidar, a concentration sensor and a photoelectric encoder; the two ultrasonic sensors are respectively positioned in front of and on the right side of the intelligent agent and used for detecting whether obstacles exist in the front and on the right side; the concentration sensor, the laser radar and the photoelectric encoder are located at the central position of the intelligent body, the concentration sensor is used for collecting pollutant concentration, the laser radar is used for scanning the whole environment and calculating and detecting the distribution condition of the obstacles, and the photoelectric encoder is used for calculating the pose of the intelligent body and comprises coordinates and angles.

Preferably, in step S1, the improved wall-following strategy includes:

the intelligent body is placed at any boundary point, the front direction and the back direction are parallel to the boundary, the intelligent body is controlled to advance or turn according to the detection distance result of the ultrasonic sensor, and the intelligent body is guaranteed to be always on the left side of the boundary along the movement direction.

Preferably, in step S1, the improved wall-following strategy includes:

the zone bit of the ultrasonic sensor i is set as U_iThe values are:

where l is the distance threshold, d is the distance between the agent and the boundary, when U_iIf the value is 1, the ultrasonic sensor i faces to the boundary, otherwise, the boundary is absent;

when the intelligent object detects different boundary conditions, the zone bit U₁U₂There are four combinations, namely 00, 01, 10 and 11;

if the current movement direction of the intelligent agent is J, the value range is 0-3, and the current movement direction is corresponding to the lower direction, the right direction, the upper direction and the left direction respectively, then the formula for controlling the movement direction change of the intelligent agent according to the marker bit combination is as follows:

where Δ J is a motion direction change value, i.e., the difference between the new motion direction and the current motion direction.

Preferably, in step S2, the simplified spiral traversal strategy includes:

and selecting a clockwise direction, taking each target point as the end point of the movement of the intelligent agent in sequence, and ensuring that the target point in each grid can only be visited once.

Preferably, step S4 includes:

in the global detection process, the scanning angle range of the laser radar is set to be-23 degrees to 23 degrees, the stepping angle is set to be 2.86 degrees, namely the scanning range of the laser radar is divided into 16 different scanning angles;

measuring data in a laser beam coverage range are obtained through laser radar scanning, and the probability p (I | z) that each point in an unknown environment is occupied by an obstacle is calculated by adopting an occupancy grid algorithm_t) The concrete formula is as follows:

where I denotes the scanned point, z_tIs a set of the measurement data of the lidar and the pose of the agent at time t, r_maxIs the farthest distance, r, that the lidar can detect_IIs the distance from the center of the laser radar to I, phi_IIs the relative angle between the lidar and I, phi_kIs phi_IThe closest scan angle, k is the index of the scan angle, r_kIs phi_kDetection range of the laser radar in the direction, beta being the step angle of the scanning, p_minAnd p_maxIs a constant, satisfies 0<p_min<0.5<p_max<1；

Updating the confidence coefficient of the t moment I according to the probability that each point is occupied by the obstacle, and calculating the occupancy probability based on the confidence coefficient, wherein the specific formula is as follows:

wherein, C_t,IAnd C_t-1,IConfidence levels of the t moment and the t-1 moment I, and an initial confidence level C_0,I＝0，p_IIs the occupation probability of I, the more the occupation probability is close to 0, the more likely I is a free area; conversely, the closer the occupancy probability is to 1, the more likely I is to be an obstacle region;

and continuously updating the occupation probability of each point detected by the laser radar by the intelligent agent according to the movement of the planned path to generate an obstacle distribution map of the unknown environment.

Preferably, in step S5, the RBF neural network is a network structure having two inputs and one output and including 5 hidden layer nodes, where the input is the position coordinates of each point in the environment area, and the output is the concentration value; and (3) learning and training the RBF neural network through the position and concentration value sample pairs recorded in the boundary detection and global detection to obtain the trained RBF neural network, and completing the fitting of the concentration field distribution of the whole unknown environment.

According to a second aspect of the present invention, the present invention further provides an environment map building apparatus based on a multi-policy joint control agent, including the following modules:

the boundary detection module is used for controlling the intelligent body to move for a circle along the boundary of the unknown environment by adopting an improved wall-following strategy based on the intelligent body sensing model so as to carry out boundary detection, and analyzing the pose of the intelligent body and the measurement data of the laser radar in the boundary detection process so as to obtain the boundary of the unknown environment;

the spiral traversal module is used for dividing the area surrounded by the boundary into a plurality of grids, randomly selecting a target point in each grid, and sequentially taking each target point as an end point of the motion of the intelligent agent by adopting a simplified spiral traversal strategy;

the global detection module is used for dynamically planning a path from the intelligent agent to each terminal by adopting a D-Lite algorithm, and the intelligent agent sequentially reaches target points according to the planned path to perform global detection;

the obstacle distribution map generating module is used for generating an obstacle distribution map of an unknown environment by adopting an occupancy grid map algorithm based on the pose of the intelligent agent and the measurement data of the laser radar in the global detection process;

the concentration distribution map generation module is used for training the RBF neural network based on the self position information recorded in the intelligent agent movement process and the collected pollutant concentration, and after the training is finished, the trained RBF neural network is adopted to generate a concentration distribution map of an unknown environment in a fitting mode;

and the environment map generation module is used for generating an environment map by combining the barrier distribution map and the concentration distribution map to complete the map construction of the unknown environment.

The technical scheme provided by the invention has the following technical effects:

the method and the device for constructing the environment map of the multi-strategy combined control agent are mainly divided into three stages of boundary detection, global detection and map construction. Firstly, detecting the boundary of an unknown environment through an improved wall-following strategy; secondly, dividing the area surrounded by the boundary into grids, sequentially taking a randomly selected target point in each grid as an end point of the movement of the intelligent body by using a simplified spiral traversal strategy, and controlling the movement of the intelligent body by combining a D × Lite algorithm, so as to reduce the time for completing the global detection of the unknown environment; and finally, generating obstacle distribution and concentration field distribution of the unknown environment through a space occupying grid map algorithm and an RBF neural network respectively. The invention controls the movement of the intelligent body through multiple strategies, scans and monitors the environment to generate the obstacle area, and fits the concentration field distribution of the pollution source according to the concentration information acquired by the sensor and the position information of the intelligent body. The intelligent agent is controlled to move in the unknown monitoring environment through multi-strategy combination, the environment map can be constructed quickly and accurately, and the problems that the existing method is low in calculation efficiency and incomplete in map construction are effectively solved.

Drawings

The invention will be further described with reference to the accompanying drawings and examples, in which:

FIG. 1 is a schematic diagram of an environment mapping method based on a multi-strategy joint control agent according to the present invention;

FIG. 2 is a diagram of six unknown environments and their map construction, wherein FIG. 2(a) is a real environment map, and FIG. 2(b) is a constructed environment map;

FIG. 3 is a diagram of the movement trajectory of an agent in six environments in accordance with the present invention;

FIG. 4 is a comparison of the map construction results of three different methods of the present invention, wherein FIG. 4(a) is a real environment map, and FIGS. 4(B) to 4(d) are a method A and a method B, respectively, and an environment map constructed by the present invention;

fig. 5 is a concentration distribution graph fitted by the RBF neural network of the present invention, in which fig. 5(a) is a real concentration map and fig. 5(b) is a fitted concentration map.

Fig. 6 is a block diagram of an environment mapping apparatus for jointly controlling agents based on multiple policies according to the present invention.

Detailed Description

For a more clear understanding of the technical features, objects and effects of the present invention, embodiments of the present invention will now be described in detail with reference to the accompanying drawings.

In this embodiment, a Python is used to compile corresponding codes, and a Pygame module is used to perform a simulation experiment. In the present embodiment, six different environments (each having a length and a width of 150cm × 105cm) are designed, and different numbers and shapes of obstacles are distributed in each environment, so as to simulate a complex actual environment. A single Gaussian concentration distribution model is adopted to simulate the point pollution source in the environment, and the function of the point pollution source is defined as:

wherein (x)₀,y₀) Is the position of the pollution source, (x, y) is the coordinate of any point in the environment, c (x, y) is the concentration of the point, and the value range is (0, 1)]。

Referring to fig. 1, fig. 1 is a schematic diagram of an environment map construction method based on a multi-policy joint control agent according to the present invention, and the specific implementation steps are as follows:

step S1: based on the intelligent sensing model, an improved wall-following strategy is adopted to control the intelligent body to move for a circle along the boundary of the unknown environment for boundary detection, and the boundary of the unknown environment is obtained according to the pose of the intelligent body and the measurement data of the laser radar in the boundary detection process.

The intelligent agent is a mobile carrier carrying two ultrasonic sensors, a laser radar, a concentration sensor and a photoelectric encoder, and has the functions of detecting distance, scanning environment and detecting pollutant concentration. The two ultrasonic sensors are respectively positioned in front of and on the right side of the intelligent body and used for detecting whether obstacles exist in the front and on the right side; the concentration sensor, the laser radar and the photoelectric encoder are located in the center of the intelligent body, the concentration sensor is used for collecting pollutant concentration, the laser radar is used for scanning the whole environment and calculating and detecting the distribution situation of the obstacles, and the photoelectric encoder is used for calculating the pose of the intelligent body and comprises coordinates and angles.

The system for sensing the unknown environment around the intelligent agent by using the functions of detecting the distance, scanning the environment and detecting the concentration of pollutants is called an intelligent agent sensing model.

And in the process of boundary detection, the intelligent agent perception model obtains the boundary of the unknown environment according to the pose of the intelligent agent and the measurement data of the laser radar.

In step S1, the improved wall-following strategy specifically includes:

setting the mark bit of the ultrasonic sensor i as U_iThe values are:

In this embodiment, the distance threshold l of the ultrasonic sensor is set to 1cm, the moving direction of the agent is adjusted according to equation (3), and the agent is ensured to be always on the left side of the boundary along the moving direction, so that the boundary detection of the unknown environment is completed.

Step S2: and dividing the area surrounded by the boundary into a plurality of grids, randomly selecting a target point in each grid, and sequentially taking each target point as an end point of the motion of the intelligent agent by adopting a simplified spiral traversal strategy.

In this embodiment, the unknown area surrounded by the boundary is divided into a plurality of 9 grids, a target point is randomly selected from each large grid, and the target points are sequentially used as the end points of the movement of the agent in the clockwise direction.

in the global detection process, the intelligent agent only needs to sequentially reach the target point according to the planned path, and the whole environment area does not need to be fully covered.

Step S4: and generating an obstacle distribution map of an unknown environment by adopting an occupancy grid map algorithm based on the pose of the intelligent agent and the measurement data of the laser radar in the global detection process.

In this embodiment, step S4 specifically includes:

where I denotes the scanned point, z_tIs a set of the measurement data of the lidar and the pose of the agent at time t, r_maxIs the farthest distance, r, that the lidar can detect_IIs the distance from the center of the laser radar to I, phi_IIs the relative angle between the lidar and I, phi_kIs phi_IThe closest scan angle, k is the index of the scan angle, r_kIs phi_kDetection range of the laser radar in the direction, beta being the step angle of the scanning, p_minAnd p_maxIs constant and satisfies 0<p_min<0.5<p_max<1, empirical value p_min＝0.3、p_max0.7. The first condition in the formula (4) corresponds to that the detected point is an undetected area, the probability value is 0.5, and the undetermined area is represented as whether the detected point is occupied or not; the second case represents that the point is likely to be occupied, and a larger probability needs to be assigned; the third case indicates that the point is less likely to be occupied, giving a lower probability.

in this embodiment, a probability of being occupied by an obstacle is given to each point in the environment map according to equation (4), the confidence of the point is updated according to equation (5), the occupancy probability of the point is calculated based on the confidence, the occupancy probability of each point detected by the laser radar is continuously updated by the intelligent agent according to the movement of the planned path, and finally, an obstacle distribution map of the unknown environment is generated.

Step S5: training the RBF neural network based on the self position information recorded in the intelligent body motion process (including the boundary detection process and the global detection process) and the collected pollutant concentration, and after the training is finished, fitting the trained RBF neural network to generate a concentration distribution map of an unknown environment.

In this embodiment, the multi-policy joint control agent algorithm provided by the present invention is applied to map reconstruction in an unknown environment. In order to verify the effectiveness of the invention, six different environments are tested, and a constructed environment map and a motion track of the intelligent agent are drawn, specifically referring to fig. 2 and fig. 3. Comparing fig. 2(a) and fig. 2(b), the constructed maps of six unknown environments are accurate, and only a few edge regions are missing (the missing part is less than 1% of the full map). This is because the scanning range of the laser radar is a sector, and when the occupancy probability is calculated by dividing the sector area with the grid, a situation that a part of the grid area is overlapped or a part of the sector area is not divided by the grid occasionally occurs, so that the occupancy probability of the area is calculated incorrectly, and the finally generated obstacle map is missing. As can be seen from fig. 3, the agent moves around the boundary for one circle to detect the boundary of the environment, then moves to each target point in a spiral traversal manner, and finally stops at the last target point, thereby basically completing coverage detection of all unknown environments.

To further illustrate the advancement of the present invention, environment 2 and environment 3 are taken as examples to compare the present invention with two other methods: the method A does not adopt a wall-following strategy and a spiral traversal strategy, and randomly selected points are used as target points when the intelligent agent moves; the method B does not adopt a wall-following strategy to control the intelligent agent to move. The environment map constructed by the three methods is shown in fig. 4. As can be seen from fig. 4, the method a has the worst effect, many parts in the environment are not detected, and the generated map is far from the real environment; the method B finishes the detection of most environmental regions, and a small number of boundary regions are not detected; the invention completely explores the whole environment, and the result is closest to the real map.

Figure 5 shows the concentration profile fitted to the RBF neural network. Although the predicted concentration near the contamination source is slightly lower than the true value, the predicted concentration distribution trend and the position of the concentration peak are substantially consistent with the true condition.

In some embodiments, referring to fig. 6, there is further provided an environment mapping apparatus based on multi-policy joint control agent, including the following modules:

the boundary detection module 1 is used for controlling the intelligent body to move for a circle along the boundary of the unknown environment by adopting an improved wall-following strategy based on the intelligent body sensing model so as to carry out boundary detection, and analyzing the pose of the intelligent body and the measurement data of the laser radar in the boundary detection process so as to obtain the boundary of the unknown environment;

the spiral traversal module 2 is used for dividing the area surrounded by the boundary into a plurality of grids, randomly selecting a target point in each grid, and sequentially taking each target point as an end point of the motion of the intelligent agent by adopting a simplified spiral traversal strategy;

the global detection module 3 is used for dynamically planning a path from the intelligent agent to each terminal by adopting a D × Lite algorithm, and the intelligent agent sequentially reaches target points according to a planned track to perform global detection;

the obstacle distribution map generating module 4 is used for generating an obstacle distribution map of an unknown environment by adopting an occupancy grid map algorithm based on the pose of the intelligent agent and the measurement data of the laser radar in the global detection process;

the concentration distribution map generation module 5 is used for training the RBF neural network based on the self position information recorded in the intelligent agent movement process and the collected pollutant concentration, and generating a concentration distribution map of an unknown environment by adopting the trained RBF neural network in a fitting manner after the training is finished;

and the environment map generation module 6 is used for generating an environment map by combining the barrier distribution map and the concentration distribution map to complete the map construction of the unknown environment.

It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or system that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or system. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or system that comprises the element.

The above-mentioned serial numbers of the embodiments of the present invention are merely for description and do not represent the merits of the embodiments. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The use of the words first, second, third and the like do not denote any order, but rather the words first, second and the like may be interpreted as indicating any order.

The above description is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention, and all equivalent structures or equivalent processes performed by the present invention or directly or indirectly applied to other related technical fields are also included in the scope of the present invention.

Claims

1. An environment map construction method based on a multi-strategy joint control agent is characterized by comprising the following steps:

step S1: based on the intelligent body sensing model, an improved wall-following strategy is adopted to control the intelligent body to move for a circle along the boundary of the unknown environment for boundary detection, and the boundary of the unknown environment is obtained according to the pose of the intelligent body and the measurement data analysis of the laser radar in the boundary detection process;

step S2: dividing a region surrounded by a boundary into a plurality of grids, randomly selecting a target point in each grid, and sequentially taking each target point as an end point of the movement of the intelligent body by adopting a simplified spiral traversal strategy;

2. The method for building an environment map based on multi-strategy joint control agent of claim 1, wherein in step S1, the agent is a mobile carrier carrying two ultrasonic sensors, a laser radar, a concentration sensor and a photoelectric encoder; the two ultrasonic sensors are respectively positioned in front of and on the right side of the intelligent agent and used for detecting whether obstacles exist in the front and on the right side; the concentration sensor, the laser radar and the photoelectric encoder are located at the central position of the intelligent body, the concentration sensor is used for collecting pollutant concentration, the laser radar is used for scanning the whole environment and calculating and detecting the distribution condition of the obstacles, and the photoelectric encoder is used for calculating the pose of the intelligent body and comprises coordinates and angles.

3. The method for building an environment map based on multi-strategy joint control agent according to claim 1, wherein in step S1, the improved wall-following strategy comprises:

4. The method for building an environment map based on multi-strategy joint control agent according to claim 1, wherein in step S1, the improved wall-following strategy comprises:

the zone bit of the ultrasonic sensor i is set as U_iThe values are:

where l is the distance threshold and d isDistance between agent and boundary, when U_iIf the value is 1, the ultrasonic sensor i faces to the boundary, otherwise, the boundary is absent;

5. The method for environment mapping based on multi-strategy joint control agent as claimed in claim 1, wherein in step S2, the simplified spiral traversal strategy comprises:

and selecting the clockwise direction, sequentially taking the target points as the end points of the movement of the intelligent agent, and ensuring that the target points in each grid can be accessed only once.

6. The multi-policy joint control agent-based environment mapping method according to claim 1, wherein step S4 includes:

where I denotes the scanned point, z_tIs a set of the measurement data of the lidar and the pose of the agent at time t, r_maxIs the farthest distance, r, that the lidar can detect_IIs the distance from the center of the laser radar to I, phi_IIs the relative angle between the lidar and I, phi_kIs phi_IClosest scan angle, k is the index number of the scan angle, r_kIs phi_kDetection range of the laser radar in the direction, beta being the step angle of the scanning, p_minAnd p_maxIs constant and satisfies 0<p_min<0.5<p_max<1；

7. The method for building an environment map based on multi-strategy cooperative control agent of claim 1, wherein in step S5, the RBF neural network is a network structure with two inputs and one output and comprises 5 hidden layer nodes, the input is the position coordinates of each point in the environment area, and the output is the concentration value; and (3) learning and training the RBF neural network through the position and concentration value sample pairs recorded in the boundary detection and global detection to obtain the trained RBF neural network, and completing the fitting of the concentration field distribution of the whole unknown environment.

8. An environment map construction device based on multi-strategy joint control agent is characterized by comprising the following modules:

the concentration distribution map generation module is used for training the RBF neural network based on the self position information recorded in the intelligent agent movement process and the collected pollutant concentration, and after the training is finished, the concentration distribution map of the unknown environment is generated by adopting the trained RBF neural network in a fitting manner;