CN113672362A

CN113672362A - Intelligent cooperative operation method and system for epidemic prevention machine group in complex and multi-environment

Info

Publication number: CN113672362A
Application number: CN202110820172.3A
Authority: CN
Inventors: 高洪波; 朱菊萍; 何希; 王源源
Original assignee: Institute of Advanced Technology University of Science and Technology of China
Current assignee: Institute of Advanced Technology University of Science and Technology of China
Priority date: 2021-07-20
Filing date: 2021-07-20
Publication date: 2021-11-19
Anticipated expiration: 2041-07-20
Also published as: CN113672362B

Abstract

The invention provides an intelligent cooperative operation method and system for an epidemic prevention machine group in a complex multi-environment, wherein the method comprises the following steps: step 1: dynamic configuration of multi-robot tasks and roles is carried out in a dynamic environment; step 2: constructing an intelligent cooperation strategy based on the LTL under the condition of weakened information according to the dynamic configuration result; and step 3: and constructing an intelligent cooperative strategy based on the LTL under the conflict resolution condition according to the dynamic configuration result. The LTL can express rich and complex task specifications and is close to the description of human natural language, so that the description of machine complex tasks is more direct and convenient, and the usability and usability of the system are enhanced. As a theoretical framework for multi-robot collaborative planning control, the method has good openness and expansibility, and is convenient to combine the existing control method with the latest scientific research achievement in the field of artificial intelligence.

Description

Intelligent cooperative operation method and system for epidemic prevention machine group in complex and multi-environment

Technical Field

The invention relates to the technical field of machine cooperative control, in particular to an intelligent cooperative operation method and system under complex and multi-environment of epidemic prevention machine groups.

Background

The epidemic prevention machine moving in the working environment can be regarded as a system with a plurality of moving states, the expression of time sequence logic is strong, the epidemic prevention machine is suitable for controlling a large number of epidemic prevention robots, the LTL theory is combined with artificial intelligence, a control theory, model detection and other methods, and effective and feasible new ideas can be provided for robot intelligent cooperation problems under multiple constraint limits such as weakening information, conflict resolution and the like through real-time perception, detection and interaction with the environment of the robot.

Disclosure of Invention

Aiming at the defects in the prior art, the invention aims to provide an intelligent cooperative operation method and system for an epidemic prevention machine group in a complex and multi-environment.

According to the intelligent cooperative operation method of the epidemic prevention machine group in the complex and multi-environment, the method comprises the following steps:

step 1: dynamic configuration of multi-robot tasks and roles is carried out in a dynamic environment;

step 2: constructing an intelligent cooperation strategy based on the LTL under the condition of weakened information according to the dynamic configuration result;

and step 3: and constructing an intelligent cooperative strategy based on the LTL under the conflict resolution condition according to the dynamic configuration result.

Preferably, the step 1 comprises:

step 1.1: learning an optimal strategy to maximize the return or achieve a specific objective problem through interaction of the robot and the environment by using reinforcement learning;

step 1.2: using game theory to realize dynamic balance among task selections;

step 1.3: the reinforcement learning and the game theory are combined, and intelligent task allocation under a dynamic unknown non-structural environment is achieved.

Preferably, said step 1 further comprises the step of dividing the robot role into leader and executor, building a hierarchical organization of leader-executor, the criterion of division being to actuate and control a large number of executor robots with a minimum number of leader robots.

Preferably, the step 2 includes constructing a robot information topology of the networked system, extracting topological features by using multi-dimensional interaction in the simple complex model characterization system based on the constructed information topology, and designing a multi-robot cooperative control strategy by fusing methods such as optimization control, nonlinear control, random control and the like.

Preferably, the step 3 comprises a task feasible network topology product automaton formed on the basis of the constructed migration system, the LTL task automaton and the Cartesian product; and searching a path meeting the system specification by applying methods such as a model inspection technology, artificial intelligence, decision and control and the like.

Preferably, the step 3 includes constructing a new method of intelligent coordination and motion planning of multiple robots based on a probability model, and constructing an automaton based on an uncertain automaton generated by a probability migration system and an LTL.

The invention also provides an intelligent cooperative operation system of the epidemic prevention machine group in a complex multi-environment, which comprises the following modules:

module M1: dynamic configuration of multi-robot tasks and roles is carried out in a dynamic environment;

module M2: constructing an intelligent cooperation strategy based on the LTL under the condition of weakened information according to the dynamic configuration result;

module M3: and constructing an intelligent cooperative strategy based on the LTL under the conflict resolution condition according to the dynamic configuration result.

Preferably, said module M1 comprises:

module M1.1: learning an optimal strategy to maximize the return or achieve a specific objective problem through interaction of the robot and the environment by using reinforcement learning;

module M1.2: using game theory to realize dynamic balance among task selections;

module M1.3: the reinforcement learning and the game theory are combined, and intelligent task allocation under a dynamic unknown non-structural environment is achieved.

Preferably, said module M1 further comprises the step of taking the robot role as a leader and an executor, building a hierarchical organization of leader-executor, the criterion of division being to actuate and control a large number of executor robots with a minimum number of leader robots.

Preferably, the module M2 includes a robot information topology for constructing a networked system, and based on the constructed information topology, the method of extracting topological features, fusing methods such as optimization control, nonlinear control, random control, and the like, and designing a multi-robot cooperative control strategy, by using multi-dimensional interaction in a simple complex model characterization system.

Compared with the prior art, the invention has the following beneficial effects:

1. the LTL can express rich and complex task specifications and is close to the description of a natural language of a human, so that the description of a complex task of a machine is more direct and convenient, and the usability of the system are enhanced;

2. as an automatic system verification method, a model-checking method has correct-by-design characteristics, and ensures that collaborative planning can meet various complex specifications such as environment, system and task (namely, the correctness of the collaborative planning is ensured);

3. as a theoretical framework for multi-robot collaborative planning control, the method has good openness and expansibility, and is convenient to combine the existing control method with the latest scientific research achievement in the field of artificial intelligence.

Drawings

Other features, objects and advantages of the invention will become more apparent upon reading of the detailed description of non-limiting embodiments with reference to the following drawings:

FIG. 1 is a general technical scheme diagram of multi-robot cooperative control;

FIG. 2 is a block diagram of an LTL-based intelligent planning control theory;

Detailed Description

The present invention will be described in detail with reference to specific examples. The following examples will assist those skilled in the art in further understanding the invention, but are not intended to limit the invention in any way. It should be noted that it would be obvious to those skilled in the art that various changes and modifications can be made without departing from the spirit of the invention. All falling within the scope of the present invention.

Referring to fig. 1 and 2, aiming at an epidemic prevention robot for an isolation area to undertake material distribution, a problem that an optimal strategy is learned to maximize return or achieve a specific target through interaction of the robot and a community environment by using reinforcement learning is adopted; using a game theory to realize dynamic balance among distribution task selections; the reinforcement learning and the game theory are combined, and intelligent task allocation under a dynamic unknown non-structural environment is achieved. The robot role is divided into leader and executor, and a leader-executor hierarchy is constructed, and the division is based on the principle that a large number of executor robots are driven and controlled by a minimum number of leader robots.

Considering that the behavior and the motion of the robot can be influenced by external factors, regarding the behavior and the motion of the robot as an uncertain event, extracting the influence factors in the environment, constructing a migration system based on a Markov decision process model, and depicting the behavior characteristics of the robot under the influence of the dynamic factors, the uncertain factors of robot hardware, information interaction and other factors in the environment through a probability model.

The LTL is made up of a set of atom topic sets, logical operators, and timing operators. The LTL is used as a formal language, is similar to the normal language habit of people, codes tasks needing to be completed by the robot into a linear time sequence formula, and can describe not only point-to-point simple tasks but also more complex tasks. The subject adopts the LTL to describe any specified and required behaviors of the robot in a complex and various way, and converts the actions into corresponding automata in a chart form according to the type of the LTL.

The distribution is carried out according to the geographic environment characteristics of the isolation area, and the isolation area is regarded as a weighted switching system with a finite state, and tasks which need to be executed by the intelligent epidemic prevention robot are described by a linear sequential logic language. And constructing a Product automaton fusing a system and a task. And obtaining the optimal path by using a Dijkstra algorithm and feeding the optimal path back to an actual system, thereby obtaining the path of the epidemic prevention robot in the actual environment.

In the multi-robot system, each independent robot is a node in the networked system, and the cooperative cooperation is realized through the real-time transmission and sharing of information. The topological structure characteristics of the communication network determine the information exchange and sharing mode among individuals and also determine whether the intelligent system can cooperatively complete complex tasks. Aiming at the situations of limited information exchange, time lag, packet loss and the like among robots by constructing a random network, a switching network, a time-varying network and the like under a dynamic non-structural environment and a complex task scene.

Based on a figure-of-graph theory and a control theory, the topological characteristics of the network are extracted by using continuous coherence, an internal rule is mined between the information topology of the multi-robot system and the information interaction and cooperative operation of the multi-stage machines, and theoretical support is provided for the cooperative operation under the condition of weakening information limitation.

Based on the topological characteristics extracted by a simple complex analysis method, optimization control, nonlinear control, random control and other methods are fused, and a multi-robot cooperative control strategy is designed, so that the robot can utilize local and limited information interaction to carry out effective intelligent cooperation, and intelligent cooperation and operation under the condition of weakening information limitation are realized.

Based on the constructed migration system, the LTL task automaton and a task feasible network topology product automaton formed by Cartesian products; searching an optimal path meeting a system specification by applying methods such as a model inspection technology, artificial intelligence, decision making, control and the like;

a novel method for establishing intelligent cooperation and motion planning of multiple robots based on a probability model is constructed, a loose product automaton is constructed on the basis of an uncertain Buchi automaton generated by a probability migration system and LTL, the robots are enabled not to strictly follow expected LTL constraint, theories such as reinforcement learning and game theory are combined, and the optimal motion planning for resolving conflicts is found in a limited time domain according to real-time sensed environment information.

The invention also provides an intelligent cooperative operation system of the epidemic prevention machine group in a complex multi-environment, which comprises: module M1: dynamic configuration of multi-robot tasks and roles is carried out in a dynamic environment; the module M1 includes: module M1.1: learning an optimal strategy to maximize the return or achieve a specific objective problem through interaction of the robot and the environment by using reinforcement learning; module M1.2: using game theory to realize dynamic balance among task selections; module M1.3: the reinforcement learning and the game theory are combined to realize intelligent task allocation under a dynamic unknown non-structural environment; module M1 also includes building a hierarchical organization of leader-executors by dividing the robot role into leader and executors, with the criterion of division being to actuate and control a large number of executors robots with a minimum number of leader robots.

Module M2: constructing an intelligent cooperation strategy based on the LTL under the condition of weakened information according to the dynamic configuration result; the module M2 comprises a robot information topology for constructing a networked system, and based on the constructed information topology, the method comprises the steps of utilizing a simple complex model to depict multi-dimensional interaction in the system, extracting topological characteristics, fusing methods such as optimization control, nonlinear control and random control, and designing a multi-robot cooperative control strategy.

Those skilled in the art will appreciate that, in addition to implementing the system and its various devices, modules, units provided by the present invention as pure computer readable program code, the system and its various devices, modules, units provided by the present invention can be fully implemented by logically programming method steps in the form of logic gates, switches, application specific integrated circuits, programmable logic controllers, embedded microcontrollers and the like. Therefore, the system and various devices, modules and units thereof provided by the invention can be regarded as a hardware component, and the devices, modules and units included in the system for realizing various functions can also be regarded as structures in the hardware component; means, modules, units for performing the various functions may also be regarded as structures within both software modules and hardware components for performing the method.

The foregoing description of specific embodiments of the present invention has been presented. It is to be understood that the present invention is not limited to the specific embodiments described above, and that various changes or modifications may be made by one skilled in the art within the scope of the appended claims without departing from the spirit of the invention. The embodiments and features of the embodiments of the present application may be combined with each other arbitrarily without conflict.

Claims

1. An intelligent cooperative operation method for an epidemic prevention machine group in a complex and multi-environment is characterized by comprising the following steps:

2. The intelligent cooperative operation method for the epidemic prevention machine group in the complex multi-environment according to claim 1, wherein the step 1 comprises:

step 1.2: using game theory to realize dynamic balance among task selections;

3. The intelligent cooperative work method for epidemic prevention robot group in complex multi-environment according to claim 1, wherein the step 1 further comprises dividing the robot role into leader and executor, and constructing a hierarchical organization of leader-executor, wherein the dividing is based on the minimum number of leader robots to drive and control a large number of executor robots.

4. The intelligent cooperative operation method under the complex multi-environment of the epidemic prevention machine group as claimed in claim 1, wherein the step 2 comprises constructing a robot information topology of a networked system, based on the constructed information topology, utilizing a simple complex model to depict multi-dimensional interaction in the system, extracting topological features, fusing methods such as optimization control, nonlinear control, random control and the like, and designing a multi-robot cooperative control strategy.

5. The intelligent cooperative operation method under the complex multi-environment of the epidemic prevention machine group as claimed in claim 1, wherein the step 3 comprises a task feasible network topology product automaton which is formed based on the constructed migration system, the LTL task automaton and the Cartesian product; and searching a path meeting the system specification by applying methods such as a model inspection technology, artificial intelligence, decision and control and the like.

6. The intelligent cooperative operation method for the epidemic prevention robot group in the complex multi-environment as claimed in claim 1, wherein the step 3 comprises constructing a new method for intelligent cooperation and motion planning of multiple robots based on a probability model, and constructing an automaton based on an uncertain automaton generated by a probability migration system and LTL.

7. The utility model provides an intelligent collaborative operation system under epidemic prevention machine crowd complicated multi-environment which characterized in that, the system includes following module:

8. The intelligent cooperative operating system in an epidemic prevention machine group complex multi-environment as claimed in claim 7, wherein the module M1 comprises:

9. The intelligent cooperative operation system in the complex multi-environment of epidemic prevention machine group as claimed in claim 7, wherein the module M1 further comprises dividing the robot role into leader and executor, and constructing a hierarchical organization of leader and executor, wherein the dividing criterion is to drive and control a large number of executor robots by a minimum number of leader robots.

10. The intelligent cooperative work method under the complex multi-environment of the epidemic prevention robot group as claimed in claim 7, wherein the module M2 comprises a robot information topology for constructing a networked system, extracting topology characteristics based on the constructed information topology by using multi-dimensional interaction in a simple complex model depicting system, and designing a multi-robot cooperative control strategy by fusing methods such as optimization control, nonlinear control, random control and the like.