CN110084295B

CN110084295B - Multi-agent system grouping surrounding control method and control system

Info

Publication number: CN110084295B
Application number: CN201910318251.7A
Authority: CN
Inventors: 任红卫; 李玉华
Original assignee: Guangdong University of Petrochemical Technology
Current assignee: Guangdong University of Petrochemical Technology
Priority date: 2019-04-19
Filing date: 2019-04-19
Publication date: 2023-05-05
Anticipated expiration: 2039-04-19
Also published as: CN110084295A

Abstract

The invention belongs to the technical field of computer application, and discloses a multi-agent system grouping surrounding control method and a control system, which are used for detecting through a sensor network with a known topological structure to determine the topological structure of the multi-agent system; combining relatively close and relatively stationary agents into a group to simplify the control model; the model adopts a centralized control method, and the intelligent agent group with the largest intelligent agent number in the intelligent agent group is used as a central control intelligent agent group of the control model; coding the grouped agent groups; an agent code in the agent group; and determining the intelligent agent group and the intelligent agent according to the codes, and performing corresponding control operation. The invention uses the sensor network with known topological structure to build and correct the model, which saves time and has high control speed; the method adopts a grouping mode, so that the whole distributed processing and the local centralized processing are realized; the real-time coding mode is adopted for control, so that the precision is high and the positioning is accurate.

Description

Multi-agent system grouping surrounding control method and control system

Technical Field

The invention belongs to the technical field of computer application, and particularly relates to a multi-agent system grouping surrounding control method and a control system.

Background

The multi-agent surrounding target control algorithm is a control algorithm for realizing a group of agents surrounding targets in a detection range through a distributed network and local information. The generation of intelligent concepts originates from earlier human research in the field of artificial intelligence, where human beings no longer need to deal with complex dangerous tasks personally, but where "agents" are used instead of humans to accomplish these tasks are an important reason that artificial intelligence theory has been proposed. A physical entity such as a sensor, an actuator, a robot, a biological individual, a drone, etc. may be considered as an agent. Along with the development of the intelligent agent theory, the multi-intelligent agent system theory is also proposed, namely, a group of intelligent agents with sensor, calculation, execution and communication capabilities are related to form a network system through communication and the like. For a single agent, it can only obtain limited information, so it is difficult to solve the distributed problem. For multi-agent systems, the ability to solve such problems far exceeds that of a single agent through the interaction between agents. Compared with a single intelligent agent system, the multi-intelligent agent system has more advantages, which are mainly represented by the following aspects: higher efficiency, better system performance, stronger robustness and fault tolerance, lower price, simple use and operation, etc.

To sum up, the prior art has the following problems:

in the existing network system of a single intelligent agent, limited information is acquired, and the distributed problem cannot be solved. The existing multi-agent control method is low in convergence speed, cannot realize rapid control, is low in control precision and inaccurate in positioning in actual application.

Disclosure of Invention

Aiming at the problems existing in the prior art, the invention provides a multi-agent system grouping surrounding control method, which belongs to the energy-saving efficient and technical improvement inventions.

The multi-agent system grouping and surrounding control method provided by the invention comprises the following steps:

step one, detecting through a sensor network with a known topological structure, and determining the topological structure of the multi-agent system;

combining the relatively static intelligent agents with relatively close positions into a group to simplify a control model;

thirdly, the model adopts a centralized control method, and the intelligent agent group with the largest intelligent agent number in the intelligent agent group is used as a central control intelligent agent group of the control model;

step four, the grouped agent groups are encoded;

fifthly, encoding the agents in the agent groups;

step six, determining the intelligent agent group and the intelligent agent according to the codes, and performing corresponding control operation.

In the fourth step, the coding mode is: and placing all the agent groups in a two-dimensional vector space, establishing a Laplace matrix, wherein the boundary of the aggregation space is the boundary of the matrix, and establishing a space rectangular coordinate system by using the lower left corner boundary, wherein the coordinates of the agent groups in the matrix are the agent group codes.

In the fifth step, the encoding priorities of the agents in the group are as follows: high, low, up, down, left and right, and adopts relative position coding by taking the intelligent agent with the strongest calculation force as the center.

Further, the second step combines relatively close and relatively stationary agents into a group to simplify the control model,

selecting an agent evaluation model, calculating a predictive value of each frame of a relatively static agent by comparing an agent with a relatively static agent at an original relatively near position, and marking the obtained frame fraction as a vector X and the total frame number of the agent as N;

the window length of the sliding window is winLen, the sliding window processing is carried out on the obtained frame level quality fraction, namely the frame level fraction of the n-th frame after processing is the average value of the frame level fractions of the [ n-winlen+1, n ] frames, and the frame level fraction after the sliding window processing is marked as a vector WX;

sequencing WX from small to large, marking the sequenced result as WX', taking the average value of the worst p% frames as the quality measurement value of the whole agent topological structure sequence, sequencing, and taking the minimum p% frame average value as the final measurement result;

and sequentially carrying out sliding window treatment on all frame fractions calculated by the agent evaluation model, namely:

wherein, winLen represents window length during sliding window filtering, which is a parameter to be adjusted, X (t) represents quality fraction of the t-th frame, and WX (n) represents quality fraction of the n-th frame after sliding window processing;

and fusing the predicted frame fraction by using a time domain information fusion method based on the inter-frame association and the worst time slot, and finally predicting the fraction:

wherein p% is a parameter to be adjusted, N is the total frame number of the topological structure of the intelligent agent, WX' (t) represents the t th frame fraction after sequencing from small to large, OM _winPooling The final evaluation result of the quality of the agent.

Further, the final evaluation result of the quality of the obtained intelligent agent is processed uniformly, and a uniform control model is constructed.

Further, the final evaluation result of the quality of the obtained intelligent agent is subjected to unified processing including data selection, data preprocessing and data transformation.

Further, the data transformation comprises data transformation dimension reduction, wherein the data transformation dimension reduction is a feature extracted from initial features of data through cluster analysis, and the data dimension is reduced.

Further, the process of constructing the unified control model includes: selecting reference alarm information parameter vector and establishing reference sequence X ₀ ，

X ₀ ＝{X ₀ (k)|k＝1,2,…,n}＝(X ₀ (1),X ₀ (2),…,X ₀ (n))

Wherein k represents time of day, X ₀ Representing alarm information, wherein n represents the feature dimension of the parameter vector of the alarm information;

secondly, assuming that m comparison fault alarm information data exist, a comparison sequence X is established _i

X _i ＝{X _i (k)|k＝1,2,…,n}＝(X _i (1),X _i (2),…,X _i (n))i＝1,2,…,m

Then, a comparison sequence X is established _i For reference number series X ₀ Correlation coefficient ζ at time k _i (k)

/>

Wherein w is ₁ The corresponding weight of each parameter is adjusted and decided according to the network attribute of the user; where ρ is the resolution factor and, ρ is e [0 ], ++ infinity a) is provided; the larger ρ isThe greater the resolution; the smaller the ρ, the smaller the resolution;

finally, calculate the comparison sequence X _i For reference number series X ₀ Is related to the degree of association of (2)

Further, establishing a unified control model further comprises establishing a mapping relation between the intelligent agent alarm information and the faults between networks through association analysis, establishing a fault positioning model, and performing network fault positioning through a BP neural network; the method comprises the following steps:

first, an m-dimensional alert vector Q is obtained _n ＝(s ₁ ,s ₂ ,s ₃ …s _m ) And n-dimensional fault vector O _n ＝(p ₁ ,p ₂ ,p ₃ …p _m ) The system is enabled to have a parallel structure and parallel processing capability by simultaneously inputting the system through a plurality of network nodes, and the input is dynamically processed in real time;

secondly, the BP network gives a value in a specified range to each connected weight, and simultaneously, a threshold value is specified to each neuron node;

thirdly, providing a group input alarm sample machine target result to a network, and calculating the connection weight, the threshold value and the input and output values of each hidden layer unit of the neural network node;

then, positive layer errors: calculating the error of the output layer unit by using the target vector and the network actual output value, correcting the connection weight and the threshold value by combining the output of each unit of the hidden layer, and carrying out reverse error propagation correction;

finally, training sample vectors and training the heterogeneous network system until the complete part of samples are trained, inputting network operation and maintenance fault alarm information into a trained BP network to locate network faults.

It is another object of the present invention to provide a multi-intelligent system packet enclosure control system implementing the multi-intelligent system packet enclosure control method.

In summary, the invention has the advantages and positive effects that:

the invention uses the sensor network with known topological structure to build and correct the model, which saves time and has high control speed; the invention adopts a grouping mode, thus realizing the overall distributed processing and the local centralized processing; the real-time coding mode is adopted for control, so that the precision is high and the positioning is accurate. The central control intelligent body is trained by adopting a neural network, so that the convergence rate is improved.

The method comprises the steps of combining relatively nearer and relatively stationary intelligent agents into a group to simplify a control model, selecting an intelligent agent evaluation model, calculating a predictive value of each frame of the relatively stationary intelligent agent by comparing the original nearer intelligent agent with the relatively stationary intelligent agent, marking the obtained frame fraction as a vector X, and marking the total frame number of the intelligent agent as N; the window length of the sliding window is winLen, the sliding window processing is carried out on the obtained frame level quality fraction, namely the frame level fraction of the n-th frame after processing is the average value of the frame level fractions of the [ n-winlen+1, n ] frames, and the frame level fraction after the sliding window processing is marked as a vector WX; sequencing WX from small to large, marking the sequenced result as WX', taking the average value of the worst p% frames as the quality measurement value of the whole agent topological structure sequence, sequencing, and taking the minimum p% frame average value as the final measurement result; and sequentially carrying out sliding window treatment on all frame fractions calculated by the agent evaluation model, namely:

wherein, winLen represents window length during sliding window filtering, which is a parameter to be adjusted, X (t) represents quality fraction of the t-th frame, and WX (n) represents quality fraction of the n-th frame after sliding window processing; and fusing the predicted frame fraction by using a time domain information fusion method based on the inter-frame association and the worst time slot, and finally predicting the fraction: />

Wherein p% is a parameter to be adjusted, N is the total frame number of the topological structure of the intelligent agent, and WX' (t) represents the first step after sequencing from small to larget frame fractions, OM _winPooling The final evaluation result of the quality of the agent.

The invention considers the connection between frames, and uses the sliding window mean value to process the data of each frame, so that the estimation accuracy is greatly improved.

By optimizing network parameters, the network self-healing is realized, and the intelligent operation and maintenance of the network are realized. The invention adopts the BP neural network method in the data mining to mine the network operation and maintenance data and the alarm information, combines the user perception information to intelligently locate the network faults, accurately judges the network faults, improves the network quality, reduces the operation cost, improves the system performance, and ensures the high efficiency, safety and stability of the network operation.

Drawings

Fig. 1 is a flowchart of a multi-agent system packet enclosure control method provided by the invention.

Detailed Description

For a further understanding of the invention, its features and advantages, reference is now made to the following examples, which are illustrated in the accompanying drawings.

The existing single intelligent agent acquires limited information, and the distributed problem cannot be solved. The existing multi-agent control method is low in convergence speed, cannot realize rapid control, is low in control precision and inaccurate in positioning in actual application.

In order to solve the above technical problems, the structure of the present invention will be described in detail with reference to the accompanying drawings.

As shown in fig. 1, the multi-agent system packet enclosure control method provided by the embodiment of the invention includes:

s101, detecting through a sensor network with a known topological structure, and determining the topological structure of the multi-agent system.

S102, combining relatively close-positioned and relatively static intelligent agents into a group to simplify a control model.

S103, the model adopts a centralized control method, takes the agent group with the largest number of agents in the agent group as a central control agent group of the control model, and trains the central control agent group by utilizing the BP neural network.

S104, coding the grouped agent groups.

S105, encoding the agents in the agent group.

S106, the trained central control agent group controls the corresponding agent group and agent through agent group and agent coding.

In step S103, the training method based on the BP neural network provided by the embodiment of the present invention specifically includes:

(1) A 3-layer neural network of one input layer, one hidden layer and one output layer is formed.

(2) The adjustment formulas of the weight and the threshold value of the basic BP neural network are as follows:

w _kj (t+1)＝w _kj (t)+αδ _k H _j

w _ji (t+1)＝w _ji (t)+αδ _j I _i

θ _k (t+1)＝θ _k (t)+βδ _k

θ _j (t+1)＝θ _j (t)+βδ _j

in the above formula, H _j Is the output of the hidden layer node. I _j A signal input for the input node i. w (w) _kj (t) and w _kj And (t+1) is the connection weight of the hidden node j and the output layer node k during the training of the front and the back two times respectively. w (w) _ji (t) and w _ji And (t+1) is the connection weight of the input node i and the hidden node j during the training of the front and back two times respectively. θ _k And theta _j The thresholds at output node k and hidden node j, respectively. Alpha and beta are learning parameters, respectively. Delta _k And delta _j The error signals of the output layer node k and the hidden node j are respectively calculated as follows:

δ _k ＝(T _k -O _k )O _k (1-O _k )

wherein T is _k For the target output value of the sample at the output node, O _k And H _j The actual output values of the samples at the output node and hidden node of the network are respectively.

Wherein the hidden layer output calculation formula is:

wherein M is the number of input nodes. f is an S-type activation function:

the output layer output uses linear weighted summation:

in the formula, s is the number of hidden nodes.

In step S104, the coding manner provided in the embodiment of the present invention specifically includes:

and placing all the agent groups in a two-dimensional vector space, establishing a Laplace matrix, wherein the boundary of the aggregation space is the boundary of the matrix, and establishing a space rectangular coordinate system by using the lower left corner boundary, wherein the coordinates of the agent groups in the matrix are the agent group codes.

In step S105, the agent code provided in the embodiment of the present invention specifically includes:

the coding priority of the agent in the group is as follows: high, low, up, down, left and right, and adopts relative position coding by taking the intelligent agent with the strongest calculation force as the center.

In the embodiment of the present invention, step S102 combines relatively close-located and relatively stationary agents into a group to simplify the control model,

selecting an agent evaluation model, calculating a predictive value of each frame of a relatively static agent by comparing an agent with a relatively static agent at a relatively near position, and marking the obtained frame fraction as a vector X and the total frame number of the agent as N.

And the window length of the sliding window is winLen, the sliding window processing is carried out on the obtained frame level quality fraction, namely the frame level fraction of the n-th frame after processing is the average value of the frame level fractions of the [ n-winlen+1, n ] frames, and the frame level fraction after the sliding window processing is marked as a vector WX.

And sequencing WX from small to large, marking the sequenced result as WX', taking the average value of the worst p% frames as the quality measurement value of the whole agent topological structure sequence, sequencing, and taking the minimum p% frame average value as the final measurement result.

/>

wherein, winLen represents window length during sliding window filtering, which is a parameter to be adjusted, X (t) represents quality fraction of the t-th frame, and WX (n) represents quality fraction of the n-th frame after sliding window processing.

And uniformly processing the obtained final evaluation result of the quality of the intelligent agent, and constructing a uniform control model.

And carrying out unified processing on the obtained final evaluation result of the quality of the intelligent agent, wherein the unified processing comprises data selection, data preprocessing and data transformation. The data transformation comprises data transformation dimension reduction, wherein the data transformation dimension reduction is to cut down the data dimension of the features extracted from the initial features of the data through cluster analysis.

The process for constructing the unified control model comprises the following steps: selecting reference alarm information parameter vector and establishing reference sequence X ₀ ，

X ₀ ＝{X ₀ (k)|k＝1,2,…,n}＝(X ₀ (1),X ₀ (2),…,X ₀ (n))

Wherein w is ₁ The corresponding weight of each parameter is adjusted and decided according to the network attribute of the user; where ρ is the resolution factor and, ρ is e [0 ], ++ infinity a) is provided; the larger the ρ, the greater the resolution; the smaller the ρ, the smaller the resolution;

Establishing a unified control model further comprises the steps of establishing a mapping relation between intelligent agent alarm information and faults between networks through association analysis, establishing a fault positioning model, and performing network fault positioning through a BP neural network; the method comprises the following steps:

first, an m-dimensional alert vector is obtainedQ _n ＝(s ₁ ,s ₂ ,s ₃ …s _m ) And n-dimensional fault vector O _n ＝(p ₁ ,p ₂ ,p ₃ …p _m ) The system is enabled to have a parallel structure and parallel processing capability by simultaneously inputting the system through a plurality of network nodes, and the input is dynamically processed in real time;

The foregoing description is only of the preferred embodiments of the present invention, and is not intended to limit the invention in any way, but any simple modification, equivalent variation and modification of the above embodiments according to the technical principles of the present invention are within the scope of the technical solutions of the present invention.

Claims

1. A multi-agent system packet enclosure control method, characterized in that the multi-agent system packet enclosure control method comprises:

step three, the control model adopts a centralized control method, and the intelligent agent group with the largest intelligent agent number in the intelligent agent group is used as the central control intelligent agent group of the control model;

step four, the grouped agent groups are encoded;

fifthly, performing agent coding in the agent grouping;

step six, determining the intelligent agent group and the intelligent agent according to the codes, and performing corresponding control operation;

in the fourth step, the coding mode is: placing all the intelligent agent groups in a two-dimensional vector space, establishing a Laplace matrix, wherein the boundary of the aggregation space is the boundary of the matrix, and establishing a space rectangular coordinate system by using the lower left corner boundary, wherein the coordinates of the intelligent agent groups in the matrix are the intelligent agent group codes;

2. The multi-agent system packet enclosure control method of claim 1, wherein step two combines relatively close-located and relatively stationary agents into a group to simplify the control model,

3. The multi-agent system packet enclosure control method of claim 2,

4. The multi-agent system packet enclosure control method of claim 3,

and carrying out unified processing on the obtained final evaluation result of the quality of the intelligent agent, wherein the unified processing comprises data selection, data preprocessing and data transformation.

5. The multi-agent system packet enclosure control method of claim 4,

the data transformation comprises data transformation dimension reduction, wherein the data transformation dimension reduction is to cut down the data dimension of the features extracted from the initial features of the data through cluster analysis.

6. The multi-agent system packet enclosure control method of claim 3,

X ₀ ＝{X ₀ (k)|k＝1，2，…，n}＝(X ₀ (1)，X ₀ (2)，…，X ₀ (n))

X _i ＝{X _i (k)|k＝1，2，…，n}＝(X _i (1)，X _i (2)，…，X _i (n))i＝1，2，…，m

Where ρ is the resolution factor and, ρ is e [0 ], ++ infinity a) is provided; the larger the ρ, the greater the resolution; the smaller the ρ, the smaller the resolution;

7. The multi-agent system grouping and surrounding control method of claim 3, wherein constructing a unified control model further comprises establishing a mapping relation between agent alarm information and faults between networks through association analysis, constructing a fault location model, and performing network fault location through a BP neural network; the method comprises the following steps:

first, an m-dimensional alert vector Q is obtained _n ＝(s ₁ ，s ₂ ，s ₃ …s _m ) And n-dimensional fault vector O _n ＝(p ₁ ，p ₂ ，p ₃ …p _m ) The system is enabled to have a parallel structure and parallel processing capability by simultaneously inputting the system through a plurality of network nodes, and the input is dynamically processed in real time;

8. A multi-agent system packet enclosure control system implementing the multi-agent system packet enclosure control method of claim 1.