CN110815225B

CN110815225B - Point-to-point iterative learning optimization control method of motor-driven single mechanical arm system

Info

Publication number: CN110815225B
Application number: CN201911119240.2A
Authority: CN
Inventors: 陶洪峰; 李健; 黄彦德
Original assignee: Jiangnan University
Current assignee: Zhongkexin Micro Intelligent Equipment Shenyang Co ltd
Priority date: 2019-11-15
Filing date: 2019-11-15
Publication date: 2020-12-25
Anticipated expiration: 2039-11-15
Also published as: CN110815225A

Abstract

The invention discloses a point-to-point iterative learning optimization control method of a motor-driven single mechanical arm system, and relates to the field of mechanical arm optimization control. The method comprises the steps of converting a single mechanical arm system which runs repeatedly into an input and output matrix model of a time sequence based on a lifting technology, then designing a point-to-point trajectory tracking robust iterative learning optimization method based on performance indexes, obtaining an optimized iterative learning control law by solving a quadratic optimal solution of a multi-target performance index function, proving the robust convergence of an algorithm when bounded uncertainty exists in the model according to a maximum singular value theory, and proving the convergence of the mechanical arm system under the condition that input is constrained.

Description

Point-to-point iterative learning optimization control method of motor-driven single mechanical arm system

Technical Field

The invention relates to the field of mechanical arm optimization control, in particular to a point-to-point iterative learning optimization control method of a motor-driven single mechanical arm system.

Background

The robot arm is an automatic operation device which can simulate some action functions of a human arm and is used for grabbing, carrying objects or operating tools according to a fixed program. It can replace the heavy labor of people to realize the mechanization and automation of production, and can be operated under harmful environment to protect the personal safety, thus being widely applied to the departments of mechanical manufacturing, metallurgy, electronics, light industry, atomic energy and the like.

A robot arm often does not need to track a complete desired trajectory when performing repetitive process tasks, but only needs to track a given desired value at certain specific points in time, such as "pick" and "put" operations of the robot arm, and only needs to focus on the outputs of the pick and place points, while the outputs at other points in time often do not need much consideration, and this type of control problem for a robot arm is called point-to-point tracking control.

There are two common solutions to the point-to-point tracking problem: firstly, an arbitrary track passing through a specific tracking point is designed, so that the problem is converted into general full-track tracking, but the method has limitations, enough prior knowledge is needed to determine the optimal fixed reference track, and the degree of freedom of a non-tracking time point is not fully utilized to design a controller; the other is a point-to-point control method based on reference track updating, although the convergence is faster than that of a fixed reference track method, the method is essentially a full-track tracking problem, the full-track tracking needs to track the output of the whole track, and some industrial processes such as mechanical arm operation only need to track the expected values of a plurality of points on the track to meet the control requirements.

Iterative Learning Control (ILC) is a high-performance control method widely used for executing repetitive tasks, and a point-to-point ILC method arises from the above point-to-point tracking problem.

Disclosure of Invention

The invention provides a point-to-point iterative learning optimization control method of a motor-driven single mechanical arm system aiming at the problems and the technical requirements, and the tracking error information at the appointed time point is used for updating the input, so that unnecessary non-key point tracking constraint is eliminated, and the degree of freedom at the non-key point increases the degree of freedom for algorithm design and simultaneously increases the promotion space of the overall performance of the system.

The technical scheme of the invention is as follows:

the point-to-point iterative learning optimization control method of the motor-driven single mechanical arm system comprises the following steps:

establishing a model of a motor-driven single mechanical arm control system; constructing a discrete state space equation of the motor-driven single mechanical arm control system; establishing a point-to-point trajectory tracking model; designing a point-to-point iterative learning trajectory tracking optimization algorithm; analyzing the convergence and robustness of the point-to-point iterative learning trajectory tracking optimization algorithm; analyzing the convergence and robustness of the point-to-point iterative learning trajectory tracking optimization algorithm under the input constraint condition; point-to-point trajectory tracking of the motor-driven single mechanical arm control system under the condition of input constraint is realized;

firstly, establishing a dynamic model of a motor-driven single mechanical arm control system:

the actual physical model of the motor-driven single mechanical arm is shown as the formula (1):

wherein N is_l＝m₂gl+m₁gl，

g is gravitational acceleration, theta is connecting rod angle, i is current, K_tIs torque, K_bIs the back electromotive force coefficient, B_cIs the viscous friction coefficient of the bearing, D_cIs the load factor, l is the link length, m₁For loading mass, m₂For connecting rod mass, R_rIs a resistor, u is a motor control voltage, xi is an actuator rotational inertia and is a reactor;

secondly, constructing a discrete state space equation of the motor-driven single mechanical arm system:

the method comprises the following steps of defining state variables by using a connecting rod angle and a current of a motor-driven single mechanical arm control system:

defining the motor control voltage u as an input variable, a motor-driven single robot control system shown in equation (1) can be described as:

obviously, the formula (2) is a continuous system model, so the formula (2) needs to be discretized, and a sampling period T meeting the Shannon sampling theorem is selected_sAnd taking the connecting rod angle theta as output, further obtaining a discrete state space model of the motor-driven single mechanical arm control system:

wherein T and k represent sampling time and batch respectively, and the time T epsilon [0, T of the batch process in a running period T]The range contains N sampling points; u. of_k(t)，y_k(t) and x_k(t) input, output and state vectors of the motor-driven single mechanical arm control system at the kth batch time t are respectively; a, B and C are discrete system parameter matrixes corresponding to the formula (2), and CB is not equal to 0; and assuming the initial state of system operation to be consistent, i.e. x_k(0)＝x₀；

Step three, establishing a point-to-point trajectory tracking model:

for a linear discrete system of the form of equation (3), its state space expression can be converted into a time-series input-output matrix model:

y_k＝Gu_k+d (4)

wherein:

d＝[CA CA² CA³ … CA^N]^Tx₀

u_k＝[u_k(0),u_k(1),...,u_k(N-1)]^T

y_k＝[y_k(1),y_k(2),...,y_k(N)]^T

g is an input and output transfer matrix on a time sequence, and d is the influence of the initial state of the system on the output;

in the design of the traditional iterative learning control algorithm, a reference track is usually fixed, and a track tracking target requires that the output of the system is gradually close to a set expected output value, namely y, at each moment along with the continuous running of a batch process_k(t)→y_d(t), where t ∈ {0,1,2, …, N }, y_k(t) actual output of the system at time t of the kth batch, y_d(t) is the system expected output at time t for the kth batch; for the point-to-point tracking problem, only the expected values of some specified key time points, namely y, need to be tracked_k(t_i)＝y_d(t_i) I is 1,2, …, M, and has a value of 0. ltoreq.t₁<t₂<…<t_MN is less than or equal to N; assume that the expected values for these M tracking points are:

y_dM＝[y_d(t₁),y_d(t₂),...,y_d(t_M)]^T (5)

defining a matrix psi as a tracking point selection matrix of M rows and N columns, where psi_ijFor the ith row in the matrix psij elements of column, when tracking time t at i_iWhen selected, all N elements of the ith row of the matrix are divided by j to t_iThe time is 1, and the rest are all 0, and M tracking time points are selected, so that psi y can be obtained_kActual output values of M tracking time points selected for N sampling time points of the entire trajectory are expressed in the following specific form:

the tracking error at a tracking point can thus be defined as:

e_kM＝y_dM-ψy_k (7)

wherein,

e_kM＝[e_k(t₁),e_k(t₂),...,e_k(t_M)]^T (8)

fourthly, designing a point-to-point iterative learning trajectory tracking optimization algorithm:

in practical engineering application, the system inputs u_k+1Generally reflecting the energy consumed in the control process, the smaller control energy and the control oscillation thereof are crucial to the loss of the controller and the final actuator thereof, so that the optimization condition of the control quantity needs to be considered on the basis of the error in the optimization process, and the following performance index function is considered:

wherein Q is a symmetric positive definite weight array, R and S are corresponding symmetric non-negative definite weight arrays, and

substituting the formula (4), the formula (7) and the formula (10) into the formula (9), and obtaining an optimized iterative learning control law by solving a quadratic optimal solution of the formula:

u_k+1＝L_uu_k+L_ee_kM (11)

wherein,

L_u＝[(ψG)^TQ(ψG)+R+S]^-1[(ψG)^TQ(ψG)+R]，L_uis the gain of the iterative learning control law controller input term,

L_e＝[(ψG)^TQ(ψG)+R+S]^-1(ψG)^TQ，L_eis the gain of the error term;

fifthly, analyzing the convergence and robustness of the point-to-point iterative learning trajectory tracking optimization algorithm:

if the optimized iterative learning control law (11) meets the condition eta | | | | L_u-L_e(ψG)||_i2<1，||·||_i2Representing the maximum singular value of the matrix, then when the iteration batch k → ∞ the tracking output error of the system at the finite expected tracking point converges;

the formula (11) can be rewritten from the formulae (4) and (7):

defining system steady state control inputs

According to formula (12):

u_∞-u_k+1＝(L_u-L_e(ψG))(u_∞-u_k)+ξ (13)

wherein xi ═ I- (L)_u-L_e(ψG)))u_∞-L_e(y_dMPhi d) is a zero matrix, and the norm of two sides of the formula (13) is taken to obtain phi u_∞-u_k+1||＝||(L_u-L_e(ψG))(u_∞-u_k)+ξ||≤η||u_∞-u_kIf the condition eta is satisfied<1, then the control input converges in the norm sense, further yielding:

where I is the identity matrix of the corresponding dimension, then:

when each weight matrix parameter is determined, the steady-state error norm can be obtained by the equation (15) to be converged to a constant value finally, if S is smaller, the point-to-point tracking steady-state error of the system under the action of the optimization iterative learning control law (11) is smaller, and particularly when S is taken as a zero matrix, the tracking output error norm at the tracking point of the system can be converged to zero;

in the actual process production process, a controlled system usually has certain modeling uncertainty; considering the uncertainty Δ of the original system G plus a variation in the time domain_GAnd further defining the expression of the uncertain controlled system as G_θ＝G(I+Δ_G) And define Δ_GThe bounded condition of the uncertainty factor is | | | Δ | | | luminance_i2≤1，

The conditions for the system robust convergence are:

||L_u-L_e(ψG_θ)||_i2<1

if the symmetric non-negative constant weight matrix R is taken as a zero matrix, the following formula is provided:

wherein W is an uncertain weight matrix;

it can be obtained if the parameters satisfy the condition:

||[(ψG)^TQ(ψG)+S]^-1(ψG)^TQ(ψG)W||_i2<1 (16)

the system robustness converges; if the symmetric non-negative fixed weight array R is equal to rI, R is more than or equal to 0, if the condition (16) is met, robust convergence of the linear discrete system (3) under the action of the optimized iterative learning control law (11) can still be ensured; because (psi G)^TQ (psi G) + S is a symmetric positive definite matrix, so that it can be decomposed into singular values by (psi G)^TQ(ψG)+S＝U∑U^TWhere U is unitary matrix and Σ is element σ_iThe full-rank diagonal matrix of (1) is expressed as Σ ═ diag { σ }_iJ, for convenience of representation, let

And conditions of

Satisfy the requirement of obtaining | | H | non-conducting phosphor_i2＝α<1。

Then one can get:

if the condition (16) is satisfied, the system robustness is converged, that is, the design of the parameter R ═ rI does not affect the robustness of the system, and on the other hand, the design of the parameter S needs to satisfy the condition (16);

sixthly, analyzing the convergence and robustness of the point-to-point iterative learning trajectory tracking optimization algorithm under the input constraint condition:

in many industrial process control applications, in order to ensure that an industrial process operates safely and smoothly, certain constraints need to be imposed on input variables, the input signal constraints include constraints on input amplitude, constraints on input amplitude with respect to time direction and batch direction, and the constraints are often expressed in inequalities;

saturation constraint of controller input:

u^low≤u_k+1≤u^hi (18)

wherein u is^low，u^hiAre respectively controller input u_k+1The lower and upper bound values of (1);

input variation constraint between two adjacent sampling times:

u^low≤u_k+1≤u^hi (19)

wherein u is^low、u^hiAre respectively an input variable u_k+1Lower and upper bound values, u, varying along the time axis_k+1(t)＝u_k+1(t-1)-u_k+1(t-2)；

Input variation constraints between two adjacent batches:

Δu^low≤Δu_k+1≤Δu^hi (20)

wherein Δ u^low、Δu^hiAre respectively input variables Δ u_k+1Lower and upper bound values, Δ u, that vary along the axis of iteration_k+1＝u_k+1-u_k；

All of the above constraint equations can be converted to Δ u_k+1First, the saturation constraint (18) of the controller input can be translated into:

u^low-u_k≤Δu_k+1≤u^hi-u_k (21)

let u (0) be u₀Then u is_k+1Can be expressed as:

u_k+1＝μu_k+1 (22)

wherein:

the input variation constraint (19) between two adjacent sampling times can therefore be translated into:

u^low-μu_k≤μΔu_k+1≤u^hi-μu_k (24)

the constraints described above can be combined into the following linear inequalities:

ζ^uΔu_k+1≥ζ_k+1 (25)

wherein:

theoretically, the input described by equation (25)Income Delta u_k+1The constraint of (1) is a convex set, here denoted by Ω; the error equation for model uncertainty can be expressed as:

e_(k+1)M＝e_kM-ψG_θΔu_k+1 (26)

wherein G is_θIs an input-output transfer matrix with uncertainty on a time sequence, corresponding to a deterministic system G;

substituting equation (26) into the performance indicator function (9) yields:

and (3) convergence and robustness analysis:

under the constraint, assume that condition 1) there is one feasible expected input u^∞And corresponding expected error e _∞0, and satisfies u^low≤u^∞≤u^hi，u^low≤u^∞≤u^hiAnd Δ u^low<0，Δu^hi>0; assuming the condition 2) Q is a symmetrical positive definite weight array, and R and S are corresponding symmetrical non-negative definite weight arrays; the system robustness converges and the system steady state error converges to 0 when S takes the zero matrix, i.e., k → ∞ Δ u_k→0，e_kM→0；

Defining the optimal performance indicator function of the (k + 1) th batch as:

first consider Δ u_k+1Case 0, when u_k+1＝u_k，e_(k+1)M＝e_kM,u_k+1＝u_kSo that (e) can be derived_kM,0,u_k) E Ω, because the performance indicator function is at point (e)_kM,0,u_k) The value of (b) is always greater than or equal to the optimum value, so the following relationship can be obtained:

by analogy, the following results are obtained:

as is clear from the formula (30), Δ u when k → ∞ is satisfied_k→ 0 because Δ u when k is sufficiently large_k→ 0, and satisfies the condition Δ u^low<0，Δu^hi>0, so the constraint Δ u^low≤Δu_k+1≤Δu^hiThe same is satisfied; from the assumption condition 1) that u is known_k+1＝u^∞Time e_(k+1)MWhen the value is equal to 0, then

Is also a feasible point within a restricted area, wherein

Is obvious (e)_(k+1)M,Δu_k+1,u_k+1) Is also a feasible point in a constrained region, because Ω is a convex set, it can be deduced that a point between any two feasible points in Ω is a feasible point;

meanwhile, the performance index function J is solved_k+1From (e)_(k+1)M,Δu_k+1,u_k+1) To

The directional derivatives of (a):

from equation (31):

since Δ u when k → ∞ is_k→ 0, so when k → ∞ time formula(32) Can be simplified as follows:

as can be seen from equation (33) and assumption condition 2), when k → ∞ is reached, the steady-state output error of the system converges to a certain finite value, and particularly when S takes a zero matrix, the steady-state output error reaches a minimum value and becomes 0; the result shows that when the assumed conditions are satisfied, the input constraint model uncertain system can still be converged under the action of the optimization iterative learning control law (11); in particular, when the system is a nominal system, i.e. G_θThe same holds true when G;

and seventhly, determining the input vector of each iteration batch of the motor-driven single mechanical arm system according to a robust optimization iterative learning control law, inputting the obtained input vector into the motor-driven single mechanical arm system to perform point-to-point trajectory tracking control, and tracking the expected output at the specified tracking point by the motor-driven single mechanical arm system under the control action of the input vector.

The beneficial technical effects of the invention are as follows:

the application discloses a method for designing a point-to-point trajectory tracking iterative control algorithm aiming at a linear system with repetitive motion characteristics, such as a motor-driven single mechanical arm control system, wherein the motor-driven single mechanical arm control system is used as a controlled object, the control algorithm can update control input by utilizing tracking error information at a limited time point of trajectory tracking, so that unnecessary non-key point tracking constraint is eliminated, and the degree of freedom at the non-key point increases the degree of freedom for ILC algorithm design and simultaneously increases the promotion space of the overall performance of the system. The method is based on a norm optimal iterative learning framework, a multi-point multi-target performance function under the condition of input constraint is introduced, an iterative learning control law is obtained by solving a quadratic optimal solution of a target function, meanwhile, the convergence condition of a robust iterative learning algorithm is analyzed under the condition that a system model has uncertainty, and bounded convergence of a system steady-state tracking output error at a limited expected time point is guaranteed.

Drawings

Fig. 1 is a model block diagram of a motor-driven single robot control system disclosed in the present application.

FIG. 2 is a graph of actual output versus reference point tracking for the motor driven single robot control system of the present application.

FIG. 3 is a graph comparing nominal and uncertainty system root mean square error under no input constraints in the present application.

FIG. 4 is a graph comparing nominal and uncertainty system performance indicators without input constraints in the present application.

Fig. 5 is a comparative plot of nominal system root mean square error with and without input constraints in the present application.

FIG. 6 is a graph comparing nominal system performance indicators with and without input constraints in the present application.

FIG. 7 is a graph comparing nominal and uncertainty system root mean square error under the input constraints of the present application.

FIG. 8 is a graph comparing nominal and uncertainty system performance indicators under the input constraints of the present application.

Detailed Description

The following further describes the embodiments of the present invention with reference to the drawings.

Referring to fig. 1, referring to fig. 8, a model block diagram of a motor-driven single robot control system disclosed in the present application is shown. Controller input for the k-th batch is u_kActing on the robot to obtain the actual output y of the k-th lot of the system_kThe actual output at the specified tracking point is obtained via the tracking point selector, and the error from the set desired value stored in the desired trajectory memory is transmitted to e_kMComparing the error with a set precision value, and if the error does not reach the set precision, determining the error e_kMWith current controller input u_kController input u passed to the optimized iterative learning controller to generate the next batch_k+1And the operation is circularly performed until the error between the actual output and the expected value of the system reaches the precision requirement, and the operation is stopped, wherein the input of the controller at the moment is the optimal control input.

For the actual physical model of the motor-driven single mechanical arm shown in the formula (1), variable parameters are respectively set as:

K_t＝1N·m,K_b＝0.085V·s/rad,

R_r＝0.075Ω,B_c＝0.015kg·m²/s,D_c＝0.05,

l＝0.6m,m₁＝0.05kg,m₂＝0.01kg,

Ξ＝0.05kg·m²,＝0.0008Ω,g＝9.8m/s²。

the system simulation time is set to be T-2 s, and the sampling time is set to be T_s0.1s, the parameter matrices of the discrete state space expression of the system are respectively:

C＝[1 0 0]

without loss of generality, 5 points (M is 5) in a sampling interval are taken to carry out point-to-point tracking control on the system, and the selected sampling point is t_iE { 26101418 }, i.e., tracking times of 0.2s, 0.6s, 1s, 1.4s, 1.8s, respectively, the expected values at the tracking points are set to:

y_dM＝[1.5 0.3 0 -0.3 1.5]^T

in units of rad, while setting the initial state to x_k(0)＝[-0.01 0.02 -0.03]^T. The symmetric positive definite matrix Q and the symmetric nonnegative definite matrix R and S are respectively selected as Q being 10I, R being 0.5I and S being 0.001I. Optimizing L in an iterative learning control law (11) when a symmetric positive definite matrix Q, a symmetric non-negative definite matrix R, S and a sampling point are determined_u，L_eAs determined accordingly. The optimized iterative learning controller is realized on a STM32F103RCT6 chip, and the input of the chip is motor control voltage u which is acquired by a voltage sensor. The input signal enters an STM32F103RCT6 chip through a conditioning circuit to be stored and calculated, an iterative learning updating law is constructed, and the signal obtained after CPU calculation is an optimal control input signal u_k+1The input signal acts on the motor through the RS232 communication moduleAnd driving a single mechanical arm control system, and continuously correcting the tracking track of the mechanical arm until the expected track at the specified point on the tracking. When the dynamic model (1) of the motor-driven single-robot control system operates, please refer to fig. 2, which shows that when the motor-driven single-robot system is accurately modeled, i.e., is a nominal system, the system applies a point-to-point trajectory tracking effect diagram of an optimized iterative learning control law (11), and after a certain batch k, the output value of the system at a specified tracking point can be accurately tracked to a desired value, and fig. 3 and 4 show that the system can be converged after a certain iterative batch. When each physical parameter of the motor-driven single mechanical arm system model has a certain deviation, a parameter matrix after the system is dispersed has a certain uncertainty, and then the uncertainty delta exists in a system matrix G, wherein delta is a bounded lower triangular random number matrix. When the physical parameters of the system are changed, the uncertainty delta is between-0.8 and 0.2]At this time, fig. 3 and fig. 4 respectively show comparison graphs of the root mean square error and the performance index of the nominal system and the uncertain system respectively without input constraint, which shows that when parameters of the motor-driven single mechanical arm system are uncertain, the optimized iterative learning control law can still ensure system convergence, the rate of convergence of the root mean square error and the performance index of the nominal system is faster than that of the uncertain system, and the rationality and the effectiveness of the algorithm are verified. The input of the motor-driven single mechanical arm system can be restrained in the actual control process, and the input amplitude restraint is considered to be applied to the system, namely the range of the motor control voltage is [ -0.9V]Fig. 5 and fig. 6 are comparison graphs of the root mean square error and the performance index of the nominal system with or without input constraint, and fig. 7 and fig. 8 are comparison graphs of the root mean square error and the performance index of the nominal system with or without input constraint, respectively, which indicate that the input signal of the system can still complete the point-to-point tracking task after being subjected to certain constraint conditions, but the convergence speed of the root mean square error curve of the system is slower than that of the system without constraint, but is finally convergent, thereby further verifying the rationality and the effectiveness of the algorithm.

According to the method, the point-to-point tracking problem of the repeated process of the motor-driven mechanical arm system is solved by directly obtaining the optimal control input signal through the optimization algorithm without introducing a reference track, and the iterative learning algorithm and the optimization algorithm are combined to construct the optimization controller. When uncertainty of certain conditions exists in system modeling, an optimization algorithm based on performance indexes has robustness on the uncertainty of the model, and in addition, when a motor drives a mechanical arm system to execute repeated process tasks, input signals of a controller can be restrained. The point-to-point robust iterative learning optimization method can ensure that the high-precision tracking of the expected track can be realized no matter whether the input of the motor-driven single mechanical arm system is constrained or not and under the condition that certain uncertainty exists in modeling.

What has been described above is only a preferred embodiment of the present application, and the present invention is not limited to the above embodiment. It is to be understood that other modifications and variations directly derivable or suggested by those skilled in the art without departing from the spirit and concept of the present invention are to be considered as included within the scope of the present invention.

Claims

1. The point-to-point iterative learning optimization control method of the motor-driven single mechanical arm system is characterized by comprising the following steps of: establishing a physical model of the motor-driven single mechanical arm system; constructing a discrete state space model of the motor-driven single mechanical arm system; establishing a point-to-point trajectory tracking model; designing a point-to-point iterative learning trajectory tracking optimization algorithm; analyzing the convergence and robustness of the point-to-point iterative learning trajectory tracking optimization algorithm; analyzing the convergence and robustness of the point-to-point iterative learning trajectory tracking optimization algorithm under the input constraint condition; point-to-point trajectory tracking of the motor-driven single mechanical arm system under the condition of input constraint is realized;

firstly, establishing a physical model of the motor-driven single mechanical arm system:

the physical model of the motor-driven single mechanical arm system is shown as the formula (1):

wherein N is_l＝m₂gl+m₁gl，

secondly, constructing a discrete state space model of the motor-driven single mechanical arm system:

defining a state variable by using a link angle and a current of the motor-driven single-robot arm system:

defining the motor control voltage u as an input variable, a motor-driven single robot system represented by equation (1) can be described as:

obviously, the formula (2) is a continuous system model, so the formula (2) needs to be discretized, and a sampling period T meeting the Shannon sampling theorem is selected_sAnd taking the connecting rod angle theta as output, further obtaining a discrete state space model of the motor-driven single mechanical arm system:

in the formula, T and k respectively represent sampling time and batches, and the batch process contains N sampling points within a time range in one operation period T, namely T belongs to {0,1,2, …, N }; u. of_k(t)，y_k(t) and x_k(t) the kth lot of the motor-driven single robot systemInputting, outputting and state vectors at the time t; a, B and C are discrete system parameter matrixes corresponding to the formula (2), and CB is not equal to 0; and assuming the initial state of system operation to be consistent, i.e. x_k(0)＝x₀；

Thirdly, establishing the point-to-point track tracking model:

for a discrete state space model of the form of equation (3), its state space expression can be converted into a time-series input-output matrix model:

y_k＝Gu_k+d (4)

wherein:

d＝[CA CA² CA³…CA^N]^Tx₀

u_k＝[u_k(0),u_k(1),...,u_k(N-1)]^T

y_k＝[y_k(1),y_k(2),...,y_k(N)]^T

in the design of the traditional iterative learning control algorithm, a reference track is usually fixed, and a track tracking target requires that the output of the system is gradually close to a set expected output value, namely y, at each moment along with the continuous running of a batch process_k(t)→y_d(t) wherein y_k(t) actual output of the system at time t of the kth batch, y_d(t) is the system expected output at time t for the kth batch; for the point-to-point tracking problem, only the expected values of some specified key time points, namely y, need to be tracked_k(t_i)＝y_d(t_i) I is 1,2, …, M, and has a value of 0. ltoreq.t₁<t₂<…<t_MN is less than or equal to N; assume that the expected values for these M tracking points are:

y_dM＝[y_d(t₁),y_d(t₂),...,y_d(t_M)]^T (5)

defining a matrix psi as a tracking point selection matrix of M rows and N columns, where psi_ijFor the ith row and jth column element in the matrix psi, when tracking the time point t at the ith_iWhen selected, all N elements of the ith row of the matrix are divided by j to t_iThe time is 1, and the rest are all 0, and M tracking time points are selected, so that psi y can be obtained_kActual output values of M tracking time points selected for N sampling time points of the entire trajectory are expressed in the following specific form:

the tracking error at a tracking point can thus be defined as:

e_kM＝y_dM-ψy_k (7)

wherein,

e_kM＝[e_k(t₁),e_k(t₂),...,e_k(t_M)]^T (8)

fourthly, designing the point-to-point iterative learning trajectory tracking optimization algorithm:

in practical applications, the tracking error e_(k+1)MAnd system input u_k+1The convergence performance and energy consumption of each running batch of the system are usually reflected, and the smaller tracking error, control energy and control oscillation thereof are important to the overall performance of the system and the loss of an actual actuator, so that the error and the control quantity need to be considered and optimized in the iterative learning control optimization process; the following performance indicator functions are defined:

u_k+1＝L_uu_k+L_ee_kM (11)

wherein,

L_e＝[(ψG)^TQ(ψG)+R+S]^-1(ψG)^TQ，L_eis the gain of the error term;

the formula (11) can be rewritten from the formulae (4) and (7):

defining system steady state control inputs

According to formula (12):

u_∞-u_k+1＝(L_u-L_e(ψG))(u_∞-u_k)+ξ (13)

wherein xi ═ I- (L)_u-L_e(ψG)))u_∞-L_e(y_dMψ d) is a zero matrix, and two sides of the formula (13)Taking norm can obtain | | | u_∞-u_k+1||＝||(L_u-L_e(ψG))(u_∞-u_k)+ξ||≤η||u_∞-u_kIf the condition eta is satisfied<1, then the control input converges in the norm sense, further yielding:

where I is the identity matrix of the corresponding dimension, then:

when each weight matrix parameter is determined, the steady state error norm can be obtained by the equation (15) to be converged to a constant value finally, if S is smaller, the point-to-point tracking steady state error of the system under the action of the optimization iterative learning control law (11) is smaller, and when S is taken as a zero matrix, the tracking output error norm at the tracking point of the system can be converged to zero;

in the actual process production process, a controlled system usually has certain modeling uncertainty; considering the uncertainty Δ of the input/output transfer matrix G in time series plus a variation in the time domain_GAnd further defining the expression of the input-output transfer matrix with uncertainty in the time series as G_θ＝G(I+Δ_G) And define Δ_GThe bounded condition of the uncertainty factor is | | | Δ | | | luminance_i2≤1，

The conditions for the system robust convergence are:

||L_u-L_e(ψG_θ)||_i2<1

wherein W is an uncertain weight matrix;

it can be obtained if the parameters satisfy the condition:

||[(ψG)^TQ(ψG)+S]^-1(ψG)^TQ(ψG)W||_i2<1 (16)

the system robustness converges; if the symmetric non-negative fixed weight array R is equal to rI, R is more than or equal to 0, if the condition (16) is met, robust convergence of the discrete state space model (3) under the action of the optimized iterative learning control law (11) can still be ensured; because (psi G)^TQ (psi G) + S is a symmetric positive definite matrix, so that it can be decomposed into singular values by (psi G)^TQ(ψG)+S＝U∑U^TWhere U is unitary matrix and Σ is element σ_iThe full-rank diagonal matrix of (1) is expressed as Σ ═ diag { σ }_iJ, for convenience of representation, let

And conditions of

Satisfy the requirement of obtaining | | H | non-conducting phosphor_i2＝α<1，

Then one can get:

sixthly, analyzing the convergence and the robustness of the point-to-point iterative learning trajectory tracking optimization algorithm under the input constraint condition:

saturation constraint of controller input:

u^low≤u_k+1≤u^hi (18)

input variation constraint between two adjacent sampling times:

u^low≤u_k+1≤u^hi (19)

Input variation constraints between two adjacent batches:

Δu^low≤Δu_k+1≤Δu^hi (20)

All of the above constraint equations can be converted to Δ u_k+1First, the saturation constraint (18) of the controller input may translate into:

u^low-u_k≤Δu_k+1≤u^hi-u_k (21)

let u (0) be u₀Then u is_k+1Can be expressed as:

u_k+1＝μu_k+1 (22)

wherein:

u^low-μu_k≤μΔu_k+1≤u^hi-μu_k (24)

ζ^uΔu_k+1≥ζ_k+1 (25)

wherein:

theoretically, the input Δ u described by equation (25)_k+1The constraint of (1) is a convex set, here denoted by Ω; the error equation for model uncertainty can be expressed as:

e_(k+1)M＝e_kM-ψG_θΔu_k+1 (26)

wherein G is_θThe input and output transfer matrix on the time sequence with uncertainty corresponds to the input and output transfer matrix G on the time sequence;

substituting equation (26) into the performance indicator function (9) yields:

and (3) convergence and robustness analysis:

under the constraint, assume that condition 1) there is one feasible expected input u^∞And corresponding expected error e_∞0, and satisfies u^low≤u^∞≤u^hi，u^low≤u^∞≤u^hiAnd Δ u^low<0，Δu^hi>0; assuming the condition 2) Q is a symmetrical positive definite weight array, and R and S are corresponding symmetrical non-negative definite weight arrays; the system robustness converges and the system steady state error converges to 0 when S takes the zero matrix, i.e., k → ∞ Δ u_k→0，e_kM→0；

Defining the optimal performance indicator function of the (k + 1) th batch as:

by analogy, the following results are obtained:

Is also a feasible point within a restricted area, wherein

The directional derivatives of (a):

from equation (31):

since Δ u when k → ∞ is_k→ 0, so when k → ∞ time equation (32) can be simplified to:

as can be seen from equation (33) and assumption condition 2), when k → ∞ is reached, the steady-state output error of the system converges to a certain finite value, and when S takes a zero matrix, the steady-state output error reaches a minimum value of 0; the result shows that when the assumed conditions are satisfied, the input constraint model uncertain system can still be converged under the action of the optimization iterative learning control law (11); when the system is a nominal system, i.e. G_θThe same holds true when G;

and seventhly, determining an input vector of each iteration batch of the motor-driven single mechanical arm system according to a robust optimization iterative learning control law, inputting the obtained input vector into the motor-driven single mechanical arm system to perform point-to-point trajectory tracking control, and tracking the expected output at the specified tracking point by the motor-driven single mechanical arm system under the control action of the input vector.