CN112114521B

CN112114521B - Intelligent prediction control entry guidance method for spacecraft

Info

Publication number: CN112114521B
Application number: CN202010751626.1A
Authority: CN
Inventors: 胥彪; 李翔; 冯建鑫; 李爽
Original assignee: Nanjing University of Aeronautics and Astronautics
Current assignee: Nanjing University of Aeronautics and Astronautics
Priority date: 2020-07-30
Filing date: 2020-07-30
Publication date: 2022-03-01
Anticipated expiration: 2040-07-30
Also published as: CN112114521A

Abstract

The invention discloses an intelligent prediction control entry guidance method for a spacecraft. Firstly, constraint conditions of spacecraft atmosphere entrance are considered, a guidance system is designed by adopting a nonlinear model prediction control method, an attenuation memory filter is introduced, a prediction model correction method based on error information estimation is provided, and robustness of the control system to model errors is enhanced by combining a variable prediction time domain strategy. And then, taking the guidance system as a guidance template, generating a sample data set, performing off-line training of a multilayer neural network, and approaching to the mapping relation between the real-time flight state of the spacecraft and the guidance instruction. And finally, in the guidance process, the trained neural network is used for replacing the process of solving the complex optimization problem and integral prediction, so that the time for solving the guidance instruction on line is reduced. Simulation results show that the guidance precision and the real-time performance of the guidance method provided by the invention can meet task requirements under the condition of larger errors such as model uncertainty.

Description

Intelligent prediction control entry guidance method for spacecraft

Technical Field

The invention relates to the technical field of spacecraft guidance, in particular to a spacecraft intelligent prediction control entry guidance method.

Background

The guidance of atmospheric admission is the precondition for the spacecraft to land the planet successfully and carry out various scientific researches and experiments. As the planet in the solar system, which is most similar to the earth, mars have become important target celestial bodies for deep space exploration in humans. The mars atmosphere is very thin, has great uncertainty, often appears weather such as wild wind, sand and dust. In the process of entering, descending and landing of the spacecraft, the atmospheric entering section lasts for the longest time, the working condition is the worst, the state of the spacecraft changes quickly, and the requirement on the deceleration performance is also high. As the Mars 'curious number' Mars vehicle adopts the closed-loop guidance method to successfully realize the Mars surface soft landing in the Mars atmosphere entrance section, the guidance method is adopted to realize the high-precision safe landing of the Mars landing task, and the Mars landing task in the future is inevitably selected.

At present, spacecraft atmosphere entry guidance methods are generally divided into reference trajectory guidance and prediction correction guidance. The reference track guidance method is to design a reference profile (such as a resistance acceleration profile) in an off-line manner according to requirements in advance, store the reference profile in a spacecraft computer, and design a guidance law according to real-time tracking errors to track the track on line. For example, the PID control law has been successfully applied to reentry guidance of Apollo spacecraft and space shuttle, but the gain is relatively complicated to calculate and is obtained under certain assumed conditions. The guidance law is designed by using the idea of feedback linearization, and the nonlinearity of the resistance dynamics can be counteracted by using a state feedback term. However, when there is a large model error and control saturation, the performance of trajectory tracking is not ideal. On the basis, some scholars propose that the control precision is improved by estimating the model error by using the sliding mode state observer, but due to the characteristics of the sliding mode, the obtained controlled variable has jitter, and great challenges are brought to a roll angle executing mechanism. Higher tracking accuracy can be obtained by using the active disturbance rejection control method to perform resistance tracking. However, none of the above methods takes into account the control quantity constraint. The prediction correction guidance method does not depend on a standard track, continuously predicts the terminal state in the flight process, then corrects the control quantity according to the deviation from the expected terminal state, has higher accuracy of the falling point and is insensitive to the entering initial condition. However, the guidance precision of the method depends on the precision of the online model to a great extent, the method has high calculation complexity and needs to rely on strong computer performance to improve the calculation speed, and the real-time performance of online guidance is difficult to ensure.

The model prediction control method capable of processing constraint on line is applied to the problem of guidance of atmosphere entry in recent years, and can well solve the problem of control saturation existing in the process of tracking a reference track. For a complex nonlinear model of an atmospheric inlet section, most scholars design a guidance system by utilizing a linear prediction control method after linearizing the model, and the landing precision of the guidance system is influenced to a certain extent by the strategy. For the prediction control of the nonlinear model, a nonlinear prediction guidance law of a Mars atmosphere entry section based on resistance tracking is provided in the 2 nd, 137 nd and 143 nd page & ltnonlinear prediction guidance law design of Mars atmosphere entry section based on resistance tracking & gt volume 2 in 2015 of journal, the nonlinear prediction control algorithm with the analytic solution can realize safe landing under the condition of meeting the control constraint, but the Taylor series expansion is adopted to approximately predict future output, the prediction precision is not high under the condition of large model uncertainty, and the guidance precision can be reduced. The Nonlinear Model Predictive Control (NMPC) method for solving the corresponding Nonlinear programming problem on line by using a numerical optimization algorithm can avoid the influence of Model mismatch caused by adopting a linear Model, has higher precision, but greatly increases the complexity of calculation and can cause instruction delay of a Control system.

The artificial neural network is an algorithmic mathematical model simulating animal neural network behavior characteristics and performing distributed parallel information processing. The network achieves the purpose of processing information by adjusting the mutual connection relationship among a large number of internal nodes depending on the complexity of the system, and has self-learning and self-adapting capabilities. And the artificial neural network designed aiming at specific complex problems can exert the high-speed computing capability of the computer and has the capability of searching the optimal solution at high speed. In recent years, the design idea of applying the advantages of the neural network to the guidance method has also become a research hotspot.

Disclosure of Invention

The invention aims to provide a spacecraft intelligent prediction control atmosphere entering guidance method, which solves the problem that high-precision guidance and rapid command calculation in the current guidance technology are difficult to meet simultaneously. A spacecraft atmosphere entry guidance system is designed based on a nonlinear model prediction control method capable of processing various constraints on line, a prediction model correction method based on error information estimation is provided in consideration of the influence of larger model uncertainty on the performance of an NMPC system, and the robustness of the NMPC system on model errors is enhanced by combining a variable prediction time domain strategy. Aiming at the defects of high complexity and long time consumption of an NMPC method for online optimization and solving of a guidance instruction, a Back Propagation (BP) neural network is trained offline by using sample data generated by an NMPC system. The neural network is used for replacing the process of solving the complex optimization problem and integral prediction, the control quantity meeting various constraints is quickly solved on line, the speed of instruction resolving is increased, the real-time performance of the guidance system is improved, and the influence of guidance instruction delay on guidance precision is reduced.

The invention adopts the following technical scheme for solving the technical problems:

an intelligent prediction control entry guidance method for a spacecraft specifically comprises the following steps:

the method comprises the following steps: according to the state quantity of the initial moment of the atmospheric entering section of the spacecraft and the target constraint condition of the terminal of the entering section, a reference track is generated in an optimized mode, a corresponding rolling angle change curve is obtained, the change relation of the resistance acceleration and the energy is extracted, and the relation of the change of the derivative of the resistance acceleration along with the time is obtained through deduction;

step two: establishing a spacecraft entry guidance model, and designing an NMPC guidance system by adopting a nonlinear model predictive control NMPC method;

step three: considering errors existing in the process of entering the atmosphere, setting the distribution range and the form of random errors of each state quantity, performing offline tracking on a reference track for multiple times based on the NMPC guidance system designed in the step two, and storing the optimal control quantity at each moment and each corresponding state quantity obtained by performing rolling solution by using a hybrid optimization algorithm when performing tracking each time to generate a sample data set taking the NMPC guidance system as a guidance template;

step four: taking the state quantities in the three generated sample data sets as input and the corresponding optimal control quantity as output, carrying out back propagation BP neural network training, and finishing the training of the BP neural network when the value of the loss function converges to a set error range or reaches the maximum iteration number of the training to obtain the neural network based on the NMPC guidance system;

step five: and (3) considering errors existing in the actual guidance process, adopting a neural network based on an NMPC guidance system in the fourth step to track the reference track generated in the first step on line: and inputting the state quantity obtained by real-time measurement of the spacecraft navigation system into a neural network based on an NMPC guidance system to obtain the corresponding optimal control quantity, thereby realizing intelligent guidance of the spacecraft.

Further, the step one is specifically as follows: according to the speed, longitude and latitude, height, course angle, track angle of an initial moment of an atmospheric entering section of the spacecraft and target constraint conditions of a terminal of the entering section, under the condition that path constraint including overload, dynamic pressure and heat flow rate and control quantity constraint are met, a reference track is generated in an optimized mode, a corresponding rolling angle change curve is obtained, the change relation of the resistance acceleration and the energy is extracted, and the relation of the change of the derivative of the resistance acceleration along with the time is obtained through deduction.

Further, in the second step, the kinetic equation of the spacecraft entering the guidance model is as follows:

wherein, the spacecraft state vector at the time t

r is the distance from the center of mass of the spacecraft to the target, V is the speed of the spacecraft,

is latitude, theta is longitude, gamma is track angle, psi is course angle, u (t) is control input vector at time t; y is_c(t) is the output value at time t, D (t) and

the resistance acceleration and the first derivative thereof at the time t; y is_b(t) is the value of the path constraint at time t,

is the heat flow rate at time t, q (t) is the incoming flow pressure at time t, n_LAnd (t) is the overload of the spacecraft at the time t.

Further, in the second step, an NMPC guidance system is designed by adopting a nonlinear model predictive control NMPC method, which specifically comprises the following steps:

1) at time t, a constrained nonlinear programming problem of the form:

the following constraints are satisfied:

wherein x (t) is the state of the spacecraft at time t; t is_cAnd T_pRespectively a control time domain and a prediction time domain, and satisfies T_c≤T_p；σ_minFor minimum value of allowable roll angle, σ_maxIn order to allow for a maximum value of roll angle,

to maximize the allowable roll angle rate, δ is the sampling time, T_c＝N_cδ,T_p＝N_pδ，N_cFor controlling the number of steps, N_pPredicting the step number;

for the upper limit of the allowable heat flow rate, q_maxFor an allowable upper limit of dynamic pressure, n_LmaxIs the allowable overload upper limit; the objective function J at time t is specifically defined as

And

the predicted value of the resistance acceleration and the first derivative thereof at the t moment to the tau moment is represented;

D_r(τ) and

representing the desired output at time τ, i.e. the resistive acceleration of the reference track and its first derivative, u_r(τ) with y_r(τ) a corresponding control input, namely roll angle;

control input value representing the time T to be solved at time T for any time T e [ T, T + T ∈_c]，

i-int (τ -t/δ), int (·) being a downward rounding function;

to-be-solved optimization sequence U for forming nonlinear programming problem with constraint at time t_tI.e. by

A predicted value representing the path constraint output from the time t to the time tau; q is an output error weighting matrix, and R is a control quantity weighting matrix

And

calculated by the following prediction model:

wherein

For the predicted value of the state quantity at time t to time t,

the values of r, V, θ,

predicted values of γ, ψ;

2) correcting the prediction model in 1) by adopting a prediction model correction method based on error information estimation:

first, a first-order attenuation memory filter is designed to obtain error estimation information, wherein the first-order attenuation memory filter is Z (t) -Z (t-delta) + (1-epsilon) (Z)^*(t) -Z (t- δ)), the state quantity at time t of the first-order attenuation memory filter is

Z (t-delta) represents the state quantity of the filter at the t-delta moment, Z (0) is 1, 0 & ltepsilon & lt 1 is a gain coefficient, L (t) represents the actual lift acceleration at the t moment,

and

respectively representing the predicted values of the prediction model at the time t to the resistance acceleration and the lift acceleration;

secondly, the output Z (t) of the first-order attenuation memory filter is used as a correction factor of the prediction model, and

and

make real-time corrections, i.e.

And

respectively predicting values of the corrected resistance acceleration and the corrected lift acceleration;

finally, the prediction model is modified as follows:

wherein

Correcting the predicted value of the prediction model to the state quantity at the time tau for the time t,

represents the predicted value output by the tau time correction prediction model to the tau time,

the predicted value of the path constraint output at the time t of the correction prediction model is represented;

3) and (3) adding a correction link for prediction errors at each optimization moment of the constrained nonlinear programming problem:

first, the error between the predicted output and the actual output at time t is defined:

wherein

The predicted output value of the corrected prediction model at the time t, and y (t) is the actual output value at the time t;

secondly, correcting the predicted value output by the modified prediction model at the time t to the time tau in a manner of weighting e (t):

wherein the content of the first and second substances,

for the purpose of the corrected predicted output value,

and

predicted values of the corrected resistance acceleration and its first derivative, eta ═ eta₁,η₂]Is a correction vector, and η₁＞0,η₂＞0；

4) After the correction of the prediction model and the feedback correction of the prediction error, the objective function J at the time t is redefined as:

further, in the third step, a mixed optimization algorithm combining the sequence quadratic programming SQP and the particle swarm optimization PSO algorithm is adopted to solve the nonlinear programming problem with the constraint: firstly, solving a nonlinear programming problem with constraint by using a PSO algorithm, and then searching the solving result of the PSO algorithm as an optimized initial value near the position by using an SQP algorithm so as to converge to a global optimal solution.

Compared with the prior art, the invention adopting the technical scheme has the following technical effects:

the invention designs a spacecraft atmosphere entering guidance system based on a nonlinear model prediction control method capable of processing constraint on line, takes the influence of larger model uncertainty on the system performance into consideration, improves an NMPC (non-uniform matrix computer) method, and provides a prediction model correction method and a variable prediction time domain strategy based on error information estimation. The method has the advantages that the traditional model prediction control method can process the control quantity and state quantity constraints on line, the robustness of a control system to model errors is enhanced, and the precision of track tracking is improved. By utilizing the strong self-learning capability of the neural network, the mapping relation between the real-time flight state and the guidance instruction in the NMPC guidance system is approximated, and the optimal control quantity can be rapidly calculated on line according to the current state quantity. Compared with an NMPC method, the neural network is used for replacing the process of solving a complex NLP problem and integral solving prediction output by a numerical optimization method in each guidance period, the defects of high solving complexity and long time consumption of an NMPC guidance system are overcome to a great extent, the solving time of guidance instructions is shortened, and the influence of instruction delay on guidance precision is reduced. The real-time requirement of a control system can be well met while high-precision guidance is ensured, intelligent guidance of an atmospheric entry section of the spacecraft is realized, and the method has important engineering application value.

Drawings

FIG. 1 is a roll angle profile of a reference trajectory according to the present invention.

Fig. 2 is a resistance acceleration curve of a reference trajectory according to the present invention.

FIG. 3 is a flow chart of a nonlinear model predictive control method in accordance with the present invention.

Fig. 4 is a graph of the NMPC method height variation according to the present invention.

Fig. 5 is a speed profile of the NMPC method according to the present invention.

Fig. 6 is a roll angle profile for the NMPC method of the present invention.

Fig. 7 is a resistance acceleration curve of the NMPC method according to the present invention.

Fig. 8 is a change curve of longitude and latitude of the NMPC method according to the present invention.

FIG. 9 is a flow chart of the spacecraft intelligent predictive entry guidance of the present invention.

FIG. 10 is a simulation result of the intelligent predictive control entry guidance method of the present invention.

Detailed Description

The technical scheme of the invention is further explained in detail by combining the attached drawings:

the invention provides an intelligent prediction control guidance entering method for a spacecraft, which comprises the following steps:

the method comprises the following steps: optimizing and generating a reference track according to the initial flight state and the constraint conditions of the atmospheric entering section of the spacecraft; according to the state quantity (speed, longitude and latitude, height, course angle and track angle) of the spacecraft at the initial atmospheric entering time and the target entering terminal constraint conditions, under the condition that path constraint (overload, dynamic pressure and heat flow rate) and control quantity constraint are met, a reference track is generated in an optimized mode, a corresponding rolling angle change curve is obtained, and the change relation between the resistance acceleration and the energy is extracted. The derivative of the resistive acceleration is then derived as a function of time.

In the embodiment, Mars detection is used as a background, and a spacecraft atmosphere entry section three-degree-of-freedom kinetic equation can be expressed in a nonlinear form as follows;

wherein r is the distance from the center of mass of the spacecraft to the fire center, namely the sum of the height h of the spacecraft and the average radius Re of the Mars, V is the speed of the spacecraft,

is latitude and theta is longitude. Gamma is the track angle, i.e. the angle between the speed and the local horizontal plane. ψ is the heading angle, i.e. the angle between the projection of the velocity on the horizontal plane and the east direction. σ is the roll angle, i.e. the rotation angle of the spacecraft with respect to the velocity vector. g is the acceleration of gravity.

Wherein m is_vThe mass of the spacecraft is shown, S is the aerodynamic reference area of the spacecraft, rho is the atmospheric density, D is the resistance acceleration, L is the lift acceleration, C is the lift acceleration_DIs the coefficient of aerodynamic drag, C_LIs the aerodynamic lift coefficient.

The reference track guidance method firstly reasonably plans a reference track in a given initial state, and meets path constraint, terminal target constraint and control quantity constraint.

Entering an initial state:

wherein, r (t)₀),θ(t₀),

V(t₀),ψ(t₀),γ(t₀) The state quantity of the spacecraft at the initial moment of the entering section is obtained.

And (3) path constraint:

(1) the heat flow rate constraint is:

wherein the content of the first and second substances,

as heat flow rate, coefficient of heat flow k of spacecraft_q＝1.898e-4，

The upper limit of the allowable heat flow rate.

(2) The dynamic pressure constraint is:

wherein q is the incoming flow pressure, q is_maxIs the upper limit of allowable dynamic pressure.

(3) The overload constraint is:

wherein n is_LFor spacecraft overload, n_LmaxIs the upper limit of allowable overload.

And (3) entering a segment terminal target constraint:

(1) entry segment terminal conditions, including altitude and speed limits.

h(t_f)≥h_f,V(t_f)≤V_f (7)；

Wherein, h (t)_f),V(t_f) Respectively the altitude and the speed, h, of the spacecraft at the end of the entry section_f,V_fAre respectively sailingAltitude and speed limits at the end of the antenna entry segment.

(2) Location (latitude and longitude) constraints.

Wherein the content of the first and second substances,

respectively the longitude and latitude of the end time of the spacecraft entering segment,

respectively, the longitude and latitude of the terminal target point.

And (3) controlling quantity constraint:

wherein σ_minFor minimum value of allowable roll angle, σ_maxIs the maximum allowable roll angle. Setting taking into account saturation of actuator response speed

Is the maximum allowable roll angle rate.

The specific values entering the initial state and various constraints are as follows:

entering an initial state:

and (3) path constraint:

q_max＝10kpa/m²,n_Lmax＝30；

and (3) entering a segment terminal target constraint: h is_f＝8km,V_f＝400m/s，θ_f＝-74.395°,

And (3) controlling quantity constraint: sigma_min＝10°,σ_max＝80°,

Model parameters included in the spacecraft atmosphere entry phase: mars attraction coefficient mu is 4.283 multiplied by 10¹³m³/s²The average Mars radius Re is 3396200m, and the reference area S of the spacecraft is 15.9m²Coefficient of aerodynamic lift C_L0.426, coefficient of aerodynamic drag C_D1.723 and spacecraft mass m_v2804 kg. The Mars atmospheric density model adopts an approximate exponential model, i.e.

Wherein the atmospheric density ρ of the Mars surface₀＝0.0158kg/m³Height h of equivalent density of atmosphere_s＝9354m。

Because the spacecraft is not acted by the external force exerted by people in the process of entering the atmosphere, the energy conservation law is satisfied, the energy of the spacecraft can be expressed by the following formula,

therefore, when the terminal energy position is reached, if the spacecraft can reach the terminal target height, the terminal speed of the spacecraft can also reach the requirement. Energy is therefore used as an argument in the reference trajectory planning.

The optimal reference trajectory satisfying various constraints is obtained through Gaussian pseudo-spectrum optimization, and the change curve of the roll angle is shown in figure 1. The variation of the resistance acceleration and the energy is extracted, and the curve is shown in figure 2.

The resistance acceleration is obtained by derivation of time,

the first derivative of the resistance acceleration with time is obtained from equation (11).

Step two: (1) and establishing a spacecraft entry guidance model.

The final range error of the spacecraft largely determines the guidance precision, and the range is only related to the magnitude of the resistance acceleration through the kinematic analysis. Therefore, as long as the tracking of the resistance acceleration is realized, the upper reference track can be tracked. The resistance acceleration is used as a tracking target.

Note the book

If u (t) is a control input vector and u (t) is a state vector at time t, a kinetic equation entering the guidance model is obtained according to equation (1),

wherein y is_c(t) is the output value at time t, including the resistive acceleration and its first derivative. y is_b(t) is a value of the path constraint at time t, and is expressed by equations (4) to (6).

(2) And designing a guidance system by using a nonlinear model predictive control method.

The model predictive control is essentially a model-based closed-loop optimization control strategy and comprises three parts, namely a predictive model, rolling optimization and feedback control. The method has the main idea that the future state quantity and the output value of a prediction model prediction system are utilized, and based on the thought of finite time domain online repeated optimization, a quadratic programming or nonlinear programming problem with constraints is solved online at each sampling moment, namely, the method is equivalent to solving an open-loop optimal control problem. Each solution results in a control sequence, but only the first control quantity is applied to the system. The above process is repeated until the next sampling moment, and the process is sequentially performed forward in a rolling manner, so that the closed-loop optimal control is realized, and the model predictive control is also called as rolling time domain control. In addition, the model predictive control can process the control quantity and state quantity constraint on line, and can avoid the phenomenon of roll angle control saturation when applied to the guidance entering process, and meanwhile, the constraint on the flight path is realized. A method flow diagram is shown in fig. 3.

At the current time t, there is a measured state value x (t), a constrained nonlinear programming problem of the following form is constructed, as shown in equations (13) and (14),

the following constraint conditions are satisfied,

in the above problem, x (t) is the state of the spacecraft at the current time, U_tAn optimized sequence for the solution required; t is_cAnd T_pRespectively a control time domain and a prediction time domain, and satisfies T_c≤T_p。

The objective function J at time t in equation (13) is specifically defined as,

wherein

And

control input value representing the time T to be solved at time T for any time T e [ T, T + T ∈_c]The definition is that,

where int (·) is a floor function and δ is the sampling time, satisfies T_c＝N_cδ,T_p＝N_pδ，N_cFor controlling the number of steps, N_pTo predict the number of steps. Therefore, the temperature of the molten metal is controlled,

the optimization sequence to be solved constituting the constrained nonlinear programming problem at time t, i.e. U in equation (13)_t，

D_r(τ) and

representing the desired output at time τ, i.e. the resistive acceleration of the reference track and its first derivative, u_r(τ) with y_r(τ) a corresponding control input, namely roll angle; q is an output error weighting matrix, and R is a control quantity weighting matrix. The first term of the objective function has the effect of minimizing the tracking error of the resistance acceleration, and the first derivative of the resistance acceleration is also used as a tracking variable, so that better tracking effect can be achieved. The second term of the objective function can make the optimization algorithm easier to solve the optimal control quantity by introducing the control quantity error.

In the formula (14), σ_min10 ° is the minimum allowable roll angle, σ _max80 is the maximum allowable roll angle,

is the maximum value of the allowable roll angle rate of change;

a predicted value representing a constraint of the path,

indicating the allowed path constraint maximum.

And

can be calculated by the following prediction model,

wherein x (t) is the initial condition of the prediction model;

the predicted value of the state quantity at the time t to the time tau is obtained.

Because large pneumatic coefficient and atmospheric density model errors exist in the actual atmospheric entering process, and the prediction model (18) does not consider the errors, the calculated prediction output value has large errors, and the robustness of the NMPC method can be reduced. Therefore, the invention provides a prediction model correction method based on error information estimation. Error estimation information is first obtained by designing a first order attenuated memory filter. As can be seen from equations (1), (2) and (11), the aerodynamic coefficient and the atmospheric density error affect the output of the system and the path constraint only by changing the magnitudes of the drag acceleration and the lift acceleration, and therefore the ratio of the actual drag acceleration and lift acceleration to the corresponding quantity calculated by the prediction model (18) is used as the state of the filter, that is, the state of the filter is the ratio

Where l (t) denotes the actual lift acceleration at time t,

and

respectively represents the predicted values of the prediction model (18) at the time t on the resistance acceleration and the lift acceleration, Z^*(t) is the state quantity at time t of the filter. The first order decay memory filter is shown as equation (20) below,

Z(t)＝Z(t-δ)+(1-ε)(Z^*(t)-Z(t-δ)) (20)；

wherein Z (t-delta) represents the state quantity of the filter at the time of t-delta, and the gain coefficient is 0 < epsilon < 1. In order to reduce the influence of model uncertainty, the gain factor may be appropriately set to a larger value to enhance the correction effect of the past state quantity of the filter on the current output value z (t), where ∈ is 0.9. The initial filter value Z (0) is taken to be 1. The output Z (t) of the filter is used as a correction factor of the prediction model (18) to correct the values of the resistance acceleration and the lift acceleration in real time, namely

Wherein

And

the corrected values of the resistance acceleration and the lift acceleration are respectively. The modified prediction model (18) may be represented as a modified prediction model (22),

wherein

represents the predicted value output by the modified prediction model at the time t to the time tau,

and the predicted value of the path constraint output at the time t by the correction prediction model at the time t is shown.

In order to further improve the capability of the control system to overcome uncertainty, a correction link for prediction errors is added at each optimization moment. Defining the error between the predicted output and the actual output at time t,

wherein

The predicted output value of the modified prediction model at time t, and y (t) the actual output value at time t. The predicted value output by the modified prediction model at the time t to the time tau is corrected by weighting e (t),

wherein the content of the first and second substances,

for the purpose of the corrected predicted output value,

and

predicted values of the corrected resistance acceleration and its first derivative, eta ═ eta₁,η₂]Is a correction vector, and η₁＞0,η₂＞0。

After the correction of the prediction model and the feedback correction of the prediction error, the objective function (15) at time t needs to be redefined as,

predicting time domain T when designing model predictive control systems_pOnce determined, is generally fixed and constant throughout the control process, while the parameter T_pThe impact on the control system performance is large. If T_pToo large, it will increase the computational burden; if T_pToo small, it will make the system less robust.

The analysis of the guidance problem of the atmospheric air entering shows that the density of the atmospheric air entering the initial section is very small, and the speed of the spacecraft at the final section is relatively low. As can be seen from the formula (2), the resistance acceleration in the two stages is small, and uncertain factors such as model errors have little influence on the tracking effect of the resistance acceleration, so that the T entering the initial stage and the end stage_pSelecting a smaller value, and reducing the calculation amount for predicting the output; the flight state of the spacecraft in the middle section changes rapidly, the resistance acceleration is large, so that the change of the tracking error is severe, and a large T is selected_pTo improve the robustness of the system. The invention provides a variable prediction time domain strategy, which takes the magnitude of the error change rate between the prediction output and the expected output as the judgment basis, reasonably selects the magnitudes of the prediction time domains at different flight phases and improves the performance of the NMPC guidance system.

The error between the predicted output and the desired output at time t is first defined,

the rate of change of the calculated error is as follows,

wherein k is₁(t) and k₂(t) the rate of change of the error in the resistive acceleration and its first derivative, respectively. The predicted time domain T at time T_pI.e. the predicted number of steps N_p(t) is determined by the following formula,

wherein λ₁And λ₂Predict the number of steps N for an appropriate rate of change threshold₁<N₂。

Finally, the invention adopts a hybrid algorithm combining the sequential quadratic programming and the particle swarm optimization algorithm to process the nonlinear programming problem with constraints formed by the formulas (13) and (14). The PSO algorithm has strong global optimization capability in the initial iteration stage, can quickly converge to a position close to a global optimal solution in the constraint range of the roll angle control quantity in the formula (14), and then the SQP algorithm takes the position as an initial value of optimization and searches nearby the position so as to converge to the global optimal solution. The hybrid optimization algorithm avoids the problem that the SQP algorithm is easy to generate a local optimal solution to a great extent, and the optimal solution can be found more quickly and accurately.

Recording optimized solution vectors obtained by solving NLP problem through hybrid optimization algorithm as

Only the first optimal solution of the obtained optimized sequence is acted on the actual system, namely the roll angle control quantity at the time t is,

and at the next moment, predicting future output according to the actual flight state of the spacecraft, repeatedly solving the NLP problem, optimizing forward in a rolling mode, and realizing closed-loop optimal control.

The distribution range and form of various random errors shown in table 1 are set, including aerodynamic coefficient, atmospheric density error, and initial state error. Under the condition that the random error exists, a designed guidance system based on an NMPC method is used for tracking the reference track generated in the step one, and values of all parameters of a control system are shown in a table 2. The results of the single simulation are shown in fig. 4-8. Fig. 4 is a height variation curve, fig. 5 is a speed variation curve, fig. 6 is a roll angle variation curve, and fig. 7 is a resistance acceleration variation curve. Fig. 8 is a longitude and latitude variation curve. According to the result, the NMPC guidance system can well track the reference track, and the guidance precision can meet the requirement.

TABLE 1 simulation error term parameters

Error of the measurement	Error value	Type (B)
			Coefficient of aerodynamics C_L,C_D	±30％	Gaussian distribution
Atmospheric density ρ	±30％	Is uniformly distributed
			Height (Km)	±0.2	Gaussian distribution
Longitude (deg)	±0.02	Gaussian distribution
			Latitude (deg)	±0.02	Gaussian distribution
Speed (m/s)	±2	Gaussian distribution
			Course angle (deg)	±0.05	Gaussian distribution
Track angle (deg)	±0.03	Gaussian distribution

TABLE 2 control of system parameters

Parameter(s)	Numerical value
		N_c	3
Q	[100,0；0,500]
		R	10
δ	0.01
		η	[1,1]
[N₁,N₂]	[5,10]
		[λ₁,λ₂]	[0.05,0.01]

Although the designed NMPC guidance system can realize high-precision guidance, the design method still has some defects: solving a complex NLP problem in each guidance period, and consuming a long time to obtain an optimal solution in an iteration mode; because numerical integration is performed on the kinetic equation when the prediction output is calculated by using the prediction model, a large calculation load is brought when the prediction time domain is large. In order to overcome the defects, an intelligent predictive control entry guidance method is provided by combining the respective advantages of an NMPC method and a neural network.

The flow chart for realizing the intelligent prediction of the spacecraft to enter the guidance is shown in fig. 9, and can be summarized into sample data generation, network off-line training and on-line intelligent guidance, namely steps three to five.

Step three: and generating sample data by taking the NMPC guidance system designed in the step two as a guidance template. The process of data generation includes the steps of,

(1) setting a nominal initial condition I ═ x₀,C_L,C_D,ρ]^TIncluding an initial state x₀(altitude, speed, latitude and longitude, track angle and course angle), aerodynamic coefficient C_LAnd C_DAnd the atmospheric density ρ.

(2) Introducing an error term p_l＝[Δx₀,ΔC_L,ΔC_D,Δρ]^TThe distribution range and form of each error are shown in the table1, where l is 1, …, and m is the number of tracking tracks. The actual initial condition of the system is d_l＝I+p_l。

(3) With d_lAs an actual initial condition, the reference trajectory is tracked using an NMPC guidance system. According to the state quantity of the current time

And (4) optimizing and solving the corresponding NLP problem to obtain an optimal control quantity u, forming a pair of arrays (x, u) and storing. Then rolling forward to the next optimization moment, and repeating the process until the actual condition d is completed_lAnd entering a guidance process under the condition, wherein all the obtained array pairs are the sample data generated by the tracking.

(4) Generating a new error term p_l+1Obtaining corresponding actual initial conditions d_l+1And repeating the generation process of the sample data in the step (3) until l is m. Finally, obtaining a sample data set for off-line training of the neural network

s-1, …, mi, where i is the number of times the optimization problem is rolled out each time the reference trajectory is tracked. To ensure training accuracy, m is 1000, so the data set Q contains approximately 2 × 10⁷And (4) array pairs of state quantity-optimal control quantity.

Step four: and training a neural network algorithm by using the data set to obtain the neural network based on the NMPC guidance system.

The BP neural network is a multilayer forward feedback neural network trained according to an error back propagation algorithm, has strong self-learning capability, can fully mine the relation between data, approaches a nonlinear complex system to the maximum extent, and is one of the most widely applied neural network models at present. Thus, embodiments of the present invention use the flight state quantities in data set Q

As the input of BP neural network, the roll angle corresponding to the current state quantity

And as the output of the BP neural network, training and testing the neural network to approximately obtain the relationship between the state quantity of the spacecraft and the corresponding roll angle. Randomly partitioning a data set Q into training sets Q₁And test set Q₂The ratios are set at 95% and 5%.

The BP neural network is composed of an input layer, an output layer and one or more hidden layers. As shown in fig. 9, the present embodiment adopts a four-layer neural network structure, that is, includes two hidden layers, which can improve the training precision of the BP neural network.

An input layer:

representing an input matrix in which each input vector

s is 1, …, mi, which contains six flight state variables.

Hidden layer: and determining the appropriate number of the hidden layer nodes through experience and repeated experiments, wherein the number of the first hidden layer node is 6, and the number of the second hidden layer node is 4. The transfer function is selected from a hyperbolic tangent sigmoid function tansig with a faster convergence rate and a wider output range.

An output layer:

the output matrix is represented by a matrix of values,

representing a desired output matrix, wherein each output variable is

s is 1, …, mi. The output layer transfer function is a linear transfer function purelin.

Before training, the training set Q is firstly compared₁Input matrix and output matrix of (1)Normalization processing is performed to normalize the value to [ -1, 1]And the training efficiency of the network is improved.

Supervised learning of the neural network is a process of adjusting weights w and thresholds B between nodes of each layer to reduce network output errors.

And

representing the weight and threshold from the input layer to the first hidden layer;

and

representing weights and thresholds between two hidden layers;

and

representing the weight and threshold from the hidden layer to the output layer. Input vector

The state quantities are summed according to the corresponding weight and threshold, the output of each neuron is obtained through the calculation of a transfer function, and the information is transferred to an output layer through two hidden layers in sequence

The output values of the hidden layer and the output layer are shown as follows,

wherein f is_1aAnd f_2bRepresenting the output values of the first and second hidden layers respectively,

representing the output value of the output layer.

To prevent overfitting, improve the accuracy of the network approximation, estimate the loss function of the network performance using the mean square error,

wherein C represents the size of the error,

which represents the input value of the sample,

represents the sample output value and mi represents the total number of samples. And (4) reversely transmitting the obtained network output error C, and continuously adjusting the weight and the threshold between nodes of each layer so as to reduce the output error.

The network training algorithm adopts Levenberg-Marquardt algorithm. Compared with the traditional BP neural network training algorithm, the L-M algorithm can achieve the advantages of combining the Gauss-Newton algorithm and the gradient descent method by adaptively adjusting the damping factor, and the training speed of the neural network is accelerated.

Updating the weight value and the threshold value of each neuron by adopting a momentum gradient descent algorithm, wherein the formula is as follows,

wherein

Is the current loss function pair weight w_nPartial derivatives (gradients) of (a), using exponentially weighted averages of the preceding

In connection with this, a momentum gradient is obtained

The updating formula of the threshold is the same as the weight. Beta is a momentum factor, which affects the exponentially weighted average, usually 0.9; α is the learning rate, i.e., the magnitude of each parameter update, affecting the convergence rate, here taken to be 0.05.

Based on training set Q₁Training the determined BP network structure, wherein the target error of the network training is set to be 1 x 10^-7The maximum number of iterations is set to 5000. And finishing the training of the BP neural network when the value of the loss function is converged to a set error range or reaches the maximum iteration number of the training. Using test set Q₂The data in the step (1) tests the network performance, and the neural network with better training effect and the corresponding weight and threshold are finally obtained through multiple experiments.

Step five: and (4) taking the error of the actual guidance process into consideration, replacing the NMPC guidance system with the BP neural network trained in the fourth step, and carrying out online tracking on the reference track generated in the first step.

When the spacecraft enters the guidance closed loop optimal control, the problem of solving the nonlinear programming with the constraint and the integral prediction process are not required to be solved repeatedly in each guidance period, and only the state quantity of the spacecraft at the current t moment obtained by real-time measurement of the navigation system is required

Inputting the data into an offline trained BP neural network, performing weighting operation according to a predetermined weight and a predetermined threshold value, and calculating and outputting an optimal roll angle control quantity U required by guidance entering at the current time t_t＝σ_tThe value of (c). And repeating the process of generating the control quantity at the previous moment to the next moment, so that the guidance instruction can be quickly solved on line, and the intelligent guidance of the spacecraft is realized.

Assuming that random errors in the actual guidance process are shown in table 1, 100 monte carlo targeting simulations were performed using the trained neural network, and the results are shown in fig. 10. Statistical analysis is performed on the actual terminal position and the distance from the target point position, and the calculation result is shown in table 3. Therefore, the guidance precision of the method designed by the invention can well meet the requirement on the drop point precision. In addition, compared with the average consumed time of about 60ms for solving the optimization problem each time by the NMPC guidance system, the trained neural network can generate the guidance instruction only by carrying out simple weighted operation, so that the instruction calculation average time at each moment is only 9ms (the simulation operation environment is Matlab R2019a programming language under a Windows 10 operating system, and the processor is Intel Core i7-9750H 2.60Ghz), the problem that the solution time of the NMPC method is long is solved, and guidance real-time performance is guaranteed.

TABLE 3 simulation results

Mean distance error	Probability of 5 km	3 km probability	Probability of 2 km
				1.59km	99％	89％	73％

The spacecraft intelligent predictive control entry guidance method firstly designs a non-linear model predictive control guidance system for spacecraft atmospheric entry according to a proposed prediction model correction method based on error information estimation and a variable prediction time domain strategy, and solves a corresponding NLP problem in a rolling mode by using a hybrid optimization algorithm to obtain an optimal control quantity meeting control constraints and path constraints. And then, by combining the strong self-learning capability of the neural network, based on sample data generated by the NMPC guidance template, the mapping relation between the real-time flight state and the guidance instruction is approximated by adopting the multilayer BP neural network, so that the speed of solving the guidance instruction on line is increased, and the intelligent and accurate guidance of the spacecraft atmosphere entering section can be realized.

The above description is only an embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can understand that the modifications or substitutions within the technical scope of the present invention are included in the scope of the present invention, and therefore, the scope of the present invention should be subject to the protection scope of the claims.

Claims

1. The spacecraft intelligent prediction control entry guidance method is characterized by comprising the following steps:

the kinetic equation of the spacecraft entering the guidance model is as follows:

wherein, the spacecraft state vector at the time t

the latitude is, theta is longitude, gamma is track angle, psi is heading angle, u (t) is control input at the time t, namely roll angle sigma (t); y is_c(t) is the output value at time t, D (t) and

is the heat flow rate at time t, q (t) is the incoming flow pressure at time t, n_L(t) spacecraft overload at time t;

the NMPC guidance system is designed by adopting a nonlinear model predictive control NMPC method, and the method specifically comprises the following steps:

1) at time t, a constrained nonlinear programming problem of the form:

the following constraints are satisfied:

to maximize the allowable roll angle rate, δ is the sampling time, T_c＝N_cδ，T_p＝N_pδ，N_cFor controlling the number of steps, N_pPredicting the step number;

And

D_r(τ) and

i-int (τ -t/δ), int (·) being a downward rounding function;

A predicted value representing the path constraint output from the time t to the time tau; q is an output error weighting matrix, and R is a control quantity weighting matrix;

and

calculated by the following prediction model:

wherein

For the predicted value of the state quantity at time t to time t,

the values of r, V, θ,

predicted values of γ, ψ;

and

and

make real-time corrections, i.e.

And

finally, the prediction model is modified as follows:

wherein

wherein

At time tCorrecting the predicted output value of the prediction model, wherein y (t) is the actual output value at the time t;

wherein the content of the first and second substances,

for the purpose of the corrected predicted output value,

and

predicted values of the corrected resistance acceleration and its first derivative, eta ═ eta₁，η₂]Is a correction vector, and η₁＞0，η₂＞0；

step four: taking the state quantity of the sample data set generated in the third step as input, taking the corresponding optimal control quantity as output, carrying out back propagation BP neural network training, finishing the training of the BP neural network when the value of the loss function converges to a set error range or reaches the maximum iteration number of the training, and obtaining the neural network based on the NMPC guidance system;

2. The spacecraft intelligent predictive control entry guidance method of claim 1, wherein step one is specifically: according to the speed, longitude and latitude, height, course angle, track angle of an initial moment of an atmospheric entering section of the spacecraft and target constraint conditions of a terminal of the entering section, under the condition that path constraint including overload, dynamic pressure and heat flow rate and control quantity constraint are met, a reference track is generated in an optimized mode, a corresponding rolling angle change curve is obtained, the change relation of the resistance acceleration and the energy is extracted, and the relation of the change of the derivative of the resistance acceleration along with the time is obtained through deduction.

3. The spacecraft intelligent predictive control entry guidance method of claim 1, wherein in step three, a hybrid optimization algorithm combining a Sequence Quadratic Programming (SQP) algorithm and a Particle Swarm Optimization (PSO) algorithm is adopted to solve the nonlinear programming problem with constraints: firstly, solving a nonlinear programming problem with constraint by using a PSO algorithm, and then searching the solving result of the PSO algorithm as an optimized initial value near the position by using an SQP algorithm so as to converge to a global optimal solution.