JP2022181937A

JP2022181937A - Information processing device

Info

Publication number: JP2022181937A
Application number: JP2021089186A
Authority: JP
Inventors: 修一矢作; Shuichi Yahagi
Original assignee: Isuzu Motors Ltd
Current assignee: Isuzu Motors Ltd
Priority date: 2021-05-27
Filing date: 2021-05-27
Publication date: 2022-12-08

Abstract

To provide an information processing device that allows a control parameter to change according to the state of a control object.SOLUTION: In a feedback control system, an information processing device that calculates a control parameter of a controller includes: a lookup table in which one or a plurality of states of a control object or external input are set as axes and a plurality of parameter values used for calculating the control parameter are arranged in a space defined by the axes; a state acquisition unit that obtains the state of the control object; a parameter generation unit that generates a control parameter based on the plurality of parameter values by referring to the lookup table, based on the state obtained by the state acquisition unit; and a table change unit that changes the plurality of parameter values arranged in the lookup table, based on a result of Ridge regression for an evaluation function of errors between an output of the control object and an output of a reference model.SELECTED DRAWING: Figure 7

Description

本開示は、情報処理装置に関する。 The present disclosure relates to an information processing device.

産業システムの閉ループ制御の大半は直感的に理解できるＰＩＤ制御が用いられている。線形性の強い制御対象であれば所望の制御性能が得られるが、非線形システムの場合、ゲインが固定されたＰＩＤ制御で十分な制御性能を得ることは難しい。非線形制御理論やモデルベース制御の適用も考えられるが、コントローラ性能に制約があることや理論が複雑かつ計算負荷が大きいため非線形制御適用のハードルが高い。また，産業システムは複雑であり精度の高いモデルを得ることが難しい場合が多く、モデルベース制御の効果が十分に発揮できない場合がある。このような背景から、産業界ではルックアップテーブル（Look-up table:以下「ＬＵＴ」と記載する)を用いたゲインスケジュールＰＩＤ制御がよく用いられる（例えば、特許文献１を参照）。 The majority of closed-loop controls in industrial systems use intuitive PID control. Desired control performance can be obtained if the controlled object is strongly linear, but in the case of a nonlinear system, it is difficult to obtain sufficient control performance with PID control with a fixed gain. Nonlinear control theory and model-based control can be applied, but the hurdles for applying nonlinear control are high due to the constraints on controller performance, the complexity of the theory, and the large computational load. In addition, industrial systems are often complicated and it is often difficult to obtain highly accurate models, and the effect of model-based control may not be fully exhibited. Against this background, gain schedule PID control using a look-up table (hereinafter referred to as "LUT") is often used in the industrial world (see Patent Document 1, for example).

ゲインスケジュール制御は制御対象の状態に応じて制御器パラメータを変更し、所望の制御性能を実現する手法である。このような考え方は直観的に理解しやすく産業界では受け入れやすい。その反面，所望の制御性能を得るために大量の制御パラメータを事前に調整する必要がある。固定ＰＩＤ制御であれば調整パラメータは比例ゲイン、積分ゲイン、微分ゲインの３つであるが、ＬＵＴを用いたゲインスケジュール制御の場合、はるかに大きな数のパラメータを調整しなくてはならず、パラメータ調整に多くの時間を要している。近年，システム同定を介することなく制御対象モデルを用いない制御系設計法が注目されている．オフラインで取得したデータから最適制御パラメータを求める手法として、ＶＲＦＴ（Virtual Reference Feedback Tuning）（例えば、特許文献２を参照）やＦＲＩＴ（Fictitious Reference Feedback Tuning）が提案されている。 Gain schedule control is a method of changing controller parameters according to the state of a controlled object to achieve desired control performance. This way of thinking is intuitively easy to understand and easy to accept in the industrial world. On the other hand, it is necessary to adjust a large number of control parameters in advance in order to obtain the desired control performance. In fixed PID control, there are three adjustment parameters: proportional gain, integral gain, and derivative gain. It takes a lot of time to adjust. In recent years, control system design methods that do not involve system identification and do not use controlled plant models have attracted attention. VRFT (Virtual Reference Feedback Tuning) (see, for example, Patent Document 2) and FRIT (Fictitious Reference Feedback Tuning) have been proposed as techniques for obtaining optimal control parameters from data obtained offline.

ＶＲＦＴやＦＲＩＴといったデータ駆動制御は、繰り返し実験を行うことなく一組の入出力データからオフラインで制御器パラメータが得られるという点で注目されている。以上のような制御対象のモデルを用いない制御手法は、プロセス系や自動車システム等の産業システム，振動制御問題への適用も進められている。これまで、非線形システムに対するデータ駆動制御手法の検討もされているが、ＬＵＴパラメータの自動調整法に対しての検討はされていない。 Data-driven controls such as VRFT and FRIT are of interest in that controller parameters can be obtained off-line from a set of input-output data without repeated experiments. Control methods that do not use a model of the controlled object as described above are being applied to process systems, industrial systems such as automobile systems, and vibration control problems. So far, data-driven control methods for nonlinear systems have been studied, but no studies have been made for automatic adjustment of LUT parameters.

特開２０１２－１１３６７６号公報JP 2012-113676 A 特開２０２１－４３５７３号公報JP 2021-43573 A

ところで、固定ＰＩＤゲインを求める一般的なＶＲＦＴでは、制御対象の状態に応じて制御パラメータが変化しないため、ＰＩＤゲインが目標応答に十分に追従できないという問題もある。 By the way, in a general VRFT that obtains a fixed PID gain, since the control parameters do not change according to the state of the controlled object, there is also the problem that the PID gain cannot sufficiently follow the target response.

本開示はこれらの点に鑑みてなされたものであり、制御対象の状態に応じて制御パラメータを変化させることが可能な情報処理装置を提供することを目的とする。 The present disclosure has been made in view of these points, and aims to provide an information processing apparatus capable of changing control parameters according to the state of a controlled object.

上記の目的を達成するため、本開示における情報処理装置は、制御器と、前記制御器の出力を入力とする制御対象と、前記制御対象に入力する入力信号を入力として前記制御対象の出力をモデル化する参照モデルとを備え、前記制御対象の出力が前記制御器の入力にフィードバックされる制御システムにおいて、前記制御器の制御パラメータを算出する情報処理装置であって、前記制御対象又は外部入力に関する１又は複数の状態を軸とし、前記軸によって定義される空間に前記制御パラメータを算出するために用いられる複数のパラメータ値が配置されたルックアップテーブルと、前記制御対象の状態を取得する状態取得部と、前記状態取得部が取得した状態に基づいて、前記ルックアップテーブルを参照して前記複数のパラメータ値に基づいて前記制御パラメータを生成するパラメータ生成部と、前記制御対象の出力と、前記参照モデルの出力との誤差に関する評価関数についてのＲｉｄｇｅ回帰の結果に基づいて、前記ルックアップテーブルに配置された前記複数のパラメータ値を変更するテーブル変更部と、を備える。 In order to achieve the above object, an information processing apparatus according to the present disclosure includes a controller, a controlled object whose input is the output of the controller, an input signal input to the controlled object, and an output of the controlled object. An information processing device for calculating control parameters of the controller in a control system in which the output of the controlled object is fed back to the input of the controller, wherein the controlled object or the external input A lookup table in which a plurality of parameter values used for calculating the control parameters are arranged in a space defined by the axes, and a state for acquiring the state of the controlled object an acquisition unit, a parameter generation unit that refers to the lookup table and generates the control parameter based on the plurality of parameter values based on the state acquired by the state acquisition unit, and an output of the controlled object; a table changing unit that changes the plurality of parameter values arranged in the lookup table based on the result of Ridge regression for the evaluation function regarding the error from the output of the reference model.

前記パラメータ生成部は、前記複数のパラメータ値のそれぞれに、前記複数のパラメータ値のそれぞれに対応する重みを乗じて生成した複数の乗算値を加算することにより前記制御パラメータを生成してもよい。 The parameter generator may generate the control parameter by adding a plurality of multiplied values generated by multiplying each of the plurality of parameter values by a weight corresponding to each of the plurality of parameter values.

前記パラメータ生成部は、前記状態取得部が取得した前記制御対象の状態と前記複数のパラメータ値が対応する状態との距離が近ければ近いほど大きな値の前記重みを前記複数のパラメータ値に乗じてもよい。 The parameter generation unit multiplies the plurality of parameter values by the weight that increases as the distance between the state of the controlled object acquired by the state acquisition unit and the state to which the plurality of parameter values correspond is closer. good too.

前記テーブル変更部は、前記評価関数の評価値が所定の範囲から外れた場合に、前記評価値が前記範囲に含まれるように前記制御パラメータを変更してもよい。 When the evaluation value of the evaluation function is out of a predetermined range, the table changing section may change the control parameter so that the evaluation value is included in the range.

前記制御対象の出力と前記参照モデルの出力との誤差は、前記ルックアップテーブルに配置された前記複数のパラメータ値に基づく前記制御パラメータを要素とするパラメータベクトルに対して線形となるように定義されており、前記テーブル変更部は、前記誤差が小さくなるように前記パラメータベクトルを変更することにより、前記制御パラメータを変更してもよい。 An error between the output of the controlled object and the output of the reference model is defined to be linear with respect to a parameter vector whose elements are the control parameters based on the plurality of parameter values arranged in the lookup table. and the table changing unit may change the control parameter by changing the parameter vector so as to reduce the error.

前記評価関数は、（１）前記空間における前記複数のパラメータ値に対応する位置座標、前記参照モデル、及び前記制御対象の出力から算出されるベクトルと前記パラメータベクトルとの内積と、（２）前記参照モデル及び前記制御対象への入力信号の積と、の差の二乗で定義されており、前記テーブル変更部は、Ｒｉｄｇｅ回帰を用いて前記評価関数の評価値が小さくなるように前記パラメータベクトルを変更してもよい。 The evaluation function includes (1) the inner product of a vector calculated from the position coordinates corresponding to the plurality of parameter values in the space, the reference model, and the output of the controlled object and the parameter vector, and (2) the It is defined by the square of the difference between the product of the reference model and the input signal to the controlled object, and the table changing unit uses Ridge regression to change the parameter vector so that the evaluation value of the evaluation function becomes small. You can change it.

本開示によれば、制御対象の状態に応じて制御パラメータを変化させることができる。 According to the present disclosure, it is possible to change the control parameter according to the state of the controlled object.

図１は、参照モデルを備える閉ループ系を模式的に示す図である。FIG. 1 schematically shows a closed-loop system with a reference model. 図２は、ゲインスケジュールＰＩＤ制御のシステム構成を模式的に示す図である。FIG. 2 is a diagram schematically showing the system configuration of gain schedule PID control. 図３は、二次元で構成されるＬＵＴを示す図である。FIG. 3 is a diagram showing a two-dimensional LUT. 図４は、ゲインスケジューラの構成を示す図である。FIG. 4 is a diagram showing the configuration of the gain scheduler. 図５は、ゲインスケジュール制御によるモデル参照制御のブロック線図である。FIG. 5 is a block diagram of model reference control by gain schedule control. 図６は、実施の形態に係る閉ループ系を模式的に示す図である。FIG. 6 is a diagram schematically showing a closed loop system according to the embodiment. 図７は、実施の形態に係る情報処理装置の機能構成を模式的に示す図である。FIG. 7 is a schematic diagram of the functional configuration of the information processing apparatus according to the embodiment. 図８は、実施の形態に係る情報処理装置が実行する情報処理の流れを説明するためのフローチャートである。FIG. 8 is a flowchart for explaining the flow of information processing executed by the information processing apparatus according to the embodiment. 図９は、取得した入出力データから固定ＰＩＤゲインを求めたときの時系列データを示す図である。FIG. 9 is a diagram showing time-series data when fixed PID gains are obtained from acquired input/output data. 図１０は、本開示の手法でＰＩＤゲインを求めたときの時系列データを示す図である。FIG. 10 is a diagram showing time-series data when PID gains are obtained by the method of the present disclosure. 図１１Ａは、制御対象の状態１および状態２のそれぞれに応じて変化する比例ゲインを示す図である。FIG. 11A is a diagram showing a proportional gain that changes according to state 1 and state 2 of the controlled object. 図１１Ｂは、制御対象の状態１および状態２のそれぞれに応じて変化する積分ゲインを示す図である。FIG. 11B is a diagram showing integral gains that change depending on states 1 and 2 of the controlled object. 図１１Ｃは、制御対象の状態１および状態２のそれぞれに応じて変化する微分ゲインを示す図である。FIG. 11C is a diagram showing differential gains that change according to state 1 and state 2 of the controlled object.

＜実施の形態の前提となる技術＞
本開示の実施の形態を説明する前に、まず、本開示の実施の形態が前提とする前提技術について説明する。 <Technology that is a premise of the embodiment>
Before describing the embodiments of the present disclosure, first, the underlying technology on which the embodiments of the present disclosure are based will be described.

［緒言］
現在、産業界において、閉ループ制御の大半はＰＩＤ制御が用いられている。これは、ＰＩＤ制御は直感的に理解でき、計算負荷が小さいからである。一方で、産業界における閉ループ制御の制御対象のほとんどは非線形システムである。非線形制御理論を適用することも考えられるものの、コントローラ性能に制約があることや理論が複雑かつ計算負荷が大きく、適用のハードルが高いため用いられることは少ないのが現状である。このため、産業界では、非線形システムの制御は、計算負荷が小さく理解が容易なゲインスケジュール制御で対応することが多い。 [Introduction]
Currently, PID control is used for most closed loop controls in the industrial world. This is because PID control is intuitive and has a low computational load. On the other hand, most of the controlled objects of closed-loop control in the industrial world are nonlinear systems. Although it is possible to apply nonlinear control theory, it is rarely used because of limitations on controller performance, the theory is complicated, the calculation load is large, and the hurdles to application are high. For this reason, in the industrial world, control of nonlinear systems is often handled by gain schedule control, which has a small computational load and is easy to understand.

ＬＵＴを用いたゲインスケジュール制御は、あらかじめ制御対象の状態と制御器の制御パラメータの生成に用いられるパラメータ値とを紐づけて格納するＬＵＴを参照することにより制御対象の状態に応じて制御器の制御パラメータを変更し、所望の制御性能を実現する手法である。このため、ＬＵＴを用いたゲインスケジュール制御はマップベース制御と呼ばれることもある。 Gain schedule control using an LUT refers to an LUT that stores in advance the state of the controlled object and the parameter values used to generate the control parameters of the controller in association with each other. This is a method of changing control parameters to achieve desired control performance. For this reason, gain schedule control using an LUT is sometimes called map-based control.

ゲインスケジュール制御を実施するには、所望の制御性能が得られるようにＬＵＴを事前に調整する必要がある。通常のＰＩＤ制御であれば調整パラメータは比例ゲイン、積分ゲイン、及び微分ゲインの３つである。一方、ゲインスケジュール制御はＬＵＴを用意する必要があるため、通常のＰＩＤ制御と比べてはるかに大きな数のパラメータを調整する必要がある。したがって、ゲインスケジュール制御のパラメータ調整には多くの時間を要する。さらに、ゲインスケジュール制御では、経年劣化等の制御対象の特性変動に対応できていないという問題もある。 To implement gain schedule control, it is necessary to adjust the LUT in advance so as to obtain desired control performance. For normal PID control, there are three adjustment parameters: proportional gain, integral gain, and derivative gain. On the other hand, since gain schedule control requires LUT preparation, it is necessary to adjust a much larger number of parameters than normal PID control. Therefore, it takes a lot of time to adjust parameters for gain schedule control. Furthermore, the gain schedule control also has a problem that it cannot cope with characteristic fluctuations of the controlled object such as deterioration over time.

そこで、本開示では、ゲインスケジュールＰＩＤ制御のスケジューリング関数（スケジューラ）であるＬＵＴパラメータを自動調整する方法を提案する。以下に、スケジューリング関数をＬＵＴで表す。次に、ゲインスケジュールＰＩＤ制御器を定義する。次に、ゲインスケジュールＰＩＤ制御器を対象としたＶＲＦＴの評価関数を導出する。次に、評価関数が最小となるようにＲｉｄｇｅ回帰により最適制御パラメータを求める。これにより、試行錯誤的なパラメータ調整や、システム同定が不要となる。 Therefore, the present disclosure proposes a method of automatically adjusting LUT parameters, which are a scheduling function (scheduler) of gain schedule PID control. In the following, the scheduling functions are represented by LUTs. Next, we define a gain-scheduled PID controller. Next, a VRFT evaluation function for a gain-scheduled PID controller is derived. Next, optimum control parameters are obtained by Ridge regression so that the evaluation function is minimized. This eliminates the need for trial-and-error parameter adjustments and system identification.

以下の説明においては、先ず、準備としてＶＲＦＴとゲインスケジュールＰＩＤ制御について説明する。次に、問題設定について説明する。次に、ＶＲＦＴを利用したゲインスケジュール制御パラメータの自動調整について説明する。次に、シミュレーション検証について説明する。 In the following description, VRFT and gain schedule PID control will be described first as a preparation. Next, problem setting will be explained. Next, automatic adjustment of gain schedule control parameters using VRFT will be described. Next, simulation verification will be described.

［ＶＲＦＴ］
ＶＲＦＴは、開ループの入出力データからシステム同定を介さず、直接的に、制御パラメータを求める手法である。最適制御パラメータは参照モデルと閉ループ系が同じ特性となるように調整される。図１にＶＲＦＴの構造について示す。図１は、参照モデルを備える閉ループ系を模式的に示す図である。Ｃは制御器、Ｍは参照モデル、Ｐは制御対象（プラント）であり、ｕは入力、ｙは出力である。また、ρは制御器パラメータ、ｚはシフトオペレータである。ｒバー（ｔ）はＶＲＦＴで提案されている仮想参照入力、ｅバー（ｔ）は、ＶＲＦＴで提案されている仮想誤差である。 [VRFT]
VRFT is a technique for obtaining control parameters directly from open-loop input/output data without system identification. The optimum control parameters are adjusted so that the reference model and the closed-loop system have the same characteristics. FIG. 1 shows the structure of VRFT. FIG. 1 schematically shows a closed-loop system with a reference model. C is a controller, M is a reference model, P is a controlled object (plant), u is an input, and y is an output. Also, ρ is a controller parameter and z is a shift operator. r(t) is the virtual reference input proposed in VRFT, and e(t) is the virtual error proposed in VRFT.

ＶＲＦＴの手順について簡単に説明する。
ステップ１：希望する閉ループの参照モデルをＭｄとする。また、制御対象の入出力データをｕ（ｔ），ｙ（ｔ），ｔ＝１，．．．，Ｎとする。
ステップ２：ｙ（ｔ）を参照モデルの出力とみなすと、ｙ（ｔ）を発生する仮想の参照入力は次式のように決められる。なお、参照入力を「参照信号」又は「疑似参照信号」ともいう。

ステップ３：この参照信号ｒバー（ｔ）を図１の閉ループの参照入力と考える。このとき制御器で作られる操作量は、これを仮想操作量とすると、次式になる。

ステップ４：この仮想操作量ｕバー（ｔ）と操作量ｕ（ｔ）のデータが近づけば、コントローラをもつ閉ループは参照モデルに近いとみなせる。すなわち、最小化する評価関数は次式になる。

式（１），（２）により、上式は次のようになる。

ステップ５：プレフィルタＬの導入
式（４）の項には、参照モデルＭｄの逆行列がありノンプロパーである。式（３４）に示すプレフィルタの追加によりノンプロパーを回避する。このプレフィルタを式（４）に追加すると式（５）が得られる。

ここで、

The VRFT procedure will be briefly described.
Step 1: Let the desired closed-loop reference model be Md. Input/output data to be controlled are u(t), y(t), t=1, . . . , N.
Step 2: Considering y(t) as the output of the reference model, the hypothetical reference input that generates y(t) is determined as follows. Note that the reference input is also called a "reference signal" or a "pseudo reference signal".

Step 3: Consider this reference signal r(t) as the reference input of the closed loop of FIG. The manipulated variable generated by the controller at this time is expressed by the following equation, if this is assumed to be a virtual manipulated variable.

Step 4: If the virtual manipulated variable u(t) and the data of the manipulated variable u(t) are close to each other, the closed loop with the controller can be regarded as close to the reference model. That is, the evaluation function to be minimized is given by the following equation.

From equations (1) and (2), the above equation is as follows.

Step 5: Introduction of pre-filter L The term of equation (4) has an inverse matrix of the reference model Md and is non-proper. Nonproperty is avoided by the addition of the prefilter shown in equation (34). Adding this prefilter to equation (4) yields equation (5).

here,

［ゲインスケジュールＰＩＤ制御］
図２は、ゲインスケジュールＰＩＤ制御のシステムを示す図である。図２に示すように、ゲインスケジュールＰＩＤ制御のシステムは、制御対象Ｐと、制御器Ｃ（ｚ，ρ）、可変な制御器パラメータρ、有理関数ベクトルψ（ｚ）、スケジューリング関数ｆ（ｘ）により構成される。
図２に示す制御器Ｃ（ｚ，ρ）は、次式で記述される。

ここで、

制御パラメータρは、スケジューリング関数ｆ（ｘ）に応じてゲインスケジュールされる。ｗ_iは、ｉ番目のスケジューリング関数のパラメータベクトルである。 [Gain schedule PID control]
FIG. 2 is a diagram showing a system of gain schedule PID control. As shown in FIG. 2, the gain schedule PID control system includes a controlled object P, a controller C(z, ρ), a variable controller parameter ρ, a rational function vector ψ(z), and a scheduling function f(x). Consists of
The controller C(z, ρ) shown in FIG. 2 is described by the following equation.

here,

The control parameter ρ is gain-scheduled according to the scheduling function f(x). w _i is the parameter vector of the i-th scheduling function.

［ルックアップテーブルの構造］
図３に、２次元の場合のグリッドベースルックアップテーブル（Grid-Based Look-up Table）の構成を示す。図３に示すｘ１，ｘ２は、ＬＵＴへの入力、Ｃ_ｌ，ｋ，Ｃ_{１，ｋ＋１}，Ｃ_２，１、Ｃ_{２，ｌ＋１}は、補間ノード、θ_ｋ，ｌ，θ_{ｋ＋１，ｌ}，θ_{ｋ、ｌ＋１}，θ_{ｋ＋１，ｌ＋１}は、テーブルデータ（ＬＵＴのパラメータ値）である。このとき、ＬＵＴへ入力が与えられときのＬＵＴからの出力ｗ_ＬＵは次式となる。

ここで、

[Structure of lookup table]
FIG. 3 shows the configuration of a Grid-Based Look-up Table for two dimensions. x1 and x2 shown in FIG. 3 are inputs to the LUT, C _l,k , C _1,k+1 , C _2,1 , C _2,l+1 are interpolation nodes, θ _k,l , θ _k+1,l , θ _{k , l+1} , θ _{k+1, l+1} are table data (LUT parameter values). At this time, the output _{w_LU} from the LUT when the input is given to the LUT is given by the following equation.

here,

図４は、ゲインスケジューラの構成を示す図である。図４に示すように、ＬＵＴから出力される複数のパラメータが重み付け加算された値が制御パラメータとなる。 FIG. 4 is a diagram showing the configuration of the gain scheduler. As shown in FIG. 4, a value obtained by weighting and adding a plurality of parameters output from the LUT becomes a control parameter.

重みは、状態量に近いテーブルデータの影響が大きく、状態量から遠いテーブルデータの影響を小さくするように、状態量と各ノードの面積Ａが利用される。 For the weight, the state quantity and the area A of each node are used so that the influence of the table data close to the state quantity is large and the influence of the table data far from the state quantity is reduced.

式（１０）に一般化基底関数φを導入するとサイスＭ_１×Ｍ_２のマップの出力は次式で表せる。なお、基底関数は、公知のルックアップテーブルで示されるか、簡単な計算により求められる。

When the generalized basis function φ is introduced into Equation (10), the output of the size M ₁ ×M ₂ map can be expressed by the following equation. Note that the basis functions are indicated by a known lookup table or obtained by simple calculation.

なお、図３に示すＬＵＴは、状態１に対応する軸と状態２に対応する軸とを有する二次元のＬＵＴであるが、ＬＵＴは１つの状態に対応する１つの軸のみを有する一次元のＬＵＴであってもよく、３つ以上の状態に対応する三次元以上のＬＵＴであってもよい。 Note that the LUT shown in FIG. 3 is a two-dimensional LUT having an axis corresponding to state 1 and an axis corresponding to state 2, but the LUT is a one-dimensional LUT having only one axis corresponding to one state. It may be a LUT, or a three or more dimensional LUT corresponding to three or more states.

［モデル参照制御］
図５は、ゲインスケジュール制御によるモデル参照制御のブロック線図である。図５に示すように、目標値ｒから出力ｙまでの入出力特性が、予め設定された参照モデルＭｄに一致するようなゲインスケジューラを構成するｎ個のパラメータベクトルｗを自動で調整する場合に、次の評価関数Ｊ_ＭＲを最小にするゲインスケジュールの最適パラメータを得ることを目的とする。

ここで、ｗはスケジューリング関数ｆ（ｘ）を構成するパラメータベクトルである。 [Model reference control]
FIG. 5 is a block diagram of model reference control by gain schedule control. As shown in FIG. 5, when automatically adjusting the n parameter vectors w constituting the gain scheduler such that the input/output characteristics from the target value r to the output y match the preset reference model Md, , to obtain the optimal parameters of the gain schedule that minimize the evaluation function J _MR

where w is a parameter vector that constitutes the scheduling function f(x).

［ゲインスケジュール制御パラメータの自動調整］
ここで、制御器は、例えば、速度型ＰＩＤ制御器とする。２次元ＬＵＴをゲインスケジューラとして用いると、ゲインスケジュールＰＩＤ制御は次式で表される。なお、ＰＩＤ制御器を速度型ＰＩＤ制御器とするが、本開示は速度型ＰＩＤ制御器に限定されない。

ここで、

である。ρは制御器の制御パラメータ、Ｋｐは比例ゲイン、Ｋｉは積分ゲイン、Ｋｄは微分ゲインである。また、ｆ_ｐは比例ゲインに関するＬＵＴ、ｆ_ｉは積分ゲインに関するＬＵＴ、ｆ_ｄは微分ゲインに関するＬＵＴである。ｗ^Ｋｐ、ｗ^Ｋｉ、ｗ^Ｋｄは、それぞれ比例ゲイン、積分ゲイン、微分ゲインに関するゲインスケジューラのパラメータベクトルである。φ（チルダ）は、式（１２）に示した一般化基底関数を要素とする既知ベクトルであり、ＬＵＴにおいて各テーブルデータが格納されている場所の位置座標を用いて計算される。また、式中「Ｔ」はベクトルの転置を表す。 [Automatic adjustment of gain schedule control parameters]
Here, the controller is, for example, a speed-type PID controller. Using a two-dimensional LUT as a gain scheduler, gain schedule PID control is expressed by the following equation. Note that the PID controller is a speed-type PID controller, but the present disclosure is not limited to the speed-type PID controller.

here,

is. ρ is the control parameter of the controller, Kp is the proportional gain, Ki is the integral gain, and Kd is the derivative gain. Also, _fp is an LUT for proportional gain, f _i is an LUT for integral gain, and _fd is an LUT for differential gain. w ^Kp , w ^Ki , and w ^Kd are the gain scheduler parameter vectors for proportional, integral, and derivative gains, respectively. φ (tilde) is a known vector whose elements are the generalized basis functions shown in Equation (12), and is calculated using the positional coordinates of the location where each table data is stored in the LUT. Also, "T" in the formula represents the transposition of the vector.

式（５）に式（１３）を代入し，式を整理すると、評価関数は次式が得られる。

ここで、

ここで、パラメータベクトルｗ、ベクトルξ，及び、ベクトルｄのそれぞれは、以下の式（２７）、式（２８）及び式（２９）で表される。

By substituting equation (13) into equation (5) and arranging the equations, the following equation is obtained as the evaluation function.

here,

Here, the parameter vector w, the vector ξ, and the vector d are respectively represented by the following equations (27), (28) and (29).

式（２６）～式（２９）に示すように、フィードバック制御の応答と参照モデルと擬似参照信号から得られる目標応答との誤差ｅ_ＬＵＴ（ｔ）は、ＬＵＴのテーブルデータから構成されるパラメータベクトルｗに関して線形で表せる。 As shown in equations (26) to (29), the error e _LUT (t) between the feedback control response, the reference model, and the target response obtained from the pseudo reference signal is a parameter vector composed of LUT table data. It can be expressed linearly with respect to w.

式（２９）より、ξ（ｔ）は、ＬＵＴにおいて各テーブルデータが格納されている場所の位置座標、参照モデルＭｄ、及び、制御対象Ｐの出力ｙ（ｔ）を用いて算出できる。また、式（２７）より、ｄ（ｔ）は、制御対象Ｐの入力ｕ（ｔ）（すなわち制御器Ｃの出力）を用いて算出できる。 From Equation (29), ξ(t) can be calculated using the position coordinates of the location where each table data is stored in the LUT, the reference model Md, and the output y(t) of the controlled object P. Further, d(t) can be calculated using the input u(t) of the controlled object P (that is, the output of the controller C) from Equation (27).

式（２５）および式（２６）から、評価関数Ｊは、（１）ＬＵＴにおいてテーブルデータを配置すべき位置座標、参照モデルＭｄ、及び制御対象Ｐの出力ｙから算出されるベクトルξと、ＬＵＴのテーブルデータを要素とするパラメータベクトルｗとの内積と、（２）制御対象Ｐの入力信号ｕを用いて算出された値の積との差の二乗で定義される。 From equations (25) and (26), the evaluation function J is: (1) a vector ξ calculated from the position coordinates where the table data should be arranged in the LUT, the reference model Md, and the output y of the controlled object P; and (2) the product of the value calculated using the input signal u of the controlled object P and the square of the difference.

［最適化手法］
上述したように、評価関数はパラメータベクトルｗに関して線形であるため、最小二乗法を用いると、最適解は次式で求められる。

ここで、

[Optimization method]
As described above, since the evaluation function is linear with respect to the parameter vector w, using the least squares method, the optimum solution is obtained by the following equation.

here,

最小二乗法を適用した場合、非常に局所的にＬＵＴのパラメータを求めることになり、そのパラメータと隣り合うパラメータとの差が極端に大きくなる場合がある。これは、ＰＩＤゲインが急変し、閉ループ系が不安定になることにつながる。本開示では、一般化Ｒｉｄｇｅを導入することで、互いに隣り合うパラメータの差の大きさに対しＬ^１ノルムを導入する。
一般化Ｒｉｄｇｅを取り入れた評価関数は次式で表される。

λは設計パラメータであって、クロスバリデーション（Cross-validation）等の公知の手法により適切な値が求められる。また、クロスバリデーションにより、制御対象の入出力データ数が少ない場合や、制御対象の状態が急激に変化した場合でも、過学習が防止されるため、パラメータの局所的な最適化を防止できる。 When the method of least squares is applied, the LUT parameters are obtained very locally, and the difference between that parameter and its adjacent parameters may become extremely large. This leads to abrupt changes in the PID gains and destabilization of the closed loop system. In this disclosure, we introduce the ^L1 norm for the magnitude of the difference between adjacent parameters by introducing the generalization Ridge.
An evaluation function incorporating generalized Ridge is expressed by the following equation.

λ is a design parameter, and an appropriate value is obtained by a known method such as cross-validation. In addition, cross-validation prevents over-learning even when the number of input/output data of the controlled object is small or when the state of the controlled object changes rapidly, thereby preventing local optimization of parameters.

［アルゴリズム］
ＶＲＦＴを用いたＰＩＤゲインのスケジューリング関数の重み係数（制御パラメータ）の自動調整アルゴリズムについて説明する。
ステップ１：開ループ系で入出力データを計測する。
ステップ２：参照モデルを設定する。
ステップ３：スケジューリングパラメータの候補を決定し、ＰＩＤゲインそれぞれのスケジューリング関数を設計する。
ステップ４：プレフィルタを設計する。
ステップ５：評価関数を最小にするスケジューリング関数の重み係数（制御パラメータ）をＲｉｄｇｅ回帰で求める。
なお、ステップ４において、たとえば、シミュレーションおよび実験にて実績のある次式のプレフィルタを用いる。ただし、プレフィルタはこれに限らない。

[algorithm]
An automatic adjustment algorithm for the weighting coefficient (control parameter) of the PID gain scheduling function using VRFT will be described.
Step 1: Measure input/output data in an open loop system.
Step 2: Set the reference model.
Step 3: Determine candidate scheduling parameters and design a scheduling function for each PID gain.
Step 4: Design a pre-filter.
Step 5: Find the weighting coefficient (control parameter) of the scheduling function that minimizes the evaluation function by Ridge regression.
In step 4, for example, a pre-filter of the following formula, which has a proven track record in simulations and experiments, is used. However, the prefilter is not limited to this.

［シミュレーション検証］
非線形システムを制御対象とした。ここでは、非線形システムを記述するモデルとして広く用いられているHammerstein モデルを制御対象とした。
制御器を含めたシミュレーションのサンプリング周期は１（ｓｅｃ）とし、次式に示すようなHammerstein モデルを制御対象とした。

ここで、ｖは分散１×１０^－３の白色雑音とする。各時刻の目標値ｒを以下のように設定した。

参照モデルは次式を用いた。

スケジューリングパラメータを次式とした。

ゲインスケジューラは、式（９）を用いた。 [Simulation verification]
A nonlinear system is the control object. Here, the Hammerstein model, which is widely used as a model to describe nonlinear systems, is used as the controlled object.
The sampling period of the simulation including the controller was set to 1 (sec), and the Hammerstein model as shown in the following equation was used as the controlled object.

Here, v is white noise with a variance of 1×10 ⁻³ . The target value r at each time was set as follows.

The following formula was used as the reference model.

The scheduling parameters are given by the following equations.

Equation (9) is used for the gain scheduler.

［検証結果］
入力はチャープサイン信号を印加し、入出力データを計測した。
従来の手法である標準ＶＲＦＴ（固定ＰＩＤゲイン）を用いたときの時系列データと、本開示の手法であるＬＵＴ－ＶＲＦＴ－Ｒｉｄｇｅを用いたとときの時系列データとを比較した。
従来の手法で求められた固定ＰＩＤゲインは、Ｋ_ｐ＝０．０３８９、Ｋ_ｉ＝０．２２３２、Ｋ_ｄ＝０．００００である。取得した入出力データから固定ＰＩＤゲインを求めたときの時系列データを図９に示す。また、本開示の手法でＰＩＤゲインを求めたときの時系列データを図１０に示す。なお、図９および図１０のそれぞれにおいて、上段から、出力、入力、比例ゲイン、積分ゲイン、微分ゲインである。
図９および図１０を参照して、従来の手法と本開示の手法とを比較すると、本開示の手法は、ＰＩＤゲインが制御対象の状態に応じて変化しており、ＰＩＤゲインが目標応答に追従していることが確認できた。 [inspection result]
A chirp sine signal was applied as an input, and input and output data were measured.
Time series data when using the conventional technique, standard VRFT (fixed PID gain), and time series data when using the technique of the present disclosure, LUT-VRFT-Ridge, were compared.
The fixed PID gains determined by the conventional method are K _p =0.0389, K _i =0.2232, K _d =0.0000. FIG. 9 shows time-series data when the fixed PID gain is obtained from the obtained input/output data. Also, FIG. 10 shows time-series data when the PID gain is obtained by the method of the present disclosure. In each of FIGS. 9 and 10, output, input, proportional gain, integral gain, and differential gain are shown from the top.
9 and 10, comparing the conventional method and the method of the present disclosure, the method of the present disclosure changes the PID gain according to the state of the controlled object, and the PID gain is adjusted to the target response. I was able to confirm that it was followed.

また、本開示の手法における、制御対象の状態と比例ゲイン、積分ゲイン、および、微分ゲインのそれぞれとの関係を図１１Ａ－図１１Ｃに示す。なお、図１１Ａ－図１１Ｃのそれぞれにおいて、ｓｔａｔｅ１軸は状態１の数量を表し、ｓｔａｔｅ２軸は状態２の数量を表す。縦軸は比例ゲイン、積分ゲイン、および、微分ゲインのそれぞれを表す。
図１１Ａ－図１１Ｃを参照すると、制御対象の状態１および状態２のそれぞれに応じて比例ゲイン、積分ゲイン、および、微分ゲインのそれぞれが変化していることが確認できた。 11A to 11C show the relationship between the state of the controlled object and the proportional gain, the integral gain, and the derivative gain, respectively, in the method of the present disclosure. 11A to 11C, the state1 axis represents the quantity of state 1, and the state2 axis represents the quantity of state 2. FIG. The vertical axis represents proportional gain, integral gain, and differential gain.
Referring to FIGS. 11A to 11C, it was confirmed that the proportional gain, the integral gain, and the differential gain changed according to state 1 and state 2 of the controlled object.

＜実施の形態に係る情報処理装置１の機能構成＞
以上の技術を前提として、実施の形態に係る情報処理装置１を説明する。 <Functional Configuration of Information Processing Apparatus 1 According to Embodiment>
Based on the above technology, the information processing apparatus 1 according to the embodiment will be described.

図６は、実施の形態に係る閉ループ系を模式的に示す図であり、制御モデルを用いずに自動調整則を有するゲインスケジュールＰＩＤ制御を実現するための構成を示す図である。図６に示すように、実施の形態に係る情報処理装置１は、制御対象Ｐへの入力ｕ、制御対象の出力ｙ、制御対象Ｐの状態Ｘ、参照モデルＭの出力Ｍ（ｚ^－１）ｒを取得して、ゲインスケジュール制御を実現するためのＬＵＴを変更するとともに、制御器Ｃの制御パラメータρを出力する。図６から明らかなように、実施の形態に係る閉ループ系は、図１に示した参照モデルを備える一般的な閉ループ系に情報処理装置１が追加されて構成されている。 FIG. 6 is a diagram schematically showing a closed-loop system according to an embodiment, showing a configuration for realizing gain schedule PID control having an automatic adjustment rule without using a control model. As shown in FIG. 6, the information processing apparatus 1 according to the embodiment includes an input u to the controlled object P, an output y of the controlled object, a state X of the controlled object P, an output M(z ⁻¹ ) of the reference model M r is obtained to change the LUT for realizing gain schedule control, and the control parameter ρ of the controller C is output. As is clear from FIG. 6, the closed loop system according to the embodiment is configured by adding the information processing device 1 to the general closed loop system provided with the reference model shown in FIG.

図７は、実施の形態に係る情報処理装置１の機能構成を模式的に示す図である。情報処理装置１は、記憶部２と制御部３とを備える。図７において、矢印は主なデータの流れを示しており、図７に示していないデータの流れがあってもよい。図７において、各機能ブロックはハードウェア（装置）単位の構成ではなく、機能単位の構成を示している。そのため、図７に示す機能ブロックは単一の装置内に実装されてもよく、あるいは複数の装置内に分かれて実装されてもよい。機能ブロック間のデータの授受は、データバス、ネットワーク、可搬記憶媒体等、任意の手段を介して行われてもよい。 FIG. 7 is a diagram schematically showing the functional configuration of the information processing device 1 according to the embodiment. The information processing device 1 includes a storage section 2 and a control section 3 . In FIG. 7, arrows indicate main data flows, and there may be data flows not shown in FIG. In FIG. 7, each functional block does not show the configuration in units of hardware (apparatus), but the configuration in units of functions. Therefore, the functional blocks shown in FIG. 7 may be implemented within a single device, or may be implemented separately within a plurality of devices. Data exchange between functional blocks may be performed via any means such as a data bus, network, or portable storage medium.

記憶部２は、情報処理装置１を実現するコンピュータのＢＩＯＳ（Basic Input Output System）等を格納するＲＯＭ（Read Only Memory）や情報処理装置１の作業領域となるＲＡＭ（Random Access Memory）、ＯＳ（Operating System）やアプリケーションプログラム、当該アプリケーションプログラムの実行時に参照される種々の情報、制御対象Ｐの状態と制御器Ｃの制御パラメータρとを紐づけて格納するＬＵＴを格納するＨＤＤ（Hard Disk Drive）やＳＳＤ（Solid State Drive）等の大容量記憶装置である。 The storage unit 2 includes a ROM (Read Only Memory) that stores a BIOS (Basic Input Output System) of a computer that implements the information processing apparatus 1, a RAM (Random Access Memory) that serves as a work area of the information processing apparatus 1, an OS ( Operating System), application programs, various information referred to when the application program is executed, the state of the controlled object P and the control parameter ρ of the controller C are linked and stored in the LUT (Hard Disk Drive) that stores the LUT. and a large-capacity storage device such as an SSD (Solid State Drive).

制御部３は、情報処理装置１のＣＰＵ（Central Processing Unit）やＧＰＵ（Graphics Processing Unit）等のプロセッサであり、記憶部２に記憶されたプログラムを実行することによって状態取得部３０、パラメータ生成部３１、及びテーブル変更部３２として機能する。 The control unit 3 is a processor such as a CPU (Central Processing Unit) or a GPU (Graphics Processing Unit) of the information processing device 1, and executes a program stored in the storage unit 2 to obtain a state acquisition unit 30, a parameter generation unit 31 and a table changing unit 32 .

なお、図７は、情報処理装置１が単一の装置で構成されている場合の例を示している。しかしながら、情報処理装置１は、例えばクラウドコンピューティングシステムのように複数のプロセッサやメモリ等の計算リソースによって実現されてもよい。この場合、制御部３を構成する各部は、複数の異なるプロセッサの中の少なくともいずれかのプロセッサがプログラムを実行することによって実現される。 Note that FIG. 7 shows an example in which the information processing apparatus 1 is composed of a single apparatus. However, the information processing apparatus 1 may be realized by computational resources such as a plurality of processors and memories, for example, like a cloud computing system. In this case, each unit that configures the control unit 3 is implemented by executing a program by at least one of a plurality of different processors.

［ゲインスケジュールＰＩＤ制御］節で説明したように、実施の形態に係る情報処理装置１は、図６における制御器Ｃの制御パラメータρを算出する。すなわち、情報処理装置１は、制御器Ｃと、制御器Ｃの出力を入力とする制御対象Ｐと、制御対象Ｐに入力する入力信号を入力として制御対象Ｐの出力をモデル化する参照モデルＭを備え、制御対象Ｐの出力が制御器Ｃの入力にフィードバックされる制御システムにおいて、制御器Ｃの制御パラメータρを算出する。また、情報処理装置１は、制御器Ｃの制御パラメータρを算出するために用いられるＬＵＴのテーブルデータであるパラメータ値も変更する。 As described in the [Gain Schedule PID Control] section, the information processing apparatus 1 according to the embodiment calculates the control parameter ρ of the controller C in FIG. That is, the information processing apparatus 1 includes a controller C, a controlled object P whose input is the output of the controller C, and a reference model M that models the output of the controlled object P whose input is an input signal input to the controlled object P. , and in a control system in which the output of the controlled object P is fed back to the input of the controller C, the control parameter ρ of the controller C is calculated. The information processing device 1 also changes the parameter values, which are the table data of the LUT used to calculate the control parameter ρ of the controller C. FIG.

［ルックアップテーブルの構造］節で説明したように、記憶部２が格納するＬＵＴは、制御対象Ｐに関する１又は複数の状態を軸とし、軸によって定義される空間に制御器Ｃの制御パラメータρが配置されて格納されている。 As described in the [Structure of Lookup Table] section, the LUT stored in the storage unit 2 has one or more states regarding the controlled object P as axes, and the control parameters ρ are placed and stored.

状態取得部３０は、制御対象Ｐの状態Ｘを取得する。制御対象Ｐの状態Ｘとは、例えば、制御対象Ｐの温度や圧力、制御対象Ｐが運動を行う場合はその速度や運動量、制御対象Ｐが存在する環境のパラメータ（温度や湿度等）である。状態取得部３０により取得されたデータは記憶部２に蓄積され、蓄積されたデータは、パラメータ生成部３１により用いられる。これにより、ＬＵＴのパラメータはテーブル変更部３２によりオフライン調整される。なお、状態取得部３０により取得されたデータがリアルタイムで用いられることで、ＬＵＴのパラメータがテーブル変更部３２によりオンライン調整されてもよい。 The state acquisition unit 30 acquires the state X of the controlled object P. FIG. The state X of the controlled object P is, for example, the temperature and pressure of the controlled object P, the speed and amount of motion when the controlled object P is exercising, and the parameters of the environment in which the controlled object P exists (temperature, humidity, etc.). . Data acquired by the state acquisition unit 30 is accumulated in the storage unit 2 and the accumulated data is used by the parameter generation unit 31 . As a result, the parameters of the LUT are adjusted offline by the table changer 32 . By using the data acquired by the state acquisition unit 30 in real time, the parameters of the LUT may be adjusted online by the table change unit 32 .

パラメータ生成部３１は、状態取得部３０が取得した制御対象Ｐの状態Ｘに基づいて、ＬＵＴを参照して複数のパラメータ値θｉ，ｊに基づいて制御パラメータρを生成する。具体的には、パラメータ生成部３１は、式（８）又は式（１３）に基づいて、複数のパラメータ値θｉ，ｊのそれぞれに、複数のパラメータ値θｉ，ｊのそれぞれに対応する重みを乗じて生成した複数の乗算値を加算することにより制御パラメータρを生成する。パラメータ生成部３１は、状態取得部３０が取得した制御対象Ｐの状態Ｘと複数のパラメータ値θｉ，ｊが対応する状態との距離ｄが近ければ近いほど大きな値の重みを複数のパラメータ値θi、ｊに乗じる。これにより、情報処理装置１は、制御対象Ｐの速い特性変動に応じて制御パラメータρを生成することができる。 Based on the state X of the controlled object P acquired by the state acquiring unit 30, the parameter generating unit 31 refers to the LUT and generates the control parameter ρ based on the plurality of parameter values θi,j. Specifically, the parameter generation unit 31 multiplies each of the plurality of parameter values θi,j by a weight corresponding to each of the plurality of parameter values θi,j based on Equation (8) or Equation (13). A control parameter ρ is generated by adding a plurality of multiplied values generated by The parameter generation unit 31 assigns a greater weight to the plurality of parameter values θi as the distance d between the state X of the controlled object P acquired by the state acquisition unit 30 and the state to which the plurality of parameter values θi and j correspond is shorter. , j. Thereby, the information processing device 1 can generate the control parameter ρ in accordance with the rapid characteristic change of the controlled object P.

テーブル変更部３２は、制御対象Ｐの出力と、参照モデルＭの出力との誤差に関する評価関数Ｊに基づいて、ＬＵＴに配置するテーブルデータである複数のパラメータ値θｉ，ｊを変更する。具体的には、テーブル変更部３２は、式（１３）に示す評価関数Ｊが小さくなるように、ＬＵＴに配置する複数のテーブルデータを変更する。これにより、情報処理装置１は、たとえば、経年劣化が原因で制御対象Ｐの特性が変動しても、ＬＵＴに格納すべきテーブルデータである複数のパラメータ値θｉ，ｊを自動的に調整することで、制御パラメータを最適な値に調整することができる。 The table changing unit 32 changes a plurality of parameter values θi,j, which are table data arranged in the LUT, based on the evaluation function J regarding the error between the output of the controlled object P and the output of the reference model M. Specifically, the table changing unit 32 changes the plurality of table data arranged in the LUT so that the evaluation function J shown in Equation (13) becomes smaller. As a result, the information processing apparatus 1 can automatically adjust a plurality of parameter values θi,j, which are table data to be stored in the LUT, even if the characteristics of the controlled object P fluctuate due to deterioration over time, for example. , the control parameters can be adjusted to optimum values.

ここで、ＬＵＴに配置するテーブルデータの変更は、制御対象Pの状態に対応して行われる。たとえば、制御対象Ｐの経年劣化等に対応してテーブルデータの変更が行われる。 Here, the table data arranged in the LUT is changed according to the state of the controlled object P. For example, the table data is changed in response to aged deterioration of the controlled object P or the like.

そこで、テーブル変更部３２は、式（１３）に示す評価関数Ｊの評価値が所定の範囲から外れた場合に、評価関数Ｊの評価値が所定の範囲に含まれるようにＬＵＴに配置するテーブルデータである複数のパラメータ値θｉ，ｊを変更する。 Therefore, when the evaluation value of the evaluation function J shown in Equation (13) is out of the predetermined range, the table changing unit 32 arranges the table in the LUT so that the evaluation value of the evaluation function J is included in the predetermined range. A plurality of parameter values θi,j that are data are changed.

ここで「所定の範囲」とは、ＬＵＴの変更の必要性の有無を決定するためにテーブル変更部３２が参照する「ＬＵＴ変更決定用範囲」である。具体的には、式（１３）に示す評価関数Ｊは評価値が小さいほど良い評価となるため、所定の範囲は０～上限値の範囲となる。すなわち、テーブル変更部３２は、評価関数Ｊの評価値が上限値を超えた場合、評価関数Ｊの評価値が上限値を下回るように、ＬＵＴに配置するテーブルデータを変更する。この上限値の具体的な値は、制御対象Ｐに関する先見情報や、制御に求められる精度等を勘案して実験により定めればよい。これにより、情報処理装置１は、不必要なＬＵＴの変更処理に計算リソースが割り当てられることを抑制できるとともに、制御器Ｃによる制御対象Ｐの制御の安定性を維持することができる。 Here, the "predetermined range" is the "LUT change determination range" referred to by the table change unit 32 in order to determine whether or not the LUT needs to be changed. Specifically, the smaller the evaluation value of the evaluation function J shown in Equation (13), the better the evaluation, so the predetermined range is 0 to the upper limit. That is, when the evaluation value of the evaluation function J exceeds the upper limit, the table changing unit 32 changes the table data arranged in the LUT so that the evaluation value of the evaluation function J falls below the upper limit. A specific value of this upper limit value may be determined by experiments in consideration of foresight information regarding the controlled object P, accuracy required for control, and the like. As a result, the information processing apparatus 1 can suppress the allocation of computational resources to unnecessary LUT change processing, and can maintain the stability of the control of the controlled object P by the controller C.

式（２６）に示すように、制御対象Ｐの出力と参照モデルＭの出力との誤差ｅは、ＬＵＴに配置すべきテーブルデータである複数のパラメータ値θｉ，ｊに基づく制御パラメータρを要素とするパラメータベクトルｗ（式（２８））に対して線形となるように定義されている。 As shown in equation (26), the error e between the output of the controlled object P and the output of the reference model M is obtained by using the control parameter ρ based on a plurality of parameter values θi,j, which are table data to be arranged in the LUT, as elements. is defined to be linear with respect to the parameter vector w (equation (28)).

そこで、テーブル変更部３２は、制御対象Ｐの出力と参照モデルＭの出力との誤差ｅが小さくなるように、式（２８）に示すパラメータベクトルｗを変更することにより、ＬＵＴに配置すべき複数のパラメータ値θｉ，ｊを変更する。 Therefore, the table changing unit 32 changes the parameter vector w shown in Equation (28) so that the error e between the output of the controlled object P and the output of the reference model M becomes small. change the parameter value θi,j of .

より具体的には、テーブル変更部３２は、式（３３）に示す評価関数に基づいて、パラメータベクトルｗを変更する。式（３３）に示す式はＲｉｄｇｅ回帰を取り入れた評価関数の式である。一般に、Ｒｉｄｇｅ回帰は、一般的な逐次最小二乗法と比較して過学習を防止することで、パラメータの局所的な最適化を防止できるという利点がある。 More specifically, the table changing unit 32 changes the parameter vector w based on the evaluation function shown in Equation (33). The formula (33) is an evaluation function formula incorporating Ridge regression. In general, Ridge regression has the advantage of preventing over-fitting and thus preventing local optimization of parameters compared to common iterative least-squares methods.

このように、テーブル変更部３２がＲｉｄｇｅ回帰の導入で評価関数Ｊの評価値が小さくなるようにパラメータベクトルｗを変更することにより、情報処理装置１は、制御器Ｃにより制御対象Ｐの制御の安定性を維持することができる。 In this manner, the table changing unit 32 changes the parameter vector w so that the evaluation value of the evaluation function J becomes smaller by introducing the Ridge regression. Stability can be maintained.

パラメータ生成部３１は、状態取得部３０が取得した制御対象Ｐの状態Ｘに対応するＬＵＴの位置座標からの距離に基づいて複数のパラメータ値θｉ，ｊを重み付け加算することにより制御パラメータρを算出する。このように、パラメータ生成部３１がＬＵＴのテーブルデータから制御パラメータρを線形演算で算出することにより、最終的に式（２６）に示すように、制御対象Ｐの出力と参照モデルＭの出力との誤差ｅをＬＵＴのテーブルデータに対して線形とすることができる。 The parameter generating unit 31 calculates the control parameter ρ by weighting and adding a plurality of parameter values θi,j based on the distance from the position coordinate of the LUT corresponding to the state X of the controlled object P acquired by the state acquiring unit 30. do. In this way, the parameter generation unit 31 calculates the control parameter ρ from the LUT table data by linear calculation, so that the output of the controlled object P and the output of the reference model M are finally obtained as shown in equation (26). can be made linear with respect to the LUT table data.

＜情報処理装置１が実行する情報処理の処理フロー＞
図８は、実施の形態に係る情報処理装置１が実行する情報処理の流れを説明するためのフローチャートである。本フローチャートにおける処理は、例えば情報処理装置１が起動したときに開始する。 <Processing Flow of Information Processing Executed by Information Processing Apparatus 1>
FIG. 8 is a flowchart for explaining the flow of information processing executed by the information processing device 1 according to the embodiment. The processing in this flowchart starts, for example, when the information processing apparatus 1 is activated.

状態取得部３０は、制御対象Ｐの状態Ｘを取得する（Ｓ１）。パラメータ生成部３１は、状態取得部３０が取得した制御対象Ｐの状態Ｘに基づいて、ＬＵＴを参照して制御器Ｃの制御パラメータρを生成する（Ｓ２）。 The state acquisition unit 30 acquires the state X of the controlled object P (S1). Based on the state X of the controlled object P acquired by the state acquisition unit 30, the parameter generation unit 31 refers to the LUT and generates the control parameter ρ of the controller C (S2).

テーブル変更部３２は、評価関数Ｊの評価値を算出して取得する（Ｓ３）。評価値が所定の範囲から外れている場合（Ｓ４：ＮＯ）、テーブル変更部３２は、評価関数Ｊの評価値が所定の範囲に含まれるように、ＬＵＴに格納する制御パラメータρを変更する（Ｓ５）。その後、本フローチャートにおける処理は終了する。評価値が所定の範囲内の場合（Ｓ４：ＹＥＳ）、本フローチャートにおける処理は終了する。 The table changing unit 32 calculates and acquires the evaluation value of the evaluation function J (S3). If the evaluation value is out of the predetermined range (S4: NO), the table changing unit 32 changes the control parameter ρ stored in the LUT so that the evaluation value of the evaluation function J is included in the predetermined range ( S5). After that, the processing in this flowchart ends. If the evaluation value is within the predetermined range (S4: YES), the processing in this flowchart ends.

＜実施の形態に係る情報処理装置１が奏する効果＞
以上説明したように、実施の形態に係る情報処理装置１によれば、制御対象の状態に応じて制御パラメータを変化させるように構成したため、ＰＩＤゲインが目標応答に十分に追従可能となる。 <Effects of Information Processing Apparatus 1 According to Embodiment>
As described above, according to the information processing apparatus 1 according to the embodiment, since the control parameters are changed according to the state of the controlled object, the PID gain can sufficiently follow the target response.

以上、本開示を実施の形態を用いて説明したが、本開示の技術的範囲は上記実施の形態に記載の範囲には限定されず、その要旨の範囲内で種々の変形及び変更が可能である。例えば、装置の分散・統合の具体的な実施の形態は、以上の実施の形態に限られず、その全部又は一部について、任意の単位で機能的又は物理的に分散・統合して構成することができる。また、複数の実施の形態の任意の組み合わせによって生じる新たな実施の形態も、本開示の実施の形態に含まれる。組み合わせによって生じる新たな実施の形態の効果は、もとの実施の形態の効果を合わせ持つ。 As described above, the present disclosure has been described using the embodiments, but the technical scope of the present disclosure is not limited to the range described in the above embodiments, and various modifications and changes are possible within the scope of the gist thereof. be. For example, specific embodiments of device distribution/integration are not limited to the above-described embodiments. can be done. In addition, new embodiments resulting from arbitrary combinations of multiple embodiments are also included in the embodiments of the present disclosure. The effect of the new embodiment caused by the combination has the effect of the original embodiment.

その他、上記実施の形態は、何れも本開示の実施をするにあたっての具体化の一例を示したものに過ぎず、これらによって本開示の技術的範囲が限定的に解釈されてはならないものである。すなわち、本開示はその要旨、又はその主要な特徴から逸脱することなく、様々な形で実施することができる。 In addition, the above-described embodiments are merely examples of specific implementations of the present disclosure, and the technical scope of the present disclosure should not be construed to be limited by these. . That is, the present disclosure can be embodied in various forms without departing from its spirit or key features.

本開示は、制御対象の状態に応じてパラメータを変化させることが要求される情報処理装置を備えた産業システム、特に、エンジンや自動変速機等の非線形性が強い産業システムに好適に利用される。 INDUSTRIAL APPLICABILITY The present disclosure is suitably used for an industrial system equipped with an information processing device that requires changing parameters according to the state of a controlled object, especially for an industrial system with strong nonlinearity such as an engine or an automatic transmission. .

１情報処理装置
２記憶部
３制御部
３０状態取得部
３１パラメータ生成部
３２テーブル変更部
Ｃ制御器
Ｐ制御対象
Ｍ参照モデル 1 information processing device 2 storage unit 3 control unit 30 state acquisition unit 31 parameter generation unit 32 table change unit C controller P controlled object M reference model

上記の目的を達成するため、本開示における情報処理装置は、制御器と、前記制御器の出力を入力とする制御対象と、前記制御器に入力する入力信号を入力として前記制御対象の出力をモデル化する参照モデルとを備え、前記制御対象の出力が前記制御器の入力にフィードバックされる制御システムにおいて、前記制御器の制御パラメータを算出する情報処理装置であって、前記制御対象又は外部入力に関する１又は複数の状態を軸とし、前記軸によって定義される空間に前記制御パラメータを算出するために用いられる複数のパラメータ値が配置されたルックアップテーブルと、前記制御対象の状態を取得する状態取得部と、取得された前記制御対象の状態を蓄積する記憶部と、前記状態取得部が取得した状態に基づいて、前記ルックアップテーブルを参照して前記複数のパラメータ値に基づいて前記制御パラメータを生成するパラメータ生成部と、蓄積された前記制御対象の状態を用いて、オフラインで、前記制御対象の出力と、前記参照モデルの出力との誤差に関する評価関数についてのＲｉｄｇｅ回帰の結果に基づいて、前記ルックアップテーブルに配置された前記複数のパラメータ値を変更するテーブル変更部と、を備える。 To achieve the above object, an information processing apparatus according to the present disclosure includes a controller, a controlled object whose input is the output of the controller, and an output of the controlled object whose input is an input signal input to the controller. An information processing device for calculating control parameters of the controller in a control system in which the output of the controlled object is fed back to the input of the controller, wherein the controlled object or the external input A lookup table in which a plurality of parameter values used for calculating the control parameter are arranged in a space defined by the axis, and a state for acquiring the state of the controlled object an acquisition unit, a storage unit that accumulates the acquired states of the controlled object, and based on the states acquired by the state acquisition unit, referencing the lookup table and determining the control parameters based on the plurality of parameter values and a Ridge regression result for an evaluation function regarding an error between the output of the controlled object and the output of the reference model offline using the accumulated state of the controlled object. and a table changing unit for changing the plurality of parameter values arranged in the lookup table.

Claims

制御器と、前記制御器の出力を入力とする制御対象と、前記制御対象に入力する入力信号を入力として前記制御対象の出力をモデル化する参照モデルとを備え、前記制御対象の出力が前記制御器の入力にフィードバックされる制御システムにおいて、前記制御器の制御パラメータを算出する情報処理装置であって、
前記制御対象又は外部入力に関する１又は複数の状態を軸とし、前記軸によって定義される空間に前記制御パラメータを算出するために用いられる複数のパラメータ値が配置されたルックアップテーブルと、
前記制御対象の状態を取得する状態取得部と、
前記状態取得部が取得した状態に基づいて、前記ルックアップテーブルを参照して前記複数のパラメータ値に基づいて前記制御パラメータを生成するパラメータ生成部と、
前記制御対象の出力と、前記参照モデルの出力との誤差に関する評価関数についてのＲｉｄｇｅ回帰の結果に基づいて、前記ルックアップテーブルに配置された前記複数のパラメータ値を変更するテーブル変更部と、
を備える情報処理装置。 a controller, a controlled object having an output of the controller as an input, and a reference model for modeling the output of the controlled object having an input signal input to the controlled object as an input, wherein the output of the controlled object is the An information processing device that calculates a control parameter of the controller in a control system that is fed back to the input of the controller,
a lookup table in which a plurality of parameter values used for calculating the control parameters are arranged in a space defined by the axis defined by one or more states related to the controlled object or the external input;
a state acquisition unit that acquires the state of the controlled object;
a parameter generation unit that refers to the lookup table and generates the control parameter based on the plurality of parameter values based on the state acquired by the state acquisition unit;
a table changing unit that changes the plurality of parameter values arranged in the lookup table based on the result of Ridge regression for an evaluation function relating to the error between the output of the controlled object and the output of the reference model;
Information processing device.

前記パラメータ生成部は、前記複数のパラメータ値のそれぞれに、前記複数のパラメータ値のそれぞれに対応する重みを乗じて生成した複数の乗算値を加算することにより前記制御パラメータを生成する、
請求項１に記載の情報処理装置。 The parameter generation unit generates the control parameter by adding a plurality of multiplied values generated by multiplying each of the plurality of parameter values by a weight corresponding to each of the plurality of parameter values,
The information processing device according to claim 1 .

前記パラメータ生成部は、前記状態取得部が取得した前記制御対象の状態と前記複数のパラメータ値が対応する状態との距離が近ければ近いほど大きな値の前記重みを前記複数のパラメータ値に乗じる、
請求項２に記載の情報処理装置。 The parameter generation unit multiplies the plurality of parameter values by the weight of a larger value as the distance between the state of the controlled object acquired by the state acquisition unit and the state to which the plurality of parameter values correspond is closer.
The information processing apparatus according to claim 2.

前記テーブル変更部は、前記評価関数の評価値が所定の範囲から外れた場合に、前記評価値が前記範囲に含まれるように前記制御パラメータを変更する、
請求項１から３のいずれか一項に記載の情報処理装置。 When the evaluation value of the evaluation function deviates from a predetermined range, the table changing unit changes the control parameter so that the evaluation value is included in the range.
The information processing apparatus according to any one of claims 1 to 3.

前記制御対象の出力と前記参照モデルの出力との誤差は、前記ルックアップテーブルに配置された前記複数のパラメータ値に基づく前記制御パラメータを要素とするパラメータベクトルに対して線形となるように定義されており、
前記テーブル変更部は、前記誤差が小さくなるように前記パラメータベクトルを変更することにより、前記制御パラメータを変更する、
請求項１から４のいずれか一項に記載の情報処理装置。 An error between the output of the controlled object and the output of the reference model is defined to be linear with respect to a parameter vector whose elements are the control parameters based on the plurality of parameter values arranged in the lookup table. and
The table changing unit changes the control parameter by changing the parameter vector so that the error becomes smaller.
The information processing apparatus according to any one of claims 1 to 4.

前記評価関数は、（１）前記空間における前記複数のパラメータ値に対応する位置座標、前記参照モデル、及び前記制御対象の出力から算出されるベクトルと前記パラメータベクトルとの内積と、（２）前記参照モデル及び前記制御対象への入力信号の積と、の差の二乗で定義されており、
前記テーブル変更部は、前記Ｒｉｄｇｅ回帰を用いて前記評価関数の評価値が小さくなるように前記パラメータベクトルを変更する、
請求項５に記載の情報処理装置。

The evaluation function includes (1) the inner product of a vector calculated from the position coordinates corresponding to the plurality of parameter values in the space, the reference model, and the output of the controlled object and the parameter vector, and (2) the It is defined as the square of the difference between the product of the reference model and the input signal to the controlled object,
The table changing unit uses the Ridge regression to change the parameter vector so that the evaluation value of the evaluation function becomes smaller.
The information processing device according to claim 5 .