JP2022028363A

JP2022028363A - System identification device and system identification method

Info

Publication number: JP2022028363A
Application number: JP2020131714A
Authority: JP
Inventors: 崇山田; Takashi Yamada
Original assignee: Kobe Steel Ltd
Current assignee: Kobe Steel Ltd
Priority date: 2020-08-03
Filing date: 2020-08-03
Publication date: 2022-02-16

Abstract

To provide a new system identification device and a method with which it is possible to achieve system identification for nonlinearly acting parameters while taking directly into account errors that accumulate over multiple steps.SOLUTION: A system identification device D pertaining to the present invention does by a computer, system identification of a model of prescribed system that performs a prescribed operation. The system identification device D comprises: a result data acquisition unit 42 which, when a prescribed input value is inputted to the system, acquires a plurality of pieces of result data which is obtained from the system for each of multiple points of time arranged in sequence of time; an estimate data acquisition unit 43 which, when the prescribed input value is inputted to the model, acquires a plurality of pieces of estimate data which is estimated by the model for each of multiple points of time; and a parameter processing unit 44 for minimizing an evaluation function of the model based on difference between the result data and estimate data obtained for each of multiple points of time and thereby finding the parameter.SELECTED DRAWING: Figure 1

Description

本発明は、所定の動作を行う所定のシステムのモデルをシステム同定するシステム同定装置およびシステム同定方法に関する。 The present invention relates to a system identification device and a system identification method for system identification of a model of a predetermined system that performs a predetermined operation.

所定の動作を行う所定のシステムに対するシミュレーション（数値実験）では、前記システムのモデルが必要であり、また、前記システムの制御において、前記システムのモデルが利用される場合がある。このシミュレーションや制御では、その精度は、前記モデルの精度によって左右されるため、前記モデルを規定するパラメータを精度よく同定することが望まれる。このパラメータの同定方法は、例えば、特許文献１に開示されている。 A model of the system is required for a simulation (numerical experiment) for a predetermined system that performs a predetermined operation, and the model of the system may be used in controlling the system. In this simulation and control, the accuracy depends on the accuracy of the model, so it is desired to accurately identify the parameters that define the model. A method for identifying this parameter is disclosed in, for example, Patent Document 1.

前記特許文献１に開示されたパラメータ同定法は、鉛直多関節油圧マニピュレータの非線形モデルを表す状態空間方程式に基づき、前記鉛直多関節油圧マニピュレータの体積弾性率および流量係数を含む未知のパラメータを同定する。 The parameter identification method disclosed in Patent Document 1 identifies unknown parameters including the bulk modulus and flow coefficient of the vertical articulated hydraulic manipulator based on a state-space equation representing a nonlinear model of the vertical articulated hydraulic manipulator. ..

特開２０１５－７７６４３号公報JP-A-2015-77643

前記特許文献１に開示されたパラメータ同定法は、非線形モデルを同定できるが、本発明は、非線形に作用するパラメータを、複数ステップにわたり蓄積していく誤差を直接考慮しながらシステム同定できる新たなシステム同定装置およびシステム同定方法を提供することである。 The parameter identification method disclosed in Patent Document 1 can identify a non-linear model, but the present invention is a new system that can identify a system that directly considers an error accumulating parameters that act non-linear over a plurality of steps. It is to provide an identification device and a system identification method.

本発明者は、種々検討した結果、上記目的は、以下の本発明により達成されることを見出した。すなわち、本発明の一態様にかかるシステム同定装置は、所定の動作を行う所定のシステムのモデルをコンピュータによってシステム同定する装置であって、所定の入力値を前記システムに入力した場合に、時系列に並ぶ複数の時点ごとに前記システムから得られる複数の実績データを取得する実績データ取得部と、前記所定の入力値を前記モデルに入力した場合に、前記複数の時点ごとに前記モデルによって推定される複数の推定データを取得する推定データ取得部と、前記複数の時点ごとに求められる前記実績データと前記推定データとの各差に基づく前記モデルの評価関数を、最小化することによって、前記パラメータを求めるパラメータ処理部とを備える。 As a result of various studies, the present inventor has found that the above object can be achieved by the following invention. That is, the system identification device according to one aspect of the present invention is a device that system-identifies a model of a predetermined system that performs a predetermined operation by a computer, and when a predetermined input value is input to the system, it is time-series. A performance data acquisition unit that acquires a plurality of performance data obtained from the system at each of a plurality of time points arranged in the line, and when the predetermined input value is input to the model, it is estimated by the model at each of the plurality of time points. The parameter is obtained by minimizing the estimation data acquisition unit that acquires a plurality of estimation data and the evaluation function of the model based on the difference between the actual data and the estimation data obtained at each of the plurality of time points. It is provided with a parameter processing unit for obtaining.

このようなシステム同定装置は、複数の時点ごとに求められる実績データと推定データとの各差に基づく前記モデルの評価関数を用いるので、非線形に作用するパラメータを、複数ステップにわたり蓄積していく誤差を直接考慮しながらシステム同定できる。 Since such a system identification device uses the evaluation function of the model based on the difference between the actual data and the estimated data obtained at each of a plurality of time points, an error of accumulating parameters acting non-linearly over a plurality of steps. The system can be identified while directly considering.

他の一態様では、上述のシステム同定装置において、前記パラメータ処理部は、前記評価関数を、前記モデルのパラメータがとり得る範囲を規定する拘束条件の下に、最小化することによって、前記パラメータを求める。 In another aspect, in the system identification apparatus described above, the parameter processing unit minimizes the evaluation function under a constraint condition that defines a range that the parameters of the model can take. Ask.

このようなシステム同定装置は、評価関数を、パラメータがとり得る範囲を規定する拘束条件の下に、最小化することによって、前記パラメータを求めるので、パラメータを、所定の条件を満たすように同定できる。 Such a system identification device obtains the parameter by minimizing the evaluation function under the constraint condition that defines the range that the parameter can take, so that the parameter can be identified so as to satisfy a predetermined condition. ..

他の一態様では、これら上述のシステム同定装置において、前記パラメータ処理部は、前記評価関数の勾配情報を用いた繰り返し計算によって前記パラメータを求める。 In another aspect, in these system identification devices described above, the parameter processing unit obtains the parameter by iterative calculation using the gradient information of the evaluation function.

このようなシステム同定装置は、評価関数の勾配情報を用いた繰り返し計算するので、パラメータを最適に収束できる。 Since such a system identification device repeatedly calculates using the gradient information of the evaluation function, the parameters can be optimally converged.

他の一態様では、これら上述のシステム同定装置において、前記所定のシステムは、油圧を生成する油圧ポンプと、前記油圧ポンプで生成された油圧を機械動力に変換する油圧アクチュエータと、前記油圧を制御する油圧制御弁とを備える油圧システムであり、前記所定の動作は、前記油圧アクチュエータで所定の対象物を動かす動作である。 In another aspect, in these system identification devices described above, the predetermined system controls a hydraulic pressure pump, a hydraulic pressure actuator that converts the hydraulic pressure generated by the hydraulic pressure pump into mechanical power, and the hydraulic pressure. It is a hydraulic system including a hydraulic control valve, and the predetermined operation is an operation of moving a predetermined object by the hydraulic actuator.

油圧システムは、通常、非線形に作用するパラメータを多数含むが、これによれば、油圧システムのモデルを精度よくシステム同定できるシステム同定装置が提供できる。 A hydraulic system usually contains a large number of parameters that act non-linearly, which can provide a system identification device capable of accurately system-identifying a model of a hydraulic system.

本発明の一態様にかかるシステム同定方法は、所定の動作を行う所定のシステムのモデルをコンピュータによってシステム同定する方法であって、所定の入力値を前記システムに入力した場合に、時系列に並ぶ複数の時点ごとに前記システムから得られる複数の実績データを取得する実績データ取得工程と、前記所定の入力値を前記モデルに入力した場合に、前記複数の時点ごとに前記モデルによって推定される複数の推定データを取得する推定データ取得工程と、前記複数の時点ごとに求められる前記実績データと前記推定データとの各差に基づく前記モデルの評価関数を、最小化することによって、前記パラメータを求めるパラメータ処理工程とを備える。 The system identification method according to one aspect of the present invention is a method of system-identifying a model of a predetermined system performing a predetermined operation by a computer, and is arranged in time series when a predetermined input value is input to the system. A performance data acquisition process for acquiring a plurality of performance data obtained from the system at each of a plurality of time points, and a plurality of estimations made by the model at each of the plurality of time points when the predetermined input value is input to the model. The parameter is obtained by minimizing the estimation data acquisition process for acquiring the estimated data of the model and the evaluation function of the model based on the difference between the actual data and the estimated data obtained at each of the plurality of time points. It is equipped with a parameter processing process.

このようなシステム同定方法は、複数の時点ごとに求められる実績データと推定データとの各差に基づく前記モデルの評価関数を用いるので、非線形に作用するパラメータを、複数ステップにわたり蓄積していく誤差を直接考慮しながらシステム同定できる。 Since such a system identification method uses the evaluation function of the model based on the difference between the actual data and the estimated data obtained at each of a plurality of time points, an error of accumulating parameters that act non-linearly over a plurality of steps. The system can be identified while directly considering.

本発明にかかるシステム同定装置およびシステム同定方法は、非線形に作用するパラメータを、複数ステップにわたり蓄積していく誤差を直接考慮しながらシステム同定できる。 The system identification apparatus and system identification method according to the present invention can identify a system by directly considering an error accumulating over a plurality of steps of parameters acting non-linearly.

実施形態におけるシステム同定装置の構成を示すブロック図である。It is a block diagram which shows the structure of the system identification apparatus in an embodiment. 前記システム同定装置の動作を示すフローチャートである。It is a flowchart which shows the operation of the system identification apparatus. 実施例１の油圧システムの構成を示す概略図である。It is a schematic diagram which shows the structure of the hydraulic system of Example 1. FIG. 実績値と実施例１の推定値と比較例の推定値との各比較結果を説明するための図である。It is a figure for demonstrating each comparison result of the actual value, the estimated value of Example 1, and the estimated value of a comparative example. 実施例２の機械システムの構成を示す概略図である。It is a schematic diagram which shows the structure of the mechanical system of Example 2.

以下、図面を参照して、本発明の１または複数の実施形態が説明される。しかしながら、発明の範囲は、開示された実施形態に限定されない。なお、各図において同一の符号を付した構成は、同一の構成であることを示し、適宜、その説明を省略する。本明細書において、総称する場合には添え字を省略した参照符号で示し、個別の構成を指す場合には添え字を付した参照符号で示す。 Hereinafter, one or more embodiments of the present invention will be described with reference to the drawings. However, the scope of the invention is not limited to the disclosed embodiments. It should be noted that the configurations with the same reference numerals in the respective drawings indicate the same configurations, and the description thereof will be omitted as appropriate. In the present specification, when they are generically referred to, they are indicated by reference numerals without subscripts, and when they refer to individual configurations, they are indicated by reference numerals with subscripts.

（実施形態）
実施形態におけるシステム同定装置は、所定の動作を行う所定のシステムのモデルをコンピュータによってシステム同定する装置である。このシステム同定装置は、所定の入力値を前記システムに入力した場合に、時系列に並ぶ複数の時点ごとに前記システムから得られる複数の実績データを取得する実績データ取得部と、前記所定の入力値を前記モデルに入力した場合に、前記複数の時点ごとに前記モデルによって推定される複数の推定データを取得する推定データ取得部と、前記複数の時点ごとに求められる前記実績データと前記推定データとの各差に基づく前記モデルの評価関数を、最小化することによって、前記パラメータを求めるパラメータ処理部とを備える。このようなシステム同定装置について、以下、より具体的に説明する。 (Embodiment)
The system identification device in the embodiment is a device that system-identifies a model of a predetermined system that performs a predetermined operation by a computer. This system identification device includes a performance data acquisition unit that acquires a plurality of performance data obtained from the system at each of a plurality of time points arranged in a time series when a predetermined input value is input to the system, and the predetermined input. When a value is input to the model, an estimation data acquisition unit that acquires a plurality of estimation data estimated by the model at each of the plurality of time points, and the actual data and the estimation data obtained at each of the plurality of time points. It is provided with a parameter processing unit for obtaining the parameters by minimizing the evaluation function of the model based on each difference from the above. Such a system identification device will be described in more detail below.

図１は、実施形態におけるシステム同定装置の構成を示すブロック図である。実施形態におけるシステム同定装置Ｄは、例えば、図１に示すように、入力部１と、出力部２と、インターフェース部（ＩＦ部）３と、制御処理部４と、記憶部５とを備える。 FIG. 1 is a block diagram showing a configuration of a system identification device according to an embodiment. As shown in FIG. 1, the system identification device D in the embodiment includes, for example, an input unit 1, an output unit 2, an interface unit (IF unit) 3, a control processing unit 4, and a storage unit 5.

入力部１は、制御処理部４に接続され、例えば、システム同定の開始を指示するコマンド等の各種コマンド、および、システム同定の対象となる所定のシステムＳの名称や、前記システムＳに入力される入力値等の、システム同定装置Ｄを動作させる上で必要な各種データをシステム同定装置Ｄに入力する機器であり、例えば、キーボードやマウス等である。出力部２は、制御処理部４に接続され、制御処理部４の制御に従って、入力部１から入力されたコマンドやデータ、およびシステム同定装置Ｄで求めたモデルのパラメータ等を出力する機器であり、例えばＣＲＴディスプレイ、液晶ディスプレイおよび有機ＥＬディスプレイ等の表示装置やプリンタ等の印刷装置等である。 The input unit 1 is connected to the control processing unit 4, and is input to, for example, various commands such as a command for instructing the start of system identification, the name of a predetermined system S to be the target of system identification, and the system S. It is a device for inputting various data necessary for operating the system identification device D, such as an input value, into the system identification device D, for example, a keyboard, a mouse, or the like. The output unit 2 is a device connected to the control processing unit 4 and outputs commands and data input from the input unit 1 and model parameters obtained by the system identification device D according to the control of the control processing unit 4. For example, a display device such as a CRT display, a liquid crystal display and an organic EL display, a printing device such as a printer, and the like.

なお、入力部１および出力部２からいわゆるタッチパネルが構成されてもよい。このタッチパネルを構成する場合において、入力部１は、例えば抵抗膜方式や静電容量方式等の操作位置を検出して入力する位置入力装置であり、出力部２は、表示装置である。このタッチパネルでは、前記表示装置の表示面上に前記位置入力装置が設けられ、前記表示装置に入力可能な１または複数の入力内容の候補が表示され、ユーザが、入力したい入力内容を表示した表示位置を触れると、前記位置入力装置によってその位置が検出され、検出された位置に表示された表示内容がユーザの操作入力内容としてシステム同定装置Ｄに入力される。このようなタッチパネルでは、ユーザは、入力操作を直感的に理解し易いので、ユーザにとって取り扱い易いシステム同定装置Ｄが提供される。 A so-called touch panel may be configured from the input unit 1 and the output unit 2. In the case of configuring this touch panel, the input unit 1 is a position input device that detects and inputs an operation position such as a resistance film method or a capacitance method, and the output unit 2 is a display device. In this touch panel, the position input device is provided on the display surface of the display device, candidates for one or a plurality of input contents that can be input to the display device are displayed, and the user displays the input contents that he / she wants to input. When the position is touched, the position is detected by the position input device, and the display content displayed at the detected position is input to the system identification device D as the operation input content of the user. With such a touch panel, since the user can intuitively understand the input operation, the system identification device D that is easy for the user to handle is provided.

ＩＦ部３は、制御処理部４に接続され、制御処理部４の制御に従って、外部機器との間でデータの入出力を行う回路であり、例えば、シリアル通信方式であるＲＳ－２３２ＣやＲＳ－４８５のインターフェース回路、Ｂｌｕｅｔｏｏｔｈ（登録商標）規格を用いたインターフェース回路、ＩｒＤＡ（ＩｎｆｒａｒｅｄＤａｔａＡｓｓｃｏｉａｔｉｏｎ）規格等の赤外線通信を行うインターフェース回路、および、ＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）規格を用いたインターフェース回路等である。また、ＩＦ部３は、外部機器との間で通信を行う回路であり、例えば、データ通信カードや、ＩＥＥＥ８０２．１１規格等に従った通信インターフェース回路等であってもよい。 The IF unit 3 is a circuit that is connected to the control processing unit 4 and inputs / outputs data to / from an external device according to the control of the control processing unit 4, for example, RS-232C or RS-, which is a serial communication method. An interface circuit of 485, an interface circuit using the Bluetooth (registered trademark) standard, an interface circuit for infrared communication such as the IrDA (Infrared Data Association) standard, and an interface circuit using the USB (Universal Serial Bus) standard. .. Further, the IF unit 3 is a circuit that communicates with an external device, and may be, for example, a data communication card, a communication interface circuit according to the IEEE802.11 standard, or the like.

記憶部５は、制御処理部４に接続され、制御処理部４の制御に従って、各種の所定のプログラムおよび各種の所定のデータを記憶する回路である。前記各種の所定のプログラムには、例えば、制御処理プログラムが含まれ、前記制御処理プログラムには、システム同定装置Ｄの各部１～３、５を当該各部の機能に応じてそれぞれ制御する制御プログラムや、所定の入力値をシステムＳに入力した場合に、時系列に並ぶ複数の時点ごとに前記システムＳから得られる複数の実績データを取得する実績データ取得プログラムや、前記所定の入力値を前記システムＳのモデルに入力した場合に、前記複数の時点ごとに前記モデルによって推定される複数の推定データを取得する推定データ取得プログラムや、前記複数の時点ごとに求められる前記実績データと前記推定データとの各差に基づく前記モデルの評価関数を、前記モデルのパラメータがとり得る範囲を規定する拘束条件の下に、最小化することによって、前記パラメータを求めるパラメータ処理プログラム等が含まれる。前記各種の所定のデータには、これら各プログラムを実行する上で必要なデータが含まれる。このような記憶部５は、例えば不揮発性の記憶素子であるＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）や書き換え可能な不揮発性の記憶素子であるＥＥＰＲＯＭ（ＥｌｅｃｔｒｉｃａｌｌｙＥｒａｓａｂｌｅＰｒｏｇｒａｍｍａｂｌｅＲｅａｄＯｎｌｙＭｅｍｏｒｙ）等を備える。そして、記憶部５は、前記所定のプログラムの実行中に生じるデータ等を記憶するいわゆる制御処理部４のワーキングメモリとなるＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）等を含む。なお、記憶部５は、比較的大容量となる学習データを記憶するために、大容量を記憶可能なハードディスク装置を備えてもよい。 The storage unit 5 is a circuit connected to the control processing unit 4 and stores various predetermined programs and various predetermined data according to the control of the control processing unit 4. The various predetermined programs include, for example, a control processing program, and the control processing program includes a control program that controls each part 1 to 3 and 5 of the system identification device D according to the function of each part. , A performance data acquisition program that acquires a plurality of performance data obtained from the system S at each of a plurality of time points arranged in a time series when a predetermined input value is input to the system S, or the system that inputs the predetermined input value. An estimation data acquisition program that acquires a plurality of estimated data estimated by the model at each of the plurality of time points when input to the model of S, and the actual data and the estimated data obtained at each of the plurality of time points. A parameter processing program or the like that obtains the parameters by minimizing the evaluation function of the model based on each difference under the constraint condition that defines the range that the parameters of the model can take is included. The various predetermined data include data necessary for executing each of these programs. Such a storage unit 5 includes, for example, a ROM (Read Only Memory) which is a non-volatile storage element, an EEPROM (Electrically Erasable Programmable Read Only Memory) which is a rewritable non-volatile storage element, and the like. The storage unit 5 includes a RAM (Random Access Memory) or the like that serves as a working memory of the so-called control processing unit 4 that stores data or the like generated during the execution of the predetermined program. The storage unit 5 may be provided with a hard disk device capable of storing a large capacity in order to store learning data having a relatively large capacity.

制御処理部４は、システム同定装置Ｄの各部１～３、５を当該各部の機能に応じてそれぞれ制御し、所定の動作を行う所定のシステムＳのモデルをシステム同定するための回路である。制御処理部４は、例えば、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）およびその周辺回路を備えて構成される。制御処理部４には、前記制御処理プログラムが実行されることによって、制御部４１、実績データ取得部４２、推定データ取得部４３およびパラメータ処理部４４が機能的に構成される。 The control processing unit 4 is a circuit for system-identifying a model of a predetermined system S that controls each unit 1 to 3 and 5 of the system identification device D according to the function of each unit and performs a predetermined operation. The control processing unit 4 includes, for example, a CPU (Central Processing Unit) and its peripheral circuits. The control processing unit 4 is functionally configured with the control unit 41, the actual data acquisition unit 42, the estimation data acquisition unit 43, and the parameter processing unit 44 by executing the control processing program.

制御部４１は、システム同定装置Ｄの各部１～３、５を当該各部の機能に応じてそれぞれ制御し、システム同定装置Ｄ全体の制御を司るものである。 The control unit 41 controls each of the units 1 to 3 and 5 of the system identification device D according to the functions of the respective units, and controls the entire system identification device D.

実績データ取得部４２は、所定の入力値をシステムＳに入力した場合に、時系列に並ぶ複数の時点ごとに前記システムＳから得られる複数の実績データを取得するものである。実績データ取得部４２は、例えば、図１に示すように、システムＳに入力値を入力し、システムＳから、所定のサンプリング間隔で、ＩＦ３を介して実績データを取り込むことで、時系列に並ぶ複数の時点ごとに前記システムＳから得られる複数の実績データを取得する。この場合では、オンラインでパラメータが同定される。あるいは、例えば、所定の入力値をシステムＳに入力した場合に、時系列に並ぶ複数の時点ごとに前記システムＳから得られる複数の実績データが入力部１を介して、破線で示すように、記憶部５に機能的に備えられる実績データ記憶部５１に予め記憶され、実績データ取得部４２は、実績データ記憶部５１から、所定の入力値をシステムＳに入力した場合に、時系列に並ぶ複数の時点ごとに前記システムＳから得られる複数の実績データを取得する。この場合では、オフラインでパラメータが同定される。 When a predetermined input value is input to the system S, the actual data acquisition unit 42 acquires a plurality of actual data obtained from the system S at each of a plurality of time points arranged in a time series. For example, as shown in FIG. 1, the actual data acquisition unit 42 inputs an input value to the system S and takes in the actual data from the system S via IF3 at a predetermined sampling interval, thereby arranging in a time series. A plurality of actual data obtained from the system S are acquired at each of a plurality of time points. In this case, the parameters are identified online. Alternatively, for example, when a predetermined input value is input to the system S, a plurality of actual data obtained from the system S at each of a plurality of time points arranged in a time series are indicated by a broken line via the input unit 1. It is stored in advance in the actual data storage unit 51 functionally provided in the storage unit 5, and the actual data acquisition unit 42 is arranged in chronological order when a predetermined input value is input to the system S from the actual data storage unit 51. A plurality of actual data obtained from the system S are acquired at each of a plurality of time points. In this case, the parameters are identified offline.

なお、実績データ記憶部５１に予め記憶される前記複数の実績データは、例えば、前記複数の実績データを管理するサーバ装置からネットワークおよびＩＦ部３を介してダウンロードされてもよい。あるいは、例えば、実績データ記憶部５１に予め記憶される前記複数の実績データは、前記複数の実績データを記憶する、ＵＳＢメモリやＳＤカード（登録商標）等のメモリーカードからＩＦ部３を介してダウンロードされてもよい。あるいは、例えば、ＣＤドライブ装置やＤＶＤドライブ装置がシステム同定装置Ｄにさらに備えられ、実績データ記憶部５１に予め記憶される前記複数の実績データは、前記複数の実績データを記憶する、ＣＤ－Ｒ（ＣｏｍｐａｃｔＤｉｓｃＲｅｃｏｒｄａｂｌｅ）やＤＶＤ－Ｒ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃＲｅｃｏｒｄａｂｌｅ）等の、コンピュータに読み取り可能な非一時的な記録媒体（Ｎｏｎ－ｔｒａｎｓｉｔｏｒｙｃｏｍｐｕｔｅｒｒｅａｄａｂｌｅｍｅｄｉｕｍ）から、ダウンロードされてもよい。 The plurality of actual data stored in advance in the actual data storage unit 51 may be downloaded from, for example, a server device that manages the plurality of actual data via the network and the IF unit 3. Alternatively, for example, the plurality of actual data stored in advance in the actual data storage unit 51 is stored in the plurality of actual data from a memory card such as a USB memory or an SD card (registered trademark) via the IF unit 3. It may be downloaded. Alternatively, for example, a CD drive device or a DVD drive device is further provided in the system identification device D, and the plurality of actual data stored in advance in the actual data storage unit 51 is a CD-R that stores the plurality of actual data. It may be downloaded from a non-transitory recording medium (Non-transistor computer readable medium) that can be read by a computer, such as (Compact Disk Record) or DVD-R (Digital Versaille Disk Record).

推定データ取得部４３は、前記所定の入力値を前記モデルに入力した場合に、前記複数の時点ごとにシステムＳのモデルによって推定される複数の推定データを取得するものである。 The estimation data acquisition unit 43 acquires a plurality of estimation data estimated by the model of the system S at each of the plurality of time points when the predetermined input value is input to the model.

パラメータ処理部４４は、前記複数の時点ごとに求められる前記実績データと前記推定データとの各差（各予測誤差）に基づく前記モデルの評価関数を、前記モデルのパラメータがとり得る範囲を規定する拘束条件の下に、最小化することによって、前記パラメータを求めるものである。 The parameter processing unit 44 defines a range in which the parameters of the model can take an evaluation function of the model based on each difference (each prediction error) between the actual data and the estimated data obtained at each of the plurality of time points. The above parameters are obtained by minimizing under the constraint condition.

このような推定データ取得部４３およびパラメータ処理部４４について、以下、より具体的に説明する。 Such an estimation data acquisition unit 43 and a parameter processing unit 44 will be described in more detail below.

推定データ取得部４３およびパラメータ処理部４４の処理は、次式（１）ないし式（５）の拘束条件付き最適化問題として表される。

ｓｕｂｊｅｃｔｔｏ

ここで、θ∈Ｒ^ｌは、推定パラメータを含むベクトルである。ｘ_ｋ∈Ｒ^ｎは、状態方程式（４）によって予測された離散時刻ｋにおけるシステムの状態、すなわち、複数の時点ｋごとに求められる推定データである（ｋ＝０～Ｎ）。上付きバーｘ_ｋ∈Ｒ^ｎは、離散時刻ｋにおいて実際に計測されたシステムの状態、すなわち、複数の時点ｋごとに求められる実績データであり、次式（６）によって表される。なお、変数を説明する文章において、記載の都合上、文字Ａの上部に“－”を付けたＡを「上付きバーＡ」と記載する。上付きバーｕ_ｋ∈Ｒ^ｎは、離散時刻ｋにおける制御入力、すなわち、所定の入力値であり、次式（７）によって表される。Ｊは、評価関数である。φ：Ｒ^ｎ→Ｒは、終端コストである。Ｌ：Ｒ^ｎ×Ｒ→Ｒは、ランニングコストである。ｆ：Ｒ^ｎ×Ｒ^ｍ→Ｒ^ｎは、離散化されたシステムの状態方程式である。θ_ｍｉｎ∈Ｒ^ｌおよびθ_ｍａｘ∈Ｒ^ｌは、パラメータθの上下限、すなわち、拘束条件である。式（５）は、ベクトル要素ごとの不等式である。

The processing of the estimation data acquisition unit 43 and the parameter processing unit 44 is expressed as a constrained optimization problem of the following equations (1) to (5).

subject to

Where θ ∈ R ^l is a vector containing the estimation parameters. x _k ∈ R ⁿ is the state of the system at the discrete time k predicted by the equation of state (4), that is, the estimated data obtained at each of a plurality of time points k (k = 0 to N). The superscript bar x _k ∈ R ⁿ is the state of the system actually measured at the discrete time k, that is, the actual data obtained at each of a plurality of time points k, and is expressed by the following equation (6). In the text explaining the variable, for convenience of description, A with "-" at the top of the letter A is described as "superscript bar A". The superscript bar u _k ∈ R ⁿ is a control input at the discrete time k, that is, a predetermined input value, and is expressed by the following equation (7). J is an evaluation function. φ: R ⁿ → R is the termination cost. L: R ⁿ × R → R is a running cost. f: R ⁿ × R ^m → R ⁿ is the equation of state of the discretized system. θ _min ∈ R ^l and θ _max ∈ R ^l are the upper and lower limits of the parameter θ, that is, the constraints. Equation (5) is an inequality for each vector element.

これら式（１）ないし式（５）の拘束条件付き最適化問題は、次の手法によって解かれる。 The constrained conditional optimization problems of Eqs. (1) to (5) are solved by the following method.

従来の最小二乗法とは異なり、複数ステップにわたる二乗誤差を考慮するために、評価関数内の終端コストとランニングコストは、次式（８）および（９）のように設定される。

ここで、Ｑ_φ∈Ｒ^ｎ×ｎとＱ_L∈Ｒ^ｎ×ｎは、定数重み行列である。 Unlike the conventional least squares method, the termination cost and the running cost in the evaluation function are set as in the following equations (8) and (9) in order to consider the square error over a plurality of steps.

Here, Q _φ ∈ R ^{n × n} and Q _L ∈ R ^{n × n} are constant weight matrices.

拘束条件の式（３）ないし式（５）を考慮して式（１）および式（２）を解くために、次式（１０）のラグランジアンが導入される。

ここで、λ_ｋ＋１∈Ｒ^ｎは、等式拘束条件（４）に対するラグランジュ乗数である。Ｈ：Ｒ^ｎ×Ｒ^ｎ×Ｒ^ｌ×Ｒ→Ｒは、次式（１１）で定義されるハミルトニアン関数である。

In order to solve the equations (1) and (2) in consideration of the constraints (3) to (5), the Lagrangian of the following equation (10) is introduced.

Here, λ _{k + 1} ∈ R ⁿ is a Lagrange multiplier for the equation constraint condition (4). H: R ⁿ × R ⁿ × R ^l × R → R is a Hamiltonian function defined by the following equation (11).

変分計算により、ラグランジアンの変分は、次式（１２）のように得られる。

ここで、δｘ_ｋ∈Ｒ^ｎは、ｘ_ｋの変分である。δ（上付きバーＪ）＝０であるから、最適性の必要条件は、次式（１３）ないし（１５）のように得られる。

By variational calculation, the variation of Lagrangian is obtained by the following equation (12).

Here, δ x _k ∈ R ⁿ is a variation of x _k . Since δ (superscript bar J) = 0, the requirement for optimality can be obtained as in the following equations (13) to (15).

これらの必要条件に基づき、最適なパラメータθは、評価関数Ｊの勾配情報を用いた繰り返し計算により数値的に求められる。例えば、最適なパラメータθは、次式（１６）のような更新則を繰り返し適用することで求められる。

ここで、「←」は、代入演算子である。η∈Ｒは、更新係数である。この更新計数ηは、評価関数の勾配（∂Ｊ／∂θ）が１回の繰り返し計算でパラメータθに与える影響度を設定する値であり、また、収束の速度を設定する値であり、システムＳ等に応じて予め適宜に設定される。評価関数の勾配（∂Ｊ／∂θ）は、次のように導出される。 Based on these requirements, the optimum parameter θ is numerically obtained by iterative calculation using the gradient information of the evaluation function J. For example, the optimum parameter θ can be obtained by repeatedly applying an update rule such as the following equation (16).

Here, "←" is an assignment operator. η ∈ R is the update factor. This update count η is a value that sets the degree of influence that the gradient of the evaluation function (∂J / ∂θ) has on the parameter θ in one iterative calculation, and is a value that sets the speed of convergence, and is a system. It is appropriately set in advance according to S and the like. The gradient of the evaluation function (∂J / ∂θ) is derived as follows.

まず、各時刻における状態の予測値ｘ_０、ｘ_１、・・・、ｘ_Nが式（３）、式（４）および現在の推定値θを用いて導出される。次に、ｘ_Nを式（１３）に代入することでλ_N が導出される。すると、各時刻におけるラグランジュ乗数λ_１、λ_２、・・・、λ_N が式（１４）を逆方向に解くことで導出される。これによって導出されたｘ_０、ｘ_１、・・・、ｘおよびλ_１、λ_２、・・・、λ_Nは、明らかに式（４）、式（１３）および式（１４）を満たすため、これらを用いた計算されたラグランジアンおよびその変分は、次式（１７）および（１８）を満たす。

First, the predicted values x ₀ , x ₁ , ..., X _N of the state at each time are derived using the equations (3), (4) and the current estimated value θ. Next, λ _N is derived by substituting x _N into equation (13). Then, the Lagrange multipliers λ ₁ , λ ₂ , ..., λ _N at each time are derived by solving Eq. (14) in the opposite direction. Since x ₀ , x ₁ , ..., X and λ ₁ , λ ₂ , ..., λ _N derived by this clearly satisfy the equations (4), (13) and (14). , The Lagrangian calculated using these and its variation satisfy the following equations (17) and (18).

式（１８）は、dθの項のみをもつため、次式（１９）のように書き換えられる。

Since the equation (18) has only the term of dθ, it can be rewritten as the following equation (19).

十分に小さいηと式（１６）および式（１９）を用いて更新されたパラメータθは、常に評価関数Ｊを減少させる。よって、評価関数Ｊの局所最適値に到達するまで、すなわち、（∂Ｊ／∂θ）が十分小さくなる（＝必要条件（１５）が満たされる）までパラメータθを更新すると、パラメータθは、局所最適な値に収束する。なお、パラメータθを更新する際に、各パラメータが式（５）を満たすように更新範囲を制限する。 A sufficiently small η and the parameter θ updated with Eq. (16) and Eq. (19) always decrease the merit function J. Therefore, when the parameter θ is updated until the local optimum value of the evaluation function J is reached, that is, until (∂J / ∂θ) becomes sufficiently small (= the necessary condition (15) is satisfied), the parameter θ becomes local. Converges to the optimum value. When updating the parameter θ, the update range is limited so that each parameter satisfies the equation (5).

このように式（１）ないし式（５）の拘束条件付き最適化問題は、解かれる。 In this way, the constrained conditional optimization problem of Eqs. (1) to (5) is solved.

このようなシステム同定装置Ｄにおける入力部１、出力部２、ＩＦ部３、制御処理部４および記憶部５は、例えば、デスクトップ型やノート型等のコンピュータによって構成可能である。 The input unit 1, the output unit 2, the IF unit 3, the control processing unit 4, and the storage unit 5 in the system identification device D can be configured by, for example, a computer such as a desktop type or a notebook type.

次に、本実施形態の動作について説明する。図２は、前記システム同定装置の動作を示すフローチャートである。 Next, the operation of this embodiment will be described. FIG. 2 is a flowchart showing the operation of the system identification device.

このような構成のシステム同定装置Ｄは、その電源が投入されると、必要な各部の初期化を実行し、その稼働を始める。制御処理部４には、その制御処理プログラムの実行によって、制御部４１、実績データ取得部４２、推定データ取得部４３およびパラメータ処理部４４が機能的に構成される。モデルを求めるシステムＳにおける式（１）ないし式（５）は、システム同定装置Ｄに予め入力され、記憶されているものとする。 When the power of the system identification device D having such a configuration is turned on, the necessary initialization of each part is executed and the system identification device D starts its operation. The control processing unit 4 is functionally configured with the control unit 41, the actual data acquisition unit 42, the estimation data acquisition unit 43, and the parameter processing unit 44 by executing the control processing program. It is assumed that the equations (1) to (5) in the system S for obtaining the model are input and stored in advance in the system identification device D.

システム同定が開始されると、まず、システム同定装置Ｄは、制御処理部４のパラメータ処理部４４によって、パラメータθを適用な値に初期化する（Ｓ１）。パラメータθの初期値は、繰り返し計算の初期値であるので、任意の値であってよい。 When the system identification is started, first, the system identification device D initializes the parameter θ to an applicable value by the parameter processing unit 44 of the control processing unit 4 (S1). Since the initial value of the parameter θ is the initial value of the iterative calculation, it may be any value.

次に、システム同定装置Ｄは、制御処理部４の実績データ取得部４２によって、モデルを求めるシステムＳにおける、式（６）および式（７）の実績データを取得する（Ｓ２）。 Next, the system identification device D acquires the actual data of the equations (6) and (7) in the system S for which the model is obtained by the actual data acquisition unit 42 of the control processing unit 4 (S2).

次に、システム同定装置Ｄは、制御処理部４の推定データ取得部４３によって、各時刻におけるシステムＳの状態の予測値（各時刻におけるシステムＳの推定データ）ｘ_０、ｘ_１、・・・、ｘ_Nを式（３）、式（４）および現在のパラメータθ（前回の繰り返し計算で推定されたパラメータθ）を用いて求める（Ｓ３）。 Next, the system identification device D uses the estimation data acquisition unit 43 of the control processing unit 4 to predict the state of the system S at each time (estimated data of the system S at each time) x ₀ , x ₁ , ... , X _N are obtained using the equations (3), (4) and the current parameter θ (parameter θ estimated in the previous iterative calculation) (S3).

次に、システム同定装置Ｄは、パラメータ処理部４４によって、ｘ_Nを式（１３）に代入することでλ_N を求める（Ｓ４）。 Next, the system identification device D obtains λ _N by substituting x _N into the equation (13) by the parameter processing unit 44 (S4).

次に、システム同定装置Ｄは、パラメータ処理部４４によって、式（１６）および式（１９）を用いてパラメータθを求めて更新する（Ｓ５）。 Next, the system identification device D obtains and updates the parameter θ by the parameter processing unit 44 using the equations (16) and (19) (S5).

次に、システム同定装置Ｄは、パラメータ処理部４４によって、（∂Ｊ／∂θ）が十分小さいか否かを判定する（Ｓ６）。より具体的には、パラメータ処理部４４は、（∂Ｊ／∂θ）が、予め設定した、小さな所定の閾値以下であるか否かを判定する。この判定の結果、（∂Ｊ／∂θ）が、前記閾値以下で、十分小さい場合には、システム同定装置Ｄは、次に、処理Ｓ７を実行し、一方、前記判定の結果、（∂Ｊ／∂θ）が、前記閾値より大きく、十分小さくない場合には、システム同定装置Ｄは、処理を処理Ｓ２に戻す。これによって繰り返し計算が実行される。 Next, the system identification device D determines whether or not (∂J / ∂θ) is sufficiently small by the parameter processing unit 44 (S6). More specifically, the parameter processing unit 44 determines whether or not (∂J / ∂θ) is equal to or less than a preset small predetermined threshold value. As a result of this determination, if (∂J / ∂θ) is equal to or less than the threshold value and sufficiently small, the system identification device D then executes the process S7, while the result of the determination, (∂J). If / ∂θ) is larger than the threshold value and not sufficiently smaller, the system identification device D returns the process to the process S2. This causes the iterative calculation to be performed.

この処理Ｓ７では、システム同定装置Ｄは、制御処理部４によって、この求めたパラメータθを出力部２に出力し、本処理を終了する。なお、必要に応じて、システム同定装置Ｄは、制御処理部４によって、この求めたパラメータθをＩＦ部３を介して外部機器へ出力してもよい。 In this process S7, the system identification device D outputs the obtained parameter θ to the output unit 2 by the control process unit 4, and ends this process. If necessary, the system identification device D may output the obtained parameter θ to the external device via the IF unit 3 by the control processing unit 4.

以上説明したように、実施形態におけるシステム同定装置Ｄおよびこれに実装されたシステム同定方法は、複数の時点ｋごとに求められる実績データ；上付きバーｘ_ｋと推定データ；ｘ_ｋとの各差に基づくシステムＳのモデルの評価関数Ｊを用いるので、非線形に作用するパラメータθを、複数ステップにわたり蓄積していく誤差を直接考慮しながらシステム同定できる。１ステップ分の誤差のみを考慮してシステム同定する従来法に比べ、上記システム同定装置Ｄおよびシステム同定方法は、推定精度に優れている。 As described above, the system identification device D and the system identification method implemented therein are the actual data obtained at a plurality of time points k; the difference between the superposition bar x _k and the estimated data; x _k . Since the evaluation function J of the model of the system S based on the above is used, the parameter θ acting in a non-linear manner can be system-identified while directly considering the error accumulated over a plurality of steps. The system identification device D and the system identification method are superior in estimation accuracy as compared with the conventional method of system identification considering only an error of one step.

上記システム同定装置Ｄおよびシステム同定方法は、この評価関数Ｊを、パラメータθがとり得る範囲を規定する拘束条件θ_ｍｉｎ～θ_ｍａｘの下に、最小化することによって、前記パラメータθを求めるので、パラメータθを、所定の条件を満たすように同定できる。 Since the system identification device D and the system identification method obtain the parameter θ by minimizing the evaluation function J under the constraint conditions θ _min to θ _max that define the range that the parameter θ can take. The parameter θ can be identified so as to satisfy a predetermined condition.

上記システム同定装置Ｄおよびシステム同定方法は、評価関数Ｊの勾配情報（∂Ｊ／∂θ）を用いた繰り返し計算するので（θ←θ－η（∂Ｊ／∂θ））、パラメータθを最適に収束できる。 Since the system identification device D and the system identification method are repeatedly calculated using the gradient information (∂J / ∂θ) of the evaluation function J (θ ← θ−η (∂J / ∂θ)), the parameter θ is optimal. Can converge to.

次に、油圧システムおよび機械システムの各モデルをシステム同定装置Ｄで求める実施例１および実施例２について説明する。 Next, Example 1 and Example 2 in which each model of the hydraulic system and the mechanical system is obtained by the system identification device D will be described.

（実施例１）
図３は、実施例１の油圧システムの構成を示す概略図である。図４は、実績値と実施例１の推定値と比較例の推定値との各比較結果を説明するための図である。図４Ａは、油圧ポンプ６１における吐出圧Ｐ_ｐの時間変化を示し、図４Ｂは、油圧アクチュエータ６３にける第１ポート６３３の入出力圧Ｐ_ｈの時間変化を示し、図４Ｃは、油圧アクチュエータ６３における第２ポート６３４の入出力圧Ｐ_ｒの時間変化を示し、これら各横軸は、経過時間であり、これら各縦軸は、圧力である。図４Ｄは、油圧アクチュエータ６３におけるピストン位置ｘ_ｃの時間変化を示し、その横軸は、経過時間であり、その縦軸は、位置である。図４Ｅは、油圧アクチュエータ６３におけるピストン速度の時間変化を示し、その横軸は、経過時間であり、その縦軸は、速度である。 (Example 1)
FIG. 3 is a schematic view showing the configuration of the hydraulic system of the first embodiment. FIG. 4 is a diagram for explaining each comparison result between the actual value, the estimated value of Example 1, and the estimated value of the comparative example. FIG. 4A shows the time change of the discharge pressure P _p in the hydraulic pump 61, FIG. _4B shows the time change of the input / output pressure Ph of the first port 633 in the hydraulic actuator 63, and FIG. 4C shows the time change of the hydraulic actuator 63. The time change of the input / output pressure _Pr of the second port 634 in the above is shown, and each of these horizontal axes is the elapsed time, and each of these vertical axes is the pressure. FIG. 4D shows the time change of the piston position x _c in the hydraulic actuator 63, the horizontal axis thereof is the elapsed time, and the vertical axis thereof is the position. FIG. 4E shows the time change of the piston speed in the hydraulic actuator 63, the horizontal axis thereof is the elapsed time, and the vertical axis thereof is the speed.

実施例１は、モデルを求める、所定の動作を行う所定のシステムＳが油圧システムであるケースである。この実施例１における油圧システム６は、例えば、図３に示すように、油圧を生成する油圧ポンプ６１と、前記油圧ポンプ６１で生成された油圧を機械動力に変換する油圧アクチュエータ６３と、前記油圧を制御する油圧制御弁６２とを備える。この油圧システム６では、前記所定の動作は、前記油圧アクチュエータ６３で所定の対象物Ｏｂを動かす動作である。油圧アクチュエータ６３は、シリンダ６３１とピストン６３２とを備え、シリンダ６３１は、その一方端に、油を入出力する第１ポート６３３と、その他方端に、油を入出力する第２ポート６３４とを備える。油圧ポンプ６１は、油圧制御弁６２に連結され、油圧制御弁６２は、第１および第２ポート６３３、６３４それぞれに連結される。このような油圧システム６では、油圧ポンプ６１で生成された油圧を、油圧制御弁６２が第１ポート６３３から油を入力することで油圧アクチュエータ６３に作用させ、ピストン６３２が前記一方端から前記他方端へ（紙面左側から右側へ）移動し、第２ポート６３４から油が出力されて油圧制御弁６２に戻る。このピストン６３２の移動によって対象物Ｏｂが紙面左側から右側へ移動する。一方、油圧ポンプ６１で生成された油圧を、油圧制御弁６２が第２ポート６３４から油を入力することで油圧アクチュエータ６３に作用させ、ピストン６３２が前記他方端から前記一方端へ（紙面右側から左側へ）移動し、第１ポート６３３から油が出力されて油圧制御弁６２に戻る。このピストン６３２の移動によって対象物Ｏｂが紙面右側から左側へ移動する。 The first embodiment is a case where a predetermined system S for obtaining a model and performing a predetermined operation is a hydraulic system. The hydraulic system 6 in the first embodiment has, for example, as shown in FIG. 3, a hydraulic pump 61 that generates hydraulic pressure, a hydraulic actuator 63 that converts the hydraulic pressure generated by the hydraulic pump 61 into mechanical power, and the hydraulic pressure. It is provided with a hydraulic control valve 62 for controlling the above. In the hydraulic system 6, the predetermined operation is an operation of moving a predetermined object Ob with the hydraulic actuator 63. The hydraulic actuator 63 includes a cylinder 631 and a piston 632, and the cylinder 631 has a first port 633 for inputting / outputting oil at one end thereof and a second port 634 for inputting / outputting oil at the other end. Be prepared. The hydraulic pump 61 is connected to the hydraulic control valve 62, and the hydraulic control valve 62 is connected to the first and second ports 633 and 634, respectively. In such a hydraulic system 6, the hydraulic pressure generated by the hydraulic pump 61 is applied to the hydraulic actuator 63 by the hydraulic control valve 62 inputting oil from the first port 633, and the piston 632 acts from the one end to the other. It moves to the end (from the left side to the right side of the paper), oil is output from the second port 634, and the oil is returned to the hydraulic control valve 62. The movement of the piston 632 causes the object Ob to move from the left side to the right side of the paper. On the other hand, the hydraulic pressure generated by the hydraulic pump 61 is applied to the hydraulic actuator 63 by the hydraulic control valve 62 inputting oil from the second port 634, and the piston 632 moves from the other end to the one end (from the right side of the paper). (To the left), oil is output from the first port 633 and returns to the hydraulic control valve 62. The movement of the piston 632 causes the object Ob to move from the right side to the left side of the paper.

このような油圧システム６の状態および状態方程式は、次式（２０）および式（２１）で表される。

ここで、Ｐ_ｐは、油圧ポンプ６１における吐出圧である。Ｐ_ｈは、油圧アクチュエータ６３にける第１ポート６３３の入出力圧である。Ｐ_ｒは、油圧アクチュエータ６３における第２ポート６３４の入出力圧である。Ｐ_ｔは、タンクにおける圧力である。ｘ_ｃは、ピストン位置である。ｋは、体積弾性係数である。Ｖ_ｐ、Ｖ_ｈ、Ｖ_ｒは、それぞれ、Ｐ_ｐ、Ｐ_ｈ、Ｐ_ｒが生じる場所の容積である。Ａ_ｈ、Ａ_ｒは、それぞれ、ピストンの断面積である。ｌ_ｃは、シリンダ長さである。ｍは、対象物Ｏｂによる負荷質量である。Ｆは、ピストン速度に依存する摩擦力である。ａ_Ｆ、ｂ_Ｆは、定数である。Ｑは、それぞれの場所における油の流量である。Ｃ_ｉｎ、Ｃ_ｏｕｔは、流量係数である。ｕは、油圧制御弁６２の制御弁位置（本油圧システム６の制御入力）である。Ｖは、油圧制御弁６２における制御弁の開口特性を表す関数である。Ｇは、圧力特性を表す関数である。 The state and the equation of state of such a hydraulic system 6 are expressed by the following equations (20) and (21).

Here, P _p is the discharge pressure in the hydraulic pump 61. _Ph is the input / output pressure of the first port 633 in the hydraulic actuator 63. _Pr is the input / output pressure of the second port 634 in the hydraulic actuator 63. _Pt is the pressure in the tank. x _c is the piston position. k is a bulk modulus. V _p , V _h , and V _r are the volumes of the places where P _p , _Ph , and _Pr occur, respectively. A _h and _Ar are the cross-sectional areas of the piston, respectively. l _c is the cylinder length. m is the load mass due to the object Ob. F is a frictional force that depends on the piston speed. a _F and b _F are constants. Q is the flow rate of oil at each location. C _in and C _out are flow coefficients. u is the control valve position of the hydraulic control valve 62 (control input of the hydraulic system 6). V is a function representing the opening characteristic of the control valve in the hydraulic control valve 62. G is a function representing the pressure characteristic.

なお、実施例１では、上記状態方程式を式（４）として適用するために、ルンゲ・クッタ法を用いて、微分方程式形式の状態方程式は、離散方程式に変換される。 In the first embodiment, in order to apply the equation of state as the equation (4), the equation of state in the form of a differential equation is converted into a discrete equation by using the Runge-Kutta method.

実施例１では、上述の実施形態におけるシステム同定装置Ｄによって、次式（２２）のパラメータθがシステム同定された。ここで、パラメータθのＶ_ｐ、Ｖ_ｈ、Ｖ_ｒ、ｂ_Ｆは、非線形に作用する。これらパラメータθは、その物理的な意味から正であるので、拘束条件として、θ_ｍｉｎは、０ベクトルに設定され、θ_ｍａｘは、各要素が無限大であるベクトルに設定された。

In Example 1, the parameter θ of the following equation (22) was system-identified by the system identification device D in the above-described embodiment. Here, the parameters V _p , V _h , V _r , and b _F of the parameters θ act non-linearly. Since these parameters θ are positive in their physical sense, θ _min is set to 0 vector and θ _max is set to the vector in which each element is infinite as a constraint condition.

上述の実施形態におけるシステム同定装置Ｄで式（２２）のパラメータθをシステム同定することによって得られた油圧システム６のモデルにおけるシミュレーション結果が実線で図４に示されている。図４には、実績データが点線で示され、比較例のシミュレーション結果が一点鎖線で示されている。前記比較例は、油圧システム６を最小二乗法で近似したモデルである。 The simulation result in the model of the hydraulic system 6 obtained by system-identifying the parameter θ of the equation (22) by the system identification device D in the above-described embodiment is shown by a solid line in FIG. In FIG. 4, the actual data is shown by the dotted line, and the simulation result of the comparative example is shown by the alternate long and short dash line. The comparative example is a model in which the hydraulic system 6 is approximated by the method of least squares.

実施形態におけるシステム同定装置Ｄは、比較例と比べ、非線形に作用するパラメータＶ_ｐ、Ｖ_ｈ、Ｖ_ｒ、ｂ_Ｆを推定可能で、複数ステップ分の予測誤差を含めた推定が可能であるため、図４から、比較例より本実施例１の方が実績データを近似しており、システム同定したパラメータθを用いたシミュレーションの精度に優れることが見て取れる。 Compared to the comparative example, the system identification device D in the embodiment can estimate the parameters V _p , V _h , V _r , and b _F that act non-linearly, and can estimate including the prediction error for a plurality of steps. From FIG. 4, it can be seen that the actual data is closer to the actual data in the first embodiment than in the comparative example, and the accuracy of the simulation using the system-identified parameter θ is excellent.

（実施例２）
図５は、実施例２の機械システムの構成を示す概略図である。実施例２は、モデルを求める、所定の動作を行う所定のシステムＳが機械システムであるケースである。この実施例２における機械システム７は、例えば、図５に示すように、バネ７１を備えるダンパーである。この機械システム７では、前記所定の動作は、前記バネ７１で所定の対象物Ｏｂを支持する動作である。 (Example 2)
FIG. 5 is a schematic view showing the configuration of the mechanical system of the second embodiment. The second embodiment is a case where a predetermined system S for obtaining a model and performing a predetermined operation is a mechanical system. The mechanical system 7 in the second embodiment is, for example, a damper provided with a spring 71, as shown in FIG. In this mechanical system 7, the predetermined operation is an operation of supporting a predetermined object Ob with the spring 71.

このような機械システム７の状態および状態方程式は、次式（２３）および式（２４）で表される。

ここで、ｋは、ばね定数である。ｃは、粘性係数である。ｍは、対象物Ｏｂによる負荷質量である。ｆは、力である。ｐは、質点の位置である。 The state and the equation of state of such a mechanical system 7 are expressed by the following equations (23) and (24).

Here, k is a spring constant. c is a viscosity coefficient. m is the load mass due to the object Ob. f is a force. p is the position of the mass point.

上記状態方程式を式（４）として適用するために、上記状態方程式は、例えばオイラー法を用いて次式（２５）のように離散方程式に変換される。

In order to apply the equation of state as equation (4), the equation of state is converted into a discrete equation as in the following equation (25) using, for example, the Euler method.

ここで、ｔ_ｋは、離散時間ｋに対応する時刻である。上付きドットｘ（ｔ_ｋ）は、時刻ｔ_ｋにおける、上付きドットｘである。Δtは、時刻の離散化周期である。 Here, tk is a time corresponding to the discrete time _k . The superscript dot x ( _tk ) is the superscript dot x at time _tk . Δt is the discretization period of time.

このような機械システム７も、上述の実施形態におけるシステム同定装置Ｄでシステム同定することによって、機械システム７のモデルが生成できる。 Such a mechanical system 7 can also generate a model of the mechanical system 7 by system-identifying with the system identification device D in the above-described embodiment.

なお、上述の実施形態ならびに実施例１および実施例２では、例えば、式（６）および式（７）に示す１組の、時系列に並ぶ複数の時点ごとに前記システムから得られる複数の実績データが用いられたが、複数組の前記複数の実績データが用いられてもよい。これにより、パラメータθを更新するたびに使用する実績データを入れ替えることで、全ての実績データとの誤差を等しく反映したパラメータθが得られる。 In addition, in the above-mentioned embodiment and Example 1 and Example 2, for example, a set of a set shown in the formula (6) and the formula (7), a plurality of achievements obtained from the system at each of a plurality of time points arranged in a time series. Although the data was used, a plurality of sets of the plurality of actual data may be used. As a result, by exchanging the actual data to be used every time the parameter θ is updated, the parameter θ that equally reflects the error with all the actual data can be obtained.

また、上述の実施形態ならびに実施例１および実施例２では、拘束条件θ_ｍｉｎ～θ_ｍａｘの下に、評価関数Ｊを最小化することによって、パラメータθが求められたが、拘束条件を考慮する必要が無いシステムＳの場合には、拘束条件無しに、評価関数Ｊを最小化することによって、パラメータθが求められてもよい。 Further, in the above-described embodiment and in the first and second embodiments, the parameter θ is obtained by minimizing the evaluation function J under the constraint conditions θ _min to θ _max , but the constraint condition is taken into consideration. In the case of the system S, which is not necessary, the parameter θ may be obtained by minimizing the evaluation function J without any constraint condition.

また、上述の実施形態ならびに実施例１および実施例２において、評価関数Ｊにおける終端コストおよびランニングコストは、式（８）および式（９）に限定されるものではなく、モデルによって予測されるシステムの挙動と実際のシステムの挙動との差（予測誤差）に応じて値が増加する関数であれば、よい。 Further, in the above-described embodiment and in the first and second embodiments, the termination cost and the running cost in the evaluation function J are not limited to the equations (8) and (9), but are predicted by the model. Any function may be used as long as it is a function whose value increases according to the difference (prediction error) between the behavior of the system and the behavior of the actual system.

また、上述の実施形態ならびに実施例１および実施例２において、パラメータθの更新則は、古典的な勾配法の更新則である式（１６）に限定されるものではなく、評価関数Ｊの勾配情報に基づく種々の勾配法が用いられてよい。例えば、共役勾配法や、慣性付き勾配法が用いられてもよい。 Further, in the above-described embodiment and in the first and second embodiments, the update rule of the parameter θ is not limited to the equation (16) which is the update rule of the classical gradient method, and the gradient of the evaluation function J is not limited to the equation (16). Various informed gradient methods may be used. For example, the conjugate gradient method or the gradient method with inertia may be used.

また、上述では、実施例１および実施例２によって、システム同定装置Ｄが油圧システム６および機械システム７に適用される例が示されたが、これらに限定されるものではなく、所定の動作を行う所定のシステムに広く適用可能である。 Further, in the above, Examples 1 and 2 show an example in which the system identification device D is applied to the hydraulic system 6 and the mechanical system 7, but the present invention is not limited thereto, and a predetermined operation is performed. It is widely applicable to the given system to be performed.

本発明を表現するために、上述において図面を参照しながら実施形態を通して本発明を適切且つ十分に説明したが、当業者であれば上述の実施形態を変更および／または改良することは容易に為し得ることであると認識すべきである。したがって、当業者が実施する変更形態または改良形態が、請求の範囲に記載された請求項の権利範囲を離脱するレベルのものでない限り、当該変更形態または当該改良形態は、当該請求項の権利範囲に包括されると解釈される。 In order to express the present invention, the present invention has been appropriately and sufficiently described through embodiments with reference to the drawings above, but those skilled in the art can easily modify and / or improve the above embodiments. It should be recognized that it is possible. Therefore, unless the modified or improved form implemented by a person skilled in the art is at a level that deviates from the scope of rights of the claims stated in the claims, the modified form or the improved form is the scope of rights of the claims. It is interpreted to be included in.

Ｄシステム同定装置
１入力部
２出力部
３インターフェース部（ＩＦ部）
４制御処理部
５記憶部
４１制御部
４２実績データ取得部
４３推定データ取得部
４４パラメータ処理部
５１実績データ記憶部 D System identification device 1 Input unit 2 Output unit 3 Interface unit (IF unit)
4 Control processing unit 5 Storage unit 41 Control unit 42 Actual data acquisition unit 43 Estimated data acquisition unit 44 Parameter processing unit 51 Actual data storage unit

Claims

所定の動作を行う所定のシステムのモデルをコンピュータによってシステム同定するシステム同定装置であって、
所定の入力値を前記システムに入力した場合に、時系列に並ぶ複数の時点ごとに前記システムから得られる複数の実績データを取得する実績データ取得部と、
前記所定の入力値を前記モデルに入力した場合に、前記複数の時点ごとに前記モデルによって推定される複数の推定データを取得する推定データ取得部と、
前記複数の時点ごとに求められる前記実績データと前記推定データとの各差に基づく前記モデルの評価関数を、最小化することによって、前記パラメータを求めるパラメータ処理部とを備える、
システム同定装置。 A system identification device that uses a computer to identify a model of a predetermined system that performs a predetermined operation.
A performance data acquisition unit that acquires a plurality of performance data obtained from the system at each of a plurality of time points arranged in a time series when a predetermined input value is input to the system.
An estimation data acquisition unit that acquires a plurality of estimation data estimated by the model at each of the plurality of time points when the predetermined input value is input to the model.
It is provided with a parameter processing unit for obtaining the parameters by minimizing the evaluation function of the model based on the difference between the actual data and the estimated data obtained at each of the plurality of time points.
System identification device.

前記パラメータ処理部は、前記評価関数を、前記モデルのパラメータがとり得る範囲を規定する拘束条件の下に、最小化することによって、前記パラメータを求める、
請求項１に記載のシステム同定装置。 The parameter processing unit obtains the parameter by minimizing the evaluation function under a constraint condition that defines a range that the parameter of the model can take.
The system identification apparatus according to claim 1.

前記パラメータ処理部は、前記評価関数の勾配情報を用いた繰り返し計算によって前記パラメータを求める、
請求項１または請求項２に記載のシステム同定装置。 The parameter processing unit obtains the parameter by iterative calculation using the gradient information of the evaluation function.
The system identification apparatus according to claim 1 or 2.

前記所定のシステムは、油圧を生成する油圧ポンプと、前記油圧ポンプで生成された油圧を機械動力に変換する油圧アクチュエータと、前記油圧を制御する油圧制御弁とを備える油圧システムであり、前記所定の動作は、前記油圧アクチュエータで所定の対象物を動かす動作である、
請求項１ないし請求項３のいずれか１項に記載のシステム同定装置。 The predetermined system is a hydraulic system including a hydraulic pressure pump that generates hydraulic pressure, a hydraulic actuator that converts the hydraulic pressure generated by the hydraulic pressure pump into mechanical power, and a hydraulic pressure control valve that controls the hydraulic pressure. Is an operation of moving a predetermined object with the hydraulic actuator.
The system identification apparatus according to any one of claims 1 to 3.

所定の動作を行う所定のシステムのモデルをコンピュータによってシステム同定するシステム同定方法であって、
所定の入力値を前記システムに入力した場合に、時系列に並ぶ複数の時点ごとに前記システムから得られる複数の実績データを取得する実績データ取得工程と、
前記所定の入力値を前記モデルに入力した場合に、前記複数の時点ごとに前記モデルによって推定される複数の推定データを取得する推定データ取得工程と、
前記複数の時点ごとに求められる前記実績データと前記推定データとの各差に基づく前記モデルの評価関数を、最小化することによって、前記パラメータを求めるパラメータ処理工程とを備える、
システム同定方法。 It is a system identification method that identifies a model of a predetermined system that performs a predetermined operation by a computer.
A performance data acquisition process for acquiring a plurality of performance data obtained from the system at each of a plurality of time points arranged in a time series when a predetermined input value is input to the system.
An estimation data acquisition step of acquiring a plurality of estimation data estimated by the model at each of the plurality of time points when the predetermined input value is input to the model.
A parameter processing step for obtaining the parameters by minimizing the evaluation function of the model based on the difference between the actual data and the estimated data obtained at each of the plurality of time points is provided.
System identification method.