JP7305113B2

JP7305113B2 - Motor control device, motor device and machine learning device

Info

Publication number: JP7305113B2
Application number: JP2019101737A
Authority: JP
Inventors: 潔大石; 勇希横倉; パラガホアンビセンテパドロン; 尊英佐々木
Original assignee: Nagaoka University of Technology
Current assignee: Nagaoka University of Technology
Priority date: 2019-05-30
Filing date: 2019-05-30
Publication date: 2023-07-10
Anticipated expiration: 2039-05-30
Also published as: JP2020198657A

Description

特許法第３０条第２項適用ｈｔｔｐ：／／ｗｗｗ．ｓｙｍｂｉｏｎ．ｃｏ．ｊｐ／ａｒｅｎａ／ｒｓｊ２０１８／ｉｎｄｅｘ．ｈｔｍｌ平成３０年９月５日第３６回日本ロボット学会学術講演会中部大学春日井キャンパス（愛知県春日井市松本町１２００）平成３０年９月６日（開催期間：平成３０年９月５日～平成３０年９月７日）メカトロニクス制御研究会予稿集発行日平成３０年９月２６日メカトロニクス制御研究会自動車会館２階小会議室（東京都千代田区九段南４－８－１３）平成３０年９月２６日ＳＡＭＣＯＮ２０１９（Ｔｈｅ５ｔｈＩＥＥＪＩｎｔｅｒｎａｔｉｏｎａｌＷｏｒｋｓｈｏｐｏｎＳｅｎｓｉｎｇ，Ａｃｔｕａｔｉｏｎ，ＭｏｔｉｏｎＣｏｎｔｒｏｌ，ａｎｄＯｐｔｉｍｉｚａｔｉｏｎ）予稿集ＵＳＢ発行日平成３１年３月４日ＳＡＭＣＯＮ２０１９（Ｔｈｅ５ｔｈＩＥＥＪＩｎｔｅｒｎａｔｉｏｎａｌＷｏｒｋｓｈｏｐｏｎＳｅｎｓｉｎｇ，Ａｃｔｕａｔｉｏｎ，ＭｏｔｉｏｎＣｏｎｔｒｏｌ，ａｎｄＯｐｔｉｍｉｚａｔｉｏｎ）千葉大学西千葉キャンパス（千葉県千葉市稲毛区弥生町１－３３）平成３１年３月６日（開催期間：平成３１年３月４日～平成３１年３月６日）Application of Article 30, Paragraph 2 of the Patent Act http://www. symbol. co. jp/arena/rsj2018/index. html September 5, 2018 The 36th Annual Meeting of the Robotics Society of Japan Chubu University Kasugai Campus (1200 Matsumoto-cho, Kasugai City, Aichi Prefecture) September 6, 2018 (Holding period: September 5, 2018 to Heisei September 7, 2018) Mechatronics Control Study Group Proceedings Publication date September 26, 2018 Mechatronics Control Study Group Automobile Hall 2nd floor small meeting room (4-8-13 Kudanminami, Chiyoda-ku, Tokyo) September 2018 26th March SAMCON2019 (The 5th IEEJ International Workshop on Sensing, Actuation, Motion Control, and Optimization) Proceedings USB Publication date March 4, 2019 SAMCON2019 (The 5th IEEJ International) Functional Workshop on Sensing, Actuation, Motion Control, and Optimization ) Chiba University Nishi-Chiba Campus (1-33 Yayoi-cho, Inage-ku, Chiba-shi, Chiba) March 6, 2019 (held from March 4, 2019 to March 6, 2019)

本発明は、モータ制御装置、モータ装置および機械学習装置に関する。 The present invention relates to a motor control device, a motor device and a machine learning device.

従来のモータ制御装置では、出力軸に摩擦力が作用して起きる外乱を推定して抑制するために外乱オブザーバが用いられてきたが、摩擦による外乱の補償は多くの場合不完全である。例えば、モータ出力軸が静止摩擦につかまって極低速状態となり、さらに力を加えてゆくと急激に回転が速くなることで、回転速度が滑らかでなくなる（スティック－スリップ現象）ことがある。言い換えると、出力軸に作用するクーロン摩擦力が大きくなると回転が極低速状態となり、その後、最大摩擦力を超える力が加わると負の勾配特性を有する粘性摩擦力により急激に回転が速くなる。 In conventional motor control devices, a disturbance observer has been used to estimate and suppress disturbance caused by frictional force acting on the output shaft, but compensation for disturbance due to friction is incomplete in many cases. For example, the motor output shaft is gripped by static friction and becomes extremely low speed, and when further force is applied, the rotation speed suddenly increases, resulting in a non-smooth rotation speed (stick-slip phenomenon). In other words, when the Coulomb frictional force acting on the output shaft increases, the rotation becomes extremely low speed, and when a force exceeding the maximum frictional force is applied, the rotation speeds up rapidly due to the viscous frictional force having a negative gradient characteristic.

スティック時にクーロン摩擦特性が変化することで生じる過渡状態では、摩擦特性の推定誤差や外乱の非線形特性により外乱オブザーバによる補償が不完全となり、負の勾配特性を有する粘性摩擦力によるスリップ時には、回転が急激に速くなることで外乱オブザーバの推定に遅れが生じる。その結果、モータにより関節駆動されるロボット・アームがガタガタ振動し、アーム先端の動きが滑らかでなくなる。 In the transient state caused by the change of the Coulomb friction characteristics when sticking, the compensation by the disturbance observer becomes incomplete due to the estimation error of the friction characteristics and the non-linear characteristics of the disturbance. A sudden increase in speed causes a delay in the estimation of the disturbance observer. As a result, the robot arm, which is articulated by the motor, vibrates jerkily, and the movement of the tip of the arm is not smooth.

この問題に対処するために、特許文献１に記載された制御装置では、まず、減速機の動摩擦トルク同定を行い、この動摩擦トルクを補償するフィードフォワード制御を行うことで、残留摩擦を補償するために必要なトルクセンサの要求バンド幅を低減する。その上で、静止摩擦トルク等の補償には、要求バンド幅の小さい安価なトルクセンサと外乱オブザーバを用いる。また、特許文献２記載のモータ制御装置では、摩擦損失補償を行う仕組みとして、同様の補償制御を行っており、特許文献３では、ＡＣサーボモータの制御系において、制御対象が静止状態から動き出したときにスティック・スリップ現象が発生することのない高精度な位置制御方法を開示している。 In order to deal with this problem, the control device described in Patent Document 1 first identifies the dynamic friction torque of the speed reducer and performs feedforward control to compensate for this dynamic friction torque, thereby compensating for the residual friction. reduce the required bandwidth of the torque sensor required for In addition, an inexpensive torque sensor with a small required bandwidth and a disturbance observer are used for compensation of static friction torque and the like. Further, in the motor control device described in Patent Document 2, similar compensation control is performed as a mechanism for compensating for friction loss. A high-precision position control method is disclosed that does not sometimes cause a stick-slip phenomenon.

特開２０１６－０３９７３７号公報JP 2016-039737 A 特開２０１２－１３０１６０号公報Japanese Unexamined Patent Application Publication No. 2012-130160 特開２００１－２３１２８０号公報Japanese Patent Application Laid-Open No. 2001-231280

しかしながら、上記特許文献１乃至３に記載された手法では、クーロン摩擦や粘性摩擦に起因する外乱を適切に補償できるか否かは、摩擦特性（摩擦の外乱モデル）や摩擦トルクを記述する物理量の推定精度および関連する制御係数の設定値の好適性に依存している。摩擦特性や摩擦トルクを記述する物理量を高精度に推定できない場合には、クーロン摩擦および粘性摩擦に起因する外乱の補償も不完全となる。 However, in the methods described in Patent Documents 1 to 3, whether or not disturbances caused by Coulomb friction and viscous friction can be appropriately compensated depends on physical quantities describing friction characteristics (disturbance model of friction) and friction torque. It depends on the accuracy of the estimation and the suitability of the associated control factor settings. If physical quantities that describe frictional characteristics and frictional torque cannot be estimated with high accuracy, compensation for disturbances caused by Coulomb friction and viscous friction will also be incomplete.

本発明に係る幾つかの実施形態では、上述した問題点に鑑み、摩擦特性の高精度なモデリングおよび関連する物理量の高精度な同定を必要とせずに、モータの出力軸に作用する摩擦力の推定とこれによる外乱の補償を適切に行うことができるモータ制御装置、モータ装置および機械学習装置を提供することを目的とする。 In view of the above-mentioned problems, some embodiments of the present invention provide methods for determining the frictional force acting on the output shaft of a motor without the need for highly accurate modeling of frictional characteristics and accurate identification of related physical quantities. An object of the present invention is to provide a motor control device, a motor device, and a machine learning device that can appropriately perform estimation and compensation for disturbance caused by the estimation.

本発明のモータ制御装置は、制御対象となるモータへの電流指令値、前記モータの回転速度および前記モータに減速機構を介して接続された出力軸のねじりトルクの測定値を制御入力として受け取り、前記モータの出力軸に作用する摩擦トルクによる外乱を抑制する外乱オブザーバ装置を備え、前記外乱オブザーバ装置は、前記制御入力に基づいて前記摩擦トルクを補償するトルク補償値を算出する補償値算出部を有し、前記補償値算出部の特性を表す感度関数が前記モータの前記回転速度に基づいて設定される。 The motor control device of the present invention receives, as control inputs, a current command value for a motor to be controlled, a rotational speed of the motor, and a measured torsional torque of an output shaft connected to the motor via a speed reduction mechanism, A disturbance observer device that suppresses disturbance due to frictional torque acting on the output shaft of the motor is provided, and the disturbance observer device includes a compensation value calculation unit that calculates a torque compensation value that compensates for the frictional torque based on the control input. A sensitivity function representing characteristics of the compensation value calculator is set based on the rotational speed of the motor.

本発明のモータ装置は、モータと、前記モータに減速機構を介して接続された出力軸と、前記出力軸に生じるねじりトルクを測定するトルクセンサと、前記モータの角度位置を検出するロータリ・エンコーダと、上記のモータ制御装置とを備える。 A motor device according to the present invention comprises a motor, an output shaft connected to the motor via a speed reduction mechanism, a torque sensor for measuring torsional torque generated in the output shaft, and a rotary encoder for detecting the angular position of the motor. and the motor control device described above.

本発明の機械学習装置は、上記のモータ制御装置に対する機械学習装置であって、前記モータの状態を、環境の現在状態を表す回転速度、トルク値、モータ電流およびモータ温度のうちの少なくとも一つを含んだ状態変数として観測する状態観測部と、前記モータの回転状態を示す速度波形データに基づいた判定データを取得する判定データ生成部と、前記状態変数と前記判定データとを用いて、前記モータの回転状態と、前記モータへの外乱補償特性を記述する特性係数、スイッチ切り替え閾値および高周波成分算出要素のフィルタ特性係数のうちの少なくとも一つを含む制御パラメータとを関連付けて学習する学習部と、前記学習部による学習結果に基づき、前記制御パラメータを設定する意思決定部とを有する。 A machine learning device according to the present invention is a machine learning device for the motor control device described above, wherein the state of the motor is at least one of a rotation speed, a torque value, a motor current, and a motor temperature, which represent the current state of the environment. a state observation unit that observes as a state variable including a state variable, a determination data generation unit that acquires determination data based on speed waveform data that indicates the rotation state of the motor, and the state variable and the determination data, the a learning unit that associates and learns a rotation state of the motor and a control parameter including at least one of a characteristic coefficient describing a disturbance compensation characteristic to the motor, a switch switching threshold value, and a filter characteristic coefficient of a high-frequency component calculation element; and a decision making section for setting the control parameters based on the learning result of the learning section.

以上より、本発明によれば、摩擦特性の高精度なモデリングおよび関連する物理量の高精度な同定を必要とせずに、モータの出力軸に作用する摩擦力の推定とこれによる外乱の補償を適切に行うことが可能なモータ制御装置、モータ制御方法および機械学習装置を実現することができる。 As described above, according to the present invention, it is possible to appropriately estimate the frictional force acting on the output shaft of the motor and compensate for the disturbance caused by this, without requiring high-precision modeling of the friction characteristics and high-precision identification of related physical quantities. It is possible to realize a motor control device, a motor control method, and a machine learning device capable of performing

本発明の第１実施形態のモータ制御装置と制御対象となるモータの構成図である。1 is a configuration diagram of a motor control device and a motor to be controlled according to a first embodiment of the present invention; FIG. 本発明の第１実施形態の減速装置の断面を示す概略図である。BRIEF DESCRIPTION OF THE DRAWINGS It is the schematic which shows the cross section of the reduction gear transmission of 1st Embodiment of this invention. 本発明の実施形態に係る外乱オブザーバ・プログラムを実行可能なプロセッサを含む制御装置の内部構成図である。1 is an internal configuration diagram of a control device including a processor capable of executing a disturbance observer program according to an embodiment of the present invention; FIG. モータと負荷の運動学モデルに本発明の第１実施形態に係る外乱オブザーバ装置が接続された状態のブロック線図である。1 is a block diagram of a state in which a disturbance observer device according to a first embodiment of the present invention is connected to a kinematic model of a motor and a load; FIG. スティック－スリップ状態におけるモータ回転数の非線形な変化を示すプロット図である。FIG. 5 is a plot showing non-linear variation of motor speed in stick-slip conditions; 感度関数の設定に用いるスイッチングパターンを示す図である。FIG. 4 is a diagram showing switching patterns used for setting a sensitivity function; 本発明の第１の実施形態に係る外乱オブザーバ装置による外乱抑制効果の評価結果を曲線グラフで示すプロット図である。FIG. 5 is a plot diagram showing, in a curve graph, an evaluation result of the disturbance suppression effect by the disturbance observer device according to the first embodiment of the present invention; 本発明の第２実施形態に従い、モータと負荷の運動学モデルに外乱オブザーバ装置と高周波ダンピング要素の両者が接続された状態のブロック線図である。FIG. 4 is a block diagram of both a disturbance observer device and a high frequency damping element connected to a motor and load kinematic model in accordance with a second embodiment of the present invention; 外乱オブザーバ装置と高周波ダンピング要素の組み合わせにより、スティック－スリップ現象によるモータ回転数の不連続変化が滑らかにされている評価結果を示す図である。FIG. 10 is a diagram showing evaluation results in which a discontinuous change in motor rotation speed due to a stick-slip phenomenon is smoothed by a combination of a disturbance observer device and a high-frequency damping element; 摩擦トルクにより生じる系の振動周波数とモータの回転加速度との関係を、外乱オブザーバの構成方式毎に対比した図である。FIG. 5 is a diagram comparing the relationship between the vibration frequency of the system caused by friction torque and the rotational acceleration of the motor for each configuration method of the disturbance observer; 高調波成分の各次数について、外乱オブザーバの構成方式毎に摩擦トルクの抑制効果を対比した図である。FIG. 5 is a diagram comparing the effect of suppressing friction torque for each order of harmonic components for each configuration method of the disturbance observer. 本発明の第３の実施形態の外乱オブザーバ装置の全体構成を示すブロック線図である。It is a block diagram which shows the whole structure of the disturbance observer apparatus of the 3rd Embodiment of this invention. 制御系内部のパラメータを機械学習の結果に従って適切に調整するための機械学習装置の構成を示す図である。FIG. 3 is a diagram showing the configuration of a machine learning device for appropriately adjusting parameters inside a control system according to machine learning results;

＜１＞第１の実施形態
（１－１）本発明の第１の実施形態に係る制御対象モータおよびモータ制御装置の構成
以下、図面を参照しながら、本発明の第１の実施形態に係るモータ制御装置について説明する。図１に示すように、第１実施形態のモータ装置１は、モータ２と、モータ２の出力軸（図示しない）の一端に接続された減速装置３と、モータ２の出力軸の他端に接続されたロータリ・エンコーダ４と、モータ２、減速装置３及びロータリ・エンコーダ４とそれぞれ配線７ａ、７ｂ、７ｃを介して接続されたモータ制御装置５を備えている。 <1> First Embodiment (1-1) Configurations of Controlled Motor and Motor Control Device According to First Embodiment of the Present Invention Hereinafter, the first embodiment of the present invention will be described with reference to the drawings. A motor control device will be described. As shown in FIG. 1, the motor device 1 of the first embodiment includes a motor 2, a speed reducer 3 connected to one end of an output shaft (not shown) of the motor 2, and a It comprises a connected rotary encoder 4, and a motor control device 5 connected to the motor 2, speed reducer 3 and rotary encoder 4 via wires 7a, 7b and 7c, respectively.

モータ２は、配線７ａを介してモータ制御装置５に接続されている。モータ制御装置５から配線７ａを介してモータ２に電流が供給されると、供給された電流の大きさに応じてモータ２は出力軸を回転させ、ねじりトルクを出力軸に発生させ、これによって減速装置３が駆動される。減速装置３は、出力軸６とトルクセンサ８と減速機構１０とを備えている。減速機構１０は、一端がモータ２の出力軸に接続され、他端が出力軸６に接続され、モータ２の出力軸の回転速度とねじりトルクの大きさとを減速比に応じて変換する。減速機構１０がモータ２によって駆動されると、出力軸６が回転し、出力軸６にねじりトルクが発生する。出力軸６の先端には、ロボットのアームのような負荷が機械的に接続され、出力軸６に生じたねじりトルクによって負荷が駆動される。 The motor 2 is connected to the motor control device 5 via wiring 7a. When a current is supplied from the motor control device 5 to the motor 2 through the wiring 7a, the motor 2 rotates the output shaft according to the magnitude of the supplied current to generate a torsional torque on the output shaft. The reduction gear 3 is driven. The reduction gear 3 includes an output shaft 6 , a torque sensor 8 and a reduction mechanism 10 . The reduction mechanism 10 has one end connected to the output shaft of the motor 2 and the other end connected to the output shaft 6, and converts the rotational speed of the output shaft of the motor 2 and the magnitude of torsional torque according to the reduction ratio. When the speed reduction mechanism 10 is driven by the motor 2 , the output shaft 6 rotates and torsional torque is generated in the output shaft 6 . A load such as an arm of a robot is mechanically connected to the tip of the output shaft 6 , and the load is driven by torsional torque generated in the output shaft 6 .

トルクセンサ８は、出力軸６に設置されている。トルクセンサ８は、出力軸６に生じたねじりトルクを測定する。トルクセンサ８は、ねじりトルクの測定値を出力値として出力する。ロータリ・エンコーダ４は、モータ２の出力軸の位置、すなわち、所定の基準点からの出力軸の角度位置を検出し、電気信号に変換して出力する。モータ制御装置５は、配線７ａを介してモータ２に電流を供給する。モータ制御装置５は、トルクセンサ８の出力値を、配線７ｂを介して受け取る。モータ制御装置５は、配線７ｃを介してロータリ・エンコーダ４から出力されたモータ２の角度位置を受け取る。 A torque sensor 8 is installed on the output shaft 6 . A torque sensor 8 measures the torsional torque generated on the output shaft 6 . The torque sensor 8 outputs a measured value of torsional torque as an output value. A rotary encoder 4 detects the position of the output shaft of the motor 2, that is, the angular position of the output shaft from a predetermined reference point, converts it into an electrical signal, and outputs it. The motor control device 5 supplies current to the motor 2 via the wiring 7a. The motor control device 5 receives the output value of the torque sensor 8 via the wiring 7b. Motor controller 5 receives the angular position of motor 2 output from rotary encoder 4 via line 7c.

モータ制御装置５は、トルク指令値や速度指令値などの種々の指令値に基づいて、モータ２が指令値に追従するように制御する。本実施形態では、モータ制御装置５はモータ２のねじりトルクを制御する場合を例として説明する。モータ制御装置５は、トルク指令値が入力されると、減速装置３の出力軸６に発生するねじりトルクがトルク指令値に追従するように、モータ２を制御する。すなわち、モータ制御装置５は、トルク指令値とトルクセンサ８の出力値とに基づいて算出した電流指令値をモータ２に出力してモータ２を駆動して、ねじりトルクの大きさを制御する。ここでは、電流指令値をモータ２に出力するということは、電流指令値に応じた直流電流をモータ２に供給することを意味している。 The motor control device 5 controls the motor 2 to follow the command values based on various command values such as a torque command value and a speed command value. In this embodiment, the case where the motor control device 5 controls the torsional torque of the motor 2 will be described as an example. When the torque command value is input, the motor control device 5 controls the motor 2 such that the torsional torque generated in the output shaft 6 of the speed reducer 3 follows the torque command value. That is, the motor control device 5 outputs to the motor 2 a current command value calculated based on the torque command value and the output value of the torque sensor 8 to drive the motor 2 and control the magnitude of the torsional torque. Here, outputting the current command value to the motor 2 means supplying the motor 2 with a DC current corresponding to the current command value.

ここで、減速装置３の構造についてさらに説明する。図２に示すように、減速装置３は、出力軸６と、トルクセンサ８と、減速機構１０と、出力軸６の一部及び減速機構１０を収容する筐体１２と、出力軸６を筐体１２に対して回転自在に支持するベアリング１３と、ベアリング１３上に設けられた軸カバー１４と、を備えている。 Here, the structure of the reduction gear 3 will be further described. As shown in FIG. 2, the speed reducer 3 includes an output shaft 6, a torque sensor 8, a speed reduction mechanism 10, a housing 12 that houses a portion of the output shaft 6 and the speed reduction mechanism 10, and a housing that houses the output shaft 6. A bearing 13 rotatably supporting the body 12 and a shaft cover 14 provided on the bearing 13 are provided.

減速機構１０は、波動歯車機構であり、モータ２の出力軸２ａの先端に接続されたウエーブジェネレータ１０ａと、薄肉の金属でカップ形状に形成されて弾性を有し、当該カップの開口部１０ｄの外側面にギア歯（図示せず）が設けられたフレクスプライン１０ｂと、当該フレクスプライン１０ｂのギア歯と噛み合うギア歯（図示せず）が内側面に設けられたサーキュラスプライン１０ｃと、を備えている。ウエーブジェネレータ１０ａは、モータ２の筐体（図示せず）に固定されたベアリング１６に支持された出力軸２ａの先端に、ねじ１１ｄによって固定されている。ウエーブジェネレータ１０ａはフレクスプライン１０ｂに挿入されている。 The speed reduction mechanism 10 is a strain wave gear mechanism, and includes a wave generator 10a connected to the tip of the output shaft 2a of the motor 2, and an elastic cup-shaped thin-walled metal having an opening 10d of the cup. A flexspline 10b provided with gear teeth (not shown) on the outer surface and a circular spline 10c provided with gear teeth (not shown) on the inner surface that mesh with the gear teeth of the flexspline 10b there is The wave generator 10a is fixed by a screw 11d to the tip of an output shaft 2a supported by a bearing 16 fixed to a housing (not shown) of the motor 2. As shown in FIG. Wave generator 10a is inserted in flexspline 10b.

フレクスプライン１０ｂの底部１５近傍にある出力軸６と筐体１２との間の空間には、オイルシール２５が設けられている。オイルシール２５は、筐体１２に固定されていると共に、出力軸６に接触しており、出力軸６と筐体１２との間の空間をシールし、減速機構１０側のオイルがトルクセンサ８側に飛散するのを防いでいる。出力軸６は、円柱形状をしており、減速機構１０に接続された一端が筐体１２に収容されており、他端が負荷を取り付けられるように筐体１２から突出している。 An oil seal 25 is provided in the space between the output shaft 6 and the housing 12 near the bottom 15 of the flexspline 10b. The oil seal 25 is fixed to the housing 12 and is in contact with the output shaft 6 to seal the space between the output shaft 6 and the housing 12 . It prevents it from scattering to the side. The output shaft 6 has a columnar shape, one end connected to the speed reduction mechanism 10 is accommodated in the housing 12, and the other end protrudes from the housing 12 so that a load can be attached.

出力軸６は、径が他の部分よりも小さく形成された起歪部６ａと、起歪部６ａよりもモータ２側に形成され、出力軸６から鍔状にせりだした円板形状の鍔部６ｂとを有している。起歪部６ａには、歪ゲージ１８が貼着されている。鍔部６ｂには、基板１９が基板固定支柱３０によって固定されている。基板１９はリング状の円板である。基板１９は、第１基板１９ａと第２基板１９ｂとでなり、第１基板１９ａの表面には歪ゲージ１８に結線されたトルク検出回路２９が設けられている。 The output shaft 6 has a strain-generating portion 6a formed to have a diameter smaller than that of other portions, and a disk-shaped flange formed closer to the motor 2 than the strain-generating portion 6a and protruding from the output shaft 6. and a portion 6b. A strain gauge 18 is attached to the strain generating portion 6a. A substrate 19 is fixed to the flange portion 6b by a substrate fixing support 30. As shown in FIG. The substrate 19 is a ring-shaped disc. The substrate 19 consists of a first substrate 19a and a second substrate 19b, and a torque detection circuit 29 connected to the strain gauge 18 is provided on the surface of the first substrate 19a.

トルク検出回路２９は、出力軸６に配置された歪ゲージ１８の抵抗変化を検出する抵抗変化検出回路（図２には不図示）を備える。さらに、トルク検出回路２９は、抵抗変化検出回路の出力信号に基づいてねじりトルクの測定値を算出するＣＰＵと、整流回路と、安定化回路と（いずれも図２には不図示）を備え、歪ゲージ１８に生じた抵抗変化から出力軸６に生じるねじりトルクを算出し送信部２７へ送出する。第２基板１９ｂの表面には、トルク検出回路２９に結線され、トルク検出回路２９から送出されたねじりトルク測定値の信号を無線で送信する送信部２７が設けられている。 The torque detection circuit 29 has a resistance change detection circuit (not shown in FIG. 2) that detects a resistance change of the strain gauge 18 arranged on the output shaft 6 . Furthermore, the torque detection circuit 29 includes a CPU for calculating the measured value of torsional torque based on the output signal of the resistance change detection circuit, a rectifier circuit, and a stabilization circuit (none of which is shown in FIG. 2), The torsional torque generated in the output shaft 6 is calculated from the resistance change generated in the strain gauge 18 and sent to the transmitter 27 . On the surface of the second substrate 19b, there is provided a transmission section 27 which is connected to the torque detection circuit 29 and which wirelessly transmits the signal of the torsional torque measurement value sent from the torque detection circuit 29. FIG.

筐体１２に固定された筐体基板２４には、送信部２７から送信された光信号を受信する受信部２８が送信部２７と対向する位置に設けられており、送信部２７及び受信部２８間で赤外線通信などの無線通信ができるようになされている。受信部２８の出力は配線７ｂ（図２には不図示）に接続されており、モータ制御装置５にねじりトルクの測定値に対応した信号を出力値として送出する。さらに、出力軸６には、例えばフェライトシートなどの磁性体シートでなり、出力軸６の側面を覆う２次側コア２１ａと、２次側コア２１ａの表面に例えば銅線などの導電性の線材を巻回して形成された２次コイル２１ｂと、を備える受電部２１が設けられている。 A housing substrate 24 fixed to the housing 12 is provided with a receiving section 28 for receiving an optical signal transmitted from the transmitting section 27 at a position facing the transmitting section 27 . Wireless communication such as infrared communication can be performed between them. The output of the receiver 28 is connected to the wiring 7b (not shown in FIG. 2), and sends a signal corresponding to the measured value of the torsional torque to the motor controller 5 as an output value. Further, the output shaft 6 has a secondary core 21a made of a magnetic sheet such as a ferrite sheet, covering the side surface of the output shaft 6, and a conductive wire such as a copper wire on the surface of the secondary core 21a. and a secondary coil 21b formed by winding a power receiving unit 21 is provided.

筐体基板２４には、受電部２１と対向する位置にコアホルダ２３が固定されている。コアホルダ２３は、送電部２２を保持している。送電部２２は、直方体形状の部材と、当該部材の長軸方向の両端で直方体の表面に垂直に同じ方向に突出した突部とを有する形状（断面形状がコ字型）をしており、例えばフェライトなどの磁性体で作られている１次側コア２２ａと、１次側コア２２ａの２つの突部間に例えば銅線などの導電性の線材を直方体形状の部材に巻回して形成された１次コイル２２ｂと、を備えている。 A core holder 23 is fixed to the housing substrate 24 at a position facing the power receiving unit 21 . Core holder 23 holds power transmission section 22 . The power transmission unit 22 has a rectangular parallelepiped member and protrusions (having a U-shaped cross section) that protrude in the same direction perpendicular to the surface of the rectangular parallelepiped at both ends of the member in the longitudinal direction, For example, the primary core 22a is made of a magnetic material such as ferrite, and a conductive wire such as a copper wire is wound around a rectangular parallelepiped member between the two projections of the primary core 22a. and a primary coil 22b.

送電部２２は、モータ制御装置５から供給された交流電流を１次コイル２２ｂに流し、１次コイル２２ｂに交流磁界を発生させ、受電部２１の２次コイル２１ｂに電流を誘起する。よって、２次コイル２１ｂが１次コイル２２ｂから非接触で電力を受電できる。受電部２１は、２次コイル２１ｂに誘起された交流電流をトルク検出回路２９に供給する。トルク検出回路２９は、供給された交流電圧を整流回路と安定化回路とによって直流電圧へと変換し、抵抗変化検出回路などに供給する。 The power transmission unit 22 causes the primary coil 22b to generate an AC magnetic field by supplying an alternating current supplied from the motor control device 5 to the primary coil 22b, thereby inducing a current in the secondary coil 21b of the power receiving unit 21. Therefore, the secondary coil 21b can receive power from the primary coil 22b in a non-contact manner. The power receiving unit 21 supplies the torque detection circuit 29 with the alternating current induced in the secondary coil 21b. The torque detection circuit 29 converts the supplied AC voltage into a DC voltage by means of a rectifying circuit and a stabilizing circuit, and supplies the DC voltage to a resistance change detection circuit or the like.

出力軸６を筐体１２に対して回転自在に支持するベアリング１３は、接続部２６を介して筐体１２に設けられている。接続部２６は、中心に穴が形成されており、当該穴内に出力軸６の鍔部６ｂが配置され、接続部２６の穴の内側面２６ａと鍔部６ｂとが所定の間隔を空けて対向するように、筐体１２に固定されている。ベアリング１３は、クロスローラベアリングであり、外輪１３ａと、内輪１３ｂと、円筒形状のコロ１３ｃとを備え、外輪１３ａが接続部２６に固定され、内輪１３ｂがねじ１１ｃによって鍔部６ｂに固定されることで、出力軸６が筐体１２に対して自在に回転できるように出力軸６を支持している。 A bearing 13 that rotatably supports the output shaft 6 with respect to the housing 12 is provided in the housing 12 via a connecting portion 26 . A hole is formed in the center of the connection portion 26, and the flange portion 6b of the output shaft 6 is arranged in the hole, and the inner side surface 26a of the hole of the connection portion 26 and the flange portion 6b face each other with a predetermined gap. It is fixed to the housing 12 so as to do so. The bearing 13 is a cross roller bearing, and includes an outer ring 13a, an inner ring 13b, and cylindrical rollers 13c. Thus, the output shaft 6 is supported so that the output shaft 6 can freely rotate with respect to the housing 12 .

軸カバー１４は、ねじ１１ｅによってベアリング１３の外輪１３ａに固定されており、中心に穴が形成されている。当該穴は、軸カバー１４をベアリング１３に固定したとき、穴の内側面と出力軸６とが接触しない程度の大きさに形成されている。トルクセンサ８は、上述の歪ゲージ１８とトルク検出回路２９と送信部２７と受信部２８と復調回路と受電部２１と送電部２２とで構成されている。トルクセンサ８の構成は、出力軸６に生じたねじりトルクτｓを測定できれば、特に限定されない。 The shaft cover 14 is fixed to the outer ring 13a of the bearing 13 with screws 11e and has a hole formed in the center. The hole is formed in such a size that the inner surface of the hole and the output shaft 6 do not come into contact when the shaft cover 14 is fixed to the bearing 13 . The torque sensor 8 is composed of the above strain gauge 18 , torque detection circuit 29 , transmission section 27 , reception section 28 , demodulation circuit, power reception section 21 and power transmission section 22 . The configuration of the torque sensor 8 is not particularly limited as long as it can measure the torsional torque τs generated on the output shaft 6 .

次に、図３および図４を参照しながら、第１の実施形態に係るモータ制御装置５の構成について説明する。以下で説明するモータ制御装置５の構成はあくまで一例であり、適宜変更してもよい。図３に示すモータ制御装置５は、内部の構成モジュールとして、プロセッサ５１０、記憶部５２０、加減算器５３０、電流指令出力インターフェース５４０、第１インターフェース回路５６１および第２インターフェース回路５６２を含んで構成され、これらは、バス５６０によって通信可能に相互接続されている。 Next, the configuration of the motor control device 5 according to the first embodiment will be described with reference to FIGS. 3 and 4. FIG. The configuration of the motor control device 5 described below is merely an example, and may be changed as appropriate. The motor control device 5 shown in FIG. 3 includes, as internal configuration modules, a processor 510, a storage unit 520, an adder/subtractor 530, a current command output interface 540, a first interface circuit 561 and a second interface circuit 562. They are communicatively interconnected by bus 560 .

本実施形態では、上述のように、モータ制御装置５は、モータ２に接続された減速機構１０の出力軸６に発生させるねじりトルクτｓを制御するように構成されている。モータ制御装置５は、外乱オブザーバ装置５０（図４を参照）として動作するようにプログラミングされたプロセッサ５１０と、摩擦トルクによる外乱の推定値と制御目標値との和からモータ２への電流指令値ｉ_ｑ ^ｒｅｆ（図４）を算出してプロセッサ５１０（外乱オブザーバ装置５０）に入力する加減算器５３０と、プロセッサ（外乱オブザーバ装置５０）とデータ通信可能に接続され、複数の異なる制御パラメータγ_ｋ（１≦ｋ≦Ｋ）を記憶する記憶部５２０とを備えている。電流指令値ｉ_ｑ ^ｒｅｆは、モータ駆動電流としてモータ２に供給される電力である。 In this embodiment, as described above, the motor control device 5 is configured to control the torsional torque τs generated in the output shaft 6 of the speed reduction mechanism 10 connected to the motor 2 . The motor control device 5 includes a processor 510 programmed to operate as a disturbance observer device 50 (see FIG. 4), and a current command value to the motor 2 from the sum of the estimated value of the disturbance due to friction torque and the control target value. An adder/subtractor 530 for calculating i _q ^ref (FIG. 4) and inputting it to the processor 510 (disturbance observer device 50) is connected to the processor (disturbance observer device 50) for data communication, and calculates a plurality of different control parameters γ _k ( 1≦k≦K). The current command value i _q ^ref is electric power supplied to the motor 2 as motor drive current.

さらに、モータ制御装置５は、第１インターフェース回路５６１および第２インターフェース回路５６２を有し、バス５６０を介して上記の各構成要素と接続されている。第１インターフェース回路５６１は、配線７ｂを介して図１に示すトルクセンサ８からトルク測定値のデータを含む信号を受信する。第２インターフェース回路５６２は、配線７ｃを介して図１に示すロータリ・エンコーダ４からモータ２の出力軸２ａの角度位置を表すデータを信号として受信する。 Furthermore, the motor control device 5 has a first interface circuit 561 and a second interface circuit 562 and is connected to each of the components described above via a bus 560 . The first interface circuit 561 receives a signal containing torque measurement data from the torque sensor 8 shown in FIG. 1 via line 7b. The second interface circuit 562 receives data representing the angular position of the output shaft 2a of the motor 2 as a signal from the rotary encoder 4 shown in FIG. 1 via the wiring 7c.

プロセッサ５１０は、トルクセンサ８からトルク測定値のデータを受信し、ロータリ・エンコーダ４からモータ２の出力軸２ａの角度位置を表すデータを受信する。その上で、プロセッサ５１０は、受信したこれらのデータを入力として用いて、モータ２に出力する電流指令の値を計算し、加減算器５３０を介して電流指令出力インターフェース５４０に出力する。また、プロセッサ５１０は、モータ２の制御に必要なその他の演算を実行し、図示しない他の出力インターフェースを介して制御出力として出力するようにしてもよい。 The processor 510 receives torque measurement data from the torque sensor 8 and data representing the angular position of the output shaft 2 a of the motor 2 from the rotary encoder 4 . Processor 510 then uses these received data as inputs to calculate the value of the current command to be output to motor 2 and outputs it to current command output interface 540 via adder/subtractor 530 . The processor 510 may also perform other calculations necessary for controlling the motor 2 and output them as control outputs via other output interfaces (not shown).

本実施形態では、記憶部５２０から読み出した外乱オブザーバ・プログラム５２１をプロセッサ５１０が読み込んで実行することで、外乱オブザーバ装置５０が実現されている。なお、外乱オブザーバ装置５０は、専用に設計されたプロセッサによって実現されてもよく、複数のプロセッサや回路の集合体として構成されてもよい。 In this embodiment, the disturbance observer device 50 is implemented by the processor 510 reading and executing the disturbance observer program 521 read from the storage unit 520 . The disturbance observer device 50 may be implemented by a specially designed processor, or may be configured as a collection of multiple processors and circuits.

また、本実施形態では、外乱オブザーバ・プログラム５２１を実行中のプロセッサ５１０は、記憶部５２０に記憶された外乱補償制御の演算に必要なデータを読み出し、摩擦トルクによる外乱を推定して抑制するための補償制御を行うように構成されている。 Further, in the present embodiment, the processor 510 executing the disturbance observer program 521 reads the data necessary for calculation of the disturbance compensation control stored in the storage unit 520, and estimates and suppresses the disturbance due to the friction torque. compensation control.

本実施形態では、演算に用いるデータとして、モータ２の回転速度ω_ｍからモータ２の出力トルクτ_ｍを推定するのに用いる疑似的な慣性モーメントＪ_ｍ（データ５２２ａ）と、モータ２と出力軸６の間に介装された減速機構１０の減速比Ｒｇ（データ５２２ｂ）と、負荷Ｌおよびモータ２の運動学モデルを記述する動力学特性パラメータ（５２２ｃ、５２２ｆ）と、外乱オブザーバ装置５０の周波数応答を所望の応答特性に近づけるように設定されるフィルタ係数および利得係数（データ５２２ｄ）と、摩擦トルクに起因する非線形制御特性をモデル化する非線形特性パラメータ（データ５２２ｅ）が記憶部５２０に記憶されている。 In this embodiment, the data used for calculation are the pseudo moment of inertia J _m (data 522a) used for estimating the output torque τ _m of the motor 2 from the rotational speed ω _m of the motor 2, 6, the dynamic characteristic parameters (522c, 522f) describing the kinematic model of the load L and the motor 2, and the frequency of the disturbance observer device 50 A storage unit 520 stores a filter coefficient and a gain coefficient (data 522d) that are set to bring the response closer to a desired response characteristic, and a nonlinear characteristic parameter (data 522e) that models the nonlinear control characteristic caused by friction torque. ing.

本発明の幾つかの実施形態では、図４に示すモータ制御装置５は、図７を用いて後述するスティック－スリップ現象により摩擦トルクがモータ２の出力軸２ａに作用し、これに起因する外乱を外乱オブザーバ装置５０が抑制するように構成される。 In some embodiments of the present invention, the motor control device 5 shown in FIG. 4 has a friction torque acting on the output shaft 2a of the motor 2 due to a stick-slip phenomenon, which will be described later with reference to FIG. is configured to be suppressed by the disturbance observer device 50 .

そこで、以下、図５を参照しながらスティック－スリップ現象によりモータ２の出力軸２ａに作用する摩擦トルクについて詳しく述べる。スティック－スリップ現象とは、強い非線形性を持つ摩擦外乱によって引き起こされる現象である。スティック－スリップ現象は、摩擦係数が急激かつ非常に大きくなりモータ２の出力軸２ａが静止摩擦につかまって極低速状態となるスティック現象と、さらに力を加えてゆくと正の値の摩擦係数が急に負の値に符号反転し、急激に回転が速くなるスリップ現象とが組み合わさって生じる現象である。スティック－スリップ現象が生じると回転速度及び回転加速度が滑らかでなくなる。 Therefore, the friction torque acting on the output shaft 2a of the motor 2 due to the stick-slip phenomenon will be described in detail below with reference to FIG. A stick-slip phenomenon is a phenomenon caused by a frictional disturbance with strong nonlinearity. The stick-slip phenomenon consists of a stick phenomenon in which the coefficient of friction abruptly and extremely increases, and the output shaft 2a of the motor 2 is caught by static friction, resulting in an extremely low speed state, and a positive coefficient of friction increases as further force is applied. This is a phenomenon that occurs in combination with a slip phenomenon in which the sign suddenly reverses to a negative value and the rotation speeds up abruptly. The stick-slip phenomenon results in non-smooth rotational speed and rotational acceleration.

以下、上述したスティック現象とスリップ現象について図５（Ａ）および図５（Ｂ）に示す具体例を用いて説明する。図５（Ａ）に示す曲線グラフは、モータ２の回転速度ω_ｍを横軸とし、モータ２の出力軸２ａに作用する摩擦トルクの大きさを縦軸とし、回転速度ω_ｍの変化に応じた摩擦トルクの変化をプロットしたものである。また、図５（Ｂ）に示す曲線グラフは、経過時間ｔを横軸とし、モータ２の回転速度ω_ｍを縦軸とし、時間ｔの経過に伴って回転速度ω_ｍがどのように変化するかをプロットしたものである。 The above-described stick phenomenon and slip phenomenon will be described below using specific examples shown in FIGS. 5(A) and 5(B). In the curve graph shown in FIG. 5A, the horizontal axis represents the rotation speed _ωm _of the motor 2, and the vertical axis represents the magnitude of the friction torque acting on the output shaft 2a of the motor 2. It is a plot of the change in friction torque. In the curve graph shown in FIG. 5B, the horizontal axis represents the elapsed time t, and the vertical axis represents the rotation speed _ωm of the motor 2. How the rotation speed _ωm changes with the passage of time t. is plotted.

図５（Ａ）に示す回転速度領域ＶＲ（２）は、図５（Ｂ）に示す時間区間Ｐｈ（１）に対応し、時間区間Ｐｈ（１）内では、モータ２の回転速度ω_ｍは領域ＶＲ（２）内にある。図５（Ａ）および図５（Ｂ）に示すように、モータ２の回転速度ω_ｍが領域ＶＲ（２）内にある間は、スティック現象が生じ、モータ２の出力軸２ａが静止摩擦につかまって極低速状態となっており、出力軸６に作用している摩擦トルクにおいてクーロン摩擦成分が支配的となっている。 The rotational speed region VR(2) shown in FIG. 5(A) corresponds to the time interval Ph(1) shown in FIG. 5(B) _. It is in region VR(2). As shown in FIGS. 5A and 5B, while the rotation speed _ωm of the motor 2 is within the region VR(2), the stick phenomenon occurs, and the output shaft 2a of the motor 2 is affected by static friction. It is caught and is in an extremely low speed state, and the Coulomb friction component is dominant in the friction torque acting on the output shaft 6 .

これに対し、図５（Ａ）に示す回転速度領域ＶＲ（１）は、図５（Ｂ）に示す時間区間Ｐｈ（２）に対応し、時間区間Ｐｈ（２）内では、モータ２の回転速度ω_ｍは領域ＶＲ（１）内にある。図５（Ａ）および図５（Ｂ）に示すように、モータ２の回転速度ω_ｍが回転速度ω_ｓｌｉｐを境にして回転速度ω_ｍが急激に上昇しており、スリップ現象が生じていることがわかる。この急激な回転速度ω_ｍの上昇は、回転トルクが増加していって領域ＶＲ（２）で作用していた静止摩擦を乗り越えたことで、出力軸６に作用している摩擦トルクにおいてクーロン摩擦成分に代わって粘性摩擦成分が支配的となったことを示している。 On the other hand, the rotation speed region VR(1) shown in FIG. 5A corresponds to the time interval Ph(2) shown in FIG. Velocity ω _m is within region VR(1). As shown in FIGS. 5(A) and 5(B), the rotation speed _ωm of the motor 2 sharply _increases beyond the rotation speed _ωslip , and a slip phenomenon occurs. I understand. This rapid increase in rotational speed ω _m is caused by the fact that the rotational torque increases and overcomes the static friction acting in the region VR(2). This indicates that the viscous friction component has become dominant instead of the component.

この急激な変化は、例えば、図２に示すオイルシール２５とモータ２の出力軸２ａとの間の摩擦によって引き起こされた非線形摩擦効果が増大することで生じる。スティック－スリップ現象により生じた摩擦トルクがモータ２の出力軸２ａに外乱として作用している期間中は、モータ２の回転速度ω_ｍおよび回転加速度ω_ｍ’は滑らかに変化せず、不規則に歪んだ波形を示す。このようなスティック－スリップ現象が発生すると、回転が滑らかではなく不規則に変化する状態にあるモータ２から出力軸６に取り付けられた負荷に出力軸６を介してトルクが伝わる。その結果、負荷Ｌにおいても高周波成分を含む振動が発生し好ましくない。 This rapid change is caused by, for example, an increase in the nonlinear frictional effect caused by the friction between the oil seal 25 and the output shaft 2a of the motor 2 shown in FIG. During the period when the friction torque generated by the stick-slip phenomenon acts as a disturbance on the output shaft 2a of the motor 2, the rotation speed ω _m and the rotation acceleration ω _m ′ of the motor 2 do not change smoothly, but irregularly. Shows a distorted waveform. When such a stick-slip phenomenon occurs, torque is transmitted through the output shaft 6 to the load attached to the output shaft 6 from the motor 2 whose rotation is not smooth but varies irregularly. As a result, even at the load L, vibration containing high frequency components is generated, which is not preferable.

（１－２）本発明の第１の実施形態に係る外乱オブザーバ装置の構成と動作
本実施形態では、ＳＶＭＮＣ（Sensitivity-variable Motor-side Normalization Compensator）を外乱オブザーバ装置に用いることで、スティック－スリップ現象の影響を効果的に低減させるようにしている。ＳＶＮＭＣは、モータの出力軸が受けるねじりトルクとモータの電流指令値とモータの回転速度とに基づいて外乱を補償するトルク補償値を算出する装置であり、ＳＶＭＮＣ５５ａの特性を表す後述の感度関数がモータ２の回転速度ω_ｍに基づいて設定されるように構成されている。そして、図４に示す本実施形態の外乱オブザーバ装置５０では、補償値演算部としてのＳＶＭＮＣ５５ａが上記の制御入力に基づいて摩擦トルクτ_ｄｍを補償するトルク補償値を算出し、トルク補償値が制御目標値である電流ｉ_ｃｔｒｌにフィードバックされるように構成している。本実施形態では、このように外乱オブザーバ装置５０を構成することで、スティック－スリップ現象を抑制するようにしている。 (1-2) Configuration and operation of the disturbance observer device according to the first embodiment of the present invention In this embodiment, by using a SVMNC (Sensitivity-variable Motor-side Normalization Compensator) in the disturbance observer device, stick-slip The effect of the phenomenon is effectively reduced. The SVNMC is a device that calculates a torque compensation value for compensating for disturbance based on the torsional torque received by the motor output shaft, the motor current command value, and the motor rotation speed. It is configured to be set based on the rotation speed ω _m of the motor 2 . In the disturbance observer device 50 of the present embodiment shown in FIG. 4, the SVMNC 55a as the compensation value calculation unit calculates the torque compensation value for compensating the friction torque _τdm based on the above control input, and the torque compensation value is controlled. It is configured to be fed back to the current i _ctrl which is the target value. In this embodiment, the stick-slip phenomenon is suppressed by configuring the disturbance observer device 50 in this way.

以下では、図４に示すブロック線図を用いて、外乱オブザーバ装置５０の構成をより詳細に説明する。図４のブロック線図は、制御系全体を示す図であり、モータ２を運動学モデル２０、減速装置３を運動学モデル９０として表している。また、外乱オブザーバ装置５０も同様にモデル化されて表されている。なお、モータ２及び減速装置３は、出力軸６が弾性を有していたり、減速機構１０のギア歯が弾性結合していたりするなどのために、所定の共振周波数で振動する機械共振系である。そのため、図４では、モータ２及び減速装置３を二慣性共振系の近似化モデルを用いて表している。 The configuration of the disturbance observer device 50 will be described in more detail below using the block diagram shown in FIG. The block diagram of FIG. 4 shows the entire control system, and represents the motor 2 as a kinematics model 20 and the speed reducer 3 as a kinematics model 90 . Also, the disturbance observer device 50 is similarly modeled and represented. The motor 2 and the reduction gear 3 are mechanical resonance systems that vibrate at a predetermined resonance frequency because the output shaft 6 has elasticity and the gear teeth of the reduction mechanism 10 are elastically coupled. be. Therefore, in FIG. 4, the motor 2 and the reduction gear 3 are represented using an approximation model of a two-inertia resonance system.

まずは、モータ２をモデル化した運動学モデル２０について説明する。モータ２は、Ｋｔをゲインに有する乗算器２０ｄと、１／Ｊｍをゲインに有する乗算器２０ｅと、非線形特性を有する摩擦トルクをモデル化した外乱要素２０ｇと、減算器２０ｃと、加算器２０ｆと、２つの積分器２０ａ、２０ｂとで構成される。図中の、Ｋｔはトルク定数であり、Ｊｍはモータ２の慣性モーメントであり、ｓはラプラス演算子である。 First, the kinematics model 20 that models the motor 2 will be described. The motor 2 includes a multiplier 20d having a gain of Kt, a multiplier 20e having a gain of 1/Jm, a disturbance element 20g modeling friction torque having nonlinear characteristics, a subtractor 20c, and an adder 20f. , and two integrators 20a, 20b. In the figure, Kt is the torque constant, Jm is the moment of inertia of the motor 2, and s is the Laplace operator.

運動学モデル２０では、電流指令値ｉ_ｑ ^ｒｅｆと、減速装置３からモータ２の出力軸２ａ（図２）が受けるトルクの値であるトルク応答値とがモータ２に入力され、出力軸２ａの回転速度ω_ｍと出力軸２ａの回転角度を表す角度位置θ_ｍとがモータ２から出力されるように表されている。 In the kinematics model 20, the current command value i _q ^ref and the torque response value, which is the value of the torque received by the output shaft 2a (FIG. 2) of the motor 2 from the reduction gear 3, are input to the motor 2, and the output shaft 2a The rotation speed ω _m and the angular position θ _m representing the rotation angle of the output shaft 2 a are shown to be output from the motor 2 .

モータ２に入力された電流指令値ｉ_ｑ ^ｒｅｆは、乗算器２０ｄにおいてトルク定数Ｋｔを乗算され、電流指令値ｉ_ｑ ^ｒｅｆに応じたトルク値τ_ｒｅｆに変換される。当該トルク値τ_ｒｅｆは、減算器２０ｃにおいて、後述するモータ２の外乱トルクτ_ｄｉｓを減算される。このようにして減算器２０ｃでは、出力軸２ａに生じる出力トルク値τ_ｒｅｆが等価的に算出される。 A current command value i _q ^ref input to the motor 2 is multiplied by a torque constant Kt in a multiplier 20d and converted into a torque value τ _ref corresponding to the current command value i _q ^ref . A disturbance torque τ _dis of the motor 2, which will be described later, is subtracted from the torque value τ _ref in a subtractor 20c. In this manner, the subtractor 20c equivalently calculates the output torque value τ _ref produced on the output shaft 2a.

出力トルク値は、乗算器２０ｅに入力され、モータ２の出力軸２ａの慣性モーメントＪｍの逆数を乗算される。乗算器２０ｅでは、出力軸２ａに生じる加速度を表す回転加速度

が等価的に算出され、算出された出力軸２ａの回転加速度は、積分器２０ｂに入力され、積分される。積分器２０ｂでは、回転速度ω_ｍが等価的に算出される。 The output torque value is input to the multiplier 20e and multiplied by the reciprocal of the moment of inertia Jm of the output shaft 2a of the motor 2. FIG. In the multiplier 20e, the rotation acceleration representing the acceleration generated on the output shaft 2a

is equivalently calculated, and the calculated rotational acceleration of the output shaft 2a is input to the integrator 20b and integrated. The integrator 20b equivalently calculates the rotational speed _ωm .

回転速度ω_ｍは積分器２０ａに入力されて積分される。積分器２０ａでは、角度位置θ_ｍが等価的に算出される。一方で、回転速度ω_ｍには非線形特性を持った外乱要素２０ｇが適用され、この外乱は、スティック－スリップ現象によりモータ２の出力軸２ａに作用する摩擦力に対応する。回転速度ω_ｍに非線形特性の外乱要素２０ｇが適用されて得られた摩擦トルクτ_ｄｍは、加算器２０ｆに入力され、前述のトルク応答値と加算される。加算器２０ｆでは、モータ２に生じる外乱トルクτ_ｄｉｓが等価的に算出される。 The rotation speed ω _m is input to the integrator 20a and integrated. The integrator 20a equivalently calculates the angular position _θm . On the other hand, a disturbance element 20g having non-linear characteristics is applied to the rotation speed ω _m , and this disturbance corresponds to the frictional force acting on the output shaft 2a of the motor 2 due to the stick-slip phenomenon. The friction torque _τdm obtained by applying the disturbance element 20g with nonlinear characteristics to the rotation speed _ωm is input to the adder 20f and added to the torque response value described above. The adder 20f equivalently calculates the disturbance torque τ _dis generated in the motor 2 .

以上のように、電流指令値ｉ_ｑ ^ｒｅｆと、減速装置３からのトルク応答値とに基づいて回転速度ω_ｍと角度位置θ_ｍとが算出され、モータ２から出力される。モータ２から出力された回転速度ω_ｍは、図４に示す外乱オブザーバ装置５０の中のＳＶＭＮＣ５５ａに入力される。モータ２から出力された回転速度ω_ｍを入力として受け取ったＳＶＭＮＣ５５ａを含む外乱オブザーバ装置５０では、スティック－スリップ現象による非線形な摩擦トルクτ_ｄｍを抑制可能な外乱補償演算を行う。 As described above, the rotation speed ω _m and the angular position θ _m are calculated based on the current command value i _q ^ref and the torque response value from the reduction gear 3 and output from the motor 2 . The rotation speed ω _m output from the motor 2 is input to the SVMNC 55a in the disturbance observer device 50 shown in FIG. The disturbance observer device 50 including the SVMNC 55a that receives the rotation speed ω _m output from the motor 2 as an input performs disturbance compensation calculation capable of suppressing the nonlinear friction torque τ _dm due to the stick-slip phenomenon.

次に、減速装置３をモデル化した運動学モデル９０について説明する。運動学モデル９０では、出力軸６と減速機構１０とがモデル化されており、紙面上段の点線で囲まれた領域が出力軸６に対応する運動学モデルであり、紙面下段の点線で囲まれた領域が減速機構１０に対応する運動学モデルである。減速装置３の減速機構１０に対応する運動学モデルは、減速機構１０の減速比Ｒ_ｇの逆数をゲインに有する乗算器９０ｄ、９０ｅと、減算器９０ｈと、ばね定数Ｋｓをゲインとして有する乗算器９０ｇと積分器９０ｆとで構成される。一方、出力軸６に対応する運動学モデルは、減算器９０ｉと、出力軸６の慣性モーメントＪｌの逆数をゲインとして有する乗算器９０ａと、積分器９０ｂ、９０ｃとで構成される。二慣性共振系の近似化モデルでは、減速装置３の出力軸６に生じるねじりトルクτｓが、出力軸２ａ及び出力軸６の速度差により生じるねじり角と、モータ２及び減速装置３間の機械共振振動に依存して定まるばね定数Ｋｓとの積としてモデル化される。そのため、減速装置３の運動学モデル９０では、ねじりトルクを等価的に算出するため、出力軸２ａの回転速度ω_ｍと減速装置３の出力軸６の回転速度ω_ｌとが減速機構１０に入力されるように表されている。 Next, a kinematic model 90 that models the reduction gear 3 will be described. In the kinematics model 90, the output shaft 6 and the speed reduction mechanism 10 are modeled. is a kinematic model corresponding to the speed reduction mechanism 10 . A kinematic model corresponding to the speed reduction mechanism 10 of the speed reduction gear 3 includes multipliers 90d and 90e having the reciprocal of the speed reduction ratio _Rg of the speed reduction mechanism 10 as gains, a subtractor 90h, and a multiplier having the spring constant Ks as the gain. 90g and an integrator 90f. On the other hand, the kinematic model corresponding to the output shaft 6 is composed of a subtractor 90i, a multiplier 90a having the reciprocal of the moment of inertia Jl of the output shaft 6 as a gain, and integrators 90b and 90c. In the approximation model of the two-inertia resonance system, the torsional torque τs generated in the output shaft 6 of the speed reducer 3 is the torsion angle generated by the speed difference between the output shafts 2a and 6 and the mechanical resonance between the motor 2 and the speed reducer 3. It is modeled as a product with a spring constant Ks determined depending on vibration. Therefore, in the kinematic model 90 of the speed reducer 3, the rotational speed _ωm of the output shaft 2a and the rotational speed _ωl of the output shaft 6 of the speed reducer 3 are input to the speed reducer 10 in order to equivalently calculate the torsional torque. are represented as

また、減速装置３の運動学モデル９０では、ねじりトルクτｓが、出力軸６側と、帯域がＬｓ（ｓ）であるトルクセンサ８とに出力され、前述のトルク応答値がモータ２に出力されるように表されている。減速機構１０へ入力された回転速度ω_ｍは、乗算器９０ｅで減速比Ｒｇの逆数を乗算される。これは出力軸２ａの回転が減速機構１０で減速されることを表している。乗算器９０ｅでは、回転速度ω_ｍが減速機構１０で減速後の回転速度、すなわち、出力軸６側での回転速度に変換される。 Further, in the kinematic model 90 of the speed reducer 3, the torsional torque τs is output to the output shaft 6 side and the torque sensor 8 whose band is Ls(s), and the torque response value described above is output to the motor 2. is represented as The rotation speed _ωm input to the reduction mechanism 10 is multiplied by the reciprocal of the reduction ratio Rg in the multiplier 90e. This indicates that the rotation of the output shaft 2a is decelerated by the deceleration mechanism 10. FIG. In the multiplier 90e, the rotation speed _ωm is converted into the rotation speed after deceleration by the speed reduction mechanism 10, that is, the rotation speed on the output shaft 6 side.

減算器９０ｈでは、回転速度ω_ｍから乗算器９０ｅにて減速比Ｒｇの逆数を乗算されて減速後の値に変換されたものから回転速度ω_ｌが減算され、出力軸２ａと出力軸６との速度差が算出される。減算器９０ｈの出力は、積分器９０ｆに入力され、積分されるのに続いて乗算器９０ｇによりばね定数Ｋｓを乗算される。積分器９０ｆでは、出力軸２ａと出力軸６との速度差を積分することで出力軸２ａと出力軸６とのねじり角θ_ｓが算出され、当該積分結果にばね定数Ｋｓが乗算されてねじりトルクτｓが等価的に算出される。 The subtractor 90h subtracts the rotational speed _ωl from the rotational speed _ωm multiplied by the reciprocal of the reduction ratio Rg in the multiplier 90e and converted into a decelerated value. is calculated. The output of the subtractor 90h is input to the integrator 90f, integrated, and then multiplied by the spring constant Ks by the multiplier 90g. The integrator 90f integrates the speed difference between the output shafts 2a and 6 to calculate the torsion angle _θs between the output shafts 2a and 6, and multiplies the integration result by the spring constant Ks to obtain the torsion. Torque τs is equivalently calculated.

算出されたねじりトルクτｓは、減速機構１０から、トルクセンサ８と出力軸６側とに出力される。またねじりトルクτｓは乗算器９０ｄに入力されて減速比Ｒｇの逆数を乗算される。乗算器９０ｄは、ねじりトルクτｓをモータ２側の値に変換し、トルク応答値を算出する。トルク応答値は、減速機構１０からモータ２へ入力される。ねじりトルクτｓを入力されたトルクセンサ８は、ねじりトルクτｓの測定値として、出力値τｓを外乱オブザーバ装置５０に出力する。なお、運動学モデル９０では、減速装置３からトルクセンサ８へねじりトルクτｓが出力されるように表されているが、実際には、出力軸６に設けられたトルクセンサ８で出力軸６に発生したねじりトルクτｓを検出している。 The calculated torsional torque τs is output from the speed reduction mechanism 10 to the torque sensor 8 and the output shaft 6 side. Also, the torsional torque τs is input to the multiplier 90d and multiplied by the reciprocal of the reduction ratio Rg. The multiplier 90d converts the torsional torque τs into a value on the motor 2 side and calculates a torque response value. The torque response value is input from the speed reduction mechanism 10 to the motor 2 . The torque sensor 8 to which the torsional torque τs is input outputs an output value τs to the disturbance observer device 50 as a measured value of the torsional torque τs. In the kinematics model 90, the torsional torque τs is represented as being output from the reduction gear 3 to the torque sensor 8, but in reality, the torque sensor 8 provided on the output shaft 6 outputs the torsion torque τs. The generated torsional torque τs is detected.

出力軸６は、減算器９０ｉと、出力軸６の慣性モーメントＪｌの逆数をゲインとして有する乗算器９０ａと、積分器９０ｂ、９０ｃとを有し、ねじりトルクτｓと、出力軸６の外乱トルクτ_ｌ ^ｅｘｔが出力軸６に入力され、出力軸６の回転速度を表す回転速度ω_ｌと出力軸６の回転角度を表す角度位置θ_ｌとが出力軸６から出力されるように表されている。出力軸６に入力されたねじりトルクτｓは、減算器９０ｉで出力軸６に入力された外乱トルクτ_ｌ ^ｅｘｔを減算される。減算器９０ｉでは、ねじりトルクτｓから外乱トルクτ_ｌ ^ｅｘｔ成分が除かれ、出力軸６に生じる出力トルク値が算出される。 The output shaft 6 has a subtractor 90i, a multiplier 90a having the reciprocal of the moment of inertia Jl of the output shaft 6 as a gain, and integrators 90b and 90c. _l ^ext is input to the output shaft 6, and the rotation speed _ωl representing the rotation speed of the output shaft 6 and the angular position _θl representing the rotation angle of the output shaft 6 are output from the output shaft 6. . The torsional torque τs input to the output shaft 6 is subtracted from the disturbance torque τ _l ^ext input to the output shaft 6 by the subtractor 90i. The subtractor 90i removes the disturbance torque τ _l ^ext component from the torsional torque τs to calculate the output torque value generated on the output shaft 6 .

出力トルク値は、乗算器９０ａに入力され、出力軸６の慣性モーメントＪｌの逆数を乗算される。乗算器９０ａでは、出力軸６に生じる加速度を表す回転加速度

が等価的に算出され、算出された出力軸６の回転加速度は、積分器９０ｂに入力され、積分される。積分器９０ｂでは、回転速度ω_ｌが等価的に算出される。回転速度ω_ｌは出力軸６から減速機構１０に出力される。一方で、回転速度ω_ｌは積分器９０ｃにも入力されて積分される。積分器９０ｃでは、角度位置θ_ｌが等価的に算出される。角度位置θ_ｌは出力軸６から出力される。 The output torque value is input to the multiplier 90a and multiplied by the reciprocal of the moment of inertia Jl of the output shaft 6. FIG. In the multiplier 90a, the rotation acceleration representing the acceleration occurring in the output shaft 6 is

is equivalently calculated, and the calculated rotational acceleration of the output shaft 6 is input to the integrator 90b and integrated. The integrator 90b equivalently calculates the rotational speed _ωl . The rotation speed _ωl is output from the output shaft 6 to the reduction mechanism 10 . On the other hand, the rotation speed _ωl is also input to the integrator 90c and integrated. The integrator 90c equivalently calculates the angular position _θl . The angular position θ _l is output from the output shaft 6 .

次に、外乱オブザーバ装置５０の構成と動作について説明する。まず、外乱オブザーバ装置５０の概要について説明する。外乱オブザーバ装置５０は、トルクセンサ８からねじりトルクτｓの測定値を制御入力として受信する。また、外乱オブザーバ装置５０は、モータ制御装置５から回転速度ω_ｍを制御入力として受信する。さらに、外乱オブザーバ装置５０は、モータ２の電流指令値ｉ_ｑ ^ｒｅｆを制御入力として受け取る。なお、回転速度ω_ｍは、モータ制御装置５が、ロータリ・エンコーダ４からモータ２の出力軸２ａの角度位置θ_ｍを表すデータを周期的に受信し、時間あたりの角度位置θ_ｍの変化率を求めることで得られる。また、ロータリ・エンコーダ４が角度位置θ_ｍを表すデータをアナログ信号として出力し、モータ制御装置５が当該アナログ信号を例えば微分器などを用いて微分することで、回転速度ω_ｍを算出するようにしてもよい。図４では、便宜的に、モータ２の回転速度ω_ｍがモータ２から外乱オブザーバ装置５０のＳＶＭＮＣ５５ａに直接入力されるように示されている。 Next, the configuration and operation of the disturbance observer device 50 will be described. First, an outline of the disturbance observer device 50 will be described. A disturbance observer device 50 receives the measured value of the torsional torque τs from the torque sensor 8 as a control input. The disturbance observer device 50 also receives the rotation speed ω _m from the motor control device 5 as a control input. Furthermore, the disturbance observer device 50 receives the current command value i _q ^ref of the motor 2 as a control input. The rotational speed ω _m is determined by the motor control device 5 periodically receiving data representing the angular position θ _m of the output shaft 2a of the motor 2 from the rotary encoder 4, and the rate of change of the angular position θ _m per time. obtained by asking for Further, the rotary encoder 4 outputs data representing the angular position _θm as an analog signal, and the motor control device 5 differentiates the analog signal using, for example, a differentiator to calculate the rotational speed _ωm . can be 4, for the sake of convenience, the rotational speed ω _m of the motor 2 is shown to be directly input from the motor 2 to the SVMNC 55a of the disturbance observer device 50. In FIG.

外乱オブザーバ装置５０は、ねじりトルクτｓと回転速度ω_ｍと電流指令値ｉ_ｑ ^ｒｅｆとに基づいて、摩擦トルクτ_ｄｍによる外乱を補償する補償トルク（トルク補償値ともいう）を算出する。外乱オブザーバ装置５０は、算出したトルク補償値を電流値に変換して加減算器５３０に出力し、制御目標値である電流ｉ_ｃｔｒｌにフィードバックする。本実施形態では、電流ｉ_ｃｔｒｌに外乱オブザーバ装置５０の出力がフィードバックされて電流指令値ｉ_ｑ ^ｒｅｆが算出される。これにより、モータ２の出力軸２ａに生じた摩擦トルクτ_ｄｍが補償され、外乱が抑制される。 The disturbance observer device 50 calculates a compensation torque (also referred to as a torque compensation value) that compensates for the disturbance due to the friction torque _τdm based on the torsional torque τs, the ^rotation speed _ωm , and the current command value _iqref . The disturbance observer device 50 converts the calculated torque compensation value into a current value, outputs it to the adder/subtractor 530, and feeds it back to the current i _ctrl which is the control target value. In this embodiment, the output of the disturbance observer device 50 is fed back to the current i _ctrl to calculate the current command value i _q ^ref . As a result, the friction torque _τdm generated in the output shaft 2a of the motor 2 is compensated, and disturbance is suppressed.

次いで、外乱オブザーバ装置５０の構成について説明する。外乱オブザーバ装置５０は、摩擦トルクτ_ｄｍによる外乱を抑制するための制御計算を実行するＳＶＭＮＣ５５ａ、加算器５０ａ、利得要素５０ｂ、５０ｃ、５０ｄおよび加算器５０ｅを備える。利得要素５０ｃは、ねじりトルクτｓの測定値を受信し、減速機構１０の減速比Ｒ_ｇの逆数に等しい利得を乗じてモータ２側のトルク値に変換してＳＶＭＮＣ５５ａに制御入力として出力する。利得要素５０ｄは、モータ２から入力された電流指令値ｉ_ｑ ^ｒｅｆにトルク定数に相当する利得係数Ｋ_ｍを乗じてトルク値に変換し、ＳＶＭＮＣ５５ａに制御入力として出力する。 Next, the configuration of the disturbance observer device 50 will be described. The disturbance observer device 50 includes an SVMNC 55a, an adder 50a, gain elements 50b, 50c, 50d, and an adder 50e that perform control calculations for suppressing disturbances due to the frictional torque _τdm . The gain element 50c receives the measured value of the torsional torque τs, multiplies it by a gain equal to the reciprocal of the speed reduction ratio _Rg of the speed reduction mechanism 10, converts it to a torque value on the side of the motor 2, and outputs it to the SVMNC 55a as a control input. The gain element 50d multiplies the current command value i _q ^ref input from the motor 2 by a gain coefficient K _m corresponding to a torque constant, converts it into a torque value, and outputs it to the SVMNC 55a as a control input.

ＳＶＭＮＣ５５ａは、ねじりトルクτｓをモータ２側のトルク値に変換した値と、電流指令値ｉ_ｑ ^ｒｅｆをトルク値に変換した値とモータ２の回転速度ω_ｍとに基づいて、摩擦トルクτ_ｄｍを補償するトルク補償値を定常成分と過渡成分とに分けて算出し、算出結果を別々に出力するように構成される。特に、トルク補償値の過渡成分に関しては、対応する高次の応答特性関数を表す多項式の各々の次数について、トルク補償値の１次成分、２次成分、・・・、Ｎ次成分を別々に出力するように構成される。 The SVMNC 55a calculates the friction torque τdm based on the value obtained by converting the torsional torque τs into a torque value on the motor 2 side, the ^{value obtained by converting the current command value iqref} _into _a torque value, and the rotation speed _ωm of the motor 2. A torque compensation value to be compensated is calculated separately for a steady component and a transient component, and the calculated results are output separately. In particular, regarding the transient component of the torque compensation value, for each degree of the polynomial representing the corresponding high-order response characteristic function, the first-order component, second-order component, . configured to output

ＳＶＭＮＣ５５ａでは、まず、モータ２側の力学モデルに基づいて摩擦トルクτ_ｄｍの理論値が次式によって算出される。
τ_ｄｍ＝Ｋ_ｍ・ｉ_ｑ ^ｒｅｆ－τｓ／Ｒ_ｇ＋Ｊ_ｍｎ・ω_ｍ・ｓ
ここで、Ｊ_ｍｎはモータ２のイナーシャである。三項目はモータ２のイナーシャＪ_ｍｎにモータ２の回転速度ω_ｍの微分値、すなわち回転加速度ω_ｍ’をかけた値に相当する項であり、モータ２の出力軸２ａに生じたトルクに相当する値である。 In the SVMNC 55a, first, the theoretical value of the friction torque _τdm is calculated by the following equation based on the dynamic model of the motor 2 side.
τ _dm = K _m ·i _q ^ref −τs/R _g +J _mn ·ω _m ·s
where J _mn is the inertia of motor 2; The third term corresponds to the value obtained by multiplying the inertia J _mn of the motor 2 by the differential value of the rotation speed ω _m of the motor 2, that is, the rotation acceleration ω _m ', and corresponds to the torque generated in the output shaft 2a of the motor 2. is the value to

その後、ＳＶＭＮＣ５５ａでは、算出した摩擦トルクτ_ｄｍの理論値が高次のローパスフィルタに通されることで、トルク補償値が算出される。高次のローパスフィルタは、その特性がモータ２の回転速度ω_ｍに応じて可変となるように設計されている。このように、補償値算出部としてのＳＶＭＮＣ５５ａは、上記の３つの制御入力に基づいて摩擦トルクτ_ｄｍの理論値を算出し、算出された摩擦トルクτ_ｄｍの理論値をフィルタするフィルタとして構成されている。図４に示すモデルでは、ＳＶＭＮＣ５５ａの出力は、摩擦トルクτ_ｄｍと高次のローパスフィルタの伝達関数の積となる。フィルタの特性は感度関数として表される。本実施形態では、このローパスフィルタが３次のローパスフィルタであり、感度関数が下記の式に設計されている。

感度関数は、高次多項式として記述され、ｇ_０、ｇ_１、ｇ_２、ｇ_３は、極に対応する。本実施形態の場合、フィルタは、極が異なる値を有するように、すなわち、設計極が単根であるように設計されている。なお、上記の式におけるパラメータφ_０、φ_１、φ_２は以下の数式で表されるようにモータ２の回転速度ω_ｍに依存して増減する利得係数であり、以下の式で表される。

After that, the SVMNC 55a calculates a torque compensation value by passing the calculated theoretical value of the friction torque _τdm through a high-order low-pass filter. The high-order low-pass filter is designed so that its characteristics are variable according to the rotation speed ω _m of the motor 2 . In this way, the SVMNC 55a as the compensation value calculator is configured as a filter that calculates the theoretical value of the friction torque _τdm based on the above three control inputs, and filters the calculated theoretical value of the friction torque _τdm . ing. In the model shown in FIG. 4, the output of SVMNC 55a is the product of the frictional torque τ _dm and the transfer function of a high-order low-pass filter. The filter characteristics are expressed as a sensitivity function. In this embodiment, this low-pass filter is a third-order low-pass filter, and the sensitivity function is designed according to the following equation.

The sensitivity function is written as a higher order polynomial, with g ₀ , g ₁ , g ₂ , g ₃ corresponding to the poles. For this embodiment, the filter is designed such that the poles have different values, ie the design pole is a single root. The parameters φ ₀ , φ ₁ , and φ ₂ in the above equation are gain coefficients that increase or decrease depending on the rotation speed ω _m of the motor 2 as expressed by the following equations, and are expressed by the following equations. .

ここで、α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）はモータ２の回転速度ω_ｍに依存する係数であり、図６に示すスイッチングパターンによって定められる値である。図６は、横軸がモータ２の回転速度ω_ｍであり、縦軸がα_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）（図６ではα_１、α_２、α_３と表記）の値であり、回転速度ω_ｍに応じたα_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の値を示すグラフである。スイッチングパターンでは、α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の値がモータ２の回転速度ω_ｍに応じて定められるようになっている。このスイッチングパターンは、事前に観測したスリップ－スティック現象などに基づいて適宜設定できる。 Here, α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m ) are coefficients dependent on the rotation speed ω _m of the motor 2, and are values determined by the switching pattern shown in FIG. . In FIG. 6, the horizontal axis is the rotational speed ω _m of the motor 2, and the vertical axis is α ₁ (ω _m ), α ₂ (ω _m ), α ₃ (ω _m ) (α ₁ , α ₂ , ω m ) in FIG. 1 (denoted as α ₃ )), and is a graph showing the values of α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m ) according to the rotational speed ω _m . In the switching pattern, the values of α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m ) are determined according to the rotational speed ω _m of the motor 2 . This switching pattern can be appropriately set based on the slip-stick phenomenon observed in advance.

回転速度ω_ｍに応じてα_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の値が変わり、パラメータφ_０、φ_１、φ_２の値も変わるので、感度関数も変わる。このように、回転速度ω_ｍに応じてローパスフィルタの感度（特性）を変えることができる。なお、上記の感度関数を有するローパスフィルタの伝達関数は、例えば、α_１（ω_ｍ）＝α_２（ω_ｍ）＝α_３（ω_ｍ）＝１の場合、下記式で表される。

上記式の第１項目が０次成分、第２項目が１次成分、第３項目が２次成分、第４項目が３次成分に対応する。このように、伝達関数の各項に含まれる極ｇ_０、ｇ_１、ｇ_２、ｇ_３が異なるので、上記の感度関数において極ｇ_０、ｇ_１、ｇ_２、ｇ_３の値を適宜設定することで、伝達関数の定常成分（第１項に相当）の特性と過渡成分（第２項から第４項に相当）の特性を分けて設計できる。ＳＶＭＮＣ５５ａは、出力が摩擦トルクτ_ｄｍとこの伝達関数との積であり、伝達関数が次数毎に分離可能であるので、トルク補償値を次数毎に分離して算出できる。 The values of α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m ) change according to the rotation speed ω _m , and the values of the parameters φ ₀ , φ ₁ , and φ ₂ also change, so the sensitivity function is also change. In this way, the sensitivity (characteristics) of the low-pass filter can be changed according to the rotational speed _ωm . The transfer function of the low-pass filter having the above sensitivity function is expressed by the following equation, for example, when α ₁ (ω _m )=α ₂ (ω _m )=α ₃ (ω _m )=1.

In the above equation, the first term corresponds to the 0th order component, the second term corresponds to the first order component, the third term corresponds to the second order component, and the fourth term corresponds to the third order component. In this way, since the poles g ₀ , g ₁ , g ₂ , and g ₃ included in each term of the transfer function are different, the values of the poles g ₀ , g ₁ , g ₂ , and g ₃ in the above sensitivity function are appropriately set. By doing so, the characteristics of the steady component (corresponding to the first term) and the characteristics of the transient components (corresponding to the second to fourth terms) of the transfer function can be designed separately. The output of the SVMNC 55a is the product of the friction torque τ _dm and this transfer function, and since the transfer function can be separated for each order, the torque compensation value can be calculated separately for each order.

外乱オブザーバ装置５０は、可変利得要素５９（ａ）、５９（ｂ）、５９（ｃ）をさらに備えている。可変利得要素５９（ａ）は可変利得α_１（ω_ｍ）を有し、可変利得要素５９（ｂ）は可変利得α_２（ω_ｍ）を有し、可変利得要素５９（ｃ）は可変利得α_３（ω_ｍ）を有している。可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の値は、モータ２の回転速度ω_ｍに基づいて変えることができる。可変利得要素５９（ａ）は、ＳＶＭＮＣ５５ａで算出された摩擦トルクτ_ｄｍのトルク補償値の１次成分が入力され、可変利得要素５９（ｂ）は、トルク補償値の２次成分が入力され、可変利得要素５９（ｃ）は、トルク補償値の３次成分が入力される。可変利得要素５９（ａ）、５９（ｂ）、５９（ｃ）は、入力されたトルク補償値（トルク補償値の１次成分、２次成分および３次成分）に、各々が有する可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）を乗算する。可変利得要素５９（ａ）、５９（ｂ）、５９（ｃ）は、利得を乗算された摩擦トルクτ_ｄｍの過渡成分のトルク補償値を加算器５０ｅにそれぞれ出力する。なお、可変利得要素５９（ａ）、５９（ｂ）、５９（ｃ）の可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）は、図６に示すスイッチングパターンを用い、回転速度ω_ｍに基づいて設定される。 The disturbance observer device 50 further comprises variable gain elements 59(a), 59(b), 59(c). Variable gain element 59(a) has a variable gain α ₁ (ω _m ), variable gain element 59(b) has a variable gain α ₂ (ω _m ), and variable gain element 59(c) has a variable gain α ₃ (ω _m ). The values of the variable gains α ₁ (ω _m ), α ₂ (ω _m ), α ₃ (ω _m ) can be changed based on the rotation speed ω _m of the motor 2 . The variable gain element 59(a) receives the primary component of the torque compensation value of the friction torque τ _dm calculated by the SVMNC 55a, and the variable gain element 59(b) receives the secondary component of the torque compensation value, Variable gain element 59(c) receives the cubic component of the torque compensation value. Variable gain elements 59(a), 59(b), and 59(c) apply variable gain α ₁ (ω _m ), α ₂ (ω _m ), α ₃ (ω _m ). The variable gain elements 59(a), 59(b), 59(c) each output a torque compensation value of the transient component of the friction torque τ _dm multiplied by the gain to the adder 50e. Note that the variable gains α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m ) of the variable gain elements 59(a), 59(b), and 59(c) have switching patterns shown in FIG. is set based on the rotational speed ω _m .

加算器５０ｅは、３つの摩擦トルクτ_ｄｍの過渡成分を加算し、合計値を加算器５０ａに出力する。加算器５０ａにはＳＶＭＮＣ５５ａから摩擦トルクτ_ｄｍの定常成分も入力されており、加算器５０ａは、摩擦トルクτ_ｄｍの定常成分（０次成分）と、摩擦トルクτ_ｄｍの過渡成分の合計値とを加算し、利得要素５０ｂに出力する。利得要素５０ｂは、加算器５０ａから受信した出力に、トルク定数に相当する利得係数Ｋ_ｍの逆数に等しい利得を乗じてトルク値を電流値に変換し、トルク補償指令信号として加減算器５３０に出力する。加減算器５３０は、制御目標値である電流ｉ_ｃｔｒｌにトルク補償指令信号を加算する。このように、トルク補償指令信号が制御目標値である電流ｉ_ｃｔｒｌに加算されることで、外乱が抑えられ、スティック－スリップ現象が抑制される。 The adder 50e adds the transient components of the three friction torques _τdm and outputs the sum to the adder 50a. The steady component of the friction torque τ _dm is also input to the adder 50a from the SVMNC 55a, and the adder 50a calculates the total value of the steady component (zero-order component) of the friction torque τ _dm and the transient component of the friction torque τ _dm . are added and output to gain element 50b. Gain element 50b multiplies the output received from adder 50a by a gain equal to the reciprocal of gain coefficient _Km corresponding to the torque constant, converts the torque value to a current value, and outputs it to adder/subtractor 530 as a torque compensation command signal. do. The adder/subtractor 530 adds the torque compensation command signal to the current i _ctrl which is the control target value. In this way, by adding the torque compensation command signal to the current i _ctrl which is the control target value, the disturbance is suppressed and the stick-slip phenomenon is suppressed.

なお、α_１（ω_ｍ）＝α_２（ω_ｍ）＝α_３（ω_ｍ）＝０の速度領域では、過渡成分に乗算されるすべてのゲインが０であるので、定常成分のみからトルク補償指令信号が算出される。α_１（ω_ｍ）＝１、α_２（ω_ｍ）＝α_３（ω_ｍ）＝０の速度領域では、定常成分と１次成分との和からトルク補償指令信号が算出される。α_１（ω_ｍ）＝α_２（ω_ｍ）＝１、α_３（ω_ｍ）＝０の速度領域では、定常成分と１次成分と２次成分との和からトルク補償指令信号が算出される。α_１（ω_ｍ）＝α_２（ω_ｍ）＝α_３（ω_ｍ）＝１の速度領域では、定常成分と１次成分と２次成分と３次成分との和からトルク補償指令信号が算出される。 In the speed region of α ₁ (ω _m )=α ₂ (ω _m )=α ₃ (ω _m )=0, all the gains multiplied by the transient component are 0. A command signal is calculated. In the speed region where α ₁ (ω _m )=1 and α ₂ (ω _m )=α ₃ (ω _m )=0, the torque compensation command signal is calculated from the sum of the steady component and the primary component. In the speed region of α ₁ (ω _m )=α ₂ (ω _m )=1 and α ₃ (ω _m )=0, the torque compensation command signal is calculated from the sum of the steady component, primary component and secondary component. be. In the speed region of α ₁ (ω _m )=α ₂ (ω _m )=α ₃ (ω _m )=1, the torque compensation command signal is obtained from the sum of the stationary component, the primary component, the secondary component, and the tertiary component. Calculated.

以上より、第１の実施形態に係る外乱オブザーバ装置５０によれば、モータ２の回転速度ω_ｍに応じて感度関数の特性を変えて摩擦トルクτ_ｄｍのトルク補償値を算出するので、クーロン摩擦や粘性摩擦がモータ２の出力軸２ａに作用することにより生じた過渡的な外乱（スティック－スリップ現象）を動的に抑制できる。よって、摩擦特性の高精度なモデリングおよび関連する物理量の高精度な同定を必要とせずに、モータ２の出力軸２ａに作用する摩擦力の推定とこれによる外乱の補償を適切に行うことができるモータ制御装置５を実現することができる。 As described above, according to the disturbance observer device 50 according to the first embodiment, since the torque compensation value of the friction torque τ _dm is calculated by changing the characteristics of the sensitivity function according to the rotation speed ω _m of the motor 2, Coulomb friction A transitional disturbance (stick-slip phenomenon) caused by viscous friction acting on the output shaft 2a of the motor 2 can be dynamically suppressed. Therefore, it is possible to appropriately estimate the frictional force acting on the output shaft 2a of the motor 2 and compensate for the disturbance caused by this, without requiring high-precision modeling of frictional characteristics and high-precision identification of related physical quantities. A motor control device 5 can be realized.

（１－３）第１の実施形態に係る外乱オブザーバによる外乱抑制効果の評価
次に、第１の実施形態に係る外乱オブザーバ装置５０による外乱抑制効果について検討する。以下では、従来の外乱抑制方式であるＺＯＭＮＣ（Zero-order MNC)を用いた外乱抑制方式を比較対象として、外乱オブザーバ装置５０を評価する。ＺＯＭＮＣは、摩擦外乱をステップ関数として近似して設計したＭＮＣであり、摩擦トルクτ_ｄｍの定常成分（ゼロ次成分）からトルク補償値を算出する。ここでは、図４に示すモデルをＳＶＭＮＣ及びＺＯＭＮＣの両方でシミュレーションし、シミュレーションにより得られたモータ２の回転速度ω_ｍと回転加速度ω_ｍ’の過渡特性を比較することで、外乱抑制効果を評価した。図７にシミュレーション結果を示す。図７の紙面上部のグラフが回転加速度ω_ｍ’の結果であり、横軸が時間、縦軸が回転加速度ω_ｍ’を表す。図７の紙面下部のグラフが回転速度ω_ｍの結果であり、横軸が時間、縦軸が回転速度ω_ｍを示す。図７のグラフでは、点線がＳＶＭＮＣの結果であり、実線がＺＯＭＮＣの結果である。 (1-3) Evaluation of disturbance suppression effect by disturbance observer according to first embodiment Next, the disturbance suppression effect by the disturbance observer device 50 according to the first embodiment will be examined. In the following, the disturbance observer device 50 is evaluated with a disturbance suppression method using ZOMNC (Zero-order MNC), which is a conventional disturbance suppression method, as a comparison target. ZOMNC is an MNC designed by approximating a frictional disturbance as a step function, and calculates a torque compensation value from a stationary component (zero-order component) of the frictional torque _τdm . Here, the model shown in FIG. 4 is simulated by both SVMNC and ZOMNC, and the disturbance suppression effect is evaluated by comparing the transient characteristics of the rotation speed ω _m and rotation acceleration ω _m ′ of the motor 2 obtained by the simulation. bottom. FIG. 7 shows simulation results. The graph at the top of the page of FIG. 7 shows the results of the rotational acceleration ω _m ', where the horizontal axis represents time and the vertical axis represents the rotational acceleration ω _m '. The graph at the bottom of FIG. 7 shows the results of the rotational speed _ωm , where the horizontal axis indicates time and the vertical axis indicates the rotational speed _ωm . In the graph of FIG. 7, the dotted line is the result of SVMNC, and the solid line is the result of ZOMNC.

ＺＯＭＮＣでは、モータ２の回転速度ω_ｍの波形は、図のＳＶ（Ａ）およびＳＶ（Ｂ）の領域に示すような不規則に歪んだ形であった。図７のＳＶ（Ａ）およびＳＶ（Ｂ）の領域に対応するＳＶ（１）およびＳＶ（２）の領域にみられるように、モータ２の回転速度ω_ｍがほぼゼロとなった後に急激に変化しており、スティック－スリップ現象が生じていることがわかる。また、回転加速度ω_ｍ’についても、図７のＰＫ（Ａ）およびＰＫ（Ｂ）の領域に示すように、モータ２の回転速度ω_ｍの急激な変化に対応するピークがみられ、スティック－スリップ現象が生じていることが確認できる。 In ZOMNC, the waveform of the rotational speed ω _m of the motor 2 was irregularly distorted as shown in the regions SV(A) and SV(B) in the figure. As can be seen in the regions SV(1) and SV(2) corresponding to the regions SV(A) and SV(B) in _FIG . It can be seen that the stick-slip phenomenon has occurred. In addition, as shown in the regions PK(A) and PK(B) in FIG. 7, the rotational acceleration ω _m ' also has peaks corresponding to rapid changes in the rotational speed ω _m of the motor 2. It can be confirmed that a slip phenomenon has occurred.

一方で、ＳＶＭＮＣでは、モータ２の回転速度ω_ｍの波形はＺＯＭＮＣの波形より滑らかになっており、ＺＯＭＮＣにおいてスティック－スリップ現象が見られた図７のＳＶ（１）およびＳＶ（２）の領域においても、スティック－スリップ現象が生じていないことが確認できる。このように、第１の実施形態の外乱オブザーバはスティック－スリップ現象を抑制できることが確認できた。 On the other hand, in the SVMNC, the waveform of the rotational speed ω _m of the motor 2 is smoother than that of the ZOMNC. Also, it can be confirmed that the stick-slip phenomenon does not occur. Thus, it was confirmed that the disturbance observer of the first embodiment can suppress the stick-slip phenomenon.

＜２＞第２の実施形態
以下、図４と同じ構成には同じ番号を付した図８を参照しながら、本発明の第２の実施形態に係るモータ制御装置について説明する。第２の実施形態のモータ制御装置は、第１の実施形態のモータ制御装置に対して、モータ２の回転速度ω_ｍに基づいて算出した正の高周波ダンピング項を制御目標値である電流ｉ_ｃｔｒｌにフィードバックすることで、モータ２の回転速度ω_ｍ及び回転加速度ω_ｍ’の高周波成分を除去し、スティック－スリップ現象をより抑制できるようにしたものである。第２の実施形態のモータ制御装置は、高周波ダンピング項演算部５７と、高周波ダンピング項演算部５７の出力を制御目標値である電流ｉ_ｃｔｒｌにフィードバックするか否かを切り替える切り替えスイッチＳＷ（１）とを有している点で、第１の実施形態のモータ制御装置と異なる。他の構成は第１の実施形態のモータ制御装置と同じであるので説明を省略する。 <2> Second Embodiment Hereinafter, a motor control device according to a second embodiment of the present invention will be described with reference to FIG. 8 in which the same numbers are assigned to the same configurations as in FIG. Unlike the motor control device of the first embodiment, the motor control device of the second embodiment adds a positive high-frequency damping term calculated based on the rotation speed ω _m of the motor 2 to the current i _ctrl that is the control target value. , the high-frequency components of the rotation speed ω _m and the rotation acceleration ω _{m ′} of the motor 2 are removed, thereby further suppressing the stick-slip phenomenon. The motor control device of the second embodiment includes a high-frequency damping term calculation unit 57 and a changeover switch SW(1) for switching whether or not to feed back the output of the high-frequency damping term calculation unit 57 to the current i _ctrl that is the control target value. and is different from the motor control device of the first embodiment. Since other configurations are the same as those of the motor control device of the first embodiment, description thereof is omitted.

高周波ダンピング項演算部５７は、ゲインがＫ_νの利得要素５７ａと、高周波成分算出要素５７ｂとを備えている。高周波成分算出要素５７ｂは、モータ２の回転速度ω_ｍが入力され、当該回転速度ω_ｍに基づいて、モータ２の回転速度及び回転加速度の高周波成分を除去するために、高周波ダンピング項として制御目標値にフィードバックする高周波成分を算出する。本実施形態では、高周波成分算出要素５７ｂは、伝達関数がｓ／（ｓ＋ｇ_ｄｍｐ）のハイパスフィルタとして構成されており、回転速度ω_ｍをハイパスフィルタに通して利得要素５７ａに出力する。図８に示すモデル上では、モータ２の回転速度ω_ｍとハイパスフィルタの伝達関数ｓ／（ｓ＋ｇ_ｄｍｐ）との積が算出される。なお、ｇ_ｄｍｐは、ハイパスフィルタの帯域である。利得要素５７ａは、高周波成分算出要素５７ｂの出力を、ゲインＫ_ν倍して高周波ダンピング項を算出する。このようにして、高周波ダンピング項演算部５７は、高周波ダンピング項τ_ＨＦＤ＝Ｋ_ν・ｓ／（ｓ＋ｇ_ｄｍｐ）・ω_ｍを算出する。算出された高周波ダンピング項τ_ＨＦＤは、利得要素５７ａから切り替えスイッチＳＷ（１）に出力される。 The high-frequency damping term calculator 57 includes a gain element 57a with a gain of _Kv and a high-frequency component calculation element 57b. The high-frequency component calculation element 57b receives the rotational speed _ωm of the motor 2, and calculates a control target as a high-frequency damping term in order to remove high-frequency components of the rotational speed and rotational acceleration of the motor 2 based on the rotational speed ωm _. Calculates the high-frequency component to feed back into the value. In this embodiment, the high-frequency component calculation element 57b is configured as a high-pass filter with a transfer function of s/(s+g _dmp ), and outputs the rotation speed ω _m through the high-pass filter to the gain element 57a. On the model shown in FIG. 8, the product of the rotation speed ω _m of the motor 2 and the transfer function s/(s+g _dmp ) of the high-pass filter is calculated. Note that g _dmp is the band of the high-pass filter. The gain element 57a multiplies the output of the high frequency component calculation element 57b by a gain _Kv to calculate a high frequency damping term. In this manner, the high frequency damping term calculator 57 calculates the high frequency damping term τ _HFD =K _ν ·s/(s+g _dmp )·ω _m . The calculated high frequency damping term τ _HFD is output from the gain element 57a to the switch SW(1).

切り替えスイッチＳＷ（１）は、モータ２の回転速度ω_ｍに基づいて高周波ダンピング項τ_ＨＦＤを加減算器５３０（２）に出力するか否か切り替える。切り替えスイッチＳＷ（１）は、モータ２の回転速度の絶対値｜ω_ｍ｜が所定のスイッチ切り替え閾値ω_ｔｈｌｄ以上の場合、オン状態にされて加減算器５３０（２）に接続され、高周波ダンピング項τ_ＨＦＤを加減算器５３０（２）に出力する。一方、切り替えスイッチＳＷ（１）は、モータ２の回転速度の絶対値｜ω_ｍ｜が所定のスイッチ切り替え閾値ω_ｔｈｌｄより小さい場合、オフ状態にされて加減算器５３０（２）との接続が切断され、高周波ダンピング項τ_ＨＦＤを加減算器５３０（２）に出力できなくされる。スイッチ切り替え閾値ω_ｔｈｌｄは、スティック－スリップ現象の観察から実験的に適宜決められる値である。加減算器５３０（２）は、制御目標値である電流ｉ_ｃｔｒｌから高周波ダンピング項τ_ＨＦＤを減算して制御目標値を補正し、減算結果（補正された制御目標値）を加減算器５３０（１）に出力する。加減算器５３０（１）では、加減算器５３０（２）から入力された電流（補正された制御目標値）と外乱オブザーバ装置５０で算出されたトルク補償指令信号とが加算されて電流指令値Ｉ_ｑ ^ｒｅｆが算出される。 A switch SW(1) switches whether or not to output the high-frequency damping term τ _HFD to the adder/subtractor 530(2) based on the rotational speed ω _m of the motor 2 . _The change-over switch SW(1) is turned on and connected to the adder/subtractor 530(2) when the absolute value of the rotation speed |ω _m | τ _HFD is output to adder/subtractor 530(2). On the other hand, when the absolute value |ω _m | of the rotational speed of the motor 2 is smaller than the predetermined switch switching threshold ω _thld , the switch SW(1) is turned off and disconnected from the adder/subtractor 530(2). and disables the high frequency damping term τ _HFD from being output to adder/subtractor 530(2). The switch switching threshold ω _thld is a value appropriately determined experimentally from the observation of the stick-slip phenomenon. The adder/subtractor 530(2) subtracts the high frequency damping term τ _HFD from the current i _ctrl which is the control target value to correct the control target value, and the subtraction result (corrected control target value) is sent to the adder/subtractor 530(1). output to Adder/subtractor 530(1) adds the current (corrected control target value) input from adder/subtractor 530(2) and the torque compensation command signal calculated by disturbance observer device 50 to obtain current command value _Iq. ^ref is calculated.

以下、図９を参照しながら、第２の実施形態に係る外乱抑制方式による外乱抑制効果をシミュレーションおよび実験により評価した結果について検討する。以下、第２の実施形態に従って実施される外乱抑制方式を「ＳＶＭＮＣ＋ＨＦｄａｍｐｉｎｇ」と略記する。ここでは、第１の実施形態で行ったシミュレーションのモデルに、高周波ダンピング項演算部５７を追加したモデルを用いてシミュレーションし、得られた結果を、第１実施形態のＳＶＭＮＣのシミュレーション結果と比較した。その結果を図９に示す。図９では。ＳＶＭＮＣ＋ＨＦｄａｍｐｉｎｇは一点鎖線で表されている。 The results of evaluating the disturbance suppression effect of the disturbance suppression method according to the second embodiment through simulations and experiments will be discussed below with reference to FIG. 9 . Hereinafter, the disturbance suppression method implemented according to the second embodiment is abbreviated as "SVMNC+HF damping". Here, a simulation is performed using a model obtained by adding a high-frequency damping term calculation unit 57 to the simulation model performed in the first embodiment, and the obtained results are compared with the simulation results of the SVMNC of the first embodiment. . The results are shown in FIG. In FIG. SVMNC+HF damping is represented by a dashed line.

図９に示すように、ＳＶＭＮＣ＋ＨＦｄａｍｐｉｎｇの波形がＳＶＭＮＣの波形と比較してなだらかになっており、高周波成分が除去されて、よりスティック－スリップ現象が抑制されていることが確認できる。 As shown in FIG. 9, the SVMNC+HF damping waveform is smoother than the SVMNC waveform, and it can be confirmed that the high frequency components are removed and the stick-slip phenomenon is further suppressed.

高周波成分の除去効果をより検証するために、上記のシミュレーション結果を用いてモータ２の回転加速度ω_ｍ’応答の周波数解析を行った。周波数解析は７～８ｓｅｃの加速度応答１サイクルのデータを高速フーリエ変換することで行った。その結果を図１０と図１１に示す。図１０は横軸を周波数とし、図１１では横軸を高調波の次数とし、縦軸は両図とも回転加速度ω_ｍ’として示している。図１０中の実線の内、線が太く５Ｈｚ付近に大きなピークがある方がＺＯＭＮＣの結果である。図１０において、ＳＶＭＮＣとＳＶＭＮＣ＋ＨＦｄａｍｐｉｎｇを比較すると、特に周波数が高い領域（１０Ｈｚから１５Ｈｚの領域）でＳＶＭＮＣ＋ＨＦｄａｍｐｉｎｇの方が、この領域の周波数成分の回転加速度が低く、高周波成分が抑制されていることがわかる。また、図１１を見ると、ほぼすべての高調波で、ＳＶＭＮＣよりもＳＶＭＮＣ＋ＨＦｄａｍｐｉｎｇの方が回転加速度が低く、高周波成分が抑制されていることがわかる。 In order to further verify the effect of removing high-frequency components, frequency analysis of the rotational acceleration ω _m ′ response of the motor 2 was performed using the above simulation results. Frequency analysis was performed by fast Fourier transforming the data of one cycle of acceleration response of 7 to 8 seconds. The results are shown in FIGS. 10 and 11. FIG. In FIG. 10, the horizontal axis represents the frequency, in FIG. 11 the horizontal axis represents the harmonic order, and in both figures the vertical axis represents the rotational acceleration ω _m '. Of the solid lines in FIG. 10, the thick line with a large peak near 5 Hz is the result of ZOMNC. In FIG. 10, when SVMNC and SVMNC+HF damping are compared, it can be seen that SVMNC+HF damping has lower rotational acceleration of frequency components in this region, particularly in the high frequency region (10 Hz to 15 Hz region), and high frequency components are suppressed. . Also, from FIG. 11, it can be seen that the rotation acceleration is lower with SVMNC+HF damping than with SVMNC, and high-frequency components are suppressed for almost all harmonics.

このことは、ＴＨＤ（Total Harmonic Distortion：全高調波歪）の計算結果からもわかる。ＴＨＤの計算結果を表１に示す。表１に示すように、ＳＶＭＮＣよりもＳＶＭＮＣ＋ＨＦｄａｍｐｉｎｇの方がＴＨＤが低くて非線形性が低く、より高周波成分が除去されていることがわかる。

This can also be seen from the calculation results of THD (Total Harmonic Distortion). Table 1 shows the calculation results of THD. As shown in Table 1, it can be seen that SVMNC+HF damping has lower THD and lower nonlinearity than SVMNC, and removes more high-frequency components.

＜３＞第３の実施形態
以下、図４と同じ構成には同じ番号を付した図１２を参照しながら、本発明の第３の実施形態に係るモータ制御装置について説明する。第３の実施形態のモータ制御装置は、第１の実施形態のモータ制御装置とは、補償値演算部としてのＳＴＭＮＣ５５ｂを備える点で異なる。第３の実施形態のモータ制御装置の外乱オブザーバ装置６０のＳＴＭＮＣ５５ｂは、２次系として構成され、モータ２の回転速度ω_ｍに基づいて感度関数を設定される。他の構成は第１の実施形態のモータ制御装置と同じなので、説明を省略する。 <3> Third Embodiment Hereinafter, a motor control device according to a third embodiment of the present invention will be described with reference to FIG. 12 in which the same numbers are assigned to the same configurations as in FIG. The motor control device of the third embodiment differs from the motor control device of the first embodiment in that it includes an STMNC 55b as a compensation value calculator. The STMNC 55b of the disturbance observer device 60 of the motor control device of the third embodiment is configured as a second-order system and has a sensitivity function set based on the rotation speed ω _m of the motor 2 . Since other configurations are the same as those of the motor control device of the first embodiment, description thereof is omitted.

ＳＴＭＮＣ５５ｂは、第１の実施形態と同様に、ねじりトルクτｓをモータ２側のトルク値に変換した値と、電流指令値ｉ_ｑ ^ｒｅｆをトルク値に変換した値と、モータ２の回転速度ω_ｍとに基づいて、摩擦トルクτ_ｄｍを算出する。ＳＴＭＮＣ５５ｂは、算出した摩擦トルクτ_ｄｍを下記の式で表される感度関数を有する２次のローパスフィルタに通すことで、摩擦トルクτ_ｄｍの補償値を算出する。

ここで、α_１、α_２は、モータ２の回転速度ω_ｍに依存する値であり、上述の図６に示すスイッチングパターンにより定められる値である。この実施形態では、フィルタは設計極が重根に設計されており、極であるｇ_ｄｍはローパスフィルタの帯域であり、適宜設定できる。 As in the first embodiment, the STMNC 55b outputs a value obtained by converting the torsional torque τs into a torque value on the motor 2 side, a value obtained by converting the current command value i _q ^ref into a torque value, and the rotation speed ω _m Friction torque τ _dm is calculated based on and. The STMNC 55b passes the calculated friction torque τ _dm through a secondary low-pass filter having a sensitivity function represented by the following formula to calculate a compensation value for the friction torque τ _dm .

Here, α ₁ and α ₂ are values that depend on the rotation speed ω _m of the motor 2 and are determined by the switching pattern shown in FIG. In this embodiment, the filter is designed with multiple design poles, and the pole _gdm is the band of the low-pass filter and can be set as appropriate.

この感度関数は、図６のスイッチングパターンにおいて、α_１＝α_２＝０となるモータ２の回転速度ω_ｍの速度領域では、

となり、０次の感度関数となる。また、感度関数は、α_１＝１、α_２＝０となるモータ２の回転速度ω_ｍの速度領域では、

となり、１次の感度関数とる。感度関数はα_１＝α_２＝１となるモータ２の回転速度ω_ｍの速度領域では、

となり、２次の感度関数となる。このように、モータ２の回転速度ω_ｍが極低速領域のときに高次の感度関数が使われて摩擦トルクτ_ｄｍの補償値の過渡成分が算出され、それ以外では０次の感度関数が使われて摩擦トルクτ_ｄｍの補償値の定常成分が算出される。 In the switching pattern of FIG _. 6, this _sensitivity _function is expressed as:

and becomes the 0th-order sensitivity function. In addition, _the _sensitivity function is expressed _as

and takes the first-order sensitivity function. In the speed region of the rotation speed ω _m of the motor 2 where the sensitivity function is α ₁ =α ₂ =1,

and becomes a second-order sensitivity function. In this way, when the rotation speed _ωm of the motor 2 is in the extremely low speed region, the high-order sensitivity function is used to calculate the transient component of the compensation value of the friction torque _τdm . is used to calculate the stationary component of the compensation value of the friction torque τ _dm .

このように、第３の実施形態のＳＴＭＮＣ５５ｂは、モータ２の回転速度ω_ｍに基づいて感度関数を切り替えることができる。ＳＴＭＮＣ５５ｂは、算出したトルク補償値の０次成分と１次成分と２次成分とをそれぞれ別々に出力する。０次成分は直接、１次成分は利得要素６１で可変利得α_１（ω_ｍ）を乗算されて、２次成分は利得要素６２で可変利得α_２（ω_ｍ）を乗算されて加算器５０ｆに入力される。α_１（ω_ｍ）、α_２（ω_ｍ）は、図６のスイッチングパターンによってモータ２の回転速度ω_ｍに基づいて定められる値である。 Thus, the STMNC 55b of the third embodiment can switch the sensitivity function based on the rotational speed _ωm of the motor 2. FIG. The STMNC 55b separately outputs the 0th-order component, the 1st-order component, and the 2nd-order component of the calculated torque compensation value. The 0th order component is directly multiplied by variable gain α ₁ (ω _m ) in gain element 61 for the 1st order component, and the 2nd order component is multiplied by variable gain α ₂ (ω _m ) in gain element 62 to adder 50f. is entered in α ₁ (ω _m ) and α ₂ (ω _m ) are values determined based on the rotation speed ω _m of the motor 2 by the switching pattern of FIG.

以上から、第３の実施形態のモータ制御装置は、第１実施形態のモータ制御装置と同様に、制御対象となるモータ２への電流指令値ｉ_ｑ ^ｒｅｆ、モータ２の回転速度ω_ｍおよび２モータに減速機構１０を介して接続された出力軸６のねじりトルクτｓの測定値を制御入力として受け取り、モータ２の出力軸２ａに作用する摩擦トルクτ_ｄｍによる外乱を抑制する外乱オブザーバ装置６０を備え、外乱オブザーバ装置６０が、上記制御入力に基づいて摩擦トルクτ_ｄｍを補償するトルク補償値を算出する補償値算出部（ＳＴＭＮＣ５５ｂ）を有し、ＳＴＭＮＣ５５ｂの特性を表す感度関数がモータ２の回転速度ω_ｍに基づいて設定されるので、第１の実施形態と同様の効果を奏する。さらに、第３の実施形態のモータ制御装置は、第１の実施形態のモータ制御装置に対して外乱オブザーバ装置６０の感度関数が簡便であるというメリットを有する。よって、第３の実施形態のモータ制御装置は、外乱オブザーバ装置６０での演算が第１の実施形態の外乱オブザーバ装置５０よりも比較的容易であり、計算負荷が軽い。そのため、第３の実施形態のモータ制御装置を、高性能の演算装置を用いなくても実装でき、安価に構成できる。さらに、第３の実施形態のモータ制御装置は、極やスイッチングパターンなどの設計すべきパラメータが少なく、感度関数を容易に設計できる。特に、モータ制御装置の設計において経験等によって設定した実際のモータ２や減速装置３のパラメータが、上述の運動学モデル２０、９０におけるモータ２や減速装置３のパラメータと当初より適合していた場合には、設計の試行回数を少なくできる。加えて、後述の強化学習により感度関数の制御パラメータの設計をする場合も第１実施形態の外乱オブザーバ装置５０と比較して計算負荷が軽く、安価な構成で強化学習を行うことができる。なお、第２の実施形態で説明した高周波ダンピング項演算部を、第３の実施形態のモータ制御装置に組み込むこともできる。 As described above, the motor control device of the third embodiment, like the motor control device of the first embodiment, provides the current command value i _q ^ref to the motor 2 to be controlled, the rotational speeds ω _m and 2 A disturbance observer device 60 that receives as a control input the measured value of the torsional torque τs of the output shaft 6 connected to the motor via the speed reduction mechanism 10 and suppresses the disturbance due to the friction torque _τdm acting on the output shaft 2a of the motor 2. The disturbance observer device 60 has a compensation value calculation unit (STMNC 55b) that calculates a torque compensation value for compensating the friction torque _τdm based on the control input, and the sensitivity function representing the characteristics of the STMNC 55b is the rotation of the motor 2. Since it is set based on the velocity _ωm , the same effect as in the first embodiment can be obtained. Furthermore, the motor control device of the third embodiment has the advantage that the sensitivity function of the disturbance observer device 60 is simpler than the motor control device of the first embodiment. Therefore, in the motor control device of the third embodiment, the calculations in the disturbance observer device 60 are relatively easier than those of the disturbance observer device 50 of the first embodiment, and the calculation load is light. Therefore, the motor control device of the third embodiment can be implemented without using a high-performance arithmetic device, and can be configured at low cost. Furthermore, the motor control device of the third embodiment has few parameters such as poles and switching patterns to be designed, and the sensitivity function can be easily designed. In particular, when the actual parameters of the motor 2 and speed reducer 3 set by experience in designing the motor control device match the parameters of the motor 2 and speed reducer 3 in the above kinematic models 20 and 90 from the beginning. can reduce the number of design iterations. In addition, when designing the control parameters of the sensitivity function by reinforcement learning, which will be described later, the calculation load is lighter than that of the disturbance observer device 50 of the first embodiment, and reinforcement learning can be performed with an inexpensive configuration. Note that the high-frequency damping term calculator described in the second embodiment can also be incorporated into the motor control device of the third embodiment.

＜４＞第４の実施形態
第４の実施形態のモータ制御装置は、制御パラメータとしてのスイッチ切り替え閾値ω_ｔｈｉｄ、モータ２への外乱補償特性を記述する特性係数、高周波成分算出要素５７ｂのフィルタ特性係数、および利得要素５７ａのゲインＫ_νなどの内の少なくとも一つ以上を機械学習により設定する機械学習装置を備えている点で、第１～第３の実施形態と異なる。モータへの外乱補償特性を記述する特性係数としては、例えば、極ｇ_０、ｇ_１、ｇ_２、ｇ_３、帯域（極）ｇ_ｄｍ、可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）などがあげられ、高周波成分算出要素のフィルタ特性係数としては、帯域ｇ_ｄｍｐなどがあげられる。以下では、機械学習装置を中心に説明する。また、外乱オブザーバ装置５０の可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）を機械学習する場合を例として説明する。なお、可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）はモータ２の回転速度ω_ｍに依存して決まる値であるので、機械学習により設定するのは、可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の回転速度ω_m依存性（回転速度ω_ｍに基づいて可変利得の値を算出すための関数）の特性や上述のスイッチングパターンの波形の形状などである。 <4> Fourth Embodiment A motor control device according to a fourth embodiment includes a switch switching threshold ω _thid as a control parameter, a characteristic coefficient describing a disturbance compensation characteristic to the motor 2, a filter characteristic of a high-frequency component calculation element 57b, It differs from the first to third embodiments in that it includes a machine learning device that sets at least one of the coefficient and the gain K _ν of the gain element 57a by machine learning. Characteristic coefficients describing the disturbance compensation characteristics to the motor include, for example, poles g ₀ , g ₁ , g ₂ , g ₃ , band (pole) g _dm , variable gains α ₁ (ω _m ), α ₂ (ω _m ), α ₃ (ω _m ), and the like, and the filter characteristic coefficient of the high-frequency component calculation element includes the band g _dmp and the like. The machine learning device will be mainly described below. Also, a case of machine learning the variable gains α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m ) of the disturbance observer device 50 will be described as an example. Since the variable gains α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m ) are values determined depending on the rotation speed ω _m of the motor 2, they are set by machine learning as follows: Characteristic of rotation speed ω _m dependence of variable gains α ₁ (ω _m ), α ₂ (ω _m ), α ₃ (ω _m ) (function for calculating variable gain value based on rotation speed ω _m ) and the shape of the waveform of the switching pattern described above.

図１３は、第４の実施形態による機械学習装置１４０を備えるモータ制御装置５の概略的な機能ブロック図である。機械学習装置１４０は、状態観測部１８１と、判定データ生成部１８２と、学習部１４１と、意思決定部と１４８とを備える。状態観測部１８１は、モータ２の状態を、環境の現在状態を表す状態変数Ｓとして観測する。状態観測部１８１は、状態変数Ｓとして、モータ２の回転速度ω_ｍ（Ｓ１）、トルク値（Ｓ２）、モータ電流（Ｓ３）およびモータ温度（Ｓ４）などを観測する。状態観測部１８１は、これらすべての状態変数Ｓを観測してもよく、これらの状態変数Ｓの内の少なくとも一つを観測してもよい。また、状態観測部１８１は、上記以外のモータ２に関連する物理量を状態変数Ｓとして観測してもよい。 FIG. 13 is a schematic functional block diagram of a motor control device 5 having a machine learning device 140 according to the fourth embodiment. Machine learning device 140 includes state observation unit 181 , determination data generation unit 182 , learning unit 141 , and decision making unit 148 . The state observation unit 181 observes the state of the motor 2 as a state variable S representing the current state of the environment. The state observation unit 181 observes, as state variables S, the rotation speed ω _m (S1) of the motor 2, the torque value (S2), the motor current (S3), the motor temperature (S4), and the like. The state observing section 181 may observe all of these state variables S, or may observe at least one of these state variables S. Also, the state observation unit 181 may observe physical quantities related to the motor 2 other than those described above as the state variable S.

本実施形態では、状態観測部１８１は、上記の状態変数Ｓ１、Ｓ２、Ｓ３を外乱オブザーバ装置５０から取得し、モータ温度（Ｓ４）をモータ２の筐体に設けた図示しない温度検出器から取得している。ここで、モータ２の回転速度ω_ｍ（Ｓ１）は_、ロータリ・エンコーダ４から出力された角度位置に基づいてモータ制御装置５で算出され、外乱オブザーバ装置５０に入力される値である。そのため、状態観測部１８１は、モータ２の回転速度ω_ｍをモータ制御装置５から取得してもよい。さらに、状態観測部１８１は、ロータリ・エンコーダ４から角度位置を取得し、モータ２の回転速度ω_ｍを算出してもよい。また、トルク値（Ｓ２）は、トルクセンサ８で測定したねじりトルクτｓである。そのため、状態観測部１８１は、トルクセンサ８からトルク値としてねじりトルクτｓを取得してもよい。なお、トルク値は、ねじりトルクτｓに減速比の逆数をかけてモータ２側のトルク値に変換した値であってもよい。モータ電流（Ｓ３）は、モータ２の電流指令値ｉ_ｑ ^ｒｅｆである。電流指令値ｉ_ｑ ^ｒｅｆはモータ制御装置５で算出されるので、状態観測部１８１は、モータ制御装置５からモータ電流として電流指令値ｉ_ｑ ^ｒｅｆを取得してもよい。また、状態変数Ｓ１、Ｓ２、Ｓ３、Ｓ４は、後述の速度波形データと同じ期間の時系列データであってもよい。 In this embodiment, the state observation unit 181 acquires the state variables S1, S2, and S3 from the disturbance observer device 50, and acquires the motor temperature (S4) from a temperature detector (not shown) provided on the housing of the motor 2. are doing. Here, the rotation speed ω _m (S1) of the motor 2 is a value calculated by the motor control device 5 based on the angular position output _from the rotary encoder 4 and input to the disturbance observer device 50 . Therefore, the state observation unit 181 may acquire the rotation speed ω _m of the motor 2 from the motor control device 5 . Furthermore, the state observation unit 181 may acquire the angular position from the rotary encoder 4 and calculate the rotation speed ω _m of the motor 2 . The torque value (S2) is the torsional torque τs measured by the torque sensor 8. FIG. Therefore, the state observing section 181 may acquire the torsional torque τs from the torque sensor 8 as the torque value. Note that the torque value may be a value obtained by multiplying the torsional torque τs by the reciprocal of the speed reduction ratio to convert it into a torque value on the motor 2 side. A motor current (S3) is a current command value i _q ^ref for the motor 2 . Since the current command value i _q ^ref is calculated by the motor control device 5 , the state observation unit 181 may acquire the current command value i _q ^ref as the motor current from the motor control device 5 . Also, the state variables S1, S2, S3, and S4 may be time-series data of the same period as velocity waveform data, which will be described later.

判定データ生成部１８２は、モータ２の回転状態を示す速度波形データに基づく判定データＤを生成する。判定データＤは、状態変数Ｓの下で制御パラメータ（本実施形態の場合は、可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の少なくとも一つ）を変えた場合の結果を表す指標である。判定データ生成部１８２は、外乱オブザーバ装置５０からモータ２の回転速度ω_ｍを取得し、回転速度ω_ｍの時系列データである速度波形データを生成する。なお、速度波形データの期間（時系列データの収集期間）は、速度波形データ中にスティック－スリップ現象が少なくとも１回は現れる期間とするのが好ましい。例えば、モータ２の速度波形が正弦波状に周期的に変動している場合は、速度波形データの期間を当該正弦波の１周期の半分の期間とするのが好ましい。 The determination data generator 182 generates determination data D based on speed waveform data indicating the rotation state of the motor 2 . The determination data D is obtained by changing the control parameter (in this embodiment, at least one of the variable gains α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m )) under the state variable S. It is an index that expresses the results when The determination data generator 182 acquires the rotational speed _ωm of the motor 2 from the disturbance observer device 50 and generates velocity waveform data, which is time-series data of the rotational speed _ωm . It should be noted that the period of the speed waveform data (time series data collection period) is preferably a period during which the stick-slip phenomenon appears at least once in the speed waveform data. For example, when the speed waveform of the motor 2 periodically fluctuates like a sine wave, it is preferable to set the period of the speed waveform data to half the period of one cycle of the sine wave.

判定データ生成部１８２は、判定データＤとして、可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の少なくとも一つを変えた際の、モータ２の回転状態の適否、すなわち、速度波形データにおいて摩擦トルクτ_ｄｍによる影響（スティック－スリップ現象）を抑制できているか否かに対する適否判定値Ｄ１を用いることができる。例えば、適否判定値Ｄ１は、判定データ生成部１８２が生成した速度波形データと、スティック－スリップ現象が生じていないときの速度波形データとの差分を算出し、差分の絶対値の総和が所定閾値以上のときを不適とし、所定閾値より小さいときを適するとすることで算出できる。なお、本実施形態では、判定データ生成部１８２が速度波形データをそのまま用いて判定データＤを生成しているが、判定データ生成部１８２は、速度波形データを加工して判定データＤを生成してもよい。例えば、回転速度ω_ｍを微分して回転加速度を算出し、速度波形データを加速度波形データに加工し、加速度波形データから判定データＤを生成するようにしてもよい。また、速度波形データからＴＨＤを計算し、算出したＴＨＤに基づいてモータ２の回転状態の適否を判定するようにしてもよい。 The determination data generation unit 182 generates, as determination data D, the rotation state of the motor 2 when at least one of the variable gains α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m ) is changed. Adequacy, that is, the adequacy judgment value D1 can be used to determine whether or not the influence (stick-slip phenomenon) of the friction torque τ _dm can be suppressed in the velocity waveform data. For example, the propriety judgment value D1 is obtained by calculating the difference between the speed waveform data generated by the judgment data generation unit 182 and the speed waveform data when the stick-slip phenomenon does not occur, and the sum of the absolute values of the differences is the predetermined threshold value. It can be calculated by determining that the above is unsuitable and the time smaller than the predetermined threshold is suitable. In this embodiment, the determination data generator 182 uses the speed waveform data as it is to generate the determination data D, but the determination data generator 182 processes the speed waveform data to generate the determination data D. may For example, the rotational acceleration may be calculated by differentiating the rotational speed _ωm , the speed waveform data may be processed into acceleration waveform data, and the judgment data D may be generated from the acceleration waveform data. Alternatively, the THD may be calculated from the speed waveform data, and whether or not the rotation state of the motor 2 is appropriate may be determined based on the calculated THD.

学習部に１４１対して判定データＤと同時に入力される状態変数Ｓは、学習部１４１による学習周期で考えた場合、判定データＤが取得された１学習周期前のデータに基づくものとなる。このように、機械学習装置１４０が学習を進める間、環境においては、状態変数Ｓの取得、後述の意思決定部１４８による可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の再設定（変更）、変更後の可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）を用いて算出したトルク補償値による摩擦トルクτ_ｄｍの制御、変更後の速度波形データ生成、判定データＤの生成が繰り返し実施される。 The state variable S input to the learning unit 141 at the same time as the determination data D is based on the data one learning cycle before the determination data D is acquired, when considering the learning cycle of the learning unit 141 . In this way, while the machine learning device 140 is learning, in the environment, the acquisition of the state variable S, the variable gains α ₁ (ω _m ), α ₂ (ω _m ), α ₃ ( ω _m ) reset (changed), control of the friction torque τ _dm by the torque compensation value calculated using the changed variable gains α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m ) , generation of velocity waveform data after change, and generation of judgment data D are repeated.

学習部１４１は、状態変数Ｓと判定データＤとを用いて、モータ２の回転状態と、制御パラメータとしての可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の少なくとも一つとを関連付けて学習する。学習部１４１は、機械学習と総称される任意の学習アルゴリズムにしたがって学習する。学習アルゴリズムの例については後述する。学習部１４１は、前述した状態変数Ｓと判定データＤとを含むデータ集合に基づく学習を反復実行することができる。上記学習中、状態変数Ｓは、上記したように１学習周期前に取得されたモータ２の回転速度ω_ｍ（Ｓ１）、トルク値（Ｓ２）、モータ電流（Ｓ３）およびモータ温度（Ｓ４）とし、判定データＤは、制御パラメータの変更が為された状態での今回の学習周期における速度波形データに基づく適否判定結果とする。 The learning unit 141 uses the state variable S and the determination data D to determine the rotation state of the motor 2 and variable gains α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m ) as control parameters. learn by associating with at least one of The learning unit 141 learns according to arbitrary learning algorithms collectively called machine learning. Examples of learning algorithms are described below. The learning unit 141 can repeatedly perform learning based on a data set including the state variable S and the determination data D described above. During the learning, the state variables S are assumed to be the rotational speed ω _m (S1), the torque value (S2), the motor current (S3), and the motor temperature (S4) of the motor 2 obtained one learning period before as described above. , and the determination data D is the propriety determination result based on the speed waveform data in the current learning cycle in the state where the control parameter is changed.

このような学習サイクルを繰り返すことにより、可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の少なくとも一つとモータ２の回転状態との相関性を暗示する特徴が次第に明らかになっていく。学習アルゴリズムの開始時には可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の少なくとも一つとモータ２の回転状態との相関性は実質的に未知であるが、学習部１４１は、学習を進めるに従い徐々に相関性についての特徴を見出し、相関性を解釈していく。可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の少なくとも一つとモータ２の回転状態との相関性がある程度信頼できる水準まで解釈されると、学習部１４１が反復出力する学習結果は、現在状態（つまり摩擦トルクτ_ｄｍによるスリップ－スティック現象が生じている状態）に対して、可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の少なくとも一つをどのように設定すべきかという行動（つまり意思決定）を行うために使用できるものとなる。つまり学習部１４１は、学習アルゴリズムの進行に伴い、上記相関性を最適解に徐々に近づけることができる。 By repeating such a learning cycle, at least one of the variable gains α ₁ (ω _m ), α ₂ (ω _m ), α ₃ (ω _m ) and the rotation state of the motor 2 have characteristics that imply a correlation. It becomes clear gradually. At the start of the learning algorithm, the correlation between at least one of the variable gains α ₁ (ω _m ), α ₂ (ω _m ), α ₃ (ω _m ) and the rotational state of the motor 2 is substantially unknown, but the learning The unit 141 gradually finds the characteristics of the correlation and interprets the correlation as the learning progresses. When the correlation between at least one of the variable gains α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m ) and the rotation state of the motor 2 is interpreted to a certain level of reliability, the learning unit 141 The learning results that are repeatedly output are variable gains α ₁ (ω _m ) _{, α 2} ₍ ω _m ), α ₃ (ω _m ) can be used to take action (ie, make a decision) on how to set at least one of them. That is, the learning unit 141 can gradually bring the correlation closer to the optimum solution as the learning algorithm progresses.

意思決定部１４８は、外乱オブザーバ装置５０に設定されている制御パラメータ（本実施形態では、可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）の少なくとも一つ）を、学習部１４１による学習結果に基づいて算出された制御パラメータに設定する。また、意思決定部１４８は、学習時に、制御パラメータとモータ２の回転状態とを関連付ける学習をするために、外乱オブザーバ装置５０の制御パラメータを適宜設定する。このような機械学習装置１４０は、例えば図３に示したプロセッサ５１０の一機能として構成してもよく、図３に示す記憶部５２０に記憶されたプロセッサ５１０を機能させるためのソフトウェアとして構成してもよい。また機械学習装置１４０は、モータ制御装置５と一体に設けてもよく、モータ制御装置５とは別の筐体に設けてもよい。 The decision-making unit 148 uses the control parameters set in the disturbance observer device 50 (in this embodiment, at least one of the variable gains α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m )). is set as the control parameter calculated based on the learning result by the learning unit 141 . In addition, the decision making unit 148 appropriately sets the control parameters of the disturbance observer device 50 in order to learn to associate the control parameters with the rotation state of the motor 2 during learning. Such a machine learning device 140 may be configured, for example, as one function of the processor 510 shown in FIG. good too. Further, the machine learning device 140 may be provided integrally with the motor control device 5 or may be provided in a housing separate from the motor control device 5 .

機械学習装置１４０は、モータ制御装置５に対する機械学習装置であって、モータ２の状態を、環境の現在状態を表す回転速度（Ｓ１）、トルク値（Ｓ２）、モータ電流（Ｓ３）およびモータ温度（Ｓ４）のうちの少なくとも一つを含んだ状態変数Ｓとして観測する状態観測部１８１と、モータ２の回転状態を示す速度波形データに基づいた判定データＤを取得する判定データ生成部１８２と、状態変数Ｓと判定データＤとを用いて、モータ２の回転状態と、可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）（制御パラメータ）の少なくとも一つとを関連付けて学習する学習部１４１と、学習部１４１による学習結果に基づき、可変利得α_１（ω_ｍ）、α_２（ω_ｍ）、α_３（ω_ｍ）を設定する意思決定部１４８とを有している。 The machine learning device 140 is a machine learning device for the motor control device 5. The state of the motor 2 is represented by the rotation speed (S1), the torque value (S2), the motor current (S3), and the motor temperature, which represent the current state of the environment. a state observation unit 181 that observes as a state variable S including at least one of (S4); a determination data generation unit 182 that acquires determination data D based on speed waveform data indicating the rotation state of the motor 2; Using the state variable S and the determination data D, the rotation state of the motor 2 and at least one of the variable gains α ₁ (ω _m ), α ₂ (ω _m ), α ₃ (ω _m ) (control parameters) are determined. A learning unit 141 that learns in association with each other, and a decision making unit 148 that sets variable gains α ₁ (ω _m ), α ₂ (ω _m ), and α ₃ (ω _m ) based on the learning result of the learning unit 141 . are doing.

よって機械学習装置１４０は、学習部１４１の学習結果を用いることで、モータ２の出力軸２ａに生じた摩擦トルクτ_ｄｍ（スティック－スリップ現象）の大きさに応じた、制御パラメータを、演算や目算によらずに自動的に、しかも正確に求めることができるようになる。そして、制御パラメータを、演算や目算によらずに自動的に求めることができれば、制御パラメータを迅速かつ適切に決定することができる。したがって、制御パラメータの設定を効率よく行うことができる。さらに、機械学習装置１４０は、モータ２の使用によりオイルの量やオイルの粘性が変化するなどして摩擦トルクτ_ｄｍが変化し、モータ２に働く摩擦トルクτ_ｄｍの特性が変わった場合も、機械学習装置１４０により自動で適切にパラメータを再設定でき、自動でスティック－スリップ現象を抑制できるようにできるので有用である。 Therefore, the machine learning device 140 uses the learning result of the learning unit 141 to calculate or calculate the control parameter according to the magnitude of the friction torque τ _dm (stick-slip phenomenon) generated in the output shaft 2a of the motor 2. It becomes possible to automatically and accurately obtain the value without relying on eye calculation. If the control parameters can be automatically determined without calculation or eye calculation, the control parameters can be quickly and appropriately determined. Therefore, control parameters can be efficiently set. Furthermore, the machine learning device 140 can be used even when the friction torque τ _dm changes due to changes in the amount of oil or the viscosity of the oil due to the use of the motor 2, and the characteristics of the friction torque τ _dm acting on the motor 2 change. It is useful because the machine learning device 140 can automatically reset the parameters appropriately and automatically suppress the stick-slip phenomenon.

上記構成を有する機械学習装置１４０では、学習部１４１が実行する学習アルゴリズムは特に限定されず、機械学習として公知の学習アルゴリズムを採用できる。ここで、学習アルゴリズムの一例として強化学習を説明する。強化学習は、学習対象が存在する環境の現在状態（つまり入力）を観測するとともに現在状態で所定の行動（つまり出力）を実行し、その行動に対し何らかの報酬を与えるというサイクルを試行錯誤的に反復して、報酬の総計が最大化されるような方策（機械学習装置１４０では制御パラメータの変更量）を最適解として学習する手法である。 In the machine learning device 140 having the above configuration, the learning algorithm executed by the learning unit 141 is not particularly limited, and a learning algorithm known as machine learning can be adopted. Reinforcement learning will now be described as an example of a learning algorithm. Reinforcement learning observes the current state of the environment in which the learning target exists (i.e., input), executes a predetermined action (i.e., output) in the current state, and gives some kind of reward for that action through trial and error. This is a method of repeatedly learning a policy that maximizes the sum of rewards (amount of change in control parameters in the machine learning device 140) as an optimal solution.

機械学習装置１４０において強化学習を行う場合、図１３に示すように、学習部１４１は報酬演算部１４１ａと行動価値関数更新部１４１ｂとをさらに備えるように構成される。報酬演算部１４１ａは、制御パラメータを変更した結果に対する報酬Ｒを演算する。報酬Ｒは、制御パラメータ変更後のモータ２の回転状態の適否判定結果（状態変数Ｓが取得された次の学習周期で用いられる判定データＤに相当）に関連する報酬である。行動価値関数更新部１４１ｂは、算出された報酬Ｒに基づいて、行動価値関数Ｑを更新する。行動価値関数Ｑは、制御パラメータの変更の価値を表す関数である。学習部１４１は、行動価値関数更新部１４１ｂによって行動価値関数Ｑの更新を繰り返して、報酬Ｒが最も多く得られる制御パラメータを学習する。 When reinforcement learning is performed in the machine learning device 140, as shown in FIG. 13, the learning unit 141 is configured to further include a reward calculation unit 141a and an action-value function update unit 141b. The reward calculator 141a calculates a reward R for the result of changing the control parameter. The reward R is a reward related to the adequacy determination result of the rotation state of the motor 2 after the control parameter change (corresponding to the determination data D used in the next learning cycle in which the state variable S is obtained). The action-value function updating unit 141b updates the action-value function Q based on the calculated reward R. The action value function Q is a function representing the value of changing the control parameters. The learning unit 141 repeats updating of the action-value function Q by the action-value function updating unit 141b, and learns the control parameter with which the largest reward R can be obtained.

学習部１４１が実行する強化学習のアルゴリズムの一例を説明する。この例によるアルゴリズムは、Ｑ学習（Ｑ－ｌｅａｒｎｉｎｇ）として知られるものであって、行動主体の状態ｓと、その状態ｓで行動主体が選択し得る行動ａとを独立変数として、状態ｓで行動ａを選択した場合の行動の価値を表す行動価値関数Ｑ（ｓ，ａ）を学習する手法である。状態ｓで行動価値関数Ｑが最も高くなる行動ａを選択することが最適解となる。状態ｓと行動ａとの相関性が未知の状態でＱ学習を開始し、任意の状態ｓで種々の行動ａを選択する試行錯誤を繰り返すことで、行動価値関数Ｑを反復して更新し、最適解に近付ける。ここで、状態ｓで行動ａを選択した結果として環境（つまり状態ｓ）が変化したときに、その変化に応じた報酬（つまり行動ａの重み付け）ｒが得られるように構成し、より高い報酬ｒが得られる行動ａを選択するように学習を誘導することで、行動価値関数Ｑを比較的短時間で最適解に近付けることができる。行動価値関数Ｑの更新式は、一般に下記の式のように表すことができる。 An example of a reinforcement learning algorithm executed by the learning unit 141 will be described. The algorithm according to this example, known as Q-learning, takes the agent's state s and the actions a that the agent can choose in that state s as independent variables, This is a method of learning an action value function Q(s, a) representing the action value when a is selected. The optimum solution is to select the action a that maximizes the action-value function Q in the state s. Starting Q-learning in a state where the correlation between state s and action a is unknown, and repeating trial and error to select various actions a in an arbitrary state s, repeatedly updating the action-value function Q, Get closer to the optimal solution. Here, when the environment (that is, state s) changes as a result of selecting action a in state s, it is configured so that a reward (that is, weighting of action a) r corresponding to the change is obtained, and a higher reward By inducing the learning to select the action a that yields r, the action-value function Q can be brought closer to the optimum solution in a relatively short time. An update formula for the action-value function Q can generally be expressed as the following formula.

ここで、ｓt及びａtはそれぞれ時刻ｔにおける状態及び行動であり、行動ａtにより状態はｓt+1に変化する。ｒt+1は、状態がｓtからｓt+1に変化したことで得られる報酬である。ｍａｘＱの項は、時刻ｔ＋１で最大の行動価値関数Ｑになる（と時刻ｔで考えられている）行動ａを行ったときのＱを意味する。α及びγはそれぞれ学習係数及び割引率であり、０＜α≦１、０＜γ≦１で任意設定される。 Here, st and at are the state and action at time t, respectively, and the state changes to st+1 by the action at. rt+1 is the reward obtained by changing the state from st to st+1. The term maxQ means Q when the action a that becomes the maximum action-value function Q at time t+1 (considered at time t) is performed. α and γ are a learning coefficient and a discount rate, respectively, and are arbitrarily set within 0<α≦1 and 0<γ≦1.

学習部１４１がＱ学習を実行する場合、状態観測部１８１が観測した状態変数Ｓ及び判定データ生成部１８２が生成した判定データＤは、更新式の状態ｓに該当し、現在状態（つまり現在のモータ２の回転状態）に対して制御パラメータをどのように設定するべきかという行動は、更新式の行動ａに該当し、報酬演算部１４１ａが求める報酬Ｒは、更新式の報酬ｒに該当する。よって行動価値関数更新部１４１ｂは、現在状態に対する制御パラメータの変更の価値を表す行動価値関数Ｑを、報酬Ｒを用いたＱ学習により繰り返し更新する。 When the learning unit 141 performs Q-learning, the state variable S observed by the state observation unit 181 and the determination data D generated by the determination data generation unit 182 correspond to the state s of the update formula, and the current state (that is, the current The action of how to set the control parameters for the rotational state of the motor 2) corresponds to the action a in the update formula, and the reward R calculated by the reward calculation unit 141a corresponds to the reward r in the update formula. . Therefore, the action-value function updating unit 141b repeatedly updates the action-value function Q representing the value of changing the control parameter for the current state by Q-learning using the reward R.

報酬演算部１４１ａは、例えば、新たに制御パラメータを設定した後に、設定した制御パラメータによりトルク補償値を算出してモータ２の出力軸２ａに生じた摩擦トルクτ_ｄｍを抑制する制御を行ったときに、報酬Ｒを算出する。具体的には、報酬演算部１４１ａは、モータ２の回転状態の適否判定結果が「適」と判定された場合に正（プラス）の報酬Ｒを算出し、モータ２の回転状態の適否判定結果が「否」と判定された場合に負（マイナス）の報酬Ｒを算出する。正負の報酬Ｒの絶対値は、互いに同一であってもよいし異なっていてもよい。また、判定の条件として、判定データＤに含まれる複数の値を組み合わせて判定するようにしても良い。 For example, after setting a new control parameter, the reward calculation unit 141a calculates a torque compensation value using the set control parameter, and performs control to suppress the friction torque τ _dm generated in the output shaft 2a of the motor 2. , the reward R is calculated. Specifically, the remuneration calculation unit 141a calculates a positive (plus) remuneration R when the determination result of the propriety of the rotation state of the motor 2 is determined to be "appropriate". is determined to be "no", a negative (minus) reward R is calculated. The absolute values of the positive and negative rewards R may be the same or different. Further, as a condition for determination, a plurality of values included in the determination data D may be combined for determination.

また、モータ２の回転状態の適否判定結果を、「適」及び「否」の二通りだけでなく複数段階に設定することができる。例えば、モータ２の回転状態の適否判定に用いた上述の「差分の総和」をＶとし、モータ２の回転状態の許容範囲の最大値をＶｍａｘとした場合、差分の総和Ｖが、０≦Ｖ＜Ｖｍａｘ／５のときは報酬Ｒ＝５を与え、Ｖｍａｘ／５≦Ｖ＜Ｖｍａｘ／２のときは報酬Ｒ＝２を与え、Ｖｍａｘ／２≦Ｖ≦Ｖｍａｘのときは報酬Ｒ＝１を与えるような構成とすることができる。さらに、学習の初期段階はＶｍａｘを比較的大きく設定し、学習が進行するにつれてＶｍａｘを縮小する構成とすることもできる。 Moreover, the propriety determination result of the rotation state of the motor 2 can be set in a plurality of stages in addition to the two types of "adequate" and "improper". For example, if the above-mentioned “sum of differences” used to determine the propriety of the rotation state of the motor 2 is V, and the maximum value of the allowable range of the rotation state of the motor 2 is Vmax, then the sum of differences V is 0≦V. Reward R=5 is given when <Vmax/5, reward R=2 is given when Vmax/5≦V<Vmax/2, and reward R=1 is given when Vmax/2≦V≦Vmax. can be configured. Furthermore, it is also possible to set Vmax relatively large in the initial stage of learning, and reduce Vmax as learning progresses.

行動価値関数更新部１４１ｂは、状態変数Ｓと判定データＤと制御パラメータと報酬Ｒとを、行動価値関数Ｑで表される行動価値（例えば数値で表される）と関連付けて整理した行動価値テーブルを持つことができる。この場合、行動価値関数更新部１４１ｂが行動価値関数Ｑを更新するという行為は、行動価値関数更新部１４１ｂが行動価値テーブルを更新するという行為と同義である。Ｑ学習の開始時には環境の現在状態（モータ２の回転状態）と制御パラメータの相関性は未知であるから、行動価値テーブルにおいては、種々の制御パラメータと状態変数Ｓと判定データＤと報酬Ｒとが、無作為に定めた行動価値の値（行動価値関数Ｑ）と関連付けた形態で用意されている。なお報酬演算部１４１ａは、判定データＤが分かればこれに対応する報酬Ｒを直ちに算出でき、算出した報酬Ｒの値が行動価値テーブルに書き込まれる。 The action value function updating unit 141b creates an action value table in which the state variable S, the determination data D, the control parameter, and the reward R are arranged in association with the action value (for example, represented by a numerical value) represented by the action value function Q. can have In this case, the act of updating the action-value function Q by the action-value function updating unit 141b is synonymous with the act of updating the action-value table by the action-value function updating unit 141b. At the start of Q-learning, the correlation between the current state of the environment (rotating state of the motor 2) and the control parameters is unknown. is prepared in a form associated with a randomly determined action value (action value function Q). Note that if the determination data D is known, the reward calculation unit 141a can immediately calculate the reward R corresponding to the determination data D, and the value of the calculated reward R is written in the action value table.

モータ２の回転状態の適否判定結果に応じた報酬Ｒを用いてＱ学習を進めると、より高い報酬Ｒが得られる行動を選択する方向へ学習が誘導され、選択した行動を現在状態で実行した結果として変化する環境の状態（つまり判定データＤ）に応じて、現在状態で行う行動についての行動価値の値（行動価値関数Ｑ）が書き換えられて行動価値テーブルが更新される。この更新を繰り返すことにより、行動価値テーブルに表示される行動価値の値（行動価値関数Ｑ）は、適正な行動ほど大きな値となるように書き換えられる。このようにして、未知であった環境の現在状態（モータ２の回転状態）とそれに対する行動（制御パラメータの設定）との相関性が徐々に明らかになる。つまり行動価値テーブルの更新により、モータ２の回転状態と、制御パラメータとの関係が最適解に徐々に近づけられる。 When the Q-learning is advanced using the reward R according to the determination result of the propriety of the rotation state of the motor 2, the learning is induced to select an action that can obtain a higher reward R, and the selected action is executed in the current state. As a result, the action value table (action value function Q) is rewritten to update the action value table according to the changing environmental state (that is, determination data D). By repeating this update, the value of the action value (action value function Q) displayed in the action value table is rewritten so that the more appropriate the action, the larger the value. In this way, the correlation between the unknown current state of the environment (rotating state of the motor 2) and the action (setting of the control parameter) is gradually clarified. That is, by updating the action value table, the relationship between the rotation state of the motor 2 and the control parameters is gradually brought closer to the optimum solution.

ここで、Ｑ学習では、すべての状態行動ペア（ｓ，ａ）についてのＱ（ｓ，ａ）のテーブルを作成して、学習を行う方法がある。しかし、すべての状態行動ペアのＱ（ｓ，ａ）の値を求めるには状態数が多すぎて、Ｑ学習が収束するのに多くの時間を要してしまう場合がある。そこで、ニューラル・ネットワークを利用して強化学習するようにしてもよい。具体的には、行動価値関数Ｑを適当なニューラル・ネットワークを用いて構成し、ニューラル・ネットワークのパラメータを調整することにより行動価値関数Ｑ（ｓ，ａ）の値を算出するようにする。ニューラル・ネットワークを利用することにより、Ｑ学習が収束するのに要する時間を短くすることが可能となる。 Here, in Q-learning, there is a method of learning by creating a table of Q(s, a) for all state-action pairs (s, a). However, the number of states is too large to obtain the values of Q(s, a) for all state-action pairs, and it may take a long time for Q-learning to converge. Therefore, a neural network may be used for reinforcement learning. Specifically, the action-value function Q is configured using an appropriate neural network, and the value of the action-value function Q(s, a) is calculated by adjusting the parameters of the neural network. By using a neural network, it is possible to shorten the time required for Q-learning to converge.

また、機械学習装置１４０が教師なしで機械学習を行う場合について説明してきたが、機械学習装置１４０が、教師有りの機械学習を行うこともできる。教師あり学習では、入力とそれに対応する出力との既知のデータセット、いわゆる教師データが予め大量に機械学習装置１４０に与えられ、機械学習装置１４０がこの教師データから入力と出力との相関性を暗示する特徴を識別して、新たな入力に対する出力を推定するための相関性モデル、例えばここではモータ２の回転状態と制御パラメータの関連性を学習する。 Moreover, although the case where the machine learning device 140 performs machine learning without a teacher has been described, the machine learning device 140 can also perform supervised machine learning. In supervised learning, a large amount of known data sets of inputs and corresponding outputs, so-called teacher data, are given in advance to the machine learning device 140, and the machine learning device 140 calculates the correlation between the input and the output from this teacher data. A correlation model is learned to identify the implied features and estimate the output for the new input, for example here the relation between the rotational state of the motor 2 and the control parameters.

より具体的には、機械学習装置１４０の学習部１４１は、予め与えられた大量の教師データから、制御パラメータおよび状態変数Ｓとモータ２の回転状態（判定データＤ）との相関性特徴を識別する演算を行う。次に、学習部１４１は、予め識別しておいた相関性特徴と、制御パラメータ、状態変数Ｓ及び判定データＤから最適なモータ２の回転状態を導く相関性モデルとの誤差を計算する。その後、学習部１４１は、この誤差を縮小するように相関性モデルを更新する。そして学習部１４１は、相関性モデルの更新を繰り返すことによって状態変数Ｓにおいて最適なモータ２の回転状態を導く最適な制御パラメータを学習する。そして、学習により導出した制御パラメータが設定される。 More specifically, the learning unit 141 of the machine learning device 140 identifies correlation features between the control parameters and state variables S and the rotation state (determination data D) of the motor 2 from a large amount of training data given in advance. perform calculations to Next, the learning unit 141 calculates an error between the previously identified correlation feature and a correlation model that derives the optimum rotation state of the motor 2 from the control parameters, the state variable S, and the determination data D. After that, the learning unit 141 updates the correlation model so as to reduce this error. Then, the learning unit 141 learns the optimum control parameters leading to the optimum rotational state of the motor 2 in the state variable S by repeating the updating of the correlation model. Then, control parameters derived by learning are set.

相関性モデルの初期値は、例えば、制御パラメータ及び状態変数Ｓと判定データＤとの相関性を単純化して表現したものであり、教師あり学習の開始前に学習部１４１に与えられる。教師データは、例えば、過去に上述の強化学習や人手によって試行錯誤的に行う制御パラメータの設定、シミュレーションを用いた制御パラメータの設定などにより得た状態変数Ｓ、制御パラメータ及び判定データＤのデータセットによって構成される。学習部１４１は、状態変数Ｓ、制御パラメータと、判定データＤとの相関性を暗示する相関性特徴を識別し、この相関性特徴と、現在の状態における制御パラメータと状態変数Ｓ及び判定データＤに対応する相関性モデルとの誤差を求める。さらに学習部１４１は、例えば予め定めた更新ルールにしたがい、誤差が小さくなる方向へ相関性モデルを更新する。 The initial value of the correlation model is, for example, a simplified expression of the correlation between the control parameter/state variable S and the judgment data D, and is given to the learning unit 141 before starting supervised learning. The teacher data is, for example, a data set of the state variables S, the control parameters, and the determination data D obtained in the past by the above-mentioned reinforcement learning, the setting of the control parameters manually by trial and error, the setting of the control parameters using simulation, etc. Consists of The learning unit 141 identifies a correlation feature that implies a correlation between the state variable S, the control parameter, and the determination data D, and learns the correlation feature, the control parameter, the state variable S, and the determination data D in the current state. Find the error with the correlation model corresponding to . Furthermore, the learning unit 141 updates the correlation model in the direction of reducing the error, for example, according to a predetermined update rule.

相関性モデルの初期値が与えられた後、更新後の相関性モデルにしたがってトルク補償値を算出して摩擦トルクτ_ｄｍを抑制する制御が行われる。学習部１４１は、この制御により変化した状態変数Ｓ及び判定データＤを用いて、変更した制御パラメータ、変化した状態変数Ｓ及び判定データＤに対応する相関性モデルと、教師データから求めた相関性特徴との誤差を求める。そして、学習部１４１は、この誤差に基づき、再び相関性モデルを更新する。これを繰り返して、未知であったモータ２の回転状態とそれに対する適切な制御パラメータとの相関性が徐々に明らかになる。相関性モデルの更新により、モータ２の回転状態と制御パラメータとの関係が最適解に徐々に近づく。教師あり学習を進める際に、例えばニューラル・ネットワークを用いても良い。 After the initial value of the correlation model is given, the torque compensation value is calculated according to the updated correlation model to control the friction torque _τdm . The learning unit 141 uses the state variable S and the determination data D changed by this control to create a correlation model corresponding to the changed control parameter, the changed state variable S and the determination data D, and the correlation obtained from the teacher data. Find the error with the feature. Then, the learning unit 141 updates the correlation model again based on this error. By repeating this process, the correlation between the previously unknown rotational state of the motor 2 and appropriate control parameters for it becomes gradually clear. By updating the correlation model, the relationship between the rotational state of the motor 2 and the control parameters gradually approaches the optimum solution. A neural network, for example, may be used when proceeding with supervised learning.

２モータ
３減速装置
４ロータリ・エンコーダ
５モータ制御装置
６出力軸
８トルクセンサ
１０減速機構
２０、９０運動学モデル
２０ａ積分器
２０ｂ積分器
２０ｃ減算器
２０ｄ乗算器
２０ｅ乗算器
２０ｆ加算器
２０ｇ外乱要素
５０、６０外乱オブザーバ装置
５０ａ加算器
５０ｂ利得要素
５０ｃ利得要素
５０ｄ利得要素
５０ｅ加算器
５５ａＳＶＭＮＣ（補償値算出部）
５５ｂＳＴＭＮＣ（補償値算出部）
５９（ａ）、５９（ｂ）、５９（ｃ）可変利得要素

2 motor 3 speed reducer 4 rotary encoder 5 motor controller 6 output shaft 8 torque sensor 10 speed reducer 20, 90 kinematics model 20a integrator 20b integrator 20c subtractor 20d multiplier 20e multiplier 20f adder 20g disturbance element 50 , 60 disturbance observer device 50a adder 50b gain element 50c gain element 50d gain element 50e adder 55a SVMNC (compensation value calculator)
55b STMNC (compensation value calculation unit)
59(a), 59(b), 59(c) variable gain elements

Claims

制御対象となるモータへの電流指令値、前記モータの回転速度および前記モータに減速機構を介して接続された出力軸のねじりトルクの測定値を制御入力として受け取り、前記モータの出力軸に作用する摩擦トルクによる外乱を抑制する外乱オブザーバ装置を備え、
前記外乱オブザーバ装置は、前記制御入力に基づいて前記摩擦トルクを補償するトルク補償値を算出する補償値算出部を有し、
前記補償値算出部の特性を表す感度関数が前記モータの前記回転速度に基づいて設定され、
前記モータの前記回転速度に基づいて、前記感度関数が、０次の感度関数、１次の感度関数又は２次の感度関数のいずれかに切り替わる
モータ制御装置。 A current command value to a motor to be controlled, a rotation speed of the motor, and a measured value of torsional torque of an output shaft connected to the motor via a speed reduction mechanism are received as control inputs, and act on the output shaft of the motor. Equipped with a disturbance observer device that suppresses disturbance due to friction torque,
The disturbance observer device has a compensation value calculation unit that calculates a torque compensation value for compensating the friction torque based on the control input,
a sensitivity function representing characteristics of the compensation value calculation unit is set based on the rotational speed of the motor ;
The sensitivity function switches to either a zero-order sensitivity function, a first-order sensitivity function, or a second-order sensitivity function based on the rotational speed of the motor.
motor controller.

制御対象となるモータへの電流指令値、前記モータの回転速度および前記モータに減速機構を介して接続された出力軸のねじりトルクの測定値を制御入力として受け取り、前記モータの出力軸に作用する摩擦トルクによる外乱を抑制する外乱オブザーバ装置を備え、
前記外乱オブザーバ装置は、前記制御入力に基づいて前記摩擦トルクを補償するトルク補償値を算出する補償値算出部を有し、
前記補償値算出部の特性を表す感度関数が前記モータの前記回転速度に基づいて設定され、
前記モータの前記回転速度に基づいて、前記感度関数が、０次の感度関数、１次の感度関数又は高次の感度関数のいずれかに切り替わる
モータ制御装置。 A current command value for a motor to be controlled, a rotation speed of the motor, and a measured value of torsional torque of an output shaft connected to the motor via a reduction mechanism are received as control inputs, and act on the output shaft of the motor. Equipped with a disturbance observer device that suppresses disturbance due to friction torque,
The disturbance observer device has a compensation value calculation unit that calculates a torque compensation value for compensating the friction torque based on the control input,
a sensitivity function representing a characteristic of the compensation value calculator is set based on the rotational speed of the motor;
The sensitivity function switches to either a zero-order sensitivity function, a first-order sensitivity function, or a higher-order sensitivity function based on the rotational speed of the motor.
motor controller.

前記補償値算出部は、前記制御入力に基づいて算出された前記摩擦トルクの理論値をフィルタするフィルタとして構成され、
前記感度関数が前記フィルタの感度関数である
請求項１又は２に記載のモータ制御装置。 The compensation value calculation unit is configured as a filter for filtering the theoretical value of the friction torque calculated based on the control input,
The motor control device according to claim 1 or 2, wherein the sensitivity function is the sensitivity function of the filter.

前記感度関数は、設計極が単根である
請求項１～３のいずれか１項に記載のモータ制御装置。 The motor control device according to any one of claims 1 to 3 , wherein the sensitivity function has a single root design pole.

前記外乱オブザーバ装置は、前記モータの前記回転速度に基づいて、前記モータに生じた高周波成分を除去するための高周波ダンピング項を算出する高周波ダンピング項演算部を備える
請求項１～４のいずれか１項に記載のモータ制御装置。 5. Any one of claims 1 to 4, wherein the disturbance observer device comprises a high-frequency damping term calculation unit that calculates a high-frequency damping term for removing high-frequency components generated in the motor based on the rotation speed of the motor. A motor controller according to any one of the preceding claims.

モータと、
前記モータに減速機構を介して接続された出力軸と、
前記出力軸に生じるねじりトルクを測定するトルクセンサと、
前記モータの角度位置を検出するロータリ・エンコーダと、
請求項１～５のいずれか１項に記載のモータ制御装置と
を備えるモータ装置。 a motor;
an output shaft connected to the motor via a reduction mechanism;
a torque sensor for measuring torsional torque generated on the output shaft;
a rotary encoder for detecting the angular position of the motor;
A motor device comprising the motor control device according to any one of claims 1 to 5.

制御対象となるモータへの電流指令値、前記モータの回転速度および前記モータに減速機構を介して接続された出力軸のねじりトルクの測定値を制御入力として受け取り、前記モータの出力軸に作用する摩擦トルクによる外乱を抑制する外乱オブザーバ装置を備え、前記外乱オブザーバ装置は、前記制御入力に基づいて前記摩擦トルクを補償するトルク補償値を算出する補償値算出部を有し、前記補償値算出部の特性を表す感度関数が前記モータの前記回転速度に基づいて設定されるモータ制御装置に対する機械学習装置であって、
前記モータの状態を、環境の現在状態を表す前記回転速度、トルク値、モータ電流およびモータ温度のうちの少なくとも一つを含んだ状態変数として観測する状態観測部と、
前記モータの回転状態を示す速度波形データに基づいた判定データを取得する判定データ生成部と、
前記状態変数と前記判定データとを用いて、前記モータの回転状態と、前記モータへの外乱補償特性を記述する特性係数、スイッチ切り替え閾値および高周波成分算出要素のフィルタ特性係数のうちの少なくとも一つを含む制御パラメータとを関連付けて学習する学習部と、
前記学習部による学習結果に基づき、前記制御パラメータを設定する意思決定部とを有する
機械学習装置。 A current command value to a motor to be controlled, a rotation speed of the motor, and a measured value of torsional torque of an output shaft connected to the motor via a speed reduction mechanism are received as control inputs, and act on the output shaft of the motor. A disturbance observer device for suppressing disturbance due to frictional torque is provided, the disturbance observer device has a compensation value calculation unit for calculating a torque compensation value for compensating the frictional torque based on the control input, and the compensation value calculation unit A machine learning device for a motor control device in which a sensitivity function representing the characteristics of is set based on the rotational speed of the motor ,
a state observation unit that observes the state of the motor as a state variable including at least one of the rotational speed, torque value, motor current, and motor temperature representing the current state of the environment;
a determination data generation unit that acquires determination data based on speed waveform data that indicates the rotation state of the motor;
At least one of a characteristic coefficient describing a rotation state of the motor and a disturbance compensation characteristic to the motor, a switch switching threshold value, and a filter characteristic coefficient of a high-frequency component calculation element, using the state variable and the determination data. a learning unit that learns in association with control parameters including
A machine learning device, comprising: a decision making section that sets the control parameter based on a learning result of the learning section.

前記学習部は、
前記制御パラメータを変更した結果に対する報酬を演算する報酬演算部と、
前記報酬に基づいて、行動価値関数を更新する行動価値関数更新部と、を含み、
前記行動価値関数更新部によって前記行動価値関数の更新を繰り返して、前記報酬が最も多く得られる前記制御パラメータを学習する
請求項７に記載の機械学習装置。 The learning unit
a reward calculation unit that calculates a reward for the result of changing the control parameter;
an action-value function updating unit that updates the action-value function based on the reward;
8. The machine learning device according to claim 7, wherein the action-value function updating unit repeats updating of the action-value function to learn the control parameter that provides the largest reward.

前記学習部は、
前記制御パラメータと前記状態変数と前記判定データとから前記モータの前記回転状態を導く相関性モデルと予め用意された教師データから識別される相関性特徴との誤差を計算し、
前記誤差を小さくするように前記相関性モデルを更新する
請求項７に記載の機械学習装置。 The learning unit
calculating an error between a correlation model that derives the rotation state of the motor from the control parameters, the state variables, and the determination data and a correlation feature identified from teacher data prepared in advance;
The machine learning device according to claim 7, wherein said correlation model is updated so as to reduce said error.