WO2022201377A1

WO2022201377A1 - Robot control device and robot control method

Info

Publication number: WO2022201377A1
Application number: PCT/JP2021/012339
Authority: WO
Inventors: 暁生斎藤
Original assignee: 三菱電機株式会社
Priority date: 2021-03-24
Filing date: 2021-03-24
Publication date: 2022-09-29
Also published as: CN116940906A; DE112021007371T5; JP7378640B2; JPWO2022201377A1

Abstract

This robot control device (100) comprises: a speed calculation unit (4) that calculates the speed profile of a pre-set manipulator (1) on the basis of the command trajectory of the manipulator (1), a constraint condition relating to the manipulator (1), and an evaluation index that is based on the operation time of the manipulator (1); a gradient calculation unit (5) that calculates a gradient relating to the command trajectory in the operation time, and that employs the resultant gradient as gradient information, on the basis of the speed profile; a command trajectory correction unit (6) that corrects the command trajectory, and that employs the resultant command trajectory as a corrected command trajectory, on the basis of the gradient information; and a control unit (8) that performs a control such that the manipulator (1) follows the corrected command trajectory.

Description

ロボット制御装置およびロボット制御方法ROBOT CONTROL DEVICE AND ROBOT CONTROL METHOD

　本開示は、マニピュレータの制御を行うロボット制御装置およびロボット制御方法に関する。 The present disclosure relates to a robot control device and a robot control method for controlling a manipulator.

　工場の製造ラインなどにおいて、部品および製品（以下、「ワーク」と称する）をマニピュレータで把持して搬送するピックアンドプレースの工程が存在する。ピックアンドプレースにおいて、ロボットの動作速度および加速度が適切でない場合に、ワークおよびマニピュレータの把持部に対し過剰な慣性力とモーメントとが発生し、ワークが落下するといった課題がある。これを解決するために、ワークに発生する力およびモーメントを考慮して適切な動作条件を決定する技術が提案されている。 In factory production lines, there is a pick-and-place process in which parts and products (hereinafter referred to as "workpieces") are gripped and transported by manipulators. In pick-and-place, if the operating speed and acceleration of the robot are not appropriate, excessive inertial force and moment are generated on the workpiece and the gripping portion of the manipulator, causing the workpiece to drop. In order to solve this problem, techniques have been proposed to determine appropriate operating conditions in consideration of the forces and moments generated in the workpiece.

　特許文献１には、弾性を有する保持体を含むロボットのシミュレーションモデルを用いてシミュレーションを実施し、保持体に発生する負荷モーメントが閾値よりも大きい場合に、負荷モーメントが閾値以下となるようシミュレーションの実行条件を変更するシミュレーション装置について開示されている。 In Patent Document 1, a simulation is performed using a simulation model of a robot including an elastic holding body, and when the load moment generated in the holding body is greater than a threshold, the simulation is performed so that the load moment is equal to or less than the threshold. A simulation device for changing execution conditions is disclosed.

特開２０１９－１４１９３９号公報JP 2019-141939 A

　特許文献１では、負荷モーメントが閾値以下になるまでシミュレーションの実行条件を繰り返し変更する必要があるため、ワークに発生する力およびモーメントを考慮して適切な実行条件を求めるのに時間を要するという問題がある。また、自動的に調整可能なシミュレーションの実行条件は最大加速度のみであり、ロボットの動作軌跡と速度プロファイルといった自由度の高い動作条件を調整するのは困難なため、マニピュレータが動作開始してからワークを把持するまでの動作時間を短くできないという問題がある。 In Patent Document 1, it is necessary to repeatedly change the execution conditions of the simulation until the load moment becomes equal to or less than the threshold. Therefore, it takes time to determine the appropriate execution conditions in consideration of the force and moment generated in the workpiece. There is In addition, the only simulation execution condition that can be automatically adjusted is the maximum acceleration. There is a problem that the operation time until gripping cannot be shortened.

　本開示は、上述の課題を解決するためになされたもので、ワークに発生する力およびモーメントなどの制約条件を満たし、動作時間が短くなるようなマニピュレータの指令軌跡と速度プロファイルを高速に求めることができるロボット制御装置およびロボット制御方法を提供することを目的とする。 The present disclosure has been made in order to solve the above-mentioned problems, and is to quickly obtain a manipulator command trajectory and velocity profile that satisfy constraints such as forces and moments generated in a workpiece and shorten operation time. It is an object of the present invention to provide a robot control device and a robot control method capable of

　本開示に係るロボット制御装置は、予め設定されたマニピュレータへの指令軌跡と、前記マニピュレータに関する制約条件と、前記マニピュレータの動作時間に基づく評価指標とに基づいて、前記マニピュレータの速度プロファイルを計算する速度計算部と、前記速度プロファイルに基づいて、前記動作時間の前記指令軌跡に関する勾配を計算して勾配情報とする勾配計算部と、前記勾配情報に基づいて前記指令軌跡を補正して補正指令軌跡とする指令軌跡補正部と、前記補正指令軌跡に対し前記マニピュレータが追従するよう制御する制御部と、を備える。 The robot control device according to the present disclosure calculates a velocity profile of the manipulator based on a preset command trajectory to the manipulator, constraints on the manipulator, and an evaluation index based on the operation time of the manipulator. a calculation unit, a gradient calculation unit that calculates a gradient related to the command trajectory of the operation time based on the speed profile to obtain gradient information, and a corrected command trajectory that corrects the command trajectory based on the gradient information. and a control unit for controlling the manipulator to follow the corrected command trajectory.

　また、本開示に係るロボット制御方法は、予め設定されたマニピュレータへの指令軌跡と、前記マニピュレータに関する制約条件と、前記マニピュレータの動作時間に基づく評価指標とに基づいて、前記マニピュレータの速度プロファイルを計算する工程と、前記速度プロファイルに基づいて、前記動作時間の前記指令軌跡に関する勾配を計算して勾配情報とする工程と、前記勾配情報に基づいて前記指令軌跡を補正して補正指令軌跡とする工程と、前記補正指令軌跡に対し前記マニピュレータが追従するよう制御する工程と、を備える。 Further, the robot control method according to the present disclosure calculates a velocity profile of the manipulator based on a preset command trajectory to the manipulator, constraints on the manipulator, and an evaluation index based on the operation time of the manipulator. calculating a slope of the command trajectory of the operation time based on the speed profile to obtain slope information; and correcting the command trajectory based on the slope information to obtain a corrected command trajectory. and controlling the manipulator to follow the correction command locus.

　本開示によれば、ロボット制御装置およびロボット制御方法は、マニピュレータに関する制約条件とマニピュレータの動作時間とに基づく評価指標に基づいて速度プロファイルを計算し、動作時間の指令軌跡に関する勾配に基づいて指令軌跡を補正するため、制約条件を満たし、動作時間が短くなるようなマニピュレータの補正指令軌跡と速度プロファイルとを高速に求めることができる。 According to the present disclosure, a robot control device and a robot control method calculate a velocity profile based on an evaluation index based on manipulator constraints and manipulator operation time, and calculate a command trajectory based on a slope of the command trajectory of the operation time. is corrected, it is possible to quickly obtain a correction command trajectory and velocity profile of the manipulator that satisfy the constraint conditions and shorten the operation time.

実施の形態１におけるロボット制御装置の一例を示すブロック図である。1 is a block diagram showing an example of a robot control device according to Embodiment 1; FIG. 実施の形態１から４におけるロボット制御装置を含む構成の一例を示す図である。1 is a diagram showing an example of a configuration including a robot control device according to Embodiments 1 to 4; FIG. 実施の形態１から４における速度計算部の一例を示すブロック図である。FIG. 4 is a block diagram showing an example of a velocity calculator in Embodiments 1 to 4; FIG. 実施の形態１から４における勾配計算部の一例を示すブロック図である。FIG. 4 is a block diagram showing an example of a gradient calculator in Embodiments 1 to 4; FIG. 実施の形態１から４における制御部の一例を示すブロック図である。FIG. 3 is a block diagram showing an example of a control unit in Embodiments 1 to 4; FIG. 実施の形態１におけるロボット制御装置の動作の一例を示すフローチャートである。4 is a flow chart showing an example of the operation of the robot control device according to Embodiment 1; 実施の形態２におけるロボット制御装置の一例を示すブロック図である。FIG. 10 is a block diagram showing an example of a robot control device according to Embodiment 2; FIG. 実施の形態２における指令軌跡補正部の一例を示すブロック図である。FIG. 11 is a block diagram showing an example of a commanded trajectory correction unit according to Embodiment 2; FIG. 実施の形態２におけるロボット制御装置の動作の一例を示すフローチャートである。9 is a flow chart showing an example of the operation of the robot control device according to Embodiment 2; 実施の形態３におけるロボット制御装置の一例を示すブロック図である。FIG. 11 is a block diagram showing an example of a robot control device according to Embodiment 3; 実施の形態３におけるロボット制御装置の動作の一例を示すフローチャートである。14 is a flow chart showing an example of the operation of the robot control device according to Embodiment 3; 実施の形態４におけるロボット制御装置の一例を示すブロック図である。FIG. 12 is a block diagram showing an example of a robot control device according to Embodiment 4; 実施の形態４におけるロボット制御装置の動作の一例を示すフローチャートである。14 is a flow chart showing an example of the operation of the robot control device according to Embodiment 4; 実施の形態１から４におけるロボット制御装置のハードウェア構成を示す図である。2 is a diagram showing a hardware configuration of a robot control device according to Embodiments 1 to 4; FIG.

実施の形態１．
　図１は、実施の形態１におけるロボット制御装置１００の一例を示すブロック図である。図１は、ロボット制御装置１００と、アクチュエータ１１０と、マニピュレータ１と、ワーク１１２とにより構成されるブロック図である。また、図２は、実施の形態１におけるロボット制御装置１００を含む構成の一例を示す図である。ロボット制御装置１００は、垂直多関節ロボットであるマニピュレータ１の各関節に設置されるアクチュエータ１１０を制御することで、マニピュレータ１の先端に設置される把持部１１１で把持対象となるワーク１１２のピックアンドプレースの動作を実現する。周辺環境１１３は、例えばカメラなどであり、例えばマニピュレータ１がピックアンドプレースの動作を行う際の映像を図示しない表示装置に出力する。 Embodiment 1.
FIG. 1 is a block diagram showing an example of a robot control device 100 according to Embodiment 1. As shown in FIG. FIG. 1 is a block diagram composed of a robot control device 100, an actuator 110, a manipulator 1, and a workpiece 112. As shown in FIG. FIG. 2 is a diagram showing an example of a configuration including the robot control device 100 according to Embodiment 1. As shown in FIG. The robot control device 100 controls the actuators 110 installed at the joints of the manipulator 1, which is a vertical articulated robot, so that the gripping unit 111 installed at the tip of the manipulator 1 can pick and hold a work 112 to be gripped. Realize the behavior of a place. The surrounding environment 113 is, for example, a camera, and outputs an image of the manipulator 1 performing pick-and-place operations to a display device (not shown).

　ロボット制御装置１００は、制約条件記憶部２と、指令軌跡記憶部３と、速度計算部４と、勾配計算部５と、指令軌跡補正部６と、指令点列計算部７と、制御部８とを備える。 The robot control device 100 includes a constraint storage unit 2, a command trajectory storage unit 3, a speed calculation unit 4, a gradient calculation unit 5, a command trajectory correction unit 6, a command point sequence calculation unit 7, and a control unit 8. and

　制約条件記憶部２は、マニピュレータ１に関して予め設定された制約条件のパラメータを記憶する。制約条件は、マニピュレータ１の関節角速度、関節角加速度、関節トルク、手先速度、手先加速度、マニピュレータ１が把持する対象（ワーク１１２）で発生する力、および把持する対象で発生するモーメントのうち少なくとも１つに関する条件である。一例として、制約条件が手先速度ｖ_ｈに関する条件の場合、制約条件とは「ｖ_ｍｉｎ≦ｖ_ｈ≦ｖ_ｍａｘ」のことである。ここで、ｖ_ｍｉｎは手先速度ｖ_ｈの下限値、ｖ_ｍａｘは手先速度ｖ_ｈの上限値である。そして、この場合の制約条件のパラメータとは、ｖ_ｍｉｎおよびｖ_ｍａｘのことである。制約条件および制約条件のパラメータは、これに限定されない。例えば、制約条件は「｜ｖ_ｈ｜≦ｖ_ｔｈ」であり、制約条件のパラメータはｖ_ｔｈであってもよい。ここで、｜ｖ_ｈ｜は手先速度ｖ_ｈの絶対値、ｖ_ｔｈは手先速度ｖ_ｈの閾値である。 The constraint storage unit 2 stores parameters of constraints preset for the manipulator 1 . The constraints are at least one of the joint angular velocity, joint angular acceleration, joint torque, hand speed, hand acceleration of the manipulator 1, the force generated in the object (workpiece 112) gripped by the manipulator 1, and the moment generated in the gripped object. It is a condition about one. As an example, if the constraint is related to the hand speed _vh , the constraint is "v _min ≤ v _h ≤ v _max ". Here, _vmin is the lower limit of the tip speed _{vh, and vmax} _is the upper limit of the tip speed _vh . In this case, the constraint parameters are v _min and v _max . Constraints and parameters of constraints are not limited to these. For example, the constraint may be “|v _h |≦v _th ” and the parameter of the constraint may be v _th . is the absolute value of the tip speed _vh , and _vth is the threshold value of the _tip speed _vh .

　なお、制約条件のパラメータが予め設定される場合について説明したが、ロボット制御装置１００が図示しない把持制約学習部を備えてもよい。把持制約学習部は、ワーク１１２で発生する力およびモーメントのうち少なくとも１つについて、機械学習を用いて制約条件のパラメータを学習し、得られた制約条件のパラメータを制約条件記憶部２に記憶してもよい。具体的には、把持制約学習部は、マニピュレータ１がワーク１１２を把持する際に、ワーク１１２で発生する力およびモーメントを図示しないセンサによって取得し、取得した値に基づいて、制約条件のパラメータとして学習する。 Although the case where the parameter of the constraint condition is set in advance has been described, the robot control device 100 may include a gripping constraint learning unit (not shown). The gripping constraint learning unit uses machine learning to learn constraint parameters for at least one of the force and moment generated on the workpiece 112 , and stores the obtained constraint parameters in the constraint storage unit 2 . may Specifically, when the manipulator 1 grips the workpiece 112, the gripping constraint learning unit acquires the force and moment generated in the workpiece 112 by a sensor (not shown), and uses the acquired values as parameters of the constraint conditions. learn.

　指令軌跡記憶部３は、予め設定されたマニピュレータ１への指令軌跡を記憶する。一例として、指令軌跡がスプライン曲線として与えられていた場合、指令軌跡記憶部３は、スプライン曲線上の経由点の位置とその位置における曲線の媒介変数の値（０から１の値）とのペアで記憶する。あるいは、指令軌跡記憶部３は、スプライン曲線上の経由点の位置のみを記憶してもよい。この場合、媒介変数の値は、スプライン曲線上の経由点間の距離などから計算される。また、指令軌跡はスプライン曲線以外にも、Ｂ－スプライン曲線、あるいはベジェ曲線などとして与えられてもよい。指令軌跡記憶部３は、後に説明する指令軌跡補正部６からの補正指令軌跡を記憶する。この際、予め設定された指令軌跡は破棄され、補正指令軌跡が新たに記憶される。 The command trajectory storage unit 3 stores a preset command trajectory to the manipulator 1 . As an example, when the command trajectory is given as a spline curve, the command trajectory storage unit 3 stores a pair of the position of a waypoint on the spline curve and the value of the parameter of the curve at that position (value from 0 to 1). memorize with Alternatively, the command trajectory storage unit 3 may store only the positions of waypoints on the spline curve. In this case, the value of the parameter is calculated from the distance between waypoints on the spline curve. Also, the command trajectory may be given as a B-spline curve, a Bezier curve, or the like, other than the spline curve. The command trajectory storage unit 3 stores a corrected command trajectory from the command trajectory correction unit 6, which will be described later. At this time, the preset command trajectory is discarded and the corrected command trajectory is newly stored.

　速度計算部４は、予め設定されたマニピュレータ１への指令軌跡と、マニピュレータ１に関する制約条件と、マニピュレータ１の動作時間に基づく評価指標とに基づいて、マニピュレータ１の速度プロファイルを計算する。すなわち、速度計算部４は、指令軌跡記憶部３からの指令軌跡と、制約条件記憶部２からの制約条件のパラメータとに基づいて、制約条件の範囲内でマニピュレータ１の動作時間を短くする指令軌跡上の速度プロファイルを計算する。あるいは、ロボット制御装置１００が把持制約学習部を備える場合、速度計算部４は、指令軌跡と、把持制約学習部で学習した制約条件のパラメータとに基づいて、速度プロファイルを計算する。ここで、マニピュレータ１の動作時間に基づく評価指標とは、後に説明するプロファイル計算部４４が用いる評価関数のことである。この評価関数は、媒介変数を介して、マニピュレータ１の加速度および速度を変数として数式化した関数である。また、速度プロファイルとは、マニピュレータ１が指令軌跡上を動作する際の各関節の速度の時間変化を表す。速度計算部４は、評価関数で表された動作時間を最小化する最適化問題によって、速度プロファイルを計算する。 The velocity calculation unit 4 calculates the velocity profile of the manipulator 1 based on a preset command trajectory to the manipulator 1, constraints on the manipulator 1, and an evaluation index based on the operation time of the manipulator 1. That is, based on the command trajectory from the command trajectory storage unit 3 and the parameters of the constraint conditions from the constraint condition storage unit 2, the velocity calculation unit 4 issues a command to shorten the operation time of the manipulator 1 within the range of the constraint conditions. Calculate the velocity profile on the trajectory. Alternatively, if the robot control device 100 includes a gripping constraint learning unit, the velocity calculating unit 4 calculates a velocity profile based on the command trajectory and the parameters of the constraints learned by the gripping constraint learning unit. Here, the evaluation index based on the operating time of the manipulator 1 is an evaluation function used by the profile calculator 44, which will be described later. This evaluation function is a function obtained by formulating the acceleration and velocity of the manipulator 1 as variables via parameters. Also, the velocity profile represents the change over time of the velocity of each joint when the manipulator 1 moves along the commanded trajectory. A speed calculator 4 calculates a speed profile by an optimization problem that minimizes the operating time represented by the evaluation function.

　図３は、実施の形態１における速度計算部４の一例を示すブロック図である。速度計算部４は、指令補間計算部４１と、動力学計算部４２と、制約条件係数計算部４３と、プロファイル計算部４４とを備える。 FIG. 3 is a block diagram showing an example of the speed calculation unit 4 according to Embodiment 1. FIG. The velocity calculator 4 includes a command interpolation calculator 41 , a dynamics calculator 42 , a constraint coefficient calculator 43 , and a profile calculator 44 .

　指令補間計算部４１は、予め設定された自然数Ｎを用いて、指令軌跡記憶部３からの指令軌跡の曲線をＮ点で補間し、指令軌跡上の補間点における位置と指令軌跡の媒介変数に関する一階微分および二階微分とを計算する。一例として、指令軌跡がスプライン曲線として与えられていた場合、マニピュレータ１の各軸の関節位置（あるいは関節角度）が媒介変数に関する３次の区分多項式として表現されるので、多項式の微分を用いることで、補間点の位置（あるいは角度）だけでなく、媒介変数に関する一階微分および二階微分を計算することができる。 The command interpolation calculation unit 41 uses a preset natural number N to interpolate the curve of the command trajectory from the command trajectory storage unit 3 at N points, and calculates the position of the interpolation point on the command trajectory and the parameter of the command trajectory. Compute first and second derivatives. As an example, when the command trajectory is given as a spline curve, the joint position (or joint angle) of each axis of the manipulator 1 is expressed as a cubic piecewise polynomial with respect to parameters. , the positions (or angles) of the interpolated points as well as the first and second derivatives with respect to the parameters can be computed.

　動力学計算部４２は、指令補間計算部４１からの補間点における位置と一階微分と二階微分とを用いて、マニピュレータ１の運動学計算と動力学計算とを行い、運動学計算結果と動力学計算結果とを出力する。運動学計算とは、マニピュレータ１の各関節の速度、加速度、角速度、および角加速度から、マニピュレータ１の各関節、各リンク、および把持部１１１などの速度、加速度、角速度、および角加速度を計算することである。また、動力学計算とは、マニピュレータ１の各関節、各リンク、および把持部１１１などの速度、加速度、角速度、および角加速度から、マニピュレータ１の各関節に発生するトルク、および把持部１１１に発生する力とモーメントとを計算することである。 The dynamics calculation unit 42 performs kinematics calculation and dynamics calculation of the manipulator 1 using the positions at the interpolation points from the command interpolation calculation unit 41 and the first-order differential and the second-order differential. output the results of academic calculations. The kinematics calculation is to calculate the velocity, acceleration, angular velocity, and angular acceleration of each joint, each link, and the grasping part 111 of the manipulator 1 from the velocity, acceleration, angular velocity, and angular acceleration of each joint of the manipulator 1. That is. The dynamics calculation is performed by calculating the torque generated at each joint of the manipulator 1 and the torque generated at the gripping portion 111 from the velocity, acceleration, angular velocity, and angular acceleration of each joint, each link, and the gripping portion 111 of the manipulator 1. is to calculate the forces and moments acting on

　制約条件係数計算部４３は、動力学計算部４２からの運動学計算結果および動力学計算結果と、制約条件記憶部２からの制約条件のパラメータとに基づいて、制約条件の係数を計算する。一例として、把持部１１１に発生する力に関する制約条件が、以下の数式（１）で表される場合、制約条件の係数とは、ａ、ｂおよびｃのことである。すなわち、制約条件の係数とは、複数の変数を含む関係式において、それぞれの項に含まれる係数のことである。ここで、変数とは、数式（１）における補間点での加速度ｕおよび速度ｘなどである。 The constraint coefficient calculation unit 43 calculates constraint coefficients based on the kinematics calculation results and dynamics calculation results from the dynamics calculation unit 42 and the constraint parameters from the constraint storage unit 2 . As an example, when the constraint on the force generated in the grip part 111 is represented by the following formula (1), the coefficients of the constraint are a, b, and c. That is, the constraint coefficient is a coefficient included in each term in a relational expression including a plurality of variables. Here, the variables are the acceleration u and the velocity x at the interpolation points in Equation (1).

　プロファイル計算部４４は、制約条件係数計算部４３からの制約条件の係数に基づいて、速度プロファイルを計算する。すなわち、プロファイル計算部４４は、マニピュレータ１の動作時間に基づく評価関数と、制約条件の係数とに基づく最適化計算により、マニピュレータ１の指令軌跡上の速度プロファイルと、マニピュレータ１の動作時間とを計算する。 The profile calculator 44 calculates a velocity profile based on the constraint coefficients from the constraint coefficient calculator 43 . That is, the profile calculator 44 calculates the velocity profile on the commanded trajectory of the manipulator 1 and the operating time of the manipulator 1 by optimization calculation based on the evaluation function based on the operating time of the manipulator 1 and the coefficients of the constraint conditions. do.

　なお、制約条件係数計算部４３が制約条件の係数を計算する方法、およびプロファイル計算部４４が加速度プロファイルを計算する方法は、非特許文献１に示されている。 The method by which the constraint coefficient calculator 43 calculates the constraint coefficients and the method by which the profile calculator 44 calculates the acceleration profile are shown in Non-Patent Document 1.

　以上のように、速度計算部４は、制約条件記憶部２で設定された制約条件の範囲内で、動作時間が短くなる速度プロファイルを計算することができる。また、ワーク１１２で発生する力およびモーメントを制約条件に加えることで、ワーク１１２のピックアンドプレースにおいて、ワーク１１２が落下する、あるいはワーク１１２に過剰な力が発生するなどの不具合を抑制することができる。加えて、非特許文献１に記載されている方法は、大域的最適解が容易に得られる凸最適化と呼ばれる方法であるため、速度プロファイルを高速に計算することができる。 As described above, the speed calculation unit 4 can calculate a speed profile that shortens the operation time within the range of the constraint conditions set in the constraint storage unit 2. In addition, by adding the force and moment generated by the workpiece 112 to the constraint conditions, problems such as the workpiece 112 dropping or excessive force being generated in the workpiece 112 during pick-and-place of the workpiece 112 can be suppressed. can. In addition, the method described in Non-Patent Document 1 is a method called convex optimization that can easily obtain a global optimum solution, so it is possible to calculate the velocity profile at high speed.

　図１に戻り、勾配計算部５は、速度計算部４からの速度プロファイルに基づいて、マニピュレータ１の動作時間の指令軌跡に関する勾配を計算し、勾配情報として出力する。具体的には、勾配計算部５は、自動微分を用いて、プロファイル計算部４４からのマニピュレータ１の動作時間を指令軌跡について微分することで、動作時間の指令軌跡に関する勾配を計算し、勾配情報として出力する。 Returning to FIG. 1, the gradient calculator 5 calculates the gradient of the commanded trajectory of the manipulator 1 based on the velocity profile from the velocity calculator 4, and outputs it as gradient information. Specifically, the gradient calculator 5 differentiates the motion time of the manipulator 1 from the profile calculator 44 with respect to the commanded trajectory using automatic differentiation, thereby calculating the gradient of the motion time with respect to the commanded trajectory. output as

　図４は、実施の形態１における勾配計算部５の一例を示すブロック図である。勾配計算部５は、指令補間勾配計算部５１と、動力学勾配計算部５２と、制約条件係数勾配計算部５３と、プロファイル勾配計算部５４とを備える。指令補間勾配計算部５１、動力学勾配計算部５２、制約条件係数勾配計算部５３、およびプロファイル勾配計算部５４は、それぞれ、指令補間計算部４１、動力学計算部４２、制約条件係数計算部４３、およびプロファイル計算部４４が行った計算結果の勾配を計算する。 FIG. 4 is a block diagram showing an example of the gradient calculator 5 according to the first embodiment. The gradient calculator 5 includes a command interpolation gradient calculator 51 , a dynamic gradient calculator 52 , a constraint coefficient gradient calculator 53 , and a profile gradient calculator 54 . The command interpolation gradient calculator 51, the dynamic gradient calculator 52, the constraint coefficient gradient calculator 53, and the profile gradient calculator 54 are the command interpolation calculator 41, the dynamic calculator 42, and the constraint coefficient calculator 43, respectively. , and the slope of the calculation result performed by the profile calculator 44 is calculated.

　プロファイル勾配計算部５４は、一例として非特許文献２で示されている方法と同様の方法に基づいて、最適化問題の係数に関する勾配を計算することで、動作時間の制約条件の係数に関する勾配を計算する。 The profile gradient calculation unit 54 calculates the gradients of the coefficients of the operating time constraints by calculating the gradients of the coefficients of the optimization problem based on a method similar to the method disclosed in Non-Patent Document 2 as an example. calculate.

　制約条件係数勾配計算部５３は、プロファイル勾配計算部５４からの制約条件の係数に関する勾配を入力とし、制約条件係数計算部４３の計算手順の微分を連鎖律に基づいて計算することで、動作時間の運動学計算結果に関する勾配、および動作時間の動力学計算結果に関する勾配を計算する。 The constraint coefficient gradient calculator 53 receives the gradient related to the coefficient of the constraint from the profile gradient calculator 54 as an input, and calculates differentiation of the calculation procedure of the constraint coefficient calculator 43 based on the chain rule to obtain the operation time , and the gradient for the motion time dynamics calculation results.

　動力学勾配計算部５２は、制約条件係数勾配計算部５３からの動作時間の運動学計算結果に関する勾配、および動作時間の動力学計算結果に関する勾配を入力とし、動力学計算部４２の計算手順の微分を連鎖律に基づいて計算することで、動作時間の指令軌跡上の補間点における位置に関する勾配、動作時間の指令軌跡の媒介変数に関する一階微分に関する勾配、および動作時間の指令軌跡の媒介変数に関する二階微分に関する勾配を計算する。 The dynamics gradient calculator 52 inputs the gradients related to the kinematics calculation result of the action time from the constraint coefficient gradient calculator 53 and the gradients related to the dynamics calculation result of the action time. By calculating the derivative based on the chain rule, the slope of the position at the interpolation point on the commanded trajectory of the action time, the slope of the first derivative of the parameter of the commanded trajectory of the action time, and the parameter of the commanded trajectory of the action time Computes the gradient with respect to the second derivative with respect to .

　指令補間勾配計算部５１は、動力学勾配計算部５２からの指令軌跡上の補間点における位置に関する勾配、指令軌跡の媒介変数に関する一階微分に関する勾配、および指令軌跡の媒介変数に関する二階微分に関する勾配を入力とし、指令補間計算部４１の計算手順の微分を連鎖律に基づいて計算することで、動作時間の指令軌跡に関する勾配を計算する。指令補間勾配計算部５１からは、動作時間の指令軌跡に関する勾配を勾配情報として出力する。 The command interpolation gradient calculation unit 51 calculates the gradient regarding the position of the interpolation point on the command trajectory from the dynamic gradient calculation unit 52, the gradient regarding the first derivative regarding the parameter of the command trajectory, and the gradient regarding the second derivative regarding the parameter of the command trajectory. is input, and the differential of the calculation procedure of the command interpolation calculation unit 41 is calculated based on the chain rule, thereby calculating the gradient of the commanded trajectory of the operation time. The command interpolating gradient calculator 51 outputs the gradient related to the commanded trajectory of the operation time as gradient information.

　図１に戻り、指令軌跡補正部６は、勾配計算部５からの勾配情報に基づいて、マニピュレータ１の動作時間が減少するように指令軌跡を補正し、補正指令軌跡として出力する。指令軌跡補正部６は、勾配情報に基づく勾配降下法、共役勾配法または準ニュートン法のうちいずれか１つを用いることで、指令軌跡を補正する。勾配降下法の一例として、最急降下法、モーメンタム法、または加速勾配法などがある。補正指令軌跡は、指令軌跡記憶部３で記憶される。 Returning to FIG. 1, the command trajectory correction unit 6 corrects the command trajectory based on the gradient information from the gradient calculation unit 5 so as to reduce the operation time of the manipulator 1, and outputs it as a corrected command trajectory. The commanded trajectory correction unit 6 corrects the commanded trajectory by using any one of the gradient descent method, the conjugate gradient method, and the quasi-Newton method based on the gradient information. Examples of gradient descent include steepest descent, momentum, or accelerating gradients. The corrected command trajectory is stored in the command trajectory storage unit 3 .

　勾配情報に基づいて、指令軌跡補正部６がマニピュレータ１の動作時間が少なくなるよう指令軌跡を補正することで、マニピュレータ１の動作時間を短縮することができる。また、指令軌跡補正部６が勾配情報を用いるため、効率よく高速に指令軌跡を補正することができる。 The operation time of the manipulator 1 can be shortened by correcting the instruction trajectory so that the operation time of the manipulator 1 is reduced by the command trajectory correction unit 6 based on the gradient information. Further, since the commanded trajectory correction unit 6 uses the gradient information, the commanded trajectory can be corrected efficiently and at high speed.

　指令点列計算部７は、指令軌跡記憶部３で記憶された補正指令軌跡と、速度計算部４からの速度プロファイルとに基づいて、所定のサンプリング周期毎の指令点列を計算する。このサンプリング周期は、後に説明する制御部８がアクチュエータ１１０への電流値を計算する際の周期のことである。一例として、補正指令軌跡がスプライン曲線として与えられていた場合、指令点列計算部７は、速度プロファイルに基づいて、時刻と補正指令軌跡の曲線の媒介変数との間の変換を行い、媒介変数を入力とした補正指令軌跡のスプライン曲線の区分多項式と組み合わせることにより、各サンプリングでマニピュレータ１への位置指令の点列を計算する。 The command point sequence calculation unit 7 calculates command point sequences for each predetermined sampling period based on the corrected command trajectory stored in the command trajectory storage unit 3 and the speed profile from the speed calculation unit 4 . This sampling period is the period when the controller 8, which will be described later, calculates the current value to the actuator 110. FIG. As an example, when the correction command trajectory is given as a spline curve, the command point sequence calculation unit 7 performs conversion between the time and the parameter of the curve of the correction command trajectory based on the velocity profile. is combined with the piecewise polynomial of the spline curve of the corrected command trajectory, which is input, to calculate the point sequence of the position command to the manipulator 1 at each sampling.

　制御部８は、補正指令軌跡に対しマニピュレータ１が追従するよう制御する。すなわち、制御部８は、指令点列計算部７からの指令点列に対し、マニピュレータ１が追従するよう制御する。 The control unit 8 controls the manipulator 1 to follow the correction command trajectory. That is, the controller 8 controls the manipulator 1 to follow the command point sequence from the command point sequence calculator 7 .

　図５は、実施の形態１における制御部８の一例を示すブロック図である。制御部８は、フィードフォワード制御部８１と、フィードバック制御部８２と、電流値計算部８３とを備える。 FIG. 5 is a block diagram showing an example of the control section 8 according to the first embodiment. The controller 8 includes a feedforward controller 81 , a feedback controller 82 and a current value calculator 83 .

　フィードフォワード制御部８１は、指令点列計算部７からの指令点列に対し、例えば平滑化などのフィルタ処理を行い、平滑化後の指令点列として出力する。フィードフォワード制御部８１は、指令点列計算部７からの指令点列に対し、モデル化された逆伝達関数を適用することで、アクチュエータ１１０に入力する電流のフィードフォワード値を計算して出力する。逆伝達関数とは、制御対象であるアクチュエータ１１０の伝達関数に対する逆関数である。センサによってアクチュエータ１１０で発生する外乱を検知できる場合、フィードフォワード制御部８１は外乱信号に対し、モデル化された逆伝達関数を適用してもよい。この場合、フィードフォワード制御部８１は、外乱信号に対し逆伝達関数を適用したものを上記の電流のフィードフォワード値に含めて出力する。外乱の一例として、作業者がマニピュレータ１に接触することによる振動が挙げられる。 The feedforward control unit 81 performs filter processing such as smoothing on the command point sequence from the command point sequence calculation unit 7, and outputs it as a smoothed command point sequence. The feedforward control unit 81 applies a modeled inverse transfer function to the command point sequence from the command point sequence calculation unit 7 to calculate and output the feedforward value of the current input to the actuator 110. . The inverse transfer function is an inverse function of the transfer function of actuator 110, which is the object to be controlled. If the sensor can detect the disturbance generated by the actuator 110, the feedforward control unit 81 may apply a modeled inverse transfer function to the disturbance signal. In this case, the feedforward control unit 81 outputs the current feedforward value including the result obtained by applying the inverse transfer function to the disturbance signal. An example of the disturbance is vibration caused by contact with the manipulator 1 by the operator.

　フィードバック制御部８２は、フィードフォワード制御部８１からの平滑化後の指令点列にアクチュエータ１１０が追従するようフィードバック制御を行い、アクチュエータ１１０に入力する電流のフィードバック値を計算して出力する。 The feedback control unit 82 performs feedback control so that the actuator 110 follows the smoothed command point sequence from the feedforward control unit 81, and calculates and outputs the feedback value of the current input to the actuator 110.

　電流値計算部８３は、フィードフォワード制御部８１からの電流のフィードフォワード値と、フィードバック制御部８２からの電流のフィードバック値とに基づいて、アクチュエータ１１０へ入力する電流値を計算する。 The current value calculator 83 calculates the current value to be input to the actuator 110 based on the current feedforward value from the feedforward controller 81 and the current feedback value from the feedback controller 82 .

　図６は、実施の形態１におけるロボット制御装置１００の動作の一例を示すフローチャートである。すなわち、図６は、実施の形態１におけるロボット制御方法の一例を示すフローチャートである。 FIG. 6 is a flow chart showing an example of the operation of the robot control device 100 according to the first embodiment. That is, FIG. 6 is a flow chart showing an example of the robot control method according to the first embodiment.

　図６に示すように、図示しない手段によりロボット制御が開始されると、速度計算部４は、予め設定されたマニピュレータ１への指令軌跡と、マニピュレータ１に関する制約条件と、マニピュレータ１の動作時間に基づく評価指標とに基づいて、マニピュレータ１の速度プロファイルを計算する（ステップＳＴ１）。 As shown in FIG. 6, when the robot control is started by a means (not shown), the velocity calculation unit 4 calculates a preset command trajectory to the manipulator 1, constraints on the manipulator 1, and operation time of the manipulator 1. A velocity profile of the manipulator 1 is calculated (step ST1).

　勾配計算部５は、マニピュレータ１の動作時間の指令軌跡に関する勾配を計算して勾配情報として出力する（ステップＳＴ２）。 The gradient calculation unit 5 calculates the gradient of the commanded trajectory of the manipulator 1 and outputs it as gradient information (step ST2).

　指令軌跡補正部６は、勾配情報に基づいて指令軌跡を補正して補正指令軌跡として出力する（ステップＳＴ３）。 The command trajectory correction unit 6 corrects the command trajectory based on the gradient information and outputs it as a corrected command trajectory (step ST3).

　指令軌跡記憶部３は、補正指令軌跡を記憶する（ステップＳＴ４）。 The command trajectory storage unit 3 stores the corrected command trajectory (step ST4).

　指令点列計算部７は、補正指令軌跡と速度プロファイルとに基づいて、指令点列を計算する（ステップＳＴ５）。 The command point sequence calculation unit 7 calculates a command point sequence based on the corrected command trajectory and speed profile (step ST5).

　制御部８は、補正指令軌跡に対しマニピュレータ１が追従するようアクチュエータ１１０を制御する（ステップＳＴ６）。 The control unit 8 controls the actuator 110 so that the manipulator 1 follows the corrected command locus (step ST6).

　図示しない手段により、ロボットの制御を継続するか否かが判定される（ステップＳＴ７）。 A means (not shown) determines whether or not to continue controlling the robot (step ST7).

　ステップＳＴ７の判定が「Ｙｅｓ」の場合は、処理はステップＳＴ６に戻り、ロボットの制御が継続される。ステップＳＴ７の判定が「Ｎｏ」の場合は、ロボットの制御が終了する。ロボットの制御が終了するのは、例えばマニピュレータ１が補正指令軌跡の終点にたどり着いた場合である。あるいは、マニピュレータ１が異常動作したと判定された場合である。この判定は、図示しない手段により行われる。 If the determination in step ST7 is "Yes", the process returns to step ST6 to continue controlling the robot. If the determination in step ST7 is "No", control of the robot ends. Robot control ends when, for example, the manipulator 1 reaches the end point of the correction command trajectory. Alternatively, it is a case where it is determined that the manipulator 1 has operated abnormally. This determination is made by means not shown.

　以上で説明した実施の形態１によれば、マニピュレータ１に関する制約条件とマニピュレータ１の動作時間とに基づく評価指標に基づいて速度プロファイルを計算し、動作時間の指令軌跡に関する勾配に基づいて指令軌跡を補正するため、制約条件を満たし、動作時間が短くなるようなマニピュレータ１の補正指令軌跡と速度プロファイルとを高速に求めることができる。 According to the first embodiment described above, the velocity profile is calculated based on the evaluation index based on the constraints on the manipulator 1 and the operating time of the manipulator 1, and the commanded trajectory is calculated based on the gradient of the commanded trajectory of the operating time. Since the correction is performed, it is possible to quickly obtain the correction command trajectory and velocity profile of the manipulator 1 that satisfy the constraint conditions and shorten the operation time.

実施の形態２．
　実施の形態２では、周辺環境１１３で予め取得されるマニピュレータ１周辺の周辺情報に基づいて、指令軌跡を生成する。 Embodiment 2.
In the second embodiment, the command trajectory is generated based on peripheral information around the manipulator 1 acquired in advance from the peripheral environment 113 .

　図７は、実施の形態２におけるロボット制御装置１００ａの一例を示すブロック図である。図７は、ロボット制御装置１００ａがロボット制御装置１００の構成要素に加え、周辺環境情報記憶部９と、指令軌跡生成部１０と、速度プロファイル記憶部１１とを備える点で、図１とは異なる。また、図７は、指令軌跡補正部６の代わりに指令軌跡補正部６ａを備える点で、図１とは異なる。周辺環境情報記憶部９、指令軌跡生成部１０、速度プロファイル記憶部１１および指令軌跡補正部６ａ以外は、図１に示すものと同じであるため、説明を省略する。 FIG. 7 is a block diagram showing an example of the robot control device 100a according to the second embodiment. 7 differs from FIG. 1 in that the robot control device 100a includes a peripheral environment information storage unit 9, a command trajectory generation unit 10, and a speed profile storage unit 11 in addition to the components of the robot control device 100. . 7 differs from FIG. 1 in that a command trajectory correction unit 6a is provided instead of the command trajectory correction unit 6. FIG. 1 except for the surrounding environment information storage unit 9, the command trajectory generation unit 10, the speed profile storage unit 11, and the command trajectory correction unit 6a, description thereof will be omitted.

　周辺環境情報記憶部９は、マニピュレータ１周辺の周辺情報を記憶する。具体的には、周辺環境情報記憶部９は、周辺環境１１３によって取得されたマニピュレータ１周辺の障害物の位置および形状などの情報を周辺情報として記憶する。周辺情報を記憶するためのデータ構造としては、例えば点群、ボクセル、ポリゴンメッシュ、および直方体などの基本形状でもよいし、複数の基本形状を組み合わせたものでもよい。また、データ構造として、後に説明する距離関数計算部６１が行う距離計算などの高速化が図れるバウンディングボリューム階層であってもよい。 The peripheral environment information storage unit 9 stores peripheral information around the manipulator 1 . Specifically, the peripheral environment information storage unit 9 stores information such as the position and shape of obstacles around the manipulator 1 acquired from the peripheral environment 113 as peripheral information. The data structure for storing peripheral information may be, for example, basic shapes such as point groups, voxels, polygon meshes, and rectangular parallelepipeds, or a combination of a plurality of basic shapes. Alternatively, the data structure may be a bounding volume layer that can speed up the distance calculation performed by the distance function calculator 61, which will be described later.

　指令軌跡生成部１０は、周辺環境情報記憶部９で記憶された周辺情報に基づいて、指令軌跡を生成する。具体的には、指令軌跡生成部１０は、周辺情報に基づいて、マニピュレータ１が周辺の障害物と環境しないような指令軌跡を生成し、指令軌跡記憶部３で記憶されている指令軌跡の更新を行う。指令軌跡を生成するアルゴリズムとして、例えばＲＲＴ（Ｒａｐｉｄｌｙ－ｅｘｐｌｏｒｉｎｇ　Ｒａｎｄｏｍ　Ｔｒｅｅ）およびＰＲＭ（Ｐｒｏｂａｂｉｌｉｓｔｉｃ　ＲｏａｄＭａｐ）などを用いてもよい。 The command trajectory generation unit 10 generates a command trajectory based on the surrounding information stored in the surrounding environment information storage unit 9. Specifically, the command trajectory generation unit 10 generates a command trajectory based on the peripheral information so that the manipulator 1 does not surround the manipulator 1 with surrounding obstacles, and updates the command trajectory stored in the command trajectory storage unit 3. I do. For example, RRT (Rapidly-exploring Random Tree) and PRM (Probabilistic Road Map) may be used as algorithms for generating the command trajectory.

　指令軌跡生成部１０で生成される指令軌跡は、予め設定された初期軌道ではなく、マニピュレータ１周辺の障害物との干渉を避けるような指令軌跡であり、自律的に生成されるものである。このため、指令軌跡生成部１０は、例えば指令軌跡の始点と終点とを入力するだけで、指令軌跡を自動的に生成することができる。 The command trajectory generated by the command trajectory generator 10 is not a preset initial trajectory, but a command trajectory that avoids interference with obstacles around the manipulator 1, and is generated autonomously. Therefore, the command trajectory generator 10 can automatically generate the command trajectory only by inputting, for example, the start point and the end point of the command trajectory.

　指令軌跡補正部６ａは、指令軌跡記憶部３で記憶された指令軌跡と、周辺環境情報記憶部９で記憶された周辺情報と、勾配計算部５からの勾配情報とに基づいて、指令軌跡を補正し、補正指令軌跡として出力する。補正指令軌跡は、指令軌跡記憶部３で記憶される。 The command trajectory correction unit 6a corrects the command trajectory based on the command trajectory stored in the command trajectory storage unit 3, the surrounding information stored in the surrounding environment information storage unit 9, and the gradient information from the gradient calculation unit 5. It is corrected and output as a correction command trajectory. The corrected command trajectory is stored in the command trajectory storage unit 3 .

　図８は、実施の形態２における指令軌跡補正部６ａの一例を示すブロック図である。指令軌跡補正部６ａは、距離関数計算部６１と、バリア関数計算部６２と、バリア関数勾配計算部６３と、指令軌跡補正値計算部６４とを備える。 FIG. 8 is a block diagram showing an example of the command locus correction unit 6a according to the second embodiment. The commanded trajectory correction unit 6 a includes a distance function calculation unit 61 , a barrier function calculation unit 62 , a barrier function gradient calculation unit 63 , and a command trajectory correction value calculation unit 64 .

　距離関数計算部６１は、指令軌跡記憶部３で記憶された指令軌跡と、周辺環境情報記憶部９で記憶された周辺情報とに基づいて、指令軌跡と周辺の障害物との距離関数の値を計算する。 Based on the commanded trajectory stored in the commanded trajectory storage unit 3 and the surrounding information stored in the surrounding environment information storage unit 9, the distance function calculation unit 61 calculates the value of the distance function between the commanded trajectory and surrounding obstacles. to calculate

　バリア関数計算部６２は、距離関数の値がある値以下になると関数の値が発散するようなバリア関数を構成し、距離関数計算部６１からの距離関数の値に基づいて、バリア関数の値を計算する。 The barrier function calculator 62 constructs a barrier function such that the value of the function diverges when the value of the distance function becomes a certain value or less. to calculate

　バリア関数勾配計算部６３は、バリア関数計算部６２からのバリア関数の値の指令軌跡に関する勾配を計算する。バリア関数勾配計算部６３は、例えば自動微分を用いて勾配を計算する。 The barrier function gradient calculator 63 calculates the gradient of the barrier function value from the barrier function calculator 62 with respect to the commanded trajectory. The barrier function gradient calculator 63 calculates gradients using, for example, automatic differentiation.

　指令軌跡補正値計算部６４は、勾配計算部５からのマニピュレータ１の動作時間の指令軌跡に関する勾配と、バリア関数勾配計算部６３からのバリア関数の値の指令軌跡に関する勾配とを合わせて勾配情報とし、この勾配情報に基づいて、動作時間とバリア関数の値との和が小さくなるように、指令軌跡の補正値を計算する。 The command trajectory correction value calculation unit 64 combines the gradient regarding the command trajectory of the operating time of the manipulator 1 from the gradient calculation unit 5 and the gradient regarding the command trajectory of the value of the barrier function from the barrier function gradient calculation unit 63 to obtain gradient information. Then, based on this gradient information, a correction value for the commanded trajectory is calculated so that the sum of the operating time and the value of the barrier function becomes small.

　図７に戻り、速度プロファイル記憶部１１は、速度計算部４からの速度プロファイルを記憶する。ロボット制御装置１００ａが速度プロファイル記憶部１１を備えることにより、速度計算部４が速度プロファイルを計算するタイミングを任意に設定することができる。すなわち、制御部８がアクチュエータ１１０を制御する直前だけでなく、例えばマニピュレータ１が動作していない間、あるいはマニピュレータ１が他の動作を行っている間などに速度プロファイルを計算することができる。 Returning to FIG. 7, the speed profile storage unit 11 stores the speed profile from the speed calculation unit 4. By providing the speed profile storage unit 11 in the robot control device 100a, the timing at which the speed calculation unit 4 calculates the speed profile can be arbitrarily set. That is, the velocity profile can be calculated not only immediately before the control unit 8 controls the actuator 110 but also, for example, while the manipulator 1 is not operating or while the manipulator 1 is performing other operations.

　指令点列計算部７は、指令軌跡記憶部３で記憶された補正指令軌跡と、速度プロファイル記憶部１１で記憶された速度プロファイルとに基づいて、所定のサンプリング周期毎の指令点列を計算する。 The command point sequence calculation unit 7 calculates a command point sequence for each predetermined sampling period based on the corrected command trajectory stored in the command trajectory storage unit 3 and the speed profile stored in the speed profile storage unit 11. .

　図９は、実施の形態２におけるロボット制御装置１００ａの動作の一例を示すフローチャートである。すなわち、図９は、実施の形態２におけるロボット制御方法の一例を示すフローチャートである。図９のステップＳＴ１からステップＳＴ７は、図６のステップＳＴ１からステップＳＴ７と同じであるため、ここでは詳細説明を省略する。 FIG. 9 is a flow chart showing an example of the operation of the robot control device 100a according to the second embodiment. That is, FIG. 9 is a flow chart showing an example of the robot control method according to the second embodiment. Since steps ST1 to ST7 in FIG. 9 are the same as steps ST1 to ST7 in FIG. 6, detailed description thereof is omitted here.

　図９に示すように、図示しない手段によりロボット制御が開始されると、指令軌跡生成部１０は、周辺環境情報記憶部９で記憶された周辺情報に基づいて、指令軌跡を生成する。（ステップＳＴ８）。
As shown in FIG. 9 , when robot control is started by means (not shown), the command trajectory generator 10 generates a command trajectory based on the surrounding information stored in the surrounding environment information storage unit 9 . (Step ST8).

　速度計算部４は、マニピュレータ１への指令軌跡と、マニピュレータ１に関する制約条件と、マニピュレータ１の動作時間に基づく評価指標とに基づいて、マニピュレータ１の速度プロファイルを計算する（ステップＳＴ１）。 The velocity calculation unit 4 calculates the velocity profile of the manipulator 1 based on the command trajectory to the manipulator 1, the constraint on the manipulator 1, and the evaluation index based on the operation time of the manipulator 1 (step ST1).

　速度プロファイル記憶部１１は、速度プロファイルを記憶する（ステップＳＴ９）。 The speed profile storage unit 11 stores the speed profile (step ST9).

　指令軌跡補正部６ａは、バリア関数の値の指令軌跡に関する勾配を計算し、ステップＳＴ２により計算された勾配と合わせたものを勾配情報として出力する（ステップＳＴ１０）。 The commanded trajectory correction unit 6a calculates the gradient of the value of the barrier function with respect to the commanded trajectory, and outputs the gradient information combined with the gradient calculated in step ST2 (step ST10).

　指令軌跡補正部６ａは、勾配情報に基づいて指令軌跡を補正して補正指令軌跡として出力する（ステップＳＴ３）。 The command trajectory correction unit 6a corrects the command trajectory based on the gradient information and outputs it as a corrected command trajectory (step ST3).

　ステップＳＴ７の判定が「Ｙｅｓ」の場合は、処理はステップＳＴ６に戻り、ロボットの制御が継続される。ステップＳＴ７の判定が「Ｎｏ」の場合は、ロボットの制御が終了する。 If the determination in step ST7 is "Yes", the process returns to step ST6 to continue controlling the robot. If the determination in step ST7 is "No", control of the robot ends.

　以上で説明した実施の形態２によれば、指令軌跡生成部１０が周辺環境１１３からの周辺情報に基づいて指令軌跡を生成するため、障害物との干渉を避けつつ動作時間が短くなるようなマニピュレータ１の補正指令軌跡と速度プロファイルとを高速に求めることができる。 According to the second embodiment described above, since the command trajectory generation unit 10 generates the command trajectory based on the surrounding information from the surrounding environment 113, the operation time is shortened while avoiding interference with obstacles. The correction command trajectory and velocity profile of the manipulator 1 can be obtained at high speed.

実施の形態３．
　実施の形態３では、タブレットなどの入出力装置１２を用いて、マニピュレータ１の制御を行う。 Embodiment 3.
In Embodiment 3, the manipulator 1 is controlled using an input/output device 12 such as a tablet.

　図１０は、実施の形態３におけるロボット制御装置１００ｂの一例を示すブロック図である。図１０は、ロボット制御装置１００ｂが入出力装置１２と図示しない手段により接続されている点で、図１とは異なる。なお、ロボット制御装置１００ｂは、実施の形態１におけるロボット制御装置１００と同じ構成であるため、説明を省略する。なお、ロボット制御装置１００ｂは、実施の形態２におけるロボット制御装置１００ａと同じ構成であってもよい。 FIG. 10 is a block diagram showing an example of a robot control device 100b according to the third embodiment. FIG. 10 differs from FIG. 1 in that the robot control device 100b is connected to the input/output device 12 by a means (not shown). Since the robot control device 100b has the same configuration as the robot control device 100 in the first embodiment, the description thereof is omitted. The robot control device 100b may have the same configuration as the robot control device 100a in the second embodiment.

　入出力装置１２は、マニピュレータ１の動作情報と周辺環境１１３から取得される周辺情報とを画面に表示する。動作情報は、例えばマニピュレータ１が動作している映像のことである。入出力装置１２は、作業者が入力した動作情報をロボット制御装置１００ｂに出力する。例えば、入出力装置１２は、画面上のタッチパネルあるいは音声インターフェースを通じて、作業者にマニピュレータ１の動作の始点と終点とを入力させてもよいし、作業者がタブレットの画面を指でトレースすることで、マニピュレータ１の把持部１１１の大まかな指令軌跡を入力させてもよい。この指令軌跡は、指令軌跡記憶部３で記憶される。なお、作業者は、マニピュレータ１が動作中に入出力装置１２を用いて停止させることもできる。この場合、入出力装置１２は、画面上に「動作停止」の表示をさせ、作業者にその表示をタッチさせることで、作業停止の命令をロボット制御装置１００ｂへ送信する。これにより、ロボット制御装置１００ｂは、マニピュレータ１の動作を停止させる。 The input/output device 12 displays operation information of the manipulator 1 and peripheral information acquired from the peripheral environment 113 on the screen. The motion information is, for example, an image of the manipulator 1 in motion. The input/output device 12 outputs motion information input by the operator to the robot control device 100b. For example, the input/output device 12 may allow the operator to input the start point and end point of the operation of the manipulator 1 through a touch panel on the screen or a voice interface, or allow the operator to trace the screen of the tablet with a finger. , a rough command trajectory of the gripping portion 111 of the manipulator 1 may be input. This commanded trajectory is stored in the commanded trajectory storage unit 3 . The operator can also use the input/output device 12 to stop the manipulator 1 during operation. In this case, the input/output device 12 displays "operation stop" on the screen and causes the operator to touch the display, thereby transmitting a work stop command to the robot control device 100b. As a result, the robot control device 100b stops the manipulator 1 from operating.

　図１１は、実施の形態３におけるロボット制御装置１００ｂの動作の一例を示すフローチャートである。すなわち、図１１は、実施の形態３におけるロボット制御方法の一例を示すフローチャートである。図１１のステップＳＴ１からステップＳＴ６は、図６のステップＳＴ１からステップＳＴ６と同じであるため、ここでは詳細説明を省略する。 FIG. 11 is a flow chart showing an example of the operation of the robot control device 100b according to the third embodiment. That is, FIG. 11 is a flow chart showing an example of the robot control method according to the third embodiment. Since steps ST1 to ST6 in FIG. 11 are the same as steps ST1 to ST6 in FIG. 6, detailed description thereof is omitted here.

　図１１に示すように、図示しない手段によりロボット制御が開始されると、速度計算部４は、予め設定されたマニピュレータ１への指令軌跡と、マニピュレータ１に関する制約条件と、マニピュレータ１の動作時間に基づく評価指標とに基づいて、マニピュレータ１の速度プロファイルを計算する（ステップＳＴ１）。 As shown in FIG. 11, when the robot control is started by a means (not shown), the velocity calculation unit 4 calculates a preset command trajectory to the manipulator 1, constraints on the manipulator 1, and operation time of the manipulator 1. A velocity profile of the manipulator 1 is calculated (step ST1).

　図示しない手段により、ロボットの制御を継続するか否かが判定される（ステップＳＴ１１）。 A means (not shown) determines whether or not to continue controlling the robot (step ST11).

　ステップＳＴ１１の判定が「Ｙｅｓ」の場合は、処理はステップＳＴ６に戻り、マニピュレータ１の制御が継続される。ステップＳＴ１１の判定が「Ｎｏ」の場合は、ロボットの制御が終了する。ロボットの制御が終了するのは、例えばマニピュレータ１が補正指令軌跡の終点にたどり着いた場合である。あるいは、マニピュレータ１が異常動作したと判定された場合である。あるいは、入出力装置１２から「動作停止」の命令がロボット制御装置１００ｂへ送信された場合である。この判定は、図示しない手段により行われる。 If the determination in step ST11 is "Yes", the process returns to step ST6 and control of the manipulator 1 is continued. If the determination in step ST11 is "No", control of the robot ends. Robot control ends when, for example, the manipulator 1 reaches the end point of the correction command trajectory. Alternatively, it is a case where it is determined that the manipulator 1 has operated abnormally. Alternatively, the I/O device 12 sends a command to "stop operation" to the robot control device 100b. This determination is made by means not shown.

　以上で説明した実施の形態３によれば、入出力装置１２を用いてマニピュレータ１の制御を行うことで、マニピュレータ１の動作を可視化することができる。 According to the third embodiment described above, the operation of the manipulator 1 can be visualized by controlling the manipulator 1 using the input/output device 12 .

実施の形態４．
　実施の形態４では、マニピュレータ１が移動架台１３の上に設置された状態で、マニピュレータ１の制御を行う。 Embodiment 4.
In Embodiment 4, the manipulator 1 is controlled while the manipulator 1 is installed on the movable platform 13 .

　図１２は、実施の形態４におけるロボット制御装置１００ｃの一例を示すブロック図である。図１２は、マニピュレータ１が移動架台１３の上に設置される点で、図１とは異なる。また、図１２は、ロボット制御装置１００ｃが制約条件記憶部２の代わりに制約条件記憶部２ｃを備える点、速度計算部４の代わりに速度計算部４ｃを備える点、および勾配計算部５の代わりに勾配計算部５ｃを備える点で、図１とは異なる。制約条件記憶部２ｃ、速度計算部４ｃおよび勾配計算部５ｃ以外は、図１に示すものと同じであるため、説明を省略する。なお、ロボット制御装置１００ｃは、実施の形態１におけるロボット制御装置１００をベースとしているが、実施の形態２におけるロボット制御装置１００ａあるいは実施の形態３におけるロボット制御装置１００ｂをベースとしてもよい。 FIG. 12 is a block diagram showing an example of the robot control device 100c according to the fourth embodiment. FIG. 12 is different from FIG. 1 in that the manipulator 1 is installed on a moving platform 13. FIG. 12, the robot control device 100c includes a constraint storage unit 2c instead of the constraint storage unit 2, a speed calculation unit 4c instead of the speed calculation unit 4, and a slope calculation unit 5 instead of the slope calculation unit 5. differs from FIG. 1 in that it includes a gradient calculator 5c. Except for the constraint storage unit 2c, the velocity calculation unit 4c, and the gradient calculation unit 5c, they are the same as those shown in FIG. The robot control device 100c is based on the robot control device 100 in the first embodiment, but may be based on the robot control device 100a in the second embodiment or the robot control device 100b in the third embodiment.

　制約条件記憶部２ｃは、マニピュレータ１に関して予め設定された制約条件のパラメータを記憶する。制約条件は、マニピュレータ１の関節角速度、関節角加速度、関節トルク、手先速度、手先加速度、前記マニピュレータ１が把持する対象で発生する力、および前記対象で発生するモーメントに加え、マニピュレータ１が設置される移動架台１３に与える反力、および反力のトルク成分のうち少なくとも１つに関する条件を含む。 The constraint storage unit 2c stores parameters of constraints set in advance for the manipulator 1. Constraints are the joint angular velocity, joint angular acceleration, joint torque, hand velocity, hand acceleration of the manipulator 1, the force generated by the object gripped by the manipulator 1, and the moment generated by the object, and the manipulator 1 is installed. It includes conditions related to at least one of the reaction force applied to the movable platform 13 and the torque component of the reaction force.

　なお、制約条件のパラメータが予め設定される場合について説明したが、ロボット制御装置１００ｃが図示しない反力制約学習部を備えてもよい。反力制約学習部は、マニピュレータ１が設置される移動架台１３に与える反力および反力のトルク成分のうち少なくとも１つについて、機械学習を用いて制約条件のパラメータを学習し、得られた制約条件のパラメータを制約条件記憶部２ｃに記憶してもよい。具体的には、反力制約学習部は、マニピュレータ１の動作中に、移動架台１３に与える反力および反力のトルク成分を図示しないセンサによって取得し、取得した値に基づいて、制約条件のパラメータとして学習する。 Although the case where the parameters of the constraint conditions are set in advance has been described, the robot control device 100c may be provided with a reaction force constraint learning section (not shown). The reaction force constraint learning unit uses machine learning to learn the parameters of the constraint conditions for at least one of the reaction force applied to the movable platform 13 on which the manipulator 1 is installed and the torque component of the reaction force. Parameters of conditions may be stored in the constraint condition storage unit 2c. Specifically, the reaction force constraint learning unit acquires the reaction force applied to the movable platform 13 and the torque component of the reaction force by a sensor (not shown) during the operation of the manipulator 1, and determines the constraint conditions based on the acquired values. Learn as parameters.

　速度計算部４ｃは、指令軌跡記憶部３からの指令軌跡と、制約条件記憶部２ｃからの制約条件のパラメータとに基づいて、制約条件の範囲内でマニピュレータ１の動作時間を短くする指令軌跡上の速度プロファイルを計算する。 Based on the command trajectory from the command trajectory storage unit 3 and the parameters of the constraint conditions from the constraint condition storage unit 2c, the speed calculation unit 4c calculates a command trajectory that shortens the operation time of the manipulator 1 within the range of the constraint conditions. Calculate the velocity profile of

　速度計算部４ｃの構成は、図３に示す速度計算部４の構成と同じであるが、動力学計算部４２の動力学計算に、マニピュレータ１の動作時に移動架台１３に与える反力および反力のトルク成分の計算を含める点、および制約条件係数計算部４３が反力および反力のトルク成分に関する制約条件も含めて制約条件の係数を計算する点が、速度計算部４とは異なる。なお、ロボット制御装置１００ｃが反力制約学習部を備える場合、速度計算部４ｃは、指令軌跡と、反力制約学習部で学習した制約条件のパラメータとに基づいて、速度プロファイルを計算する。 The configuration of the speed calculation unit 4c is the same as the configuration of the speed calculation unit 4 shown in FIG. and the constraint coefficient calculator 43 calculates the coefficient of the constraint including the reaction force and the torque component of the reaction force. If the robot control device 100c includes a reaction force constraint learning unit, the speed calculation unit 4c calculates a speed profile based on the command trajectory and the constraint parameters learned by the reaction force constraint learning unit.

　勾配計算部５ｃは、マニピュレータ１の動作時間の指令軌跡に関する勾配を計算し、勾配情報として出力する。 The gradient calculation unit 5c calculates the gradient regarding the commanded trajectory of the manipulator 1 operation time and outputs it as gradient information.

　勾配計算部５ｃの構成は、図４に示す勾配計算部５の構成と同じであるが、制約条件係数勾配計算部５３が、マニピュレータ１の動作時に移動架台１３に与える反力および反力のトルク成分も考慮して、動作時間の運動学計算結果に関する勾配、および動作時間の動力学計算結果に関する勾配を計算する点が、勾配計算部５とは異なる。また、プロファイル勾配計算部５４が、マニピュレータ１の動作時に移動架台１３に与える反力および反力のトルク成分も考慮して勾配を計算する点が、勾配計算部５とは異なる。 The configuration of the gradient calculator 5c is the same as the configuration of the gradient calculator 5 shown in FIG. It is different from the gradient calculator 5 in that it calculates the gradient for the kinematic calculation result of the action time and the gradient for the dynamic calculation result of the action time, taking into consideration the components as well. Further, the profile gradient calculator 54 differs from the gradient calculator 5 in that the gradient is calculated in consideration of the reaction force applied to the movable platform 13 during the operation of the manipulator 1 and the torque component of the reaction force.

　以上のように、ロボット制御装置１００ｃが制約条件記憶部２ｃ、速度計算部４ｃおよび勾配計算部５ｃを備えることで、移動架台１３への反力に起因する振動を抑制することができ、マニピュレータ１の動作時間を短くすることができる。 As described above, the robot control device 100c includes the constraint storage unit 2c, the speed calculation unit 4c, and the gradient calculation unit 5c. operation time can be shortened.

　図１３は、実施の形態４におけるロボット制御装置１００ｃの動作の一例を示すフローチャートである。すなわち、図１３は、実施の形態４におけるロボット制御方法の一例を示すフローチャートである。図１３のステップＳＴ１からステップＳＴ７は、図６のステップＳＴ１からステップＳＴ７と同じであるため、ここでは詳細説明を省略する。 FIG. 13 is a flow chart showing an example of the operation of the robot control device 100c according to the fourth embodiment. That is, FIG. 13 is a flow chart showing an example of the robot control method according to the fourth embodiment. Since steps ST1 to ST7 in FIG. 13 are the same as steps ST1 to ST7 in FIG. 6, detailed description thereof is omitted here.

　図１３に示すように、図示しない手段によりロボット制御が開始されると、速度計算部４ｃは、予め設定されたマニピュレータ１への指令軌跡と、マニピュレータ１に関する制約条件と、マニピュレータ１の動作時間に基づく評価指標とに基づいて、マニピュレータ１の速度プロファイルを計算する（ステップＳＴ１）。速度計算部４ｃ内の動力学計算部４２は、マニピュレータ１の動作時に移動架台１３に与える反力および反力のトルク成分の計算を含めて運動学計算と動力学計算とを行う。また、速度計算部４ｃ内の制約条件係数計算部４３は、移動架台１３に与える反力および反力のトルク成分に関する制約条件も含めて、制約条件の係数を計算する。 As shown in FIG. 13, when the robot control is started by a means (not shown), the speed calculation unit 4c calculates a preset command trajectory to the manipulator 1, constraints on the manipulator 1, and operation time of the manipulator 1. A velocity profile of the manipulator 1 is calculated (step ST1). A dynamics calculation unit 42 in the speed calculation unit 4c performs kinematics calculation and dynamics calculation including calculation of the reaction force applied to the movable platform 13 during operation of the manipulator 1 and the torque component of the reaction force. Further, the constraint coefficient calculation unit 43 in the speed calculation unit 4c calculates coefficients of constraint conditions, including the constraint conditions related to the reaction force applied to the movable platform 13 and the torque component of the reaction force.

　勾配計算部５ｃは、マニピュレータ１の動作時間の指令軌跡に関する勾配を計算して勾配情報として出力する（ステップＳＴ２）。勾配計算部５ｃ内の制約条件係数勾配計算部５３は、マニピュレータ１の動作時に移動架台１３に与える反力および反力のトルク成分も考慮して、動作時間の運動学計算結果に関する勾配、および動作時間の動力学計算結果に関する勾配を計算する。また、勾配計算部５ｃ内のプロファイル勾配計算部５４は、移動架台１３に与える反力および反力のトルク成分も考慮して、動作時間の制約条件の係数に関する勾配を計算する。 The gradient calculation unit 5c calculates the gradient of the commanded trajectory of the manipulator 1 during operation and outputs it as gradient information (step ST2). The constraint coefficient gradient calculation unit 53 in the gradient calculation unit 5c calculates the gradient related to the kinematic calculation result of the operation time and the operation time taking into account the reaction force applied to the movable platform 13 during the operation of the manipulator 1 and the torque component of the reaction force. Calculate the gradient for the time dynamics calculation results. In addition, the profile gradient calculator 54 in the gradient calculator 5c calculates the gradient related to the coefficient of the operating time constraint, taking into account the reaction force applied to the movable platform 13 and the torque component of the reaction force.

　ステップＳＴ１１の判定が「Ｙｅｓ」の場合は、処理はステップＳＴ６に戻り、マニピュレータ１の制御が継続される。ステップＳＴ１１の判定が「Ｎｏ」の場合は、ロボットの制御が終了する。 If the determination in step ST11 is "Yes", the process returns to step ST6 and control of the manipulator 1 is continued. If the determination in step ST11 is "No", control of the robot ends.

　以上で説明した実施の形態４によれば、マニピュレータ１の動作中に移動架台１３に与える反力および反力のトルク成分を考慮することで、移動架台１３の振動を抑制することができ、マニピュレータ１の動作時間を短くすることができる。 According to the fourth embodiment described above, it is possible to suppress the vibration of the movable platform 13 by considering the reaction force applied to the movable platform 13 during the operation of the manipulator 1 and the torque component of the reaction force. 1 can be shortened.

　なお、実施の形態４において、ロボット制御装置１００ｃが行うマニピュレータ１の制御は、移動架台１３に対しても適用できる。また、マニピュレータ１が移動架台１３に設置される場合に限定されず、マニピュレータ１が図示しない固定架台に設置される場合にも適用できる。 It should be noted that the control of the manipulator 1 performed by the robot control device 100c in Embodiment 4 can also be applied to the movable platform 13. Further, the present invention is not limited to the case where the manipulator 1 is installed on the movable pedestal 13, but can be applied to the case where the manipulator 1 is installed on a fixed pedestal (not shown).

　実施の形態１から４におけるロボット制御装置１００，１００ａ，１００ｂおよび１００ｃおよびロボット制御方法は、垂直多関節ロボットであるマニピュレータ１以外にも適用できる。例えば、水平多関節ロボットなどの任意の軸構成のマニピュレータ１にも適用できる。また、マニピュレータ１以外の産業装置にも適用できる。 The

robot control devices

100, 100a, 100b, and 100c and the robot control method in Embodiments 1 to 4 can be applied to other than the manipulator 1, which is a vertical articulated robot. For example, it can be applied to a manipulator 1 having an arbitrary axis configuration such as a horizontal articulated robot. Moreover, it can also be applied to industrial devices other than the manipulator 1 .

　ここで、実施の形態１から４におけるロボット制御装置１００，１００ａ，１００ｂおよび１００ｃのハードウェア構成について説明する。ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃの各機能は、処理回路によって実現し得る。処理回路は、少なくとも１つのプロセッサと少なくとも１つのメモリとを備える。 Here, the hardware configuration of the

robot control devices

100, 100a, 100b and 100c in Embodiments 1 to 4 will be described. Each function of the

robot controllers

100, 100a, 100b and 100c can be implemented by processing circuitry. The processing circuitry comprises at least one processor and at least one memory.

　図１４は、実施の形態１から４におけるロボット制御装置１００，１００ａ，１００ｂおよび１００ｃのハードウェア構成を示す図である。ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃは、図１４（ａ）に示すプロセッサ２００およびメモリ２０１によって実現することができる。プロセッサ２００は、例えばＣＰＵ（Ｃｅｎｔｒａｌ　Ｐｒｏｃｅｓｓｉｎｇ　Ｕｎｉｔ、中央処理装置、処理装置、演算装置、マイクロプロセッサ、マイクロコンピュータ、プロセッサ、ＤＳＰ（Ｄｉｇｉｔａｌ　Ｓｉｇｎａｌ　Ｐｒｏｃｅｓｓｏｒ）ともいう）またはシステムＬＳＩ（Ｌａｒｇｅ　Ｓｃａｌｅ　Ｉｎｔｅｇｒａｔｉｏｎ）である。 FIG. 14 is a diagram showing the hardware configuration of the

robot control devices

100, 100a, 100b and 100c according to Embodiments 1-4.

Robot controllers

100, 100a, 100b and 100c can be implemented by processor 200 and memory 201 shown in FIG. 14(a). The processor 200 is, for example, a CPU (Central Processing Unit, central processing unit, processing unit, arithmetic unit, microprocessor, microcomputer, processor, DSP (Digital Signal Processor)) or system LSI (Large Scale Integration).

　メモリ２０１は、例えばＲＡＭ（Ｒａｎｄｏｍ　Ａｃｃｅｓｓ　Ｍｅｍｏｒｙ）、ＲＯＭ（Ｒｅａｄ　Ｏｎｌｙ　Ｍｅｍｏｒｙ）、フラッシュメモリ、ＥＰＲＯＭ（Ｅｒａｓａｂｌｅ　Ｐｒｏｇｒａｍｍａｂｌｅ　Ｒｅａｄ　Ｏｎｌｙ　Ｍｅｍｏｒｙ）、ＥＥＰＲＯＭ（登録商標）（Ｅｌｅｃｔｒｉｃａｌｌｙ　Ｅｒａｓａｂｌｅ　Ｐｒｏｇｒａｍｍａｂｌｅ　Ｒｅａｄ－Ｏｎｌｙ　Ｍｅｍｏｒｙ）などの不揮発性または揮発性の半導体メモリ、ＨＤＤ（Ｈａｒｄ　Ｄｉｓｋ　Ｄｒｉｖｅ）、磁気ディスク、フレキシブルディスク、光ディスク、コンパクトディスク、ミニディスク、またはＤＶＤ（Ｄｉｇｉｔａｌ　Ｖｅｒｓａｔｉｌｅ　Ｄｉｓｋ）などである。 The memory 201 is, for example, a RAM (Random Access Memory), a ROM (Read Only Memory), a flash memory, an EPROM (Erasable Programmable Read Only Memory), an EEPROM (registered trademark) (Electrically Erasable Programmable Read Only Memory) or a non-volatile memory such as a Volatile semiconductor memory, HDD (Hard Disk Drive), magnetic disk, flexible disk, optical disk, compact disk, mini disk, DVD (Digital Versatile Disk), and the like.

　ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃの各部の機能は、ソフトウェアなど（ソフトウェア、ファームウェア、またはソフトウェアとファームウェア）により実現される。ソフトウェアなどはプログラムとして記述され、メモリ２０１に格納される。プロセッサ２００は、メモリ２０１で記憶されているプログラムを読み出して実行することにより、各部の機能を実現する。すなわち、このプログラムは、ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃの手順または方法をコンピュータに実行させるものであると言える。 The functions of each part of the

robot control devices

100, 100a, 100b, and 100c are realized by software (software, firmware, or software and firmware). Software or the like is written as a program and stored in the memory 201 . The processor 200 reads out and executes programs stored in the memory 201 to achieve the functions of each unit. That is, it can be said that this program causes a computer to execute the procedures or methods of the

robot control devices

100, 100a, 100b and 100c.

　プロセッサ２００が実行するプログラムは、インストール可能な形式または実行可能な形式のファイルで、コンピュータが読み取り可能な記憶媒体に記憶されてコンピュータプログラムプロダクトとして提供されてもよい。また、プロセッサ２００が実行するプログラムは、インターネットなどのネットワーク経由でロボット制御装置１００，１００ａ，１００ｂおよび１００ｃに提供されてもよい。 The program executed by the processor 200 may be stored in a computer-readable storage medium in an installable or executable format and provided as a computer program product. Also, the program executed by the processor 200 may be provided to the

robot control devices

100, 100a, 100b and 100c via a network such as the Internet.

　また、ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃは、図１４（ｂ）に示す専用の処理回路２０２によって実現してもよい。処理回路２０２が専用のハードウェアである場合、処理回路２０２は、例えば単一回路、複合回路、プログラム化したプロセッサ、並列プログラム化したプロセッサ、ＡＳＩＣ（Ａｐｐｌｉｃａｔｉｏｎ　Ｓｐｅｃｉｆｉｃ　Ｉｎｔｅｇｒａｔｅｄ　Ｃｉｒｃｕｉｔ）、ＦＰＧＡ（Ｆｉｅｌｄ－Ｐｒｏｇｒａｍｍａｂｌｅ　Ｇａｔｅ　Ａｒｒａｙ）、またはこれらを組み合わせたものなどが該当する。 Also, the

robot control devices

100, 100a, 100b, and 100c may be realized by a dedicated processing circuit 202 shown in FIG. 14(b). If the processing circuit 202 is dedicated hardware, the processing circuit 202 may be, for example, a single circuit, a composite circuit, a programmed processor, a parallel programmed processor, an ASIC (Application Specific Integrated Circuit), or an FPGA (Field-Programmable Gate). Array), or a combination thereof.

　以上、ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃの各構成要素の機能が、ソフトウェアなど、またはハードウェアのいずれか一方で実現される構成について説明した。しかしこれに限ったものではなく、ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃの一部の構成要素をソフトウェアなどで実現し、別の一部を専用のハードウェアで実現する構成であってもよい。 The configuration in which the functions of the constituent elements of the

robot control devices

100, 100a, 100b, and 100c are realized by either software or hardware has been described above. However, the configuration is not limited to this, and may be a configuration in which a part of the constituent elements of the

robot control devices

100, 100a, 100b, and 100c are realized by software, and another part is realized by dedicated hardware. .

　１　マニピュレータ、　２，２ｃ　制約条件記憶部、　３　指令軌跡記憶部、　４，４ｃ　速度計算部、　４１　指令補間計算部、　４２　動力学計算部、　４３　制約条件係数計算部、　４４　プロファイル計算部、　５，５ｃ　勾配計算部、　５１　指令補間勾配計算部、　５２　動力学勾配計算部、　５３　制約条件係数勾配計算部、　５４　プロファイル勾配計算部、　６，６ａ　指令軌跡補正部、　６１　距離関数計算部、　６２　バリア関数計算部、　６３　バリア関数勾配計算部、　６４　指令軌跡補正値計算部、　７　指令点列計算部、　８　制御部、　８１　フィードフォワード制御部、　８２　フィードバック制御部、　８３　電流値計算部、　９　周辺環境情報記憶部、　１０　指令軌跡生成部、　１１０　アクチュエータ、　１１１　把持部、　１１２　ワーク、　１１３　周辺環境、　１３　移動架台、　２００　プロセッサ、　２０１　メモリ、　２０２　処理回路。 1 manipulator, 2, 2c constraint storage unit, 3 command trajectory storage unit, 4, 4c speed calculation unit, 41 command interpolation calculation unit, 42 dynamics calculation unit, 43 constraint coefficient calculation unit, 44 profile calculation unit, 5, 5c gradient calculator, 51 command interpolation gradient calculator, 52 dynamic gradient calculator, 53 constraint coefficient gradient calculator, 54 profile gradient calculator, 6, 6a command trajectory corrector, 61 distance function calculator, 62 barrier function Calculation unit 63 Barrier function gradient calculation unit 64 Command trajectory correction value calculation unit 7 Command point sequence calculation unit 8 Control unit 81 Feedforward control unit 82 Feedback control unit 83 Current value calculation unit 9 Surrounding environment information Storage unit 10 Command trajectory generation unit 110 Actuator 111 Grasping unit 112 Work 113 Surrounding environment 13 Moving platform 200 Processor 201 Memory 202 Processing circuit.

Claims

　予め設定されたマニピュレータへの指令軌跡と、前記マニピュレータに関する制約条件と、前記マニピュレータの動作時間に基づく評価指標とに基づいて、前記マニピュレータの速度プロファイルを計算する速度計算部と、
　前記速度プロファイルに基づいて、前記動作時間の前記指令軌跡に関する勾配を計算して勾配情報とする勾配計算部と、
　前記勾配情報に基づいて前記指令軌跡を補正して補正指令軌跡とする指令軌跡補正部と、
　前記補正指令軌跡に対し前記マニピュレータが追従するよう制御する制御部と、
　を備えるロボット制御装置。 a speed calculation unit that calculates a speed profile of the manipulator based on a preset command trajectory to the manipulator, constraints on the manipulator, and an evaluation index based on the operation time of the manipulator;
a gradient calculation unit that calculates a gradient related to the commanded trajectory of the operation time based on the velocity profile and uses the gradient information as gradient information;
a command trajectory correction unit that corrects the command trajectory based on the gradient information to obtain a corrected command trajectory;
a control unit that controls the manipulator to follow the correction command trajectory;
A robot controller comprising:
　前記マニピュレータ周辺の周辺情報に基づいて、前記指令軌跡を生成する指令軌跡生成部を更に備える請求項１に記載のロボット制御装置。 The robot control device according to claim 1, further comprising a command trajectory generation unit that generates the command trajectory based on peripheral information around the manipulator.
　前記制約条件は、前記マニピュレータの関節角速度、関節角加速度、関節トルク、手先速度、手先加速度、前記マニピュレータが把持する対象で発生する力、および前記対象で発生するモーメントのうち少なくとも１つに関する条件である請求項１または２に記載のロボット制御装置。 The constraint conditions relate to at least one of joint angular velocity, joint angular acceleration, joint torque, hand velocity, hand acceleration, force generated in an object grasped by the manipulator, and moment generated in the object. A robot controller according to claim 1 or 2.
　前記力および前記モーメントのうち少なくとも１つについて、機械学習を用いて前記制約条件のパラメータを学習する把持制約学習部を更に備え、
　前記速度計算部は、前記把持制約学習部で学習した前記パラメータに基づいて、前記加速度プロファイルを計算する請求項３に記載のロボット制御装置。 further comprising a gripping constraint learning unit that learns parameters of the constraint using machine learning for at least one of the force and the moment;
4. The robot control device according to claim 3, wherein the velocity calculation section calculates the acceleration profile based on the parameters learned by the grasping constraint learning section.
　前記制約条件は、前記マニピュレータが設置される架台に与える反力、および前記反力のトルク成分のうち少なくとも１つに関する条件を含む請求項１から４のいずれか１項に記載のロボット制御装置。 The robot control device according to any one of claims 1 to 4, wherein the constraint conditions include conditions relating to at least one of a reaction force applied to the pedestal on which the manipulator is installed and a torque component of the reaction force.
　前記反力および前記反力のトルク成分のうち少なくとも１つについて、機械学習を用いて前記制約条件のパラメータを学習する反力制約学習部を更に備え、
　前記速度計算部は、前記反力制約学習部で学習した前記パラメータに基づいて、前記速度プロファイルを計算する請求項５に記載のロボット制御装置。 Further comprising a reaction force constraint learning unit that learns parameters of the constraint using machine learning for at least one of the reaction force and the torque component of the reaction force,
6. The robot control device according to claim 5, wherein the velocity calculation section calculates the velocity profile based on the parameters learned by the reaction force constraint learning section.
　前記速度計算部は、前記指令軌跡上の補間点における位置と前記指令軌跡の媒介変数に関する一階微分および二階微分とを計算する指令補間計算部と、
　前記位置と前記一階微分と前記二階微分とを用いて前記マニピュレータの運動学計算と動力学計算とを行い、運動学計算結果と動力学計算結果とを出力する動力学計算部と、
　前記運動学計算結果と前記動力学計算結果と前記制約条件とに基づいて、前記制約条件の係数を計算する制約条件係数計算部と、
　前記制約条件の係数に基づいて、前記速度プロファイルを計算するプロファイル計算部と、
　を備える請求項１から６のいずれか１項に記載のロボット制御装置。 The speed calculation unit includes a command interpolation calculation unit that calculates a position at an interpolation point on the command trajectory and a first-order differential and a second-order differential with respect to a parameter of the command trajectory;
a dynamics calculation unit that performs kinematics calculation and dynamics calculation of the manipulator using the position, the first-order differential, and the second-order differential, and outputs a kinematics calculation result and a dynamics calculation result;
a constraint coefficient calculator that calculates the coefficient of the constraint based on the kinematics calculation result, the dynamics calculation result, and the constraint;
a profile calculator that calculates the velocity profile based on the coefficients of the constraint;
The robot control device according to any one of claims 1 to 6, comprising:
　前記勾配計算部は、自動微分を用いて前記動作時間を前記指令軌跡について微分することで、前記動作時間の前記指令軌跡に関する勾配を計算して前記勾配情報とする請求項１から７のいずれか１項に記載のロボット制御装置。 8. The gradient calculator according to any one of claims 1 to 7, wherein the gradient calculation unit differentiates the movement time with respect to the command trajectory using automatic differentiation, thereby calculating a gradient of the movement time with respect to the command trajectory and using it as the gradient information. The robot control device according to item 1.
　前記指令軌跡補正部は、前記勾配情報に基づく勾配降下法、共役勾配法または準ニュートン法のうちいずれか１つを用いることで、前記指令軌跡を補正する請求項１から８のいずれか１項に記載のロボット制御装置。 9. The command trajectory correction unit corrects the command trajectory by using any one of a gradient descent method, a conjugate gradient method, and a quasi-Newton method based on the gradient information. The robot control device according to .
　予め設定されたマニピュレータへの指令軌跡と、前記マニピュレータに関する制約条件と、前記マニピュレータの動作時間に基づく評価指標とに基づいて、前記マニピュレータの速度プロファイルを計算する工程と、
　前記速度プロファイルに基づいて、前記動作時間の前記指令軌跡に関する勾配を計算して勾配情報とする工程と、
　前記勾配情報に基づいて前記指令軌跡を補正して補正指令軌跡とする工程と、
　前記補正指令軌跡に対し前記マニピュレータが追従するよう制御する工程と、
　を備えるロボット制御方法。 calculating a velocity profile of the manipulator based on a preset command trajectory to the manipulator, constraints on the manipulator, and an evaluation index based on the operating time of the manipulator;
calculating a slope of the commanded trajectory of the operating time based on the velocity profile as slope information;
a step of correcting the command trajectory based on the gradient information to obtain a corrected command trajectory;
a step of controlling the manipulator to follow the correction command trajectory;
A robot control method comprising: