JP2607351B2

JP2607351B2 - Error Signal Generation Method for Efficient Learning of Multilayer Perceptron Neural Network

Info

Publication number: JP2607351B2
Application number: JP6313861A
Authority: JP
Inventors: 相▲勲▼ ▲呉▼
Original assignee: 財団法人韓国電子通信研究所
Priority date: 1994-09-30
Filing date: 1994-12-16
Publication date: 1997-05-07
Anticipated expiration: 2012-05-07
Also published as: JPH08115310A; KR960012131A; KR0141341B1

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明はパターン認識問題の学習
に広く使用される多層パーセプトロンの神経回路網（mu
lti-layer perceptron neural networks）モデルの効率
的な学習方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a multilayer perceptron neural network (mu) widely used for learning a pattern recognition problem.
lti-layer perceptron neural networks).

【０００２】[0002]

【従来の技術】多層パーセプトロンを学習させるとき、
学習時間が長時間の間所要されるが、幾つかのパターン
に対しては全く学習されない現象が発生する場合もあ
る。2. Description of the Related Art When learning a multilayer perceptron,
Although the learning time is required for a long time, a phenomenon that some patterns are not learned at all may occur.

【０００３】[0003]

【発明が解決しようとする課題】本発明は上記のような
問題点を解消するためになされたもので、多層パーセプ
トロンを利用したパターン認識問題の学習時間を短縮し
て迅速化を図り、学習パターンに対してもありのまま学
習できることを目的とする。SUMMARY OF THE INVENTION The present invention has been made in order to solve the above-mentioned problems, and aims at shortening the learning time of a pattern recognition problem using a multi-layered perceptron to speed up the problem. The purpose is to be able to learn as it is.

【０００４】[0004]

【課題を解決するための手段】本発明に係る請求項１記
載の多層パーセプトロンの神経回路網の効率的な学習の
ための誤差信号の発生方法は、生命体の情報処理を模倣
した神経回路網モデルの一つとして、神経細胞（neuro
n）を意味するノードと、各ノードを連結する連接部加
重値が階層的に構成されている多層パーセプトロンから
誤差信号を発生させる方法において、According to the present invention, there is provided a method for generating an error signal for efficient learning of a neural network of a multilayer perceptron according to the present invention. As one of the models, neurons (neuro
n) and a method of generating an error signal from a multi-layer perceptron in which connection weights connecting the nodes are hierarchically configured,

【数９】で示される前記誤差信号を与える数式に基づいて、前記
多層パーセプトロンの逆伝播学習時に出力ノードが不適
切に飽和される場合は強い誤差信号を発生させ、前記出
力ノードが適切に飽和される場合は弱い誤差信号を発生
させることを特徴とする。(Equation 9) A strong error signal is generated when the output node is improperly saturated during back propagation learning of the multilayer perceptron based on a formula that gives the error signal represented by, and when the output node is appropriately saturated, A weak error signal is generated.

【０００５】[0005]

【作用】本発明に係る請求項１記載の多層パーセプトロ
ンの神経回路網の効率的な学習のための誤差信号の発生
方法によれば、逆伝播学習時に出力ノードが不適切に飽
和される場合は強い誤差信号を発生させ、前記出力ノー
ドが適切に飽和される場合は弱い誤差信号を発生させる
ので、出力ノードが不適切に飽和される現象が減り、神
経網が学習パターンを過度に学習することが防止され
る。According to the method of generating an error signal for efficient learning of a neural network of a multilayer perceptron according to the first aspect of the present invention, when an output node is inappropriately saturated during back propagation learning, Since a strong error signal is generated and a weak error signal is generated when the output node is properly saturated, the phenomenon that the output node is inappropriately saturated is reduced, and the neural network excessively learns the learning pattern. Is prevented.

【０００６】[0006]

【実施例】本発明の説明のために次のように用語を定義
する。DESCRIPTION OF THE PREFERRED EMBODIMENTS For the purpose of describing the present invention, terms are defined as follows.

【０００７】まず、“多層パーセプトロン”とは、生命
体の情報処理を模倣した神経回路網モデルの一つであ
り、図１に図示のように、神経細胞（neuron）を意味す
るノードとノードを連結する連接部加重値（synapse we
ight value）が階層的に構成されている。First, the “multilayer perceptron” is one of the neural network models imitating the information processing of living organisms. As shown in FIG. Weights of connecting parts to be connected (synapse we
ight value) are arranged hierarchically.

【０００８】この多層パーセプトロンの各ノードはその
状態が下層ノードの状態値とその連結加重値の“加重値
の合計”を入力として受け入れて、図２のようにシグモ
イド変換した値を出力する。Each node of the multilayer perceptron receives as input the state value of the lower layer node and the "sum of the weights" of the connection weights thereof, and outputs a sigmoidally converted value as shown in FIG.

【０００９】シグモイド関数は傾きが小さい両側面の飽
和領域と傾きが大きい中央の活性領域と分けられる。The sigmoid function is divided into a saturated region on both sides having a small slope and a central active region having a large slope.

【００１０】“学習パターン”とは、パターン認識問題
を学習させるために任意に収集したパターンである。A "learning pattern" is a pattern arbitrarily collected for learning a pattern recognition problem.

【００１１】“試験パターン”とは、パターン認識問題
の学習程度を試験する基準とするために任意に収集した
パターンである。The "test pattern" is a pattern arbitrarily collected in order to test the degree of learning of the pattern recognition problem.

【００１２】これらのパターンは多数個の“集団”と分
けることができ、“パターン認識”とは、入力されたパ
ターンがどの集団に属するかを決定するものである。These patterns can be divided into a large number of "groups", and "pattern recognition" is to determine which group the input pattern belongs to.

【００１３】最終階層ノードの状態が入力パターンが属
する集団を示す。The state of the last hierarchical node indicates the group to which the input pattern belongs.

【００１４】“逆伝播学習（Back Propagating Trainin
g）”とは、この多層パーセプトロンを学習させる方法
として、学習パターンを入力させた後に、最終階層ノー
ドの出力値が所望の目標値となるように誤差信号により
最終階層ノードに連結された加重値を変更させており、
その下層のノードは上の階層から逆伝播された誤差信号
により連結加重値を変更させる方法である。"Back Propagating Trainin
g) "is a method for learning the multilayer perceptron, in which a learning pattern is input, and then the weight value connected to the final hierarchical node by an error signal so that the output value of the final hierarchical node becomes a desired target value. Has been changed,
In the lower layer, the connection weight is changed by an error signal back-propagated from the upper layer.

【００１５】“誤差関数”とは、逆伝播学習から誤差信
号を如何に発生させるかを決定する関数である。The "error function" is a function for determining how to generate an error signal from back propagation learning.

【００１６】“ノードの飽和とは、ノードの加重値の合
計の入力値がシグモイド関数の傾きが小さい領域に位置
することを指称する。"Node saturation" means that the total input value of the node weights is located in a region where the slope of the sigmoid function is small.

【００１７】ノードが目標値と同じ飽和領域に位置する
と“適切な飽和”、反対側の飽和領域に位置すると“不
適切な飽和”という。If the node is located in the same saturation region as the target value, it is called "appropriate saturation", and if it is located in the opposite saturation region, it is called "inappropriate saturation".

【００１８】多層パーセプトロンの“逆伝播学習アルゴ
リズム”の具体的な内容は図３のように示される。The specific contents of the "back propagation learning algorithm" of the multilayer perceptron are shown in FIG.

【００１９】学習パターンｘ＝［ｘ₁，ｘ₂，…，ｘ_N0］
が入力されると、Ｌ層からなる多層パーセプトロンは、
ステップＳ１において、全ての方向の計算によって１層
のｊ番目のノード状態がLearning pattern x = [x ₁ , x ₂ ,..., X _N0 ]
Is input, the multilayer perceptron composed of L layers becomes
In step S1, the j-th node state of the first layer is calculated by calculation in all directions.

【００２０】[0020]

【数１】 (Equation 1)

【００２１】のように決定される。Is determined as follows.

【００２２】ここで、Here,

【００２３】[0023]

【数２】 (Equation 2)

【００２４】であり、ｗ_ji ^(l)はｘ_i ^(l-1)とｘ_j ^(l)の間
の連結加重値、ｗ_jo ^(l)はｘ_j ^(l)のｂｉａｓを示す。 _Where w _ji ^(l) is the connection weight between x _i ^(l-1) and x _j ^(l) , and w _jo ^(l) is the bias of x _j ^(l) .

【００２５】このように最終階層ノードの状態ｘ_K ^(L)が
求められると、多層パーセプトロンの誤差関数は入力パ
ターンに対する目標パターンｔ＝［ｔ₁，ｔ₂，…，
ｔ_NL］との関係によってWhen the state x _K ^(L) of the last hierarchical node is obtained in this manner, the error function of the multilayer perceptron is calculated as the target pattern t = [t ₁ , t ₂ ,.
t _NL ]

【００２６】[0026]

【数３】 (Equation 3)

【００２７】と定義され、この誤差関数値を減らすよう
に誤差信号が発生され、この誤差信号により各加重値が
変更される。An error signal is generated so as to reduce the error function value, and each weight value is changed by the error signal.

【００２８】即ち、ステップＳ２において出力層の誤差
信号は、That is, in step S2, the error signal of the output layer is:

【００２９】[0029]

【数４】 (Equation 4)

【００３０】と計算される。Is calculated.

【００３１】次にステップＳ３において下層の誤差信号
は逆伝播によって、Next, in step S3, the error signal in the lower layer is

【００３２】[0032]

【数５】 (Equation 5)

【００３３】と計算される。Is calculated.

【００３４】次にステップＳ４において各階層の加重値
は、Next, in step S4, the weight value of each layer is

【００３５】[0035]

【数６】 (Equation 6)

【００３６】により変更されて一つのパターンに対して
学習が行なわれる。The learning is performed for one pattern after being changed.

【００３７】この過程をすべての学習パターンに対して
一回遂行したことをsweepという単位で表示する。The fact that this process has been performed once for all learning patterns is displayed in units of sweep.

【００３８】上述の逆伝播アルゴリズムから、誤差信号
δ_K ^(L)は目標値と実際値の差異にシグモイド活性化関数
の傾きが乗算された形態である。From the above-described backpropagation algorithm, the error signal δ _K ^(L) has a form in which the difference between the target value and the actual value is multiplied by the slope of the sigmoid activation function.

【００３９】もし、ｘ_K ^(L)が-1或いは+1に近接の値であ
ると、傾きに対する項のため、δ_K ^(L)は極めて小さい値
になる。If x _K ^(L) is a value close to -1 or +1 δ _K ^(L) will be a very small value due to the term for the slope.

【００４０】即ち、tk = 1であり、ｘ_K ^(L)が-1に近似し
ている場合、或いはその反対の場合に、ｘ_K ^(L)は連結さ
れた加重値を調整するのに充分に強い誤差信号を発生さ
せない。That is, if tk = 1 and x _K ^(L) is close to -1, or vice versa, x _K ^(L) is sufficient to adjust the concatenated weights. Does not generate a strong error signal.

【００４１】このような出力ノードの不適切な飽和が逆
伝播学習からEmの最小化を遅延させ、あるパターンの学
習を妨害する。Such inappropriate saturation of the output node delays the minimization of Em from the backpropagation learning and hinders the learning of a certain pattern.

【００４２】本発明は学習のための誤差関数を、The present invention provides an error function for learning,

【００４３】[0043]

【数７】 (Equation 7)

【００４４】と変えており、この誤差関数を利用して出
力ノードの誤差信号がUsing this error function, the error signal at the output node is

【００４５】[0045]

【数８】 (Equation 8)

【００４６】となるようにしたものである。ここで、数
式７で示す誤差関数は、数式８で示す誤差信号を示す式
を経験的に定め、これを積分することにより導いたもの
である。そして、数式８を定めるには、先に説明したよ
うに、数式４で示される誤差信号の問題点を認識してい
る必要があり、そのためには数式４で示される誤差信号
をグラフ化すると明確になる。 [0046] Where the number
The error function represented by Equation 7 is an equation representing an error signal represented by Equation 8.
Is determined empirically and is derived by integrating this
It is. And, to determine equation 8,
As described above, the problem of the error signal represented by Equation 4 is recognized.
It is necessary to obtain the error signal
It becomes clear when graphed.

【００４７】図４はtk = 1の場合に、ｘ _K ^(L) を変数とし
て、数式４で示される誤差信号を従来の誤差信号（Conv
entional Error Signal）ＣＥとして表し、数式８で示
される誤差信号を本発明で提案された誤差関数により得
られる提案誤差信号（Proposed Error Signal）ＰＥと
して表している。図４から判るように、従来の誤差信号
ＣＥは、出力ノードが不適切に飽和される場合、すなわ
ちｘ _k ^(L) の値が−１に近づくと、δ _V の値は急激に小さ
く（弱く）なり、出力ノードが適切に飽和される場合、
すなわちｘ _k ^(L) の値が１に近づくと、δ _V の値は徐々に
小さく（弱く）なっている。出力ノードが適切に飽和さ
れる場合は、誤差信号ＣＥは小さくて良いのでこれでも
構わないが、出力ノードが不適切に飽和される場合に
は、誤差信号ＣＥは大きく（強く）なければならない。
数式８で示す誤差信号はこのような条件を満たすために
定められたもので、ｘ _k ^(L) の値が−１に近づくと、誤差
信号ＰＥは急激に大きく（強く）なり、ｘ _k ^(L) の値が１
に近づくと、誤差信号ＰＥは徐々に小さく（弱く）なっ
ている。また、出力ノードが適切に飽和される場合に、
誤差信号ＣＥと誤差信号ＰＥとを比較すると、誤差信号
ＰＥの方が絶対値が小さくなっている。これも、誤差信
号ＣＥの問題点を解消するために考慮されたものであ
る。 FIG. 4 shows that when tk = 1, x _K ^(L) is used as a variable.
Therefore, the error signal represented by Equation 4 is converted to a conventional error signal (Conv
entional Error Signal) expressed as CE
Is obtained by the error function proposed in the present invention.
Proposed Error Signal PE
It is expressed as As can be seen from FIG. 4, the conventional error signal
CE indicates that if the output node is improperly saturated,
When the value of x _k ^(L) approaches -1, the value of δ _V sharply decreases.
And the output node is properly saturated,
That is, when the value of x _k ^(L) approaches 1, the value of δ _V gradually increases.
It is small (weak). Output nodes are properly saturated
Error signal CE can be small,
Okay, but if the output node is improperly saturated
Means that the error signal CE must be large (strong).
The error signal shown in Expression 8 is used to satisfy such a condition.
When the value of x _k ^(L) approaches -1, the error
The signal PE increases rapidly (strongly), and the value of x _k ^(L) becomes 1
, The error signal PE gradually becomes smaller (weaker).
ing. Also, if the output node is properly saturated,
Comparing the error signal CE and the error signal PE, the error signal
The absolute value of PE is smaller. This is also the error signal
No. CE was considered in order to solve the problem.
You.

【００４８】なお、学習のための他の数式は誤差関数Em
を利用した従来の逆伝播アルゴリズムと同一である。The other equations for learning are the error function Em
This is the same as the conventional back-propagation algorithm using.

【００４９】[0049]

【発明の効果】本発明に係る請求項１記載の多層パーセ
プトロンの神経回路網の効率的な学習のための誤差信号
の発生方法によれば、提案された誤差関数を利用した逆
伝播アルゴリズムは、出力層の目標値が出力値との差異
が大差になると強い誤差信号を発生させ、出力ノードが
不適切に飽和される現象を減らし、出力層の目標値が出
力値と近接の値になると弱い誤差信号を発生させて神経
網が学習パターンを過度に学習することを防止するの
で、学習時間を短縮して迅速化を図り、学習パターンに
対してもありのまま学習することができる。According to the method for generating an error signal for efficient learning of the neural network of the multilayer perceptron according to the first aspect of the present invention, the back propagation algorithm using the proposed error function is: If the difference between the target value of the output layer and the output value is large, a strong error signal is generated, the phenomenon that the output node is inappropriately saturated is reduced, and if the target value of the output layer is close to the output value, the signal is weak. Since an error signal is generated to prevent the neural network from excessively learning the learning pattern, the learning time can be shortened and speeded up, and the learning pattern can be learned as it is.

【図面の簡単な説明】[Brief description of the drawings]

【図１】多層パーセプトロンの神経回路網の構造を示
す図である。FIG. 1 is a diagram showing the structure of a neural network of a multilayer perceptron.

【図２】シグモイドの活性化関数を示す図である。FIG. 2 is a diagram showing an activation function of a sigmoid.

【図３】多層パーセプトロンの一般的な逆伝播の学習
方法を示すフローチャートである。FIG. 3 is a flowchart showing a general method of learning back propagation in a multilayer perceptron.

【図４】効率的な学習のための誤差信号を示す図であ
る。FIG. 4 is a diagram showing an error signal for efficient learning.

【符号の説明】[Explanation of symbols]

ＣＥ従来の誤差信号（Cnventionl Error Signal）、
ＰＥ提案誤差信号（Proposed Error Signal）。CE Conventional error signal (Cnventionl Error Signal),
PE Proposed Error Signal.

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】生命体の情報処理を模倣した神経回路網
モデルの一つとして、神経細胞（neuron）を意味するノ
ードと、各ノードを連結する連接部加重値が階層的に構
成されている多層パーセプトロンから誤差信号を発生さ
せる方法において、【数９】で示される前記誤差信号を与える数式に基づいて、前記
多層パーセプトロンの逆伝播学習時に出力ノードが不適
切に飽和される場合には強い誤差信号を発生させ、出力
ノードが適切に飽和される場合には弱い誤差信号を発生
させることを特徴とする多層パーセプトロンの神経回路
網の効率的な学習のための誤差信号の発生方法。1. A neural network model that simulates information processing of a living body, in which nodes representing nerve cells and weights of connection portions connecting the nodes are hierarchically configured. In a method for generating an error signal from a multilayer perceptron, A strong error signal is generated when the output node is improperly saturated during back propagation learning of the multilayer perceptron , based on the equation that gives the error signal indicated by, and when the output node is appropriately saturated. A method for generating an error signal for efficient learning of a neural network of a multilayer perceptron characterized by generating a weak error signal.