JP2877450B2

JP2877450B2 - Pattern recognition device using neural network

Info

Publication number: JP2877450B2
Application number: JP2154550A
Authority: JP
Inventors: 正典宮武
Original assignee: Sanyo Denki Co Ltd
Current assignee: Sanyo Denki Co Ltd
Priority date: 1990-06-13
Filing date: 1990-06-13
Publication date: 1999-03-31
Anticipated expiration: 2014-03-31
Also published as: JPH0445500A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、神経回路網（ニューラルネットワーク）を
利用して種々のパターン認識、たとえば音声パターンの
認識，画像パターンの認識等を行うためのパターン認識
装置に関する。The present invention relates to a pattern for performing various pattern recognition using a neural network (neural network), for example, voice pattern recognition, image pattern recognition, and the like. It relates to a recognition device.

〔従来の技術〕[Conventional technology]

神経回路網は生物の脳神経細胞を模したネットワーク
であり、ニューロンに対応した複数のユニットを相互に
接続し、それぞれのユニットの動作，ユニット間の接続
状態を適宜に設定することにより、入力データのパター
ン認識、たとえば音声データのパターン，画像データの
パターン等を認識する機能を発揮させることが可能にな
る。A neural network is a network that simulates the brain nerve cells of a living organism. A plurality of units corresponding to neurons are connected to each other, and the operation of each unit and the connection state between the units are appropriately set so that input data can be obtained. It is possible to exhibit a function of pattern recognition, for example, a function of recognizing a pattern of voice data, a pattern of image data, and the like.

更に、誤差逆伝播（Error Back Propagation）学習法
と称される神経回路網のための優れた学習アルゴリズム
が近年開発されたため、神経回路網を利用したパターン
認識装置の種々の分野への応用が期待されている。Furthermore, since an excellent learning algorithm for a neural network called Error Back Propagation learning method has been recently developed, application of the pattern recognition device using the neural network to various fields is expected. Have been.

第２図は従来の神経回路網を用いたパターン認識装置
の一例としての音声認識装置の構成例を示すブロック図
である。なお、この従来例の音声認識装置では、入力さ
れた認識対象の音声のパターンを６つの破裂性子音/b//
d//g//p//t//k/のいずれかに識別するように構成されて
いる。FIG. 2 is a block diagram showing a configuration example of a speech recognition apparatus as an example of a conventional pattern recognition apparatus using a neural network. In this conventional speech recognition apparatus, the input speech pattern to be recognized is converted into six bursting consonants / b //.
It is configured to identify any of d // g // p // t // k /.

第２図において、参照符号１は音声入力部であり、入
力された認識対象の音声のパターンからスペクトラム,L
PCケプストラム係数等のようなそのパターンの特徴を表
す音声パラメータを抽出し、神経回路網２へ与える。In FIG. 2, reference numeral 1 denotes a voice input unit, which generates a spectrum, L
Speech parameters representing characteristics of the pattern, such as PC cepstrum coefficients, are extracted and provided to the neural network 2.

神経回路網２は、その具体的構成は後述するが、上述
の６つの子音カテゴリのいずれに認識対象の入力子音の
パターンが含まれるかを識別し、それぞれのカテゴリに
対応して識別結果を表す出力信号を判定部３へ出力す
る。この神経回路網２の識別対象である各カテゴリに対
応した出力信号は、“0"〜“1"の範囲の値になる。The neural network 2 identifies which of the above-mentioned six consonant categories includes the pattern of the input consonant to be recognized, and a specific result thereof will be described later. An output signal is output to the determination unit 3. The output signal corresponding to each category to be identified by the neural network 2 has a value in a range from “0” to “1”.

判定部３では神経回路網２から与えられる出力信号の
値が最大を示す信号を選択し、最終的な認識結果として
出力する。The determination unit 3 selects a signal having the maximum value of the output signal given from the neural network 2 and outputs it as a final recognition result.

判定部３が出力する認識結果は、たとえば言語処理装
置等の外部装置４に与えられる。The recognition result output from the determination unit 3 is provided to an external device 4 such as a language processing device.

第３図は神経回路網２の詳細な構成を示すブロック図
であり、従来公知の典型的な３層構造が示されている。FIG. 3 is a block diagram showing the detailed configuration of the neural network 2, showing a typical three-layer structure known in the art.

第３図において、参照符号21は入力層を、22は隠れ層
を、23は出力層をそれぞれ示している。In FIG. 3, reference numeral 21 denotes an input layer, 22 denotes a hidden layer, and 23 denotes an output layer.

入力層21は複数のユニット211〜21nにて、隠れ層22は
複数のユニット221〜22mにて、出力層23は６つの子音に
それぞれ対応した６つのユニット231〜236にてそれぞれ
構成されている。The input layer 21 includes a plurality of units 211 to 21n, the hidden layer 22 includes a plurality of units 221 to 22m, and the output layer 23 includes six units 231 to 236 respectively corresponding to six consonants. .

入力層21の各ユニット211〜21nと隠れ層22の各ユニッ
ト221〜22mとの間、及び隠れ層22の各ユニット221〜22m
と出力層23の各ユニット231〜236との間はそれぞれ異な
った強さで結合されている。この各ユニット間の結合の
強さ（以下、ウェイトと称す）は学習、たとえば前述の
如き誤差逆伝播法による学習にて決定される。Between each unit 211-21n of the input layer 21 and each unit 221-22m of the hidden layer 22, and each unit 221-22m of the hidden layer 22
And the units 231 to 236 of the output layer 23 are connected with different strengths. The strength of the connection between the units (hereinafter, referred to as weight) is determined by learning, for example, learning by the error back propagation method as described above.

このような音声認識装置においては、音声入力部１へ
入力された認識対象の音声のパターンから抽出されたパ
ラメータセットのデータが入力層21の各ユニット211〜2
1nへまず入力される。ユニット211〜21nに入力されたパ
ラメータセットのデータは、入力層21の各ユニット211
〜21nと隠れ層22の各ユニット221〜22mとの間及び隠れ
層22の各ユニット221〜22mと出力層23の各ユニット231
〜236との間の結合の強さ、即ちウェイトの値に応じて
順次隠れ層22の各ユニット221〜22mから出力層23の各ユ
ニット231〜236へ情報が伝達され、識別結果として出力
される。In such a speech recognition apparatus, the data of the parameter set extracted from the pattern of the speech to be recognized input to the speech input unit 1 is stored in each unit 211 to 2 of the input layer 21.
First input to 1n. The data of the parameter set input to the units 211 to 21n are stored in each unit 211 of the input layer 21.
To 21n and each unit 221 to 22m of the hidden layer 22, and each unit 221 to 22m of the hidden layer 22 and each unit 231 of the output layer 23
The information is sequentially transmitted from each unit 221 to 22m of the hidden layer 22 to each of the units 231 to 236 of the output layer 23 according to the strength of the connection between 236 and 236, that is, the value of the weight, and is output as the identification result. .

第４図は神経回路網２の学習のための構成を示すブロ
ック図であり、第３図と同一の構成部分には同一の参照
符号を付与してある。FIG. 4 is a block diagram showing a configuration for learning the neural network 2, and the same components as those in FIG. 3 are denoted by the same reference numerals.

第４図において、参照符号51は誤差逆伝播学習制御部
であり、52は学習用音声パラメータメモリであり、53は
教師信号メモリである。In FIG. 4, reference numeral 51 denotes an error back propagation learning control unit, 52 denotes a learning voice parameter memory, and 53 denotes a teacher signal memory.

学習用音声パラメータメモリ52は、音声入力部１から
神経回路網２への入力または同等の機能を有する図示さ
れていない学習用音声パラメータ作成部にて作成された
複数の学習用音声パラメータセットを個々に格納してい
る。The learning speech parameter memory 52 individually stores a plurality of learning speech parameter sets created by a learning speech parameter creation unit (not shown) having an input from the speech input unit 1 to the neural network 2 or an equivalent function. Is stored in

また教師信号メモリ53は学習用音声パラメータメモリ
52内の各学習用音声パラメータセットに対応した教師信
号を格納している。The teacher signal memory 53 is a learning voice parameter memory.
A teacher signal corresponding to each learning voice parameter set in 52 is stored.

このような構成では、たとえば/b/を神経回路網２に
学習させる場合、誤差逆伝播学習制御部51はカテゴリ/b
/に属する音声のパラメータセットを入力層21の各ユニ
ット211〜21nへ入力し、それに対して出力層23の各ユニ
ット231〜236から得られる出力信号を読取り、教師信号
メモリ53内の対応する教師信号と比較する。In such a configuration, for example, when the neural network 2 learns / b /, the error backpropagation learning control unit 51 uses the category / b /
The parameter set of the voice belonging to / is input to each of the units 211 to 21n of the input layer 21, and the output signal obtained from each of the units 231 to 236 of the output layer 23 is read, and the corresponding teacher in the teacher signal memory 53 is read. Compare with signal.

なお教師信号とは、入力層21の各ユニット211〜21nへ
入力された認識対象のパターンに対する出力層23の各ユ
ニット231〜236からの出力信号の理想値のことであり、
識別結果となるカテゴリに対応するユニット（ここでは
/b/に対応するユニット231）が信号値“1"を、他の各ユ
ニット232〜236が信号値“0"をそれぞれ出力するように
教師信号が対応付けられる。Note that the teacher signal is an ideal value of an output signal from each of the units 231 to 236 of the output layer 23 with respect to the pattern to be recognized input to each of the units 211 to 21n of the input layer 21,
The unit corresponding to the category of the identification result (here,
The teacher signal is associated such that the unit 231 corresponding to / b / outputs the signal value “1” and the other units 232 to 236 output the signal value “0”.

誤差逆伝播学習制御部51は、教師信号と出力層23の各
ユニット231〜236の出力信号の値との誤差が最小となる
ように、公知の学習方法である誤差逆伝播学習法により
各ユニット間のウェイト値を変更し最適化する。The error back-propagation learning control unit 51 performs the error back-propagation learning method, which is a known learning method, so that the error between the teacher signal and the output signal value of each of the units 231 to 236 of the output layer 23 is minimized. Change and optimize the weight value between.

以上のような手順を神経回路網２の認識対象の全ての
カテゴリ（この例では/b//d//g//p//t//k/の６つ）に対
して行うことにより、神経回路網２の学習を反復する
が、神経回路網２に高度の識別能力を発揮させるには相
当回数の反復学習が必要である。By performing the above-described procedure for all categories (6 in this example, / b // d // g // p // t // k /) to be recognized by the neural network 2, The learning of the neural network 2 is repeated, but a considerable number of iterative learning is required for the neural network 2 to exhibit a high degree of discrimination ability.

〔発明が解決しようとする課題〕[Problems to be solved by the invention]

以上のように、従来の神経回路網を用いたパターン認
識装置の一例である音声認識装置においては、教師信号
の値は“0"または“1"のいずれかの値をとる２値信号で
ある。即ち、神経回路網２は学習過程において認識対象
である音声のパターンのカテゴリに対応する出力を“1"
にそれ以外の出力“0"にするように学習する。As described above, in the speech recognition device that is an example of the pattern recognition device using the conventional neural network, the value of the teacher signal is a binary signal that takes either “0” or “1”. . That is, the neural network 2 outputs “1” corresponding to the category of the voice pattern to be recognized in the learning process.
To make the other output “0”.

従って、従来の神経回路網を用いたパターン認識装置
ではパターン認識に際して、正しく認識が行われた場合
には認識対象のパターンが含まれると識別されたカテゴ
リに対応する出力層23のユニット231（又は232〜236）
の出力信号の値が“1"に近い値に、他のユニットの出力
信号の値が“0"に近い値になる。換言すれば、このよう
な神経回路網２からは認識対象のパターンと各カテゴリ
との間の類似性あるいは距離等のような統計学的なデー
タは得られない。このため、誤認識が生じた場合には、
第２位以下の認識結果を得ることは出来ないので、外部
装置４においては誤りを修正することは困難であり、神
経回路網２の高度の認識能力を充分に活用することは出
来ないという問題がある。Therefore, in a conventional pattern recognition device using a neural network, in performing pattern recognition, if the recognition is correctly performed, the unit 231 (or the output layer 23) corresponding to the category identified as including the pattern to be recognized is included. 232-236)
Becomes closer to “1”, and the output signals of other units become closer to “0”. In other words, the neural network 2 cannot obtain statistical data such as similarity or distance between the pattern to be recognized and each category. Therefore, if misrecognition occurs,
Since it is impossible to obtain a recognition result of the second or lower rank, it is difficult to correct an error in the external device 4, and the advanced recognition ability of the neural network 2 cannot be fully utilized. There is.

本発明はこのような事情に鑑みてなされたものであ
り、第１位の認識結果のみならず第２位の認識結果も、
更に必要であれば第３位以降の各順位の認識結果をも容
易に得られる神経回路網を用いたパターン認識装置の提
供を目的とする。The present invention has been made in view of such circumstances, and not only the first recognition result but also the second recognition result,
It is still another object of the present invention to provide a pattern recognition apparatus using a neural network which can easily obtain a recognition result of each of the third and subsequent ranks if necessary.

〔課題を解決するための手段〕[Means for solving the problem]

本発明の神経回路網を用いたパターン認識装置は、第
１の発明では、認識対象の入力パターンを識別対象の全
てのカテゴリを対象として識別する神経回路網にて構成
された第１の識別手段と、この第１の識別手段が識別対
象する全カテゴリの内の一部のカテゴリを対象として入
力パターンを識別する神経回路網にて構成された第２の
識別手段とを備え、両識別手段の識別結果を統合して最
終的な認識結果を出力するように構成されている。According to the first aspect of the present invention, the pattern recognition apparatus using the neural network according to the first aspect is a first identification unit configured with a neural network that identifies an input pattern to be recognized for all categories to be identified. And a second identification unit composed of a neural network for identifying an input pattern for some of the categories to be identified by the first identification unit. It is configured to integrate the identification results and output a final recognition result.

また第２の発明では、第１の識別手段により識別され
たカテゴリを第１位の認識結果とし、第２の識別手段は
第１位の認識結果とされたカテゴリ以外のカテゴリを識
別対象とする神経回路網により識別を行い、その識別結
果を第２位の認識結果とする構成を採っている。In the second invention, the category identified by the first identification unit is set as the first recognition result, and the second identification unit sets categories other than the category determined as the first recognition result as identification targets. The recognition is performed by a neural network, and the result of the recognition is used as the second recognition result.

第３の発明では、第３位以下の認識結果として、順次
第ｋ−１位の認識結果とされたカテゴリを除外した残り
のカテゴリを識別対象とする神経回路網の識別結果を第
ｋ位の認識結果とする構成を採っている。In the third invention, as the recognition results of the third and lower ranks, the classification results of the neural network, which is the classification target of the remaining categories excluding the categories that have been sequentially determined to be the k-1th rank recognition results, are the kth rank. It adopts the configuration of the recognition result.

第４の発明では、第２の識別手段の神経回路網が識別
対象とするカテゴリを第１位の認識結果のカテゴリに対
応して予め定められたカテゴリを除いたカテゴリとする
構成を採っている。In the fourth invention, a configuration is adopted in which the category to be identified by the neural network of the second identification means is a category excluding a category predetermined in accordance with the category of the first-ranked recognition result. .

第５の発明では、第３位以下の認識結果を求める際に
も、第ｋ−１位までの認識結果である各カテゴリに対応
して予め定められたカテゴリを除外したカテゴリを第２
の識別手段の神経回路網が識別対象として第ｋ位の認識
結果を得るように構成している。According to the fifth aspect, even when the recognition result of the third or lower rank is obtained, the category excluding a predetermined category corresponding to each category which is the recognition result of the k-1st rank is set to the second rank.
Is configured to obtain a k-th recognition result as an identification target.

第６の発明では、第ｋ位の認識結果を得る際に第ｋ−
１位の各識別結果のカテゴリに対応して除外されるカテ
ゴリが必ず１個は存在するように構成されている。In the sixth invention, when the k-th recognition result is obtained, the k-th
It is configured such that there is always one category to be excluded corresponding to the category of each identification result of the first place.

第７の発明では、第ｋ位の認識結果を得る際に第ｋ−
１位の各識別結果のカテゴリに対応して除外されるカテ
ゴリが第１位から第ｋ−１位までの各カテゴリの組合わ
せに対応して決定されるように構成されている。In the seventh invention, the k-th recognition result is obtained when the k-th recognition result is obtained.
The category to be excluded corresponding to the category of each identification result of the first place is determined according to the combination of each category from the first place to the (k-1) th place.

〔作用〕[Action]

本発明の神経回路網を用いたパターン認識装置では、
第１の発明によれば、認識対象の全カテゴリを対象とし
て第１の識別手段により得られる識別結果と、識別対象
の全カテゴリの一部を対象として第２の識別手段により
得られる識別結果とを統合して認識結果が得られ、認識
不可能な状態は生じない。In the pattern recognition device using the neural network of the present invention,
According to the first aspect, the identification result obtained by the first identification unit for all categories of the recognition target and the identification result obtained by the second identification unit for a part of all categories of the identification target are Are integrated to obtain a recognition result, and an unrecognizable state does not occur.

第２の発明によれば、第１位の認識結果のカテゴリ以
外のカテゴリから第２位の認識結果が得られる。According to the second aspect, the second-ranked recognition result can be obtained from a category other than the category of the first-ranked recognition result.

第３の発明によれば、必要に応じて順次第３位以下の
認識結果が得られる。According to the third invention, recognition results of the third and lower ranks are sequentially obtained as necessary.

第４の発明によれば、第２位の認識結果は第１位の認
識結果に応じて予め定めてあるカテゴリのみから得ら
れ、余分な回路構成及びデータ処理を削減することが出
来る。According to the fourth aspect, the second-ranked recognition result is obtained only from the category determined in advance in accordance with the first-ranked recognition result, and unnecessary circuit configuration and data processing can be reduced.

第５の発明によれば、第３位以下の認識結果について
もそれぞれの１位上の認識結果に応じて予め定めてある
カテゴリのみから得られ、余分な回路構成及びデータ処
理を削減することが出来る。According to the fifth aspect, the recognition result of the third and lower ranks can be obtained only from the predetermined category according to the recognition result of the first rank, thereby reducing unnecessary circuit configuration and data processing. I can do it.

第６の発明によれば、第ｋ位の認識結果を得る際に第
ｋ−１位の認識結果を得る場合と同一の処理が行われる
虞がない。According to the sixth aspect, there is no possibility that the same processing as in the case of obtaining the (k−1) th recognition result is performed when obtaining the kth recognition result.

第７の発明によれば、第ｋ位の認識結果を得る際に、
第ｋ−１位までの各認識結果のカテゴリの組合わせに応
じて、神経回路網が識別対象とするカテゴリが予め決定
されるので、余分な回路構成及びデータ処理を削減する
ことが出来る。According to the seventh aspect, when obtaining the k-th recognition result,
Since the category to be identified by the neural network is determined in advance according to the combination of the categories of the respective recognition results up to the (k-1) th order, it is possible to reduce unnecessary circuit configuration and data processing.

〔実施例〕〔Example〕

以下、本発明をその実施例を示す図面を参照して詳述
する。Hereinafter, the present invention will be described in detail with reference to the drawings showing the embodiments.

第１図は本発明の神経回路網を用いたパターン認識装
置の一実施例としての音声認識装置の一構成例を示すブ
ロック図である。なお、この実施例の音声認識装置で
は、入力された音声のパターンを６つの破裂性子音のカ
テゴリ/b//d//g//p//t//k/にて構成される集合Ｃのいず
れのカテゴリであるかを識別するように構成されてい
る。FIG. 1 is a block diagram showing a configuration example of a speech recognition apparatus as one embodiment of a pattern recognition apparatus using a neural network according to the present invention. Note that, in the speech recognition apparatus of this embodiment, the input speech pattern is converted into a set C composed of six bursting consonant categories / b // d // g // p // t // k /. It is configured to identify which category the category is.

第１図において、参照符号１は音声入力部であり、入
力された認識対象の音声のパターンからスペクトラム,L
PCケプストラム係数等のようなそのパターンの特徴を表
す音声パラメータセットを抽出し、第１位候補識別用神
経回路網２及び第２位候補識別用神経回路網61〜66へ与
える。In FIG. 1, reference numeral 1 denotes a voice input unit, which generates a spectrum, L
A speech parameter set representing a feature of the pattern, such as a PC cepstrum coefficient, is extracted and provided to the first candidate identification neural network 2 and the second candidate identification neural networks 61 to 66.

第１位候補識別用神経回路網２は上述の如く、集合Ｃ
を構成する６つの子音カテゴリのいずれに認識対象の入
力子音のパターンが含まれるかを識別し、それぞれのカ
テゴリに対応して識別結果を表す出力信号を判定部３へ
出力する。この第１位候補識別用神経回路網２の識別対
象である各カテゴリに対応した出力信号は、“0"〜“1"
の範囲の値になる。As described above, the neural network 2 for the first-ranked candidate identifies the set C
Of the six consonant categories constituting the input consonant pattern to be recognized are identified, and an output signal representing the identification result corresponding to each category is output to the determination unit 3. Output signals corresponding to each category to be identified by the first candidate identification neural network 2 are “0” to “1”.
Value in the range.

また各第２位候補識別用神経回路網61〜66は、たとえ
ば集合Ｃを構成する６つの子音/b//d//g//p//t//k/から
それぞれ一つを除いた他の５つの子音にて構成される部
分集合Ciを識別するように構成されている。具体的に
は、第２位候補識別用神経回路網61は/b/以外の、同62
は/d/以外の、同63は/g/以外の、同64は/p/以外の、同6
5は/t/以外の、同66は/k/以外のそれぞれ５つの子音を
識別する。In addition, each of the second-ranking candidate identification neural networks 61 to 66 is obtained by removing one of the six consonants / b // d // g // p // t // k / constituting the set C, for example. It is configured to identify a subset Ci composed of the other five consonants. Specifically, the neural network 61 for the second-rank candidate identification is the same as the neural network 61 other than / b /.
Is other than / d /, 63 is other than / g /, 64 is other than / p /, 6
5 identifies five consonants other than / t /, and 66 identifies five consonants other than / k /.

判定部３では第１位候補識別用神経回路網２及び各第
２位候補識別用神経回路網61〜66から与えられる出力信
号の結果を統合して最終的な認識結果を外部装置４へ出
力する。The determination unit 3 integrates the output signal results given from the first candidate neural network 2 for identification and the neural networks 61 to 66 for the second candidate identification and outputs the final recognition result to the external device 4. I do.

なお、第１位候補識別用神経回路網２及び各第２位候
補識別用神経回路網61〜66の詳細な構成及びその学習の
ための構成は前述の第３図に示されている従来例の神経
回路網２と基本的に同様であるので、ここでは省略す
る。The detailed configuration of the first candidate neural network for identification 2 and each of the second neural networks 61 to 66 and the configuration for learning the same are shown in FIG. This is basically the same as the neural network 2 of FIG.

このような本発明の神経回路網を用いたパターン認識
装置の一実施例である音声認識装置では、認識対象の音
声のパターンが音声入力部１へ入力されるとその音声の
パターンの特徴を表す音声パラメータセットが抽出さ
れ、第１位候補識別用神経回路網２及び各第２位候補識
別用神経回路網61〜66へ与えられる。In such a speech recognition apparatus as an embodiment of the pattern recognition apparatus using the neural network of the present invention, when a speech pattern to be recognized is input to the speech input unit 1, the feature of the speech pattern is represented. A voice parameter set is extracted and provided to the first candidate identification neural network 2 and each of the second candidate identification neural networks 61 to 66.

第１位候補識別用神経回路網２では６つのカテゴリ/b
//d//g//p//t//k/の内のいずれかが前述の従来の神経回
路網２と同様にして選択されて識別結果として判定部３
へ与えられる。そして、たとえば第１位候補識別用神経
回路網２が/b/を第１位の認識結果として判定部３へ出
力したとすると、判定部３はそれに対応する信号を第１
位の認識結果として外部装置４へ出力すると共に、第１
位の認識結果である/b/を除いた他の５つの子音を識別
する第２位候補識別用神経回路網61の識別結果、たとえ
ば/p/であったとするとそれに応じた信号を第２位の識
別結果として外部装置４へ出力する。In the first candidate candidate neural network 2, six categories / b
// Any one of d // g // p // t // k / is selected in the same manner as the above-described conventional neural network 2 and the determination unit 3
Given to. For example, if the first-ranking candidate identification neural network 2 outputs / b / to the determining unit 3 as the first-ranking recognition result, the determining unit 3 outputs a signal corresponding thereto to the first
Output to the external device 4 as the position recognition result,
If the recognition result of the second candidate identification neural network 61 for identifying the other five consonants excluding / b /, which is the recognition result of the position, for example, is / p /, a signal corresponding to the result is given as the second position. Is output to the external device 4 as a result of the identification.

以上により、音声入力部１へ入力された音声パターン
の認識結果としては、第１位の認識結果が/b/、第２位
の認識結果が/p/として外部装置４へ出力される。As described above, as the recognition result of the voice pattern input to the voice input unit 1, the first recognition result is output to the external device 4 as / b /, and the second recognition result is / p /.

同様に、たとえば第１図に示されている第２位候補識
別用神経回路網61〜66の他に、６つの子音の内の４つ、
たとえば/b//p/以外の４つの子音を識別するための神経
回路網を用いれば、第３位の認識結果を得ることが可能
になる。以下、順次第４位，第５位…というように更に
下位の識別結果を順次得ることが可能である。Similarly, for example, in addition to the second candidate identification neural networks 61 to 66 shown in FIG.
For example, if a neural network for identifying four consonants other than / b // p / is used, a third-ranked recognition result can be obtained. Hereinafter, it is possible to sequentially obtain lower-order identification results such as the fourth place, the fifth place, and so on.

ところで、上述の例では第２位の認識結果を得るため
に各１つの子音を除く５つの子音を対象とする第２位候
補識別用神経回路網61〜66を用いているが、たとえば第
１位の認識結果が/b/である場合には第２位の認識結果
としては/b/と同じ有声破裂音である/d/または/g/のい
ずれかが得られればよいという考え方もある。このよう
な考え方を採る場合には、第１位の認識結果が/b/であ
れば６つの子音の内の/d/と/g/の二つのみを認識対象と
する神経回路網を第２位候補識別用神経回路網として用
いてもよく、この場合にはその神経回路網の構成を簡易
にすることが出来る。By the way, in the above-mentioned example, the second-ranking candidate identification neural networks 61 to 66 for five consonants except one consonant are used in order to obtain the second-ranking recognition result. There is also an idea that if the recognition result of the position is / b /, then the same voiced plosive as / b / or / g / should be obtained as the second recognition result. . If this concept is adopted, if the first-ranked recognition result is / b /, a neural network that recognizes only two of the six consonants, / d / and / g /, will be used. It may be used as a second candidate identification neural network. In this case, the configuration of the neural network can be simplified.

更に、一つの子音、たとえば/g/に対して破裂性であ
る場合と鼻音性である場合との二つのカテゴリに分類可
能であるような場合には、たとえば第１位の認識結果が
破裂性の/g/であれば第２位の認識結果は破裂性の/g/と
鼻音性の/g/とを除外した残りを認識対象とする神経回
路網を使用してもよい。Further, in the case where a single consonant can be classified into two categories, i.e., a case in which it is bursty and a case in which it is nasal, for example, if the recognition result of the first place is bursty, If / g /, the second-ranked recognition result may use a neural network that recognizes the remainder excluding bursty / g / and nasal / g /.

なお、上述の説明では一部の子音の認識を例として音
声認識に本発明を適用した場合について記述たが、たと
えば日本語の全要素であるいは日本語以外の言語の音素
を対象とすることも勿論可能であり、更には単語のよう
にカテゴリに分類可能な単位での音声認識にも本発明は
適用可能である。更にまた、音声認識のみならず、たと
えば画像認識等のように入力データがパターン化される
種々の認識対象にも本発明を適用することが可能である
ことも言うまでもない。In the above description, the case where the present invention is applied to speech recognition using recognition of some consonants as an example has been described. However, for example, it is possible to cover all elements in Japanese or phonemes in languages other than Japanese. Of course, it is possible, and the present invention is also applicable to voice recognition in units that can be classified into categories such as words. Furthermore, it goes without saying that the present invention can be applied not only to voice recognition but also to various recognition targets in which input data is patterned, such as image recognition.

〔発明の効果〕〔The invention's effect〕

以上に詳述した如く本発明の神経回路網を用いたパタ
ーン認識装置によれば、従来は第１位の認識結果のみし
か得られなかったにも拘わらず、第２位の認識結果を、
また更に必要に応じてそれ以下の各順位の認識結果を得
ることが可能になる。このため、第１位の認識結果が誤
認識であった場合に第２位の認識結果あるいはそれ以下
の各認識結果を採用して誤認識の修正が可能になり、神
経回路網の高度な認識能力を従来以上に活用することが
可能になる。As described in detail above, according to the pattern recognition apparatus using the neural network of the present invention, despite the fact that only the first recognition result can be obtained conventionally, the second recognition result is
Further, it is possible to obtain a recognition result of each rank lower than that as needed. For this reason, when the recognition result of the first place is erroneous recognition, the recognition result of the second place or each of the lower recognition results can be adopted to correct the erroneous recognition. It is possible to utilize the ability more than before.

また、第２位以下の各順位の認識結果を得る際には、
それまでに得られている各順位のカテゴリに応じて予め
定められているカテゴリ、あるいは各順位のカテゴリの
組合わせに応じて予め定められているカテゴリを識別対
象として神経回路網にて識別が行われる構成を採る場合
には、神経回路網の構成及び処理時間の削減が可能にな
る。Also, when obtaining the recognition result of each rank below the second place,
The classification is performed by the neural network using the category determined in advance according to the category of each rank obtained up to that time or the category predetermined according to the combination of the category of each rank. When the configuration is adopted, the configuration of the neural network and the processing time can be reduced.

【図面の簡単な説明】[Brief description of the drawings]

第１図は本発明の神経回路網を用いたパターン認識装置
の構成を示すブロック図、第２図は従来の神経回路網を
用いたパターン認識装置の構成を示すブロック図、第３
図はその神経回路網の詳細な構成を示すブロック図、第
４図は神経回路網の学習のための構成を示すブロック図
である。１……音声入力部、２……第１位候補識別用神経回路
網、３……判定部、61〜66……第２位候補識別用神経回
路網FIG. 1 is a block diagram showing a configuration of a pattern recognition device using a neural network according to the present invention. FIG. 2 is a block diagram showing a configuration of a pattern recognition device using a conventional neural network.
FIG. 4 is a block diagram showing a detailed configuration of the neural network, and FIG. 4 is a block diagram showing a configuration for learning the neural network. 1. Voice input unit 2. Neural network for first candidate identification 3. Neural unit 61 to 66. Neural network for second candidate identification

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁶，ＤＢ名) G10L 9/10 301 G06F 15/18 560 G06F 15/70 465 G06K 9/66 ＪＩＣＳＴファイル（ＪＯＩＳ)──────────────────────────────────────────────────の Continued on the front page (58) Fields surveyed (Int.Cl. ⁶ , DB name) G10L 9/10 301 G06F 15/18 560 G06F 15/70 465 G06K 9/66 JICST file (JOIS)

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】認識対象のパターンからその特徴を表すパ
ラメータを抽出する入力部と、該入力部により抽出されたパラメータに基づいて、前記
認識対象のパターンがＮ個のカテゴリにて構成される集
合Ｃのいずれのカテゴリに含まれるかを識別する神経回
路網にて構成される第１の識別手段と、前記入力部により変換されたパラメータに基づいて、前
記認識対象のパターンが前記集合Ｃの前記Ｎ個より少な
いカテゴリにて構成される集合Ｃの部分集合Ci（ｉ＝1,
2…Ｉ、Ｉは１以上Ｎ以下の整数）それぞれにおいてい
ずれのカテゴリに含まれるかを識別するＩ個の神経回路
網にて構成される第２の識別手段と、前記第１及び第２の識別手段による識別結果を統合して
最終の認識結果として出力する判定部とを備えたことを特徴とする神経回路網を用いたパターン
認識装置。An input unit for extracting a parameter representing the feature from a pattern to be recognized, and a set in which the pattern to be recognized is composed of N categories based on the parameters extracted by the input unit. A first identification unit configured by a neural network for identifying which category of the set C is included; and the pattern of the recognition target is defined by the set C in the set C based on a parameter converted by the input unit. Subset Ci of set C composed of less than N categories (i = 1,
2 ... I, I is an integer of 1 or more and N or less), a second identification means composed of I neural networks for identifying which category is included in each of the first and second, A determination unit that integrates the identification results obtained by the identification means and outputs the result as the final recognition result.

【請求項２】前記判定部は、前記第１の識別手段により
前記認識対象のパターンが含まれると識別されたカテゴ
リを第１位の認識結果とし、前記第１の識別手段におい
て第１位に判定されたカテゴリを除くＮ−１個のカテゴ
リを識別対象とする前記第２の識別手段内の各神経回路
網により識別されたカテゴリを第２位の認識結果とする
ことを特徴とする請求項１に記載の神経回路網を用いた
パターン認識装置。2. The method according to claim 1, wherein the determining unit sets a category identified by the first identification unit as including the pattern to be recognized as a first-order recognition result, and the first identification unit ranks the first-ranked result. The category identified by each of the neural networks in the second identification means, wherein N-1 categories other than the determined category are to be identified, is set as a second recognition result. A pattern recognition device using the neural network according to claim 1.

【請求項３】前記判定部は、第１位から第ｋ−１位（ｋ
≧３）までの認識結果である各カテゴリを除くカテゴリ
を識別対象とする前記第２の識別手段内の各神経回路網
により識別されたカテゴリを第ｋ位の認識結果とするこ
とを特徴とする請求項２に記載の神経回路網を用いたパ
ターン認識装置。3. The method according to claim 1, wherein the determining unit determines a first to (k-1) th order (k
A category identified by each neural network in the second identification means, which is a category other than each category which is a recognition result up to ≧ 3), is set as a k-th recognition result. A pattern recognition device using the neural network according to claim 2.

【請求項４】前記判定部は、前記第１の識別手段により
前記認識対象のパターンが含まれると識別されたカテゴ
リを第１位の認識結果とし、前記第１の識別手段におい
て第１位に判定されたカテゴリに対応して予め定められ
たカテゴリを除くカテゴリを識別対象とする前記第２の
識別手段内の各神経回路網により識別されたカテゴリを
第２位の認識結果とすることを特徴とする請求項１に記
載の神経回路網を用いたパターン認識装置。4. The determination section sets a category identified by the first identification means as including the pattern to be recognized as a first-order recognition result, and the first identification means ranks the category as a first-order recognition result. A category identified by each neural network in the second identification means for identifying a category other than a predetermined category corresponding to the determined category is set as a second recognition result. A pattern recognition device using the neural network according to claim 1.

【請求項５】前記判定部は、第１位から第ｋ−１位（ｋ
≧３）までの認識結果である各カテゴリに対応して予め
定められたカテゴリを除くカテゴリを識別対象とする前
記第２の識別手段内の各神経回路網により識別されたカ
テゴリを第ｋ位の認識結果とすることを特徴とする請求
項４に記載の神経回路網を用いたパターン認識装置。5. The method according to claim 1, wherein the determining unit determines that the first to (k-1) th (k
The category identified by each neural network in the second identification means, which is a category other than a predetermined category corresponding to each category as a recognition result up to ≧ 3), is identified as a k-th category. The pattern recognition device using a neural network according to claim 4, wherein the pattern recognition device obtains a recognition result.

【請求項６】前記第１位から第ｋ−１位までの認識結果
である各カテゴリに対応して予め定められたカテゴリ
は、前記各カテゴリに対して少なくとも１つ以上存在す
ることを特徴とする請求項５に記載の神経回路網を用い
たパターン認識装置。6. A method according to claim 1, wherein at least one or more categories predetermined for each of the categories which are the recognition results from the first place to the (k-1) th place are present for each category. A pattern recognition device using the neural network according to claim 5.

【請求項７】前記第１位から第ｋ−１位までの認識結果
である各カテゴリに対応して予め定められたカテゴリ
は、前記各カテゴリの組合わせに対して予め定められる
ことを特徴とする請求項５に記載の神経回路網を用いた
パターン認識装置。7. A category determined in advance corresponding to each category which is a recognition result from the first place to the (k-1) th place is predetermined for a combination of each category. A pattern recognition device using the neural network according to claim 5.