JPS5811998A

JPS5811998A - Voice recognizer

Info

Publication number: JPS5811998A
Application number: JP10926581A
Authority: JP
Inventors: 能勢　勇; 水野　金儀
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1981-07-15
Filing date: 1981-07-15
Publication date: 1983-01-22
Also published as: JPS6332400B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】本発明は、音声認識装置において、認識率の向上を計る
ことができる重み何升類似度演算に関するものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a weighted similarity calculation that can improve the recognition rate in a speech recognition device.

従来の音声認識装置のブロック図を第１図に示す。第１
図において、　　１１は入力端子、　　１２は周波数分
析部、　　１３は音声区間検出部、１４は音声区間の始
端検出信号、１５は音声区間の終端検出信号、１６はス
ペクトル変換部、１７は非類似度演算部１１８は判定部
の如く構成されてｈｖ以下谷部の説明をする。A block diagram of a conventional speech recognition device is shown in FIG. 1st
In the figure, 11 is an input terminal, 12 is a frequency analysis section, 13 is a voice section detection section, 14 is a voice section start detection signal, 15 is a voice section end detection signal, 16 is a spectrum conversion section, and 17 is a degree of dissimilarity. The calculation unit 118 is configured like a determination unit and explains the valley below hv.

周波数分析部１２は第２図に示す如く構成され。The frequency analysis section 12 is constructed as shown in FIG.

入力音声信号２１　［前置増幅器２２によって増幅され
約２００　Ｈｚ〜６０００　’Ｈｚの間で中心周波数が
対数で等間隔となるように設定された帯域ｐ波器群２３
−１　、２３−２　、・・・、　２３−？Ｌ、全波整流
器群２４−１　、２４−２、・・・、　２４−ｎ及び低
域沖波器群２５−１　、２５−２　、・・・。Input audio signal 21 [band p-wave amplifier group 23 which is amplified by a preamplifier 22 and set so that the center frequency is logarithmically evenly spaced between approximately 200 Hz and 6000' Hz.
-1, 23-2,..., 23-? L, full-wave rectifier groups 24-1, 24-2, . . . , 24-n and low-frequency wave rectifier groups 25-1, 25-2, .

２５−ｎによって分析され多重化器２６ヲ通してアナロ
グ・ディジタル変換器２７によってあらかじめ設定され
た時間間隔（以下サンプル周期と記す）毎に量子化され
、対数変換器２８全通して出力端子２９に出力される。25-n, passed through the multiplexer 26, quantized by the analog-to-digital converter 27 at preset time intervals (hereinafter referred to as sample period), and passed through the logarithmic converter 28 to the output terminal 29. Output.

周波数分析部１２で分析された結果は音声区間検出部】
３．及びスペクトル変換部１６に送られる。The results analyzed by the frequency analysis section 12 are sent to the voice section detection section]
3. and sent to the spectrum conversion section 16.

音声区間検出部］３は音声区間の始端及び終端を検出し
非類似度演算部に始端検出信号１４及び終端検出信号１
５ヲ送るものであり、簡易的な検出法としてはサンプル
周期毎の周波数分析部１２からのｎ個の分析データの平
均値全求めその値があらかじめ設定された閾値を最初に
越えた時点を始点とし、最後に閾値以下になった時点全
終端とする検出法がある。Speech section detection section] 3 detects the start and end of a speech section and sends a start end detection signal 14 and an end detection signal 1 to the dissimilarity calculation section.
A simple detection method is to calculate the average value of n pieces of analysis data from the frequency analyzer 12 for each sampling period and start from the point in time when the value first exceeds a preset threshold. There is a detection method in which the end point is defined as the end point when the value finally falls below the threshold value.

スペクトル変換部１６は１話者による音源特性及びパワ
ーの正規化の方法として、論文”非線形スペクトルマツ
チングによる単語音声認識の一方式″小原他（電子通信
学会技術研究報告ＰＲＬ　７９−４６）に発表されたも
のでＩず計算方法を説明する。The spectrum conversion unit 16 was published in the paper "A method of word speech recognition using nonlinear spectral matching" (IEICE technical research report PRL 79-46) as a method for normalizing sound source characteristics and power by one speaker. The calculation method will be explained below.

周波数分析部１２で、ある時刻に分析されたｎ個のデー
タ’ｆｒ：ｘ、７（ｉ＝Ｉ〜ｎ）とすると、スベク（３
）れる。If n data 'fr: x, 7 (i=I~n) analyzed at a certain time in the frequency analysis unit 12, then
) can be done.

ｘ・−ｘ　−（Ａｉ十Ｂ）　　　　　−−（ＩＪｔ（１）式に２いてＡ、Ｂはそれぞれｘｉ（ｉ＝ｌ−ｎ）
の最小２乗近似ＵＮｆｌＪの傾き及び切片を意味するも
のでそれぞれ次式によって求められる。x -
These mean the slope and intercept of the least squares approximation UNflJ, and are determined by the following equations.

ｉ＝ｔｘｉ２は定数となり従って（２）、（３）式の分母も定
数１＝１（４）（４，）　、　＋５７式から明らかなように入力データ
力１らりＡ、Ｈの値を求めることができ、さらに（１）
式によりスペクトル変換データ２Ｘｉ（ｔ−１〜ｔＬ）
を求めることができる。i=t xi2 is a constant, so the denominator of equations (2) and (3) is also a constant 1=1 (4) (4,) +57 As is clear from equation can be obtained, and further (1)
Spectral conversion data 2Xi (t-1 to tL) by the formula
can be found.

第３図にスペクトル変換部１６のフ゛ロック図を示し以
下図にそって説明する。FIG. 3 shows a block diagram of the spectrum conversion section 16, and will be explained below with reference to the figure.

入力端子３１から入力された入力データｚｉ（ｉ−１〜
ｎ）と、入力データと同期して計算するカウンタ３２に
よって発生したｉとの積を乗算器３３によって求めさら
に加算器３４とレジスタ３５によりｉ−ｘ・の埴を累積
させることによりレジスタ′ｆ、た。加算器３６　とレ
ジスタ３７により同様に。Input data zi (i-1 to
n) and i generated by the counter 32 which calculates in synchronization with the input data is calculated by the multiplier 33, and the adder 34 and the register 35 accumulate the values of i−x·, thereby register 'f, Ta. Similarly with adder 36 and register 37.

きる。Wear.

次にマルチプレクサ３８　、３９において、それぞれｙ
、Ｃ，の値を選択することにより乗算器４０ではＡの値
ヲレジスタ４３にセットする。同様にマルチプレクサ３
８．３９においてそれぞれＣＩ、Ｃ２を選択させ乗算器
４０．４１及び減算除算器４４を使用してその結果すな
わちＢの値全レジスタ４５にセットする。Next, in multiplexers 38 and 39, y
, C, the multiplier 40 sets the value of A in the register 43. Similarly multiplexer 3
At step 8.39, CI and C2 are selected respectively, and the result, that is, the value of B, is set in the total register 45 using the multipliers 40 and 41 and the subtraction divider 44.

続いてカウンタ４６によりｉを発生させ乗算器４７によ
ｆ）　、４　ｅ　ｉを求めざらに７Ｊｌ］算器４８によ
りＡｉ十１３’ｃ求めることができる。次に遅延回路４
９により遅延した入カテータｘｉ、ｌ！ニアＩｌ］算器
４８で求めたＡｉ十Ｂの減算全減算器５０によって行え
はスペクトル変換データＸ・が出力端子に出力される。Subsequently, the counter 46 generates i, the multiplier 47 calculates f), 4 e i, and the multiplier 48 calculates Ai 113'c. Next, delay circuit 4
9 delayed input caterator xi,l! The spectral conversion data X.subtracted by the full subtractor 50 is outputted to the output terminal.

次に、非類似度演算部１７の構成を第４図に示し以下図
にそって説明する。第４図において、１０１は音声区間
の始端検出信号、１０２は音声区間の終端検出信号、１
０３はスペクトル変換部１７からの入力データ、　　１
０４Ｉ／′Ｘ、入力メモリ制御回路、１０５は入力メモ
リ、１０６は標準パターンメモリ制御回路。Next, the configuration of the dissimilarity calculating section 17 is shown in FIG. 4 and will be described below with reference to the figure. In FIG. 4, 101 is a voice section start detection signal, 102 is a voice section end detection signal, 1
03 is input data from the spectrum conversion unit 17, 1
04I/'X, input memory control circuit; 105, input memory; 106, standard pattern memory control circuit;

１０７は標準パターンメモＩＬ１０８は差分絶対値演算
回路、１０９は加算器、１１０はレジスタである。107 is a standard pattern memo IL 108 is an absolute difference calculation circuit, 109 is an adder, and 110 is a register.

音声区間の始端検出信号１０１が発生してから音声区間
の終端検出信号１０２が発生する１での間入カテータ１
０３は入力メモリ制御回路１０４により入力メモリ−０
５に格納される。音声区間の入力データ１０３の格納が
終了すると、入力メモリ−Ｏ５とあ（７）らかしめ分析され標準パターンメモリ１０７に格納され
ている所望の標準パターンとの非類似度の演算を順次行
なう。Intermediate cutter 1 at 1 where the voice section end detection signal 102 is generated after the voice section start detection signal 101 is generated.
03 is input memory -0 by the input memory control circuit 104.
It is stored in 5. When the storage of the input data 103 of the voice section is completed, the degree of dissimilarity with the desired standard pattern that has been subjected to calibration analysis and stored in the standard pattern memory 107 is sequentially calculated in the input memory O5 (7).

非類似度の演算方法では動的計画法を用いて入力データ
と標準パターンとを非線形に対応させる方法が一般的に
用いられているが、説明の簡略化の為、以下線形対応を
用いた方法で説明する。しかしながら本発明は非線形対
応に対しても適用できる事は明らかである。A commonly used method for calculating dissimilarity is to use dynamic programming to make nonlinear correspondences between input data and standard patterns, but for the sake of simplicity, we will use a method using linear correspondence below. I will explain. However, it is clear that the present invention can also be applied to nonlinear correspondences.

入力メモリ制御回路１０４及び標準パターンメモリ制御
部１（）６を介して入力データ及び標準パターンそれぞ
れの対応する要素を読出し、差分絶対値回路１０８によ
って両者の差分の絶対値の演Ｘ＋行い、さらにその結果
とレジスタ１１０との加算を加算器１１０で行い加算結
果を再びレジスタ１１０に入れる。The corresponding elements of the input data and the standard pattern are read out via the input memory control circuit 104 and the standard pattern memory control unit 1()6, and the absolute value of the difference between the two is calculated by the absolute difference circuit 108. Addition of the result and register 110 is performed by adder 110, and the addition result is input into register 110 again.

この演算を対応する要素すべてについて繰υ返丁ことに
より入力データとある標準パターンとの非類似度の演算
ができる。このようにして、標準パターンメモリ１０７
に格納されている全て又は−（８）部の標準パターンとの非類似度の演算を行う。By repeating this calculation for all corresponding elements, it is possible to calculate the degree of dissimilarity between the input data and a certain standard pattern. In this way, standard pattern memory 107
The degree of dissimilarity between all or -(8) parts stored in the standard pattern is calculated.

但シ、レジスタｌｌ０Ｕ、ある標準パターンとの非類似
度演算を始める時の初期値はＯとしておく必要がある。However, it is necessary to set the initial value of the register ll0U to O when starting the dissimilarity calculation with a certain standard pattern.

即ち、ある認識語の標準パターンＰと入力データＱとの
非類似度演算に２いて両者の対応する要素があらかじめ
正規化されているものとして（６）式（６）式にてｉは
対応する要素に付された番号でありｌｕ標標準パターン
上入力データＱとの音声区間長の正規化後の時系列に付
された番号である。That is, in the dissimilarity calculation between the standard pattern P of a certain recognition word and the input data Q, assuming that the corresponding elements of both have been normalized in advance, i corresponds in equation (6). This is a number assigned to an element, and is a number assigned to a time series after normalization of the voice interval length with respect to the input data Q on the lu standard pattern.

判定部１８では非類似度演算部１７の結果によｐ最も非
類似度の低かった。すなわち類似度の最も高かった標準
パターンと同じ音声が入力されたものと判断して、結果
を出力する。In the determination unit 18, the result of the dissimilarity calculation unit 17 was that p had the lowest dissimilarity. In other words, it is determined that the same voice as the standard pattern with the highest degree of similarity has been input, and the result is output.

しかしながら、上記従来の技術では、音声は話者による
変化はもちろんのこと同一話者においても発声毎に変化
するため１分析結果の似ている語間の誤認識が生ずると
いう欠点があった。However, the above-mentioned conventional technology has the disadvantage that speech changes not only depending on the speaker, but also from one utterance to another even by the same speaker, resulting in erroneous recognition between words that have similar results in one analysis.

従って本発明は従来の技術の上記欠点を改善するもので
、その目的は音声認識装置の認識率を向上させることに
あり、標準パターンメモリに重み領域データを付加し、
さらに、非類似度演算部における重みの大きさを、入カ
バターンと標準パターンの符号を含めたレベルの相互関
係によって判断する機能全付加したものである。Therefore, the present invention aims to improve the above-mentioned drawbacks of the prior art, and its purpose is to improve the recognition rate of a speech recognition device, by adding weight area data to a standard pattern memory,
Furthermore, a complete function is added to determine the magnitude of the weight in the dissimilarity calculating section based on the correlation between the levels including the sign of the input pattern and the standard pattern.

すなわち、短時間スペクトルを目視した場合は明らかに
異なるパターンであると認識できるものであっても、全
体の卵類０！度としては小さな値になり、誤認識される
ことがある。In other words, even if the spectra can be clearly recognized as different patterns when visually observing the spectrum for a short period of time, the total number of eggs is 0! This value may be small and may be misrecognized.

このように、一定の非類似度の演算のみでは類似してし
１う小数の音声を識別するための一つＭ力な手法は、ス
ペクトル変換データを要素とする標準パターンの特定領
域に非類似度を増す方向の重みをつけることである。In this way, one powerful method for identifying a small number of voices that are similar only by calculating a certain degree of dissimilarity is to identify dissimilarities in a specific region of a standard pattern using spectral transformation data as an element. It is to add weight in the direction of increasing degree.

本発明は、このような重みうけによる非類似度の演算を
、短時間スペクトルにｐける山や谷の位置を考慮して行
わせるものであり、特に短時間スベクトルにおける山や
谷が、スペクトル変換データにおける１負の符号及びデ
ータの絶対値の太きさとして現われるの全利用するもの
である。The present invention calculates the degree of dissimilarity by weighting in consideration of the positions of peaks and valleys in the short-time spectrum. In particular, the peaks and valleys in the short-time vector are It makes full use of the 1 negative sign in the converted data and the thickness of the absolute value of the data.

第５図は本発明の笑怖例のブロック図であり。FIG. 5 is a block diagram of an example of the present invention.

１１は入力端子、１２は周波数分析部、１３は音声区間
検出部、　　１４は音声区間の始端検出信号、１５は音
声区間の終端検出信号、］６はスペクトル変換部、５５
は重み何升類似度演算部、　　１８は判定部の如く構成
されている。重み何升類似屁演算部５５以外は第１図の
構成と同じであるので、以下重み何升類似肝演算部５５
について第６図によって詳細に説明する。11 is an input terminal, 12 is a frequency analysis section, 13 is a voice section detection section, 14 is a voice section start detection signal, 15 is a voice section end detection signal,] 6 is a spectrum conversion section, 55
18 is constructed as a weight similarity calculation section, and a determination section 18. Since the configuration is the same as that shown in FIG. 1 except for the weight similar fart calculation unit 55, the following weight calculation unit 55 is the same as the configuration shown in FIG.
This will be explained in detail with reference to FIG.

第６図において■旧は音声区間の始端検出信号。In Fig. 6, ■Old indicates the start end detection signal of the voice section.

】０２は音声区間の終端検出信号、１０３はスペクトル
変換部１６からの入力データ、　　１０４Ｕ人カメモリ
制御回路、１０５は入力メモ１，１，１０８は差分絶対
値演算回路、２０３は標準パターンメモリ制御回路。02 is a voice section end detection signal, 103 is input data from the spectrum converter 16, 104 is a human memory control circuit, 105 is an input memo 1, 1, and 108 is a difference absolute value calculation circuit, and 203 is a standard pattern memory control circuit. .

２０４は標準パターンメモ！Ｊ、２０１ｉ１１人カメモ
リの出力信号線、２０５は標準パターンメモリのパター
ンデータに関する出力信号線、２０７は標準パターンメ
モリの重み計算指定に関する出力信号線。204 is a standard pattern memo! 201i is an output signal line of the 11-person memory, 205 is an output signal line related to pattern data of the standard pattern memory, and 207 is an output signal line related to weight calculation designation of the standard pattern memory.

２０８　、２０９はレベル変換回路、　　２］０，２］
］はレベル変換回路２０８　、２０９の出力信号線、２
１２はテーブルメモリ、２１３は乗算器、１０９は刀０
算器、Ｊ１０はレジスタの如く構成されている。208 and 209 are level conversion circuits, 2]0,2]
] are the output signal lines of the level conversion circuits 208 and 209, 2
12 is table memory, 213 is multiplier, 109 is sword 0
The calculator J10 is configured like a register.

音声区間の始端検出信号１０１が発生してから音声区間
の終端検出信号１０２が発生する互での間入カテータ１
０３は入力メモリ制御回路１０４により人力メモリ１０
５に格納される。An intermediate cutter 1 in which a voice section start detection signal 101 is generated and a voice section end detection signal 102 is generated.
03 is the manual memory 10 by the input memory control circuit 104.
It is stored in 5.

入力データ１０３の格納が終了すると、入力メモリ１０
５とあらかじめ分析され標準パターンメモリ２（Ｊ４に
格納されている碑準パターンとの重み何升類似度の演算
を順次行う。When the input data 103 has been stored, the input memory 10
5 and the standard pattern previously analyzed and stored in the standard pattern memory 2 (J4).

重み何升類似度演算に２いては、標準パターンは、（Ｊ
）式で示されるスペクトル変換データＺＺ　と重み演算
部ケ示す重み指定データＰｉ　との時系列で記述されて
いて、−万人カテータは（１）式で示されるスペクトル
変換データ（以下標準パターンとの区別の為Ｚｉで記述
する）のみであジ、各々のスペクトル変換データは差分
絶対値演算回路１０８（１１）の入力部とレベル変換器Ｗ５２０９　、２０８の入力部
へ出力信号線２０５　、２旧を介して出力されると同時
に重み指定データがテーブルメモリのアドレス入力線の
一部２０７を介して出力される。If the weight is 2 for the similarity calculation, the standard pattern is (J
) The spectral conversion data ZZ shown by the equation (1) and the weight designation data Pi shown by the weight calculation section are described in time series. Each spectrum conversion data is connected to the input part of the absolute difference calculation circuit 108 (11) and the input part of the level converter W5209 (W5209, 208) through the output signal line 205 (2). At the same time, weight designation data is output via a portion 207 of the address input line of the table memory.

レベル変換回路２０８　、２０９は対数変換器２７の出
力データのビット数が大きい為（８ビツト以上）。The level conversion circuits 208 and 209 are used because the number of bits of the output data of the logarithmic converter 27 is large (8 bits or more).

ビット低減全行いテーブルメモリ２１２の容量が大きく
ならない様にしている。通常レベル変換回路２０８　、
２０９の出力ビツト数は２〜４ビット程度で変換される
。例えば２ビツトの場合入力データを９となる。The capacity of the bit reduction all-do table memory 212 is prevented from increasing. Normal level conversion circuit 208,
The number of output bits of 209 is converted to about 2 to 4 bits. For example, in the case of 2 bits, the input data is 9.

レベル変換回路２０８　、２０９の変換出力に信号線（
１２）２］、０　、２］１　’ｅ介してテーブルメモリ２１２
のアドレス入力の一部となっている。テーブルメモリ２
１２には重みが格納されていて１重み指定Ｐｉ二〇の場
合は無条件に重みＷｉ−１が出力されるが、Ｐｉ＝１の
場合は変換量ブ月直の組み合わせで重みが変化する。The signal line (
12) 2], 0, 2] 1 'e via table memory 212
is part of the address input. table memory 2
12 stores weights, and when 1 weight designation Pi20, the weight Wi-1 is output unconditionally, but when Pi=1, the weight changes depending on the combination of conversion amounts and months.

前記２ピツトの場合の例を示す。入力データの変換出力
′ｆｒ：Ｘｉ、標準パターンデータの変換出力ｋＹｉと
すると１次のようになる。An example of the above-mentioned 2-pit case will be shown. If the conversion output of input data 'fr:Xi and the conversion output of standard pattern data kYi are expressed as follows.

但し　Ｈ’１＜　Ｗ２＜　Ｗｓ通常ＷＩ　＝　４　、　Ｗ２”　２　、　Ｗｓ　＝　１
程度の値をもつ０差分絶対値演算回路１０８では１７）
式の演ｌＩ１．を行う。However, H'1<W2< Ws Normal WI = 4, W2" 2, Ws = 1
17) in the zero difference absolute value calculation circuit 108 with a value of
Performance of the ceremony lI1. I do.

ｄ、−１工１−Ｚｉｌ　　　　　　　　・・・・・・（
７）を重みつき非類似度演算は１重みｗｉ　と差分絶対値ｄ・
の時系列Ｗ・（ｌｌ　、　ｄｉ（１）に対し２乗算器２
１３．加Ｚ　　　　　　　　　　　　　Ｚ算器１ｏ９．レジスタ］、ＩＯによＱ（８）式は（６）式に対して重み何升類似度演算を示し
ており１重み指定は標準パターンにｐいて特に強調した
い（他の標準パターンとの区別に有効な）部分に設定さ
れ１重みＷｉはスペクトル変換データの極性の違いが強
調されるようにつけられている。d, -1technique 1-Zil ・・・・・・(
7), the weighted dissimilarity calculation uses 1 weight wi and the absolute difference value d・
2 multiplier 2 for the time series W・(ll, di(1))
13. Addition Z Z calculator 1o9. Register], IO Q Equation (8) shows the weight similarity calculation for Equation (6), and the 1 weight specification is particularly important for the standard pattern (to distinguish it from other standard patterns). A weight Wi of 1 is set to the effective) portion so that the difference in polarity of the spectrum conversion data is emphasized.

なお識別ハ（８）式の非類似全全ての標準パターンに対
して求め、最小値を示す認識語を出力する。It should be noted that the identification word is determined for all dissimilar standard patterns of formula (8) and outputs the recognition word that shows the minimum value.

以上説明したように標準パターンに重み領域を設定しそ
の領域内でスペクトル概形の山と谷の部分がマツチング
されると重みが大きくなる様に設定されているので、安
定な重み演算ができ誤認識を起し易い語の標準パターン
に特有な領域部に重みを付けることにより１発声毎に変
化する差計よび話者による差の影響を重み付によってさ
ほど増大させることなく誤認識を起し易い語の標準パタ
ーンとの非類似度を増大させることができるという利点
がある。As explained above, a weight region is set in the standard pattern, and the weight is set so that when the peaks and valleys of the spectrum outline are matched within that region, the weight becomes larger. By weighting areas specific to the standard pattern of words that are likely to cause recognition, it is possible to easily cause misrecognition without significantly increasing the effects of differences between speakers and differences that change with each utterance. This has the advantage that the degree of dissimilarity with the standard pattern of words can be increased.

本発明に重みつき非類似度演算全極性の差、データ値等
を考慮して行っているので安定かつ有効な類似屁演算が
できる利点があり、不特定話者等のバラツキの多い音声
データを取り扱う音声認識装置に利用することができる
。Since the weighted dissimilarity calculation is performed in consideration of all polarity differences, data values, etc., the present invention has the advantage of being able to perform stable and effective similarity fart calculations, and can handle widely varying speech data such as from unspecified speakers. It can be used for voice recognition devices that we handle.

【図面の簡単な説明】[Brief explanation of drawings]

第１図は従来の音声認識装置の構成ブロック図。第２図は第１図に２ける周波数分析部の詳細ブロック図
、第３図は第１図におけるスペクトル変換部の詳細ブロ
ック図、第４図は第１図における非類似度演算部の詳細
ブロック図、第５図は本発明の構成ブロック図、第６図
は本発明の重み付非類似度演算部の詳細ブロック図であ
る。１１・・・入力端子、１２・・・周波数分析部、１３・
・・音声区間検出部、１４・・・始端検出信号、１５・
・・終端検出信号。１６・・・スペクトル変換部、１７・・・非類似度演算
、１８・・・判定部、　２１・・・入力音声信号、２２
・・・前置増幅器。２Ｌ１＋　２３−２ｉ・・２３−ｎ・・・帯域Ｆ波器群
、２４−１゜２４−２．・・・２４−ｎ・・・全波整流
器群、　　２５−１．２５−２．・・・２５−ｎ・・・
低域Ｆ波器群、２６・・・多重化器、２７川アナログテ
イジタル変換器、２８・・・対数変換器、２９・・出力
端子、３１・・・入力端子、　　３２．４６・・・カウ
ンタ、３３゜４２．４４．４７・・・乗算器、　　３４
，３６．４８・・劾ｌ算器、３５゜３７．４３．４５・
・・レジスタ、　　３８．３９・・・マルチプレクサ。４２　、４３・・・減算除算器、４９・・・遅延回路、
５０・・・減算。５１・・・出力端子、５５・・・重み付非類似度演算部
。１０１・・・始端検出信号、１０２・・・終端検出信号
、１０３・・・入カテータ、１０４・・・入力メモリ制
御Ｎ路、】０５・・・入力メモリ、１０６・・・標準パ
ターンメモリ制御回路、１０７・・・標準パターンメモ
リ、１０８・・・差分絶対値演算回路、１０９・・・７
ＪＩ］算器、１１０・・・レジスタ。２０３・・・標準パターンメモリ制御回路、２０４・・
・標準パターンメモリ、２０１・・・入力メモリ出力信
号線。２０５・・・標準パターンデータ出力信号線、２０７・
・・重み指定データ出力信号線、　　２０８，２０９・
・・レベル変換回路、　　２１０，２１１・・・変換出
力信号線、２１２・・・テーブルメモＩＪ、２１３・・
・乗算器。特許出願人沖電気工業株式会社特許出願代理人弁理士　　山　本　恵　− 手続補正書（自発）昭和５７年２月ンタ日特許庁長官　　島　１）春　樹　殿１、事件の表示昭和５６年　特　許願第１０９２６５号２、発明の名称音声認識装置３、補正をする者事件との関係　特許出願人名　称　　（０２９）沖電気工業株式会社明細書の発明
の詳細な説明の欄［又（最小値は負数でＭＩＮ又最大値は正数でＭＡＸと
すると、Ｍ　Ｉ　Ｎりｙｉ＜；：ＭＡＸ　）とすると変
換出力は次表のとおりとなる。レベル変換回路２０８，２０９の変換出力は信号線」以
上FIG. 1 is a block diagram of a conventional speech recognition device. Fig. 2 is a detailed block diagram of the frequency analysis section 2 in Fig. 1, Fig. 3 is a detailed block diagram of the spectrum conversion section in Fig. 1, and Fig. 4 is a detailed block diagram of the dissimilarity calculation section in Fig. 1. 5 is a block diagram of the configuration of the present invention, and FIG. 6 is a detailed block diagram of the weighted dissimilarity calculating section of the present invention. 11... Input terminal, 12... Frequency analysis section, 13.
...Voice section detection unit, 14...Start detection signal, 15.
...Termination detection signal. 16... Spectrum conversion unit, 17... Dissimilarity calculation, 18... Judgment unit, 21... Input audio signal, 22
...Preamplifier. 2L1+ 23-2i...23-n...Band F wave device group, 24-1°24-2. ...24-n...Full wave rectifier group, 25-1.25-2. ...25-n...
Low-frequency F wave device group, 26... Multiplexer, 27 Analog digital converter, 28... Logarithmic converter, 29... Output terminal, 31... Input terminal, 32.46... Counter, 33°42.44.47... Multiplier, 34
, 36.48...Gai l calculator, 35°37.43.45.
...Register, 38.39...Multiplexer. 42, 43... Subtraction divider, 49... Delay circuit,
50...subtraction. 51... Output terminal, 55... Weighted dissimilarity calculation unit. 101... Start end detection signal, 102... End detection signal, 103... Input cutter, 104... Input memory control N path, ]05... Input memory, 106... Standard pattern memory control circuit , 107...Standard pattern memory, 108...Difference absolute value calculation circuit, 109...7
JI] calculator, 110... register. 203...Standard pattern memory control circuit, 204...
- Standard pattern memory, 201... Input memory output signal line. 205...Standard pattern data output signal line, 207...
・Weight designation data output signal line, 208, 209・
... Level conversion circuit, 210, 211 ... Conversion output signal line, 212 ... Table memo IJ, 213 ...
- Multiplier. Patent Applicant Oki Electric Industry Co., Ltd. Patent Application Agent Megumi Yamamoto - Procedural Amendment (Spontaneous) February 1981 Commissioner of the Japan Patent Office Shima 1) Haruki Tono1, Indication of Case 1988 Patent Application No. 109265 2, Name of the invention Speech recognition device 3, Relationship with the case of the person making the amendment Patent applicant name (029) Detailed description of the invention in the Oki Electric Industry Co., Ltd. specification [Also, the minimum value is a negative number If the MIN or maximum value is a positive number and MAX, then if MIN = yi<;:MAX), the conversion output will be as shown in the following table. The conversion output of the level conversion circuits 208 and 209 is above the signal line.

Claims

【特許請求の範囲】[Claims]

（１）入力音声を周波数分析する手段と、その出力に接
続される音声区間検出手段及びスペクトル変換手段と、
スペクトル変換手段の出力に接続され音声区間検出手段
によシ与えられる音声の始端と終端の間で入力音声を標
準パターンと比較して非類似度を演算する非類似度演算
手段と、その出力に接続されて音声全認識する判定手段
とを有する音声認識装置に２いて、あらかじめ設定され
た重み領域における非類似度演算に重み領域を解釈し、
かつデータの性質により重みの大きさを判断して非類似
度の演算を行うことを特徴とする音声ｇ識装置。(1) A means for frequency analyzing input speech, a speech section detection means and a spectrum conversion means connected to the output thereof,
a dissimilarity calculating means connected to the output of the spectrum converting means and calculating a degree of dissimilarity by comparing the input speech with a standard pattern between the start and end of the speech given by the speech section detecting means; 2, the speech recognition device is connected to a speech recognition device having a determination means for recognizing all speech;
A speech g recognition device characterized in that the magnitude of the weight is determined according to the nature of the data and the degree of dissimilarity is calculated.

（２）重みの大きさを入力データと標準パターンデータ
の２種のデータの符号の一致、不一致及びデータの絶対
値の大きさによって判断する特許請求の範囲第１項記載
の音声認識装置。(2) The speech recognition device according to claim 1, wherein the magnitude of the weight is determined based on the coincidence or mismatch of signs of two types of data, input data and standard pattern data, and the magnitude of the absolute value of the data.