JPS60203995A

JPS60203995A - Voice pattern matching

Info

Publication number: JPS60203995A
Application number: JP59058440A
Authority: JP
Inventors: 裕飯塚
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1984-03-28
Filing date: 1984-03-28
Publication date: 1985-10-15
Also published as: JPH0424719B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】（技術分野）本発明は、音声認識における音声パターンマツチング方
法に関する。DETAILED DESCRIPTION OF THE INVENTION (Technical Field) The present invention relates to a speech pattern matching method in speech recognition.

（背景技術）音声認識においては一般に、入力音声と標準音声とを周
波数チャンネル（以下単にチャンネルという）軸と時間
フレーム（以下単にフレームという）軸との２次元特徴
・ｐターンで表現し、入力・ぐターンと標準パターンと
の距離をめ、最小距離を与える標準パターンコードを入
力音声カテゴリと判定する。(Background technology) In general, in speech recognition, input speech and standard speech are expressed as two-dimensional features or p-turns between a frequency channel (hereinafter simply referred to as channel) axis and a time frame (hereinafter simply referred to as frame) axis, The standard pattern code that provides the minimum distance is determined to be the input voice category.

発声速度変動を考慮した距離のめ方として、ＤＰマ、チ
ング法がよく知られているが、そこでの演算量の膨大さ
を避ける方法として、例えば特開昭５７−５２０９６号
明細書で開示されているように、本質的に線形な部分・
ぐスを用いる方法が知られている。The DP machining method is well known as a method of estimating distance that takes into account variations in speaking speed, but as a method to avoid the enormous amount of calculations, for example, the method disclosed in Japanese Patent Application Laid-Open No. 57-52096 As shown in the figure, the essentially linear part/
A method using gas is known.

そこでは、便宜的なブロックを設定することがあるけれ
ども、単語標準音声の過渡部と定常部とに対応して標準
・ぐターンのフレームに過渡ブロックと定常ブロックと
を設定しておき、過渡部パスとして実質１対１の線形パ
スを設定し、定常部パスとして残余のフレームを対応さ
せる。In this case, convenient blocks may be set, but transient blocks and steady blocks are set in the standard word frame corresponding to the transient and steady parts of the standard voice of a word. A substantially one-to-one linear path is set as the path, and the remaining frames are made to correspond to each other as the stationary part path.

この方法は比較的短かい単語には有用であったが、従来
、初期設定後、そこを起点とした所定の範囲内で、云わ
ば、標準パターンの過渡ブロックを入カバターンをシフ
トさせた関係で複数の過渡部パスを候補として設定し、
最小距離を力える候補を選択していた。This method has been useful for relatively short words, but conventionally, after the initial setting, it is a transition block of a standard pattern within a predetermined range starting from that point, so to speak, by shifting the input pattern. Set multiple transition paths as candidates,
The candidate with the shortest distance was selected.

従って、認識対象の増加や単語長の増加などに対処する
ため、過渡ブロック数を増大させたり、或は単一の過渡
プロ、りのフレーム長を制限するために標準音声の１つ
の過渡部に対応して２つの過渡プロ、りと１つの定常ブ
ロックを設定して分割するなどの工夫をすると、定常部
パスが通常の発声速度変動を越えて異常に長くなったり
短かくなったシすることが生じてきた。Therefore, it is necessary to increase the number of transient blocks in order to deal with an increase in the number of recognition targets, an increase in word length, etc., or to limit the frame length of a single transient processor. Correspondingly, if you take measures such as setting and dividing two transient blocks and one steady block, the steady-state path may become abnormally long or short beyond normal vocalization rate fluctuations. has arisen.

（発明の目的）本発明の目的は、発声速度変動が予想される範囲に過渡
部パスを制限することにあり、これを定常部パスの傾斜
を制限することによって行ない、それに対応して、最適
化と仮称した・ぐス設定処理を複数回繰返すようにした
ものである。(Objective of the Invention) The object of the present invention is to limit the transient path to a range in which vocalization rate fluctuations are expected, and to do this by limiting the slope of the steady-state path, so that the optimum This process is designed to repeat the process of setting the file, which is tentatively called ``configuration'', multiple times.

（発明の構成）本発明における標準ノやターンは、その時間フレーム（
以下単にフレームという）に関して過渡ブロックと定常
ブロックとが予め設定しである。過渡ブロックのフレー
ム長を数フレームに限定するため、便宜的な定常フレー
ムを設定することはあるけれども、大部分は音声の過渡
部と定常部とに対応させて設定する。(Structure of the Invention) The standard no or turn in the present invention is defined by its time frame (
Transient blocks and steady blocks (hereinafter simply referred to as frames) are preset. In order to limit the frame length of the transient block to a few frames, convenient steady frames are sometimes set, but in most cases they are set in correspondence with the transient and steady parts of the audio.

過渡プロ、りの最大数の典型は、１フレーム１０　ｍ５
ｅｃとして２０〜１５０フレーム長の１００語を認識対
象とした場合、１０個程度である。The typical maximum number of transients is 10 m5 per frame.
When 100 words with a length of 20 to 150 frames are to be recognized as ec, the number is about 10.

第１図は、本発明によるマ、チングノクスの１例を示し
た図であシ、入力・やターン長が６５フレーム、標準パ
ターン長が６９フレーム、過渡ブロックの数が４である
、トウキヨウの例を示したものであり、縦軸は入力・ぐ
ターンのフレーム軸、横軸ハ標準パターンのフレーム軸
である。第１図の例は、常に入カバターンの始端からの
距離をめる形式の例（そのため標準ｉｅターンも、その
作成時の始端前の数フレームの情報をもつ）を示してい
るため、始端対応では厳密にも１対１対応ではないけれ
ども、標準パターンの過渡ブロックＴＲＡＮＩ　。FIG. 1 is a diagram showing an example of a machining node according to the present invention, in which the input turn length is 65 frames, the standard pattern length is 69 frames, and the number of transient blocks is 4. The vertical axis is the input pattern frame axis, and the horizontal axis is the standard pattern frame axis. The example in Figure 1 shows an example of a format in which the distance from the starting edge of the input pattern is always calculated (therefore, the standard IE turn also has information about several frames before the starting edge when it is created), so it is compatible with the starting edge. Although it is not strictly a one-to-one correspondence, the standard pattern is the transient block TRANI.

ＴＲＡＮＪ　、　−、ＴＲＡＮ４に対応した過渡部パス
は実質ｌ対ｌの線形・ぐスである。標準パターンの定常
プロ、りＣ０ＮＴｌ　、　Ｃ０ＮＴＪ　、　Ｃ０ＮＴ３
に対応した定常部・ぐスは過渡部・ぐスの残部として対
応づけることによって設定される。The transition path corresponding to TRANJ, -, TRAN4 is essentially an l-to-l linear path. Standard pattern steady pro, C0NTl, C0NTJ, C0NT3
The stationary part/gas corresponding to is set by associating it with the remaining part of the transient part/gas.

第１図では、標準・やターンの各定常ブロックＣ０ＮＴ
Ｊに対応して１フレ一ム分の情報で代表させた例を示し
ているため、各定常パスは垂直な線として示している。In Figure 1, each standard/turn steady block C0NT
Each stationary path is shown as a vertical line because the example shown is represented by one frame's worth of information corresponding to J.

なお第１図は、入カバターンパワーとフレーム当りの距
離を示していて、標準・ぐターンの最後の定常プロ、り
Ｃ０ＮＴ３に対応した初め部分の距離が多少太きい。Note that Fig. 1 shows the input cover turn power and the distance per frame, and the distance at the beginning of the standard turn, which corresponds to the last steady program and RI C0NT3, is somewhat wide.

後述の実施例では、１つの定常ブロックをそのフレーム
長に対して最大４個のフレームで代表させ、定常部での
距離増大を少なくしている。In the embodiment described later, one stationary block is represented by a maximum of four frames for its frame length to reduce the increase in distance in the stationary part.

第２図は、本発明の概要を示すブロック図であシ、各標
準パターンに対してまず初期パス設定過程を設け、候補
・Ｑス設定過程と過渡部・ぐス選択過程とからなる最適
化過程を複数回設ける。FIG. 2 is a block diagram showing an overview of the present invention. First, an initial path setting process is provided for each standard pattern, and an optimization process consisting of a candidate/Q path setting process and a transient part/gus selection process is shown in FIG. Set up the process multiple times.

本発明における初期・ぐスば、過渡部ノヤスを実質１対
１で設定し、定常部パスを実質−裸線形で設定すること
によって行ない、第３図はその概念を説明する図である
。In the present invention, the initial phase and the transition phase are set in a substantially one-to-one ratio, and the steady-state path is set substantially in a bare linear manner. FIG. 3 is a diagram explaining the concept.

第３図において、標準パターンの全ての過渡プロ、りに
対応するものとしてそのフレーム長と等しいフレーム長
ＩＮＴＲＡＮを割り当て、入力ノヤターンの残部ｌＮＣ
０Ｎ５Ｔと標準パターンの定常ブロックＣ０ＮＴｌ〜Ｃ
０ＮＴ３とを線形に対応させることによって、その各定
常ブロックＣ０ＮＴｌ〜Ｃ０ＮＴ　３に対応した入カバ
ターンに−けるフレーム数を決定し、これによって標準
・やターンの各過渡ブロック及び各定常ブロックに対応
したものが入力／Ｎｏターンの中に決定され、実質１対
１の過渡部・ぐスとその残部を一様線形に対応づけた定
常部・パスとからなる初期・ぐスが設定される。In FIG. 3, a frame length INTRAN equal to that of the standard pattern is assigned as corresponding to all the transient processes of the standard pattern, and the remainder of the input noyaturn is
0N5T and standard pattern steady blocks C0NTl~C
0NT3, the number of frames in the input cover turn corresponding to each of the steady blocks C0NTl to C0NT3 is determined, and thereby the number of frames corresponding to each transient block and each steady block of the standard and turn is determined. is determined in the input/no turn, and an initial stage consisting of a substantially one-to-one transient part/path and a stationary part/path in which the remaining part is uniformly linearly correlated is set.

この場合、標準ｉｐターンの特定の定常ブロックに対応
するものが零となることがあり、この場合、便宜的に例
えば１フレームの人力・ぐターンを対応づけることが必
要となるため、定常部７ＮＯスの設定は厳密には一様線
形で寿いけれども実質的には一様線形ということができ
る。本発明では、最適化過程を複数回繰返すことによっ
て過渡部・やスを決定するものであり、第４図は、標準
・やターンの１つの過渡ブロックＴＲＡＮ３に対応して
、候補パス設定過程と過渡部パス選択過程との概念を説
明するために示した図である。In this case, the standard IP turn that corresponds to a specific stationary block may be zero. Strictly speaking, the setting of the speed is uniformly linear, but in reality it can be said to be uniformly linear. In the present invention, the transient section/path is determined by repeating the optimization process multiple times, and FIG. 4 shows the candidate path setting process and FIG. 3 is a diagram shown to explain the concept of a transition path selection process.

第４図において、過渡部パスＰＡＴＨＯを初期・９ス設
定過程で設定されたものとし、これを起点・パスとする
ものとする。In FIG. 4, it is assumed that the transition path PATHO is set in the initial 9th path setting process, and this is assumed to be the starting point and path.

過渡ブロックＴＲＡＮ３に対応した過渡部パスの候補例
えばＰＡＴＨＦやＰＡＴＨＢは、ある範囲で人カッ９タ
ーンのフレームを移動させた関係で設定し、従って候補
ＰＡＴＨＦ　、　ＰＡＴＨＢは起点ノぐスＰＡＴＨＯに
平行である。The candidates for the transition path corresponding to the transient block TRAN3, such as PATHF and PATHB, are set by moving nine turns of frames within a certain range, so the candidates PATHF and PATHB are parallel to the starting point PATHO. .

このことは、入力パターンの中で過渡ブロックＴＲＡＮ
３をシフトさせることによって候補を設定すると考えて
もよい。候補は書道複数であるが、範囲を決定する条件
によって１個のみとなることがある。This means that the transient block TRAN in the input pattern
You may consider setting candidates by shifting 3. Although there are multiple calligraphy candidates, there may be only one candidate depending on the conditions that determine the range.

範囲は、前後の定常ブロックの傾斜を含む条件と特定値
とを含む。The range includes conditions including the slopes of the preceding and following stationary blocks and a specific value.

前方範囲Ｆ及び後方範囲Ｂにはそれぞれ例えは３なる値
の制限を設け、また前者Ｆは先行定常ブロックＣ０ＮＴ
２に対応した定常部７Ｆスの傾斜が特定の値以内である
制限を設け、後者Ｂは後続プロ。For example, a limit of 3 is set for each of the front range F and the rear range B, and the former F is set as the preceding stationary block C0NT.
A restriction is set that the slope of the stationary part 7F corresponding to 2 is within a certain value, and the latter B is the subsequent pro.

りＣ０ＮＴ　３に対応した定常部パスの傾斜が特定の値
以内である制限を設ける。A restriction is set that the slope of the steady-state path corresponding to C0NT3 is within a specific value.

後述の実施例では、音声の定常部における発生速度変動
の限界を５０チから２００　％と想定し、定常パスの傾
斜が１／！〜２となる範囲で制限し、この傾斜の検出を
当該定常パスに対応した標漁・ぐターンと入カバターン
とのフレーム数を比較することによって行ない、満足し
た場合に特定値の範囲でシフトさせることによって候補
を設定する。In the example described later, it is assumed that the limit of the variation in the rate of occurrence in the steady part of the voice is from 50% to 200%, and the slope of the steady path is 1/! ~2, and this slope is detected by comparing the number of frames between the target fishing turn and the entering cover turn corresponding to the steady path, and if it is satisfied, it is shifted within a specific value range. Set candidates by

また、候補・ぐスの設定範囲は、認識対象が限定されれ
ば、初期・ぐスから特定値例えば１４フレ一ム分（前方
後方夫々７フレーム）以内であることを経験的に知るこ
とができるので、この特定値を条件に加えることができ
る。過渡部・ぐス選択過程では、その候補に沿った距離
をめ、最小距離を与えるものを過渡部パスとして選択す
る。In addition, if the recognition target is limited, it is possible to know empirically that the setting range of the candidate/gusu is within a specific value, for example, 14 frames (7 frames each in the front and rear) from the initial candidate. Therefore, this specific value can be added to the condition. In the transition path selection process, the distance along the candidates is determined, and the path that provides the minimum distance is selected as the transition path.

候補パス設定過程と過渡部パス選択過程からなる最適化
過程は、まず初期パス設定過程での過渡部・モスを起点
ｉ９スとして行ない、２回目以後は前回の最適化過程で
選択されたものを起点・ぐスとして行ない、複数回繰返
す。The optimization process consisting of the candidate path setting process and the transition part path selection process is first performed using the transition part/MOS in the initial path setting process as the starting point, and from the second time onwards, the one selected in the previous optimization process is used. Start as a starting point and repeat multiple times.

繰返し回数はｉｏｏ個の都市名の実施例では３〜４回が
適当であった。In the example with ioo city names, the appropriate number of repetitions was 3 to 4 times.

次に実施例について述べる。Next, an example will be described.

（実施例）第５図はこの発明の一実施例を示すブロック図であって
、入力端子１０１より入力された入カノクターンは入力
バッファ１０２に格納され、さらに入力バッファ１０２
はベクトル間距離計算部１０３に接続され参照を可能に
している。又、ベクトル間距離計算部１０３はテーブル
１０４に接続される。テーブル１０４は２次元の配列で
あり、後述の項目を保持する。初期設定部１０５はテー
ブル１０４に接続されテーブルの初期化を行なう。同様
に最適化部１０７はテーブル１０４に接続され、入力・
ぐターンと標準・９ターンの対応を最適化する。(Embodiment) FIG. 5 is a block diagram showing an embodiment of the present invention, in which an input canocturne inputted from an input terminal 101 is stored in an input buffer 102;
is connected to the inter-vector distance calculation unit 103 to enable reference. Further, the inter-vector distance calculation unit 103 is connected to a table 104. Table 104 is a two-dimensional array and holds items described below. An initial setting unit 105 is connected to the table 104 and initializes the table. Similarly, the optimization unit 107 is connected to the table 104, and
Optimize the correspondence between the long turn and the standard/9 turn.

標準ノ９ターンメモＩＪ　１０９はベクトル間距離計算
部１０３に接続されデータの参照を可能にしている。距
離計算部１１０はテーブル１０４に接続され、その出力
はレジスタ１１１に接続されレジスタ１１ノの出力は出
力レジスタ１１２．比較回路１１３に接続される。出力
レジスタ１１２の出力は比較回路１１３と出力端子１１
４とに接続される。制御回路１１５は全体の制御を行な
う。A standard 9-turn memo IJ 109 is connected to the inter-vector distance calculation section 103 to enable data reference. The distance calculation unit 110 is connected to the table 104, its output is connected to the register 111, and the output of the register 11 is sent to the output register 112. Connected to comparison circuit 113. The output of the output register 112 is sent to the comparison circuit 113 and the output terminal 11.
4. A control circuit 115 performs overall control.

まず入力端子１０１から入力される入力・々ターンは入
カバ、ファ１０２に書き込まれる。次に出力レジスタ１
１２に正の最大値をセットする。次ニ標準ノやターンメ
モＩＪ　１０９に格納されたに個の標準ｉｅターンとの
距離をめるが、ここではに番目の標準パターンとのマツ
チングを考えることにする。First, the input turns input from the input terminal 101 are written to the input cover 102. Next, output register 1
Set the maximum positive value to 12. Next, we will calculate the distance from the 2nd standard pattern and the 2nd standard ie turns stored in the turn memo IJ 109, but here we will consider matching with the 2nd standard pattern.

パターンマツチングは標準ツクターンに対し、よく一致
するように入力・ぐターンを変形させることにより行な
う。テーブル１０４には標準パターンと入力・ぐターン
変形との状態を保持しておく。標準・ぐターンはあらか
じめ標準音声の過渡部を数フレーム単位に過渡ブロック
として保持する。又、標準音声の定常部はそのフレーム
数に応じて１〜４に分割し、各々１フレ一ム分づつのデ
ータを各分割部対応で保持する。Pattern matching is performed by transforming the input pattern so that it closely matches the standard pattern. The table 104 holds the states of the standard pattern and the input/guitar deformation. Standard Gutern stores the transient part of the standard audio in units of several frames in advance as a transient block. Further, the stationary part of the standard voice is divided into 1 to 4 parts according to the number of frames, and data for one frame is held for each divided part.

テーブル１０４は２次元の配列でありその要素をＴ（ｉ
、ｊ）で表わす。ここで１は項目、ｊは３番目の過渡プ
ロ、りとそれに続く定常ブロックを表わす。The table 104 is a two-dimensional array whose elements are T(i
, j). Here, 1 represents the item, j represents the third transient program, and the following stationary block.

テーブル１０４の内容は１−１過渡プロ、りのフレーム数ｉ＝２過渡ブロックの標準パターンデータへのポインタ３過渡ブロツクの入力バッファへのポインタ（初期値）４過渡ブロツクの入力バッファへのポインタ５過渡部・
ぐスの距離６定常ブロツクのフレーム数７定常ブロツクの分割数８定常ブロツクの標準・ぐターンデータへのポインタ９定常ブロツクの入力バッファへのポインタ（初期値）１０定常ブロツクの入カバ、ファへのポインタ１１定常
フロツクへの入力ｉ’？ターンフレーム数（初期値）１２定常ブロツクの入カバターンフレーム数１３定常プ
ロ、りの最小フレーム数１４定常プロ、りの最大フレーム数１５定常部パスの距離である。The contents of table 104 are: 1-1 Transient program, number of frames i = 2 Pointer to the standard pattern data of the transient block 3 Pointer to the input buffer of the transient block (initial value) 4 Pointer to the input buffer of the transient block 5 Transition part/
6. Number of frames in the steady block 7. Number of divisions in the steady block 8. Pointer to the standard pattern data of the steady block 9. Pointer to the input buffer of the steady block (initial value) 10. Input cover of the steady block, to the buffer Input i'? to the pointer 11 stationary block. Number of turn frames (initial value) 12 Input turn frame number of steady block 13 Steady pro, minimum number of frames 14 Steady pro, maximum number of frames 15 Distance of the steady part path.

次にテーブル１０４の初期化を初期設定部１０５によシ
行なう。ｋ番目の標準・ぐターンの全体のフレーム数を
ＦＲ，過渡ブロックの数をＢＬＫ　、全体の過渡プロ、
りのフ゛レーム数をＢＦＲ、全体の定常ブロックのフレ
ーム数をＦＦＲ、入カッ９ターンの全体のフレーム数を
ＩＮＦＲとする。次に第１式に従って定常ブロックの数
を計算し、第２式に従って全体の定常ブロックの入カバ
ターンのフレーム数ヲ計算する。Next, the table 104 is initialized by the initial setting unit 105. The total number of frames of the kth standard turn is FR, the number of transient blocks is BLK, the total number of transient blocks is FR,
The number of frames in each turn is BFR, the total number of frames in the stationary block is FFR, and the total number of frames in the first nine turns is INFR. Next, the number of stationary blocks is calculated according to the first equation, and the number of frames of the input pattern of the entire stationary block is calculated according to the second equation.

ＢＬＫＭＩ　４−　ＢＬＫ−１・第１式Ｉ　ＮＦＦＲ←
ＩＮＦＲ−ＢＦＲ・第２式次に割り当て部１０６により
定常ブロックについて実質−棟線形で標準・ぐターンと
入力・ぐターンとを対応づけるために、Ｔ　（１１、ｊ
　）、Ｔ（１２，ｊ）（Ｊ二ｌ〜ＢＬＫＭＩ　）を設定
する。BLKMI 4- BLK-1・First formula I NFFR←
INFR-BFR・Second Formula Next, the assignment unit 106 uses T (11, j
), T(12,j) (J2l~BLKMI) are set.

まず、ｊ　＝　ｌ　−ＢＬＫＭＩまでについてＴ（１１
、Ｊ）←（ＩＮＦＦＲ＊８十Ｔ（６，ｊ）／ＦＦＲ＋４
）／８　・第：３式％式％便宜的に定常プロ、りを設定し、さらにＴ（１２，’ｊ
）←Ｔ（ｔｔ、Ｊ）　・・第４式とする。First, T(11
, J)←(INFFR*80T(6,j)/FFR+4
)/8 ・No. 3 formula % formula % For convenience, set steady pro, ri, and further T(12,'j
)←T(tt, J)...The fourth equation is used.

ここでもしＪ＝１でないなら、第５式が成立するように便宜的に、Ｔ（１
１，ｊ）、Ｔ（１２，ｊ）　（ｊ＝ｔ−ＢＬＫＭＩ　）
の値の大きい順に１を加算するか、減算して調整する。Here, if J = 1, then for convenience, T(1
1,j), T(12,j) (j=t-BLKMI)
Adjust by adding or subtracting 1 in descending order of the value.

なお第３式を含め、演算はすべて整数演算であり、第３
式の４なる数値は４捨５人のために導入したものである
。Note that all operations, including the third expression, are integer operations;
The number 4 in the formula was introduced for 4 to 5 people.

次に定常／Ｊ？スの傾斜が棒と２とに対応した入力・ぐ
ターンのフレーム数をめてこれを最小フレーム数と最大
フレーム数として設定するためにｊ＝ｌ〜ＢＬＫＭ１に
ついて、Ｔ（１３，ｊ）←Ｔ（６，ｊ）／２　・・・第６式（も
しＴ（１３，ｊ）−〇ならｒ（ｉ３．Ｊ）←ｌｄ力）Ｔ
（１４，ｊ）←Ｔ（６，ｊ）＊２　・・・第７式さらに
入カッやターンの先頭フレームを５ＴＦＲとして、音声
始端変動に対処するために、ＦＦ←５ＴＦＲ−Ｗ　・・・第８式各プロ、りの先頭フレーム位置をめるために、ｊ　＝　
ｌ　−ＢＬＫＭＩについてＴ（３，ｊ）←ＦＦ　−・第９式Ｔ（４，ｊ）←ＦＦ　・第１Ｏ式ＦＦ４−ＦＦ＋Ｔ　（１、ｊ　）　・第１１式％式％ＦＦ４−ＦＦ＋Ｔ　（１１、ｊ　）　・・第１４式を繰
返す。Next is steady/J? In order to find the number of frames of the input turn whose slope corresponds to bar and 2 and set these as the minimum and maximum frames, for j=l~BLKM1, T(13,j)←T (6,j)/2...6th formula (if T(13,j)-〇 then r(i3.J)←ld force)T
(14,j)←T(6,j)*2 ...Equation 7Furthermore, assuming that the first frame of an input or turn is 5TFR, in order to deal with the fluctuation of the voice start edge, FF←5TFR-W ...th In order to set the first frame position of each type 8, j =
l - Regarding BLKMI T (3, j) ← FF - - 9th formula T (4, j) ← FF - 1st O formula FF4-FF+T (1, j ) - 11th formula % formula % FF4-FF + T (11, j) ...Repeat Equation 14.

なお、Ｗは単語音声始端終端の不確定性を考慮した値で
あり、ここでは３としだ。Note that W is a value that takes into account the uncertainty of the beginning and end of a word's speech, and is set to 3 here.

これでテーブル１０４の初期化が終了した。This completes the initialization of the table 104.

次に最適化部１０７により入カバターンとに番目の標準
パターンとがよく一致するようにテーブル１０４の最適
化を行なう。Next, the optimization unit 107 optimizes the table 104 so that the input cover pattern closely matches the standard pattern.

最適化の回数を４として以下を行なう。The following is performed with the number of optimizations set to 4.

Ｊ＝　１−ＢＬＫについて過渡プロ、りを前後に移動し
、もっとも良く一致する場所に固定する。その手順は、
まず前後でどれ程の範囲内で移動を行なうかを決定する
。Move the transient profile back and forth for J=1-BLK and fix it at the best match. The procedure is
First, decide how far back and forth you want to move.

決定の条件は、０当回の起点パスから前後最大３フレームまでの移動・３％１．ｊ％ＢＬＫのとき（端の過渡ブロックでない
とき）は初期位置から前後７フレ一ム以内０ｊ−１又はｊ　＝　ＢＬＫのときは初期位置からＷフ
レーム以内０先行定常パス対応の入カバターン部分の長さがＴ（１
３，ｊ−１）以上、Ｔ（１４，Ｊ−１）以下であること０後続定常・ぐス対応の入カバターン部分の長さがＴ（
ｔ３＝）以上、Ｔ（１４，ｊ）以下であることである。決定された前方への移動の範囲をＦ、後方への
移動の範囲をＢとしてＦＡ＝Ｔ（４，ｊ）−Ｆ〜Ｔ（４
，ｊ）＋Ｂについて、過渡部・ぐス対応のパターンの距
離をベクトル間距離割算部１０３によりめ、最小となっ
た距離をＭＩＮＤ　、そのときのＦＡをＭＩＮＦとする
。ベクトル間距離計算部１０３ではテーブル１０３のブ
ロック部フレーム数Ｔ（１，ｊ）と標準パターンメモリ
１０９へのポインタＴ　（２，ｊ）を参照している。距
離りは次式により計算される。The conditions for the decision are: 0 movement up to a maximum of 3 frames before and after the starting point path, 3% 1. j% When BLK (not a transient block at the end), within 7 frames before and after the initial position 0j-1 or when j = BLK, within W frames from the initial position 0 Length of the input cover turn corresponding to the preceding steady path Saga T (1
3,j-1) or more and less than or equal to T(14,J-1)
t3=) or more and T(14,j) or less. FA=T(4,j)−F〜T(4
, j)+B, the distance between the patterns corresponding to the transient part/gust is determined by the inter-vector distance dividing unit 103, the minimum distance is MIND, and the FA at that time is MINF. The inter-vector distance calculation unit 103 refers to the block part frame number T(1,j) in the table 103 and the pointer T(2,j) to the standard pattern memory 109. The distance is calculated by the following formula.

・・・第１５式ここで■は音声人カバッファ１０２、Ｓは標準パターン
メモリ１０９、ＣＨはチャネル数、Ｃはチャネル番号、
ｍはフレーム番号である。...Formula 15, where ■ is the audio driver buffer 102, S is the standard pattern memory 109, CH is the number of channels, and C is the channel number.
m is a frame number.

次に以上の結果によりテーブル１０４を書き直すＯｊ＞１のときにはＴ（１２，Ｊ−１）←Ｔ（１２，Ｊ−１）−（Ｔ（４、
ｊ　）−ＭＩＮＦ）・・・第１６式％式％Ｔ（１２，ｊ）←Ｔ（１２，ｊ）＋（Ｔ（４，ｊ）−Ｍ
ＩＮＦ）　・第１７式ｒ（４０，ｊ）←Ｔ（１０，Ｊ）
−（Ｔ（４，Ｊ）　−ＭＩＮＦ）　・第１８式さらにＴ（４，ｊ）←ＭＩＮＦ　・第１９式Ｔ（５，ｊ）←ＭＩＮＤ　・・第２０式とする。これに
より最適化が完了した。Next, rewrite table 104 based on the above results. When O j > 1, T (12, J-1)
j)-MINF)...16th formula% formula% T(12,j)←T(12,j)+(T(4,j)-M
INF) ・Equation 17 r(40,j)←T(10,J)
-(T(4,J)-MINF) - 18th equation further T(4,j)←MINF 19th equation T(5,j)←MIND ..20th equation. This completes the optimization.

次に距離割算部１１０により全体の距離をめる。まず、
ブロック部ｌ、とＢＬＫの距離を計算しなおす。Next, the distance dividing section 110 calculates the entire distance. first,
The distance between block portion l and BLK is recalculated.

ＦＮ４−Ｔ／１．１）　−Ｗ＋（Ｔ（４，１）　−Ｔ（
３，１））　・・・第２２式５Ｐ４−Ｔ（４，１）＋Ｗ
　・第２２式５Ｐ４−Ｔ（２，１）　＋Ｗ−（Ｔ（４，
１）−Ｔ（３，ｌ））　−・第２３式・・・第２４式ＦＮｈＴ（１、ＢＬＫ）→←（Ｔ（４，ＢＬＫ）　−Ｔ
（３，ＢＬＫ）ル第２５式ＩＰ４−Ｔ（４、ＢＬＫ）　
・・・第２６式５Ｐ４−Ｔ（２，ＢＬＫ）　−・・第２
７式％式％次にｊ＝１〜ＢＬＫＭＩについて、定常部パス対応の距
離をめＴ（１５，ｊ）に格納する。FN4-T/1.1) -W+(T(4,1) -T(
3,1)) ...22nd formula 5P4-T(4,1)+W
・22nd formula 5P4-T(2,1) +W-(T(4,
1)-T(3,l))--23rd formula...24th formula FNhT(1,BLK)→←(T(4,BLK)-T
(3, BLK) Le No. 25 IP4-T (4, BLK)
...26th formula 5P4-T (2, BLK) ---2nd
7 Formula % Formula % Next, for j=1 to BLKMI, the distance corresponding to the stationary part path is stored in T(15,j).

ＤＤＤ←０　・第２２式５Ｐ４−Ｔ（１０，Ｊ）　・第３３式５ＦＰ４Ｔ（８，ｊ）　・・・第３１式％式％）ＩＦＰ←ＩＦＰ　＋　ＦＮ　第３３式５ＦＰ４−８ＦＰ＋１　・・第３４式％式％最後に、Ｔ（１５，ｊ）←ＤＤＤ　・・・第３５式％式％ＤＤ＋−ＤＤ＋Ｔ（５，ｊ）＋Ｔ（１５，ｊ）　・第３
６式を計算し、ｋ番目の標準パターンと２春カバターン
の距離ＤＤを得る。DDD←0 ・22nd formula 5P4-T(10,J) ・33rd formula 5FP4T(8,j) ...31st formula % formula%) IFP←IFP + FN 33rd formula 5FP4-8FP+1 ・34th Formula % Formula % Finally, T (15, j) ← DDD ... 35th formula % Formula % DD + - DD + T (5, j) + T (15, j) - 3rd
Equation 6 is calculated to obtain the distance DD between the kth standard pattern and the second spring cover turn.

次に距離ＤＤをレジスタ１１ノにセットし、出力レジス
タ１１２との比較を比較回路１１３との間で行なう。も
しレジスタ１ツノの方が小さかったら、その内容と標準
ノ９ターン番号を出力レジスタ１１２にセットする。Next, the distance DD is set in the register 11, and a comparison with the output register 112 is performed with the comparison circuit 113. If the register 1 corner is smaller, its contents and standard 9th turn number are set in the output register 112.

ｋ＝１−Ｋまですべての標準パターンとのパターンマツ
チングを行ない最小となった距離と標準・ぐターン番号
を出力端子１１４より出力して動作終了する。Pattern matching is performed with all the standard patterns up to k=1-K, and the minimum distance and standard turn number are output from the output terminal 114, and the operation ends.

前述の第１図はこの実施例におけるフレーム当りの距離
を示しており、一部距離が大きい処もあるが、全体的に
ほとんどＯに近く、よくマツチングしていることがわか
る。The above-mentioned FIG. 1 shows the distance per frame in this example, and although there are some distances where the distance is large, it can be seen that the distance is generally close to O, indicating good matching.

（発明の効果）以上説明したように、本発明では各標準音声に対応して
過渡部と定常部とを予め設定する必要はあるが、本質的
に線形な部分ノｅスをある範囲に亘って設定する方法に
工っているため、比較的少ない計算量で高認識率を得る
ことができる利点がある。(Effects of the Invention) As explained above, in the present invention, although it is necessary to set the transient part and the steady part in advance for each standard voice, it is possible to set the essentially linear partial noise over a certain range. This method has the advantage of being able to obtain a high recognition rate with a relatively small amount of calculation.

【図面の簡単な説明】[Brief explanation of the drawing]

第１図〜第４図は本発明の詳細な説明するために示した
図であって第１図はマツチング／ｌスの例を示す図、第
２図は本発明の全体の機能を説明するだめの図、第３図
は初期パス設定過程を説明するだめの図、第４図は最適
化過程を説明するだめの図、第５図は本発明の一実施例
を示すブロック図である。１０１−入力端子、１０２・・・音声人カバ、ファ、１
０３・・ベクトル間距離計算部、１０４　テーブル、１
０５　初期設定部、１０７・−最適化部。Ｊ　０９・標準ノミターンメモＩＪ　、　１７　ｏ・・
距離計算部＋　１１１　レジスタ、１１２　出力レジス
タ。１１３・比較回路、１１４　出力端子、１１５・・制御
回路。特許出願人　沖電気工業株式会社第２図第３図第４図 ””ＣＵＮｌど一町　７７局〒５η 手続補正書（睦）５９．８．２３昭和　年　月　日特許庁長官　殿１　事件の表示− 昭和５９年　特　許　願第５８４４０号２　発明の名称ｆＦ、４ターンマツチンク方法３　補正をする者事件との関係　特許　出　願　人件　所（〒１０５）　東京都港区虎ノ門１丁目７番１２
号住　所（〒１０５）　東京都港区虎ノ門１丁目７８１
２号６補正の内容図面第２図を別紙のとおり補正する。Figures 1 to 4 are diagrams shown to explain the present invention in detail, with Figure 1 illustrating an example of the matching/l process, and Figure 2 explaining the overall function of the present invention. FIG. 3 is a diagram explaining the initial path setting process, FIG. 4 is a diagram explaining the optimization process, and FIG. 5 is a block diagram showing an embodiment of the present invention. 101-input terminal, 102... voice person cover, fa, 1
03... Inter-vector distance calculation unit, 104 Table, 1
05 Initial setting section, 107.--Optimization section. J 09 Standard Nomi Turn Memo IJ, 17 o...
Distance calculation unit + 111 register, 112 output register. 113. Comparison circuit, 114 output terminal, 115. Control circuit. Patent Applicant: Oki Electric Industry Co., Ltd. Figure 2 Figure 3 Figure 4 ""CUNl Doichicho 77th Bureau 〒5η Procedural Amendment (Mutsu) 59.8.23 Showa Year Month Date Commissioner of the Japan Patent Office 1 Indication of the case - 1981 Patent Application No. 58440 2 Name of the invention fF, 4-turn matching method 3 Relationship with the case of the person making the amendment Patent application Person Address (105) 1-7-12 Toranomon, Minato-ku, Tokyo
Address (105) 1-781 Toranomon, Minato-ku, Tokyo
Contents of Amendment 2 No. 6 Figure 2 of the drawing is amended as shown in the attached sheet.

Claims

【特許請求の範囲】[Claims]

（１）各標準ｉｅターンに対応して過渡ブロックと定常
プロ、りとを設定しておき、初期ｉ４ス設定過程と候補
パス設定過程と過渡部・やス選択過程とを備え、標準パ
ターンの過渡ブロックに対応したパスを過渡部・やスと
し且つ標準パターンの定常ブロックに対応したパスを定
常部パスとして、最終の前記過渡部パス選択過程で選択
された過渡部・ぐスとそれらの残部として対応づけた定
常部パスとをマツチングパスとし、そのマツチングパス
に沿って入力ｉ９ターンと標準・ぐターンとの距離をめ
ることを特徴とした音声・ぐターンマツチング方法にお
いて、ａ）過渡部パスとして実質１対１の線形・ぐスを設定し
、且つ定常部・ぐスとして実質一様線形な・ぐスを設定
する前記初期・ぐス設定過程を備え、ｂ）第１回目は前
記初期パス設定過程で設定された過渡部パスを起点とし
且つ第２回目以後は前記過渡部ノソス選択過程で選択さ
れた過渡部パスを含むものを起点として、標準・ぐター
ンの各過渡ブロックに対応してその前後の定常ブロック
の傾斜と第１特定値とを含む条件に従って範囲を決定し
、標準・ξターンの各過渡ブロックに対して入力・やタ
ーンの時間フレーム前記範囲内で移動させた関係で、実
質１対１の過渡部・ぞスを標準パターンの各過渡プロ、
りに対応してｌもしくは複数設定する前記候補・ぐス設
定過程を複数回備え、Ｃ）前記候補パス設定過程で設定された過渡部パスに沿
ってパターンの距離をめ、標準・Ｐターンの各過渡ブロ
ックに対応して最小値を与える過渡部パスを選択する前
記過渡部パス選択過程を前記各候補パス設定過程に対応
して備えていることを特徴とした音声パターンマツチフ
グ方法。(1) A transient block and a steady state pattern are set corresponding to each standard IE turn, and an initial i4 path setting process, a candidate path setting process, and a transition part/ya path selection process are provided. The path corresponding to the transient block is defined as the transient part/path, and the path corresponding to the stationary block of the standard pattern is defined as the stationary part path, and the transient part/gas selected in the final transition path selection process and their remaining parts In the audio/guitar matching method, which is characterized in that the constant part path associated with the input i9 turn is set as a matching path, and the distance between the input i9 turn and the standard turn is adjusted along the matching path, a) the transient part path b) the first time is the step of setting a substantially one-to-one linear shape and a substantially uniform linear shape as a stationary part; The transition block set in the path setting process is used as the starting point, and from the second time onward, the transition block path selected in the transient area selection process is used as the starting point, corresponding to each standard/gutern transient block. The range is determined according to the conditions including the slope of the steady block before and after it and the first specific value, and the time frame of the input and turn is moved within the above range for each transient block of the standard and ξ turns. , each transition pro of the standard pattern has a virtually one-to-one transition part/zoos,
C) The distance of the pattern is determined along the transition path set in the candidate path setting process, and the distance of the pattern is determined along the transition path set in the candidate path setting process. A voice pattern matching method, comprising the step of selecting a transition path that gives a minimum value for each transient block, corresponding to each candidate path setting step.

（２）第２回目以後の起点は第１回目の起点を含み、第
２回目以後の条件は第２特定値を含むことを特徴とする
特許請求の範囲第（１）項記載の音声パターンマツチン
グ方法。(2) The voice pattern pattern set forth in claim (1), wherein the starting point for the second and subsequent times includes the starting point for the first time, and the conditions for the second and subsequent times include the second specific value. Ching method.