JP2001175294A

JP2001175294A - Sound analysis device and sound analysis method

Info

Publication number: JP2001175294A
Application number: JP36199799A
Authority: JP
Inventors: Goro Sakata; 吾朗坂田
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 1999-12-21
Filing date: 1999-12-21
Publication date: 2001-06-29

Abstract

PROBLEM TO BE SOLVED: To reduce the burden of a processing in the analysis processing of sound. SOLUTION: A multiplier 11a multiplies an input signal A by an input signal B. A multiplier 11b squares the input signal B. Multipliers (12a and 13a), an adder 14a and a delay unit 15a, and multipliers (12b and 13b), an adder 14b and a delay unit 15b constitute feedback filters respectively. Multiplication coefficients are set in the filters so that they show entirely identical time characteristics. Signals obtained by inputting signals outputted from the multiplier 11a or the multiplier 11b to and making them pass through the feedback filters are weighted by windowing with respect to the signals outputted from the multiplier 11a or the multiplier 11b and is considered to be a signal obtained by accumulating and adding them at every sample. A divider 16 obtains a division result k by correcting the past division result of the divider based on divisor data and a divided data, which are inputted to the divider 16.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声の高能率符号
化の技術に関し、特に音声波形の線形予測分析の技術に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a technique for high-efficiency speech coding, and more particularly to a technique for linear prediction analysis of a speech waveform.

【０００２】[0002]

【従来の技術】音声の高能率符号化の分野において、分
析合成方式、いわゆるボコーダ方式の符号化の技術が数
多く提案されている。その中でも、音声波形の線形予測
を行なう、偏自己相関係数（ＰＡＲＣＯＲ係数：partia
l correlation coefficient ）を利用する分析合成方式
は、情報圧縮効果が高く、音声データの記憶・転送とい
った点で有利である。2. Description of the Related Art In the field of high-efficiency speech coding, many techniques for encoding using a so-called vocoder method have been proposed. Among them, a partial autocorrelation coefficient (PARCOR coefficient: partia) for performing linear prediction of a speech waveform
The analysis / synthesis method using l correlation coefficient) has a high information compression effect and is advantageous in terms of storage and transfer of voice data.

【０００３】図７は、ディジタルデータの形で入力され
る音声信号のＰＡＲＣＯＲ係数を計算する音声分析回路
の一例である。同図において、１０１ａ、１０１ｂ、
…、１０１ｎは遅延器、１０２ａ、１０２ｂ、…、１０
２ｎは相関器、１０３ａａ、１０３ａｂ、１０３ｂａ、
１０３ｂｂ、…、１０３ｎａ、１０３ｎｂは乗算器、１
０４ａａ、１０４ａｂ、１０４ｂａ、１０４ｂｂ、…、
１０４ｎａ、１０４ｎｂは加算器である。同図の回路は
いわゆる格子型計算法に基づいてＰＡＲＣＯＲ係数を算
出するものであり、この回路に音声信号を入力するとｋ
パラメータ（ＰＡＲＣＯＲ係数：ｋａ、ｋｂ、…、ｋ
ｎ）及び残差波形εが出力される。FIG. 7 shows an example of a voice analysis circuit for calculating a PARCOR coefficient of a voice signal input in the form of digital data. In the figure, 101a, 101b,
, 101n are delay units, 102a, 102b, ..., 10
2n is a correlator, 103aa, 103ab, 103ba,
103bb,..., 103na, 103nb are multipliers,
04aa, 104ab, 104ba, 104bb, ...,
104na and 104nb are adders. The circuit shown in the figure calculates a PARCOR coefficient based on a so-called grid type calculation method.
Parameters (PARCOR coefficients: ka, kb, ..., k
n) and the residual waveform ε are output.

【０００４】図８は、図７に示した回路により算出され
たディジタル信号の形で入力される音声信号のＰＡＲＣ
ＯＲ係数から音声信号を合成する音声合成装置の一例で
ある。図８において、２０１ｎ、２０１ｍ、…、２０１
ａは遅延器、２０２ｎａ、２０２ｎｂ、２０２ｍａ、２
０２ｍｂ、…、２０２ａａ、２０２ａｂは乗算器、２０
３ｎａ、２０３ｎｂ、２０３ｍａ、２０３ｍｂ、…、２
０３ａａ、２０３ａｂは加算器である。図８の装置で
は、図７の回路により算出されたｋパラメータ及び残差
波形εが入力されると、もとの音声信号が合成されて出
力される。FIG. 8 shows a PARC of an audio signal input in the form of a digital signal calculated by the circuit shown in FIG.
It is an example of a speech synthesizer that synthesizes a speech signal from an OR coefficient. 8, 201n, 201m,..., 201
a is a delay device, 202na, 202nb, 202ma, 2
, 202aa, 202ab are multipliers, 20
3na, 203nb, 203ma, 203mb, ... 2
03aa and 203ab are adders. In the device of FIG. 8, when the k parameter and the residual waveform ε calculated by the circuit of FIG. 7 are input, the original audio signal is synthesized and output.

【０００５】この図７及び図８に示した偏自己相関音声
分析合成系において、図７から出力されるｋパラメータ
や残差波形εの情報量を削減することによって情報圧縮
が行なわれる。[0005] In the partial autocorrelation speech analysis / synthesis system shown in FIGS. 7 and 8, information compression is performed by reducing the information amount of the k parameter and the residual waveform ε output from FIG. 7.

【０００６】一般的に音声波形のＰＡＲＣＯＲ係数を求
めて分析を行なう場合、分析する音声データ列に対し、
音声波形が定常的であると考えられる通常数十ｍｓの間
隔（この時間を、「分析ブロック長」と呼ぶこととす
る）での切り出しが行なわれる。このときの切り出され
たデータに対してはハニング窓やハミング窓等の窓関数
を用いての窓掛けが行なわれている。図９は、窓掛けに
用いられる窓関数のひとつであるハミング窓の時間波形
を示す図である。ここで、結果の連続性を確保するた
め、窓掛けがなされたデータ列と、分析ブロック長より
短い一定時間（この時間を「フレーム」と呼ぶこととす
る）だけ遅延させた元の音声データ列から分析ブロック
長での切り出し及び窓掛けが行なわれたデータ列との間
での畳み込み演算が行なわれる。更に、その畳み込み演
算の結果に対して正規化が行なわれることによって、ｋ
パラメータが算出される。ここで、正規化演算は分析ブ
ロック長に含まれる窓掛けされたデータ列の有するエネ
ルギーに応じた除算となる。図１０は、分析ブロック長
とフレームとの関係を示した図である。In general, when a PARCOR coefficient of a voice waveform is obtained and analyzed, a voice data string to be analyzed is
Speech waveforms are usually cut out at intervals of several tens of ms (this time is referred to as “analysis block length”), which is considered to be stationary. The data extracted at this time is windowed using a window function such as a Hanning window or a Hamming window. FIG. 9 is a diagram showing a time waveform of a Hamming window, which is one of window functions used for windowing. Here, in order to ensure the continuity of the result, the windowed data sequence and the original audio data sequence delayed by a certain time shorter than the analysis block length (this time is referred to as “frame”) Is convolved with the data string that has been cut out and windowed at the analysis block length. Further, by normalizing the result of the convolution operation, k
Parameters are calculated. Here, the normalization operation is division according to the energy of the windowed data sequence included in the analysis block length. FIG. 10 is a diagram illustrating a relationship between an analysis block length and a frame.

【０００７】図７においては、フレームが遅延器（１０
１ａ、１０１ｂ、…、１０１ｎ）で生じさせる遅延時間
に相当し、また、窓掛け、畳み込み演算、及び正規化演
算は相関器（１０２ａ、１０２ｂ、…、１０２ｎ）で処
理される。ここで、相関器（１０２ａ、１０２ｂ、…、
１０２ｎ）で行なわれるｋパラメータの算出処理を数式
で示すと下記の式になる。[0007] In FIG. 7, the frame is a delay unit (10
1a, 101b,..., 101n), and the windowing, convolution operation, and normalization operation are processed by the correlators (102a, 102b,..., 102n). Here, the correlators (102a, 102b,...,
The calculation process of the k parameter performed in step 102n) is expressed by the following equation.

【０００８】[0008]

【数１】 (Equation 1)

【０００９】上記の式において、ｘ_nは音声データ列を
示し、ｘ_n+iはｘ_nを基準としたときにｉサンプル後の
音声データ列を示している。これは、ｘ_n+iを基準とす
ればｘ_nがｉサンプル過去の音声データ列を示している
と考えられ、遅延器（１０１ａ、１０１ｂ、…、１０１
ｎ）により遅延された音声データ列がｘ_nとして与えら
れる。また、Ｗ_nは窓関数を示している。In the above equation, x _n indicates an audio data sequence, and x _{n + i} indicates an audio data sequence after i samples with reference to x _n . It is considered that _xn indicates an audio data sequence i samples past based on _{xn + i} , and the delay units (101a, 101b,..., 101)
The audio data sequence delayed by n) is given as _xn . In addition, W _n represents the window function.

【００１０】[0010]

【発明が解決しようとする課題】ＰＡＲＣＯＲ係数を算
出する演算は、分析・合成の品質により、予測残差に対
して同様の演算が数回から数十回漸化的に繰り返し実行
されるのが一般的であり、多量の演算が必要である。そ
のため、音声波形の分析を高速に行なうには演算処理を
実行させるハードウェアの規模を大きくすることが避け
られなかった。In the calculation for calculating the PARCOR coefficient, the same calculation is repeatedly and repetitively executed several to several tens of times for the prediction residual depending on the quality of analysis and synthesis. It is general and requires a lot of computation. Therefore, in order to analyze a speech waveform at high speed, it is inevitable to increase the scale of hardware for executing arithmetic processing.

【００１１】以上の問題を鑑み、音声の分析処理におけ
る処理の負担を軽減することが本発明が解決しようとす
る課題である。In view of the above problems, it is an object of the present invention to reduce the processing load in voice analysis processing.

【００１２】[0012]

【課題を解決するための手段】本発明は、音声波形デー
タの線形予測分析を行なう音声分析装置または方法を前
提とする。SUMMARY OF THE INVENTION The present invention is based on a speech analyzer or method for performing linear prediction analysis of speech waveform data.

【００１３】そして、本発明の第一の態様の音声分析装
置では、前記音声波形データと、遅延させた前記音声波
形データとの間で相関演算を行なう相関器と、前記相関
器に備えられ、前記相関演算において行なわれる除算を
行なう除算器と、前記除算器の除数データ入力と被除数
データ入力との各々に設けられる、特性の等しい帰還型
フィルタとを有するように構成することによって前述し
た課題を解決する。[0013] In the voice analyzing apparatus according to the first aspect of the present invention, the correlator performs a correlation operation between the voice waveform data and the delayed voice waveform data, and the correlator includes: The above-mentioned problem is solved by having a divider that performs the division performed in the correlation operation, and a feedback filter having the same characteristics provided for each of the divisor data input and the dividend data input of the divider. Resolve.

【００１４】一般的に、データ列に対する窓掛けの処理
は、窓関数の値が予め格納されているテーブルから値を
取得したり、あるいは窓関数の値を直接計算しながらそ
の値を各データに乗算する処理となり、処理負担が大き
い。これに対し、上記の構成によれば、演算量の少ない
帰還型フィルタを利用して音声波形データに対する窓掛
けの処理を代行することにより、分析のための処理負担
を大幅に軽減することができる。更に、前述したような
フレームを単位としての分析処理ではないので、リアル
タイムの分析処理を行なう場合であっても、小規模のハ
ードウェアの構成でタイムラグの短い分析処理が行なえ
る。In general, the windowing process for a data sequence is performed by obtaining a value from a table in which the value of a window function is stored in advance, or by directly calculating the value of the window function and assigning the value to each data. Multiplication is required, and the processing load is large. On the other hand, according to the above configuration, the processing load for the analysis can be greatly reduced by using the feedback filter having a small amount of calculation to perform the windowing process on the audio waveform data. . Further, since the analysis processing is not performed in units of frames as described above, even when performing real-time analysis processing, analysis processing with a short time lag can be performed with a small-scale hardware configuration.

【００１５】また、本発明の第二の態様の音声分析装置
では、前記音声波形データと、遅延させた前記音声波形
データとの間で相関演算を行なう相関器と、前記相関器
に備えられ、除算を行なう除算器と、を有し、前記除算
器は、除算結果を、該除算器に入力されている除数デー
タ及び被除数データに基づいて該除算器の過去の除算結
果を補正することによって得るように構成することによ
って前述した課題を解決する。[0015] In a second aspect of the present invention, in the voice analyzer, the correlator performs a correlation operation between the voice waveform data and the delayed voice waveform data, and the correlator includes: A divider for performing division, wherein the divider obtains a division result by correcting a past division result of the divider based on divisor data and dividend data input to the divider. With such a configuration, the above-described problem is solved.

【００１６】上記の除算器としては、例えば、該除算器
の除算結果を遅延させる遅延器と、前記遅延器の出力に
該除算器に入力された除数データを乗じる乗算器と、該
除算器に入力された被除数データから前記乗算器の乗算
結果を減じる減算器と、前記遅延器の出力と前記減算器
の減算結果とを加算する加算器と、を有し、前記加算器
の加算結果を該除算器の除算結果とする、ように構成す
ることで実現される。As the above-mentioned divider, for example, a delay unit for delaying the result of division by the divider, a multiplier for multiplying the output of the delay unit by the divisor data input to the divider, A subtractor that subtracts the multiplication result of the multiplier from the input dividend data; and an adder that adds the output of the delay unit and the subtraction result of the subtractor. This is realized by a configuration in which the result is a division result of the divider.

【００１７】除算器は多量の演算を必要とするものが一
般的であるが、上記の構成の除算器は、少ない演算量で
除算結果を得ることができるので、分析のための処理負
担を軽減することができる。Although the divider generally requires a large amount of operation, the divider having the above configuration can obtain a division result with a small amount of operation, so that the processing load for analysis is reduced. can do.

【００１８】また、本発明の第一の態様の音声分析方法
では、前記音声波形データと、遅延させた前記音声波形
データとの間での相関演算のために行なわれる除算で、
特性の等しい帰還型フィルタから出力されるデータを該
除算の除数データと被除数データとすることにより、前
記第一の態様の音声分析装置と同様の作用・効果が得ら
れる。Further, in the voice analysis method according to the first aspect of the present invention, the division performed for a correlation operation between the voice waveform data and the delayed voice waveform data includes:
By using the data output from the feedback filters having the same characteristics as the divisor data and the dividend data of the division, the same operation and effect as those of the voice analyzer of the first aspect can be obtained.

【００１９】また、本発明の第二の態様の音声分析方法
では、前記音声波形データと、遅延させた前記音声波形
データとの間の相関係数を得る演算における除算で、該
除算の除算結果を、除数データ及び被除数データに基づ
いて過去の除算結果を補正することによって得るように
することにより、前記第二の態様の音声分析装置と同様
の作用・効果が得られる。Further, in the voice analysis method according to the second aspect of the present invention, in the division in the calculation for obtaining the correlation coefficient between the voice waveform data and the delayed voice waveform data, the division result of the division is obtained. Is obtained by correcting the past division result based on the divisor data and the dividend data, thereby obtaining the same operation and effect as those of the speech analyzer of the second aspect.

【００２０】[0020]

【発明の実施の形態】以下、本発明の実施の形態を図面
に基づいて説明する。図１は本発明に係る音声分析装置
（以下、「本装置」という）の全体構成を示す図であ
る。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a diagram showing an overall configuration of a speech analyzer according to the present invention (hereinafter, referred to as “this device”).

【００２１】音声分析部１は、ディジタルデータの形で
入力される音声信号の分析演算を実行するものであり、
入力される音声信号を分析してｋパラメータ及び残差波
形εを出力する。本実施の形態においては、音声分析部
１として図７に示す回路をそのまま使用する。音声分析
部１から出力されるｋパラメータ（線形予測係数）はそ
のまま本装置の出力となり、残差波形εは残差波形デー
タ処理部２に送付される。The voice analysis unit 1 performs an analysis operation on a voice signal input in the form of digital data.
The input voice signal is analyzed and the k parameter and the residual waveform ε are output. In the present embodiment, the circuit shown in FIG. The k parameter (linear prediction coefficient) output from the voice analysis unit 1 becomes the output of the apparatus as it is, and the residual waveform ε is sent to the residual waveform data processing unit 2.

【００２２】残差波形データ処理部２は入力された残差
波形εを正規化し、その後低ビット（少レベル）での再
量子化を行なった正規化残差データをその正規化情報と
共に出力する。ここで、残差波形εの正規化は低ビット
での再量子化による劣化を軽減させるために行なわれる
ものである。残差波形データ処理部２の詳細構成は後述
する。The residual waveform data processing unit 2 normalizes the input residual waveform ε, and then outputs normalized residual data that has been requantized with low bits (small level) together with the normalization information. . Here, the normalization of the residual waveform ε is performed in order to reduce deterioration due to requantization at low bits. The detailed configuration of the residual waveform data processing unit 2 will be described later.

【００２３】次に図２について説明する。同図は本装置
の音声分析部１に設けられる相関器（図７における相関
器（１０２ａ、１０２ｂ、…、１０２ｎ））の回路構成
を示す図であり、本装置において本発明に特に関係する
ものである。Referring now to FIG. FIG. 2 is a diagram showing a circuit configuration of a correlator (correlators (102a, 102b,..., 102n) in FIG. 7) provided in the voice analysis unit 1 of the present apparatus. It is.

【００２４】図２において、１１ａ、１１ｂ、１２ａ、
１２ｂ、１３ａ、１３ｂは乗算器、１４ａ、１４ｂは加
算器、１５ａ、１５ｂはデジタルデータにおける１サン
プル分に相当する遅延を得る遅延器、１６は除算器であ
る。In FIG. 2, 11a, 11b, 12a,
12b, 13a and 13b are multipliers, 14a and 14b are adders, 15a and 15b are delay devices for obtaining a delay corresponding to one sample in digital data, and 16 is a divider.

【００２５】同図において、乗算器１１ａは入力Ａから
の入力信号に入力Ｂからの入力信号を乗じた値Ａ×Ｂを
計算する。乗算器１１ｂは入力Ｂからの入力信号の二
乗、すなわちＢ×Ｂを計算する。In the figure, a multiplier 11a calculates a value A × B obtained by multiplying an input signal from an input A by an input signal from an input A. The multiplier 11b calculates the square of the input signal from the input B, that is, B × B.

【００２６】また、乗算器（１２ａ、１３ａ）、加算器
１４ａ、遅延器１５ａにより一次の帰還型フィルタ（巡
回型フィルタ）が構成されており、更に、乗算器（１２
ｂ、１３ｂ）、加算器１４ｂ、遅延器１５ｂによっても
同様の帰還型フィルタが構成されている。ここで、乗算
器１２ａと乗算器１３ａとの乗数が同じくａ、また乗算
器１２ｂと乗算器１３ｂとの乗数が同じく１−ａにそれ
ぞれ設定されており、これらのフィルタのインパルス応
答（単位インパルスに対する時間応答）は、図３に示す
ような、時間の経過と共に指数的に減衰していく全く同
一の特性を呈する。A first-order feedback filter (recursive filter) is constituted by the multipliers (12a, 13a), the adder 14a, and the delay unit 15a.
b, 13b), the adder 14b, and the delay unit 15b constitute a similar feedback filter. Here, the multiplier between the multiplier 12a and the multiplier 13a is set to a, and the multiplier between the multiplier 12b and the multiplier 13b is set to 1-a. The impulse response (the unit impulse) of these filters is set. The time response) has exactly the same characteristics as shown in FIG. 3, which exponentially decays with time.

【００２７】本実施の形態では、乗算器１１ａ若しくは
乗算器１１ｂから出力される信号が、図３に示す特性を
呈する帰還型フィルタを通過する。この帰還型フィルタ
からの出力信号を、乗算器１１ａ若しくは乗算器１１ｂ
から出力される信号に対して図３に示した特性の時間窓
の窓掛けによる重み付けを行なって１サンプル毎にそれ
らを累積加算した信号とする。すなわち、図３に示した
特性を有する窓関数Ｗ _nを想定し、入力Ａからの入力信
号ａをデータ列ｘ_n+iとし、入力Ｂからの入力信号ｂを
遅延器（１０１ａ、１０１ｂ、…、１０１ｎ）によりデ
ータ列ｘ_n+iよりもｉサンプルの遅延を有しているデー
タ列ｘ_nとすれば、図５に示す回路全体であたかも前掲
した［数１］の右辺の計算を行なうかのような演算処理
が実行されることになる。In this embodiment, the multiplier 11a or
The signal output from the multiplier 11b has the characteristic shown in FIG.
Pass through the present feedback filter. This feedback filter
Is output from the multiplier 11a or the multiplier 11b
Window with the characteristics shown in FIG. 3 for the signal output from
Weighting by windowing
These signals are cumulatively added. That is, as shown in FIG.
Window function W with characteristics _nInput signal from input A
A to data string x_{n + i}And the input signal b from the input B is
The delays (101a, 101b,..., 101n)
Data row x_{n + i}Data with a delay of i samples than
Column x_nThen, the entire circuit shown in FIG.
Calculation processing as if calculating the right side of [Equation 1]
Will be executed.

【００２８】なお、この帰還型フィルタの次数を増や
し、より複雑な応答特性をフィルタに持たせるようにし
てもよい。次に除算器１６について説明する。図４は図
２に示した相関器で用いられる除算器１６の詳細構成を
示す図である。図４において、２１は減算器、２２は加
算器、２３は乗算器、２４はデジタルデータにおける１
サンプル分の遅延器であり、入力Ａに被除数である入力
信号ａが入力され、入力Ｂに除数である入力信号ｂが入
力されると出力Ｃからａ÷ｂの結果である出力信号ｃが
出力される。図４に示す回路により実行される演算を式
により示すと下記の式となる。The order of the feedback filter may be increased so that the filter has more complicated response characteristics. Next, the divider 16 will be described. FIG. 4 is a diagram showing a detailed configuration of the divider 16 used in the correlator shown in FIG. In FIG. 4, 21 is a subtractor, 22 is an adder, 23 is a multiplier, and 24 is 1 in digital data.
This is a delay unit for a sample. When an input signal a which is a dividend is input to an input A and an input signal b which is a divisor is input to an input B, an output signal c which is a result of a ÷ b is output from an output C. Is done. The operation performed by the circuit shown in FIG. 4 is expressed by the following expression.

【００２９】[0029]

【数２】 (Equation 2)

【００３０】上式において、ｃ（ｚ^-1）は遅延器２４の
出力として得られる、出力Ｃの１サンプル前の出力信号
である。［数２］の右辺の｛｝内の演算は、入力信号ａ
の現在の値を入力信号ｂの現在の値で除算したときの除
算結果を１サンプル前の除算の結果である出力信号ｃ
（ｚ^-1）としたときの解のズレを求める演算となる。そ
こで、この演算により得られた値と前述したｃ（ｚ^-1）
とを加算して解のズレを補正し、この加算の結果の値を
ａ÷ｂの出力信号ｃとする。この［数２］に示した演算
による除算をここでは「サーボ除算」と呼ぶこととす
る。In the above equation, c (z ^-1 ) is an output signal obtained one sample before the output C, which is obtained as the output of the delay unit 24. The operation in the square on the right side of [Equation 2] is based on the input signal a
Is divided by the current value of the input signal b to obtain an output signal c which is the result of the division one sample before.
This is an operation to find the deviation of the solution when (z ^-1 ). Therefore, the value obtained by this calculation and the above-mentioned c (z ^-1 )
Is added to correct the deviation of the solution, and the value of the result of the addition is set as an output signal c of a ÷ b. Here, the division by the operation shown in [Equation 2] is referred to as “servo division”.

【００３１】このサーボ除算は、解が正確な値で安定す
るまでにある程度繰り返し演算する必要があるものの、
１サンプル中の計算量が少なくて済む利点があり、除算
結果の変動が少ない除算を行なう場合に好適なものであ
る。一般的に、同一の音源を分析して得られるＰＡＲＣ
ＯＲ係数の値はあまり激しく変化することがないので、
本装置の相関器での除算にサーボ除算を実行する回路を
採用できる。This servo division needs to be repeated to some extent until the solution is stabilized at an accurate value.
This method has the advantage that the amount of calculation in one sample is small, and is suitable for performing division with little fluctuation in the division result. Generally, a PARC obtained by analyzing the same sound source
Since the value of the OR coefficient does not change very sharply,
A circuit for performing servo division for division by the correlator of the present apparatus can be employed.

【００３２】図４に示すサーボ除算器を図２に示す相関
器に使用するときには、図２の加算器１４ａの出力を図
４の入力Ａに入力し、図２の加算器１４ｂの出力を図４
の入力Ｂに入力する。図５はサーボ除算器の使用・不使
用による相関器の応答特性の違いを示す図である。同図
は、周波数をわずかにずらした２つの正弦波Ａ、Ｂにつ
いて、相関係数を算出する演算を行なった結果であり、
（１）が正弦波Ａの波形、（２）が正弦波Ｂの波形、
（３）が除算器として通常の除算を行なわせるようにし
たときの相関器の出力である相関係数のシミュレーショ
ン結果、（４）が除算器としてサーボ除算器を用いたと
きの相関器の出力である相関係数のシミュレーション結
果を示している。同図からもわかるように、（３）の場
合と（４）の場合とで得られる相関係数の値は、相関演
算の開始直後の百数十サンプルを除外すれば、ほぼ一致
する。なお、（４）の場合において、除算器の出力の初
期値は０としているが、相関係数の値が予めある程度予
測できているのであれば、その値を除算器の出力の初期
値とすることにより、相関演算の解が正確な値で安定す
るまでに要する時間を短縮することが可能である。When the servo divider shown in FIG. 4 is used for the correlator shown in FIG. 2, the output of the adder 14a of FIG. 2 is input to the input A of FIG. 4, and the output of the adder 14b of FIG. 4
To the input B. FIG. 5 is a diagram showing a difference in response characteristics of the correlator depending on whether or not the servo divider is used. The figure shows the result of calculating the correlation coefficient for two sine waves A and B with slightly shifted frequencies.
(1) the waveform of the sine wave A, (2) the waveform of the sine wave B,
The simulation result of the correlation coefficient which is the output of the correlator when (3) performs the normal division as the divider, and the (4) the output of the correlator when the servo divider is used as the divider. The simulation result of the correlation coefficient is shown. As can be seen from the figure, the values of the correlation coefficients obtained in the cases (3) and (4) are almost the same except for one hundred and several tens of samples immediately after the start of the correlation calculation. In the case of (4), the initial value of the output of the divider is set to 0, but if the value of the correlation coefficient can be predicted to some extent in advance, that value is set as the initial value of the output of the divider. This makes it possible to reduce the time required for the solution of the correlation operation to stabilize at an accurate value.

【００３３】次に図１に示した残差波形データ処理部に
ついて説明する。図６は残差波形データ処理部の構成を
示す図である。同図において、３１は絶対値算出（全波
整流）部、３２は帰還型フィルタ、３３は除算部、３４
及び３７はサンプルホールド部、３５は乗算部、３６は
再量子化部である。Next, the residual waveform data processing section shown in FIG. 1 will be described. FIG. 6 is a diagram showing a configuration of the residual waveform data processing unit. In the figure, 31 is an absolute value calculation (full wave rectification) section, 32 is a feedback filter, 33 is a division section, 34
And 37 are a sample hold unit, 35 is a multiplication unit, and 36 is a requantization unit.

【００３４】図１における音声分析部１から出力された
残差波形εは、図２の絶対値算出部３１において絶対値
が算出され、その後低域通過（ローパス）特性を有する
ように係数値が設定されている帰還型フィルタ３２を通
過させることによって、残差波形εのエンベロープ波形
の信号が得られる。The absolute value of the residual waveform ε output from the voice analysis unit 1 in FIG. 1 is calculated by the absolute value calculation unit 31 in FIG. 2, and then the coefficient value is changed so as to have a low-pass (low-pass) characteristic. By passing the signal through the set feedback filter 32, a signal having an envelope waveform of the residual waveform ε is obtained.

【００３５】除算部３３ではこのエンベロープ波形信号
を除数として被除数１．０を除算する。つまり、除算部
３３ではエンベロープ波形信号の逆数の波形信号を得
る。なお、残差波形εのエンベロープ波形信号の変化は
緩やかなので、除算部３３に前述したサーボ除算を利用
することも可能である。The divider 33 divides the dividend 1.0 by using the envelope waveform signal as a divisor. That is, the divider 33 obtains a waveform signal that is the reciprocal of the envelope waveform signal. Since the envelope waveform signal of the residual waveform ε changes slowly, the above-described servo division can be used in the division unit 33.

【００３６】除算部３３の出力信号はサンプルホールド
部３４により残差波形εのサンプリング間隔よりも長い
間隔で標本化された後に乗算部３５に送られ、もとの残
差波形εと乗算される。この結果、乗算器３５からは残
差波形εの正規化された波形データが出力される。この
正規化された波形データは再量子化器３６により低ビッ
ト（少レベル）での再量子化が行なわれる。再量子化器
３６の出力である正規化残差データは本装置の出力のひ
とつとなる。The output signal of the divider 33 is sampled by the sample and hold unit 34 at intervals longer than the sampling interval of the residual waveform ε, sent to the multiplier 35, and multiplied by the original residual waveform ε. . As a result, the normalized waveform data of the residual waveform ε is output from the multiplier 35. The normalized waveform data is requantized at a low bit (small level) by the requantizer 36. The normalized residual data output from the requantizer 36 is one of the outputs of the present apparatus.

【００３７】また、その一方で、帰還型フィルタ３２の
出力である残差波形εのエンベロープ波形信号はサンプ
ルホールド部３４のサンプリングと同期して動作するサ
ンプルホールド部３７により標本化され、正規化情報と
して本装置から出力される。On the other hand, the envelope waveform signal of the residual waveform ε, which is the output of the feedback filter 32, is sampled by the sample-and-hold unit 37 that operates in synchronization with the sampling of the sample-and-hold unit 34, and the normalized information Is output from the apparatus.

【００３８】以上のようにして本装置から出力された正
規化誤差データ及び正規化情報は、本装置からの分析出
力に基づいて元の音声を合成する音声合成装置４０に送
付される。音声合成装置４０では、正規化情報が一旦レ
ジスタ４１に格納された後に、乗算器４２によって正規
化残差データと乗算される。乗算器４２の出力は複合残
差波形ε' として、本装置の音声分析部１から出力され
るｋパラメータ（ＰＡＲＣＯＲ係数）と共に、音声合成
装置４０に備えられている不図示の音声合成部に送付さ
れ、音声合成処理が行なわれる。この音声合成部の構成
は図１０に示した音声合成装置をそのまま使用すること
ができる。本実施の形態においてはこのようにして音声
の分析・合成が行なわれる。The normalization error data and the normalization information output from the present apparatus as described above are sent to a speech synthesizer 40 that synthesizes the original speech based on the analysis output from the present apparatus. In the speech synthesizer 40, after the normalization information is once stored in the register 41, the normalized information is multiplied by the normalized residual data by the multiplier 42. The output of the multiplier 42 is transmitted as a composite residual waveform ε ′ to a voice synthesizer (not shown) provided in the voice synthesizer 40 together with the k parameter (PARCOR coefficient) output from the voice analyzer 1 of the present apparatus. Then, voice synthesis processing is performed. The voice synthesizing unit can use the voice synthesizing apparatus shown in FIG. 10 as it is. In the present embodiment, speech analysis / synthesis is performed in this manner.

【００３９】[0039]

【発明の効果】以上詳細に説明したように、本発明は、
音声波形データの線形予測分析を行なうときに、前記音
声波形データと、遅延させた前記音声波形データとの間
での相関演算のために行なわれる除算で、特性の等しい
帰還型フィルタから出力されるデータを該除算の除数デ
ータと被除数データとすることにより、演算量の少ない
帰還型フィルタを利用して音声波形データに対する窓掛
けの処理を代行させることができる。As described in detail above, the present invention provides
When performing linear prediction analysis on audio waveform data, the division is performed for a correlation operation between the audio waveform data and the delayed audio waveform data, and is output from a feedback filter having equal characteristics. By using the data as the divisor data and the dividend data of the division, it is possible to substitute the windowing processing for the audio waveform data using a feedback filter having a small amount of calculation.

【００４０】あるいは、本発明は、音声波形データの線
形予測分析を行なうときに、前記音声波形データと、遅
延させた前記音声波形データとの間の相関係数を得る演
算における除算で、該除算の除算結果を、除数データ及
び被除数データに基づいて過去の除算結果を補正するこ
とによって得るようにすることにより、少ない演算量で
除算結果を得ることができる。Alternatively, according to the present invention, when performing linear prediction analysis of audio waveform data, the division is performed in an operation for obtaining a correlation coefficient between the audio waveform data and the delayed audio waveform data. Is obtained by correcting the past division result based on the divisor data and the dividend data, the division result can be obtained with a small amount of calculation.

【００４１】以上のように、本発明のいずれの構成によ
っても、音声の分析処理における処理の負担を軽減する
ことできるという効果を奏する。As described above, according to any of the configurations of the present invention, there is an effect that the processing load in the voice analysis processing can be reduced.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明に係る音声分析装置の全体構成を示す図
である。FIG. 1 is a diagram showing an overall configuration of a speech analysis device according to the present invention.

【図２】図１に示した音声分析装置の音声分析部で用い
られる相関器の構成を示す図である。FIG. 2 is a diagram showing a configuration of a correlator used in a voice analysis unit of the voice analysis device shown in FIG.

【図３】図２に示されている帰還型フィルタのインパル
ス応答を示す図である。FIG. 3 is a diagram illustrating an impulse response of the feedback filter illustrated in FIG. 2;

【図４】図２に示した相関器で用いられる除算器の構成
を示す図である。FIG. 4 is a diagram showing a configuration of a divider used in the correlator shown in FIG.

【図５】サーボ除算器の使用・不使用による相関器の応
答特性の違いを示す図である。FIG. 5 is a diagram illustrating a difference in response characteristics of a correlator depending on whether a servo divider is used or not.

【図６】残差波形データ処理部の構成を示す図である。FIG. 6 is a diagram illustrating a configuration of a residual waveform data processing unit.

【図７】ＰＡＲＣＯＲ係数を算出する音声分析回路の一
例を示す図である。FIG. 7 is a diagram illustrating an example of a voice analysis circuit that calculates a PARCOR coefficient.

【図８】図７に示した回路により算出されたＰＡＲＣＯ
Ｒ係数から音声信号を合成する音声合成装置の一例を示
す図である。8 is a PARCO calculated by the circuit shown in FIG.
FIG. 2 is a diagram illustrating an example of a speech synthesis device that synthesizes a speech signal from R coefficients.

【図９】ハミング窓の時間波形を示す図である。FIG. 9 is a diagram showing a time waveform of a Hamming window.

【図１０】分析ブロック長とフレームとの関係とを示し
た図である。FIG. 10 is a diagram showing a relationship between an analysis block length and a frame.

【符号の説明】[Explanation of symbols]

１音声分析部２残差波形データ処理部１１ａ、１１ｂ、１２ａ、１２ｂ、１３ａ、１３ｂ乗
算器１４ａ、１４ｂ加算器１５ａ、１５ｂ遅延器１６除算器２１減算器２２加算器２３乗算器２４遅延器DESCRIPTION OF SYMBOLS 1 Voice analysis part 2 Residual waveform data processing part 11a, 11b, 12a, 12b, 13a, 13b Multiplier 14a, 14b Adder 15a, 15b Delayer 16 Divider 21 Subtractor 22 Adder 23 Multiplier 24 Delayer

Claims

【特許請求の範囲】[Claims]

【請求項１】音声波形データの線形予測分析を行なう
音声分析装置であって、前記音声波形データと、遅延させた前記音声波形データ
との間で相関演算を行なう相関器と、前記相関器に備えられ、前記相関演算において行なわれ
る除算を行なう除算器と、前記除算器の除数データ入力と被除数データ入力との各
々に設けられる、特性の等しい帰還型フィルタと、を有することを特徴とする音声分析装置。1. A speech analyzer for performing linear prediction analysis of speech waveform data, comprising: a correlator for performing a correlation operation between the speech waveform data and the delayed speech waveform data; Voice, comprising: a divider for performing a division performed in the correlation operation; and a feedback filter having equal characteristics provided for each of a divisor data input and a dividend data input of the divider. Analysis equipment.

【請求項２】音声波形データの線形予測分析を行なう
音声分析装置であって、前記音声波形データと、遅延させた前記音声波形データ
との間で相関演算を行なう相関器と、前記相関器に備えられ、除算を行なう除算器と、を有し、前記除算器は、除算結果を、該除算器に入力されている
除数データ及び被除数データに基づいて該除算器の過去
の除算結果を補正することによって得る、ことを特徴とする音声分析装置。2. A speech analyzer for performing linear prediction analysis of speech waveform data, comprising: a correlator for performing a correlation operation between the speech waveform data and the delayed speech waveform data; And a divider for performing division, wherein the divider corrects a previous division result of the divider based on divisor data and dividend data input to the divider. A speech analyzer obtained by:

【請求項３】前記除算器は、該除算器の除算結果を遅延させる遅延器と、前記遅延器の出力に該除算器に入力された除数データを
乗じる乗算器と、該除算器に入力された被除数データから前記乗算器の乗
算結果を減じる減算器と、前記遅延器の出力と前記減算器の減算結果とを加算する
加算器と、を有し、前記加算器の加算結果を該除算器の除算結果とする、ことを特徴とする請求項２に記載の音声分析装置。3. The divider, a delay unit for delaying a result of the division by the divider, a multiplier for multiplying an output of the delay unit by divisor data input to the divider, and an input to the divider. A subtractor for subtracting the multiplication result of the multiplier from the dividend data, and an adder for adding the output of the delay unit and the subtraction result of the subtractor. The speech analysis device according to claim 2, wherein the result of division is:

【請求項４】音声波形データの線形予測分析を行なう
音声分析方法であって、前記音声波形データと、遅延させた前記音声波形データ
との間での相関演算のために行なわれる除算で、特性の
等しい帰還型フィルタから出力されるデータを該除算の
除数データと被除数データとする、ことを特徴とする音声分析方法。4. A voice analysis method for performing a linear prediction analysis of voice waveform data, wherein a characteristic is obtained by a division performed for a correlation operation between the voice waveform data and the delayed voice waveform data. Wherein the data output from the feedback filter having the same values as the divisor data and the dividend data of the division are used.

【請求項５】音声波形データの線形予測分析を行なう
音声分析方法であって、前記音声波形データと、遅延させた前記音声波形データ
との間の相関係数を得る演算における除算で、該除算の
除算結果を、除数データ及び被除数データに基づいて過
去の除算結果を補正することによって得る、ことを特徴とする音声分析方法。5. A speech analysis method for performing linear prediction analysis of speech waveform data, wherein the division is performed in an operation for obtaining a correlation coefficient between the speech waveform data and the delayed speech waveform data. Wherein the division result is obtained by correcting the past division result based on the divisor data and the dividend data.