JP3087814B2

JP3087814B2 - Acoustic signal conversion encoding device and decoding device

Info

Publication number: JP3087814B2
Application number: JP07052389A
Authority: JP
Inventors: 直樹岩上; 健弘守谷; 聡三樹
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1994-03-17
Filing date: 1995-03-13
Publication date: 2000-09-11
Anticipated expiration: 2015-09-11
Also published as: JPH0844399A

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】この発明はオーディオ信号、特に
音楽信号や音声信号などの音響信号を、周波数領域係数
に変換し、その周波数領域係数をできるだけ少ない情報
量でディジタル符号化する符号化装置、及びその符号化
音響信号を復号化する復号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an encoding apparatus for converting an audio signal, particularly an audio signal such as a music signal or a voice signal, into a frequency domain coefficient and digitally encoding the frequency domain coefficient with a minimum amount of information. And a decoding device for decoding the encoded audio signal.

【０００２】[0002]

【従来の技術】現在、オーディオ信号を高能率に符号化
する方法として、原音をフレームと呼ばれる５〜５０ｍ
ｓ程度の一定間隔の区間に分割し、その１フレームの信
号に時間−周波数変換（フーリエ変換）を行って得た周
波数領域係数（周波数軸上のそれぞれの点におけるサン
プル値）を、その周波数特性の包絡形状（スペクトラム
概形）と、周波数領域係数をスペクトラム概形で平坦化
して得られる残差係数との２つの情報に分離し、それぞ
れを符号化することが提案されている。このような符号
化法として、適応スペクトラム聴感制御エントロピー符
号化法（ASPEC ：Adaptive Spectral Perceptual Entro
py Coding ）や、重み付きベクトル量子化による変換符
号化法（TCWVQ：Transform Coding with Weighted Vect
or Quantization)、エムペグ−オーディオ・レイヤ３方
式（MPEG-Audio Layer III）などがある。それぞれの技
術については、K.Brandenburg, J.Herre, J.D.Johnston
etal：“ASPEC：Adaptive spectral entropy coding o
f high quality music signals”, Proc. AES'91及び
T.Moriya, H.Suda：“An 8 Kbit/s transform coderfor
noisy channels,”Proc.ICASSP'89, pp196-199 及びIS
O/IEC標準IS-11172-3に述べられている。2. Description of the Related Art At present, as a method of encoding an audio signal with high efficiency, an original sound is called a frame by 5 to 50 m.
The frequency domain coefficients (sample values at each point on the frequency axis) obtained by performing time-frequency conversion (Fourier transform) on the signal of one frame by dividing the signal into one section at a constant interval of about s It has been proposed to separate the information into two pieces of information, namely, an envelope shape (spectral shape) and a residual coefficient obtained by flattening the frequency domain coefficient with the spectrum shape, and encode each of them. As such an encoding method, an adaptive spectral hearing control entropy encoding method (ASPEC: Adaptive Spectral Perceptual Entro)
py Coding) and transform coding with weighted vector quantization (TCWVQ: Transform Coding with Weighted Vect)
or Quantization), and mpeg-audio layer 3 system (MPEG-Audio Layer III). K. Brandenburg, J. Herre, JDJohnston
etal: “ASPEC: Adaptive spectral entropy coding o
f high quality music signals ”, Proc. AES'91 and
T. Moriya, H. Suda: “An 8 Kbit / s transform coderfor
noisy channels, ”Proc.ICASSP'89, pp196-199 and IS
It is described in O / IEC standard IS-11172-3.

【０００３】これらの符号化法では、高能率な符号化を
実現するためには、残差係数はできるだけ周波数特性が
平坦であること、つまり残差係数の振幅周波数特性の変
動が少ないほうが望ましい。このため、上述のASPEC や
MPEG-Audio Layer IIIでは、周波数領域係数をいくつか
の小帯域に分割し、小帯域内での信号の強さの代表値を
示すスケーリングファクタと呼ばれる値で割ることによ
って正規化する事によって周波数特性の平坦化をはか
る。即ち図１に示すように入力端子１１から入力された
ディジタル化された音響入力信号は時間−周波数変換部
（変形離散コサイン変換：ＭＤＣＴ）２により周波数領
域係数に変換され、この周波数領域係数は帯域分割部３
により複数の小帯域に分割され、これら小帯域信号はそ
れぞれ代表値計算・量子化部４₁〜４ｎでその平均値
又は最大値などの代表値がスケーリングファクタとして
計算され、かつそのスケーリングファクタは量子化され
て、全体として周波数領域係数の概形が得られる。前記
分割された各小帯域信号は正規化部５₁〜５ｎでそれ
ぞれ対応する小帯域の前記量子化されたスケーリングフ
ァクタで正規化され帯域残差係数とされ、これら正規化
によって得られた帯域残差係数は量子化部６で帯域合成
されスペクトラム残差を得る。その結果、時間−周波数
変換部２により得られた前記周波数領域係数はその周波
数特性の概形が取り除かれ、平坦化された残差係数とな
り、その残差係数は量子化される。この残差係数の量子
化を示すインデックスＩ_Rと、前記各代表値を量子化し
たインデックスとがそれぞれ復号器へ送出される。[0003] In these encoding methods, in order to realize highly efficient encoding, the frequency characteristics of the residual coefficients are as flat as possible , that is , the amplitude frequency characteristics of the residual coefficients are changed.
Less movement is desirable. For this reason, ASPEC and
In MPEG-Audio Layer III, the frequency domain coefficient is divided into several small bands, and the representative value of the signal strength within the small band is calculated.
The frequency characteristic is flattened by normalizing by dividing by a value called a scaling factor shown . That is, as shown in FIG. 1, the digitized sound input signal input from the input terminal 11 is converted into a frequency domain coefficient by a time-frequency conversion unit (modified discrete cosine transform: MDCT) 2, and the frequency domain coefficient is converted into a band. Division 3
Is divided into a plurality of subbands by these subbands each signal representative value calculation and quantization unit 4 ₁ to 4n In the representative value such as an average value or the maximum value is <br/> calculated as a scaling factor, and the scaling factor is quantized envelope of the frequency domain coefficients are obtained as a whole. Wherein each divided subband signals were normalized unit 5 ₁ through 5n Are normalized by the quantized scaling factors of the corresponding small bands to be band residual coefficients, and the band residual coefficients obtained by these normalizations are band-synthesized by the quantization unit 6 to obtain a spectrum residual. . As a result, the frequency domain coefficients obtained by the time-frequency conversion unit 2 are removed from the outline of their frequency characteristics, become flattened residual coefficients, and the residual coefficients are quantized. And index I _R indicating the quantization of the residual coefficients, wherein the index of each representative values quantized is sent to each decoder.

【０００４】この方法よりも高能率な周波数領域係数の
平坦化方法として、線形予測分析を用いる方法がある。
周知のように、線形予測係数は入力信号の周波数特性を
平坦化するように動作する線形予測フィルタ（逆フィル
タと呼ばれている）のインパルス応答を表している。こ
の方法では、図２に示すように端子１１に与えられたデ
ィジタル音響信号を線形予測分析・予測係数量子化部７
で線形予測し、得られた線形予測係数を線形予測分析フ
ィルタ、いわゆる逆フィルタ８にフィルタ係数として設
定し、この線形予測分析フィルタ８を端子１１からの入
力信号で駆動することによって周波数特性の平坦化され
た残差信号を得る。この残差信号を時間−周波数変換部
（離散コサイン変換：ＤＣＴ）２で周波数領域の信号、
即ち残差係数に変換し、残差量子化部６で量子化し、そ
の量子化を表すインデックスI_Rと線形予測係数を量子化
したインデックスI_pとを復号器へ送出する。この方法
は、TCWVQ で用いられている。なお線形予測分析につい
ては斉藤、中田“音声情報処理の基礎”第６章（オーム
社）に、ＤＣＴについてはK.R.Rao, P.Yip著安田、藤原
訳“画像符号化技術−ＤＣＴとその国際標準”第２章
（オーム社）に、およびＭＤＣＴについてはISO/IEC 標
準IS-11172-3に述べられている。 [0004] As a method of flattening frequency domain coefficients more efficient than this method, there is a method using linear prediction analysis.
As is well known, linear prediction coefficients represent the impulse response of a linear prediction filter (called an inverse filter) that operates to flatten the frequency characteristics of an input signal. In this method, as shown in FIG. 2, the digital acoustic signal supplied to the terminal 11 is
, And the obtained linear prediction coefficient is set as a filter coefficient in a linear prediction analysis filter, a so-called inverse filter 8. To obtain a normalized residual signal. This residual signal is converted to a signal in the frequency domain by a time-frequency transform unit (discrete cosine transform: DCT) 2,
I.e. transform to the residual coefficient, quantized by residual quantizer 6, and sends the index I _R and a linear prediction coefficient representing the quantizer to the decoder the index I _p quantized. This method is used in TCWVQ. Note that linear prediction analysis
Saito, Nakata "Basics of Speech Information Processing" Chapter 6 (Ohm
And KRRao, P.Yip on DCT for Yasuda and Fujiwara
"Image coding technology-DCT and its international standard" Chapter 2
(Ohm) and ISO / IEC standards for MDCT
It is described in quasi IS-11172-3.

【０００５】[0005]

【発明が解決しようとする課題】上述の何れの方法にお
いても、周波数特性の全体的な概形を正規化するにとど
まり、オーディオ信号に含まれるピッチ成分のような微
視的な周波数特性の凹凸を能率良く除去することはでき
ない。このことが障害となり、ピッチ成分の強い音楽信
号や音声信号などを符号化する際の性能の向上が困難で
あった。In any of the above-mentioned methods, only the general outline of the frequency characteristic is normalized, and the microscopic unevenness of the frequency characteristic such as a pitch component included in the audio signal is obtained. Cannot be removed efficiently. This has been an obstacle, and it has been difficult to improve the performance when encoding a music signal or audio signal having a strong pitch component.

【０００６】この発明の目的は、変換符号化において、
入力音響信号の周波数特性をその概形で正規化して得ら
れる残差係数にピッチ成分が含まれる場合でも少ない情
報量で能率良く符号化可能な符号化装置及びその符号化
音響信号を復号する装置を提供することにある。[0006] It is an object of this invention is, in transform coding,
An encoding apparatus capable of encoding efficiently with a small amount of information even when a pitch component is included in a residual coefficient obtained by normalizing the frequency characteristic of an input audio signal in its general form, and an apparatus for decoding the encoded audio signal Is to provide.

【０００７】[0007]

【課題を解決するための手段】この発明の符号化装置に
よれば、入力音響信号を周波数領域に変換して符号化す
る装置において、入力音響信号についてフレーム単位
で、その周波数特性の概形が平坦化された残差係数を得
る第１の手段と、その手段で得られた現フレームの残差
係数の概形を現在、または過去のフレームの残差係数か
ら予測し、予測残差概形を生成する第２の手段と、第１
の手段で得られた現フレームの上記残差係数を、上記予
測残差概形で正規化して微細構造係数を得る第３の手段
と、その微細構造係数を量子化し、その量子化微細構造
を表すインデックスを上記音響信号の符号化出力の少な
くとも１部として出力する手段とを有している。Means for Solving the Problems] According to the coding apparatus of the present invention, the apparatus for encoding and converts the frequency domain input audio signal, in units of frames for the input acoustic signal, the envelope of the frequency characteristics A first means for obtaining a flattened residual coefficient, and a rough form of a residual coefficient of a current frame obtained by the means is predicted from a residual coefficient of a current or past frame, and a predicted residual rough form is obtained. Second means for generating
Said residual coefficients of the current frame obtained by the means, the pre <br/> normalized third means <br/> to obtain a fine structure coefficients by Hakazansa envelope, micro fine structure of its Means for quantizing the coefficient and outputting an index representing the quantized fine structure as at least a part of the encoded output of the audio signal.

【０００８】第１の手段で残差係数を得るには、入力音
響信号を周波数領域係数に変換した後にその周波数特性
の概形を取り除くか、入力音響信号を時間領域でその周
波数特性の概形を平均化した後に、周波数領域係数に変
換してもよい。第２の手段で予測残差概形を生成するに
は、量子化微細構造係数を逆正規化して再生残差係数を
生成し、その再生残差係数からそのスペクトラム概形を
求め、そのスペクトラム概形から次フレームの残差係数
に対する予測残差概形を合成する。In order to obtain the residual coefficient by the first means , the outline of the frequency characteristic is removed after converting the input audio signal into the frequency domain coefficient, or the outline of the frequency characteristic is obtained in the time domain. May be converted to frequency domain coefficients after averaging. In order to generate a prediction residual outline by the second means , a reproduction residual coefficient is generated by denormalizing the quantized fine structure coefficient, a spectrum outline is obtained from the reproduction residual coefficient, and the spectrum outline is obtained. From the shape, a predicted residual approximate shape for the residual coefficient of the next frame is synthesized.

【０００９】上記符号化装置の第２の手段において、予
測残差概形が現フレームにおける残差係数のスペクトラ
ム概形に最も近くなるよう、そのスペクトラム概形を量
子化し、その量子化を表すインデックスを符号化出力の
他の一部として出力してもよい。その場合、現フレーム
の残差係数のスペクトラム概形と、過去のフレームの量
子化スペクトラム概形とを予め決めた予測係数を使って
線形合成し、その線形合成値が上記スペクトラム概形と
最も近くなるように上記量子化スペクトラム概形を決定
し、その時の上記線形合成値を上記予測残差概形として
もよい。あるいは、現フレームの量子化スペクトラム概
形と、過去のフレームの上記予測残差概形とを線形合成
し、その線形合成値が上記スペクトラム概形と最も近く
なるように上記量子化スペクトラム概形を決定し、その
時の上記線形合成値を上記予測残差概形として得てもよ
い。In the second means of the encoding apparatus , the spectrum outline is quantized so that the prediction residual outline becomes closest to the spectrum outline of the residual coefficient in the current frame, and an index representing the quantization is obtained. May be output as another part of the encoded output. In that case, the spectrum outline of the residual coefficient of the current frame and the quantization spectrum outline of the past frame are linearly synthesized using a predetermined prediction coefficient, and the linear synthesized value is closest to the spectrum outline. The quantized spectrum outline may be determined so that the predicted combined residual value may be used as the predicted residual outline. Alternatively, the quantized spectrum outline of the current frame is linearly synthesized with the prediction residual outline of the past frame, and the quantized spectrum outline is approximated so that the linear synthesized value is closest to the spectrum outline. It may be determined, and the linear combined value at that time may be obtained as the predicted residual outline.

【００１０】上記符号化装置において、重ね合わせ直交
変換を使って入力音響信号を周波数領域係数に変換して
もよい。その場合は、入力音響信号を線形予測分析して
得た線形予測係数のスペクトラム振幅を上記周波数領域
係数の概形として求め、それを使って周波数領域係数を
正規化するのが好ましい。この発明の復号化装置によれ
ば、周波数領域に変換されて符号化された音響信号を復
号する方法において、入力された第１の量子化インデッ
クスから再生した微細構造係数を、過去のフレームの情
報から合成した残差概形で逆正規化して現フレームの再
生残差係数を得る第１の手段と、その再生残差係数から
元の音響信号の周波数特性の概形が与えられた音響信号
を再生する第２の手段とを有する。In the above coding apparatus , the input audio signal may be converted into a frequency domain coefficient using a superposition orthogonal transform. In this case, it is preferable that the spectrum amplitude of the linear prediction coefficient obtained by performing the linear prediction analysis on the input acoustic signal is obtained as an outline of the frequency domain coefficient, and the frequency domain coefficient is normalized using the spectrum amplitude. According to the decoding apparatus of the present invention, in a method of decoding an audio signal which has been converted into a frequency domain and coded, the fine structure coefficient reproduced from the input first quantization index is converted into information of a past frame. A first means for denormalizing the residual approximate shape synthesized from the above to obtain a reproduction residual coefficient of the current frame, and an audio signal to which an approximate form of the frequency characteristic of the original audio signal is given from the reproduction residual coefficient. Second means for reproducing.

【００１１】第１の手段は、上記再生残差係数から次フ
レームに対する残差係数の概形を合成する第３の手段を
含む。第３の手段は更に、上記再生残差係数のスペクト
ラム概形を計算する第４の手段と、現フレームに対し、
予め決めた１または連続する複数の過去のフレームの上
記スペクトラム概形にそれぞれ予測係数を乗残して線形
合成により現フレームの残差係数の概形を得る第５の手
段を含む。The first means includes third means for synthesizing an approximate form of the residual coefficient for the next frame from the reproduction residual coefficient. The third means further comprises: fourth means for calculating a spectrum outline of the reproduction residual coefficient;
Fifth hand obtaining envelope of residual coefficients of the current frame by linear combination leaving multiply each prediction coefficient to the spectrum envelope of a plurality of past frames in advance of 1 or successive decided
Including steps .

【００１２】第２の手段において周波数特性の概形が与
えられた音響信号を再生するには、周波数領域の再生残
差係数に与える場合と、再生残差係数を時間領域に変換
して得た残差信号に与える場合とがある。上記復号化装
置において、上記残差概形は、符号側から送られたイン
デックスを逆量子化して得た現フレームと過去のフレー
ムの量子化スペクトラム概形を線形合成して得てもよ
い。In order to reproduce an audio signal having an approximate frequency characteristic in the second means , it is necessary to provide a reproduction residual coefficient in a frequency domain and to convert the reproduction residual coefficient into a time domain. It may be given to the residual signal. The decoding KaSo
In this case, the residual outline may be obtained by linearly synthesizing a quantized spectrum outline of a current frame and a past frame obtained by dequantizing an index sent from the code side.

【００１３】あるいは、上記残差概形は、過去のフレー
ムにおける残差概形と、符号側から送られたインデック
スを逆量子化して得た現フレームの量子化スペクトラム
概形を線形合成して得てもよい。[0013] Alternatively, the residual outline is obtained by linearly synthesizing the residual outline in the past frame and the quantized spectrum outline of the current frame obtained by dequantizing the index sent from the code side. You may.

【００１４】[0014]

【作用】一般に、周波数領域係数をスペクトラム概形で
正規化して得られる残差係数は、ピッチ成分を含んでお
り、全体のパワーに対してエネルギーの大きいスパイク
となって現れる。ピッチ成分は長時間持続することから
特異的にパワーが大きくなる現象は複数フレームにわた
って同じ周波数領域で持続するので、残差係数のパワー
はフレーム間で相関が高い。この発明では、前フレーム
の残差係数の振幅またはその概形と現フレームの相関を
用いて残差係数の冗長度を取り除くため、つまり特異的
にパワーが大きくなることによる影響を除去し、残差係
数よりも平坦化された微細構造係数を量子化するため高
い量子化能率が得られる。また、入力信号のパワーが特
異的に大きくなる周波数領域が複数存在していても、各
々の周波数領域は異なるので、単数の場合と同様に扱え
る。Generally, a residual coefficient obtained by normalizing a frequency domain coefficient by a spectrum outline includes a pitch component, and appears as a spike having a large energy with respect to the entire power. Because the pitch component lasts for a long time
Specifically Runode be sustained phenomenon power is increased at the same frequency domain over a plurality of frames, the power of the residual coefficients that are strongly correlated between frames. In the present invention, in order to remove the redundancy of the residual coefficient using the amplitude of the residual coefficient of the previous frame or its approximate shape and the correlation of the current frame ,
In this case, the effect of increasing the power is removed , and the fine structure coefficient that is flatter than the residual coefficient is quantized, so that a high quantization efficiency can be obtained. Also, the power of the input signal is
Even if there are multiple frequency regions that become abnormally large,
Since each frequency domain is different, it can be treated like a singular case
You .

【００１５】[0015]

【実施例】図３にこの発明を適用した符号器１０とこの
発明を適用した復号器５０とを示し、図中のＡ，Ｂ，
…，Ｅで示す出力の波形例を図５Ａ〜５Ｅに示す。この
発明においても、まず、入力音響信号の符号化に必要な
ビット数を低減するため、入力信号に対し、まず平坦化
されたスペクトラム概形を有する残差係数を求めるが、
その方法としては例えば次の２つがある。EXAMPLES an encoder 10 that applies this inventions 3 this
A decoder 50 to which the invention is applied is shown, and A, B,
5A to 5E show examples of output waveforms indicated by..., E. Also in the present invention, first, in order to reduce the number of bits required for encoding the input audio signal, for the input signal, first find a residual coefficient having a flattened spectrum outline,
For example, there are the following two methods.

【００１６】(a) 入力信号を周波数領域係数に変換する
と共に、入力信号のスペクトラム概形を求め、前記周波
数領域係数をそのスペクトラム概形で正規化して残差係
数を得る。 (b) 線形予測係数で制御される逆フィルタにより時間領
域で入力信号を処理して残差信号を求め、その残差信号
を周波数領域係数に変換し、残差係数を得る。(A) The input signal is converted into a frequency domain coefficient, a spectrum outline of the input signal is obtained, and the frequency domain coefficient is normalized by the spectrum outline to obtain a residual coefficient. (b) An input signal is processed in a time domain by an inverse filter controlled by a linear prediction coefficient to obtain a residual signal, and the residual signal is converted into a frequency domain coefficient to obtain a residual coefficient.

【００１７】上記方法(a) において、入力信号のスペク
トラム概形を得る方法として次の３つの方法が考えられ
る。 (c) 上述の事実を応用して、入力信号の線形予測係数を
フーリエ変換して求める。 (d) 図１で説明したように、入力信号を変換して得た周
波数領域係数を複数の帯域に分割し、それぞれの帯域の
スケーリングファクタをスペクトラム概形として得る。In the above method (a), the following three methods are conceivable as methods for obtaining a spectrum outline of an input signal. (c) Applying the above fact, the linear prediction coefficient of the input signal is obtained by Fourier transform. (d) As described in FIG. 1, the frequency domain coefficient obtained by converting the input signal is divided into a plurality of bands, and the scaling factor of each band is obtained as a spectrum outline.

【００１８】(e) 入力信号を変換して得た周波数領域係
数の絶対値を逆変換して得た時間領域信号の線形予測係
数を求め、その線形予測係数をフーリエ変換して求め
る。方法(c) と(e) は次の事実に基づいている。前述の
ように、線形予測係数は、入力信号の周波数特性を平坦
化するように動作する逆フィルタのインパルス応答（周
波数特性）を表しており、従って線形予測係数のスペク
トラム概形は、入力信号のスペクトラム概形に対応して
いる。詳しく言えば、線形予測係数をフーリエ変換して
得られるスペクトラム振幅は、入力信号のスペクトラム
概形の逆数となっている。(E) A linear prediction coefficient of the time domain signal obtained by inversely transforming the absolute value of the frequency domain coefficient obtained by converting the input signal is obtained, and the linear prediction coefficient is obtained by Fourier transform. Methods (c) and (e) are based on the following facts. As described above, the linear prediction coefficients represent the impulse response (frequency characteristics) of an inverse filter that operates to flatten the frequency characteristics of the input signal, and thus the spectral profile of the linear prediction coefficients is It corresponds to the spectrum outline. More specifically, the spectrum amplitude obtained by Fourier-transforming the linear prediction coefficient is the reciprocal of the spectrum shape of the input signal.

【００１９】この発明は上記方法(a) と(c),(d),(e) の
どの組み合わせを使ってもよいし、方法(b) のみを使っ
てもよい。図３の実施例は上記方法(a) と(c) の組み
合わせを使った場合である。符号器１０には入力端子１
１からディジタル信号とされた音響信号が入力され、フ
レーム分割部１４でそのＮ入力サンプルごとに過去２×
Ｎサンプルの入力系列を抽出し、この２×Ｎ個のサンプ
ルの系列を重ね合わせ直交変換（ＬＯＴ：Lapped Ortho
gonal Transform）処理用フレームとする。このＬＯＴ
処理用フレームは時間窓掛部１５で時間窓が掛けられ
る。ＬＯＴについては例えばH.S.Malvar,“Signal Proc
essing with Lapped Transform, ”ArtechHouseに述べ
られている。例えば０から始まってｎ番目の窓関数の値
W(n)は式(１)で表わされるものを用いるのが一般的であ
り、この実施例でもこれを用いる。In the present invention, any combination of the above methods (a) and (c), (d) and (e) may be used, or only the method (b) may be used. The embodiment shown in FIG. 3 is a case where a combination of the above methods (a) and (c) is used. The encoder 10 has an input terminal 1
1 is input as a digital signal, and the frame division unit 14 outputs the past 2 ×
An input sequence of N samples is extracted, and a sequence of 2 × N samples is superimposed and orthogonally transformed (LOT: Lapped Orthogonal Transform).
gonal Transform) Processing frame. This LOT
The processing frame is time-windowed by the time windowing unit 15. For example, HSMalvar, “Signal Proc
essing with Lapped Transform, "described in ArtechHouse. For example, the value of the nth window function starting from 0
Generally, W (n) is represented by the formula (1), and this is also used in this embodiment.

【００２０】 W(n)＝ sin｛π(n+0.5)／(2N)｝ (1) この時間窓が掛けられた信号は一方で重ね合わせ直交変
換の一種であるＮ次のＭＤＣＴ(Modified Discrete Cos
ine Transform:変形離散コサイン変換）部１６で変形離
散コサイン変換されて周波数領域係数（周波数軸上のそ
れぞれの点におけるサンプル値）に変換され、例えば図
４Ａに示すようなスペクトラム振幅が得られる。時間窓
掛け部１５の出力は他方で線形予測分析部１７で線形予
測分析され、Ｐ次の予測係数α₀,…,α_Pが求められる。
この予測係数α₀,…,α_Pを量子化部１８で例えばＬＳＰ
パラメータ、あるいはｋパラメータに変換してから量子
化してスペクトラム概形を示すインデックスI_Pを得る。W (n) = sin ｛π (n + 0.5) / (2N)｝ (1) On the other hand, an N-order MDCT (Modified Discrete Cos
An ine Transform (Modified Discrete Cosine Transform) unit 16 performs a Modified Discrete Cosine Transform and converts it into a frequency domain coefficient (sample value at each point on the frequency axis) to obtain, for example, a spectrum amplitude as shown in FIG. 4A. On the other hand, the output of the time windowing unit 15 is subjected to linear prediction analysis by the linear prediction analysis unit 17, and P-order prediction coefficients α ₀ ,..., Α _P are obtained.
The prediction coefficients α _0, ..., a quantization unit 18 for alpha _P eg LSP
Parameter, or obtain an index I _P indicating the spectrum envelope are quantized after being converted into k parameters.

【００２１】前記予測係数α₀,…,α_Pのスペクトラム概
形がスペクトラム概形計算部２１で求められる。図４Ｂ
は得られたスペクトラム概形の例を示す。予測係数のス
ペクトラム概形を得るには、例えば図５Ａに示すよう
に、Ｐ＋１個の量子化予測係数（αパラメータ）の後
に、(4×N-P-1)個の０をつなげて得た長さ４×Ｎのサン
プル系列を離散フーリエ変換（例えば高速フーリエ変換
ＦＦＴ）し、更にその２×Ｎ次のパワースペクトラムを
計算し、このスペクトラムの各奇数次をそれぞれ取り出
し、それらについてそれぞれ平方根をとる。このように
して得られたＮ点のスペクトラム振幅は、スペクトラム
概形の逆数を表している。あるいは図５Ｂに示すように
Ｐ＋１個の量子化予測係数（αパラメータ）の後に(2×
N-P-1)個の０をつなげた長さ２×Ｎのサンプル系列をＦ
ＦＴ分析し、その結果についてＮ次のパワースペクトラ
ムを計算する。０番目から始まってｉ番目のスペクトラ
ム概形の逆数は、ｉ＝Ｎ−１以外ではｉ＋１番目とｉ番
目の各パワースペクトラムの平方根を平均して、つまり
補間して得る。Ｎ−１番目のスペクトラム概形の逆数
は、Ｎ−１番目のパワースペクトラムの平方根をとって
得る。正規化部２２はこのようにして得られたスペクト
ラム概形で、ＭＤＣＴ部１６からのスペクトラム振幅を
各対応サンプルごとに割って正規化し、図４Ｃに示すよ
うな現フレームＦの残差係数R(F)を得る。ただし、前述
のように量子化予測係数αをフーリエ変換して直接得ら
れるものはスペクトラム概形の逆数であり、従って実際
には正規化部２２はＭＤＣＴ部１６の出力とスペクトラ
ム概形計算部２１の出力（スペクトラム概形の逆数）を
単に乗算すればよい。しかしながら、以降の説明におい
ても便宜上、スペクトラム概形計算部２１はスペクトラ
ム概形を出力するものとする。The spectrum outline of the prediction coefficients α ₀ ,..., Α _P is obtained by a spectrum outline calculator 21. FIG. 4B
Shows an example of the obtained spectrum outline. To obtain a spectrum outline of the prediction coefficient, for example, as shown in FIG. 5A, a length obtained by connecting (4 × NP-1) 0s after P + 1 quantization prediction coefficients (α parameter) A 4 × N sample sequence is subjected to a discrete Fourier transform (for example, fast Fourier transform FFT), and a 2 × N-order power spectrum is calculated. The spectrum amplitude at N points obtained in this way represents the reciprocal of the spectrum outline. Alternatively, as shown in FIG. 5B, after (P + 1) quantized prediction coefficients (α parameters), (2 ×
NP-1) A sequence of 2 × N samples with 0s connected to F
FT analysis is performed, and an Nth-order power spectrum is calculated for the result. The reciprocal of the i-th spectrum outline starting from the 0-th is obtained by averaging the square roots of the (i + 1) -th and i-th power spectra except for i = N−1, that is, by interpolating. The reciprocal of the (N-1) th spectrum outline is obtained by taking the square root of the (N-1) th power spectrum. The normalizing unit 22 normalizes the spectrum amplitude from the MDCT unit 16 by dividing the spectrum amplitude from the corresponding sample by each corresponding sample, and obtains the residual coefficient R () of the current frame F as shown in FIG. Get F). However, as described above, what is directly obtained by Fourier-transforming the quantized prediction coefficient α is the reciprocal of the spectrum outline, and therefore, the normalization unit 22 actually outputs the output of the MDCT unit 16 and the spectrum outline calculation unit 21 (The reciprocal of the spectrum outline) may simply be multiplied. However, also in the following description, for convenience, the spectrum outline calculation unit 21 outputs the spectrum outline.

【００２２】従来においては上述とは異なる方法で得た
残差係数を量子化してそのインデックスを送出するが、
一般に音響信号（特に音声信号、音楽信号）の残差係数
は図４Ｃに示したようにピッチ成分などの比較的大きな
変動を含んでいる。そこでこの発明では、過去又は現在
のフレームの残差係数から予測した現フレームの残差係
数R(F)の概形E_R(F) で、現フレームの残差係数R(F)を正
規化して微細構造係数を得、その微細構造係数を量子化
する。この実施例では正規化により得られた微細構造係
数をそのレベルが大きい成分程、重要視するように重み
付き量子化する場合であり、スペクトラム概形計算部２
１で得られたスペクトラム概形と残差概形計算部２３で
得られた残差係数概形E_R(F) とを重み計算部２４で各対
応サンプルごとに掛け合わせて重み付け係数w₁,…,w
_N（ベクトルＷで表す）を得て量子化部２５へ供給す
る。この重み付け係数に聴感制御を施すこともできる。
この実施例では、重み付け係数に０.6前後の定数をべき
乗する。この他の聴感制御方法として各サンプルごとに
必要な、エムペグ−オーディオ方式で用いられている聴
覚モデルによって求めた各サンプルごとに必要なＳＮＲ
（信号対雑音比）を非対数化して重み付け係数と掛け合
わせる方法などもある。この方法では、入力信号を分析
して得られた周波数特性から、各周波数サンプルごとに
聴感的にノイズが検知できる最小のＳＮＲを、聴覚モデ
ルによってマスキング量を推定することによって計算す
る。このＳＮＲが各サンプルごとに必要なＳＮＲであ
る。 Conventionally, a residual coefficient obtained by a method different from the above is quantized and its index is transmitted.
Generally, the residual coefficient of an audio signal (especially a voice signal or a music signal) includes a relatively large fluctuation such as a pitch component as shown in FIG. 4C. Therefore, in the present invention, the residual coefficient R (F) of the current frame is normalized by an approximate form E _R (F) of the residual coefficient R (F) of the current frame predicted from the residual coefficient of the past or current frame. To obtain a fine structure coefficient and quantize the fine structure coefficient. In this embodiment, the fine structure coefficient obtained by normalization is weighted and quantized so that the component having a higher level is regarded as more important.
1 is multiplied by the residual coefficient approximate form E _R (F) obtained by the residual approximate form calculating section 23 for each corresponding sample by the weight calculating section 24 to obtain a weighting coefficient w ₁ , …, W
_N (represented by the vector W) is obtained and supplied to the quantization unit 25. Auditory control can be applied to this weighting coefficient.
In this embodiment, the weighting coefficient is raised to a power of about 0.6. Another way to control hearing is for each sample
The required SNR for each sample determined by the required auditory model used in the mpeg-audio system
(Signal-to-noise ratio) may be made non-logarithmic and multiplied by a weighting coefficient. In this method, the minimum SNR at which noise can be detected audibly for each frequency sample is calculated from the frequency characteristics obtained by analyzing the input signal by estimating the masking amount using an auditory model. This SNR is the SNR required for each sample .

【００２３】正規化部２２からの現フレームＦの残差係
数R(F)を残差係数概形計算部２３からの残差係数概形で
割って正規化して微細構造係数を得ることが残差概形正
規化部２６で行われる。その現フレームの微細構造係数
はパワー正規化部２７でその振幅の平均値またはパワー
の平均値の平方根である正規化ゲインg(F)で割算すこと
により正規化され、正規化微細構造係数X(F)＝(x₁,…,x
_N)として量子化部２５へ供給される。このパワー正規化
の際の正規化ゲインg(F)は逆正規化部３１に与えられる
と共に量子化され、その量子化ゲインを示すインデック
スI_Gが出力される。The residual coefficient R (F) of the current frame F from the normalizing section 22 is divided by the residual coefficient approximate form from the residual coefficient approximate calculation section 23 to normalize to obtain a fine structure coefficient. This is performed by the difference outline normalizing unit 26. The fine structure coefficient of the current frame is normalized by dividing it by a normalization gain g (F), which is the average value of the amplitude or the square root of the average value of the power, in the power normalization unit 27. X (F) = (x ₁ , ..., x
_N ) is supplied to the quantization unit 25. The power normalization normalization gain g when the (F) are quantized together fed to the anti-normalization part 31, the index I _G indicating the quantized gains are output.

【００２４】量子化部２５では正規化微細構造係数X(F)
に対し重み付け係数Ｗにより重み付けを行った後、ベク
トル量子化する。この例ではインターリーブ型重み付き
ベクトル量子化する場合で、まず、Ｎ個のサンプルから
成る正規化微細構造係数x_j(j=1,…,N)と重み付け係数w_j
(j=1,…,N)の系列を、それぞれ同じように次のようにＮ
／Ｍ個ずつのＭ個の小系列にインターリーブ法で再配列
する。即ちそれぞれの係数のｋ番目の小系列のｉ番目の
サンプル値x^k _i，w^k _iと、もとの系列のｊサンプル目の
値x_j，w_jとの関係を次式(２)に示す。The quantizer 25 normalizes the fine structure coefficient X (F)
Is weighted by a weighting coefficient W, and then vector-quantized. In this example, interleaved weighted vector quantization is performed. First, a normalized fine structure coefficient x _j (j = 1,..., N) consisting of N samples and a weighting coefficient w _j
(j = 1,..., N)
Rearrange into M subsequences of / M by the interleave method . That is, the relationship between the i-th sample value x ^k _i , w ^k _i of the k-th small sequence of each coefficient and the j-th sample value x _j , w _j of the original sequence is expressed by the following equation (2). Show.

【００２５】 x^k _i＝x_iM+k， w^k _i＝w_iM+k， (２) 即ち、ｊ＝iM＋ｋの関係がある。ただし、ｋ＝0,1,…,M
-1であり，ｉ＝0,1,…,(N/M)-1である。Ｎ＝１６、Ｍ＝
４の場合の式(２)のインターリーブ法で正規化微細構造
係数x_j(j=1,…,N)を再配列する場合の再配列前と後の関
係を図６に示す。重み付け係数w_jの場合も同様である。
このようにして得られた微細構造係数と重み付け係数の
Ｍ個の小系列対の各々について重み付きベクトル量子化
する。インターリーブ後のｋ番目の小系列微細構造係数
のサンプル値をx^k _i，重み付け係数の値をw^k _i，符号帳
のインデックスｍのベクトルC(m)のｉ番目の要素の値を
c_i(m) としたときのベクトル量子化の際の重み付き距離
尺度d^k(m) は次式 d^k(m)＝Σ〔w^k _i｛x^k _i−c_i(m)｝〕² (３) で規定され、Σはｉ＝０から(N/M)-1 までの加算オペレ
ータである。この距離尺度d^k(m) が最小になる符号ベク
トルC(m^k) を探索する事をｋ＝1,…,Mに付いて行い、そ
れぞれの符号ベクトルのインデックスm¹,…,m^Mから量子
化インデックスImを得る。[0025] ^{_{_{x k i = x iM + k}}} , w k i = w iM + k, (2) That is, a relationship of j = iM + k. Where k = 0,1, ..., M
-1 and i = 0, 1,..., (N / M) -1. N = 16, M =
FIG. 6 shows the relationship before and after rearrangement when rearranging the normalized fine structure coefficients x _j (j = 1,..., N) by the interleaving method of equation (2) in the case of No. 4. In the case of the weighting coefficient w _j is the same.
Weighted vector quantization is performed on each of the M small sequence pairs of the fine structure coefficient and the weighting coefficient thus obtained. The sample value of the k-th small-sequence fine structure coefficient after interleaving is x ^k _i , the value of the weighting coefficient is w ^k _i , and the value of the i-th element of the codebook index m vector C (m) is
The weighted distance scale d ^k (m) at the time of vector quantization when c _i (m) is given by d ^k (m) = Σ [w ^k _i ｛x ^k _i −c _i (m)｝] ² Defined in (3), Σ is an addition operator from i = 0 to (N / M) -1. This distance measure d ^k (m) is minimized code vector C (m ^{k) k} = 1 that searches for a, ..., performed with the M, the index m ¹ of each code vector, ..., from m ^M Obtain the quantization index Im.

【００２６】図７は上述のインターリーブ型重み付きベ
クトル量子化を行う量子化部２５の構成を示す。この図
を参照してｋ番目の小系列x^k _iに付いての量子化手順を
説明する。入力微細構造係数x_jと重み付け係数w_j（j=1,
…,N）はインターリーブ部２５Ａで式(２)のように再配
列され、ｋ番目の小系列x^k _i，w^k _iがそれぞれ減算部２
５Ｂ及び２乗器２５Ｅに与えられる。符号帳２５Ｃから
選択したベクトルC(m)の要素系列c_i(m) と微細構造係数
小系列x^k _iとの差が減算部２５Ｂで求められ、その差が
２乗器２５Ｄで２乗される。一方、重み付け係数小系列
w^k _iは２乗器２５Ｅで２乗され、その出力と２乗器２５
Ｄの出力との内積が内積計算部２５Ｆで求められ、その
内積値d^k _iが最小となるベクトルC(m^k) を符号帳２５Ｃ
から探索することが最適符号帳探索部２５Ｇで行われ、
そのd^k _iが最小となるベクトルC(m^k) を示すインデック
スm^kが出力される。FIG. 7 shows a configuration of the quantization unit 25 for performing the above-described interleaved weighted vector quantization. The quantization steps of with the k-th subsequences x ^k _i with reference to the figures described. Input fine structure coefficient x _j and weighting coefficient w _j (j = 1,
.., N) are rearranged in the interleaving section 25A as in equation (2), and the k-th small series x ^k _i , w ^k _i are respectively
5B and the squarer 25E. The difference between the element sequence c _i (m) of the vector C (m) selected from the codebook 25C and the fine structure coefficient small sequence x ^k _i is obtained by the subtraction unit 25B, and the difference is squared by the squarer 25D. You. On the other hand, the weighting coefficient small series
w ^k _i is squared by the squarer 25E, and its output and the squarer 25
The inner product with the output of D is obtained by the inner product calculator 25F, and the vector C (m ^k ) that minimizes the inner product value d ^k _i is ^obtained from the codebook 25C.
From the optimal codebook search unit 25G,
An index m ^k indicating the vector C (m ^k ) in which the d ^k _i is minimum is output.

【００２７】このようにして、量子化部２５で量子化に
より得られたＭ個のベクトルC(m¹),C(m²),…,C(m^M)を構
成する要素系列である量子化小系列C(m)は、逆正規化部
３１で式(２)に従って元の配列に並べかえられて量子化
正規化微細構造係数とされ、これに対しパワー正規化部
２７で得た正規化ゲインg(F)で逆正規化し、更に残差概
形計算部２３からの残差係数概形が乗算されて量子化残
差係数R_q(F) が再生される。その量子化残差係数の概形
が残差概形計算部２３により求められる。In this manner, the quantization is a sequence of elements constituting the ^M vectors C (m ¹ ), C (m ² ),..., C (m ^M ) obtained by the quantization in the quantization section 25. The normalized small series C (m) is rearranged into the original array according to the equation (2) by the inverse normalization unit 31 to obtain a quantized normalized fine structure coefficient, and the normalized The inverse normalization is performed with the gain g (F), and the result is further multiplied by the residual coefficient approximate form from the residual approximate form calculation unit 23 to reproduce the quantized residual coefficient R _q (F). The approximate shape of the quantized residual coefficient is obtained by the residual approximate shape calculation unit 23.

【００２８】残差概形計算部２３の具体例を図８を参照
して説明する。この例では、残差概形正規化部２６へ入
力された現フレームＦの残差信号R(F)に対し、残差概形
計算部２３が前フレームＦ−１の残差係数R(F-1)を使っ
て決定した予測係数β₁(F-1)〜β₄(F-1)を使って合成し
た残差概形E_R(F) で正規化する場合である。残差概形計
算部２３の線形合成部３７は、この例では４つの縦続接
続された１フレーム遅延段３５₁〜３５₄と、それぞれの
遅延段の出力Ｅ₁〜Ｅ₄に予測係数β₁〜β₄を乗算する乗
算器３６₁〜３６₄と、全ての乗算結果の対応するサンプ
ルをそれぞれ加算し、加算結果を合成残差概形係数E_R"
(F)（Ｎサンプル）として出力する加算器３４とから構
成されている。現フレームＦにおいて、遅延段３５₁ 〜
３５₄ はそれらの出力E₁(F)〜E₄(F)として過去のフレー
ム(F-1)〜(F-4)に測定した残差スペクトラム概形E(F-1)
〜E(F-4)を出力しており、予測係数β₁〜β₄としては、
前フレーム(F-1) で決定された値β₁(F-1)〜β₄(F-1)が
設定されている。従って、現フレームの加算器３４の出
力E_R"(F)は次式 E_R"(F)＝β₁(F-1)E(F-1)＋β₂(F-1)E(F-2)＋…＋β₄(F-1)E(F-4) で表される。A specific example of the residual shape calculator 23 will be described with reference to FIG. In this example, for the residual signal R (F) of the current frame F input to the residual approximate normalization unit 26, the residual approximate calculation unit 23 outputs the residual coefficient R (F (F) of the previous frame F-1. This is a case where the residual is approximated by the residual approximate form E _R (F) synthesized using the prediction coefficients β ₁ (F-1) to β ₄ (F-1) determined using -1). In this example, the linear synthesizing unit 37 of the residual approximate shape calculating unit 23 includes four cascade-connected one-frame delay stages 35 _{1 to} 35 _4, and outputs E _{1 to} E ₄ of the respective delay stages to prediction coefficients β _1. a multiplier 36 ₁ to 36 ₄ for multiplying ～Beta _4, all of the multiplication results of the corresponding samples were respectively added, the addition result to synthesis residual envelope coefficients E _R "
(F) (N samples). In the current frame F, the delay stages 35 ₁ to 35 ₁
35 ₄ These outputs E ₁ (F) ~E ₄ past frames as (F) (F-1) ~ (F-4) to the measured residual spectrum envelope E (F-1)
~ E (F-4), and as prediction coefficients β _{1 to} β ₄ ,
Values β ₁ (F-1) to β ₄ (F-1) determined in the previous frame (F-1) are set. Accordingly, the output E _R "(F) of the adder 34 of the current frame is given by the following equation: E _R " (F) = β ₁ (F-1) E (F-1) + β ₂ (F-1) E (F- 2) + ... + β ₄ (F-1) E (F-4)

【００２９】図８の例では、加算器３４の出力Ｅ_Rは必
要に応じて定数加算部３８にて同一定数が各サンプルに
加算されて予測残差概形係数E_R' を得る。定数加算部３
８で定数を加算するのは加算器３４の出力として得られ
る予測残差概形係数Ｅ_Rの予測が大きく間違っていた場
合に、その影響を小とするためである。この加算定数
は、例えば加算器３４の出力の１フレームの平均パワー
に0.05を乗算した値であり、加算器３４の出力である予
測残差概形係数Ｅ_Rの振幅の平均が１０２４のとき、前
記加算定数は５０程度とされる。定数加算部３８の出力
E_R' は必要に応じて正規化部３９で１フレーム（Ｎ点）
のパワーの平均が１になるように正規化して最終的な現
フレームＦの予測残差概形（以降単に残差概形とも呼
ぶ）E_R(F) を得る。In the example of FIG. 8, the same constant as the output E _R of the adder 34 is added to each sample by a constant adder 38 as necessary to obtain a prediction residual approximate coefficient E _R '. Constant adder 3
The reason why the constant is added in step 8 is to reduce the influence of a large prediction error in the prediction residual approximate coefficient E _R obtained as an output of the adder 34, if the prediction is large. This addition constant is, for example, a value obtained by multiplying the average power of one frame output from the adder 34 by 0.05, and when the average of the amplitude of the prediction residual approximate coefficient E _R output from the adder 34 is 1024, The addition constant is about 50. Output of constant adder 38
E _R 'is one frame (N points) by the normalization unit 39 as necessary.
Is normalized so that the average of the power of the current frame becomes 1 to obtain a final estimated residual approximate form (hereinafter simply referred to as a residual approximate form) E _R (F) of the current frame F.

【００３０】この様にして得られる残差概形E_R(F) は、
例えば図４Ｄに示すように、正規化部２２からの図４Ｃ
に示す残差係数R(F)中の強いピッチ成分と対応する位置
に単極性のインパルスを有している。一般に、オーディ
オ信号では隣接フレーム間でピッチ成分の周波数位置が
ほとんど変わらないので、残差概形正規化部２６におい
てこの残差概形E_R(F) により入力残差係数R(F)を割算す
ることにより、ピッチ成分レベルが抑圧され、図４Ｅに
示すようなランダム成分が主体の微細構造係数が得られ
る。この正規化により得られた微細構造係数が前述のよ
うにパワー正規化部２７、量子化部２５で順次処理さ
れ、それぞれ正規化ゲインg(F)と量子化小系列ベクトル
C(m)が逆正規化部３１に与えられる。逆正規化部３１で
は、再生部３１Ａでは、量子化小系列ベクトルC(m)を再
配列して量子化正規化微細構造係数X_q(F)を求め、それ
に正規化ゲインg(F)を乗算して量子化微細構造係数を再
生し、その再生出力に乗算器３１Ｂで現フレームＦの残
差概形E_R(F) を乗算することにより量子化残差係数R
_q(F) を再生する。現フレームＦにおいてこの再生され
た量子化残差係数（再生残差係数）R_q(F) が残差概形計
算部２３のスペクトラム振幅計算部３２に与えられる。The residual approximate form E _R (F) thus obtained is
For example, as shown in FIG. 4D, FIG.
Has a unipolar impulse at a position corresponding to the strong pitch component in the residual coefficient R (F) shown in FIG. Generally, in an audio signal, the frequency position of the pitch component hardly changes between adjacent frames. Therefore, the residual approximate normalization unit 26 divides the input residual coefficient R (F) by the residual approximate E _R (F). By this calculation, the pitch component level is suppressed, and a fine structure coefficient mainly composed of random components as shown in FIG. 4E is obtained. The fine structure coefficients obtained by this normalization are sequentially processed by the power normalizing unit 27 and the quantizing unit 25 as described above, and the normalized gain g (F) and the quantized small sequence vector are respectively obtained.
C (m) is provided to the inverse normalization unit 31. In the inverse normalizing unit 31, the reproducing unit 31A rearranges the quantized small sequence vector C (m) to obtain a quantized normalized fine structure coefficient X _q (F), and calculates the normalized gain g (F) therefrom. The quantized residual coefficient R is multiplied to reproduce the quantized fine structure coefficient, and the reproduced output is multiplied by the residual approximate form E _R (F) of the current frame F by the multiplier 31B.
_q Play (F). The reproduced quantized residual coefficient (reproduced residual coefficient) R _q (F) in the current frame F is supplied to the spectrum amplitude calculator 32 of the residual approximate calculator 23.

【００３１】スペクトラム振幅計算部３２は、逆正規化
部３１からの再生された量子化残差係数R_q(F) のＮ個の
サンプルのスペクトラムの振幅を計算する。その計算さ
れたＮ個のスペクトラム振幅に対し、窓関数畳み込み部
３３で周波数窓関数が畳み込まれて現フレームＦの再生
残差係数R_q(F) の振幅の概形、即ち残差係数概形E(F)を
得て線形合成部３７に与える。スペクトラム振幅計算部
３２では、例えば、再生残差係数R_q(F) の各サンプルの
絶対値をスペクトラム振幅とする方法や、再生残差係数
R_q(F) の各サンプルの２乗値と１つ前のフレーム(F-1)
の残差係数R_q(F-1) の対応するサンプルの２乗値との和
の平方根をスペクトラム振幅とする方法などがある。ま
た、このスペクトラム振幅を対数で表してもよい。窓関
数の畳み込み方法は、幅が３〜９サンプル程度で、窓関
数の形状として、三角窓、ハミング窓、ハニング窓、指
数関数窓などを用いることもできるし、窓形状を適応的
に可変にしても良い。窓関数として指数関数窓を使う場
合の具体例としては、ｇを１以上の予め決めた整数とす
ると、窓関数を例えば次式The spectrum amplitude calculator 32 calculates the amplitude of the spectrum of the N samples of the quantized residual coefficient R _q (F) reproduced from the inverse normalizer 31. A window function convolution unit 33 convolves a frequency window function with the calculated N spectrum amplitudes, and the approximate form of the amplitude of the reproduction residual coefficient R _q (F) of the current frame F, that is, the residual coefficient approximate The form E (F) is obtained and given to the linear combination section 37. The spectrum amplitude calculation unit 32 uses, for example, a method of using the absolute value of each sample of the reproduction residual coefficient R _q (F) as the spectrum amplitude,
The square value of each sample of R _q (F) and the previous frame (F-1)
, The square root of the sum of the residual coefficient R _q (F-1) of the corresponding sample and the square value of the corresponding sample is used as the spectrum amplitude. In addition, the spectrum amplitude may be represented by a logarithm. The convolution method of the window function has a width of about 3 to 9 samples, and can use a triangular window, a Hamming window, a Hanning window, an exponential function window, or the like as the shape of the window function. May be. As a specific example of using an exponential function window as a window function, assuming that g is a predetermined integer of 1 or more, the window function is expressed by the following equation, for example.

【００３２】[0032]

【数１】で規定し、ａの値として例えばａ＝０.5が用いられる。
上式の場合の窓の幅は2g+1である。窓関数の畳み込みに
より、周波数軸上の各点におけるサンプル値は、その正
方向に隣接するｇ個のサンプル値と負方向に隣接するｇ
個のサンプル値の影響を受けた値に変換される。これに
より、残差概形計算部２３での残差概形の予測効果が過
敏にならないようにでき、従って復号音に異常音が生じ
るのを抑えることができる。窓の幅を１２サンプル以上
とすると、残差係数概形中のピッチ成分にもとづく変動
が不明となり、好ましくない。(Equation 1) And a = 0.5, for example, is used as the value of a.
The width of the window in the above equation is 2g + 1. Due to the convolution of the window function, the sample value at each point on the frequency axis becomes g sample values adjacent in the positive direction and g sample values adjacent in the negative direction.
Is converted to a value affected by the sample values. As a result, the effect of predicting the residual approximate shape in the residual approximate shape calculation unit 23 can be prevented from becoming too sensitive, and therefore, occurrence of abnormal sound in the decoded sound can be suppressed. If the width of the window is 12 samples or more, the fluctuation based on the pitch component in the residual coefficient outline becomes unclear, which is not preferable.

【００３３】窓関数が畳み込まれて得られたスペクトラ
ム概形E(F)は、現フレームのスペクトラム概形E₀(F) と
して線形合成部３７に与えられると共に、予測係数計算
部４０にも与えられる。予測係数計算部４０は線形合成
部３７への入力E₀(F) と遅延段３５₁ 〜３５₄ の出力E₁
=E(F-1) 〜E₄=E(F-4) が与えられ、後述のようにスペク
トラム概形E₀(F) に対する加算器３４の出力E_R" の自乗
誤差が最小となるように予測係数β₁(F)〜β₄(F)を適応
的に決定する。その後、遅延段３５₁〜３５₄は与えられ
ているスペクトラム概形E₀〜E₃をそれぞれ取り込んで、
更新されたスペクトラム概形E₁〜E₄として出力し、１フ
レームに付いての処理サイクルを終了する。その結果得
られる加算器３４の出力（合成残差概形）E_R" に基づい
て、前述と同様に次フレーム(F+1) の残差係数R(F+1)に
対する予測残差概形係数E_R(F+1)が生成される。The spectrum outline E (F) obtained by convolving the window function is given to the linear synthesizing unit 37 as the spectrum outline E ₀ (F) of the current frame. Given. Prediction coefficient calculation unit 40 outputs E ₁ input to the linear combination unit 37 E ₀ (F) a delay stage 35 _1-35 ₄
= E (F-1) to E ₄ = E (F-4), and the square error of the output E _R "of the adder 34 with respect to the spectrum outline E ₀ (F) is minimized as described later. adaptively determining a prediction coefficient _{_{β 1 (F) ~β 4 (}} F). then, capture delay stages 35 ₁ to 35 ₄ of the spectrum envelope E ₀ to E ₃ are given respectively,
Output as the spectrum envelope E ₁ to E _4, which are updated, the processing cycle is ended in with the frame. Based on the output (combined residual approximate form) E _R "of the adder 34 obtained as a result, a predicted residual approximate form for the residual coefficient R (F + 1) of the next frame (F + 1) is similarly obtained. A coefficient E _R (F + 1) is generated.

【００３４】予測係数β₁〜β₄は以下のようにして求め
ることができる。図８では予測次数は４次となっている
が、ここでは一般化のためにＱ次とする。ｑを１≦ｑ≦
Ｑを満足する任意の整数とし、ｑ段目における予測係数
の値をβ_qとする。乗算器３６₁〜３６_Q（Ｑ＝４）に対
する予測係数（乗算係数）をβ₁〜β_Qとし、また、ｑ段
目出力の係数系列をベクトルＥ_qで表し、遅延段３５₁
〜３５_Qの出力をＥ₁，Ｅ₂，…，Ｅ_Qとし、さらに、
窓関数畳み込み部３３の出力であるスペクトラム概形の
係数系列（現フレームの残差係数概形係数）E(F)をベク
トルＥ₀で表す。この時、次式(４)で求められる相互相
関係数ｒ_i,jを用いて、連立一次方程式(５)をβ₁〜β_Q
について解くことによって加算器３４の出力Ｅ_RのＥ₀
に対する自乗誤差（予測誤差）が最小になる予測係数β
₁〜β_Qが求められる。The prediction coefficients β _{1 to} β ₄ can be obtained as follows. In FIG. 8, the prediction order is the fourth order, but here the Q order is used for generalization. q is 1 ≦ q ≦
Q is an arbitrary integer that satisfies Q, and the value of the prediction coefficient at the q-th stage is β _q . The prediction coefficients (multiplication coefficients) for the multipliers 36 _{1 to} 36 _Q (Q = 4) are β ₁ to β _Q, and the coefficient series of the q-th output is represented by a vector E _q , and the delay stage 35 ₁
The output of ~ 35 _Q is E ₁ , E ₂ , ..., E _Q, and
A vector E ₀ represents the spectrum series coefficient sequence (residual coefficient approximation coefficient of the current frame) E (F) output from the window function convolution unit 33. At this time, using the cross-correlation coefficient r _{i, j} obtained by the following equation (4), the simultaneous linear equation (5) is converted into β ₁ to β _Q
By solving for E ₀ of the output E _R of adder 34
Prediction coefficient β that minimizes the square error (prediction error) with respect to
_{1 to} β _Q are required.

【００３５】ｒ_i,j＝Ｅ_i・Ｅ_j (４)R _{i, j} = E _i · E _j (4)

【００３６】[0036]

【数２】線形合成部３７において参照する過去のフレームのスペ
クトラム概形は４フレーム前までに限ることなく、１フ
レーム前だけでも、それ以上前でもよく、従って遅延段
の数Ｑは１以上任意の整数でよい。(Equation 2) Spectrum envelope of a past frame to be referred to in the linear combination unit 37 is not limited to before four frames, even in one frame before, may be before more, therefore at any integer number Q is one or more delay stages Good.

【００３７】この様に、図８の残差概形計算部２３を使
った符号化方法によれば、正規化部２２からの残差係数
R(F)は、その過去の残差係数から推定された残差概形E_R
(F)で正規化されるため、その正規化された微細構造係
数は残差係数R(F)よりも概形の変化が少ない平坦なもの
になる。従って、それだけ少ないビット数で量子化する
ことができる。また残差係数のスペクトラム振幅系列に
対し窓関数畳み込み部３３で周波数窓関数を畳み込んで
求めたスペクトラム概形E(F)から予測した残差概形E
_R(F) で残差係数R(F)を正規化するので、残差係数R(F)
中の例えばピッチ成分とそれぞれ対応した位置に現れる
強いパルスに対し、残差概形の推定が周波数軸方向に１
サンプル程度ずれても大きな予測誤差を生ずることなく
動作する。窓関数畳み込みを用いない場合は、推定誤り
が生じると大きな予測誤差が生じる原因となる。As described above, according to the encoding method using the residual shape calculating unit 23 in FIG.
R (F) is the residual approximation E _R estimated from the past residual coefficients
Since the normalized fine structure coefficient is normalized by (F), the normalized fine structure coefficient becomes flat with less change in the outline than the residual coefficient R (F). Therefore, quantization can be performed with a smaller number of bits. Also, the residual amplitude E predicted from the spectrum amplitude E (F) obtained by convolving the frequency window function with the window amplitude convolution unit 33 with respect to the spectrum amplitude sequence of the residual coefficient.
_{Since the} residual coefficient R (F) is normalized by R (F), the residual coefficient R (F)
For a strong pulse appearing at a position corresponding to, for example, a pitch component, the estimation of the residual approximate
It operates without generating a large prediction error even if it is shifted by about a sample. When window function convolution is not used, an estimation error causes a large prediction error.

【００３８】図３において、符号器１０から線形予測係
数の量子化値を示すインデックスI_Pと、微細構造係数の
パワー正規化利得の量子化値を示すインデックスI_Gと、
微細構造係数の量子化値を示すインデックスImとが出力
される。復号器５０においてはインデックスI_P，I_G，Im
が入力され、再生部５１で入力されたインデックスImか
ら正規化微細構造係数が再生され、正規化ゲイン再生部
５２で入力されたインデックスI_Gから正規化ゲインが再
生され、パワー逆正規化部５３で前記再生された正規化
微細構造係数が前記再生された正規化ゲインにより逆正
規化されて微細構造係数とされる。その微細構造係数
は、残差逆正規化部５４で、残差概形計算部５５からの
残差概形Ｅ_Rが掛算されて逆正規化され、残差係数R(F)
が再生される。この再生された残差係数R(F)の概形が残
差概形計算部５５で符号器１０の残差概形計算部２３と
同じ方法で計算される。[0038] In FIG. 3, and the index I _P showing the quantized value of the linear prediction coefficients from the encoder 10, and the index I _G indicating the quantized value of the power normalization gain of the fine structure coefficients,
An index Im indicating the quantization value of the fine structure coefficient is output. Index I _P in the decoder 50, I _G, Im
There are input, normalized fine structure coefficients from the index Im entered in the reproducing unit 51 is reproduced, the normalized gain from the index I _G input by normalizing the gain reproducing unit 52 is reproduced, a power de-normalization part 53 Then, the reproduced normalized fine structure coefficient is inversely normalized by the reproduced normalized gain to obtain a fine structure coefficient. Its fine structure coefficients by residual inverse normalization unit 54, the inverse normalized residual envelope E _R is multiplied from the residual envelope calculation section 55, the residual coefficients R (F)
Is played. The approximate shape of the reproduced residual coefficient R (F) is calculated by the residual approximate shape calculation unit 55 in the same manner as the residual approximate shape calculation unit 23 of the encoder 10.

【００３９】一方、再生・スペクトラム概形計算部５６
で入力されたインデックスI_Pから線形予測係数α₀〜α_P
が再生され、更にその線形予測係数から、符号器１０の
スペクトラム概形計算部２１と同じ方法でスペクトラム
概形が計算され、その計算されたスペクトラム概形が逆
正規化部５７で残差概形逆正規化部５４からの再生残差
係数R(F)に対して掛算されて逆正規化され、周波数領域
係数が再生される。その周波数領域係数はフレーム毎に
逆ＭＤＣＴ部５８でＮ次の逆変形離散コサイン変換がさ
れて２Ｎサンプルの時間領域信号（逆ＬＯＴ処理用フレ
ームと呼ぶ）に変換される。この時間領域信号は窓掛部
５９でフレームごとに例えば式(１)で表わされる形状の
時間窓がかけられる。その窓かけ出力はフレーム重ね合
わせ部６１で長さ２×Ｎサンプルの逆ＬＯＴ処理用現フ
レームの前半Ｎサンプルと前フレームの後半Ｎサンプル
とが互いに加算され、得られたＮサンプルを現フレーム
の再生音響信号として出力端子９１に出力される。On the other hand, the reproduction / spectrum outline shape calculation unit 56
The linear prediction coefficients α _{0 to} α _{P are} calculated from the index I _P input at
Is further reproduced from the linear prediction coefficients, and the spectrum outline is calculated in the same manner as the spectrum outline calculation unit 21 of the encoder 10. The calculated spectrum outline is denormalized by the denormalization unit 57. The reproduction residual coefficient R (F) from the inverse normalization unit 54 is multiplied and inversely normalized, and the frequency domain coefficient is reproduced. The frequency domain coefficients are subjected to an Nth-order inversely modified discrete cosine transform by the inverse MDCT unit 58 for each frame, and are converted into a time domain signal of 2N samples (referred to as a frame for inverse LOT processing). The time domain signal is time-framed by the windowing unit 59 for each frame, for example, in the shape represented by the equation (1). The windowed output is added to the first half N samples of the current frame for inverse LOT processing having a length of 2 × N samples and the second half N samples of the previous frame by the frame superposition unit 61, and the obtained N samples are added to the current frame. The signal is output to the output terminal 91 as a reproduced sound signal.

【００４０】上述においてＰ，Ｎ，Ｍの各値は、Ｐ＝６
０前後、Ｎ＝５１２，Ｍ＝６４程度を目安に自由に選べ
るが、Ｐ＋１＜Ｎ×４でなくてはならない。また、上記
実施例では、図６で説明した残差概形係数のインターリ
ーブベクトル量子化の際の係数系列の分割数Ｍの値は、
Ｎ／Ｍの値が整数であるものとして説明したが、Ｍの値
は、必ずしもＮ／Ｍが整数となるように設定する必要は
ない。整数とならない場合には、分割したそれぞれの小
系列の一部を１サンプルずつ長くして、不足サンプル数
を補えばよい。In the above description, each value of P, N and M is P = 6
Around 0, N = 512 and M = 64 can be freely selected, but P + 1 <N × 4. Further, in the above embodiment, the value of the number of divisions M of the coefficient sequence at the time of interleave vector quantization of the residual approximate coefficients described in FIG.
Although the description has been made assuming that the value of N / M is an integer, it is not always necessary to set the value of M so that N / M is an integer. If the number does not become an integer, a part of each divided small sequence may be lengthened by one sample to compensate for the number of missing samples.

【００４１】残差概形計算部２３，５５としては図９に
図８と対応する部分に同一符号を付けて示すように、窓
関数畳み込み部３３の出力を平均計算部４１で例えば１
０フレームにわたる平均を対応するサンプル毎に計算
し、又は１フレーム内平均をフレーム毎に求め、つまり
直流成分を検出し、その直流成分を窓関数畳み込みの出
力から減算器４２で差引き、得られた変動分のみを遅延
段３５₁ へ供給し、加算器３４の出力に加算器４３で同
じ平均計算部４１の出力を加えてもよい。加算器３４の
出力E_R" が減算器４２の出力Ｅ₀ になるべく近ずくよう
に予測係数β₁ 〜β_Qを決定する。このような予測係数
β₁〜β_Qは、前述と同様に式(４)、(５)により決定する
ことができる。しかし図９の構成によれば、変動分につ
いてのみ予測するため予測の能率がより向上する。As shown in FIG. 9 by attaching the same reference numerals to the parts corresponding to FIG. 8 as the residual approximate shape calculating sections 23 and 55, the output of the window function convolution
The average over 0 frames is calculated for each corresponding sample, or the average within one frame is obtained for each frame, that is, the DC component is detected, and the DC component is subtracted from the output of the window function convolution by the subtractor 42 to obtain the average. Only the variation may be supplied to the delay stage 35 _1, and the output of the same average calculator 41 may be added to the output of the adder 34 by the adder 43. Adder output E _R 34 'determines the prediction coefficients β ₁ ~β _Q as possible in the near Nuisance so the output E ₀ of the subtracter 42. Such prediction coefficient β ₁ ~β _Q, like above formula (4) and (5) can be determined, but according to the configuration of Fig. 9, the prediction efficiency is further improved because only the variation is predicted.

【００４２】更に図１０に示すように図９中の減算器４
２の出力のフレーム内の各サンプルの２乗の平均値の平
方根、つまり標準偏差を振幅検出部４４で計算し、この
標準偏差で減算器４２の出力を割算器４５において割る
ことにより正規化して変動を平坦化したスペクトラム概
形Ｅ₀ を遅延段３５₁ へ供給し、加算器３４の出力E_R"
が割算器４５の出力になるべく一致するように式(４)、
(５)により予測係数β ₁ 〜β_Qを決定し、加算器３４の
出力に振幅検出部４４の出力である標準偏差を乗算器４
６で乗算して逆正規化し、その逆正規化出力を加算器４
３へ供給して残差概形係数E_R(F) を得る。ただし、図１
０のように処理すると、図８に示した構成における予測
係数β₁〜β_Qを求める方程式(５)は、次式(６)Further, as shown in FIG. 10, the subtractor 4 in FIG.
The average of the squares of each sample in the frame of output 2
The root, that is, the standard deviation is calculated by the amplitude detector 44, and
Divide the output of the subtractor 42 by the standard deviation in the divider 45
Spectrum that is normalized by
Type E₀ To delay stage 35₁ And the output E of the adder 34_R"
Equation (4) so that
According to (5), the prediction coefficient β ₁ ~ Β_QIs determined, and the adder 34
The output of the multiplier 4 is a standard deviation which is the output of the amplitude detector 44.
6 and denormalized by multiplication by 6;
3 and the residual form factor E_R(F). However, FIG.
0, the prediction in the configuration shown in FIG.
Coefficient β₁~ Β_QThe equation (5) for obtaining the following is given by the following equation (6)

【００４３】[0043]

【数３】のように近似することが可能である。つまり、線形合成
部３７に与えられるスペクトラム概形のパワーが正規化
されているので、式(５)の左辺第１項中の対角要素
r_1,1，r_2,2，…は互いに等しくなり、またr_i,j＝r_j,iと
なる。式(６)中の行列はテプリッツ型なので、この方程
式はLevinson-Durbin アルゴリズムで高速に解くことが
できる。また、図８及び９では、Ｑ×Ｑ個の相関係数を
計算しなくてはならなかったが、図１１ではＱ個の相関
係数を計算するだけでよいので、予測係数β₁ 〜β_Qを
求めるのに少ない演算量ですむ。また、相関係数r
_0,jは、式(４)のように求めても良いが、下式(７)のよ
うに互いにｊフレーム離れた係数ベクトルＥ_iとＥ_i+j
の内積をｉ＝０からn_MAXに渡って加算して求めると、よ
り安定性が増す。(Equation 3) It is possible to approximate as follows. That is, since the power of the spectrum outline given to the linear synthesis unit 37 is normalized, the diagonal element in the first term on the left side of the equation (5) is used.
r _1,1 , r _2,2 ,... are equal to each other, and r _{i, j} = r _{j, i} . Since the matrix in equation (6) is of the Toeplitz type, this equation can be solved at high speed by the Levinson-Durbin algorithm. 8 and 9, Q × Q correlation coefficients have to be calculated. However, in FIG. 11, since only Q correlation coefficients need to be calculated, the prediction coefficients β _{1 to} β Only a small amount of computation is required to find _Q. Also, the correlation coefficient r
_{0, j} may be obtained as in equation (4), but as shown in equation (7) below, coefficient vectors E _i and E _{i + j} separated by j frames from each other are obtained.
Is obtained by adding the inner product of i from n = 0 to n _MAX , thereby increasing the stability.

【００４４】 r_0,j＝ (1/S)ΣＥ_i・Ｅ_i+j (７) ここで、Σはｉ＝０からn_MAXまでの総和演算子であり、
Ｓは平均化のための定数であり、Ｓ≧Ｑである。n_MAXの
値は、Ｓ−１でも良いし、(S-j-1) でも良い。なお、Le
vinson-Durbin アルゴリズムの詳細については、「音声
情報処理の基礎」（斉藤、中田共著、オーム社）に記述
してある。R _{0, j} = (1 / S) ΣE _i · E _{i + j} (7) where Σ is a sum operator from i = 0 to n _MAX ,
S is a constant for averaging, and S ≧ Q. The value of n _MAX may be S-1 or (Sj-1). Note that Le
The details of the vinson-Durbin algorithm are described in "Basics of Speech Information Processing" (by Saito and Nakata, Ohmsha).

【００４５】図１０において、振幅検出部４４で標準偏
差を求める代りに、各サンプルの絶対値の平均値を用い
てもよい。図８及び９における予測係数β₁ 〜β_Qの計
算について、相関係数r_i,jを下式(８)のように計算する
こともできる。 r_i,j＝ (1/S)ΣＥ_n+i・Ｅ_n+i+j (８) ここで、Σはｎ＝０からｎ_MAXまでの総和演算子であ
り、Ｓは平均化のための定数であり、Ｓ≧Ｑである。ｎ
_MAXの値は、S-1 でも良いし、S-j-1 でも良い。このよ
うに相関係数を計算すると、ＳがＱよりも十分大きい場
合には、r_i,j＝r_0,jと近似でき、予測係数を求める方程
式(５)は、式(６)と同一に近似でき、Levinson-Durbin
アルゴリズムで高速に解くことができる。In FIG. 10, instead of obtaining the standard deviation by the amplitude detector 44, the average value of the absolute values of each sample may be used. In the calculation of the prediction coefficients β _{1 to} β _Q in FIGS. 8 and 9, the correlation coefficient ri _{, j} can be calculated as in the following equation (8). r _{i, j} = (1 / S) ΣE _{n + i} · E _{n + i + j} (8) where Σ is a sum operator from n = 0 to n _MAX , and S is It is a constant and S ≧ Q. n
The value of _MAX may be S-1 or Sj-1. When the correlation coefficient is calculated in this way, when S is sufficiently larger than Q, r _{i, j} = r _{0, j} can be approximated, and the equation (5) for calculating the prediction coefficient is the same as the equation (6). To Levinson-Durbin
It can be solved at high speed with an algorithm.

【００４６】残差概形計算部２３，５５における残差概
形の予測係数β₁〜β_Qを全帯域一括で決定していたが、
残差概形計算部２３，５５の入力をまず小帯域に分割
し、これら各小帯域毎に独立に予測係数を設定してもよ
い。この小帯域の分割方法としては、周波数領域におい
て線形尺度で等分割、対数尺度で等分割、バーク尺度で
等分割などを用いることができる。The prediction coefficients β _{1 to} β _Q of the residual approximate shapes in the residual approximate shape calculation units 23 and 55 are determined collectively for all bands.
The input of the residual outline calculators 23 and 55 may be first divided into small bands, and a prediction coefficient may be independently set for each of these small bands. As a method of dividing the small band, equal division on a linear scale, equal division on a logarithmic scale, equal division on a Bark scale, and the like can be used in the frequency domain.

【００４７】残差概形計算部２３，５５において、予測
係数β₁〜β_Qの予測誤差の影響が小さくなるように、
窓関数畳み込み部３３における窓幅、窓中心、場合によ
っては窓形状を変更しても良い。また、窓関数の畳込み
と、予測係数β₁〜β_Qによる線形合成とを一括して行
ってもよい。その例を図１１に示す。この例は予測次数
Ｑを４、窓幅Ｔを３とした場合で、遅延段３５₁〜３５
₄の出力は、それぞれ周波数軸に沿って１サンプル正側
にシフトするシフタ７p₁〜７p₄と、１サンプル負側にシ
フトするシフタ７n₁〜７n₄とにそれぞれ供給され、正側
シフタ７p₁〜７p₄の各出力はそれぞれ乗算器８p₁〜８p₄
を通じて加算器３４へ供給され、負側シフタ７n₁〜７n₄
の各出力はそれぞれ乗算器８n₁〜８n₄を通じて加算器３
４へ供給される。いま乗算器３６₁，８n₁，８p₁，３
６₂，８n₂，８p₂，…，８p₄の各乗算係数をそれぞれ
β₁,β₂,β₃,β₄,β₅,β₆,…,β_u（この例ではｕ＝１
２）とし、その各入力スペクトラム概形ベクトルを
Ｅ₁，Ｅ₂，Ｅ₃，Ｅ₄，…，Ｅ_uとし、スペクトラム振幅
計算部３２の出力をＥ₀とする時、加算器３４の出力Ｅ
_RのＥ₀に対する２乗誤差が最小となる予測係数β₁〜
β_uは、予測係数計算部４０において次の（10）式を解
くことにより求めることができる。In the residual shape calculation units 23 and 55, the influence of the prediction errors of the prediction coefficients β _{1 to} β _Q is reduced.
The window width, window center, and in some cases, the window shape in the window function convolution unit 33 may be changed. Further, the convolution of the window function and the linear synthesis using the prediction coefficients β _{1 to} β _Q may be performed collectively. An example is shown in FIG. In this example, the prediction order Q is 4 and the window width T is 3, and the delay stages 35 _{1 to} 35 ₁
The output of the _4, respectively shifter 7p ₁ ～7P ₄ to shift one sample positive along the frequency axis, are supplied to a shifter 7n ₁ ～7N ₄ to shift to one sample negative, positive shifter 7p ₁ each output of ～7P ₄ multipliers 8p ₁ ~8p ₄
It is supplied to the adder 34 through the negative side shifter 7n ₁ ~7n ₄
Each output of the multiplier 8n ₁ ~8n ₄ through the adder 3
4. Now, the multipliers 36 ₁ , 8n ₁ , 8p ₁ , 3
_{_{_{6 2, 8n 2, 8p 2}}} , ..., respectively beta ₁ each multiplier coefficient _{_{_{8p 4, β 2, β 3}}} , β 4, β 5, β 6, ..., β u ( in this example u = 1
2), the input spectrum outline vectors are E ₁ , E ₂ , E ₃ , E ₄ ,..., _Eu, and the output of the spectrum amplitude calculator 32 is E _0.
_The prediction coefficient β ₁ to minimize the square error of _R with respect to E ₀
β _u can be obtained by solving the following equation (10) in the prediction coefficient calculation unit 40.

【００４８】r_i,j＝Ｅ_i・Ｅ_j R _{i, j} = E _i · E _j

【００４９】[0049]

【数４】この様にして決定された予測係数β₁〜β_uに対し得られ
る加算器３４の出力Ｅ _Rは、図８と同様に必要に応じて
定数加算し、更に正規化して現フレームＦの残差概形E_R
(F) とされ、残差概形正規化部２６で残差係数R(F)に対
する概形正規化のために使用される。このように窓関数
も適応化することは、図９及び１０の例にも適用でき
る。(Equation 4)The prediction coefficient β determined in this way₁~ Β_uObtained for
Output E of the adder 34 _RAs needed, as in FIG.
Add a constant and normalize it to obtain the residual E of the current frame F_R
(F), and the residual approximation normalization unit 26
It is used for approximate normalization. Thus the window function
Is also applicable to the examples of FIGS. 9 and 10.
You.

【００５０】図３、８〜１１の各例では、残差概形正規
化部２６に入力された現フレームＦの残差係数R(F)に対
し、残差概形計算部２３において１つ前のフレームＦ−
１の残差係数R(F-1)を使って決定した予測係数β₁(F-1)
〜β_Q(F-1)（又はβ_u）を使って得られた予測残差係数
概形E_R(F) を使って残差係数R(F)の正規化を行い、それ
によって微細構造係数を求める場合を説明した。しか
し、残差概形正規化部２６に入力された現フレームＦの
残差係数R(F)に対し、残差概形計算部２３において現フ
レームで予測係数β₁(F)〜β_Q(F)（図１１の場合はβ_u
であるが、以降全てβ_Qで代表する）を決定し、次式 E_R"(F)＝β₁(F)E₁(F)＋β₂(F)E₂(F)＋…＋β_Q(F)E_Q(F) により合成残差概形E_R"(F)を求め、それから得られた予
測残差概形E_R(F) を使って現フレームの残差係数R(F)を
正規化するように構成してもよい。その場合は、現フレ
ームＦの残差係数R(F)を直接使って予測係数β₁〜β_Qを
決定するため、図３中に破線で示すように正規化部２２
の出力である現フレームの残差係数R(F)を直接残差概形
計算部２３に与える。この方法は図８〜１１のいずれの
残差概形計算部２３にも適用できるが、代表して図８に
適用した場合の構成を図１２に示す。In each of the examples shown in FIGS. 3 and 8 to 11, the residual approximate shape calculation unit 23 calculates one for the residual coefficient R (F) of the current frame F input to the residual approximate normalization unit 26. Previous frame F-
Prediction coefficient β ₁ (F-1) determined using the residual coefficient R (F-1) of 1
Normalize the residual coefficient _R (F) using the estimated residual coefficient approximate form E _R (F) obtained using ~ β _Q (F-1) (or β _u ), thereby obtaining the fine structure The case where the coefficient is obtained has been described. However, with respect to the residual coefficient R (F) of the current frame F input to the residual approximate normalization unit 26, the residual approximate calculation unit 23 predicts the prediction coefficients β ₁ (F) to β _Q ( F) (in the case of FIG. 11, β _u
Hereafter, all are represented by β _Q ), and the following equation E _R "(F) = β ₁ (F) E ₁ (F) + β ₂ (F) E ₂ (F) +... + Β _Q ( F) E _Q (F) is used to obtain a combined residual approximate form E _R "(F), and the predicted residual approximate form E _R (F) is used to calculate the residual coefficient R (F) of the current frame. You may comprise so that it may normalize. In this case, since the prediction coefficients β _{1 to} β _Q are determined by directly using the residual coefficients R (F) of the current frame F, the normalization unit 22 shown in FIG.
, The residual coefficient R (F) of the current frame, which is the output of This method can be applied to any of the residual difference shape calculation units 23 in FIGS. 8 to 11, but FIG. 12 shows a configuration in which the method is applied to FIG. 8 as a representative.

【００５１】図１２において、図８と対応する部分には
同じ番号を付けてある。図８と異なる点は、残差概形計
算部２３にもう一組のスペクトラム振幅計算部３２' と
窓関数畳み込み部３３' を設け、現フレームの残差係数
R(F)を直接スペクトラム振幅計算部３２' に与えてその
スペクトラム振幅概形を求め、それに窓関数畳み込み部
３３' で窓関数を畳み込んでスペクトラム概形E^t ₀(F)を
求め、それを予測係数計算部４０に与えていることであ
る。従って、再生残差係数R_q(F) から求めた現フレーム
のスペクトラム概形E₀(F) は線形合成部３７の第１遅延
段３５₁ のみに与えられる。In FIG. 12, parts corresponding to those in FIG. 8 are given the same numbers. 8 is different from FIG. 8 in that another set of spectrum amplitude calculation unit 32 'and window function convolution unit 33' is provided in residual approximate shape calculation unit 23, and residual coefficient of current frame is set.
R (F) is directly given to the spectrum amplitude calculator 32 'to obtain the spectrum amplitude profile, and the window function convolution unit 33' convolves the window function to obtain the spectrum profile E ^t ₀ (F). Is given to the prediction coefficient calculation unit 40. Therefore, reproduced residual coefficients R _q spectrum of the current frame obtained from (F) envelope E ₀ (F) is applied only to the first delay stage 35 ₁ of the linear combining unit 37.

【００５２】まず、正規化部２２（図３参照）から残差
概形正規化部２６へ与えられた現フレームＦでの入力残
差係数R(F)は、残差概形計算部２３のスペクトラム振幅
計算部３２' 及び窓関数畳み込み部３３' でスペクトラ
ム振幅計算部３２及び窓関数計算部３３と同様の処理を
受け、残差係数R(F)のスペクトラム概形E^t ₀(F)が求めら
れ、それが予測係数計算部４０に与えられる。予測係数
計算部４０は、与えられた係数ベクトルE^t ₀ に対し加算
器３４の出力E_R" の２乗誤差が最小となるように、図８
の場合と同様に式(４)、(５)により予測係数β₁〜β₄を
計算する。その係数β₁〜β₄が乗算器３６₁〜３６₄に与
えられた時の加算器３４の出力が現フレームＦの合成残
差概形E_R"(F) として得られる。First, the input residual coefficient R (F) in the current frame F given from the normalizing section 22 (see FIG. 3) to the residual approximate normalizing section 26 is calculated by the residual approximate calculating section 23. The spectrum amplitude calculation unit 32 ′ and the window function convolution unit 33 ′ receive the same processing as the spectrum amplitude calculation unit 32 and the window function calculation unit 33, and the spectrum outline E ^t ₀ (F) of the residual coefficient R (F) is obtained. Is obtained and supplied to the prediction coefficient calculation unit 40. The prediction coefficient calculation unit 40 determines that the square error of the output E _R "of the adder 34 with respect to the given coefficient vector E ^t ₀ is minimized, as shown in FIG.
The prediction coefficients β _{1 to} β ₄ are calculated by the equations (4) and (5) as in the case of ( ₁ ). The output of the adder 34 when the coefficients β ₁ ~β ₄ is given to the multiplier 36 ₁ to 36 ₄ is obtained as the synthetic residual envelope E _R of the current frame F "(F).

【００５３】合成残差概形E_R"(F)は必要に応じて定数加
算部３８、正規化部３９で図８の場合と同様に処理され
て現フレームの残差概形E_R(F) として残差概形正規化部
２６に与え、現フレームＦの入力残差係数R(F)を正規化
し、微細構造係数を得る。微細構造係数は図３で説明し
たようにパワー正規化部２７でパワーが正規化され、更
に量子化部２５で重み付きベクトル量子化され、パワー
正規化部２７での正規化ゲインの量子化インデックスI_G
と量子化部２５での量子化インデックスImが復号器５０
に供給される。一方、量子化部２５から出力されたイン
ターリーブ型重み付きベクトルC(m)を逆正規化部３１で
再配列し、正規化ゲインg(F)で逆正規化して得られた再
生残差係数R_q(F) は、残差概形計算部２３のスペクトラ
ム振幅計算部３２に与えられてＮ個のサンプル点のスペ
クトラム振幅をそれぞれ計算し、得られた残差係数振幅
に対し窓関数畳み込み部３３で窓関数を畳み込んで残差
係数の概形E₀(F) を得る。このスペクトラム概形E₀(F)
を線形合成部３７に対する現フレームＦの入力係数ベク
トルＥ₀ として与え、遅延段３５₁〜３５₄はそれぞれ与
えられたスペクトラム概形E₀〜E₃を取り込んで、更新さ
れたスペクトラム概形E₁〜E₄として出力し、１フレーム
に付いての処理サイクルを終了する。The combined residual approximate form E _R "(F) is processed as necessary by the constant adder 38 and the normalizer 39 in the same manner as in FIG. 8 to obtain the residual approximate form E _R (F) of the current frame. ) To the residual approximate normalization section 26 to normalize the input residual coefficient R (F) of the current frame F to obtain a fine structure coefficient. 27 power is normalized, yet are weighted vector quantization by the quantization unit 25, the normalization gain at the power normalization unit 27 quantization index I _G
And the quantization index Im in the quantization unit 25 is
Supplied to On the other hand, the reproduction residual coefficient R obtained by rearranging the interleaved weighted vector C (m) output from the quantization unit 25 by the denormalization unit 31 and denormalizing by the normalization gain g (F). _q (F) is supplied to a spectrum amplitude calculator 32 of the residual approximate calculator 23 to calculate the spectrum amplitudes of the N sample points, respectively, and obtain a window function convolution unit 33 for the obtained residual coefficient amplitude. Convolves the window function to obtain the approximate form E ₀ (F) of the residual coefficient. This spectrum outline E ₀ (F)
The applied as an input coefficient vector E ₀ of the current frame F to the linear synthesis unit 37 takes in the spectrum envelope E ₀ to E ₃ delay stages 35 ₁ to 35 ₄ are given respectively, updated spectral envelope E ₁ output as to E _4, and ends the processing cycles with one frame.

【００５４】図１２の実施例では、現フレームの残差係
数R(F)からβ₁〜β₄を決定し、これを用いて現フレーム
の残差概形E_R(F) を予測合成するが、図３の復号器５０
においては、現フレームの再生残差係数R_q(F) は残差概
形逆正規化部５４においてパワー逆正規化部５３からの
現フレームの微細構造係数と残差概形計算部５５からの
現フレームの残差概形係数とを使って求めるべきもので
あり、従って残差概形計算部５５は現フレームの予測係
数β₁〜β₄を決定するための現フレームの残差係数R(F)
を与えられていない。従って、式(４)、(５)により予測
係数β₁〜β₄を決定することはできない。そこで、符号
器１０において図１２の残差概形計算部２３を使った場
合、対応する復号器５０の残差概形計算部５５には、符
号器１０側の予測係数計算部４０で決定した現フレーム
の予測係数β₁〜β₄を量子化してその量子化インデック
スI_Bを供給し、復号器５０では与えられたインデックス
I_Bから再生した係数β₁〜β₄を使って現フレームの残差
概形を計算する。In the embodiment of FIG. 12, β ₁ to β ₄ are determined from the residual coefficient R (F) of the current frame, and the residual approximate form E _R (F) of the current frame is predicted and synthesized using the determined values. Is the decoder 50 of FIG.
In, the reproduction residual coefficient R _q (F) of the current frame is converted into the fine structure coefficient of the current frame from the power inverse normalization unit 53 and the residual residual shape calculation unit 55 by the residual approximate inverse normalization unit 54. Therefore, the residual shape calculation unit 55 determines the prediction coefficients β _{1 to} β ₄ of the current frame by using the residual shape coefficient R ( F)
Have not been given. Therefore, the prediction coefficients β _{1 to} β ₄ cannot be determined by the equations (4) and (5). Therefore, when the residual approximate shape calculating unit 23 of FIG. 12 is used in the encoder 10, the prediction coefficient calculating unit 40 of the encoder 10 determines the residual approximate shape calculating unit 55 of the corresponding decoder 50. index prediction coefficients β ₁ ~β ₄ of the current frame are quantized and supplies the quantization index I _B, given the decoder 50
Using the coefficients β ₁ ~β ₄ reproduced from I _B to calculate the residual envelope of the current frame.

【００５５】即ち、図１３に復号器５０の残差概形計算
部５５を示すように、符号器１０の予測係数計算部４０
から与えられた現フレームの予測係数β₁〜β₄の量子化
インデックスI_Bは残差概形計算部５５の逆量子化部６０
で逆量子化されて予測係数β ₁〜β₄が再生され、線形合
成部６２の乗算器６６₁〜６６₄に設定される。これらの
予測係数β₁〜β₄は遅延段６５₁〜６５₄の出力とそれぞ
れ乗算され、それら乗算結果は加算器６７で加算され残
差概形Ｅ_Rが合成される。この合成残差概形Ｅ _Rは符号
器側と同様に定数加算部６８、正規化部６９で処理さ
れ、現フレームの残差概形E_R(F) として残差概形逆正規
化部５４に与えられる。残差概形逆正規化部５４はパワ
ー逆正規化部５３からの現フレームの微細構造係数と上
記残差概形E_R(F) を乗算して現フレームの残差係数R(F)
を求め、スペクトラム振幅計算部６３に与えると共に、
逆正規化部５７（図３）に与える。与えられた再生残差
係数R_q(F) に対しスペクトラム振幅計算部６３及び窓関
数畳み込み部６４は図１３の対応する部分と同様の処理
を行って残差係数のスペクトラム概形を得て、線形合成
部６２に与える。従って、図１２の残差概形計算部２３
に対応する復号器の残差概形計算部５５は予測係数計算
部を有していない。図１２の予測係数計算部４０におけ
る予測係数の量子化方法としては、例えば予測係数をＬ
ＳＰパラメータに変換した後、例えばフレーム間差分ベ
クトル量子化等の量子化を施すＬＳＰ量子化法を用いる
ことができる。That is to say, FIG.
As shown in the section 55, the prediction coefficient calculation section 40 of the encoder 10
Prediction coefficient β of the current frame given by₁~ Β_FourQuantization
Index I_BIs the inverse quantization unit 60 of the residual approximate calculation unit 55
And the prediction coefficient β ₁~ Β_FourIs played and the linear
Multiplier 66 of component 62₁~ 66_FourIs set to these
Prediction coefficient β₁~ Β_FourIs the delay stage 65₁~ 65_FourOutput and each
The result of the multiplication is added by an adder 67 and the result is multiplied.
Difference form E_RAre synthesized. This composite residual outline E _RIs the sign
The processing is performed by the constant adding unit 68 and the normalizing unit 69 in the same manner as the
And the residual form E of the current frame_R(F) as residual approximate inverse normal
To the conversion unit 54. The residual approximate inverse normalizing unit 54 is
-The fine structure coefficient of the current frame from the inverse normalization unit 53 and the
Approximate residual E_R(F) multiplied by the residual coefficient R (F) of the current frame.
And give it to the spectrum amplitude calculator 63.
This is given to the inverse normalizing section 57 (FIG. 3). Given playback residuals
Coefficient R_q(F), the spectrum amplitude calculator 63 and the window
The number convolution unit 64 performs the same processing as the corresponding part in FIG.
To obtain a spectral outline of the residual coefficients,
To the unit 62. Therefore, the residual shape calculator 23 in FIG.
The residual approximate shape calculation unit 55 of the decoder corresponding to
No parts. In the prediction coefficient calculation unit 40 of FIG.
As a quantization method of the prediction coefficient, for example, the prediction coefficient is L
After converting to SP parameters, for example,
Use LSP quantization method that performs quantization such as vector quantization
be able to.

【００５６】図８〜１０及び１２の残差概形計算部２３
において、それぞれ加算器３４の現フレームの出力であ
る合成残差概形Ｅ_Rに対する１〜４フレーム前の残差係
数のスペクトラム概形Ｅ₁ 〜Ｅ₄ の寄与の程度に応じ
て、それぞれ乗算器３６₁ 〜３６₄ の乗算係数β₁ 〜β
₄ を予め固定的に決めてもよく、例えば古いフレーム
程、重み（乗算係数）が小とされる。あるいは同一重
み、この例では１／４として４つのフレームのサンプル
係数の平均を用いてもよい。この様に係数β₁ 〜β₄を
固定した場合は、式(４)、(５)の計算を行う予測係数計
算部４０は不要である。この場合、復号器５０側の対応
する残差概形計算部５５でも同じ係数β₁ 〜β ₄ を使え
ばよいので係数β₁ 〜β₄ を復号器５０に転送する必要
はない。図１１の場合も同様に係数β₁ 〜β₁₂を固定し
てもよい。FIGS. 8 to 10 and 12 show the residual shape calculator 23.
Are the outputs of the current frame of the adder 34, respectively.
Composite residual E_ROf the frame 1 to 4 frames before
Number spectrum outline E₁ ~ E_Four Depending on the degree of contribution
And a multiplier 36₁ ~ 36_Four Multiplication factor β₁ ~ Β
_Four May be fixedly determined in advance, for example, an old frame
The smaller the weight (multiplication coefficient), the smaller the weight. Or the same weight
In this example, in this example, 1/4 and 4 frame samples
The average of the coefficients may be used. Thus, the coefficient β₁ ~ Β_FourTo
When fixed, a prediction coefficient meter that calculates equations (4) and (5)
The calculation unit 40 is unnecessary. In this case, the response on the decoder 50 side
The same coefficient β is applied to the residual₁ ~ Β _Four Use
The coefficient β₁ ~ Β_Four Must be transferred to the decoder 50
There is no. In the case of FIG.₁ ~ Β₁₂Fix
You may.

【００５７】図８〜１０及び１２の残差概形計算部２３
の各例において、その構成を最も簡単にするには、例え
ば図８において、加算器３４，遅延段３５₂〜３５₄，乗
算器３６₂〜３６₄を省略して、乗算器３６₁の出力を定
数加算部３８へ直接供給し、前フレーム(F-1) のスペク
トラム概形E₁＝E(F-1)からのみ、残差概形係数E_R(F) を
推定してもよい。この変形は図１１にも同様に適用で
き、その場合、乗算器３６₁、８ｐ₁、８ｎ₁の出力の
みを加算器３４に与えればよい。FIGS. 8 to 10 and 12 show the residual shape calculator 23.
In each example, to the simplest its configuration, for example, in FIG. 8, an adder 34, delay stage 35 _2-35 _4, skip multiplier 36 ₂ to 36 _4, the multiplier 36 ₁ outputs May be directly supplied to the constant adding unit 38 to estimate the residual approximate shape coefficient E _R (F) only from the spectrum approximate shape E ₁ = E (F−1) of the previous frame (F−1). This variant can be applied similarly to FIG. 1 1, in which case, the multiplier 36 _1, 8p _1, only the output of 8n ₁ may be supplied to the adder 34.

【００５８】図３及び８〜１２の各例では、残差概形計
算部２３は、合成残差概形係数E_R"が、入力された再生
残差係数R_q(F) 又は残差係数R(F)から求めたスペクトラ
ム概形E(F)とできるだけ一致するように線形予測により
予測係数β（β₁,β₂,…）を決めて予測残差概形係数E_R
(F) を求めていたが、この様な線形予測を行わないで、
残差概形を決定する実施例を図１４、１５及び１６を参
照して説明する。In each of FIGS. 3 and 8 to 12, the residual approximate shape calculator 23 calculates the combined residual approximate shape coefficient E _R ”by using the input reproduction residual coefficient R _q (F) or the residual coefficient. A prediction coefficient β (β ₁ , β ₂ ,...) Is determined by linear prediction so as to match as closely as possible the spectrum outline E (F) obtained from R (F), and the prediction residual outline coefficient E _R
(F), but without performing such linear prediction,
An embodiment for determining the residual shape will be described with reference to FIGS.

【００５９】図１４は図３と対応させて符号器１０と復
号器５０の全体の構成を示し、残差概形計算部２３に対
する接続は図３における破線の接続に対応する。従っ
て、図１２の場合と同様な逆正規化部３１は設けられて
いない。図３及び１２と異なる点は、図１４の実施例に
おける残差概形計算部２３は、線形合成すべき残差概形
Ｅ_Rが、与えられた残差係数R(F)のスペクトラム概形と
できるだけ一致するようにスペクトラム概形を量子化
し、その時の線形合成出力Ｅ_Rを残差概形E_R(F) として
使うと共に、その時の量子化インデックスI_Qを復号器５
０に供給する。復号器５０は、与えられたスペクトラム
概形量子化インデックスI_Qから残差概形計算部５５で逆
量子化処理によりスペクトラム概形E(F)を再生し、残差
概形逆正規化部５４に与える。その他の各部の処理は図
３の場合と同様であり、説明を省略する。FIG. 14 shows the entire configuration of the encoder 10 and the decoder 50 in correspondence with FIG. 3, and the connection to the residual approximate shape calculation unit 23 corresponds to the connection of the broken line in FIG. Therefore, the inverse normalizing unit 31 similar to the case of FIG. 12 is not provided. 3 and 12 differs from the residual envelope calculation section 23 in the embodiment of FIG. 14, residual envelope E _R to be linear synthesis, spectrum given residual coefficients R (F) envelope If possible matching quantized spectrum envelope so that, together with the use of linear synthetic output E _R at that time as a residual envelope E _R (F), the decoder the quantization index I _Q at that time 5
Supply 0. Decoder 50 reproduces the spectral envelope E (F) by the inverse quantization processing by the residual envelope calculation part 55 from the spectrum envelope quantization index I _Q given, residual envelope de-normalization part 54 Give to. The other processes of each unit are the same as those in FIG. 3, and the description is omitted.

【００６０】図１４の実施例における符号器１０と復号
器５０の残差概形計算部２３及び５５の構成例を図１５
に示す。残差概形計算部２３は残差係数R(F)が与えら
れ、そのＮサンプル点のスペクトラム振幅をそれぞれ計
算するスペクトラム振幅計算部３２と、得られたＮ点の
スペクトラム振幅に窓関数を畳み込み演算し、スペクト
ラム概形E(F)を求める窓関数畳み込み部３３と、得られ
たスペクトラム概形E(F)を量子化する量子化部３０と、
量子化スペクトラム概形が現フレームの量子化スペクト
ラム概形係数Ｅ_q0として入力され、過去のフレームの量
子化スペクトラム概形係数と線形合成を行う線形合成部
３７とから構成されている。線形合成部３７は、図１２
に示すものとほぼ同様に、遅延段３５₁〜３５₄と、乗算
器３６₁ 〜３６₄ と、加算器３４とから構成されてい
る。図１５の場合、１〜４フレーム前の量子化スペクト
ラム概形係数Ｅ_q1〜Ｅ_q4をそれぞれ予測係数β₁ 〜β₄
で乗算した結果だけでなく、乗算器３６₀ で現フレーム
の入力量子化スペクトラム概形係数Ｅ_q0に予測係数β₀
を乗算した結果も加算器３４に与えて合成し、予測残差
概形E_R(F) として出力する。また、予測係数β₀ 〜β₄
は予め決めた固定値である。量子化部３０は残差概形E_R
(F) の、入力スペクトラム概形E(F)に対する２乗誤差が
最小となるようにスペクトラム概形E(F)を量子化し、得
られた量子化スペクトラム概形係数E_q0 を線形合成部３
７に与えると共に、その量子化インデックスI_Qを復号器
５０の残差概形計算部５５に供給する。FIG. 15 shows an example of the configuration of the residual approximate calculation units 23 and 55 of the encoder 10 and the decoder 50 in the embodiment of FIG.
Shown in The residual approximate shape calculation unit 23 is given a residual coefficient R (F), and calculates a spectrum amplitude calculation unit 32 for calculating the spectrum amplitude of each of the N sample points, and convolves a window function with the obtained spectrum amplitude of the N points. A window function convolution unit 33 that calculates and obtains a spectrum outline E (F); a quantization unit 30 that quantizes the obtained spectrum outline E (F);
The quantized spectrum approximate form is input as the quantized spectrum approximate coefficient E _q0 of the current frame, and includes a linear combining section 37 that performs linear synthesis with the quantized spectrum approximate coefficient of the past frame. The linear synthesizing unit 37 is configured as shown in FIG.
Much like the one shown in, the delay stages 35 ₁ to 35 _4, a multiplier 36 ₁ to 36 _4, and an adder 34.. In the case of FIG. 15, the quantized spectrum approximate coefficients E _{q1 to} E _{q4 of one} to four frames before are respectively predicted coefficients β _{1 to} β _4.
Not only in the multiplication result, the multiplier 36 ₀ predictive coefficient to the input quantized spectrum envelope coefficients E _q0 of the current frame beta ₀
Is also given to the adder 34 to be synthesized, and output as a prediction residual approximate form E _R (F). Also, the prediction coefficients β _{0 to} β ₄
Is a predetermined fixed value. The quantization unit 30 calculates the residual approximate form E _R
(F) is quantized so that the square error with respect to the input spectrum approximate form E (F) is minimized, and the obtained quantized spectrum approximate coefficient E _q0 is converted to a linear synthesis unit 3.
7, and supplies the quantization index _IQ to a residual approximate shape calculation unit 55 of the decoder 50.

【００６１】残差概形計算部５５の再生部６０は、与え
られた量子化インデックスI_Qから現フレームの量子化ス
ペクトラム概形係数を再生し、符号器１０側と同様に遅
延段６５₁ 〜６５₄ と乗算器６６₀ 〜６６₄ と加算器６
７とから構成された線形合成部６２は再生部６０からの
現フレームの量子化スペクトラム概形係数と、遅延段６
５₁ 〜６５₄ からの過去のフレームの量子化スペクトラ
ム概形係数とを線形合成する。加算器６７から合成され
た残差概形E_R(F) が出力され、残差概形逆正規化部５４
に与えられる。なお、乗算器６６₀ 〜６６₄ には符号器
１０側と同じ係数β₀〜β₄が与えられている。また、符
号器１０の量子化部３０における量子化法として、スカ
ラ量子化を行ってもよいし、ベクトル量子化を行っても
よい。後者の場合、図７で説明したようなインターリー
ブされた係数系列のベクトル量子化を使ってもよい。[0061] reproduction of the residual envelope calculation section 55 60 reproduces the quantized spectrum envelope coefficients of the current frame from a given quantization index I _Q, the encoder 10 side as well as delay stages 65 ₁ 65 ₄ multipliers 66 ₀ to 66 ₄ and an adder 6
The linear synthesizing unit 62 constituted by the delay stage 6 and the delay stage 6 includes a quantized spectrum approximate coefficient of the current frame from the reproducing unit 60.
Of past frames from 5 _1-65 ₄ and quantized spectrum envelope coefficients linear combination. The combined residual approximate shape E _R (F) is output from the adder 67, and the residual approximate inverse normalization unit 54 is output.
Given to. The multipliers 66 _{0 to} 66 ₄ are given the same coefficients β _{0 to} β _{4 as those} on the encoder 10 side. In addition, scalar quantization or vector quantization may be performed as a quantization method in the quantization unit 30 of the encoder 10. In the latter case, vector quantization of the interleaved coefficient sequence as described in FIG. 7 may be used.

【００６２】図１６は図１５の変形実施例を示し、図１
５と対応する部分には同じ参照番号を付けてある。この
実施例では、量子化部３０においてスペクトラム概形E
(F)に対する予測残差概形（加算器３４の出力）E_R(F)
の２乗誤差が最小となるように量子化を行う点は同じで
あるが、線形合成部３７の構成が異なっている。即ち、
縦続接続された遅延段３５₁〜３５₄の入力として予測残
差概形E_R(F) が与えられ、従って、遅延段３５₁ 〜３５
₄ からそれぞれ１〜４フレーム前の予測残差概形E_R(F-
1)〜E_R(F-4)が出力されている。また、量子化部３０か
ら出力される量子化スペクトラム概形E_q(F) が直接加算
器３４に入力されている。従って、線形合成部３７は過
去のフレーム(F-1)〜(F-4)における予測残差概形E_R(F-
1)〜E_R(F-4)と現フレームＦにおける量子化概形係数と
を線形合成し、現フレームの予測残差概形E_R(F) を出力
する。復号器５０側の線形合成部６２も同様の構成とさ
れ、過去のフレームにおける合成残差概形と、再生され
た現フレームにおける量子化概形係数とを線形合成する
ことにより現フレームの残差概形を得る。FIG. 16 shows a modified embodiment of FIG.
Parts corresponding to 5 have the same reference numbers. In this embodiment, the quantizer 30 generates a spectrum outline E
Predicted residual approximation to (F) (output of adder 34) E _R (F)
Are the same in that quantization is performed so that the square error of is minimized, but the configuration of the linear synthesis unit 37 is different. That is,
The input of the cascaded delay stages 35 _{1 to} 35 ₄ is given the predicted residual approximation E _R (F), and thus the delay stages 35 _{1 to} 35 4
₄ to 1 to 4 frames before each prediction residual approximate form E _R (F-
1) to E _R (F-4) are output. Further, the quantized spectrum outline E _q (F) output from the quantization unit 30 is directly input to the adder 34. Accordingly, the linear synthesis unit 37 calculates the prediction residual approximate E _R (F−F
1) to E _R (F-4) are linearly combined with the quantized approximate coefficients in the current frame F, and a predicted residual approximate shape E _R (F) of the current frame is output. The linear synthesizing unit 62 of the decoder 50 has the same configuration, and linearly synthesizes the synthesized residual approximate shape in the past frame and the reproduced quantized approximate shape coefficient in the current frame to obtain the residual of the current frame. Get an outline.

【００６３】図８〜１２、１５及び１６の各残差概形計
算部２３において、帯域処理部を設け、窓関数畳み込み
部３３からのスペクトラム概形を複数の帯域に区分し、
変動成分の少ない高次帯域に付いてそのスペクトラム概
形を振幅が一定の平坦な概形に近似してもよい。図１７
は、例えば図８における畳み込み部３３と遅延部３５と
の間に挿入するそのような帯域処理部４７の例を示す。
この例では窓関数畳み込み部３３の出力E(F)は帯域処理
部４７に入力され、分割部４７Ａで最低次（最低周波
数）から約２／３のサンプル点を中心に例えば５０次程
度の幅の狭い中間帯域の成分E_B(F) と、これより高次の
高次帯域成分E_H(F) と低次の低次帯域成分E_L(F) に分割
し、高次帯域成分E_H(F) を平均部４７Ｂへ供給して、そ
の高次帯域のスペクトラム振幅の平均値を求めて、高次
帯域成分E_H(F) を全てその平均値に置き換えて出力し、
低次帯域成分E_L(F) はそのまゝ出力する。また中間帯域
の成分E_B(F) については、漸近部４７Ｃにより中間帯域
の高低両端におけるスペクトラム振幅がそれぞれ平均部
４７Ｂで得た平均値及び低次帯域E_L(F)の最高次のスペ
クトラム振幅に一致し、その間で直線的に変化するよう
に各スペクトラム点の振幅が修正されて出力される。つ
まり、高周波成分は変動が少ないので高次帯域内のスペ
クトラム振幅を一定値、ここでは平均値、に近似してい
る。In each of the residual shape calculating units 23 shown in FIGS. 8 to 12, 15, and 16, a band processing unit is provided, and the spectrum shape from the window function convolution unit 33 is divided into a plurality of bands.
For a high-order band having a small fluctuation component, the spectrum outline may be approximated to a flat outline having a constant amplitude. FIG.
Shows an example of such a band processing unit 47 inserted between the convolution unit 33 and the delay unit 35 in FIG. 8, for example.
In this example, the output E (F) of the window function convolution unit 33 is input to the band processing unit 47, and the division unit 47A has a width of, for example, about the 50th order centering on about 2/3 sampling points from the lowest order (lowest frequency). Is divided into a narrower intermediate band component E _B (F), a higher-order higher-order band component E _H (F) and a lower-order lower-order band component E _L (F), and the higher-order band component E _H (F) is supplied to the averaging unit 47B, the average value of the spectrum amplitude of the higher-order band is obtained, and all the higher-order band components E _H (F) are replaced with the average value and output.
The low-order band component E _L (F) is output as it is. Regarding the component E _B (F) of the intermediate band, the spectrum amplitudes at the upper and lower ends of the intermediate band are respectively obtained by the asymptotic portion 47C by the average value obtained by the averaging portion 47B and the highest-order spectrum amplitude of the low-order band E _L (F). , And the amplitude of each spectrum point is corrected and output so as to change linearly between them. That is, since the high-frequency component has little fluctuation, the spectrum amplitude in the high-order band is approximated to a constant value, here an average value.

【００６４】図８〜１２の各残差概形計算部２３におい
て、入力音響信号の典型的な複数の状態に応じた好まし
い予測係数β₁ 〜β_Q（又はβ_u）の組をそれぞれイン
デックスに対応した係数ベクトルとして符号帳に用意し
ておき、もっとも良く残差概形を予測する係数ベクトル
を符号帳から選択して使用し、その係数ベクトルを示す
インデックスを復号器５０の残差概形計算部５５に転送
してもよい。Each of the residual shape calculators 23 of FIGS. 8 to 12 uses a set of preferable prediction coefficients β _{1 to} β _Q (or β _u ) corresponding to a plurality of typical states of the input audio signal as indexes. The corresponding coefficient vector is prepared in the codebook, and the coefficient vector that best predicts the residual shape is selected from the codebook and used, and the index indicating the coefficient vector is used by the decoder 50 to calculate the residual shape. The information may be transferred to the unit 55.

【００６５】図８〜１１の各実施例のように、過去の残
差係数概形から現在の残差概形を予測する線形予測モデ
ルにおいては、その系の安定性を判断するためパラメー
タｋが用いられるが、この発明においても、予測係数を
ｋパラメータに変換して、その絶対値が１.0に近い、或
いは１.0を越えている場合、強制的に予め決められた係
数に設定したり、または別の残差概形構成方法に、例え
ば図８から図９へ切り替えたり、あるいは予め決められ
た残差概形（例えば凹凸のない平坦な信号）に切り替え
る、などの処理を施して系の安定性が増すようにするこ
ともできる。As in the embodiments shown in FIGS. 8 to 11, in the linear prediction model for predicting the current residual general shape from the past residual coefficient general shape, the parameter k is used to judge the stability of the system. However, in the present invention as well, the prediction coefficient is converted into the k parameter, and when the absolute value is close to 1.0 or exceeds 1.0, the prediction coefficient is forcibly set to a predetermined coefficient. 8 or FIG. 9 or switch to a predetermined residual shape (for example, a flat signal without unevenness), or the like. The stability of the system can be increased.

【００６６】図３及び１４の実施例の符号器１０では、
線形予測分析部１７で線形予測分析の際、窓掛け部１５
からの入力音響信号の自己相関係数を用いて予測係数を
求めた。しかし、図１８に示すようにＭＤＣＴ部１６で
求めた周波数領域係数の各サンプル（スペクトラム）の
絶対値を絶対値部８１で求め、その絶対値出力を逆フー
リエ変換部８２で逆フーリエ変換して自己相関係数と
し、その自己相関係数を線形予測分析部１７で線形予測
分析してもよい。この場合は、その分析に先立って相関
を求める必要はない。In the encoder 10 of the embodiment of FIGS.
When the linear prediction analysis is performed by the linear prediction analysis unit 17, the windowing unit 15
The prediction coefficient was obtained by using the autocorrelation coefficient of the input acoustic signal from. However, as shown in FIG. 18, the absolute value of each sample (spectrum) of the frequency domain coefficient obtained by the MDCT unit 16 is obtained by the absolute value unit 81, and the output of the absolute value is subjected to inverse Fourier transform by the inverse Fourier transform unit 82. The autocorrelation coefficient may be used as the autocorrelation coefficient, and the linear prediction analysis unit 17 may perform linear prediction analysis on the autocorrelation coefficient. In this case, there is no need to determine the correlation prior to the analysis.

【００６７】更に、図３及び１４の各実施例において
は、符号器側では入力信号の線形予測係数α₀〜α_Pを量
子化し、その量子化予測係数をフーリエ変換して入力信
号のスペクトラム概形（周波数特性の概形）を求め、そ
の概形により入力信号の周波数特性を正規化して残差係
数を得た。また、その量子化予測係数のインデックスI_P
を復号器に転送し、復号器でそのインデックスI_Pから線
形予測係数α₀ 〜α_Pを得、更にその線形予測係数から
周波数特性の概形を求めた。しかし、以下のようにして
もよい。即ち、図１９に図３と対応する部分に同一符号
を付けて示すように、ＭＤＣＴ部１６からの周波数領域
係数をスケーリングファクタ計算量子化部１９に分岐入
力して、その周波数領域係数をいくつかの小帯域に分割
し、その各小帯域ごとのサンプルの絶対値の平均値、ま
たは最大値をスケーリングファクタとして求め、このス
ケーリングファクタを量子化し、そのインデックスI_Sを
復号器５０へ送出する。また前記求めたスケーリングフ
ァクタでＭＤＣＴ部１６からの周波数領域係数を正規化
部２２において、各対応小帯域ごとに割算して残差係数
R(F)を得て残差概形正規化部２６へ供給する。更に、前
記求めたスケーリングファクタと、残差概形計算部２３
からの残差係数の概形の対応する小帯域のサンプルとを
重み計算部２４で乗算して重み付け係数Ｗ(w₁,…,w_N)を
得て量子化部２５へ供給する。復号器５０では入力され
たインデックスI_Sからスケーリングファクタ再生部７１
でスケーリングファクタを再生し、そのスケーリングフ
ァクタを逆正規化部５７で残差概形逆正規化部５４から
の再生残差係数に乗算して周波数領域係数を再生して逆
ＭＤＣＴ部５８へ供給する。Further, in each of the embodiments of FIGS. 3 and 14, the encoder quantizes the linear prediction coefficients α _{0 to} α _P of the input signal, and performs a Fourier transform on the quantized prediction coefficients to obtain an approximate spectrum of the input signal. A shape (an outline of frequency characteristics) was obtained, and the frequency characteristics of the input signal were normalized based on the outline to obtain a residual coefficient. Also, the index I _{P of the} quantized prediction coefficient
It was transferred to the decoder, resulting the index I _P linear prediction coefficients from alpha ₀ to? _P at the decoder to determine the further envelope of the frequency characteristic from the linear prediction coefficients. However, the following may be performed. That is, as shown in FIG. 19 by attaching the same reference numerals to the parts corresponding to FIG. 3, the frequency domain coefficients from the MDCT section 16 are branched and input to the scaling factor calculation and quantization section 19, and some of the frequency domain coefficients are divided into small bands of calculated average of the absolute values of samples per the respective small band or the maximum value as a scaling factor, the scaling factor is quantized, and sends the index I _S to the decoder 50. Further, the normalization unit 22 divides the frequency domain coefficient from the MDCT unit 16 for each corresponding small band by the obtained scaling factor to obtain a residual coefficient.
R (F) is obtained and supplied to the residual approximate normalization unit 26. Further, the obtained scaling factor and the residual approximate shape calculation unit 23
Are multiplied by a sample of a small band corresponding to the approximate form of the residual coefficient from the weighting unit 24 to obtain a weighting coefficient W (w ₁ ,..., W _N ) and supply it to the quantization unit 25. Scaling factor reproducing unit 71 from the decoder 50 is input index I _S
To reproduce the scaling factor, and the inverse normalization unit 57 multiplies the reproduction residual coefficient from the residual approximate inverse normalization unit 54 by the inverse normalization unit 57 to reproduce the frequency domain coefficient and supplies it to the inverse MDCT unit 58. .

【００６８】上述では入力音響信号を周波数領域係数に
変換した後、残差係数を得たが、入力音響信号から時間
領域でスペクトラム概形を取り除いた残差信号を求め、
その残差信号を周波数領域の残差係数に変換してもよ
い。即ち、図２０に図３と対応する部分に同一符号を付
けて示すように、入力端子１１からの音響入力信号は線
形予測分析部１７で線形予測分析され、得られた線形予
測係数α₀〜α_Pは量子化部１８で量子化され、その量子
化された線形予測係数が逆フィルタ２８にフィルタ係数
として設定され、この逆フィルタ２８に端子１１からの
入力音響信号が通されて、平坦化された周波数特性を有
する時間領域の残差信号を得る。この残差信号はコサイ
ン変換部２９で離散コサイン変換されて周波数領域の残
差係数R(F)に変換されて残差概形正規化部２６へ供給さ
れる。一方、量子化部１８からの量子化線形予測係数が
スペクトラム概形計算部２１へ供給され、入力信号の周
波数特性の概形が求められ、重み計算部２４に与えられ
る。符号器１０におけるその他の部分の処理は図３と同
様である。In the above description, the residual coefficient is obtained after converting the input audio signal into the frequency domain coefficient. The residual signal obtained by removing the spectral outline in the time domain from the input audio signal is obtained.
The residual signal may be converted into a residual coefficient in the frequency domain. That is, as shown with the parts corresponding to those in FIG. 3 in FIG. 20, an acoustic input signal from the input terminal 11 is linear prediction analysis in the linear prediction analysis unit 17, the linear prediction coefficients resulting alpha ₀ ~ α _P is quantized by the quantization unit 18, the quantized linear prediction coefficients are set as filter coefficients in the inverse filter 28, and the input audio signal from the terminal 11 is passed through the inverse filter 28, and To obtain a time-domain residual signal having the obtained frequency characteristic. The residual signal is subjected to a discrete cosine transform by a cosine transform unit 29, converted to a residual coefficient R (F) in the frequency domain, and supplied to a residual approximate normalization unit 26. On the other hand, the quantized linear prediction coefficient from the quantization unit 18 is supplied to the spectrum outline calculation unit 21, and the outline of the frequency characteristic of the input signal is obtained and given to the weight calculation unit 24. The other processes in the encoder 10 are the same as those in FIG.

【００６９】復号器５０では残差概形逆正規化部５４か
らの再生残差係数R_q(F) は逆コサイン変換部７２で逆離
散コサイン変換されて時間領域の残差信号とされて合成
フィルタ７３へ供給される。一方、入力されたインデッ
クスI_Pは再生部７４で線形予測係数α₀〜α_Pに再生さ
れ、その線形予測係数が合成フィルタ７３のフィルタ係
数として設定される。合成フィルタ７３は逆コサイン変
換部７２からの残差信号が与えられ、音響信号を合成し
て出力端子９１へ出力する。図２０に示した例では時間
周波数変換はＭＤＣＴよりもＤＣＴの方が適する。In the decoder 50, the reproduction residual coefficient R _q (F) from the residual approximate inverse normalizer 54 is subjected to inverse discrete cosine transform by the inverse cosine transform unit 72, and is synthesized as a time domain residual signal. It is supplied to the filter 73. On the other hand, the index I _P inputted is reproduced linear prediction coefficient alpha ₀ to? _P in the reproducing unit 74, the linear prediction coefficient is set as the filter coefficients of the synthesis filter 73. The synthesis filter 73 receives the residual signal from the inverse cosine transform unit 72, synthesizes an audio signal, and outputs the synthesized audio signal to the output terminal 91. In the example shown in FIG. 20, DCT is more suitable for time-frequency transform than MDCT.

【００７０】図３、１４、１９及び２０の実施例におけ
る量子化部２５として、図２１に示すように構成し、図
２２に示す処理手順に従って量子化を行ってもよい。即
ち、まずスカラー量子化部２５Ａにおいて量子化幅制御
部２５Ｄから与えられた予め決められた最大量子化幅で
微細構造係数X(F)をスカラー量子化する(S1)。次に誤差
計算部２５Ｂにおいて入力された微細構造係数X(F)に対
する量子化された微細構造係数Xq(F) の誤差を計算する
(S2)。誤差としては、例えば重み付け係数Ｗを使った重
み付け２乗誤差を使う。次に量子化ループ制御部２５Ｃ
においてこの量子化誤差が聴感的に許容される所定値よ
り小さいかを判断する(S3)。この時、量子化誤差が所定
値より小さければ、その量子化微細構造係数Xq(F) とそ
れを表すインデックスImを出力すると共に、使用された
量子化幅を表すインデックスI_Dを量子化幅制御部２５Ｄ
から出力し、量子化処理を終了する。量子化ループ制御
部２５Ｃは、ステップＳ３で量子化誤差が所定値より大
きいと判断した場合は、量子化微細構造係数Xq(F) に使
用されるビット数が使用可能最大ビット数を越えていな
いか判断する(S4)。使用されるビット数が使用可能最大
ビット数を越えていなければ、処理ループを継続すべき
と判断し、量子化幅制御部２５Ｄに対し前回より小さい
所定の量子化幅をスカラー量子化部２５Ａに与えさせ(S
5)、スカラー量子化部２５Ａで再量子化する。以下同様
の処理手順を繰り帰る。ステップＳ４で使用ビット数が
使用可能最大ビット数以上の場合は、前回のループによ
る量子化微細構造係数Xq(F) とそのインデックスImと共
に量子化幅インデックスI_Dを出力して量子化処理を終了
する。The quantization section 25 in the embodiments of FIGS. 3, 14, 19 and 20 may be configured as shown in FIG. 21 and perform quantization according to the processing procedure shown in FIG. That is, first, the scalar quantization unit 25A scalar-quantizes the fine structure coefficient X (F) with the predetermined maximum quantization width given from the quantization width control unit 25D (S1). Next, an error of the quantized fine structure coefficient Xq (F) with respect to the input fine structure coefficient X (F) is calculated in the error calculator 25B.
(S2). As the error, for example, a weighted square error using a weighting coefficient W is used. Next, the quantization loop control unit 25C
It is determined whether this quantization error is smaller than a predetermined value that is perceptually permissible (S3). At this time, if the quantization error is smaller than a predetermined value, the quantized fine structure coefficient Xq (F) and the index Im representing the same are output, and the index _ID representing the used quantization width is controlled by the quantization width control. Part 25D
, And the quantization process ends. If the quantization loop control unit 25C determines that the quantization error is larger than the predetermined value in step S3, the number of bits used for the quantized fine structure coefficient Xq (F) does not exceed the maximum number of usable bits. Is determined (S4). If the number of bits used does not exceed the maximum number of usable bits, it is determined that the processing loop should be continued, and a predetermined quantization width smaller than the previous one is sent to the scalar quantization unit 25A by the quantization width control unit 25D. Give (S
5), requantization is performed by the scalar quantization unit 25A. Hereinafter, the same processing procedure is repeated. If the number of bits used is equal to or greater than the maximum number of usable bits in step S4, the quantization width index _ID is output together with the quantization fine structure coefficient Xq (F) and its index Im from the previous loop, and the quantization process ends. I do.

【００７１】この量子化部２５に対応する復号器５０の
再生部５１（図３、１４、１９、２０参照）には、量子
化インデックスImと量子化幅インデックスI_Dを供給し、
再生部５１はこれらのインデックスに基づいて微細構造
係数を再生する。The reproducing unit 51 (see FIGS. 3, 14, 19 and 20) of the decoder 50 corresponding to the quantizing unit 25 is supplied with the quantization index Im and the quantization width index _ID .
The reproducing unit 51 reproduces the fine structure coefficient based on these indexes.

【００７２】[0072]

【発明の効果】以上述べたように、この発明によれば、
ピッチ成分を含む信号が入力された場合にみられる周波
数領域の残差係数におけるフレーム間の高い相関を利用
して、その残差係数の概形を正規化して、より平坦化し
た微細構造係数として量子化するため、高い量子化能率
が得られる。また、複数ピッチが存在する場合にも周波
数領域ではピッチが分離されるので問題を生じない。更
に、残差係数の概形を適応的に決定するので、ピッチ成
分の変化傾向に追従することもできる。As described above, according to the present invention,
Utilizing the high correlation between frames in the frequency domain residual coefficient seen when a signal containing a pitch component is input, the general shape of the residual coefficient is normalized to provide a more flattened fine structure coefficient. Since quantization is performed, high quantization efficiency is obtained. Further, even when a plurality of pitches exist, no problem occurs because the pitches are separated in the frequency domain. Furthermore, since the approximate shape of the residual coefficient is determined adaptively, it is possible to follow the changing tendency of the pitch component.

【００７３】また、重ね合わせ直交変換を用いて入力音
響信号を周波数領域係数に変換し、それを、音響信号の
線形予測係数から求めたスペクトラム振幅（即ち入力音
響信号の周波数特性の概形）で周波数領域において正規
化する実施例では、フレーム間ノイズを生じさせないで
周波数領域係数の高能率な平坦化を実現することが可能
である。[0073] Further, by using the I heavy combined orthogonal transformation converts the input acoustic signal to frequency domain coefficients, which, (envelope of the frequency characteristics of words input acoustic signal) spectrum amplitude obtained from the linear prediction coefficients of the acoustic signals In the embodiment in which the frequency domain is normalized in the frequency domain, it is possible to realize highly efficient flattening of the frequency domain coefficient without generating inter-frame noise.

【００７４】Ｐ＝６０，Ｎ＝５１２，Ｍ＝６４，Ｑ＝２
とし、線形予測係数α₀〜α_Pと正規化ゲインの量子化の
ための情報量を十分大きくとり、微細構造係数を２ビッ
ト／サンプルの情報量でベクトル量子化する条件で、図
８に示した残差概形計算部２３を用いて様々な音楽ソー
スを符号化復号化した場合、残差概形計算部２３，５５
を取り除いて符号化復号化した場合よりも平均約５dB，
最大約１０dBのセグメンタルＳＮＲの改善がみられる。
また、聴感的にもより自然性の高い音が得られる。P = 60, N = 512, M = 64, Q = 2
FIG. 8 shows a condition under which the amount of information for quantizing the linear prediction coefficients α _{0 to} α _P and the normalized gain is sufficiently large, and the fine structure coefficient is vector-quantized with the information amount of 2 bits / sample. When various music sources are encoded and decoded using the residual approximate shape calculation unit 23, the residual approximate shape calculation units 23 and 55
Approximately 5 dB on average compared to encoding and decoding
Up to about a 10 dB improvement in segmental SNR is seen.
Also, a sound with higher naturalness can be obtained audibly.

【図面の簡単な説明】[Brief description of the drawings]

【図１】入力信号の周波数特性をスケーリングファクタ
によって平坦化する従来の符号器を示すブロック図。FIG. 1 is a block diagram showing a conventional encoder for flattening a frequency characteristic of an input signal by a scaling factor.

【図２】入力信号の周波数特性を線形予測分析フィルタ
で平坦化する従来の符号器を示すブロック図。FIG. 2 is a block diagram showing a conventional encoder for flattening a frequency characteristic of an input signal using a linear prediction analysis filter.

【図３】この発明が適用された符号器、及びこの発明が
適用された復号器の各実施例を示すブロック図。3 is a block diagram showing each embodiment of the inventions is applied coder, and a decoder which this inventions is <br/> applied.

【図４】Ａは図３におけるＭＤＣＴ部で求めた周波数領
域波形の例、Ｂはスペクトラム概形計算部により求めた
スペクトラム概形の例、Ｃは正規化部２２により求めた
残差係数の例、Ｄは残差概形計算部２３で求めた残差概
形の例、Ｅは残差概形正規化部２６で求めた微細構造係
数の例をそれぞれ示す図。4A is an example of a frequency domain waveform obtained by an MDCT unit in FIG. 3, B is an example of a spectrum outline obtained by a spectrum outline calculation unit, and C is an example of a residual coefficient obtained by a normalization unit 22. FIG. 4D is a diagram showing an example of the residual shape obtained by the residual shape calculator 23, and FIG.

【図５】Ａは予測係数から周波数特性の概形を得る方法
を示す図、Ｂは予測係数から周波数特性の概形を得る他
の方法を示す図。FIG. 5A is a diagram illustrating a method of obtaining a rough shape of a frequency characteristic from a prediction coefficient, and FIG. 5B is a diagram illustrating another method of obtaining a rough shape of a frequency characteristic from a prediction coefficient.

【図６】ベクトル量子化の際の信号系列と分割小系列と
の関係例を示す図。FIG. 6 is a diagram showing an example of a relationship between a signal sequence and a divided small sequence at the time of vector quantization.

【図７】量子化部２５の構成例を示すブロック図。FIG. 7 is a block diagram illustrating a configuration example of a quantization unit 25.

【図８】図３中の残差概形計算部２３，５５の具体例を
示すブロック図。FIG. 8 is a block diagram showing a specific example of residual shape calculating units 23 and 55 in FIG. 3;

【図９】図８中の残差概形計算部２３，５５の変形例を
示すブロック図。FIG. 9 is a block diagram showing a modified example of the residual shape calculating units 23 and 55 in FIG. 8;

【図１０】図９中の残差概形計算部２３、５５の変形例
を示すブロック図。FIG. 10 is a block diagram showing a modified example of the residual approximate shape calculation units 23 and 55 in FIG. 9;

【図１１】図３中の残差概形計算部２３，５５で窓関数
及び予測係数の両者を適応的に制御するようにした例を
示すブロック図。FIG. 11 is a block diagram showing an example in which both the window function and the prediction coefficient are adaptively controlled by the residual shape calculators 23 and 55 in FIG. 3;

【図１２】図３中の残差概形計算部２３の更に他の例を
示すブロック図。FIG. 12 is a block diagram showing still another example of the residual shape calculating unit 23 in FIG. 3;

【図１３】図１２に対応する復号器側の残差概形計算部
５５の例を示すブロック図。FIG. 13 is a block diagram showing an example of a residual difference calculation unit 55 on the decoder side corresponding to FIG. 12;

【図１４】この発明の符号器と復号器の他の実施例を示
すブロック図。FIG. 14 is a block diagram showing another embodiment of the encoder and the decoder according to the present invention.

【図１５】図１４における残差概形計算部２３、５５の
具体例を示すブロック図。FIG. 15 is a block diagram showing a specific example of residual approximate shape calculation units 23 and 55 in FIG. 14;

【図１６】図１４における残差概形計算部２３、５５の
具体例を示すブロック図。FIG. 16 is a block diagram showing a specific example of residual approximate shape calculation units 23 and 55 in FIG. 14;

【図１７】残差概形計算部２３においてスペクトラム概
形の高次帯域部を一定値に近似した処理をする帯域処理
部の構成を示すブロック図。FIG. 17 is a block diagram showing a configuration of a band processing unit that performs a process of approximating a higher-order band portion of a spectrum outline to a constant value in a residual outline calculation unit 23;

【図１８】図３の符号器における部分変形例を示すブロ
ック図。FIG. 18 is a block diagram showing a partial modification of the encoder shown in FIG. 3;

【図１９】この発明を適用した符号器及び復号器の他の
例を示すブロック図。Figure 19 is a block diagram showing another example of the encoder and decoder of applying the inventions.

【図２０】時間領域で残差信号を得る構成とした符号器
とそれに対応する復号器の例を示すブロック図。FIG. 20 is a block diagram showing an example of an encoder configured to obtain a residual signal in a time domain and a corresponding decoder.

【図２１】図３、１４、１９及び２０の実施例における
量子化部２５の他の構成例を示すブロック図。FIG. 21 is a block diagram showing another configuration example of the quantization unit 25 in the embodiments of FIGS. 3, 14, 19 and 20;

【図２２】図２１の量子化部による量子化処理手順を示
すフロー図。FIG. 22 is a flowchart showing a quantization processing procedure by the quantization unit in FIG. 21;

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩＧ１０Ｌ 9/18 ＥＭ (56)参考文献特開平４−249300（ＪＰ，Ａ) 特開平３−119398（ＪＰ，Ａ) 特開平７−261800（ＪＰ，Ａ) 特開平５−289696（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 19/08 G10L 19/00 H03M 7/30 H04B 14/04 G10L 19/08 G10L 19/00 H03M 7/30 H04B 14/04 ＪＩＣＳＴファイル（ＪＯＩＳ)────────────────────────────────────────────────── (5) Continuation of the front page (51) Int.Cl. ⁷ Identification symbol Fig10L 9/18 EM (56) References JP-A-4-249300 (JP, A) JP-A-3-119398 (JP, A JP-A-7-261800 (JP, A) JP-A-5-289696 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) G10L 19/08 G10L 19/00 H03M 7 / 30 H04B 14/04 G10L 19/08 G10L 19/00 H03M 7/30 H04B 14/04 JICST file (JOIS)

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】残差係数を得る手段（１４，１５，１
６，１７，１８，２１，２２：１４，１５，１６，１
９，２２：１７，１８，２８，２９）と、残差概形計算
手段（２３）と、微細構造量子化手段（２５）と、残差
概形正規化手段（２６）と、逆正規化手段（３１）とを
備え、音響信号を入力し、そのスペクトラム概形を表す
情報と、量子化微細構造係数情報を出力する音響信号符
号化装置であって、上記残差係数を得る手段はフレーム単位の入力音響信号
から上記スペクトラム概形を表す情報を算出し、そのス
ペクトラム概形を表す情報を出力すると共にそのスペク
トラム概形を表す情報を用い、周波数特性の概形が平坦
化された周波数領域の残差係数を算出し、残差概形正規化手段（２６）は上記残差係数を予測残差
概形で正規化して微細構造係数を算出し、微細構造量子化手段（２５）は上記微細構造係数を量子
化して上記量子化微細構造係数情報を出力し、逆正規化手段（３１）は上記量子化微細構造係数情報
を、残差概形計算手段（２３）よりの予測残差概形によ
り正規化して残差係数を再生し、残差概形計算手段（２３）は上記再生された残差係数が
入力され、その過去のフレームの量子化残差係数から次
フレームの残差係数に対する上記予測残差概形を合成す
る音響信号符号化装置。A means for obtaining a residual coefficient (14, 15, 1);
6,17,18,21,22: 14,15,16,1
9, 22: 17, 18, 28, 29), residual shape calculating means (23), fine structure quantization means (25), residual shape normalizing means (26), and inverse normalization Means for inputting an audio signal, and outputting information representing a spectrum outline thereof and quantized fine structure coefficient information, wherein the means for obtaining the residual coefficient is a frame. The frequency domain in which the information representing the spectrum outline is calculated from the input acoustic signal of the unit, the information representing the spectrum outline is output, and the information representing the spectrum outline is used, and the frequency characteristic outline is flattened. The residual approximate normalization means (26) calculates the fine structure coefficient by normalizing the residual coefficient with the predicted residual approximate form, and the fine structure quantization means (25) calculates the fine structure coefficient Quantizing the fine structure coefficient The inverse normalizing means (31) normalizes the quantized fine structure coefficient information by the predicted residual approximate form from the residual approximate form calculating means (23) to reproduce the residual coefficient. The residual approximate shape calculating means (23) receives the reproduced residual coefficient, and synthesizes the predicted residual approximate shape for the next frame residual coefficient from the quantized residual coefficient of the past frame. Signal encoding device.

【請求項２】請求項１記載の符号化装置において、残差概形計算手段（２３）は、上記再生された残差係数
からその包絡を求める手段（３２，３３）と、上記残差
係数の包絡から次フレームの上記残差係数に対する上記
予測残差概形を合成する合成手段（３７）からなる。2. An encoding apparatus according to claim 1, wherein said residual approximate shape calculating means (23) comprises means (32, 33) for obtaining an envelope from said reproduced residual coefficients, and said residual coefficient (37) for synthesizing the predicted residual approximate form for the residual coefficient of the next frame from the envelope of (37).

【請求項３】請求項２記載の符号化装置において、上記合成手段（３７）は、現フレームに対し予め決めた
１つまたは連続する複数の過去のフレームの上記再生さ
れた残差係数の包絡から線形合成により上記予測残差概
形を合成する線形合成手段（３７）である。3. The encoding device according to claim 2, wherein said combining means (37) includes an envelope of said reproduced residual coefficients of one or a plurality of past frames determined in advance for a current frame. A linear synthesis means (37) for synthesizing the above-mentioned approximate shape of the prediction residual by linear synthesis from.

【請求項４】請求項３記載の符号化装置において、残差概形計算手段（２３）は、上記過去のフレームの上
記再生残差係数の包絡から合成する上記予測残差概形
が、現フレームの再生残差係数の包絡を目標としてそれ
に近づくように予測係数を計算し、上記過去のフレーム
の上記残差係数の包絡に対する上記線形合成手段（３
７）の上記予測係数を制御する予測係数計算手段（４
０）を有する。4. The encoding apparatus according to claim 3, wherein the residual approximate shape calculating means (23) calculates the predicted residual approximate shape synthesized from the envelope of the reproduction residual coefficient of the past frame as a current one. The prediction coefficient is calculated so that the envelope of the reproduction residual coefficient of the frame approaches the target, and the linear combination means (3) for the envelope of the residual coefficient of the past frame is calculated.
7) A prediction coefficient calculation means (4) for controlling the prediction coefficient.
0).

【請求項５】残差係数を得る手段（１４，１５，１
６，１７，１８，２１，２２：１４，１５，１６，１
９，２２：１７，１８，２８，２９）と、残差概形計算
手段（２３）と、微細構造量子化手段（２５）と、残差
概形正規化手段（２６）と、逆正規化手段（３１）とを
備え、音響信号を入力し、そのスペクトラム概形を表す
情報と、量子化予測係数情報と、量子化微細構造係数情
報とを出力する音響信号符号化装置であって、上記残差係数を得る手段はフレーム単位の入力音響信号
から上記スペクトラム概形を表す情報を算出し、そのス
ペクトラム概形を表す情報を出力すると共にそのスペク
トラム概形を表す情報を用い、周波数特性の概形が平坦
化された周波数領域の残差係数を算出し、残差概形正規化手段（２６）は上記残差係数を予測残差
概形で正規化して微細構造係数を算出し、微細構造量子化手段（２５）は上記微細構造係数を量子
化して量子化微細構造係数情報を出力し、逆正規化手段（３１）は上記量子化微細構造係数情報
を、残差概形計算手段（２３）よりの予測残差概形によ
り正規化して、残差係数を再生し、残差概形計算手段（２３）は残差係数の包絡を求める第
１，第２手段（３２，３３：３２′，３３′）と線形合
成手段（３７）と、予測係数計算手段（４０）とを有
し、上記残差係数の包絡を求める第１手段（３２，３３）
は、上記再生された残差係数からその包絡を求める手段
であり、上記残差係数の包絡を求める第２手段（３２′，３
３′）は上記残差係数を得る手段で得られた残差係数か
らその包絡を求める手段であり、上記予測係数計算手段（４０）は現フレームの上記残差
係数の包絡及び過去のフレームの上記再生残差係数の包
絡を入力して、予測係数を算出し、かつこれを量子化し
て上記量子化予測係数情報を出力し、上記線形合成手段（３７）は上記再生残差係数の包絡を
入力し、上記予測係数を用いて上記予測残差概形を線形
合成する音響信号符号化装置。5. A means (14, 15, 1) for obtaining a residual coefficient.
6,17,18,21,22: 14,15,16,1
9, 22: 17, 18, 28, 29), residual approximate shape calculation means (23), fine structure quantization means (25), residual approximate shape normalization means (26), and inverse normalization Means for receiving an audio signal and outputting information representing a spectrum outline thereof, quantized prediction coefficient information, and quantized fine structure coefficient information. The means for obtaining the residual coefficient calculates information representing the above-mentioned spectrum outline from the input audio signal in frame units, outputs information representing the spectrum outline, and uses the information representing the spectrum outline to obtain an outline of the frequency characteristic. A residual coefficient in a frequency domain whose shape is flattened is calculated, and a residual approximate normalization means (26) calculates a fine structure coefficient by normalizing the residual coefficient with a predicted residual approximate form, and calculates a fine structure coefficient. The quantization means (25) calculates the fine structure coefficient And outputs quantized fine structure coefficient information. The inverse normalizing means (31) normalizes the quantized fine structure coefficient information by the predicted residual approximate form from the residual approximate form calculating means (23). The residual coefficient calculating means (23) calculates first and second means (32, 33: 32 ', 33') for obtaining an envelope of the residual coefficient, and a linear synthesizing means (37). First means (32, 33) for determining the envelope of the residual coefficient
Are means for obtaining the envelope of the residual coefficient from the reproduced residual coefficient, and second means (32 ', 3) for obtaining the envelope of the residual coefficient.
3 ') means for obtaining the envelope from the residual coefficient obtained by the means for obtaining the residual coefficient. The predictive coefficient calculating means (40) calculates the envelope of the residual coefficient of the current frame and the envelope of the past frame. The envelope of the reproduction residual coefficient is input, a prediction coefficient is calculated, and this is quantized to output the quantized prediction coefficient information. The linear combination means (37) calculates the envelope of the reproduction residual coefficient. An audio signal encoding apparatus which inputs and linearly synthesizes the prediction residual outline using the prediction coefficient.

【請求項６】請求項５記載の符号化装置において、上記予測係数計算手段（４０）は、上記過去のフレーム
の上記再生残差係数の包絡から合成する上記予測残差概
形が、現フレームの残差係数の包絡を目標としてそれに
近づくように予測係数を計算し、上記過去のフレームの
上記残差係数の包絡に対する上記線形合成手段（３７）
の上記予測係数を制御する手段である。6. The encoding apparatus according to claim 5, wherein said prediction coefficient calculation means (40) is configured such that said prediction residual approximate form synthesized from an envelope of said reproduction residual coefficient of said past frame is a current frame. The prediction coefficient is calculated so that the envelope of the residual coefficient approaches the target, and the linear combination means (37) for the envelope of the residual coefficient of the past frame.
Means for controlling the prediction coefficient.

【請求項７】請求項６記載の符号化装置において、上記予測係数計算手段（４０）における上記線形合成を
最適制御するための予測係数は、現フレームの上記残差
係数の包絡を目標に決定され、決定された最適制御は現
フレームにおける上記予測残差概形の線形合成に適用さ
れる。7. The encoding device according to claim 6, wherein a prediction coefficient for optimally controlling the linear combination in the prediction coefficient calculation means (40) is determined with a target of an envelope of the residual coefficient of a current frame. And the determined optimal control is applied to a linear combination of the prediction residual profile in the current frame.

【請求項８】請求項４または７記載の符号化装置にお
いて、上記線形合成手段（３７）は、上記過去のフレームの上
記再生残差係数の包絡に対してそれぞれ予測係数を乗算
し、それによって得られた乗算結果を加算して上記予測
残差概形を得る手段であり、上記予測係数計算手段（４
０）は、上記加算結果が上記目標に近づくよう上記予測
係数を決定する手段である。8. The encoding device according to claim 4, wherein said linear combination means (37) multiplies the envelope of said reproduction residual coefficient of said past frame by a prediction coefficient, respectively. This is a means for adding the obtained multiplication results to obtain the above-mentioned prediction residual approximate form.
0) is a means for determining the prediction coefficient so that the addition result approaches the target.

【請求項９】請求項８記載の符号化装置において、上記線形合成手段（３７）は各上記過去のフレームのサ
ンプル群に対し、周波数軸上において少なくとも１サン
プル正方向及び負方向にそれぞれずれた第１シフトサン
プル群と第２シフトサンプル群を生成する手段（７ｐ₁
〜７ｐ₄，７ｎ₁〜７ｎ₄）と、上記第１及び第２シフ
トサンプル群にそれぞれ予測係数を乗算する手段（８ｐ
₁〜８ｐ₄，８ｎ₁〜８ｎ₄）と、これら乗算結果を上
記過去のフレームに対する予測係数の乗算結果と共に全
て加算することによって上記予測残差概形を得る手段を
有する。9. The encoding apparatus according to claim 8, wherein said linear synthesizing means (37) is shifted at least one sample in the positive and negative directions on the frequency axis with respect to the sample group of each of said past frames. Means for generating a first shift sample group and a second shift sample group (7p ₁
-7p ₄ , 7n ₁ -7n ₄ ) and means (8p
_{1 to} 8p ₄ , 8n _{1 to} 8n ₄ ), and a means for obtaining the approximate form of the prediction residual by adding all the multiplication results together with the multiplication result of the prediction coefficient for the past frame.

【請求項１０】請求項３，５乃至９のいずれかに記載
の符号化装置において、上記残差概形計算手段（２３）は、上記再生残差係数か
ら求めた上記残差係数の包絡の現在及び過去の複数フレ
ーム間平均値を各サンプルごとに、または現フレーム内
サンプルの平均値を求める手段（４１）と、上記平均値
を現フレームの上記残差係数の包絡から減算し、その減
算結果を上記残差係数の包絡として上記線形合成手段
（３７）に与える減算手段（４２）と、上記線形合成手
段（３７）の合成結果に上記平均値を加算し、その加算
結果から上記予測残差概形を得る加算手段（４３）とを
含む。10. The encoding apparatus according to claim 3, wherein said residual approximate shape calculating means (23) calculates an envelope of said residual coefficient obtained from said reproduction residual coefficient. Means (41) for calculating the average value of the current and past multiple frames for each sample or the average value of the samples in the current frame, and subtracting the average value from the envelope of the residual coefficient of the current frame, A subtraction means (42) for giving the result to the linear combination means (37) as an envelope of the residual coefficient, and the average value is added to the combined result of the linear combination means (37). Adding means (43) for obtaining a difference outline.

【請求項１１】請求項１０記載の符号化装置におい
て、上記減算手段（４２）の減算結果のフレーム内平均振幅
を計算する振幅検出手段（４４）と、上記振幅検出手段
（４４）で得た上記減算結果の平均振幅で上記減算手段
（４２）の上記減算結果を割算し、その割算結果を上記
残差係数の包絡として上記線形合成手段（３７）に与え
る割算手段（４５）と、上記線形合成手段（３７）の合
成結果に上記振幅検出手段（４４）の上記減算結果の平
均振幅を乗算し、その乗算結果を上記線形合成の結果と
して上記加算手段（４３）に与える乗算手段（４６）と
を有する。11. An encoding device according to claim 10, wherein said amplitude detection means (44) calculates an average amplitude in a frame of a subtraction result of said subtraction means (42), and said amplitude detection means (44) obtains said difference. Dividing means (45) which divides the result of the subtraction by the subtracting means (42) by the average amplitude of the result of the subtraction and gives the result of the division to the linear synthesizing means (37) as an envelope of the residual coefficient; Multiplying means for multiplying the combined result of the linear combining means (37) by the average amplitude of the subtraction result of the amplitude detecting means (44) and giving the multiplied result to the adding means (43) as the result of the linear combining; (46).

【請求項１２】請求項３、５乃至１１のいずれかに記
載の符号化装置において、上記残差係数の包絡を求める手段（３２，３３）は上記
再生残差係数のスペクトラムの振幅を計算する手段（３
２）と、上記スペクトラム振幅に対し窓関数を畳み込
み、上記残差係数の包絡を求める手段（３３）とよりな
る。12. The encoding apparatus according to claim 3, wherein the means (32, 33) for calculating the envelope of the residual coefficient calculates the amplitude of the spectrum of the reproduced residual coefficient. Means (3
2) and means (33) for convolving a window function with the spectrum amplitude to obtain the envelope of the residual coefficient.

【請求項１３】請求項３，５乃至１２のいずれかに記
載の符号化装置において、上記残差概形計算手段（２３）は、上記線形合成手段
（３７）の合成結果に予め決めた定数を加算して上記予
測残差概形を得る定数加算手段（３８）を含む。13. The encoding apparatus according to claim 3, wherein said residual approximate shape calculating means (23) includes a constant determined in advance by a combination result of said linear combination means (37). And a constant adding means (38) for obtaining the above-mentioned predicted residual approximate form.

【請求項１４】請求項４乃至９のいずれかに記載の符
号化装置において、上記残差概形計算手段（２３）は、入力される上記再生
残差係数を複数の小帯域に分割し、この小帯域ごとに行
う手段である。14. The encoding apparatus according to claim 4, wherein the residual approximate shape calculating means (23) divides the input reproduction residual coefficient into a plurality of small bands, This means is performed for each small band.

【請求項１５】残差係数を得る手段（１４，１５，１
６，１７，１８，２１，２２：１４，１５，１６，１
９，２２：１７，１８，２８，２９）と、残差概形計算
手段（２３）と、微細構造量子化手段（２５）と、残差
概形正規化手段（２６）とを備え、音響信号を入力し、
そのスペクトラム概形を表す情報と、量子化微細構造係
数情報と、量子化残差係数概形情報とを出力する音響信
号符号化装置であって、上記残差係数を得る手段はフレーム単位の入力音響信号
から上記残差係数の包絡を表す情報を算出し、そのスペ
クトラム概形を表す情報を出力すると共にそのスペクト
ラム概形を表す情報を用い、周波数特性の概形が平坦化
された周波数領域の残差係数を算出し、残差概形正規化手段（２６）は上記残差係数を予測残差
概形で正規化して微細構造係数を算出し、微細構造量子化手段（２５）は上記微細構造係数を量子
化して上記量子化微細構造係数情報を出力し、残差概形計算手段（２３）は上記残差係数が入力され、
上記予測残差概形を算出し、かつその予測残差概形を量
子化して上記量子化残差係数概形情報を出力する音響信
号符号化装置。15. A means (14, 15, 1) for obtaining a residual coefficient.
6,17,18,21,22: 14,15,16,1
9, 22: 17, 18, 28, 29), a residual shape calculating means (23), a fine structure quantization means (25), and a residual shape normalizing means (26), Input the signal,
An acoustic signal encoding apparatus for outputting information representing the spectrum outline, quantized fine structure coefficient information, and quantized residual coefficient outline information, wherein the means for obtaining the residual coefficient is an input for each frame. The information representing the envelope of the residual coefficient is calculated from the acoustic signal, the information representing the spectrum outline is output and the information representing the spectrum outline is used, and the frequency characteristic outline is flattened. The residual coefficient normalization means (26) calculates the fine structure coefficient by normalizing the residual coefficient with the predicted residual approximate form, and the fine structure quantization means (25) calculates the fine structure coefficient. The structure coefficient is quantized, and the quantized fine structure coefficient information is output. The residual shape calculating means (23) receives the residual coefficient,
An audio signal encoding device that calculates the predicted residual approximate shape and quantizes the predicted residual approximate shape to output the quantized residual coefficient approximate shape information.

【請求項１６】請求項１５の符号化装置において、上記残差係数が入力されてその包絡を求める手段（３
２，３３）と、上記残差係数の包絡を参照信号とし、線
形合成手段（３７）の出力を比較信号として入力し、上
記量子化残差係数概形情報と、その量子化残差係数概形
情報の再生情報を出力する量子化手段（３０）と、上記
再生情報を入力し、その現フレームと過去のフレームと
を線形合成する上記線形合成手段（３７）とを備え、上記線形合成手段（３７）の出力が上記残差係数の包絡
と最も近くなるように、上記現フレームの量子化残差係
数概形情報を決定し、その時の線形合成手段（３７）の
出力を上記予測残差概形として得ると共に、その時の量
子化残差概形情報を出力する。16. An encoding apparatus according to claim 15, wherein said residual coefficient is inputted and an envelope thereof is obtained.
2, 33) and the envelope of the residual coefficient as a reference signal, and the output of the linear combining means (37) is input as a comparison signal, and the above-mentioned quantized residual coefficient approximate information and the quantized residual coefficient approximate information are input. A quantizing means (30) for outputting reproduction information of the shape information; and a linear synthesizing means (37) for inputting the reproduction information and linearly synthesizing the current frame and the past frame. The quantized residual coefficient outline information of the current frame is determined so that the output of (37) is closest to the envelope of the residual coefficient, and the output of the linear combining means (37) at that time is determined by the prediction residual. An approximate shape is obtained, and the quantized residual approximate shape information at that time is output.

【請求項１７】請求項１５の符号化装置において、上記残差係数が入力されてその包絡を求める手段（３
２，３３）と、上記残差係数の包絡を参照信号とし、線
形合成手段（３７）の出力を比較信号として入力し、上
記量子化残差係数概形情報と、その量子化残差係数概形
情報の再生情報を出力する量子化手段（３０）と、上記
再生情報と線形合成手段（３７）の出力を入力し、現フ
レームの再生情報と過去のフレームの出力とを線形合成
する上記線形合成手段（３７）とを備え、上記線形合成手段（３７）の出力が上記残差係数の包絡
と最も近くなるように、上記現フレームの量子化残差係
数概形情報を決定し、その時の線形合成手段（３７）の
出力を上記予測残差概形として得ると共に、その時の量
子化残差概形情報を出力する。17. The encoding apparatus according to claim 15, wherein said residual coefficient is inputted and an envelope thereof is obtained.
2, 33) and the envelope of the residual coefficient as a reference signal, and the output of the linear combining means (37) is input as a comparison signal, and the above-mentioned quantized residual coefficient approximate information and the quantized residual coefficient approximate information are input. A quantization means (30) for outputting reproduction information of the shape information; and a linear means for inputting the reproduction information and the output of the linear synthesizing means (37) and linearly synthesizing the reproduction information of the current frame and the output of the past frame. Synthesis means (37), and determines the quantized residual coefficient outline information of the current frame so that the output of the linear synthesis means (37) is closest to the envelope of the residual coefficient. The output of the linear synthesizing means (37) is obtained as the above-mentioned predicted residual approximate shape, and the quantized residual approximate shape information at that time is output.

【請求項１８】請求項４乃至１４のいずれかに記載の
符号化装置において、上記予測係数の値から系の安定性を判断する手段を有
し、その判断が不安定であると、予測係数を予め与えら
れた値に設定するか、残差概形構成方法を切り替え、予
め決められた残差概形に切り替えるなど系の安定性が増
すように処理する手段を有する。18. The encoding apparatus according to claim 4, further comprising: means for judging the stability of the system from the value of the prediction coefficient. Is set to a given value, or the method is switched to a residual shape forming method, and is switched to a predetermined residual shape.

【請求項１９】請求項１乃至１８のいずれかに記載の
符号化装置において、パワー正規化手段（２７）を有し、パワー正規化手段
（２７）は上記微細構造係数を入力し、その振幅の平均
値またはパワーの平均値の平方根である正規化ゲインで
上記微細構造係数を割算して正規化微細構造係数を上記
微細構造量子化手段（２５）へ出力し、かつ上記正規化
ゲインを量子化し、量子化ゲイン情報を出力する。19. The encoding apparatus according to claim 1, further comprising a power normalizing means (27), wherein the power normalizing means (27) receives the fine structure coefficient, and The fine structure coefficient is divided by a normalized gain which is a square root of the average value or the average value of the power, and the normalized fine structure coefficient is output to the fine structure quantization means (25). Quantize and output quantization gain information.

【請求項２０】請求項２乃至１９のいずれかに記載の
符号化装置において、残差概形計算手段（２３）に帯域処理手段（４７）が設
けられ、帯域処理手段（４７）は残差係数の包絡を少な
くとも２つ以上の帯域に分割手段（４７Ａ）で分割し、
その高域についてその残差係数の包絡の振幅を一定値に
平均手段（４７Ｂ）で近似する。20. The encoding device according to claim 2, wherein the residual shape calculating means (23) is provided with a band processing means (47), and the band processing means (47) is provided with a residual processing means. The envelope of the coefficient is divided into at least two or more bands by the dividing means (47A),
The amplitude of the envelope of the residual coefficient in the high band is approximated to a constant value by the averaging means (47B).

【請求項２１】請求項２０記載の符号化装置におい
て、帯域処理手段（４７）は残差係数の包絡を高域、中域、
低域の３帯域に分割し、その中域について、残差係数の
包絡の振幅を、その高域の一定値と、低域の最高周波数
の振幅との間を補間する漸近手段（４７Ｃ）を有する。21. The encoding apparatus according to claim 20, wherein the band processing means (47) converts the envelope of the residual coefficient into a high band, a middle band,
Asymptotic means (47C) which divides the amplitude of the envelope of the residual coefficient into three bands of the low band and interpolates between the constant value of the high band and the amplitude of the highest frequency of the low band. Have.

【請求項２２】請求項１乃至２１の何れかに記載の符
号化装置において、微細構造量子化手段（２５）はフレームごとにＮ個のサ
ンプルからなる微細構造係数を、Ｎ／Ｍサンプル（Ｎ＞
Ｍ＞２，Ｎ，Ｍ共に整数）からなるＭ個の小系列に分割
する手段と、上記各小系列についてそれぞれ距離が最小
となるベクトルを符号帳から選択して、Ｍ個の微細構造
係数を反映する量子化微細構造係数情報を出力する手段
とを有する。22. The encoding apparatus according to claim 1, wherein the fine structure quantization means (25) calculates a fine structure coefficient composed of N samples for each frame by N / M samples (N / M samples). >
Means for dividing into M subsequences of M > 2, N and M), and a vector having the minimum distance for each of the subsequences is selected from the codebook, and the M fine structure coefficients are calculated. Means for outputting quantized microstructure coefficient information to be reflected.

【請求項２３】請求項２２記載の符号化装置におい
て、上記小系列のサンプルとベクトルの差と、重み付け係数
との内積が上記距離とされている。23. The encoding apparatus according to claim 22, wherein the inner product of the difference between the sample of the small series and the vector and a weighting coefficient is the distance.

【請求項２４】請求項１乃至２３記載の符号化装置に
おいて、微細構造量子化手段（２５）は上記微細構造係数をある
量子化幅でスカラー量子化する手段（２５Ａ）と、上記
微細構造係数の上記スカラー量子化値からの誤差を計算
する手段（２５Ｂ）と、上記スカラー量子化誤差に基づ
き聴感的に許容される値より小さいか否かを判断する手
段（２５Ｃ）と、聴感的に許容される値より小さいと判
断されたときに上記微細構造係数を表現するために使用
した量子化幅を示す情報を出力し、大きいと判断された
ときに上記微細構造係数を表現するための情報量が使用
可能な情報量を超えていないか否かを判断して、超えて
いない場合は現在よりも小さい量子化幅に更新し、超え
ている場合には更新を行わずに、上記量子化幅を示す情
報を出力する量子化幅制御手段（２５Ｄ）を有する。24. The encoding apparatus according to claim 1, wherein said fine structure quantization means (25) scalar-quantizes said fine structure coefficient with a certain quantization width, and said fine structure coefficient. Means (25B) for calculating an error from the scalar quantization value, and means (25C) for determining whether or not the value is less than an audibly acceptable value based on the scalar quantization error. When the value is determined to be smaller than the value to be output, information indicating the quantization width used to represent the fine structure coefficient is output, and when the value is determined to be larger, the information amount for expressing the fine structure coefficient Judge whether or not exceeds the available information amount, if it does not exceed, update to a smaller quantization width than the current one, otherwise, do not update, the quantization width Output information indicating That has a quantization width control means (25D).

【請求項２５】請求項１乃至２４のいずれかに記載の
符号化装置において、上記残差係数を得る手段は、上記
入力音響信号を周波数領域係数に変換する手段（１６）
と、上記入力音響信号をフレーム毎に線形予測分析して
線形予測係数を求める手段（１７）と、上記線形予測係
数を周波数領域係数に変換して上記音響信号の残差係数
の包絡を求める手段（１８，２１）と、上記音響信号の
周波数領域係数を上記残差係数の包絡で正規化して上記
残差係数を得る手段（２２）とよりなる。25. The encoding device according to claim 1, wherein the means for obtaining the residual coefficient converts the input audio signal into a frequency domain coefficient.
Means (17) for performing linear prediction analysis on the input audio signal for each frame to obtain a linear prediction coefficient, and means for converting the linear prediction coefficient into a frequency domain coefficient and obtaining an envelope of a residual coefficient of the audio signal (18, 21) and means (22) for obtaining the residual coefficient by normalizing the frequency domain coefficient of the audio signal by the envelope of the residual coefficient.

【請求項２６】請求項１乃至２４のいずれかに記載の
符号化装置において、上記残差係数を得る手段は、上記
入力音響信号を周波数領域係数に変換する手段（１６）
と、上記周波数領域係数のスペクトラム振幅を時間領域
信号に逆変換する手段（８２）と、上記時間領域信号を
線形予測分析して線形予測係数を求める手段（１７）
と、上記線形予測係数を周波数領域係数に変換して上記
音響信号の残差係数の包絡を求める手段（１８，２１）
と、上記音響信号の周波数領域係数を上記残差係数の包
絡で正規化して上記残差係数を得る手段（２２）とより
なる。26. The encoding device according to claim 1, wherein the means for obtaining the residual coefficient converts the input audio signal into a frequency domain coefficient.
Means (82) for inversely transforming the spectrum amplitude of the frequency domain coefficient into a time domain signal; means (17) for linear prediction analysis of the time domain signal to obtain a linear prediction coefficient
Means for converting the linear prediction coefficient into a frequency domain coefficient to obtain an envelope of a residual coefficient of the acoustic signal (18, 21)
Means (22) for normalizing the frequency domain coefficients of the acoustic signal by the envelope of the residual coefficients to obtain the residual coefficients.

【請求項２７】請求項２５または２６記載の符号化装
置において、上記線形予測係数を周波数領域係数に変換
する手段は、上記線形予測係数を量子化して量子化線形
予測係数を求め、上記量子化線形予測係数を上記スペク
トラム概形を表す情報として出力する手段（１８）と、
上記量子化線形予測係数を上記線形予測係数として上記
周波数領域係数に変換する手段とよりなる。27. The encoding apparatus according to claim 25, wherein the means for converting the linear prediction coefficient into a frequency domain coefficient quantizes the linear prediction coefficient to obtain a quantized linear prediction coefficient. Means (18) for outputting the linear prediction coefficient as information representing the spectrum outline,
Means for converting the quantized linear prediction coefficients as the linear prediction coefficients into the frequency domain coefficients.

【請求項２８】請求項１乃至２４のいずれかに記載の
符号化装置において、上記残差係数を得る手段は、上記入力音響信号を周波数
領域係数に変換し、そのスケーリングファクタを量子化
して上記スペクトラム概形を表す情報として出力する手
段（１６）と、上記周波数領域係数を複数の小帯域に分
割してそれぞれのスケーリングファクタを求める手段
（１９）と、上記入力音響信号の周波数領域係数を上記
スケーリングファクタで正規化して上記残差係数を得る
手段（２２）とよりなる。28. The encoding apparatus according to claim 1, wherein the means for obtaining the residual coefficient converts the input audio signal into a frequency domain coefficient, quantizes a scaling factor of the input audio signal, and quantizes the scaling factor. Means (16) for outputting as information representing a spectrum outline, means (19) for dividing the frequency domain coefficient into a plurality of sub-bands to determine respective scaling factors, and Means (22) for normalizing with a scaling factor to obtain the residual coefficient.

【請求項２９】請求項１乃至２４のいずれかに記載の
符号化装置において、上記残差係数を得る手段は、上記入力音響信号を線形予
測分析して線形予測係数を求める手段（１７）と、上記
線形予測係数で制御された逆フィルタに上記音響信号を
通して残差信号を得る手段（２８）と、上記残差信号を
周波数領域係数に変換して上記残差係数を得る手段（２
９）とよりなる。29. The encoding apparatus according to claim 1, wherein the means for obtaining the residual coefficient includes means for performing linear prediction analysis on the input audio signal to obtain a linear prediction coefficient. Means (28) for obtaining a residual signal through the acoustic signal through an inverse filter controlled by the linear prediction coefficient, and means (2) for converting the residual signal into frequency domain coefficients to obtain the residual coefficient
9).

【請求項３０】請求項２９記載の符号化装置におい
て、上記残差信号を得る手段は、上記線形予測係数を量子化
し、その量子化によって得られた量子化線形予測係数を
上記線形予測係数として上記逆フィルタに与えてそれを
制御すると共に、上記量子化線形予測係数を上記スペク
トラム概形を表す情報として出力する手段（１８）であ
る。30. The encoding apparatus according to claim 29, wherein the means for obtaining the residual signal quantizes the linear prediction coefficient, and uses a quantized linear prediction coefficient obtained by the quantization as the linear prediction coefficient. A means (18) for providing the inverse filter to control the same and outputting the quantized linear prediction coefficient as information representing the spectrum outline.

【請求項３１】請求項１乃至３０のいずれかに記載の
符号化装置において、上記入力音響信号を周波数領域係数に変換する手段（１
６，２９）は、上記入力音響信号をフレーム単位に重ね
合わせ直交変換する手段である。31. The encoding device according to claim 1, wherein the input audio signal is converted into a frequency domain coefficient.
6, 29) are means for superimposing and orthogonally transforming the input audio signals in frame units.

【請求項３２】逆量子化手段（５１）と、残差概形逆
正規化手段（５４）と、残差概形計算手段（５５）と、
逆平坦化手段（５６，５７，５８，５９，６１：５７，
５８，５９，６１，７１：７２，７３，７４）とを備
え、スペクトラム概形を表す情報と、量子化微細構造係
数情報とをフレームごとに入力し、再生音響信号を出力
する音響信号復号化装置であって、逆量子化手段（５１）は上記量子化微細構造係数情報を
微細構造係数に再生し、残差概形逆正規化手段（５４）は上記再生された微細構
造係数を残差概形で逆正規化して残差係数を再生し、残差概形計算手段（５５）は上記再生された残差係数の
過去のフレームのものから上記残差概形を予測し、上記逆平坦化手段は上記スペクトラム概形を表す情報と
上記再生された残差係数とからその残差係数に音響信号
の周波数特性の概形が与えられた時間領域の音響信号を
再生する音響信号復号化装置。32. An inverse quantization means (51), a residual approximate inverse normalizing means (54), a residual approximate shape calculation means (55),
Inverse flattening means (56, 57, 58, 59, 61:57,
58, 59, 61, 71: 72, 73, 74), an audio signal decoding for inputting information representing a spectrum outline and quantized fine structure coefficient information for each frame and outputting a reproduced audio signal. An inverse quantization means (51) for reproducing the quantized fine structure coefficient information into a fine structure coefficient, and a residual approximate inverse normalization means (54) for converting the reproduced fine structure coefficient to a residual The residual coefficient is reconstructed by inverse normalization with the approximate form, and the residual approximate form calculating means (55) predicts the residual approximate form from the past frame of the reproduced residual coefficient, and performs the inverse flattening. An audio signal decoding apparatus for reproducing an audio signal in a time domain in which the residual coefficient is given an outline of the frequency characteristic of the audio signal from the information representing the spectrum outline and the reproduced residual coefficient. .

【請求項３３】請求項３２記載の復号化装置におい
て、上記残差概形計算手段（５５）は、上記再生残差係数の
包絡を計算する手段（３２，３３）と、現フレームに対
し、予め決めた１または連続する複数の過去のフレーム
の上記残差係数の包絡にそれぞれ予測係数を乗算して線
形合成により現フレームの上記残差概形を得る線形合成
手段（３７）とを有する。33. The decoding apparatus according to claim 32, wherein said residual shape calculating means (55) calculates means for calculating an envelope of said reproduction residual coefficients, Linear synthesis means (37) for multiplying each of the envelopes of the residual coefficients of one or a plurality of past frames determined in advance by prediction coefficients to obtain the residual approximate form of the current frame by linear synthesis.

【請求項３４】請求項３３記載の復号化装置におい
て、上記線形合成手段（３７）は、上記線形合成により得ら
れる上記残差概形が現在フレームにおける上記再生残差
係数の概形にもっとも近くなるように上記予測係数を適
応的に制御する手段である。34. The decoding device according to claim 33, wherein said linear combination means (37) is configured such that said residual approximate shape obtained by said linear combination is closest to an approximate shape of said reproduction residual coefficient in a current frame. This is a means for adaptively controlling the prediction coefficient so as to be as follows.

【請求項３５】逆量子化手段（５１）と、残差概形逆
正規化手段（５４）と、残差概形計算手段（５５）と、
逆平坦化手段（５６，５７，５８，５９，６１：５７，
５８，５９，６１，７１：７２，７３，７４）とを備
え、スペクトラム概形を表す情報と、量子化微細構造係
数情報と、量子化予測係数とをフレームごとに入力し、
再生音響信号を出力する音響信号復号化装置であって、逆量子化手段（５１）は上記量子化微細構造係数情報を
微細構造係数に再生し、残差概形逆正規化手段（５４）は上記再生された微細構
造係数を残差概形で逆正規化して残差係数を再生し、残差概形計算手段（５５）は残差係数の包絡を計算する
手段（６３，６４）と、予測係数逆量子化手段（６０）
と、線形合成手段（６２）とを備え、残差係数の包絡を計算する手段（６３，６４）は、上記
再生残差係数の包絡を計算し、予測係数逆量子化手段（６０）は上記量子化予測係数を
逆量子化して予測係数を出力し、上記線形合成手段（６２）は上記再生残差係数の包絡を
入力して、上記予測係数を用いて上記残差概形を線形合
成し、上記逆平坦化手段は上記スペクトラム概形を表す情報
と、上記再生された残差係数とからその残差係数に音響
信号の周波数特性の概形が与えられた時間領域の音響信
号を再生する音響信号復号化装置。35. An inverse quantizer (51), a residual approximate inverse normalizer (54), a residual approximate shape calculator (55),
Inverse flattening means (56, 57, 58, 59, 61:57,
58, 59, 61, 71: 72, 73, 74), and inputs information representing a spectrum outline, quantized fine structure coefficient information, and quantized prediction coefficients for each frame,
An audio signal decoding device for outputting a reproduced audio signal, wherein an inverse quantization means (51) reproduces the quantized fine structure coefficient information into a fine structure coefficient, and a residual approximate inverse normalization means (54) The reconstructed fine structure coefficient is denormalized with a residual approximation to regenerate the residual coefficient, and a residual approximation calculating means (55) calculates an envelope of the residual coefficient (63, 64); Predictive coefficient inverse quantization means (60)
And a linear synthesis means (62), wherein the means (63, 64) for calculating the envelope of the residual coefficient calculate the envelope of the reproduction residual coefficient, and the predictive coefficient inverse quantization means (60) The quantized prediction coefficient is inversely quantized to output a prediction coefficient. The linear synthesis means (62) receives the envelope of the reproduction residual coefficient, and linearly synthesizes the residual approximate shape using the prediction coefficient. The inverse flattening means reproduces a sound signal in a time domain in which the residual coefficient is given an outline of the frequency characteristic of the audio signal from the information representing the spectrum outline and the reproduced residual coefficient. Sound signal decoding device.

【請求項３６】請求項３３，３４または３５記載の復
号化装置において、上記残差概形計算手段（５５）は、上記再生残差係数か
ら求めた上記残差係数の包絡の現在及び過去の複数フレ
ーム間平均値を各サンプルごとに、またはフレーム内サ
ンプルの平均値を求める平均値計算手段（４１）と、上
記平均値を現フレームの上記残差係数の包絡から減算
し、その減算結果を上記残差係数の包絡として上記線形
合成手段（３７）に与える減算手段（４２）と、上記線
形合成手段（３７）の合成結果に上記平均値を加算して
上記予測残差概形を得る加算手段（４３）とを含む。36. The decoding apparatus according to claim 33, wherein the residual approximate shape calculating means (55) includes a current and a past of an envelope of the residual coefficient obtained from the reproduction residual coefficient. An average value calculating means (41) for obtaining an average value between a plurality of frames for each sample or an average value of samples within a frame; subtracting the average value from the envelope of the residual coefficient of the current frame; A subtraction means (42) for giving the envelope of the residual coefficient to the linear combination means (37); and an addition for adding the average value to the combined result of the linear combination means (37) to obtain the predicted residual approximate form. Means (43).

【請求項３７】請求項３６記載の復号化装置におい
て、上記残差概形計算手段（５５）は、上記減算手段（４
２）の減算結果のフレーム内平均振幅を計算する振幅検
出手段（４４）と、上記平均振幅で上記減算手段（４
２）の減算結果を割算し、その割算結果を上記残差係数
の包絡として上記線形合成手段（３７）に与える割算手
段（４５）と、上記線形合成手段（３７）の合成結果に
上記減算結果の平均振幅を乗算し、その乗算結果を上記
線形合成の結果として上記加算手段（４３）に与える乗
算手段（４６）とを有する。37. The decoding apparatus according to claim 36, wherein said residual approximate shape calculating means (55) is provided with said subtraction means (4).
2) an amplitude detecting means (44) for calculating an average amplitude within the frame as a result of the subtraction, and the subtracting means (4) using the average amplitude.
2) dividing the result of the subtraction, and applying the result of the division to the linear combination means (37) as an envelope of the residual coefficient to the linear combination means (37); Multiplying means (46) for multiplying the average amplitude of the subtraction result and for giving the multiplication result to the adding means (43) as a result of the linear combination.

【請求項３８】請求項３３乃至３７のいずれかに記載
の復号化装置において、残差概形計算手段（５５）は上記再生残差係数の振幅を
計算する手段（３２：６３）と、そのスペクトラム振幅
に窓関数を畳み込み、上記残差係数の包絡を得る手段
（３３：６４）とを有する。38. The decoding apparatus according to claim 33, wherein the residual shape calculating means (55) calculates the amplitude of the reproduction residual coefficient (32:63), and Means for convolving a window function with the spectrum amplitude to obtain an envelope of the residual coefficient (33:64).

【請求項３９】請求項３３乃至３８のいずれかに記載
の復号化装置において、上記線形合成手段は各上記過去のフレームのサンプル群
に対し、周波数軸上において少なくとも１サンプル正方
向及び負方向にずれた第１シフトサンプル群と第２シフ
トサンプル群を生成する手段（７ｐ₁〜７ｐ₄，７ｎ₁
〜７ｎ₄）と、上記第１及び第２シフトサンプル群にそ
れぞれ予測係数を乗算する手段（８ｐ₁〜８ｐ₄，８ｎ
₁〜８ｎ₄）と、これら乗算結果を上記過去の各フレー
ムに対する予測係数の乗算結果と共に全て加算すること
により上記予測残差概形を得る加算手段（３４）とを有
する。39. The decoding apparatus according to claim 33, wherein said linear combination means adds at least one sample in a positive direction and a negative direction on a frequency axis to a sample group of each of said past frames. means for generating shifted to the first shift sample group a second shift sample group (7p _₁ ~7p _4, 7n ₁
To 7n ₄ ) and means (8p _{1 to} 8p ₄ , 8n) for multiplying the first and second shift sample groups by prediction coefficients, respectively.
_{1 to} 8n ₄ ), and an adder (34) for adding the multiplication results together with the multiplication results of the prediction coefficients for the past frames to obtain the prediction residual approximate form.

【請求項４０】請求項３３乃至３９のいずれかに記載
の復号化装置において、残差概形計算手段（５５）は上
記線形合成手段（３７）の合成結果に予め決めた定数を
加算して上記予測残差概形を得る定数加算手段（３８）
を有する。40. The decoding apparatus according to claim 33, wherein said residual approximate shape calculating means (55) adds a predetermined constant to the synthesis result of said linear synthesis means (37). Constant adding means (38) for obtaining the above-mentioned predicted residual outline
Having.

【請求項４１】請求項３３乃至４０のいずれかに記載
の復号化装置において、上記線形合成手段（３７）の制
御は、上記残差係数の包絡を複数の小帯域に分割し、こ
の小帯域ごとに行う。41. The decoding device according to claim 33, wherein the control of the linear combination means (37) divides the envelope of the residual coefficient into a plurality of small bands, Do it every time.

【請求項４２】逆量子化手段（５１）と、残差概形逆
正規化手段（５４）と、残差概形計算手段（５５）と、
逆平坦化手段（５６，５７，５８，５９，６１：５７，
５８，５９，６１，７１：７２，７３，７４）とを備
え、スペクトラム概形を表す情報と、量子化残差概形情
報と、量子化微細構造係数情報とをフレームごとに入力
し、再生音響信号を出力する音響信号復号化装置であっ
て、逆量子化手段（５１）は上記量子化微細構造係数を微細
構造係数に再生し、残差概形計算手段（５５）は上記量子化残差概形情報を
残差概形に再生し、残差概形逆正規化手段（５４）は上記再生された微細構
造係数を上記残差概形で逆正規化して残差係数を再生
し、上記逆平坦化手段は上記スペクトラム概形を表す情報
と、上記再生された残差係数とからその残差係数に音響
信号の周波数特性の概形が与えられた時間領域の音響信
号を再生する音響信号復号化装置。42. An inverse quantization means (51), a residual approximate inverse normalizing means (54), a residual approximate shape calculation means (55),
Inverse flattening means (56, 57, 58, 59, 61:57,
58, 59, 61, 71: 72, 73, 74), and inputs information representing a spectrum outline, quantization residual outline information, and quantized fine structure coefficient information for each frame, and reproduces them. An audio signal decoding device for outputting an audio signal, wherein an inverse quantization means (51) reproduces the quantized fine structure coefficient into a fine structure coefficient, and a residual approximate calculation means (55) outputs the quantization residual Regenerating the difference shape information into a residual shape; a residual shape denormalization means (54) denormalizes the reconstructed fine structure coefficient with the residual shape to reconstruct a residual coefficient; The inverse flattening means reproduces a sound signal in a time domain in which the residual coefficient is given an outline of the frequency characteristic of the audio signal from the information representing the spectrum outline and the reproduced residual coefficient. Signal decoding device.

【請求項４３】請求項４２記載の復号化装置におい
て、上記残差概形計算手段（５５）は上記量子化残差概
形情報を逆量子化する手段（６０）と、その逆量子化に
より得た現フレームと過去のフレームの逆量子化された
係数を線形合成して上記残差概形を得る線形合成手段
（６２）とを有する。43. The decoding apparatus according to claim 42, wherein said residual approximate shape calculating means (55) comprises means for inversely quantizing said quantized residual approximate information, and said inverse quantizing means. A linear synthesizing means (62) for linearly synthesizing the dequantized coefficients of the obtained current frame and the past frame to obtain the residual approximate shape;

【請求項４４】請求項４２記載の復号化装置におい
て、上記残差概形計算手段（５５）は上記量子化残差概
形情報を逆量子化する手段（６０）と、過去のフレーム
における残差概形と、現フレームの上記逆量子化された
係数を線形合成して上記残差概形を得る線形合成手段
（６２）とを有する。44. The decoding apparatus according to claim 42, wherein said residual approximate shape calculating means (55) includes means for inversely quantizing said quantized residual approximate shape information, and a residual in a past frame. Linear synthesis means (62) for linearly synthesizing the dequantized coefficients of the current frame and the dequantized coefficients of the current frame to obtain the residual outline.

【請求項４５】請求項３２乃至４４のいずれかに記載
の復号化装置において、入力された量子化ゲイン情報を再生して正規化ゲインを
得る手段（５２）と、上記再生された微細構造係数に上
記正規化ゲインを乗じて逆正規化し、逆正規化された微
細構造係数を上記残差概形逆正規化手段（５４）へ供給
するパワー逆正規化手段（５３）とを有する。45. The decoding apparatus according to claim 32, wherein the means for reproducing the input quantization gain information to obtain a normalized gain, and the reproduced fine structure coefficient And a power inverse normalizing means (53) for supplying the inverse normalized fine structure coefficient to the residual approximate inverse normalizing means (54).

【請求項４６】請求項３２乃至４５のいずれかに記載
の復号化装置において、逆量子化手段（５１）は量子化微細構造係数情報に応じ
たベクトルを符号帳から取り出して上記微細構造係数を
得る手段である。46. The decoding device according to claim 32, wherein the inverse quantization means (51) extracts a vector corresponding to the quantized fine structure coefficient information from a codebook and calculates the fine structure coefficient. It is a means to obtain.

【請求項４７】請求項３２乃至４５のいずれかに記載
の復号化装置において、逆量子化手段（５１）は量子化微細構造係数情報として
微細構造係数に対応する符号と、その微細構造係数を表
現するために使用した量子化幅を示す符号とを入力し
て、これら符号に基づいた微細構造係数を再生する手段
である。47. The decoding device according to claim 32, wherein the inverse quantization means (51) outputs a code corresponding to the fine structure coefficient as the quantized fine structure coefficient information and the fine structure coefficient. This is a means for inputting a code indicating the quantization width used for expression and reproducing a fine structure coefficient based on these codes.

【請求項４８】請求項３２乃至４７の何れかに記載の
復号化装置において、上記逆平坦化手段は、入力されたスペクトラム概形を表
す情報から上記音響信号の周波数特性の概形情報を再生
する手段（５６）と、上記周波数特性の概形情報に基づ
いて上記周波数特性の概形が与えられた上記音響信号を
再生する手段とを含む。48. The decoding device according to claim 32, wherein said inverse flattening means reproduces outline information of frequency characteristics of said audio signal from information representing an input outline of said spectrum. (56) means for reproducing the acoustic signal to which the outline of the frequency characteristic is given based on the outline information of the frequency characteristic.

【請求項４９】請求項４８記載の復号化装置におい
て、上記逆平坦化手段は、入力された上記スペクトラム概形
を表す情報から上記周波数特性の概形情報として上記音
響信号の線形予測係数を再生する手段と、上記再生され
た線形予測係数から上記音響信号の周波数特性の概形を
求める手段と、上記再生残差係数を上記音響信号の周波
数特性の概形で逆正規化して上記周波数領域係数を得る
手段（５７）と、上記周波数領域係数を時間領域信号に
変換して上記音響信号を得る手段（５８）とを有する。49. The decoding device according to claim 48, wherein the inverse flattening means reproduces the linear prediction coefficient of the audio signal as the outline information of the frequency characteristic from the input information indicating the outline of the spectrum. Means for obtaining an approximate shape of the frequency characteristic of the audio signal from the reproduced linear prediction coefficient, and denormalizing the reproduction residual coefficient with the approximate shape of the frequency characteristic of the audio signal, (57) and means (58) for converting the frequency domain coefficient into a time domain signal to obtain the acoustic signal.

【請求項５０】請求項４９記載の復号化装置におい
て、上記周波数特性の概形を求める手段は、上記線形予測係
数をフーリエ変換する手段と、その結果得られたスペク
トラム振幅を上記周波数特性の概形として得る手段とよ
りなる。50. The decoding device according to claim 49, wherein the means for obtaining the approximate shape of the frequency characteristic includes: means for performing a Fourier transform on the linear prediction coefficient; It is a means to obtain as a form.

【請求項５１】請求項３２乃至４７のいずれかに記載
の復号化装置において、上記逆平坦化手段は、上記再生残差係数を時間領域の残
差信号に変換する手段（７２）と、入力されたスペクト
ラム概形を表す情報から上記周波数特性の概形情報とし
て上記音響信号の線形予測係数を再生する手段（７４）
と、上記線形予測係数をフィルタ係数として上記残差信
号を逆フィルタ処理して上記音響信号を再生する合成フ
ィルタ手段（７３）とを有する。51. A decoding device according to claim 32, wherein said inverse flattening means converts said reproduction residual coefficient into a time domain residual signal, and an input means. Means (74) for reproducing the linear prediction coefficient of the acoustic signal as the outline information of the frequency characteristic from the information representing the obtained spectrum outline.
And a synthesis filter means (73) for performing inverse filtering on the residual signal using the linear prediction coefficient as a filter coefficient to reproduce the audio signal.

【請求項５２】請求項３２乃至４７のいずれかに記載
の復号化装置において、上記逆平坦化手段は、上記再生残差係数を複数の小帯域
に分割する手段と、入力されたスペクトラム概形を表す
情報からそれぞれの小帯域に対応するスケールファクタ
を上記周波数特性の概形情報として再生する手段（７
１）と、それぞれの小帯域の上記再生残差係数を対応す
る上記スケールファクタで逆正規化して上記周波数特性
の概形が付与された周波数領域係数を求める手段（５
７）と、上記周波数領域係数を時間領域信号に変換して
上記音響信号を再生する手段（５８）とを有する。52. The decoding apparatus according to claim 32, wherein the inverse flattening means is configured to divide the reproduction residual coefficient into a plurality of small bands, and to input an approximate spectrum. Means for reproducing the scale factor corresponding to each small band from the information representing
(1) and means (5) for denormalizing the reproduction residual coefficient of each small band with the corresponding scale factor to obtain a frequency domain coefficient to which the outline of the frequency characteristic is added.
7) and means (58) for converting the frequency domain coefficient into a time domain signal to reproduce the audio signal.

【請求項５３】請求項４４または５０記載の復号化装
置において、上記周波数領域係数から上記時間領域信号への変換手段
は逆重ね合わせ直交変換手段である。53. The decoding apparatus according to claim 44, wherein the means for converting the frequency domain coefficients into the time domain signals is a deconvolution orthogonal transform means.

【請求項５４】請求項３２乃至４７のいずれかに記載
の復号化装置において、上記逆平坦化手段は上記周波数特性の概形情報に基づい
て上記再生残差計数に上記周波数特性の概形を与えて周
波数領域係数を生成する手段と、上記周波数領域係数を
時間領域信号に変換することにより再生された上記音響
信号を得る手段とを有する。54. The decoding apparatus according to claim 32, wherein said inverse flattening means adds the approximate shape of the frequency characteristic to the reproduction residual count based on the approximate shape information of the frequency characteristic. Means for generating a frequency-domain coefficient by applying the frequency-domain coefficient to the time-domain signal to obtain a reproduced sound signal.

【請求項５５】請求項５４記載の復号化装置におい
て、上記周波数領域係数から上記時間領域信号への変換手段
は逆重ね合わせ直交変換手段である。55. The decoding apparatus according to claim 54, wherein the means for converting the frequency domain coefficient into the time domain signal is an inverse superposition orthogonal transform means.