JP2736157B2

JP2736157B2 - Encoding device

Info

Publication number: JP2736157B2
Application number: JP2188438A
Authority: JP
Inventors: 淳悟鬼頭; 智一森尾
Original assignee: Consejo Superior de Investigaciones Cientificas CSIC
Current assignee: Consejo Superior de Investigaciones Cientificas CSIC
Priority date: 1990-07-17
Filing date: 1990-07-17
Publication date: 1998-04-02
Anticipated expiration: 2013-04-02
Also published as: JPH0475100A

Description

【発明の詳細な説明】［産業上の利用分野］音声信号の情報を圧縮して伝送及び蓄積する符号化装
置に関する。Description: TECHNICAL FIELD The present invention relates to an encoding apparatus for compressing, transmitting, and storing information of an audio signal.

［従来の技術］一般に音声信号は近接サンプル間の相関がかなり高い
ので、近接サンプルの信号系列から線形予測（短期予
測）により音声信号を効率良く情報圧縮する方法が広く
用いられている。[Prior Art] Generally, a speech signal has a considerably high correlation between adjacent samples. Therefore, a method of efficiently compressing information of a speech signal by linear prediction (short-term prediction) from a signal sequence of neighboring samples is widely used.

また、音声信号の有音声にはピッチ周期のように比較
的時間間隔の広い冗長性があり、この冗長性に対しても
線形予測によりピッチ予測（長期予測）を行って音声信
号を更に効率良く情報圧縮する方法が用いられている。Also, voiced speech of a speech signal has redundancy having a relatively wide time interval, such as a pitch period. For this redundancy, pitch prediction (long-term prediction) is performed by linear prediction to further efficiently reproduce the speech signal. A method of compressing information is used.

上述の二つの線形予測を用いた低ビットレートの音声
符号化方法として、CELP（コード・エキサイテッド・リ
ニア・プレディクション・コーディング（Code−Excite
d Linear Prediction coding））が盛んに研究されてい
る。CELP (Code-Excited Linear Prediction Coding (Code-Excite) is a low bit rate speech encoding method using the above two linear predictions.
d Linear Prediction coding)) is being actively studied.

CELPを用いて音声を符号化する従来の符号化装置は、
予測残差信号を短時間（例えば5msec）のベクトルとし
て取り扱うことによりベクトル量子化の技術を用いて予
測残差信号を低ビットレートで符号化している。A conventional encoding device that encodes speech using CELP is:
By treating the prediction residual signal as a vector of a short time (for example, 5 msec), the prediction residual signal is encoded at a low bit rate using a vector quantization technique.

第３図にCELPを用いたの従来の符号化装置、第４図に
第３図に示す符号化装置に対応する復号化装置をそれぞ
れ示す。FIG. 3 shows a conventional coding apparatus using CELP, and FIG. 4 shows a decoding apparatus corresponding to the coding apparatus shown in FIG.

以下、第３図に示す符号化装置を説明する。 Hereinafter, the encoding device shown in FIG. 3 will be described.

AD変換器35は、入力端子から入力されたアナログ音声
信号をデジタルデータに変換して一定のサンプル長（例
えば5msec）の入力ベクトルＶとして線形予測分析器36
及び減算器37に出力する。The AD converter 35 converts the analog audio signal input from the input terminal into digital data and converts it into an input vector V having a fixed sample length (for example, 5 msec).
And outputs it to the subtractor 37.

線形予測分析器36は、AD変換器35により変換されたデ
ジタルデータを線形予測分析する。ここで線形予測分析
として、スペクトル予測（近接予測）及びピッチ予測
（長期予測）の両者を含むものとする。なお、線形予測
分析器36で分析結果として得られた予測係数ベクトルβ
は、線形予測器38に出力されると共にマルチプレクサ39
にも出力される。コードブック40は、一定のサンプル長
を有しておりゲイン正規化された残差信号波形（以後、
コードワードと称する）を複数（例えば1024種類）記憶
する。The linear prediction analyzer 36 performs linear prediction analysis on the digital data converted by the AD converter 35. Here, it is assumed that the linear prediction analysis includes both spectrum prediction (proximity prediction) and pitch prediction (long-term prediction). The prediction coefficient vector β obtained as an analysis result by the linear prediction analyzer 36
Is output to the linear predictor 38 and the multiplexer 39
Is also output to The codebook 40 has a fixed sample length and a gain-normalized residual signal waveform (hereinafter, referred to as
(For example, 1024 types) are stored.

最適コードワード選択器41はコードブック40に記憶さ
れたコードワードをその指示により選択し、コードブッ
ク40は最適コードワード選択器41により選択されたコー
ドワードをゲイン掛け算器42に出力する。ゲイン掛け算
器42は、最適コードワード選択器41により選択されたコ
ードワードを増幅して線形予測器38及び加算器43などで
構成される合成フィルタに出力する。合成フィルタは、
ゲイン掛け算器42で増幅されたコードワードを入力して
合成信号ベクトルＶ′を算出する。減算器37は、合成信
号ベクトルＶ′と入力ベクトルＶとの誤差、即ちエラー
を算出する。The optimal codeword selector 41 selects a codeword stored in the codebook 40 according to the instruction, and the codebook 40 outputs the codeword selected by the optimal codeword selector 41 to the gain multiplier 42. The gain multiplier amplifies the codeword selected by the optimum codeword selector 41 and outputs the amplified codeword to a synthesis filter including the linear predictor 38 and the adder 43. The synthesis filter is
The codeword amplified by the gain multiplier 42 is input to calculate a composite signal vector V '. The subtracter 37 calculates an error between the synthesized signal vector V ′ and the input vector V, that is, an error.

聴覚的重み付けフィルタ44は、減算器37で算出された
エラーにスペクトルマスキング効果を考慮する。パワー
算出器45は、エラーの二乗誤差を計算する。最適コード
ワード選択器41は、二乗誤差が最小となるコードワード
及びゲインを選択して、選択されたコードワードを特定
する指標値Ｉをマルチプレクサ39に出力する。The auditory weighting filter 44 considers the spectral masking effect on the error calculated by the subtractor 37. The power calculator 45 calculates the square error of the error. The optimum codeword selector 41 selects a codeword and a gain that minimize the square error, and outputs an index value I specifying the selected codeword to the multiplexer 39.

マルチプレクサ39は、予測係数ベクトルβ、残差利得
ｈ及び残差信号を特定する指標Ｉをマルチプレックスし
て伝送路に送出する。The multiplexer 39 multiplexes the prediction coefficient vector β, the residual gain h, and the index I for specifying the residual signal, and sends the result to the transmission line.

上述のコードワードのコードブック40の作成方法とし
ては、実際の長時間の音声信号を分析して残差信号を算
出してベクトル量子化により作成する方法、ホワイト−
ガウシアン（White−Gaussian）のランダム信号で擬似
的に作成する方法がある。As a method for creating the codebook 40 of the codewords described above, a method of analyzing a real long-time audio signal to calculate a residual signal and creating the residual signal by vector quantization is used.
There is a method of pseudo-creating a Gaussian (White-Gaussian) random signal.

次に第４図に示す復号化装置を説明する。 Next, the decoding device shown in FIG. 4 will be described.

デマルチプレクサ47は、伝送路46から受信した信号を
予測係数ベクトルβ、残差利得ｈ及び残差信号を特定す
る指標Ｉにそれぞれ分解する。The demultiplexer 47 decomposes the signal received from the transmission path 46 into a prediction coefficient vector β, a residual gain h, and an index I for specifying the residual signal.

コードブック48は、上述の符号化器のコードブックス
40と同じ内容を有しており、残差信号を特定する指標値
Ｉに基づきコードワードを掛け算器49に出力する。掛け
算器49は、コードワード48から出力されたコードワード
を入力して受信した残差利得ｈにより増幅する。Codebook 48 is the codebook of the encoder described above.
It has the same contents as 40 and outputs a codeword to the multiplier 49 based on the index value I specifying the residual signal. The multiplier 49 receives the codeword output from the codeword 48 and amplifies the codeword with the received residual gain h.

受信した予測係数ベクトルβを係数とする線形予測器
50及び加算器51により構成された合成フィルタは、掛け
算器49で増幅されたコードワードを入力して再生信号ベ
クトルをDA変換器52に出力する。DA変換器52は合成フィ
ルタから出力された再生信号をアナログ信号に変換す
る。Linear predictor using received prediction coefficient vector β as a coefficient
The synthesis filter constituted by 50 and the adder 51 inputs the codeword amplified by the multiplier 49 and outputs a reproduced signal vector to the DA converter 52. The DA converter 52 converts the reproduced signal output from the synthesis filter into an analog signal.

上記のCELPを用いた従来の符号化装置は、破裂子音の
破裂時点、無音及び無声子音から母音定常部への過渡区
間などでは異なるコードブックを用いて符号化した方が
効率的に音声を圧縮できることが示唆されている（「コ
ード・エキサイテッド・リニア・プレディクション（CE
LP）：ハイ・クォリティ・スピーチ・アト・ヴェリー・
ロウ・ビット・レイツ（Code−Excited Linear Predict
ion（CELP）:High Quality Speech at Very Low Bit Ra
tes）」，エム・アール・シュレーダー及びビー・エス
・アタル（M.R.Schroeder and B.S.Atal）共著,ICASSP,
1985、参照）。The conventional coding device using CELP described above compresses speech more efficiently by encoding using a different codebook at the time of the burst of a consonant, the transition from silence and unvoiced consonants to the vowel stationary part, etc. Has been suggested ("Code Excited Linear Prediction (CE
LP): High Quality Speech at Very
Low Bit Rates (Code-Excited Linear Predict)
ion (CELP): High Quality Speech at Very Low Bit Ra
tes) ", co-authored by MR Schroeder and BSAtal, ICASSP,
1985).

［発明が解決しようとする問題点］しかしながら、CELPを用いた上述の従来の符号化装置
には、コードブックの内容が音声の長時間統計分析の結
果により最適化されて設定されており、短時間的に発声
された音声に対して最適ではないという問題点がある。
また、従来の符号化装置には、線形予測で全極モデルに
より音声信号をモデル化しているため、子音、特に鼻子
音区間の残差信号に、全極モデルによっては表現するこ
とができないスペクトルの零点の情報が含まれており、
子音及び鼻子音に対する符号化が難しいという問題点が
ある。[Problems to be Solved by the Invention] However, in the above-described conventional coding apparatus using CELP, the contents of the codebook are set by being optimized based on the result of a long-term statistical analysis of speech. There is a problem that it is not optimal for temporally uttered speech.
In addition, in the conventional encoding device, since the speech signal is modeled by an all-pole model by linear prediction, a consonant, especially a residual signal in a nasal consonant section, has a spectrum that cannot be expressed by the all-pole model. Zero information is included,
There is a problem that it is difficult to encode consonants and nasal consonants.

本発明の目的は、上述の従来の符号化装置における問
題点に鑑み、複数の残差信号コードブックの中から使用
するコードブックを選択して効率良く音声を符号化する
ことができる符号化装置を提供することにある。SUMMARY OF THE INVENTION An object of the present invention is to provide an encoding apparatus capable of selecting a codebook to be used from among a plurality of residual signal codebooks and efficiently encoding speech in view of the above-described problems in the conventional encoding apparatus. Is to provide.

［問題点を解決するための手段］本発明の上述した目的は、AD変換器、第１の線形予測
分析器、複数のコードブック、合成フィルタ、乗算器、
減算器、フィルタ、パワー算出器、最適コードワード選
択器、コードブック選択手段、マルチプレクサからなる
符号化装置であって、AD変換器は、入力されるアナログ
音声信号を所定長デジタルデータに変換出力し、第１の
線形予測分析器は、所定長デジタルデータから第１の線
形予測係数を算出し、コードブックは、複数の正規化さ
れた残差信号波形を記憶し、合成フィルタは、第１の線
形予測係数と乗算器からの出力により合成信号を算出
し、乗算器は、最適コードワード選択器で特定された残
差信号波形と残差利得を乗算し、減算器は、合成信号と
所定長デジタルデータの差であるエラーを出力し、フィ
ルタは、エラーに聴覚的特徴に基づく重み付けを行い、
パワー算出器は、重み付けされたエラーの二乗誤差を算
出し、最適コードワード選択器は、二乗誤差に基づいて
乗算器の残差利得とコードブックの残差信号波形を特定
すると共に、残差利得と特定された残差信号波形の指標
値を出力し、コードブック選択手段は、線形予測器、FF
T、スペクトル反転器、IFFT、第２の線形予測分析器、
分類器からなり、線形予測器は、所定長デジタルデータ
と第１の線形予測係数に基づいて残差信号を出力し、FF
Tは、残差信号をパワースペクトルに変換し、スペクト
ル反転器は、パワースペクトルを成分毎に反転し、IFFT
は、反転されたパワースペクトルを時間領域に変換し、
第２の線形予測分析器は、時間領域に変換されたパワー
スペクトルから第２の線形予測係数を算出し、分類器
は、第２の線形予測係数に基づいて複数のコードブック
から１つを選択すると共に、選択されたコードブックの
指標信号をマルチプレクサへ出力し、マルチプレクサ
は、第１の線形予測係数、残差利得、指標信号、指標値
を伝送路へ送出する符号化装置によって達成される。[Means for Solving the Problems] The above-described object of the present invention is to provide an AD converter, a first linear prediction analyzer, a plurality of codebooks, a synthesis filter, a multiplier,
An encoding device comprising a subtractor, a filter, a power calculator, an optimal codeword selector, a codebook selector, and a multiplexer, wherein the AD converter converts an input analog audio signal into digital data having a predetermined length and outputs the digital audio signal. , A first linear prediction analyzer calculates a first linear prediction coefficient from the predetermined length digital data, a codebook stores a plurality of normalized residual signal waveforms, and a synthesis filter outputs the first linear prediction coefficient. A composite signal is calculated based on the linear prediction coefficient and the output from the multiplier. The multiplier multiplies the residual signal waveform specified by the optimal codeword selector by the residual gain. Outputting an error that is the difference between the digital data, the filter weights the error based on auditory features,
The power calculator calculates the squared error of the weighted error, and the optimal codeword selector specifies the residual gain of the multiplier and the residual signal waveform of the codebook based on the squared error. And the index value of the residual signal waveform identified as
T, spectral inverter, IFFT, second linear prediction analyzer,
A linear predictor that outputs a residual signal based on the predetermined length digital data and the first linear prediction coefficient,
T converts the residual signal to a power spectrum, and a spectrum inverter inverts the power spectrum for each component,
Transforms the inverted power spectrum into the time domain,
The second linear prediction analyzer calculates a second linear prediction coefficient from the power spectrum converted into the time domain, and the classifier selects one from a plurality of codebooks based on the second linear prediction coefficient. At the same time, the index signal of the selected codebook is output to the multiplexer, and the multiplexer is achieved by the encoder that sends the first linear prediction coefficient, the residual gain, the index signal, and the index value to the transmission path.

［作用］ AD変換器により、入力されたアナログ音声信号が所定
長デジタルデータに変換出力され、第１の線形予測分析
器により、所定長デジタルデータから第１の線形予測係
数が算出され、合成フィルタにより、第１の線形予測係
数と乗算器からの出力とから合成信号が算出され、乗算
器により、最適コードワード選択器で特定された残差信
号波形と残差利得とが乗算され、減算器により、エラー
が合成信号と所定長デジタルデータの差から求められて
出力され、フィルタにより、エラーに聴覚的特徴に基づ
く重み付けが行われ、パワー算出器により、重み付けさ
れたエラーの二乗誤差が算出され、最適コードワード選
択器により、二乗誤差に基づいた乗算器の残差利得とコ
ードブックの残差信号波形とが特定されると共に、残差
利得と特定された残差信号波形の指標値が出力される。
コードブック選択手段において、線形予測器により、所
定長デジタルデータと第１の線形予測係数に基づいた残
差信号が出力され、FFTにより、残差信号がパワースペ
クトルに変換され、スペクトル反転器により、パワース
ペクトルが成分毎に反転され、IFFTにより、反転された
パワースペクトルが時間領域に変換され、第２の線形予
測分析器により、時間領域に変換されたパワースペクト
ルから第２の線形予測係数が算出され、分類器により、
第２の線形予測係数に基づいて複数のコードブックから
１つが選択されると共に、選択されたコードブックの指
標信号がマルチプレクサへ出力しされる。マルチプレク
サにより出力として、第１の線形予測係数、残差利得、
指標信号、指標値が伝送路へ送出される。[Operation] An analog-to-digital converter converts an input analog audio signal into digital data having a predetermined length and outputs the digital data. A first linear prediction analyzer calculates a first linear prediction coefficient from the digital data having a predetermined length. , A composite signal is calculated from the first linear prediction coefficient and the output from the multiplier, and the multiplier multiplies the residual signal waveform specified by the optimal codeword selector by the residual gain, and The error is obtained and output from the difference between the combined signal and the predetermined length digital data, and the error is weighted based on the auditory feature by the filter, and the square error of the weighted error is calculated by the power calculator. , The optimal codeword selector identifies the residual gain of the multiplier and the residual signal waveform of the codebook based on the square error, and identifies the residual gain. Index value of the residual signal waveform is output.
In the codebook selecting means, a linear predictor outputs a residual signal based on the predetermined-length digital data and the first linear prediction coefficient, and the FFT converts the residual signal into a power spectrum. The power spectrum is inverted for each component, the inverted power spectrum is converted to the time domain by IFFT, and the second linear prediction coefficient is calculated from the power spectrum converted to the time domain by the second linear prediction analyzer. And the classifier
One of the plurality of codebooks is selected based on the second linear prediction coefficient, and an index signal of the selected codebook is output to the multiplexer. The first linear prediction coefficient, the residual gain,
The index signal and the index value are transmitted to the transmission path.

［実施例］以下、本発明の符号化装置の一実施例を図面を参照し
て説明する。[Embodiment] Hereinafter, an embodiment of an encoding device of the present invention will be described with reference to the drawings.

第１図は本実施例における符号化装置の構成を示す。 FIG. 1 shows the configuration of an encoding device according to the present embodiment.

第１図に示す符号化装置は、AD変換器11、線形予測分
析器12,13、線形予測器14,15、残差信号コードブック1
6、掛け算器17、加算器18、減算器19、聴覚的重み付け
フィルタ20、パワー算出器21、最適コードワード選択器
22、マルチプレクサ23、高速フーリエ変換器（以後、FF
Tと称する）24、逆高速フーリエ変換器（以後、逆FFTと
称する）25、スペクトル反転器26及び分類器27により構
成されている。本発明のコードブック選択手段10は、線
形予測器15、FFT24、逆FFT25、スペクトル反転器26及び
分類器27からなる。The encoding device shown in FIG. 1 includes an AD converter 11, linear prediction analyzers 12, 13, linear predictors 14, 15, a residual signal codebook 1
6, multiplier 17, adder 18, subtractor 19, auditory weighting filter 20, power calculator 21, optimal codeword selector
22, multiplexer 23, fast Fourier transformer (hereinafter FF
T) 24, an inverse fast Fourier transformer (hereinafter, referred to as an inverse FFT) 25, a spectrum inverter 26, and a classifier 27. The codebook selecting means 10 of the present invention includes a linear predictor 15, an FFT 24, an inverse FFT 25, a spectrum inverter 26, and a classifier 27.

次に上述の符号化装置の動作を説明する。 Next, the operation of the above-described encoding device will be described.

AD変換器11は入力端子から入力されたアナログ音声信
号をデジタルデータに変換して、一定のサンプル長（例
えば5msec）の入力ベクトルＳとして線形予測分析器12
及び減算器19に出力する。The AD converter 11 converts the analog audio signal input from the input terminal into digital data, and converts it into an input vector S having a fixed sample length (for example, 5 msec).
And outputs it to the subtractor 19.

線形予測分析器12は、AD変換器11により変換されたデ
ジタルデータを線形予測分析する。The linear prediction analyzer 12 performs linear prediction analysis on the digital data converted by the AD converter 11.

ここでは線形予測分析としてスペクトル予測の近接予
測及びピッチ予測の長期予測を含むものとする。分析は
複数の入力ベクトル（例えば４つのベクトル、即ち20ms
ec毎）に対して実施され、分析の結果得られた線形予測
係数としての線形予測ベクトルαは、線形予測器14,15
及びマルチプレクサ23に出力される。Here, it is assumed that the linear prediction analysis includes proximity prediction of spectrum prediction and long-term prediction of pitch prediction. The analysis is based on multiple input vectors (eg, four vectors, ie, 20 ms).
ec), and the linear prediction vector α as the linear prediction coefficient obtained as a result of the analysis is
And output to the multiplexer 23.

線形予測器15は分析側に配置されており、線形予測分
析器12から出力された線形予測ベクトルαにより駆動さ
れて入力音声の残差信号を算出する。The linear predictor 15 is arranged on the analysis side, and is driven by the linear prediction vector α output from the linear prediction analyzer 12, and calculates a residual signal of the input speech.

なお、線形予測器15により算出される残差信号は、音
声信号に含まれている零点、即ち***振点によるスペク
トルの谷を含む。The residual signal calculated by the linear predictor 15 includes a zero point included in the audio signal, that is, a valley of the spectrum due to the anti-resonance point.

以下のステップによりスペクトル概形に対応する線形
予測係数を求める。The linear prediction coefficient corresponding to the spectrum outline is obtained by the following steps.

FFT24（例えばポイント数が256点）により算出した残
差信号を周波数分析してパワースペクトルに変換する
（ステップS1）。パワースペクトル成分毎にスペクトル
反転器26によりスペクトル反転を実施する。ここで反転
されたスペクトル成分をパワースペクトル成分をSiとすると、で表される。但しｉ＝０〜（Ｎ−１）、Ｎ＝FFT24のポ
イント数とする（ステップS2）。逆FFT器25によりスペ
クトル反転したものに逆FFTを施して時間波形に戻す
（ステップS3）。The residual signal calculated by the FFT 24 (for example, having 256 points) is frequency-analyzed and converted into a power spectrum (step S1). Spectrum inversion is performed by the spectrum inverter 26 for each power spectrum component. Here, the inverted spectral components are If the power spectrum component is Si, It is represented by However, i = 0 to (N-1) and N = the number of points of FFT24 (step S2). An inverse FFT is applied to the spectrum inverted by the inverse FFT unit 25 to return to a time waveform (step S3).

上述のステップS1〜ステップS3の操作によりパワース
ペクトルの零点の特徴量を表わす線形予測係数が算出で
きる。そして、線形予測分析器13により線形予測分析を
行って線形予測ベクトルα′を抽出し、抽出された線形
予測ベクトルα′をベストル量子化方法を用いた分類器
27により数種類に分類し、分類されたスペクトル形状の
指標信号としての指標ｊを出力する（ステップS4）。The linear prediction coefficient representing the feature value of the zero point of the power spectrum can be calculated by the operations in steps S1 to S3. Then, a linear prediction analysis is performed by the linear prediction analyzer 13 to extract a linear prediction vector α ′, and the extracted linear prediction vector α ′ is classified into a classifier using a Vestle quantization method.
27, and outputs an index j as an index signal of the classified spectrum shape (step S4).

上記の各ステップS1〜S4を経たあと、分類器27から出
力されたスペクトル形状の指標ｊに基づいて、予め備え
られている複数のコードブック16から最適コードワード
選択器22により使用するコードブックを選択する（複数
のコードブック16の作成方法は後述する）。なお、分類
器27より出力されたスペクトル形状の指標ｊはマルチプ
レクサ23にも出力される。After each of the above steps S1 to S4, based on the spectral shape index j output from the classifier 27, a codebook to be used by the optimal codeword selector 22 from a plurality of codebooks 16 provided in advance. Select (the method of creating the plurality of codebooks 16 will be described later). The spectrum shape index j output from the classifier 27 is also output to the multiplexer 23.

複数のコードブック16を構成するコードブックのそれ
ぞれには、一定のサンプル長を有しており、ゲイン正規
化された残差信号波形（コードワード）が複数（例えば
256種類）記憶されている。Each of the codebooks constituting the plurality of codebooks 16 has a fixed sample length and has a plurality of gain-normalized residual signal waveforms (codewords) (for example,
256 types) are stored.

最適コードワード選択器22の指示により選択された全
てのコードワードは、ゲイン掛け算器17で増幅される。All codewords selected by the instruction of the optimum codeword selector 22 are amplified by the gain multiplier 17.

ゲイン掛け算器17で増幅されたコードワードは、線形
予測器14及び加算器18などで構成された合成フィルタに
出力される。合成フィルタは増幅されたコードワードを
入力して合成信号ベクトルＳ′を出力する。The codeword amplified by the gain multiplier 17 is output to a synthesis filter including the linear predictor 14 and the adder 18. The synthesis filter receives the amplified codeword and outputs a synthesized signal vector S '.

減算器19は、合成フィルタから出力された合成信号ベ
クトルＳ′とAD変換器11から出力された入力ベクトルＳ
との誤差、即ちエラーを算出する。The subtractor 19 includes a synthesized signal vector S ′ output from the synthesis filter and an input vector S output from the AD converter 11.
, That is, the error is calculated.

減算器19により算出されたエラーは、聴覚重み付けフ
ィルタ20によりスペクトルマスキング効果が考慮され
る。The error calculated by the subtractor 19 takes into account the spectral masking effect of the auditory weighting filter 20.

パワー算出器21は、聴覚重み付けフィルタ20から出力
されたスペクトルマスキング効果が考慮されたエラーの
二乗誤差を計算して最適コードワード選択器22に出力す
る。The power calculator 21 calculates the square error of the error in consideration of the spectrum masking effect output from the auditory weighting filter 20, and outputs the calculated square error to the optimal codeword selector 22.

最適コードワード選択器22は、パワー算出器21により
算出されたエラーの二乗誤差が最小となるコードワード
及びゲインを選択して、選択されたコードワードを特定
する指標値indexをマルチプレクサ23に出力する。The optimal codeword selector 22 selects a codeword and a gain that minimize the square error of the error calculated by the power calculator 21 and outputs an index value index for specifying the selected codeword to the multiplexer 23. .

マルチプレクサ23は、線形予測ベクトルα、残差利得
ｇ、コードブック選択のための指標ｊ及びコードワード
を特定する指標値indexのそれぞれをマルチプレックス
して伝送路28に送る。The multiplexer 23 multiplexes the linear prediction vector α, the residual gain g, the index j for selecting a codebook, and the index value index for specifying a codeword, and sends them to the transmission line 28.

次に上述の複数のコードブック16の作成方法について
説明する。Next, a method of creating the plurality of codebooks 16 will be described.

複数のコードブック16は、長時間の実際の音声信号を
分析して残差信号を得て、得られた残差信号を更にベク
トル量子化することにより作成する。The plurality of codebooks 16 are created by analyzing a long-time actual audio signal to obtain a residual signal, and further performing vector quantization on the obtained residual signal.

その作成方法の手順は、まず従来のサイズ（例えば10
24〜2048種類のコードワード）の残差信号のコードブッ
クを作成する。そして作成されたコードブックを使用し
て、コードブック作成に用いた長時間の実際の音声信号
を符号化し、残差信号のスペクトル形状の指標としての
形状指標ｊごとに使用頻度の高いコードワードを一定個
数（例えば256種）選択する。上記のようにして複数の
コードブックを作成する。The procedure of the creation method is as follows.
Create a codebook of 24-2048 types of residual signals. Then, using the created codebook, the long-time actual audio signal used for creating the codebook is encoded, and a frequently used codeword is used for each shape index j as an index of the spectral shape of the residual signal. Select a certain number (for example, 256 types). Create multiple codebooks as described above.

第２図は第１図の符号化装置に対応する復号化装置の
構成を示す。FIG. 2 shows a configuration of a decoding device corresponding to the encoding device of FIG.

第２図に示す復号化装置は、デマルチプレクサ29、複
数のコードブック30、掛け算器31、加算器32、線形予測
器33及びDA変換器34により構成される。The decoding device shown in FIG. 2 includes a demultiplexer 29, a plurality of codebooks 30, a multiplier 31, an adder 32, a linear predictor 33, and a DA converter 34.

次に上述の復号化装置の動作について説明する。 Next, the operation of the above-described decoding device will be described.

デマルチプレクサ29は、伝送路28から受信した信号を
線形予測ベクトルα、残差利得ｇ、残差スペクトル形状
の形状指標ｊ及び残差信号を特定する指標値indexに分
解して、分解された残差スペクトル形状の形状指標ｊ及
び残差信号を特定する指標値indexを複数のコードブッ
ク30に、残差利得ｇを加算器32に、線形予測ベクトルα
を線形予測器33にそれぞれ出力する。The demultiplexer 29 decomposes the signal received from the transmission path 28 into a linear prediction vector α, a residual gain g, a shape index j of a residual spectrum shape, and an index value index for specifying the residual signal, and The shape index j of the difference spectrum shape and the index value index for specifying the residual signal are stored in a plurality of codebooks 30, the residual gain g is stored in an adder 32, and the linear prediction vector α
Is output to the linear predictor 33.

デマルチプレクタ29から出力された形状指標ｊに基づ
いて、あらかじめ蓄えられている複数のコードブック30
から使用するコードブックを選択する。ここで複数のコ
ードブック30は符号化装置におけるコードブック16の内
容と同じ内容を有している。選択されたコードブック30
の指標値indexに基づくコードワードは掛け算器31に出
力される。Based on the shape index j output from the demultiplexer 29, a plurality of codebooks 30 stored in advance are stored.
Select the codebook to use from. Here, the plurality of codebooks 30 have the same content as the content of the codebook 16 in the encoding device. Selected codebook 30
Is output to the multiplier 31.

掛け算器31は受信した残差利得ｇにより入力されたコ
ードワードを増幅して合成フィルタに出力する。受信し
た線形予測ベクトルαを係数とする線形予測器33で構成
された合成フィルタは、増幅されたコードワードを入力
して再生信号ベクトルを出力する。そしてDA変換器34
は、合成フィルタから出力された再生信号ベクトルを入
力してアナログの音声信号に変換する。The multiplier 31 amplifies the input codeword by the received residual gain g and outputs the amplified codeword to the synthesis filter. The synthesis filter composed of the linear predictor 33 using the received linear prediction vector α as a coefficient inputs the amplified codeword and outputs a reproduced signal vector. And DA converter 34
Converts the reproduced signal vector output from the synthesis filter into an analog audio signal.

［発明の効果］ AD変換器、第１の線形予測分析器、複数のコードブッ
ク、合成フィルタ、乗算器、減算器、フィルタ、パワー
算出器、最適コードワード選択器、コードブック選択手
段、マルチプレクサからなる符号化装置であって、AD変
換器は、入力されるアナログ音声信号を所定長デジタル
データに変換出力し、第１の線形予測分析器は、所定長
デジタルデータから第１の線形予測係数を算出し、コー
ドブックは、複数の正規化された残差信号波形を記憶
し、合成フィルタは、第１の線形予測係数と乗算器から
の出力により合成信号を算出し、乗算器は、最適コード
ワード選択器で特定された残差信号波形と残差利得を乗
算し、減算器は、合成信号と所定長デジタルデータの差
であるエラーを出力し、フィルタは、エラーに聴覚的特
徴に基づく重み付けを行い、パワー算出器は、重み付け
されたエラーの二乗誤差を算出し、最適コードワード選
択器は、二乗誤差に基づいて乗算器の残差利得とコード
ブックの残差信号波形を特定すると共に、残差利得と特
定された残差信号波形の指標値を出力し、コードブック
選択手段は、線形予測器、FFT、スペクトル反転器、IFF
T、第２の線形予測分析器、分類器からなり、線形予測
器は、所定長デジタルデータと第１の線形予測係数に基
づいて残差信号を出力し、FFTは、残差信号をパワース
ペクトルに変換し、スペクトル反転器は、パワースペク
トルを成分毎に反転し、IFFTは、反転されたパワースペ
クトルを時間領域に変換し、第２の線形予測分析器は、
時間領域に変換されたパワースペクトルから第２の線形
予測係数を算出し、分類器は、第２の線形予測係数に基
づいて複数のコードブックから１つを選択すると共に、
選択されたコードブックの指標信号をマルチプレクサへ
出力し、マルチプレクサは、第１の線形予測係数、残差
利得、指標信号、指標値を伝送路へ送出するように構成
されているので、残差信号のスペクトル形状と１対１に
対応する線形予測係数が求められ、残差信号のスペクト
ル形状に応じたコードブックを選択することが可能とな
り、子音及び鼻子音区間に対してもより正確に符号化す
ることができる。[Effect of the Invention] From an AD converter, a first linear prediction analyzer, a plurality of codebooks, a synthesis filter, a multiplier, a subtractor, a filter, a power calculator, an optimum codeword selector, a codebook selection means, and a multiplexer. An AD converter converts an input analog audio signal into digital data of a predetermined length and outputs the digital audio signal, and a first linear prediction analyzer calculates a first linear prediction coefficient from the digital data of a predetermined length. Calculating, the codebook stores a plurality of normalized residual signal waveforms, the synthesis filter calculates a synthesized signal based on the first linear prediction coefficient and an output from the multiplier, and the multiplier determines an optimal code The residual signal waveform specified by the word selector is multiplied by the residual gain, the subtractor outputs an error that is the difference between the synthesized signal and the predetermined length digital data, and the filter outputs the error based on the auditory characteristics. The power calculator calculates the square error of the weighted error, and the optimal codeword selector specifies the residual gain of the multiplier and the residual signal waveform of the codebook based on the square error. Along with the residual gain, an index value of the identified residual signal waveform is output, and the codebook selecting means includes a linear predictor, an FFT, a spectrum inverter, an IFF
T, a second linear prediction analyzer and a classifier, the linear predictor outputs a residual signal based on the predetermined length digital data and the first linear prediction coefficient, and the FFT converts the residual signal into a power spectrum. , The spectrum inverter inverts the power spectrum component by component, the IFFT transforms the inverted power spectrum into the time domain, and the second linear prediction analyzer
A second linear prediction coefficient is calculated from the power spectrum converted to the time domain, and the classifier selects one from a plurality of codebooks based on the second linear prediction coefficient, and
The index signal of the selected codebook is output to a multiplexer, and the multiplexer is configured to send the first linear prediction coefficient, the residual gain, the index signal, and the index value to the transmission line. Linear prediction coefficients corresponding to the spectral shape of the residual signal are obtained, and a codebook corresponding to the spectral shape of the residual signal can be selected. can do.

【図面の簡単な説明】[Brief description of the drawings]

第１図は本発明の符号化装置における一実施例の構成を
示すブロック図、第２図は第１図に示す符号化装置に対
応する復号化装置の一構成例を示すブロック図、第３図
は従来の符号化装置の一構成例を示すのブロック図、第
４図は第３図に示す符号化装置に対応する復号化装置の
一構成例を示すブロック図である。 11……AD変換器、12,13……線形予測分析器、14,15,33
……線形予測器、16,30……コードブック、17,31……掛
け算器、18,32……加算器、19……減算器、20……聴覚
的重み付けフィルタ、21……パワー算出器、22……最適
コードワード選択器、23……マルチプレクサ、24……FF
T、25……逆FFT、26……スペクトル反転器、27……分類
器、28……デマルチプレクサ、34……DA変換器。FIG. 1 is a block diagram showing a configuration of an embodiment of an encoding apparatus according to the present invention. FIG. 2 is a block diagram showing an example of a configuration of a decoding apparatus corresponding to the encoding apparatus shown in FIG. FIG. 1 is a block diagram showing one configuration example of a conventional encoding device, and FIG. 4 is a block diagram showing one configuration example of a decoding device corresponding to the encoding device shown in FIG. 11 …… AD converter, 12,13 …… Linear prediction analyzer, 14,15,33
…… Linear predictor, 16,30 …… Codebook, 17,31 …… Multiplier, 18,32 …… Adder, 19 …… Subtractor, 20 …… Auditory weighting filter, 21 …… Power calculator , 22 ... optimum code word selector, 23 ... multiplexer, 24 ... FF
T, 25… Inverse FFT, 26… Spectral inverter, 27… Classifier, 28… Demultiplexer, 34… DA converter.

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】AD変換器（11）、第１の線形予測分析器
（12）、複数のコードブック（16）、合成フィルタ（1
4、18）、乗算器（17）、減算器（19）、フィルタ（2
0）、パワー算出器（21）、最適コードワード選択器（2
2）、コードブック選択手段（10）、マルチプレクサ（2
3）からなる符号化装置であって、 AD変換器（11）は、入力されるアナログ音声信号を所定
長デジタルデータに変換出力し、第１の線形予測分析器（12）は、所定長デジタルデータ
から第１の線形予測係数を算出し、コードブック（16）は、複数の正規化された残差信号波
形を記憶し、合成フィルタ（14、18）は、第１の線形予測係数と乗算
器（17）からの出力により合成信号を算出し、乗算器（17）は、最適コードワード選択器（22）で特定
された残差信号波形と残差利得を乗算し、減算器（19）は、合成信号と所定長デジタルデータの差
であるエラーを出力し、フィルタ（20）は、エラーに聴覚的特徴に基づく重み付
けを行い、パワー算出器（21）は、重み付けされたエラーの二乗誤
差を算出し、最適コードワード選択器（22）は、二乗誤差に基づいて
乗算器の残差利得とコードブック（16）の残差信号波形
を特定すると共に、残差利得と特定された残差信号波形
の指標値を出力し、コードブック選択手段（10）は、線形予測器（15）、FF
T（24）、スペクトル反転器（26）、IFFT（25）、第２
の線形予測分析器（13）、分類器（27）からなり、線形予測器（15）は、所定長デジタルデータと第１の線
形予測係数に基づいて残差信号を出力し、 FFT（24）は、残差信号をパワースペクトルに変換し、スペクトル反転器（26）は、パワースペクトルを成分毎
に反転し、 IFFT（25）は、反転されたパワースペクトルを時間領域
に変換し、第２の線形予測分析器（13）は、時間領域に変換された
パワースペクトルから第２の線形予測係数を算出し、分類器（27）は、第２の線形予測係数に基づいて複数の
コードブック（16）から１つを選択すると共に、選択さ
れたコードブックの指標信号をマルチプレクサ（23）へ
出力し、マルチプレクサ（23）は、第１の線形予測係数、残差利
得、指標信号、指標値を伝送路へ送出する符号化装置。An A / D converter (11), a first linear prediction analyzer (12), a plurality of codebooks (16), a synthesis filter (1)
4, 18), multiplier (17), subtractor (19), filter (2
0), power calculator (21), optimal codeword selector (2
2), codebook selection means (10), multiplexer (2
An AD converter (11) converts an input analog audio signal into digital data having a predetermined length and outputs the digital data. A first linear prediction analyzer (12) outputs a digital signal having a predetermined length. Calculating a first linear prediction coefficient from the data; a codebook (16) storing a plurality of normalized residual signal waveforms; and a synthesis filter (14, 18) multiplying the first linear prediction coefficient by the first linear prediction coefficient. A multiplier (17) multiplies the residual signal waveform specified by the optimum codeword selector (22) by the residual gain, and a subtractor (19) Outputs an error that is the difference between the synthesized signal and the digital data of a predetermined length, a filter (20) weights the error based on an auditory feature, and a power calculator (21) generates a square error of the weighted error. The optimal codeword selector (22) calculates the square error The residual gain of the multiplier and the residual signal waveform of the codebook (16) are specified based on the residual gain and the index value of the residual signal waveform specified as the residual gain is output. , Linear predictor (15), FF
T (24), spectrum inverter (26), IFFT (25), second
A linear prediction analyzer (13) and a classifier (27). The linear predictor (15) outputs a residual signal based on the predetermined length digital data and the first linear prediction coefficient, and an FFT (24) Converts the residual signal into a power spectrum, a spectrum inverter (26) inverts the power spectrum for each component, IFFT (25) converts the inverted power spectrum into the time domain, The linear prediction analyzer (13) calculates a second linear prediction coefficient from the power spectrum converted into the time domain, and the classifier (27) calculates a plurality of codebooks (16) based on the second linear prediction coefficient. ) And outputs an index signal of the selected codebook to a multiplexer (23), which transmits a first linear prediction coefficient, a residual gain, an index signal, and an index value. Encoding device to send to the road.