JPH0594200A

JPH0594200A - Code excitation linear prediction encoding device

Info

Publication number: JPH0594200A
Application number: JP3256067A
Authority: JP
Inventors: Hiromi Aoyanagi; 弘美青柳; Hiroshi Katsuragawa; 浩桂川; Yoshihiro Ariyama; 義博有山
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1991-10-03
Filing date: 1991-10-03
Publication date: 1993-04-16
Anticipated expiration: 2015-06-12
Also published as: JP3050966B2

Abstract

PURPOSE:To convert a gain index by code excitation linear prediction encoding to a low bit. CONSTITUTION:The device is provided with an RMS calculating circuit 304 for predicting amplitude of a statistical excitation vector by deforming the statistical excitation vectors based on an RMS(Root Mean Square) of an adaptive excitation vector, and a multiplier 306. Also, this device is provided with a VQ gain code book 305 obtained by vector-quantizing statistical excitation gain and adaptive excitation gain. According to this constitution. with regard to an excitation gain index, even if it is converted to a low bit, an excellent composite voice quality is obtained.

Description

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、適応励振コードブッ
ク、統計励振コードブック、及び励振ゲインコードブッ
クを用い、それらのインデックスと声道予測パラメータ
とに関する、情報を符号化情報とするコード励振線形予
測符号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention uses an adaptive excitation codebook, a statistical excitation codebook, and an excitation gain codebook, and a code excitation linear with information about these indexes and vocal tract prediction parameters as coding information. The present invention relates to a predictive coding device.

【０００２】[0002]

【従来の技術】過去の原音声ベクトルの最適な励振ベク
トルを蓄積した適応励振コードブックと、予め定められ
た励振ベクトルを蓄積した統計励振コードブックとを備
えたコード励振線形予測符号化装置は、例えば、次記文
献で開示されている。文献名：Ｎ．Ｓ．ＪａｙａｎｔａｎｄＪ．Ｈ．Ｃｈ
ｅｎ，“ＳｐｅｅｃｈＣｏｄｉｎｇｗｉｔｈＴｉｍ
ｅ−ＶａｒｙｉｎｇＢｉｔＡｌｌｏｃａｔｉｏｎｓ
ｔｏＥｘｃｉｔａｔｉｏｎａｎｄＬＰＣＰａｒ
ａｍｅｔｅｒｓ”，Ｐｒｏｃ．ＩＣＡＳＳＰ，ｐｐ６５
−６８，１９８９。この文献では、音声合成のための残差信号が、周期性の
ある信号と、雑音性の信号とで構成されていると考え、
統計励振コードブックを雑音性の信号に対応させ、適応
励振コードブックを周期性の信号に対応させ、原音声ベ
クトルに対する最適な励振ベクトルを選択決定するため
に、そこから読みだした複数の適応励振ベクトルと複数
の統計励振ベクトルとに基づいて複数の励振ベクトルを
作成する。適応励振コードブックは、具体的には、過去
の励振ベクトル即ち最適残差信号系列をある程度蓄えて
おき、ｘサンプルだけ過去の信号を出力し、このｘの値
を変えることで複数の適応励振ベクトルを得ている（ｎ
通りのｘがあれば、ｎケの候補ベクタが得られる）。2. Description of the Related Art A code excitation linear predictive coding apparatus provided with an adaptive excitation codebook in which optimum excitation vectors of past original speech vectors are accumulated and a statistical excitation codebook in which predetermined excitation vectors are accumulated, For example, it is disclosed in the following document. Reference name: N. S. Jayant and J. H. Ch
en, "SpeechCoding with Tim
e-Varying Bit Allocations
to Excitation and LPC Par
ameters ", Proc.ICASSP, pp65
-68,1989. In this document, it is considered that the residual signal for speech synthesis is composed of a periodic signal and a noise signal,
The statistical excitation codebook is associated with a noisy signal, the adaptive excitation codebook is associated with a periodic signal, and a plurality of adaptive excitations read from the adaptive excitation codebook are selected to determine the optimal excitation vector for the original speech vector. Creating a plurality of excitation vectors based on the vector and the plurality of statistical excitation vectors. Specifically, the adaptive excitation codebook stores past excitation vectors, that is, optimum residual signal sequences to some extent, outputs past signals by x samples, and changes the value of x to obtain a plurality of adaptive excitation vectors. (N
If there are x in the street, then n candidate vectors are obtained).

【０００３】[0003]

【発明が解決しようとする課題】しかし、そこでは、適
応励振ベクトル用及び統計励振ベクトル用にそれぞれ独
立の励振ゲインコードブックを用意しているため、換言
すれば、励振ゲインをスカラ量子化しているため、量子
化ビットが少なくなると十分な性能が得られないという
問題点がある。確かに、統計励振ベクトルは予め決定さ
れている（統計励振コードブックは予め決定されてい
る。）のに対し、適応励振ベクトルは各時刻毎にそのパ
ワが変動してしまう（適応励振コードブックは符号化毎
に更新される）ので、両者にはパワの相関がほとんどな
く、従って、一般的に考えれば、統計励振ゲインと適応
励振ゲインとは独立に設定することになる。他方、任意
の原音声ベクトルに対して、最適として選ばれた適応励
振ベクトルをｅao、同じく適応励振ゲインをβｏ、同じ
く統計励振ベクトルをｅso、同じく統計励振ゲインをγ
ｏとすると、βο＊ｅaoのパワ（ベクタ要素の２乗和）
とγｏ＊ｅsoのパワには、かなりの相関がある（βο＊
ｅaoのパワが大きい時は、γｏ＊ｅsoのパワも大き
い）。そこで、適当なゲインｇを考え、γｏ＊ｅsoをγ
ｏ＊ｇ＊ｅsoと表し、ｅaoとｇ＊ｅsoに相関性を持たせ
れば、適応励振ゲインβｏと統計励振ゲインγｏとにも
相関性が現われる。本発明は、このような励振ゲインの
相関性に着目したものであり、低ビット符号化において
も優れた合成音声品質が得られるコード励振線形予測符
号化装置を提供することを目的とするものである。However, since independent excitation gain codebooks are prepared for the adaptive excitation vector and the statistical excitation vector, in other words, the excitation gain is scalar-quantized. Therefore, there is a problem that sufficient performance cannot be obtained when the number of quantization bits decreases. Certainly, while the statistical excitation vector is predetermined (the statistical excitation codebook is predetermined), the power of the adaptive excitation vector changes at each time (the adaptive excitation codebook is Since they are updated for each encoding), there is almost no power correlation between them, and therefore, in general, the statistical excitation gain and the adaptive excitation gain are set independently. On the other hand, for any original speech vector, the adaptive excitation vector selected as the optimum is eao, the adaptive excitation gain is βo, the statistical excitation vector is eso, and the statistical excitation gain is γ.
If o, βο * eao power (vector element sum of squares)
And the power of γo * eso has a considerable correlation (βο *
When the power of eao is large, the power of γo * eso is also large). Therefore, consider an appropriate gain g and set γo * eso to γ
If it is expressed as o * g * eso, and eao and g * eso have a correlation, the adaptive excitation gain βo and the statistical excitation gain γo also have a correlation. The present invention focuses on the correlation of such excitation gains, and an object of the present invention is to provide a code excitation linear predictive encoding device that can obtain excellent synthesized speech quality even in low bit encoding. is there.

【０００４】[0004]

【課題を解決するための手段】本発明は、過去の複数の
原音声ベクトルに対する最適な励振ベクトルを蓄積して
いて、それらを、選択制御手段に制御されて、適応励振
ベクトルとして出力する適応励振コードブックを有す
る。又、複数の、予め定められた励振ベクトルを蓄積し
ていて、それらを、選択制御手段に制御されて、統計励
振ベクトルとして出力する統計励振コードブックとを有
する。又、複数の、予め定められた励振ゲインを蓄積し
ていて、それらを、選択制御手段に制御されて、適応励
振ゲイン及び統計励振ゲインとして出力する励振ゲイン
コードブックを有する。又、選択制御手段に制御され
て、適応励振コードブック、統計励振コードブック、及
び励振ゲインコードブックの出力に基づいて、複数の励
振ベクトルを作成する励振ベクトル合成手段を有する。
又、原音声ベクトルの声道予測パラメータと各励振ベク
トルとに基づいて合成音声ベクトルを作成し、原音声ベ
クトルと各合成音声ベクトルとの誤差ベクトルを所定の
評価手段によって評価し、この評価値が最小となる励振
ベクトルを最適な励振ベクトルとして適応励振コードブ
ックに蓄積する選択制御手段を有する。そして、この励
振ベクトル合成手段は、適応励振ベクトルに適応励振ゲ
インを乗算し、統計励振ベクトルに、適応励振ベクトル
のパワーの平方根に比例した値及び統計励振ゲインとを
乗算し、それぞれ乗算した後の適応励振ベクトルと統計
励振ベクトルとを加算して各励振ベクトルを作成するも
のである。なお、この明細書において、パワーの平方根
とは、それとほぼ同等の値となる、適応励振ベクトルの
各要素の大きさの総和を含むものである。According to the present invention, an optimum excitation vector for accumulating optimum excitation vectors for a plurality of original speech vectors in the past is stored, and these are controlled by the selection control means and output as an adaptive excitation vector. Have a codebook. It also has a statistical excitation codebook that stores a plurality of predetermined excitation vectors and outputs them as statistical excitation vectors under the control of the selection control means. Further, it has an excitation gain codebook that stores a plurality of predetermined excitation gains and outputs them as adaptive excitation gains and statistical excitation gains under the control of the selection control means. Further, it has an excitation vector synthesizing means that is controlled by the selection control means and creates a plurality of excitation vectors based on the outputs of the adaptive excitation codebook, the statistical excitation codebook, and the excitation gain codebook.
In addition, a synthetic speech vector is created based on the vocal tract prediction parameter of the original speech vector and each excitation vector, and an error vector between the original speech vector and each synthetic speech vector is evaluated by a predetermined evaluation means. It has a selection control means for accumulating the smallest excitation vector as an optimum excitation vector in the adaptive excitation codebook. Then, the excitation vector synthesizing means multiplies the adaptive excitation vector by the adaptive excitation gain, multiplies the statistical excitation vector by a value proportional to the square root of the power of the adaptive excitation vector, and the statistical excitation gain. Each of the excitation vectors is created by adding the adaptive excitation vector and the statistical excitation vector. In this specification, the square root of power includes the sum of the magnitudes of the respective elements of the adaptive excitation vector, which have almost the same value.

【０００５】[0005]

【作用】励振ゲインコードブックは、励振ゲインをベク
トル量子化して蓄積しているものであり、即ち、共通の
ゲインコードインデックスＩｇの下に適応励振ゲインβ
ｋ及び統計励振ゲインγｋとを出力するものであり、ま
た、各励振ベクトルｅを作成する際に、式（１）で示す
修正ゲインｇｉによって統計励振ベクトルｅslの振幅を
変化させ、従って、各励振ベクトルｅは式（２）で示す
ものとなる。ｇｉ＝α＊ＲＭＳ［ｅai］（１）ただし、αは定数、ＲＭＳ［ｅai］は適応励振ベクトル
ｅaiのパワーの平方根ｅ＝βｋ＊ｅai＋γｋ＊ｇｉ＊ｅsl （２）適応励振ベクトルｅaiの振幅と統計励振ベクトルｅslの
振幅とを修正ゲインｇｉで結び付けておくことにより、
観念的に言えば、統計励振ベクトルｅslの振幅を適応励
振ベクトルｅaiに関する修正ゲインｇｉで予測すること
により、最適な励振ベクトル（原音声ベクトルの励振ベ
クトル）ｅ optにおける２励振ゲインβｏ，γｏの間に
相関性が現れる。従って、適応励振ゲインβｋと統計励
振ゲインγｋとの組み合わせを共通のゲインコードイン
デックスＩｇで指定するようにしておくことにより、各
々独立のゲインコードインデックスを対応させるより
も、励振ゲインに関して低ビット符号化できることにな
るなお、式（１）における定数αの値は、統計励振ベク
トルｅsl自身のパワにも関係するので、実験的に決定す
べきものであり、目安としては、適応励振ゲインダイナ
ミックレンジと統計励振ゲインのダイナミックレンジが
同じ位になる程度が適当である。The excitation gain codebook is obtained by vector-quantizing and accumulating the excitation gains, that is, the adaptive excitation gain β under the common gain code index Ig.
k and the statistical excitation gain γk are output, and when each excitation vector e is created, the amplitude of the statistical excitation vector esl is changed by the modified gain gi shown in the equation (1), and thus each excitation vector e The vector e is represented by the equation (2). gi = α * RMS [eai] (1) where α is a constant and RMS [eai] is the square root of the power of the adaptive excitation vector eai e = βk * eai + γk * gi * esl (2) The amplitude and statistics of the adaptive excitation vector eai By connecting the amplitude of the excitation vector esl with the correction gain gi,
Conceptually speaking, by predicting the amplitude of the statistical excitation vector esl with the correction gain gi for the adaptive excitation vector ei, the optimum excitation vector (excitation vector of the original speech vector) e opt between the two excitation gains βo and γo Correlation appears in. Therefore, by designating a combination of the adaptive excitation gain βk and the statistical excitation gain γk with a common gain code index Ig, it is possible to perform low-bit encoding with respect to the excitation gain rather than to associate independent gain code indexes with each other. It should be noted that the value of the constant α in the equation (1) is related to the power of the statistical excitation vector esl itself and should be determined experimentally. As a guide, the adaptive excitation gain dynamic range and the statistical excitation are used. It is suitable that the dynamic range of the gain is about the same.

【０００６】[0006]

【実施例】図１に、本発明のコード励振線形予測符号化
器の実施例を示す。端子３０１よりフレーム単位にまと
められて、ベクトルとして入力される原音声ベクトルＳ
は、まず声道分析回路３１３に入力され、声道予測パラ
メータａｊが計算される。声道分析回路３１３は、声道
予測パラメータａｊをＬＰＣ量子化器３１２に送出す
る。ＬＰＣ量子化器３１２は、声道予測パラメータａｊ
を量子化し、そのＬＰＣコードＩｃをＬＰＣ逆量子化器
３１１、多重化回路３１７に送出する。ＬＰＣ逆量子化
器３１１は、ＬＰＣコードＩｃを声道予測パラメータａ
qjに逆変換して合成フィルタ３１０に送出する。次に、
適応励振コードブック３０２は適応励振ベクトルｅai
（ｉ＝１〜ｎ）、統計励振コードブック３０３は統計励
振ベクトルｅsl（ｌ＝１〜ｍ）、ＶＱゲインコードブッ
ク３０５は励振ゲインβｋ、γｋ（ｋ＝１〜ｐ）を各々
出力する。ＲＭＳ計算回路３０４は、適応励振ベクトル
ｅaiをもとに式（１）で示した修正ゲインｇｉ＝α×Ｒ
ＭＳ［ｅai］を計算し出力する。適応励振ベクトルｅai
は乗算器３０８により適応励振ゲインβｋが乗ぜられベ
クトルｅaik となり、統計励振ベクトルｅslは乗算器３
０６により修正ゲインｇｉ、乗算器３０７により統計励
振ゲインγｋが乗ぜられベクトルｅslikとなる。FIG. 1 shows an embodiment of a code-excited linear predictive encoder according to the present invention. The original voice vector S that is collected from the terminal 301 in frame units and input as a vector
Is first input to the vocal tract analysis circuit 313, and the vocal tract prediction parameter aj is calculated. The vocal tract analysis circuit 313 sends the vocal tract prediction parameter aj to the LPC quantizer 312. The LPC quantizer 312 uses the vocal tract prediction parameter aj
Is quantized and the LPC code Ic is sent to the LPC dequantizer 311 and the multiplexing circuit 317. The LPC dequantizer 311 uses the LPC code Ic as the vocal tract prediction parameter a.
It is converted back to qj and sent to the synthesis filter 310. next,
The adaptive excitation codebook 302 is an adaptive excitation vector eai.
(I = 1 to n), the statistical excitation codebook 303 outputs the statistical excitation vector esl (l = 1 to m), and the VQ gain codebook 305 outputs the excitation gains βk and γk (k = 1 to p), respectively. The RMS calculation circuit 304 uses the adaptive excitation vector eai to correct the correction gain gi = α × R shown in the equation (1).
Calculate and output MS [eai]. Adaptive excitation vector eai
Is multiplied by the adaptive excitation gain βk by the multiplier 308 to form a vector eaik, and the statistical excitation vector esl is calculated by the multiplier 3
The correction gain gi is multiplied by 06, and the statistical excitation gain γk is multiplied by the multiplier 307 to obtain the vector eslik.

【０００７】加算器３０９は、ベクトルｅaik とベクト
ルｅslikの成分単位の加算を行い励振ベクトルｅを計算
する。合成フィルタ３１０は、励振ベクトルｅに対する
合成音声ベクトルＳ_Wを計算し、減算器３１４に送出す
る。減算器３１４は、両ベクトルＳ_W，Ｓの成分単位の
減算を行い、誤差ベクトルｅｒを知覚フィルタ３１５に
送出する。知覚フィルタ３１５は誤差ベクトルｅｒに対
する出力ベクトルｅｗを知覚誤差計算回路３１６に送出
する。知覚誤差計算回路３１６は、ベクトルｅｗの各成
分の２乗平均を計算し、この値が最小となるｉ，ｌ，ｋ
の組み合わせを最適な各コードブックのインデックスＩ
ａ，Ｉｓ，Ｉｇとして、適応励振コードブック３０２、
統計励振コードブック３０３、ＶＱゲインコードブック
３０５、多重化回路３１７に送出する。適応励振コード
ブック３０２は、インデックスＩａにより最適な適応励
振コードベクトルｅao、統計励振コードブック３０３
は、インデックスＩｓにより最適な統計励振コードベク
トルｅso、ＶＱゲインコードブック３０５は、インデッ
クスＩｇにより最適なＶＱゲインβｏ、γｏを各々出力
する。ＲＭＳ計算回路３０４は、ベクトルｅaoをもとに
ゲインｇｏを計算し出力する。これらにより構成される
最適励振ベクトルｅopt は適応励振コードブック３０２
に入力され、適応励振コードブック３０２の内容が更新
される。多重化回路３１７は、Ｉｃ，Ｉａ，Ｉｓ，Ｉｇ
をトータルコードＣとして出力端子３１８により受信側
に伝送する。The adder 309 adds the vector eaik and the vector eslik in component units to calculate the excitation vector e. The synthesis filter 310 calculates a synthesized speech vector S _W for the excitation vector e and sends it to the subtractor 314. The subtractor 314 subtracts the two vectors S _W and S in component units, and sends the error vector er to the perceptual filter 315. The perceptual filter 315 sends the output vector ew for the error vector er to the perceptual error calculation circuit 316. The perceptual error calculation circuit 316 calculates the root mean square of each component of the vector ew, and i, l, k with which this value is minimized
Optimal combination of each index of each codebook I
As a, Is, and Ig, the adaptive excitation codebook 302,
It is sent to the statistical excitation codebook 303, the VQ gain codebook 305, and the multiplexing circuit 317. The adaptive excitation codebook 302 includes an optimal adaptive excitation code vector eao and a statistical excitation codebook 303 according to the index Ia.
Is the optimal statistical excitation code vector eso based on the index Is, and the VQ gain codebook 305 outputs the optimal VQ gains βo and γo based on the index Ig, respectively. The RMS calculation circuit 304 calculates and outputs the gain go based on the vector eao. The optimum excitation vector eopt constituted by these is the adaptive excitation codebook 302.
And the contents of the adaptive excitation codebook 302 are updated. The multiplexing circuit 317 uses Ic, Ia, Is, Ig
Is transmitted to the receiving side through the output terminal 318 as the total code C.

【０００８】図２において、各時刻において、まず、原
音声ベクトルＳから声道パラメータａｊ，Ｉｃ，ａqjの
計算を行う（ステップＳＴ１）。次に、ｉ，ｌ，ｋの各
組み合わせにおいて、適応励振ベクトルｅaiに対応した
修正係数ｇｉ（ｅaiとｇｉとは１対１対応）を計算し、
更に励振ベクトルｅを計算し（ステップＳＴ２）、合成
音声ベクトル等の諸ベクトルＳ_w，ｅｒ，ｅｗ，そして
知覚誤差を計算し（ステップＳＴ３）、以上の過程を
ｉ，ｌ，ｋの全ての組み合わせについて行う。（ステッ
プＳＴ４）。その後、知覚誤差が最小となる各インデッ
クスを出力し（ステップＳＴ５）、最適な励振ベクトル
ｅopt を構成し（ステップＳＴ６）、その励振ベクトル
ｅopt で適応励振コードブックを更新する（ステップＳ
Ｔ１）。In FIG. 2, the vocal tract parameters aj, Ic, aqj are calculated from the original speech vector S at each time (step ST1). Next, for each combination of i, l, and k, a correction coefficient gi (eai and gi have a one-to-one correspondence) corresponding to the adaptive excitation vector eai is calculated,
Further, the excitation vector e is calculated (step ST2), the vectors S _w , er, ew such as the synthesized speech vector, and the perceptual error are calculated (step ST3), and the above process is performed for all combinations of i, l, k. Do about. (Step ST4). After that, each index that minimizes the perceptual error is output (step ST5), an optimal excitation vector eopt is constructed (step ST6), and the adaptive excitation codebook is updated with the excitation vector eopt (step S).
T1).

【０００９】図３は、図１のコード励振線形予測符号化
装置による一群の単語の符号結果について、その適応励
振ゲインβｋと統計励振ゲインγｋとの関係を示す分布
図である。図３から明らかなように、適応励振ゲインβ
ｋと統計励振ゲインγｋとの間に強い相関があるので、
その分布の代表点をサンプルして図１のＶＱゲインコー
ドブックに蓄積することにより、ゲインインデックスＩ
ｇを低ビット化することができる。FIG. 3 is a distribution diagram showing the relationship between the adaptive excitation gain βk and the statistical excitation gain γk for the code result of a group of words by the code excitation linear predictive coding apparatus of FIG. As is clear from FIG. 3, the adaptive excitation gain β
Since there is a strong correlation between k and the statistical excitation gain γk,
By sampling the representative points of the distribution and accumulating them in the VQ gain codebook of FIG. 1, the gain index I
It is possible to reduce g for g.

【００１０】図４に、本発明に係る励振線形予測符号化
装置の実施例を示す。入力端子４０１より入力されたコ
ードＣは多重分離回路４１３によりＬＰＣコードＩｃ、
適応励振コードインデックスＩａ、統計励振コードイン
デックスＩｓ、ゲインコードインデックスＩｇに分離さ
れ、各々ＬＰＣ逆量子化器４１１、適応励振コードブッ
ク４０２、統計励振コードブック４０３、ＶＱゲインコ
ードブック４０５に送出される。ＬＰＣ逆量子化器４１
１は、ＬＰＣコードＩｃを声道予測パラメータａｊに変
換して合成フィルタ４１０に送出する。適応励振コード
ブック４０２はインデックスＩａに相当する適応励振コ
ードベクトルｅａ、統計励振コードブック４０３はイン
デックスＩｓに相当する統計励振ベクトルｅｓ、ＶＱゲ
インコードブック４０５はインデックスＩｇに相当する
励振ゲインβ，γを各々出力する。ＲＭＳ計算回路４０
４は、ベクトルｅａをもとにゲインｇを計算し出力す
る。ｅａ，ｅｓ，β，γ，ｇにより励振ベクトルｅが構
成される。合成フィルタ４１０はベクトルｅに対する合
成音声ベクトルＳを計算し出力端子４１２より出力す
る。適応励振コードブック４０２はベクトルｅによりそ
の内容が更新される。FIG. 4 shows an embodiment of the excitation linear predictive coding apparatus according to the present invention. The code C input from the input terminal 401 is output by the demultiplexing circuit 413 to the LPC code Ic,
It is separated into an adaptive excitation code index Ia, a statistical excitation code index Is, and a gain code index Ig, and sent to the LPC dequantizer 411, the adaptive excitation codebook 402, the statistical excitation codebook 403, and the VQ gain codebook 405, respectively. LPC inverse quantizer 41
1 converts the LPC code Ic into a vocal tract prediction parameter aj and sends it to the synthesis filter 410. The adaptive excitation codebook 402 is the adaptive excitation code vector ea corresponding to the index Ia, the statistical excitation codebook 403 is the statistical excitation vector es corresponding to the index Is, and the VQ gain codebook 405 is the excitation gains β and γ corresponding to the index Ig. Output each. RMS calculation circuit 40
4 calculates and outputs the gain g based on the vector ea. The excitation vector e is composed of ea, es, β, γ, and g. The synthesis filter 410 calculates a synthesized speech vector S for the vector e and outputs it from the output terminal 412. The contents of the adaptive excitation codebook 402 are updated by the vector e.

【００１１】[0011]

【発明の効果】二つのコードブックのゲインをベクトル
量子化しているため、スカラ量子化に比べ量子化特性が
良く、さらにＲＭＳ計算回路を用いることにより二つの
コードブックのゲインに、より高い相関性が現れるため
ベクトル量子化の効果がさらに上がる。これらは、低ビ
ットレート化に有効である。Since the gains of the two codebooks are vector-quantized, the quantization characteristic is better than that of the scalar quantization, and the use of the RMS calculation circuit results in a higher correlation between the gains of the two codebooks. Appears, the effect of vector quantization is further enhanced. These are effective for lowering the bit rate.

【図面の簡単な説明】[Brief description of drawings]

【図１】本発明の一実施例を示すブロック図。FIG. 1 is a block diagram showing an embodiment of the present invention.

【図２】図１の動作を説明するフローチャート。FIG. 2 is a flowchart illustrating the operation of FIG.

【図３】図１の構成における適応励振ゲインと統計励振
ゲインとの関係図。FIG. 3 is a diagram showing the relationship between the adaptive excitation gain and the statistical excitation gain in the configuration of FIG.

【図４】本発明にかかる復号化装置のブロック図。FIG. 4 is a block diagram of a decoding device according to the present invention.

【符号の説明】[Explanation of symbols]

３０２適応励振コードブック３０３統計励振コードブック３０４ＲＭＳ計算回路３０５ＶＱゲインコードブック３０６乗算器３０７乗算器３０８乗算器３０９加算器３１０合成フィルタ３１１ＬＰＣ逆量子化器３１２ＬＰＣ量子化器３１３声道分析回路３１４減算器３１５知覚フィルタ３１６知覚誤差計算回路 302 Adaptive excitation codebook 303 Statistical excitation codebook 304 RMS calculation circuit 305 VQ gain codebook 306 Multiplier 307 Multiplier 308 Multiplier 309 Adder 310 Synthesis filter 311 LPC dequantizer 312 LPC quantizer 313 Vocal tract analysis circuit 314 Subtractor 315 Perceptual filter 316 Perceptual error calculation circuit

Claims

【特許請求の範囲】[Claims]

【請求項１】過去の複数の原音声ベクトルに対する最
適な励振ベクトルを蓄積していて、それらを、選択制御
手段に制御されて、適応励振ベクトルとして出力する適
応励振コードブックと、複数の、予め定められた励振ベクトルを蓄積していて、
それらを、前記選択制御手段に制御されて、統計励振ベ
クトルとして出力する統計励振コードブックと、複数の、予め定められた励振ゲインを蓄積していて、そ
れらを、前記選択制御手段に制御されて、適応励振ゲイ
ン及び統計励振ゲインとして出力する励振ゲインコード
ブックと、前記選択制御手段に制御されて、前記適応励振コードブ
ック、前記統計励振コードブック、及び前記励振ゲイン
コードブックの出力に基づいて、複数の励振ベクトルを
作成する励振ベクトル合成手段と、原音声ベクトルの声道予測パラメータと各前記励振ベク
トルとに基づいて合成音声ベクトルを作成し、前記原音
声ベクトルと各前記合成音声ベクトルとの誤差ベクトル
を所定の評価手段によって評価し、この評価値が最小と
なる前記励振ベクトルを最適な励振ベクトルとして前記
適応励振コードブックに蓄積する前記選択制御手段とを
備え、最適な前記励振ベクトルに対応した、前記適応励振ベク
トルのインデックス、前記統計励振ベクトルのインデッ
クス、及び前記励振ゲインのインデックス、並びに前記
声道予測パラメータに関する、情報を符号化情報とする
コード励振線形予測符号化装置において、前記励振ベクトル合成手段は、前記適応励振ベクトルに
前記適応励振ゲインを乗算し、前記統計励振ベクトル
に、前記適応励振ベクトルのパワーの平方根に比例した
値及び前記統計励振ゲインとを乗算し、それぞれ乗算し
た後の適応励振ベクトルと統計励振ベクトルとを加算し
て各前記励振ベクトルを作成するものであることを特徴
としたコード励振線形予測符号化装置。1. An adaptive excitation codebook for accumulating optimum excitation vectors for a plurality of past original speech vectors, outputting them as adaptive excitation vectors under the control of a selection control means, and a plurality of pre-existing codebooks. Accumulating the defined excitation vector,
They are stored in a statistical excitation codebook controlled by the selection control means and output as a statistical excitation vector, and a plurality of predetermined excitation gains, which are controlled by the selection control means. , An excitation gain codebook that outputs as an adaptive excitation gain and a statistical excitation gain, and is controlled by the selection control means, based on the output of the adaptive excitation codebook, the statistical excitation codebook, and the excitation gain codebook, An excitation vector synthesizing means for creating a plurality of excitation vectors; and a synthetic speech vector based on the vocal tract prediction parameter of the original speech vector and each of the excitation vectors, and an error between the original speech vector and each of the synthesized speech vectors. The vector is evaluated by a predetermined evaluation means, and the excitation vector with the smallest evaluation value is optimized. The selection control means for accumulating in the adaptive excitation codebook as an excitation vector, the index of the adaptive excitation vector, the index of the statistical excitation vector, and the index of the excitation gain corresponding to the optimal excitation vector, and Regarding the vocal tract prediction parameter, in a code excitation linear predictive encoding device in which information is encoded information, the excitation vector synthesizing means multiplies the adaptive excitation vector by the adaptive excitation gain, and the statistical excitation vector, A value proportional to the square root of the power of the adaptive excitation vector and the statistical excitation gain are multiplied, and the adaptive excitation vector after the multiplication and the statistical excitation vector are added to create each of the excitation vectors. Characterized code-excited linear predictive coding device.