JP3101376B2

JP3101376B2 - Audio coding method

Info

Publication number: JP3101376B2
Application number: JP03323367A
Authority: JP
Inventors: 雅子加藤; 智彦谷口; 秀明栗原; 恭士大田; 文雄天野
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1991-12-06
Filing date: 1991-12-06
Publication date: 2000-10-23
Anticipated expiration: 2015-10-23
Also published as: JPH05158496A

Abstract

PURPOSE:To prevent characteristics from being degraded when inputting fine voice signals by selecting a zero vector as an optimum code word again when error signal power exceeds input voice signal power. CONSTITUTION:When preparing a code notebook by generating a noise sequence, a vector (defined as C0), for which all the elements of the code word are zero, is given to a noise code notebook 2. When the optimum code word is completely retrieved, an error power evaluation part 5 calculates error signal power Ec<2> of an auditive sense weighted regenerative signal obtained by the optimum code word and a quantized optimum code gain and the input voice signal and judges the levels of this error signal power Ec<2> and input voice signal power (AX)<2>. When the error signal power Ec<2> exceeds the input voice signal power (AX)<2>, the selected code word C is abandonned and the code vector Co, for which all the elements are zero, is selected as the optimum code word again. Thus, noise generation is prevented.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は音声符号化方式に関し、
特に音声信号の情報圧縮伝送を行うためのＣＥＬＰ型音
声符号化方式に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech coding system,
In particular, the present invention relates to a CELP-type speech coding system for performing information compression transmission of a speech signal.

【０００２】近年、ディジタル信号処理技術の発展に伴
いディジタル通信網の発達及び普及には目覚ましいもの
がある。特に、ディジタル移動無線システム、企業内通
信システムなどにおいては、音声品質を劣化させること
なく情報の高能率な圧縮を行う音声符号化方式が要求さ
れている。In recent years, with the development of digital signal processing technology, the development and spread of digital communication networks have been remarkable. In particular, in a digital mobile radio system, an in-company communication system, and the like, a speech coding scheme for performing efficient compression of information without deteriorating speech quality is required.

【０００３】音声の予測符号化方式では、フレーム毎に
短期予測分析により抽出された短期予測係数と、長期予
測分析により抽出されたピッチ予測係数及びピッチ周期
とを入力信号にその短期予測フィルタ及び長期予測フィ
ルタの逆特性フィルタ（逆フィルタ）を施して得た予測
残差信号と共に多重化して伝送する方式が一般的であ
る。In the speech predictive coding method, a short-term prediction coefficient extracted by a short-term prediction analysis for each frame and a pitch prediction coefficient and a pitch period extracted by a long-term prediction analysis are input to an input signal as a short-term prediction filter and a long-term prediction filter. In general, a method of multiplexing and transmitting a prediction residual signal obtained by applying an inverse characteristic filter (inverse filter) of the prediction filter is used.

【０００４】この予測残差信号情報を効率的に伝送する
ために、予測残差ベクトルをベクトル量子化して、その
インデックスを伝送したり（コード駆動線型予測符号化
方式（Code Excited Linear Prediction）：ＣＥＬ
Ｐ）、予測残差ベクトルを有限個のパルス列でモデル化
し、最適なパルス位置、及びパルス振幅を伝送する方式
（マルチパルス駆動符号化方式：ＭＰＣ）等のいくつか
の方式があるが、伝送効率の点で特に前者のＣＥＬＰ方
式が注目を集めている。但し、このような音声符号化方
式では、雑音系列を音源とすることに起因する特性劣化
が見られ、そのような特性劣化を克服するための改善手
法が求められている。In order to efficiently transmit the prediction residual signal information, the prediction residual vector is vector-quantized and its index is transmitted (Code Excited Linear Prediction: CEL).
P), there are several schemes such as a scheme (multi-pulse drive encoding scheme: MPC) that models the prediction residual vector with a finite number of pulse trains and transmits the optimal pulse position and pulse amplitude. In particular, the former CELP system has attracted attention. However, in such a speech coding method, characteristic deterioration due to using a noise sequence as a sound source is seen, and an improvement method for overcoming such characteristic deterioration is required.

【０００５】[0005]

【従来の技術】ＣＥＬＰ方式は、入力音声データをフレ
ーム毎にベクトル量子化を行う音声符号化方式であり、
図４にその一般的な構成を示す。2. Description of the Related Art The CELP system is a speech coding system for performing vector quantization of input speech data for each frame.
FIG. 4 shows the general configuration.

【０００６】ＣＥＬＰ型音声符号器は、適応符号帳１及
び雑音符号帳２の二つの符号帳を持っており、適応符号
帳１は、入力音声の周期性成分の同定を行うためのもの
であり、前フレームの再生音声信号によって毎フレーム
更新されて行き、雑音符号帳２は、雑音系列、即ち非周
期性成分の音源情報の同定を行うためのものであり、予
め発生させた雑音系列を固定で持っている。The CELP type speech coder has two codebooks, an adaptive codebook 1 and a noise codebook 2, and the adaptive codebook 1 is for identifying a periodic component of the input speech. The noise codebook 2 is updated for each frame by the reproduced audio signal of the previous frame, and is used to identify the noise sequence, that is, the excitation information of the non-periodic component, and fixes the noise sequence generated in advance. Have.

【０００７】適応符号帳１のコードワード、即ちピッチ
ベクトルをＰ、ピッチゲインをｂとし、雑音符号帳２の
コードワード、即ちコードベクトルをＣ、コードゲイン
をｇとする。各符号帳のコードワード（Ｐ、Ｃ）に各々
のゲイン（ｂ、ｇ）を乗じて加算器３で加算することに
より駆動音源信号（ｂＰ＋ｇＣ）を生成し、更にこれを
聴覚重み付け（声道特性の擬似化）フィルタ４を通すこ
とにより重み付け再生信号（ｂＡＰ＋ｇＡＣ）が得られ
る。The codeword of the adaptive codebook 1, that is, the pitch vector is P, the pitch gain is b, the codeword of the noise codebook 2, that is, the code vector is C, and the code gain is g. The driving sound source signal (bP + gC) is generated by multiplying the codeword (P, C) of each codebook by the respective gains (b, g) and adding the result by the adder 3, and furthermore, the driving excitation signal (bP + gC) is weighted by hearing (vocal tract characteristics). By passing through a filter 4 to obtain a weighted reproduction signal (bAP + gAC).

【０００８】この重み付け再生信号と、聴覚重み付けさ
れた入力信号であるターゲットベクトル（ＡＸ）との間
の誤差信号の電力（Ｅ）を誤差電力評価部５で求め、誤
差電力評価部５ではこれを全てのコードワードについて
検索して行くことにより該誤差信号電力を最小にする駆
動音源信号を最適駆動音源信号として決定する。The error power evaluator 5 obtains the power (E) of the error signal between the weighted reproduced signal and the target vector (AX) which is the input signal weighted by the auditory sense. The drive excitation signal that minimizes the error signal power by searching for all codewords is determined as the optimal drive excitation signal.

【０００９】この検索の結果得られた最適駆動音源信号
（ｂＰ＋ｇＣ）を、フレーム遅延器６を通して適応符号
帳１に入力することで、次フレームのための適応符号帳
１の更新が行われる。By inputting the optimal driving excitation signal (bP + gC) obtained as a result of the search to adaptive codebook 1 through frame delay unit 6, adaptive codebook 1 for the next frame is updated.

【００１０】そして、符号化器としては、この最適駆動
音源信号を構成する適応符号帳１及び雑音符号帳２のコ
ードワードのインデックス、及び各符号帳のゲイン
（ｂ、ｇ）、及びＬＰＣ分析パラメータを量子化して復
号器側へ伝送する。[0010] As an encoder, the index of the codeword of the adaptive codebook 1 and the noise codebook 2 constituting the optimal driving excitation signal, the gain (b, g) of each codebook, and the LPC analysis parameter Is quantized and transmitted to the decoder side.

【００１１】復号器側でも適応符号帳と雑音符号帳とが
用意されており、伝送されてきたインデックスを基に各
符号帳の検索を行い、これを音源信号として、他に伝送
されてきた各符号帳のゲイン（ｂ、ｇ）やＬＰＣ分析パ
ラメータによって形成される合成系により再生信号を得
る。An adaptive codebook and a random codebook are also prepared on the decoder side, and each codebook is searched based on the transmitted index, and each of the codebooks is used as a sound source signal, and each of the other transmitted codebooks is searched for. A reproduction signal is obtained by a synthesis system formed by the gains (b, g) of the codebook and the LPC analysis parameters.

【００１２】図５は誤差電力評価部５における雑音符号
帳２の探索方式の構成を示したもので、ここでは雑音符
号帳２のコードワードと入力音声ベクトルとの間で最適
コードワード探索処理が行われ、最適コードワード
（Ｃ）が決定される。FIG. 5 shows a configuration of a search method for the random codebook 2 in the error power evaluation unit 5. Here, an optimum codeword search process is performed between the codeword of the random codebook 2 and the input speech vector. Is performed to determine the optimal codeword (C).

【００１３】即ち、まずコードワード（Ｃ）にコードゲ
イン（ｇ）を掛けて振幅調整し、重み付けフィルタ４を
通して、聴覚重み付けされた信号（ＡＣ）を生成する。
この信号と、重み付け入力信号であるターゲットベクト
ル（ＡＸ）との間の誤差信号の電力（Ｅ_C）を評価し、
誤差を最小にするようなコードワードを最適コードワー
ドとして決定する。That is, first, the code word (C) is multiplied by the code gain (g) to adjust the amplitude, and through the weighting filter 4, an auditory weighted signal (AC) is generated.
Evaluate the power (E _C ) of the error signal between this signal and the target vector (AX), which is the weighted input signal,
The codeword that minimizes the error is determined as the optimal codeword.

【００１４】また、コードゲインｇは、コードゲイン算
出部７において、コードワード（Ｃ）とターゲットベク
トル（ＡＸ）の相関値により算出される。The code gain g is calculated by the code gain calculator 7 based on the correlation value between the code word (C) and the target vector (AX).

【００１５】また聴覚重み付けフィルタ処理は、周波数
領域において人間の聴覚特性にもとづいた強調処理を音
声データに施す処理のことであり、フィルタの伝達関数
は次式によって表される。Ｈ(Z) ＝１／Ａ'(Z) Ａ'(Z)＝１−Σα_iγⁱｚ^-i （γ＝０．９） (1) The auditory weighting filter process is a process of applying an emphasis process based on human auditory characteristics to audio data in the frequency domain. The transfer function of the filter is represented by the following equation. H (Z) = 1 / A '(Z) A' (Z) = 1-Σα i γ i z -i (γ = 0.9) (1)

【００１６】ここで、最適コードワード選択処理は、誤
差信号電力（Ｅ_C）の評価式（次式(2) ）の最小化問題
に帰着できる。Ｅ_C ²＝（ＡＸ−ｇＡＣ）² (2) 但し、ＡＸ：重み付け入力音声ベクトルＡＣ：重み付け雑音系列（ストカスティック）コードベ
クトルｇ：コードゲインHere, the optimal codeword selection processing can be reduced to a problem of minimizing the evaluation expression (the following expression (2)) of the error signal power (E _C ). E _C ² = (AX−gAC) ² (2) where AX: weighted input speech vector AC: weighted noise sequence (stochastic) code vector g: code gain

【００１７】ここで、最適コードゲインｇ_optは、式
(2) をｇで微分して＝０と置くことにより求められる。ｄＥ_C ²／ｄｇ＝｛（ＡＸ）^T（ＡＸ）−２ｇ（ＡＸ）^T（ＡＣ）＋ｇ²（ＡＣ）^T（ＡＣ）｝／ｄｇ＝−２（ＡＸ）^T（ＡＣ）＋２ｇ（ＡＣ）^T（ＡＣ）＝０尚、^Tは転置行列を示す。Here, the optimum code gain g _opt is given by the following equation.
(2) is obtained by differentiating g with g and setting = 0. _{^{dE C 2 / dg = {(}} AX) T (AX) -2g (AX) T (AC) + g 2 (AC) T (AC)} / dg = -2 (AX) T (AC) + 2g (AC) T (AC) = 0 Note that ^T indicates a transposed matrix.

【００１８】これにより、 ∴ｇ_opt＝（ＡＸ）^T（ＡＣ）／（ＡＣ）^T（ＡＣ） (3) となる。Thus, ∴g _opt = (AX) ^T (AC) / (AC) ^T (AC) (3)

【００１９】この式(3) を式(2) に代入することにより
次式(4)を得る。 The following equation (4) is obtained by substituting equation (3) into equation (2).

【００２０】この式(4) は図６に示す雑音符号帳探索の
演算ブロック図の如く最大値評価部５０（これは誤差電
力評価部５と等価なものである。）において演算される
こととなる。This equation (4) is to be calculated in the maximum value evaluation section 50 (which is equivalent to the error power evaluation section 5) as shown in the operation block diagram of the random codebook search shown in FIG. Become.

【００２１】ここで式(4) を最小にするコードワードＣ
の選択について考えると、第一項はコードワードＣの検
索中一定であるため、第二項を最大にするＣを選択する
ことにより誤差信号電力Ｅ_C ²を最小にできるというこ
とが分かる。Here, the code word C that minimizes the expression (4)
Considering the choice of, it can be seen that the error signal power E _C ² can be minimized by selecting C, which maximizes the second term, since the first term is constant during the search for the codeword C.

【００２２】そこで、式(4) の第二項を、Ｅ_Vと置く
と、となる。Therefore, if the second term of equation (4) is set as E _V , Becomes

【００２３】そして、自己相関係数（ＡＣ）^T（ＡＣ）
をＲcc、相互相関係数（ＡＸ）^T（ＡＣ）をＲcxと書く
ことにすると、式(5) は次のように書き改められる。Ｅ_V＝（Ｒcx）²／（Ｒcc） (6) The autocorrelation coefficient (AC) ^T (AC)
Is written as Rcc and the cross-correlation coefficient (AX) ^T (AC) is written as Rcx, Equation (5) can be rewritten as follows. _{^{E V = (Rcx) 2 /}} (Rcc) (6)

【００２４】以上より、雑音符号帳探索では、最大値評
価部５０において式(6) の計算及び最大値評価を行うこ
とにより、最適コードワードを得ることができ、このよ
うに、各コードベクトルを用いて一旦音声信号を再生し
て、最も誤差の小さかったものを選ぶ方法は、合成によ
る分析手法（Analysis-by-Synthesis: A-b-S法）として
知られている。As described above, in the random codebook search, the maximum code evaluation unit 50 calculates the expression (6) and evaluates the maximum value, thereby obtaining the optimum codeword. The method of reproducing the audio signal once and selecting the one with the smallest error is known as an analysis method by synthesis (Analysis-by-Synthesis: AbS method).

【００２５】[0025]

【発明が解決しようとする課題】上記の最適コードワー
ド検索処理において、式(5) （式(4) の右辺第二項）は
常に正となり、従って次の式(7) が成立する。Ｅ_C ²＝（ＡＸ−ｇＡＣ）² ≦ （ＡＸ）² (7) 但し、等号成立はＡＸ⊥ＡＣのときである。In the above-mentioned optimum code word search processing, equation (5) (the second term on the right side of equation (4)) is always positive, and therefore the following equation (7) is established. E _C ² = (AX−gAC) ² ≦ (AX) ² (7) However, the equality holds when AX⊥AC.

【００２６】ところが、最適コードゲインｇを量子化し
てｇ' とした場合、誤差信号電力Ｅ _Cは、Ｅ_C ² ＝（ＡＸ）^T（ＡＸ）−２ｇ' （ＡＸ）^T（ＡＣ）＋ｇ'²（ＡＣ）^T（ＡＣ） (8) となり、ｇの量子化が粗い場合には、 −２ｇ' （ＡＸ）^T（ＡＣ）＋ｇ'²（ＡＣ）^T（ＡＣ）＞０ (9) すなわち、Ｅ_C ²＞（ＡＸ）² (10) となる場合が考えられ、これにより微小入力時であって
も、雑音符号帳２の非周期性成分ＡＣを用いたことで、
却って誤差電力が大きくなり、再生音声の特性が劣化す
るという問題が生ずる。However, when the optimal code gain g is quantized,
, The error signal power E _CIs E_C ^Two = (AX)^T(AX) -2g '(AX)^T(AC) + g '^Two(AC)^T(AC) (8) When the quantization of g is coarse, −2g ′ (AX)^T(AC) + g '^Two(AC)^T(AC)> 0 (9) That is, E_C ^Two> (AX)^Two (10) can be considered.
Also uses the non-periodic component AC of the random codebook 2,
On the contrary, the error power increases and the characteristics of the reproduced sound deteriorate.
Problem arises.

【００２７】従って本発明は、入力音声信号の周期性成
分及び非周期性成分をそれぞれ同定するための適応符号
帳及び雑音符号帳を用いて聴覚重み付けされた再生信号
を生成し、該再生信号と聴覚重み付けした入力音声信号
との誤差信号電力が最小となる該適応符号帳及び該雑音
符号帳の各最適コードワードのインデックス並びに各々
の最適コードゲインの量子化値を伝送する音声符号化方
式において、微小音声信号入力時の特性劣化を防ぐこと
を目的とする。Accordingly, the present invention generates a perceptually weighted reproduction signal using an adaptive codebook and a noise codebook for identifying a periodic component and an aperiodic component of an input speech signal, respectively. In an audio coding system for transmitting an index of each optimal codeword of the adaptive codebook and the noise codebook and a quantization value of each optimal code gain in which an error signal power with an input audio signal weighted by hearing is minimized, It is an object of the present invention to prevent characteristic deterioration when a minute audio signal is input.

【００２８】[0028]

【課題を解決するための手段及び作用】雑音符号帳は予
め発生させた雑音系列を持つ符号帳である。本発明に係
る音声符号化方式の符号器では、図１に示すように、雑
音系列を発生させて符号帳を作成する際に、コードワー
ドの全要素が零であるようなベクトル（Ｃ₀とする）を
雑音符号帳２に与えておくこととする。The noise codebook is a codebook having a noise sequence generated in advance. As shown in FIG. 1, in the encoder of the speech coding system according to the present invention, when a noise sequence is generated to create a codebook, a vector (C ₀ and C _{0) in} which all elements of a codeword are zero is generated. ) Is given to the random codebook 2.

【００２９】最適コードワードの検索終了後に、誤差電
力評価部５は、最適コードワード及び量子化された最適
コードゲインとによって得られる聴覚重み付けされた再
生信号と入力音声信号との誤差信号電力Ｅ_C ²を求める
と共に、この誤差信号電力Ｅ _C ²と入力音声信号電力
（ＡＸ）²との大小判定を行う。After the search for the optimum code word is completed, the error
The force evaluator 5 determines the optimal codeword and the quantized optimal
Auditory weighted re-gain obtained by code gain
Error signal power E between the raw signal and the input audio signal_C ^TwoAsk for
And the error signal power E _C ^TwoAnd input audio signal power
(AX)^TwoIs determined.

【００３０】この結果、上記の式(10)に示すように、誤
差信号電力Ｅ_C ²が入力音声信号電力（ＡＸ）²を上回
った場合には、入力音声信号は微小入力信号であり、こ
のままコードワードを選択することは雑音を発生させて
しまう。As a result, when the error signal power E _C ² exceeds the input audio signal power (AX) ^2, as shown in the above equation (10), the input audio signal is a minute input signal. Choosing a codeword creates noise.

【００３１】このため本発明の符号器では、選択された
コードワードＣを棄却し、全要素零であるコードベクト
ルＣ₀を最適コードワードとして選択し直すことによ
り、雑音発生を防止する。Therefore, in the encoder of the present invention, the occurrence of noise is prevented by rejecting the selected code word C and re-selecting the code vector C ₀ having all the elements zero as the optimum code word.

【００３２】このとき、復号器側へは、Ｃ₀のインデッ
クスと、量子化コードゲインｇ' を伝送する。At this time, the index of C ₀ and the quantized code gain g ′ are transmitted to the decoder side.

【００３３】そして、復号器でも符号器側と同じ雑音符
号帳が用意されており、送られてきたコードベクトルＣ
₀のインデックスにより雑音符号帳からコードベクトル
Ｃ₀を取り出し、これに量子化コードゲインｇ’を逆量
子化して得られたコードゲインｇを掛け、これらと適応
符号帳のインデックス及び量子化コードゲインとにより
駆動音源信号を生成し、この駆動音源信号を再生フィル
タを通して再生音声信号を生成することができる。In the decoder, the same noise codebook as that of the encoder side is prepared, and the transmitted code vector C
_The code vector C ₀ is extracted from the noise codebook by an index of _0, and the obtained code vector is multiplied by the code gain g obtained by inversely quantizing the quantized code gain g ′, and these are multiplied by the adaptive codebook index and the quantized code gain. , A driving sound source signal is generated, and the driving sound source signal can be generated through a reproduction filter to generate a reproduced sound signal.

【００３４】[0034]

【実施例】本発明に係る音声符号化方式に用いる符号器
の実施例が図２に示されており、この実施例の構成は、
雑音符号帳２に一個だけ全要素零のコードベクトルＣ₀
を与えておき、以下の如く誤差電力評価部５において異
なった探索処理を行う点が図５の従来例と異なってい
る。FIG. 2 shows an embodiment of an encoder used for a speech coding system according to the present invention.
In the random codebook 2, only one code vector C _{0 of} all element zeros
Is different from the conventional example of FIG. 5 in that a different search process is performed in the error power evaluation unit 5 as described below.

【００３５】このような雑音符号帳２から得られる聴覚
重み付けした再生ベクトル信号（ｇＡＣ）と聴覚重み付
けされた入力音声ベクトル（ＡＸ）との誤差信号電力Ｅ
_Cが最小となる最適コードベクル（Ｃ_opt）を上記の式
(4) に対応する次の式(11)に従って誤差電力評価部５が
探索する。 The error signal power E between the perceptually weighted reproduced vector signal (gAC) obtained from the noise codebook 2 and the perceptually weighted input speech vector (AX)
_The optimal code vector (C _opt ) that minimizes _C is calculated using the above equation.
The error power evaluation unit 5 searches according to the following equation (11) corresponding to (4).

【００３６】最適コードベクトル（Ｃ_opt）が選択され
た後、上記の式(3) に対応する次の式(12)に従って最適
コードゲイン（ｇ_opt) が求められる。ｇ_opt＝（ＡＸ）^T（ＡＣ_opt）／（ＡＣ_opt）^T（ＡＣ_opt） (12)After the optimum code vector (C _opt ) is selected, the optimum code gain (g _opt ) is obtained according to the following equation (12) corresponding to the above equation (3). g _opt = (AX) ^T (AC _opt ) / (AC _opt ) ^T (AC _opt ) (12)

【００３７】そして、この最適コードゲイン（ｇ_opt)
を量子化することにより、量子化最適コードゲイン（ｇ
_opt' ）が得られ、これらのｇ_opt' とＣ_optを用い
て、上記の式(10)の判定（誤差信号電力Ｅ_C ²が入力音
声信号電力（ＡＸ）²を超える値になっていないかどう
かの判定）を行うが、この式(10)の判定は式(9) の判定
を行うことであるので、この式(9) に対応する次の式(1
3)の正負判定を行う。Then, the optimum code gain (g _opt )
Is quantized to obtain a quantization optimum code gain (g
_opt ′), and using these g _opt ′ and C _opt , the determination of the above equation (10) (error signal power E _C ² does not exceed input audio signal power (AX) ² ) (10) is determined by the equation (9). Therefore, the following equation (1) corresponding to the equation (9) is used.
Perform the positive / negative judgment in 3).

【００３８】 −２ｇ_opt ^'( ＡＸ）^T（ＡＣ_opt）＋ｇ_opt'²（ＡＣ_opt）^T（ＡＣ_opt） (13)−2g _opt ^′ (AX) ^T (AC _opt ) + g _opt ′ ² (AC _opt ) ^T (AC _opt ) (13)

【００３９】これは、次式(14)が成立しているかどうか
と等価である。ｇ'²・Ｒcc−２ｇ' ・Ｒcx＜０ (14)This is equivalent to whether the following equation (14) holds. g ' ² · Rcc-2g' · Rcx <0 (14)

【００４０】そして、式(14)が成立していることが判明
した場合は、選択された最適コードワード（Ｃ_opt）の
インデックスと、量子化最適コードゲインｇ_opt' を復
号器へ伝送するが、次式(15)のように、ｇ_opt'²・Ｒcc−２ｇ_opt' ・Ｒcx＞０ (15) となった場合は、誤差信号電力Ｅ_C ²が入力音声信号電
力（ＡＸ）²をこえてしまっており雑音を発生させてし
まうので、誤差電力評価部５は、選択されたコードワー
ド（Ｃ_opt）を棄却し、全要素零であるコードベクトル
（Ｃ₀）を最適コードワードとして選択し直して、復号
器側へはコードベクトル（Ｃ₀）のインデックスと、量
子化最適コードゲイン（ｇ_opt' ）を伝送する。If it is found that the equation (14) holds, the index of the selected optimum codeword (C _opt ) and the quantization optimum code gain g _opt ′ are transmitted to the decoder. If g _opt ' ² · Rcc−2g _opt ' · Rcx> 0 (15) as in the following equation (15), the error signal power E _C ² exceeds the input voice signal power (AX) ² . Therefore, the error power evaluator 5 rejects the selected codeword (C _opt ) and selects a code vector (C ₀ ) having all zeros as the optimal code word. Then, the index of the code vector (C ₀ ) and the quantization optimum code gain (g _opt ′) are transmitted to the decoder side.

【００４１】尚、雑音の発生を防止するためには、コー
ドゲイン（ｇ' ）を零にする方法と、最適コードワード
（Ｃ_opt）を零にする方法の二つが考えられるが、ゲイ
ン量子化器は量子化感度が高く、この量子化レベルに対
して零レベルを設けることは量子化の効率を下げること
につながるので、選択されたコードワード（Ｃ_opt）を
棄却し、全要素零であるコードベクトル（Ｃ₀）を最適
コードワードとして選択し直すこととしている。In order to prevent the generation of noise, there are two methods, a method of making the code gain (g ') zero and a method of making the optimum code word (C _opt ) zero. Since the detector is highly sensitive to quantization and providing a zero level for this quantization level reduces the efficiency of quantization, the selected codeword (C _opt ) is rejected and all elements are zero. The code vector (C ₀ ) is to be selected again as the optimum code word.

【００４２】図３は復号器の構成が示されており、この
復号器においても符号器と同じ適応符号帳１と雑音符号
帳２とが用意されており、図２に示した符号器から送ら
れて来た各符号帳１，２の最適コードワードのインデッ
クスにより最適コードワードＰ_opt及びＣ_optが選択さ
れ、同様にして送られてきた各コードゲインの量子化値
ｂ’，ｇ’により得られるコードゲインｂ，ｇに最適コ
ードワードＰ_opt及びＣ_optをそれぞれ乗じて得られる
ｂ_optＰ_opt及びｇ_optＣ_optを加算器１０で加算して
駆動音源信号Ｘを生成し、これをＬＰＣ再生フィルタ
（聴覚重み付けフィルタ）１１を通すことにより再生音
声信号が得られることとなる。尚、適応符号帳１は符号
器と同じようにフレーム遅延器１２により前フレームの
周期性成分で更新される。FIG. 3 shows the configuration of a decoder. In this decoder, the same adaptive codebook 1 and noise codebook 2 as those of the encoder are prepared, and transmitted from the encoder shown in FIG. The optimum codewords P _opt and C _opt are selected by the index of the optimum code word of each of the codebooks 1 and 2 obtained, and are obtained by the quantized values b ′ and g ′ of each code gain transmitted in the same manner. B _opt P _opt and g _opt C _opt obtained by multiplying the obtained code gains b and g by the optimum code words P _opt and C _opt , respectively, are added by an adder 10 to generate a driving sound source signal X, which is reproduced by LPC By passing through a filter (auditory weighting filter) 11, a reproduced audio signal is obtained. Note that the adaptive codebook 1 is updated by the frame delay unit 12 with the periodicity component of the previous frame, similarly to the encoder.

【００４３】[0043]

【発明の効果】以上述べたように、本発明に係る音声符
号化方式によれば、雑音符号帳が１個の零ベクトルを含
み、最適コードワードの検索終了後に、最適コードワー
ド及び量子化された最適コードゲインを用いて誤差信号
電力と入力音声信号電力との大小判定を行い、該誤差信
号電力が該入力音声信号電力を上回った場合には、該零
ベクトルを最適コードワードとして選択し直すように構
成したので、微小音声入力時における誤差電力の増大を
抑制することができ、再生音声の特性の劣化を防止する
ことができる。As described above, according to the speech coding method of the present invention, the random codebook includes one zero vector, and after the search for the optimum codeword is completed, the optimum codeword and the quantized codeword are quantized. The magnitude of the error signal power and the input voice signal power are determined using the optimized code gain, and when the error signal power exceeds the input voice signal power, the zero vector is selected again as the optimal codeword. With such a configuration, it is possible to suppress an increase in error power when a minute voice is input, and to prevent deterioration in characteristics of a reproduced voice.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明に係る音声符号化方式の符号化原理を示
したブロック図である。FIG. 1 is a block diagram showing an encoding principle of a speech encoding method according to the present invention.

【図２】本発明に係る音声符号化方式における符号器の
一実施例を示したブロック図である。FIG. 2 is a block diagram showing one embodiment of an encoder in a speech encoding system according to the present invention.

【図３】本発明に係る音声符号化方式における復号器の
一実施例を示したブロック図である。FIG. 3 is a block diagram showing an embodiment of a decoder in the speech coding system according to the present invention.

【図４】従来より知られたＣＥＬＰ型音声符号器を示し
たブロック図である。FIG. 4 is a block diagram showing a conventionally known CELP type speech encoder.

【図５】従来より知られたＣＥＬＰ型音声符号化方式に
おける雑音符号帳の探索処理部分を示したブロック図で
ある。FIG. 5 is a block diagram showing a noise codebook search processing part in a conventionally known CELP type speech coding scheme.

【図６】従来より知られたＣＥＬＰ型音声符号化方式に
おける雑音符号帳の探索演算過程を説明するためのブロ
ック図である。FIG. 6 is a block diagram for explaining a noise codebook search operation process in a conventionally known CELP-type speech coding scheme.

【符号の説明】[Explanation of symbols]

１適応符号帳２雑音符号帳４聴覚重み付けフィルタ５誤差電力評価部７コードゲイン算出部図中、同一符号は同一又は相当部分を示す。 DESCRIPTION OF SYMBOLS 1 Adaptive codebook 2 Noise codebook 4 Auditory weighting filter 5 Error power evaluation part 7 Code gain calculation part In the figure, the same code shows the same or corresponding part.

───────────────────────────────────────────────────── フロントページの続き (72)発明者大田恭士神奈川県川崎市中原区上小田中1015番地富士通株式会社内 (72)発明者天野文雄神奈川県川崎市中原区上小田中1015番地富士通株式会社内 (56)参考文献特開平３−33900（ＪＰ，Ａ) 特開平３−101800（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 19/00 - 19/14 H03M 7/30 H04B 14/04 ＪＩＣＳＴファイル（ＪＯＩＳ)──────────────────────────────────────────────────続き Continuing on the front page (72) Inventor Kyoji Ota 1015 Uedanaka, Nakahara-ku, Kawasaki City, Kanagawa Prefecture Inside Fujitsu Limited (72) Inventor Fumio Amano 1015 Uedanaka, Nakahara-ku, Kawasaki City, Kanagawa Prefecture Fujitsu Limited (56) References JP-A-3-33900 (JP, A) JP-A-3-101800 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) G10L 19/00-19 / 14 H03M 7/30 H04B 14/04 JICST file (JOIS)

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】入力音声信号の周期性成分及び非周期性
成分をそれぞれ同定するための適応符号帳及び雑音符号
帳を用いて聴覚重み付けされた再生信号を生成し、該再
生信号と聴覚重み付けした入力音声信号との誤差信号電
力が最小となる該適応符号帳及び該雑音符号帳の各最適
コードワードのインデックス並びに各々の最適コードゲ
インの量子化値を伝送する音声符号化方式において、該雑音符号帳が１個の零ベクトルを含み、最適コードワ
ードの検索終了後に、該最適コードワード及び該量子化
された最適コードゲインを用いて該誤差信号電力と入力
音声信号電力との大小判定を行い、該誤差信号電力が該
入力音声信号電力を上回った場合には、該零ベクトルを
最適コードワードとして選択し直すことを特徴とした音
声符号化方式。1. An audio-weighted reproduction signal is generated using an adaptive codebook and a noise codebook for identifying a periodic component and an aperiodic component of an input audio signal, respectively, and the reproduction signal and the audio-weighting are generated. An audio coding system for transmitting an index of each optimal codeword of the adaptive codebook and the noise codebook and a quantization value of each optimal code gain, in which an error signal power with respect to an input audio signal is minimized. The book includes one zero vector, and after the search for the optimal codeword is completed, the magnitude of the error signal power and the input speech signal power are determined using the optimal codeword and the quantized optimal code gain, When the error signal power exceeds the input voice signal power, the zero vector is selected again as an optimal codeword.

【請求項２】請求項１に記載の音声符号化方式と同じ
適応符号帳と雑音符号帳とを有し、該符号化側から伝送
された各符号帳のインデックス及び各最適コードゲイン
の量子化値により最適駆動音源信号を生成し、該最適駆
動音源信号を再生フィルタにより再生音声信号とするこ
とを特徴とした音声復号化方式。2. A quantization apparatus according to claim 1, further comprising an adaptive codebook and a noise codebook which are the same as those of the speech coding method according to claim 1, wherein the index of each codebook and the optimum code gain transmitted from the encoding side are quantized. A speech decoding method characterized by generating an optimum driving sound source signal based on a value, and using the optimum driving sound source signal as a reproduction sound signal by a reproduction filter.