JP3335650B2

JP3335650B2 - Audio coding method

Info

Publication number: JP3335650B2
Application number: JP15656591A
Authority: JP
Inventors: 一範小澤
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1991-06-27
Filing date: 1991-06-27
Publication date: 2002-10-21
Anticipated expiration: 2017-10-21
Also published as: JPH056200A

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は音声信号を低いビットレ
ート、特に８ｋｂ／ｓ以下で高品質に符号化するための
音声符号化方式に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech encoding system for encoding a speech signal at a low bit rate, particularly at a high quality of 8 kb / s or less.

【０００２】[0002]

【従来の技術】音声信号を８ｋｂ／ｓ以下の低いビット
レートで符号化する方式としては、例えば、Ｍ．Ｓｃｈ
ｒｏｅｄｅｒａｎｄＢ．Ａｔａｌ氏による“Ｃｏｄ
ｅ−ｅｘｃｉｔｅｄｌｉｎｅａｒｐｒｅｄｉｃｔｉ
ｏｎ：Ｈｉｇｈｑｕａｌｉｔｙｓｐｅｅｃｈａｔ
ｖｅｒｙｌｏｗｂｉｔｒａｔｅｓ”（Ｐｒｏ
ｃ．ＩＣＡＳＳＰ，ｐｐ．９３７−９４０，１９８５
年）と題した論文（文献１）や、Ｋｌｅｉｊｎ氏らによ
る“Ｉｍｐｒｏｖｅｄｓｐｅｅｃｈｑｕａｌｉｔｙ
ａｎｄｅｆｆｉｃｉｅｎｔｖｅｃｔｏｒｑｕａ
ｎｔｉｚａｔｉｏｎｉｎＳＥＬＰ”（Ｐｒｏｃ．ＩＣ
ＡＳＳＰ，ｐｐ．１５５−１５８，１９８８年）と題し
た論文（文献２）等に記載されているＣＥＬＰ（Ｃｏｄ
ｅＥｘｃｉｔｅｄＬＰＣＣｏｄｉｎｇ）が知られて
いる。この方法では、送信側では、フレーム毎（例えば
２０ｍｓ）に音声信号から音声信号のスペクトル特性を
表すスペクトルパラメータを抽出し、フレームをさらに
サブフレーム（例えば５ｍｓ）に分割し、サブフレーム
毎に過去の音源信号をもとに適応コードブックにおける
パラメータ（遅延パラメータとゲインパラメータ）を抽
出し、適応コードブックによりサブフレームの音声信号
をピッチ予測し、ピッチ予測して求めた残差信号に対し
て、予め定められた種類の雑音信号からなる音源コード
ブックから最適音源コードベクトルを選択し最適なゲイ
ンを計算する。音源コードベクトルの選択の仕方は、選
択した雑音信号により合成した信号と、残差信号との誤
差電力を最小化するように一種類の雑音信号を選択す
る。そして選択されたコードベクトルの種類を表すイン
デクスとゲインならびに、前記スペクトルパラメータと
適応コードブックのパラメータを伝送する。受信側の説
明は省略する。2. Description of the Related Art As a method of encoding a speech signal at a low bit rate of 8 kb / s or less, for example, M. Sch
roeder and B.R. "Cod by Atal
e-excited linear predictic
on: High quality speech at
very low bit rates ”(Pro
c. ICASSP, pp. 937-940, 1985
), And "Improved speech quality" by Kleijn et al.
and efficient vector qua
ntization in SELP "(Proc. IC
ASSP, pp. 155-158, 1988) (Reference 2).
eExcited LPC Coding) is known. In this method, the transmitting side extracts a spectrum parameter representing a spectrum characteristic of a voice signal from a voice signal for each frame (for example, 20 ms), further divides the frame into subframes (for example, 5 ms), and The parameters (delay parameter and gain parameter) in the adaptive codebook are extracted based on the sound source signal, the pitch of the speech signal of the subframe is predicted by the adaptive codebook, and the residual signal obtained by pitch prediction is determined in advance. An optimal excitation code vector is selected from an excitation codebook including a predetermined type of noise signal, and an optimal gain is calculated. The method of selecting the excitation code vector is to select one type of noise signal so as to minimize the error power between the signal synthesized from the selected noise signal and the residual signal. Then, an index and a gain representing the type of the selected code vector, the spectrum parameters and the parameters of the adaptive codebook are transmitted. Description on the receiving side is omitted.

【０００３】[0003]

【発明が解決しようとする課題】上述した文献１，２の
従来方式では、良好な音質を得るためには音源コードブ
ックのサイズが十分大きい（例えば１０ビット）必要が
あった。このため、音源コードブックの探索に膨大な演
算量を必要とした。さらにメモリ量も膨大であり（例え
ば１０ビット４０次元の場合は、４０ｋワードのメモリ
量）、ハードウェアをコンパクトに実現するのが困難で
あった。また、ビットレートを低減するために、フレー
ム長，サブフレーム長を増大し、コードブックのビット
数を低減せずに次元数を増大すると演算量はきわめて顕
著に増加するという問題点があった。In the conventional methods described in the above-mentioned references 1 and 2, the size of the sound source codebook needs to be sufficiently large (for example, 10 bits) in order to obtain good sound quality. Therefore, an enormous amount of computation was required to search for the sound source codebook. Furthermore, the amount of memory is enormous (for example, in the case of 10 bits and 40 dimensions, the amount of memory is 40 k words), and it has been difficult to realize compact hardware. Further, when the frame length and the sub-frame length are increased to reduce the bit rate, and the number of dimensions is increased without reducing the number of bits of the codebook, there is a problem that the amount of calculation increases significantly.

【０００４】コードブックのサイズを低減する方法とし
て、例えばＢ．Ｊｕａｎｇ氏らによる“Ｍｕｌｔｉｐｌ
ｅｓｔａｇｅｖｅｃｔｏｒｑｕａｎｔｉｚａｔｉ
ｏｎｆｏｒｓｐｅｅｃｈｃｏｄｉｎｇ”（Ｐｒｏ
ｃ．ＩＣＡＳＳＰ，ｐｐ．５９７−６００，１９８２
年）と題した論文（文献３）等に記載されているよう
に、コードブックを多段に分割して構成し、各コードブ
ックを独立に探索する多段ベクトル量子化法が知られて
いる。この方法では、コードブックが複数段に分割され
ているので、１段あたりのコードブックのサイズは例え
ば、Ｂ／Ｌビット（ここでＢは全体のビット数、Ｌは段
数）に低減されるため、コードブック探索に要する演算
量は、Ｌ段全体でも、Ｂビット１段に比べ、Ｌ×２^B/L
に低減される。また、コードブック格納に必要なメモリ
量も同様に低減される。しかし、この方法では、各段の
コードブックを独立に学習，探索しているので、Ｂビッ
ト１段に比べ、性能は大きく低下するという問題点があ
った。[0004] As a method of reducing the size of the code book, for example, B.I. "Multipl by Juang et al.
e stage vector quantizati
onfor speech coding ”(Pro
c. ICASSP, pp. 597-600, 1982
As described in a paper entitled “Year” (Reference 3) and the like, a multi-stage vector quantization method in which a code book is divided into multiple stages and configured to search each code book independently is known. In this method, since the codebook is divided into a plurality of stages, the size of the codebook per stage is reduced to, for example, B / L bits (where B is the total number of bits and L is the number of stages). , The amount of computation required for codebook search is L × 2 ^{B / L} compared to one B-bit stage in the entire L stages.
To be reduced. Also, the amount of memory required to store the codebook is similarly reduced. However, in this method, since the codebook of each stage is independently learned and searched, there is a problem that the performance is greatly reduced as compared with one stage of B bits.

【０００５】本発明の目的は、上述した問題点を解決
し、比較的少ない演算量及びメモリ量により、８ｋｂ／
ｓ以下、特に４．８ｋｂ／ｓ以下で音質の良好な音声符
号化方式を提供することにある。SUMMARY OF THE INVENTION An object of the present invention is to solve the above-mentioned problems and to provide a relatively small amount of calculation and a small amount of memory, thereby achieving an 8 kb / s
It is an object of the present invention to provide a speech coding system with good sound quality at s or less, especially at 4.8 kb / s or less.

【０００６】[0006]

【０００７】[0007]

【課題を解決するための手段】本発明は、入力した音声
信号を予め定められた時間長のフレームに分割し、前記
音声信号のスペクトル包絡を表すスペクトルパラメータ
を求め、前記フレームを予め定められた時間長のサブフ
レームに分割し、前記サブフレーム毎に過去の音源信号
をもとにして適応コードブックにおけるパラメータを求
め、予め構成された音源コードブックから最適なコード
ベクトルを探索して前記音声信号を量子化する音声符号
化方式において、前記音源コードブックは次元数が前記
サブフレーム長よりも短い複数のコードブックの多段縦
続接続で構成され、少なくとも一つの段では量子化歪の
小さい順にコードベクトルを複数候補出力しサブフレー
ム全体で歪を累積し、前記累積歪を最小化する候補の組
を出力することにより前記音声信号を量子化することを
特徴とする。 According to the present invention, an input audio signal is divided into frames of a predetermined time length, spectrum parameters representing a spectrum envelope of the audio signal are obtained, and the frame is determined in advance. Dividing into time-length subframes, obtaining parameters in the adaptive codebook based on past excitation signals for each subframe, searching for an optimal code vector from a preconfigured excitation codebook, In the speech coding method for quantizing the sound source codebook, the excitation codebook is configured by a multistage cascade connection of a plurality of codebooks whose dimensionality is shorter than the subframe length, and at least one stage has a code vector in the order of smaller quantization distortion. Output a plurality of candidates, accumulate distortion over the entire subframe, and output a set of candidates for minimizing the accumulated distortion. The audio signal, wherein the quantizing Ri.

【０００８】[0008]

【作用】本発明による音声符号化方式の作用を示す。The operation of the speech coding system according to the present invention will be described.

【０００９】図１は、本発明に関連する発明の構成を示
すブロック図である。図において、音声信号をフレーム
（例えば３０ｍｓ）に分割し、スペクトルパラメータ計
算部１５０において、周知のＬＰＣ分析によりスペクト
ルパラメータとして、ＬＰＣパラメータを求める。さら
にフレームの音声をサブフレーム（例えば７．５ｍｓ）
に分割する。FIG. 1 is a block diagram showing the configuration of the invention related to the present invention . In the figure, an audio signal is divided into frames (for example, 30 ms), and an LPC parameter is obtained as a spectrum parameter by a known LPC analysis in a spectrum parameter calculation unit 150. In addition, the audio of the frame is converted to a sub-frame
Divided into

【００１０】次に、適応コードブック部１６０では、サ
ブフレーム毎の音声信号に対して、過去のサブフレーム
で計算した音源信号をもとにパラメータとして遅延Ｔ，
ゲインβを求める。適応コードブックのパラメータの計
算法は、前記文献１，２等を参照できる。そして下式に
より適応コードブックによる予測値を減算する。Next, adaptive codebook section 160 applies a delay T, as a parameter to the audio signal for each subframe as a parameter based on the excitation signal calculated in the past subframe.
Find the gain β. The method of calculating the parameters of the adaptive codebook can be referred to the above-mentioned references 1, 2 and the like. Then, the prediction value by the adaptive codebook is subtracted by the following equation.

【００１１】[0011]

【数１】 (Equation 1)

【００１２】本発明に関連する発明の特徴は、音源コー
ドブックが複数個の異なるコードブックの多段縦続接続
により構成され、少なくとも一つの段において歪の小さ
い順に複数個のコードブック候補を出力することにあ
る。以下では、簡単のために、２段のコードブックの縦
続接続により音源コードブックが構成されるとする。音
源コードブック探索部１７０は、あらかじめ学習して構
成したコードブック１８０₁ ，１８０₂ を用いて、ｅ_w
（ｎ）を表すコードベクトルの候補を出力する。この動
作を以下で説明する。以下では初段においてＭ種類のコ
ードベクトル候補を出力するものとする。まず、初段で
はコードブック１を用いて下式により量子化歪を求め
る。 A feature of the invention related to the present invention is that the sound source codebook is constituted by a plurality of cascade connections of a plurality of different codebooks, and at least one stage outputs a plurality of codebook candidates in ascending order of distortion. It is in. Hereinafter, for simplicity, it is assumed that a sound source codebook is configured by cascade connection of two-stage codebooks. The sound source codebook search unit 170 uses the codebooks 180 ₁ and 180 ₂ that have been learned in advance to construct e _w
A candidate for a code vector representing (n) is output. This operation will be described below. Hereinafter, it is assumed that M types of code vector candidates are output in the first stage. First, in the first stage, the quantization distortion is obtained using the codebook 1 by the following equation.

【００１３】[0013]

【数２】 (Equation 2)

【００１４】ここでγ_1j，ｃ_1j（ｎ），ｈ_w（ｎ），Ｎ
はそれぞれ、最適ゲイン，コードブック１のｊ番目のコ
ードベクトル（ｊ＝１〜２^B1：Ｂ１はコードブック１の
ビット数），スペクトルパラメータから求めた聴感重み
付けインパルス応答，サブフレーム長を表すサンプル数
である。ここで聴感重み付けインパルス応答の求め方は
前記文献１，２等を参照できる。記号＊は畳み込み積分
を示す。Ｎはサブフレーム長を示すサンプル数である。
初段では、数２の量子化歪の小さい順に複数種類（例え
ばＭ種類）のコードベクトルを求め、このときの各候補
の量子化歪を記憶する。また、各候補に対して下式によ
り誤差信号を計算する。Here, γ _1j , c _1j (n), h _w (n), N
Are the optimal gain, the j-th code vector of codebook 1 (j = 1 to 2 ^B1 : B1 is the number of bits of codebook 1), the perceptual weighting impulse response obtained from the spectral parameters, and the number of samples representing the subframe length. It is. Here, the method of obtaining the auditory weighting impulse response can be referred to the above-mentioned documents 1 and 2. The symbol * indicates convolution integration. N is the number of samples indicating the subframe length.
In the first stage, a plurality of types (for example, M types) of code vectors are obtained in ascending order of the quantization distortion of Expression 2, and the quantization distortion of each candidate at this time is stored. An error signal is calculated for each candidate by the following equation.

【００１５】[0015]

【数３】 (Equation 3)

【００１６】ここでｅ_w′_k（ｎ）は、ｋ番目の候補に
対して計算した誤差信号を示す。Here, e _w ′ _k (n) indicates an error signal calculated for the k-th candidate.

【００１７】次に２段目では、コードブック１８０₂を
用いて初段の各候補に対して下式を最小化する最適コー
ドベクトルを求め、そのときの量子化歪を、対応する初
段での候補の歪に加算して累積歪を求める。[0017] In the next second step, determine the optimum code vector minimizing the following equation with respect to each candidate of the first stage using a codebook 180 _2, the quantization distortion at that time, candidates for the corresponding first stage To obtain the cumulative distortion.

【００１８】[0018]

【数４】 (Equation 4)

【００１９】ただし、ｌ＝１〜２^B2：Ｂ２はコードブッ
ク２のビット数を示す。また、Ｂ＝Ｂ１＋Ｂ２：Ｂは２
段のコードブック全体のビット数である。Here, l = 1 to 2 ^B2 : B2 indicates the number of bits of the codebook 2. B = B1 + B2: B is 2
This is the number of bits in the entire codebook of the column.

【００２０】次に、前記累積歪を最小化する初段，２段
目の候補の組合せを選択し、選択されたコードベクトル
を出力する。Next, a combination of first-stage and second-stage candidates for minimizing the cumulative distortion is selected, and the selected code vector is output.

【００２１】なお、候補の組合せの選択の尺度には、累
積歪以外にも、数４で示される２段目の量子化歪を最小
化する候補の組合せを選択してもよい。また、縦続接続
の段数は任意に設定できる。As a measure for selecting a combination of candidates, a combination of candidates that minimizes the second-stage quantization distortion shown in Expression 4 may be selected in addition to the cumulative distortion. The number of cascade connections can be set arbitrarily.

【００２２】なお、コードブック１，２としては、前記
文献１，２のようにガウス乱数信号から構成してもよい
し、あらかじめ多量のトレーニング信号を用いて学習し
て構成してもよい。後者の構成法については、例えば特
願平２−４２９５６号明細書（文献４）等を参照でき
る。以上で本発明に関連する発明の作用の説明を終え
る。The codebooks 1 and 2 may be composed of Gaussian random number signals as in the above-mentioned documents 1 and 2, or may be constructed by learning using a large amount of training signals in advance. For the latter configuration method, reference can be made, for example, to Japanese Patent Application No. 2-42956 (Document 4). This concludes the description of the operation of the invention relating to the present invention .

【００２３】さらに性能を改善するためには、複数種類
の候補を出力するという条件でコードブック全体で最適
になるように、各段のコードブックを学習してもよい。In order to further improve the performance, the codebook at each stage may be learned so as to be optimal for the entire codebook under the condition that a plurality of types of candidates are output.

【００２４】次に、本発明の作用を図２を用いて説明す
る。図２において図１と同一の番号を付した構成要素
は、同一の動作を行うので説明は省略する。本発明で
は、音源コードブックの構成と探索法が本発明に関連す
る発明と異なるので、この点について説明する。音源コ
ードブックの次元数はサブフレーム長よりも短くする。
以下では一例として、次元数はサブフレーム長Ｎの１／
２のＮ／２とする。従って、コードブック２１０₁ ，コ
ードブック２１０₂ の次元数はＮ／２とする。また、縦
続接続の段数は以下では一例として２とする。Next, the operation of the present invention will be described with reference to FIG. In FIG. 2, components denoted by the same reference numerals as those in FIG. 1 perform the same operations, and thus description thereof will be omitted. In the present invention, the structure of the sound source codebook and the search method are related to the present invention.
Invention differs so that this point will be described. The number of dimensions of the sound source codebook is shorter than the subframe length.
In the following, as an example, the number of dimensions is 1/1 / subframe length N.
2 N / 2. Therefore, the number of dimensions of the codebook 210 ₁ and the codebook 210 ₂ is N / 2. In addition, the number of stages of the cascade connection is set to 2 as an example below.

【００２５】まず、サブフレームの前半Ｎ／２サンプル
に対して、初段ではコードブック１を用いて下式により
量子化歪を求める。First, for the first half of the N / 2 samples of the subframe, quantization distortion is obtained by the following equation using the codebook 1 in the first stage.

【００２６】[0026]

【数５】 (Equation 5)

【００２７】ここでγ_1j，ｃ_1j（ｎ），ｈ_w（ｎ）はそ
れぞれ、最適ゲイン，コードブック１のｊ番目のコード
ベクトル（ｊ＝１〜２^B1：Ｂ１はコードブック１のビッ
ト数），スペクトルパラメータから求めた聴感重み付け
インパルス応答である。初段では、数５の量子化歪の小
さい順に複数種類（例えばＭ種類）のコードベクトルを
求め、このときの各候補の量子化歪を記憶する。また、
各候補に対して下式により誤差信号を計算する。Here, γ _1j , c _1j (n) and h _w (n) are the optimum gain and the j-th code vector of codebook 1 (j = 1 to 2 ^B1 : B1 is the number of bits of codebook 1). ), The auditory weighting impulse response obtained from the spectrum parameter. In the first stage, a plurality of types (for example, M types) of code vectors are obtained in ascending order of the quantization distortion of Expression 5, and the quantization distortion of each candidate at this time is stored. Also,
An error signal is calculated for each candidate by the following equation.

【００２８】[0028]

【数６】 (Equation 6)

【００２９】次に２段目では、コードブック２１０₂を
用いて初段の各候補に対して下式を最小化する最適コー
ドベクトルを求め、そのときの量子化歪を、対応する初
段での候補の歪に加算して累積歪を求める。Next, in the second stage, an optimal code vector for minimizing the following equation is determined for each candidate in the first stage using the code book 210 _2, and the quantization distortion at that time is determined by the corresponding candidate in the first stage. To obtain the cumulative distortion.

【００３０】[0030]

【数７】 (Equation 7)

【００３１】ただし、ｌ＝１〜２^B2：Ｂ２はコードブッ
ク２のビット数を示す。また、Ｂ／２＝Ｂ１＋Ｂ２：Ｂ
はサブフレーム全体のビット数である。Here, l = 1 to 2 ^B2 : B2 indicates the number of bits of the codebook 2. B / 2 = B1 + B2: B
Is the number of bits in the entire subframe.

【００３２】次に、サブフレーム後半のＮ／２サンプル
に対して、上記で求まったコードベクトルの各候補に対
して下式を計算する。Next, for the N / 2 samples in the latter half of the subframe, the following equation is calculated for each code vector candidate determined above.

【００３３】[0033]

【数８】 (Equation 8)

【００３４】ここでｓ_wk（ｎ）は前記コードベクトルの
ｋ番目の候補について、サブフレーム後半に及ぼす影響
信号である。Here, s _wk (n) is an influence signal which affects the k-th candidate of the code vector in the latter half of the subframe.

【００３５】次に、コードブック１を用いて下式の量子
化歪の小さい順にＭ種の候補を出力し、そのときの量子
化歪を累積する。Next, M kinds of candidates are output in ascending order of the quantization distortion in the following equation using the codebook 1, and the quantization distortion at that time is accumulated.

【００３６】[0036]

【数９】 (Equation 9)

【００３７】また、各候補に対して下式により誤差信号
を計算する。An error signal is calculated for each candidate by the following equation.

【００３８】[0038]

【数１０】 (Equation 10)

【００３９】次に２段目では、コードブック２１０₂を
用いて初段の各候補に対して下式を最小化する最適コー
ドベクトルを求め、そのときの量子化歪を、対応する初
段での候補の歪に加算して累積歪を求める。Next, in the second stage, an optimal code vector for minimizing the following equation is obtained for each of the first-stage candidates using the codebook 210 _2, and the quantization distortion at that time is determined by the corresponding first-stage candidate. To obtain the cumulative distortion.

【００４０】[0040]

【数１１】 [Equation 11]

【００４１】次に、サブフレーム全体での累積歪を求
め、前記累積歪を最小化するサブフレーム全体での候補
の組合せを選択し、選択されたコードベクトルを出力す
る。前述の説明では、サブフレームの前半と後半で同一
のコードブックを用いたが、前半と後半で異なるコード
ブックを用いることもできる。Next, the cumulative distortion in the entire subframe is obtained, a combination of candidates in the entire subframe that minimizes the cumulative distortion is selected, and the selected code vector is output. In the above description, the same codebook is used in the first half and the second half of the subframe. However, different codebooks can be used in the first half and the second half.

【００４２】以上で本発明の作用の説明を終える。The operation of the present invention has been described above.

【００４３】[0043]

【実施例】図３は本発明に関連する発明による音声符号
化方式を実施する音声符号化装置の一例を示すブロック
図である。FIG. 3 is a block diagram showing an example of a speech coding apparatus for implementing a speech coding method according to the present invention .

【００４４】図において、送信側では、入力端子４００
から音声信号を入力し、１フレーム分（例えば３０ｍ
ｓ）の音声信号をバッファメモリ４１０に格納する。In the figure, on the transmitting side, an input terminal 400
And input an audio signal from one frame (for example, 30 m
The audio signal of s) is stored in the buffer memory 410.

【００４５】ＬＰＣ分析回路４３０は、フレームの音声
信号のスペクトル特性を表すパラメータとして、ＬＳＰ
パラメータを前記フレームの音声信号から周知のＬＰＣ
分析を行い予め定められた次数Ｐだけ計算する。この具
体的な計算法については、Ｓｕｇａｍｕｒａ氏らによる
“ＱｕａｎｔｉｚｅｒｄｅｓｉｇｎｉｎＬＳＰｓ
ｐｅｅｃｈａｎａｌｙｓｉｓ−ｓｙｎｔｈｅｓｉｓ”
と題した論文（文献５）等を参照することができる。The LPC analysis circuit 430 uses LSP as a parameter representing the spectral characteristic of the audio signal of the frame.
The parameters are calculated from the audio signal of the frame using a well-known LPC.
The analysis is performed and a predetermined order P is calculated. The specific calculation method is described in “Quantizer design in LSPs” by Sugamura et al.
"peech analysis-synthesis"
And the like (Reference 5).

【００４６】次にＬＳＰ量子化回路４４０は、ＬＳＰパ
ラメータを予め定められた量子化ビット数で量子化し、
得た符号ｌ_kをマルチプレクサ５６０へ出力するととも
に、これを復号化してさらに線形予測係数ａ_i′（ｉ＝
１〜Ｐ）に変換して重み付け回路５００，インパルス応
答計算回路４７０へ出力する。ＬＳＰパラメータの符号
化，ＬＳＰパラメータから線形予測係数への変換の方法
については、前記文献５等を参照できる。また、ＬＳＰ
をより効率的に符号化するためにベクトル−スカラ量子
化やベクトル量子化を用いることもできる。前者の具体
的な方法としては、例えばＭｏｒｉｙａ氏らによる“Ｔ
ｒａｎｓｆｏｒｍｃｏｄｉｎｇｏｆｓｐｅｅｃｈ
ｕｓｉｎｇａｗｅｉｇｈｔｅｄｖｅｃｔｏｒｑ
ｕａｎｔｉｚｅｒ”と題した論文（ＩＥＥＥＪ．Ｓｅ
ｌ．ＡｒｅａｓＣｏｍｍｕｎ．，ｐｐ．４２５−４３
１，１９８８年）（文献６）等を参照できる。Next, the LSP quantization circuit 440 quantizes the LSP parameter with a predetermined number of quantization bits,
The obtained code l _k is output to the multiplexer 560, which is decoded to further obtain the linear prediction coefficient a _i ′ (i =
1 to P) and output to the weighting circuit 500 and the impulse response calculation circuit 470. For the method of encoding the LSP parameter and converting the LSP parameter into the linear prediction coefficient, reference can be made to the above-mentioned reference 5. Also, LSP
May be used to more efficiently encode the vector-scalar quantization or the vector quantization. As a specific method of the former, for example, “T
transform coding ofspeech
using a weighted vector q
A paper entitled "Uniqueizer" (IEEE J. Se
l. Areas Commun. Pp. 425-43
1, 1988) (Document 6).

【００４７】サブフレーム分割回路４５０は、フレーム
の音声信号をサブフレームに分割する。ここで例えばサ
ブフレーム長は７．５ｍｓとする。The sub-frame division circuit 450 divides the audio signal of the frame into sub-frames. Here, for example, the subframe length is set to 7.5 ms.

【００４８】重み付け回路５００は、サブフレームに分
割した信号に対して前記文献１，２に記載の方法を用い
て聴感重み付けを行う。The weighting circuit 500 weights the perceptual weight of the signal divided into subframes by using the method described in the above-mentioned documents 1 and 2.

【００４９】減算器４９０は、聴感重み付けされた信号
から合成フィルタ５８１の出力を減算して出力する。The subtracter 490 subtracts the output of the synthesis filter 581 from the signal weighted by the auditory sense and outputs the result.

【００５０】インパルス応答計算回路４７０は、聴感重
み付けした合成フィルタのインパルス応答ｈ_w（ｎ）
を、予め定められたサンプル数Ｌだけ計算する。具体的
な計算法は、前記文献１，２，３等を参照できる。The impulse response calculation circuit 470 calculates the impulse response h _w (n) of the synthesis filter weighted by the auditory sense.
Is calculated for a predetermined number L of samples. The specific calculation method can be referred to the aforementioned documents 1, 2, 3, and the like.

【００５１】適応コードブック５１０は、合成フィルタ
５８１の入力信号ｖ（ｎ）を入力し、さらにインパルス
応答出力回路４７０から重み付けインパルス応答ｈ
_w（ｎ）、減算器４９０から重み付け信号を入力し、長
期相関にもとづくピッチ予測を行い、ピッチパラメータ
として遅延Ｍとゲインβを計算する。適応コードブック
の次数は１とする。１次の適応コードブックにおける遅
延Ｍ，ゲインβの計算法は、前記文献２等を参照でき
る。The adaptive codebook 510 receives the input signal v (n) of the synthesis filter 581, and receives a weighted impulse response h from the impulse response output circuit 470.
_w (n), a weighting signal is input from the subtractor 490, pitch prediction is performed based on long-term correlation, and delay M and gain β are calculated as pitch parameters. The order of the adaptive codebook is 1. The method of calculating the delay M and the gain β in the first-order adaptive codebook can be referred to the above-mentioned reference 2.

【００５２】また、数１に従い残差信号を計算し出力す
る。数１でｖ（ｎ）は過去の音源信号で、合成フィルタ
５８１の入力信号である。ｈ_w（ｎ）はインパルス応答
計算回路４７０で求めた重み付けインパルス応答であ
る。Further, the residual signal is calculated and output according to equation (1). In Expression 1, v (n) is a past sound source signal, which is an input signal of the synthesis filter 581. h _w (n) is a weighted impulse response obtained by the impulse response calculation circuit 470.

【００５３】音源コードブックは、ここでは２種類の異
なるコードブック５４０₁と５４０₂の２段縦続接続で
表せるものとする。音源コードブック探索回路５３０の
動作を図４に示す。図４において、数２の量子化歪計算
回路６０５は、端子６００から信号ｅ_w（ｎ）を入力
し、端子６０２から重み付けインパルス応答ｈ_w（ｎ）
を入力し、コードブック５４０₁を用いて、数２の量子
化歪を計算する。ここで数２を直接計算しても良いし、
数２を変形して相関演算により計算してもよい。後者の
方法は前記文献４等を参照できる。Here, it is assumed that the sound source codebook can be represented by a two-stage cascade connection of two different codebooks 540 ₁ and 540 ₂ . FIG. 4 shows the operation of the sound source codebook search circuit 530. In FIG. 4, the quantization distortion calculation circuit 605 of Equation 2 receives the signal e _w (n) from the terminal 600 and the weighted impulse response h _w (n) from the terminal 602.
Is input, and the quantization distortion of Expression 2 is calculated using the codebook 540 ₁ . Here, Equation 2 may be directly calculated,
Equation 2 may be modified and calculated by a correlation operation. For the latter method, reference can be made to the aforementioned reference 4.

【００５４】次にＭ候補選択回路６１０では、数２の量
子化歪の小さい順にコードベクトルをＭ候補選択し減算
器６１５へ出力すると共に、各候補の量子化歪を累積歪
計算回路６２５へ出力する。Next, the M candidate selection circuit 610 selects M candidate code vectors in ascending order of the quantization distortion of Expression 2 and outputs them to the subtractor 615, and outputs the quantization distortion of each candidate to the cumulative distortion calculation circuit 625. I do.

【００５５】減算器６１５は、数３を用いて、各候補に
対して誤差信号ｅ′_kw（ｎ）を計算する。The subtractor 615 calculates the error signal e ′ _kw (n) for each candidate by using _equation (3).

【００５６】数４の量子化歪計算回路６２０は、各候補
に対する誤差信号に対して、コードブック２を用いて数
４を用いて、最適なコードベクトルを求め、このときの
量子化歪を累積歪計算回路６２５へ出力する。The quantization distortion calculation circuit 620 of Expression 4 finds an optimal code vector for the error signal for each candidate by using Expression 4 using Codebook 2, and accumulates the quantization distortion at this time. Output to the distortion calculation circuit 625.

【００５７】累積歪計算回路６２５は、２段全体で累積
した量子化歪あるいは、２段目の量子化歪を最小化する
コードベクトルの候補の組合せを選択して出力する。The cumulative distortion calculation circuit 625 selects and outputs a combination of quantization distortion accumulated in the entire two stages or a combination of code vector candidates for minimizing the second stage quantization distortion.

【００５８】音源コードブックは、作用の項に述べたよ
うに、ガウス正の乱数信号から構成（乱数コードブッ
ク）してもよいし、トレーニング信号に対して予め学習
して構成（学習コードブック）してもよい。また、学習
コードブックと乱数コードブックの両方を用いて構成し
てもよい。The sound source code book may be composed of Gaussian positive random number signals (random number code book), or may be constructed by previously learning training signals (learning code book), as described in the section of operation. May be. Alternatively, the learning code book and the random number code book may be used.

【００５９】図３にもどって、ゲインコードブック５５
０は、適応コードブック５１０と音源コードブック５３
０のゲインをベクトル量子化する。ここで、ゲインコー
ドブック５５５において、下式を最小化するゲインコー
ドベクトルを選択し、出力する。Returning to FIG. 3, gain code book 55
0 is the adaptive codebook 510 and the sound source codebook 53
The vector of the gain of 0 is quantized. Here, in the gain codebook 555, a gain code vector that minimizes the following equation is selected and output.

【００６０】[0060]

【数１２】 (Equation 12)

【００６１】ここでβ′，γ₁′，γ₂′は、それぞれ
適応コードブックのゲイン、コードブック１のゲイン、
コードブック２のゲインに対応したゲインコードベクト
ルである。またゲインコードブック５５０は下式により
音源信号を計算し、合成フィルタ５８１へ出力する。Here, β ′, γ ₁ ′, and γ ₂ ′ are the gain of the adaptive codebook, the gain of the codebook 1,
It is a gain code vector corresponding to the gain of codebook 2. The gain codebook 550 calculates a sound source signal by the following equation, and outputs it to the synthesis filter 581.

【００６２】[0062]

【数１３】 (Equation 13)

【００６３】合成フィルタ５８１は、ｖ（ｎ）を入力
し、下式により聴感重み付け合成音声を１サブフレーム
分求め、さらにもう１サブフレーム分は０の系列をフィ
ルタに入力して重み付け応答信号系列を求め、１サブフ
レーム分の重み付け応答信号系列を減算器４９０に出力
する。The synthesis filter 581 receives v (n), obtains perceptually weighted synthesized speech for one subframe by the following equation, and inputs a sequence of 0 to the filter for the other subframe to obtain a weighted response signal sequence. And outputs a weighted response signal sequence for one subframe to the subtractor 490.

【００６４】[0064]

【数１４】 [Equation 14]

【００６５】ただしHowever,

【００６６】[0066]

【数１５】 (Equation 15)

【００６７】ここでδは聴感重み付けの程度を決める係
数であり、０＜γ＜１に選ぶ。Here, δ is a coefficient for determining the degree of hearing weighting, and is selected to be 0 <γ <1.

【００６８】マルチプレクサ５６０は、ＬＳＰ量子化器
４４０，適応コードブック５１０，音源コードブック探
索回路５３０，ゲインコードブック５５０の出力符号系
列を組み合わせて出力する。Multiplexer 560 combines and outputs the output code sequences of LSP quantizer 440, adaptive codebook 510, excitation codebook search circuit 530, and gain codebook 550.

【００６９】以上で本発明に関連する発明の例の説明を
終える。 This concludes the description of examples of the present invention related to the present invention .

【００７０】本発明では、音源コードブック探索回路及
び、コードブック１，コードブック２の動作が本発明に
関連する発明と異なるので、これらについて説明する。
図５は、本発明における音源コードブック探索回路の構
成を説明したブロック図である。音源コードブックの次
元数はサブフレーム長よりも短くする。以下では一例と
して、次元数はサブフレーム長Ｎの１／２のＮ／２とす
る。従って、コードブック７４０₁ ，コードブック７４
０₂ の次元数はＮ／２とする。また、縦続接続の段数は
以下では一例として２とする。図において、端子７００
からサブフレームの前半Ｎ／２のサンプルを入力する。
このとき、スイッチ７０３は下側に倒す。数５式の量子
化計算回路７０５は、コードブック１を用いて数５によ
り量子化歪を求める。Ｍ候補選択回路７１０では、量子
化歪の小さい順に複数種類（例えばＭ種類）のコードベ
クトルを求め、このときの各候補の量子化歪を累積歪計
算回路７２５に出力する。In the present invention, the sound source codebook search circuit and the operations of the codebook 1 and the codebook 2 correspond to the present invention.
Since these are different from the related invention, these will be described.
FIG. 5 is a block diagram illustrating the configuration of the sound source codebook search circuit according to the present invention. The number of dimensions of the sound source codebook is shorter than the subframe length. In the following, as an example, the number of dimensions is set to N / 2, which is 1/2 of the subframe length N. Therefore, the codebook 740 ₁ and the codebook 74
The number of dimensions of O ₂ is N / 2. In addition, the number of stages of the cascade connection is set to 2 as an example below. In the figure, terminal 700
From the first half of the subframe.
At this time, the switch 703 is moved down. The quantization calculation circuit 705 of Expression 5 calculates quantization distortion by Expression 5 using Codebook 1. The M candidate selection circuit 710 obtains a plurality of types (for example, M types) of code vectors in ascending order of the quantization distortion, and outputs the quantization distortion of each candidate at this time to the cumulative distortion calculation circuit 725.

【００７１】減算器７１５は、各候補に対して数６によ
り誤差信号を計算する。The subtractor 715 calculates an error signal for each candidate according to equation (6).

【００７２】次に、数７の量子化歪計算回路７２０で
は、コードブック７４０₂を用いて、各候補に対して数
７を最小化するコードベクトルを選択し、このときの量
子化歪を累積歪計算回路７２５へ出力する。Next, the number 7 quantization distortion calculation circuit 720, and with reference to codebook 740 _2, selects a code vector that minimizes the number 7 for each candidate, accumulated quantization distortion at this time Output to the distortion calculation circuit 725.

【００７３】累積歪計算回路７２５は各候補ごとに２段
累積した量子化歪を求める。The cumulative distortion calculation circuit 725 calculates the quantization distortion accumulated in two stages for each candidate.

【００７４】次に、減算器７０４では、サブフレーム後
半のＮ／２サンプルを端子７０１から入力し、上記で求
まったコードベクトルの各候補に対して数８を計算す
る。スイッチ７０３は上側に倒し、サブフレーム後半の
Ｎ／２サンプルに対して、量子化歪計算回路７０５，Ｍ
候補選択回路７１０，量子化歪計算回路７２０，累積歪
計算回路７２５は上記の動作を繰り返し、累積歪計算回
路７２５において、サブフレーム全体の累積歪を計算
し、累積歪を最小化するコードベクトルの組合せを求
め、端子７３０から出力する。Next, the subtractor 704 receives the N / 2 samples of the latter half of the subframe from the terminal 701, and calculates Equation 8 for each of the code vector candidates obtained above. The switch 703 is tilted upward, and the quantization distortion calculation circuit 705 and M
The candidate selection circuit 710, the quantization distortion calculation circuit 720, and the cumulative distortion calculation circuit 725 repeat the above operation, and calculate the cumulative distortion of the entire subframe in the cumulative distortion calculation circuit 725, and calculate the code vector of the code vector for minimizing the cumulative distortion. The combination is determined and output from terminal 730.

【００７５】音源コードブックは、作用の項に述べたよ
うに、ガウス性の乱数信号から構成（乱数コードブッ
ク）してもよいし、トレーニング信号に対して予め学習
して構成（学習コードブック）してもよい。また、学習
コードブックと乱数コードブックの両方を用いて構成し
てもよい。The sound source code book may be composed of Gaussian random number signals (random number code book), or may be constructed by previously learning training signals (learning code book). May be. Alternatively, the learning code book and the random number code book may be used.

【００７６】以上で本発明の説明を終える。The description of the present invention has been completed.

【００７７】音源コードブックの縦続接続の段数は、２
段以上の任意の段数を用いることができる。The number of cascade connections of the sound source code book is 2
Any number of stages above the stage can be used.

【００７８】また、音源コードブックの学習は、トレー
ニング信号を用いて、１段毎に行っても良いし、複数段
での歪を最小化するように、同時に最適化するように学
習してもよい。さらに、各段で複数種類の候補を出力す
ることを考慮して学習してもよい。例えば、２段のとき
は、１段目を最適化した後に、１段目と共に２段目を最
適化し、これらの最適化を歪の低減が飽和するまで繰り
返す。The learning of the sound source codebook may be performed for each stage using a training signal, or may be performed so as to simultaneously optimize and minimize distortion in a plurality of stages. Good. Further, learning may be performed in consideration of outputting a plurality of types of candidates in each stage. For example, in the case of the second stage, after the first stage is optimized, the second stage is optimized together with the first stage, and these optimizations are repeated until the distortion reduction is saturated.

【００７９】また、コードブック１，２は全探索型のコ
ードブックでもよいし、探索に要する演算量をさらに低
減するために、木探索型のコードブックでもよい。木探
索型のコードブックの構成法は、例えばＲ．Ｇｒａｙ
“Ｖｅｃｔｏｒｑｕａｎｔｉｚａｔｉｏｎ”と題した
論文（ＩＥＥＥＡＳＳＰＭａｇａｚｉｎｅ，ｐｐ．
４−２９，１９８４年）（文献７）等を参照できる。The codebooks 1 and 2 may be full search type codebooks, or may be tree search type codebooks in order to further reduce the amount of calculation required for search. A method of constructing a tree search type codebook is described in, for example, R. Gray
A paper entitled "Vector quantification" (IEEE ASSP Magazine, pp. 146-64).
4-29, 1984) (Reference 7).

【００８０】また、本発明の実施例では、サブフレーム
の前半では、コードブック２でのコードベクトル選択に
おいても複数種類の候補を選択し累積歪を計算するよう
にしてもよい。また、サブフレーム前半と後半は、異な
るコードブックを用いてもよい。In the embodiment of the present invention, in the first half of the subframe, a plurality of types of candidates may be selected in the code vector selection in the codebook 2 to calculate the cumulative distortion. Further, different codebooks may be used for the first half and the second half of the subframe.

【００８１】上述の実施例では、適応コードブックのゲ
イン、第１，第２のコードブックのゲインには同時最適
化を施さなかったが、適応コードブック，第１のコード
ブック，第２のコードブックのゲインについて、同時最
適化を行うことにより、さらに特性が改善される。この
同期最適化は、コードブック１，コードブック２のコー
ドベクトルを選択するときに適用するときわめて効果的
である。同時最適化の方法としては、例えば、適応コー
ドブックの遅延，ゲインβを求めた後に、コードブック
１のコードベクトルｃ_1j（ｎ）、ゲインγ₁を探索する
ときに、各コードベクトル毎に、次式を最小化するよう
に解いてβとγ₁を同時最適化する。In the above embodiment, the adaptive codebook gain and the first and second codebook gains were not simultaneously optimized. However, the adaptive codebook, the first codebook, and the second codebook were not optimized simultaneously. By performing simultaneous optimization on the book gain, the characteristics are further improved. This synchronization optimization is very effective when applied when selecting code vectors of codebook 1 and codebook 2. As a method of the simultaneous optimization, for example, after finding the delay and the gain β of the adaptive codebook, when searching for the code vector c _1j (n) and the gain γ ₁ of the codebook 1, for each code vector, Β and γ ₁ are simultaneously optimized by solving the following equation to minimize it.

【００８２】[0082]

【数１６】 (Equation 16)

【００８３】次に、コードブック２のコードベクトルを
選択するときに、各コードベクトル毎に次式を最小化す
るように適応コードブックのゲイン、コードブック１，
コードブック２のゲインを同時最適化する。Next, when the code vector of the code book 2 is selected, the gain of the adaptive code book, the code book 1 and the code book 1 are set so as to minimize the following equation for each code vector.
Simultaneously optimize the codebook 2 gain.

【００８４】[0084]

【数１７】 [Equation 17]

【００８５】また、演算量を低減化するためには、コー
ドブック１，２の探索のときにはゲインの同時最適化を
行わずに、コードブック１でＭ候補選択した後に、選択
した候補に対してのみ数１６から適応コードブックとの
間でゲインの同時最適化を行い、次にコードブック２で
コードベクトルを選択した後に、数１７から、適応コー
ドブック、コードブック１との間でゲインの同時最適化
を行う構成をとることもできる。Further, in order to reduce the amount of calculation, when searching for codebooks 1 and 2, simultaneous optimization of gains is not performed. Simultaneous gain optimization is performed between the adaptive codebook and the adaptive codebook from Equation (16), and then a code vector is selected in the codebook2. A configuration for performing optimization may be adopted.

【００８６】また、さらに演算量を低減化するために
は、累積歪計算回路において、累積歪を最小化するコー
ドブック１，２のコードベクトルの組が選択された後
に、適応コードブックのゲインβと、第１，第２のコー
ドブックのゲインγ₁，γ₂の３種を数１７から同時に
最適化するような構成とすることもできる。Further, in order to further reduce the amount of computation, after the code vector set of the codebooks 1 and 2 that minimizes the cumulative distortion is selected in the cumulative distortion calculating circuit, the gain β of the adaptive codebook is selected. And the three kinds of gains γ ₁ , γ ₂ of the _first and _second codebooks can be simultaneously optimized from Expression 17.

【００８７】また、ゲインの同期最適化を行うための別
の方法として、コードブック１の探索のときにコードベ
クトルを適応コードブックに直交化させた後に探索し、
コードブック２の探索のときには、コードベクトルを適
応コードブック及びコードブック１で選択されたコード
ベクトルに直交化させて探索する構成とすることもでき
る。直交化の方法としては、例えば、Ｉ．Ｇｅｒｓｏｎ
氏らによる“Ｖｅｃｔｏｒｓｕｍｅｘｃｉｔｅｄ
ｌｉｎｅａｒｐｒｅｄｉｃｔｉｏｎ（ＶＳＥＬＰ）ｓ
ｐｅｅｃｈｃｏｄｉｎｇａｔ８ｋｂ／ｓ”と題し
た論文（Ｐｒｏｃ．ＩＣＡＳＳＰ，ｐｐ．４６１−４６
４，１９９０年）（文献８）等を参照できる。As another method for performing the gain synchronization optimization, a code vector is orthogonalized to an adaptive code book at the time of search of the code book 1, and then search is performed.
At the time of searching the codebook 2, a configuration may be adopted in which the codevector is orthogonalized to the adaptive codebook and the codevector selected in the codebook 1, and the search is performed. As the orthogonalization method, for example, I.I. Gerson
"Vector sum excited"
linear prediction (VSELP) s
paper titled "peech coding at 8 kb / s" (Proc. ICASP, pp. 461-46).
4, 1990) (Literature 8).

【００８８】実施例では、各段のコードブック毎に最適
ゲインを求めたが、複数段のコードブックをまとめて１
つのゲインとしてもよいし、全段のコードブックをまと
めて１つのゲインとすることもできる。このようにする
と、ゲインを伝送するときのビット数を低減できるの
で、さらにビットレートを低減できる。In the embodiment, the optimum gain is obtained for each codebook of each stage.
One gain may be used, or a codebook of all stages may be collectively used as one gain. By doing so, the number of bits used to transmit the gain can be reduced, so that the bit rate can be further reduced.

【００８９】また、上記の実施例では、スペクトルパラ
メータとしてＫパラメータ，ＬＳＰパラメータを符号化
し、その分析法としてＬＰＣ分析を用いたが、スペクト
ルパラメータとしては他の周知なパラメータ、例えばＬ
ＰＣケプストラム，ケプストラム，改良ケプストラム，
一般化ケプストラム，メルケプストラムなどを用いるこ
ともできる。また各パラメータに最適な分析法を用いる
ことができる。In the above embodiment, the K parameter and the LSP parameter are encoded as spectral parameters, and the LPC analysis is used as the analysis method. However, other well-known parameters such as L
PC cepstrum, cepstrum, improved cepstrum,
Generalized cepstrum, mel cepstrum and the like can also be used. In addition, an optimal analysis method can be used for each parameter.

【００９０】また、フレームで求めたＬＰＣ係数をＬＳ
Ｐ上や線形予測係数上でサブフレーム毎に線形、あるい
は非線形に補間し、補間した係数を用いて適応コードブ
ック、コードブック１，２の探索を行う構成としてもよ
い。このような構成とすることにより、音質がさらに改
善される。The LPC coefficient obtained for the frame is given by LS
A configuration may be used in which interpolation is performed linearly or non-linearly for each subframe on P or a linear prediction coefficient, and the adaptive codebook, codebooks 1, 2 are searched using the interpolated coefficients. With such a configuration, the sound quality is further improved.

【００９１】また、ＬＳＰ係数は周知の方法により、ベ
クトル量子化、あるいはベクトル−スカラ量子化、さら
には複数のサブフレームのＬＳＰをまとめてマトリクス
量子化することにより、さらに効率的に符号化すること
ができる。ベクトル−スカラ量子化の方法については例
えば前記文献４等を参照できる。The LSP coefficients can be encoded more efficiently by a vector quantization or a vector-scalar quantization by a well-known method, and furthermore, by a matrix quantization of LSPs of a plurality of subframes collectively. Can be. The method of vector-scalar quantization can be referred to, for example, the above-mentioned reference 4.

【００９２】また、演算量を低減するために、重み付け
回路５００をサブフレーム分割回路４５０の前に配置さ
せ、合成フィルタ５８１では下式により重み付け合成信
号を計算するようにしてもよい。In order to reduce the amount of calculation, the weighting circuit 500 may be arranged before the sub-frame division circuit 450, and the synthesis filter 581 may calculate a weighted synthesized signal by the following equation.

【００９３】[0093]

【数１８】 (Equation 18)

【００９４】ここでδは聴感重み付けの程度を決める重
み付け係数である。Here, δ is a weighting coefficient for determining the degree of hearing weighting.

【００９５】また、実施例では、適応コードブックのパ
ラメータは音源コードブックを探索する前に１種類に決
定していたが、例えば遅延パラメータについて、複数種
類の候補を出力し、遅延パラメータの各候補に対して音
源コードブックを探索し、適応コードブックと音源コー
ドブックとの累積歪を最小化する適応コードブックと音
源コードベクトルとの組合せをサブフレーム毎に選択す
る構成とすることもできる。このようにすると演算量は
増加するが、性能は改善される。In the embodiment, the adaptive codebook parameter is determined to be one before searching the sound source codebook. However, for example, a plurality of types of delay parameter candidates are output, and each of the delay parameter candidates is output. , A combination of the adaptive codebook and the excitation code vector that minimizes the accumulated distortion between the adaptive codebook and the excitation codebook may be selected for each subframe. This increases the amount of computation but improves performance.

【００９６】また、さらに性能を改善するために、ＬＳ
Ｐ係数，適応コードブックのパラメータ，音源コードブ
ックのコードベクトル，ゲインコードブックの少なくと
も１つをサブフレームで一意に決定するのではなく、デ
レイドデシジョン（遅延決定法）を取り入れることもで
きる。例えば、適応コードブックのパラメータと音源コ
ードブックのコードベクトルに対してデレイドデシジョ
ンを行うときは、サブフレームでは適応コードブックと
音源コードブックで複数種類の候補を求め、各候補の組
合せに対して複数サブフレーム（例えばＬサブフレー
ム）にわたり累積歪を計算し、Ｌサブフレーム全体で累
積歪を最小化する適応コードブックのパラメータと音源
コードベクトルの組合せを選択する構成とすることもで
きる。このような構成とすると、演算量はさらに増大す
るが性能はさらに改善される。To further improve the performance, LS
Instead of uniquely determining at least one of the P coefficient, the parameter of the adaptive codebook, the code vector of the sound source codebook, and the gain codebook in a subframe, a delayed decision (delay determination method) can be adopted. For example, when performing a delayed decision on the parameters of the adaptive codebook and the code vector of the excitation codebook, in the subframe, a plurality of types of candidates are obtained in the adaptive codebook and the excitation codebook, and for each candidate combination, It is also possible to adopt a configuration in which the cumulative distortion is calculated over a plurality of subframes (for example, L subframes), and a combination of an adaptive codebook parameter and an excitation code vector that minimizes the cumulative distortion over the entire L subframes. With such a configuration, the operation amount is further increased, but the performance is further improved.

【００９７】[0097]

【発明の効果】以上述べたように、本発明によれば、さ
らにコードブックサイズを小さくすることができるの
で、さらに少ない演算量，メモリ量で良好な特性が得ら
れるという効果がある。従って本発明によれば、８〜
４．８ｋｂ／ｓのビットレートで、良好な音質の符号化
再生音声を得ることができるという大きな効果がある。As described above , according to the present invention, since the codebook size can be further reduced, good characteristics can be obtained with a smaller operation amount and memory amount. Therefore, according to the present invention, 8 to
At a bit rate of 4.8 kb / s, there is a great effect that encoded and reproduced speech with good sound quality can be obtained.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明に関連する発明による音声符号化方式の
作用を説明するためのブロック図である。FIG. 1 is a block diagram for explaining an operation of a speech coding system according to the present invention .

【図２】本発明による音声符号化方式の作用を説明する
ためのブロック図である。FIG. 2 is a block diagram for explaining the operation of the speech coding system according to the present invention.

【図３】本発明に関連する発明による音声符号化方式を
実施する装置の一例を示すブロック図である。FIG. 3 is a block diagram showing an example of an apparatus for implementing a speech coding method according to the present invention .

【図４】本発明に関連する発明による音源コードブック
探索回路の構成を示すブロック図である。FIG. 4 is a block diagram showing a configuration of a sound source codebook search circuit according to the invention related to the present invention .

【図５】本発明による音源コードブック探索回路の構成
を示すブロック図である。FIG. 5 is a block diagram showing a configuration of a sound source codebook search circuit according to the present invention.

【符号の説明】[Explanation of symbols]

１５０スペクトルパラメータ計算部１６０適応コードブック部１７０，２００音源コードブック探索部１８０₁ ，２１０₁ コードブック１１８０₂ ，２１０₂ コードブック２４１０バッファメモリ４３０ＬＰ分析算回路４４０ＬＳＰ量子化回路４５０サブフレーム分割回路４７０インパルス応答計算回路４９０，６１５，７０４，７１５減算器５００重み付け回路５１０適応コードブック５３０音源コードブック探索回路５４０₁ ，７４０₁ コードブック１５４０₂ ，７４０₂ コードブック２５５０ゲインコードブック５６０マルチプレクサ６０５数２の量子化歪計算回路６１０，７１０Ｍ候補選択回路６２０数４の量子化歪計算回路６２５，７２５累積歪計算回路７０３スイッチ７０５数５の量子化歪計算回路７２０数７の量子化歪計算回路150 spectral parameter calculation unit 160 the adaptive codebook section 170, 200 excitation codebook searching unit 180 _1, 210 ₁ codebook 1 180 _2, 210 ₂ codebook 2 410 buffer memory 430 LP analysis calculation circuit 440 LSP quantization circuit 450 subframes Dividing circuit 470 Impulse response calculation circuit 490, 615, 704, 715 Subtractor 500 Weighting circuit 510 Adaptive codebook 530 Sound source codebook search circuit 540 ₁ , 740 ₁ Codebook 1 540 ₂ , 740 ₂ Codebook 2 550 Gain codebook 560 Multiplexer 605 Equation 2 quantization distortion calculation circuit 610,710 M candidate selection circuit 620 Equation 4 quantization distortion calculation circuit 625,725 Cumulative distortion calculation circuit 703 Switch 705 Equation 5 quantization distortion calculation circuit 720 Quantization distortion calculation circuit of Expression 7

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平１−205638（ＪＰ，Ａ) 小沢、熊谷「ＬＳＰパラメータの効率的なベクトル量子化法の検討」1990年電子情報通信学会秋季全国大会講演論文集，Ｐ１−132 ────────────────────────────────────────────────── ─── Continuation of the front page (56) References JP-A-1-205638 (JP, A) Ozawa, Kumagaya "Study on efficient vector quantization of LSP parameters" 1990 IEICE Autumn National Convention Lecture Papers, P1-132

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】入力した音声信号を予め定められた時間長
のフレームに分割し、前記音声信号のスペクトル包絡を
表すスペクトルパラメータを求め、前記フレームを予め
定められた時間長のサブフレームに分割し、前記サブフ
レーム毎に過去の音源信号をもとにして適応コードブッ
クにおけるパラメータを求め、予め構成された音源コー
ドブックから最適なコードベクトルを探索して前記音声
信号を量子化する音声符号化方式において、前記音源コードブックは次元数が前記サブフレーム長よ
りも短い複数のコードブックの多段縦続接続で構成さ
れ、少なくとも一つの段では量子化歪の小さい順にコー
ドベクトルを複数候補出力しサブフレーム全体で歪を累
積し、前記累積歪を最小化する候補の組を出力すること
により前記音声信号を量子化することを特徴とする音声
符号化方式。An input audio signal is divided into frames of a predetermined time length, spectrum parameters representing a spectrum envelope of the audio signal are obtained, and the frame is divided into sub-frames of a predetermined time length. A speech coding method for obtaining parameters in an adaptive codebook based on a past excitation signal for each subframe, searching for an optimal code vector from a previously configured excitation codebook, and quantizing the audio signal. In the sound source codebook, the number of dimensions is configured by a multistage cascade connection of a plurality of codebooks shorter than the subframe length, and at least one stage outputs a plurality of candidate code vectors in ascending order of quantization distortion and outputs the entire subframe. Accumulates the distortion in and outputs a set of candidates for minimizing the accumulated distortion to quantize the audio signal. A speech coding method characterized by: