JPH11119799A

JPH11119799A - Method and device for voice encoding

Info

Publication number: JPH11119799A
Application number: JP9280079A
Authority: JP
Inventors: Takahiro Kondo; 孝宏近藤
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1997-10-14
Filing date: 1997-10-14
Publication date: 1999-04-30

Abstract

PROBLEM TO BE SOLVED: To reduce the increase in operation quantity regarding address generation when a fixed code book search process of an ACELP(algebraic code excitation linear prediction) voice encoding system is actualized by a signal processing LSI such as a DSP. SOLUTION: This device is equipped with a threshold calculating means 101 which calculates a threshold for controlling the frequency of search of an algebraic code book in the fixed code book search process, a narrowing-down search approach means 102, which narrows down a search place of the algebraic code book according to the threshold, and a pulse position search means 103 which searches an algebraic code book on the basis of the result of the narrowing-down search approach means 102.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声符号化方法お
よび音声符号化装置に関し、特に低ビットレートで自然
な品質の音声を実現したＣＥＬＰ（符号励振線形予測：
ＣｏｄｅＥｘｃｉｔｅｄＬｉｎｅａｒＰｒｅｄｉ
ｃｔｉｏｎ）方法等の音声符号化方法および音声符号化
装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech coding method and a speech coding apparatus, and more particularly to a CELP (Code Excited Linear Prediction: CELP) realizing natural quality speech at a low bit rate.
Code Excited Linear Predi
ction) method and the like, and a speech encoding device.

【０００２】[0002]

【従来の技術】音声の符号化技術は音声波形をディジタ
ル符号に変換して伝送したり、記録する際に情報量を削
減する技術である。そして、近年のＬＳＩ技術の急速な
進展とともに、圧縮効率の高い高能率符号化方法が実現
可能となり、ディジタル移動通信や多様なマルチメディ
ア通信に適用されている。2. Description of the Related Art A speech encoding technique is a technique for converting an audio waveform into a digital code and transmitting or recording the information, thereby reducing the amount of information. With the rapid progress of LSI technology in recent years, a high-efficiency encoding method with high compression efficiency has become feasible and has been applied to digital mobile communications and various multimedia communications.

【０００３】ここで、高能率符号化方法の１つであるＣ
ＥＬＰ方法、その中で固定コードブックに代数コードブ
ックを用いるＡＣＥＬＰ（代数符号励振線形予測：Ａｌ
ｇｅｂｒａｉｃＣＥＬＰ）方法について説明する。こ
のＡＣＥＬＰ方法については、標準化されている符号化
方式ＩＴＵ−ＴＧ．７２９およびＩＴＵ−ＴＧ．７２
３．１の勧告書に記載されている手法を参考として、以
下に説明する。Here, C, which is one of the high-efficiency coding methods, is used.
ELLP method, in which ACELP (Algebraic code excitation linear prediction: Al
Gebric CELP) method will be described. Regarding this ACELP method, a standardized coding method ITU-TG. 729 and ITU-TG. 72
The method is described below with reference to the method described in the 3.1 recommendation.

【０００４】符号器側では、まずあるフレーム毎に符号
化対象の音声信号に対し線形予測分析を行い、短期合成
フィルタ係数を抽出する。上記の線形予測分析というの
は、現時刻におけるサンプル値を、過去のサンプル値の
線形結合として予測する線形予測モデルを用いて、現サ
ンプル値と予測値の誤差が最小となるように線形式の係
数を求め、それを分析値とする音声信号の分析方法であ
る。また、短期合成フィルタというのは、線形予測分析
より求められた係数をフィルタ係数とするフィルタであ
り、音声の周波数特性を表すフィルタである。[0004] On the encoder side, first, a linear prediction analysis is performed on a speech signal to be encoded for each certain frame to extract short-term synthesis filter coefficients. The linear prediction analysis described above uses a linear prediction model that predicts the sample value at the current time as a linear combination of the past sample values, and uses a linear formula to minimize the error between the current sample value and the predicted value. This is an audio signal analysis method in which a coefficient is obtained and used as an analysis value. Further, the short-term synthesis filter is a filter that uses a coefficient obtained by the linear prediction analysis as a filter coefficient, and is a filter that represents a frequency characteristic of voice.

【０００５】つぎに、音声信号に対し、短期合成フィル
タに基づいて与えられる聴覚重み付けフィルタにより聴
覚重み付けを行い、聴覚重み付き音声信号が生成され
る。そして、聴覚重み付き音声信号より、後述の適応コ
ードブック探索時の演算量を削減するためのピッチ周期
が予備選択される。上記の聴覚重み付けフィルタという
のは、以下のようなフィルタである。すなわち、音声信
号のホルマント領域での量子化雑音がマスキング効果に
より小さく感じられるという効果を利用し、聴感上の雑
音感を低減するためのフィルタである。Next, perceptual weighting is performed on the voice signal by a perceptual weighting filter provided based on a short-term synthesis filter to generate a perceptual weighted voice signal. Then, a pitch period for reducing the amount of computation at the time of searching for an adaptive codebook, which will be described later, is preliminarily selected from the auditory weighted audio signal. The above auditory weighting filter is the following filter. In other words, the filter is a filter for reducing the noise perception by utilizing the effect that the quantization noise in the formant region of the audio signal is perceived to be small due to the masking effect.

【０００６】ここで、上記フレームはさらに幾つかのサ
ブフレームに分割され、以降の処理はサブフレーム毎に
実行される。そしてまず、音声信号から短期合成フィル
タによる短期予測寄与分を減算した線形予測残差信号に
対し、ピッチ周期に基づき適応コードブック探索を行う
ことによりピッチパラメータおよび利得を抽出し、長期
合成フィルタを生成する。上記の短期予測寄与分という
のは、｛入力音声信号−（１／短期合成フィルタ）×入
力音声信号｝に等しい。ただし、（１／短期合成フィル
タ）は、短期合成フィルタの逆フィルタである。また、
音声信号には声帯の振動による周期的な成分が含まれて
おり、そのときの周期をピッチ周期とする。ここでは、
音声信号（聴覚重み付き）から直接抽出した周期をピッ
チ周期と呼び、さらに、より正確な周期を得るために音
声信号から上記短期予測寄与分を差し引いた線形予測残
差信号から抽出した周期をピッチパラメータとしてい
る。また、適用コードブックというのは、パラメータと
して上記線形予測残差信号のピッチパラメータと利得を
有するコードブックである。また、長期合成フィルタと
いうのは、上記適応コードブックパラメータ（ピッチパ
ラメータと利得）により構成されるフィルタであり、過
去の線形予測残差信号から現在の線形予測残差信号を予
測するものである。Here, the frame is further divided into several sub-frames, and the subsequent processing is executed for each sub-frame. First, a pitch parameter and a gain are extracted by performing an adaptive codebook search based on a pitch period for a linear prediction residual signal obtained by subtracting a short-term prediction contribution by a short-term synthesis filter from an audio signal, and a long-term synthesis filter is generated. I do. The short-term prediction contribution is equal to {input speech signal− (1 / short-term synthesis filter) × input speech signal}. Here, (1 / short-term synthesis filter) is an inverse filter of the short-term synthesis filter. Also,
The voice signal includes a periodic component due to the vibration of the vocal cords, and the cycle at that time is defined as a pitch cycle. here,
The period directly extracted from the audio signal (with auditory weights) is called the pitch period, and the period extracted from the linear prediction residual signal obtained by subtracting the short-term prediction contribution from the audio signal to obtain a more accurate period is the pitch. Parameters. The applied codebook is a codebook having, as parameters, a pitch parameter and a gain of the linear prediction residual signal. The long-term synthesis filter is a filter configured by the adaptive codebook parameters (pitch parameter and gain), and predicts a current linear prediction residual signal from a past linear prediction residual signal.

【０００７】つぎに、線形予測残差信号から長期合成フ
ィルタによる長期予測寄与分を減算した残差信号が固定
コードブック探索において最適励振信号の確定に用いら
れる。固定コードブック探索処理は聴覚重み付き音声信
号と聴覚重み付き再生音声信号との間の平均自乗誤差を
最小化することにより行われ、以下の手順で実現され
る。上記の長期予測寄与分というのは、（長期合成フィ
ルタ×過去の線形予測残差信号）に等しい。また、固定
コードブックというのは、パラメータとしてあらかじめ
数種類の音源信号が登録されたコードブックであり、固
定コードブック探索では線形予測残差信号から長期予測
寄与分を減じた残差信号とコードブックに登録された音
源信号の誤差電力が最小となるような最適な音源信号が
選択される。Next, the residual signal obtained by subtracting the long-term prediction contribution by the long-term synthesis filter from the linear prediction residual signal is used to determine the optimal excitation signal in the fixed codebook search. The fixed codebook search processing is performed by minimizing the mean square error between the auditory weighted audio signal and the auditory weighted reproduced audio signal, and is realized by the following procedure. The above-described long-term prediction contribution is equal to (long-term synthesis filter × past linear prediction residual signal). A fixed codebook is a codebook in which several types of sound source signals are registered in advance as parameters.In a fixed codebook search, a residual signal obtained by subtracting a long-term prediction contribution from a linear prediction residual signal and a codebook are used. An optimal sound source signal that minimizes the error power of the registered sound source signal is selected.

【０００８】まず、残差信号ｘ［ｎ］および短期合成フ
ィルタと聴覚重み付けフィルタとの縦続接続で構成され
る聴覚重み付き合成フィルタのインパルス応答ｈ［ｎ］
の相関信号ｄ［ｎ］を（数１）に基づいて算出する。
（数１）のｎは各信号の時間領域の連続するサンプル番
号であり、ｎ＝０，１，２，３，…，Ｎ−１である。ま
た、同式のｉとｊは、各信号の時間領域の連続するサン
プル番号である。First, an impulse response h [n] of a perceptually weighted synthesis filter composed of a cascade connection of a residual signal x [n] and a short-term synthesis filter and a perceptual weighting filter.
Is calculated based on (Equation 1).
N in (Equation 1) is a continuous sample number in the time domain of each signal, and n = 0, 1, 2, 3,..., N−1. Also, i and j in the same expression are successive sample numbers in the time domain of each signal.

【０００９】[0009]

【数１】 (Equation 1)

【００１０】つぎに、聴覚重み付き合成フィルタのイン
パルス応答ｈ［ｎ］の相関を含む行列Φを（数２）に基
づいて算出する。Next, a matrix Φ including the correlation of the impulse response h [n] of the auditory weighted synthesis filter is calculated based on (Equation 2).

【００１１】[0011]

【数２】 (Equation 2)

【００１２】ｃ_kをｋ番目の固定コードブックベクトル
とするとき、コードブックは（数３）を最大にするｋを
求めることにより探索される。なお、Ｃ_kはスカラ量を
表し、ｃ_kはベクトルを表している。When _ck is the k-th fixed codebook vector, the codebook is searched for by finding k that maximizes (Equation 3). Note that C _k represents a scalar amount, and c _k represents a vector.

【００１３】[0013]

【数３】 (Equation 3)

【００１４】上記の（数３）において、ｃ_kは固定コー
ドブックに登録されたｋ番目の音源ベクトルであり、ｃ
_k ^tは音源ベクトルｃ_kの転置ベクトルである。ｃ
_k［ｎ］は音源ベクトルｃ_kのｎ番目の要素である。Ｃ
_k ²およびＥ_kはそれぞれ（数３）の右辺の分子および
分母を便宜上１個の変数に割り当てただけものである。
ここで、Ｎはサブフレーム長を示す。また、固定コード
ブックは、代数的構造を持つため、探索手順の簡略化が
可能であり、（数３）の分子の相関および分母のエネル
ギーは（数４）および（数５）で示される。In the above (Equation 3), c _k is the k-th sound source vector registered in the fixed codebook, and c _k
_k ^t is the transpose vector of the excitation vector c _k. c
_k [n] is the n-th element of the sound source vector c _k . C
Each of _k ² and E _k is _obtained by assigning the numerator and denominator on the right side of (Equation 3) to one variable for convenience.
Here, N indicates the subframe length. Further, since the fixed codebook has an algebraic structure, the search procedure can be simplified, and the numerator correlation and the denominator energy in (Equation 3) are expressed by (Equation 4) and (Equation 5).

【００１５】[0015]

【数４】 (Equation 4)

【００１６】[0016]

【数５】 (Equation 5)

【００１７】ここで、Ｍはパルス本数、ｍ_iはｉ番目の
パルス位置、ｓ_iはその振幅を示す。つぎに、（数３）
と（数４）および（数５）の関係について詳しく説明す
る。まず、（数３）の分子と（数４）の関係について説
明する。（数３）の分子はそのまま計算すると、Ｃ_k＝ｄ［０］・ｃ_k［０］＋ｄ［１］・ｃ_k［１］＋
…＋ｄ［Ｎ−１］・ｃ_k［Ｎ−１］となる。なお、二乗の部分は省略している。しかし、固
定コードブックが代数的構造を持つ場合には、固定コー
ドブックに登録される音源ベクトルｃ_k［ｎ］は、ある
決められたパルス数（ここでは、Ｍ本）だけ零でないパ
ルスを持ち、他は全て零となる。[0017] Here, M is the pulse number, m _i is the i th pulse position, s _i represents the amplitude. Next, (Equation 3)
The relationship between (Equation 4) and (Equation 5) will be described in detail. First, the relationship between the numerator of (Equation 3) and (Equation 4) will be described. If the numerator of (Equation 3) is directly calculated, C _k = d [0] · c _k [0] + d [1] · c _k [1] +
.. + D [N−1] · c _k [N−1]. Note that the squared portion is omitted. However, when the fixed codebook has an algebraic structure, the sound source vector c _k [n] registered in the fixed codebook has non-zero pulses by a predetermined number of pulses (here, M). , All others are zero.

【００１８】例として、パルス本数Ｍ＝４であり、０，
１，２，３番目のパルス位置をそれぞれｍ₀，ｍ₁，ｍ
₂，ｍ₃（ｍ₀＜ｍ₁＜ｍ₂＜ｍ₃）とすると、音源ベ
クトルｃ_k［ｎ］は、ｃ_k［ｍ₀］、ｃ_k［ｍ₁］、ｃ
_k［ｍ₂］、ｃ_k［ｍ₃］のみ非零で、その他の音源ベ
クトルｃ_k［ｎ］（ｎはｍ₀、ｍ₁、ｍ₂、ｍ₃以外）
は全て零となる。したがって、（数３）の分子は、Ｃ_k＝ｄ［０］・０＋…＋ｄ［ｍ₀−１］・０＋ｄ［ｍ₀］・ｃ_k［ｍ₀］＋ｄ［ｍ₀＋１］・０＋…＋ｄ［ｍ₁−１］・０＋ｄ［ｍ₁］・ｃ_k［ｍ₁］＋ｄ［ｍ₁＋１］・０＋…＋ｄ［ｍ₂−１］・０＋ｄ［ｍ₂］・ｃ_k［ｍ₂］＋ｄ［ｍ₂＋１］・０＋…＋ｄ［ｍ₃−１］・０＋ｄ［ｍ₃］・ｃ_k［ｍ₃］＋ｄ［ｍ₃＋１］・０＋…＋ｄ［Ｎ−１］・０＝ｄ［ｍ₀］・ｃ_k［ｍ₀］＋ｄ［ｍ₁］・ｃ_k［ｍ₁］＋ｄ［ｍ₂］・ｃ_k［ｍ₂］＋ｄ［ｍ₃］・ｃ_k［ｍ₃］となる。そこで、音源信号ｃ_k［ｍ_i］の振幅をｓ
_i（ｉ＝０，１，２，３）とすると、Ｃ_k＝ｄ［ｍ₀］・ｓ₀＋ｄ［ｍ₁］・ｓ₁＋ｄ［ｍ
₂］・ｓ₂＋ｄ［ｍ₃］・ｓ₃ となり、（数４）のＭ＝４の場合の結果となる。As an example, when the number of pulses M = 4, 0,
The first, second, and third pulse positions are m ₀ , m ₁ , and m, respectively.
₂ , m ₃ (m ₀ <m ₁ <m ₂ <m ₃ ), the sound source vector c _k [n] is c _k [m ₀ ], c _k [m ₁ ], c
Only _k [m ₂ ] and c _k [m ₃ ] are non-zero, and other sound source vectors c _k [n] (n is other than m ₀ , m ₁ , m ₂ and m ₃ )
Are all zero. Therefore, the numerator of (Equation 3) can be expressed as follows: C _k = d [0] · 0 +... + D [m ₀ −1] · 0 + d [m ₀ ] · c _k [m ₀ ] + d [m ₀ +1] · 0 +. _{+ d [m 1 -1] ·} 0 + d [m 1] · c k [m 1] + d [m 1 +1] · 0 + ... + d [m 2 -1] · 0 + d [m 2] · c k [m 2 _{] + d [m 2 +1]} · 0 + ... + d [m 3 -1] · 0 + d [m 3] · c k [m 3] + d [m 3 +1] · 0 + ... + d [N-1] · 0 = d a _{_{[m 0] · c k [}} m 0] + d [m 1] · c k [m 1] + d [m 2] · c k [m 2] + d [m 3] · c k [m 3]. Therefore, the amplitude of the sound source signal c _k [m _i ] is _defined as s
Assuming that _i (i = 0, 1, 2, 3), C _k = d [m ₀ ] · s ₀ + d [m ₁ ] · s ₁ + d [m
₂ ] · s ₂ + d [m ₃ ] · s ₃ , which is the result of (Equation 4) when M = 4.

【００１９】また、（数３）の分母と（数５）の関係
も、上記要因を考慮すると、同様に成立する。なお、パ
ルス本数については、上述したように、固定コードブッ
クに登録された音源ベクトルで非零の要素がいくつある
かを意味し、実際の本数は、このＡＣＥＬＰ方法を用い
た様々な符号化方式によって異なると考えられる。Also, the relationship between the denominator of (Equation 3) and (Equation 5) is similarly established in consideration of the above factors. Note that, as described above, the number of pulses means the number of non-zero elements in the excitation vector registered in the fixed codebook, and the actual number is determined by various encoding methods using this ACELP method. It is thought to be different.

【００２０】また、（数１），（数２）のｎ，ｉ，ｊ
は、前述したように、各信号（ｄ［］等）の時間領域で
の連続するサンプル番号を示すが、（数４）以降の数式
のｍ _iは、ｉ番目のパルス位置を示し、例えばｄ［ｎ］
（ｎ＝０，１，２，…，Ｎ−１）の中のある任意の位置
を表す。例えば、ｄ［ｎ］（ｎ＝ａ）とｄ［ｍ_i］（ｍ
_i＝ａ）は等値である。したがって、各数式中のｍ_iが
とりうる値は、ｎ，ｉ，ｊがとりうる値（０，１，２，
…，Ｎ−１）であり、整数である。ｍ_jも同様である。Further, n, i, j of (Equation 1) and (Equation 2)
Is in the time domain of each signal (d [] etc.) as described above.
Indicates the consecutive sample numbers of the following equation.
M _iIndicates the i-th pulse position, for example, d [n]
An arbitrary position in (n = 0, 1, 2,..., N-1)
Represents For example, d [n] (n = a) and d [m_i] (M
_i= A) are equivalent. Therefore, m in each formula_iBut
Possible values are n, i, j (0, 1, 2,
..., N-1), which are integers. m_jThe same is true for

【００２１】例えば、（数４）のｄ［ｍ_i］は、ｄ
［ｎ］（ｎ＝０，１，２，…，Ｎ−１）のｎ＝＝ｍ_iで
あった場合を示す。また、（数５）のΦ（ｍ_i，ｍ_j）
はΦ（ｉ，ｊ）（ｉ＝０，１，２，…，Ｎ−１、ｊ＝
ｉ，…，Ｎ−１）のｉ＝＝ｍ_i，ｊ＝＝ｍ_jであった場
合を示す。また、Ｍ本のパルス位置はＭ重ループにより
一本ずつ逐次的に探索されることから、探索手順をさら
に簡略化するために絞り込み探索アプローチが使用され
る。本アプローチでは最終ループに入る前に、事前に算
出されたしきい値とその時点での（数４）で示される相
関値とを比較し、しきい値を越えた場合のみ最終ループ
に入る。さらに、最終ループに入ることができる回数の
上限を設定することにより、コードブック探索回数を抑
えることができる。For example, d [m _i ] in (Equation 4) is d
[N] (n = 0,1,2, ..., N-1) shows a case was of n == m _i. Also, Φ (m _i , m _j ) of (Equation 5)
Is Φ (i, j) (i = 0, 1, 2,..., N−1, j =
i,..., N−1), i == m _i , j == m _j . Further, since the M pulse positions are sequentially searched one by one by an M-fold loop, a narrow search approach is used to further simplify the search procedure. In this approach, before entering the final loop, the threshold calculated in advance is compared with the correlation value shown in (Equation 4) at that time, and only when the threshold is exceeded, the final loop is entered. Further, by setting the upper limit of the number of times that the apparatus can enter the final loop, the number of times of codebook search can be reduced.

【００２２】以上の手順で固定コードブックの探索が行
われた後、線形予測分析で抽出された短期予測符号およ
び適応コードブック探索で抽出された長期予測符号、そ
して固定コードブック探索で抽出されたパルス位置、利
得情報が復号器側へ伝送される。上記の短期予測符号と
いうのは短期合成フィルタ係数を符号化して得られる短
期予測符号のことであり、長期予測符号というのは線形
予測残差信号のピッチパラメータおよび利得を符号化し
て得られる長期予測符号のことである。After the fixed codebook search is performed in the above procedure, the short-term prediction code extracted by the linear prediction analysis, the long-term prediction code extracted by the adaptive codebook search, and the fixed codebook search are extracted. The pulse position and gain information are transmitted to the decoder side. The short-term prediction code is a short-term prediction code obtained by encoding a short-term synthesis filter coefficient, and the long-term prediction code is a long-term prediction code obtained by encoding a pitch parameter and a gain of a linear prediction residual signal. It is a sign.

【００２３】[0023]

【発明が解決しようとする課題】しかし、ＡＣＥＬＰ音
声符号化方法における従来の固定コードブック探索処理
では、絞り込み探索アプローチ処理とパルス位置探索処
理が同時に行われており、Ｍ本のパルス位置を探索する
際のＭ重ループ内では（数４）、（数５）で示されるｄ
［ｍ_i］およびΦ（ｍ_i，ｍ_i）、Φ（ｍ_i，ｍ_j）の
合計Ｍ²＋Ｍ−（１＋２＋・・・＋（Ｍ−１））個のデ
ータを扱う必要がある。そのため、この探索処理をＤＳ
Ｐ（ＤｉｇｉｔａｌＳｉｇｎａｌＰｒｏｃｅｓｓｏ
ｒ）等の信号処理用ＬＳＩで実現させた場合には、メモ
リ上に格納された各データを扱うためにＭ²＋Ｍ−（１
＋２＋・・・＋（Ｍ−１））本のアドレスポインタが必
要となる。However, in the conventional fixed codebook search processing in the ACELP speech coding method, the narrowing search approach processing and the pulse position search processing are performed simultaneously, and the M pulse positions are searched. D in Equation (4) and (Equation 5)
It is necessary to handle a total of M ² + M− (1 + 2 +... + (M−1)) data of [m _i ], φ (m _i , m _i ), and φ (m _i , m _j ). Therefore, this search processing is performed by DS
P (Digital Signal Processo)
r) and the like, the M ² + M− (1) is used to handle each data stored in the memory.
.. + (M-1)) address pointers are required.

【００２４】ここで、ＡＣＥＬＰ音声符号化方法におけ
る従来の固定コードブック探索処理の処理フローを図４
に示す。図４において、１０１は代数コードブックの探
索方法を制御するしきい値の計算を行うしきい値計算手
段、４１８はしきい値を基に代数コードブックの探索箇
所を絞り込む絞り込み探索アプローチ処理とその結果を
基に代数コードブックの探索を行うパルス位置探索処理
を行う絞り込み探索アプローチ・パルス位置探索手段で
ある。また、図４は、探索する代数コードブックのパル
ス数Ｍが４本である場合について示している。まず、し
きい値計算手段１０１のステップ４００において、（数
６）に示すしきい値ｔｈｒｅｓを算出する。FIG. 4 shows a processing flow of a conventional fixed codebook search process in the ACELP speech coding method.
Shown in In FIG. 4, reference numeral 101 denotes a threshold value calculating means for calculating a threshold value for controlling a method of searching an algebraic codebook, and 418 denotes a narrowing-down search approach process for narrowing down a search location of an algebraic codebook based on a threshold value. It is a refined search approach / pulse position search means for performing a pulse position search process for searching an algebraic codebook based on the result. FIG. 4 shows a case where the number of pulses M of the algebraic codebook to be searched is four. First, in step 400 of the threshold value calculating means 101, a threshold value thres shown in (Equation 6) is calculated.

【００２５】[0025]

【数６】 (Equation 6)

【００２６】ここで、ｍａｘは最初の３パルスによる相
関値の絶対値の最大値、ａｖｅは最初の３パルスによる
相関値の平均値、Ｋは０≦Ｋ≦１の定数を示す。つぎ
に、絞り込み探索アプローチ・パルス位置探索手段４１
８に移り、ステップ４０２で（数７）に示す相関値Ｃ
０、（数８）に示すエネルギーＥ０、ステップ４０４で
（数９）に示す相関値Ｃ１、（数１０）に示すエネルギ
ーＥ１、ステップ４０６で（数１１）に示す相関値Ｃ
２、（数１２）に示すエネルギーＥ２を算出する。な
お、ステップ４０１は第１パルス位置ｍ₀探索ループス
タートステップであり、ステップ４０３は第２パルス位
置ｍ₁探索ループスタートステップであり、ステップ４
０５は第３パルス位置ｍ₂探索ループスタートステップ
である。Here, max is the maximum absolute value of the correlation values of the first three pulses, ave is the average value of the correlation values of the first three pulses, and K is a constant satisfying 0 ≦ K ≦ 1. Next, the narrowing-down search approach / pulse position searching means 41
8 and in step 402, the correlation value C shown in (Expression 7)
0, energy E0 shown in (Equation 8), correlation value C1 shown in (Equation 9) in step 404, energy E1 shown in (Equation 10), and correlation value C shown in (Equation 11) in step 406.
2. The energy E2 shown in (Equation 12) is calculated. Step 401 is a first pulse position m ₀ search loop start step, and step 403 is a second pulse position m ₁ search loop start step.
05 is a third pulse position m ₂ search loop start step.

【００２７】[0027]

【数７】 (Equation 7)

【００２８】[0028]

【数８】 (Equation 8)

【００２９】[0029]

【数９】 (Equation 9)

【００３０】[0030]

【数１０】 (Equation 10)

【００３１】[0031]

【数１１】 [Equation 11]

【００３２】[0032]

【数１２】 (Equation 12)

【００３３】ここで、ｄ［］は入力音声信号から短期
予測寄与分および長期予測寄与分を減じた残差信号と、
聴覚重み付き合成フィルタのインパルス応答との相関信
号、ｍ_iはｉ番目のパルス位置、ｓ_iはその振幅、Φ′
は（数２）に示す聴覚重み付き合成フィルタのインパル
ス応答の相関を含む行列Φに前記相関信号ｄ［］の極
性情報を付加したものを示す。つぎに、ステップ４０７
において、ステップ４００で算出したしきい値ｔｈｒｅ
ｓと相関値Ｃ２を比較する。そして、相関値Ｃ２がしき
い値ｔｈｒｅｓ以下である場合には、つぎのパルス位置
に対して探索を行う（ステップ４１５、４１６、４１
７）。また、相関値Ｃ２がしきい値ｔｈｒｅｓより大き
い場合には、ステップ４０８に移り、第４パルス位置ｍ
₃の探索をスタートし、ステップ４０９で（数１３）に
示す相関値Ｃ３、（数１４）に示すエネルギーＥ３を算
出する。Where d [] is the residual signal obtained by subtracting the short-term prediction contribution and the long-term prediction contribution from the input speech signal;
Correlation signal with the impulse response of the perceptually weighted synthesis filter, m _i is the i th pulse position, s _i is its amplitude, [Phi '
Indicates a matrix Φ including the correlation of the impulse response of the auditory weighted synthesis filter shown in (Equation 2) to which the polarity information of the correlation signal d [] is added. Next, step 407
, The threshold thre calculated in step 400
s is compared with the correlation value C2. When the correlation value C2 is equal to or smaller than the threshold value thres, a search is performed for the next pulse position (steps 415, 416, and 41).
7). If the correlation value C2 is larger than the threshold value thres, the process proceeds to step 408, where the fourth pulse position m
The search of ₃ is started, and in step 409, the correlation value C3 shown in (Equation 13) and the energy E3 shown in (Equation 14) are calculated.

【００３４】[0034]

【数１３】 (Equation 13)

【００３５】[0035]

【数１４】 [Equation 14]

【００３６】そして、ステップ４１０において、（数
３）に示す式を最大にする最適パルス位置を探索するた
めに、Ｃ３²・Ｅ_maxとＣ_max・Ｅ３の比較を行う。Ｃ
_maxおよびＥ_maxはそれぞれ最適パルスによる相関値お
よびエネルギーである。ここで、Ｃ_max・Ｅ３がＣ３²
・Ｅ_max以上である場合には、次のパルス位置に対して
探索を行う（ステップ４１２）。逆に、Ｃ_max・Ｅ３が
Ｃ３²・Ｅ_maxより小さい場合には、ステップ４１１に
おいて、そのときの全パルス位置ｍ₀，ｍ₁，ｍ ₂，ｍ
₃を最適パルスによる相関値およびエネルギーとして保
存更新し、つぎのパルス位置に対して探索を行う（ステ
ップ４１２）。なお、ステップ４１２は、第４パルス位
置探索ループ終了ステップである。そして、探索すべき
全ての第４パルス位置の探索が終了したら、ステップ４
１３において、第４パルス位置の探索を行った回数をカ
ウントするカウンタをインクリメントし、ステップ４１
４においてそのカウンタが第４パルス位置の探索を行う
回数の上限である最大探索回数より大きければパルス位
置探索処理を終了する。また、カウンタが最大探索回数
以下であれば、つぎのパルス位置に対して探索を行う
（ステップ４１５）。絞り込み探索アプローチ・パルス
位置探索手段４１８では、上記一連の処理をカウンタが
最大探索回数より大きくなるか、または探索すべき全て
の第１，第２，第３パルス位置の探索が終了するまで繰
り返す。なお、ステップ４１５は、第３パルス位置探索
ループ終了ステップ、ステップ４１６は第２パルス位置
探索ループ終了ステップ、ステップ４１７は第１パルス
位置探索ループ終了ステップである。これから明らかな
ように、探索する代数コードブックのパルス数が４本で
ある固定コードブック処理を絞り込み探索アプローチ・
パルス位置探索手段４１８で同時に行う場合、４重ルー
プ内では、（数７）〜（数１４）で示されるｄ［］お
よびΦ′（）の合計１４個のデータを扱う必要があ
る。Then, in step 410, (number
Search for the optimal pulse position that maximizes the expression shown in 3)
In order, C3^Two・ E_maxAnd C_max・ Compare E3. C
_maxAnd E_maxAre the correlation values and
And energy. Where C_max・ E3 is C3^Two
・ E_maxIf this is the case, the next pulse position
A search is performed (step 412). Conversely, C_max・ E3
C3^Two・ E_maxIf smaller, go to step 411
Where, all pulse positions m at that time₀, M₁, M _Two, M
_ThreeAs the correlation value and energy from the optimal pulse.
Search for the next pulse position (step
412). Step 412 is performed at the fourth pulse level.
This is the end step of the location search loop. And should be searched
When the search for all fourth pulse positions is completed, step 4
In 13, the number of times the search for the fourth pulse position has been performed is counted.
The counter to be counted is incremented, and step 41 is executed.
At 4, the counter searches for the fourth pulse position.
Pulse position if it is larger than the maximum number of searches, which is the upper limit of the number of times
The location search processing ends. Also, the counter is the maximum number of searches
If below, search for the next pulse position
(Step 415). Refine search approach pulse
In the position searching means 418, the above series of processing is performed by a counter.
Everything greater than the maximum number of searches or to be searched
Until the search for the first, second, and third pulse positions is completed.
Return. Step 415 is the third pulse position search.
Loop end step, step 416 is the second pulse position
Search loop end step, step 417 is the first pulse
This is a position search loop end step. Obvious from this
So, if the number of pulses in the algebraic codebook to search is four,
A search approach that narrows down a fixed codebook process
When the pulse position search means 418 performs the operation simultaneously,
In the group, d [] and (
And Φ '() need to handle a total of 14 data
You.

【００３７】これは、例えば代数コードブックのパルス
数Ｍが４本の場合では、アドレスポインタが１４本必要
となることを意味する。しかし、通常ＤＳＰを用いた探
索処理で使用できるアドレスポインタ本数には制限があ
り、これまではアドレスポインタの内容をメモリ上に退
避、復帰しながら処理を行っていたため、演算量が増大
するという問題を有していた。This means that, for example, when the pulse number M of the algebraic codebook is 4, 14 address pointers are required. However, there is a limit to the number of address pointers that can be normally used in search processing using a DSP, and the processing has been performed while saving and restoring the contents of the address pointer in the memory so far, resulting in an increase in the amount of calculation. Had.

【００３８】本発明は上記従来の問題点を解決するもの
で、ＤＳＰ等の信号処理用ＬＳＩを用いてＡＣＥＬＰ音
声符号化方法を実現する場合の固定コードブック探索処
理における演算量を削減することができる音声符号化方
法および音声符号化装置を提供することを目的とする。The present invention solves the above-mentioned conventional problems. An object of the present invention is to reduce a calculation amount in a fixed codebook search process when an ACELP speech encoding method is realized using a signal processing LSI such as a DSP. It is an object of the present invention to provide a voice coding method and a voice coding device capable of performing the above.

【００３９】[0039]

【課題を解決するための手段】この目的を達成するため
に、本発明の請求項１記載の音声符号化方法は、あるフ
レーム長の音声信号に対し線形予測分析を行い短期合成
フィルタを生成し、音声信号に対し聴覚重み付けを行い
聴覚重み付き音声信号を生成し、聴覚重み付き音声信号
を基に長期合成フィルタを生成し、短期合成フィルタに
よる短期予測寄与分と長期合成フィルタによる長期予測
寄与分を音声信号より減算した残差信号を基に残差信号
と再生音声信号の平均自乗誤差が最小となるように代数
コードブックの探索を行う方法であり、代数コードブッ
クの探索方法を制御するしきい値の計算を行い、しきい
値を基に代数コードブックの探索箇所を絞り込み、絞り
込みの結果を基に代数コードブックの探索を行うことを
特徴とする。In order to achieve this object, a speech encoding method according to the first aspect of the present invention performs a linear prediction analysis on a speech signal having a certain frame length to generate a short-term synthesis filter. Audience weighting is applied to the audio signal to generate an auditory weighted audio signal, a long-term synthesis filter is generated based on the auditory weighted audio signal, and a short-term prediction contribution by the short-term synthesis filter and a long-term prediction contribution by the long-term synthesis filter. Is a method of searching for an algebraic codebook based on a residual signal obtained by subtracting the following from an audio signal so that the mean square error between the residual signal and the reproduced audio signal is minimized. The method is characterized in that a threshold value is calculated, a search location of an algebraic codebook is narrowed down based on a threshold value, and an algebraic codebook search is performed based on the narrowed-down result.

【００４０】この方法によると、ＡＣＥＬＰ音声符号化
方法の固定コードブック探索処理においてコードブック
の探索箇所を絞り込む絞り込み探索アプローチ処理と代
数コードブックのパルス位置を探索するパルス位置探索
処理を分離することにより、固定コードブック探索処理
をＤＳＰ等の信号処理用ＬＳＩで実現する場合に必要と
なるアドレスポインタ数を低減することができ、アドレ
ス発生に伴うアドレスポインタの内容の退避、復帰処理
が減少することから、アドレス発生に関する演算量の増
加を抑えることが可能となる。According to this method, in the fixed codebook search processing of the ACELP speech coding method, the narrowing search approach processing for narrowing the codebook search location and the pulse position search processing for searching the pulse position of the algebraic codebook are separated. Therefore, the number of address pointers required when the fixed codebook search process is realized by a signal processing LSI such as a DSP can be reduced, and the process of saving and restoring the contents of the address pointer due to the address generation is reduced. In addition, it is possible to suppress an increase in the amount of operation related to address generation.

【００４１】本発明の請求項２記載の音声符号化装置
は、あるフレーム長の音声信号に対し線形予測分析を行
い短期合成フィルタを生成する線形予測分析手段と、音
声信号に対し聴覚重み付けを行い聴覚重み付き音声信号
を生成する聴覚重み付け手段と、聴覚重み付き音声信号
を基に長期合成フィルタを生成する適応コードブック探
索手段と、短期合成フィルタによる短期予測寄与分と長
期合成フィルタによる長期予測寄与分を音声信号より減
算した残差信号を基に残差信号と再生音声信号の平均自
乗誤差が最小となるように代数コードブックの探索を行
う固定コードブック探索手段とを備えた音声符号化装置
であり、固定コードブック探索手段が、代数コードブッ
クの探索方法を制御するしきい値の計算を行うしきい値
計算手段と、しきい値を基に代数コードブックの探索箇
所を絞り込む絞り込み探索アプローチ手段と、絞り込み
探索アプローチ手段の結果を基に代数コードブックの探
索を行うパルス位置探索手段とを備えたことを特徴とす
る。According to a second aspect of the present invention, there is provided a speech encoding apparatus for performing linear prediction analysis on a speech signal having a certain frame length to generate a short-term synthesis filter, and performing auditory weighting on the speech signal. Perceptual weighting means for generating a perceptually weighted speech signal, adaptive codebook search means for generating a long-term synthesis filter based on the perceptually weighted speech signal, short-term prediction contribution by a short-term synthesis filter, and long-term prediction contribution by a long-term synthesis filter A fixed codebook search means for searching for an algebraic codebook so as to minimize the mean square error between the residual signal and the reproduced audio signal based on the residual signal obtained by subtracting the minutes from the audio signal. A fixed codebook searching means for calculating a threshold value for controlling a method of searching an algebraic codebook; Characterized by comprising a narrowing search approach means to narrow down the search portion of the algebraic codebooks based on, and a pulse position search means for performing a search of the algebraic codebook based on the result of narrowing the search approach means.

【００４２】この構成によると、ＡＣＥＬＰ音声符号化
方法の固定コードブック探索処理においてコードブック
の探索箇所を絞り込む絞り込み探索アプローチ処理と代
数コードブックのパルス位置を探索するパルス位置探索
処理を分離することにより、固定コードブック探索処理
をＤＳＰ等の信号処理用ＬＳＩで実現する場合に必要と
なるアドレスポインタ数を低減することができ、アドレ
ス発生に伴うアドレスポインタの内容の退避、復帰処理
が減少することから、アドレス発生に関する演算量の増
加を抑えることが可能となる。According to this configuration, in the fixed codebook search processing of the ACELP speech coding method, the narrowing search approach processing for narrowing the codebook search location and the pulse position search processing for searching the pulse position of the algebraic codebook are separated. Therefore, the number of address pointers required when the fixed codebook search process is realized by a signal processing LSI such as a DSP can be reduced, and the process of saving and restoring the contents of the address pointer due to the address generation is reduced. In addition, it is possible to suppress an increase in the amount of operation related to address generation.

【００４３】本発明の請求項３記載の音声符号化装置
は、あるフレーム長の音声信号に対し線形予測分析を行
い短期合成フィルタを生成する線形予測分析処理部と、
音声信号に対し聴覚重み付けを行い聴覚重み付き音声信
号を生成する聴覚重み付け処理部と、聴覚重み付き音声
信号を基に長期合成フィルタを生成する適応コードブッ
ク探索処理部と、短期合成フィルタによる短期予測寄与
分と長期合成フィルタによる長期予測寄与分を音声信号
より減算した残差信号を基に残差信号と再生音声信号の
平均自乗誤差が最小となるように代数コードブックの探
索を行う固定コードブック探索処理部とを備えた音声符
号化装置であり、固定コードブック探索処理部が、代数
コードブックの探索方法を制御するしきい値の計算を行
うしきい値計算処理部と、しきい値を基に代数コードブ
ックの探索箇所を絞り込む絞り込み探索アプローチ処理
部と、絞り込み探索アプローチ処理部で絞り込まれた代
数コードブックの探索箇所に関する情報を保存するコー
ドブック探索箇所情報格納用メモリと、コードブック探
索箇所情報格納用メモリに保存された情報を基に代数コ
ードブックの探索を行うパルス位置探索処理部とを備え
たことを特徴とする。According to a third aspect of the present invention, there is provided a speech encoding apparatus for performing a linear prediction analysis on a speech signal having a certain frame length to generate a short-term synthesis filter;
An auditory weighting processor that generates an auditory weighted audio signal by performing auditory weighting on an audio signal, an adaptive codebook search processor that generates a long-term synthesis filter based on the auditory weighted audio signal, and a short-term prediction using a short-term synthesis filter A fixed codebook that searches the algebraic codebook so that the mean square error between the residual signal and the reproduced audio signal is minimized based on the residual signal obtained by subtracting the contribution and the long-term prediction contribution from the long-term synthesis filter from the audio signal. A fixed codebook search processing unit for calculating a threshold value for controlling a method of searching for an algebraic codebook; A narrowing search approach processing unit that narrows the search location of the algebraic codebook based on the algebraic codebook based on the algebraic codebook narrowed down by the narrowing search approach processing unit A codebook search location information storage memory for storing information related to a search location; and a pulse position search processing unit for searching for an algebraic codebook based on the information stored in the codebook search location information storage memory. It is characterized by.

【００４４】この構成によると、ＡＣＥＬＰ音声符号化
方法の固定コードブック探索処理においてコードブック
の探索箇所を絞り込む絞り込み探索アプローチ処理と代
数コードブックのパルス位置を探索するパルス位置探索
処理を分離することにより、固定コードブック探索処理
をＤＳＰ等の信号処理用ＬＳＩで実現する場合に必要と
なるアドレスポインタ数を低減することができ、アドレ
ス発生に伴うアドレスポインタの内容の退避、復帰処理
が減少することから、アドレス発生に関する演算量の増
加を抑えることが可能となる。According to this configuration, in the fixed codebook search processing of the ACELP speech coding method, the narrow search approach processing for narrowing the codebook search location and the pulse position search processing for searching the pulse position of the algebraic codebook are separated. Therefore, the number of address pointers required when the fixed codebook search process is realized by a signal processing LSI such as a DSP can be reduced, and the process of saving and restoring the contents of the address pointer due to the address generation is reduced. In addition, it is possible to suppress an increase in the amount of operation related to address generation.

【００４５】また、本発明の媒体は、コンピュータに請
求項１記載の音声符号化方法を実現する各手段を実行さ
せるプログラムを記録したものである。この構成による
と、この媒体を用いてコンピュータを動作させると、請
求項１と同様な作用効果を達成することができる。Further, the medium of the present invention stores a program for causing a computer to execute each means for realizing the voice encoding method according to the first aspect. According to this configuration, when the computer is operated using the medium, the same operation and effect as those of the first aspect can be achieved.

【００４６】[0046]

【発明の実施の形態】以下、本発明の実施の形態につい
て、図１、図２および図３を用いて説明する。（第１の実施の形態）以下、本発明の第１の実施の形態
について、図１および図２を参照しながら説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Embodiments of the present invention will be described below with reference to FIGS. (First Embodiment) Hereinafter, a first embodiment of the present invention will be described with reference to FIGS.

【００４７】図１は本発明の第１の実施の形態における
ＡＣＥＬＰ音声符号化方法における固定コードブック探
索手段１００の基本構成を示すブロック図である。図１
において、１０１は代数コードブックの探索方法を制御
するしきい値の計算を行うしきい値計算手段、１０２は
しきい値を基に代数コードブックの探索箇所を絞り込む
絞り込み探索アプローチ手段、１０３は絞り込み探索ア
プローチ手段の結果を基に代数コードブックの探索を行
うパルス位置探索手段である。FIG. 1 is a block diagram showing a basic configuration of the fixed codebook search means 100 in the ACELP speech coding method according to the first embodiment of the present invention. FIG.
, 101 is a threshold value calculating means for calculating a threshold value for controlling a search method of an algebraic codebook, 102 is a narrowing-down search approach means for narrowing down a search location of an algebraic codebook based on a threshold value, and 103 is a narrowing-down search approach means. Pulse position searching means for searching the algebraic codebook based on the result of the search approach means.

【００４８】図２は本発明の第１の実施の形態のＡＣＥ
ＬＰ音声符号化方法における固定コードブック探索処理
の処理フローを示したフローチャートである。なお、こ
の実施の形態で示すしきい値計算手段１０１、絞り込み
探索アプローチ手段１０２およびパルス位置探索手段１
０３の前段の処理は従来と同様の処理方法で処理が行わ
れ、この実施の形態で扱う各データは用意されているも
のとする。また、この実施の形態では、探索する代数コ
ードブックのパルス数が４本であるものとする。FIG. 2 shows an ACE according to the first embodiment of the present invention.
It is the flowchart which showed the processing flow of the fixed codebook search processing in an LP audio | voice coding method. It should be noted that the threshold value calculating means 101, the narrow-down search approach means 102, and the pulse position searching means 1 shown in this embodiment
It is assumed that the processing of the preceding stage of 03 is performed by the same processing method as the conventional one, and each data handled in this embodiment is prepared. Further, in this embodiment, it is assumed that the number of pulses of the algebraic codebook to be searched is four.

【００４９】図２に示すように、この実施の形態におい
ては、まずしきい値計算手段１０１のステップ２００に
おいて（数６）に示したしきい値ｔｈｒｅｓを算出す
る。ここで、ｍａｘは最初の３パルスによる相関値の絶
対値の最大値、ａｖｅは最初の３パルスによる相関値の
平均値、Ｋは０≦Ｋ≦１の定数を示す。つぎに、絞り込
み探索アプローチ手段１０２に移り、最後の第４パルス
位置を探索するための第１、第２、第３パルス位置をし
きい値ｔｈｒｅｓを用いて予備選択する。絞り込み探索
アプローチ手段１０２では、１本逐次的に第１、第２、
第３のパルス位置の探索を行い、ステップ２０４で（数
１５）に示す相関値Ｃを算出する。なお、ステップ２０
１は第１パルス位置ｍ₀探索ループスタートステップで
あり、ステップ２０２は第２パルス位置ｍ₁探索ループ
スタートステップであり、ステップ２０３は第３パルス
位置ｍ₂探索ループスタートステップである。As shown in FIG. 2, in this embodiment, first, in step 200 of the threshold value calculating means 101, the threshold value thres shown in (Equation 6) is calculated. Here, max is the maximum absolute value of the correlation values of the first three pulses, ave is the average of the correlation values of the first three pulses, and K is a constant of 0 ≦ K ≦ 1. Next, the process proceeds to the narrowing-down search approach means 102, and first, second, and third pulse positions for searching for the last fourth pulse position are preliminarily selected using the threshold value thres. In the narrowing-down search approach means 102, the first, second,
A search for the third pulse position is performed, and a correlation value C shown in (Equation 15) is calculated in step 204. Step 20
1 is a first pulse position m ₀ search loop start step, step 202 is a second pulse position m ₁ search loop start step, and step 203 is a third pulse position m ₂ search loop start step.

【００５０】[0050]

【数１５】 (Equation 15)

【００５１】ここで、ｄ［］は入力音声信号から短期
予測寄与分および長期予測寄与分を減じた残差信号と聴
覚重み付き合成フィルタのインパルス応答との相関信
号、ｍ _iはｉ番目のパルス位置、ｓ_iはその振幅を示
す。つぎに、ステップ２０５において、ステップ２００
で算出したしきい値ｔｈｒｅｓと相関値Ｃを比較する。
そして、相関値Ｃがしきい値ｔｈｒｅｓ以下である場合
には、つぎのパルス位置に対して探索を行う（ステップ
２０９，２１０，２１１）。また、相関値Ｃがしきい値
ｔｈｒｅｓよりも大きい場合は、ステップ２０６に移
り、そのときの第１、第２、第３パルス位置ｍ₀，
ｍ₁，ｍ₂および相関値Ｃを保存する。そして、第１か
ら第３パルス位置を選択し保存した回数をカウントする
カウンタが、第４パルス位置の探索を行う回数の上限で
ある最大探索回数以上であれば絞り込み探索アプローチ
手段１０２を終了する。また、カウンタが最大探索回数
よりも小さい場合には、ステップ２０８においてカウン
タを１インクリメントし、次のパルス位置に対して探索
を行う。絞り込み探索アプローチ手段１０２では、上記
一連の処理をカウンタが最大探索回数以上になるか、ま
たは探索すべき全ての第１、第２、第３パルス位置の探
索が終了するまで繰り返す。なお、ステップ２０９は第
３パルス位置探索ループ終了ステップであり、ステップ
２１０は第２パルス位置探索ループ終了ステップであ
り、ステップ２１１は第１パルス位置探索ループ終了ス
テップである。Here, d [] is a short term from the input voice signal.
The residual signal and the acoustic signal with the predicted contribution and the long-term predicted contribution reduced
Correlation Signal with Impulse Response of Perceptually Weighted Synthesis Filter
Number, m _iIs the i-th pulse position, s_iIndicates its amplitude
You. Next, in step 205, step 200
The threshold value thres calculated in the above is compared with the correlation value C.
When the correlation value C is equal to or smaller than the threshold thres
Search for the next pulse position (step
209, 210, 211). The correlation value C is a threshold
If it is larger than thres, the process proceeds to step 206.
And the first, second, and third pulse positions m at that time.₀,
m₁, M_TwoAnd the correlation value C are stored. And the first
Counts the number of times the third pulse position is selected and saved
At the upper limit of the number of times the counter searches for the fourth pulse position
If more than a certain maximum number of searches, narrow search approach
The means 102 ends. Also, the counter is the maximum number of searches
If it is less than
Increment by 1 and search for the next pulse position
I do. In the narrowing search approach means 102, the above
A series of processing is performed until the counter exceeds the maximum number of searches.
Or search for all first, second and third pulse positions to be searched.
Repeat until the search is finished. Step 209 is the
3 pulse position search loop end step
210 is a second pulse position search loop end step.
Step 211 is the end of the first pulse position search loop.
This is Tep.

【００５２】つぎに、パルス位置探索手段１０３に移
る。パルス位置探索手段１０３では、まずステップ２１
３において（数１６）に示す絞り込み探索アプローチ手
段１０２で選択した第１、第２、第３パルス位置に関す
るエネルギーＥ１を算出する。Next, the operation proceeds to the pulse position searching means 103. In the pulse position searching means 103, first, in step 21
In step 3, the energy E1 relating to the first, second, and third pulse positions selected by the narrowing-down search approach means 102 shown in (Expression 16) is calculated.

【００５３】[0053]

【数１６】 (Equation 16)

【００５４】ここで、Φ′は（数２）に示す聴覚重み付
き合成フィルタのインパルス応答の相関を含む行列Φに
前記相関信号ｄ［］の極性情報を付加したものを示
し、ｍ ₀，ｍ₁，ｍ₂は絞り込み探索アプローチ手段１
０２で選択した第１、第２、第３パルス位置を示す。ま
た、Ｃ１にパルス位置が前記ｍ₀〜ｍ₂である第１、第
２、第３パルスによる相関値Ｃを読み出す。Here, Φ ′ is the auditory weighting shown in (Equation 2)
Matrix Φ containing the correlation of the impulse response of the synthesis filter
Indicates that the polarity information of the correlation signal d [] is added.
Then m ₀, M₁, M_TwoIs a narrow search approach 1
02 indicates the first, second, and third pulse positions selected. Ma
Also, the pulse position at C1 is m₀~ M_TwoThe first and second
2. Read the correlation value C by the third pulse.

【００５５】上記の（数１６）のΦ′と（数２）のΦと
の関係は、 Φ′（ｉ，ｊ）＝sign〔ｄ［ｉ］〕・sign〔ｄ［ｊ］〕
・Φ（ｉ，ｊ）（ｉ＝０，１，…，ｎ−１）ここで、sign〔Ｘ〕は変数Ｘの極性（＋または−）を表
す。なお、ステップ２１２は、コードブック探索ループ
スタートステップである。The relationship between Φ ′ in (Equation 16) and Φ in (Equation 2) is as follows: Φ ′ (i, j) = sign [d [i]] · sign [d [j]]
Φ (i, j) (i = 0, 1,..., N−1) Here, sign [X] represents the polarity (+ or −) of the variable X. Step 212 is a codebook search loop start step.

【００５６】そして、第４パルス位置ｍ₃探索ループス
タートステップ２１４から第４パルス位置の探索をスタ
ートし、ステップ２１５で（数１７）、（数１８）に示
すエネルギーＥ２および相関値Ｃ２を算出する。Then, the search for the fourth pulse position is started from the fourth pulse position m ₃ search loop start step 214, and the energy E2 and the correlation value C2 shown in (expression 17) and (expression 18) are calculated in step 215. .

【００５７】[0057]

【数１７】 [Equation 17]

【００５８】[0058]

【数１８】 (Equation 18)

【００５９】ここで、ｍ₃は第４パルス位置、ｓ₃はそ
の振幅を示す。そして、ステップ２１６において（数
３）に示す式を最大にする最適パルス位置を探索するた
めに、Ｃ２²・Ｅ_maxとＣ_max・Ｅ２の比較を行う。Ｃ
_max、Ｅ _maxはそれぞれ最適パルスによる相関値および
エネルギーである。ここで、Ｃ_ma _x・Ｅ２がＣ２²・Ｅ
_max以上である場合には、次のパルス位置に対して探索
を行う。逆に、Ｃ_max・Ｅ２がＣ２²・Ｅ_maxより小さ
い場合には、ステップ２１７においてそのときの全パル
ス位置ｍ₀、ｍ₁、ｍ₂、ｍ₃を最適パルス位置とし
て、またＣ２²、Ｅ２を最適パルスによる相関値および
エネルギーとして保存更新し、つぎのパルス位置に対し
て探索を行う。ステップ２１８は第４パルス₁探索ルー
プエンドステップを示している。Here, m_ThreeIs the fourth pulse position, s_ThreeHaso
Shows the amplitude of Then, in step 216, (number
Search for the optimal pulse position that maximizes the expression shown in 3)
In order, C2^Two・ E_maxAnd C_max・ Compare E2. C
_max, E _maxAre the correlation value by the optimal pulse and
Energy. Where C_ma _x・ E2 is C2^Two・ E
_maxIf so, search for the next pulse position
I do. Conversely, C_max・ E2 is C2^Two・ E_maxSmaller
If not, in step 217, the
Position m₀, M₁, M_Two, M_ThreeIs the optimal pulse position
And C2 again^Two, E2 are the correlation values by the optimal pulse and
Save and update as energy, and for the next pulse position
To search. Step 218 is the fourth pulse₁Search Lou
Shows the pre-end step.

【００６０】そして、第４パルス位置の全探索が終了し
たら、絞り込み探索アプローチ手段１０２で選択された
つぎの第１から第３パルス位置ｍ₀、ｍ₁、ｍ₂に対し
て同様に第４パルス位置の探索を行う。パルス位置探索
手段１０３では、上記一連の処理を絞り込み探索アプロ
ーチ手段１０２で選択した全パルス位置に対して行う。When the entire search for the fourth pulse position is completed, the fourth pulse position is similarly set for the next first to third pulse positions m ₀ , m ₁ , and m ₂ selected by the narrow-down search approach means 102. Perform a location search. The pulse position searching means 103 performs the above-described series of processing for all the pulse positions selected by the narrowing search approach means 102.

【００６１】ここで、上記処理をＤＳＰ等の信号処理用
ＬＳＩで実現する場合、絞り込み探索アプローチ手段１
０２において扱うデータはｄ［ｍ₀］、ｄ［ｍ₁］、ｄ
［ｍ ₂］のみであり、必要となるアドレスポインタは、
前記ｄ［ｍ₀］〜ｄ［ｍ₂］を扱う３本と、パルス位置
等の保存を行うために使用する１本の計４本である。ま
た、パルス位置探索手段１０３において必要なアドレス
ポインタは、第４パルス位置探索時にΦ′（ｍ₃，
ｍ₃）、Φ′（ｍ₀，ｍ₃）、Φ′（ｍ₁，ｍ₃）、
Φ′（ｍ₂，ｍ₃）およびｄ［ｍ₃］を扱うための計５
本である。なお、パルス位置探索手段１０３でエネルギ
ーＥ１を算出する際に扱うΦ′（ｍ₀，ｍ₀）〜Φ′
（ｍ₁，ｍ₂）については、前述の５本のアドレスポイ
ンタと共用可能であり、新たにアドレスポインタが必要
となることは無い。Here, the above processing is performed for signal processing of a DSP or the like.
In the case of realizing by LSI, narrowing search approach means 1
02 is d [m₀], D [m₁], D
[M _Two], And the required address pointer is
The d [m₀] To d [m_Two] And pulse position
, Etc., which are used for storing the data, etc. Ma
Also, the address required in the pulse position searching means 103
The pointer indicates Φ ′ (m_Three,
m_Three), Φ '(m₀, M_Three), Φ '(m₁, M_Three),
Φ '(m_Two, M_Three) And d [m_Three5]
It is a book. Note that the pulse position searching means 103
Φ '(m₀, M₀) ~ Φ '
(M₁, M_Two)), The above five address points
Address pointer and a new address pointer is required.
It does not become.

【００６２】以上のように、この実施の形態によれば、
固定コードブック探索処理時に必要なアドレスポインタ
は５本のみであり、アドレスポインタを５本以上備えた
ＤＳＰを用いて処理をさせる場合には、アドレスポイン
タの内容の退避、復帰を一度も行う必要が無く、アドレ
ス発生に関する演算量の増加を引き起こさない。また、
例え５本未満のアドレスポインタしか備えていないＤＳ
Ｐを用いて処理をさせる場合においても、アドレスポイ
ンタの内容の退避、復帰を必要とする回数が従来と比較
して減少するため、アドレス発生に関する演算量の増加
を抑えることが可能となる。As described above, according to this embodiment,
Only five address pointers are required at the time of the fixed codebook search processing. When processing is performed using a DSP having five or more address pointers, it is necessary to save and restore the contents of the address pointers at least once. And does not cause an increase in the amount of operation related to address generation. Also,
DS with less than 5 address pointers
Even when processing is performed using P, the number of times that the contents of the address pointer need to be saved and restored is reduced as compared with the related art, so that it is possible to suppress an increase in the amount of operation related to address generation.

【００６３】（第２の実施の形態）以下、本発明の第２
の実施の形態について、図３を参照しながら説明する。
図３は本発明の第２の実施の形態を示すブロック構成図
である。図３において、３０１は絞り込み探索アプロー
チ処理で使用するしきい値を計算するしきい値計算処理
部、３０２はしきい値を基にコードブックの探索箇所を
絞り込む絞り込み探索アプローチ処理部、３０３は絞り
込み探索アプローチ処理部３０２で絞り込まれたコード
ブックの探索箇所に関する情報を保存するコードブック
探索箇所情報格納用メモリ、３０４は絞り込み探索アプ
ローチ処理部３０２で絞り込まれたコードブック探索箇
所をコードブック探索箇所情報格納用メモリ３０３に保
存された情報を基に探索するパルス位置探索処理部であ
る。なお、この実施の形態で示すしきい値計算処理部３
０１、絞り込み探索アプローチ処理部３０２およびパル
ス位置探索処理部３０４の前段の処理は従来と同様の処
理方法で処理が行われ、この実施の形態で扱う各データ
は用意されているものとする。(Second Embodiment) Hereinafter, a second embodiment of the present invention will be described.
The embodiment will be described with reference to FIG.
FIG. 3 is a block diagram showing a second embodiment of the present invention. In FIG. 3, reference numeral 301 denotes a threshold value calculation processing unit that calculates a threshold value used in the narrowing down search approach process, 302 denotes a narrowing down search approach processing unit that narrows down a code book search location based on the threshold value, and 303 denotes a narrowing down search approach unit. A codebook search location information storage memory for storing information related to codebook search locations narrowed down by the search approach processing unit 302, and a codebook search location information 304 indicating codebook search locations narrowed down by the narrow search approach processing unit 302. A pulse position search processing unit that searches based on information stored in the storage memory 303. The threshold value calculation processing unit 3 shown in this embodiment
01, the processing at the preceding stage of the narrowing-down search approach processing unit 302 and the pulse position search processing unit 304 is performed by the same processing method as in the related art, and it is assumed that each data handled in this embodiment is prepared.

【００６４】また、この実施の形態では、探索する代数
コードブックのパルス数が４本であるものとし、以下に
その処理形態を述べる。まず、しきい値計算処理部３０
１において、（数６）に示したしきい値ｔｈｒｅｓを算
出する。ここで、ｍａｘは最初の３パルスによる相関値
の絶対値の最大値、ａｖｅは最初の３パルスによる相関
値の平均値、Ｋは０≦Ｋ≦１の定数を示す。つぎに絞り
込み探索アプローチ処理部３０２において、最後の第４
パルス位置を探索するための第１、第２、第３パルス位
置をしきい値ｔｈｒｅｓを用いて予備選択する。絞り込
み探索アプローチ処理部３０２では、１本逐次的に第
１、第２、第３のパルス位置の探索を行い、まず（数１
５）に示した相関値Ｃを算出する。In this embodiment, it is assumed that the number of pulses of the algebraic codebook to be searched is four, and the processing form will be described below. First, the threshold calculation unit 30
In step 1, the threshold value thres shown in (Equation 6) is calculated. Here, max is the maximum absolute value of the correlation values of the first three pulses, ave is the average of the correlation values of the first three pulses, and K is a constant of 0 ≦ K ≦ 1. Next, in the narrowed-down search approach processing unit 302, the last fourth
First, second, and third pulse positions for searching for pulse positions are preliminarily selected using the threshold value thres. The narrow-down search approach processing unit 302 sequentially searches the first, second, and third pulse positions one by one, and first, (Equation 1)
The correlation value C shown in 5) is calculated.

【００６５】ここで、ｄ［］は入力音声信号から短期
予測寄与分および長期予測寄与分を減じた残差信号と、
聴覚重み付き合成フィルタのインパルス応答との相関信
号、ｍ_iはｉ番目のパルス位置、ｓ_iはその振幅を示
す。つぎに、しきい値計算処理部３０１で算出したしき
い値ｔｈｒｅｓと相関値Ｃを比較する。そして、相関値
Ｃがしきい値ｔｈｒｅｓ以下である場合には、次のパル
ス位置に対して探索を行う。また、相関値Ｃがしきい値
ｔｈｒｅｓよりも大きい場合は、そのときの第１、第
２、第３パルス位置ｍ₀、ｍ₁、ｍ₂および相関値Ｃを
コードブック探索箇所情報格納用メモリ３０３に保存す
る。そして、第１〜第３パルス位置を選択し保存した回
数が、第４パルス位置の探索を行う回数の上限である最
大探索回数以上であれば絞り込み探索アプローチ処理を
終了する。また第１〜第３パルス位置を選択し保存した
回数が最大探索回数よりも小さい場合には、つぎのパル
ス位置に対して探索を行う。絞り込み探索アプローチ処
理部３０２では、上記一連の処理を第１〜第３パルス位
置を選択し保存した回数が最大探索回数以上になるか、
または探索すべき全ての第１、第２、第３パルス位置の
探索が終了するまで繰り返す。つぎにパルス位置探索処
理部３０４に移る。パルス位置探索処理部３０４では、
まずコードブック探索箇所情報格納用メモリ３０３に保
存された情報を基に（数１６）に示した絞り込み探索ア
プローチ処理部３０２で選択した第１、第２、第３パル
ス位置に関するエネルギーＥ１を算出する。Here, d [] is the residual signal obtained by subtracting the short-term prediction contribution and the long-term prediction contribution from the input speech signal,
Correlation signal with the impulse response of the perceptually weighted synthesis filter, m _i is the i th pulse position, s _i represents the amplitude. Next, the threshold value thres calculated by the threshold value calculation processing unit 301 is compared with the correlation value C. When the correlation value C is equal to or smaller than the threshold value thres, a search is performed for the next pulse position. If the correlation value C is larger than the threshold value thres, the first, second, and third pulse positions m ₀ , m ₁ , and m ₂ and the correlation value C at that time are stored in the codebook search location information storage memory. 303. If the number of times the first to third pulse positions have been selected and stored is equal to or greater than the maximum number of times of searching for the fourth pulse position, the narrowed search approach process is terminated. If the number of times the first to third pulse positions are selected and stored is smaller than the maximum number of searches, the search is performed for the next pulse position. The refined search approach processing unit 302 determines whether the number of times that the above series of processing has selected and stored the first to third pulse positions is equal to or greater than the maximum number of searches,
Alternatively, the process is repeated until the search for all the first, second, and third pulse positions to be searched is completed. Next, the process proceeds to the pulse position search processing unit 304. In the pulse position search processing unit 304,
First, based on the information stored in the codebook search location information storage memory 303, the energy E1 relating to the first, second, and third pulse positions selected by the narrowing search approach processing unit 302 shown in (Equation 16) is calculated. .

【００６６】ここで、Φ′は（数２）に示す聴覚重み付
き合成フィルタのインパルス応答の相関を含む行列Φに
前記相関信号ｄ［］の極性情報を付加したもの、
ｍ₀、ｍ ₁、ｍ₂は絞り込み探索アプローチ処理部３０
２で選択した第１、第２、第３パルス位置を示す。また
Ｃ１にパルス位置が前記ｍ₀〜ｍ₂である第１、第２、
第３パルスによる相関値Ｃをコードブック探索箇所情報
格納用メモリ３０３から読み出す。そして第４パルス位
置の探索をスタートし、（数１７）、（数１８）に示す
エネルギーＥ２および相関値Ｃ２を算出する。Here, Φ ′ is the auditory weight given by (Equation 2)
Matrix Φ containing the correlation of the impulse response of the synthesis filter
A signal obtained by adding polarity information of the correlation signal d [],
m₀, M ₁, M_TwoIs the narrow search approach processing unit 30
2 shows the first, second, and third pulse positions selected. Also
The pulse position at C1 is m₀~ M_TwoThe first, second,
Correlation value C by the third pulse is used as codebook search location information
The data is read from the storage memory 303. And the fourth pulse
Start the search for the position, as shown in (Equation 17) and (Equation 18)
The energy E2 and the correlation value C2 are calculated.

【００６７】ここで、ｍ₃は第４パルス位置、ｓ₃はそ
の振幅を示す。そして、（数３）に示す式を最大にする
最適パルス位置を探索するために、Ｃ２²・Ｅ_maxとＣ
_max・Ｅ２の比較を行う。Ｃ_max、Ｅ_maxはそれぞれ最
適パルスによる相関値およびエネルギーである。ここ
で、Ｃ_max・Ｅ２がＣ２²・Ｅ_ma _x以上である場合に
は、つぎのパルス位置に対して探索を行う。逆に、Ｃ
_max・Ｅ２がＣ２²・Ｅ_maxより小さい場合には、その
ときの全パルス位置ｍ₀、ｍ₁、ｍ₂、ｍ₃を最適パル
ス位置として、またＣ２²、Ｅ２を最適パルスによる相
関値およびエネルギーとして保存更新し、つぎのパルス
位置に対して探索を行う。Here, m ₃ is the fourth pulse position, and s ₃ is its amplitude. Then, to search for the optimal pulse position that maximizes the equation shown in equation (3), C2 ² · E _max and C
Compare _max · E2. C _max and E _max are respectively the correlation value and the energy by the optimum pulse. Here, when C _max · E2 is C2 ² · E _ma _x above, to search against the next pulse position. Conversely, C
_{When max} · E2 is smaller than C2 ² · E _max , all the pulse positions m ₀ , m ₁ , m ₂ , and m ₃ at that time are set as optimum pulse positions, and C2 ² and E2 are set as correlation values by the optimum pulse and The energy is stored and updated, and the next pulse position is searched.

【００６８】そして、第４パルス位置の全探索が終了し
たら、絞り込み探索アプローチ処理部３０２で選択され
たつぎの第１〜第３パルス位置ｍ₀、ｍ₁、ｍ₂に対し
て同様に第４パルス位置の探索を行う。パルス位置探索
処理部３０４では、上記一連の処理を絞り込み探索アプ
ローチ処理部３０２で選択した全パルス位置に対して行
う。When the full search for the fourth pulse position is completed, the fourth to third pulse positions m ₀ , m ₁ , and m ₂ selected by the narrow search approach processing unit 302 are similarly processed. Search for the pulse position. The pulse position search processing unit 304 performs the above-described series of processing for all the pulse positions selected by the narrowing search approach processing unit 302.

【００６９】ここで、上記各処理部をＤＳＰ等の信号処
理用ＬＳＩで実現した場合、絞り込み探索アプローチ処
理部３０２において扱うデータはｄ［ｍ₀］、ｄ
［ｍ₁］、ｄ［ｍ₂］のみであり、必要となるアドレス
ポインタは、前記ｄ［ｍ₀］〜ｄ［ｍ₂］を扱う３本
と、パルス位置等の情報をコードブック探索箇所情報格
納用メモリ３０３に保存するために使用する１本の計４
本である。また、パルス位置探索処理部３０４において
必要なアドレスポインタは、第４パルス位置探索時に
Φ′（ｍ₃，ｍ₃）、Φ′（ｍ₀，ｍ₃）、Φ′
（ｍ₁，ｍ₃）、Φ′（ｍ₂，ｍ ₃）およびｄ［ｍ₃］
を扱うための計５本である。なお、パルス位置探索処理
部３０４でエネルギーＥ１を算出する際に扱うΦ′（ｍ
₀，ｍ₀）〜Φ′（ｍ₁，ｍ₂）については、前述の５
本のアドレスポインタと共用可能であり、新たにアドレ
スポインタが必要となることは無い。Here, each of the above processing units is processed by a signal processor such as a DSP.
In the case of realizing with a LSI, a narrow search approach
The data handled by the processing unit 302 is d [m₀], D
[M₁], D [m_Two] Is the only required address
The pointer is the d [m₀] To d [m_Two] To handle three
And information such as pulse position
One total of 4 used to save in the delivery memory 303
It is a book. In the pulse position search processing unit 304,
The necessary address pointer is used when searching for the fourth pulse position.
Φ '(m_Three, M_Three), Φ '(m₀, M_Three), Φ '
(M₁, M_Three), Φ '(m_Two, M _Three) And d [m_Three]
There are a total of five cables to handle. The pulse position search processing
Φ ′ (m
₀, M₀) ~ Φ '(m₁, M_Two) For 5
It can be shared with the address pointer of a book, and a new address
No pointer is needed.

【００７０】以上のように、この実施の形態によれば、
固定コードブック探索処理時に必要なアドレスポインタ
は５本のみであり、アドレスポインタを５本以上備えた
ＤＳＰを用いて処理をさせる場合には、アドレスポイン
タの内容の退避、復帰を一度も行う必要が無く、アドレ
ス発生に関する演算量の増加を引き起こさない。また、
例え５本未満のアドレスポインタしか備えていないＤＳ
Ｐを用いて処理をさせる場合においても、アドレスポイ
ンタの内容の退避、復帰を必要とする回数が従来と比較
して減少するため、アドレス発生に関する演算量の増加
を抑えることが可能となる。As described above, according to this embodiment,
Only five address pointers are required at the time of the fixed codebook search processing. When processing is performed using a DSP having five or more address pointers, it is necessary to save and restore the contents of the address pointers at least once. And does not cause an increase in the amount of operation related to address generation. Also,
DS with less than 5 address pointers
Even when processing is performed using P, the number of times that the contents of the address pointer need to be saved and restored is reduced as compared with the related art, so that it is possible to suppress an increase in the amount of operation related to address generation.

【００７１】なお、本発明は、記録媒体へも適用でき
る。すなわち、この媒体は、コンピュータに前述の第１
の実施の形態の音声符号化方法を実現する各手段をコン
ピュータに実行させるプログラムを記録したものであ
る。この構成によると、この媒体を用いてコンピュータ
を動作させると、請求項１と同様な作用効果を達成する
ことができる。The present invention can be applied to a recording medium. That is, this medium is stored in the computer in the first format.
A program for causing a computer to execute each means for realizing the speech encoding method according to the embodiment is recorded. According to this configuration, when the computer is operated using the medium, the same operation and effect as those of the first aspect can be achieved.

【００７２】[0072]

【発明の効果】以上のように、本発明によれば、ＡＣＥ
ＬＰ音声符号化方法の固定コードブック探索処理におい
てコードブックの探索箇所を絞り込む絞り込み探索アプ
ローチ処理と代数コードブックのパルス位置を探索する
パルス位置探索処理を分離することにより、固定コード
ブック探索処理をＤＳＰ等の信号処理用ＬＳＩで実現す
る場合に必要となるアドレスポインタ数を低減すること
ができ、アドレス発生に伴うアドレスポインタの内容の
退避、復帰処理が減少することから、アドレス発生に関
する演算量の増加を抑えることが可能となる。As described above, according to the present invention, the ACE
In the fixed codebook search processing of the LP speech encoding method, the fixed codebook search processing is performed by the DSP by separating the narrow search search processing for narrowing the codebook search location and the pulse position search processing for searching the pulse position of the algebraic codebook. The number of address pointers required in the case of realizing with a signal processing LSI such as the above can be reduced, and the number of operations for saving and restoring the contents of the address pointer due to the generation of the address is reduced. Can be suppressed.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の第１の実施の形態の音声符号化装置に
おける固定コードブック探索手段の基本的な構成を示す
ブロック図である。FIG. 1 is a block diagram illustrating a basic configuration of a fixed codebook search unit in a speech encoding device according to a first embodiment of the present invention.

【図２】本発明の第１の実施の形態の音声符号化装置に
おける固定コードブック探索の処理を示すフローチャー
トである。FIG. 2 is a flowchart showing a fixed codebook search process in the speech encoding device according to the first embodiment of the present invention.

【図３】本発明の第２の実施の形態の音声符号化装置に
おける固定コードブック探索処理部の構成を示すブロッ
ク図である。FIG. 3 is a block diagram illustrating a configuration of a fixed codebook search processing unit in a speech encoding device according to a second embodiment of the present invention.

【図４】従来の固定コードブック探索処理フローを示す
フローチャートである。FIG. 4 is a flowchart showing a conventional fixed codebook search processing flow.

【符号の説明】[Explanation of symbols]

１００固定コードブック探索手段１０１しきい値計算手段１０２絞り込み探索アプローチ手段１０３パルス位置探索手段２００〜２１９処理の各ステップ３００固定コードブック探索処理部３０１しきい値計算処理部３０２絞り込み探索アプローチ処理部３０３コードブック探索箇所情報格納用メモリ３０４パルス位置探索処理部４００〜４１７処理の各ステップ４１８絞り込み探索アプローチ・パルス位置探索手段 REFERENCE SIGNS LIST 100 Fixed codebook search means 101 Threshold calculation means 102 Narrow search approach means 103 Pulse position search means 200 to 219 Each step of processing 300 Fixed codebook search processing unit 301 Threshold calculation processing unit 302 Narrow search approach processing unit 303 Codebook search location information storage memory 304 Pulse position search processing unit 400 to 417 Each step of processing 418 Narrowing search approach / pulse position search means

Claims

【特許請求の範囲】[Claims]

【請求項１】あるフレーム長の音声信号に対し線形予
測分析を行い短期合成フィルタを生成し、前記音声信号
に対し聴覚重み付けを行い聴覚重み付き音声信号を生成
し、前記聴覚重み付き音声信号を基に長期合成フィルタ
を生成し、前記短期合成フィルタによる短期予測寄与分
と前記長期合成フィルタによる長期予測寄与分を前記音
声信号より減算した残差信号を基に前記残差信号と再生
音声信号の平均自乗誤差が最小となるように代数コード
ブックの探索を行う音声符号化方法であって、前記代数コードブックの探索方法を制御するしきい値の
計算を行い、前記しきい値を基に前記代数コードブック
の探索箇所を絞り込み、絞り込みの結果を基に代数コー
ドブックの探索を行うことを特徴とする音声符号化方
法。1. A short-term synthesis filter is generated by performing a linear prediction analysis on an audio signal of a certain frame length, an auditory weight is applied to the audio signal to generate an auditory weighted audio signal, and the audio signal with an auditory weight is generated. A long-term synthesis filter is generated based on the residual signal obtained by subtracting the short-term prediction contribution by the short-term synthesis filter and the long-term prediction contribution by the long-term synthesis filter from the audio signal. A speech coding method for searching for an algebraic codebook so that a mean square error is minimized, comprising calculating a threshold value for controlling a search method for the algebraic codebook, and based on the threshold value. A speech coding method characterized by narrowing down a search location of an algebraic codebook and searching for an algebraic codebook based on a result of the narrowing down.

【請求項２】あるフレーム長の音声信号に対し線形予
測分析を行い短期合成フィルタを生成する線形予測分析
手段と、前記音声信号に対し聴覚重み付けを行い聴覚重
み付き音声信号を生成する聴覚重み付け手段と、前記聴
覚重み付き音声信号を基に長期合成フィルタを生成する
適応コードブック探索手段と、前記短期合成フィルタに
よる短期予測寄与分と前記長期合成フィルタによる長期
予測寄与分を前記音声信号より減算した残差信号を基に
前記残差信号と再生音声信号の平均自乗誤差が最小とな
るように代数コードブックの探索を行う固定コードブッ
ク探索手段とを備えた音声符号化装置であって、前記固定コードブック探索手段が、前記代数コードブッ
クの探索方法を制御するしきい値の計算を行うしきい値
計算手段と、前記しきい値を基に前記代数コードブック
の探索箇所を絞り込む絞り込み探索アプローチ手段と、
前記絞り込み探索アプローチ手段の結果を基に代数コー
ドブックの探索を行うパルス位置探索手段とを備えたこ
とを特徴とする音声符号化装置。2. A linear prediction analysis means for performing a linear prediction analysis on an audio signal having a certain frame length to generate a short-term synthesis filter, and an auditory weighting means for applying an auditory weight to the audio signal to generate an audio signal with an auditory weight. An adaptive codebook search means for generating a long-term synthesis filter based on the auditory weighted audio signal; and subtracting the short-term prediction contribution by the short-term synthesis filter and the long-term prediction contribution by the long-term synthesis filter from the audio signal. A speech codec comprising fixed codebook search means for searching for an algebraic codebook so that a mean square error between the residual signal and the reproduced audio signal is minimized based on the residual signal. Code book search means for calculating a threshold value for controlling the algebraic code book search method; threshold value calculation means; And narrowing the search approach means to narrow down the search portion of the algebraic codebook based on,
And a pulse position searching means for searching an algebraic codebook based on a result of the narrowing search approach means.

【請求項３】あるフレーム長の音声信号に対し線形予
測分析を行い短期合成フィルタを生成する線形予測分析
処理部と、前記音声信号に対し聴覚重み付けを行い聴覚
重み付き音声信号を生成する聴覚重み付け処理部と、前
記聴覚重み付き音声信号を基に長期合成フィルタを生成
する適応コードブック探索処理部と、前記短期合成フィ
ルタによる短期予測寄与分と前記長期合成フィルタによ
る長期予測寄与分を前記音声信号より減算した残差信号
を基に前記残差信号と再生音声信号の平均自乗誤差が最
小となるように代数コードブックの探索を行う固定コー
ドブック探索処理部とを備えた音声符号化装置であっ
て、前記固定コードブック探索処理部が、前記代数コードブ
ックの探索方法を制御するしきい値の計算を行うしきい
値計算処理部と、前記しきい値を基に前記代数コードブ
ックの探索箇所を絞り込む絞り込み探索アプローチ処理
部と、前記絞り込み探索アプローチ処理部で絞り込まれ
た前記代数コードブックの探索箇所に関する情報を保存
するコードブック探索箇所情報格納用メモリと、前記コ
ードブック探索箇所情報格納用メモリに保存された情報
を基に代数コードブックの探索を行うパルス位置探索処
理部とを備えたことを特徴とする音声符号化装置。3. A linear prediction analysis processing unit for performing a linear prediction analysis on an audio signal having a certain frame length to generate a short-term synthesis filter, and an auditory weighting for applying an auditory weight to the audio signal to generate an auditory-weighted audio signal. A processing unit; an adaptive codebook search processing unit that generates a long-term synthesis filter based on the auditory weighted audio signal; and a short-term prediction contribution by the short-term synthesis filter and a long-term prediction contribution by the long-term synthesis filter. A fixed codebook search processing unit that searches for an algebraic codebook based on the residual signal obtained by subtracting the residual signal and the reproduced audio signal so as to minimize the mean square error. A fixed-value codebook search processing unit that calculates a threshold value that controls a search method of the algebraic codebook; A narrowing search approach processing unit that narrows down the search location of the algebraic codebook based on the threshold value, and codebook search location information that stores information about the search location of the algebraic codebook narrowed down by the narrowing search approach processing unit A speech encoding apparatus comprising: a storage memory; and a pulse position search processing unit that searches an algebraic codebook based on information stored in the codebook search location information storage memory.

【請求項４】コンピュータに請求項１記載の音声符号
化方法を実現する各手段を実行させるプログラムを記録
した媒体。4. A medium on which a program for causing a computer to execute each means for realizing the speech encoding method according to claim 1 is recorded.