JPH06130996A

JPH06130996A - Code excitation linear predictive encoding and decoding device

Info

Publication number: JPH06130996A
Application number: JP4278455A
Authority: JP
Inventors: Hiroshi Katsuragawa; 浩桂川; Hiromi Aoyanagi; 弘美青柳; Yoshihiro Ariyama; 義博有山; Katsutoshi Ito; 克俊伊東
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1992-10-16
Filing date: 1992-10-16
Publication date: 1994-05-13
Anticipated expiration: 2015-09-18
Also published as: JP3088204B2

Abstract

PURPOSE:To reduce processing and time for deciding the code of excitation source information contributed for a voiced sound rather than conventional processing and time and to less reduce the number of bits required for the excitation source information contributed for the voiced sound rather than the conventional device. CONSTITUTION:In the code excitation linear predictive encoding device, a long- term analyzer 12 obtain a frame lag L of an input voice signal at every frame period, and an excitation source evaluation circuit 13 searches an adaptive code book 14 from among the adaptive excitation code vectors before and behind the prescribed number with an adaptive excitation code vector corresponding to the frame lag L as a center in the case of searching the adaptive code book 14. In this case, when the encoded voice signal contains both the frame lag and a sub frame corrected lag DELTAL as the information of time difference between the optimum adaptive excitation code vector obtained as the result of search and the adaptive excitation code vector corresponding to the frame lag, the efficiency of compression can be improved in comparison with the conventional device.

Description

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、コード励振線形予測符
号化方式（ＣＥＬＰ符号化方式）に従うコード励振線形
予測符号化装置及び復号化装置に適用して好適なもので
ある。BACKGROUND OF THE INVENTION The present invention is suitable for application to a code-excited linear predictive coding apparatus and a decoding apparatus according to a code-excited linear predictive coding method (CELP coding method).

【０００２】[0002]

【従来の技術】８ｋビット／秒以下の圧縮率での、音声
信号の高能率符号化方式としては、Ａｔａｌ等によって
提案されたＣＥＬＰ符号化方式が有効な手法である。2. Description of the Related Art The CELP coding method proposed by Atal et al. Is an effective method as a high-efficiency coding method for audio signals at a compression rate of 8 kbit / sec or less.

【０００３】これは、音声信号を固定の長さのフレーム
毎に符号化するフレーム符号化方式に属しており、フレ
ームに１回求められる声道のパラメータと、フレームを
いくつかに分解したサブフレーム毎に求められる励振源
のパラメータによって表現するものである。入力音声か
ら声道パラメータを得るフォワード型のＣＥＬＰ符号化
方式においては、励振源のパラメータとして、無声音の
符号化のための統計コードブックと、有声音の符号化の
ための適応コードブックの２つによってベクトル量子化
するものが多く、例えば、下記文献に記載されている。
音声信号は、定常的は有声音の区間と、過渡的な無声音
の区間に分けて考えることができ、両者はその統計的な
性質が大きく異なっており、そのため、上述のように、
統計コードブックと適応コードブックとが別個に用意さ
れている。This belongs to a frame coding system for coding a voice signal for each frame having a fixed length. The vocal tract parameter required once for a frame and a subframe obtained by dividing the frame into several parts. It is expressed by the parameters of the excitation source obtained for each. In the forward type CELP coding method for obtaining vocal tract parameters from input speech, there are two excitation source parameters: a statistical codebook for coding unvoiced sounds and an adaptive codebook for coding voiced sounds. Many are vector-quantized by, for example, it is described in the following document.
The voice signal can be considered as being divided into a stationary voiced sound section and a transient unvoiced section, and both have greatly different statistical properties. Therefore, as described above,
A statistical codebook and an adaptive codebook are prepared separately.

【０００４】文献：N.S.Jayant & J.H.Chen,"Speech Co
ding with Time-Varying BitAllocations to Excitatio
n and LPC parameters",Proc,ICASSP-89,(1989) 図２は適応コードブックによる符号化の概念を説明する
ためのものである。周期性（自己相関性）を有する有声
音区間の音質に対する寄与が大きい適応コードブックに
は、現時点で対象となっているサブフレームの直前過去
の長時間Ｔの励振源波形Ｓ１が例えばシフトレジスタに
よって格納されている。この長時間の励振源波形Ｓ１に
おいて、サブフレーム期間を有する部分的波形（統計励
振コードベクトル）Ｓ11〜Ｓ1xを考える。現時点で対象
となっているサブフレームの励振源波形として、いずれ
の部分的波形Ｓ11、Ｓ12、…、Ｓ1xを用いると、入力音
声と局部再生音声とが最も近似するかを探索し、探索さ
れた部分的波形Ｓ1nを特定する情報（例えば長時間Ｔの
最初のサンプルからの部分的波形Ｓ1nの最初のサンプル
までのサンプル数をビット表現したもの）を適応コード
ブックからの現時点の処理対象サブフレームについての
出力符号とする。ここで、出力符号は、結局、現時点の
処理対象サブフレームとほぼ同様な波形形状を有する過
去の時点との時間差（ラグ：ピッチ周期）である。Reference: NSJayant & JHChen, "Speech Co
ding with Time-Varying BitAllocations to Excitatio
n and LPC parameters ", Proc, ICASSP-89, (1989) Fig. 2 is for explaining the concept of coding by the adaptive codebook. For the sound quality of the voiced section having periodicity (autocorrelation) In the adaptive codebook with a large contribution, the excitation source waveform S1 for a long time T immediately before and after the target subframe at this time is stored by, for example, a shift register. Consider partial waveforms (statistical excitation code vectors) S11 to S1x having a sub-frame period, where any one of the partial waveforms S11, S12, ..., S1x is used as the excitation source waveform of the target sub-frame at present. , Information for identifying the input sound and the locally reproduced sound that are closest to each other and specifying the searched partial waveform S1n (for example, the first partial waveform S1n from the first sample of the long time T). (The number of samples up to the sample is expressed in bits) is the output code for the current processing target subframe from the adaptive codebook, where the output code is, after all, almost the same waveform as the current processing target subframe. It is a time difference (lag: pitch period) from a past time point having a shape.

【０００５】[0005]

【発明が解決しようとする課題】ところで、適応コード
ブックが大きく寄与する有声音は定常性が高いため、ラ
グ（ピッチ周期）の変動速度は遅く相前後するサブフレ
ーム間のラグは近似したものとなる。By the way, since the voiced sound to which the adaptive codebook greatly contributes has a high stationarity, the lag (pitch period) fluctuation speed is slow and the lags between the subframes that follow each other are approximate. Become.

【０００６】しかしながら、従来では、このようなラグ
の性質を利用しておらず、サブフレーム毎に独立にラグ
を求めていた。すなわち、上述のように、サブフレーム
毎に、その処理時点での多くの部分的波形Ｓ11、Ｓ12、
…、Ｓ1xから最適なものを探索してラグを決定してい
た。そのため、ラグを得るために多くの処理及び時間が
かかっていた。However, conventionally, such a property of the lag is not used, and the lag is independently calculated for each subframe. That is, as described above, for each subframe, many partial waveforms S11, S12,
..., the optimum one was searched from S1x to determine the lag. Therefore, it took a lot of processing and time to obtain the lag.

【０００７】また、上述のように、相前後するサブフレ
ーム間のラグは近似しているという性質を利用せずに、
サブフレーム毎に得られたラグを送信しているため、伝
送情報に冗長な情報が入り込んでいるということができ
る。ＣＥＬＰ符号化方式は、低ビットレートの音声符号
化として採用されることが多く、伝送情報の各ビットを
有効に利用することが求められ、この点、ラグに対する
符号化は不十分ということができる。Further, as described above, without utilizing the property that the lags between successive subframes are similar,
Since the lag obtained for each subframe is transmitted, it can be said that redundant information is included in the transmission information. The CELP coding method is often adopted as low bit rate speech coding, and it is required to effectively use each bit of transmission information. In this respect, it can be said that coding for lag is insufficient. .

【０００８】本発明は、以上の点を考慮してなされたも
のであり、有声音に寄与する励振源情報の符号を決定す
る処理及び時間を従来より削減することができるコード
励振線形予測符号化器を提供しようとしたものである。The present invention has been made in consideration of the above points, and code excitation linear predictive coding capable of reducing the processing and time for determining the code of excitation source information that contributes to voiced sound compared to the conventional method. It was an attempt to provide a container.

【０００９】また、本発明は、有声音に寄与する励振源
情報の符号に必要なビット数を従来より削減することが
できるコード励振線形予測符号化器及び復号化器を提供
しようとしたものである。Further, the present invention is intended to provide a code excitation linear predictive encoder and a decoder which can reduce the number of bits required for encoding excitation source information contributing to voiced sound as compared with the conventional code excitation encoder. is there.

【００１０】[0010]

【課題を解決するための手段】かかる課題を解決するた
め、本発明においては、適応コードブックを備えたコー
ド励振線形予測符号化装置において、適応コードブック
を利用した励振源情報の符号化周期の整数倍の周期毎
に、入力音声信号の長周期相関値を求める長周期分析器
を備え、適応コードブックに格納されている複数の適応
励振コードベクトルから最適な適応励振コードベクトル
を探索する際に、長周期相関値に応じた適応励振コード
ベクトルを中心とした前後所定数の適応励振コードベク
トルの中から探索するようにした。In order to solve such a problem, according to the present invention, in a code excitation linear predictive coding apparatus provided with an adaptive codebook, the coding cycle of excitation source information using the adaptive codebook When searching for the optimum adaptive excitation code vector from multiple adaptive excitation code vectors stored in the adaptive codebook, it is equipped with a long-period analyzer that calculates the long-period correlation value of the input speech signal for each cycle of an integer multiple. , A predetermined number of adaptive excitation code vectors before and after the adaptive excitation code vector corresponding to the long-period correlation value are searched.

【００１１】ここで、コード励振線形予測復号化装置に
向けて伝送する符号化音声に、長周期相関値と、探索の
結果得られた最適な適応励振コードベクトルと長周期相
関値に応じた適応励振コードベクトルとの時間差情報と
を含めることが好ましい。Here, the coded speech transmitted to the code-excited linear predictive decoding apparatus is adapted according to the long-period correlation value, the optimum adaptive excitation code vector obtained as a result of the search, and the long-period correlation value. It is preferable to include time difference information from the excitation code vector.

【００１２】このような好ましい態様のコード励振線形
予測符号化装置に対応した復号化装置においては、符号
化音声に含まれている長周期相関値と時間差情報とを加
算して得た値を、適応コードブックに対するインデック
スとして適応励振コードベクトルを出力させることが好
ましい。In the decoding device corresponding to the code-excited linear predictive coding device of such a preferable mode, the value obtained by adding the long-period correlation value and the time difference information included in the coded speech is It is preferable to output the adaptive excitation code vector as an index to the adaptive codebook.

【００１３】[0013]

【作用】本発明は、有声音区間の長周期相関値（ピッチ
周期）を、適応コードブックの探索周期の整数倍の周期
で求めても、探索周期で求めてもほとんど同じ値をとる
ことに着目してなされたものである。According to the present invention, the long-period correlation value (pitch period) of the voiced sound section has almost the same value whether it is obtained by a period that is an integral multiple of the search period of the adaptive codebook or by the search period. It was made paying attention.

【００１４】そして、この性質を利用することで適応コ
ードブックの探索時間及び処理を軽減させることを意図
した。本発明のコード励振線形予測符号化装置におい
て、長周期分析器は、適応コードブックを利用した励振
源情報の符号化周期（探索周期）の整数倍の周期毎に、
入力音声信号の長周期相関値を求め、適応コードブック
についての符号を決定する構成は、適応コードブックに
格納されている複数の適応励振コードベクトルから最適
な適応励振コードベクトルを探索する際に、長周期相関
値に応じた適応励振コードベクトルを中心とした前後所
定数の適応励振コードベクトルの中から探索するように
した。It is intended to reduce the search time and processing of the adaptive codebook by utilizing this property. In the code-excited linear predictive encoding device of the present invention, the long-period analyzer uses an adaptive codebook for each period that is an integer multiple of the encoding period (search period) of the excitation source information,
The configuration for determining the long-period correlation value of the input speech signal and determining the code for the adaptive codebook has a configuration in which an optimal adaptive excitation code vector is searched from a plurality of adaptive excitation code vectors stored in the adaptive codebook, A search is made from a predetermined number of adaptive excitation code vectors before and after the adaptive excitation code vector corresponding to the long-period correlation value.

【００１５】ここで、コード励振線形予測復号化装置に
向けて伝送する符号化音声に、長周期相関値と、探索の
結果得られた最適な適応励振コードベクトルと長周期相
関値に応じた適応励振コードベクトルとの時間差情報と
を含めるようにすると、従来に比べて、圧縮効率を高め
ることができて好ましい。Here, the coded speech transmitted to the code-excited linear predictive decoding apparatus is adapted according to the long-period correlation value and the optimum adaptive excitation code vector and long-period correlation value obtained as a result of the search. It is preferable to include the time difference information with respect to the excitation code vector because the compression efficiency can be improved as compared with the conventional case.

【００１６】このような好ましい態様のコード励振線形
予測符号化装置に対応した復号化装置においては、適応
コードブックを直接アクセスできる情報が伝送されてこ
ないので、符号化音声に含まれている長周期相関値と時
間差情報とを加算して得た値を、適応コードブックに対
するインデックスとして適応励振コードベクトルをアク
セスすることになる。In the decoding device corresponding to the code-excited linear predictive coding device of such a preferable mode, since the information that can directly access the adaptive codebook is not transmitted, the long period correlation included in the coded speech is generated. The value obtained by adding the value and the time difference information is used as an index for the adaptive codebook to access the adaptive excitation code vector.

【００１７】[0017]

【実施例】以下、本発明の実施例によるコード励振線形
予測符号化装置及び復号化装置を、図面を参照しながら
この順序で詳述する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A code-excited linear predictive coding apparatus and a decoding apparatus according to an embodiment of the present invention will be described below in this order with reference to the drawings.

【００１８】図１は、実施例のコード励振線形予測符号
化装置の構成を示すものである。FIG. 1 shows the configuration of a code-excited linear predictive coding apparatus according to the embodiment.

【００１９】図１において、アナログ／デジタル変換さ
れた入力音声信号系列は、特定のフレーム長単位で当該
符号化装置に入力される。In FIG. 1, the analog-to-digital converted input audio signal sequence is input to the encoding apparatus in a specific frame length unit.

【００２０】声道分析器１１は、入力音声信号を声道分
析（例えばＬＰＣ分析）し、分析で得た声道パラメータ
αを、長周期分析器１２、励振源評価回路１３及び多重
化器１７に与える。ここで、声道パラメータαはフレー
ム毎に得られてフレーム毎に出力される。The vocal tract analyzer 11 performs vocal tract analysis (for example, LPC analysis) on the input voice signal, and the vocal tract parameter α obtained by the analysis is used as a long period analyzer 12, an excitation source evaluation circuit 13, and a multiplexer 17. Give to. Here, the vocal tract parameter α is obtained for each frame and output for each frame.

【００２１】長周期分析器１２は、入力音声信号の長周
期の相関を計算し、入力音声信号のフレーム全体につい
てのピッチ周期情報であるラグ（以下、フレームラグと
呼ぶ）Ｌを求め、励振源評価回路１３及び多重化器１４
に与える。このようなフレームラグＬも各フレームで１
回ずつ求められる。なお、この長周期分析器１２には、
短周期パラメータである声道パラメータαも与えられて
おり、この声道パラメータαを利用して入力音声信号に
おける短周期成分の影響を除去して長周期の相関を検出
するようになされている。The long period analyzer 12 calculates a long period correlation of the input speech signal, obtains a lag (hereinafter referred to as a frame lag) L which is pitch period information for the entire frame of the input speech signal, and an excitation source. Evaluation circuit 13 and multiplexer 14
Give to. Such frame lug L is also 1 for each frame.
You will be asked each time. In addition, this long period analyzer 12
A vocal tract parameter α which is a short period parameter is also given, and the influence of the short period component in the input speech signal is removed by using this vocal tract parameter α to detect a long period correlation.

【００２２】なお、長周期分析器１２によって求められ
るフレームラグＬの表記方法や検出精度については後述
する。The notation method and detection accuracy of the frame lag L obtained by the long period analyzer 12 will be described later.

【００２３】適応コードブック１４は、有声音区間に大
きく寄与する複数の適応励振コードベクトルを格納して
いるものであり、例えば、シフトレジスタやテーブル等
によって構成されている。後述するように、最適な適応
励振コードベクトルは、励振源評価回路１３による制御
下で１フレーム期間をいくつかに等分したサブフレーム
毎に求められる。The adaptive code book 14 stores a plurality of adaptive excitation code vectors that greatly contribute to the voiced sound section, and is composed of, for example, a shift register and a table. As will be described later, the optimum adaptive excitation code vector is obtained for each subframe obtained by equally dividing one frame period into several under the control of the excitation source evaluation circuit 13.

【００２４】統計コードブック１５は、無声音区間に大
きく寄与する複数の統計励振コードベクトルを格納して
いるものである。後述するように、最適な統計励振コー
ドベクトルも、励振源評価回路１３による制御下でサブ
フレーム毎に求められる。The statistical code book 15 stores a plurality of statistical excitation code vectors that greatly contribute to the unvoiced section. As will be described later, the optimum statistical excitation code vector is also obtained for each subframe under the control of the excitation source evaluation circuit 13.

【００２５】ゲインコードブック１６は、適応励振コー
ドベクトルと統計励振コードベクトルのそれぞれに対す
る複数のゲイン情報を格納している。後述するように、
最適な適応励振コードベクトル及び統計励振コードベク
トルに対する最適なゲイン（β，γ）も、励振源評価回
路１３による制御下でサブフレーム毎に求められる。The gain code book 16 stores a plurality of gain information for each of the adaptive excitation code vector and the statistical excitation code vector. As described below,
The optimum gains (β, γ) for the optimum adaptive excitation code vector and the statistical excitation code vector are also obtained for each subframe under the control of the excitation source evaluation circuit 13.

【００２６】励振源評価回路１３は、フレームラグＬを
除いた励振源情報ΔＬ、Ｉ、（β，γ）をサブフレーム
毎に決定させるものである。励振源評価回路１３は、適
応コードブック１４に係る励振源情報（以下、サブフレ
ーム補正ラグと呼ぶ）ΔＬ、統計コードブック１５に係
る励振源情報（以下、統計コードインデックスと呼ぶ）
Ｉ、ゲインコードブック１６に係る励振源情報（以下、
最適度ゲインと呼ぶ）（β，γ）の順に決定動作を行な
うものである。The excitation source evaluation circuit 13 determines the excitation source information ΔL, I, (β, γ) excluding the frame lag L for each subframe. The excitation source evaluation circuit 13 has excitation source information (hereinafter, referred to as subframe correction lag) ΔL related to the adaptive codebook 14 and excitation source information (hereinafter, referred to as statistical code index) related to the statistical codebook 15.
I, excitation source information related to gain codebook 16 (hereinafter,
The determination operation is performed in the order of (optimity gain) (β, γ).

【００２７】励振源評価回路１３は、励振コードベクト
ル及び声道パラメータαから局部再生の音声を得る構成
や、その局部再生の音声と入力音声との類似度を求める
構成を備えており（例えば、特願平３−３２７４４３号
明細書及び図面等を参照）、類似度が最も大きくなるよ
うな情報を、適応コードブック１４、統計コードブック
１５、ゲインコードブック１６等について探索する。The excitation source evaluation circuit 13 has a structure for obtaining a locally reproduced voice from the excitation code vector and the vocal tract parameter α and a structure for obtaining the similarity between the locally reproduced voice and the input voice (for example, (See Japanese Patent Application No. 3-327443, drawings, etc.), and information that maximizes the degree of similarity is searched for in the adaptive codebook 14, the statistical codebook 15, the gain codebook 16, and the like.

【００２８】多重化器１７は、フレームに１回与えられ
る声道パラメータα及びフレームラグＬ、サブフレーム
に１回与えられるサブフレーム補正ラグΔＬ、統計コー
ドインデックスＩ及び最適度ゲイン（β，γ）を適宜量
子化しながら多重化して符号化音声信号を得る。The multiplexer 17 has a vocal tract parameter α and a frame lag L given once to a frame, a subframe correction lag ΔL given once to a subframe, a statistical code index I, and an optimality gain (β, γ). Are appropriately quantized and multiplexed to obtain an encoded voice signal.

【００２９】以上の構成を有する実施例のコード励振線
形予測符号化装置は、以下のように動作する。The code-excited linear predictive coding apparatus of the embodiment having the above configuration operates as follows.

【００３０】１フレーム分の入力音声信号が与えられる
と、音声分析器１１及び長周期分析器１２はそれぞれ、
分析処理して声道パラメータα及びフレームラグＬを求
めて出力する。When an input voice signal for one frame is given, the voice analyzer 11 and the long period analyzer 12 respectively,
Analysis processing is performed to obtain and output the vocal tract parameter α and the frame lag L.

【００３１】励振源評価回路１３は、サブフレーム毎に
励振源情報を決定する。まず、励振源評価回路１３は、
統計コードブック１５及びゲインコードブック１６から
の出力を停止させた状態で（又はデフォルト値を出力さ
せた状態で）、適応コードブック１４から最適な適応励
振コードベクトルの候補としての複数の適応励振コード
ベクトルを、例えば時間順次に又は並列に出力させ、各
候補の適応励振コードベクトルを利用してそれぞれ局部
再生の音声を得て評価し、入力音声信号に対する類似度
が最も大きい局部再生音声を合成させた適応励振コード
ベクトルを最適な適応励振コードベクトルとする。The excitation source evaluation circuit 13 determines the excitation source information for each subframe. First, the excitation source evaluation circuit 13
With the outputs from the statistical codebook 15 and the gain codebook 16 being stopped (or with default values being output), a plurality of adaptive excitation codes from the adaptive codebook 14 as candidates for the optimal adaptive excitation code vector Vectors are output, for example, in time sequence or in parallel, and the locally reproduced speech is obtained and evaluated by using the adaptive excitation code vector of each candidate, and the locally reproduced speech with the highest similarity to the input speech signal is synthesized. The adaptive excitation code vector is set as the optimum adaptive excitation code vector.

【００３２】ここで、励振源評価回路１３は、今回のサ
ブフレームに対してフレームラグＬだけずれた時間位置
の適応励振コードベクトルを中心とした前後所定数の適
応励振コードベクトルを候補とする。すなわち、従来と
は異なって、適応コードブック１４に格納されている全
ての適応励振コードベクトルを候補とすることはない。
なお、候補ベクトルの決定方法については後述する。Here, the excitation source evaluation circuit 13 selects a predetermined number of adaptive excitation code vectors before and after the adaptive excitation code vector centered on the adaptive excitation code vector at the time position displaced by the frame lag L with respect to the current subframe. That is, unlike the conventional method, all the adaptive excitation code vectors stored in the adaptive codebook 14 are not selected as candidates.
The method of determining the candidate vector will be described later.

【００３３】励振源評価回路１３は、最適な適応励振コ
ードベクトルを決定すると、今回のサブフレームに対し
てフレームラグＬだけずれた時間位置の適応励振コード
ベクトルと、最適な適応励振コードベクトルとの時間差
情報を得てサブフレーム補正ラグΔＬとして多重化器１
７に出力する。When the excitation source evaluation circuit 13 determines the optimum adaptive excitation code vector, the excitation source evaluation circuit 13 calculates the optimum adaptive excitation code vector at the time position deviated by the frame lag L from the current subframe and the optimum adaptive excitation code vector. The time difference information is obtained and the subframe correction lag ΔL is used as the multiplexer 1
Output to 7.

【００３４】次に、励振源評価回路１３は、ゲインコー
ドブック１６からの出力を停止させた状態で（又はデフ
ォルト値を出力させた状態で）、統計コードブック１５
から最適な統計励振コードベクトルの候補として全ての
統計励振コードベクトルを、例えば時間順次に又は並列
に出力させ、各候補の統計励振コードベクトルと最適な
適応励振コードベクトルとを加算し、加算により得られ
た各励振コードベクトルを利用してそれぞれ局部再生の
音声を得て評価し、入力音声信号に対する類似度が最も
大きい局部再生音声を合成させた統計励振コードベクト
ルを最適な統計励振コードベクトルとする。そして、最
適な統計励振コードベクトルに対するインデックス（統
計コードインデックス）Ｉを多重化器１７に出力する。Next, the excitation source evaluation circuit 13 keeps the output from the gain codebook 16 stopped (or outputs the default value), and the statistical codebook 15
To output all statistical excitation code vectors as optimal statistical excitation code vector candidates, for example, in time sequence or in parallel, add the statistical excitation code vector of each candidate and the optimal adaptive excitation code vector, and obtain by addition. Each locally reproduced speech is obtained and evaluated by using each excitation code vector obtained, and the statistically excited code vector obtained by synthesizing the locally reproduced speech having the highest similarity to the input speech signal is set as the optimal statistically excited code vector. . Then, the index (statistical code index) I for the optimum statistical excitation code vector is output to the multiplexer 17.

【００３５】続いて、励振源評価回路１３は、ゲインコ
ードブック１６から最適度ゲインの候補として全てのゲ
イン情報を、例えば時間順次に又は並列に出力させ、各
候補のゲイン情報によって最適な統計励振コードベクト
ルと最適な適応励振コードベクトルとを重付け加算し、
加算により得られた各励振コードベクトルを利用してそ
れぞれ局部再生の音声を得て評価し、入力音声信号に対
する類似度が最も大きい局部再生音声を合成させたゲイ
ン情報を最適度ゲインに決定する。この実施例の場合に
は、最適度ゲイン（β，γ）そのものを多重化器１７に
与える。Then, the excitation source evaluation circuit 13 outputs all gain information from the gain codebook 16 as candidates for the optimality gain, for example, in time sequence or in parallel, and the optimal statistical excitation is performed according to the gain information of each candidate. The code vector and the optimal adaptive excitation code vector are weighted and added,
Using each excitation code vector obtained by the addition, a locally reproduced voice is obtained and evaluated, and the gain information obtained by synthesizing the locally reproduced voice having the highest similarity to the input voice signal is determined as the optimum gain. In the case of this embodiment, the optimum gain (β, γ) itself is given to the multiplexer 17.

【００３６】以上のようにして、今回のサブフレームに
ついての最適な適応励振コードベクトル、統計励振コー
ドベクトル及びゲインを得ると、これら情報から、今回
のサブフレームについての最終的な励振コードベクトル
（声道パラメータαと合成処理される励振コードベクト
ル）を得て適応コードブック１４に与える。適応コード
ブック１４は、この励振コードベクトルを最新のものと
して取込むと共に、最も古い方の適応励振コードベクト
ルを破棄させる。As described above, when the optimum adaptive excitation code vector, statistical excitation code vector, and gain for this subframe are obtained, the final excitation code vector (voice A road parameter α and an excitation code vector to be combined are obtained and given to the adaptive codebook 14. The adaptive codebook 14 takes in this excitation code vector as the latest one and discards the oldest adaptive excitation code vector.

【００３７】このような適応処理が終了すると、次のサ
ブフレームの処理に進む。Upon completion of such an adaptive process, the process proceeds to the next subframe.

【００３８】そして、多重化器１７によって、フレーム
に１回与えられる声道パラメータα及びフレームラグ
Ｌ、サブフレームに１回与えられるサブフレーム補正ラ
グΔＬ、統計コードインデックスＩ及び最適度ゲイン
（β，γ）から符号化音声信号が組み立てられて通信路
に出力される。Then, by the multiplexer 17, the vocal tract parameter α and the frame lag L given once to the frame, the subframe correction lag ΔL given once to the subframe, the statistical code index I and the optimum gain (β, The encoded voice signal is assembled from γ) and output to the communication path.

【００３９】図３は、長周期分析器１２によって求めら
れるフレームラグＬの表記方法（検出精度）についての
説明図である。図３は、フレームラグＬの表記方法や検
出精度を３種類示しており、実際上、いずれを採用する
こともできる。FIG. 3 is an explanatory diagram of a notation method (detection accuracy) of the frame lag L obtained by the long period analyzer 12. FIG. 3 shows three types of notation method and detection accuracy of the frame lag L, and any of them can be practically adopted.

【００４０】音声のピッチ周波数が５０Ｈｚから４００
Ｈｚ程度なので、例えば、入力音声信号が８０００Ｈｚ
でアナログ／デジタル変換されている場合には、フレー
ムラグＬの値としては、２０（＝８０００÷４００）サ
ンプリング周期から１５０（８０００÷５０に近い値）
サンプリング周期を取り得る値の範囲とすることが適当
である。この実施例の場合、フレームラグＬも当然にデ
ジタル値として処理されるのでビット表現される。The pitch frequency of voice is 50 Hz to 400
Since it is about Hz, for example, the input audio signal is 8000 Hz
When the analog / digital conversion is performed with, the value of the frame lag L is 150 (value close to 8000/50) from 20 (= 8000 ÷ 400) sampling period.
It is appropriate to set the range of values that can take the sampling period. In the case of this embodiment, since the frame lag L is also processed as a digital value, it is expressed in bits.

【００４１】図３（ａ）は、フレームラグＬの第１の表
記方法を示している。ビット表現を考慮して、フレーム
ラグＬの値の範囲を２０サンプリング周期から１４７サ
ンプリング周期とし、１サンプリング周期ずつ異なる全
ての値（１２８個の値）を取るものとする。従って、こ
の場合にはフレームラグＬは７ビットで表現される。FIG. 3A shows the first notation method of the frame lag L. Considering bit representation, the range of the value of the frame lag L is set to 20 sampling periods to 147 sampling periods, and all values (128 values) that differ by one sampling period are assumed. Therefore, in this case, the frame lag L is represented by 7 bits.

【００４２】図３（ｂ）は、フレームラグＬの第２の表
記方法を示している。ビット表現を考慮して、フレーム
ラグＬの値の範囲を２０サンプリング周期から１４６サ
ンプリング周期とし、２サンプリング周期ずつ異なる全
ての値（６４個の値）を取るものとする。従って、この
場合にはフレームラグＬは６ビットで表現される。フレ
ームラグＬは、適応コードブック１４における最適な適
応コードベクトルが存在する時間位置の目安を与えるも
のであるので、このように２サンプリング周期ずつ異な
るように表記しても（精度を落としても）問題となるこ
とはない。FIG. 3B shows the second notation method of the frame lag L. In consideration of the bit representation, the range of the value of the frame lag L is set from 20 sampling periods to 146 sampling periods, and all values (64 values) that differ by 2 sampling periods are assumed. Therefore, in this case, the frame lag L is represented by 6 bits. Since the frame lag L gives an indication of the time position where the optimum adaptive code vector in the adaptive codebook 14 exists, even if it is expressed differently by 2 sampling cycles in this way (even if the accuracy is lowered). There is no problem.

【００４３】図３（ｃ）は、フレームラグＬの第３の表
記方法を示している。この方法は、フレームラグＬの最
小値を２０サンプリング周期とし、高いピッチ周波数に
対応する部分では１サンプリング周期ずつの間隔で隣合
うフレームラグＬの値を定め、ピッチ周波数が低い値に
対応する部分になるに従い広い間隔で隣合うフレームラ
グＬの値を定めるようにしたものである。このようにし
た場合には、少ないビット数でフレームラグＬを表現す
る場合に効率良く表現することができる。FIG. 3C shows the third notation method of the frame lag L. In this method, the minimum value of the frame lag L is set to 20 sampling periods, and in the portion corresponding to the high pitch frequency, the values of the adjacent frame lags L are determined at intervals of 1 sampling period, and the portion corresponding to the low pitch frequency is determined. The value of the adjacent frame lag L is determined at a wider interval as In this case, the frame lag L can be efficiently expressed when the number of bits is small.

【００４４】図４は、最適な適応励振コードベクトルの
探索範囲及びサブフレーム補正ラグΔＬの表現方法を説
明するものであり、４種類の方法を示している。FIG. 4 illustrates the optimum adaptive excitation code vector search range and the expression method of the subframe correction lag ΔL, and shows four kinds of methods.

【００４５】図４（ａ）は、第１の方法の説明図であ
り、あフレームラグＬに対応した適応励振コードベクト
ルの先頭サンプルから８サンプリング周期だけ前のサン
プルを先頭サンプルとする適応励振コードベクトルか
ら、フレームラグＬに対応した適応励振コードベクトル
の先頭サンプルから７サンプリング周期だけ後のサンプ
ルを先頭サンプルとする適応励振コードベクトルまでの
計１６個の適応励振コードベクトルを候補ベクトルとす
ることを示している。従って、サブフレーム補正ラグΔ
Ｌを４ビットで表現することができる。FIG. 4 (a) is an explanatory diagram of the first method, in which the adaptive excitation code corresponding to the frame lag L has a sample eight sampling cycles before the leading sample of the adaptive excitation code vector as the leading sample. A total of 16 adaptive excitation code vectors from the vector to the adaptive excitation code vector whose sample is seven sampling cycles after the initial sample of the adaptive excitation code vector corresponding to the frame lag L are set as candidate vectors. Shows. Therefore, the subframe correction lag Δ
L can be represented by 4 bits.

【００４６】図４（ｂ）は、第２の方法の説明図であ
り、フレームラグＬに対応した適応励振コードベクトル
の先頭サンプルから８サンプリング周期だけ前のサンプ
ルを先頭サンプルとする適応励振コードベクトルから、
フレームラグＬに対応した適応励振コードベクトルの先
頭サンプルから７．５サンプリング周期だけ後のサンプ
ルを先頭サンプルとする適応励振コードベクトルまで
の、適応励振コードベクトルであって相前後する適応励
振コードベクトルの時間差が０．５サンプリング周期で
ある計３２個の適応励振コードベクトルを候補ベクトル
とすることを示している。従って、サブフレーム補正ラ
グΔＬを５ビットで表現することができる。これは、ア
ップサンプリングフィルターを適応コードブック１４に
用いた場合の方法である。FIG. 4 (b) is an explanatory diagram of the second method, in which the adaptive excitation code vector corresponding to the frame lag L has its leading sample a sample eight sampling cycles before the leading sample. From
Of the adaptive excitation code vector which is the adaptive excitation code vector and is succeeding and succeeding to the adaptive excitation code vector whose leading sample is the sample after 7.5 sampling periods from the leading sample of the adaptive excitation code vector corresponding to the frame lag L. This shows that a total of 32 adaptive excitation code vectors with a time difference of 0.5 sampling period are used as candidate vectors. Therefore, the subframe correction lag ΔL can be represented by 5 bits. This is the method when the upsampling filter is used in the adaptive codebook 14.

【００４７】図４（ｃ）は第３の方法の説明図である。
この方法も、アップサンプリングフィルターを適応コー
ドブック１４に用いることを前提としており、フレーム
ラグＬに対応した適応励振コードベクトルの近傍では、
１サンプリング周期より小さい時間ずつ先頭サンプルの
位置をずらして候補とする適応励振コードベクトルを考
え、フレームラグＬに対応した適応励振コードベクトル
から遠くなるに従い、隣合う候補の適応励振コードベク
トル間の時間差を大きくするようにしたものである。従
って、サブフレーム補正ラグΔＬを効率良く表現するこ
とができる。FIG. 4C is an explanatory diagram of the third method.
This method is also premised on using an upsampling filter in the adaptive codebook 14, and in the vicinity of the adaptive excitation code vector corresponding to the frame lag L,
Consider the adaptive excitation code vector as a candidate by shifting the position of the first sample by a time smaller than one sampling period, and as the distance from the adaptive excitation code vector corresponding to the frame lag L increases, the time difference between the adaptive excitation code vectors of the adjacent candidates. Is designed to be large. Therefore, the subframe correction lag ΔL can be efficiently expressed.

【００４８】図４（ｄ）は第４の方法の説明図である。
この方法は、フレームラグＬが小さい範囲（ピッチ周波
数が高い範囲）では、隣合う候補の適応励振コードベク
トル間の時間差を小さくし、フレームラグＬが大きい範
囲（ピッチ周波数が低い範囲）では、隣合う候補の適応
励振コードベクトル間の時間差を大きく設定したもので
ある。従って、サブフレーム補正ラグΔＬを所定ビット
数で表現しても、フレームラグＬの値によって１ビット
が意味する値が異なったものとなっている。FIG. 4D is an explanatory view of the fourth method.
This method reduces the time difference between the adaptive excitation code vectors of the adjacent candidates in the range where the frame lag L is small (range where the pitch frequency is high), and reduces the time difference between adjacent adaptive excitation code vectors in the range where the frame lag L is large (range where the pitch frequency is low). The time difference between the adaptive excitation code vectors of the matching candidates is set to be large. Therefore, even if the sub-frame correction lag ΔL is expressed by a predetermined number of bits, the value of 1 bit varies depending on the value of the frame lag L.

【００４９】以上のように、この実施例においては、フ
レームラグＬを求めて適応コードブック１４の探索範囲
を設定している。これは、以下の理由による。有声音区
間におけるピッチ周期の変動は遅く、フレームについて
ピッチ周期（フレームラグＬ）を求めても、サブフレー
ムについてピッチ周期（フレームラグＬ＋サブフレーム
補正ラグΔＬ）を求めてもその値は似たものとなってい
る。従って、サブフレームのピッチ周期の予測値として
フレームについてのピッチ周期（フレームラグＬ）を用
い、その前後でサブフレームについてのピッチ周期を探
索しても正確なサブフレームについてのピッチ周期を得
ることができ、これにより探索時間及び探索処理量を軽
減することができる。As described above, in this embodiment, the search range of the adaptive codebook 14 is set by obtaining the frame lag L. This is for the following reason. The fluctuation of the pitch cycle in the voiced sound section is slow, and even if the pitch cycle (frame lag L) is calculated for a frame or the pitch cycle (frame lag L + subframe correction lag ΔL) is calculated for a subframe, the values are similar. Has become. Therefore, even if the pitch period for a frame (frame lag L) is used as the predicted value of the pitch period for a subframe and the pitch period for the subframe is searched before and after that, the accurate pitch period for the subframe can be obtained. This makes it possible to reduce the search time and the search processing amount.

【００５０】また、この実施例において、符号化音声信
号にフレームラグＬとサブフレーム補正ラグΔＬとを含
めて伝送するようにしたのは、以下の理由による。従来
では、サブフレーム毎にピッチ周期（フレームラグＬ＋
サブフレーム補正ラグΔＬに相当する）を伝送してい
た。仮に、１フレームが４サブフレームとし、ピッチ周
期情報を７ビットとすると、１フレーム当り従来では２
８ビット伝送することになる。一方、サブフレーム補正
ラグΔＬを４ビットとすると、この実施例の場合、１フ
レームについて、フレームについてのピッチ情報（フレ
ームラグＬ）を１回、サブフレームについてのピッチ情
報（サブフレーム補正ラグΔＬ）を４回伝送するので、
１フレームに伝送するピッチ情報のビット数は、７＋４
×４＝２３ビットとなる。すなわち、従来より圧縮効率
を高めることができる。In this embodiment, the reason why the encoded voice signal is transmitted by including the frame lag L and the subframe correction lag ΔL is as follows. Conventionally, the pitch period (frame lag L +
(Corresponding to the subframe correction lag ΔL) was transmitted. If one frame is 4 subframes and the pitch period information is 7 bits, it is conventionally 2 per frame.
8 bits will be transmitted. On the other hand, assuming that the subframe correction lag ΔL is 4 bits, in this embodiment, the pitch information (frame lag L) for the frame is once for one frame, and the pitch information (subframe correction lag ΔL) for the subframe is once. Is transmitted four times,
The number of bits of pitch information transmitted in one frame is 7 + 4
× 4 = 23 bits. That is, the compression efficiency can be improved as compared with the conventional case.

【００５１】次に、実施例のコード励振線形予測復号化
装置の構成及び動作を、図５を参照しながら詳述する。Next, the configuration and operation of the code-excited linear predictive decoding apparatus according to the embodiment will be described in detail with reference to FIG.

【００５２】コード励振線形予測符号化装置（図１）か
ら送信されてきた符号化音声信号は、多重分離器２０に
与えられ、声道パラメータα、フレームラグＬ、サブフ
レーム補正ラグΔＬ、統計コードインデックスＩ、最適
度ゲイン（β，γ）に分離される。分離された声道パラ
メータαは合成フィルタ２６に与えられ、分離されたフ
レームラグＬ及びサブフレーム補正ラグΔＬは多重分離
器２０内で加算されて適応コードブック２１に与えら
れ、分離された統計コードインデックスＩは統計コード
ブック２２に与えられ、分離された最適度ゲイン（β，
γ）の内適応励振コードベクトル用のゲインβは乗算器
２３に与えられ、他方の統計励振コードベクトル用のゲ
インγはの乗算器２５に与えられる。The coded speech signal transmitted from the code-excited linear predictive coding apparatus (FIG. 1) is given to the demultiplexer 20, and vocal tract parameter α, frame lag L, subframe correction lag ΔL, and statistical code. The index I and the optimum gain (β, γ) are separated. The separated vocal tract parameter α is given to the synthesis filter 26, and the separated frame lag L and subframe correction lag ΔL are added in the demultiplexer 20 and given to the adaptive codebook 21 to obtain the separated statistical code. The index I is given to the statistical codebook 22 and separated into the optimality gains (β,
The gain β for the adaptive excitation code vector of γ) is given to the multiplier 23, and the gain γ for the other statistical excitation code vector is given to the multiplier 25 of.

【００５３】かくして、適応コードブック２１からその
サブフレームについて最適な適応励振コードベクトルが
出力され、これが乗算器２３によってβ倍されて加算器
２４に入力され、また、統計コードブック２２からその
サブフレームについて最適な統計励振コードベクトルが
出力され、これが乗算器２５によってγ倍されて加算器
２４に入力される。これにより、加算器２４において、
最終的にこのサブフレームについての励振コードベクト
ルが得られ、合成フィルタ２６がこの励振コードベクト
ルに対して声道パラメータαを適用して合成処理し、入
力音声信号に対応した再生音声信号を得る。加算器２４
から出力された励振コードベクトルは適応コードブック
２１にフィードバックされ、適応コードブック２１内の
適応コードベクトルが適応的に変更される。Thus, the optimum adaptive excitation code vector for the subframe is output from the adaptive codebook 21, multiplied by β by the multiplier 23 and input to the adder 24, and the statistical codebook 22 outputs the subframe. The optimum statistical excitation code vector for is output, which is multiplied by γ by the multiplier 25 and input to the adder 24. Thereby, in the adder 24,
Finally, the excitation code vector for this subframe is obtained, and the synthesis filter 26 applies the vocal tract parameter α to this excitation code vector for synthesis processing to obtain a reproduced speech signal corresponding to the input speech signal. Adder 24
The excitation code vector output from is fed back to the adaptive code book 21, and the adaptive code vector in the adaptive code book 21 is adaptively changed.

【００５４】従って、上記実施例によれば、フレーム毎
に求めたフレームラグＬによって適応コードブック１４
の探索範囲を変更すると共に、符号化音声信号にフレー
ムラグＬ及びサブフレーム補正ラグΔＬを含めて伝送
し、復号化装置においてフレームラグＬ及びサブフレー
ム補正ラグΔＬからサブフレームラグ（サブフレームに
ついてのピッチ周期）Ｌ＋ΔＬを得て適応コードブック
２１をアクセスするようにしたので、有声音に寄与する
励振源情報の符号を決定する処理及び時間を従来より削
減することができ、また、有声音に寄与する励振源情報
の符号に必要なビット数を従来より削減することができ
る。Therefore, according to the above-described embodiment, the adaptive codebook 14 is calculated according to the frame lag L obtained for each frame.
Of the subframe lag L and the subframe correction lag ΔL are transmitted in the encoded audio signal while changing the search range of the subframe lag L and the subframe correction lag ΔL in the decoding device. Since the pitch code) L + ΔL is obtained and the adaptive codebook 21 is accessed, it is possible to reduce the processing and time for determining the sign of the excitation source information that contributes to the voiced sound, and to contribute to the voiced sound. It is possible to reduce the number of bits required for the code of the excitation source information to be reduced as compared with the related art.

【００５５】本発明は、適応コードブックを有するコー
ド励振線形予測符号化装置及び復号化装置であれば適用
することができ、フォワード型のものに限定されるもの
ではない。The present invention can be applied to any code excitation linear predictive coding device and decoding device having an adaptive codebook, and is not limited to the forward type.

【００５６】上記実施例においては、長周期相関値を声
道パラメータの分析周期であるフレーム毎に求めるもの
を示したが、本発明はこれに限定されるものではなく、
声道パラメータの分析周期と長周期相関値を求める周期
とが異なっていても良い。但し、長周期相関値を求める
周期が、適応コードブックの探索周期の整数倍であるこ
とは必要である。In the above embodiment, the long period correlation value is obtained for each frame which is the vocal tract parameter analysis period, but the present invention is not limited to this.
The analysis cycle of the vocal tract parameter and the cycle of obtaining the long-period correlation value may be different. However, it is necessary that the cycle for obtaining the long cycle correlation value is an integral multiple of the search cycle of the adaptive codebook.

【００５７】また、上記実施例においては、フレームラ
グＬとサブフレーム補正ラグΔＬを符号化音声信号に含
めて伝送するものを示したが、フレームラグＬとサブフ
レーム補正ラグΔＬとを加算したサブフレームラグＬ＋
ΔＬを符号化音声信号に含めて伝送するようにしても良
い。この場合には、コード励振線形予測復号化装置の構
成は従来と同様になる。このようにした場合、励振源情
報の圧縮効率を従来より高めるという効果を得ることが
できないが、適応コードブック１４の探索時間及び処理
を軽減できるという効果は得ることができる。In the above embodiment, the frame lag L and the subframe correction lag ΔL are included in the encoded audio signal for transmission, but the subframe obtained by adding the frame lag L and the subframe correction lag ΔL is shown. Frame lug L +
ΔL may be included in the encoded audio signal and transmitted. In this case, the configuration of the code excitation linear predictive decoding device is the same as the conventional one. In this case, the effect of increasing the compression efficiency of the excitation source information cannot be obtained, but the effect of reducing the search time and processing of the adaptive codebook 14 can be obtained.

【００５８】[0058]

【発明の効果】以上のように、請求項１の本発明によれ
ば、適応コードブックを利用した励振源情報の符号化周
期の整数倍の周期毎に、入力音声信号の長周期相関値を
求める長周期分析器を備え、適応コードブックに格納さ
れている複数の適応励振コードベクトルから最適な適応
励振コードベクトルを探索する際に、長周期相関値に応
じた適応励振コードベクトルを中心とした前後所定数の
適応励振コードベクトルの中から探索するようにしたの
で、適応コードブックに対する探索時間及び処理を軽減
することができるコード励振線形予測符号化装置を実現
できる。As described above, according to the first aspect of the present invention, the long-period correlation value of the input speech signal is calculated for each cycle that is an integral multiple of the coding cycle of the excitation source information using the adaptive codebook. Equipped with a long-period analyzer to find the optimum adaptive excitation code vector from multiple adaptive excitation code vectors stored in the adaptive codebook, the adaptive excitation code vector corresponding to the long-period correlation value is used as the center. Since a search is performed from a predetermined number of adaptive excitation code vectors before and after, a code excitation linear predictive coding device that can reduce the search time and processing for the adaptive codebook can be realized.

【００５９】請求項２の本発明によれば、さらに、コー
ド励振線形予測復号化装置に向けて伝送する符号化音声
に、長周期相関値と、探索の結果得られた最適な適応励
振コードベクトルと上記長周期相関値に応じた適応励振
コードベクトルとの時間差情報とを含めるようにしたの
で、圧縮効率を高めることができるという効果をも得る
ことができるコード励振線形予測符号化装置を実現でき
る。According to the second aspect of the present invention, the long-period correlation value and the optimum adaptive excitation code vector obtained as a result of the search are added to the coded speech transmitted to the code-excited linear predictive decoding apparatus. Since the time difference information with respect to the adaptive excitation code vector corresponding to the long-period correlation value is included, it is possible to realize a code excitation linear predictive encoding device that can also obtain the effect of improving compression efficiency. .

【００６０】請求項３の本発明によれば、請求項２の本
発明のコード励振線形予測符号化装置から送信されてき
た符号化音声を復号処理するコード励振線形予測復号化
装置において、符号化音声に含まれている長周期相関値
と時間差情報とを加算して得た値を、適応コードブック
に対するインデックスとして適応励振コードベクトルを
出力させるようにしたので、請求項２の本発明による効
果を実効あるものとすることができる。According to the present invention of claim 3, in the code-excited linear predictive decoding apparatus for decoding the coded speech transmitted from the code-excited linear predictive coding apparatus of the present invention of claim 2, the coding is performed. Since the value obtained by adding the long-period correlation value and the time difference information included in the voice is used as the index for the adaptive codebook to output the adaptive excitation code vector, the effect of the present invention according to claim 2 is obtained. It can be effective.

【図面の簡単な説明】[Brief description of drawings]

【図１】実施例のコード励振線形予測符号化装置の構成
を示すブロック図である。FIG. 1 is a block diagram showing the configuration of a code-excited linear predictive coding apparatus according to an embodiment.

【図２】適応コードブックの概念の説明図である。FIG. 2 is an explanatory diagram of a concept of an adaptive codebook.

【図３】実施例のフレームラグの表記方法（検出精度）
の説明図である。FIG. 3 is a diagram showing a frame lag notation method (detection accuracy) of the embodiment.
FIG.

【図４】実施例の適応コードブックの探索範囲及びサブ
フレーム補正ラグの説明図である。FIG. 4 is an explanatory diagram of a search range and a subframe correction lag of the adaptive codebook according to the embodiment.

【図５】実施例のコード励振線形予測復号化装置の構成
を示すブロック図である。FIG. 5 is a block diagram showing a configuration of a code-excited linear predictive decoding apparatus according to an embodiment.

【符号の説明】[Explanation of symbols]

１２…長周期分析器、１３…励振源評価回路、１４…適
応コードブック、２０…多重分離器、Ｌ…フレームラグ
（長周期相関値）、ΔＬ…サブフレーム補正ラグ（最適
な適応励振コードベクトルと長周期相関値に応じた適応
励振コードベクトルとの時間差情報）。12 ... Long period analyzer, 13 ... Excitation source evaluation circuit, 14 ... Adaptive codebook, 20 ... Demultiplexer, L ... Frame lag (long period correlation value), .DELTA.L ... Subframe correction lag (optimal adaptive excitation code vector) And the time difference information between the adaptive excitation code vector according to the long period correlation value).

フロントページの続き (72)発明者伊東克俊東京都港区虎ノ門１丁目７番12号沖電気工業株式会社内Front page continued (72) Inventor Katsutoshi Ito 1-7-12 Toranomon, Minato-ku, Tokyo Oki Electric Industry Co., Ltd.

Claims

【特許請求の範囲】[Claims]

【請求項１】適応コードブックを備えたコード励振線
形予測符号化装置において、上記適応コードブックを利用した励振源情報の符号化周
期の整数倍の周期毎に、入力音声信号の長周期相関値を
求める長周期分析器を備え、上記適応コードブックに格納されている複数の適応励振
コードベクトルから最適な適応励振コードベクトルを探
索する際に、上記長周期相関値に応じた適応励振コード
ベクトルを中心とした前後所定数の適応励振コードベク
トルの中から探索するようにしたことを特徴としたコー
ド励振線形予測符号化装置。1. A code excitation linear predictive coding apparatus including an adaptive codebook, wherein a long-period correlation value of an input speech signal is obtained every integer multiple of a coding period of excitation source information using the adaptive codebook. When searching for the optimum adaptive excitation code vector from the plurality of adaptive excitation code vectors stored in the adaptive codebook, the adaptive excitation code vector corresponding to the long-period correlation value is calculated. A code excitation linear predictive coding device characterized in that a predetermined number of adaptive excitation code vectors around the center are searched for.

【請求項２】コード励振線形予測復号化装置に向けて
伝送する符号化音声に、上記長周期相関値と、探索の結果得られた最適な適応励振コードベクトルと上
記長周期相関値に応じた適応励振コードベクトルとの時
間差情報とを含めたことを特徴とした請求項１に記載の
コード励振線形予測符号化装置。2. The coded speech to be transmitted to a code-excited linear predictive decoding device, wherein the long-period correlation value, the optimum adaptive excitation code vector obtained as a result of the search, and the long-period correlation value are used. The code-excited linear predictive coding apparatus according to claim 1, wherein the code-excited linear predictive coding apparatus includes time difference information with respect to the adaptive excitation code vector.

【請求項３】請求項２に記載のコード励振線形予測符
号化装置から送信されてきた符号化音声を復号処理する
コード励振線形予測復号化装置において、符号化音声に含まれている上記長周期相関値と上記時間
差情報とを加算して得た値を、適応コードブックに対す
るインデックスとして適応励振コードベクトルを出力さ
せることを特徴としたコード励振線形予測復号化装置。3. A code-excited linear predictive decoding apparatus that decodes coded speech transmitted from the code-excited linear predictive coding apparatus according to claim 2, wherein the long period included in the coded speech is included. A code excitation linear predictive decoding device characterized in that an adaptive excitation code vector is output using a value obtained by adding a correlation value and the time difference information as an index for an adaptive codebook.