JP3338074B2

JP3338074B2 - Audio transmission method

Info

Publication number: JP3338074B2
Application number: JP32336891A
Authority: JP
Inventors: 恭士大田; 智彦谷口; 良紀田中; 秀明栗原; 雅子加藤
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1991-12-06
Filing date: 1991-12-06
Publication date: 2002-10-28
Anticipated expiration: 2017-10-28
Also published as: JPH05158497A

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は音声信号の情報圧縮を行
うための音声伝送方式に関し、特に４〜１６Kb/sの伝送
速度で符号化を行うためのＡ−ｂ−Ｓ（Analysis-by-Sy
nthesis:合成による分析）型ベクトル量子化を用いた音
声伝送方式に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio transmission system for compressing information of an audio signal, and more particularly, to an Abs (Analysis-by-Abs) for encoding at a transmission speed of 4 to 16 Kb / s. Sy
(nthesis: analysis by synthesis) This relates to an audio transmission system using vector quantization.

【０００２】Ａ−ｂ−Ｓ型ベクトル量子化を用いた音声
伝送方式による音声符号器、例えばＣＥＬＰ（Code Exc
ited Linear Production）符号器は、企業内通信システ
ム、ディジタルの移動無線システムなどにおいて、音声
品質を保ちつつ情報圧縮を実現するものとして期待され
ている。[0002] A speech encoder based on a speech transmission system using AbS type vector quantization, for example, CELP (Code Exc
Ited Linear Production) encoders are expected to realize information compression while maintaining voice quality in in-house communication systems, digital mobile radio systems, and the like.

【０００３】[0003]

【従来の技術】音声には有声音と無声音とがあり、有声
音は声帯の振動によるパルス音源が基となって発生し、
個人個人のノドや口の声道特性が付加されて声になる。
又、無声音は声帯を振るわせないで出す音で、単なるガ
ウス性の雑音列が音源となって声道を通って声となる。
従って、音声発生メカニズムは図４に示すように、有声
音の元となるパルス音源ＰＳＧと無声音の元となる雑音
源ＮＳＧと、各音源から出力される信号に声道特性を付
加する線形予測合成フィルタＬＰＣＦによりモデル化で
きる。尚、人の声は周期性有し、該周期性はパルス音源
から出力されるパルスの周期性に対応しており、人や話
内容によって異なる。2. Description of the Related Art Voices are classified into voiced sounds and unvoiced sounds, and voiced sounds are generated based on a pulse sound source caused by vibration of a vocal cord.
An individual's throat and mouth vocal tract characteristics are added to form a voice.
An unvoiced sound is a sound emitted without shaking the vocal cords, and a mere Gaussian noise sequence becomes a sound source and becomes a voice through the vocal tract.
Therefore, as shown in FIG. 4, the sound generation mechanism includes a pulse sound source PSG as a source of voiced sound, a noise source NSG as a source of unvoiced sound, and linear predictive synthesis for adding a vocal tract characteristic to a signal output from each sound source. It can be modeled by the filter LPCF. It should be noted that a human voice has periodicity, and the periodicity corresponds to the periodicity of the pulse output from the pulse sound source, and differs depending on the person and the content of the speech.

【０００４】以上のことから、入力音声に対応するパル
ス音源の周期と雑音源の雑音列を特定することができれ
ば、これらパルス周期と雑音源の雑音列を識別する符号
（インデックス）により入力音声を符号化することがで
きる。[0004] From the above, if the period of the pulse sound source and the noise sequence of the noise source corresponding to the input speech can be specified, the input speech can be identified by a code (index) for identifying the pulse period and the noise sequence of the noise source. Can be encoded.

【０００５】そこで、図５に示すように、適応符号帳１
１を用いて入力音声信号の周期性に基づいてパルス音源
のパルス周期を同定し、該周期を備えたパルス列Ｐにゲ
インｂを乗じたベクトルｂＰを線形予測合成フィルタ１
３に入力してフィルタ演算処理を施し、得られたフィル
タ演算結果を入力音声信号から減算してその誤差信号か
ら誤差電力評価部１３が最小となる適応符号帳１１のピ
ッチベクトルを選択することにより、周期成分を除去す
る。[0005] Therefore, as shown in FIG.
1 is used to identify the pulse period of the pulse sound source based on the periodicity of the input audio signal, and a vector bP obtained by multiplying a pulse train P having the period by a gain b is used as a linear prediction synthesis filter 1
3 to perform a filter operation process, subtract the obtained filter operation result from the input speech signal, and select a pitch vector of the adaptive codebook 11 that minimizes the error power evaluator 13 from the error signal. , Remove the periodic component.

【０００６】その後又はこれと同時に、予め複数の雑音
列（各雑音列はＮ次元のコードベクトルで表現されてい
る）を固定（ストカスティック）符号帳１に用意してお
き、各コードベクトルＣに合成フィルタ３の処理を施し
た再生信号ベクトルと上記の周期成分が除去された入力
信号ベクトル（Ｎ次元ベクトル）との誤差が最小となる
コードベクトルを誤差電力評価部５が求めれば、前記周
期とコードベクトルを特定するデータにより音声を符号
化することが出来る。Thereafter or at the same time, a plurality of noise sequences (each noise sequence is represented by an N-dimensional code vector) are prepared in a fixed (stochastic) codebook 1 in advance. If the error power evaluator 5 finds a code vector that minimizes the error between the reproduced signal vector that has been processed by the synthesis filter 3 and the input signal vector (N-dimensional vector) from which the periodic component has been removed, Audio can be encoded by data specifying a code vector.

【０００７】図６はＡ−ｂ−Ｓ法によるベクトル量子化
を用いた音声伝送（符号化）方式の構成図で図５の下半
分に相当しており、詳しく述べると１はランダムに発生
した複数の例えば１０２４種類の雑音列Ｃ（各雑音列は
Ｎ次元コードベクトルで表現されている）を記憶する雑
音符号帳、２はゲインｇの増幅部、３は増幅部出力に声
道特性を模擬した聴覚重み付け演算処理を施す線形予測
合成フィルタ、４は線形予測合成フィルタ３から出力さ
れる再生信号ベクトルと入力信号ベクトルの誤差を出力
する誤差発生部、５は該誤差を評価し、該誤差が最小と
なる雑音列（コートベクトル）を求める誤差電力評価部
である。FIG. 6 is a block diagram of an audio transmission (coding) system using vector quantization by the AbS method, which corresponds to the lower half of FIG. 5, and in detail, 1 is randomly generated. A noise codebook that stores a plurality of, for example, 1024 types of noise sequences C (each noise sequence is represented by an N-dimensional code vector), 2 is an amplifying unit of gain g, and 3 is a simulated vocal tract characteristic in the output of the amplifying unit. A linear prediction synthesis filter for performing the perceptual weighting calculation process, an error generation unit for outputting an error between the reproduced signal vector output from the linear prediction synthesis filter 3 and the input signal vector, and an evaluation unit for evaluating the error. It is an error power evaluator for finding the minimum noise sequence (coat vector).

【０００８】Ａ−ｂ−Ｓ法による量子化では通常のベク
トル量子化と異なり、雑音符号帳１の各コードベクトル
（Ｃ）に最適のゲイン（ｇ）を掛けた後、線形予測合成
フィルタ３でフィルタ処理を施し、フィルタ処理で得ら
れる再生信号ベクトル（ｇＡＣ）と入力信号ベクトル
（Ｘ）との間の誤差信号（Ｅ）を誤差発生部４で求め、
誤差電力評価部５で誤差信号の電力を評価関数（距離尺
度）として雑音符号帳１の探索を行い、誤差電力が最小
となる雑音列（コードベクトル）を求め、該雑音列（コ
ードベクトル）を特定する符号（インデックス）により
入力信号を符号化して伝送する。In the quantization by the AbS method, unlike the ordinary vector quantization, each code vector (C) of the noise codebook 1 is multiplied by an optimum gain (g), and then the linear prediction synthesis filter 3 The filter processing is performed, and an error signal (E) between the reproduced signal vector (gAC) obtained by the filter processing and the input signal vector (X) is obtained by the error generation unit 4,
Carries out the search of the random codebook 1 power of the error signal by the error power evaluation unit 5 as an evaluation function (distance measure), obtains the noise sequences which error power is minimized (code vector), The noise sequence (co <br /> encoding and transmitting an input signal by a code (index) specifying the over de vector).

【０００９】このときの誤差電力は次式｜Ｅ｜²＝｜Ｘ−ｇＡＣ｜² （１）により与えられる。最適なコードベクトル及びゲインｇ
は、この（１）式に示す誤差電力を最小化するものとし
て決定される。尚、声の大きさによりパワーが異なるの
で、ゲインｇを最適化して再生信号パワーを入力信号の
パワーに合わせる。最適ゲインは（１）式をｇで偏微分
して０と置くことにより求めることができる。すなわ
ち、ｄ｜Ｅ｜²／ｄｇ＝０より、ｇはｇ＝（Ｘ^TＡＣ）／（（ＡＣ）^T（ＡＣ））（２）で与えられる。このｇを（１）式に代入すると、｜Ｅ｜²＝｜Ｘ｜²−（Ｘ^TＡＣ）²／（（ＡＣ）^T（ＡＣ））（３）となる。入力信号Ｘと合成フィルタ３の出力ＡＣの相互
相関をＲ_XC、合成フィルタ３の出力ＡＣの自己相関をＲ
_ccとすれば、相互相関及び自己相関は次式Ｒ_XC＝Ｘ^TＡＣ（４）Ｒ_cc＝（ＡＣ）^T（ＡＣ）（５）により表現される。The error power at this time is given by the following equation: | E | ² = | X-gAC | ² (1) Optimal code vector and gain g
Is determined to minimize the error power shown in the equation (1). Since the power varies depending on the loudness of the voice, the gain g is optimized to adjust the reproduction signal power to the power of the input signal. The optimum gain can be obtained by partially differentiating equation (1) with g and setting it to 0. That is, from d | E | ² / dg = 0, g is given by g = (X ^T AC) / ((AC) ^T (AC)) (2) When this g is substituted into the equation (1), | E | ² = | X | ² − (X ^T AC) ² / ((AC) ^T (AC)) (3) The cross-correlation between the input signal X and the output AC of the synthesis filter 3 is R _XC , and the auto-correlation of the output AC of the synthesis filter 3 is R
_{If cc} is used, the cross-correlation and the auto-correlation are expressed by the following equation: R _XC = X ^T AC (4) R _cc = (AC) ^T (AC) (5)

【００１０】（３）式の誤差電力を最小にするコードベ
クトルＣは（３）式の右辺第２項を最大にするものであ
るから、該コードベクトルＣは次式Ｃ＝argmax（Ｒ_xc ²／Ｒ_cc）（６）と表現でき、最適のゲインは（６）式を満たす相互相
関、及び自己相関を用いて（２）式よりｇ＝Ｒ_xc／Ｒ_cc （７）で与えられる。[0010] (3) Since equation code vector C that minimizes the error power is to maximize the second term on the right side of equation (3), the code vector C the following equation C = argmax (R _xc ² / R _cc ) (6), and the optimum gain is given by g = R _xc / R _cc (7) from equation (2) using the cross-correlation and auto-correlation satisfying equation (6).

【００１１】図７は以上の式により、誤差電力が最小と
なる雑音列（コードベクトル）を求めて入力信号を符号
化する雑音符号帳探索処理アルゴリズムをモデル化した
構成図であり、図６に加えて、相互相関Ｒ_xc（＝Ｘ^TＡ
Ｃ）を演算する自己相関演算部、Ｒ_xc ²／Ｒ_CCを演算す
る演算部９とＲ_xc ²／Ｒ_CCが最大となる、換言すれば誤
差電力が最小となる雑音列（コードベクトル）を決定し
て、該コードベクトルを特定する符号を出力する誤差電
力評価部１０が設けられているが等価的に図６と同じも
のである。FIG. 7 is a block diagram showing a model of a noise codebook search processing algorithm for obtaining a noise sequence (code vector) with the minimum error power and encoding an input signal by the above equation. In addition, the cross-correlation R _xc (= X ^T A
Autocorrelation calculating unit for calculating a _{^{C), R xc 2 / R}} CC and an arithmetic unit 9 which calculates R _xc ² / R _CC is maximized, noise sequences which error power is minimized in other words (code vector) An error power evaluator 10 for determining and outputting a code specifying the code vector is provided, but is equivalent to FIG.

【００１２】[0012]

【発明が解決しようとする課題】上述したように、符号
帳探索の処理の内で主なものは、コードベクトルＣに
対するフィルタ処理、相互相関Ｒ_XCの算出処理、及び
自己相関Ｒ_CCの算出処理の三つであり、これら三種類
の演算を符号帳に格納されている全てのコードベクトル
について行い、評価関数値を算出し最適のベクトルを選
択するため、膨大な演算量を必要とする。As described above, the main processes of the codebook search process are the filtering process for the code vector C, the cross-correlation R _XC calculation process, and the auto-correlation R _CC calculation process. Since these three types of operations are performed on all code vectors stored in the codebook, the evaluation function value is calculated, and the optimum vector is selected, an enormous amount of operation is required.

【００１３】Ａ−ｂ−Ｓ型ベクトル量子化を用いた音声
符号化器の適用領域と考えられる自動車・携帯電話にお
いては、装置の小型化・低消費電力化が必須の条件であ
り、この膨大な演算量は、符号化器実現上の重大な障害
となっている。In automobiles and mobile phones, which are considered to be applicable areas of speech encoders using the AbS type vector quantization, miniaturization and low power consumption of the apparatus are indispensable conditions. The large amount of computation is a serious obstacle to the realization of the encoder.

【００１４】一方、従来のＣＥＬＰアルゴリズムでは、
雑音符号帳としてすべてのサンプル（フルサンプル）を
ガウス性の白色雑音によって同仕様に生成したコードベ
クトルで構成されているが、この従来のコードベクトル
より、振幅の絶対値が閾値より大きいサンプル以外を０
値にすることで間引きを行ったスパースコードベクトル
の方が再生音声の特性が良いことが報告されている。On the other hand, in the conventional CELP algorithm,
As a noise codebook, all samples (full samples) are composed of code vectors generated to the same specifications using Gaussian white noise. 0
It has been reported that the sparse code vector thinned out by setting the value to the value has better characteristics of reproduced voice.

【００１５】これは、図４に示したように音声の生成系
では、声帯の振幅により作られるパルス列ＤＳＧを音源
とする有声音と口先などで作られるランダム雑音列ＮＳ
Ｇを音源とする無声音の二種類に分けられることに起因
して、特に有声音の定常部では音源情報としてはパルス
系列が支配的となるため、雑音符号帳の雑音コードベク
トルが却って再生音声品質に悪影響を及ぼしてしまうと
考えられ、高周波成分を除去した形のスパースコードベ
クトルがこの悪影響を抑えて特性を補償するのではない
かと考えられる。This is because, as shown in FIG. 4, in a speech generation system, a voiced sound using a pulse train DSG generated by the amplitude of a vocal cord as a sound source and a random noise sequence NS generated by a mouth and the like are used.
Since the pulse sequence is dominant as the sound source information, especially in the stationary part of voiced sound, because the noise source is divided into two types of unvoiced sound using G as a sound source, the noise code vector of the noise codebook is rather reduced to the reproduced voice quality. It is thought that the sparse code vector from which high-frequency components have been removed suppresses this adverse effect and compensates for the characteristics.

【００１６】しかしながら、雑音符号帳をスパースコー
ドベクトルにしても、前述のように音声生成系において
は時間的に有声音／無声音（パルス音源／ランダム雑音
音源）の例のように音源情報の性質が大きく変わってい
るにも関わらず従来のＣＥＬＰではこのスパース音源情
報を一律の雑音列で表現しているため、パルス音源の依
存度が高い部分（母音の定常部など）では適応符号帳だ
けでは周期性成分の表現が十分に行えず、音質の劣化が
見られるという問題点があった。However, even if the random codebook is used as a sparse code vector, as described above, in the speech generation system, the characteristics of sound source information are temporally different from those of voiced sound / unvoiced sound (pulse sound source / random noise source). In spite of the large change, the conventional CELP expresses this sparse excitation information as a uniform noise sequence. Therefore, in a portion where the pulse excitation is highly dependent (such as a stationary part of a vowel), the period is determined only by the adaptive codebook. There has been a problem that the expression of the sexual component cannot be sufficiently performed, and the sound quality is deteriorated.

【００１７】そこで本発明では、雑音列を表現するコー
ドベクトルに対して声道特性を模擬した線形予測合成フ
ィルタを通して得られる再生信号と入力音声信号との誤
差が最小となるコードベクトルを決定し、該コードベク
トルを特定するインデックス情報により該入力音声信号
を符号化する音声伝送方式において、雑音符号帳をスパ
ースコードベクトルで生成するだけでなく適応符号帳で
も充分に表現できない周期性成分を雑音符号帳で補償で
きるようなコードベクトルで生成することを目的とす
る。Accordingly, in the present invention, a code vector that minimizes an error between a reproduced signal obtained through a linear prediction synthesis filter simulating vocal tract characteristics and an input speech signal with respect to a code vector representing a noise sequence is determined. In a speech transmission system that encodes the input speech signal by using index information that specifies the code vector, a periodic component that cannot be sufficiently expressed by an adaptive codebook as well as generating a noise codebook by a sparse codebook is generated by a noise codebook. It is intended to generate a code vector that can be compensated by the following.

【００１８】[0018]

【課題を解決するための手段及び作用】このような問題
点を解決するために、本発明では雑音符号帳を、（１）大きな演算量削減効果が図れるスパース化された
（間引かれた）デルタ符号帳とし、更に（２）このデルタ符号帳を一律の雑音列でないスパース
コードベクトルで生成することとした。In order to solve such a problem, the present invention uses a noise codebook in which (1) a sparse (thinned out) signal which can achieve a large computational amount reduction effect. (2) This delta codebook is generated by a sparse code vector that is not a uniform noise sequence.

【００１９】（１）スパースデルタ符号帳を用いるこ
と：デルタ符号帳とは、図１(1) に示す従来のフルサン
プルの雑音符号帳の代わりに同図(2) に示す特殊な構造
を持った符号帳であり、同図(3) に示す如く、その各コ
ードベクトル（符号語）Ｃ₀〜Ｃ_N（Ｃ_K）が、基本と
なる初期ベクトル（Ｃ₀）を元に、以前に生成されてい
るコードベクトル（Ｃ_K-1)に対しデルタベクトルΔＣ₁
〜ΔＣ_N（ΔＣ_K) を加えることにより生成されるもの
として既に知られており、本発明ではまず、このデルタ
符号帳の各コードベクトルをスパース化したものとす
る。[0019](1) Use a sparse delta codebook
When:The delta codebook is a conventional full sampler as shown in Fig. 1 (1).
Special structure shown in Fig. 2 (2) instead of pull noise codebook
As shown in Fig. 3 (3), each codebook has
Code vector (codeword) C₀~ C_N(C_K) But the basics
Initial vector (C₀), Previously generated
Code vector (C_K-1) For the delta vector ΔC₁
~ ΔC_N(ΔC_K)
In the present invention, first, this delta
Let each codebook in the codebook be sparse.
You.

【００２０】従って、各コードベクトルの差がデルタベ
クトルとなる関係で構成されるため、各コードベクトル
における評価式の主要要素となる相互相関値及び自己相
関値は、スパースデルタコードベクトルの要素分の演算
で更新が可能となり、逐一相関値を計算する従来法に比
べ大幅な演算量削減効果が望める。Therefore, since the difference between the code vectors is configured to be a delta vector, the cross-correlation value and the auto-correlation value, which are the main elements of the evaluation formula in each code vector, are the same as those of the sparse delta code vector. Updates can be performed by calculation, and a significant reduction in the amount of calculation can be expected as compared with the conventional method of calculating correlation values one by one.

【００２１】即ち、各インデックスの間が全くランダム
な関係である従来符号帳と比べ、或る固定数サンプルの
違いだけのスパースデルタ符号帳では探索処理が前イン
デックスの相関値を用いて漸化的に行え、実際の入力音
声との誤差評価処理は、次式に示すように一つ前のコー
ドベクトルの評価時の相関値とデルタコードベクトルの
相関要素との和の形で表すことができる。That is, as compared with the conventional codebook in which each index has a completely random relationship, in the sparse delta codebook having only a certain fixed number of sample differences, the search process uses the correlation value of the previous index to perform recursive processing. The error evaluation process with respect to the actual input voice can be expressed in the form of the sum of the correlation value at the time of evaluation of the immediately preceding code vector and the correlation element of the delta code vector as shown in the following equation.

【００２２】 Rcx(k)=(ACk)^T(AX) =(Ck-1+ ΔCk) ^TA ^TAX =R_CX(k-1)+ΔCk^TA ^TAX （８） Rcc(k)=(ACk)^T(ACk) =(Ck-1+ ΔCk) ^TA ^TA(Ck-1+ ΔCk) =Rcc(k)+ 2ΔCk^TA ^TACk-1+ΔCk^TA ^TA ΔCk （９）[0022] Rcx (k) = (ACk) T (AX) = (Ck-1 + ΔCk) T A T AX = R CX (k-1) + ΔCk T A T AX (8) Rcc (k) = ( ACk) ^T (ACk) = (Ck-1 + ΔCk) ^T A ^T A (Ck-1 + ΔCk) = Rcc (k) + 2ΔCk ^T A ^T ACk-1 + ΔCk ^T A ^T A ΔCk (9)

【００２３】（２）一律の雑音列でないスパースコード
ベクトルで生成すること：上記のように大きな演算量削減効果が図れるスパースデ
ルタ符号帳の長所をそのままにして、加え合わせるデル
タ符号帳の位相条件を工夫し、各コードベクトルのスパ
ースレート（各コードベクトルの次元数における非零値
サンプルの割合）を一定にせず、図２(1) に示すように
疎／密の二つの性質を合わせ持つ雑音符号帳が仮想的に
生成されるようなデルタ符号帳を用意すると、同図(2)
に示すようなパルス系列からフルサンプル雑音系列まで
広いバリエーションを持つ符号帳を用いた処理を一元的
に行うことができ、スパースデルタ符号帳の大幅な演算
量削減効果を生かしつつ、量子化特性、従って再生音声
品質の向上を図ることが可能となる。この場合、本発明
では、該スパースレートを疎から蜜に変化させるため偶
数インデックスのコードベクトルでは前のコードベクト
ルと重ならないサンプル位置のデルタコードベクトルを
用意してサンプル数が増加するようにし、奇数インデッ
クスでは前のコードベクトルの非零値のサンプル位置と
重なる位置にサンプルを持つデルタコードベクトルを用
意してサンプル数が増加しないようにしている。 (2) Sparse code that is not a uniform noise sequence
Generating as a vector: While maintaining the advantages of the sparse delta codebook that can achieve a large reduction in the amount of computation as described above, devising the phase condition of the delta codebook to be added, the sparse rate of each code vector (each code vector Delta codebook such that a random codebook having both sparse / dense properties is virtually generated as shown in FIG. Prepare (2)
Processing using a codebook having a wide variation from a pulse sequence to a full sample noise sequence as shown in (1) can be performed centrally, while taking advantage of the significant computational amount reduction effect of the sparse delta codebook, quantization characteristics, Therefore, it is possible to improve the reproduction sound quality. In this case, the present invention
In order to change the sparse rate from sparse to honey,
The previous code vector in a code vector with a numerical index
The delta code vector of the sample position that does not overlap
Be prepared to increase the number of samples, and
The non-zero sample position of the previous code vector
Use delta code vector with samples at overlapping positions
The number of samples does not increase intentionally.

【００２４】[0024]

【実施例】図２に示したような疎／密の二つの性質を合
わせ持つスパースデルタ符号帳を生成する方法の実施例
が図３に示されており、特に図２(2) に示すようなパル
スコードベクトル部と疎／蜜コードベクトル部とフルサ
ンプルコードベクトル部のそれぞれの生成方法が図３
(1) 〜(4) に示されている。FIG. 3 shows an embodiment of a method for generating a sparse delta codebook having both sparse / dense characteristics as shown in FIG. 2, and in particular, as shown in FIG. 2 (2). FIG. 3 shows a method of generating a simple pulse code vector part, a sparse / honey code vector part, and a full sample code vector part.
These are shown in (1) to (4).

【００２５】（１）パルスコードベクトルの生成（図３
(1) ）：まず初期ベクトルを第０サンプルに持つベクト
ル（例えば同図のインパルスベクトルｍ）とし、±1.0
のペアリングのデルタコードベクトルΔｍを考える。そ
うすると、デルタ符号帳の最初の次元個（Ｎ個）のコー
ドベクトルは、図示のように一つ前のコードベクトルｍ
とデルタコードベクトルΔｍとを加えたコードベクトル
ｍ＋Δｍとなり、位相が一つシフトしただけのインパル
スコードベクトル群が仮想的に形成されることとなる。
従って、デルタ符号帳としてはＮ個分のコードベクトル
Δｍが用意されることとなる。 (1) Generation of Pulse Code Vector (FIG. 3
(1)): First, a vector having an initial vector in the 0th sample (for example, an impulse vector m in FIG. 3), and ± 1.0
Consider the delta code vector Δm of the pairing of. Then, the code vector of the first dimension (N) of the delta codebook is, as shown in FIG.
And a delta code vector Δm are added to form a code vector m + Δm, and an impulse code vector group having only one phase shift is virtually formed.
Therefore, N code vectors Δm are prepared as the delta codebook.

【００２６】即ち、この実施例としては、加えられるデ
ルタコードベクトルΔｍのサンプル数を二本に固定し、
前インデックスと相殺するサンプル数と０値の位置に新
たに生成するサンプル数をそれぞれ一本づつ割り当て、
サンプル位置を一つづつシフトさせて図２(2) に示した
インパルスコードベクトルを仮想的に生成している。That is, in this embodiment, the number of samples of the delta code vector Δm to be added is fixed to two.
Allocate the number of samples to be canceled with the previous index and the number of newly generated samples at the position of 0 value one by one,
The impulse code vectors shown in FIG. 2 (2) are virtually generated by shifting the sample positions one by one.

【００２７】（２）疎／蜜コードベクトルの生成（図３
(2) ，(3) ）：そして、Ｎ番目からは偶数インデックス
のコードベクトルでは前のコードベクトルと重ならない
サンプル位置のデルタコードベクトルを用意してサンプ
ル数が増加するようにし（図３(2) 参照）、奇数インデ
ックスのコードベクトルでは前のコードベクトルの非零
値のサンプル位置と重なる位置にサンプルを持つデルタ
コードベクトルを用意してサンプル数が増加しないよう
にする（同図(3) 参照）、という拘束条件を付ける。 (2) Generation of Sparse / Honey Code Vector (FIG. 3)
(2), (3)): From the N-th, a delta code vector at a sample position that does not overlap with the previous code vector in the code vector of the even index is prepared so that the number of samples is increased (FIG. )), For the odd-indexed code vector, prepare a delta code vector having a sample at a position overlapping the non-zero sample position of the previous code vector so that the number of samples does not increase (see FIG. 3 (3)). ).

【００２８】これにより、Ｎ番目から２Ｎ−１番目まで
のインデックスのコードベクトルでは、２インデックス
で２本のサンプル増加率（１インデックス当たり一本換
算）で疎から密の雑音符号帳が形成できる。尚、このよ
うに疎から密へのサンプル数の変化を２段階に分けたの
は、図３(2) の密度増加だけの場合にはすぐにフルサン
プル列になってしまうからである。As a result, with the code vectors of the Nth to 2N-1st indices, a sparse to dense noise codebook can be formed at a rate of two samples (one conversion per index) with two indices. The reason why the change in the number of samples from sparse to dense is divided into two stages is that if only the density is increased as shown in FIG.

【００２９】（３）フルサンプルコードベクトルの生成
（図３(4) ）：２Ｎ番目以降はこの拘束条件を外し、ラ
ンダムにサンプル位置と振幅値を定めれば、Ｍを雑音符
号帳サイズとして、Ｍ−２Ｎ＋１本のフルサンプルコー
ドベクトルが用意できる。 (3) Generation of Full Sample Code Vector
(FIG. 3 (4)): If the constraint conditions are removed from the 2Nth and thereafter and the sample position and the amplitude value are determined at random, M−2N + 1 full sample code vectors can be prepared with M as the noise codebook size. .

【００３０】尚、上記のようにして生成されるデルタ符
号帳は、復号化側においても全く同じものが用意されて
雑音符号帳を仮想的に生成することにより再生音声が得
られることとなる。The same delta codebook is prepared on the decoding side as described above, and reproduced voice is obtained by virtually generating a noise codebook.

【００３１】[0031]

【発明の効果】このように本発明に係る音声伝送方式に
よれば、予め基準雑音列を表現するＮ次元のスパース初
期ベクトルとデルタ雑音列を表現するスパースデルタベ
クトルとをスパースデルタ符号帳として用意し、このス
パースデルタ符号帳によって仮想的に生成される各雑音
列コードベクトルのスパースレートが疎から蜜に順次配
列されるように構成したので、非常に効率的な探索技術
を適用できるスパースデルタコードベクトルを用いて有
声音／無声音における音声生成系のパルス音源／ランダ
ム雑音音源の変化に対してより適切なコードベクトルを
提供でき、符号化特性の向上を図ることができる。ま
た、このデルタコードベクトルのサンプル数の増減、相
殺、生成サンプルの割り当てなどの条件を変えることに
よりあらゆる特性の雑音符号帳を形成することができ
る。As described above, according to the speech transmission system of the present invention, an N-dimensional sparse initial vector representing a reference noise sequence and a sparse delta vector representing a delta noise sequence are prepared in advance as a sparse delta codebook. Since the sparse rate of each noise sequence code vector virtually generated by the sparse delta codebook is arranged in order from sparse to honey, a sparse delta code to which a very efficient search technique can be applied. By using the vector, a code vector more suitable for a change in the pulse sound source / random noise sound source of the voice generation system for voiced / unvoiced sound can be provided, and the encoding characteristics can be improved. Also, by changing conditions such as increase / decrease of the number of samples of the delta code vector, cancellation, allocation of generated samples, and the like, it is possible to form a noise codebook having all characteristics.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明に係る音声伝送方式に用いるデルタコー
ドベクトルの原理と符号帳の概念を示した図である。FIG. 1 is a diagram showing the principle of a delta code vector and the concept of a codebook used in a voice transmission system according to the present invention.

【図２】本発明に係る音声伝送方式によりデルタ符号帳
から仮想的に生成される雑音符号帳の構成を示した図で
ある。FIG. 2 is a diagram illustrating a configuration of a noise codebook virtually generated from a delta codebook by the voice transmission method according to the present invention.

【図３】本発明に係る音声伝送方式によりデルタ符号帳
から雑音コードベクトルを生成する実施例を説明するた
めの図である。FIG. 3 is a diagram for explaining an embodiment of generating a noise code vector from a delta codebook by a voice transmission method according to the present invention.

【図４】音声生成系の概念を示した図である。FIG. 4 is a diagram showing a concept of a voice generation system.

【図５】一般的なＣＥＬＰの原理について示したブロッ
ク図である。FIG. 5 is a block diagram showing the principle of general CELP.

【図６】従来のＡ−ｂ−Ｓ型ベクトル量子化の構成につ
いて示したブロック図である。FIG. 6 is a block diagram showing a configuration of a conventional AbS type vector quantization.

【図７】雑音符号帳の符号帳探索に要する演算機能につ
いて示したブロック図である。FIG. 7 is a block diagram showing an arithmetic function required for searching a codebook of a random codebook.

【符号の説明】[Explanation of symbols]

１雑音符号帳２適応符号帳Ｃ₀初期ベクトル ΔＣ₁〜ΔＣ_N デルタ符号帳図中、同一符号は同一又は相当部分を示す。Reference Signs List 1 noise codebook 2 adaptive codebook C ₀ initial vector ΔC _{1 to} ΔC _N delta codebook In the drawings, the same codes indicate the same or corresponding parts.

───────────────────────────────────────────────────── フロントページの続き (72)発明者栗原秀明神奈川県川崎市中原区上小田中1015番地富士通株式会社内 (72)発明者加藤雅子神奈川県川崎市中原区上小田中1015番地富士通株式会社内 (56)参考文献特開平３−177900（ＪＰ，Ａ) 特開平２−282799（ＪＰ，Ａ) 特開平３−245195（ＪＰ，Ａ) 特開平３−273300（ＪＰ，Ａ) 赤嶺他，”適応密度パルス列モデルに基づくＣＥＬＰ符号化方式”，1989年電子情報通信学会秋季全国大会講演論文集（1989−９），分冊１，Ａ−３ (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 19/00 - 19/14 H03M 7/30 H04B 14/04 ──────────────────────────────────────────────────続き Continuing on the front page (72) Inventor Hideaki Kurihara 1015 Uedanaka, Nakahara-ku, Kawasaki City, Kanagawa Prefecture Inside Fujitsu Limited (72) Inventor Masako Kato 1015 Kamikodanaka, Nakahara-ku, Kawasaki City, Kanagawa Prefecture Fujitsu Limited ( 56) References JP-A-3-177900 (JP, A) JP-A-2-282799 (JP, A) JP-A-3-245195 (JP, A) JP-A-3-273300 (JP, A) Akamine et al. , "CELP Coding Scheme Based on Adaptive Density Pulse Train Model", Proc. Of the 1989 IEICE Autumn National Convention (1989-9), Volume 1, A-3 (58). ⁷ , DB name) G10L 19/00-19/14 H03M 7/30 H04B 14/04

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】雑音列を表現するコードベクトルに対し
て声道特性を模擬した線形予測合成フィルタを通して得
られる再生信号と入力音声信号との誤差が最小となるコ
ードベクトルを決定し、該コードベクトルを特定するイ
ンデックス情報により該入力音声信号を符号化する音声
伝送方式において、予め基準雑音列を表現するＮ次元のスパース初期ベクト
ルとデルタ雑音列を表現するスパースデルタベクトルと
をスパースデルタ符号帳として用意し、該スパースデル
タ符号帳によって仮想的に生成される各雑音列コードベ
クトルのスパースレートが疎から蜜に順次配列されてお
り、該スパースレートを疎から蜜に変化させるため偶数
インデックスのコードベクトルでは前のコードベクトル
と重ならないサンプル位置のデルタコードベクトルを用
意してサンプル数が増加するようにし、奇数インデック
スでは前のコードベクトルの非零値のサンプル位置と重
なる位置にサンプルを持つデルタコードベクトルを用意
してサンプル数が増加しないようにすることを特徴とし
た音声伝送方式。1. A code vector which minimizes an error between a reproduced signal obtained through a linear predictive synthesis filter simulating vocal tract characteristics and an input speech signal with respect to a code vector expressing a noise sequence, and In an audio transmission system that encodes the input audio signal by using index information that specifies a sparse delta codebook, an N-dimensional sparse initial vector expressing a reference noise sequence and a sparse delta vector expressing a delta noise sequence are prepared in advance as a sparse delta codebook. The sparse rate of each noise sequence code vector virtually generated by the sparse delta codebook is sequentially arranged from sparse to fine .
To change the sparse rate from sparse to honey
The previous code vector in the index code vector
Use the delta code vector of the sample position that does not overlap with
To increase the number of samples
Is the nonzero sample position and weight of the previous code vector.
Delta code vector with samples at different positions
Voice transmission scheme number of samples were characterized to Rukoto to prevent increase in.

【請求項２】該デルタベクトルがパルス系列からフル
サンプル雑音系列までを含んでいることを特徴とした請
求項１記載の音声伝送方式。2. The audio transmission system according to claim 1, wherein said delta vector includes a sequence from a pulse sequence to a full sample noise sequence.

【請求項３】該パルス系列が、一つ前のインデックス
のコードベクトルと相殺するサンプルと０値の位置に立
つサンプルとのペアリングという条件でランダムに生成
したデルタベクトルによって生成されることを特徴とし
た請求項２に記載の音声伝送方式。3. The pulse sequence is generated by a randomly generated delta vector under a condition of pairing a sample to be canceled with a code vector of an immediately preceding index and a sample standing at a position of 0 value. The audio transmission system according to claim 2, wherein