JPH037120B2

JPH037120B2 -

Info

Publication number: JPH037120B2
Application number: JP56046810A
Authority: JP
Inventors: Kazunori Ozawa
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1981-03-30
Filing date: 1981-03-30
Publication date: 1991-01-31
Also published as: JPS57161795A

Description

【発明の詳細な説明】高能率音声信号符号化方式の分野において、特
に伝送レイトが9.6kビツト／秒前後の帯域におい
て、音声信号のスペクトラム構造を表わす係数を
量子化伝送する方式が知られている。この方式の
代表的なものとしては、適応型直交変換符号化方
式（Adaptive Transform Coding：以下、ATC
と記す）が挙げられる。ATCに関する基本的な
説明は、ロナルド・イー・クロシヤー
（RONALD.E.CROCHIERE）氏らによるアイ・
イー・イー・イー・トランザクシヨンズ・オン・
アコーステイクス・スピーチ・アンド・シグナル
プロセシング（IEEE TRANSACTIONS ON
ACOUSTICS，SPEECH，AND SIGNAL
PROCESSING）誌1979年10月号512〜530頁に掲
載の「フリーケンシイ・ドメイン・コーデイン
グ・オブ・スピーチ」（“Frequency Domain
Goding of Speech。”）と題した論文（文献１）、
あるいは発明の名称が「音声信号処理回路」であ
る特開昭55−57900号に述べられている。これは、
短時間音声信号系列の離散的コサイン変換
（Discrete Cosine Transform：以下DCTと記
す）係数を求め、DCT係数を量子化伝送する方
式である。[Detailed Description of the Invention] In the field of high-efficiency audio signal encoding systems, a method is known that quantizes and transmits coefficients representing the spectral structure of an audio signal, especially in a band where the transmission rate is around 9.6 kbit/s. There is. A typical example of this method is Adaptive Transform Coding (ATC).
). A basic explanation of ATC is provided by Ronald E. CROCHIERE et al.
E.E. Transactions on
Acoustics Speech and Signal Processing (IEEE TRANSACTIONS ON
ACOUSTICS, SPEECH, AND SIGNAL
``Frequency Domain Coding of Speech'' published in October 1979 issue of PROCESSING, pages 512-530.
Goding of Speech. ”) (Reference 1),
Alternatively, the invention is described in Japanese Patent Application Laid-Open No. 57900/1983, which is entitled "Audio Signal Processing Circuit." this is,
This method calculates discrete cosine transform (hereinafter referred to as DCT) coefficients of a short-time audio signal sequence and quantizes and transmits the DCT coefficients.

また、DCT係数の推定値を計算し、DCT係数
と推定値との差分を量子化伝送する方式が知られ
ており、この方式に関しては発明の名称が「適応
型音声伝送方式およびその装置」である特願昭55
−136282号（文献２）に述べられている。ここで
は従来方式として前記文献１に引用した方式を説
明する。 In addition, a method is known in which an estimated value of a DCT coefficient is calculated and the difference between the DCT coefficient and the estimated value is quantized and transmitted. A special request in 1977
-136282 (Reference 2). Here, the method cited in the above-mentioned document 1 will be explained as a conventional method.

ブロツク図を第１図ａ，ｂに示す。図におい
て、第１図ａで示す送信側では入力端子１０に標
本化された音声信号系列ｘ（ｎ）が入力される。
１５は入力信号をＭサンプルずつブロツク処理す
るためにＭサンプル分記憶するバツフアメモリで
あり、音声信号系列はＭサンプルずつバツフアメ
モリ１５に格納される。２０は窓関数回路であ
り、バツフアメモリ１５に格納されているＭサン
プルの信号に対してあらかじめ定められた窓関数
を掛ける操作を行なう。前記文献１で述べられて
いる窓関数の一例を第２図に示す。第２図でｎは
サンプル時刻を示し、ｍは重なりあうサンプル数
を示す。以下、Ｍ点のサンプル値の並びをブロツ
クと呼ぶことにすると、第２図の実線部は現ブロ
ツクの窓関数を示し、破線部はその前後のブロツ
クの窓関数を示す。窓関数回路２０の出力値は離
散的コサイン変換回路（以下、DCT回路と記す）
30に入力される。ここでＭサンプルの離散的コサ
イン変換（Discrete Cosine Transform；以下
DCTと記す）係数の計算法の一つとしては、前
記文献１によれば下式に従つて計算される。 The block diagram is shown in Figures 1a and 1b. In the figure, on the transmitting side shown in FIG. 1a, a sampled audio signal sequence x(n) is input to an input terminal 10.
Reference numeral 15 denotes a buffer memory for storing M samples of the input signal in order to block process the input signal in units of M samples, and the audio signal sequence is stored in the buffer memory 15 in units of M samples. 20 is a window function circuit, which performs an operation of multiplying the M sample signal stored in the buffer memory 15 by a predetermined window function. An example of the window function described in Document 1 is shown in FIG. In FIG. 2, n indicates the sample time, and m indicates the number of overlapping samples. Hereinafter, the sequence of sample values at M points will be referred to as a block. In FIG. 2, the solid line section shows the window function of the current block, and the broken line section shows the window functions of the blocks before and after it. The output value of the window function circuit 20 is converted to a discrete cosine transform circuit (hereinafter referred to as a DCT circuit).
Entered at 30. Here, M-sample discrete cosine transform (Discrete Cosine Transform; hereinafter
According to the above-mentioned document 1, the coefficient (denoted as DCT) is calculated according to the following formula.

Vc（ｋ）＝_M-1 〓ⁿ⁼⁰ ｖ（ｎ）Ｃ（ｋ）cos〔（2n＋１）πk／2M〕 −(1) （０≦ｋ≦Ｍ−１）ここでＣ（ｋ）＝１（ｋ＝０） √２（１≦ｋ≦Ｍ−１） −(2) (1)式において、Ｖ（ｎ）（０≦ｎ≦Ｍ−１）は、
窓関数回路２０の出力値であり、Vc〔ｋ）はｖ
（ｎ）のＭサンプルDCT係数を示す。また逆コサ
イン変換（以下、IDCTと記す）は、次式に従い
計算される。 Vc(k)= _M-1 〓 ⁿ⁼⁰ v(n)C(k)cos[(2n+1)πk/2M] −(1) (0≦k≦M-1) Here, C(k)=1 (k=0) √2 (1≦k≦M−1) −(2) In equation (1), V(n) (0≦n≦M−1) is
It is the output value of the window function circuit 20, and Vc[k] is v
The M-sample DCT coefficients of (n) are shown. Inverse cosine transform (hereinafter referred to as IDCT) is calculated according to the following formula.

ｖ（ｎ）＝１／Ｍ_M-1 〓^K=0 Ve（ｋ）Ｃ（ｋ）cos〔（2n＋１）πk／2M〕 −(3) （０≦ｎ≦Ｍ−１）ＭサンプルDCT係数の計算法としては、2Mサ
ンプルの離散的フーリエ変換（Discrete Fourier
Transfrom；以下、DFTと記す）を用いる計算
法が知られている。DFTを用いた計算法につい
ては前記文献１等で詳述されているのでここでは
説明を省略する。DCT回路３０はＭサンプル
DCT係数Vc（ｋ）を自己相関関数計算回路４０
とDCT係数適応型量子化符号化回路１１０へ出
力する。自己相関関数計算回路４０は、まずVc
（ｋ）の各々の２乗値を計算しさらにこれらに対
して2Mサンプルの逆離散的フーリエ変換
（Inverse Fourier Transform；以下IDFTと記
す）を施し、Ｍ個の擬似自己相関関数を計算す
る。ここで“擬似”と記したのは一般にDFT係
数から得られるパワースペクトラムと自己相関関
数とはウイーナー（Wiener）の定理によつて関
係づけられているが、本実施例においてはDCT
係数から自己相関関数を計算したためである。次
に、Ｍ個の擬似自己相関関数はピツチ検出回路５
０と予測パラメータ計算回路７０へ出力される。
ピツチ検出回路５０は、Ｍ個の擬似自己相関関数
を用いてあらかじめ定められた個数の予測パラメ
ータ値系列（一例として、Ｋパラメータ値系列と
する。）を計算し、Ｋパラメータ植系列を量子化
器６０へ出力する。なお、ｋパラメータは別名パ
ーコール・パラメータとも呼ばれる。量子化器６
０は、ピツチ周期Ｐ、ピツチゲインP_G及びＫパ
ラメータ値系列をあらかじめ定められた量子化ビ
ツト数と量子化ステツプサイズを用いて量子化
し、得られた量子化値系列を逆量子化器６５及び
サイド情報符号器１３０へ出力する。逆量子化器
６５は、入力した量子化値系列を逆量子化（復号
化）し、ピツチ周期P′、ピツチゲインP_G′及びＫ
パラメータ値系列K′を得る。さらにP′，P_G′をス
ペクトラム再生回路９０へ出力し、K′をパラメ
ータ変換回路８０へ出力する。パラメータ変換回
路８０は、K′を入力し、スペクトラム再生に有
利なパラメータ値系列（例えばαパラメータ値系
列、αパラメータは線形予測係数と呼ばれること
もある。）に変換し、αパラメータ値系列をスペ
クトラム再生回路９０へ出力する。スペクトラム
再生回路９０は、ピツチ周期P′とピツチゲイン
P_G′とからピツチ信号を発生させDFTによつてピ
ツチスペクトラムσp（ｋ）に変換する。さらに、
αパラメータ値系列を用いてDFTによつて包絡
スペクトラムσf（ｋ）を計算する。ここで包絡ス
ペクトラムはフオルマントスペクトラムとも呼ば
れる。その後にσp（ｋ）とσf（ｋ）とを組みあわ
せて再生スペクトラムσ^（ｋ）を計算する。次にσ^f
（ｋ）及びσ（ｋ）はビツト割り当てステツプサイ
ズ計算回路１００へ出力される。ビツト割り当て
ステツプサイズ計算回路１００の計算アルゴリズ
ムを前記文献１に従つて次に説明する。最初にス
テツプサイズ計算式を示す。 v(n)=1/M _M-1 〓 ^K=0 Ve(k)C(k)cos[(2n+1)πk/2M] −(3) (0≦n≦M-1) M sample DCT coefficient The calculation method is Discrete Fourier transform of 2M samples.
A calculation method using Transfrom (hereinafter referred to as DFT) is known. The calculation method using DFT is detailed in the above-mentioned document 1, so the explanation is omitted here. DCT circuit 30 has M samples
The DCT coefficient Vc(k) is calculated by the autocorrelation function calculation circuit 40.
and is output to the DCT coefficient adaptive quantization encoding circuit 110. The autocorrelation function calculation circuit 40 first calculates Vc
The square value of each of (k) is calculated, and further, a 2M sample inverse discrete Fourier transform (hereinafter referred to as IDFT) is applied to these values to calculate M pseudo-autocorrelation functions. The term "pseudo" here means that the power spectrum obtained from the DFT coefficients and the autocorrelation function are generally related by Wiener's theorem, but in this example, the DCT
This is because the autocorrelation function was calculated from the coefficients. Next, the M pseudo-autocorrelation functions are calculated by the pitch detection circuit 5.
0 and is output to the prediction parameter calculation circuit 70.
The pitch detection circuit 50 calculates a predetermined number of predicted parameter value sequences (as an example, a K parameter value sequence) using M pseudo-autocorrelation functions, and converts the K parameter value sequence into a quantizer. Output to 60. Note that the k parameter is also called a percall parameter. Quantizer 6
0 quantizes the pitch period P, pitch gain P _G and K parameter value series using a predetermined number of quantization bits and quantization step size, and applies the obtained quantization value series to the inverse quantizer 65 and the side The information is output to the information encoder 130. The inverse quantizer 65 inversely quantizes (decodes) the input quantized value sequence, and calculates pitch period P', pitch gain P _G ', and K
Obtain the parameter value series K′. Further, P' and P _G ' are outputted to the spectrum reproduction circuit 90, and K' is outputted to the parameter conversion circuit 80. The parameter conversion circuit 80 inputs K' and converts it into a parameter value series (for example, an α parameter value series, the α parameter is also called a linear prediction coefficient) that is advantageous for spectrum reproduction, and converts the α parameter value series into a spectrum. It is output to the reproduction circuit 90. The spectrum regeneration circuit 90 has a pitch period P′ and a pitch gain.
A pitch signal is generated from P _G ' and converted to a pitch spectrum σp(k) by DFT. moreover,
An envelope spectrum σf(k) is calculated by DFT using the α parameter value series. Here, the envelope spectrum is also called formant spectrum. Thereafter, the reproduction spectrum σ^(k) is calculated by combining σp(k) and σf(k). Then σ^f
(k) and σ(k) are output to bit allocation step size calculation circuit 100. The calculation algorithm of the bit allocation step size calculation circuit 100 will be explained below according to the above-mentioned document 1. First, the step size calculation formula is shown.

Δ（ｋ）＝Ｑ・Ａ（ｂ（ｋ））・σ^（ｋ） −(4) （０≦ｋ≦Ｍ−１）ここでΔ（ｋ）はｋ番目のDCT係数を量子化す
る際の量子化ステツプサイズを示し、σ^（ｋ）は
前記再生スペクトラムを示す。ｂ（ｋ）は割り当
てビツト数であり、Ａ（ｂ（ｋ））はｂ（ｋ）によつ
て決まる定数である。Ｑは負荷定数であり、負荷
に対する量子化器の雑音特性を決定する。次にビ
ツト割り当て計算アルゴリズムを示す。ｋ番目の
DCT係数Vc（ｋ）に対する割り当てビツト数は
次式によつて計算される。 Δ(k)=Q・A(b(k))・σ^(k) −(4) (0≦k≦M−1) Here, Δ(k) is when quantizing the k-th DCT coefficient. σ^(k) represents the reproduction spectrum. b(k) is the number of allocated bits, and A(b(k)) is a constant determined by b(k). Q is a loading constant and determines the noise characteristics of the quantizer with respect to loading. Next, the bit allocation calculation algorithm is shown. kth
The number of bits allocated to the DCT coefficient Vc(k) is calculated by the following equation.

ｂ（ｋ）＝δ＋１／２log₂Ｗ（ｋ）σ^²（ｋ）／Ｄ（０≦ｋ≦Ｍ−１） −(5) ここでδはあらかじめ定められた定数、Ｗ（ｋ）
は周波数重み関数、Ｄは次式で表わされる量子化
雑音電力である。 b(k)=δ+1/2log ₂ W(k)σ^ ² (k)/D (0≦k≦M−1) −(5) Here, δ is a predetermined constant, W(k)
is a frequency weighting function, and D is a quantization noise power expressed by the following equation.

Ｄ＝１／Ｍ_M-1 〓^k=0 Ｗ（ｋ）σ² _e（ｋ）（０≦ｋ≦Ｍ−１） −(6) ここでσ² _e（ｋ）は、ｋ番目のDCT係数の量子化
雑音電力を示す。周波数重み関数Ｗ（ｋ）は次式
によつて表わされる。 D=1/M _M-1 〓 ^k=0 W(k)σ ² _e (k) (0≦k≦M-1) −(6) Here, σ ² _e (k) is the k-th DCT coefficient shows the quantization noise power of The frequency weighting function W(k) is expressed by the following equation.

Ｗ（ｋ）＝σ^2r _f（ｋ）（０≦ｋ≦Ｍ−１） −(7) ここでσf（ｋ）は前述の包絡スペクトラムであ
る。ｒは、量子化雑音電力の周波数特性を決定す
る因子であり−１≦ｒ≦０の値をとる。ｒの値に
よる量子化雑音電力σ² _e（ｋ）の変化を第３図示す。
第３図において、横軸はｋの値を示し、縦軸は対
数電力値を示す。また実線部は信号電力スペクト
ラムを示し、破線部は量子化雑音電力スペクトラ
ムを示す。ｒ＝０の時は量子化雑音電力スペクト
ラムは平坦な周波数特性を有し、ｒ＝−１の時は
信号電力スペクトラムと相似な周波数特性を有
す。ｒの値としては、普通は−１＜ｒ＜０の値を
用いる。ｒの値によつて量子化雑音電力スペクト
ラムの周波数特性を変化させる手法を、文献１に
従つてノイズ・シエイピング（Noise Shaping）
と呼ぶ。第１図ａに戻つて、DCT係数適応型量
子化符号化回路１１０は、ビツト割り当て量子化
ステツプサイズ計算回路１００から量子化ステツ
プサイズΔ（ｋ）と割り当てビツト数ｂ（ｋ）とを
入力し、こられの値を用いてDCT係数Vc（ｋ）
を適応的に量子化符号化しマルチプレクサ１２０
へ出力する。サイド情報符号器１３０は、量子化
器６０からの量子化パラメータ値系列をサイド情
報として符号化しマルチプレクサ１２０へ出力す
る。マルチプレクサ１２０はサイド情報符号器１
３０の出力符号と、DCT係数適応型量子化符号
化回路１１０の出力符号とをＭサンプル時刻毎に
送信側出力端子１４０を通して受信側へ伝送す
る。 W(k)=σ ^2r _f (k) (0≦k≦M−1) −(7) where σf(k) is the aforementioned envelope spectrum. r is a factor that determines the frequency characteristics of quantization noise power, and takes a value of -1≦r≦0. Figure 3 shows changes in the quantization noise power σ ² _e (k) depending on the value of r.
In FIG. 3, the horizontal axis shows the value of k, and the vertical axis shows the logarithmic power value. Moreover, the solid line part shows the signal power spectrum, and the broken line part shows the quantization noise power spectrum. When r=0, the quantization noise power spectrum has flat frequency characteristics, and when r=-1, it has frequency characteristics similar to the signal power spectrum. As the value of r, a value of -1<r<0 is normally used. Noise shaping is a method for changing the frequency characteristics of the quantized noise power spectrum depending on the value of r, according to Reference 1.
It is called. Returning to FIG. 1a, the DCT coefficient adaptive quantization encoding circuit 110 inputs the quantization step size Δ(k) and the number of allocated bits b(k) from the bit allocation quantization step size calculation circuit 100. , using these values, the DCT coefficient Vc(k)
A multiplexer 120 adaptively quantizes and encodes
Output to. Side information encoder 130 encodes the quantization parameter value sequence from quantizer 60 as side information and outputs it to multiplexer 120. Multiplexer 120 is side information encoder 1
30 and the output code of the DCT coefficient adaptive quantization coding circuit 110 are transmitted to the receiving side through the transmitting side output terminal 140 every M sample times.

次に受信側の動作を説明する。受信側では、第
１図ｂに示すように受信側入力端子１５０を通し
て伝送された符号系列を受信し、デマルチプレク
サ１６０は受信符号系列からDCT係数を表わす
符号系列とサイド情報を表わす符号系列とを分離
し、前者をDCT係数適応型復号回路２００へ出
力し、後者をサイド情報復号器１７０へ出力す
る。サイド情報復号器１７０は入力された符号系
列を復号しピツチ周期P′及びピツチゲインP_G′と
Ｋパラメータ値系列とを分離し、Ｋパラメータ値
系列をパラメー変換回路１７５へ出力しP′及び
P_G′をスペクトラム再生回路１８０へ出力する。
ここで、パラメータ変換回路１７５及びスペクト
ラム再生回路１８０は、送信側におけるパラメー
タ変換回路８０及びスペクトラム再生回路９０と
同一の動作をするので説明を省略する。また、ビ
ツト割り当て量子化ステツプサイズ計算回路１９
０は送信側におけるビツト割り当て量子化ステツ
プサイズ計算回路１００と同一の動作をする。
DCT係数適応型復号回路２００は、ビツト割り
当て量子化ステツプサイズ計算回路１９０から出
力される量子化ステツプサイズΔ（ｋ）と割り当
てビツト数ｂ（ｋ）とを用いて、デマルチプレク
サ１６０から入力した符号系列を適応的に復号化
しDCT係数を得た後に逆DCT（IDCT）回路２１
０へ出力する。IDCT回路２１０はＭサンプルの
DCT係数を入力し、これらに逆コサイン変換を
施し再生音声信号系列ｘ（ｎ）（０≦ｋ≦Ｍ−１）
を得る。バツフアメモリ回路２２０は、Ｍサンプ
ルのｘ（ｎ）を一旦蓄積した後に受信側出力端子
２３０を通して出力する。 Next, the operation on the receiving side will be explained. On the receiving side, the code sequence transmitted through the receiving side input terminal 150 is received as shown in FIG. The former is output to the DCT coefficient adaptive decoding circuit 200, and the latter is output to the side information decoder 170. The side information decoder 170 decodes the input code sequence, separates the pitch period P', pitch gain P _G ', and K parameter value sequence, outputs the K parameter value sequence to the parameter conversion circuit 175, and outputs the K parameter value sequence to the parameter conversion circuit 175.
P _G ' is output to the spectrum regeneration circuit 180.
Here, the parameter conversion circuit 175 and the spectrum regeneration circuit 180 operate in the same manner as the parameter conversion circuit 80 and the spectrum regeneration circuit 90 on the transmitting side, so a description thereof will be omitted. Further, the bit allocation quantization step size calculation circuit 19
0 performs the same operation as the bit allocation quantization step size calculation circuit 100 on the transmitting side.
The DCT coefficient adaptive decoding circuit 200 uses the quantization step size Δ(k) output from the bit allocation quantization step size calculation circuit 190 and the allocated bit number b(k) to calculate the code input from the demultiplexer 160. After adaptively decoding the sequence and obtaining DCT coefficients, the inverse DCT (IDCT) circuit 21
Output to 0. The IDCT circuit 210 has M samples.
Input the DCT coefficients, perform inverse cosine transformation on them, and reproduce the audio signal sequence x(n) (0≦k≦M-1)
get. The buffer memory circuit 220 once stores M samples of x(n) and then outputs it through the receiving side output terminal 230.

以上説明した従来方式によれば、9.6kビツト／
秒前後の伝送レイトにおいては非常に良好な特性
を得ることができるが、より低い伝送レイト（例
えば7.2kビツト／秒）においては特性が劣化する
という欠点があつた。これは文献２に示した従来
方式においても同様である。 According to the conventional method explained above, 9.6k bits/
Although very good characteristics can be obtained at transmission rates of around 100 seconds, the disadvantage is that the characteristics deteriorate at lower transmission rates (for example, 7.2 kbits/second). This also applies to the conventional method shown in Document 2.

本発明の目的は、より低い伝送レイトにおいて
も良好な特性が得られ、かつより簡単な装置構成
によつて実現可能な適応型変換符号化方法とその
装置を提供するところにある。 SUMMARY OF THE INVENTION An object of the present invention is to provide an adaptive transform-coding method and a device thereof that can obtain good characteristics even at lower transmission rates and can be implemented with a simpler device configuration.

本発明は、音声信号を直交変換し直交変換系列
を適応的に量子化し伝送する適応型変換符号化方
式において、送信側ではデイジタル信号に変換さ
れた音声信号離散系列をフレームに分割し、前記
フレーム内の離散信号系列を第１の直交変換離散
系列に変換し、前記第１の直交変換離散系列のス
ペクトラム的特徴を表わすパラメータ系列を求
め、前記パラメータ系列から音源特性を表わすパ
ラメータ系列を求め、前記スペクトラム的特徴を
表わすパラメータ系列かまたは前記第１の直交変
換離散系列から声道特性を表わすパラメータ系列
とを求めると共に、前記音源特性を表わすパラメ
ータ系列を第２の直交変換離散系列に変換し、更
に前記第２の直交変換離散系列を適応的に量子化
した量子化信号系列を求め、前記量子化信号系列
と前記声道特性を表わすパラメータ系列とを組み
合わせて伝送し、受信側では伝送された信号系列
から前記量子化信号系列と前記声道特性を表わす
パラメータ系列とを分離し、前記声道特性を表わ
すパラメータ系列を用いて声道の応答特性を表わ
す信号系列を求め、前記量子化信号系列を適応的
に復合し、更に逆変換離散信号系列に変換し、前
記逆変換離散信号系列と前記声道の応答特性を表
わす信号系列とを用いて所定フレーム内の音声信
号離散系列を再生させることを特徴とするもので
ある。 The present invention provides an adaptive transform coding method that orthogonally transforms an audio signal, adaptively quantizes and transmits the orthogonal transform sequence, and the transmission side divides the audio signal discrete sequence converted into a digital signal into frames, and the converting a discrete signal sequence in a first orthogonal transform discrete series into a first orthogonal transform discrete series, determining a parameter series representing the spectral characteristics of the first orthogonally transforming discrete series, determining a parameter series representing sound source characteristics from the parameter series, Determining a parameter series representing spectral characteristics or a parameter series representing vocal tract characteristics from the first orthogonally transformed discrete series, converting the parameter series representing the sound source characteristics into a second orthogonally transformed discrete series, and further A quantized signal sequence is obtained by adaptively quantizing the second orthogonal transform discrete sequence, the quantized signal sequence and the parameter sequence representing the vocal tract characteristics are combined and transmitted, and the receiving side receives the transmitted signal. Separating the quantized signal sequence and the parameter sequence representing the vocal tract characteristics from the sequence, using the parameter sequence representing the vocal tract characteristics to obtain a signal sequence representing the response characteristics of the vocal tract, and converting the quantized signal sequence into adaptively decoding, further converting into an inversely transformed discrete signal sequence, and using the inversely transformed discrete signal sequence and a signal sequence representing the response characteristic of the vocal tract to reproduce a discrete audio signal sequence within a predetermined frame. This is a characteristic feature.

また、本発明は、離散的デイジタル信号に変換
された音声信号離散系列をフレームに分割する回
路と、前記フレーム内の離散信号系列を第１の直
交変換離散系列に変換する回路と、前記第１の直
交変換離散系列のスペクトラム的特徴を表わす第
１のスペクトラムパラメータ系列を発生させる回
路と、前記第１のスペクトラムパラメータ系列か
ら音源情報を含む第２のスペクトラムパラメータ
系列を求めかつ前記第１のスペクトラムパラメー
タかまたは前記第１の直交変換離散系列から声道
情報を含む第３のスペクトラムパラメータ系列と
を求める回路と、前記第３のスペクトラムパラメ
ータ系列を少ないビツト数で量子化するパラメー
タ量子化回路と、前記第１のスペクトラムパラメ
ータ系列から抽出して得られる情報を少ないビツ
ト数で量子化する量子化回路と、前記第２のスペ
クトラムパラメータ系列を用いて音源のピツチ構
造を含むスペクトラム情報に変換する回路と、前
記スペクトラム情報を用いて第２の直交変換離散
系列を計算する回路と、前記量子化回路からの量
子化値を復号化し前記第２の直交変換離散系列を
適応的に量子化するために必要な情報を適応的に
計算する回路と、前記量子化に必要な情報を用い
て前記第２の直交変換離散系列を適応的に量子化
した第４の信号系列を計算する適応量子化回路
と、前記第４の信号系列と前記パラメータ量子化
回路からの出力系列と前記量子化回路からの出力
情報とを組み合わせて伝送する回路とを有するこ
とを特徴とするものである。 The present invention also provides a circuit for dividing an audio signal discrete sequence converted into a discrete digital signal into frames, a circuit for converting the discrete signal sequence within the frame into a first orthogonal transform discrete sequence, and the first orthogonal transform discrete sequence. a circuit for generating a first spectral parameter series representing spectral characteristics of an orthogonally transformed discrete series; a circuit for generating a second spectral parameter series including sound source information from the first spectral parameter series; or a circuit for determining a third spectrum parameter series including vocal tract information from the first orthogonal transform discrete series; a parameter quantization circuit for quantizing the third spectrum parameter series with a small number of bits; a quantization circuit that quantizes information extracted from the first spectrum parameter series using a small number of bits; and a circuit that converts the second spectrum parameter series into spectrum information including the pitch structure of the sound source; a circuit for calculating a second orthogonal transform discrete sequence using the spectrum information; and a circuit necessary for decoding the quantized value from the quantization circuit and adaptively quantizing the second orthogonal transform discrete sequence. a circuit that adaptively calculates information; an adaptive quantization circuit that calculates a fourth signal sequence obtained by adaptively quantizing the second orthogonal transform discrete sequence using the information necessary for the quantization; The present invention is characterized by comprising a circuit that transmits a combination of a fourth signal sequence, an output sequence from the parameter quantization circuit, and output information from the quantization circuit.

さらに本発明は、伝送された信号系列から適応
量子化信号系列を表わす第５の信号系列と声道情
報を有す量子化パラメータ系列を表わす第６の信
号系列と音源情報を有すスペクトラムパラメータ
の一部を表わす量子化情報とを分離する回路と、
前記分離回路によつて分離された第６の信号系列
を用いて第７の信号系列を計算する回路と、前記
量子化情報を復号し前記第５の信号系列を適応的
に復号化するために必要な情報を適応的に計算す
る回路と、前記復号化に必要な情報を用いて前記
第５の信号系列を適応的に復号化した信号系列を
計算する回路と、前記復号化した信号系列を用い
て逆変換離散信号系列を表わす第８の信号系列を
計算する回路と、前記第８の信号系列と前記第７
の信号系列とを用いて所定フレーム内の再生音声
信号離散系列を計算する回路とを有することを特
徴とするものである。 Furthermore, the present invention provides a fifth signal sequence representing an adaptive quantization signal sequence, a sixth signal sequence representing a quantization parameter sequence having vocal tract information, and a spectral parameter having sound source information from the transmitted signal sequence. a circuit that separates quantized information representing a part;
a circuit for calculating a seventh signal sequence using the sixth signal sequence separated by the separation circuit; and a circuit for decoding the quantization information and adaptively decoding the fifth signal sequence. a circuit that adaptively calculates necessary information; a circuit that calculates a signal sequence that adaptively decodes the fifth signal sequence using the information necessary for the decoding; a circuit for calculating an eighth signal sequence representing an inversely transformed discrete signal sequence;
and a circuit for calculating a discrete sequence of reproduced audio signals within a predetermined frame by using the signal sequence of .

次に、本発明の実施例を図面を用いて詳細に説
明する。第４図ａ，ｂは、本発明の第１の実施例
を示すブロツク図である。本図において第１図と
同一の番号を付した構成要素は第１図と同一の動
作を行なうのでここでの説明は省略する。第４図
ａは、送信側のブロツク図を第４図ｂは受信側の
ブロツク図をそれぞれ示す。 Next, embodiments of the present invention will be described in detail using the drawings. FIGS. 4a and 4b are block diagrams showing a first embodiment of the present invention. Components in this figure that are given the same numbers as in FIG. 1 perform the same operations as in FIG. 1, so their explanations will be omitted here. FIG. 4a shows a block diagram of the transmitting side, and FIG. 4b shows a block diagram of the receiving side.

送信側において窓関数回路２０の出力値ｖ（ｎ）
（０≦ｎ≦Ｍ−１）は、DFT回路５１０へ出力さ
れる。５１０は2M点DFT計算回路であり次式に
従つて2M点DFT係数の計算を行なう。今、ｕ
（ｎ）を2M点のサンプル値系列とする。 On the transmitting side, the output value v(n) of the window function circuit 20
(0≦n≦M-1) is output to the DFT circuit 510. 510 is a 2M point DFT calculation circuit which calculates 2M point DFT coefficients according to the following equation. Now u
Let (n) be a sample value series of 2M points.

ｕ(n)＝ｖ（ｎ）（０≦ｋ≦Ｍ−１）Ｏ（Ｍ≦ｎ≦2M−１） −(8) 2M点DFT係数をＵ（ｋ）とすると、Ｕ(k)＝_2M-1 〓ⁿ⁼⁰ ｕ(n)EXP（−j2πkn／2M） −(9) （０≦ｋ≦2M−１）Ｕ（ｋ）＝Re（Ｕ（ｋ））＋gIm（Ｕ（ｋ））−（10
）ここでRe（Ｕ）及びIm（Ｕ）は実数部、虚数部
をそれぞれ表わす。Re（Ｕ（ｋ））及びIm（Ｕ
（ｋ））は絶対値位相計算回路５２０へ出力され
る。絶対値位相計算回路５２０は次式に従つて絶
対値及び位相の計算をする。u(n)=v(n) (0≦k≦M−1) O (M≦n≦2M−1) −(8) If the 2M point DFT coefficient is U(k), then U(k)= _{2M -1} 〓 ⁿ⁼⁰ u(n)EXP(−j2πkn/2M) −(9) (0≦k≦2M−1) U(k)=Re(U(k))+gIm(U(k))− (Ten
) Here, Re(U) and Im(U) represent the real part and imaginary part, respectively. Re(U(k)) and Im(U
(k)) is output to the absolute value phase calculation circuit 520. The absolute value phase calculation circuit 520 calculates the absolute value and phase according to the following equation.

｜Ｕ（ｋ）｜＝√² _e（（））＋² _n（（））−（11
） arg（Ｕ（ｋ））＝tan^-1（Im（Ｕ（ｋ））／Re（Ｕ（
ｋ））ー（12）ここで｜Ｕ（ｋ）｜及びarg（Ｕ（ｋ））はＵ（ｋ）
の絶対値及び位相を示す。｜Ｕ（ｋ）｜は対数計算
回路３２０へ出力され対数計算が行なわれる。
IDFT回路３３０は前記対数計算回路３２０の出
力値に対してIDFTを計算しケプストラムCp（ｎ）
をリフタ回路５３０へ出力する。ここでケプスト
ラムCp（ｎ）は、音声信号のスペクトラム構造及
びピツチ構造を表すスペクトラムパラメータの一
つとしてよく知られているが、その詳細な説明は
エィ・ブイ・オツペンハイム（Ａ・Ｖ，
OPPENHEIM）氏らによるプロシーデイング
ズ・オブ・デイ・アイ・イー・イー・イー
（PROCEEDINGS OF THE IEEE）誌の1968年
８月号1264〜1291頁に掲載の「ノンリニアー・フ
イルタリング・オブ・マルチプライド・アンド・
コンボルブド・シグナルズ」（“Nonlinear
Filtering of Multiplied and Convolved
Signals.”）と題した論文（文献３）、あるいは、
デイー・ジー・チルダーズ（D.G.CHILDERS）
氏らによるプロシーデイングズ・オブ・デイ・ア
イ・イー・イー・イ−（PROCEEDINGS OF
THE IEEE）誌の1977年10月号1428〜1443頁に
掲載の「ザ・ケプストラム：ア・ガイド・トウ
ー・プロセシング」（“The Cepstrum：Ａ
Guide to Processing.”）と題した論文（文献４）
等に述べられているので、ここでは省略する。リ
フタ回路５３０はケプストラムCp（ｎ）に対して
あらかじめ定められた窓関数を乗じ、ケプストラ
ムの低時間領域と高時間領域とを分離して抽出し
低時間領域を量子化回路５３０へ出力し、高時間
領域をDFT回路５３５とピツチ抽出量子化回路
５２５へ出力する。ピツチ抽出量子化回路５２５
はピツチ周期ＰとピツチゲインP_Gを検出しこれ
らをあらかじめ定められた量子化ビツト数で量子
化しP′及びP_G′を得てP′及びP_G′をピツチ信号発
生回路５５０及びマルチプレクサ６０５へ出力す
る。量子化回路５３０は低時間領域ケプストラム
を入力しあらかじめ定められた量子化ビツト数で
量子化し、量子化値をマルチプレクサ４２０へ出
力する。ここで量子化はケプストラムを直接量子
化してもよいし、前記従来方式で説明したように
他のパラメータに変換してから量子化してもよ
い。なお、ケプストラムの低時間部は、原音声信
号のスペクトラム包絡情報を含むことが知られて
いる。DFT回路５３５は、リフタ回路５３０よ
り出力された高時間領域ケプストラムを入力しこ
れに対して2M点のDFTの計算を行なう。本回路
においては、Ｍ点の入力信号系列を偶対称化して
2M点系列とし、その2M点系列に対して2M点
DFT計算を行なう。なおDFT計算には前記(9)式
を用いる。指数計算回路３７０はDFT回路５３
５の出力に対して指数計算を行なう。ここで指数
計算回路３７０の出力値Ｅ（ｋ）は音源のピツチ
構造を表わすスペクトラムである。これは、よく
知られているように高時間領域のケプストラムが
原音声信号の音源に関する情報を含んでいるから
である。指数計算回路３７０の出力値は前記絶対
値位相計算回路５２０の位相出力とともに実部、
虚部計算回路５４０へ出力される。実部、虚部計
算回路５４０は、Ｅ（ｋ）及び位相を用いて実数
部、虚数部を次式に従つて計算する。 |U(k)| =√ ² _e (()) + ² _n (()) − (11
) arg(U(k))=tan ^-1 (Im(U(k))/Re(U(
k)) - (12) where |U(k)| and arg(U(k)) is U(k)
shows the absolute value and phase of |U(k)| is output to the logarithm calculation circuit 320 and logarithm calculation is performed.
The IDFT circuit 330 calculates the IDFT for the output value of the logarithm calculation circuit 320 and calculates the cepstrum Cp(n).
is output to the lifter circuit 530. Here, the cepstrum Cp(n) is well known as one of the spectral parameters representing the spectral structure and pitch structure of an audio signal, but its detailed explanation can be found in A.V.
``Nonlinear Filtering of Multiply,'' published in the August 1968 issue of PROCEEDINGS OF THE IEEE, pp. 1264-1291, by Mr. OPPENHEIM et al. ·and·
“Convolved Signals” (“Nonlinear
Filtering of Multiplied and Convolved
”) (Reference 3), or
DGCHILDERS
PROCEEDINGS OF D.I.E.E.
“The Cepstrum: A Guide to Processing” published in the October 1977 issue of THE IEEE, pages 1428-1443.
A paper entitled “Guide to Processing.” (Reference 4)
etc., so it will be omitted here. The lifter circuit 530 multiplies the cepstrum Cp(n) by a predetermined window function, separates and extracts the low time region and high time region of the cepstrum, outputs the low time region to the quantization circuit 530, and outputs the low time region to the quantization circuit 530. The time domain is output to the DFT circuit 535 and the pitch extraction quantization circuit 525. Pitch extraction quantization circuit 525
detects pitch period P and pitch gain P _G , quantizes them with a predetermined number of quantization bits to obtain P' and P _G ', and outputs P' and P _G ' to pitch signal generation circuit 550 and multiplexer 605. do. The quantization circuit 530 inputs the low time domain cepstrum, quantizes it by a predetermined number of quantization bits, and outputs the quantized value to the multiplexer 420. Here, the quantization may be performed by directly quantizing the cepstrum, or by converting it into other parameters and then quantizing it as described in the conventional method. Note that it is known that the low time portion of the cepstrum includes spectral envelope information of the original audio signal. The DFT circuit 535 receives the high time domain cepstrum output from the lifter circuit 530 and performs a 2M-point DFT calculation on it. In this circuit, the input signal sequence at point M is made even symmetrical.
2M point series, 2M points for that 2M point series
Perform DFT calculation. Note that the above equation (9) is used for the DFT calculation. The index calculation circuit 370 is a DFT circuit 53
Exponential calculation is performed on the output of 5. Here, the output value E(k) of the index calculation circuit 370 is a spectrum representing the pitch structure of the sound source. This is because, as is well known, the cepstrum in the high time domain contains information regarding the source of the original audio signal. The output value of the exponent calculation circuit 370, together with the phase output of the absolute value phase calculation circuit 520, has a real part,
It is output to the imaginary part calculation circuit 540. The real part and imaginary part calculation circuit 540 calculates the real part and imaginary part using E(k) and the phase according to the following equation.

Re（Ｅ（ｋ））＝Ｅ（ｋ） cos（arg（Ｕ（ｋ） −（13） Im（Ｅ（ｋ））＝Ｅ（ｋ） sin（arg（Ｕ（ｋ） −（14）ここでRe（Ｅ）はＥの実数部を、Im（Ｅ）はＥ
の虚数部を表わす。Re（Ｅ（ｋ））及びIm（Ｅ
（ｋ））は、DCT変換回路５９０へ出力される。
DCT変換回路５９０は、次式に従つて、DFT係
数の実数部及び虚数部からDCT係数Vc（ｋ）を
計算する。 Re(E(k))=E(k) cos(arg(U(k) −(13) Im(E(k))=E(k) sin(arg(U(k) −(14) where Re(E) is the real part of E, Im(E) is E
represents the imaginary part of Re(E(k)) and Im(E
(k)) is output to the DCT conversion circuit 590.
The DCT conversion circuit 590 calculates the DCT coefficient Vc(k) from the real part and imaginary part of the DFT coefficient according to the following equation.

Vc（ｋ）＝Ｃ（ｋ）（Re（Ｅ（ｋ））cos （2πk／4N）＋Im（Ｅ（ｋ））sin（2πk／4N）−
（15）（０≦ｋ≦Ｍ−１）Ｍ点のDCT係数Vc（ｋ）は適応型量子化器６
００へ出力される。 Vc(k)=C(k)(Re(E(k))cos(2πk/4N)+Im(E(k))sin(2πk/4N)−
(15) (0≦k≦M−1) The DCT coefficient Vc(k) at M point is the adaptive quantizer 6
Output to 00.

Ｍ点DCT係数Vc（ｋ）は、それ自身のスペク
トラムパワーに応じて適応型量子化器６００によ
り適応的に量子化される。スペクトラムパワーに
応じた適応的な量子化は、前記従来方式とほぼ同
一の方法に従つて行なわれる。次に、この適応的
量子化法について説明する。Ｍ点DCT係数系列
Vc（ｋ）は、ピツチ構造と位相から成る系列であ
り、適応的に、ビツト割り当て及び量子化を行な
うためには｜Vc（ｋ）｜を推定する必要がある。
本実施例においては、破線部で囲んだ回路５７５
によつて｜Vc（ｋ）｜の推定値V^c（ｋ）を計算し
ている。V^c（ｋ）計算に必要な回路を順に説明す
る。ピツチ信号発生回路５５０は、ピツチ抽出量
子化回路５２５の出力値P′及びP_G′ を入力し、まずこれらを復号化し及び_Gを
得る。次に及びP_Gを用いてピツチ信号ｅ（ｎ）
を発生させる。これはケプストラム領域で行なつ
てもよいしあるいは、線形（Linear）領域で行
なつてもよい。なお、線形領域で行なつた場合に
は後に説明する指数計算回路３７０を省略するこ
とができる。ピツチ信号ｅ（ｎ）はDFT回路５３
５へ出力される。DFT回路５３５は2M点DFT
計算を行ない指数計算回路３７０へ出力する。指
数計算回路３７０は指数計算DFT回路５３５の
出力値に対して指数計算を行ないDCT変換回路
５６０へ出力する。DCT変換回路５６０は、よ
く知られているように下式に従つてDFT係数か
らDCT係数への変換をする。 The M-point DCT coefficients Vc(k) are adaptively quantized by an adaptive quantizer 600 according to their own spectral power. Adaptive quantization according to spectral power is performed in substantially the same manner as the conventional method. Next, this adaptive quantization method will be explained. M-point DCT coefficient series
Vc(k) is a sequence consisting of pitch structure and phase, and it is necessary to estimate |Vc(k)| in order to adaptively perform bit allocation and quantization.
In this embodiment, a circuit 575 surrounded by a broken line
The estimated value V^c(k) of |Vc(k)| is calculated by The circuits required for V^c(k) calculation will be explained in order. The pitch signal generation circuit 550 inputs the output values P' and P _G ' of the pitch extraction and quantization circuit 525, and first decodes them to obtain _G. Then, the pitch signal e(n) using and P _G
to occur. This may be done in the cepstral domain or in the linear domain. Note that when the calculation is performed in a linear region, the exponent calculation circuit 370, which will be described later, can be omitted. The pitch signal e(n) is the DFT circuit 53
5. DFT circuit 535 is 2M point DFT
The calculation is performed and output to the index calculation circuit 370. The exponent calculation circuit 370 performs exponent calculation on the output value of the exponent calculation DFT circuit 535 and outputs it to the DCT conversion circuit 560. As is well known, the DCT conversion circuit 560 converts DFT coefficients into DCT coefficients according to the following equation.

Vc（ｋ）＝EXP（−j2πk／4N）・Ｕ（ｋ） −（16）（０≦ｋ≦Ｍ−１）なお、この変換式は、ジエー・マクホウル
（Ｊ・MAKHOUL）氏によるアイ・イー・イ
ー・イー・トランザクツシヨンズ・オン・アコー
ステイツクス・スピーチ・アンド・シグナル・プ
ロセシング（IEEE TRANSACTIONS ON
ACOSTICS，SPEECH，AND SIGNAL
RROCESSING）誌の1980年２目号、27〜34頁に
掲載の「ア・フアースト・コサイン・トランスフ
オーム・イン・ワン・アンド・トウー・デイメン
シヨンズ」（“Ａ Fast Cosine Transform in
One and Two Dimensions.”）と題した論文
（文献５）の（９，ｂ）式と対応している。ここ
では虚数部は０である。レベル調整回路はDCT
変換回路５６０の出力値σ（ｋ）及び前述した
DCT変換回路５９０の出力値Vc（ｋ）を入力し、
σ（ｋ）のレベル調整を次式に従つて行なう。 Vc(k)=EXP(−j2πk/4N)・U(k)−(16) (0≦k≦M−1) This conversion formula was written by J. MAKHOUL.・IEEE TRANSACTIONS ON Acoustics Speech and Signal Processing
ACOSTICS, SPEECH, AND SIGNAL
``A Fast Cosine Transform in One and Two Dimensions'' published in 1980 issue 2 of RROCESSING magazine, pages 27-34.
This corresponds to equation (9, b) in the paper entitled “One and Two Dimensions.” (Reference 5). Here, the imaginary part is 0. The level adjustment circuit is a DCT
The output value σ(k) of the conversion circuit 560 and the above-mentioned
Input the output value Vc(k) of the DCT conversion circuit 590,
The level adjustment of σ(k) is performed according to the following equation.

Vc（ｋ）＝Ｇ・σ（ｋ） −（18）ここでＧは、レベル調整係数である。また、
V^c（ｋ）は、レベル調整後の｜Vc（ｋ）｜の推定
値である。なお、ここではＭ点の平均電力を用い
てレベル調整を行なつたが他の方法を用いてもよ
い。レベル調整回路５７０はレベル調整係数Ｇを
量子化器５８０へ、推定値V^c（ｋ）をビツト割り
当てステツプサイズ計算回路５８５へそれぞれ出
力する。ビツト割り当てステツプサイズ計算回路
５８５は、前記(5)，(6)及び(7)式に従つてVc（ｋ）
に対する割り当てビツト数を計算する。但し、本
実施例では、ノイズ・シエイピングを考慮してい
ないのでＷ（ｋ）＝１としなくてはならない。ま
た、量子化ステツプサイズの計算には前記(4)式を
用いる。なお、前記(4)及び(5)式を適用する場合
は、σ（ｋ）のかわりにV^c（ｋ）を代入しなけれ
ばならない。適応型量子化器６００は、ビツト割
り当てステツプサイズ計算回路５８５から出力さ
れる割り当てビツト数ｂ（ｋ）及び量子化ステツ
プサイズΔ（ｋ）を用いて、DCT係数Vc（ｋ）を
適応的に量子化し、量子化値をマルチプレクサ６
０５へ出力する。量子化器５８０は、レベル調整
係数Ｇをあらかじめ定められた量子化ビツト数で
量子化し、G′を得てこれをマルチプレクサ６０
５へ出力する。マルチプレクサ６０５は、マルチ
プレクサ入力情報を組み合わせて一定フレーム時
間毎に出力端子１４０を通して受信側へ伝送す
る。 Vc(k)=G·σ(k)−(18) Here, G is a level adjustment coefficient. Also,
V^c(k) is the estimated value of |Vc(k)| after level adjustment. Although the level adjustment is performed here using the average power of M points, other methods may be used. The level adjustment circuit 570 outputs the level adjustment coefficient G to the quantizer 580 and the estimated value V^c(k) to the bit allocation step size calculation circuit 585, respectively. The bit allocation step size calculation circuit 585 calculates Vc(k) according to equations (5), (6), and (7) above.
Calculate the number of bits allocated to However, in this embodiment, since noise shaping is not considered, W(k) must be set to 1. Furthermore, the above equation (4) is used to calculate the quantization step size. Note that when applying equations (4) and (5) above, V^c(k) must be substituted in place of σ(k). The adaptive quantizer 600 adaptively quantizes the DCT coefficient Vc(k) using the allocated bit number b(k) and the quantization step size Δ(k) output from the bit allocation step size calculation circuit 585. and send the quantized value to multiplexer 6
Output to 05. The quantizer 580 quantizes the level adjustment coefficient G by a predetermined number of quantization bits to obtain G', which is sent to the multiplexer 60.
Output to 5. The multiplexer 605 combines the multiplexer input information and transmits the combined information to the receiving side through the output terminal 140 every fixed frame time.

受信側においては、デマルチプレクサ６０６
は、一定フレーム時間毎に受信側入力端子１５０
を通して情報を受信し、これらを分離し各々の回
路へ出力する。復号器６３０は、低時間領域をケ
プストラムの量子化値を入力し、これを復号化し
DFT回路６４０へ出力する。DFT回路６４０
は、入力信号系列に０を補充してＭ点系列とし、
さらに偶対称化し2M点系列とした後に、2M点
DFT計算を行なう。指数計算回路６５０は、
DFT回路６４０の2M点出力系列のＭ点分に対し
て指数計算を行ない計算結果を偶対称化し2M点
系列Ｖ（ｋ）とした後に掛け算回路６９０へ出力
する。次に、ピツチ信号発生回路６２５、DFT
回路６４５、指数計算回路６５５及びDCT変換
回路６６０は、送信側におけるピツチ信号発生回
路５５０、DFT回路５３５、指数計算回路３７
０及びDCT変換回路５６０と同一の動作を行な
うので説明を略する。復号器６２０は、量子化さ
れたレベル係数G′を入力し、これを復号化しＧ
をレベル調整回路６７０へ出力する。レベル調整
回路６７０は、前記（18）式に従つてσ（ｋ）の
レベル調整を行ない、V^c（ｋ）をビツト割り当て
ステツプサイズ計算回路６１５へ出力する。ビツ
ト割り当てステツプサイズ計算回路６１５は送信
側におけるビツト割り当てステツプサイズ計算回
路５８５と同一の動作を行ない、割り当てビツト
数ｂ（ｋ）及び量子化ステツプサイズΔ（ｋ）を適
応型復号器６１０へ出力する。適応型復号器６１
０は、ｂ（ｋ）及びΔ（ｋ）を用いて、送信側にお
ける適応型量子化器６００と逆の動作を行ない、
Vc（ｋ）を復号する。Vc（ｋ）は、IDCT変換回
路６８５へ入力される。IDCT変換回路６８５
は、送信側におけるDCT変換回路５９０と逆の
動作を行ないDCT係数系列Vc（ｋ）からＭ点
DFT係数系列を計算し偶対称化し2M点系列Ｇ
（ｋ）を得た後に掛け算器６９０へ出力する。掛
け算器６９０は、2M点系列Ｇ（ｋ）とＶ（ｋ）と
を入力し、掛け算（Ｇ（ｋ）・Ｖ（ｋ））を計算し、
２Ｍ点系列（ｋ）を得てこれをIDFT計算回路
６８０へ出力する。IDFT計算回路６８０は、、
次式に従い2M点IDFT計算をする。 On the receiving side, a demultiplexer 606
is the receiving side input terminal 150 every fixed frame time.
It receives information through the circuit, separates it, and outputs it to each circuit. The decoder 630 inputs the quantized value of the cepstrum in the low time domain and decodes it.
Output to DFT circuit 640. DFT circuit 640
is an M-point series by supplementing the input signal series with 0,
After further making it even symmetrical and making it a 2M point series, 2M points
Perform DFT calculation. The index calculation circuit 650 is
Exponential calculation is performed on M points of the 2M point output series of the DFT circuit 640, and the calculation result is made even symmetrical to form a 2M point series V(k), which is then output to the multiplication circuit 690. Next, pitch signal generation circuit 625, DFT
The circuit 645, the index calculation circuit 655, and the DCT conversion circuit 660 are the pitch signal generation circuit 550, the DFT circuit 535, and the index calculation circuit 37 on the transmitting side.
0 and DCT conversion circuit 560, so the explanation will be omitted. The decoder 620 inputs the quantized level coefficient G' and decodes it into G
is output to the level adjustment circuit 670. Level adjustment circuit 670 adjusts the level of σ(k) according to equation (18), and outputs V^c(k) to bit allocation step size calculation circuit 615. The bit allocation step size calculation circuit 615 performs the same operation as the bit allocation step size calculation circuit 585 on the transmitting side, and outputs the allocated bit number b(k) and quantization step size Δ(k) to the adaptive decoder 610. . Adaptive decoder 61
0 performs the opposite operation to the adaptive quantizer 600 on the transmitting side using b(k) and Δ(k),
Decode Vc(k). Vc(k) is input to the IDCT conversion circuit 685. IDCT conversion circuit 685
performs the opposite operation to the DCT conversion circuit 590 on the transmitting side and converts the DCT coefficient series Vc(k) to the M point.
Calculate the DFT coefficient series and make it even symmetrical to generate a 2M point series G
After obtaining (k), it is output to multiplier 690. The multiplier 690 inputs the 2M point series G(k) and V(k), calculates the multiplication (G(k)·V(k)),
A 2M point series (k) is obtained and output to the IDFT calculation circuit 680. The IDFT calculation circuit 680 is,
Calculate 2M point IDFT according to the following formula.

(n)＝_2M-1 〓^k=0 Ｘ(k)EXP（j2πkn／2M） −(19) （０≦ｋ≦2M−１）ｘ（ｎ）は再生音声信号系列を表わし、2M点系
列である。重ねあわせた回路６９５は、2M点系
列ｘ（ｎ）を入力しこのうちのＭ点を前フレーム
のＭ点系列と重ね合わせる。ここで、重ねあわせ
法に関しては重畳加算法（overlap−add）及び
重畳保留法（overlap−save）がよく知られてい
るが、ここではどちらの方法を用いてもよい。な
お、重畳加算法及び重畳保留法に関してはプレン
テイス−ホール（PRENTICE−HALL）社から
1975年に発行された「セオリイ・アンド・アプリ
ケーシヨン・オブ・デイジタル・シグナル・プロ
セシング」（“Theory and Application of
Digital Signal Processing。」と題した単行本の
63〜67頁に説明されているので、ここでは説明を
省略する。重の合わせ回路６９５の出力値系列
は、バツフアメモリ回路２２０へ出力され一旦記
憶された後に出力端子２３０を通して出力され
る。(n)= _2M-1 〓 ^k=0 X(k)EXP(j2πkn/2M) −(19) (0≦k≦2M-1) be. The superimposition circuit 695 receives the 2M point series x(n) and superimposes M points thereon with the M point series of the previous frame. Here, regarding the overlapping method, the overlapping addition method (overlap-add) and the overlapping-save method (overlap-save) are well known, but either method may be used here. Regarding the superimposed addition method and superimposed retention method, please refer to PRENTICE-HALL Co., Ltd.
“Theory and Application of Digital Signal Processing,” published in 1975.
Digital Signal Processing. A book entitled ``
Since it is explained on pages 63 to 67, the explanation will be omitted here. The output value series of the superposition circuit 695 is outputted to the buffer memory circuit 220 and once stored, and then outputted through the output terminal 230.

本実施例の構成によれば、原音声信号のスペク
トラム包絡情報は、低時間領域ケプストラムをサ
イド情報として用いて少ないビツト数（例えば２
〜3Kビツト／秒程度）で伝送する。更に、音源
情報のみをDCT係数に変換し適応的なビツト割
り当て及び量子化を施しているので、従来の音源
情報とスペクトラム包絡情報の双方を伝送してい
る方式と比較して伝送情報量を低減させることが
可能である。また、原音声信号のスペクトラム構
造及びピツチ構造を表わすスペクトラムパラメー
タとしてケプストラムを用いているので、ケプス
トラム領域上で、音源情報とスプストラム包絡情
報の分離が容易である。更に、有声／無声判別も
容易であり、もしあるフレームの音声信号が有声
音であれば、音源情報のDCT係数は周期的なス
ペクトラム構造を有しているので、DCT係数の
伝送の際に低周波数帯域（例えば1kHz以下）の
みを伝送し、受信側において伝送されなかつた周
波数帯域を補間するようにしてもよい。このよう
にすることによつて伝送情報量をさらに低減させ
ることが可能となる。 According to the configuration of this embodiment, the spectral envelope information of the original audio signal is obtained using a small number of bits (for example, 2 bits) using the low time domain cepstrum as side information.
~3K bits/second). Furthermore, since only sound source information is converted into DCT coefficients and adaptive bit allocation and quantization are applied, the amount of transmitted information is reduced compared to conventional methods that transmit both sound source information and spectrum envelope information. It is possible to do so. Furthermore, since the cepstrum is used as a spectrum parameter representing the spectral structure and pitch structure of the original audio signal, it is easy to separate the sound source information and the spstrum envelope information on the cepstrum domain. Furthermore, voiced/unvoiced discrimination is easy; if the audio signal of a certain frame is voiced, the DCT coefficients of the sound source information have a periodic spectral structure, so when transmitting the DCT coefficients, low It is also possible to transmit only a frequency band (eg, 1 kHz or less) and interpolate the untransmitted frequency band on the receiving side. By doing so, it becomes possible to further reduce the amount of transmitted information.

なお、本実施例においては、DCT係数量子化
の際に、ノイズ・シエイピングを考慮していない
が、これを施すこともできる。そのようにするた
めには、送、受信側においてはDFT計算回路を
さらに１回路ずつ必要とする。 Note that in this embodiment, noise shaping is not taken into account when quantizing DCT coefficients, but noise shaping can also be applied. In order to do this, one additional DFT calculation circuit is required on each of the transmitting and receiving sides.

本実施例の受信側では、音源情報を表わす
DFT系数系列Ｇ（ｋ）と声道のインパルス応答情
報を表わすDFT係数系列Ｖ（ｋ）とを用いて再生
音声信号系列（ｎ）を求める方法として、掛け
算（掛け算回路６９０）とIDFT計算（IDFT計
算回路６８０）とを用いた方法によつたが、他の
方法を用いて（ｎ）を求めてもよい。その一つ
の方法は、よく知られているようにケプストラム
を係数とする対数伝達特性を有するフイルタを構
成し、このフイルタをＧ（ｋ）をIDFTして得ら
れるｇ（ｎ）によつて励振し、（ｎ）を得る。
この方法の詳細な説明は、今井氏らによる電子通
信学会総合全国大会予稿集昭和52年、５−321頁
に掲載の「対数伝達特性近似デイジタルフイルタ
による音声の合成」と題した論文等に述べられて
いるので、ここでは省略する。この方法によれば
受信側においてDFT回路６４０、指数計算回路
６５０が不要となり演算量を大幅に低減させるこ
とができる。 On the receiving side of this embodiment, the sound source information is
Methods for obtaining the reproduced audio signal sequence (n) using the DFT coefficient sequence G(k) and the DFT coefficient sequence V(k) representing impulse response information of the vocal tract include multiplication (multiplication circuit 690) and IDFT calculation (IDFT (n) may be obtained using other methods. One method is to construct a filter with logarithmic transfer characteristics using the cepstrum as a coefficient, and to excite this filter with g(n) obtained by performing IDFT on G(k), as is well known. , (n) is obtained.
A detailed explanation of this method is given in the paper entitled "Speech synthesis using a digital filter approximating logarithmic transfer characteristics," published in Proceedings of the National Conference of the Institute of Electronics and Communication Engineers, 1978, pages 5-321, by Mr. Imai et al. , so it is omitted here. According to this method, the DFT circuit 640 and the exponent calculation circuit 650 are not required on the receiving side, and the amount of calculation can be significantly reduced.

なお、本実施例では原音声信号のスペクトラム
包絡情報をケプストラムの低時間領域から求めた
が、これは他の方法によつてもよい。一例として
は、第４図において絶対値位相計算回路５２０の
絶対値出力の２乗値から自己相関関数して変換し
自己相関関数からよく知られている手法でＫパラ
メータ値系列に変換し、Ｋパラメータ値系列から
スペクトラム包絡情報を求めるようにしてもよ
い。またこのようにした場合には受信側において
再生音声信号系列（ｎ）を求める際に、Ｋパラ
メータ値系列を用いて合成フイルタを構成し、Ｇ
（ｋ）をIDFTして得られるｇ（ｎ）によつてこの
合成フイルタを励振し（ｎ）を求めるようにし
てもよい。このようにした場合にもDFT回路６
４０及び指数計算回路６５０とは不要となる。 In this embodiment, the spectral envelope information of the original audio signal was obtained from the low time domain of the cepstrum, but this may be done using other methods. As an example, in FIG. 4, the square value of the absolute value output of the absolute value phase calculation circuit 520 is converted into an autocorrelation function, and the autocorrelation function is converted into a K parameter value series using a well-known method. Spectrum envelope information may be obtained from the parameter value series. In addition, in this case, when obtaining the reproduced audio signal sequence (n) on the receiving side, a synthesis filter is configured using the K parameter value sequence, and the G
This synthesis filter may be excited by g(n) obtained by performing IDFT on (k) to obtain (n). Even in this case, the DFT circuit 6
40 and the index calculation circuit 650 are no longer necessary.

第５図ａ，ｂは、本発明の２の実施例を示すブ
ロツク図である。本実施例において第４図と同一
の番号を付した構成要素は、第４図と同一の動作
を行なうので、ここでは説明を省略する。第５図
ａは送信側のブロツク図を、第５図ｂは受信側の
ブロツク図をそれぞれ示す。送信側において、絶
対値位相計算回路５２０の出力値である絶対値と
位相は複素対数計算回路７００へ入力される。複
素対数計算回路７００は絶対値と位相の各々に対
して対数を計算しIDFT回路７１０へ出力する。
IDFT回路７１０はまず最初にRe入力に対しては
偶対称化し2M点系列とし、またIm入力に対して
は奇対称化し2M点系列とする。その後に2M点
IDFT計算を行ない、結果の2M点系列xp（ｎ）を
リフタ回路７２０へ出力する。この場合のIDFT
回路７１０の出力例xp（ｎ）は、複素ケプストラ
ムと呼ばれており、この詳細な説明は、前記文献
３及び４に述べられているのでここでは省略す
る。一般に、複素ケプストラムは原音声信号系列
の振幅と位相情報を含むことが知られている。さ
らに、ケプストラムと同様に、複素ケプストラム
においてもその低時間領域はスペクトラム包絡情
報を、また、その高時間領域は音源情報を含むこ
とがよく知られている。リフタ回路７２０は、窓
関数を乗じ複素ケプストラムの低時間領域と高時
間領域とを分離し、低時間領域を量子化器７６０
へ出力し、高時間領域をピツチ抽出量子化回路７
３０とDFT回路７４０へ出力する。今、第６図
ａ，ｂに複素ケプストラムxp（ｎ）とリフタ回路
７２０の窓関数の一例をそれぞれ示す。いずれも
横軸は間を縦軸は振幅を表わす。第５図ａにおい
て、量子化器７６０は、複素ケプストラムXp
（ｎ）の低時間領域をあらかじめ定められた量子
化ビツト数で量子化し、量子化値をマルチプレク
サ６０５へ出力する。ピツチ抽出量子化回路７３
０は、複素ケプストラムxp（ｎ）の片方向（正あ
るいは負方向）の高時間領域部を用いてピツチ抽
出をし、ピツチ周期Ｐ及びピツチゲインP_Gを求
める。さらに、これをあらかじめ定められた量子
化ビツト数及び量子化ステツプサイズを用いて量
子化し、P′及びP_G′を得て、これらをピツチ信号
発生回路５５０とマルチプレクサ６０５へ出力す
る。ピツチ信号発生回路５５０、DFT回路５３
５、指数計算回路３７０、DCT変換回路５６０、
レベル調整回路５７０、量子化器５８０、ビツト
割り当てステツプサイズ計算回路５８５は第４図
ａの同一番号の構成要素と同一動作をする。 FIGS. 5a and 5b are block diagrams showing a second embodiment of the present invention. In this embodiment, the components given the same numbers as in FIG. 4 perform the same operations as in FIG. 4, and therefore their explanations will be omitted here. FIG. 5a shows a block diagram of the transmitting side, and FIG. 5b shows a block diagram of the receiving side. On the transmitting side, the absolute value and phase, which are the output values of the absolute value phase calculation circuit 520, are input to the complex logarithm calculation circuit 700. The complex logarithm calculation circuit 700 calculates logarithms for each of the absolute value and phase and outputs the logarithms to the IDFT circuit 710.
The IDFT circuit 710 first makes the Re input even symmetrical and makes it a 2M point series, and the Im input makes it odd symmetrical and makes it a 2M point series. Then 2M points
IDFT calculation is performed and the resulting 2M point series xp(n) is output to the lifter circuit 720. IDFT in this case
The output example xp(n) of the circuit 710 is called a complex cepstrum, and detailed explanation thereof is omitted here because it is described in the above-mentioned documents 3 and 4. It is generally known that the complex cepstrum includes amplitude and phase information of the original audio signal sequence. Furthermore, like the cepstrum, it is well known that the complex cepstrum also includes spectrum envelope information in its low time domain, and sound source information in its high time domain. The lifter circuit 720 separates the low time region and the high time region of the complex cepstrum by multiplying it by a window function, and converts the low time region into a quantizer 760.
The high time region is output to the pitch extraction quantization circuit 7.
30 and output to the DFT circuit 740. Now, an example of the complex cepstrum xp(n) and the window function of the lifter circuit 720 are shown in FIGS. 6a and 6b, respectively. In both cases, the horizontal axis represents the interval and the vertical axis represents the amplitude. In FIG. 5a, the quantizer 760 calculates the complex cepstrum Xp
The low time region of (n) is quantized with a predetermined number of quantization bits, and the quantized value is output to multiplexer 605. Pitch extraction quantization circuit 73
0 performs pitch extraction using the high time region in one direction (positive or negative direction) of the complex cepstrum xp(n), and determines the pitch period P and pitch gain _PG . Further, this is quantized using a predetermined number of quantization bits and a quantization step size to obtain P' and P _G ', which are output to pitch signal generation circuit 550 and multiplexer 605. Pitch signal generation circuit 550, DFT circuit 53
5, index calculation circuit 370, DCT conversion circuit 560,
The level adjustment circuit 570, quantizer 580, and bit allocation step size calculation circuit 585 operate in the same manner as the similarly numbered components in FIG. 4a.

次に、DFT回路７４０はリフタ回路７２０よ
り、複素ケプストラムxp（ｎ）の高時間領域部を
出力され、これに０を付加して全体で2M点系列
とした後に2M点DFT計算を行ない、結果の実
部、虚部を指数計算回路７５０へ出力する。指数
計算回路７５０は、複素対数計算回路７００と逆
の動作をする。すなわち、DFT回路７４０の２
組（実部及び虚部）の出力値に対してそれぞれ指
数計算をし、絶対値（｜｜）と位相（arg）を得
てこれらを実部、虚部計算回路７５５へ出力す
る。実部、虚部計算回路７５５は、入力された絶
対値と位相を用いて前記（13），（14）式に従つて
実部、虚部を計算する。但し、ここではＥ（ｋ）
の代わりに絶対値（｜｜）を用いなくてはならな
い。DCT変換回路５９０及び適応型量子化器６
００は、第４図ａにおける同一番号を記した構成
要素と同一の動作をする。 Next, the DFT circuit 740 outputs the high time domain part of the complex cepstrum xp(n) from the lifter circuit 720, adds 0 to this to make a 2M point series in total, performs a 2M point DFT calculation, and obtains the result. The real and imaginary parts of are output to the exponent calculation circuit 750. Exponent calculation circuit 750 operates inversely to complex logarithm calculation circuit 700. That is, 2 of the DFT circuit 740
Exponents are calculated for each output value of the set (real part and imaginary part) to obtain an absolute value (||) and a phase (arg), which are output to the real part and imaginary part calculation circuit 755. The real part and imaginary part calculating circuit 755 calculates the real part and imaginary part using the input absolute value and phase according to the above equations (13) and (14). However, here E(k)
The absolute value (||) must be used instead. DCT conversion circuit 590 and adaptive quantizer 6
00 operates in the same way as the components labeled with the same numbers in FIG. 4a.

次に、受信側においては、デマルチプレクサ６
０６、適応型複号器６１０、ビツト割り当てステ
ツプサイズ計算回路６１５、複号器６２０、レベ
ル調整回路６７０、ピツチ信号発生回路６２５、
DFT回路６４５、、指数計算回路６５５、DCT
変換回路６６０、IDCT変換回路６８５は、第４
図ｂにおける同一番号を記した構成要素と同一の
動作をする。複号器７７０は、低時間領域複素ケ
プストラムの量子化値を入力し、これを復号して
DFT回路７８０へ出力する。DFT回路７８０は
入力信号系列に０を付加して全体で2M点系列と
した後に、2M点DFT計算を行ない、結果の実
部、虚部を指数計算回路７９０へ出力する。指数
計算回路７９０は、送信側における指数計算回路
７５０と同一の動作を行ない絶対値（｜｜）と位
相（arg）を実部、虚部計算回路８００へ出力す
る。実部、虚部計算回路８００は、送信側におけ
る実部、虚部計算回路７５５と同一の動作を行な
い、計算結果Vp（ｋ）を掛け算回路６９０へ出力
する。なおVp（ｋ）は次のようにかける。 Next, on the receiving side, the demultiplexer 6
06, adaptive decoder 610, bit allocation step size calculation circuit 615, decoder 620, level adjustment circuit 670, pitch signal generation circuit 625,
DFT circuit 645, index calculation circuit 655, DCT
The conversion circuit 660 and the IDCT conversion circuit 685 are the fourth
It operates in the same way as the component with the same number in FIG. b. The decoder 770 inputs the quantized value of the low time domain complex cepstrum and decodes it.
Output to DFT circuit 780. The DFT circuit 780 adds 0 to the input signal sequence to make the entire 2M point sequence, performs 2M point DFT calculations, and outputs the real and imaginary parts of the result to the index calculation circuit 790 . The exponent calculation circuit 790 performs the same operation as the exponent calculation circuit 750 on the transmitting side and outputs the absolute value (||) and phase (arg) to the real and imaginary part calculation circuit 800. The real part/imaginary part calculation circuit 800 performs the same operation as the real part/imaginary part calculation circuit 755 on the transmitting side, and outputs the calculation result Vp(k) to the multiplication circuit 690. Note that Vp(k) is multiplied as follows.

Vp（ｋ）＝Re（Vp（ｋ）＋jIm（Vp（ｋ）） −（20）掛け算回路６９０、IDFT計算回路６８０、重
ね合わせ回路６９５、バツフアメモリ回路２２０
は、第４図ｂにおける同一番号を付した構成要素
と同一の動作をするので、ここでは説明を省略す
る。 Vp (k) = Re (Vp (k) + jIm (Vp (k)) - (20) Multiplication circuit 690, IDFT calculation circuit 680, superposition circuit 695, buffer memory circuit 220
Since these operate in the same manner as the constituent elements with the same numbers in FIG. 4b, their explanation will be omitted here.

本実施例によれば、第１の実施例の場合と異な
り、スペクトラム構造及びピツチ構造を表わすス
ペクトラムパラメータとして複素ケプストラムを
用いている。従つて本実施例においては、サイド
情報として低時間領域複素ケプストラムを伝送す
るので、第１の実施例と比較してサイド情報伝送
に必要な伝送情報量は増加するが、従来方式と比
較した場合は伝送情報量を低減させることができ
る。しかしながら受信側において実部、虚部計算
回路８００の出力に得られる声道のインパルス応
答情報を表わす系列Vp（ｋ）を位相を含めて再現
することができるという効果がある。一方、第１
の実施例においては、第４図ｂのおける指数計算
回路６５０の出力系列Ｖ（ｋ）はは０位相で再現
される。しかしながら、第１の実施例では、受信
側でＧ（ｋ）に原音声信号の位相が含まれて再生
される。 According to this embodiment, unlike the first embodiment, a complex cepstrum is used as a spectrum parameter representing the spectrum structure and pitch structure. Therefore, in this embodiment, since the low time domain complex cepstrum is transmitted as side information, the amount of transmission information required for side information transmission increases compared to the first embodiment, but when compared with the conventional method. can reduce the amount of transmitted information. However, on the receiving side, there is an advantage that the sequence Vp(k) representing the impulse response information of the vocal tract obtained from the output of the real part/imaginary part calculation circuit 800 can be reproduced including the phase. On the other hand, the first
In the embodiment shown in FIG. 4B, the output series V(k) of the index calculation circuit 650 in FIG. 4B is reproduced at 0 phase. However, in the first embodiment, G(k) includes the phase of the original audio signal and is reproduced on the receiving side.

なお、本実施例においてサイド情報の伝送情報
量を低減させる方法としては、複素ケプストラム
のかわりにケプストラムを伝送する方法が考えら
れる。よく知られているように複素ケプストラム
Xp（ｎ）は次式に従つてケプストラムCp（ｎ）に
変換することができる。 In this embodiment, a possible method for reducing the amount of side information to be transmitted is to transmit a cepstrum instead of a complex cepstrum. As is well known, the complex cepstrum
Xp(n) can be converted to a cepstrum Cp(n) according to the following equation.

Cp（ｎ）＝1/2〔Xp（ｎ）＋xp（−ｎ）〕 −（21）従つて複素ケプストラムxp（ｎ）の低時間領域
を前記（21）式によつてケプストラムCp（ｎ）に
変換し、サイド情報としてケプストラムCp（ｎ）
を伝送してもよい。またケプストラムの代わりに
他のパラメータ系列（例えばＫパラメータ系列）
を伝送してもよい。このようにした場合には、受
信側でVp（ｋ）は０位相で再現される。 Cp (n) = 1/2 [Xp (n) + xp (-n)] - (21) Therefore, the low time region of the complex cepstrum xp (n) can be converted to the cepstrum Cp (n) by the above equation (21). Convert and use the cepstrum Cp(n) as side information
may be transmitted. Also, instead of the cepstrum, other parameter series (e.g. K parameter series) can be used.
may be transmitted. In this case, Vp(k) is reproduced at 0 phase on the receiving side.

更に、本実施によれば、受信側で再生される音
源信号のDFT係数系列Ｇ（Ｋ）には、第１の実施
例と異なり音源信号位相が含まれている。 Further, according to the present embodiment, the DFT coefficient series G(K) of the sound source signal reproduced on the receiving side includes the sound source signal phase, unlike the first embodiment.

また、入力音声信号が有声音の場合には、
DCT係数の低周波帯域のみを伝送すればよいこ
とは第１の実施例の場合と同様である。 Also, if the input audio signal is a voiced sound,
As in the case of the first embodiment, only the low frequency band of DCT coefficients needs to be transmitted.

さらにまた、ノイズ・シエイピングに関しても
第１の実施例の場合と同様である。 Furthermore, noise shaping is also the same as in the first embodiment.

演算量的には、本実施例の構成要素７００，７
１０，７５０，７８０及び７９０は複素演算を必
要とする。 In terms of the amount of calculation, the components 700, 7 of this embodiment
10,750, 780 and 790 require complex operations.

なお、第１，第２の実施例において受信側で
（ｎ）を求める際に、Ｇ（ｋ）をIDFTしたｇ（ｎ）
とＶ（ｋ）あるいはVp（ｋ）をIDFTしたｖ（ｎ）
とを時間軸上でたたみこむようにしてもよいが、
このようにした場合は演算量は大幅に増加する。 In addition, in the first and second embodiments, when calculating (n) on the receiving side, g(n) obtained by IDFT of G(k)
and v(n) which is IDFT of V(k) or Vp(k)
It is also possible to convolve the and on the time axis, but
In this case, the amount of calculation increases significantly.

【図面の簡単な説明】[Brief explanation of drawings]

第１図は従来方式を示す図、第２図は窓関数の
一例を示す図、第３図はノイズ・シエイピングの
効果を示す図、第４図ａ，ｂは本発明の第１の実
施例を示すブロツク図、第５図ａ，ｂは本発明の
第２の実施例を示すブロツク図、第６図はリフタ
回路の動作を示す図である。図において、３０……DCT回路、９０，１８
０……スペクトラム再生回路、１００，１９０，
５８５，６１５……ビツト割り当てステツプサイ
ズ計算回路、１１０……DCT係数適応型量子化
回路、２００……DCT係数適応型復号回路、２
１０……IDCT回路、５１０，５３５，６４０，
６４５，７４０，７８０……DFT回路、５２０
……絶対値位相計算回路、３２０……対数計算回
路、３３０，６８０，７１０……IDFT計算回
路、５３０，７２０……リフタ回路、５２５，７
３０……ピツチ抽出量子化回路、３７０，６５
５，７５０，７９０……指数計算回路、５４０，
７５５，８００……実部、虚部変換回路、５９
０，５６０，６６０……DCT変換回路、６００
……適応型量子化器、６１０……適応型復号器、
５７０，６７０……レベル調整回路、６８５……
IDCT変換回路、６９５……重ねあわせ回路、を
それぞれ示す。 FIG. 1 is a diagram showing the conventional method, FIG. 2 is a diagram showing an example of a window function, FIG. 3 is a diagram showing the effect of noise shaping, and FIGS. 4 a and b are diagrams showing the first embodiment of the present invention. FIGS. 5a and 5b are block diagrams showing the second embodiment of the present invention, and FIG. 6 is a diagram showing the operation of the lifter circuit. In the figure, 30...DCT circuit, 90, 18
0...Spectrum regeneration circuit, 100, 190,
585, 615... Bit allocation step size calculation circuit, 110... DCT coefficient adaptive quantization circuit, 200... DCT coefficient adaptive decoding circuit, 2
10...IDCT circuit, 510, 535, 640,
645,740,780...DFT circuit, 520
... Absolute value phase calculation circuit, 320 ... Logarithm calculation circuit, 330, 680, 710 ... IDFT calculation circuit, 530, 720 ... Lifter circuit, 525, 7
30... Pitch extraction quantization circuit, 370, 65
5,750,790...Exponent calculation circuit, 540,
755,800...real part, imaginary part conversion circuit, 59
0,560,660...DCT conversion circuit, 600
... adaptive quantizer, 610 ... adaptive decoder,
570, 670... Level adjustment circuit, 685...
An IDCT conversion circuit and 695 . . . superimposition circuit are shown, respectively.

Claims

【特許請求の範囲】１音声信号を直交変換し直交変換系列を適応的
に量子化し伝送する適応型変換符号化方式におい
て、送信側ではデイジタル信号に変換された音声
信号離散系列をフレームに分割し、前記フレーム
内の離散信号系列を第１の直交変換離散系列に変
換し、前記第１の直交変換離散系列のスペクトラ
ム的特徴を表わすパラメータ系列を求め、前記パ
ラメータ系列から音源特性を表わすパラメータ系
列を求め、前記スペクトラム的特微を表わすパラ
メータ系列かまたは前記第１の直交変換離散系列
から声道特性を表わすパラメータ系列とを求める
と共に、前記音源特性を表わすパラメータ系列を
第２の直交変換離散系列に変換し、更に前記第２
の直交変換離散系列を適応的に量子化した量子化
信号系列を求め、前記量子化信号系列と前記声道
特性を表わすパラメータ系列とを組み合わせて伝
送し、受信側では伝送された信号系列から前記量
子化信号系列と前記声道特性を表わすパラメータ
系列とを分離し、前記声道特性を表わすパラメー
タ系列を用いて声道の応答特性を表わす信号系列
を求め、前記量子化信号系列を適応的に複合し、
更に逆変換離散信号系列に変換し、前記逆変換離
散信号系列と前記声道の応答特性を表わす信号系
列とを用いて所定フレーム内の音声信号離散系列
を再生させるようにしたことを特徴とする適応型
変換符号化方法。２離散的デイジタル信号に変換された音声信号
離散系列をフレームに分割する回路と、前記フレ
ーム内の離散信号系列を第１の直交変換離散系列
に変換する回路と、前記第１の直交変換離散系列
のスペクトラム的特徴を表わす第１のスペクトラ
ムパラメータ系列を発生させる回路と、前記第１
のスペクトラムパラメータ系列から音源情報を含
む第２のスペクトラムパラメータ系列を求めかつ
前記第１のスペクトラムパラメータかまたは前記
第１の直交変換離散系列から声道情報を含む第３
のスペクトラムパラメータ系列とを求める回路
と、前記第３のスペクトラムパラメータ系列を少
ないビツト数で量子化するパラメータ量子化回路
と、前記第１のスペクトラムパラメータ系列から
抽出して得られる情報を量子化する量子化回路
と、前記第２のスペクトラムパラメータ系列を用
いて音源のピツチ構造を含むスペクトラム情報に
変換する回路と、前記スペクトラム情報を用いて
第２の直交変換離散系列を計算する回路と、前記
量子化回路からの量子化値を復号化し前記第２の
直交変換離散系列を適応的に量子化するために必
要な情報を適応的に計算する回路と、前記量子化
に必要な情報を用いて前記第２の直交変換離散系
列を適応的に量子化した第４の信号系列を計算す
る適応量子化回路と、前記第４の信号系列と前記
パラメータ量子化回路からの出力系列と前記量子
化回路からの出力情報とを組み合わせて伝送する
回路とを有することを特徴とする符号化装置。３伝送された信号系列から適応量子化信号系列
を表わす第５の信号系列と声道情報を有す量子化
パラメータ系列を表わす第６の信号系列と音源情
報を有すスペクトラムパラメータの一部を表わす
量子化情報とを分離する回路と、前記分離回路に
よつて分離された第６の信号系列を用いて第７の
信号系列を計算する回路と、前記量子化情報を復
号し前記第５の信号系列を適応的に復号化するた
めに必要な情報を適応的に計算する回路と、前記
復号化に必要な情報を用いて前記第５の信号系列
を適応的に復号化した信号系列を計算する回路
と、前記復号化した信号系列を用いて逆変換離散
信号系列を表わす第８の信号系列を計算する回路
と、前記第８の信号系列と前記第７の信号系列と
を用いて所定フレーム内の再生音声信号離散系列
を計算する回路とを有することを特徴とする復号
化装置。[Claims] 1. In an adaptive transform coding method that orthogonally transforms an audio signal and adaptively quantizes and transmits the orthogonal transform sequence, the transmitting side divides the audio signal discrete sequence converted into a digital signal into frames. , converting the discrete signal sequence in the frame into a first orthogonally transformed discrete sequence, obtaining a parameter sequence representing the spectral characteristics of the first orthogonally transforming discrete sequence, and determining a parameter sequence representing the sound source characteristic from the parameter sequence. and obtaining a parameter series representing the spectral features or a parameter series representing vocal tract characteristics from the first orthogonally transformed discrete series, and converting the parameter series representing the sound source characteristics into a second orthogonally transformed discrete series. Convert and further convert the second
A quantized signal sequence is obtained by adaptively quantizing an orthogonal transform discrete sequence of The quantized signal sequence and the parameter sequence representing the vocal tract characteristics are separated, the parameter sequence representing the vocal tract characteristics is used to obtain a signal sequence representing the response characteristics of the vocal tract, and the quantized signal sequence is adaptively converted into the quantized signal sequence. compound,
The voice signal is further converted into an inversely transformed discrete signal sequence, and the audio signal discrete sequence within a predetermined frame is reproduced using the inversely transformed discrete signal sequence and the signal sequence representing the response characteristic of the vocal tract. Adaptive transform coding method. 2. A circuit that divides the audio signal discrete sequence converted into a discrete digital signal into frames, a circuit that converts the discrete signal sequence in the frame into a first orthogonal transformed discrete sequence, and the first orthogonal transformed discrete sequence. a circuit for generating a first spectral parameter sequence representing a spectral feature of the first spectral parameter;
A second spectrum parameter series including sound source information is obtained from the spectrum parameter series, and a third spectrum parameter series including vocal tract information is obtained from the first spectrum parameter or the first orthogonal transform discrete series.
a parameter quantization circuit that quantizes the third spectrum parameter series using a small number of bits; and a quantizer that quantizes information extracted from the first spectrum parameter series. a circuit for converting the second spectrum parameter series into spectrum information including pitch structure of the sound source; a circuit for calculating a second orthogonal transform discrete series using the spectrum information; a circuit that adaptively calculates information necessary for decoding the quantized value from the circuit and adaptively quantizing the second orthogonal transform discrete sequence; an adaptive quantization circuit that calculates a fourth signal sequence obtained by adaptively quantizing the second orthogonal transform discrete sequence; 1. An encoding device comprising: a circuit for transmitting a combination of output information and output information. 3 A fifth signal sequence representing an adaptive quantization signal sequence from the transmitted signal sequence, a sixth signal sequence representing a quantization parameter sequence having vocal tract information, and a portion of a spectrum parameter having sound source information. a circuit that separates the quantization information from the quantization information; a circuit that calculates a seventh signal sequence using the sixth signal sequence separated by the separation circuit; and a circuit that decodes the quantization information and calculates the fifth signal sequence. A circuit that adaptively calculates information necessary for adaptively decoding a sequence, and a signal sequence that is obtained by adaptively decoding the fifth signal sequence using the information necessary for the decoding. a circuit for calculating an eighth signal sequence representing an inversely transformed discrete signal sequence using the decoded signal sequence; and a circuit for calculating an eighth signal sequence representing an inversely transformed discrete signal sequence using the decoded signal sequence; A decoding device comprising: a circuit for calculating a discrete sequence of reproduced audio signals.