JP2904427B2

JP2904427B2 - Missing voice interpolation device

Info

Publication number: JP2904427B2
Application number: JP3273503A
Authority: JP
Inventors: 隆裕野村; 陽太郎八塚
Original assignee: KEI DEI DEI KK
Current assignee: KEI DEI DEI KK
Priority date: 1991-09-26
Filing date: 1991-09-26
Publication date: 1999-06-14
Anticipated expiration: 2014-06-14
Also published as: JPH0588697A

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、送信側で入力された音
声信号を符号化した音声符号化情報を受信側で離散的に
受信するシステムにおいて、音声符号化情報を受信しな
い時間区間における聴感上の通話品質の改善を図るため
に有効である欠落音声補間装置に関わるものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a system for discretely receiving audio encoded information obtained by encoding an audio signal inputted on a transmitting side on a receiving side, in a time interval in which no audio encoded information is received. The present invention relates to a missing voice interpolation device which is effective for improving the above communication quality.

【０００２】[0002]

【従来の技術】音声信号を標本化し一定の時間長のフレ
ームに分割し符号化した音声符号化情報（これを以下パ
ケットと呼ぶ）を伝送する通信網においては、例えば網
内で生じる不規則な遅延のために受信側にパケットが一
定時間内に到達せずにパケットの紛失を生じることがあ
り、そのために受信側において音声信号の一部を再生で
きなくなり、聴感上の音声品質の劣化を生ずる。従っ
て、欠落した再生音声信号を何らかの方法で補間する必
要がある。この問題を解決するために、従来よりいくつ
か欠落音声の補間方法が提案されている。2. Description of the Related Art In a communication network for transmitting voice coded information (hereinafter referred to as a packet) obtained by sampling a voice signal, dividing the frame into frames of a fixed time length, and transmitting, for example, irregularities generated in the network. Due to the delay, the packet may not be able to reach the receiving side within a certain period of time, and the packet may be lost. As a result, a part of the audio signal cannot be reproduced on the receiving side, resulting in deterioration of the audible sound quality. . Therefore, it is necessary to interpolate the missing reproduced audio signal by some method. In order to solve this problem, interpolation methods for some missing voices have been conventionally proposed.

【０００３】まず、如何なる音声符号化方式にも適用で
きる欠落音声補間方式について説明する。第一に、欠落
音声部分に零値を挿入する方法（以下、零値挿入方法と
呼ぶ）及び欠落音声部分に１つ前のパケットの再生音声
信号をそのまま挿入する方法（以下、直前パケット挿入
方法と呼ぶ）がある。これらは如何なる音声符号化方式
に対しても適用でき補間処理も容易である利点がある
が、音声の性質を無視した方法のため補間した再生音声
品質が悪く、更に零値もしくは前パケットの音声信号を
挿入するための補間処理回路の付加が必要である。ま
た、零値挿入方法や直前パケット挿入方法と同様に如何
なる音声符号化方式にも適用できる補間方法として、過
去の再生音声信号に対してパターンマッチングやピッチ
検出を行うことにより抽出した類似信号を、欠落音声信
号の代用として挿入する方法（以下、各々パターンマッ
チング補間方法及びピッチ検出補間方法と呼ぶ）があ
る。これは、前の２つの補間方法に比べ再生音声品質は
比較的よいが、送信側で入力された音声信号に対してで
はなく符号化歪を含む再生音声に対してパターンマッチ
ングやピッチ検出を行うため補間音声品質が劣化し、更
にパターンマッチングやピッチ検出のための処理計算量
及び処理回路の付加量が増大する。以上の４種類の補間
方式の欠点は、如何なる音声符号化方式にも適用可能を
するために起こったとも言える。First, a description will be given of a missing voice interpolation system applicable to any voice coding system. First, a method of inserting a zero value into a missing voice part (hereinafter, referred to as a zero value insertion method) and a method of inserting the reproduced voice signal of the immediately preceding packet as it is into the missing voice part (hereinafter, the immediately preceding packet inserting method) There is). These have the advantage that they can be applied to any audio coding method and the interpolation processing is easy. It is necessary to add an interpolation processing circuit for inserting. In addition, as an interpolation method applicable to any audio encoding method like the zero value insertion method or the immediately preceding packet insertion method, a similar signal extracted by performing pattern matching or pitch detection on a past reproduced audio signal, There is a method (hereinafter, referred to as a pattern matching interpolation method and a pitch detection interpolation method, respectively) that is inserted as a substitute for the missing voice signal. Although the reproduction voice quality is relatively good as compared with the previous two interpolation methods, the pattern matching and the pitch detection are performed not on the voice signal input on the transmission side but on the reproduction voice including coding distortion. As a result, the quality of the interpolated voice deteriorates, and the amount of processing calculations and the amount of additional processing circuits for pattern matching and pitch detection increase. It can be said that the above-mentioned drawbacks of the four types of interpolation methods have occurred because they can be applied to any audio coding method.

【０００４】次に、音声符号化方式に依存した欠落音声
補間方式について説明する。代表的な音声符号化方式の
１つである適応差分ＰＣＭ（ＡＤＰＣＭ）方式に対する
補間方法としては、復号器側で受信された前パケットま
での符号化情報により再生された残差信号に対してピッ
チ検出を行いその再生された残差信号から抽出された疑
似信号を欠落パケットに対応する残差信号の代用として
用い通常の復号処理を行う方法も提案されている。この
構成も上記の４種類の補間方式に比べて再生音声品質は
比較的よいが、上記と同様に原音声信号ではなく量子化
雑音を含む再生された残差信号に対してパターンマッチ
ングやピッチ検出を行い補間のための疑似信号を生成し
ているので、再生音声品質が劣化し、更にパターンマッ
チングやピッチ検出のための処理計算量及び処理回路の
付加量も増大する結果となる。[0004] Next, a description will be given of a missing voice interpolation system depending on the voice coding system. As an interpolation method for the adaptive difference PCM (ADPCM) system, which is one of the typical voice coding systems, a residual signal reproduced on the basis of coding information up to the previous packet received at the decoder side is used. method of performing normal decoding processing is used as a substitute of the residual signal corresponding pseudo signal extracted from the reproduced residual signal performs pitch detection to the missing packet has been proposed. this
Configuration is also reproduced speech quality is relatively good in comparison with the four interpolation method described above, the pattern matching and pitch detection for the the reproduced residual signal including quantization noise instead of the original speech signal as well generating a pseudo-signal for the line have interpolation
As a result, the quality of reproduced voice is degraded, and the amount of processing calculations for pattern matching and pitch detection and the amount of additional processing circuits are increased.

【０００５】一方、代表的な音声符号化方式の１つであ
る適応予測符号化（ＡＰＣ）方式に対するパケット欠落
補間装置について以下に図を用いて説明する。まず、Ａ
ＰＣ符号化方式においては送信側の符号器により入力音
声信号の短時間スペクトルエンベロープを表す短時間相
関（フォルマント相関）を短時間予測器（フォルマント
予測器）を用いて取り除くために短時間予測器の出力で
ある短時間予測信号（フォルマント予測信号）を入力音
声信号から差し引いて短時間予測残差信号（フォルマン
ト予測残差信号）を作成し、更に有声音などにおいてピ
ッチ周期毎に波形が繰り返されるという長時間相関（ピ
ッチ相関）を長時間予測器（ピッチ予測器）を用いて取
り除くため、長時間予測器の出力である長時間予測信号
（ピッチ予測信号）を短時間予測残差信号から差し引
き、その結果の長時間予測残差信号（ピッチ予測残差信
号）と呼ばれる信号を量子化・符号化し短時間予測器内
の短時間予測係数及び長時間予測器内の長時間予測係数
からなる予測情報と共に受信側に伝送している。[0005] On the other hand, a packet loss interpolation apparatus for an adaptive predictive coding (APC) system, which is one of the typical speech coding systems, will be described below with reference to the drawings. First, A
Correlation briefly representing the short-time spectral envelope of the input speech signal by the encoder in the transmitting-end PC coding scheme (formant correlation) short predictor (formant predictor) predicted short time to take dividing Kutame using At the output of the vessel
Input a short-term prediction signal (formant prediction signal)
Short-term prediction residual signal (Formman
And a long-term correlation (pitch correlation) in which a waveform is repeated for each pitch period in a voiced sound or the like using a long-term predictor (pitch predictor). removing the camera in order LTP signal output from the LTP
(Pitch prediction signal) is subtracted from the short-term prediction residual signal
Come, the LTP residual signal results (pitch prediction residual Sashin
Signal ) is quantized and coded into the short-time predictor
Short-term prediction coefficient and long-term prediction coefficient in long-term predictor
Is transmitted to the receiving side together with the prediction information consisting of

【０００６】図８はＡＰＣ符号化方式の復号器における
欠落音声補間装置を説明するための図である。図中、１
は端子、２は音声符号化情報分離手段として用いられる
多重分離回路、３は雑音発生器、４は残差信号電力復号
器、５は逆量子化器、６は長時間予測係数復号化手段と
して用いられる長時間予測係数復号器、７は短時間予測
係数復号化手段として用いられる短時間予測係数復号
器、８は切り換えスイッチ、９は長時間合成フィルタ手
段として用いられる長時間合成フィルタ、１０は短時間
合成フィルタ手段として用いられる短時間合成フィル
タ、１１は加算器、１２は長時間合成フィルタ手段とし
て用いられる長時間予測器、１３は加算器、１４は短時
間予測器、１５は端子、１６は端子である。音声符号化
情報は端子１に受信され、長時間予測残差信号電力と、
符号化された正規化残差信号の情報（長時間予測残差信
号量子化情報）、長時間予測係数（ピッチ係数とピッチ
周期）及び短時間予測係数（フォルマント予測係数）か
らなる符号化パラメータの情報は多重分離回路２により
各々分離される。残差信号電力復号器４からの残差信号
電力を基に逆量子化器５のステップサイズを設定し、符
号化された正規化長時間予測残差信号から逆量子化長時
間予測残差信号を長時間予測残差信号復号化手段として
用いられる逆量子化器５において再生する。端子１５か
らの欠落検出信号により音声符号化情報の欠落を検出し
ない場合には、切り換えスイッチ８を介して加算器１１
と逆量子化器５が接続される。この結果、逆量子化長時
間予測残差信号は長時間予測器１２と加算器１１からな
る長時間合成フィルタ９に長時間予測残差信号（ピッチ
予測残差信号）として入力され、その後、更に短時間予
測器１４と加算器１３からなり短時間合成フィルタ手段
として用いられる短時間合成フィルタ１０に短時間予測
残差信号（フォルマント予測残差信号）として入力され
る。これらの構成はＡＰＣ復号器として動作するもので
短時間合成フィルタ１０の出力として再生されたディジ
タル音声信号が得られる。長時間予測係数復号器６は送
信側から伝送されてきた長時間予測係数に関する情報か
ら長時間予測係数を復号するもので、復号された係数を
長時間予測器１２に設定する。同様に、短時間予測係数
復号器７は送信側から伝送されてきた短時間予測係数に
関する情報から短時間予測係数を復号し、復号された係
数を短時間予測器１４に設定する。尚、残差信号電力、
長時間予測係数及び短時間予測係数からなる符号化パラ
メータの情報はフレームあるいはサブフレーム毎に伝送
されている。次に、端子１５からの欠落検出信号により
音声符号化情報（符号化パラメータの情報及び符号化さ
れた正規化残差信号の情報）の欠落を検出した場合に
は、端子１には通常の符号化情報が伝送されてこないこ
とになる。従って、切り換えスイッチ８を介して雑音発
生器３と加算器１１が接続される。長時間予測器１２及
び短時間予測器１４には前フレームの長時間予測係数及
び短時間予測係数をそのまま設定しておき、動作させ
る。更に残差信号電力復号器４からの前フレームの残差
信号電力をそのまま用いて同一の電力を持った雑音を雑
音発生器３から送出する。この雑音を逆量子化器５によ
り再生された逆量子化残差信号と見なし長時間合成フィ
ルタ９に加え、上記のように通常のＡＰＣ復号を行うこ
とにより短時間合成フィルタ１０の出力として再生ディ
ジタル音声信号が補間される。この補間方式では、残差
信号は相関が殆ど取り除かれた雑音であると仮定してい
る。ところが、実際には音声信号から相関を取り除くこ
とは難しく結局のところ残差信号にも相関が残ってしま
うこと、及び残差信号の代わりとして用いる雑音の位相
が本来の再生された逆量子化残差信号の位相と異なるこ
とから、補間された音声信号の品質があまりよくない。
更に、逆量子化残差信号（逆量子化器５により再生され
た残差信号）の代わりとなる雑音を生成する回路、即ち
雑音発生器３の付加が必要である。FIG. 8 is a diagram for explaining a missing voice interpolation device in a decoder of the APC encoding system. In the figure, 1
Is a terminal, 2 is a demultiplexing circuit used as speech coded information separating means , 3 is a noise generator, 4 is a residual signal power decoder, 5 is an inverse quantizer, and 6 is a long-term prediction coefficient. Decryption means and
Long-term prediction coefficient decoder used for short-term prediction
Short-term prediction coefficient decoder used as coefficient decoding means , 8 is a changeover switch, 9 is a long-time synthesis filter
Long time synthesis filter used as a stage , 10 short time
Short-time synthesis filter used as synthesis filter means , 11 is an adder, and 12 is long-time synthesis filter means.
LTP used Te, 13 an adder, 14 a short time predictor, 15 terminal, 16 is a terminal. The speech coded information is received at terminal 1 and the long-term predicted residual signal power ,
Information on the coded normalized residual signal (long-term prediction residual signal
No. quantization information) , long-term prediction coefficient (pitch coefficient and pitch)
The information of the encoding parameter including the period) and the short-term prediction coefficient (formant prediction coefficient) is separated by the demultiplexing circuit 2. The step size of the inverse quantizer 5 is set based on the residual signal power from the residual signal power decoder 4, and the inverse quantized length is calculated from the encoded normalized long-term prediction residual signal.
Inter prediction residual signal as long-term prediction residual signal decoding means
The reproduction is performed by the used inverse quantizer 5. When the loss of the voice coded information is not detected by the loss detection signal from the terminal 15, the adder 11
And the inverse quantizer 5 are connected. As a result, when the inverse quantization length is
The long-term prediction residual signal (pitch ) is sent to a long-term synthesis filter 9 comprising a long-term predictor 12 and an adder 11.
Is input as the prediction residual signal), then further short predictor 14 and adder 13 Tona Ri short synthesis filter means
-Time prediction in short- time synthesis filter 10 used as
It is input as a residual signal (formant prediction residual signal) . These components operate as an APC decoder, and a reproduced digital audio signal is obtained as an output of the short-time synthesis filter 10. The long-term prediction coefficient decoder 6 decodes the long-term prediction coefficient from the information about the long-term prediction coefficient transmitted from the transmission side, and sets the decoded coefficient in the long-term prediction unit 12. Similarly, the short-term prediction coefficient decoder 7 decodes the short-term prediction coefficients from the information on the short-term prediction coefficients transmitted from the transmission side, and sets the decoded coefficients in the short-term prediction unit 14. Note that the residual signal power,
The information of the coding parameter including the long-term prediction coefficient and the short-term prediction coefficient is transmitted for each frame or subframe. Next, speech coded information (information of coding parameters and coded
In this case, the normal encoded information is not transmitted to the terminal 1 when the missing of the normalized residual signal information is detected. Therefore, the noise generator 3 and the adder 11 are connected via the changeover switch 8. The long-term predictor 12 and the short-term predictor 14 are set and operated with the long-term prediction coefficient and the short-term prediction coefficient of the previous frame as they are. Further, using the residual signal power of the previous frame from the residual signal power decoder 4 as it is, a noise having the same power is transmitted from the noise generator 3. This noise is generated by the inverse quantizer 5.
The reproduced digital audio signal is interpolated as an output of the short-time synthesis filter 10 by performing normal APC decoding as described above in addition to the long-time synthesis filter 9 assuming that the reproduced digital audio signal is an inversely quantized residual signal. In this interpolation method, it is assumed that the residual signal is noise from which the correlation has almost been removed. However, in practice, it is difficult to remove the correlation from the audio signal, and eventually the correlation remains in the residual signal, and the phase of the noise used as a substitute for the residual signal is reduced by the originally reproduced inverse quantization residual. The quality of the interpolated audio signal is not very good because it is different from the phase of the difference signal.
Further, an inverse quantization residual signal (reproduced by the inverse quantizer 5)
It is necessary to add a circuit that generates noise instead of the residual signal , that is, a noise generator 3.

【０００７】[0007]

【発明が解決しようとする課題】以上のように従来技術
には、如何なる音声符号化方式にも適用可能なように再
生音声信号を用いることにより補間信号を生成する方式
の場合には、処理計算量及び処理回路の付加量の増大を
伴うわりには補間信号の品質がよくなく、また音声符号
化方式に依存した方式の場合においても、処理回路の付
加量の増大のわりには補間信号の品質があまりよいとは
言えないという欠点があった。As described above, in the prior art, in the case of a system in which an interpolation signal is generated by using a reproduced audio signal so as to be applicable to any audio encoding system, a processing calculation is performed. The quality of the interpolated signal is not good in spite of the increase in the amount of the processing circuit and the additional amount of the processing circuit. There was a drawback that it was not very good.

【０００８】本発明は、上記の欠点を解決するためのも
ので、適用可能な音声符号化方式は限定されるものの、
処理回路の付加が殆どなく補間信号の品質が格段に良好
な欠落音声補間装置を提供するものである。更に、本発
明の回路構成のみを一般の音声符号化方式に適用するこ
とにより、従来より補間のための回路の付加量が少なく
従来と同じ補間品質が得られる欠落音声補間装置を提供
するものである。The present invention has been made to solve the above-mentioned drawbacks, and although applicable speech coding systems are limited,
An object of the present invention is to provide a missing voice interpolation apparatus in which the quality of an interpolation signal is remarkably excellent without adding a processing circuit. Furthermore, by applying only the circuit configuration of the present invention to a general speech coding method, a missing speech interpolation apparatus is provided in which the amount of additional circuits for interpolation is smaller than before and the same interpolation quality can be obtained as before. is there.

【０００９】[0009]

【課題を解決するための手段】この課題を解決するため
本発明は、送信側で入力された音声信号を符号化し音声
符号化情報として受信側へ伝送し、該受信側で該音声符
号化情報を離散的に受信し、前記音声符号化情報を受信
しない時間区間には生成した補間信号を出力する欠落音
声補間方式である。（１）さらに、具体的には本願第１の発明は、音声符
号化情報分離手段（２），残差信号復号化手段（１９，
２０），長時間予測係数復号／選択手段（１８），短時
間予測係数復号化手段（７），長時間合成／補間フィル
タ手段（１７），短時間合成フィルタ手段（１０），端
子（１５）を備える欠落音声補間装置であって、音声符
号化情報分離手段（２）は、入力される音声符号化情報
を長時間予測係数情報，短時間予測係数情報，残差信号
量子化情報に分離し、残差信号復号化手段（１９，２
０）は、残差信号量子化情報を復号化して残差信号を出
力し、長時間予測係数復号／選択手段（１８）は、長時
間予測係数情報を復号化して得られる長時間予測係数を
長時間合成／補間フィルタ手段（１７）に出力するとと
もに、端子（１５）からの欠落検出信号を検出した場合
には、過去の長時間予測係数から得た補間用長時間予測
係数を長時間予測係数として長時間合成／補間フィルタ
手段（１７）に出力し、短時間予測係数復号化手段
（７）は、短時間予測係数情報を復号して得られる短時
間予測係数を出力し、長時間合成／補間フィルタ手段
（１７）は、切り換え手段（２１），長時間合成／フィ
ルタ（１１，１２）を備え、切り換え手段（２１）は、
端子（１５）からの欠落検出信号の検出の有無に対応し
て残差信号復号化手段（１９，２０）と長時間合成／フ
ィルタ（１１，１２）との相互間を切断又は接続し、長
時間合成／フィルタ（１１，１２）は、フィードバック
状に接続された長時間予測手段（１２）からなり、切り
換え手段（２１）からの残差信号を入力とし、長時間予
測係数を用いて短時間予測残差信号を合成して出力し、
短時間合成フィルタ手段（１０）は、短時間予測係数と
短時間予測残差信号とから再生音声信号を合成し出力す
る欠落音声補間装置である。（２）次に、本願第２の発明は、音声符号化情報分離
手段（２），残差信号復号化手段（４，５），短時間予
測係数復号化手段（７），長時間予測係数復号／選択手
段（１８），短時間合成フィルタ手段（１０），長時間
合成／補間フィルタ手段（１７），端子（１５）を備え
る欠落音声補間装置であって、音声符号化情報分離手段
（２）は、入力される音声符号化情報を長時間予測係数
情報，短時間予測係数情報，残差信号量子化情報に分離
し、残差信号復号化手段（４，５）は、残差信号量子化
情報を復号化して残差信号を出力し、長時間予測係数復
号／選択手段（１８）は、長時間予測係数情報を復号化
して得られる長時間予測係数を長時間合成／補間フィル
タ手段（１７）に出力するとともに、端子（１５）から
の欠落検出信号を検出した場合には、過去の長時間予測
係数から得た補間用長時間予測係数を長時間予測係数と
して長時間合成／補間フィルタ手段（１７）に出力し、
短時間予測係数復号化手段（７）は、短時間予測係数情
報を復号して得られる短時間予測係数を出力し、短時間
合成フィルタ手段（１０）は、短時間予測係数と残差信
号とから長時間予測残差信号を合成して出力し、長時間
合成／補間フィルタ手段（１７）は、切り換え手段（２
１），長時間合成／フィルタ（１１，１２）を備え、切
り換え手段（２１）は、端子（１５）からの欠落検出信
号の検出の有無に対応して短時間合成フィルタ手段（１
０）と長時間合成／フィルタ（１１，１２）との相互間
を切断又は接続し、長時間合成／フィルタ（１１，１
２）は、フィードバック状に接続された長時間予測手段
（１２）からなり、切り換え手段（２１）からの長時間
予測残差信号を入力とし、長時間予測係数を用いて再生
音声信号を合成して出力する欠落音声補間装置である。（３）また、本願第３の発明は、音声復号化手段（３
３），補間用フィルタ手段（３４），ピッチ情報検出手
段（３７），端子（３９）を備える欠落音声補間装置で
あって、音声復号化手段（３３）は、入力される音声符
号化情報を復号化して得られる再生音声信号を出力し、
補間用フィルタ手段（３４）は、切り換え手段（３
５），補間用長時間予測手段（３６）を備え、切り換え
手段（３５）の出力を補間再生音声信号として出力し、
切り換え手段（３５）は、端子（３９）からの欠落検出
信号を検出しない場合には、音声復号化手段（３３）か
らの再生音声信号を補間再生音声信号として出力し、欠
落検出信号を検出した場合には、補間用長時間予測手段
（３６）からの長時間予測信号を補間再生音声信号とし
て出力し、補間用長時間予測手段（３６）は、ピッチ情
報検出手段（３７）からの長時間予測係数に基づいて切
り換え手段（３５）の出力する補間再生音声信号から長
時間予測信号を合成して出力し、ピッチ情報検出手段
（３７）は、補間再生音声信号から長時間予測係数を抽
出して補間用長時間予測手段（３６）に出力する欠落音
声補間装置である。SUMMARY OF THE INVENTION In order to solve this problem, the present invention encodes a speech signal input on a transmission side, transmits the encoded speech signal to a reception side as speech encoded information, and transmits the speech encoded information on the reception side. Is a discrete speech interpolation method in which the generated interpolation signal is output in a time period in which the speech coding information is not received and the generated speech signal is not received. (1) More specifically, the first invention of the present application is a voice code
Encoding information separating means (2), residual signal decoding means (19,
20), long-term prediction coefficient decoding / selection means (18), short time
Inter prediction coefficient decoding means (7), long time synthesis / interpolation file
Filter means (17), short-time synthesis filter means (10), end
A lost audio interpolation apparatus including a child (15), sound marks
Encoding information separating means (2) for inputting audio encoded information;
Is the long-term prediction coefficient information, short-term prediction coefficient information, residual signal
It is separated into quantized information, and the residual signal decoding means (19, 2)
0) decodes the residual signal quantization information and outputs a residual signal.
The long-term prediction coefficient decoding / selection means (18)
The long-term prediction coefficient obtained by decoding the
Output to the long-time synthesis / interpolation filter means (17)
When a missing detection signal from terminal (15) is detected
Contains long-term interpolation predictions obtained from past long-term prediction coefficients.
Long-term synthesis / interpolation filter with coefficients as long-term prediction coefficients
Means (17) for decoding the short-term prediction coefficient
(7) is a short time obtained by decoding the short-time prediction coefficient information.
Output inter prediction coefficient, and long time synthesis / interpolation filter means
(17) Switching means (21), long-time synthesizing / filtering
(11, 12), and the switching means (21)
Corresponding to the presence or absence of the missing detection signal from the terminal (15)
And a long-time synthesis / processing with the residual signal decoding means (19, 20).
The mutual disconnection or connection between the filter (11, 12), the length
Time synthesis / filters (11, 12) are feedback
Long time prediction means (12) connected in a
Receiving the residual signal from the changing means (21) ,
The short-term prediction residual signal is synthesized using the
The short-time synthesis filter means (10) includes a short-time prediction coefficient and
Synthesizes and outputs the reproduced audio signal from the short-term prediction residual signal
It is a missing voice interpolation apparatus that. (2) Next, the second invention of the present application is a method for separating speech encoded information.
Means (2), residual signal decoding means (4, 5),
Measurement coefficient decoding means (7), long-term prediction coefficient decoding / selection means
Stage (18), short-time synthesis filter means (10), long time
Equipped with synthesis / interpolation filter means (17) and terminal (15)
Lost speech interpolation device, comprising speech encoded information separation means
(2) converts the input speech coding information into a long-term prediction coefficient
Information, short-term prediction coefficient information, and residual signal quantization information
The residual signal decoding means (4, 5) performs residual signal quantization.
Decode the information and output the residual signal, and then perform long-term prediction coefficient decoding.
Signal / selection means (18) decodes long-term prediction coefficient information
Long-term prediction coefficient obtained by long-term synthesis / interpolation fill
Output to the terminal means (17) and from the terminal (15)
If a missing detection signal is detected, a long time
The long-term prediction coefficient for interpolation obtained from the coefficient is
And outputs it to the long-time synthesis / interpolation filter means (17).
The short-term prediction coefficient decoding means (7) outputs the short-term prediction coefficient information.
Outputs short prediction coefficients obtained by decoding the broadcast, short
The synthesis filter means (10) includes a short-term prediction coefficient and a residual signal.
And it outputs the synthesized prediction residual signal long from the items, long
The synthesizing / interpolating filter means (17) includes switching means (2)
1), with long synthetic / filter (11, 12), switching
The replacement means (21) is provided with a missing detection signal from the terminal (15).
Short-time synthesis filter means (1
0) and the long-time synthesis / filter (11, 12)
Is disconnected or connected, and the synthesis / filter (11, 1
2) long-term prediction means connected in a feedback manner
(12), a long time from the switching means (21)
Inputs the prediction residual signal and plays back using the long-term prediction coefficient
This is a missing voice interpolation device that synthesizes and outputs a voice signal . (3) Further, the third invention of the present application is the audio decoding device (3
3), interpolation filter means (34), pitch information detecting means
A missing voice interpolation device having a stage (37) and a terminal (39)
The voice decoding means (33) is adapted to input the voice code
Output a reproduced audio signal obtained by decoding the encoded information,
The interpolation filter means (34) is provided with a switching means (3
5), provided with interpolation long time prediction means (36), switching
Outputting the output of the means (35) as an interpolation reproduction audio signal;
The switching means (35) detects missing from the terminal (39).
If no signal is detected, the voice decoding means (33)
These playback audio signals are output as interpolation playback audio signals,
If a falling detection signal is detected, the interpolation long time prediction means
The long-term prediction signal from (36) is used as the interpolated reproduction audio signal.
The interpolation long time prediction means (36) outputs the pitch information.
Based on the long-term prediction coefficient from the alarm detection means (37).
From the interpolated reproduced audio signal output by the switching means (35).
A time prediction signal is synthesized and output, and pitch information detection means
(37) extracts a long-term prediction coefficient from the interpolated reproduced audio signal.
This is a missing voice interpolation device that outputs a signal to the interpolation long time prediction means (36) .

【００１０】[0010]

【実施例１】まず本発明は、ＡＰＣ方式、マルチパルス
駆動型線形予測符号化（ＭＰＥＣ）方式もしくはコード
駆動型線形予測符号化（ＣＥＬＰ）方式等のピッチ分析
を持つ音声符号化方式に適用可能な欠落音声補間方式で
ある。以下に図を用いて本発明を詳細に説明する。尚、
以下の説明では従来方式（図８）と同一機能を持つ構成
要素については同一番号を付して説明の重複を省く。図
１は本発明を上記の音声符号化方式へ適用した一実施例
であり、送信側で入力された音声信号を符号化し音声符
号化情報として受信側へ送信し受信側でそれを離散的に
受信し、音声符号化情報を受信しない時間区間には受信
側で生成した補間信号を出力する欠落音声補間装置の概
略図である。同図中、１７は長時間合成フィルタ手段と
して用いられる長時間合成／補間フィルタであり、長時
間予測器１２と加算器１１とからなり、短時間予測残差
信号を出力又は補間出力とする。、１８は長時間係数復
号化手段とし用いられる長時間予測係数復号／選択器、
１９は残差信号電力復号器、２０は長時間予測残差信号
復号化手段として用いられる逆量子化器、２１は加算器
１１への入力をしゃ断制御する接続しゃ断スイッチであ
る。図２は図１中の長時間予測係数復号化手段として用
いられる長時間予測係数復号／選択器１８の構成概略図
である。同図中、２２は長時間予測係数復号器、２３は
補間用長時間予測係数生成器、２４は切り換えスイッチ
である。First Embodiment First, the present invention is applicable to a speech coding system having a pitch analysis such as an APC system, a multi-pulse driven linear predictive coding (MPEC) system or a code driven linear predictive coding (CELP) system. This is a missing voice interpolation method. Hereinafter, the present invention will be described in detail with reference to the drawings. still,
In the following description, components having the same functions as those of the conventional system (FIG. 8) are assigned the same reference numerals, and the description will not be repeated. FIG. 1 shows an embodiment in which the present invention is applied to the above-described speech coding method. A speech signal inputted on the transmission side is encoded and transmitted to the reception side as speech coded information, which is discretely received on the reception side. FIG. 3 is a schematic diagram of a missing voice interpolation device that outputs an interpolation signal generated on the receiving side during a time period in which reception is performed and voice coding information is not received. In the figure, reference numeral 17 denotes a long-time synthesis filter means.
A long synthesis / interpolation filter used to, when the length
A short-time prediction residual consisting of an inter prediction unit 12 and an adder 11
The signal is output or interpolated output. , 18 is a long-term coefficient recovery
Long-term prediction coefficient decoder / selector used as encoding means ,
19 is a residual signal power decoder, 20 is a long-term predicted residual signal
Dequantizer used as decoding means , 21 is an adder
This is a connection cutoff switch that cuts off the input to 11 . FIG. 2 is used as a long-term prediction coefficient decoding means in FIG.
FIG. 2 is a schematic diagram of a configuration of a long-term prediction coefficient decoding / selector 18 to be used. In the figure, 22 is a long-term prediction coefficient decoder, 23 is a long-term prediction coefficient generator for interpolation, and 24 is a changeover switch.

【００１１】この実施例の動作を説明する。端子１５か
らの欠落検出信号が音声符号化情報の欠落を指示しない
場合には、従来方式と同様な動作をする。即ち、まず端
子１５から音声符号化情報の欠落を検出していないこと
を示す欠落検出信号が接続しゃ断スイッチ２１及び切り
換えスイッチ２４に入力される。接続しゃ断スイッチ２
１を介して長時間予測残差信号復号化手段として用いら
れる逆量子化器２０から長時間予測残差信号が出力され
加算器１１へ長時間予測残差信号として入力され、一方
切り換えスイッチ２４により長時間予測係数復号器２２
の出力が長時間予測器１２へ長時間予測係数（ピッチ係
数とピッチ周期）として入力されることにより、図８と
全く同じ動作となって音声信号を再生することになる。
尚、残差信号電力復号器１９、逆量子化器２０及び長時
間予測係数復号器２２はそれぞれ残差信号電力復号器
４、逆量子化器５及び長時間予測係数復号器６と全く同
じである。The operation of this embodiment will be described. When the loss detection signal from the terminal 15 does not indicate the loss of the voice coded information, the same operation as the conventional method is performed. That is, first, a loss detection signal indicating that the loss of the voice coded information is not detected from the terminal 15 is output from the connection cutoff switch 21 and the disconnection switch.
Input to the change switch 24. Connection cutoff switch 2
1 as a means for decoding a long-term prediction residual signal.
A long-term prediction residual signal is output from the inverse quantizer 20 to be input to the adder 11 as a long-term prediction residual signal.
Is output to the long-term predictor 12 by a long-term prediction coefficient (pitch-related
By inputting as ( number and pitch period), the operation is exactly the same as in FIG. 8 and the audio signal is reproduced.
The residual signal power decoder 19, the inverse quantizer 20, and the long-term prediction coefficient decoder 22 are exactly the same as the residual signal power decoder 4, the inverse quantizer 5, and the long-term prediction coefficient decoder 6, respectively. is there.

【００１２】逆に、端子１５からの欠落検出信号が音声
符号化情報の欠落を指示した場合における欠落した音声
信号の補間方法について、以下に詳細に説明する。この
場合には、端子１には通常の符号化情報が伝送されてこ
ないことになり、接続しゃ断スイッチ２１を介して逆量
子化器２０と加算器１１とが切り放され、一方切り換え
スイッチ２４を介して補間用長時間予測係数生成器２３
と補間用長時間予測係数と短時間予測残差信号を用いて
長時間予測信号を生成する長時間予測器１２が接続され
る。尚、補間用長時間予測係数生成器２３における補間
用の長時間予測係数としては、（１）予め数値が設定さ
れおり、常に一定の値を出力する場合、もしくは（２）
長時間予測係数復号器２２から得た前フレームの長時間
予測係数に応じた補間用の長時間予測係数を出力する場
合などがある。また短時間予測係数と再生音声信号とを
用いて短時間予測信号を生成する短時間予測器１４には
前フレームの短時間予測係数をそのまま設定する。この
ようにすると、長時間合成／補間フィルタ１７は何の入
力もされずに長時間予測信号を短時間予測残差信号の補
間信号としたフィードバックループによる自己駆動の動
作をすることから、長時間予測信号であるその出力信号
を短時間予測残差信号として短時間合成フィルタ（１
０）内の加算器１３に入力することになる。そして上記
のように従来の通常の合成処理を行うことにより短時間
合成フィルタ１０の出力として再生ディジタル音声信号
が補間される。Conversely, a method of interpolating the missing audio signal when the missing detection signal from the terminal 15 indicates the lack of the audio coding information will be described in detail below. In this case, normal encoded information is not transmitted to the terminal 1, the inverse quantizer 20 and the adder 11 are cut off via the disconnection switch 21, and the switch 24 is turned off. Interpolated long-term prediction coefficient generator 23
Using long-term prediction coefficient for interpolation and short-term prediction residual signal
A long-term predictor 12 for generating a long-term prediction signal is connected. As the long-term prediction coefficient for interpolation in the interpolation long-term prediction coefficient generator 23, (1) a numerical value is set in advance and a constant value is always output, or (2)
In some cases, a long-term prediction coefficient for interpolation corresponding to the long-term prediction coefficient of the previous frame obtained from the long-term prediction coefficient decoder 22 is output. Also, the short-term prediction coefficient and the reproduced audio signal
The short-term prediction coefficient of the previous frame is set as it is in the short-term predictor 14 that generates a short- term prediction signal using the same. In this way, the long-time synthesis / interpolation filter 17 does not input anything and converts the long-term prediction signal to the short-term prediction residual signal.
Movement of self-driven by between signals and the feedback loop
Since the work, a short time synthesis filter its output signal is a long prediction signal as a short prediction residual signal (1
0) is input to the adder 13. As described above, the reproduced digital audio signal is interpolated as the output of the short-time synthesis filter 10 by performing the conventional ordinary synthesis processing.

【００１３】尚、一般に長時間予測係数にはピッチ周期
とピッチ係数がある。式（１）に、長時間予測器１２が
１タップフィルタである場合の伝達関数Ｐ（ｚ）を示
す。Ｐ（ｚ）＝ｂ・ｚ^-p （１）同式中、ｂはピッチ係数（０≦ｂ≦１）、ｐはピッチ周
期を示す。シミュレーションの結果、補間用長時間予測
係数生成器２３がピッチ周期ｐとして前フレームの値を
そのまま出力し、またピッチ係数ｂとして１を出力する
ことが、補間信号の聴感的品質上最適であった。In general, the long-term prediction coefficients include a pitch period and a pitch coefficient. Equation (1) shows the transfer function P (z) when the long-term predictor 12 is a one-tap filter. P (z) = b · z ^−p (1) In the equation, b represents a pitch coefficient (0 ≦ b ≦ 1), and p represents a pitch period. Simulation results interpolation LTP coefficient generator 23 directly outputs the value of the previous frame as the pitch period p, also be output 1 as pitch coefficient b, was optimal on perceptual quality of the interpolation signal .

【００１４】[0014]

【実施例２】実施例１の等価構成として図３に示す構成
概略図がある。尚、以下の説明では実施例１（図１）と
同一機能を持つ構成要素については同一番号を付して説
明の重複を省く。同図中、２５は切り換えスイッチ、２
６は補間用長時間合成フィルタ、２７は補間用長時間予
測器である。その他については図８に示す従来技術の復
号化構成に等しい。まず、端子１５からの欠落検出信号
が音声符号化情報の欠落を指示しない場合には、従来方
式と同様な動作をする。即ち、端子１５から音声符号化
情報の欠落を検出していないことを示す欠落検出信号が
切り換えスイッチ２５に入力される。切り換えスイッチ
２５を介して加算器１１と加算器１３が接続されること
により、図９と全く同じ動作となって音声信号を再生す
ることになる。逆に、端子１５からの欠落検出信号が音
声符号化情報の欠落を指示した場合には、切り換えスイ
ッチ２５を介して補間用長時間合成フィルタ２６（また
は補間用長時間予測器２７の出力）と加算器１３が接続
される。又、長時間予測器１２のディレイバッファの内
容が長時間予測器２７へコピーされ設定される。尚、補
間用長時間予測器２７が１タップフィルタの場合にはそ
の伝達関数は式（１）に等しく、また補間用の長時間予
測係数は実施例１と同様に設定される。このようにする
と、実施例１と同様に、何の入力もされることなくフィ
ードバックループにより自己駆動する補間用長時間合成
フィルタ２６の長時間予測信号である出力信号が短時間
予測残差信号として短時間合成フィルタ１０内の加算器
１３に入力されることになる。そして上記のように通常
の合成処理を行うことにより短時間合成フィルタ１０の
出力として再生ディジタル音声信号が補間される。尚、
補間開始以降は、必要に応じて、長時間予測器２７のデ
ィレイバッファの内容が長時間予測器１２へコピーされ
設定される。[Embodiment 2] FIG. 3 is a schematic diagram showing an equivalent configuration of the first embodiment. In the following description, components having the same functions as those of the first embodiment (FIG. 1) will be assigned the same reference numerals, and redundant description will be omitted. In the figure, 25 is a changeover switch, 2
Reference numeral 6 denotes a long-time synthesis filter for interpolation, and 27 denotes a long-time predictor for interpolation. Others are the same as the conventional decoding configuration shown in FIG. First, when the loss detection signal from the terminal 15 does not indicate the loss of the audio coded information, the same operation as the conventional method is performed. In other words, a loss detection signal indicating that no loss of the audio coded information has been detected is input from the terminal 15 to the changeover switch 25. When the adder 11 and the adder 13 are connected via the changeover switch 25, the operation is exactly the same as in FIG. 9 and the audio signal is reproduced. Conversely, when the missing detection signal from the terminal 15 indicates the lack of the voice coded information, the interpolation long-time synthesis filter 26 (or the output of the interpolation long-time predictor 27) via the changeover switch 25. The adder 13 is connected. Also, the contents of the delay buffers of the LTP 12 Ru is set are copied to the LTP 27. When the interpolation long-term predictor 27 is a one-tap filter, the transfer function is equal to the equation (1), and the interpolation long-term prediction coefficient is set as in the first embodiment. In this way, in the same manner as in Example 1, without Rukoto is any input Fi
The over-back loop output signal, a long-predicted signal for a long time for interpolation for self-driven synthesis filter 26 is a short time
The prediction residual signal is input to the adder 13 in the short-time synthesis filter 10 . Then, the reproduced digital audio signal is interpolated as the output of the short-time synthesis filter 10 by performing the normal synthesis processing as described above. still,
After the start of the interpolation, the contents of the delay buffer of the long time predictor 27 are copied to the long time predictor 12 as necessary.
Ru is set.

【００１５】[0015]

【実施例３】図４は本発明の欠落音声補間方式をピッチ
分析を持つ音声符号化方式に適用した別の実施例の概略
図である。尚、同図は、図１での長時間合成／補間フィ
ルタ１７と短時間合成フィルタ１０を入れ換えた構成と
なっている。まず、端子１５からの欠落検出信号が音声
符号化情報の欠落を指示しない場合には、従来方式と同
様な動作をする。即ち、端子１５から音声符号化情報の
欠落を検出していないことを示す欠落検出信号が接続し
ゃ断スイッチ２１及び切り換えスイッチ２４に入力され
ると、接続しゃ断スイッチ２１を介して加算器１３と加
算器１１が接続され、一方切り換えスイッチ２４により
長時間予測係数復号器２２の出力が長時間予測器１２へ
入力されることにより、図８と全く同じ動作となって音
声信号を再生することになる。逆に、端子１５からの欠
落検出信号が音声符号化情報の欠落を指示した場合に
は、端子１には通常の符号化情報が伝送されてこないこ
とになるので、接続しゃ断スイッチ２１により加算器１
３と加算器１１とが切り放され、一方長時間予測係数復
号／選択器１８内の長時間予測器１２が切り換えスイッ
チ２４を介して補間用長時間予測係数生成器２３と接続
される。尚、補間用長時間予測係数生成器２３は実施例
１と同様に動作し、短時間予測器１４には前フレームの
短時間予測係数をそのまま設定されている。このように
すると、長時間合成／補間フィルタ１７に何の入力もさ
れず、フィードバックループによる自己駆動することに
より、その出力が再生ディジタル音声信号として補間さ
れる。Embodiment 3 FIG. 4 is a schematic diagram of another embodiment in which the missing speech interpolation system of the present invention is applied to a speech encoding system having pitch analysis. In this figure, the long-time synthesis / interpolation filter 17 and the short-time synthesis filter 10 in FIG. 1 are replaced. First, when the loss detection signal from the terminal 15 does not indicate the loss of the audio coded information, the same operation as the conventional method is performed. That is, a loss detection signal indicating that the loss of the audio coded information is not detected is connected from the terminal 15.
When input to the disconnection switch 21 and the changeover switch 24, the adder 13 and the adder 11 are connected via the connection cutoff switch 21. When the signal is input to the device 12, the operation is exactly the same as that of FIG. Conversely, if the dropout detection signal from the terminal 15 instructs the missing speech coding information, since the normal encoded information is not come transmitted to the terminal 1, the adder by connecting cutoff switch 21 1
3 an adder 11 and is split off, whereas LTP coefficient decoding / selector 18 LTP 12 in is Connect the interpolation LTP coefficient generator 23 through the changeover switch 24. The interpolation long-term prediction coefficient generator 23 operates in the same manner as in the first embodiment, and the short-term prediction coefficient of the previous frame is set as it is in the short-term prediction unit 14. In this way, no input is made to the long-time synthesis / interpolation filter 17 and the output is interpolated as a reproduced digital audio signal by self-driving by the feedback loop .

【００１６】[0016]

【実施例４】実施例３の等価構成として図５に示す構成
概略図がある。尚、以下の説明では実施例３（図４）と
同一機能を持つ構成要素については同一番号を付して説
明の重複を省く。まず、端子１５からの欠落検出信号が
音声符号化情報の欠落を指示しない場合には、従来方式
と同様な動作をする。即ち、端子１５から音声符号化情
報の欠落を検出していないことを示す欠落検出信号が切
り換えスイッチ２９に入力される。切り換えスイッチ２
９を介して長時間合成フィルタ９の出力が再生音声とし
て端子１６に出力されることにより、図８と全く同じ動
作となって音声信号を再生することになる。また、逆
に、端子１５からの欠落検出信号が音声符号化情報の欠
落を指示した場合には、長時間予測器１２のディレイバ
ッファの内容が長時間予測器２７へコピーされ設定され
る。切り換えスイッチ２９を介して補間用長時間合成フ
ィルタ２６（または補間用長時間予測器２７）の出力が
補間音声として端子１６に出力される。尚、長時間予測
器２７における長時間予測係数は実施例３と同様に設定
される。また、長時間予測器２７が１タップフィルタの
場合にはその伝達関数は式（１）に等しい。このように
すると、実施例３と同様に、何の入力もされず、フィー
ドバックループにより自己駆動する補間用長時間合成フ
ィルタ２６の長時間予測信号である出力信号により再生
ディジタル音声信号が補間される。尚、補間開始以降
は、必要に応じて、長時間予測器２７のディレイバッフ
ァの内容が長時間予測器１２へコピーされ設定される。[Embodiment 4] FIG. 5 is a schematic diagram showing an equivalent configuration of the embodiment 3. As shown in FIG. In the following description, components having the same functions as those of the third embodiment (FIG. 4) are denoted by the same reference numerals, and the description will not be repeated. First, when the loss detection signal from the terminal 15 does not indicate the loss of the audio coded information, the same operation as the conventional method is performed. That is, a loss detection signal indicating that the loss of the audio coded information has not been detected is input to the switch 29 from the terminal 15. Changeover switch 2
When the output of the long-time synthesis filter 9 is output to the terminal 16 as the reproduced sound via the terminal 9, the operation is exactly the same as that of FIG. 8 and the sound signal is reproduced. Conversely, when the loss detection signal from the terminal 15 indicates the loss of the audio coding information, the contents of the delay buffer of the long time predictor 12 are copied to the long time predictor 27 and set. > The output of the interpolation long-time synthesis filter 26 (or the interpolation long-time predictor 27) is output to the terminal 16 as the interpolation sound via the changeover switch 29. The long-term prediction coefficient in the long-term predictor 27 is set in the same manner as in the third embodiment. When the long-term predictor 27 is a one-tap filter, its transfer function is equal to the equation (1). In this case, as in the third embodiment, no input is made , and
The reproduced digital audio signal is interpolated by the output signal which is the long-term prediction signal of the long- time interpolation filter for interpolation 26 which is self-driven by the feedback loop . Incidentally, interpolation start thereafter, if necessary, the contents of the delay buffers of the LTP 27 Ru is set are copied to the LTP 12.

【００１７】[0017]

【実施例５】図６は本発明の欠落音声補間方式の回路構
成を一般の音声符号化方式に適用した一実施例であり、
従来方式と比較してより簡単な回路構成で同様な効果が
得られるものである。同図中、３２は端子、３３は音声
復号化手段として用いられる復号器、３４は補間用フィ
ルタ、３５は切り換えスイッチ、３６は補間用長時間予
測手段として用いられる補間用長時間予測器、３７はピ
ッチ情報検出器、３８は端子、３９は端子である。ま
ず、端子３９からの欠落検出信号が音声符号化情報の欠
落を指示しない場合には、通常の音声信号の復号化が行
われる。即ち、端子３９から音声符号化情報の欠落を検
出していないことを示す欠落検出信号が切り換えスイッ
チ３５に入力されると、切り換えスイッチ３５により復
号器３３の出力信号が端子３８を介して再生音声信号と
して出力される。一方、復号器３３の出力は補間用長時
間予測器３６及びピッチ情報検出器３７へ入力される。
補間用長時間予測器３６では再生音声信号を一定時間蓄
積しておく。またピッチ情報検出器３７では再生音声信
号からピッチ情報（ピッチ周期及びピッチ係数）を検出
し補間用長時間予測器３６に出力する。逆に、端子３９
からの欠落検出信号が音声符号化情報の欠落を指示した
場合における欠落音声信号の補間方法について、以下に
詳細に説明する。この場合には、端子３２に音声符号化
情報が伝送されてこないことになるので、切り換えスイ
ッチ３５により補間用長時間予測器３６の出力信号が再
生音声信号の補間信号として端子３８を介して出力され
る。同時にその出力信号が再び補間用長時間予測器３６
に入力される。尚、補間用長時間予測器３６は、ピッチ
情報検出器３７において補間すべき音声信号の直前まで
の再生音声信号から求められたピッチ情報（長時間予測
係数）を用いて、その出力信号として長時間予測信号を
生成する。即ち、補間用フィルタ３４は切り換えスイッ
チ３５により復号器３３からの入力がしゃ断され、補間
用長時間予測器３６を自己駆動することにより再生音声
出力信号を生成している。ここで補間用長時間予測器３
６が１タップフィルタである場合には、その伝達関数は
式（１）に等しく、補間時に使用されるピッチ係数は１
が補間音声品質上よい。Embodiment 5 FIG. 6 shows an embodiment in which the circuit configuration of the missing speech interpolation system of the present invention is applied to a general speech encoding system.
Similar effects can be obtained with a simpler circuit configuration as compared with the conventional method. In the figure, 32 is a terminal, 33 is a voice.
A decoder used as a decoding means , 34 is an interpolation filter, 35 is a changeover switch, and 36 is a long time interpolation interpolation.
Reference numeral 37 denotes a pitch information detector, reference numeral 38 denotes a terminal, and reference numeral 39 denotes a terminal. First, when the loss detection signal from the terminal 39 does not indicate the loss of the voice coded information, the normal voice signal is decoded. That is, when a loss detection signal indicating that the loss of the audio coded information is not detected is input from the terminal 39 to the changeover switch 35, the output signal of the decoder 33 is output by the changeover switch 35 via the terminal 38 to the reproduced sound. Output as a signal. On the other hand, the output of the decoder 33 is input to the long time predictor for interpolation 36 and the pitch information detector 37.
In the interpolation long time predictor 36, the reproduced audio signal is accumulated for a certain period of time. The pitch information detector 37 detects pitch information (pitch cycle and pitch coefficient) from the reproduced audio signal and outputs it to the interpolation long time predictor 36. Conversely, terminal 39
A method of interpolating a missing audio signal when the missing detection signal from the device indicates a lack of audio encoded information will be described in detail below. In this case, since the audio coded information is not transmitted to the terminal 32, the output signal of the interpolation long time predictor 36 is output via the terminal 38 as the interpolation signal of the reproduced audio signal by the changeover switch 35. Is done. At the same time, the output signal is again supplied to the interpolation long time predictor 36.
Is input to Note that the interpolation long time predictor 36 generates the pitch information (long time prediction) obtained from the reproduced sound signal immediately before the sound signal to be interpolated by the pitch information detector 37.
) , A long-term prediction signal is generated as the output signal. That is, the interpolation filter 34 is switched by the switching switch.
The input from the decoder 33 is cut off by the switch 35 and interpolation is performed.
The self-driving of the long time predictor 36 generates a reproduced sound output signal. Here, interpolation long time predictor 3
If 6 is a one-tap filter, its transfer function is equal to equation (1), and the pitch coefficient used during interpolation is 1
Is good in terms of interpolation voice quality.

【００１８】[0018]

【実施例６】図７は本発明の欠落音声補間方式の回路構
成をＤＰＣＭやＡＤＰＣＭ等のピッチ分析を行っていな
い音声符号化方式へ適用した場合の一実施例であり、従
来方式と比較してより簡単な回路構成で同様な効果が獲
られるものである。同図中、４０は端子、４１は逆量子
化器、７は短時間予測係数復号器、４２は補間用フィル
タ、４３は切り換えスイッチ、４４は補間用長時間予測
器、４５はピッチ情報検出器、４６は短時間合成フィル
タ、４７は加算器、４８は短時間予測器、４９は端子、
５０は端子である。図７において、多重分離回路を省略
したものを示す。なお、端子４０には符号化された正規
化残差信号の情報を、端子５１には符号化された短時間
予測係数情報を入力する。まず、端子５０からの欠落検
出信号が音声符号化情報の欠落を指示しない場合には、
通常の音声信号の復号化が行われる。即ち、まず端子４
０からの音声符号化情報を入力した逆量子化器４１は残
差信号を出力する。端子５０から音声符号化情報の欠落
を検出していないことを示す欠落検出信号が切り換えス
イッチ４３に入力されると、残差信号が切り換えスイッ
チ４３を介して加算器４７に入力され、更に短時間予測
器４８の出力である短時間予測信号と足し加えられて、
再生音声信号として端子４９を介して出力される。同時
に、逆量子化器４１の出力は補間用長時間予測器４４及
びピッチ情報検出器４５へ、また加算器４７の出力は短
時間予測器４８へそれぞれ入力される。そして短時間予
測器４８では、短時間予測係数復号器７からの短時間予
測係数に基づいて短時間予測器４８の係数を推定し短時
間予測信号を生成・出力する。一方、補間用長時間予測
器４４では逆量子化器４１にて再生された短時間予測残
差信号を一定時間蓄積しておき、またピッチ情報検出器
４５では前記短時間予測残差信号からピッチ情報（ピッ
チ周期及びピッチ係数）を検出し補間用長時間予測器４
４に出力する。逆に、端子５０からの欠落検出信号が
音声符号化情報の欠落を指示した場合における欠落音声
信号の補間方法について、以下に詳細に説明する。この
場合には、端子４０に音声符号化情報が伝送されてこな
いことになるので、切り換えスイッチ４３により補間用
長時間予測器４４の出力信号が、前記の短時間予測残差
信号の補間信号として加算器４７に入力されると同時
に、再び補間用長時間予測器４４に入力され自己駆動さ
れる。尚、補間用長時間予測器４４は、ピッチ情報検出
器４５において補間すべき短時間予測残差信号の直前ま
での短時間予測残差音声信号から求められたピッチ情報
を用いて、長時間予測信号としてのその出力信号を生成
する。即ち、補間用フィルタ４２は自己駆動することに
より補償信号としての短時間予測残差信号を生成してい
る。ここで補間用長時間予測器４４が１タップフィルタ
である場合には、その伝達関数は式（１）に等しく、補
間時に使用されるピッチ係数は１が補間音声品質上よ
い。Embodiment 6 FIG. 7 shows an embodiment in which the circuit configuration of the missing voice interpolation system of the present invention is applied to a voice coding system such as DPCM or ADPCM which does not perform pitch analysis. A similar effect can be obtained with a simpler circuit configuration. In the figure, 40 is a terminal, 41 is an inverse quantizer, 7 is a short-term prediction coefficient decoder, 42 is an interpolation filter, 43 is a changeover switch, 44 is an interpolation long-time predictor, and 45 is a pitch information detector. , 46 are short-time synthesis filters, 47 is an adder, 48 is a short-time predictor, 49 is a terminal,
50 is a terminal. In FIG. 7, the demultiplexing circuit is omitted.
This is shown. It should be noted that the encoded normal
The information of the encoded residual signal is input to
Enter prediction coefficient information. First, when the loss detection signal from the terminal 50 does not indicate the loss of the audio coding information,
Normal audio signal decoding is performed. That is, first, terminal 4
The inverse quantizer 41 which has inputted the speech coding information from 0 outputs a residual signal. When a loss detection signal indicating that the loss of the voice coded information is not detected is input from the terminal 50 to the changeover switch 43, the residual signal is input to the adder 47 via the changeover switch 43, and the time is shorter. The short-term prediction signal output from the predictor 48 is added, and
The signal is output via the terminal 49 as a reproduced audio signal. At the same time, the output of the inverse quantizer 41 is input to the interpolation long time predictor 44 and the pitch information detector 45, and the output of the adder 47 is input to the short time predictor 48. Then, the short-term predictor 48 outputs the short-term prediction from the short-term prediction coefficient decoder 7.
The coefficient of the short-term predictor 48 is estimated based on the measurement coefficients, and a short-term prediction signal is generated and output. On the other hand, the interpolation long time predictor 44 accumulates the short time prediction residual signal reproduced by the inverse quantizer 41 for a certain period of time, and the pitch information detector 45 calculates the pitch from the short time prediction residual signal. Information (pitch period and pitch coefficient) is detected and a long time predictor 4 for interpolation is used.
4 is output. Conversely, a method of interpolating the missing audio signal when the missing detection signal from the terminal 50 indicates the lack of the encoded audio information will be described in detail below. In this case, since the speech coded information is not transmitted to the terminal 40, the output signal of the interpolation long time predictor 44 is changed by the changeover switch 43 as the interpolation signal of the short time prediction residual signal. a is input simultaneously <br/> to an adder 47, a self-drive is entered again interpolation LTP 44
Re that. Incidentally, interpolation LTP 44 uses pitch information obtained from the short-time prediction residual speech signal just before the short prediction residual signal to be interpolated at pitch information detector 45, LTP Generate its output signal as a signal. That is, the interpolation filter 42 generates a short-time prediction residual signal as a compensation signal by self-driving. Here, when the interpolation long time predictor 44 is a one-tap filter, its transfer function is equal to the equation (1), and the pitch coefficient used at the time of interpolation is 1 which is good in terms of interpolation voice quality.

【００１９】[0019]

【発明の効果】以上詳細に説明したように本発明は、送
信側で入力された音声信号を符号化し音声符号化情報と
して受信側へ伝送し、該受信側で該音声符号化情報を離
散的に受信し、該音声符号化情報を受信しない時間区間
には生成した補間信号を出力する欠落音声補間装置にお
いて、（１）音声符号化情報分離手段（２），残差信号復号
化手段（１９，２０），長時間予測係数復号／選択手段
（１８），短時間予測係数復号化手段（７），長時間合
成／補間フィルタ手段（１７），短時間合成フィルタ手
段（１０），端子（１５）を備える欠落音声補間装置で
あって、音声符号化情報分離手段（２）は、入力される
音声符号化情報を長時間予測係数情報，短時間予測係数
情報，残差信号量子化情報に分離し、残差信号復号化手
段（１９，２０）は、残差信号量子化情報を復号化して
残差信号を出力し、長時間予測係数復号／選択手段（１
８）は、長時間予測係数情報を復号化して得られる長時
間予測係数を長時間合成／補間フィルタ手段（１７）に
出力するとともに、端子（１５）からの欠落検出信号を
検出した場合には、過去の長時間予測係数から得た補間
用長時間予測係数を長時間予測係数として長時間合成／
補間フィルタ手段（１７）に出力し、短時間予測係数復
号化手段（７）は、短時間予測係数情報を復号して得ら
れる短時間予測係数を出力し、長時間合成／補間フィル
タ手段（１７）は、切り換え手段（２１），長時間合成
／フィルタ（１１，１２）を備え、切り換え手段（２
１）は、端子（１５）からの欠落検出信号の検出の有無
に対応して残差信号復号化手段（１９，２０）と長時間
合成／フィルタ（１１，１２）との相互間を切断又は接
続し、長時間合成／フィルタ（１１，１２）は、フィー
ドバック状に接続された長時間予測手段（１２）からな
り、切り換え手段（２１）からの残差信号を入力とし、
長時間予測係数を用いて短時間予測残差信号を合成して
出力し、短時間合成フィルタ手段（１０）は、短時間予
測係数と短時間予測残差信号とから再生音声信号を合成
し出力するように構成するか、もしくは、（２）音声符号化情報分離手段（２），残差信号復号
化手段（４，５），短時間予測係数復号化手段（７），
長時間予測係数復号／選択手段（１８），短時間合成フ
ィルタ手段（１０），長時間合成／補間フィルタ手段
（１７），端子（１５）を備える欠落音声補間装置であ
って、音声符号化情報分離手段（２）は、入力される音
声符号化情報を長時間予測係数情報，短時間予測係数情
報，残差信号量子化情報に分離し、残差信号復号化手段
（４，５）は、残差信号量子化情報を復号化して残差信
号を出力し、長時間予測係数復号／選択手段（１８）
は、長時間予測係数情報を復号化して得られる長時間予
測係数を長時間合成／補間フィルタ手段（１７）に出力
するとともに、端子（１５）からの欠落検出信号を検出
した場合には、過去の長時間予測係数から得た補間用長
時間予測係数を長時間予測係数として長時間合成／補間
フィルタ手段（１７）に出力し、短時間予測係数復号化
手段（７）は、短時間予測係数情報を復号して得られる
短時間予測係数を出力し、短時間合成フィルタ手段（１
０）は、短時間予測係数と残差信号とから長時間予測残
差信号を合成して出力し、長時間合成／補間フィルタ手
段（１７）は、切り換え手段（２１），長時間合成／フ
ィルタ（１１，１２）を備え、切り換え手段（２１）
は、端子（１５）からの欠落検出信号の検出の有無に対
応して短時間合成フィルタ手段（１０）と長時間合成／
フィルタ（１１，１２）との相互間を切断又は接続し、
長時間合成／フィルタ（１１，１２）は、フィードバッ
ク状に接続された長時間予測手段（１２）からなり、切
り換え手段（２１）からの長時間予測残差信号を入力と
し、長時間予測係数を用いて再生音声信号を合成して出
力するように構成することにより、従来方式に比べて欠
落音声信号の補間のために必要な回路の付加が殆どない
にも関わらず格段に良好な補間信号の品質を実現するこ
とができる。更に、本発明は、ＡＰＣ方式のみならずフ
レーム単位に符号化処理を行っている殆ど全ての音声符
号化方式にも適用可能であり、その効果は極めて大であ
る。また、（３）音声復号化手段（３３），補間用フィルタ手段
（３４），ピッチ情報検出手段（３７），端子（３９）
を備える欠落音声補間装置であって、音声復号化手段
（３３）は、入力される音声符号化情報を復号化して得
られる再生音声信号を出力し、補間用フィルタ手段（３
４）は、切り換え手段（３５），補間用長時間予測手段
（３６）を備え、切り換え手段（３５）の出力を補間再
生音声信号として出力し、切り換え手段（３５）は、端
子（３９）からの欠落検出信号を検出しない場合には、
音声復号化手段（３３）からの再生音声信号を補間再生
音声信号として出力し、欠落検出信号を検出した場合に
は、補間用長時間予測手段（３６）からの長時間予測信
号を補間再生音声信号として出力し、補間用長時間予測
手段（３６）は、ピッチ情報検出手段（３７）からの長
時間予測係数に基づいて切り換え手段（３５）の出力す
る補間再生音声信号から長時間予測信号を合成して出力
し、ピッチ情報検出手段（３７）は、補間再生音声信号
から長時間予測係数を抽出して補間用長時間予測手段
（３６）に出力するように構成することにより、従来方
式に比べて欠落音声信号の補間のために必要な付加回路
が殆どないにも関わらず同じ程度の補間信号の品質を実
現でき、また如何なる音声符号化方式にも適用可能であ
り、その効果は極めて大である。As described above in detail, according to the present invention, a speech signal input on the transmission side is encoded and transmitted as speech encoded information to the receiving side, and the speech encoded information is discretely encoded on the receiving side. And a missing speech interpolator for outputting a generated interpolation signal in a time interval in which the speech coded information is not received.
Means (19, 20), long-term prediction coefficient decoding / selection means
(18), short-term prediction coefficient decoding means (7),
Synthesis / interpolation filter means (17), short-time synthesis filter
A missing voice interpolation device having a stage (10) and a terminal (15)
Then, the speech coded information separating means (2) is inputted.
Speech coding information is long-term prediction coefficient information, short-term prediction coefficient
Information and residual signal quantization information, and the residual signal
Stages (19, 20) decode the residual signal quantization information and
The residual signal is output, and the long-term prediction coefficient decoding / selection means (1
8) is a long time obtained by decoding the long-term prediction coefficient information
Inter prediction coefficient to long-time synthesis / interpolation filter means (17)
Output and the missing detection signal from the terminal (15)
If detected, the interpolation obtained from past long-term prediction coefficients
Long-term prediction coefficient as long-term prediction coefficient
Outputs the interpolation filter means (17), a short time prediction coefficient restored
The decoding means (7) decodes the short-term prediction coefficient information to obtain
And outputs a short time prediction coefficient that is, for a long time synthesis / interpolation fill
Switching means (21), long-time synthesis
/ Filters (11, 12) and switching means (2
1) The presence or absence of detection of a missing detection signal from the terminal (15)
And the residual signal decoding means (19, 20)
Cutting / connecting with the synthesis / filter (11, 12)
The long time synthesis / filters (11, 12)
The long-term prediction means (12) connected in a
Input the residual signal from the switching means (21),
Combining short-term prediction residual signals using long-term prediction coefficients
The short-time synthesis filter means (10) outputs
Synthesizes reproduced audio signal from measurement coefficient and short-term prediction residual signal
Or configured to output, or, (2) the speech encoded information separation means (2), the residual signal decoding
Decoding means (4, 5), short-term prediction coefficient decoding means (7),
Long-term prediction coefficient decoding / selection means (18), short-time synthesis
Filter means (10), long-time synthesis / interpolation filter means
(17) A missing voice interpolation device having a terminal (15).
Thus, the speech coded information separating means (2)
The voice coded information is converted to long-term prediction coefficient information and short-term prediction coefficient information.
Information and residual signal quantization information, and the residual signal decoding means
(4, 5) decodes the residual signal quantization information to obtain the residual signal.
And a long-term prediction coefficient decoding / selection means (18)
Is the long-term prediction obtained by decoding the long-term prediction coefficient information.
Outputs measurement coefficients to long-time synthesis / interpolation filter means (17)
And also detects the missing detection signal from the terminal (15).
The interpolation length obtained from past long-term prediction coefficients.
Long-term synthesis / interpolation using long-term prediction coefficients as long-term prediction coefficients
Output to the filter means (17) and decode the short-term prediction coefficients
The means (7) is obtained by decoding the short-term prediction coefficient information.
The short-term prediction coefficient is output, and the short-time synthesis filter means (1
0) is the long-term prediction residual from the short-term prediction coefficient and the residual signal.
The difference signal is synthesized and output, and a long time synthesis / interpolation filter
The stage (17) comprises a switching means (21), a long-time synthesis /
Filter (11, 12), switching means (21)
Indicates whether or not the missing detection signal is detected from the terminal (15).
In response, the short-time synthesis filter means (10) and the long-time synthesis /
Disconnecting or connecting between the filters (11, 12),
The long-time synthesis / filters (11, 12)
It consists of long-term prediction means (12) connected in a
The long-term prediction residual signal from the switching means (21) is input and
And synthesizes the reproduced audio signal using the long-term prediction coefficient.
With such a configuration, significantly better quality of the interpolated signal can be realized as compared with the conventional method, even though there is almost no additional circuit required for interpolation of the missing audio signal. Further, the present invention is applicable not only to the APC system but also to almost all audio coding systems that perform encoding processing in units of frames, and the effect is extremely large. (3) speech decoding means (33), interpolation filter means
(34), pitch information detecting means (37), terminal (39)
A lost audio interpolation apparatus comprising a speech decoding means
(33) is obtained by decoding the input speech encoded information.
The reproduced audio signal is output and the interpolation filter means (3
4) is a switching means (35), interpolation long time predicting means
(36), and the output of the switching means (35) is interpolated again.
The signal is output as a raw audio signal, and the switching means (35)
When the missing detection signal from the child (39) is not detected,
Interpolation reproduction of the reproduced audio signal from the audio decoding means (33)
Output as an audio signal, and when a missing detection signal is detected
Is a long-term prediction signal from the interpolation long-term prediction means (36).
Signal as an interpolated playback audio signal, and a long time prediction for interpolation
The means (36) is provided with a length information from the pitch information detecting means (37).
The output of the switching means (35) is output based on the time prediction coefficient.
Synthesizes and outputs a long-term prediction signal from the interpolated playback audio signal
The pitch information detecting means (37) outputs the interpolated reproduced audio signal
Long-term prediction means for interpolation by extracting long-term prediction coefficients from
By providing the output to (36), the same level of interpolated signal quality can be realized despite the fact that there is almost no additional circuit required for interpolating missing audio signals as compared with the conventional method. The present invention is also applicable to a speech encoding method, and the effect is extremely large.

【図面の簡単な説明】[Brief description of the drawings]

【図１】ピッチ分析を持つ音声符号化方式の復号器へ本
発明の欠落音声補間装置を適用した１つの実施例の構成
概略図である。FIG. 1 is a schematic diagram of a configuration of an embodiment in which a missing speech interpolation device of the present invention is applied to a speech coding type decoder having pitch analysis.

【図２】図１中の長時間予測係数復号／選択器１８の構
成概略図である。FIG. 2 is a schematic diagram of a configuration of a long-term prediction coefficient decoding / selector 18 in FIG.

【図３】図１の等価構成概略図である。FIG. 3 is a schematic diagram of an equivalent configuration of FIG. 1;

【図４】ピッチ分析を持つ音声符号化方式の復号器へ本
発明の欠落音声補間装置を適用したもう１つの実施例の
構成概略図である。FIG. 4 is a schematic structural diagram of another embodiment in which the missing speech interpolation device of the present invention is applied to a speech coding type decoder having pitch analysis.

【図５】図４の等価構成概略図である。FIG. 5 is a schematic diagram of an equivalent configuration of FIG. 4;

【図６】本発明の欠落音声補間装置を一般の音声符号化
方式の復号器に適用した１つの実施例の構成概略図であ
る。FIG. 6 is a schematic structural diagram of one embodiment in which the missing speech interpolation device of the present invention is applied to a decoder of a general speech coding system.

【図７】本発明の欠落音声補間装置をＤＰＣＭやＡＤＰ
ＣＭへ適用した１つの実施例の構成概略図である。FIG. 7 shows a missing voice interpolation device of the present invention using DPCM or ADP.
It is a schematic diagram of a configuration of one embodiment applied to a CM.

【図８】ＡＰＣ符号化方式の復号器における従来の欠落
音声補間方式の構成概略図である。FIG. 8 is a schematic configuration diagram of a conventional missing voice interpolation system in a decoder using the APC encoding system.

【符号の説明】[Explanation of symbols]

１端子２多重分離回路３雑音発生器４残差信号電力復号器５逆量子化器６長時間予測係数復号器７短時間予測係数復号器８切り換えスイッチ９長時間合成フィルタ１０短時間合成フィルタ１１加算器１２長時間予測器１３加算器１４短時間予測器１５端子１６端子１７長時間合成／補間フィルタ１８長時間予測係数復号／選択器１９残差信号電力復号器２０逆量子化器２１接続しゃ断スイッチ２２長時間予測係数復号器２３補間用長時間予測係数生成器２４切り換えスイッチ２５切り換えスイッチ２６補間用長時間合成フィルタ２７補間用長時間予測器２９切り換えスイッチ３２端子３３復号器３４補間用フィルタ３５切り換えスイッチ３６補間用長時間予測器３７ピッチ情報検出器３８端子３９端子４０端子４１逆量子化器４２補間用フィルタ４３切り換えスイッチ４４補間用長時間予測器４５ピッチ情報検出器４６短時間合成フィルタ４７加算器４８短時間予測器４９端子５０端子５１端子Reference Signs List 1 terminal 2 demultiplexing circuit 3 noise generator 4 residual signal power decoder 5 inverse quantizer 6 long-term prediction coefficient decoder 7 short-time prediction coefficient decoder 8 changeover switch 9 long-time synthesis filter 10 short-time synthesis filter 11 Adder 12 Long time predictor 13 Adder 14 Short time predictor 15 Terminal 16 Terminal 17 Long time synthesis / interpolation filter 18 Long time prediction coefficient decoding / selector 19 Residual signal power decoder 20 Inverse quantizer 21 Disconnection Switch 22 Long-term prediction coefficient decoder 23 Long-term prediction coefficient generator for interpolation 24 Changeover switch 25 Changeover switch 26 Long-time synthesis filter for interpolation 27 Long-time prediction unit 29 for interpolation 29 Switch 32 Terminal 33 Decoder 34 Interpolation filter 35 Changeover switch 36 Long-time predictor for interpolation 37 Pitch information detector 38 Terminal 39 terminal 40 terminal 41 inverse quantizer 42 interpolation filter 43 changeover switch 44 interpolation long time predictor 45 pitch information detector 46 short time synthesis filter 47 adder 48 short time predictor 49 terminal 50 terminal 51 terminal

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平２−1661（ＪＰ，Ａ) 特開平３−245199（ＪＰ，Ａ) 特開平３−51900（ＪＰ，Ａ) 特開平１−248200（ＪＰ，Ａ) 特開昭60−67999（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁶，ＤＢ名) G10L 9/14 G10L 9/18 ────────────────────────────────────────────────── ─── Continuation of the front page (56) References JP-A-2-1661 (JP, A) JP-A-3-245199 (JP, A) JP-A-3-51900 (JP, A) JP-A-1- 248200 (JP, A) JP-A-60-67999 (JP, A) (58) Fields investigated (Int. Cl. ⁶ , DB name) G10L 9/14 G10L 9/18

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】音声符号化情報分離手段（２），残差信
号復号化手段（１９，２０），長時間予測係数復号／選
択手段（１８），短時間予測係数復号化手段（７），長
時間合成／補間フィルタ手段（１７），短時間合成フィ
ルタ手段（１０），端子（１５）を備える欠落音声補間
装置であって、音声符号化情報分離手段（２）は、入力される音声符号
化情報を長時間予測係数情報，短時間予測係数情報，残
差信号量子化情報に分離し、残差信号復号化手段（１９，２０）は、残差信号量子化
情報を復号化して残差信号を出力し、長時間予測係数復号／選択手段（１８）は、長時間予測
係数情報を復号化して得られる長時間予測係数を長時間
合成／補間フィルタ手段（１７）に出力するとともに、
端子（１５）からの欠落検出信号を検出した場合には、
過去の長時間予測係数から得た補間用長時間予測係数を
長時間予測係数として長時間合成／補間フィルタ手段
（１７）に出力し、短時間予測係数復号化手段（７）は、短時間予測係数情
報を復号して得られる短時間予測係数を出力し、長時間合成／補間フィルタ手段（１７）は、切り換え手
段（２１），長時間合成／フィルタ（１１，１２）を備
え、切り換え手段（２１）は、端子（１５）からの欠落検出
信号の検出の有無に対応して残差信号復号化手段（１
９，２０）と長時間合成／フィルタ（１１，１２）との
相互間を切断又は接続し、長時間合成／フィルタ（１１，１２）は、フィードバッ
ク状に接続された長時間予測手段（１２）からなり、切
り換え手段（２１）からの残差信号を入力とし、長時間
予測係数を用いて短時間予測残差信号を合成して出力
し、短時間合成フィルタ手段（１０）は、短時間予測係数と
短時間予測残差信号とから再生音声信号を合成し出力す
る欠落音声補間装置。1. A speech coded information separating means (2), a residual signal
Decoding means (19, 20), long-term prediction coefficient decoding / selection
Selection means (18), short-term prediction coefficient decoding means (7), length
Time synthesis / interpolation filter means (17), short-time synthesis filter
Missing voice interpolation with filter means (10) and terminal (15)
A speech encoding information separating unit (2) for receiving an inputted speech code;
The long-term prediction coefficient information, short-term prediction coefficient information,
The signal is separated into quantized difference signal information, and the residual signal decoding means (19, 20) performs quantization of the residual signal.
The long-term prediction coefficient decoding / selection means (18) decodes the information and outputs a residual signal,
Long-term prediction coefficients obtained by decoding coefficient information
Output to the synthesis / interpolation filter means (17),
When a missing detection signal from the terminal (15) is detected,
Interpolated long-term prediction coefficients obtained from past long-term prediction coefficients
Long-term synthesis / interpolation filter means as long-term prediction coefficients
(17), and the short-term prediction coefficient decoding means (7) outputs the short-term prediction coefficient information.
The long-term synthesis / interpolation filter means (17) outputs a short-term prediction coefficient obtained by decoding the report.
Equipped with a stage (21) and a long-time synthesis / filter (11, 12)
The switching means (21) detects missing from the terminal (15).
Residual signal decoding means (1
9, 20) and long-time synthesis / filter (11, 12)
Disconnect or connect each other, and the long-time synthesis / filters (11, 12) provide feedback.
It consists of long-term prediction means (12) connected in a
Receiving the residual signal from the switching means (21) for a long time
Combining and outputting short-term prediction residual signals using prediction coefficients
And the short-time synthesis filter means (10)
Synthesizes and outputs the reproduced audio signal from the short-term prediction residual signal
Lost audio interpolation apparatus that.

【請求項２】音声符号化情報分離手段（２），残差信
号復号化手段（４，５），短時間予測係数復号化手段
（７），長時間予測係数復号／選択手段（１８），短時
間合成フィルタ手段（１０），長時間合成／補間フィル
タ手段（１７），端子（１５）を備える欠落音声補間装
置であって、音声符号化情報分離手段（２）は、入力される音声符号
化情報を長時間予測係数情報，短時間予測係数情報，残
差信号量子化情報に分離し、残差信号復号化手段（４，５）は、残差信号量子化情報
を復号化して残差信号を出力し、長時間予測係数復号／選択手段（１８）は、長時間予測
係数情報を復号化して得られる長時間予測係数を長時間
合成／補間フィルタ手段（１７）に出力するとともに、
端子（１５）からの欠落検出信号を検出した場合には、
過去の長時間予測係数から得た補間用長時間予測係数を
長時間予測係数として長時間合成／補間フィルタ手段
（１７）に出力し、短時間予測係数復号化手段（７）は、短時間予測係数情
報を復号して得られる短時間予測係数を出力し、短時間合成フィルタ手段（１０）は、短時間予測係数と
残差信号とから長時間予測残差信号を合成して出力し、長時間合成／補間フィルタ手段（１７）は、切り換え手
段（２１），長時間合成／フィルタ（１１，１２）を備
え、切り換え手段（２１）は、端子（１５）からの欠落検出
信号の検出の有無に対応して短時間合成フィルタ手段
（１０）と長時間合成／フィルタ（１１，１２）との相
互間を切断又は接続し、長時間合成／フィルタ（１１，１２）は、フィードバッ
ク状に接続された長時間予測手段（１２）からなり、切
り換え手段（２１）からの長時間予測残差信号を入力と
し、長時間予測係数を用いて再生音声信号を合成して出
力する欠落音声補間装置。2. A speech coded information separating means (2), a residual signal
Decoding means (4, 5), short-term prediction coefficient decoding means
(7), long-term prediction coefficient decoding / selection means (18 ), short time
Intermediate synthesis filter means (10), long time synthesis / interpolation filter
Missing voice interpolation device provided with data means (17) and terminal (15)
The encoded speech information separating means (2)
The long-term prediction coefficient information, short-term prediction coefficient information,
The residual signal quantization means (4, 5) separates the residual signal quantized information into quantized residual signal information.
And outputs a residual signal, and the long-term prediction coefficient decoding / selecting means (18)
Long-term prediction coefficients obtained by decoding coefficient information
Output to the synthesis / interpolation filter means (17),
When a missing detection signal from the terminal (15) is detected,
Interpolated long-term prediction coefficients obtained from past long-term prediction coefficients
Long-term synthesis / interpolation filter means as long-term prediction coefficients
(17), and the short-term prediction coefficient decoding means (7) outputs the short-term prediction coefficient information.
And outputs a short- term prediction coefficient obtained by decoding the information, and the short-time synthesis filter means (10) outputs the short-term prediction coefficient
A long-time prediction residual signal is synthesized from the residual signal and output, and the long-time synthesis / interpolation filter means (17) performs a switching operation.
Equipped with a stage (21) and a long-time synthesis / filter (11, 12)
The switching means (21) detects missing from the terminal (15).
Short-time synthesis filter means corresponding to the presence or absence of signal detection
Phase of (10) and long-time synthesis / filter (11, 12)
Disconnect or connect each other, and the long-time synthesis / filter (11, 12)
It consists of long-term prediction means (12) connected in a
The long-term prediction residual signal from the switching means (21) is input and
And synthesizes the reproduced audio signal using the long-term prediction coefficient.
A missing speech interpolator to power .

【請求項３】音声復号化手段（３３），補間用フィル
タ手段（３４），ピッチ情報検出手段（３７），端子
（３９）を備える欠落音声補間装置であって、音声復号化手段（３３）は、入力される音声符号化情報
を復号化して得られる再生音声信号を出力し、補間用フィルタ手段（３４）は、切り換え手段（３
５），補間用長時間予測手段（３６）を備え、切り換え
手段（３５）の出力を補間再生音声信号として出力し、切り換え手段（３５）は、端子（３９）からの欠落検出
信号を検出しない場合には、音声復号化手段（３３）か
らの再生音声信号を補間再生音声信号として出力し、欠
落検出信号を検出した場合には、補間用長時間予測手段
（３６）からの長時間予測信号を補間再生音声信号とし
て出力し、補間用長時間予測手段（３６）は、ピッチ情報検出手段
（３７）からの長時間予測係数に基づいて切り換え手段
（３５）の出力する補間再生音声信号から長時間予測信
号を合成して出力し、ピッチ情報検出手段（３７）は、補間再生音声信号から
長時間予測係数を抽出して補間用長時間予測手段（３
６）に出力する欠落音声補間装置。3. A speech decoding means (33), an interpolation filter
(34), pitch information detecting means (37), terminal
(39), wherein the speech decoding means (33) comprises:
The interpolation filter means (34) outputs a reproduced audio signal obtained by decoding the
5), provided with a long interpolation prediction hand stage (36), switching
The output of the means (35) is output as an interpolated reproduction audio signal, and the switching means (35) detects the loss from the terminal (39).
If no signal is detected, the voice decoding means (33)
These playback audio signals are output as interpolation playback audio signals,
If a falling detection signal is detected, the interpolation long time prediction means
The long-term prediction signal from (36) is used as the interpolated reproduction audio signal.
The interpolation long time predicting means (36) comprises a pitch information detecting means.
Switching means based on the long-term prediction coefficient from (37)
A long-term prediction signal from the interpolated reproduction audio signal output from (35)
The pitch information detecting means (37) outputs the synthesized signal from the interpolated and reproduced audio signal.
A long-term prediction coefficient is extracted and a long-term prediction means for interpolation (3
6) A missing voice interpolation device for outputting to (6) .