JP3223966B2

JP3223966B2 - Audio encoding / decoding device

Info

Publication number: JP3223966B2
Application number: JP20989998A
Authority: JP
Inventors: 聖子田中
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1997-07-25
Filing date: 1998-07-24
Publication date: 2001-10-29
Anticipated expiration: 2018-07-24
Also published as: JPH1198090A

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、主として移動体通
信分野のＰＤＣ（ＰｅｒｓｏｎａｌＤｉｇｉｔａｌ
ＣｅｌｌｕｌａｒＴｅｌｅｃｏｍｍｕｎｉｃａｔｉｏ
ｎＳｙｓｔｅｍｓ）における無線ディジタル伝送技術
に適用されると共に、送信者による有声又は無声を判別
するＶＯＸ（ＶｏｉｃｅＯｐｅｒａｔｅｄＴｒａｎ
ｓｍｉｔｔｅｒ）機能を有し、且つ有声・無声に応じて
背景雑音を生成する音声符号化／復号化装置に関する。The present invention relates to a PDC (Personal Digital) mainly in the field of mobile communication.
Cellular Telecommunication
VOX (Voice Operated Tran) that is applied to a wireless digital transmission technology in N Systems and that determines whether a sender is voiced or unvoiced.
The present invention relates to a speech encoding / decoding device having a function of generating a background noise according to voiced / unvoiced.

【０００２】[0002]

【従来の技術】従来、この種の音声符号化／復号化装置
に関連する技術としては、例えば図９に示されるような
音声符号化装置と、図１０に示されるような音声復号化
装置とによる構成のものが挙げられる。2. Description of the Related Art Conventionally, techniques related to this type of speech encoding / decoding device include, for example, a speech encoding device as shown in FIG. 9 and a speech decoding device as shown in FIG. The configuration according to

【０００３】ここでの音声符号化装置では、有声・無声
判別部１で入力音声信号から有声・無声区間を判別して
有声・無声に応じた識別制御信号をスイッチ制御部７へ
送出すると共に、この有声・無声の判別如何に拘らずＬ
ＰＣ分析部２で入力音声信号から合成フィルタのフィル
タ係数を算出することでＬＰＣパラメータを取得すると
共に、ＬＰＣパラメータをＬＳＰパラメータに換算し、
ＬＰＣ分析部２で算出されるフィルタ係数の合成フィル
タとＬＰＣパラメータを符号化するＬＰＣ用コードブッ
ク及びその他の音声特性パラメータを符号化する励振コ
ードブックとによる２種のコードブックを含むコードブ
ック照合処理部とを有する高能率符号化処理部４によっ
て、ＬＰＣパラメータに基づいた符号化処理として、フ
ィルタ係数をＬＰＣ用コードブックのパラメータで符号
化した後、合成フィルタで励振コードブックのベクトル
をフィルタリングして入力音声信号との差を最小にする
ように励振コードブックを検索するコードブック照合を
行って得られる符号化音声信号をスイッチ制御部７へ送
出している。又、ユニークワード発生部６は有声・無声
を識別したユニークワード制御信号として、有声の場合
にはプリアンブル信号，無声の場合にはポストアンブル
信号を送出する。スイッチ制御部７では、識別制御信号
による有声・無声の識別に応じて有声の場合にプリアン
ブル信号と符号化音声信号とを出力符号化信号として随
時送出し、無声の場合にポストアンブル信号と背景雑音
とを出力符号化信号として送出する。但し、背景雑音は
ＶＯＸ機能により一定の間隔で復号側に送出される。In the speech coding apparatus, a voiced / unvoiced discriminating section 1 discriminates a voiced / unvoiced section from an input speech signal and sends a discrimination control signal corresponding to voiced / unvoiced to a switch control section 7. Regardless of this voiced / voiceless discrimination, L
The PC analysis unit 2 obtains LPC parameters by calculating filter coefficients of a synthesis filter from the input audio signal, and converts the LPC parameters into LSP parameters.
Codebook collation processing including two types of codebooks including a synthesis filter of filter coefficients calculated by the LPC analysis unit 2, an LPC codebook for encoding LPC parameters, and an excitation codebook for encoding other audio characteristic parameters. The coding process based on the LPC parameters is performed by the high-efficiency coding processing unit 4 having the filter unit, the filter coefficients are coded by the parameters of the LPC codebook, and then the excitation codebook vector is filtered by the synthesis filter. An encoded audio signal obtained by performing codebook collation for searching an excitation codebook so as to minimize the difference from the input audio signal is transmitted to the switch control unit 7. The unique word generator 6 sends out a preamble signal in the case of voiced and a postamble signal in the case of unvoiced, as a unique word control signal identifying voiced / unvoiced. The switch control unit 7 sends out a preamble signal and a coded voice signal as output coded signals as needed according to the discrimination of voiced / unvoiced by the discrimination control signal, and outputs a postamble signal and background noise in the case of unvoiced. Are transmitted as an output coded signal. However, the background noise is sent to the decoding side at regular intervals by the VOX function.

【０００４】因みに、ここでの「有声」は話者が声を発
していることを示し、「無声」は話者が声を発しておら
ずに入力音源が背景雑音のみとなることを示す。[0004] Incidentally, "voiced" here indicates that the speaker is uttering voice, and "unvoiced" indicates that the speaker is not uttering voice and the input sound source is only background noise.

【０００５】一方、音声復号化装置では、音声符号化装
置からの出力符号化信号を入力符号化信号として入力す
るもので、入力符号化信号に対してユニークワード発生
部６から送出されるユニークワード制御信号に基づいて
有声・無声判別部８において有声・無声の識別を行う。
有声の場合にはＬＰＣ復号部１０でＬＰＣパラメータの
復号処理を行った後、高能率復号化処理部１１で復号化
処理を行って復号化音声信号を得る。無声の場合には背
景雑音更新部９で背景雑音を蓄積し、次の背景雑音が送
られるまで同じ背景雑音を繰り返して更新生成するが、
更新生成された背景雑音はＬＰＣ復号部１０へ送出さ
れ、ＬＰＣ復号部１０でＬＰＣパラメータを算出した後
に高能率復号化処理部１１によって復号化処理を行う。On the other hand, in the speech decoding apparatus, an output coded signal from the speech coding apparatus is input as an input coded signal, and a unique word generated from a unique word generator 6 is transmitted to the input coded signal. Based on the control signal, the voiced / unvoiced discriminating section 8 discriminates between voiced and unvoiced.
In the case of voiced, the LPC decoding unit 10 performs LPC parameter decoding processing, and then the high-efficiency decoding processing unit 11 performs decoding processing to obtain a decoded audio signal. In the case of unvoiced, the background noise is accumulated in the background noise updating unit 9 and the same background noise is repeatedly updated and generated until the next background noise is sent.
The updated and generated background noise is sent to the LPC decoding unit 10, and the LPC decoding unit 10 calculates the LPC parameters, and then performs the decoding process by the high-efficiency decoding unit 11.

【０００６】即ち、このような音声符号化／復号化装置
において、無声時に生成される背景雑音は、消費電力低
減化を計るためのＶＯＸ機能に基づいて一定の間隔を持
って符号化装置側から復号化装置側へ送信される。復号
化装置側では次の背景雑音が送信されるまで同じ背景雑
音を繰り返して使用する。音声信号は、一般に有声と無
声とに大別され、それぞれの信号を符号化，復号化する
際には各々の特性を考慮に入れながら省電力化，回線の
有効利用等の諸条件を満たしつつ、如何に高品質な音声
を話者に対して提供できるかが問題となっている。現在
使用されている無線ディジタル通信における背景雑音生
成には、送信者の音声を有声・無声の識別するＶＯＸ機
能が採用されている。背景雑音は、無声時の場合にのみ
符号化装置側で全ての入力音声信号のフレームに対して
生成されるが、復号化装置側には任意の一定時間の間隔
で送信される。That is, in such a speech encoding / decoding device, background noise generated during unvoiced speech is transmitted from the encoding device side at regular intervals based on a VOX function for reducing power consumption. It is transmitted to the decoding device side. The decoding apparatus repeatedly uses the same background noise until the next background noise is transmitted. Speech signals are generally classified into voiced and unvoiced, and when encoding and decoding each signal, various conditions such as power saving and effective use of a line are satisfied while taking into account the characteristics of each signal. The problem is how to provide high quality speech to the speaker. Background noise generation in currently used wireless digital communication employs a VOX function for discriminating voice of a sender between voiced and unvoiced. The background noise is generated on the encoding device side for all input speech signal frames only when there is no voice, but is transmitted to the decoding device side at an arbitrary fixed time interval.

【０００７】尚、こうした音声符号化や復号化に関連す
るその他の周知技術としては、例えば特開平７−１１５
４０３号公報に開示された無音区間情報の符号化及び復
号化回路，特開平７−３３４１９７号公報及び特開平８
−１３９６８８号公報に開示された音声符号化装置が挙
げられる。[0007] Other well-known techniques related to such audio coding and decoding include, for example, Japanese Patent Application Laid-Open No. 7-115.
No. 403, Japanese Patent Laid-Open No. 7-334197 and Japanese Patent Laid-Open No. 8
No. 139688 discloses a speech encoding device.

【０００８】ところで、ＶＯＸ機能において、背景雑音
が或る周期で送信されて滑らかに変化しないため、背景
雑音の違和感を低減するために、音声信号の振幅や電力
に着目して背景雑音を生成する方法や、有声・無声の切
り替わりのフレームに着目して時系列上で平滑や補間処
理を施して背景雑音を生成する方法が提案されている。
このような主旨に関連する周知技術としては、特願平７
−５７７０２号で提案された音声符号化・復号化方法，
特願平７−１６５７３６号並びに特願平８−７２０８９
号で提案された音声復号化装置，特願平７−１７３８８
６号で提案された音声復号装置，特願平８−７６１９５
号で提案された雑音抑制処理機能を備えた音声符号化装
置，特願平９−８５８９号で提案された非活性音声の効
率的符号化の為の方法及び装置等（作成者；注／これら
の公知出願が上記の各公報と重複していないか否かを御
確認下さい。これらの公知出願の公開番号がお判りであ
れば上記のように公開番号で記す方が良いと思われま
す）が挙げられる。In the VOX function, since background noise is transmitted at a certain period and does not change smoothly, background noise is generated by paying attention to the amplitude and power of the audio signal in order to reduce the discomfort of the background noise. A method and a method of generating background noise by performing smoothing or interpolation processing on a time series by focusing on a voiced / unvoiced switching frame have been proposed.
As a well-known technology related to such a purpose, Japanese Patent Application No.
Speech encoding / decoding method proposed in -57702,
Japanese Patent Application No. 7-165736 and Japanese Patent Application No. 8-72089
Decoder proposed in Japanese Patent Application No. Hei 7-17388
Speech decoding device proposed in No. 6, Japanese Patent Application No. 8-76195.
Coding apparatus with a noise suppression processing function proposed in Japanese Patent Application Laid-Open No. 9-8589, a method and apparatus for efficient coding of inactive voice proposed in Japanese Patent Application No. 9-8589 (author; Please make sure that the publicly known applications of the above do not overlap with the above publications. If you know the publication numbers of these publicly known applications, it is better to use the public numbers as described above.) Is mentioned.

【０００９】[0009]

【発明が解決しようとする課題】上述した音声符号化／
復号化装置の場合、音声符号化装置において入力音声信
号が有声・無声判別部による有声・無声の識別後、有声
・無声の判別如何に拘らずＬＰＣ分析部へ送出され、Ｌ
ＰＣ分析部でＬＰＣパラメータを算出して音声に対して
コードブックと照合し、高能率符号化処理部により音声
以外の雑音や声道データ及びピッチデータに対して合成
フィルタにかけた後、コードブックと照合している。SUMMARY OF THE INVENTION
In the case of a decoding device, an input speech signal is sent to an LPC analysis unit regardless of voiced / unvoiced discrimination after a voiced / unvoiced discrimination unit discriminates between voiced and unvoiced in a speech coding device.
The PC analysis unit calculates the LPC parameters, compares the speech with the codebook, and applies a synthesis filter to noise, vocal tract data, and pitch data other than speech by the high-efficiency coding processing unit. Is collated.

【００１０】ところが、ＬＰＣ分析部で照合するコード
ブックは音声特性に基づいて作成されているため、有声
のときの音声特性とは異なり、無声のときの雑音特性に
対しては適さないという問題があり、このように有声の
ときの音声特性とは異なる無声のときの雑音に対して有
声のときの音声特性に基づくコードブックと照合するこ
とが復号後の背景雑音に違和感を生じる要因の一つと考
えられている。これは、送信者の有声・無声を識別する
ＶＯＸ機能を有するＣＥＬＰ方式に基づく音声符号化／
復号化装置において（上述した音声信号の振幅や電力に
着目して背景雑音を生成する方法や、有声・無声の切り
替わりのフレームに着目して時系列上で平滑や補間処理
を施して背景雑音を生成する方法を含む）も同様であ
り、この場合にも復号化された背景雑音に違和感を生じ
る。However, since the codebook to be collated by the LPC analysis unit is created based on the voice characteristics, the codebook differs from voiced voice characteristics and is not suitable for unvoiced noise characteristics. There is one of the factors that make the uncomfortable background noise after decoding match against a codebook based on voiced voice characteristics against unvoiced noise different from voiced voice characteristics. It is considered. This is a speech coding / coding based on CELP scheme having a VOX function for distinguishing voiced / unvoiced of a sender.
In the decoding device, the background noise is generated by focusing on the amplitude and power of the audio signal described above, or the background noise is generated by performing smoothing or interpolation on a time series by focusing on the voiced / unvoiced switching frame. This includes the method of generating the background noise. In this case as well, the decoded background noise gives a sense of incongruity.

【００１１】一般に音声信号が有声時の場合と無声時の
場合とでは、スペクトル特性が異なり、その特徴として
有声時の場合にはスペクトルに複数の山状の波形（フォ
ルマント）となり、無声時の場合にはフラットな波形と
なる。従来、スペクトルの符号化に用いられるコードブ
ックは、有声時の音声のスペクトル特性に基づいて作成
されているため、有声時の音声とは異なる特性を持つ無
声時の雑音に対しては適当ではなく、無理にコードブッ
クと照合すると、本来の雑音とかけ離れた特性を示す雑
音に符号化され、それが復号化されると違和感のある背
景雑音を生成してしまう。In general, the spectrum characteristics are different between voiced and unvoiced speech signals. When voice signals are voiced, the spectrum has a plurality of mountain-like waveforms (formants). Has a flat waveform. Conventionally, codebooks used for spectrum coding are created based on the spectral characteristics of voiced speech, so they are not suitable for unvoiced noise having characteristics different from voiced speech. If it is forcibly collated with the codebook, it is encoded into noise having characteristics far from the original noise, and when it is decoded, uncomfortable background noise is generated.

【００１２】背景雑音が人の耳に違和感を感じるのは、
ＶＯＸ機能において無声と識別された後、繰り返し同じ
背景雑音が生成されるため、一定間隔で送られる背景雑
音が滑らかに変化しないこと、無声時の音源が有声時の
特性を示すＬＰＣパラメータにより処理されていること
（ＬＰＣパラメータの符号化の際、使用されるコードブ
ックが有声の特性に基づいて作成されているため、無声
時の信号として不適切なものとなっている）等が要因と
して挙げられる。[0012] The reason why the background noise feels strange to human ears is
After being identified as unvoiced by the VOX function, the same background noise is repeatedly generated, so that the background noise transmitted at regular intervals does not change smoothly, and the unvoiced sound source is processed according to the LPC parameters indicating the characteristics when voiced. (At the time of encoding LPC parameters, the codebook used is created based on voiced characteristics, which makes it unsuitable as an unvoiced signal). .

【００１３】こうした背景雑音における違和感の低減を
含む時系列上の音声データの特性評価に際しては、時間
軸上の特性（エネルギーや振幅等）と併せて周波数（ス
ペクトル）特性を考慮することが極めて重要と考えられ
る。In evaluating the characteristics of time-series audio data including the reduction of discomfort due to background noise, it is extremely important to consider frequency (spectrum) characteristics in addition to time-axis characteristics (energy, amplitude, etc.). it is conceivable that.

【００１４】本発明は、このような問題点を解決すべく
なされたもので、その技術的課題は、無声時に生成され
る背景雑音の特性を考慮した上で違和感を充分に低減化
して符号化，復号化を行い得る音声符号化／復号化装置
を提供することにある。SUMMARY OF THE INVENTION The present invention has been made to solve such a problem, and a technical problem of the present invention is to take into consideration the characteristics of background noise generated during unvoiced speech and to reduce the feeling of incongruity sufficiently for coding. , An audio encoding / decoding device capable of decoding.

【００１５】[0015]

【課題を解決するための手段】本発明によれば、入力音
声信号に対して有声・無声区間を判別して有声・無声に
応じた識別制御信号を出力する有声・無声判別部と、識
別制御信号により有声のときに入力される入力音声信号
に対して線形予測分析法に基づいて合成フィルタのフィ
ルタ係数を算出することでＬＰＣパラメータを取得する
と共に、該ＬＰＣパラメータをＬＳＰパラメータに換算
して出力するＬＰＣ分析部と、識別制御信号により有声
から無声に切り替わったときに直前の有声時におけるＬ
ＰＣパラメータを一時蓄積するＬＰＣ蓄積部と、識別制
御信号によりＬＰＣ蓄積部に蓄積されたＬＰＣパラメー
タに基づいて無声のときの雑音特性を有声のときの雑音
特性に近付けて線形予測分析法に供する背景雑音を生成
するための濾波を行うＬＰＦと、ＬＰＣ分析部から出力
されるＬＳＰパラメータ又はＬＰＦ経由の背景雑音を符
号化処理して符号化音声信号又は雑音信号を出力する高
能率符号化処理部と、識別制御信号に応じて有声のとき
に符号化音声信号，無声ときに雑音信号をそれぞれ出力
符号化信号として切り替え送出するスイッチ制御部とを
備えた音声符号化装置が得られる。According to the present invention, a voiced / unvoiced discriminator for discriminating a voiced / unvoiced section from an input voice signal and outputting a discrimination control signal corresponding to voiced / unvoiced, and a discrimination control. An LPC parameter is obtained by calculating a filter coefficient of a synthesis filter based on a linear prediction analysis method on an input speech signal input when voiced by a signal, and the LPC parameter is converted into an LSP parameter and output. The LPC analysis unit which performs the switching from voiced to unvoiced according to the identification control signal
An LPC accumulator for temporarily accumulating PC parameters, and a background in which a noise characteristic in unvoiced is approximated to a noise characteristic in voiced based on the LPC parameter accumulated in the LPC accumulator by an identification control signal, and is used for a linear prediction analysis method. An LPF that performs filtering for generating noise, and a high-efficiency encoding unit that encodes an LSP parameter output from the LPC analysis unit or background noise via the LPF and outputs an encoded voice signal or a noise signal. And a switch control unit for switching and transmitting an encoded speech signal when voiced and a noise signal when unvoiced as output encoded signals in accordance with the discrimination control signal.

【００１６】この音声符号化装置において、高能率符号
化処理部は、符号化処理に際して有声時の音声特性に基
づいて作成されたコードブックを照合すること、高能率
符号化処理部は、雑音を含む音声以外の情報（声道情
報、音源情報）に対してＬＰＣパラメータによる合成フ
ィルタを通してコードブックの照合を行うこと、ＬＰＣ
蓄積部は、ＬＰＣパラメータとして有声時のＬＰＣ分析
部からのＬＰＣ分析後の出力を蓄積すること、ＬＰＣ蓄
積部は、ＬＰＣパラメータをＬＰＦのフィルタ係数とし
て出力すること、ＬＰＦは、識別制御信号により無声の
ときに入力音声信号の雑音を入力し、該雑音を有声時の
該入力音声信号の雑音に近付けて背景雑音を生成出力す
ること、ＬＰＦは、送信者による発声を示す有声又は無
発声を示す無声を識別するＶＯＸ機能を有すると共に、
ＣＥＬＰ方式に基づいて無声に伴う雑音のスペクトル特
性を音声に伴う雑音のスペクトル特性に近似して背景雑
音を生成出力すること、並びに符号化音声信号又は雑音
信号に対する有声・無声区間を制御するためのユニーク
ワード制御信号を出力するユニークワード発生部を備
え、スイッチ制御部は、ユニークワード制御信号と出力
符号化信号とを切り替え送出することは何れも好まし
い。In this speech coding apparatus, the high-efficiency coding section checks a codebook created based on voiced speech characteristics during the coding process, and the high-efficiency coding section reduces noise. Collating a codebook with information (vocal tract information, sound source information) other than speech through a synthesis filter based on LPC parameters;
The accumulation unit accumulates the output after LPC analysis from the voiced LPC analysis unit as the LPC parameter. The LPC accumulation unit outputs the LPC parameter as a filter coefficient of the LPF. Inputting the noise of the input voice signal at the time of, and generating and outputting the background noise by bringing the noise close to the noise of the input voice signal when voiced, and the LPF indicates voiced or unvoiced indicating the utterance by the sender It has a VOX function to identify voicelessness,
For generating and outputting background noise by approximating the spectral characteristics of unvoiced noise to the spectral characteristics of noise associated with speech based on the CELP method, and for controlling voiced / unvoiced sections for coded voice signals or noise signals. It is preferable that a unique word generation unit that outputs a unique word control signal is provided, and that the switch control unit switches and transmits between the unique word control signal and the output coded signal.

【００１７】又、本発明によれば、上記音声符号化装置
からの出力符号化信号を入力符号化信号として入力した
上で復号化する音声復号化装置であって、入力符号化信
号に対してユニークワード制御信号に基づいて有声・無
声を判別して有声・無声に応じた制御信号を出力する有
声・無声判別部と、制御信号により無声のときに入力符
号化信号の背景雑音を蓄積する背景雑音更新部と、制御
信号により有声のときに入力符号化信号をＬＰＣ復号化
するＬＰＣ復号部と、ＬＰＣ復号化に際してのＬＰＣパ
ラメータを蓄積するＬＰＣ蓄積部と、ＬＳＰパラメータ
に基づいてＬＰＣ復号化された後の入力符号化信号に対
する復号化処理を行って復号化音声信号又は雑音信号を
出力する高能率復号化処理部と、ＬＰＣパラメータに基
づいて無声のときに雑音信号を濾波して背景雑音として
出力するＨＰＦと、有声のときに復号化音声信号，無声
のときに背景雑音をそれぞれ切り替え送出するスイッチ
制御部とを備えた音声復号化装置が得られる。Further, according to the present invention, there is provided a speech decoding apparatus for decoding an input coded signal output from the speech coding apparatus after inputting the coded signal as an input coded signal. A voiced / unvoiced discriminator for discriminating voiced / unvoiced based on the unique word control signal and outputting a control signal corresponding to voiced / unvoiced, and a background for accumulating background noise of the input coded signal when unvoiced by the control signal A noise update unit, an LPC decoding unit that LPC decodes the input coded signal when voiced by the control signal, an LPC storage unit that stores LPC parameters for LPC decoding, and LPC decoding based on the LSP parameters. A high-efficiency decoding processing unit that performs a decoding process on the input coded signal after the decoding and outputs a decoded speech signal or a noise signal, and is unvoiced based on LPC parameters. And HPF to output a background noise filters the noise signal, decoded speech signal when voiced, speech decoding apparatus and a switch controller for each switch sends the background noise when the silent obtained.

【００１８】この音声復号化装置において、ＬＰＣ蓄積
部は、有声のときにＬＰＣ復号部からの復号化信号を蓄
積してＨＰＦのフィルタ係数として出力すること、高能
率符号化処理部は、雑音を含む音声以外の情報（声道情
報、音源情報）に対してＬＰＣパラメータによる合成フ
ィルタを通してコードブックの照合を行うこと、ＨＰＦ
は、送信者による発声を示す有声又は無発声を示す無声
を識別するＶＯＸ機能を有すると共に、ＣＥＬＰ方式に
基づいて無声に伴う雑音のスペクトル特性を音声に伴う
雑音のスペクトル特性に近似して背景雑音を生成出力す
ることは何れも好ましい。In this speech decoding apparatus, the LPC accumulating section accumulates the decoded signal from the LPC decoding section when voiced and outputs the signal as a filter coefficient of the HPF. Performing codebook collation on information (vocal tract information, sound source information) other than included speech through a synthesis filter based on LPC parameters, HPF
Has a VOX function for distinguishing voiced or unvoiced voices indicating utterances by the sender, and approximates the spectral characteristics of unvoiced noise to the spectral characteristics of noise accompanying voice based on the CELP method to obtain background noise. It is preferable to generate and output.

【００１９】[0019]

【発明の実施の形態】以下に実施例を挙げ、本発明の音
声符号化／復号化装置について、図面を参照して詳細に
説明する。Embodiments of the present invention will be described below in detail with reference to the accompanying drawings.

【００２０】図１は、本発明の一実施例に係る音声符号
化／復号化装置の音声符号化装置の基本構成を示したブ
ロック図である。この音声符号化装置は、入力音声信号
に対して有声・無声区間を判別して有声・無声に応じた
識別制御信号を出力する有声・無声判別部１と、識別制
御信号により有声のときに入力される入力音声信号に対
して線形予測分析法に基づいて合成フィルタのフィルタ
係数を算出することでＬＰＣパラメータを取得すると共
に、そのＬＰＣパラメータをＬＳＰパラメータに換算す
るＬＰＣ分析部２と、識別制御信号により有声から無声
に切り替わったときに直前の有声時におけるＬＰＣパラ
メータを一時蓄積するＬＰＣ蓄積部３と、識別制御信号
によりＬＰＣ蓄積部３に蓄積されたＬＰＣパラメータに
基づいて無声のときの雑音特性を有声のときの雑音特性
に近付けて線形予測分析法に供する背景雑音を生成する
ための濾波を行うＬＰＦ５と、ＬＰＣ分析部２から出力
されるＬＳＰパラメータ又はＬＰＦ５経由の背景雑音を
符号化処理して符号化音声信号又は雑音信号を出力する
高能率符号化処理部４と、符号化音声信号又は雑音信号
に対する有声・無声区間を制御するためのユニークワー
ド制御信号を出力するユニークワード発生部６と、識別
制御信号に応じて有声のときに符号化音声信号，無声と
きに雑音信号をそれぞれ出力符号化信号として切り替え
送出すると共に、ユニークワード制御信号と出力符号化
信号とを切り替え送出するスイッチ制御部７とを備えて
成る。FIG. 1 is a block diagram showing a basic configuration of a speech coding apparatus of a speech coding / decoding apparatus according to one embodiment of the present invention. This speech coding apparatus includes a voiced / unvoiced discriminating unit 1 for discriminating a voiced / unvoiced section from an input speech signal and outputting a discrimination control signal corresponding to voiced / unvoiced, and an input when voiced by the discrimination control signal. An LPC analysis unit 2 that calculates LFP parameters by calculating filter coefficients of a synthesis filter based on a linear prediction analysis method for the input speech signal to be converted, and converts the LPC parameters into LSP parameters; The LPC storage unit 3 for temporarily storing the LPC parameter at the time of the immediately preceding voice when the voice is switched from voiced to unvoiced, and the noise characteristic at the time of unvoiced based on the LPC parameter stored in the LPC storage unit 3 by the identification control signal. An LPF 5 that performs filtering to generate background noise to be subjected to a linear prediction analysis method by approaching a noise characteristic when voiced; A high-efficiency encoding processing unit 4 that encodes the LSP parameter output from the analysis unit 2 or the background noise via the LPF 5 and outputs an encoded audio signal or a noise signal; A unique word generator 6 for outputting a unique word control signal for controlling unvoiced sections, and a coded speech signal when voiced and a noise signal when unvoiced are switched and output as coded signals according to the discrimination control signal. And a switch control unit 7 for switching and transmitting the unique word control signal and the output coded signal.

【００２１】このうち、高能率符号化処理部４は、符号
化処理に際して有声時の音声特性に基づいて作成された
コードブックを照合するが、具体的には雑音を含む音声
以外の情報である声道情報や音源情報等に対してＬＰＣ
パラメータによる合成フィルタを通してコードブックの
照合を行う。即ち、高能率符号化処理部４では、ＬＰＣ
分析部２から送出された雑音を含む音声以外の声道情報
や音源情報に対し、一連の動作中での無声時における高
能率符号化処理において、音声特有の４０Ｈｚ以下の信
号を除去するためにＨＰＦをかけ、ＬＰＣ分析によって
算出されたフィルタ係数に基づく合成フィルタでフィル
タリングし、更にフィルタリングされた出力信号に対し
て既存の（有声時の音声の特性に基づいて作成された）
コードブックと照合する処理を行う。但し、無声時の場
合には雑音を一度ＬＰＦ５に通して高周波を除去してい
るため、ここで改めてＨＰＦをかけると、再び雑音特性
が現れる可能性が生じるので、このときには高能率符号
化処理部４内のＨＰＦ処理を除去するかフィルタ係数を
適当な値に変化させるようにする。Among these, the high-efficiency encoding processing section 4 collates a codebook created based on the voice characteristics at the time of voice during the encoding process. Specifically, it is information other than speech including noise. LPC for vocal tract information and sound source information
Codebook collation is performed through a synthesis filter based on parameters. That is, in the high-efficiency encoding processing unit 4, the LPC
In order to remove a signal of 40 Hz or less peculiar to voice, in a high-efficiency encoding process during unvoiced speech during a series of operations on vocal tract information and sound source information other than voice including noise transmitted from the analysis unit 2. HPF is applied, filtered by a synthesis filter based on the filter coefficient calculated by LPC analysis, and an existing (created based on voiced voice characteristics) is applied to the filtered output signal.
Perform the process of collating with the codebook. However, in the unvoiced state, since the noise is once passed through the LPF 5 to remove the high frequency, if the HPF is applied again, there is a possibility that the noise characteristic appears again. Either remove the HPF processing in 4 or change the filter coefficient to an appropriate value.

【００２２】ＬＰＣ蓄積部３は、ＬＰＣパラメータとし
て有声時のＬＰＣ分析部２からのＬＰＣ分析後の出力を
蓄積し、蓄積したＬＰＣパラメータをＬＰＦ５のフィル
タ係数として出力する。ＬＰＦ５は、識別制御信号によ
り無声のときに入力音声信号の雑音を入力し、その雑音
を有声時の入力音声信号の雑音に近付けて背景雑音を生
成出力する。但し、ここでのＬＰＦ５は、送信者による
発声を示す有声又は無発声を示す無声を識別するＶＯＸ
機能を有すると共に、ＣＥＬＰ方式に基づいて無声に伴
う雑音のスペクトル特性を音声に伴う雑音のスペクトル
特性に近似して背景雑音を生成出力するものとする。The LPC accumulating section 3 accumulates the output after LPC analysis from the voiced LPC analyzing section 2 as an LPC parameter, and outputs the accumulated LPC parameter as a filter coefficient of the LPF 5. The LPF 5 inputs the noise of the input voice signal when the voice is unvoiced by the discrimination control signal, and generates and outputs the background noise by bringing the noise closer to the noise of the input voice signal when voiced. However, the LPF 5 here is a VOX that identifies voiced or unvoiced voices indicating utterance by the sender.
It has a function and, based on the CELP method, generates and outputs background noise by approximating the spectral characteristics of noise accompanying voicelessness to the spectral characteristics of noise accompanying voice.

【００２３】即ち、この音声符号化装置では、入力音声
信号の有声・無声を有声・無声判別部１で判別した結
果、有声の場合にＬＰＣ分析部２で入力音声信号のＬＰ
Ｃパラメータを算出し、そのＬＰＣパラメータがＬＰＣ
蓄積部３に蓄積される。ＬＰＣ分析された入力音声信号
は高能率符号化処理４で符号化処理されて符号化音声信
号としてスイッチ制御部７へ送出される。又、無声時の
場合にＬＰＣ蓄積部３に蓄積された有声時のＬＰＣパラ
メータを用いてＬＰＦ５でフィルタリングされ、ＬＰＣ
分析部２で合成フィルタのフィルタ係数を求める。この
ときＬＰＦ５では、無声時に切り替わる直前の有声時に
おける音声の特性を利用するため、無声時における雑音
の特性をより自然な状態で音声の特性に近付けることが
できる。この場合もＬＰＣ分析された入力音声信号は高
能率符号化処理４で符号化処理されて雑音信号としてス
イッチ制御部７へ送出される。That is, in this speech coding apparatus, the voiced / unvoiced discriminating section 1 discriminates the voiced / unvoiced state of the input speech signal.
C parameter is calculated, and the LPC parameter is
It is stored in the storage unit 3. The input voice signal subjected to the LPC analysis is coded in a high-efficiency coding process 4 and sent to the switch control unit 7 as a coded voice signal. Further, in the case of unvoiced voice, the LPF 5 filters using the voiced LPC parameter stored in the LPC storage unit 3 and performs LPC filtering.
The analysis unit 2 obtains a filter coefficient of the synthesis filter. At this time, the LPF 5 uses the characteristics of the voice at the time of voiced immediately before the switching at the time of unvoiced, so that the characteristics of the noise at the time of unvoiced can be approximated to the characteristics of the voice in a more natural state. Also in this case, the input voice signal subjected to the LPC analysis is coded by the high-efficiency coding process 4 and sent to the switch control unit 7 as a noise signal.

【００２４】図２は、本発明の一実施例に係る音声符号
化／復号化装置の音声復号化装置の基本構成を示したブ
ロック図である。この音声復号化装置は、先の音声符号
化装置からの出力符号化信号を入力符号化信号として入
力した上で復号化するものであって、入力符号化信号に
対してユニークワード制御信号に基づいて有声・無声を
判別して有声・無声に応じた制御信号を出力する有声・
無声判別部８と、制御信号により無声のときに入力符号
化信号の背景雑音を蓄積する背景雑音更新部９と、制御
信号により有声のときに入力符号化信号をＬＰＣ復号化
するＬＰＣ復号部１０と、ＬＰＣ復号化に際してのＬＰ
Ｃパラメータを蓄積するＬＰＣ蓄積部１２と、ＬＳＰパ
ラメータに基づいてＬＰＣ復号部１０でＬＰＣ復号化さ
れた後の入力符号化信号に対する復号化処理を行って復
号化音声信号又は雑音信号を出力する高能率復号化処理
部１１と、ＬＰＣパラメータに基づいて無声のときに雑
音信号を濾波して背景雑音として出力するＨＰＦ１３
と、有声のときに復号化音声信号，無声のときに背景雑
音をそれぞれ切り替え送出するスイッチ制御部１４とを
備えて成る。FIG. 2 is a block diagram showing a basic configuration of a speech decoding device of the speech encoding / decoding device according to one embodiment of the present invention. This speech decoding apparatus decodes an input coded signal output from the preceding speech coder as an input coded signal and decodes the input coded signal based on a unique word control signal. Voiced to output a control signal according to voiced / unvoiced
An unvoiced discriminating unit 8, a background noise updating unit 9 for accumulating background noise of the input coded signal when unvoiced by the control signal, and an LPC decoding unit 10 for LPC decoding the input coded signal when voiced by the control signal And LP for LPC decoding
An LPC accumulating unit 12 for accumulating the C parameter, and a decoding unit for performing a decoding process on the input coded signal after the LPC decoding by the LPC decoding unit 10 based on the LSP parameter to output a decoded speech signal or a noise signal. An efficiency decoding processing unit 11 and an HPF 13 that filters a noise signal and outputs it as background noise when there is no voice based on the LPC parameter
And a switch control unit 14 for switching and transmitting the decoded voice signal when voiced and the background noise when voiceless.

【００２５】このうち、ＬＰＣ蓄積部１２は、有声のと
きにＬＰＣ復号部１０からの復号化信号を蓄積してＨＰ
Ｆ１３のフィルタ係数として出力する。高能率符号化処
理部１１は、雑音を含む音声以外の情報である声道情報
や音源情報に対してＬＰＣパラメータによる合成フィル
タを通してコードブックの照合を行う。ＨＰＦ１３は、
送信者による発声を示す有声又は無発声を示す無声を識
別するＶＯＸ機能を有すると共に、ＣＥＬＰ方式に基づ
いて無声に伴う雑音のスペクトル特性を音声に伴う雑音
のスペクトル特性に近似して背景雑音を生成出力する。The LPC storage unit 12 stores the decoded signal from the LPC decoding unit 10 when voiced, and
Output as a filter coefficient of F13. The high-efficiency coding processing unit 11 performs codebook collation on vocal tract information and sound source information, which are information other than speech including noise, through a synthesis filter based on LPC parameters. HPF13,
It has a VOX function that identifies voiced or unvoiced voices indicating utterances by the sender, and generates background noise by approximating the spectral characteristics of unvoiced noise to those of voice based on the CELP method. Output.

【００２６】即ち、この音声復号化装置では、有声・無
声判別部８により音声符号化装置からの出力符号化信号
をユニークワード制御信号により識別し、有声（プリア
ンブル信号）の場合にＬＰＣ復号部１０で入力符号化信
号のＬＰＣパラメータを算出し、そのＬＰＣパラメータ
がＬＰＣ蓄積部１２に蓄積される。入力符号化信号は高
能率復号化処理部１１で復号化処理されて復号化音声信
号としてスイッチ制御部１４へ送出される。又、無声
（ポストアンブル信号）の場合に入力符号化信号におけ
る１フレーム分の背景雑音が背景雑音更新部９で蓄積さ
れ、これ以後は有声・無声判別部８で無声と判別される
が、背景雑音更新部９の背景雑音はＬＰＣ復号部１０に
送出され、高能率復号化処理部１１において処理された
後、ＬＰＣ蓄積部１２に蓄積された有声時のＬＰＣパラ
メータを用いてＨＰＦ１３でフィルタリングされ、背景
雑音としてスイッチ制御部１４へ送出される。That is, in this voice decoding apparatus, the voiced / unvoiced discriminating section 8 identifies the output coded signal from the voice coded apparatus by the unique word control signal, and in the case of voiced (preamble signal), the LPC decoding section 10 Calculates the LPC parameter of the input coded signal, and the LPC parameter is stored in the LPC storage unit 12. The input coded signal is decoded by the high-efficiency decoding unit 11 and sent to the switch control unit 14 as a decoded audio signal. Also, in the case of unvoiced (postamble signal), one frame of background noise in the input coded signal is accumulated in the background noise updating unit 9 and thereafter, the voiced / unvoiced discriminating unit 8 discriminates it as unvoiced. The background noise of the noise updating unit 9 is sent to the LPC decoding unit 10 and processed by the high-efficiency decoding processing unit 11, and then filtered by the HPF 13 using the voiced LPC parameters stored in the LPC storage unit 12, It is sent to the switch control unit 14 as background noise.

【００２７】図３は上述した音声符号化装置の動作処理
を示したフローチャートである。ここでは入力音声信号
に対し、有声・無声判別部１で有声・無声を判定（ステ
ップＳ１）し、有声時の場合、ＬＰＣ分析部２でＬＰＣ
分析（ステップＳ２）としてＬＰＣパラメータを求めた
後、高能率符号化処理部４で上述したＨＰＦ，合成フィ
ルタ，コードブック照合の各処理を含む高能率符号化処
理（ステップＳ４）して符号化変換し、スイッチ制御部
７へ送出してスイッチング（ステップＳ８）を行う。
又、ＬＰＣ分析部２によって算出した有声時の結果はＬ
ＰＣ蓄積部３でＬＰＣ蓄積（ステップＳ３）され、これ
がＬＰＦ５のフィルタ係数となる。ところで、無声時の
雑音は、ＬＰＦ５によってフィルタリングされ、これに
より，ＬＰＦ処理（ステップＳ５）が行われる。その
後、ＬＰＣ分析部２でＬＰＣ分析（ステップＳ６）にか
けられ、有声時と同様に高能率符号化処理部４で上述し
たＨＰＦ→除去，合成フィルタ，コードブック照合の各
処理を含む高能率符号化処理（ステップＳ７）され、ス
イッチ制御部７へ送出してスイッチング（ステップＳ
８）を行う。このとき、スイッチ制御部７では、有声・
無声判別部１の判別によるユニークワード発生部６の有
声・無声識別情報を示すユニークワード制御信号と共
に、有声・無声時の信号をスイッチング（ステップＳ
８）により切り替えて出力符号化信号として復号化装置
側へ出力送信する。FIG. 3 is a flowchart showing the operation processing of the above-described speech coding apparatus. Here, the voiced / unvoiced discriminating unit 1 determines whether the input voice signal is voiced or unvoiced (step S1).
After the LPC parameters are obtained as an analysis (step S2), the high-efficiency encoding processing unit 4 performs high-efficiency encoding processing (step S4) including the above-described processing of the HPF, the synthesis filter, and the codebook collation, and performs encoding conversion. Then, the data is transmitted to the switch control unit 7 to perform switching (step S8).
The voiced result calculated by the LPC analysis unit 2 is L
The LPC is stored in the PC storage unit 3 (step S3), and this becomes the filter coefficient of the LPF 5. By the way, the unvoiced noise is filtered by the LPF 5, whereby the LPF processing (step S5) is performed. Thereafter, the LPC analysis unit 2 performs an LPC analysis (step S6), and the high-efficiency encoding processing unit 4 performs the high-efficiency encoding including the above-described processes of the HPF → removal, the synthesis filter, and the codebook collation as in the case of voiced speech. The processing (step S7) is performed and transmitted to the switch control unit 7 for switching (step S7).
Perform 8). At this time, the switch control unit 7
Switching the voiced / unvoiced signal together with the unique word control signal indicating the voiced / unvoiced discrimination information of the unique word generating unit 6 based on the discrimination by the unvoiced discriminating unit 1 (step S
8), and outputs and transmits the output coded signal to the decoding device side.

【００２８】図４は上述した音声復号化装置の動作処理
を示したフローチャートである。ここでは入力符号化信
号に対し、有声・無声判別部８でユニークワード制御信
号により有声・無声を判定（ステップＳ１）し、有声
（プリアンブル信号）の場合、ＬＰＣ復号部１０でＬＰ
Ｃ復号化（ステップＳ２）としてＬＰＣパラメータを算
出した後、高能率復号化処理部１１で合成フィルタ，コ
ードブック照合の各処理を含む高能率復号化処理（ステ
ップＳ５）して復号化変換（音声生成）し、スイッチ制
御部１４へ送出してスイッチング（ステップＳ９）を行
う。又、ＬＰＣ復号部１０によって復号化された有声時
の結果はＬＰＣ蓄積部１２でＬＰＣ蓄積（ステップＳ
４）され、これがＨＰＦ１３のフィルタ係数となる。無
声時（ポストアンブル信号）の場合、入力符号化信号に
関する１フレーム分の背景雑音が背景雑音更新部９に蓄
積されて背景雑音更新（ステップＳ４）が行われるが、
これ以後は符号化装置側からの背景雑音の送信が途絶え
るため、復号化装置側は背景雑音ではない信号を受信し
続ける。この信号は有声・無声判別部８で有声・無声の
判定（ステップＳ１）により無声と判別され、背景雑音
更新部９から送出された背景雑音に基づいてＬＰＣ復号
部１０でＬＰＣ復号化（ステップＳ６）が行われ、更に
高能率復号化処理部１１で合成フィルタ，コードブック
照合の各処理を含む高能率復号化処理（ステップＳ７）
され、ＬＰＣ蓄積部１２のＬＰＣパラメータを使用して
ＨＰＦ１３でフィルタリングされた後、スイッチ制御部
１４に送出されてスイッチング（ステップＳ９）が行わ
れ、出力音声信号（復号化音声信号）として出力送信さ
れる。FIG. 4 is a flowchart showing the operation processing of the above-described speech decoding apparatus. Here, the voiced / unvoiced discriminating section 8 determines whether the input coded signal is voiced / unvoiced by the unique word control signal (step S1).
After calculating LPC parameters as C decoding (step S2), the high-efficiency decoding processing unit 11 performs high-efficiency decoding processing including synthesis filter and codebook collation processing (step S5) to perform decoding conversion (speech). (Step S9). The voiced result decoded by the LPC decoding unit 10 is stored in the LPC storage unit 12 by LPC storage (step S10).
4) This is the filter coefficient of the HPF 13. In the case of unvoiced (postamble signal), one frame of background noise related to the input coded signal is accumulated in the background noise updating unit 9 and the background noise is updated (step S4).
After that, since the transmission of the background noise from the encoding device is interrupted, the decoding device continues to receive a signal that is not the background noise. This signal is determined to be unvoiced by the voiced / unvoiced determination unit 8 by voiced / unvoiced determination (step S1), and LPC decoded by the LPC decoding unit 10 based on the background noise sent from the background noise updating unit 9 (step S6). ), And the high-efficiency decoding unit 11 performs high-efficiency decoding processing including synthesis filter and codebook collation processing (step S7).
After being filtered by the HPF 13 using the LPC parameters of the LPC storage unit 12, the data is sent to the switch control unit 14 for switching (step S9), and output and transmitted as an output audio signal (decoded audio signal). You.

【００２９】尚、復号化装置側においては、初段の有声
・無声判別部８により随時送信するタイミングが切り替
わるため、スイッチ制御部１４においてユニークワード
制御信号を必要としない。On the decoding device side, the transmission timing is switched at any time by the voiced / unvoiced discriminating section 8 in the first stage, so that the switch control section 14 does not need a unique word control signal.

【００３０】因みに、符号化装置のＬＰＦ５及び復号化
装置のＨＰＦ１３の細部について説明する。符号化装置
のＬＰＣ分析部２で生成したフィルタ係数をαｉ（ｉ＝
０〜Ｎｐ：但し、ＮｐはＬＰＣのオーダー）とすると、
それを用いた合成フィルタの伝達係数Ｈ（Ｚ）は、Ｈ
（Ｚ）＝１／（１＋Σαｉ・Ｚ＾（−１））のように表
わすことができる。因みに、このような技術は例えば
『ディジタル音声処理』東海大学出版、古井著、第５章
「線形予測分析」等に記載されている。しかし、ここで
の雑音のスペクトルは、フラットな形状を取るため、う
まくコード化することができない。The details of the LPF 5 of the encoding device and the HPF 13 of the decoding device will be described. The filter coefficient generated by the LPC analysis unit 2 of the encoding device is denoted by αi (i =
0 to Np: where Np is the order of LPC)
The transfer coefficient H (Z) of the synthesis filter using it is H
(Z) = 1 / (1+ {αi · Z} (− 1)). Incidentally, such a technique is described in, for example, "Digital Speech Processing", published by Tokai University Press, written by Furui, Chapter 5, "Linear Prediction Analysis". However, the spectrum of the noise here has a flat shape and cannot be coded well.

【００３１】そこで、ＬＰＦ５をかけることによりコー
ド化し易くする。よってＬＰＦ５のフィルタ係数Ａ
（Ｚ）は、Ａ（Ｚ）＝１／（１＋Σλ＾ｉ・αｉ・Ｚ＾
（−１））なる関係で表わすことができる。但し、Ｎ≦
Ｎｐ₀λは、０＜λ＜１の範囲で決まる定数である。こ
こでの雑音のスペクトルは、０に近いほどフラットにな
りフィルタリングの効果が薄れ、１に近いほど音声特有
の周期的な山状のスペクトルになることを示している。
又、ＨＰＦ１３のフィルタ係数Ａ′（Ｚ）は、逆数（逆
フィルタ）を用いるため、Ａ′（Ｚ）＝１＋Σλ＾ｉ・
αｉ・Ｚ＾（−１）なる関係で表わすことができる。Therefore, coding is facilitated by applying LPF5. Therefore, the filter coefficient A of the LPF 5
(Z) is A (Z) = 1 / (1+ {λ {i.αi.Z}}
(-1)). Where N ≦
Np ₀ λ is a constant determined in the range of 0 <λ <1. Here, the noise spectrum becomes flatter as the value is closer to 0, and the effect of the filtering is weaker. As the noise value is closer to 1, the spectrum becomes a periodic mountain-like spectrum specific to speech.
The filter coefficient A '(Z) of the HPF 13 uses the reciprocal (inverse filter), so that A' (Z) = 1 + {λ} i ·
αi · Z ＾ (− 1).

【００３２】図５は、本発明の他の実施例に係る音声符
号化／復号化装置の音声符号化装置の基本構成を示した
ブロック図である。この音声符号化装置は、入力音声信
号に対して有声・無声区間を判別して有声・無声に応じ
た識別制御信号を出力する有声・無声判別部１と、入力
音声信号に対して線形予測分析法に基づいて合成フィル
タのフィルタ係数を算出することでＬＰＣパラメータを
取得すると共に、そのＬＰＣパラメータをＬＳＰパラメ
ータに換算するＬＰＣ分析部２と、ＬＰＣ分析部２で算
出されるフィルタ係数の合成フィルタとＬＰＣパラメー
タを符号化するＬＰＣ用コードブック及びその他の音声
特性パラメータを符号化する励振コードブックとによる
２種のコードブックを含むコードブック照合処理部とを
有し、ＬＰＣパラメータに基づいた符号化処理として、
フィルタ係数をＬＰＣ用コードブックのパラメータで符
号化した後、合成フィルタで励振コードブックのベクト
ルをフィルタリングして入力音声信号との差を最小にす
るように励振コードブックを検索（これにより信号量が
圧縮される）するコードブック照合処理を行って得られ
る符号化音声信号を出力する高能率符号化処理部４と、
有声・無声の識別制御信号に応じたユニークワード制御
信号として、有声の場合にはプリアンブル信号，無声の
場合にはポストアンブル信号を生成出力するユニークワ
ード発生部６と、無声の識別制御信号に応じて無声時の
ＬＰＣパラメータを所定のフレームフォーマットに符号
化変換した無声時パラメータ符号化信号を出力する無声
符号化変換部１５と、有声・無声判別部１からの識別制
御信号に応じて高性能符号化処理部４からの符号化音声
信号，ユニークワード発生部６からのユニークワード制
御信号（プリアンブル信号又はポストアンブル信号），
並びに無声符号化変換部１５からの無声時パラメータ符
号化信号による背景雑音を有声・無声別に出力符号化信
号として切り替え送出するスイッチ制御部７とを備えて
成る。FIG. 5 is a block diagram showing a basic configuration of a speech encoding apparatus of a speech encoding / decoding apparatus according to another embodiment of the present invention. The speech coding apparatus includes a voiced / unvoiced discriminating unit 1 for discriminating a voiced / unvoiced section from an input speech signal and outputting a discrimination control signal corresponding to the voiced / unvoiced section, and a linear prediction analysis for the input speech signal. An LPC analysis unit 2 that obtains LPC parameters by calculating filter coefficients of a synthesis filter based on a method and converts the LPC parameters into LSP parameters; and a synthesis filter of filter coefficients calculated by the LPC analysis unit 2. A codebook matching processing unit including two types of codebooks including an LPC codebook for encoding LPC parameters and an excitation codebook for encoding other audio characteristic parameters, and an encoding process based on the LPC parameters. As
After the filter coefficients are encoded with the parameters of the LPC codebook, the excitation codebook is searched so as to minimize the difference from the input speech signal by filtering the excitation codebook vector with the synthesis filter. A high-efficiency encoding processing unit 4 that outputs an encoded audio signal obtained by performing a codebook collation process (to be compressed);
As a unique word control signal corresponding to a voiced / unvoiced discrimination control signal, a unique word generator 6 for generating and outputting a preamble signal in the case of voiced and a postamble signal in the case of unvoiced, and a unique word control signal corresponding to the unvoiced discrimination control signal. An unvoiced encoding conversion unit 15 that outputs an unvoiced parameter encoded signal obtained by encoding conversion of unvoiced LPC parameters into a predetermined frame format, and a high-performance codec according to an identification control signal from the voiced / unvoiced discrimination unit 1. The encoded voice signal from the encoding processing unit 4, the unique word control signal (preamble signal or postamble signal) from the unique word generation unit 6,
And a switch control unit 7 for switching and transmitting background noise caused by the unvoiced parameter coded signal from the unvoiced coded conversion unit 15 as an output coded signal for voiced and unvoiced.

【００３３】即ち、ここでのスイッチ制御部７は、無声
の識別制御信号に応じて無声時にはＬＰＣ用コードブッ
クを使用せずに無声符号化変換部１５で得られた無声時
パラメータ符号化信号による背景雑音を切り替え送出す
る。尚、ここでも背景雑音は任意な一定時間の間隔で送
出される。That is, the switch control section 7 here uses the unvoiced parameter coded signal obtained by the unvoiced code conversion section 15 without using the LPC codebook in unvoiced state in response to the unvoiced identification control signal. The background noise is switched and transmitted. Here, the background noise is transmitted at an arbitrary fixed time interval.

【００３４】図６は、本発明の他の実施例に係る音声符
号化／復号化装置の音声復号化装置の基本構成を示した
ブロック図である。この音声復号化装置は、先の図５に
示す音声符号化装置からの出力符号化信号を入力符号化
信号として入力した上で復号化するものであって、入力
符号化信号に対してユニークワード制御信号に基づいて
有声・無声を判別して有声・無声に応じた制御信号を出
力する有声・無声判別部８と、入力符号化信号に基づい
て符号化装置側から一定間隔で送られる無声時パラメー
タ符号化信号による背景雑音のレベルを制御する背景雑
音制御部１６と、背景雑音を１フレーム分更新生成する
背景雑音更新部９と、入力符号化信号に基づいて有声時
に符号化装置側におけるＬＰＣ分析部２によって算出さ
れたＬＰＣパラメータの復号処理を行う有声ＬＰＣ復号
部１７と、入力符号化信号に基づいて無声時にＬＰＣ分
析部２によって算出されたＬＰＣパラメータの復号処理
を行う無声ＬＰＣ復号部１８と、有声ＬＰＣ復号部１７
及び無声ＬＰＣ復号部１８で算出されたフィルタ係数に
よる合成フィルタとＬＰＣパラメータを復号化するＬＰ
Ｃ用コードブック及びその他の音声特性パラメータを復
号化する励振コードブックとによる２種のコードブック
を含むコードブック照合処理部とを有し、ＬＰＣパラメ
ータに基づいた復号化処理として、フィルタ係数をＬＰ
Ｃ用コードブックのパラメータで復号化した後、合成フ
ィルタで励振コードブックのベクトルをフィルタリング
して入力符号化信号との差を最小にするように励振コー
ドブックを検索（これにより信号量が圧縮される）する
コードブック照合処理を行って得られる復号化音声信号
を出力復号化信号として出力する高性能復号化処理部１
１とを備えて成る。FIG. 6 is a block diagram showing a basic configuration of a speech decoding apparatus of a speech encoding / decoding apparatus according to another embodiment of the present invention. This speech decoding apparatus decodes an input coded signal output from the speech coding apparatus shown in FIG. 5 after inputting the coded signal as an input coded signal. A voiced / unvoiced discriminating unit 8 that discriminates voiced / unvoiced based on the control signal and outputs a control signal corresponding to voiced / unvoiced, and an unvoiced state transmitted from the encoding device side at regular intervals based on the input encoded signal. A background noise control unit 16 for controlling the level of background noise by the parameter coded signal, a background noise updating unit 9 for updating and generating one frame of background noise, and an LPC on the coding apparatus side when voiced based on the input coded signal. A voiced LPC decoding unit 17 for decoding the LPC parameters calculated by the analysis unit 2, and an LPC parameter calculated by the LPC analysis unit 2 when there is no voice based on the input coded signal. And unvoiced LPC decoding section 18 for decoding processing over data, voiced LPC decoding section 17
And a LP for decoding LPC parameters and a synthesis filter based on the filter coefficients calculated by the unvoiced LPC decoding unit 18
A codebook collation processing unit including two types of codebooks including a codebook for C and an excitation codebook for decoding other audio characteristic parameters. As a decoding process based on the LPC parameter, a filter coefficient is set to LP.
After decoding using the parameters of the codebook for C, the excitation codebook is filtered by a synthesis filter to search the excitation codebook so as to minimize the difference from the input coded signal (this reduces the signal amount. 1) a high-performance decoding processing unit 1 that outputs a decoded audio signal obtained by performing a codebook matching process as an output decoded signal
1 is provided.

【００３５】但し、ここでの背景雑音制御部１６は、背
景雑音更新部９による背景雑音の更新に際して音声符号
化装置側から一定周期で送信される背景雑音の１フレー
ム分に基づいて直前に送られたデータとの間で平均化し
た平均値を算出する。However, the background noise control unit 16 transmits the background noise immediately before the background noise is updated by the background noise updating unit 9 based on one frame of the background noise transmitted from the speech encoding apparatus at a constant period. An average value calculated from the averaged data is calculated.

【００３６】図７は上述した図５に示す音声符号化装置
の動作処理を示したフローチャートである。ここでは入
力音声信号に対し、有声・無声判別部１で有声・無声を
識別するための有音・無音検出（ステップＳ１）を行
い、この結果を示す識別制御信号がユニークワード発生
部６に送出され、ユニークワード発生部６ではユニーク
ワード生成（ステップＳ２）として、復号化装置側で有
声・無声の切替え及び背景雑音の更新をするためのユニ
ークワード制御信号（無声→有声：プリアンブル信号、
有声→無声及び背景雑音更新時：ポストアンブル信号）
を生成する。このユニークワード制御信号はスイッチ制
御部７に送出され、スイッチ制御部７ではスイッチ制御
（ステップＳ３）として、有声・無声判別部１からの識
別制御信号に応じてユニークワード制御信号を送信する
必要がある場合に切り替え制御を行うことにより、ユニ
ークワード制御信号を出力符号化信号と共に復号化装置
側へ送出する。FIG. 7 is a flowchart showing the operation processing of the speech coding apparatus shown in FIG. Here, the voiced / unvoiced discriminating unit 1 performs voiced / unvoiced detection for discriminating voiced / unvoiced (step S1) on the input voice signal, and an identification control signal indicating the result is sent to the unique word generating unit 6. Then, the unique word generation unit 6 generates a unique word (step S2) as a unique word control signal (unvoiced → voiced: preamble signal, unvoiced / voiced) for switching between voiced / unvoiced and updating background noise on the decoding device side.
Voiced → unvoiced and background noise updated: postamble signal)
Generate This unique word control signal is sent to the switch control section 7, and the switch control section 7 needs to transmit a unique word control signal as switch control (step S3) in accordance with the identification control signal from the voiced / unvoiced discrimination section 1. By performing the switching control in a certain case, the unique word control signal is transmitted to the decoding device together with the output coded signal.

【００３７】有声・無声判別部１では、有音・無音検出
（ステップＳ１）に引き続いて、入力音声信号に対して
有声・無声の判定（ステップＳ４）を行う。この結果、
有声であれば、有声・無声判別部１は、ここでの判定に
拘らずＬＰＣ分析部２で入力音声信号に対してＬＰＣ分
析（ステップＳ４）により算出されたＬＰＣパラメータ
を高能率符号化処理部４へ引き渡し、高能率符号化処理
部４ではＨＰＦ，合成フィルタ（有声時ＬＰＣパラメー
タ），コードブック照合の各処理を含む高能率符号化処
理（ステップＳ６）を行い、符号化音声信号をスイッチ
制御部７に送出する。高能率符号化処理部４では、ＬＰ
ＣパラメータをＬＰＣ用コードブックにより符号化した
後、ＬＰＣ分析部２で算出したフィルタ係数による合成
フィルタで励振コードブックにあるベクトルをフィルタ
リングし、実際の入力音声データとの残差を取り、その
残差が最小となるように既存の励振コードブックを検索
するコードブック照合を行って符号化音声信号を生成す
る。スイッチ制御部７では、符号化音声信号をスイッチ
制御（ステップＳ３）することによって復号化装置側に
送出する。The voiced / unvoiced discriminating section 1 performs voiced / unvoiced judgment (step S4) on the input voice signal following the detection of voiced / unvoiced (step S1). As a result,
If it is voiced, the voiced / unvoiced discriminating unit 1 applies the LPC parameter calculated by the LPC analysis (step S4) to the input speech signal by the LPC analyzing unit 2 regardless of the judgment here, to the high-efficiency encoding processing unit. The high-efficiency encoding processing unit 4 performs high-efficiency encoding processing (step S6) including each processing of an HPF, a synthesis filter (LPC parameter when voiced), and codebook collation, and performs switch control of the encoded audio signal. Send it to the unit 7. In the high-efficiency encoding processing unit 4, LP
After the C parameter is encoded by the LPC codebook, the vector in the excitation codebook is filtered by a synthesis filter based on the filter coefficient calculated by the LPC analysis unit 2, and the residual from the actual input voice data is obtained. Codebook matching for searching an existing excitation codebook is performed so as to minimize the difference, and an encoded audio signal is generated. The switch control section 7 performs switch control (step S3) of the coded voice signal to transmit it to the decoding device side.

【００３８】一方、有声・無声の判定（ステップＳ４）
の結果、無声であれば、有声・無声判別部１は、ここで
の判定に拘らずＬＰＣ分析部２で入力音声信号に対して
ＬＰＣ分析（ステップＳ７）により算出された無声時の
ＬＰＣパラメータを無声符号化変換部１５へ引き渡し、
無声符号化変換部１５では無声時のＬＰＣパラメータを
ＬＰＣ蓄積（ステップＳ８）により蓄積した後、所定の
フレームフォーマットに符号化変換するＬＰＣ変換（ス
テップＳ９）を行って無声時パラメータ符号化信号とし
てスイッチ制御部７へ出力する。無声符号化変換部１５
では、無声時のＬＰＣパラメータを出力フレームのフォ
ーマットに変換する処理を行うが、図９に示した従来装
置の出力符号化信号のフォーマットは、高性能符号化処
理部４においてＬＰＣパラメータのコードブックとの照
合値やその他の音声情報（基本周波数，振幅，電力等）
がフォーマット化されている。スイッチ制御部７では、
無声時パラメータ符号化信号による背景雑音をスイッチ
制御（ステップＳ３）することによって復号化装置側に
送出する。On the other hand, determination of voiced / unvoiced (step S4)
As a result, if it is unvoiced, the voiced / unvoiced discriminating unit 1 applies the unvoiced LPC parameter calculated by the LPC analysis (step S7) to the input voice signal by the LPC analyzing unit 2 regardless of the determination here. Handover to the unvoiced coded conversion unit 15,
The unvoiced encoding conversion unit 15 accumulates the LPC parameters at the time of unvoiced by LPC accumulation (step S8), and then performs LPC conversion (step S9) for encoding conversion into a predetermined frame format, and switches as an unvoiced parameter encoded signal. Output to the control unit 7. Unvoiced coding conversion unit 15
In this embodiment, the process of converting the unvoiced LPC parameters into the output frame format is performed. However, the format of the output coded signal of the conventional device shown in FIG. Verification value and other audio information (fundamental frequency, amplitude, power, etc.)
Is formatted. In the switch control unit 7,
Switch control (step S3) of the background noise caused by the unvoiced parameter coded signal is transmitted to the decoding device side.

【００３９】この音声符号化装置では無声時の場合、有
声の特性により作成されたコードブックと照合せずに送
信を行うため、データの量が大きくなってしまう。そこ
で、音声情報を省いてＬＰＣパラメータのみをフレーム
長にフォーマット変換し、復号化装置側へ送信する。即
ち、無声時の場合、ＬＰＣパラメータをＬＰＣ用コード
ブックを使用して圧縮符号化ぜずにＬＰＣパラメータを
そのまま送信する。尚、無声符号化変換部１５のＬＰＣ
パラメータのフレームフォーマットへの変換に関連する
技術は、例えば電気通信協会編「ＰＣＭディジタル通信
の基礎知識」等に開示されている。In this speech coding apparatus, in the case of unvoiced speech, data is transmitted without collation with a codebook created based on voiced characteristics, so that the amount of data becomes large. Therefore, the format of only the LPC parameter is converted to the frame length by omitting the audio information and transmitted to the decoding device side. That is, in the case of no voice, the LPC parameters are transmitted as they are without compression coding using LPC codebooks. Note that the LPC of the unvoiced coding conversion unit 15
Techniques relating to the conversion of parameters into frame formats are disclosed, for example, in "Basic Knowledge of PCM Digital Communication" edited by the Telecommunications Association.

【００４０】図８は上述した図６に示す音声復号化装置
の動作処理を示したフローチャートである。ここでは入
力符号化信号に対し、符号化装置側から送られたユニー
クワード制御信号に基づいて有声・無声判別部８で有声
・無声の判定（ステップＳ１）を行う。この結果、有声
であれば、有声・無声判別部８は制御信号を送出して有
声ＬＰＣ復号部１７によるＬＰＣ有声復号化（ステップ
Ｓ２）を選択し、有声ＬＰＣ復号部１７で有声時に符号
化装置側におけるＬＰＣ分析部２によって算出されたＬ
ＰＣパラメータの復号処理を行った結果を高性能復号化
処理部１１へ送出させる。高性能復号化処理部１１では
合成フィルタ（有声時ＬＰＣパラメータ），コードブッ
ク照合の各処理を含む高能率復号化処理（ステップＳ
３）を行い、復号化音声信号を出力復号化信号として出
力する。FIG. 8 is a flowchart showing the operation processing of the speech decoding apparatus shown in FIG. Here, the voiced / unvoiced discriminating unit 8 determines whether the input coded signal is voiced or unvoiced based on the unique word control signal sent from the coding device (step S1). As a result, if it is voiced, the voiced / unvoiced discriminating unit 8 sends out a control signal, selects LPC voiced decoding by the voiced LPC decoding unit 17 (step S2), and the voiced LPC decoding unit 17 outputs Calculated by the LPC analysis unit 2 on the side
The result of decoding the PC parameter is sent to the high-performance decoding unit 11. The high-performance decoding unit 11 performs a high-efficiency decoding process including the synthesis filter (LPC parameter when voiced) and codebook matching (step S
3) is performed, and the decoded audio signal is output as an output decoded signal.

【００４１】一方、有声・無声の判定（ステップＳ１）
の結果、無声であれば、有声・無声判別部８は背景雑音
制御部１６，背景雑音更新部９，及び無声ＬＰＣ復号部
１８側を選択し、これにより背景雑音制御部１６では無
声時の入力符号化信号に基づいて無声時パラメータ符号
化信号に対して背景雑音制御（ステップＳ４）を行い、
背景雑音の更新に際して符号化装置側から一定周期で送
信される１フレーム分の背景雑音に基づいて直前に送ら
れたデータとの間で平均化した平均値を算出した上で背
景雑音更新部９へ送出し、背景雑音更新部９では１フレ
ーム分の背景雑音更新（ステップＳ５）を行い、その結
果を無声ＬＰＣ復号部１８へ引き渡し、無声ＬＰＣ復号
部１８では符号化装置側で有声時のＬＰＣ用コードブッ
クにより符号化されていない無声時のＬＰＣパラメータ
に対してＬＰＣ復号化処理（ステップＳ６）を行い、Ｌ
ＰＣパラメータの復号処理を行った結果を高性能復号化
処理部１１へ送出させる。高性能復号化処理部１１では
合成フィルタ（無声時ＬＰＣパラメータ），コードブッ
ク照合の各処理を含む高能率復号化処理（ステップＳ
７）を行い、復号化音声信号を出力復号化信号として出
力する。On the other hand, determination of voiced / unvoiced (step S1)
As a result, if it is unvoiced, the voiced / unvoiced discriminating unit 8 selects the background noise control unit 16, the background noise updating unit 9, and the unvoiced LPC decoding unit 18 side. Background noise control (step S4) is performed on the unvoiced parameter coded signal based on the coded signal,
Upon updating the background noise, the background noise updating unit 9 calculates an average value obtained by averaging the average value with the immediately preceding data based on one frame of the background noise transmitted from the encoding device at a constant period. The background noise updating unit 9 updates the background noise for one frame (step S5), and delivers the result to the unvoiced LPC decoding unit 18. The unvoiced LPC decoding unit 18 uses the LPC for voiced LPC decoding on the encoding device side. LPC decoding processing (step S6) is performed on unvoiced LPC parameters not encoded by the codebook for LPC.
The result of decoding the PC parameter is sent to the high-performance decoding unit 11. The high-performance decoding unit 11 performs a high-efficiency decoding process including a synthesis filter (unvoiced LPC parameter) and codebook matching (step S
7), and outputs the decoded audio signal as an output decoded signal.

【００４２】即ち、この音声符号化／復号化装置では、
音声符号化装置側で背景雑音の違和感を低減化させるべ
く、無声時の音源に対するＬＰＣパラメータの符号化の
際、有声時の特性に見合って作成されたＬＰＣ用コード
ブックを使用せず、無声符号化変換部１５で背景雑音の
特性を考慮しながら無声時の音源そのものの特性を活か
してＬＰＣパラメータをフォーマット変換する処理を施
してスイッチ制御部７を介して音声復号化装置側に送出
し、音声復号化装置側で背景雑音制御部１６により一定
時間で繰り返し生成される背景雑音のレベルを制御して
背景雑音を滑らかに変化させると共に、背景雑音の更新
に際して前のデータの平均値を用いるようにしているの
で、周波数（スペクトル）を考慮した上で無声時の音源
の特性を活かしながら一層自然に背景雑音を生成制御す
ることが可能になる。That is, in this speech encoding / decoding device,
In order to reduce the discomfort of background noise on the speech encoding device side, when encoding LPC parameters for a sound source when unvoiced, an unvoiced code is used without using an LPC codebook created in accordance with the characteristics when voiced. The conversion conversion unit 15 performs a process of converting the format of the LPC parameter by utilizing the characteristics of the sound source itself in the unvoiced state while taking into account the characteristics of the background noise, and sends the LPC parameters to the audio decoding device side via the switch control unit 7. On the decoding device side, the background noise control unit 16 controls the level of the background noise repeatedly generated in a fixed time to smoothly change the background noise and use the average value of the previous data when updating the background noise. Therefore, it is possible to generate and control the background noise more naturally while taking into account the frequency (spectrum) while taking advantage of the characteristics of the sound source when unvoiced. .

【００４３】[0043]

【発明の効果】以上に述べた通り、本発明の音声符号化
／復号化装置によれば、音声符号化装置において雑音の
スペクトル特性を音声に近いスペクトル特性となるよう
に有声時のＬＰＣ分析によるＬＰＣパラメータを利用し
てフィルタによりフィルタリングを行ってからコードブ
ックと照合して符号化処理を行い、復号化装置でも復号
化処理後、無声時の場合に有声時のＬＰＣパラメータに
よる逆フィルタを通すことによって自然な背景雑音を生
成しており、背景雑音のスペクトル特性を音声のスペク
トル特性に近づけることによって、コードブックと照合
の際に最適なコードブックを選択可能にしているので、
無声時に生成される背景雑音の違和感が充分に低減化さ
れるようになる。又、本発明の他の音声符号化／復号化
装置によれば、音声符号化装置側で無声時の特性を活か
すため、無声時の音源に対してＬＰＣパラメータを有声
時のコードブックと照合せずに無声時のＬＰＣパラメー
タをそのままフォーマット変換して送信し、音声復号化
装置で一定時間で繰り返し生成される背景雑音のレベル
を制御して背景雑音を滑らかに変化させるようにしてい
るので、周波数（スペクトル）を考慮した上で一層自然
に背景雑音を生成制御できるようになる。As described above, according to the speech encoding / decoding apparatus of the present invention, the speech encoding apparatus uses the LPC analysis at the time of voice so that the noise has a spectral characteristic close to that of speech. After performing filtering by a filter using the LPC parameter, collating with the codebook and performing the encoding process, the decoding device performs the decoding process, and in the case of unvoiced, passes the inverse filter by the LPC parameter when voiced. Generates a natural background noise, and by making the spectral characteristics of the background noise close to the spectral characteristics of the voice, it is possible to select the optimal codebook when collating with the codebook.
The uncomfortable feeling of the background noise generated when there is no voice can be sufficiently reduced. Further, according to another speech encoding / decoding device of the present invention, in order to make use of the unvoiced characteristics on the speech encoding device side, the LPC parameter of the unvoiced sound source is compared with the codebook of voiced. The LPC parameter at the time of unvoice is converted without format and transmitted as it is, and the level of background noise repeatedly generated in a fixed time by the speech decoding device is controlled to smoothly change the background noise. In consideration of (spectrum), background noise can be generated and controlled more naturally.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の一実施例に係る音声符号化／復号化装
置の音声符号化装置の基本構成を示したブロック図であ
る。FIG. 1 is a block diagram showing a basic configuration of a speech encoding device of a speech encoding / decoding device according to an embodiment of the present invention.

【図２】本発明の一実施例に係る音声符号化／復号化装
置の音声復号化装置の基本構成を示したブロック図であ
る。FIG. 2 is a block diagram showing a basic configuration of a speech decoding device of the speech encoding / decoding device according to one embodiment of the present invention.

【図３】図１に示す音声符号化装置の動作処理を示した
フローチャートである。FIG. 3 is a flowchart showing an operation process of the speech encoding device shown in FIG.

【図４】図２に示す音声復号化装置の動作処理を示した
フローチャートである。FIG. 4 is a flowchart showing an operation process of the speech decoding device shown in FIG. 2;

【図５】本発明の他の実施例に係る音声符号化／復号化
装置の音声符号化装置の基本構成を示したブロック図で
ある。FIG. 5 is a block diagram showing a basic configuration of a speech encoding device of a speech encoding / decoding device according to another embodiment of the present invention.

【図６】本発明の他の実施例に係る音声符号化／復号化
装置の音声復号化装置の基本構成を示したブロック図で
ある。FIG. 6 is a block diagram showing a basic configuration of a speech decoding device of a speech encoding / decoding device according to another embodiment of the present invention.

【図７】図５に示す音声符号化装置の動作処理を示した
フローチャートである。FIG. 7 is a flowchart showing an operation process of the speech encoding device shown in FIG.

【図８】図６に示す音声復号化装置の動作処理を示した
フローチャートである。FIG. 8 is a flowchart showing an operation process of the speech decoding device shown in FIG. 6;

【図９】従来の音声符号化装置の基本構成を示したブロ
ック図である。FIG. 9 is a block diagram showing a basic configuration of a conventional speech encoding device.

【図１０】従来の音声復号化装置の基本構成を示したブ
ロック図である。FIG. 10 is a block diagram showing a basic configuration of a conventional speech decoding device.

【符号の説明】[Explanation of symbols]

１，８有声・無声判別部２ＬＰＣ分析部３，１２ＬＰＣ蓄積部４高能率符号化処理部５ＬＰＦ６ユニークワード発生部７，１４スイッチ制御部９背景雑音更新部１０ＬＰＣ復号部１１高能率復号化処理部１３ＨＰＦ１５無声符号化変換部１６背景雑音制御部１７有声ＬＰＣ復号部１８無声ＬＰＣ復号部 Reference Signs List 1,8 Voiced / unvoiced discrimination unit 2 LPC analysis unit 3,12 LPC storage unit 4 High-efficiency coding processing unit 5 LPF 6 Unique word generation unit 7,14 Switch control unit 9 Background noise update unit 10 LPC decoding unit 11 High efficiency Decoding processing unit 13 HPF 15 Unvoiced coding conversion unit 16 Background noise control unit 17 Voiced LPC decoding unit 18 Unvoiced LPC decoding unit

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】入力音声信号に対して有声・無声区間を
判別して有声・無声に応じた識別制御信号を出力する有
声・無声判別部と、前記識別制御信号により有声のとき
に入力される前記入力音声信号に対して線形予測分析法
に基づいて合成フィルタのフィルタ係数を算出すること
でＬＰＣ（ＬｉｎｅａｒＰｒｅｄｉｃｔｉｖｅＣｏ
ｄｉｎｇ）パラメータを取得すると共に、該ＬＰＣパラ
メータをＬＳＰ（ＬｉｎｅＳｐｅｃｔｒｕｍＰａｉ
ｒ）パラメータに換算して出力するＬＰＣ分析部と、前
記識別制御信号により有声から無声に切り替わったとき
に直前の有声時における前記ＬＰＣパラメータを一時蓄
積するＬＰＣ蓄積部と、前記識別制御信号により前記Ｌ
ＰＣ蓄積部に蓄積された前記ＬＰＣパラメータに基づい
て無声のときの雑音特性を有声のときの雑音特性に近付
けて前記線形予測分析法に供する背景雑音を生成するた
めの濾波を行うＬＰＦと、前記ＬＰＣ分析部から出力さ
れる前記ＬＳＰパラメータ又は前記ＬＰＦ経由の前記背
景雑音を符号化処理して符号化音声信号又は雑音信号を
出力する高能率符号化処理部と、前記識別制御信号に応
じて有声のときに前記符号化音声信号，無声ときに前記
雑音信号をそれぞれ出力符号化信号として切り替え送出
するスイッチ制御部とを備えたことを特徴とする音声符
号化装置。1. A voiced / unvoiced discriminating unit for discriminating a voiced / unvoiced section from an input voice signal and outputting a discrimination control signal corresponding to voiced / unvoiced, and input when the discrimination control signal is voiced. By calculating filter coefficients of a synthesis filter for the input speech signal based on a linear prediction analysis method, an LPC (Linear Predictive Coding) is calculated.
Ding) parameters, and the LPC parameters are converted to LSP (Line Spectrum Pai).
r) an LPC analysis unit that converts the output into a parameter and outputs the LPC parameter; an LPC storage unit that temporarily stores the LPC parameter at the time of the immediately preceding voice when switching from voiced to unvoiced by the identification control signal; L
An LPF that performs filtering to generate a background noise to be subjected to the linear prediction analysis method by making a noise characteristic when unvoiced close to a noise characteristic when voiced based on the LPC parameter stored in the PC storage unit; A high-efficiency encoding processing unit that encodes the LSP parameter output from the LPC analysis unit or the background noise via the LPF and outputs an encoded voice signal or a noise signal; and voiced according to the identification control signal. And a switch control unit for switching and transmitting the coded speech signal as an output coded signal when unvoiced and the noise signal when unvoiced.

【請求項２】請求項１記載の音声符号化装置におい
て、前記高能率符号化処理部は、前記符号化処理に際し
て有声時の音声特性に基づいて作成されたコードブック
を照合することを特徴とする音声符号化装置。2. The speech encoding apparatus according to claim 1, wherein said high-efficiency encoding processing section checks a codebook created based on speech characteristics at the time of voice during said encoding processing. Speech encoding device.

【請求項３】請求項２記載の音声符号化装置におい
て、前記高能率符号化処理部は、雑音を含む音声以外の
情報に対して前記ＬＰＣパラメータによる合成フィルタ
を通して前記コードブックの照合を行うことを特徴とす
る音声符号化装置。3. The speech encoding apparatus according to claim 2, wherein the high-efficiency encoding processing unit performs collation of the codebook on information other than speech including noise through a synthesis filter based on the LPC parameter. A speech encoding device characterized by the above-mentioned.

【請求項４】請求項１〜３の何れか一つに記載の音声
符号化装置において、前記ＬＰＣ蓄積部は、前記ＬＰＣ
パラメータとして有声時の前記ＬＰＣ分析部からのＬＰ
Ｃ分析後の出力を蓄積することを特徴とする音声符号化
装置。4. The speech encoding device according to claim 1, wherein the LPC storage unit stores the LPC data.
LP from the LPC analyzer when voiced as a parameter
A speech coding apparatus for storing an output after C analysis.

【請求項５】請求項４記載の音声符号化装置におい
て、前記ＬＰＣ蓄積部は、前記ＬＰＣパラメータを前記
ＬＰＦのフィルタ係数として出力することを特徴とする
音声符号化装置。5. The speech encoding apparatus according to claim 4, wherein said LPC storage unit outputs said LPC parameter as a filter coefficient of said LPF.

【請求項６】請求項１〜５の何れか一つに記載の音声
符号化装置において、前記ＬＰＦは、前記識別制御信号
により無声のときに前記入力音声信号の雑音を入力し、
該雑音を有声時の該入力音声信号の雑音に近付けて前記
背景雑音を生成出力することを特徴とする音声符号化装
置。6. The speech encoding device according to claim 1, wherein the LPF inputs noise of the input speech signal when the LPF is unvoiced by the identification control signal,
A speech encoding apparatus characterized in that the noise is brought close to the noise of the input speech signal when voiced, and the background noise is generated and output.

【請求項７】請求項１〜６の何れか一つに記載の音声
符号化装置において、前記ＬＰＦは、送信者による発声
を示す有声又は無発声を示す無声を識別するＶＯＸ（Ｖ
ｏｉｃｅＯｐｅｒａｔｅｄＴｒａｎｓｍｉｔｔｅ
ｒ）機能を有すると共に、ＣＥＬＰ（Ｃｏｄｅ−ｂｏｏ
ｋＥｘｃｉｔｅｄＬｉｎｅａｒＰｒｅｄｉｃｔｉ
ｏｎ）方式に基づいて無声に伴う雑音のスペクトル特性
を音声に伴う雑音のスペクトル特性に近似して前記背景
雑音を生成出力することを特徴とする音声符号化装置。7. The speech encoding device according to claim 1, wherein the LPF identifies a voiced or unvoiced voice that indicates utterance or unvoiced voice by the sender.
Oice Operated Transmit
r) function and CELP (Code-boo
k Excited Linear Predicti
on) A speech coding apparatus characterized by generating and outputting the background noise by approximating the spectrum characteristics of noise accompanying unvoiced speech to the spectrum characteristics of noise accompanying speech based on a method.

【請求項８】請求項１〜７の何れか一つに記載の音声
符号化装置において、前記符号化音声信号又は前記雑音
信号に対する有声・無声区間を制御するためのユニーク
ワード制御信号を出力するユニークワード発生部を備
え、前記スイッチ制御部は、前記ユニークワード制御信
号と前記出力符号化信号とを切り替え送出することを特
徴とする音声符号化装置。8. The speech encoding apparatus according to claim 1, wherein a unique word control signal for controlling a voiced / unvoiced section for the encoded speech signal or the noise signal is output. A speech coding apparatus comprising a unique word generation unit, wherein the switch control unit switches and transmits the unique word control signal and the output coded signal.

【請求項９】請求項８記載の音声符号化装置からの出
力符号化信号を入力符号化信号として入力した上で復号
化する音声復号化装置であって、前記入力符号化信号に
対して前記ユニークワード制御信号に基づいて有声・無
声を判別して有声・無声に応じた制御信号を出力する有
声・無声判別部と、前記制御信号により無声のときに前
記入力符号化信号の背景雑音を蓄積する背景雑音更新部
と、前記制御信号により有声のときに前記入力符号化信
号をＬＰＣ復号化するＬＰＣ復号部と、前記ＬＰＣ復号
化に際してのＬＰＣパラメータを蓄積するＬＰＣ蓄積部
と、前記ＬＳＰパラメータに基づいて前記ＬＰＣ復号化
された後の前記入力符号化信号に対する復号化処理を行
って復号化音声信号又は雑音信号を出力する高能率復号
化処理部と、前記ＬＰＣパラメータに基づいて無声のと
きに前記雑音信号を濾波して背景雑音として出力するＨ
ＰＦと、有声のときに前記復号化音声信号，無声のとき
に前記背景雑音をそれぞれ切り替え送出するスイッチ制
御部とを備えたことを特徴とする音声復号化装置。9. A speech decoding apparatus for decoding an input coded signal output from the speech coding apparatus according to claim 8 after inputting the coded signal as an input coded signal. A voiced / unvoiced discriminator that discriminates voiced / unvoiced based on the unique word control signal and outputs a control signal corresponding to voiced / unvoiced, and accumulates background noise of the input coded signal when the control signal is unvoiced A background noise updating unit that performs LPC decoding on the input coded signal when voiced by the control signal; an LPC storage unit that stores LPC parameters for the LPC decoding; A high-efficiency decoding unit that performs a decoding process on the input coded signal that has been subjected to the LPC decoding based on the LPC and outputs a decoded speech signal or a noise signal; H that filters out the noise signal and outputs it as background noise when unvoiced based on PC parameters
A speech decoding apparatus comprising: a PF; and a switch control unit for switching and transmitting the decoded speech signal when voiced and the background noise when unvoiced.

【請求項１０】請求項９記載の音声復号化装置におい
て、前記ＬＰＣ蓄積部は、有声のときに前記ＬＰＣ復号
部からの復号化信号を蓄積して前記ＨＰＦのフィルタ係
数として出力することを特徴とする音声復号化装置。10. The audio decoding apparatus according to claim 9, wherein the LPC storage unit stores the decoded signal from the LPC decoding unit when voiced, and outputs the decoded signal as a filter coefficient of the HPF. Audio decoding device.

【請求項１１】請求項９又は１０記載の音声復号化装
置において、前記高能率符号化処理部は、雑音を含む音
声以外の情報に対して前記ＬＰＣパラメータによる合成
フィルタを通してコードブックの照合を行うことを特徴
とする音声復号化装置。11. The speech decoding apparatus according to claim 9, wherein the high-efficiency encoding processing unit performs codebook collation on information other than speech including noise through a synthesis filter based on the LPC parameter. An audio decoding device characterized by the above-mentioned.

【請求項１２】請求項９〜１１の何れか一つに記載の
音声復号化装置において、前記ＨＰＦは、送信者による
発声を示す有声又は無発声を示す無声を識別するＶＯＸ
機能を有すると共に、ＣＥＬＰ方式に基づいて無声に伴
う雑音のスペクトル特性を音声に伴う雑音のスペクトル
特性に近似して前記背景雑音を生成出力することを特徴
とする音声復号化装置。12. The voice decoding device according to claim 9, wherein the HPF identifies a voiced voice indicating a voice by the sender or a voiceless voice indicating an unvoiced voice.
A speech decoding apparatus having a function and generating and outputting the background noise by approximating a spectrum characteristic of noise accompanying unvoiced to a spectrum characteristic of noise accompanying speech based on a CELP method.