JP3158434B2

JP3158434B2 - Digital audio decoder with post-filter having reduced spectral distortion

Info

Publication number: JP3158434B2
Application number: JP51307390A
Authority: JP
Inventors: アランジャーソン・イラ; アントニジャシウク・マーク
Original assignee: モトローラ・インコーポレーテッド
Priority date: 1989-10-17
Filing date: 1990-09-17
Publication date: 2001-04-23
Anticipated expiration: 2016-04-23
Also published as: ATE177867T1; JPH05500573A; EP0570362B1; AU635342B2; ES2131498T3; AU6411490A; DE69033011D1; CN1078371C; EP0570362A1; EP0570362A4; WO1991006093A1; DE69033011T2; CN1051101A

Abstract

An adaptive spectral postfilter (108) in a synthesized speech platform has a denominator characteristic that corresponds to a preceding LPC filter stage (106) and a numerator characteristic that is developed as a function of the denominator characteristic through application of spectral smoothing techniques. This allows the numerator to track the denominator without the introduction of spectral distortion that would otherwise affect the processing in an adverse way.

Description

【発明の詳細な説明】技術分野この発明は一般的には音声コーダに関し、かつより特
定的には音声品質を改善するためにポストフィルタを使
用するデジタル音声コーダに関する。Description: TECHNICAL FIELD The present invention relates generally to audio coders, and more particularly to digital audio coders that use post filters to improve audio quality.

発明の背景音声コーダおよびデコーダが技術上知られている。い
くつかの音声コーダはアナログ音声サンプルをデジタル
化された表現に変換し、かつその後リニア予測符号化を
用いることによりスペクトル音声情報を表わす。他の音
声コーダは通常のリニア予測符号化（linear predicti
ve coding:LPC）技術に対し元の音声信号に関係する励
起信号（excitation signal）を提供することにより改
善を行なう。BACKGROUND OF THE INVENTION Speech coders and decoders are known in the art. Some audio coders convert analog audio samples into a digitized representation and then represent the spectral audio information by using linear predictive coding. Other speech coders use conventional linear predictive coding (linear predicti
An improvement is made to ve coding (LPC) technology by providing an excitation signal related to the original speech signal.

米国特許第4,817,157号は改善されたベクトル励起源
を有するデジタル音声コーダおよびデコーダを開示して
おり、これはコードブック励起ベクトルのコードブック
が利用可能な情報に最も適合するコードブック励起信号
を選択するためにアクセスされ、かつオリジナルを忠実
に表現するLPCフィルタから合成された音声信号を与え
るために使用される。U.S. Pat.No. 4,817,157 discloses a digital speech coder and decoder having an improved vector excitation source, which selects a codebook excitation signal whose codebook of codebook excitation vectors best fits the available information. Used to provide an audio signal that has been accessed and synthesized from an LPC filter that faithfully represents the original.

いったん合成された音声信号が出力されると、種々の
ポストLPCフィルタがしばしば使用されてさらに該信号
を調整する。１つのそのようなフィルタは適応スペクト
ルポストフィルタであり（これは典型的には合成音声の
知覚的な品質を増大することを意図している）、そして
他のものはポストエンファシス・フィルタである（これ
は合成音声結果に対するブライトネス（brightness）に
寄与する）。Once the synthesized audio signal is output, various post-LPC filters are often used to further condition the signal. One such filter is an adaptive spectral postfilter (which is typically intended to increase the perceived quality of the synthesized speech), and the other is a post-emphasis filter ( This contributes to the brightness of the synthesized speech result).

適応スペクトルポストフィルタは典型的には次の一般
式で表わされる。An adaptive spectral postfilter is typically represented by the following general formula:

（ｚ）＝｛１−Ａ（z/η）｝／｛１−Ａ（z/ν）｝この場合、０≦η≦ν＜１であり、かつ 1/｛１−Ａ（ｚ）｝は関連するLPCフィルタを表わす。 (Z) = {1-A (z / η)} / {1-A (z / ν)} In this case, 0 ≦ η ≦ ν <1 and 1 / {1-A (z)} is Represents the associated LPC filter.

上のポストフィルタ表現における分母項は合成信号ス
ペクトラムにおけるホルマントを強調し、一方でスペク
トルの谷を減衰する。（２つの極端な場合には、ν＝０
のセッティングはオールパスフィルタとなり、一方ν＝
１のセッティングは関連するLPCフィルタと同じ分母項
を生ずる結果となる。）分子項は分母によって導入され
る一般的なスペクトル形状を打ち消すよう作用する。従
来技術の用途においては、νはしばしばおよそ0.8にセ
ットされ、かつηはおよそ0.5にセットされる。The denominator term in the above postfilter representation enhances the formants in the composite signal spectrum, while attenuating spectral valleys. (In the two extreme cases, ν = 0
Is an all-pass filter, while ν =
A setting of 1 results in the same denominator term as the associated LPC filter. 2.) The numerator term acts to counteract the general spectral shape introduced by the denominator. In prior art applications, ν is often set to about 0.8 and η is set to about 0.5.

実際には、分子の多項式は分母のスペクトル形状を追
跡する上でほんの部分的にしか成功せず（実際には、フ
ィルタのスペクトル特性は時間とともに傾斜する）、そ
してその食い違いは典型的にはポストフィルタによりろ
波された音声のブライトネスの時間的に変化する変調と
して表われる。In practice, the numerator polynomial is only partially successful in tracking the spectral shape of the denominator (actually, the spectral characteristics of the filter will slope over time), and the discrepancy is typically Appears as a time-varying modulation of the brightness of the sound filtered by the filter.

従って、音声のブライトネスに対する有害な影響を最
小にしながら、同時に合成音声の知覚的な品質を向上す
る合成音声のポストフィルタリングの方法の必要性が存
在する。同様に音声のブライトネスそれ自体もよりよく
制御できることが好ましい。Accordingly, a need exists for a method of post-filtering synthesized speech that minimizes the detrimental effect on speech brightness while at the same time improving the perceived quality of the synthesized speech. Similarly, it is preferred that the brightness of the sound itself be better controllable.

発明の概要これらの必要性および他のものは実質的にここに開示
されたポストフィルタの装置により満たされる。この発
明によれば、あるポストフィルタ（postfilter）が提供
でき、該ポストフィルタは第１および第２の成分または
構成要素（component）によって特徴付けられる。第１
の構成要素は１組の係数を含む。これらの係数は代わり
のドメインの組のパラメータに変換され、かつ次に修正
された組のパラメータを与えるために操作される。これ
らは次の前記第２の構成要素を特徴付ける１組の係数を
提供するために使用される。SUMMARY OF THE INVENTION These needs and others are substantially satisfied by the post-filter apparatus disclosed herein. According to the present invention, a postfilter can be provided, wherein the postfilter is characterized by first and second components or components. First
Include a set of coefficients. These coefficients are converted to an alternate set of domain parameters and then manipulated to provide a modified set of parameters. These are used to provide a set of coefficients that characterize the second component:

１つの実施例においては、前記第１の構成要素を表わ
すＺ変換（フィルタ係数が自己相関ドメインに変換され
る。帯域幅拡張機能を用いるスペクトル平滑技術が次に
自己相関シーケンスに適用され、かつ第２の構成要素の
多項式の係数がレビンソン（Levinson）の帰納法を用い
て修正された自己相関シーケンスから計算される。上に
述べたフィルタ特性において第１の構成要素が次に分母
として使用され、かつ第２の構成要素が分子として使用
される。In one embodiment, a Z-transform representing the first component (the filter coefficients are transformed into the autocorrelation domain. A spectral smoothing technique using a bandwidth extension function is then applied to the autocorrelation sequence, and The coefficients of the two-component polynomial are calculated from the modified autocorrelation sequence using Levinson's induction.In the filter characteristics described above, the first component is then used as the denominator; And the second component is used as a molecule.

このプロセスにより、分子の多項式はＡ（z/ν）多項
式のスペクトル的に平滑化されたものによって置き代え
られる。フォーマットの帯域幅拡張は平滑化されたスペ
クトルのエンベロープを変えない。従って、Ａ（z/ν）
多項式のスペクトル的に平滑化された帯域幅が拡張され
たものは実効的に時間的に変化するスペクトルの傾斜を
最小化し、かつ分子が分母の概略的なスペクトル形状を
適応的に追跡しかつそれを打ち消すことができるように
する。By this process, the numerator polynomial is replaced by a spectrally smoothed version of the A (z / ν) polynomial. Format bandwidth extension does not change the envelope of the smoothed spectrum. Therefore, A (z / ν)
An extended spectrally smoothed bandwidth of the polynomial effectively minimizes the time-varying spectral slope, and the numerator adaptively tracks the general spectral shape of the denominator and To be able to negate.

他の実施例においては、付加的なポストエンファシス
フィルタが使用されポストフィルタがかけられた音声の
ブライトネスにより一層の制御を与えることができるよ
うにする。このフィルタは次の式で表わされる一次フィ
ルタである。In another embodiment, an additional post-emphasis filter is used to provide more control over the brightness of the post-filtered audio. This filter is a first-order filter represented by the following equation.

（ｚ）＝１−uz^-1 なお、この場合典型的には、0.2≦ｕ≦0.5である。(Z) = 1-uz ^{-1 In} this case, typically, 0.2 ≦ u ≦ 0.5.

図面の簡単な説明第１図は、本発明に従って構成された無線機のブロッ
ク図である。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram of a wireless device configured according to the present invention.

第２図は、本発明に従った適応スペクトルポストフィ
ルタの特徴を表すフローチャートである。FIG. 2 is a flowchart illustrating the features of the adaptive spectral postfilter according to the present invention.

発明を実施するための最良の形態 1989年３月28日に、イラジャーソン（Ira Gerso
n）に発行された、「改良されたベクトル励起源を有す
るデジタル音声コーダ」と題する、米国特許第4,817,15
7号はデジタル音声コーダおよびデコーダにつきかなり
詳細に述べている。この参照文献に詳細に述べられてい
るように、この発明はモトローラ社のDSP56000型ファミ
リ装置のような適切なデジタル信号プロセッサを使用す
る音声コーダ（またはデコーダ）において実施できる。BEST MODE FOR CARRYING OUT THE INVENTION On March 28, 1989, Ira Gerso
US Patent No. 4,817,15 entitled "Digital Speech Coder with Improved Vector Excitation Source"
No. 7 describes the digital audio coder and decoder in considerable detail. As described in detail in this reference, the present invention can be implemented in an audio coder (or decoder) using a suitable digital signal processor, such as a Motorola DSP56000 family device.

第１図において、本発明を実施する無線機（100）
は、音声符号化無線周波（RF）信号（101）を受信する
ためのアンテナ（102）を含む。RFユニット（103）は受
信した信号を処理して音声符号化情報を復元する。この
情報は種々の後続の処理のための制御パラメータを出力
するパラメータデコーダ（105）に提供される。上に述
べた励起源（excitation source）（104）はそれに与
えられたパラメータを利用して励起信号を生成する。励
起源（104）から得られた励起信号はLPCフィルタ（10
6）に提供され、該LPCフィルタ（106）は符号化情報に
従って合成された音声信号を生成する。合成された音声
信号は次にピッチポストフィルタ（107）によりろ波さ
れかつスペクトルポストフィルタ（108）によりろ波さ
れて再構成または再現された音声の品質を向上させる。
もし必要であれば、ポストエンファシスフィルタ（10
9）を含むことができさらに音声信号品質を向上させ
る。（スペクトルポストフィルタ（108）およびポスト
エンファシスフィルタ（109）に関するこれ以上の詳細
については後に説明する。）音声信号は次にオーディオ処理ユニット（111）にお
いて処理されかつオーディオ変換器（112）により可聴
的にされる。励起源（104）、LPCフィルタ（106）、ピ
ッチポストフィルタ（107）、適応スペクトルポストフ
ィルタ（108）、そしてポストエンファシスフィルタ（1
09）はすべてDSP（113）の適切なプログラミングによっ
て提供できる。In FIG. 1, a wireless device (100) for implementing the present invention
Includes an antenna (102) for receiving a speech coded radio frequency (RF) signal (101). The RF unit (103) processes the received signal to restore the audio coded information. This information is provided to a parameter decoder (105) that outputs control parameters for various subsequent processing. The above mentioned excitation source (104) utilizes the parameters given to it to generate an excitation signal. The excitation signal obtained from the excitation source (104) is
6), the LPC filter (106) generates a synthesized speech signal according to the encoding information. The synthesized speech signal is then filtered by a pitch post-filter (107) and filtered by a spectral post-filter (108) to improve the quality of the reconstructed or reproduced speech.
If necessary, use a post-emphasis filter (10
9) can further improve the audio signal quality. (More details regarding the spectral post-filter (108) and the post-emphasis filter (109) will be described later.) The audio signal is then processed in an audio processing unit (111) and is audible by an audio converter (112). To be. Excitation source (104), LPC filter (106), pitch postfilter (107), adaptive spectral postfilter (108), and post-emphasis filter (1
09) can all be provided by appropriate programming of the DSP (113).

この発明によれば、適応スペクトルポストフィルタ
（108）は第１の構成要素（LPCフィルタ（106）のフィ
ルタ特性に関係する分母）および第２の構成要素（前記
分母の概略的なスペクトル形状を適応的に追跡しそれに
よりそれを打ち消す分子）により特徴付けられる。その
ようなフィルタの一般的な形式は、音響、音声、および
信号処理に関する国際会議紀要の1987年４月版の、ペー
ジ2185−2188に見られる、chenおよびGershoによる、
「適応ポストフィルタリングを備えた4800bpsの実時間
ベクトルAPC音声コーディング）と題する論文に述べら
れている。According to the present invention, the adaptive spectral post-filter (108) adapts the first component (the denominator related to the filter characteristics of the LPC filter (106)) and the second component (the approximate spectral shape of the denominator). Molecules that track and thereby counteract it). The general form of such a filter is described by chen and Gersho, found in the April 1987 edition of the Bulletin of the International Conference on Sound, Speech, and Signal Processing, pages 2185-2188.
"4800bps real-time vector APC speech coding with adaptive post-filtering).

この発明によれば、前記分子は分母の多項式にスペク
トル平滑技術を適用することにより出力される。そのよ
うな技術は音響、音声、および信号処理に関するIEEE紀
要の1978年12月版に見られる、トークラ、イタクラ、お
よびハシモトによる、「PARCOR音声分析−合成における
スペクトル平滑技術」と題する論文に述べられている。According to the invention, the numerator is output by applying a spectral smoothing technique to the denominator polynomial. Such a technique is described in a paper by Tokla, Itakura and Hashimoto entitled "PARCOR Speech Analysis-Spectral Smoothing Techniques in Synthesis", found in the December 1978 edition of the IEEE Bulletin of Sound, Speech, and Signal Processing. ing.

１つの実施例においては、分母を表わすＺ変換係数は
自己相関ドメインに変換される。（そのような変換の例
はMarkel,J.D.Gray,A.H.,Jr、音声のリニア予測（Sprin
ger−Verlag,Berlin,Heidelberg,New York,1976）に見
ることができる。）スペクトル平滑技術の帯域幅拡張機
能が次に自己相関シーケンスに適用され、分子の多項式
の係数はレビンソン帰納法によって前記修正された自己
相関シーケンスから計算される。１つの実施例において
は、自己相関係数は以下のファクタによって乗算され結
果的に分子係数を提供する。In one embodiment, the Z transform coefficients representing the denominator are transformed into the autocorrelation domain. (Examples of such transformations are Markel, JDGray, AH, Jr, linear prediction of speech (Sprin
ger-Verlag, Berlin, Heidelberg, New York, 1976). 2.) The bandwidth extension function of the spectral smoothing technique is then applied to the autocorrelation sequence and the numerator polynomial coefficients are calculated from the modified autocorrelation sequence by Levinson induction. In one embodiment, the autocorrelation coefficients are multiplied by the following factors, resulting in numerator coefficients.

表１自己相関の遅れスペクトル平滑ファクタ 0 1.0000000 1 0.9230769 2 0.7252747 3 0.4835164 4 0.2719780 5 0.1279896 6 4.9773753E−02 7 1.5718028E−02 8 3.9294070E−03 9 7.4847753E−04 10 1.0206513E−04 分母および分子は次に適応スペクトルポストフィルタ
（108）を特徴付けるために使用される。Table 1 Autocorrelation delay spectrum smoothing factor 0 1.0000000 1 0.9230769 2 0.7252747 3 0.4835164 4 0.2719780 5 0.1279896 6 4.9773753E-02 7 1.5718028E-02 8 3.9294070E-03 9 7.4847753E-04 10 1.0206513E-04 The denominator and numerator are It is then used to characterize the adaptive spectral postfilter (108).

勿論、LPCフィルタ情報を直接使用しかつ同様のプロ
セスによってそこから分子項を出力することも可能であ
り、これはLPCフィルタ情報が上に述べた分母項を出力
するのに使用されるからである。Of course, it is also possible to use the LPC filter information directly and output the numerator terms therefrom by a similar process, since the LPC filter information is used to output the denominator terms mentioned above. .

この処理により、分子多項式が分母多項式のスペクト
ル的に平滑化されたものにより提供される。分母多項式
のスペクトル的に平滑化された帯域幅拡張版は実効的に
時間的に変化するスペクトルの傾きを最小にしかつ分子
が適応的に分母の概略的なスペクトル形状を追跡してそ
れを打ち消すことを可能にする。リスニング試験に基づ
き、約1200Hzの帯域幅拡張ファクタ（これは分母に対し
て行なわれる平滑の程度を特定する）が使用された。With this processing, the numerator polynomial is provided by the spectrally smoothed version of the denominator polynomial. A spectrally smoothed bandwidth extension of the denominator polynomial effectively minimizes the time-varying spectral slope and the numerator adaptively tracks and cancels the approximate spectral shape of the denominator Enable. Based on listening tests, a bandwidth extension factor of about 1200 Hz, which specifies the degree of smoothing performed on the denominator, was used.

第２図のフローチャートは、上記の適応スペクトルポ
ストフィルタの特徴の理解に役立つ。前述したように、
適応スペクトルポストフィルタは、第１の構成要素すな
わち分母と、第２の構成要素すなわち分子とによって特
徴づけられる。第１の構成要素は、工程202に示すよう
に、以下の式で表現できる。The flowchart of FIG. 2 helps to understand the characteristics of the above-mentioned adaptive spectral post-filter. As previously mentioned,
The adaptive spectral postfilter is characterized by a first component or denominator and a second component or numerator. The first component can be represented by the following equation, as shown in step 202.

１−Ａ（z/ν）続く工程203において、第１の構成要素を表すＺ変換
係数を自己相関ドメインに変換する。工程204におい
て、スペクトル平滑帯域幅拡張を自己相関順列に適用す
る。続く工程205において、工程204で変形された自己相
関順列から、レビンソン帰納法を使用して分子（第２の
構成要素）多項係数を計算する。この分子すなわち第２
の構成要素は、以下の式で表現できる。1-A (z / ν) In the following step 203, the Z-transform coefficient representing the first component is transformed into the autocorrelation domain. In step 204, a spectral smoothing bandwidth extension is applied to the autocorrelation permutation. In the subsequent step 205, a numerator (second component) polynomial coefficient is calculated from the autocorrelation permutated in the step 204 using Levinson induction. This molecule, the second
Can be expressed by the following equation.

１−Ｂ（ｚ）最後の工程206において、第１および第２の構成要素
（分母および分子）を用いて、適応スペクトルポストフ
ィルターを特徴づける（以下の式で表現できる）。1-B (z) In a final step 206, the adaptive spectral postfilter is characterized using the first and second components (denominator and numerator) (which can be expressed by the following equation).

｛１−Ｂ（ｚ）｝／｛１−Ａ（z/ν）｝ポストエンファシスフィルタ（109）はポストフィル
タによりろ波された音声のブライトネスに対しより一層
の制御を可能にするため提供される。該フィルタは次の
形式の一次のフィルタである。{1-B (z)} / {1-A (z / ν)} A post-emphasis filter (109) is provided to allow more control over the brightness of the audio filtered by the post-filter. . The filter is a primary filter of the form:

（ｚ）＝１−uz^-1 この場合、典型的には、0.2≦ｕ≦0.5である。(Z) = 1-uz ^{-1 In} this case, typically 0.2 ≦ u ≦ 0.5.

フロントページの続き (56)参考文献特開昭64−13200（ＪＰ，Ａ) 特公昭58−51280（ＪＰ，Ｂ２) 特公昭59−41597（ＪＰ，Ｂ２) 特公昭59−24439（ＪＰ，Ｂ２) 欧州特許570362（ＥＰ，Ｂ１) 米国特許4301329（ＵＳ，Ａ) Ｐｒｏｃｅｅｄｉｎｇｓｏｆ 1987 ＩＥＥＥＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＡｃｏｕｓｔｉｃｓ，ＳｐｅｅｃｈａｎｄＳｉｇｎａｌＰｒｏｃｅｓｓｉｎｇ，Ｖｏｌ．４，Ｊｕｉｎ−ＨｗｅｙＣｈｅｎｅｔａｌ，”Ｒｅａｌ−ＴｉｍｅＶｅｃｔｏｒＡＰＣＳｐｅｅｃｈＣｏｄｉｎｇａｔ 4800 ｂｐｓｗｉｔｈＡｄａｐｔｉｖｅＰｏｓｔｉｆｉｌｔｅｒｉｎｇ”，ｐ. 2185−2188，Ａｐｒｉｌ 1987 Ｐｒｏｃｅｅｄｉｎｇｓｏｆ 1988 ＩＥＥＥＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＡｃｏｕｓｔｉｃｓ，ＳｐｅｅｃｈａｎｄＳｉｇｎａｌＰｒｏｃｅｓｓｉｎｇ，Ｖｏｌ．１，Ｄ．ＪＺａｒｋｉａｄｉｓｅｔａｌ，”Ａ 16ＫＢ／ＳＡＰＣＳｙｓｔｅｍｗｉｔｈＡｄａｐｔｉｖｅＰｏｓｔｆｉｌｔｅｒａｎｄＥｖａｌｕａｔｉｏｎｏｆｉｔｓＰｅｒｆｏｍａｎｃｅ”, ｐ．631−634 ＡｄｖａｎｃｅｓｉｎＳｐｅｅｃｈＣｏｄｉｎｇ，ＩＥＥＥＷｏｒｋｓｈｏｐｏｎＳｐｅｅｃｈＣｏｄｉｎｇｆｏｒＴｅｌｅｃｏｍｍｎｉｃａｔｉｏｎｓ 1991，Ｊ．Ｇｅｒｓｏｎ，”ＶｅｃｔｏｒＳｕｍＥｘｃｉｔｅｄＬｉｎｅａｒＰｒｅｄｉｃｔｉｏｎ”，ｐ．69−79，ＶａｎｃｏｖｅｒＣａｎａｄａ (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 11/00 - 13/08 G10L 19/00 - 21/06 ＩＥＥＥ／ＩＥＥＥｌｅｃｔｒｏｎｉｃＬｉｂｒａｒｙＯｎｌｉｎｅＪＩＣＳＴファイル（ＪＯＩＳ)Continuation of the front page (56) References JP-A-64-13200 (JP, A) JP-B-58-51280 (JP, B2) JP-B-59-41597 (JP, B2) JP-B-59-24439 (JP, A) , B2) European Patent 570362 (EP, B1) US Patent 4,301,329 (US, A) Proceedings of 1987 IEEE International Conference on Acoustics, Speech and Signal Processing, Vol. 4, Juin-Hway Chen et al, "Real-Time Vector APC Speech Coding at 4800 bps with Adaptive Pose Filtering", p. Signal Processing, Vol. 1, D. J Zarkia dis et al, "A 16KB / S APC System with A Adaptive Postfilter and Evaluation of Performances Performance", p. 631-634 Advances in Speech Coding, IEEE Work shop on Speech Coding for Telecommunications 1991, J. Am. Gerson, "Vector Sum Excited Linear Prediction", p. 69-79, Vancover Canada (58) Fields investigated (Int. Cl. ⁷ , DB name) G10L 11/00-13/08 G10L 19/00-21/06 IEEE / IEEE Electronic Library Online JICST file (JOIS )

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】合成音声信号を生成する方法であって：Ａ）励起信号をLPCフィルタに提供する段階；Ｂ）前記LPCフィルタから、合成音声信号を提供する段
階；Ｃ）第１の構成要素および第２の構成要素を必要とする
音声合成ポストフィルタを提供する段階；Ｄ）第１の組の係数を含む前記第１の構成要素を提供す
る段階；Ｅ）少なくともいくつかの前記第１の組の係数を自己相
関ドメインの組のパラメータに変換する段階；Ｆ）前記自己相関ドメインの組のパラメータにスペクト
ル平滑演算を行って修正された第１の組の係数を提供す
る段階；Ｇ）前記修正された第１の組の係数を用いて、前記音声
合成ポストフィルタにより使用するための前記第２の構
成要素を提供する段階；Ｈ）前記第１の構成要素および前記第２の構成要素を使
用して音声合成ポストフィルタにおいて合成音声信号を
濾波し、濾波された合成音声信号を提供する段階；およ
びＩ）前記濾波された合成音声信号を可聴にする段階；を備えたことを特徴とする方法。1. A method for generating a synthesized speech signal, comprising: A) providing an excitation signal to an LPC filter; B) providing a synthesized speech signal from the LPC filter; C) a first component. Providing a speech synthesis postfilter requiring a second component and a second component; D) providing the first component comprising a first set of coefficients; E) at least some of the first components. Converting the set of coefficients to autocorrelation domain set parameters; F) performing a spectral smoothing operation on the autocorrelation domain set parameters to provide a modified first set of coefficients; Using the modified first set of coefficients to provide the second component for use by the speech synthesis postfilter; H) providing the first component and the second component; use Wherein further comprising a; step of audible a and I) said filtered synthesized speech signal; filters the synthesized speech signal in the speech synthesis postfilter stage provides a filtered synthesized speech signal Te.

【請求項２】請求項１に記載の合成音声信号を生成する
方法であって、さらに：Ｊ）符号化音声情報を含むRF信号を受信する段階；Ｋ）前記符号化音声情報から励起信号を復元する段階；
およびＬ）該励起信号をLPCフィルタに提供する段階；を備え、第１の構成要素が前記音声合成ポストフィルタ
により使用されることを特徴とするもの。2. The method for generating a synthesized speech signal according to claim 1, further comprising: J) receiving an RF signal including encoded speech information; K) extracting an excitation signal from the encoded speech information. Restoring;
And L) providing the excitation signal to an LPC filter, wherein a first component is used by the speech synthesis post-filter.

【請求項３】前記LPCフィルタは少なくとも部分的に次
の式、 1/｛１−Ａ（ｚ）｝により定義される、請求項１または２に記載の方法。3. The method according to claim 1, wherein the LPC filter is defined at least in part by the following equation: 1 / {1-A (z)}.

【請求項４】前記音声合成ポストフィルタの第１の構成
要素はＺ変換表記法により次の式、 1/A（z/ν）で表される、請求項１または２に記載の方法。4. The method according to claim 1, wherein the first component of the speech synthesis post-filter is expressed by the following formula in Z-transform notation: 1 / A (z / ν).

【請求項５】さらに、Ｍ）前記合成音声信号をＺ変換表記法で、Ｈ（ｚ）＝１−uz^-1 ただし0.2≦ｕ≦0.5、で実質的に定義されるポストエン
ファシスフィルタによって濾波する段階；を含む、請求項１または２に記載の方法。5. In addition, M) the synthesized speech signal in Z transform notation, is filtered by H (z) = 1-uz -1 However 0.2 ≦ u ≦ 0.5 in the post emphasis filter substantially defined, 3. The method according to claim 1 or 2, comprising the step of:

【請求項６】前記演算を行う段階は乗算手段を含む請求
項１または２に記載の方法。6. The method according to claim 1, wherein the step of performing an operation includes a multiplication means.

【請求項７】請求項１に記載の方法であって：前記音声合成ポストフィルタの第２構成要素は１−Ｂ（ｚ）の形式であり、前記音声合成ポストフィルタは（１−Ｂ（ｚ））／（１−Ａ（z/ν））の形式である、ことを特徴とする方法。7. The method of claim 1, wherein: the second component of the speech synthesis postfilter is of the form 1-B (z); and the speech synthesis postfilter is (1-B (z )) / (1-A (z / ν)).