JPS5814196A

JPS5814196A - Voice synthesization circuit

Info

Publication number: JPS5814196A
Application number: JP56112703A
Authority: JP
Inventors: 日比野　昌弘
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1981-07-17
Filing date: 1981-07-17
Publication date: 1983-01-26
Also published as: JPS6046438B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】この発明は音声波形を分析して特徴パラメータを抽出し
、この特徴パラメータを一定時間（以下、フレーム周期
と称す）毎にメモリ手段に転送し、ディジタルフィルタ
によりこの特徴パラメータに基づいて音声波形全合成出
力する偏自己相関分析合成方式の音声合成器に関するも
のである。DETAILED DESCRIPTION OF THE INVENTION This invention analyzes audio waveforms to extract feature parameters, transfers these feature parameters to memory means at fixed time intervals (hereinafter referred to as frame periods), and uses digital filters to extract feature parameters. This invention relates to a speech synthesizer using a partial autocorrelation analysis synthesis method that outputs speech waveform total synthesis based on .

現在実用に供されている音声合成器の多くは、偏自己相
関分析合成方式にもとづくもので、合成計算を行なう回
路は１個のシリコンチップに集積化されるに至っている
。このような音声合成器は一般に第１図の分析合成シス
テムの合成側の各機能図ＦＮ１（１００）　？集積化し
たものとなっている。Most of the speech synthesizers currently in practical use are based on the partial autocorrelation analysis synthesis method, and the circuit for performing synthesis calculations has come to be integrated on a single silicon chip. Such a speech synthesizer generally has the function diagram FN1 (100) on the synthesis side of the analysis and synthesis system shown in Figure 1. It has become integrated.

同図中、（３００）はパラメータファイルで、分析器（
２００）で分析抽出された音声の特徴パラメータを記憶
する手段、たとえば読み出し専用メモリである。In the figure, (300) is a parameter file for the analyzer (
200), such as a read-only memory.

この音声合成器の主要部は一般に第２図のブロック図に
示すような回路構成で、第１図の分析器（２００）で音
声波形から分析抽出され、さらに量子化された特徴デー
タＤのピッチ、有声・無声判定コード、振幅、偏自己相
関係＆（いわゆるにパラメータ）を復号化する復号器（
１１０）、（１２０）、（１３０）。The main part of this speech synthesizer generally has a circuit configuration as shown in the block diagram of FIG. , voiced/unvoiced judgment code, amplitude, partial autocorrelation & (so-called parameters)
110), (120), (130).

それぞれの復号されたパラメータを一時記憶するメモリ
（１１１）、（１２１）、（１３１）、メモリ（１１１
）の出力であるピッチパラメータの値に対応したパルス
列を発生するパルス発生回路（１１２）　、無声音用音
源として使用する白雑音を発生する１雑音発生回路（１
１３）　、有声・無声判定コードに対応して音源信号と
してパルス列か１雑音信号かを選択する音源選択回路（
１１４）、音源信号に振幅値メモＩＪ　（１２１）の内
容を掛は合わせる振幅乗算回路（１４０）、　Ｋ、＜ラ
メータメモリ（１３１）の内容に対応したフィルタ係数
を用いて音源信号から所定の周波数スペクトラム成分を
抽出するディジタルフィルタ（１５０）、ディジタルフ
ィルタ（１５０）のディジタル波高値Ｙｌ全アナログ信
号、ＹＶｃ変換するＤ／Ａ変換器（１６０）から構成さ
れている。なお、図示はされてい表いが、これら以外に
、これらの各機能回路を時間的なタイミングをはかつて
操作させるために必要なタイミング信号発生回路や、復
号器（１１０）、（１２０）。Memories (111), (121), (131) for temporarily storing the respective decoded parameters;
), a pulse generation circuit (112) that generates a pulse train corresponding to the value of the pitch parameter which is the output of
13) , a sound source selection circuit (
114), an amplitude multiplier circuit (140) that multiplies the sound source signal by the contents of the amplitude value memo IJ (121); It consists of a digital filter (150) for extracting spectrum components, a digital peak value Yl of the digital filter (150), an all-analog signal, and a D/A converter (160) for YVc conversion. Although not shown in the figure, there are also timing signal generation circuits and decoders (110) and (120) necessary for controlling the temporal timing of each of these functional circuits.

（１３０）　ＶＣ外部メモリに貯えられている音声分析
によって得られた時系列データを順次取り込むためのイ
ンタフェース回路などが、加わって音声合成器を構成し
ている。(130) An interface circuit for sequentially importing time-series data obtained by speech analysis stored in the VC external memory is added to constitute a speech synthesizer.

このような音声合成器では、音声データＤｆ記憶するメ
モリを節約するために分析データの情報圧縮が行なわれ
ており、１秒間の音声について約２０００ビツト程度に
圧縮した場合でも明瞭度はあまり損われず、実用に供し
得る。圧縮方法は種々あるが、１例として振幅パラメー
タは４〜６ビツト、ピッチパラメータは５〜６ビツト、
Ｋパラメータについては不均一ビット配分と称して、Ｋ
。In such speech synthesizers, information compression is performed on the analysis data in order to save memory for storing the speech data Df, and even if one second of speech is compressed to approximately 2000 bits, the intelligibility is not significantly impaired. However, it can be put to practical use. There are various compression methods; for example, the amplitude parameter is 4 to 6 bits, the pitch parameter is 5 to 6 bits,
Regarding the K parameter, it is called non-uniform bit allocation.
.

〜Ｋｌｏノ順Ｋ　５，５．４．４．４．４．４．３．３
．３ピツトあるいは、７．５．４．４．４．３．３．３
．３．３ビツトに割り当てられている。~Klo order K 5, 5.4.4.4.4.4.3.3
．． 3 pits or 7.5.4.4.4.3.3.3
．． It is assigned to 3.3 bits.

第２図中の復号器（１１０）、（１２０）、（１３０）
は量子化されたこれらのパラメータコードを分析データ
の真値に復号するもので、それぞれのビット数に応じた
語数のテーブルを成している。通常回路構成上の制約か
ら、復号されるディジタル数値は１０ピット程度の精度
ケ有している。普た復号テーブルの６値は分析値の上限
値と下限値の間を線形量子化あるいは、逆双曲線関数変
換した後に線形量子化したものが設定されている。Decoders (110), (120), (130) in FIG.
is used to decode these quantized parameter codes into the true values of analysis data, and forms a table with the number of words corresponding to the number of bits. Usually, due to circuit configuration constraints, the decoded digital value has an accuracy of about 10 pits. The six values of the general decoding table are set by linear quantization between the upper limit value and lower limit value of the analysis value, or by linear quantization after inverse hyperbolic function transformation.

上述の音声合成器は音声を合成する場合、小容量の音声
データメモリでかなり自然度の高い合成音声ケ得ること
ができる。１．かし正弦波等の楽音については、量子化
に伴うスペクトル歪や、音源周波数とディジタルフィル
タ（１５０）の極周波数の不整合による変調ノイズが大
きく、十分な音質を得ることができなかった。また後に
詳述するように、正弦波等の純音で音階の構成や数百Ｈ
ｚ以上の基本周波数の楽音の発生が不可能であった。When the above-mentioned speech synthesizer synthesizes speech, it is possible to obtain synthesized speech with a high degree of naturalness using a small capacity speech data memory. 1. For musical tones such as sine waves, sufficient sound quality could not be obtained due to spectral distortion caused by quantization and large modulation noise due to mismatch between the sound source frequency and the pole frequency of the digital filter (150). In addition, as will be explained in detail later, we will explain the composition of scales using pure tones such as sine waves, and
It was impossible to generate musical tones with a fundamental frequency higher than z.

この発明は上述の音声合成器に改良を加え音声のみなら
ず、正弦波などの楽音の合成および音階音（メロディ）
のｔｌｔ成も可能とするものである。This invention improves the above-mentioned speech synthesizer and synthesizes not only speech but also musical tones such as sine waves and scale tones (melody).
It is also possible to perform tlt construction.

以下、この、発明の詳細な説明する。。This invention will be explained in detail below. .

全権型ディジタルフィルタの伝達関数は極数が１１のと
き、Ｈ（Ｚ）−Ａ／　（１＋ａ、　Ｚ　’　＋ａ２Ｚ＋
２１・−−・・（１）Ｚ＝ｅ　　Ｐ”：Ｊ２ｙｒｆＴである。上式において極周波数１ｆｒとすると（１）式
の分母−〇とおいた連立方稈式よりａｌ−２ｅ−ρｃｏｓ２πｆｒ７・・・・・・・・・・
・・・・・・・（２）ａ２＝−〇−１ａρ なる関係式が成立する。一方このフィルタのインパルス
レスポンスはｘｉ＝４ｅ−９１画２πｆｒｉ７・・・・・・・・・・
・・・・・（３）で表わされる。（３）式は減衰振動波
形全意味しており、楽音として好適な波形である。つぎ
ＶＣ線形予測係数０１は数学的な変換処理により偏自己
相関係数のにパラメータと次式によって関係付けられる
。When the number of poles is 11, the transfer function of the all-powerful digital filter is H(Z)-A/ (1+a, Z' +a2Z+
21・---(1) Z=e P'': J2yrfT.If the polar frequency is 1fr in the above equation, then from the simultaneous cubic equation with the denominator -〇 in equation (1), al-2e-ρcos2πfr7...・・・・・・・・・
(2) The relational expression a2=-〇-1aρ holds true. On the other hand, the impulse response of this filter is xi=4e-91 strokes 2πfri7...
...It is expressed as (3). Equation (3) represents the entire damped vibration waveform, and is a waveform suitable for musical tones. Next, the VC linear prediction coefficient 01 is related to the parameter of the partial autocorrelation coefficient by the following equation through mathematical conversion processing.

Ｋ＋＝６＋　／（１−０２）　　・・・・・・・・・・
・・・・・・・・・（４）２−０２１７たがってｆｒ＝（１／２πＴ）ｃｏｓ　　［（１−＋−６”ρ）
Ｋ＋／（２ｅ−ρ；＝（１／２πＴ）ｃｏｓ　　［ｌ−
に２）Ｋ＋／（２ｖ’　Ｋｚ））・・−・・（５）ρ−
−（１／２　）７？ｎ（−に雪）である。（５）式によれば減衰振切波形の周波数はＫｌ
　１１（２パラメータの値によって、また減衰定数はに
２パラメータによって一意的に定まる。なお間代におい
てに２が−０，９５〜−１，０の範囲では、Ｋｚのｙ化
が極周波＆に影響を与える程度は１％以下であり、＠路
上の音程の狂い感はない。この場合（５）式のｆｒは近
似的に次式で与えられ、ｆｒはに１のみに対応する。K+=6+/(1-02) ・・・・・・・・・・・・
・・・・・・・・・(4) 2-02 17 Therefore fr=(1/2πT) cos [(1-+-6”ρ)
K+/(2e-ρ;=(1/2πT) cos [l-
2) K+/(2v' Kz))...-(5) ρ-
-(1/2)7? It is n (- snow). According to equation (5), the frequency of the damped cut-off waveform is Kl
11 (The attenuation constant is uniquely determined by the values of the two parameters. In addition, in the range of -0,95 to -1,0 for 2, the y conversion of Kz becomes polar frequency & The degree of influence is 1% or less, and there is no sense of pitch deviation on the road.In this case, fr in equation (5) is approximately given by the following equation, and fr corresponds only to 1.

ｆｒ中（１／２ｙｒＴ）ｃｏｓ　’に、　　　　　（６
）Ｋｚの値の上述の範囲は減衰定数のθ〜０．０２５６
　Ｋ対応し、すなわち減衰のない定常正弦波形から約４
０サンプリング周期で１／εに減衰する波形に対応する
。これはピアノ楽器などの自然楽器音の減衰特性に近い
ものであり楽音として好適である。(6
) The above range of the value of Kz is the damping constant θ ~ 0.0256
K corresponding, i.e. from a steady sinusoidal waveform with no damping, approximately
This corresponds to a waveform that attenuates to 1/ε with a sampling period of 0. This is close to the attenuation characteristic of the sound of a natural musical instrument such as a piano instrument, and is suitable for musical sounds.

一方音声用として構成された１０段のディジタルフィル
タの演算アルゴリズムは表ＩＶＣ示す逐次計算式である
。On the other hand, the calculation algorithm of the 10-stage digital filter configured for audio use is the sequential calculation formula shown in Table IVC.

この式中のｙｊ、ｂｊ　　はそれぞれ格子型フィルタに
おける前進波、後進波のｊステージにおける中間値で（
りの１はサンプリング番号である。フィルタ出力はｂｌ
（ｉ）である。表１の遂次計算式はに３〜ＫＩＯ＝０の
場合１極のディジタルフィルタとして機能し、線形予測
係数ａｌ＋１１１２を用いて表わした場合、（４）式を
考慮してＸｎ＝１Ｊ＋ａｌ　ｘｎ、　＋Ｑ２　Ｘｎ−−・−−・
（７）なる式と等価である。ただし、ｘｎ　　はｎ番目
のサンプル周期に対応する波形値、ＸｎｔＸｎ−、はそ
］れぞれｘｎから１つ前、２つ前のサンプル時点の値？、
Ｕは音源信号値を意味する。In this equation, yj and bj are the intermediate values at the j stage of the forward wave and backward wave in the lattice filter, respectively (
1 is the sampling number. The filter output is bl
(i). The sequential calculation formula in Table 1 functions as a one-pole digital filter when 3 to KIO = 0, and when expressed using the linear prediction coefficient al+1112, considering formula (4), Xn = 1J + al xn, +Q2 Xn---・---・
This is equivalent to equation (7). However, xn is the waveform value corresponding to the nth sampling period, and ,
U means the sound source signal value.

（１）式の伝達関数で決まるディジタルフィルタのイン
パルス応答（３）式のｘｌは（７）式において音源信号
値ＵをインパルスとしたときのＸｎ　Ｋ一致する。The impulse response of the digital filter determined by the transfer function of equation (1), xl in equation (3), matches Xn K when the sound source signal value U is taken as an impulse in equation (7).

この場合、重要なことは、音源にインパルスを用いるこ
とであり、音声合成の目的で第２図の音源信号発生口＠
　（１１２）、（１１３）で生成された有声の場合のイ
ンパルス列、無声の場合のホワイトノイ？（“白雑音）
を用いたのでは目的を充分に達成することができない。In this case, the important thing is to use impulses as the sound source, and for the purpose of speech synthesis, the sound source signal generation port @
Impulse trains generated in (112) and (113) for voiced cases, white noise for unvoiced cases? (“white noise”)
If you use , you will not be able to fully achieve your purpose.

以下に従来の偏自己相関分析合成方式の音声合成器で楽
音を発生する際の間粗点を詳説する。Below, a detailed explanation will be given of the roughness when musical tones are generated by a speech synthesizer using a conventional partial autocorrelation analysis synthesis method.

音声合成用として用いられるホワイトノイズは、そのホ
ワイトノイズがランダムパルス列として模擬的に作られ
るものであり、完全なランダム性、すなわち周波数スペ
クトラムの均一性を有し得す、何らかの音声を持ってい
る。また有声音用としてのインパルス列の場合は、イン
パルス周期で決まる周波数およびその整数倍に特に強い
成分のあるスペクトラムを有する。White noise used for speech synthesis is simulated as a random pulse train, and has some kind of sound that can have complete randomness, that is, uniformity of the frequency spectrum. In addition, an impulse train for voiced sound has a spectrum with particularly strong components at frequencies determined by the impulse period and integral multiples thereof.

このような音源侶′８を入力し、たときのディジタルフ
ィルタの出力は音源に含まれる周波＆成分とフィルタ係
数に１によって決まる極周波数の一致した周波数におい
て大きな成分を生じるが、音源に含まれる他の周波数成
分も少なからず存在し、これが聴感上には変調ノイズ感
やうなり感を生ずることになる。したがって、定常的に
エネルギーを有する音源信号ケフィルタに入力する場合
は音源そのものの周波数スペクトラムがフィルタの極周
波＆あるいはその整数倍の周波数のみを有するものでな
ければ良質な楽音とすることができない。When such a sound source is input, the output of the digital filter will produce a large component at the frequency where the frequency and component contained in the sound source and the pole frequency determined by the filter coefficient coincide with 1, but the frequency and component contained in the sound source will be There are also quite a few other frequency components, which give rise to a sense of modulation noise and beat to the auditory sense. Therefore, when a sound source signal having constant energy is input to a filter, a high-quality musical sound cannot be obtained unless the frequency spectrum of the sound source itself has only the polar frequency of the filter and/or frequencies that are integral multiples thereof.

しかし有声用音源として作られるインパルス列はサンプ
リング周期を最小時間間隔としたものしか作り得す、１
−たがって音源信号の基本周波数は表２の例に示すよう
な段階的なものとなり、通常の方法で量子化および復号
化されたに１パラメータ値で決まる極周波数とは一致し
ない。表２のピッチ周期はインパルス列のパノ１ス間隔
をサンプリング点数の形で表現した数値であり、基本周
波数は表２に示される周波数列では１オクタ一ブ以上に
わたる平均律音階や純生調音階を構成できないし、サン
プリング周波数が８〜ｌ０ＫＨ２では攻１００Ｈ２以上
の基本周波数の楽音は極〈僅かのものしか得られない。However, the impulse train created as a voiced sound source can only be created with the sampling period as the minimum time interval.
- Therefore, the fundamental frequency of the sound source signal becomes stepwise as shown in the example in Table 2, and does not match the polar frequency determined by the value of one parameter quantized and decoded using the usual method. The pitch period in Table 2 is a numerical value that expresses the pano-first interval of an impulse train in the form of a sampling point, and the fundamental frequency is a value that represents an equal-tempered scale or a pure tonal scale over one octave or more in the frequency sequence shown in Table 2. It cannot be configured, and if the sampling frequency is 8 to 10KH2, only very few musical tones with a fundamental frequency of 100H2 or higher can be obtained.

この発明による音声合成器は前述の原理にもとづき、従
来の音声合成器に以下の手段を工夫することにより、歪
のない楽音が発生できるようにしたものである。The speech synthesizer according to the present invention is based on the above-mentioned principle and is capable of generating distortion-free musical tones by adding the following means to the conventional speech synthesizer.

この発明に係る音声合成器の一実施例第３図によって説
明する。この実施例は第２図に示した従来の音声合成器
に楽音用としてに＋　、に２パラメータの復号器（１７
０）を付加した構成となしたものである。パラメータ、
復号器（１７０）はコード化されている特徴パラメータ
Ｉｎ入力して・、ディジタルフィルタ（１５０＞　ＶＣ
係数として用いられるに、、に２パラメータを復号して
作り出す機能回路である。１これはたとえば復号値を１
０進数値で表わしたものを読出し専用メモリに記憶させ
、コードデータをそのメモリのアドレスとして対応する
メモリ内容値をメモリから読み出す、いわゆるテーブル
方式の機能回路で実現できる。An embodiment of the speech synthesizer according to the present invention will be explained with reference to FIG. This embodiment adds a 2-parameter decoder (17) to the conventional speech synthesizer shown in FIG.
0) is added. parameters,
The decoder (170) inputs the encoded feature parameters In and converts them into digital filters (150>VC
This is a functional circuit that decodes and creates two parameters to be used as coefficients. 1 This means, for example, that the decoded value is 1
This can be realized by a so-called table-type functional circuit that stores the value expressed in decimal digits in a read-only memory and reads out the corresponding memory content value from the memory by using the code data as an address of the memory.

メモリに記憶させておく、復号値は原理脱ｔＪ［おいて
述べた式（２）及び式（４）から導き出される次式によ
って計算される値とする。The decoded value to be stored in the memory is a value calculated by the following equation derived from equations (2) and (4) described in the principle tJ[.

Ｋ、４０ｏｓ２πｆ、ｒＴ　　、、、に２＝　　”ρ゛
°＝−°“−＜８）　　　　−ここで極周波＆　、ｆ　
ｒはメロディなどの作１ｉＫとって必要な音階音の周波
数とする。たとえばサンプリング周波数ｉ　８　ＫＨ２
とし、４４０Ｈ２ｌ／）、Ａ４音から８８０Ｈ２のＡ６
音の１オクターブにわたる音階音を実現するためには表
３ＶＣ示されるに、パラメータ値を復号器（１７０）の
メモリに記憶さ、せる。K, 40os2πf, rT , , 2 = ”ρ゛°=-°”-<8) - where the polar frequency & , f
Let r be the frequency of the scale note necessary for creating a melody or the like. For example, sampling frequency i 8 KH2
and 440H2l/), A4 sound to 880H2 A6
To achieve a scale tone spanning an octave of notes, parameter values are stored in the memory of the decoder (170) as shown in Table 3VC.

表　　　　　３に２パラメータの復号値としては、たとえば表４に示す
ような数種類の値をメモリに記憶させる。友４のに２の
値を用いれば１／εに減衰するに要する時シ間が表４のτに示されるように自然楽器に近い好適な楽
音が得られる。As the decoded values of the two parameters shown in Table 3, several types of values shown in Table 4 are stored in the memory, for example. If a value of 2 is used for 4, as shown by τ in Table 4, it is possible to obtain a suitable musical tone close to that of a natural musical instrument, as shown by τ in Table 4.

表　　　　　４（Ｔ＝］／８０００）Ｋ１＋に２の復号値の（８）式による計算の忠実度は、
必要とされる周波数および、減衰定数の精度に応じたも
ので良い。［、かじ、ディジタルフィルタ（１５０）に
適用される格子形フィルタの加減算の極性によっては、
Ｋｌ、に２はそねぞれ符号を反転したものを用いる必要
がある。Table 4 (T=]/8000) The fidelity of calculation using equation (8) for the decoded value of 2 to K1+ is:
It may be determined according to the required frequency and accuracy of the attenuation constant. [, depending on the polarity of addition and subtraction of the lattice filter applied to the digital filter (150),
It is necessary to use the values of Kl, and 2 whose signs are inverted.

このようにして決定される復号値テーブルを有する復す
器（１７０）　ｆ／ｊよって得られるＫＩ　＋　Ｋ２パ
ラメータ金ディジタルフィルタ（１５０）　Ｋ与λ０、
（単一の）インパルス音源信号でフィルタを駆動するこ
とによって、（３）式に示されるような減衰振切波形が
得られ、またメロディ音の８−成かり能となる。A decoder (170) having a decoded value table determined in this way KI + K2 parameters obtained by f/j Gold digital filter (150) K given λ0,
By driving the filter with a (single) impulse sound source signal, a damped cut-off waveform as shown in equation (3) is obtained, and an 8-component of melody sounds is obtained.

なお、第３図の実施例においては、楽音用のＫ１．に２
復合器（１７０）’ｒ別［２けたが、本復号器のメモリ
手段を音声用Ｋｌ　＋　Ｋ２パラメータ復号器と一括し
、そのメモリの中の特定のエリアを楽音用に１．に２値
の記憶に用い残りのエリアを音声用として用いても良い
。従来の音声合成器に用いられている音声用のに、ｌＫ
２値を楽音用として用いることには難があるが、逆にこ
の発明の原理によって決定された楽音用のに１＋　Ｋ２
値は音声を合成する目的にも利用できることる確認して
いる。In the embodiment shown in FIG. 3, K1. to 2
Decoder (170)'r separate [2 digits] The memory means of this decoder is combined with the audio Kl + K2 parameter decoder, and a specific area in the memory is designated as 1. for musical tones. Alternatively, the area may be used for storing binary values, and the remaining area may be used for audio. lK for speech used in conventional speech synthesizers
There are difficulties in using binary values for musical tones, but conversely, for musical tones determined by the principle of this invention, 1+K2
It has been confirmed that the value can also be used for the purpose of synthesizing speech.

以上のように、この発明は装置の大規模化を招くことな
く、音声、楽音双方の合成を可能とするもので、実用的
効果は高い。As described above, the present invention enables the synthesis of both voice and musical tones without increasing the scale of the device, and has a high practical effect.

【図面の簡単な説明】[Brief explanation of drawings]

第１図は従来の偏自己相関分析合成方式の音声分析合成
システムのブロック図、第２図は従来の音声合成器の要
部のブロック構成図、第３図はこの発明に係る音声合成
器−例を示す要部のブロック構成図である。（１１１）、（１２１）、（１３１）　　・メモリ、（
１５０）・・・ディジタルフィルタ、（１１２）・・・
パルス発生器、　　（１１３）・・・１雑音発生器、（
１７０）・・・Ｋパラメータ復号器、（２００）・・・
音声分析器、（３００）・・・パラメータファイル０なお、図中同一符号は同一もしくは相当部分を示す。代理人　　葛野侶−（外１名）手続補正書（自発）特許庁長官殿１、事件の表示　　　　特願昭　５６−１１２７０５　
　号２、発明の名称音声合成器３、補正をする者（］）明細書の「発明の詳細な説明」６、補正の内容（１）明細書をつぎのとおり訂正する。FIG. 1 is a block diagram of a conventional partial autocorrelation analysis and synthesis system for speech analysis and synthesis, FIG. 2 is a block diagram of the main parts of a conventional speech synthesizer, and FIG. 3 is a speech synthesizer according to the present invention. FIG. 2 is a block configuration diagram of main parts showing an example. (111), (121), (131) ・Memory, (
150)...Digital filter, (112)...
Pulse generator, (113)...1 noise generator, (
170)...K parameter decoder, (200)...
Speech analyzer, (300)...Parameter file 0 Note that the same reference numerals in the figures indicate the same or corresponding parts. Agent: Tsuyoshi Kuzuno (1 other person) Procedural amendment (spontaneous) Commissioner of the Japan Patent Office 1, case description Patent application No. 112705, 1987
No. 2. Name of the invention Speech synthesizer 3. Person making the amendment (]) "Detailed description of the invention" in the specification 6. Contents of the amendment (1) The specification is amended as follows.

Claims

【特許請求の範囲】[Claims]

（１）ディジタル音源信号発生回路と、音源信号から所
定の周波数スペクトラム成分を抽出するディジタルフィ
ルタと、前記ディジタルフィルタの係数を表わすディジ
タル値を記憶するメモリ一手段とを基本構成要素とする
偏自己相関分析合成方式の音声合成器において、前記デ
ィジタルフィルタの最終段の係数に１および１つ前の段
の係数に２ｆｚｒ表わすディジタル値をに、＝ｃｏｓ２ｙｒｆｒ’ｌ’、　　　　　　　　Ｋ２
”　　”　　’ρで決定し、少なくとも１オクタ一ブ分
の平均律に近い音階周波数ｆｒＶｃ対応ｌ−た値として
記憶するメモリ手段を設け、インパルスによって前記デ
ィジタルフィルタを駆動し、正弦波形を合成出力させる
ようＫ　Ｌ、たことを特徴とする音声合成器。(1) Partial autocorrelation whose basic components are a digital sound source signal generation circuit, a digital filter that extracts a predetermined frequency spectrum component from the sound source signal, and a memory that stores digital values representing the coefficients of the digital filter. In an analysis-synthesis speech synthesizer, a digital value representing 1 for the coefficient of the last stage of the digital filter and 2fzr for the coefficient of the previous stage is expressed as: =cos2yrfr'l', K2
A memory means is provided for storing the value determined by ρ and corresponding to a scale frequency frVc close to equal temperament for at least one octave, and drives the digital filter with an impulse to output a synthesized sine waveform. A speech synthesizer characterized by YoKL and Tata.