JPS5913759B2

JPS5913759B2 - audio processing device

Info

Publication number: JPS5913759B2
Application number: JP55022697A
Authority: JP
Inventors: 和男中田; 大和佐藤; 憲也村上; 清志印藤
Original assignee: Hitachi Ltd; Nippon Telegraph and Telephone Corp
Current assignee: Hitachi Ltd; Nippon Telegraph and Telephone Corp
Priority date: 1980-02-27
Filing date: 1980-02-27
Publication date: 1984-03-31
Also published as: JPS56121096A

Description

【発明の詳細な説明】・ｏ本発明は音声の分析や合成に用いられる偏自己相
関係数（以下、にパラメータと呼ぷ）を効率よく符号化
または復号化するための音声処理装置に関し、特に、に
パラメータ値に所定の変換を施すために必要なＲＯＭの
容量を低減することにより゛５ＩＣ化を容易にしたにパ
ラメータ符号化または復号化のための装置に関する。[Detailed Description of the Invention] o The present invention relates to a speech processing device for efficiently encoding or decoding partial autocorrelation coefficients (hereinafter referred to as parameters) used in speech analysis and synthesis. In particular, the present invention relates to a parameter encoding or decoding device that facilitates 5IC implementation by reducing the ROM capacity required to perform predetermined transformations on parameter values.

にパラメータを用いて音声の分析や合成をおこなうこと
は従来からよく知られている方法であり、音声に対する
高度の情報圧縮をおこなう一手法と’０考えることが
できる。Analyzing and synthesizing speech using parameters is a well-known method, and can be considered as a method for highly compressing speech information.

この観点からは分析側における情報圧縮率はできるだけ
大きい、すなわち実用上許容できる音質が保証される範
囲内で分析後のにパラメータの情報量を少なくすること
が望ましい。From this point of view, it is desirable that the information compression ratio on the analysis side be as high as possible, that is, the amount of information of parameters after analysis should be reduced within a range that guarantees practically acceptable sound quality.

一方、合成側フ５においては、にパラメータは音声合
成のためのディジタル・フィルタの特性を規定する係数
となるから、合成のためのディジタル・フィルタの特性
を向上かつ安定化させるためには、にパラメータに関す
る情報量を多くして高精度化することが望１０ましい。
このような相反する要求を満足するために、通常下記の
手法が用いられる。On the other hand, on the synthesis side F5, the parameters are coefficients that define the characteristics of the digital filter for speech synthesis, so in order to improve and stabilize the characteristics of the digital filter for synthesis, it is necessary to It is desirable to increase the amount of information regarding parameters to improve accuracy.
In order to satisfy such conflicting demands, the following method is usually used.

（１）分析側においてにパラメータの情報を限られたビ
ット数で有効に符号化できるように高精度１５のにパラ
メータにたいして所定の変換もしくは範囲の限定をおこ
なつてから符号化する。(1) In order to effectively encode parameter information with a limited number of bits on the analysis side, the parameters are encoded after performing a predetermined conversion or range limitation with a high precision of 15.

（２）符号化された情報にたいし、合成側において（１
）の変換とは逆の変換をおこなつて高精度のｋパラメー
タを復号化する。(2) For the encoded information, (1
) to decode the high-precision k parameter.

上記の手法（１）における所定の変換と、手法（２）に
おける逆変換を実行するためには通常、読出専用記憶装
置（ＲＯＭ）が用いられ、このＲＯＭに所定の変換前の
値に対する変換後の値を変換表の形式で書きこんでおき
、上記ＲＯＭの表引（テーブルルツクアツプ）をお
こなえばよい。In order to perform the predetermined conversion in method (1) and the inverse conversion in method (2) above, a read-only memory (ROM) is usually used, and this ROM stores the predetermined values before conversion and the inverse conversion in method (2). All you have to do is write the values in the form of a conversion table and look up the table in the ROM.

音声の分析装置や合成装置に上記ＲＯＭを組みこんで１
チツプＩＣ化する場合、チツプ寸法を小さくしてコスト
低減をはかるためにはＲＯＭ容量をできるだけ小さくす
ることが望ましい。By incorporating the above ROM into a voice analysis device or synthesis device, 1
When implementing a chip IC, it is desirable to reduce the ROM capacity as much as possible in order to reduce the chip size and reduce costs.

従来、上記変換表は所定次数までのｋパラメータのすべ
てについて作られていたため、考慮すべきｋパラメータ
の次数が増大すると、これに比例して変換表を格納する
ＲＯＭの容量も増大することになつてしまう。Conventionally, the conversion table described above has been created for all k parameters up to a predetermined order, so as the order of the k parameters to be considered increases, the capacity of the ROM that stores the conversion table also increases proportionally. I end up.

これに対して、変換表を格納するＲＯＭの容量を一定値
以下におさえるためには、各ｋパラメータの変換値と逆
変換値をあられすビツト数を少なくしなければならず、
そのため精度が悪くなり、この結果として音声の分析お
よび合成装置の性能を劣化させることになつてしまう欠
点があつた。On the other hand, in order to keep the capacity of the ROM that stores the conversion table below a certain value, it is necessary to reduce the number of bits used to store the converted and inversely converted values of each k parameter.
As a result, the accuracy deteriorates, resulting in a drawback that the performance of the speech analysis and synthesis device deteriorates.

したがつて、本発明の目的は、ｋパラメータの精度を劣
化させることなく、変換表を格納するＲＯＭ容量を低減
することにより音声の分析および合成用１Ｃのチツプ寸
法を小さくし、コスト低減をはかつた音声処理装置を提
供することにある。上記目的を達成するため、本発明で
は、たがいに連続する奇数次のｋパラメータと偶数次の
ｋパラメータの各値の出現頻度分布がＯの値（ＫｉＯ）
に関してほぼ対称になつていることを利用して、奇数次
または偶数次いずれか一方のｋパラメータのみに対する
変換表をＲＯＭに格納し、他方のｋパラメータに対する
所定の変換は、ＲＯＭに格納されている変換値に対して
所定の演算を施すことにより実行させるようにして、Ｒ
ＯＭ容量を低減するものである。第１図は、長時間にわ
たり男性および女性の音声波形を分析して、各フレーム
期間毎に得られる８次までのｋパラメータの値ｋ１〜Ｋ
８の相対出現頻度分布を示す。Therefore, an object of the present invention is to reduce the chip size of a 1C for speech analysis and synthesis by reducing the ROM capacity for storing conversion tables without deteriorating the accuracy of k-parameters, thereby reducing costs. It is an object of the present invention to provide a voice processing device that has a high level of performance. In order to achieve the above object, in the present invention, the appearance frequency distribution of each value of an odd-order k parameter and an even-order k parameter that are consecutive to each other is a value of O (KiO).
Taking advantage of the fact that it is almost symmetrical with respect to By performing a predetermined operation on the converted value, R
This reduces the OM capacity. Figure 1 shows the k-parameter values k1 to K up to the 8th order obtained for each frame period by analyzing male and female voice waveforms over a long period of time.
The relative appearance frequency distribution of 8 is shown.

（板倉、東倉、他電子通信学会論文誌ＶＯｌＪ６ｌ
−Ａ，．ｆｌ）．３、Ｐ２５４〜２６１）たとえば、ｋ
１の出現頻度はｋ１＝０．９近傍において最大で、−１
≦ｋ１〈ＯにおいてはほぼＯになつている。(Itakura, Higashikura, et al. Journal of the Institute of Electronics and Communication Engineers VOlJ6l
-A,. fl). 3, P254-261) For example, k
The appearance frequency of 1 is maximum near k1=0.9, and -1
When ≦k1〈O, it becomes almost O.

第１図における顕著な特徴は、男性、女性いずれの音声
においても、ＫｉとＫｉ＋１（１−１、３、５、７）の
出現頻度分布の関係がそれぞれの値を表わす横軸上の０
点を通る垂線に関してほぼ線対称の関係になつているこ
とである。A remarkable feature in Figure 1 is that in both male and female voices, the relationship between the appearance frequency distributions of Ki and Ki+1 (1-1, 3, 5, 7) is 0 on the horizontal axis representing each value.
This means that there is a nearly line-symmetrical relationship with respect to the perpendicular line passing through the point.

ここでであることは理論的に証明され、また実験的にも確認さ
れている。This has been theoretically proven and also confirmed experimentally.

したがつて、たとえばＫｉ＋１の符号を反転した（−Ｋ
ｉ＋１）の分布はほぼＫｉの分布と等しくなる。Therefore, for example, by inverting the sign of Ki+1 (−K
The distribution of i+1) is approximately equal to the distribution of Ki.

このような関係は、Ｋｉ．！ｌ：．Ｋｉ＋１に対して所
定の変換をおこなつた結果得られるＫｉ′とＫｉ′＋１
に対しても成立する。Such a relationship is based on Ki. ! l:. Ki′ and Ki′+1 obtained as a result of performing a predetermined conversion on Ki+1
It also holds true for .

さらに一般に、特定のＫｉとＫｉ＋１に対して（２）の
変換をおこない、他のＫｊとＫｊ＋ｌ（ＪＮｉｌｊ＝１
、３、５、７）に対しては他の変換をおこなつた結果得
られるＫｉ′とＫｉ′＋１、Ｋｊ′とＫｊ′＋１に対し
て上記の関係は成立する。Furthermore, in general, the transformation (2) is performed for specific Ki and Ki+1, and other Kj and Kj+l (JNilj=1
, 3, 5, and 7), the above relationship holds for Ki' and Ki'+1, and Kj' and Kj'+1 obtained as a result of performing other transformations.

この関係を利用すれば、ＫｉとＫｉ＋１の変換表は同じ
もので良いから、（ｋ１、Ｋ３、Ｋ５、Ｋ７）か（Ｋ２
、Ｋ４、Ｋ６、Ｋ８）のいずれの一方に対する変換表を
用意すればよいことになり、結局、変換表を格納するた
めに必要なＲＯＭの容量は１／２でよいことになる。第
１図にもどり各ｋパラメータの出現頻度分布を詳細に見
ると、たとえばｋ１は０．８≦ｋ１≦１のきわめて限ら
れた範囲（面積でみて、全体の８０％〜９０％を占める
領域）に分布が集中していることがわかる。Using this relationship, the conversion tables for Ki and Ki+1 can be the same, so either (k1, K3, K5, K7) or (K2
. Returning to Figure 1 and looking in detail at the appearance frequency distribution of each k parameter, for example, k1 is in an extremely limited range of 0.8≦k1≦1 (a region that occupies 80% to 90% of the total area) It can be seen that the distribution is concentrated in

これは、音声から抽出されたｋ１が８０％〜９０％の確
率で、上記範囲の値になることを示す。このような分布
特性を有するｋ１に対する変換表を作成する場合、原理
的にｋ１のとり得る値の範囲（−１≦ｋ１≦１）をたと
えば１２８サンプル区間に等分して各サンプル区間にお
けるｋ１の代表値をＲＯＭ中の所定の１２８番地にわた
つて書きこむ方法における上記サンプル区間はｋ１く０
．８においては細かすぎ、０。８≦ｋ１≦１においては
粗すぎてＲＯＭを効率よく使用しているとはいえない。This indicates that k1 extracted from the voice will have a value within the above range with a probability of 80% to 90%. When creating a conversion table for k1 having such distribution characteristics, in principle, the range of possible values of k1 (-1≦k1≦1) is equally divided into, for example, 128 sample intervals, and k1 in each sample interval is calculated. The above sample interval in the method of writing representative values across 128 predetermined addresses in the ROM is k1 x 0.
．． 8 is too fine, and 0.8≦k1≦1 is too coarse, so it cannot be said that the ROM is used efficiently.

この点を改善するためにぱ、出現頻度分布が大きいｋ１
の範囲を相対的に拡大するような変換をｋ１に対してお
こない、この拡大された範囲に対して適切なサンプル区
間を設定する処理をおこなえばよい。In order to improve this point, k1 has a large appearance frequency distribution.
It is only necessary to perform a conversion on k1 to relatively expand the range of k1, and to set an appropriate sample section for this expanded range.

Ｋ２についても、−１≦Ｋ２≦−０．６の比較的限られ
た範囲に分布が偏在しているから、ｋ１と同様な変換を
おこなうことが有効である。Regarding K2, since the distribution is unevenly distributed in a relatively limited range of -1≦K2≦-0.6, it is effective to perform the same conversion as for k1.

ｋ１、Ｋ２に対しておこなうべき変換としては次式で与
えられる非線形変換が有効であることが実験により確認
されている。It has been confirmed through experiments that the nonlinear transformation given by the following equation is effective as the transformation to be performed on k1 and K2.

ｖ▲ たとえば、ｃ＝３としたとき、０．８≦ｋ１≦１の範囲
は（４）式の変換により−１０ｇ３≦ｇ１〈閃に拡大さ
れるから、前記１２８サンプル区間の８０％〜９０％を
この拡大された範囲に設定することにより有効な符号化
を用いＲＯＭを効率よく使用できることになる。v▲ For example, when c = 3, the range of 0.8≦k1≦1 is expanded to -10g3≦g1 by the conversion of equation (4), so it is 80% to 90% of the 128 sample interval. By setting the value to this expanded range, effective encoding can be used and the ROM can be used efficiently.

（Ｋ３、Ｋ５、Ｋ７）と（Ｋ４、Ｋ６、Ｋ８）について
はｋ１、Ｋ２に比較すれば広範囲にわたつてよ（５）式
のｋ１とＫ２に対して、変換をおこなうと、ｃ＝３として（４）式のしたがつて、ｇ１、Ｋ３、Ｋ５、Ｋ７を各対の代表パラ
メータと考え、これら代表パラメータの（７）式で与え
られる範囲を所定数のサンプル区間に分割し、ｇ１に対
しては（４）式と逆の変換・・・・・・・・・Ｔａｎｈ
（３ｇ１）・・・・・・・・・をおこなつて得られるｋ
１′と上記Ｋ３、Ｋ５、Ｋ７の各サンプル区間における
代表値をもつて変換表を作成し、これをＲＯＭ中に格納
すればよい。(K3, K5, K7) and (K4, K6, K8) have a wide range compared to k1 and K2.If we convert k1 and K2 in equation (5), we get c=3. According to equation (4), consider g1, K3, K5, and K7 as representative parameters of each pair, divide the range given by equation (7) of these representative parameters into a predetermined number of sample sections, and is the inverse transformation of equation (4)......Tanh
(3g1) k obtained by doing...
1' and the representative values in each sample section of K3, K5, and K7, a conversion table may be created and stored in the ROM.

このようにして作成される変換表の形式とこれをＲＯＭ
に格能するためのＲＯＭアドレスとの対応関係の１例を
第２図に示す。The format of the conversion table created in this way and its ROM
FIG. 2 shows an example of the correspondence with the ROM address for the function.

第２図によれば、たとえばｇ１は、（７）式における範
囲（−０．１６〜１．３４）が１２８個の等間隔なサン
プル区間に分割されて各サンプル区間におけるｇ１の代
表値｛曾１、１１．父１、２、゜゛゜゜゜゜゜゜゜、貧
１、１２８｝がそれぞれＴａｎｈ（３貧，、１）の変換
を受けて｛仏、１、Ｃ１、２、゜゜゜゜゜゜゜゜゜、妄
１、１２８｝となつて、各成分｛ｋ１、ｉ｝がたとえば
１０ビツト程度で表わされて、変換表中に書きこまれる
。Ｋ３、Ｋ５、Ｋ７についても同様で、それぞれ３２、
１６、１６のサンプル区間に分割されて、各サンプル区
間における代表値：がそれぞれ、たとえば１０ビツト程
度で表わされて、変換表中に書きこまれる。According to FIG. 2, for example, g1 is calculated by dividing the range (-0.16 to 1.34) in equation (7) into 128 equally spaced sample sections, and calculating the representative value of g1 in each sample section. 1, 11. Father 1, 2, ゜゛゜゜゜゜゜゜゜゜, poverty 1, 128} respectively undergo the transformation of Tanh (3 poverty,, 1) and become {Buddha, 1, C1, 2, ゜゜゜゜゜゜゜゜゜゜, delusion 1, 128}, and each component { k1,i} is represented by, for example, about 10 bits and written into the conversion table. The same goes for K3, K5, and K7, each with 32 and
It is divided into 16, 16 sample sections, and the representative value in each sample section is expressed, for example, in about 10 bits, and written into the conversion table.

１語を１０ビツトで構成するとき、第２図の変換表を格
納するために必要なＲＯＭの容量はとなる。When one word is composed of 10 bits, the capacity of the ROM required to store the conversion table shown in FIG. 2 is as follows.

一方、従来の方法においては、たとえば２４００ビツト
／秒の割合で、音声情報から抽出される８次までの各ｋ
パラメータに対する最適なサンプル区間数は、通常つぎ
のようになることが知られている。On the other hand, in the conventional method, each of the up to 8th order k
It is known that the optimal number of sample intervals for a parameter is usually as follows.

したがつて、上記サンプル区間の代表値をすべてＲＯＭ
に格納する場合に必要なＲＯＭの容量はとなる。Therefore, all the representative values of the above sample section are stored in ROM.
The required ROM capacity is:

すなわち、１９２／２６４キ０．７３であるから、本発明によつて必要なＲＯＭの容量は従来
の７３％程度まで低減できることになる。That is, since it is 192/264 x 0.73, the required ROM capacity can be reduced to about 73% of the conventional capacity by the present invention.

ここで、上記２４００ビツト／秒は、１フレーム期間（
２０ミリ秒）について１１次〜８次のｋパラメータを表
現するために必要なビツト数が７＋５×２＋４×４＋３
＝３６ビツト２サンプルピツチを指定するために必要なビツト数が
６ビツト３音源強度を表現
するために必要なビツト数が６ビツトしたがつて、合計３６＋６＋６＋−４８ビツト／２０ミリ秒すなわち、４
８Ｘ５０−２４００ビツト／秒の情報量が必要なことを
意味している。Here, the above 2400 bits/second is one frame period (
The number of bits required to express the 11th to 8th order k parameters for 20 milliseconds is 7 + 5 x 2 + 4 x 4 + 3.
= 36 bits The number of bits required to specify the 2-sample pitch is
The number of bits required to express the 6-bit 3 sound source strength is 6 bits, so the total is 36+6+6+-48 bits/20 milliseconds, or 4
This means that an amount of information of 8x50-2400 bits/sec is required.

なお、従来の１／２（５０％）まで低減できないのはＫ
ｉとＫｉ＋１（１＝１、３、５、７）の出現頻度分布曲
線が前記垂線に関して完全には線対称になつていないこ
とによるもので、たとえば、Ｋ２（Ｇ２）に対しては従
米の３２区間でなく１２８区間、Ｋ４に対しては従来の
１６区間でなく３２区間など、より広い範囲を考慮しな
ければならなくなる場合があるからである。Note that K cannot be reduced to 1/2 (50%) of the conventional level.
This is because the appearance frequency distribution curves of i and Ki+1 (1=1, 3, 5, 7) are not completely symmetrical with respect to the perpendicular line.For example, for K2 (G2), 32 This is because it may be necessary to consider a wider range, such as 128 sections instead of sections, and 32 sections instead of the conventional 16 sections for K4.

また、ｋパラメータの次数がさらに増大した場合も同様
な処理をおこなえばよく、たとえば１０次までのｋパラ
メータを対象とする場合、Ｋ，、ＫｌＯのとり得る値の
範囲はになることが実験により確認されている。Furthermore, even if the order of the k parameter increases further, the same process can be performed. For example, when targeting k parameters up to the 10th order, experiments have shown that the range of possible values of K,,KlO is as follows. Confirmed.

したがつて、前記と同様に、Ｋ９と（−ＫｌＯ）を対に
して考えると、−０．３２≦（−ＫｌＯ）≦０．６４で
あるから、Ｋ９、（−ＫｌＯ）がつくる対のとり得る値
の範囲は−０．５２≦Ｋ，、（−ＫｌＯ）≦０．７３（
７ｙしたがつて、たとえばＫ，をこの対の代表パラメー
タと考え、（７）″で与えられる範囲を８サンプル区間
に分割し、各サンプル区間における代表値をＲＯＭ中に
格納すればよい。Therefore, in the same way as above, considering K9 and (-KlO) as a pair, since -0.32≦(-KlO)≦0.64, the pair formed by K9 and (-KlO) is The range of values obtained is -0.52≦K, (-KlO)≦0.73 (
7y Therefore, for example, consider K as the representative parameter of this pair, divide the range given by (7)'' into 8 sample sections, and store the representative value in each sample section in the ROM.

以下、本発明を実施例を参照して詳細に説明する。Hereinafter, the present invention will be explained in detail with reference to Examples.

第３図は本発明を用いた音声分析と合成装置の構成を示
す回路プロツク図である。FIG. 3 is a circuit block diagram showing the configuration of a speech analysis and synthesis apparatus using the present invention.

第３図において、入力音声からＫｉ抽出回路２０で抽出
された高精度のｋパラメータはその次数を示す量ととも
に分析部２１の奇数／偶数判定回路２１１に並列に入力
されて、奇数次パラメータ（ｋ１、Ｋ３、Ｋ５、Ｋ７）
が偶数次パラメータ（Ｋ２、Ｋ４、Ｋ６、Ｋ８）かの判
定がおこなわれるとともに、アンドゲート２１２および
２１３に入力される。In FIG. 3, the high-precision k parameter extracted from the input voice by the Ki extraction circuit 20 is input in parallel to the odd/even determination circuit 211 of the analysis unit 21 together with the quantity indicating its order, and the odd-order parameter (k1 , K3, K5, K7)
It is determined whether is an even-order parameter (K2, K4, K6, K8), and is input to AND gates 212 and 213.

奇数次パラメータのときはそのまま、偶数次パラメータ
のときは符号反転部２１４で符号が反転されてからオア
ゲート２１５を通してアドレス設定回路２１６に入力さ
れる。If the parameter is an odd-numbered parameter, the signal is input as is, and if it is an even-numbered parameter, the sign is inverted by the sign inverting section 214 and then input to the address setting circuit 216 through the OR gate 215.

アドレス設定回路２１６は、ｋパラメータの値とその次
数を示す値から、（００００００００）−０〜（１０１
１１１１１）＝１９１のうちのいずれかのコードを作成
して、これを符号化回路２１８中にあるメモリのアドレ
スとするための回路で、制御部２１９で指定された次数
の変換用ＲＯＭ２ｌ７のアドレスより読み出された代表
値｛′Ｋｉ、１、宴１、２、′Ｋｉ、３、・・・・・・
・・・｝のそれぞれを入力ｋパラメータＫｉと比較する
一連の比較器からなる。The address setting circuit 216 calculates from (00000000)-0 to (101
11111) = 191, and uses this as the address of the memory in the encoding circuit 218, from the address of the conversion ROM 2l7 of the order specified by the control unit 219. Read representative values {'Ki, 1, party 1, 2, 'Ki, 3,...
...} with the input k-parameter Ki.

符号化回路２１８はアドレス設定回路２１６の出力から
、第２図におけるアドレス（０〜１９１）のいずれかを
指定する符号を作成する回路で、ＲＯＭにより構成され
る。制御部２１９は抽出されたｋパラメータの次数を示
す量からＯ〜１９１のいずれかの値を出力する回路で、
第４図のごとく構成される。The encoding circuit 218 is a circuit that creates a code specifying any of the addresses (0 to 191) in FIG. 2 from the output of the address setting circuit 216, and is constructed of a ROM. The control unit 219 is a circuit that outputs any value from O to 191 from the amount indicating the order of the extracted k parameter,
It is configured as shown in Figure 4.

第４図において、ｋパラメータの次数を示す信号４１は
ＲＯＭ４２のアドレス信号ＡＯ，Ａｌ，Ａ２となるとと
もにオアゲート４３を通してフリツプフロツプ４４のト
リカー信号となる。In FIG. 4, a signal 41 indicating the order of the k parameter becomes the address signals AO, Al, A2 of the ROM 42, and also becomes the trigger signal of the flip-flop 44 through an OR gate 43.

フリツプフロツプ４４の出力信号によりＲＯＭ４２の指
定されたアドレスの内容が読み出されてカウンタ４５１
と４５２にプリセツトされるとともに、クロツク発生器
４６のクロツクパルスがアンドゲート４７を通してカウ
ンタ４５１と４５２に印加され、クロツクパルスの発生
毎に１づつ、カウントダウンされていく、カウンタ４５
１の出力信号４８が第３図における変換用ＲＯＭ２ｌ７
のアドレス信号となる。カウンタ４５２の内容がＯにな
るとアンドゲート４９の出力信号によりフリツプフロツ
プ４４がりセツトされて、カウント動作が停止し、与え
られたｋパラメータに対する所定の処理が終了したこと
になる。たとえば、ｋ１の場合、ＲＯＭ４２の（００１
）番より下位８ビツト、上位８ビツトともに１２７を表
わす信号が読み出され、それぞれカウンタ４５１と４５
２にプリセツトされ、カウンタ４５１の出力信号が前記
変換用ＲＯＭ２ｌ７中にあるｋ１の一連の代表値のうち
、宝，、１２８の格納されているアドレスを指定する信
号となる。The contents of the specified address of the ROM 42 are read out by the output signal of the flip-flop 44 and are sent to the counter 451.
At the same time, the clock pulse of the clock generator 46 is applied to the counters 451 and 452 through the AND gate 47, and the counter 45 is counted down by 1 each time a clock pulse occurs.
The output signal 48 of 1 is the conversion ROM 2l7 in FIG.
This becomes the address signal. When the contents of the counter 452 reach O, the flip-flop 44 is reset by the output signal of the AND gate 49, the counting operation is stopped, and the predetermined processing for the given k parameter is completed. For example, in the case of k1, (001
), the lower 8 bits and upper 8 bits are both read out and a signal representing 127 is read out, and the signals are output to the counters 451 and 45, respectively.
2, and the output signal of the counter 451 becomes a signal specifying the address where 128 of the series of representative values of k1 in the conversion ROM 217 is stored.

以後、カウンタ４５１と４５２が１づつカウントダウン
されていくとともに順次′ｋ１、１２７、？１、１２ぃ
・・・・・・・・・、′ｋ１、１の格納されているアド
レスが指定される。１２７番地からＯ番地までのアドレ
ス指定がすべて終了すると、カウンタ４５２の内容がＯ
になり、アンドゲート４９の出力信号によりフリツプフ
ロツプ４４がりセツトされて、ｋ１に対する所定の処理
が終了する。Thereafter, the counters 451 and 452 count down one by one, and sequentially 'k1, 127, ? The address where 1, 12..., 'k1, 1 is stored is specified. When all addresses from address 127 to address O are specified, the contents of counter 452 become O.
Then, the flip-flop 44 is reset by the output signal of the AND gate 49, and the predetermined processing for k1 is completed.

同様にＫ３の場合はＲＯＭ４２の（０１１）番地より下
位８ビツトが１５９を表わし、上位８ビツトが３１を表
わす信号が読み出され、それぞれカウンタ４５１と４５
２にプリセツトされ、カウンタ４５１の出力が前記変換
用ＲＯＭ２ｌ７中にあるＫ３の一連の代表値のうち、宝
。Similarly, in the case of K3, a signal is read from address (011) of the ROM 42 in which the lower 8 bits represent 159 and the upper 8 bits represent 31, and the signals are read out from the counters 451 and 45, respectively.
2, and the output of the counter 451 is one of the series of representative values of K3 in the conversion ROM 2l7.

、３２の格納されているアドレスを指定する信号となる
。以後、カウンタ４５１と４５２が１づつカウントダウ
ンされていくとともに順次安．、３１、↑３、３０１・
・・・・・．・・貧。、１の格納されているアドレスが
指定される。１５９番地から１２８番地までのアドレス
指定が終了すると、カウンタ４５２の内容がＯになり、
アンドゲート４９の出力信号によりフロツプフロツプ４
４がりセツトされて、Ｋ３に対する所定の処理が終了す
る。, 32 are stored. Thereafter, the counters 451 and 452 count down one by one, and the value decreases one by one. , 31, ↑3, 301・
・・・・・・． ...Poverty. , 1 are stored. When the address specification from address 159 to address 128 is completed, the content of counter 452 becomes O.
The output signal of the AND gate 49 causes the flop flop 4 to
4 is set, and the predetermined processing for K3 is completed.

第５図はアドレス設定回路２１６の一実施例を示す回路
図で、ｉ次のｋパラメータＫｉと変換用ＲＯＭ２ｌ７よ
り、たとえば大きい順に読み出されたＫｉのｊ番目のサ
ンプル区間における代表値′ＶＣｉｓｊ（〉宝１１ｊ−
１〉宝１１ｊ−２・・・・・・・・・）とが、加算器５
１に入力されて（宝１，．ｊ−Ｋｉ）の演算が実行され
る。FIG. 5 is a circuit diagram showing an embodiment of the address setting circuit 216, in which the i-th k parameter Ki and the representative value 'VCisj( 〉Treasure 11j-
1〉Treasure 11j-2......) is the adder 5
1 is input and the calculation (treasure 1, .j-Ki) is executed.

この結果、最初に安１．ｊ−Ｋｉ〈０すなわち、ボロ一
（ＢＯｒｒＯｗ）出力端子Ｂに１が出力されるタイミン
グで、前記第４図におけるカウンタ４５１の出力信号４
８がバツフアレジスタ５２にセツトされ、その出力信号
５３が、符号化回路２１８に入力される。As a result, the first one is 1. j−Ki<0, that is, at the timing when 1 is output to the BOrrOw output terminal B, the output signal 4 of the counter 451 in FIG.
8 is set in the buffer register 52, and its output signal 53 is input to the encoding circuit 218.

なお、バツフアレジスタ５２の出力をそのまま符号化さ
れたＫｉとして合成側に送出することも可能で、この場
合には第３図の符号化回路２１８は不要になる。Note that it is also possible to send the output of the buffer register 52 as it is as encoded Ki to the synthesis side, and in this case, the encoding circuit 218 shown in FIG. 3 becomes unnecessary.

符号化回路２１８により得られた符号（以下、これを符
号化パラメータと呼ぶ）は分析部２１の出力として、合
成部２３に送られる。The code obtained by the encoding circuit 218 (hereinafter referred to as an encoding parameter) is sent to the synthesis section 23 as an output of the analysis section 21.

合成部２３においては、上記符号化パラメータとこれに
対応したｋパラメータの次数を示すビツトがアドレスレ
ジスタ２３０にセツトされる。In the synthesis section 23, bits indicating the order of the encoding parameter and the corresponding k parameter are set in the address register 230.

上記符号化パラメータに相当するアドレスレジスタ２３
０の内容は、アドレスデコーダ２３１によりデコードさ
れて変換用ＲＯＭ２３２のアドレスとなり、第２図と同
様のアドレス構造を有する変換用ＲＯＭ２３２にあらか
じめ格納されている高精度のｋパラメータの値（ｋ１、
Ｋ３、Ｋ５、Ｋ７の所定サンブル区間における代表値）
が並列に読み出されて、アンドゲート２３４および２３
５に入力され、かつこれに対応した次数を示すビツトの
内容が奇数／偶数判定回路２３３において判定される。
その結果が奇数のときには、アンドゲート２３５の出力
がそのまま、また偶数のときにはアンドゲート２３４の
出力は符号反転部２３６で符号が反転されてからオアゲ
ート２３７を通して、前記高精度のｋパラメータの真値
として復号されバツフア２３８にセツトされる。Address register 23 corresponding to the above encoding parameters
The contents of 0 are decoded by the address decoder 231 to become the address of the conversion ROM 232, and are converted into high-precision k parameter values (k1, k1,
Representative values in the predetermined sample section of K3, K5, K7)
are read out in parallel and the AND gates 234 and 23
The contents of the bits input to the bits 5 and indicating the corresponding orders are determined by the odd/even determining circuit 233.
When the result is an odd number, the output of the AND gate 235 remains as it is, and when the result is an even number, the output of the AND gate 234 is inverted in sign by a sign inverter 236 and then passed through an OR gate 237 as the true value of the high-precision k parameter. It is decoded and set in buffer 238.

バツフア２３８の内容はレジスタ２３９により並列から
直列に変換され、復号化されたｋパラメータとして出力
端子２４より送出される。The contents of the buffer 238 are converted from parallel to serial by a register 239, and sent out from the output terminal 24 as a decoded k parameter.

なお、アンドゲート２１２，２１３とオアゲート２１５
、およびアンドゲート２３４，２３５とオアゲート２３
７とは簡単のため１ビツト分のみを示したものであるが
、実際はｋパラメータを表わすビツト数分だけ必要なこ
とはいうまでもない。In addition, AND gates 212, 213 and OR gate 215
, and AND gates 234, 235 and OR gate 23
7 shows only one bit for simplicity, but it goes without saying that in reality, only the number of bits representing the k parameter is required.

上記の処理がｋ１〜Ｋ８、ｋ１〜ＫｌＯなど必要なすべ
てのｋパラメータに対しておこなわれる。たとえば入力
音声より抽出されたｋパラメータＫ，〜Ｋ４について第
３図の作動をさらに具体的に説明する。（１）ｋ１の
場合分析部２１においては、奇数次パラメータであることが
判定された後、アンドゲート２１３とオアゲート２１５
を通して、アドレス設定回路２１６に入力され、ｋ１の
大きさに応じて、第２図に示されるようにＯ〜１２７の
いずれかのアドレスが設定される。The above processing is performed for all necessary k parameters such as k1 to K8 and k1 to KlO. For example, the operation shown in FIG. 3 will be explained in more detail with respect to the k parameters K, -K4 extracted from the input voice. (1) In the case of k1, in the analysis unit 21, after determining that it is an odd-order parameter, the AND gate 213 and the OR gate 215
The signal is input to the address setting circuit 216 through the signal k1, and one of the addresses 0 to 127 is set as shown in FIG. 2, depending on the size of k1.

変換用ＲＯＭ２ｌ７の該当するアドレスには、前記（４
）式によるＧ，の逆変換値｛宝，−Ｔａｎｈ（Ｅｇｌ）
｝が書きこまれているから、アドレス設定回路２１６に
おいて貧，が入力値ｋ１と比較され、比較結果が符号化
回路２１８において符号化される。The corresponding address of the conversion ROM 2l7 has the above (4
) inverse transformation value of G, {Takara, −Tanh(Egl)
} has been written, the address setting circuit 216 compares the input value k1 with the input value k1, and the comparison result is encoded in the encoding circuit 218.

合成部２３においては、妄，に対応した符号化パラメー
タはアドレスデコーダ２３１においてデコードされ、第
２図と同様のアドレス構造を有する変換用ＲＯＭ２３２
のアドレスとなる。In the synthesis unit 23, the encoding parameters corresponding to the error are decoded in an address decoder 231, and are stored in a conversion ROM 232 having an address structure similar to that shown in FIG.
address.

従つて、所定の妄，に所定のｋ１を対応させて変換用Ｒ
ＯＭ２３２が作成されているから賞，で指定されたアド
レスからこれに対応したｋ１が読み出されることになる
。Ｋ，は奇数次パラメータであることが判定されて、バ
ツフア２３８にセツトされる。Therefore, by associating a predetermined delusion with a predetermined k1, the conversion R
Since the OM232 has been created, the corresponding k1 will be read from the address specified by the prize. K, is determined to be an odd-order parameter and is set in the buffer 238.

すなわち、高精度のｋ１が真値として復号されたことに
なる。In other words, the highly accurate k1 is decoded as the true value.

（４）Ｋ２の場合分析部２１においては、偶数次パラメータであることが
判定された後、符号反転されて（−Ｋ２）としてオアゲ
ート２１５を通して、アドレス設定回路２１６に入力さ
れ、（−Ｋ２）の大きさに応じて、第２図に示されるｋ
１と共用されるＯ〜１２７のいずれかのアドレスが設定
される。(4) In the case of K2 After determining that it is an even-order parameter in the analysis unit 21, the sign is inverted and input as (-K2) to the address setting circuit 216 through the OR gate 215, and the value of (-K2) is Depending on the size, k shown in Figure 2
Any address from 0 to 127 shared with 1 is set.

変換用ＲＯＭ２ｌ７の該当するアドレスには前記（４）
式による（−Ｇ２）の逆変換値｛（一貧，）＝Ｔａｎｈ
（−Ｃｇ２）｝が書きこまれているからアドレス設定回
路２１６において（一妥，）が入力値（−Ｋ２）と比較
され、比較結果が符号化回路２１８において符号化され
る。The corresponding address of the conversion ROM2l7 has the above (4)
Inverse transformation value of (-G2) according to the formula {(Ippo,)=Tanh
(-Cg2)} has been written, the address setting circuit 216 compares (1) with the input value (-K2), and the encoding circuit 218 encodes the comparison result.

合成部２３においては、（一貧，）に対応した符号化パ
ラメータはアドレスデコーダ２３１においてデコードさ
れ、変換用ＲＯＭ２３２のアドレスとなる。（−Ｋ２）
からＧ２を得る変換式は（１）において、ｋ１からｇ１
を得た変換式と同一であるから、（一貧，）で指定され
たアドレスから読み出される量は（−Ｋ２）になるはず
である。（−Ｋ２）は偶数次パラメータであることが判
定されて、符号反転部２３６において、所望のＫ２にも
どされて、バツフア２３８にセツトされる。In the synthesis unit 23, the encoding parameter corresponding to (Ippou,) is decoded by the address decoder 231 and becomes an address of the conversion ROM 232. (-K2)
In (1), the conversion formula to obtain G2 from k1 to g1
Since it is the same as the conversion formula obtained, the amount read from the address specified by (Ippou,) should be (-K2). (-K2) is determined to be an even-order parameter, and is returned to the desired K2 in the sign inverter 236 and set in the buffer 238.

すなわち、高精度のＫ２が真値として復号されたことに
なる。In other words, the highly accurate K2 is decoded as the true value.

上記（１）、（４）はｋ１に対する変換表からＫ２に対
する変換もおこなえることを示す。The above (1) and (4) indicate that conversion for K2 can also be performed from the conversion table for k1.

（１１１）Ｋ３の場合分析部２１においては、奇数次パラメータであることが
判定された後、（１）と同様にＫ３の大きさに応じて、
第２図に示されるように１２８〜１５９のいずれかのア
ドレスが設定される。(111) In the case of K3 In the analysis unit 21, after determining that it is an odd-order parameter, depending on the size of K3, as in (1),
As shown in FIG. 2, any address from 128 to 159 is set.

変換用ＲＯＭ２ｌ７の該当するアドレスには、何らの変
換も受けていないＫ３の所定サンプル区間における代表
値｛貧。｝が書きこまれているから、この値がそのまま
読み出されて、アドレス設定回路２１６において入力値
Ｋ３と比較され、比較結果が符号化回路２１８において
符号化される。合成部２３においては、貧。The corresponding address of the conversion ROM 2l7 contains the representative value {poor. } has been written, this value is read out as is, compared with the input value K3 in the address setting circuit 216, and the comparison result is encoded in the encoding circuit 218. In the synthesis section 23, it is poor.

に対応した符号化パラメータはアドレスデコーダ２３１
においてデコードされ、変換用ＲＯＭ２３２のアドレス
となり、指定されたアドレスからＫ３が読み出される。
Ｋ３は奇数次パラメータであることが判定されて、その
ままバツフア２３８にセツトされる。The encoding parameters corresponding to the address decoder 231
K3 is decoded and becomes the address of the conversion ROM 232, and K3 is read from the specified address.
It is determined that K3 is an odd-numbered parameter and is set in the buffer 238 as is.

すなわち、高精度のＫ３が真値として復号されたことに
なる。（１Ｖ）Ｋ４の場合分析部２１においては、偶数次パラメータであることが
判定された後、符号反転されて（−Ｋ４）として、アド
レス設定回路２１６に入力され、（−Ｋ４）の大きさに
応じて、第２図に示されるＫ３と共用さ，れる１２８〜
１５９のいずれかのアドレスが設定される。In other words, the highly accurate K3 is decoded as the true value. (1V) In the case of K4, the analyzer 21 determines that it is an even-order parameter, then inverts the sign and inputs it to the address setting circuit 216 as (-K4), and sets it to the magnitude of (-K4). Accordingly, 128~ which is shared with K3 shown in FIG.
One of 159 addresses is set.

変換用ＲＯＭ２ｌ７の該当するアドレスには、Ｋ４の所
定サンプル区間における代表値が｛一貧。｝として書き
こまれているから、｛一↑４｝がアドレス設定回路２１
６において入力値（−Ｋ４）と比較され、比較結果が符
号化回路２１８において符号化される。合成部２３にお
いては、（一貨。The representative value in the predetermined sample section of K4 is stored in the corresponding address of the conversion ROM 2l7. }, so {1↑4} is the address setting circuit 21
6 is compared with the input value (-K4), and the comparison result is encoded in the encoding circuit 218. In the synthesis section 23, (one coin.

）に対応した符号化パラメータはアドレスデコーダ２３
１においてデコードされ、変換用ＲＯＭ２３２のアドレ
スとなり、指定されたアドレスから（−Ｋ４）が読み出
される。) The encoding parameters corresponding to
1 and becomes the address of the conversion ROM 232, and (-K4) is read from the specified address.

（−Ｋ４）は偶数次パラメータであることが判定されて
、符号反転部２３６において所望のＫ４にもどされて、
バツフア２３８にセツトされる。(-K4) is determined to be an even-order parameter, and is returned to the desired K4 in the sign inverter 236,
It is set in buffer 238.

すなわち、高精度のＫ４が真値として復号されたことに
なる。In other words, the highly accurate K4 is decoded as the true value.

上記（１ｉｉ）、（ＩＶ）は、Ｋ３に対する変換表から
Ｋ４に対する変換もおこなえることを示す。以下、Ｋ５
とＫ６、Ｋ７とＫ９・・・・・・・・・についても同様
である。The above (1ii) and (IV) indicate that conversion to K4 can also be performed from the conversion table for K3. Below, K5
The same applies to K6, K7 and K9, and so on.

変換用ＲＯＭ２３２にもｋパラメータの各サンプル区間
における代表値を書きこむことにすれば、これは変換用
ＲＯＭ２ｌ７と全く同一になる。If the representative value of the k parameter in each sample interval is also written in the conversion ROM 232, it will be exactly the same as the conversion ROM 217.

また符号反転部２１４および２３６は入力信号の符号ビ
ツトも含めた各ビツトを反転した後、最下位桁に１を加
算することにより実現され、奇数／偶数判定回路２１１
および２３３は入力信号の最下位桁が１なら奇数、０な
ら偶数と判定する回路にすればよい。なお、たとえば７
ビツト長のアドレスからなる変換表を５ビツト長のアド
レスで引くには、５ビツトのアドレス情報を左へ２ビツ
トシフトし、下位２ビツトを（００）としたアドレスを
用いればよい。Further, the sign inverting units 214 and 236 are realized by inverting each bit including the sign bit of the input signal and then adding 1 to the least significant digit.
and 233 may be a circuit that determines that if the least significant digit of the input signal is 1, it is an odd number, and if it is 0, it is an even number. For example, 7
To draw a conversion table consisting of bit-length addresses using 5-bit addresses, the 5-bit address information may be shifted to the left by 2 bits and the lower 2 bits set to (00).

換言すれば、７ビツト長のアドレスで指定される１２８
語の表を４語おきに間引いて使えばよいことはいうまで
もない。また、これまでの説明においては、奇数次のｋ
パラメータの代表値（脅，、父。In other words, 128 specified by a 7-bit address
It goes without saying that you can use a table of words by thinning it out to every four words. In addition, in the explanation so far, odd-order k
Typical values of parameters (threat,, father.

、乍，、宝，、・・・・・・・・・）を変換表中に書き
こむことを前提にしていたが、偶数次のｋパラメータの
代表値（脅，、↑４、食。、脅。、・・・・・・・・・
）を変換表中に書きこむ場合に対しても、本発明が適用
可能であることはいうまでもない。以上に述べたごとく
、本発明によれば少ないＲＯＭ容量でｋパラメータの符
号化および復号化をおこなうことが可能になり、音声分
析や合成装置のＩＣ化をすすめる上できわめて有効であ
る。,乍,,Takara,,......) was assumed to be written in the conversion table, but the representative value of even-order k parameter (Shin,, ↑4, Eclipse,...) was assumed to be written in the conversion table. Threat......
) is written into the conversion table, it goes without saying that the present invention is also applicable. As described above, according to the present invention, it is possible to encode and decode k-parameters with a small ROM capacity, and it is extremely effective in promoting the use of ICs in speech analysis and synthesis devices.

【図面の簡単な説明】[Brief explanation of the drawing]

第１図は１次〜８次のｋパラメータの相対出現頻度分布
を示す図、第２図はｋパラメータ変換表を格納するＲＯ
Ｍ内の番地付の→１を示す図、第３図は本発明による音
声分析および合成装置の一実施例の回路プロツクを示す
図、第４図は第３図における制御部の一実施例を示す回
路図、第５図は第３図におけるアドレス設定回路の一実
施例を示す回路図である。２０：Ｋｉ抽出回路、２１：分析部、２２：メモリ、２
３：合成部。Figure 1 shows the relative appearance frequency distribution of the 1st to 8th order k-parameters, and Figure 2 shows the RO that stores the k-parameter conversion table.
3 is a diagram showing the circuit block of an embodiment of the speech analysis and synthesis device according to the present invention, and FIG. 4 is a diagram showing an embodiment of the control section in FIG. 3. FIG. 5 is a circuit diagram showing an embodiment of the address setting circuit in FIG. 3. 20: Ki extraction circuit, 21: analysis section, 22: memory, 2
3: Synthesis section.

Claims

【特許請求の範囲】１音声特徴パラメータとしての偏自己相関関数の奇数
次および偶数次のいずれか一方の第１の次数の偏自己相
関関係数に対応する変換値のみを記憶する記憶手段と、
音声信号から抽出された偏自己相関係数の次数に応じて
前記記憶手段から変換値を読み出す読み出し手段とを備
え、前記読み出し手段は、前記第１の次数以外の第２の
次数の偏自己相関係数を変換するに際し、前記第２の次
数に隣接する第１の次数の偏自己相関係数に対応する変
換値を上記記憶手段から読み出すように構成したことを
特徴とする音声処理装置。２音声特徴パラメータとしての符号化された奇数次お
よび偶数次のいずれか一方の第１の次数の偏自己相関係
数に対応する変換値を記憶する記憶手段と、符号化され
た偏自己相関係数に応じて前記記憶手段から変換値を読
み出す読み出し手段を備え、前記読み出し手段は、前記
第１の次数以外の第２の次数の符号化された偏自己相関
係数を変換するに際し、前記変換値を符号反転して出力
するように構成したことを特徴とする音声処理装置。[Scope of Claims] 1. Storage means for storing only a converted value corresponding to a first-order partial autocorrelation coefficient of either odd-order or even-order of a partial autocorrelation function as a voice feature parameter;
reading means for reading out a converted value from the storage means according to the order of the partial autocorrelation coefficient extracted from the audio signal, the reading means reading out a converted value from the partial autocorrelation coefficient of a second order other than the first order; An audio processing device characterized in that, when converting a relational coefficient, a converted value corresponding to a partial autocorrelation coefficient of a first order adjacent to the second order is read out from the storage means. 2. A storage means for storing a converted value corresponding to a first-order partial autocorrelation coefficient of either odd-order or even-order coded as a speech feature parameter, and a coded partial autocorrelation coefficient. readout means for reading out a converted value from the storage means according to the number of converted values, and the readout means is configured to read out a converted value from the storage means according to the converted value when converting the encoded partial autocorrelation coefficient of a second order other than the first order. An audio processing device characterized in that it is configured to output a value with its sign inverted.