JPH0279099A - Multi-pulse voice processor - Google Patents
Multi-pulse voice processorInfo
- Publication number
- JPH0279099A JPH0279099A JP63231250A JP23125088A JPH0279099A JP H0279099 A JPH0279099 A JP H0279099A JP 63231250 A JP63231250 A JP 63231250A JP 23125088 A JP23125088 A JP 23125088A JP H0279099 A JPH0279099 A JP H0279099A
- Authority
- JP
- Japan
- Prior art keywords
- sound source
- driving sound
- correlation function
- source pulse
- cross
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000005314 correlation function Methods 0.000 claims abstract description 25
- 238000000605 extraction Methods 0.000 claims abstract description 16
- 238000005311 autocorrelation function Methods 0.000 claims abstract description 10
- 238000004364 calculation method Methods 0.000 claims abstract description 4
- 230000003595 spectral effect Effects 0.000 claims description 5
- 239000000284 extract Substances 0.000 claims description 2
- 230000015572 biosynthetic process Effects 0.000 claims 1
- 238000003786 synthesis reaction Methods 0.000 claims 1
- 238000001228 spectrum Methods 0.000 abstract description 4
- 230000005284 excitation Effects 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
Abstract
Description
【発明の詳細な説明】
−〔産業上の利用分野〕
本発明は、音声処理装置に関し、特に、音声の駆動音源
パルスを抽出し、伝送するマルチパルス音声処理装置に
関する。DETAILED DESCRIPTION OF THE INVENTION - [Field of Industrial Application] The present invention relates to an audio processing device, and more particularly to a multi-pulse audio processing device that extracts and transmits audio driving sound source pulses.
従来、この種のマルチパルス音声処理装置では、予め1
フレーム内に求めるべき駆動音源パルスの数を決めてお
き、この決められた数のパルスを伝送する構成となって
いた。つまり、従来のマルチパルス音声処理装置では、
入力音声のピッチに拘らず、1フレーム内の駆動音源パ
ルス数は常に一定数となっていた。Conventionally, in this type of multi-pulse audio processing device, 1
The number of driving sound source pulses to be obtained within a frame is determined in advance, and the configuration is such that this determined number of pulses is transmitted. In other words, in the conventional multipulse audio processing device,
Regardless of the pitch of the input audio, the number of drive sound source pulses within one frame is always a constant number.
前述した従来のマルチパルス音声処理装置では、音源パ
ルス探索部において、入力音声と合成音声の差信号、あ
るいは前記差信号に対応する相互相関関数の大小に拘ら
ず、1フレーム内の駆動音源パルス数は常に一定数にな
っていたので、ピッチ周波数の低い音声に対しては、決
められたパルス数において波形を良好に再現出来るが、
ピッチ周波数の高いところでは駆動音源パルスの数が足
らず、波形を思人に再現出来ず、合成音質の劣化を招く
という欠点がある。In the conventional multi-pulse audio processing device described above, in the sound source pulse search unit, the number of driving sound source pulses in one frame is determined regardless of the magnitude of the difference signal between the input sound and the synthesized sound or the cross-correlation function corresponding to the difference signal. is always a constant number, so for voices with a low pitch frequency, the waveform can be reproduced well with a fixed number of pulses, but
At high pitch frequencies, the number of drive sound source pulses is insufficient, making it impossible to reproduce the waveform to the human ear, resulting in a deterioration of the synthesized sound quality.
本発明のマルチパルス音声符号化装置は、従来のマルチ
パルス音声処理装置に加え、駆動音源パルスを探索中に
順次更新される相互相関関数の絶対値の最大値を求める
手段と、前記相互相関関数の絶対値の最大値を予め決め
られているしきい値と比較する手段とを有し、前記相互
相関関数の絶対値の最大値が前記しきい値以下となるま
で駆動音源パルスを探索することを特徴とる。尚、駆動
音源パルス数に応じ駆動音源パルスを量子化し符号化す
ることにより、駆動音源パルス数の多いところではより
少ないビット数で量子化が行われ、全体としては伝送す
べき駆動音源パルスの数によらず、伝送速度は常に一定
に保たれる。The multi-pulse speech encoding device of the present invention includes, in addition to the conventional multi-pulse speech processing device, means for determining the maximum absolute value of a cross-correlation function that is sequentially updated while searching for a driving excitation pulse, and means for comparing the maximum absolute value of the cross-correlation function with a predetermined threshold value, and searching for the driving sound source pulse until the maximum absolute value of the cross-correlation function becomes equal to or less than the threshold value. It is characterized by Furthermore, by quantizing and encoding the driving sound source pulses according to the number of driving sound source pulses, quantization is performed with a smaller number of bits in areas where the number of driving sound source pulses is large, and the overall number of driving sound source pulses to be transmitted is reduced. Regardless, the transmission speed is always kept constant.
次に、本発明の実施例について図面を参照して説明する
。Next, embodiments of the present invention will be described with reference to the drawings.
第1図を参照すると、入力端子1より入力された音声信
号は短時間スペクトル情報抽出部2と相互相関関数抽出
部3に入力される。短時間スペクトル情報抽出部2の出
力結果は、自己相関関数抽出部4と相互相関関数抽出部
3に入力される。相互相関関数抽出部3の出力と自己相
関関数抽出部4の出力はそれぞれ駆動音源パルス探索部
5に入力されている。相互相関関数絶対最大値算出部6
は、駆動音源パルスを探索中に順次更新される相互相関
関数の絶対値の最大値を求める。しきい値比較部7は、
相互相関関数の絶対値の最大値を予め決められているし
きい値と比較する。Referring to FIG. 1, an audio signal input from an input terminal 1 is input to a short-time spectrum information extraction section 2 and a cross-correlation function extraction section 3. The output result of the short-time spectral information extraction section 2 is input to the autocorrelation function extraction section 4 and the cross-correlation function extraction section 3. The outputs of the cross-correlation function extracting section 3 and the outputs of the autocorrelation function extracting section 4 are respectively input to the driving excitation pulse searching section 5. Cross-correlation function absolute maximum value calculation unit 6
calculates the maximum absolute value of the cross-correlation function that is sequentially updated while searching for the driving sound source pulse. The threshold comparison unit 7
The maximum absolute value of the cross-correlation function is compared with a predetermined threshold.
駆動音源パルス探索部5では、次の(1)式及び第2図
に示されるフローチャートに基づいて、順次、駆動音源
パルスが求められる。The driving sound source pulse search unit 5 sequentially finds driving sound source pulses based on the following equation (1) and the flowchart shown in FIG.
gに一φ、、(mに)
但し、gKは駆動音源パルス振幅、mKは駆動音源パル
ス位置、φ、hはφ、、(0)により正規化された相互
ill関関数、φゎ、はφh、(0)により正規化され
た自己相関関数である。g is one φ, , (m is) where gK is the driving sound source pulse amplitude, mK is the driving sound source pulse position, φ, h is the mutual ill-function function normalized by φ, , (0), and φゎ is φh is an autocorrelation function normalized by (0).
駆動音源パルス探索部5により求まった音源パルス数と
音源パルスとは量子化器8に入力される。The number of sound source pulses and the sound source pulses determined by the driving sound source pulse search unit 5 are input to a quantizer 8 .
量子化器8ではフレーム全体でパルスに割当てられるビ
ット数と伝送すべきパルス数より音源パルス数を決定し
、量子化および符号化した後、量子化情報と共に、出力
端子9に出力される。The quantizer 8 determines the number of sound source pulses from the number of bits allocated to pulses in the entire frame and the number of pulses to be transmitted, quantizes and encodes them, and outputs them to an output terminal 9 together with quantization information.
第2図に於いて、g o = 0 、 M A X =
t A B S〔φ1、〕) は絶対値の最大値、T
hはしきい値である。In Figure 2, go = 0, M A x =
t A B S [φ1, ]) is the maximum absolute value, T
h is a threshold.
尚、第2図に示されたフローチャートでは、リアルタイ
ムで動作する音声符号化装置において制限されるパルス
探索時間の保護も設定されである。In the flowchart shown in FIG. 2, protection of the pulse search time, which is limited in a speech encoding device operating in real time, is also set.
このパルス時間長の制限により、伝送できるパルス数の
最大値も決めることができ、量子化器8のビット配分を
予めテーブル化することもてきる。By limiting the pulse time length, the maximum number of pulses that can be transmitted can be determined, and the bit allocation of the quantizer 8 can be tabulated in advance.
例えば、第1表に示されるようなビット割当てを行うこ
とにより、最大48%の駆動音源パルスが増加する。こ
れは、音源パルスの符号化ビット数の減少による合成音
質を補うに十分である。但し、第1表は、16 kbp
s、 20 n5ec/フレームの場合である。For example, by performing the bit allocation as shown in Table 1, the driving sound source pulses can be increased by up to 48%. This is sufficient to compensate for the synthesized sound quality due to the reduction in the number of coded bits of the sound source pulse. However, in Table 1, 16 kbp
s, 20 n5ec/frame.
以下余白
第 1 表
〔発明の効果〕
以上説明したように本発明は、音源パルス探索中に順次
更新される相互相関関数の絶対値の最大値を調べること
により、駆動音源パルス数及び駆動音源パルスの符号化
ビット数を可変とする事によって、特に、ピッチ周波数
の高い女性話者など予め決められた音源パルス数の足ら
ない場合における合成音質を向上させることができると
いう効果がある。Table 1 with blank space below [Effects of the Invention] As explained above, the present invention can determine the number of driving sound source pulses and driving sound source pulses by checking the maximum absolute value of the cross-correlation function that is sequentially updated during the sound source pulse search. By making the number of encoding bits variable, the synthesized sound quality can be improved, especially when the predetermined number of sound source pulses is insufficient, such as for a female speaker with a high pitch frequency.
第1図は本発明の一実施例によるマルチパルス音声処理
装置の構成を示すブロック図、第2図は本実施例におけ
る音源パルス探索のフローチャートである。
1・・・入力端子、2・・・短時間スペクトル情報抽出
部、3・・・相互相関関数抽出ごIS、4・・・自己相
関関数抽出部、4・・・駆動音源パルス探索部、6・・
・相互相関関数絶対値算出部、7・・・しきい値比較部
、8・・・音源パルス量子化器、9・・・出力端子。
第2図FIG. 1 is a block diagram showing the configuration of a multi-pulse audio processing device according to an embodiment of the present invention, and FIG. 2 is a flow chart of sound source pulse search in this embodiment. DESCRIPTION OF SYMBOLS 1... Input terminal, 2... Short-time spectrum information extraction section, 3... IS for cross-correlation function extraction, 4... Auto-correlation function extraction section, 4... Drive sound source pulse search section, 6・・・
- Cross-correlation function absolute value calculation unit, 7... Threshold comparison unit, 8... Sound source pulse quantizer, 9... Output terminal. Figure 2
Claims (1)
ム毎に入力音声の駆動音源パルスを抽出し、伝送するマ
ルチパルス音声処理装置であって、前記フレーム毎の入
力音声より短時間スペクトル情報を抽出する短時間スペ
クトル情報抽出手段と、前記短時間スペクトル情報より
構成される合成フィルタのインパルス応答の自己相関関
数を求める自己相関関数抽出手段と、前記入力音声と前
記短時間スペクトル情報と前記自己相関関数とから相互
相関関数を求める相互相関関数抽出手段と、前記相互相
関関数と前記自己相関関数とから前記駆動音源パルスを
求める駆動音源パルス探索手段とを有するマルチパルス
音声処理装置に於いて、前記駆動音源パルス探索手段に
おいて前記駆動音源パルスを求める際に、順次更新され
る相互相関関数の絶対値の最大値を求める相互相関関数
絶対最大値算出手段と、 前記相互相関関数の絶対値の最大値を予め決められてい
るしきい値と比較するしきい値比較手段とを有し、 前記駆動音源パルス探索手段は、前記相互相関関数の絶
対値の最大値が前記しきい値以下となるまで駆動音源パ
ルスを探索することを特徴とするマルチパルス音声処理
装置。[Scope of Claims] 1. A multi-pulse audio processing device that divides input audio into frames of a certain length of time, extracts and transmits driving sound source pulses of the input audio for each frame, the apparatus comprising: short-time spectral information extraction means for extracting shorter-term spectral information; autocorrelation function extraction means for determining an autocorrelation function of an impulse response of a synthesis filter configured from the short-term spectral information; Multi-pulse audio processing comprising cross-correlation function extraction means for obtaining a cross-correlation function from spectral information and the autocorrelation function, and driving sound source pulse searching means for finding the driving sound source pulse from the cross-correlation function and the autocorrelation function. In the apparatus, when determining the driving sound source pulse in the driving sound source pulse searching means, cross-correlation function absolute maximum value calculation means for calculating the maximum value of the absolute value of the cross-correlation function that is sequentially updated; and the cross-correlation function threshold comparison means for comparing the maximum absolute value of the cross-correlation function with a predetermined threshold; A multi-pulse audio processing device characterized in that a driving sound source pulse is searched until the driving sound source pulse becomes equal to or less than a value.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP63231250A JPH087598B2 (en) | 1988-09-14 | 1988-09-14 | Multi-pulse voice processor |
US07/305,924 US4924517A (en) | 1988-02-04 | 1989-02-03 | Encoder of a multi-pulse type capable of controlling the number of excitation pulses |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP63231250A JPH087598B2 (en) | 1988-09-14 | 1988-09-14 | Multi-pulse voice processor |
Publications (2)
Publication Number | Publication Date |
---|---|
JPH0279099A true JPH0279099A (en) | 1990-03-19 |
JPH087598B2 JPH087598B2 (en) | 1996-01-29 |
Family
ID=16920674
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP63231250A Expired - Lifetime JPH087598B2 (en) | 1988-02-04 | 1988-09-14 | Multi-pulse voice processor |
Country Status (1)
Country | Link |
---|---|
JP (1) | JPH087598B2 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115050386A (en) * | 2022-05-17 | 2022-09-13 | 哈尔滨工程大学 | Automatic detection and extraction method for Chinese white dolphin whistle sound signal |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS62150297A (en) * | 1985-12-24 | 1987-07-04 | 日本電気株式会社 | Voice waveform encoding system |
-
1988
- 1988-09-14 JP JP63231250A patent/JPH087598B2/en not_active Expired - Lifetime
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS62150297A (en) * | 1985-12-24 | 1987-07-04 | 日本電気株式会社 | Voice waveform encoding system |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115050386A (en) * | 2022-05-17 | 2022-09-13 | 哈尔滨工程大学 | Automatic detection and extraction method for Chinese white dolphin whistle sound signal |
CN115050386B (en) * | 2022-05-17 | 2024-05-28 | 哈尔滨工程大学 | Automatic detection and extraction method for whistle signal of Chinese white dolphin |
Also Published As
Publication number | Publication date |
---|---|
JPH087598B2 (en) | 1996-01-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP2903533B2 (en) | Audio coding method | |
KR0155315B1 (en) | Celp vocoder pitch searching method using lsp | |
JPH0279099A (en) | Multi-pulse voice processor | |
JPH01200296A (en) | Sound encoder | |
JPH087597B2 (en) | Speech coder | |
JPH058839B2 (en) | ||
JP2560682B2 (en) | Speech signal coding / decoding method and apparatus | |
JPH0727398B2 (en) | Constant variable perceptual weighting filter | |
JPH02294700A (en) | Voice analyzer and synthesizer | |
JP2508002B2 (en) | Speech coding method and apparatus thereof | |
JP2615862B2 (en) | Voice encoding / decoding method and apparatus | |
JPH0738119B2 (en) | Speech waveform coding / decoding device | |
JPH01312600A (en) | Reproducing method for voice waveform by power adaptive window | |
JP2596143B2 (en) | Audio coding method | |
JPS6028700A (en) | Voice analyzer/synthesizer | |
RU2180974C2 (en) | Process of compression of insulated layers | |
JP2853126B2 (en) | Multi-pulse encoder | |
JPH0632035B2 (en) | Speech coding method and apparatus thereof | |
JPH01207799A (en) | Multipulse voice encoder | |
JPH0426119B2 (en) | ||
JPS617899A (en) | Multipulse type encoder/decoder | |
JPH0659698A (en) | Voice transfer method | |
JPH01179200A (en) | Voice encoding system | |
JPH03132800A (en) | Multi-pulse type voice encoding and decoding device | |
JPS6332599A (en) | Voice encoder |