JPS62287297A

JPS62287297A - Voice detector

Info

Publication number: JPS62287297A
Application number: JP61130775A
Authority: JP
Inventors: 茨木　悟; 博基古川; 博之直野
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1986-06-05
Filing date: 1986-06-05
Publication date: 1987-12-14

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】３、発明の詳細な説明産業上の利用分野本発明は音声伝送装置、音声認識・装置（′Ｃ便用する
、雑音の混入した信号から音声信号の区間を正確に検出
する音声検出装置に関するものである。[Detailed Description of the Invention] 3. Detailed Description of the Invention Industrial Application Field The present invention is useful for voice transmission equipment, voice recognition equipment ('C), and is useful for accurately determining the section of a voice signal from a signal mixed with noise. The present invention relates to a voice detection device for detecting a voice.

従来の技術近年、スピーカホン、テレコンファレンスンステムの普
及が目ざましく、これ等の装置では・・ウリングの防止
、および室内騒音の除去のためにボイススイッチが使用
されている。また音声伝送のディジタル化も進んでおり
、無音声区間と音声区間を判別し、音声区間のみを伝送
し伝送効率を向上する研究開発も行なわれている。BACKGROUND OF THE INVENTION In recent years, speakerphones and teleconference systems have become widespread, and voice switches are used in these devices to prevent ringing and eliminate indoor noise. In addition, digitalization of voice transmission is progressing, and research and development is being conducted to improve transmission efficiency by distinguishing between non-voice sections and voice sections and transmitting only the voice sections.

これ等の装置の性能向上のために、雑音による誤動作の
無い音声検出装置が強く要望されている。In order to improve the performance of these devices, there is a strong demand for a voice detection device that does not malfunction due to noise.

この音声検出装置の従来の技術としては、例えば、電子
通信学会論文誌、　Ｍｏ１．　Ｊ　６３−Ａ　、　４７
　。As a conventional technique of this voice detection device, for example, the journal of the Institute of Electronics and Communication Engineers, Mo1. J63-A, 47
.

Ｐ、Ｐ、　１５９−１７６　（１９８０）、電子通信学
会技術研究報告、Ｃ人Ｓ　８４−２３８　、　Ｐ、　Ｐ
、　７１〜７８（１９８４）、岩通技報、　Ｔ１０１．
２０　、　、ｃｌ、　ｐ、ｐ、　１８−２１　（１９８
１）に示されている、雑音による誤動作を軽減した音声
検出装置がある。P, P, 159-176 (1980), Institute of Electronics and Communication Engineers Technical Research Report, C Person S 84-238, P, P
, 71-78 (1984), Iwatsu Giho, T101.
20, , cl, p, p, 18-21 (198
There is a voice detection device shown in 1) that reduces malfunctions caused by noise.

以下、図面を参照しながら、上述したような従来の音声
検出装置について説明を行なうつ第８図は従来の音声検
出装置のブ（１７り図を示すものである。第８図２でお
いて、８１は信号の入力端子、８２はこの信号を整流す
る整流回路、８３は立上がりの時定数が短く、立下がり
の時定数の長い樺犬値ホールド回路、８４は立下がりの
時定数が短かく、立上がりの時定数の長い極小値ホール
ド回路、８６ばこの極小値ホールド回路８４の出力信号
からしきい値信号を算出するしきい値設定回路、８６は
このしきい値信号と極大値ホールド回路で求めた極大値
信号とを比較して、極大値信号がしきい値信号より大き
い場合に音声信号が存在すると判定する比較回路、８７
はこの比較回路８６の判定信号の出力端子でちる。The conventional voice detection device as described above will be explained below with reference to the drawings. FIG. 8 shows a block diagram of the conventional voice detection device. , 81 is a signal input terminal, 82 is a rectifier circuit that rectifies this signal, 83 is a Kabainu value hold circuit that has a short rising time constant and a long falling time constant, and 84 has a short falling time constant. , a local minimum value hold circuit with a long rising time constant, a threshold setting circuit that calculates a threshold signal from the output signal of the local minimum value hold circuit 84 of 86, and 86 is a combination of this threshold signal and the local maximum value hold circuit. a comparison circuit that compares the obtained local maximum value signal and determines that an audio signal exists if the local maximum value signal is larger than the threshold signal; 87;
is determined at the output terminal of the judgment signal of this comparison circuit 86.

以上のように構成された音声検出装置について、以下そ
の動作について説明するっ第９図は第８図の各部の信号波形を示すもので、この図
を利用して説明を行なう。第９図において、９１は第８
図のＡ点の入力信号の波形、９２は第８図のＢ点の整流
信号の波形、９３は０点の琢犬値信号の波形、９４ばＤ
点の極小値信号の波形、９５はＥ点のしきい値信号の波
形、９６ばＦ点の判定信号の波形を示す。The operation of the voice detection apparatus constructed as above will be described below. FIG. 9 shows signal waveforms at each part of FIG. 8, and the explanation will be made using this figure. In Figure 9, 91 is the 8th
The waveform of the input signal at point A in the figure, 92 the waveform of the rectified signal at point B in Figure 8, 93 the waveform of the final value signal at point 0, 94 D
95 shows the waveform of the minimum value signal at the point, 95 shows the waveform of the threshold signal at the E point, and 96 shows the waveform of the determination signal at the F point.

まず第８図の入力端子８１に第９図の入力信号９１が入
力される。この入力信号９１において時刻Ｔｏから時刻
Ｔ１までは雑音区間であり、音声信号は含まれていない
。時刻Ｔ１から時刻Ｔ２までが音声信号区間であり、時
刻で２　　以後は雑音の区間である。雑音としては機器
の電気的な雑音と音声収音時の騒音による雑音がある。First, the input signal 91 shown in FIG. 9 is input to the input terminal 81 shown in FIG. In this input signal 91, the period from time To to time T1 is a noise section and does not include a voice signal. The period from time T1 to time T2 is an audio signal section, and the period after time 2 is a noise section. Noise includes electrical noise from equipment and noise caused by sound recording.

騒音としては空調設備、換気扇、自動車、工場設備等か
ら発生するものが考えられ、通常この掻音による雑音の
ほうが電気的な雑音よりも太さい１．入力信号９１に示
す波形はこの１音による雑音が王に混入しており、雑音
の振中冨は時間的に変動する。Noise can be generated from air conditioning equipment, ventilation fans, automobiles, factory equipment, etc., and the noise caused by this scratching is usually louder than electrical noise1. The waveform shown in the input signal 91 is mixed with noise due to this single note, and the amplitude of the noise varies over time.

この入力信号９１は第８図の整流回路８２で整流され第
９図の整流信号９２を得る。この整流信号９２は翫犬匝
ホールド回路８３で処理され極大値信号９３を得る。一
方整備信号９２は極小値ホールド回路８４にも入力され
、極小値信号９４となる。次にこの極小値信号９４はし
きい値設定回路８５により増幅されしきい値信号９５と
なる。This input signal 91 is rectified by a rectifier circuit 82 shown in FIG. 8 to obtain a rectified signal 92 shown in FIG. This rectified signal 92 is processed by a hold circuit 83 to obtain a maximum value signal 93. On the other hand, the maintenance signal 92 is also input to the minimum value hold circuit 84 and becomes a minimum value signal 94. Next, this minimum value signal 94 is amplified by a threshold setting circuit 85 and becomes a threshold signal 95.

このしきい値設定回路８５の増幅率は時刻Ｔｏ　　から
時刻Ｔ１　　および時刻Ｔ２から時刻ＴＳの雑音区間で
、しきい値信号９５が必ず極大値信号９３以上となるよ
うにあらかじめ設定されている。すなわち、あらかじめ
雑音振幅の極大髄と極小値の比を想定し、この比に見合
った増重率としている。次にこのしきい値信号９５と極
大嘆信号９３は比較回路８６で比較され、極大値信号９
３がしきい垣信号９５を超えた時に音声信号が存在して
いると判断し、判定信号９６を出力端子８７から外部機
器に出力する。この判定信号９６は時刻Ｔ１　　から時
刻Ｔ２　　でオン、すなわち音声信号が存在していると
判定しており、雑音下でも音声検出が正確に行なわれて
いることがわかる。The amplification factor of this threshold setting circuit 85 is set in advance so that the threshold signal 95 is always equal to or higher than the maximum value signal 93 in the noise period from time To to time T1 and from time T2 to time TS. That is, the ratio between the maximum value and the minimum value of the noise amplitude is assumed in advance, and the amplification rate is set in accordance with this ratio. Next, this threshold signal 95 and the maximum value signal 93 are compared in a comparison circuit 86, and the maximum value signal 93 is compared with the maximum value signal 93.
3 exceeds the threshold signal 95, it is determined that an audio signal is present, and a determination signal 96 is outputted from the output terminal 87 to an external device. This determination signal 96 is turned on from time T1 to time T2, that is, it is determined that a voice signal is present, and it can be seen that voice detection is performed accurately even under noise.

発明が解決しようとする問題点しかしながら、上記のような構成では、雑音振幅の変化
率の大きい場合には、この雑音を音声信号と誤って検出
するという問題点を有していた。Problems to be Solved by the Invention However, the above configuration has a problem in that when the rate of change in noise amplitude is large, this noise is mistakenly detected as a voice signal.

例えば空調のファンの不具合で、周期的に比較的高いレ
ベルの騒音を発生したり、プリンタ、あるいはタイプラ
イタかも断続的な掻音が発生している場合である。この
場合にはしきい値設定回路８５の増幅率を上昇させ、し
きい値を雑音振幅の極太値より高く設定しなければなら
ないが、従来の構成では増幅率を一定としており、この
処理は不可能であった。For example, a malfunctioning air conditioner fan may periodically generate a relatively high level of noise, or a printer or typewriter may produce intermittent scratching noises. In this case, it is necessary to increase the amplification factor of the threshold setting circuit 85 and set the threshold value higher than the extremely thick value of the noise amplitude, but in the conventional configuration, the amplification factor is kept constant, and this process is unnecessary. It was possible.

本発明は上記問題点に鑑み、振幅の時間的変化率が大き
い雑音が混入した信号からでも、正確に音声区間を検出
する音声検出装置を提供するものである。SUMMARY OF THE INVENTION In view of the above-mentioned problems, the present invention provides a speech detection device that accurately detects a speech section even from a signal mixed with noise having a large rate of change in amplitude over time.

問題点を解決するための手段この目的を達成するために本発明の音声検出装置は、雑
音信号が混入した音声信号を整流平滑し整流平滑信号を
得る整流平滑回路と、立上がりの時定数が短く、立下が
りの時定数が長く設定され、かつ、前記整流平滑信号の
極大値を求める極大値ホールド回路と、立下がりの時定
数が短く、立上がりの時定数が前記極大値ホールド回路
の立下がりの時定数よりも長く設定され、かつ前記極大
値信号の極小値を求める極小値ホールド回路と、この極
小値信号からしきい値信号を算出するしきい値設定回路
と、このしきい値信号と前記整流平滑信号とを比較して
、前記整流平滑信号が前記しきい値信号を協えた場合に
音声信号が存在していると判定する比較回路とから構成
されている。Means for Solving the Problems In order to achieve this object, the voice detection device of the present invention includes a rectifying and smoothing circuit that rectifies and smoothes a voice signal mixed with a noise signal to obtain a rectified and smoothed signal, and a rectifying and smoothing circuit that has a short rising time constant. , a local maximum value hold circuit having a long falling time constant and determining the local maximum value of the rectified smoothed signal, and a local maximum value holding circuit having a short falling time constant and a rising time constant of a minimum value hold circuit that is set longer than a time constant and that calculates the minimum value of the maximum value signal; a threshold setting circuit that calculates a threshold signal from this minimum value signal; and a comparison circuit that compares the rectified and smoothed signal with the rectified and smoothed signal and determines that an audio signal exists if the rectified and smoothed signal matches the threshold signal.

作用この構成によって極小値ホールド回路の出力として、雑
音の振幅の極大値を得ることができる。Effect: With this configuration, the maximum value of the noise amplitude can be obtained as the output of the minimum value hold circuit.

すなわち、極大値ホールド回路の出力は、音声の休止区
間では雑音振、＠の極大値を出力し、音声区間ではこの
出力は音声の振幅の最大値まで上昇する。したがってこ
の出力の極小値と求めれば雑音振幅の極大値を知ること
ができる。この雑音振幅の啄犬埴をしきい値設定回路で
少し増幅してしきい値とすることにより、雑音による誤
検出の無い音声検出装置が実現できることとなる。That is, the output of the local maximum value hold circuit outputs the maximum value of the noise amplitude and @ during the pause period of the voice, and this output increases to the maximum value of the amplitude of the voice during the voice period. Therefore, by finding the minimum value of this output, the maximum value of the noise amplitude can be found. By amplifying this noise amplitude a little in a threshold setting circuit and setting it as a threshold value, it is possible to realize a voice detection device that does not cause false detections due to noise.

実施例以下本発明の一実施例について、図面を参照しながら説
明する。EXAMPLE An example of the present invention will be described below with reference to the drawings.

第１図（ｄ本発明の一実１池例における音声検出装置の
ブロック図を示すものである。FIG. 1(d) shows a block diagram of a voice detection device in one embodiment of the present invention.

第１図において、１１は信号の入力端子、１２ばこの信
号を整流平滑する整流子１滑回路、１３は立上がりの時
定数が短く、立下がりの時定数つ長い極大（直ホールド
回路、１４は立上がりの時定数が長く、立下がりの時定
数が短く設定された極小値ホールド回路、１６ばこの咥
小値ホールド回路１４の出力信号からしきい値信号を算
出するしきい値設定回路、１６はこのしきい値信号と、
整流平滑回路１２で求めた整流子１脅信号とを比較して
、整流平滑信号がこのしきい値君号より大きい場合：・
て音声信号が存在すると判定する比較回路、１７：はこ
の比較回路１６の判定信号を出力する出力端子である。In Fig. 1, 11 is a signal input terminal, 12 is a commutator single-slip circuit for rectifying and smoothing the smoke signal, 13 is a local maximum (direct hold circuit) with a short rising time constant and one long falling time constant, and 14 is a direct hold circuit. A threshold value setting circuit 16 calculates a threshold signal from the output signal of the minimum value hold circuit 14, which has a long rising time constant and a short falling time constant. This threshold signal and
Comparing with the commutator 1 threat signal obtained by the rectification and smoothing circuit 12, if the rectification and smoothing signal is larger than this threshold value:・
17 is an output terminal for outputting a judgment signal from the comparison circuit 16.

以上のように構成された音声検出装置について、以下そ
の動作について説明する。The operation of the voice detection device configured as described above will be described below.

第２図は第１図の各部の信号波形を示すもので、この図
を利用して説明を行なう。第２図において、２１は第１
図のＡ点の入力信号の波形、２２は第１囚の８点の整流
平滑信号の波形、２３ば０点の翫犬値信号の波形、２４
はＤ点の極小値信号の波形、２５ばＥ屯のしきい値信号
の波形、２６４−ｉＦ点の判定信号の波形を示す。FIG. 2 shows signal waveforms at various parts in FIG. 1, and the explanation will be made using this diagram. In Figure 2, 21 is the first
In the figure, the waveform of the input signal at point A, 22 is the waveform of the rectified and smoothed signal at 8 points of the first prisoner, 23 is the waveform of the 0-point signal, 24
2 shows the waveform of the minimum value signal at point D, the waveform of the threshold signal at 25-iF, and the waveform of the determination signal at point 264-iF.

まず第１図の入力端子１１に第２図の入力信号２１が入
力される。この入力信号２１シておいて時刻Ｔｏから時
刻Ｔ１までと時刻Ｔ２から時刻Ｔ５までは雑音区間でち
り、音声信号は含まれていない。First, the input signal 21 shown in FIG. 2 is input to the input terminal 11 shown in FIG. The input signal 21 is a noise section from time To to time T1 and from time T2 to time T5, and contains no voice signal.

時刻Ｔ１から時刻Ｔ２までが音声信号区間である。The period from time T1 to time T2 is an audio signal section.

この入力信号２１は第１図の整流平滑回路１２で整流平
滑され第２図の整流平滑信号２２を得る。This input signal 21 is rectified and smoothed by the rectification and smoothing circuit 12 shown in FIG. 1 to obtain a rectified and smoothed signal 22 shown in FIG.

この整流平滑回路１２の平滑の時定数は数ｍ５６ｃ〜数
１０　ｍ　ｓｅｃの（直に設定し、音声の基本”［３ｍ
′Ｊ。The smoothing time constant of this rectifying and smoothing circuit 12 is several m56c to several 10 msec (directly set,
'J.

ンてよるリプルを除く、欠ンでこの整流平滑信号２２は
極大僅ホールド回路１３で処理され極大（値信号２３を
得る。この極大値ホールド回路１３の立下がりの時定数
を数秒に設定すれ；ブ時刻Ｔｏ　−Ｔ＋　Ｚ’）雑音区
間で雑音振幅の極大魚が得られる。こＤり大管信号２３
は音声が入力された時刻Ｔ、〜Ｔ２では上昇するが、音
声が無くなってから数秒〜ｆ１１０秒経過すれば、また
雑音振幅の極大噴？示すようになる。次にこの極大′直
信号２３は極小算ホールド回路１４に入力され極小値信
号２４として出力される。この極小實ホールド回路１４
の立上がりの時定数を数１０秒〜数分に設定すれば、極
大値信号２３の雑音振幅の極大値、すなわち時刻Ｔ。This rectified and smoothed signal 22 is processed by a maximum value hold circuit 13 to obtain a maximum value signal 23, excluding ripples caused by the input signal.The time constant of the fall of this maximum value hold circuit 13 is set to several seconds; The maximum noise amplitude is obtained in the noise section (time To -T+Z'). Main pipe signal 23
increases at time T, ~T2 when the voice is input, but after several seconds to f110 seconds have passed since the voice is no longer present, the noise amplitude peaks again? It comes to show. Next, this maximum' direct signal 23 is input to the minimum calculation hold circuit 14 and output as a minimum value signal 24. This extremely small hold circuit 14
If the time constant for the rise of is set to several tens of seconds to several minutes, the maximum value of the noise amplitude of the maximum value signal 23, that is, time T.

〜Ｔ１およびＴ３付近の値をホールドさせることができ
る。このようにして求めた極小値信号２４はしきい値設
定回路１６で少し増幅されしきい値信号２５となる。比
較回路１６では、このしきい値信号２５と整流平滑信号
２２を比較し、整流平滑信号２２がしきい値信号２５よ
り大きい場合に音声信号が存在していると判断し、判定
信号２６を出力端子１７から外部機器に出力する。この
判定信号２６は時刻Ｔ１〜Ｔ２でオン（０）Ｊ）すなわ
ち音声信号が存在していると判定しており、雑音下でも
正確に音声検出が行なわれていることを示している。It is possible to hold values near T1 and T3. The minimum value signal 24 obtained in this manner is slightly amplified by the threshold setting circuit 16 and becomes a threshold signal 25. The comparison circuit 16 compares this threshold signal 25 and the rectified smoothed signal 22, and if the rectified smoothed signal 22 is larger than the threshold signal 25, it determines that an audio signal exists, and outputs a determination signal 26. Output from terminal 17 to external equipment. This determination signal 26 is on (0) J) at time T1 to T2, that is, it is determined that a voice signal is present, indicating that voice detection is performed accurately even under noise.

以上のように、本実施例によれば、立上がりの時定数が
短く、立下がりの時定数が長く設定された極大値ホール
ド回路１３と、立下がりの時定数が短く、立上がりの時
定数が極犬瞳ホールド回路の立下がりの時定数よりも長
く設定された極小値ホールド回路１４を縦続接続するこ
とにより、雑音振幅の極大値を求めることができ、この
甑犬値よりもわずかに高い値にしきい値を設定すること
により、常に雑音振幅の極太値よりわずかに高いしきい
値を自動的に作や出すことができる。したがって振幅の
時間的変化の異なる種々の雑音にも自動的に適応し、常
に正確で高感度な音声検出の行なえる装置が実現できる
。As described above, according to this embodiment, the maximum value hold circuit 13 is set to have a short rising time constant and a long falling time constant, and a local maximum value hold circuit 13 having a short falling time constant and a maximum rising time constant. By cascade-connecting local minimum value hold circuits 14 that are set longer than the falling time constant of the dog-pupil hold circuit, the maximum value of the noise amplitude can be determined, and the value can be set to a value slightly higher than this dog-pupil value. By setting the threshold value, it is possible to automatically create a threshold value that is always slightly higher than the extreme value of the noise amplitude. Therefore, it is possible to realize a device that can automatically adapt to various types of noise whose amplitudes change over time and can always perform accurate and highly sensitive voice detection.

以下本発明の第２の実施例について図面を参照しながら
説明する。A second embodiment of the present invention will be described below with reference to the drawings.

第３図は本発明の第２の実施例における音声検出装置の
ブロック図を示すものである。第３図において、３１は
極小値信号によってその立下がりの時定数が別画される
極大値ホールド回路であり、他の構成は第１図の実施例
とまったく同じである。FIG. 3 shows a block diagram of a voice detection device according to a second embodiment of the present invention. In FIG. 3, numeral 31 is a local maximum value hold circuit whose fall time constant is determined differently depending on the local minimum value signal, and the other configurations are exactly the same as the embodiment shown in FIG.

以上のように構成された音声検出装置について、以下そ
の動作を説明する。The operation of the voice detection device configured as described above will be described below.

第４図は第３図に示す第２の実施例の各部の信号波形を
示すもので、この図を使用して説明を行なう。第４図に
おいて、４１は第３図の８点の整流平滑信号の波形、４
２は第３図の０点の陰犬値信号の波形、４３は第３図の
Ｄ点の極小値信号の波形、、４４．４５は参照のために
記した第１図の実施例の０点、Ｄ点の信号波形をそれぞ
れ示す。FIG. 4 shows signal waveforms at various parts of the second embodiment shown in FIG. 3, and the explanation will be given using this diagram. In FIG. 4, 41 is the waveform of the rectified and smoothed signal at 8 points in FIG.
2 is the waveform of the negative value signal at point 0 in FIG. 3, 43 is the waveform of the minimum value signal at point D in FIG. The signal waveforms at point and point D are shown, respectively.

第３図において、まず入力端子１１に信号が入力され整
流平滑回路１２で整流平滑され、整流平滑信号４１を得
る。この信号において時刻Ｔ、　−Ｔ２に３音節からな
る音声と、時刻Ｔ５〜Ｔ４に２音節からなる音声が存在
する。時刻Ｔｏ−Ｔ、では極大値信号４２および極小値
信号４３は雑音の唖大振幅値を示している。時刻で１　
において音声が入力されると極大値信号４２は急に上昇
する。この第１番目の音節が消えると、整流平滑信号４
１が雑音の振幅値まで減少するため、極大値信号４２は
減衰する。In FIG. 3, a signal is first input to the input terminal 11 and rectified and smoothed by the rectification and smoothing circuit 12 to obtain a rectified and smoothed signal 41. In this signal, there are voices consisting of three syllables at times T and -T2, and voices consisting of two syllables at times T5 to T4. At time To-T, the maximum value signal 42 and the minimum value signal 43 indicate the maximum amplitude value of the noise. 1 by time
When voice is input at , the maximum value signal 42 suddenly rises. When this first syllable disappears, the rectified and smoothed signal 4
1 decreases to the noise amplitude value, the local maximum signal 42 is attenuated.

本実施例の極大値ホールド回路３１は、この減衰すなわ
ち立下がりの時定数を極小値信号４３で制（財）してお
り、極大値信号４２が極小値信号４３よシ高ければ高い
ほどその立下がりの時定数を短くしている。したがって
本実施例における極大値信号４２は第１図に示す第１の
実施例における極大値信号４４よりも速く減衰すること
となる。この極大値信号４２は音節が近接している場合
には雑音の振幅値まで下がることはなく、このため極小
値信号４３は少しづつ増加する。The local maximum value hold circuit 31 of this embodiment controls the time constant of this attenuation, that is, falling, by the local minimum value signal 43, and the higher the local maximum value signal 42 is than the local minimum value signal 43, the higher the rise. The falling time constant is shortened. Therefore, the maximum value signal 42 in this embodiment attenuates faster than the maximum value signal 44 in the first embodiment shown in FIG. This maximum value signal 42 does not decrease to the amplitude value of noise when syllables are close to each other, and therefore the minimum value signal 43 increases little by little.

次に時刻で２〜Ｔ５の少し長い音声の休止区間にはいる
。この休止区間の中はどで、極大値信号４２は雑音の極
大振幅まで低下し、極小振＠４３の値を更新し、これを
低下させる。次に時刻で５〜で４でまた音声が入力され
、極大値信号４２が大きくなり、再び極小値信号４３が
上昇するが、時刻Ｔ４で音声が消えると時刻で５付近で
また極大値信号４２が雑音の極大値振幅まで低下し、極
小振＠４３のｒ直を更新し、これを低下させる。Next, the system enters a slightly longer audio pause section from time 2 to T5. During this rest period, the maximum value signal 42 drops to the maximum amplitude of the noise, updates the value of the minimum amplitude @43, and lowers it. Next, at time T4, the voice is input again at 4, the local maximum value signal 42 becomes large, and the local minimum value signal 43 rises again, but when the voice disappears at time T4, the local maximum value signal 42 increases again at around 5. decreases to the maximum amplitude of the noise, updates the r-direction of the minimum amplitude @43, and lowers it.

一方、第１の実施例では、極大値信号４４は立下がりの
時定数が一定であるため時刻で２〜Ｔ５に示すような短
い音声の休止区間では、雑音の極大振幅まで下がること
ができずに、極小値信号４５は増加しつづける。比較回
路１６のしきい値は、この極小値信号４２．４５から算
出されるため、本実施例ではしきい値の信頼法が第１の
実施例に比して啄めて向上する。すなわち、比較的短い
音声の休止区間でも、この休止区間を利用してしきい値
の更新が行なえる。したがって長い会話音声中でも、的
確にしきい値を自動調節することが可能となる。On the other hand, in the first embodiment, since the time constant of the fall of the maximum value signal 44 is constant, it cannot decrease to the maximum amplitude of the noise during a short speech pause period as shown at time 2 to T5. Meanwhile, the minimum value signal 45 continues to increase. Since the threshold value of the comparator circuit 16 is calculated from this minimum value signal 42.45, the reliability of the threshold value in this embodiment is greatly improved compared to the first embodiment. That is, even if the audio pause section is relatively short, the threshold value can be updated using this pause section. Therefore, it is possible to accurately and automatically adjust the threshold value even during long conversations.

以下本発明の第３の実施例について、図面を参照しなが
ら説明する。A third embodiment of the present invention will be described below with reference to the drawings.

第６図は本発明の第３の実施例における音声検出装置の
ブロック図を示すものである。第５図において、６１は
自己の出力信号によってその立ち上がりの時定数を制御
する極小値ホールド回路であり、他の構成は、第１図の
実施例とまったく同じである。FIG. 6 shows a block diagram of a voice detection device according to a third embodiment of the present invention. In FIG. 5, numeral 61 is a minimum value hold circuit which controls its rising time constant by its own output signal, and the other configurations are exactly the same as the embodiment shown in FIG.

第６図は第５図の実施グ］の谷部の信号波形を示してお
り、この図を使用して本実施例の動作の説明を行なう。FIG. 6 shows the signal waveform at the valley part of the embodiment shown in FIG. 5, and the operation of this embodiment will be explained using this diagram.

第６図において、６１は第６図のＢ点の整流平滑信号の
波形、６２は第５図の０点の極大値信号の波形、６３は
第５図のＤ点の極小値信号の波形、６４は参照のために
記１７た第１図の実施ダｊのＤ点の極小値信号の波形を
示すっ第６図において、入力端子１１に信号が入力され
、整流平滑信号６１と極大値信号６２が得られる。第２
の実施例の動作説明で述べたように、第１の実施例と同
じ極大値ホールド回路３１から出力される極大値信号６
２は時刻Ｔ１〜Ｔ５では雑音の極大ｍＩＩＩＭ値まで下
がることはない。したがって第１の実施例での極大値信
号は極大値信号６４のように単調に増加し、しきい値が
変化してしまうことになる。しかしながら、この第５図
に示す本実施例では、極小値ホールド回路１３の立上が
りの時定数を、自己の出力、すなわち極小（面信号６３
で制御している。すなわち、極大値信号６２が極小値信
号６３より高ければ高いほど、立上がりの時定数を長く
しており、このだめ、音声によって極大値信号６２の値
が上昇した時刻で１〜Ｔ５では極小値６３の増加スピー
ドは低く押えられ、長い会話音声中でもしきい値の上昇
を小さくすることができる。In FIG. 6, 61 is the waveform of the rectified and smoothed signal at point B in FIG. 6, 62 is the waveform of the maximum value signal at point 0 in FIG. 5, 63 is the waveform of the minimum value signal at point D in FIG. Reference numeral 64 indicates the waveform of the minimum value signal at point D of implementation daj in FIG. 1, which is noted in 17 for reference. In FIG. 62 is obtained. Second
As described in the operation description of the embodiment, the maximum value signal 6 output from the maximum value hold circuit 31, which is the same as that of the first embodiment,
2 does not fall to the maximum noise mIIIM value from time T1 to T5. Therefore, the maximum value signal in the first embodiment increases monotonically like the maximum value signal 64, and the threshold value changes. However, in this embodiment shown in FIG.
It is controlled by. That is, the higher the maximum value signal 62 is than the minimum value signal 63, the longer the time constant of the rise is made. The rate of increase in the threshold value can be kept low, and the increase in the threshold value can be kept small even during long conversations.

第７図は本発明の第４の実施例？示しており、この実施
例では、第３図の極大値ホールド回路３１と第５図の極
小値ホールド回路５１を組み合わせて、さらにしきい値
の侶頼注を高めたものである。Is FIG. 7 the fourth embodiment of the present invention? In this embodiment, the maximum value hold circuit 31 of FIG. 3 and the minimum value hold circuit 51 of FIG. 5 are combined to further increase the threshold value.

発明の効果本発明は整流平滑信号の極大値を極太値ホールド回路で
求め、次にこの極大値信号の極小値を極小値ホールド回
路で求めることにより、雑音の極大振幅を正確に求める
ことができる。このことにより、振、＠の時間的変化の
異なる種々の雑音にも自動的に適応し、常に正確で高感
度な音声検出の行なえる音声検出装置を実現できるもの
である。Effects of the Invention The present invention can accurately determine the maximum amplitude of noise by determining the maximum value of a rectified and smoothed signal using a maximum value hold circuit, and then determining the minimum value of this maximum value signal using a minimum value hold circuit. . As a result, it is possible to realize a voice detection device that can automatically adapt to various types of noise that have different temporal changes, and can always perform accurate and highly sensitive voice detection.

また、極大値ホールド回路の立下がりの時定数を極小値
信号で制（財）した９、極小値ホールド回路の立上がり
の時定数を自己の出力である極小値信号で制御すること
により長い会話音声中でも、正確でかつ高感度な音声検
出性能を維持する音声検出装置も実現できるものである
。In addition, by controlling the falling time constant of the local maximum value hold circuit with the local minimum value signal 9, and controlling the rising time constant of the local minimum value hold circuit with the local minimum value signal that is its own output, it is possible to produce long conversational sounds. Among these, it is possible to realize a voice detection device that maintains accurate and highly sensitive voice detection performance.

【図面の簡単な説明】[Brief explanation of drawings]

第１図は本発明の第１の実施例における音声検出装置の
ブロック図、第２図は第１図の各部における信号の波形
図、第３図は本発明の第２の実施例における音声検出装
置の１０２２図、第４図は第３図の各部における信号の
波形図、第６図は本発明の第３の実施例における音声検
出装置のブロック図、第６図は第６図の各部における信
号の波形図、第７図は本発明の第４の実施りｊにおける
音声検出装置のブ（１７り図、第８図は従来の音声検出
装置のプ０７り図、第９図は第８図の各部における信号
の波形図である。１２・・・・・・整流平滑回路、１３　、３１・・・・
極大値ホールド回路、１４．６１・・・・・極小値ホー
ルド回路、１５・・・・・・しきい値設定回路、１６・
・・・・比較回路。代理人の氏名　弁理士　中　尾　敏　男　ほか１名第２
図FIG. 1 is a block diagram of a voice detection device in a first embodiment of the present invention, FIG. 2 is a waveform diagram of signals in each part of FIG. 1, and FIG. 3 is a voice detection device in a second embodiment of the present invention. 1022 diagram of the device, FIG. 4 is a waveform diagram of the signal in each part of FIG. 3, FIG. 6 is a block diagram of the voice detection device in the third embodiment of the present invention, and FIG. The waveform diagram of the signal, FIG. It is a waveform diagram of signals in each part of the figure. 12... Rectifier smoothing circuit, 13, 31...
Maximum value hold circuit, 14.61... Minimum value hold circuit, 15... Threshold setting circuit, 16.
...Comparison circuit. Name of agent: Patent attorney Toshio Nakao and 1 other person 2nd
figure

Claims

【特許請求の範囲】[Claims]

（１）雑音信号が混入した音声信号を整流平滑する整流
平滑回路と、立上がりの時定数が短く、立下がりの時定
数が長く設定され、かつ前記整流平滑回路で得られた整
流平滑信号の極大値を求める極大値ホールド回路と、立
下がりの時定数が短く、立上がりの時定数が前記極大値
ホールド回路の立下がりの時定数よりも長く設定され、
かつ前記極大値ホールド回路で得られた極大値信号の極
小値を求める極小値ホールド回路と、この極小値ホール
ド回路で得られた極小値信号からしきい値信号を算出す
るしきい値設定回路と、このしきい値信号と前記整流平
滑信号とを比較して、前記整流平滑信号が前記しきい値
信号を超えた場合に音声信号が存在していると判定する
比較回路とを具備したことを特徴とする音声検出装置。(1) A rectifying and smoothing circuit that rectifies and smoothes an audio signal mixed with a noise signal, and a rectifying and smoothing circuit that has a short rising time constant and a long falling time constant, and the maximum of the rectifying and smoothing signal obtained by the rectifying and smoothing circuit. a local maximum value hold circuit for determining the value; a falling time constant is short and a rising time constant is set longer than a falling time constant of the local maximum value holding circuit;
and a minimum value hold circuit for determining the minimum value of the maximum value signal obtained by the maximum value hold circuit, and a threshold setting circuit for calculating a threshold signal from the minimum value signal obtained by the minimum value hold circuit. , a comparison circuit that compares the threshold signal and the rectified smoothed signal and determines that an audio signal is present when the rectified smoothed signal exceeds the threshold signal. Characteristic voice detection device.

（２）極大値ホールド回路は、極大値信号が極小値信号
より大きくなればなるほど立下がりの時定数を短くする
ように構成されていることを特徴とする特許請求の範囲
第１項記載の音声検出装置。(2) The audio according to claim 1, wherein the maximum value hold circuit is configured to shorten the falling time constant as the maximum value signal becomes larger than the minimum value signal. Detection device.

（３）極小値ホールド回路は、極大値信号が極小値信号
より大きくなればなるほど立上がりの時定数を長くする
ように構成されていることを特徴とする特許請求の範囲
第１項または第２項記載の音声検出装置。(3) The local minimum value hold circuit is configured to lengthen the rising time constant as the local maximum value signal becomes larger than the local minimum value signal. The voice detection device described.