JPH01183233A

JPH01183233A - Voice detector

Info

Publication number: JPH01183233A
Application number: JP677288A
Authority: JP
Inventors: Yuichi Shiraki; 白木　裕一; Osamu Noguchi; 修野口; Takao Suzuki; 孝夫鈴木; Yasuo Shoji; 庄司　保夫
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1988-01-18
Filing date: 1988-01-18
Publication date: 1989-07-21

Abstract

PURPOSE:To substantially eliminate a voice lack while appropriate activity is maintained by prolonging a hand-over time as an adaptive power threshold is enlarged. CONSTITUTION:The output Vout of a hangover control part 5 is inputted to an adaptive power threshold calculation part 6. Since a mean power P inputted from a power calculation part 3 is a background noise power, the threshold calculation part 6 calculates an adaptive power threshold AP and supplies the power threshold AP to a ringing tone/silence decision part 4 and the hangover control part 5 when Vout=0, for example. In such a case, the hangover control part 5 generates the hangover time which is changed in accordance with the size of the adaptive power threshold to protect the weak voice part subject to lack. Thus, activity is appropriately maintained irrespective of the height of a power level in background noise, and interruption of word and word cut can be prevented, whereby satisfactory voice quality can be realized.

Description

【発明の詳細な説明】（産業上の利用分野）本発明は、たとえばディジタル音声挿入システム、音声
／ｅチケット信システムなどのディジタル音声通信シス
テムに用いられる音声検出器に関する。DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to a voice detector used in digital voice communication systems such as, for example, digital voice insertion systems and voice/e-ticket communication systems.

（従来の技術）たとえば加藤、西谷、下山、［適応閾値温音声検出器、
電子通信学会通信方式研究会資料Ｃ３８４−１８７に記
載の音声検出では、語頭と語尾の間に無音区間のある語
、例えば英単語の５ｕｃｈ　＋　ｅｎｄ、等の音声を検
出する場合、語尾を雑音とみなして検出しない語尾欠落
や、語尾の大部分が失われる語中脱落を改善する提案が
行われている。(Prior art) For example, Kato, Nishitani, Shimoyama, [Adaptive Threshold Temperature Speech Detector,
In the speech detection described in the Institute of Electronics and Communications Engineers Communication Systems Research Group Material C384-187, when detecting speech in words with a silent interval between the beginning and end of the word, such as an English word 5uch + end, the end of the word is considered to be noise. Proposals have been made to improve word endings, which are not detected by conventional methods, and word dropouts, in which most of the ending of a word is lost.

同文献の適応閾値塑音声検出器では、背景雑音のノソワ
ーレベルに比例して変動する適応パワー閾値を無音時に
高く、有音時に低く設定し、入力信号の平均ｉ４ワーを
適応Ａ？ワー閾値と比較して入力信号が大きいとき有音
と判定し、入力信号が小さいとき無音と判定する。そし
て有音から無音に変化したとき、有音判定の状態を所定
期間継続させるために、有音時間の長短により、ロング
ハングオーバ時間とジョートハングオーバ時間が付加さ
れる。これら２種類の）・ングオーパ時間は、適応パワ
ー閾値が大きい場合、仮りに蔗音と判定して直ちに無音
検出信号を出力したとすると弱電力の語尾がカットされ
てし１う音声脱落が起るのを防止するために設けられて
いる。In the adaptive threshold plastic speech detector of the same document, the adaptive power threshold, which varies in proportion to the noise level of background noise, is set high when there is no sound and low when there is sound, and the average i4 power of the input signal is set to the adaptive A? When the input signal is large compared to the power threshold, it is determined that there is a sound, and when the input signal is small, it is determined that there is no sound. When the sound state changes to silence, a long hangover time and a jolt hangover time are added depending on the length of the sound time in order to continue the state of sound presence determination for a predetermined period of time. When the adaptive power threshold of these two types is large, if a silence detection signal is immediately output after determining that it is a melody, the ending of a word with low power will be cut off, resulting in voice dropout. It is provided to prevent

次に、音声検出器が適応されるディジタル音声挿入シス
テム、音声ノクケット通信システムでは、音声のアクテ
ィビティ、すなわち有音時Ｗ（無音時間＋有音時間）の
百分率が４０％程度であり、残シの無音時間に他の情報
を伝送して伝送路の有効利用を図っている。前述の・・
ングオーパ時間は長すぎるとアクティビティを増加させ
るので、音声検出器としては、アクティビティを大きく
しない程度の長さのハングオーバ時間を付加する方法が
望ましい。Next, in the digital voice insertion system and the voice communication system to which the voice detector is applied, the voice activity, that is, the percentage of voice time W (silent time + voice time) is about 40%, and the percentage of voice activity is about 40%. Other information is transmitted during silent periods to make effective use of the transmission path. As mentioned above...
If the hangover time is too long, the activity will increase, so it is desirable for the voice detector to add a hangover time that is long enough not to increase the activity.

（発明が解決しようとする問題点）しかしながら、上記の方法では、有音時間の長短により
、２種類の固定長のロングノ−ングオーパ時間またはシ
ョートハングオーバ時間が付加されるので、次の問題点
があった。(Problems to be Solved by the Invention) However, in the above method, two types of fixed length long hangover time or short hangover time are added depending on the length of the active time, so the following problem occurs. there were.

まず、適応パワー閾値は、背景雑音の電力レベルに追従
して高低するので、適応Ａワー閾値以下の音声信号の弱
電力部分は、背景雑音が高いとき増加し、低いとき減少
する。したがって、単に有音区間の長短によって固定時
間長のハングオーバ時間を選択するだけであると、同一
有音時間であっても、背景雑音により音声欠落部分が異
カシ、ハングオーバ時間が音声欠落部分より長すぎたり
、短かすぎたシすることが起り得る。長ずざるとアクテ
ィビティが増加し、無音圧縮率を減少させ、短かすぎる
と語中、語尾切断が起こり、伝送品質を劣化させるとい
う問題が生ずる。First, the adaptive power threshold increases or decreases following the power level of the background noise, so the weak power portion of the audio signal below the adaptive A-power threshold increases when the background noise is high and decreases when the background noise is low. Therefore, if you simply select a hangover time of a fixed length based on the length of the voiced section, even if the voiced period is the same, the voice-missing portion will be different due to background noise, and the hangover time will be longer than the voice-missing portion. It may happen that it is too long or too short. If the length is not too long, the activity will increase and the silence compression rate will be reduced, and if the length is too short, mid-word and ending truncations will occur, leading to a problem of deterioration of transmission quality.

本発明は、上記の問題点を除去し、実質的に過不足のな
いハングオーバ時間を付加することができる音声検出器
を提供することを目的とする。SUMMARY OF THE INVENTION An object of the present invention is to provide a voice detector that can eliminate the above-mentioned problems and add substantially just the right amount of hangover time.

（問題点を解決するための手段）本発明は上述の問題を解決するために、適応／’Ｐワー
閾値が大きくなるのに従って長く可変するハングオーバ
時間を設定する。(Means for Solving the Problems) In order to solve the above-mentioned problems, the present invention sets a hangover time that varies as the adaptive/'P-war threshold increases.

本発明による音声検出器は、音声信号を含む入力信号の
電力を算出する電力算出手段と、入力信号の電力を閾値
と比較して、閾値に対して入力信号の電力が大きいとき
は有音状態をとり小さいときは無音状態をとる判定信号
を発生する比較判定手段と、判定信号を受けて判定信号
に応じた検出信号を出力し、判定信号が有音状態から無
音状態に変化したときは検出信号をハングオーバ時間だ
けさらに有音状態に維持するハングオーバ制御手段とを
有し、ハングオーバ制御手段は、電力算出手段で算出し
た電力に適応的に応動する閾値を比較判定手段に与え、
ハングオーバ時間を閾値に関連して可変とする。The audio detector according to the present invention includes a power calculation means for calculating the power of an input signal including an audio signal, and a sound state when the power of the input signal is compared with a threshold value and the power of the input signal is larger than the threshold value. a comparison/judgment means that generates a determination signal that assumes a silent state when it is small; and a comparison/judgment means that receives the determination signal and outputs a detection signal corresponding to the determination signal, and detects when the determination signal changes from a sound state to a silent state. and a hangover control means for further maintaining the signal in a sound state for a hangover time, the hangover control means providing the comparison determination means with a threshold value that adaptively responds to the power calculated by the power calculation means,
Make the hangover time variable in relation to the threshold.

（作用）適応ノクワー閾値が大きくなるに従って音声欠落部分が
多くなるはずであるが、本発明によれば、適応１４？ワ
ー閾値が大きくなるに従ってノ・ングオーパ時間を可変
的に長くする適切な関係を設定することにより、適切な
アクティビティを維持しながら実質的に音声欠落をなく
すことができる。(Effect) As the adaptive noise threshold increases, the number of voice missing parts should increase, but according to the present invention, the adaptive 14? By setting an appropriate relationship in which the output time is variably lengthened as the power threshold increases, voice dropouts can be substantially eliminated while maintaining appropriate activity.

（実施例）本発明の詳細な説明に先立って、第４図を参照して前記
文献に記載された従来廻の音声検出器を説明する。(Example) Prior to a detailed explanation of the present invention, a conventional voice detector described in the above-mentioned document will be explained with reference to FIG.

入力端子８より入力されるサンプル信号は、符号変換回
路１０によ多振幅の大きさのみを表わす絶対値信号に変
換されて・ぐワー算出部１１と閾値算出部１４に入力さ
れる。閾値算出部１４では、ハングオーバ制御部１３の
出力Ｖｏｕｔ−が入力され、有音検出信号Ｖｏｕｔが無
音を示しているとき、絶対値信号から背景雑音レベルを
算出する。そしてこの雑音レベルに応じた振幅閾値ＴＨ
Ｉ　、　ＴＨ２を算出してパワー算出部１１に出力する
。また有音・無音判定部１２に出力する判定閾値ＴＨ３
は、ハングオー・９制御部１３の出力Ｖｏｕｔが無音を
示している時は高い閾値に設定し、有音を示している時
は、低い閾値に設定し、かつ雑音レベルに応じて適応的
に変動させる。The sample signal inputted from the input terminal 8 is converted by the code conversion circuit 10 into an absolute value signal representing only the magnitude of the amplitude, and is inputted to the power calculation section 11 and the threshold value calculation section 14. The threshold calculation unit 14 receives the output Vout- of the hangover control unit 13, and calculates the background noise level from the absolute value signal when the sound detection signal Vout indicates silence. And the amplitude threshold TH according to this noise level
I and TH2 are calculated and output to the power calculation section 11. Also, the determination threshold value TH3 output to the sound/non-sound determination unit 12
When the output Vout of the Hang-O-9 control unit 13 indicates silence, a high threshold is set, and when it indicates sound, a low threshold is set, and the threshold is adaptively varied according to the noise level. let

パワー算出部１１では、入力される絶対値信号を振幅閾
値ＴＨＩ　、　ＴＨ２と比較し、その比較結果に従い比
較結果＋ｃ　、　＋ｌ）　、　−ａを出力する。The power calculation unit 11 compares the input absolute value signal with amplitude thresholds THI and TH2, and outputs comparison results +c, +l) and -a according to the comparison results.

これを累積回路（図示せず）で累積し、累積値として有
音・無音判定部１２へ出力する。有音・無音判定部１２
では、累積値と判定閾値ＴＨ３との比較が行われる。累
積値が閾値ＴＨ３より大きいと有音と判定され、出力Ｖ
が「１」になる。This is accumulated by an accumulation circuit (not shown) and outputted to the sound/silence determining section 12 as a cumulative value. Sound/silence determination unit 12
Then, the cumulative value is compared with the determination threshold TH3. If the cumulative value is larger than the threshold TH3, it is determined that there is a sound, and the output V
becomes "1".

無音と判定されるとｖ＝０が出力される。この出力Ｖは
ハングオーバ制御部１３に入力される。If it is determined that there is no sound, v=0 is output. This output V is input to the hangover control section 13.

ハングオーバ制御部１３では、Ｖ−１のとき、その出力
Ｖｏｕｔが「１」となる。入力Ｖが「１」から「０」に
変化すると、Ｖ−１のそれまでの継続時間に応じてロン
グハングオーバ時間トショートハングオーバ時間のいず
れかが選択的に付加される。ロングハングオーバのとき
、雑音等による短時間誤検出が生じてもショートハング
オーバに切シ替らないように構成されている。In the hangover control unit 13, its output Vout becomes "1" when the voltage is V-1. When the input V changes from "1" to "0", either a long hangover time or a short hangover time is selectively added depending on the duration of V-1 up to that point. During a long hangover, even if a short-time erroneous detection occurs due to noise or the like, it is configured not to switch to a short hangover.

ハングオーバ中の検出信号Ｖ　ｏｕｔは「１」を保って
いる。The detection signal V out remains at "1" during hangover.

次に、本発明による音声検出器の実施例を詳細に説明す
る。Next, an embodiment of the speech detector according to the present invention will be described in detail.

第１図はこの発明の一実施例を示す回路図である。入力
端子１より入力されるサンプル信号は・ぐワー算出部３
に入力される。パワー算出部３は、サンプル信号につい
てその所定の長さの期間すなわちブロック毎に平均・や
ワーＰを算出し、有音・無音判定部４および適応・ぐワ
ー閾値算出部６にこれを出力する。適応・ぞワー閾値算
出部６には、ハングオーバ制御部５よりその出力Ｖｏｕ
ｔが入力されている。たとえばＶ　ｏｕｔ−＝０のとき
、・クワ−算出部３より入力する平均パワーＰは背景雑
音パワーであるから、閾値算出部”　　６はこれに応じ
た適応・臂ワー閾値ＡＰ１Ｆｔ算出して有音・無音判定
部４およびノ・ングオーパ制御部５にパワー閾値ＡＰを
供給する。FIG. 1 is a circuit diagram showing one embodiment of the present invention. The sample signal input from the input terminal 1 is sent to the power calculation unit 3.
is input. The power calculation unit 3 calculates the average power P for each predetermined length period of the sample signal, that is, for each block, and outputs this to the sound/non-sound determination unit 4 and the adaptive power threshold calculation unit 6. . The adaptive threshold calculation unit 6 receives the output Vou from the hangover control unit 5.
t has been input. For example, when V out = 0, the average power P input from the noise calculation section 3 is the background noise power, so the threshold calculation section 6 calculates the adaptive arm threshold AP1Ft and detects the sound. - Supply the power threshold value AP to the silence determination section 4 and the non-verbal control section 5.

有音・無音判定部４は、パワー算出部３からの平均ノぐ
ワーＰが適応／ぐワー閾値ＡＰより大きいと有音、小さ
いと無音と判定する。この判定結果は有音・無音判定信
号Ｖとしてノ・ングオーパ制御部５に出力される。ノ・
ングオーパ制御部５は入力信号Ｖが有音を示していると
き出力端子２の検出信号Ｖｏｕｔを「工」とし、無音の
ときはＶｏｕｔｗＯを出力する。ハングオーバ制御部５
は塘た、入力信号Ｖが有音から無音になると、それまで
の有音時間の長短によりロングハングオーバ時間または
ショートハングオーバ時間ヲ択−的に付加し、その付加
されたハングオーバ時間中はＶｏｕｔｗｌを出力する。The sound/non-sound determination section 4 determines that there is a sound when the average sound P from the power calculation section 3 is larger than the adaptive/warr threshold value AP, and determines that there is no sound when it is smaller. This determination result is output as a sound/non-sound determination signal V to the non-verbal control section 5. of·
The operating control unit 5 sets the detection signal Vout at the output terminal 2 to "work" when the input signal V indicates the presence of sound, and outputs VoutwO when there is no sound. Hangover control unit 5
Then, when the input signal V changes from sound to silence, a long hangover time or a short hangover time is selectively added depending on the length of the sound time up to that point, and during the added hangover time, Voutwl Output.

次に、本音声検出器の各部４，５．６について詳説する
。Next, each section 4, 5.6 of this audio detector will be explained in detail.

有音・無音判定部４では、・ぐワー算出部１１で算出さ
れた所定長のブロックにおける平均パワーＰと適応・母
ワー閾値ＡＰとを用いて、下式（１）に従って有音・無
音を判定して有音・無音の判定信号Ｖを出力する。The voice/silence determination unit 4 determines voice/silence according to the following formula (1) using the average power P in a block of a predetermined length calculated by the voice calculation unit 11 and the adaptive power threshold AP. It makes a judgment and outputs a sound/non-sound judgment signal V.

ハングオーバ制御部５は、ハングオーバ制御手段の一部
をなし、有音・無音判定信号ＶがＶ−１からＶ−Ｏに変
化した時、ハングオーバ時間だけＶ　ｏｕｔ　ｗ＝　１
を出力する。即ち、ハングオーバ時間の決定は、次によ
る。The hangover control unit 5 constitutes a part of the hangover control means, and when the sound/no-sound determination signal V changes from V-1 to V-O, the hangover control unit 5 sets V out w= 1 for the hangover time.
Output. That is, the hangover time is determined as follows.

ｔｔ）ｖ−１のブロックが連続して所定数以上発生した
場合、ロングハングオーバ時間ＨＬとし、それ以外はシ
ョートハングオーバ時間Ｈ８とする。tt) If a predetermined number or more of v-1 blocks occur consecutively, the long hangover time is set as HL; otherwise, the short hangover time is set as H8.

（ｉｉ）　　（ｉ）の決定が行なわれた時に用いられた
適応ｉ４ワー閾値によってロングハングオーバ時間ＨＬ
、ショートハングオーバ時間Ｉｓの数値を決定する。(ii) the long hangover time HL depending on the adaptive i4war threshold used when the decision in (i) was made;
, determine the value of the short hangover time Is.

第２図に適応Ａ？クワ−値とハングオーバ時間の対応関
係を示す。適応パワー閾値が大きくなると、ロングハン
グオーバ時間ＨＬも大きな値となるような対応関係を持
たせる。一方シヨードハングオーバ時間Ｈ８は背景雑音
ノクワーに対する誤動作率を一定にするために、適応・
母ワー閾値に対して一定値としている。この対応関係を
保つかぎシ、具体的な設定値は任意でよい。Adapted to Figure 2 A? The relationship between the hangover time and the hangover time is shown. A correspondence relationship is established such that as the adaptive power threshold increases, the long hangover time HL also increases. On the other hand, the side hangover time H8 is adapted and
It is set as a constant value with respect to the mother power threshold. The key and specific setting values for maintaining this correspondence may be arbitrary.

以上の点が従来例のハングオーバととくに異なる。The above point is particularly different from the hangover of the conventional example.

適応ｉ４ワ１閾値算出部６は、やはシハングオーパ制御
手段の一部をなす。無音検出信号ＶｏｕｔがｒＯＪのと
き、その無音と判定されているかぎシ連続するブロック
の数をカウントする。その計数値が所定数に達した場合
、それら所定数のブロック全体の平均ノクワー下を求め
、次の（２）式の適応ノ４’ワー閾値ＡＰを算出する。The adaptive i4wa1 threshold value calculation unit 6 forms a part of the operating control means. When the silence detection signal Vout is rOJ, the number of consecutive blocks determined to be silent is counted. When the count value reaches a predetermined value, the average value of the entire predetermined number of blocks is determined, and the adaptive value threshold AP of the following equation (2) is calculated.

ＡＰ−α・Ｆ定数αは１より大きな値をとり、背景雑音のレベルに対
して音声検出器が誤って有音判定することがないように
設定される任意の値である。AP-α·F The constant α takes a value larger than 1 and is an arbitrary value set so that the voice detector does not erroneously determine the presence of a voice based on the level of background noise.

次に、本実、雄側の動作について説明する。本音声検出
器は基本的に、入力信号の所定の数のブロックの平均ノ
’ｅワーが適応ノヤワー閾値より大きいとき、そのブロ
ックを有音と判定する。−方、適応パワー閾値は背景雑
音に比例し、しかも背景雑音を誤って有音と判定するこ
とがないような値として計算されている。背景雑音の電
力が高くなると、適応パワー閾値も高くなるので、その
ままでは欠落する弱音声部分も増えるでちろう。しかし
本実施例では、この欠落するはずの弱音声部分を保護す
るために、ハングオーバ制御部５は、適応パワー閾値の
大きさに従って長く可変するハングオーバ時間を発生す
る。Next, the operation of the male side will be explained. Basically, the speech detector determines that a block of a predetermined number of blocks of the input signal is speech-active when the average noise of the block is greater than an adaptive noise threshold. On the other hand, the adaptive power threshold is calculated as a value that is proportional to the background noise and does not erroneously determine that the background noise is a voice. As the power of the background noise increases, the adaptive power threshold also increases, so if left as is, the number of weak voice parts that will be missed will increase. However, in this embodiment, in order to protect this weak audio portion that should be lost, the hangover control unit 5 generates a hangover time that varies depending on the magnitude of the adaptive power threshold.

適応ノクワー閾値が高くなると無音時間が増加するので
、適応パワー閾値の増加に応じてハングオーバ時間を長
く可変する好適な関係を設定すれば、アクティビティを
増加させることなく弱音声部分の欠落を防止できる。As the adaptive power threshold increases, the silent time increases, so by setting a suitable relationship in which the hangover time is lengthened and varied in accordance with the increase in the adaptive power threshold, it is possible to prevent the loss of weak audio parts without increasing activity.

次に、第３図の背景雑音レベルと音声検出器のアクティ
ビティの関係を用いて、本実施例の音声検出器と従来例
の音声検出器の効果の差異を説明する。第３図で実線Ａ
は本実施例で得られた特性である。第２図に示したよう
に本実施例でハ、５０ングハングオ一パ時間ＨＬ　ハ４
０ｍｇ〜１６０ｍ５ｔで可変され、ショートハングオー
バ時間Ｈ８は２０ｍ５に固定されている。実線Ｂ。Next, the difference in effectiveness between the voice detector of this embodiment and the conventional voice detector will be explained using the relationship between the background noise level and the activity of the voice detector shown in FIG. Solid line A in Figure 3
is the characteristic obtained in this example. As shown in FIG.
It is variable from 0mg to 160m5t, and the short hangover time H8 is fixed at 20m5. Solid line B.

Ｃは従来の音声検出器の特性で、曲線Ｂはロングハング
オーバ時間ＨＬを１６０ｍ５．ショートハングオーバ時
間Ｈ８を２０ｍ５に固定したもので、同Ｃはロングハン
グオーバ時間ＨＬを４０ｍ５　、ショートハングオーバ
時間Ｈ３を２０　ｍｓとしたものである。ただし、従来
の音声検出器の場合、有音・無音判定部および適応・ぐ
ワー閾値算出部は本実施例のものと同一のものを使用し
た。Curve C shows the characteristics of a conventional voice detector, and curve B shows the long hangover time HL of 160m5. The short hangover time H8 is fixed at 20 m5, and the case C has the long hangover time HL set at 40 m5 and the short hangover time H3 at 20 ms. However, in the case of the conventional speech detector, the speech presence/non-speech determination section and the adaptive voice threshold calculation section were the same as those of this embodiment.

特性Ｂでは、ロングハングオーバ時間ＨＬｋ比較的長い
値１６０ｍ５に固定しているので、背景雑音が小さいと
きは、欠落する弱音声部分よりハングオーパ時間ＨＬが
長くなってアクティビティが増加している。一方、特性
Ｃでは、ロングハングオーバ時間ＭＬが比較的短い値４
０ｍＢであるので、背景雑音レベルが高くなると欠落す
る弱音声部分が増加することにより、アクティビティが
減少している。これらに対して本実施例の特性Ａは、背
景雑音の高低に係わらずアクティビティが実質的に一定
となっているので、安定した音声品質を保゛っている。In characteristic B, the long hangover time HLk is fixed at a relatively long value of 160 m5, so when the background noise is small, the hangover time HL becomes longer than the missing weak audio portion, and the activity increases. On the other hand, in characteristic C, the long hangover time ML is a relatively short value 4.
Since it is 0 mB, as the background noise level increases, the number of weak voice parts that are missing increases, and the activity decreases. In contrast, characteristic A of this embodiment maintains stable voice quality because the activity is substantially constant regardless of the level of background noise.

（′発明の効果）以上説明したように本発明によれば、適応パワー閾値が
大きくなるに従い、ハングオーバ時間を長く可変するよ
うにしたので、背景雑音の電力レベルの高低にかかわら
ず、アクティビティが適切に維持され、語中、語切れを
防止し良好な音声品質を実現できる。('Effects of the Invention) As explained above, according to the present invention, the hangover time is made variable as the adaptive power threshold increases, so that the activity can be adjusted appropriately regardless of the high or low power level of the background noise. It is possible to maintain good speech quality by preventing mid-word and break-offs.

【図面の簡単な説明】[Brief explanation of the drawing]

第１図は本発明の音声検出器の一実施例を示すブロック
図、第２図は、第１図に示す実施例における音声検出器の背
景雑音の・ぐワーレベルとアクティビティの関係を示す
図、第３図は適応パワー閾値とハングオーバ時間の対応関係
を同実施例と従来例について示す説明図、第４図は従来例の音声検出器を示すブロック図である。主要部分の符号の説明１．２１・・・入力端子２．２２・・・出力端子３．１１・・・パワー算出部４．１２・・・有音・無音判定部５．１３・・・ハングオーバ制御部６・・・適応・やワー閾値算出部１０・・・符号変換回路１４・・・閾値算出部特許出願人　沖電気工業株式会社代　理　人　　香　　　取　　　孝　　　雑光　　　山
　　　隆　　　夫迦忘・１ぐワーｌＡ１ヒ八゛ノフ−Ｘ利く１藩（せ忘・
間係、の廿））著、２凹宅ｔｋ　自　レベ′ノソ　　（ｃｉＢｒｎｏン誓簀」上
白の電力レベルともｙ才今土路のア７カピ゛すイ偽閘係
伎条雫１の告ヤ蝉叱器幕４　都FIG. 1 is a block diagram showing an embodiment of the speech detector of the present invention; FIG. 2 is a diagram showing the relationship between the background noise level and activity of the speech detector in the embodiment shown in FIG. 1; FIG. 3 is an explanatory diagram showing the correspondence between adaptive power thresholds and hangover times for the same embodiment and a conventional example, and FIG. 4 is a block diagram showing a conventional audio detector. Explanation of symbols of main parts 1.21...Input terminal 2.22...Output terminal 3.11...Power calculation section 4.12...Sound/silence determination section 5.13...Hangover Control unit 6...adaptive/power threshold calculation unit 10...code conversion circuit 14...threshold calculation unit Patent applicant Oki Electric Industry Co., Ltd. Agent Takashi Katori Zamitsu Takashi Yama Fukanbo 1 Guwara1A1hi8nov -
Written by Shizuku Shizuku Kijo, a fake locksmith who has a power level of 200,000,000,000,000,000,000,000,000. Cicada Shoukimaku 4 Miyako

Claims

【特許請求の範囲】１、音声信号を含む入力信号の電力を算出する電力算出
手段と、前記入力信号の電力を閾値と比較して、該閾値に対して
該入力信号の電力が大きいときは有音状態をとり小さい
ときは無音状態をとる判定信号を発生する比較判定手段
と、該判定信号を受けて該判定信号に応じた検出信号を出力
し、該判定信号が有音状態から無音状態に変化したとき
は該検出信号をハングオーバ時間だけさらに有音状態に
維持するハングオーバ制御手段とを有し、該ハングオーバ制御手段は、前記電力算出手段で算出した電力に適応的に応動する閾
値を前記比較判定手段に与え、前記ハングオーバ時間を該閾値に関連して可変とするこ
とを特徴とする音声検出器。２、ハングオーバ制御手段には、第１のハングオーバ時
間、および第１のハングオーバ時間より長い第２のハン
グオーバ時間が設定され、第２のハングオーバ時間が前
記閾値に関連して可変であることを特徴とする請求項１
の音声検出器。３、電力算出手段は、前記入力信号について所定の長さ
のブロックごとに電力を算出し、前記ハングオーバ制御
手段は、前記検出信号が第２の状態をとっているときは
その間連続する前記ブロックについて平均の電力を算出
し、該算出された平均電力に関連して前記閾値を設定す
ることを特徴とする請求項１の音声検出器。[Claims] 1. A power calculation means for calculating the power of an input signal including an audio signal, and comparing the power of the input signal with a threshold, and when the power of the input signal is larger than the threshold, Comparison/judgment means for generating a determination signal for a sound state and a silent state when the sound is small; and hangover control means for further maintaining the detection signal in a sound state for a hangover time when the detection signal changes to , and the hangover control means is configured to set a threshold value adaptively responsive to the power calculated by the power calculation means. A voice detector, characterized in that the hangover time is given to a comparison/determination means, and the hangover time is made variable in relation to the threshold. 2. The hangover control means is set with a first hangover time and a second hangover time longer than the first hangover time, and the second hangover time is variable in relation to the threshold value. Claim 1
voice detector. 3. The power calculation means calculates the power for each block of a predetermined length with respect to the input signal, and the hangover control means calculates the power for the continuous blocks when the detection signal is in the second state. 2. The voice detector according to claim 1, wherein an average power is calculated and the threshold value is set in relation to the calculated average power.