JP4256631B2

JP4256631B2 - Auto gain control device

Info

Publication number: JP4256631B2
Application number: JP2002162029A
Authority: JP
Inventors: 良二阿部
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 2002-06-03
Filing date: 2002-06-03
Publication date: 2009-04-22
Anticipated expiration: 2022-06-03
Also published as: JP2004015125A

Description

【０００１】
【発明の属する技術分野】
本発明は、音声入力のレベルに依存せずに出力信号のレベルを一定にするオートゲインコントロール装置に関する。
【０００２】
【従来の技術】
従来、この種のオートゲインコントロール装置は、図８のように示される。このオートゲインコントロール装置では、音声入力端子１０１から入力された音声信号は、絶対値変換手段１０３により絶対値変換され、ゲイン補正手段１０４により、所定のゲインに正規化され、補正される。
【０００３】
補正されたゲインは、スレッショルドレベル格納手段１０５に予め格納されていたスレッショルドレベル係数と比較手段１０６により比較される。
【０００４】
ゲイン発生手段１０７は、ゲインがスレッショルドレベルよりも大きい場合は、１より小さいゲインを発生し、出力信号としての出力レベルは減衰される。
【０００５】
ゲインがスレッショルドレベルよりも小さい場合は、ゲイン発生手段１０７は、１より大きいゲインを発生し、出力信号としての出力レベルは増幅される。
【０００６】
なお、急激な入力信号の変化に対して不自然無く動作させるために、放絡線発生手段１０８は、ゲイン発生手段１０７により発生されるゲイン値の変動量を緩やかにし、ゲイン変化の不自然さを無くす。
【０００７】
出力用乗算手段１０９は、放絡線発生手段１０８により出力される時定数を持ったゲイン値と、音声入力端子１０１から入力された音声信号データとを乗算し、その出力値を出力信号として音声出力端子１０２から出力する。
【０００８】
このようにして、スレッショルドレベル付近で一定の出力信号レベルを保つことが可能となる。
【０００９】
【発明が解決しようとする課題】
しかし、このような従来のオートゲインコントロール装置では、入力音声信号の周波数特性が考慮されていないため、入力音声信号の周波数特性に依存して、オートゲインコントロール装置の動作がばらつくという問題があった。
【００１０】
例えば、高い周波数成分を持ったノイズが間違って入力された場合でも、通常の音声信号と同様にゲインがコントロールされてしまうことがある。暗騒音などの低域の騒音も誤動作の原因となる。
【００１１】
また、周波数特性が全く異なるアナウンスの音声信号と音楽ソースなどの音声信号では、オートゲインコントロールが動作するレベルが異なってしまい、オートゲインコントロールの動作が入力音声信号のソースに依存してしまう、という問題もあった。
【００１２】
本発明はこのような問題を解決するためになされたもので、所定の入力音声信号に対して良好な特性を実現することができるオートゲインコントロール装置を提供するものである。
【００１３】
【課題を解決するための手段】
本発明のオートゲインコントロール装置は、入力音声信号を複数の周波数帯域の異なる信号に分割する帯域分割フィルタ手段と、前記複数の周波数帯域の異なる信号それぞれを絶対値変換する絶対値変換手段と、前記絶対値変換手段で絶対値変換されたそれぞれの値に対する重み付けデータを格納する重み付けデータ格納手段と、前記絶対値変換手段で絶対値変換されたそれぞれの値と前記重み付けデータ格納手段に格納されている重み付けデータとを乗算する乗算手段と、前記乗算手段で乗算されたそれぞれの値を加算することにより前記周波数帯域ごとに重み付けされた音声信号の全周波数帯域に対するゲイン値を出力する加算手段と、前記加算手段の出力のゲイン値を補正するゲイン補正手段と、前記ゲイン補正手段により補正されたゲイン値とスレッショルドレベルとを比較する比較手段と、前記比較手段の比較結果に基づいてゲイン値を発生するゲイン発生手段とを備え、前記ゲイン発生手段から出力されるゲイン値に基づいて前記入力音声信号のゲイン値をコントロールする構成を有している。
【００１４】
この構成により、入力音声信号が所定の周波数帯域毎に重み付けされ、重み付けされた信号に基づいてオートゲインコントロールが行なわれることとなる。
【００１５】
ここで、前記重み付けデータ格納手段に複数組の重み付けデータを格納し、前記乗算手段に出力する前記重み付けデータを前記複数組の重み付けデータの中で切り換える重み付けデータ切換え手段を備える構成とした。
【００１６】
この構成により、入力音声信号の周波数帯域毎の重み付けが切り換えられることとなる。
【００１７】
また、前記ゲイン発生手段が発生するゲイン値を所定時間維持するホールド制御手段と、前記ホールド制御手段でゲイン値を維持する時間を複数格納するホールドタイムデータ格納手段とを備え、前記ホールドタイムデータ格納手段は、前記重み付けデータ切換え手段が切り換えた前記重み付けデータに対応する前記ゲイン値を維持する時間を前記ホールド制御手段に設定する構成とした。
【００１８】
この構成により、切り換えられた入力音声信号の周波数帯域毎の重み付けに対応して、発生したゲインの維持時間も切り換えられることとなる。
【００１９】
また、入力音声信号をＦＦＴ処理により周波数データに変換する変換処理手段を備え、前記重み付けデータ切換え手段は、前記周波数データに対応する重み付けデータに切り換える構成とした。
【００２０】
この構成により、入力音声信号の周波数特性に適する重み付けに自動的に切り換えられることとなる。
【００２１】
【発明の実施の形態】
以下、本発明の実施の形態について、図面を参照して説明する。
図１〜図４は本発明の一実施の形態のオートゲインコントロール装置を示す図である。
【００２２】
図１に示すように、本実施の形態のオートゲインコントロール装置は、音声信号が入力される音声入力端子１と、ゲインコントロールされた音声信号が出力される音声出力端子２と、予め設定された互いに異なった周波数帯域の信号のみ通過させる複数の帯域分割フィルタ手段３と、帯域分割フィルタ手段３それぞれの出力する信号を絶対値変換する絶対値変換手段４と、帯域分割フィルタ手段３で分割された周波数帯域毎に重み付けするデータを格納しておく重み付けデータ格納手段５と、絶対値変換された値に重み付けデータを乗算する乗算手段６と、重み付けした値を加算する加算手段７と、加算された値を補正するゲイン補正手段８と、補正された値と比較されるスレッショルドレベルデータを格納しておくスレッショルドレベル格納手段９と、補正された値とスレッショルドレベルデータを比較する比較手段１０と、比較手段１０の比較結果によりゲイン値を発生するゲイン発生手段１１と、ゲイン値の変動量を緩やかにする放絡線発生手段１２と、放絡線発生手段１２が出力するゲイン値を入力音声信号に乗算する出力信号用乗算手段１３とを備えている。
【００２３】
このようなオートゲインコントロール装置において、音声入力端子１に入力された音声信号は、複数の帯域分割フィルタ手段３にそれぞれ入力される。
【００２４】
各帯域分割フィルタ手段３は、帯域分割フィルタ手段３毎に予め設定されたそれぞれ異なる周波数帯域の信号のみを出力する。
【００２５】
例えば、想定する入力ソースの周波数特性が図２のような特性である場合、図３（ａ）から（ｄ）のような周波数特性を持った４個の帯域分割フィルタ手段３を使用する。
【００２６】
これら４個の帯域分割フィルタ手段３の加算特性は図４のようになり、この加算特性は、図２の想定される入力ソースの特性に近似させている。この帯域分割フィルタ手段３による分割の数を多くすれば入力ソースを忠実に近似することができ、入力ソースに合った良好なゲインコントロールを行なうことができる。
【００２７】
このような帯域分割フィルタ手段３の出力信号は、それぞれ絶対値変換手段４に入力され、各帯域ごとに絶対値変換される。
【００２８】
重み付けデータ格納手段５には、各帯域分割フィルタ手段３が出力した信号を絶対値変換したそれぞれの値に対する重み付けデータを格納しておく。
【００２９】
この重み付けデータは、想定される入力ソースの周波数帯域の特性を反映し、ゲインコントロールの中心となる周波数帯域の重み付けデータほど値を大きく設定する。
【００３０】
例えば、上述の図３のような周波数特性の４個の帯域分割フィルタ手段３を用いたとき、３番目のフィルタ出力（図３（ｃ）の特性を持つ帯域分割フィルタ手段３の出力）に対する重み付けを大きくすると、ゲインコントロールにおける３番目のフィルタ出力の帯域（図３（ｃ）の特性を持つ帯域分割フィルタ手段３で分割された周波数帯域）の寄与を大きくすることとなる。
【００３１】
重み付けデータ格納手段５に格納されている帯域分割フィルタ手段３で帯域分割された周波数帯域毎の重み付けデータは、それぞれ対応する帯域分割フィルタ手段３の出力が絶対値変換された値に対して、乗算手段６で乗算され、乗算手段で乗算されたすべての値は加算手段７で加算される。
【００３２】
このようにして加算手段７から出力される信号は、予め重み付けデータ格納手段５に格納された所定の帯域分割フィルタ手段３に対応した音声帯域モデルデータによる重み付けデータによって重み付けされた信号となる。
【００３３】
この加算手段７の出力（ゲイン値）は、音声帯域モデルデータの周波数特性を持った音声信号が入力された場合、所定のゲイン値に近づくことになる。
【００３４】
ゲイン補正手段８は、加算手段７の出力に予め設定された係数を乗算して補正する。このゲイン補正手段８により、ゲイン値の絶対値に関しては調整することができるので、重み付けデータは、周波数帯域毎の相対値のみを考慮すればよい。
【００３５】
ゲイン補正手段８により補正されたゲイン値は、比較手段１０において、スレッショルドレベル格納手段９に格納されているスレッショルドレベルデータと比較される。
【００３６】
ゲイン発生手段１１は、ゲイン値がスレッショルドレベルデータよりも大きい場合、予め設定された１より小さいゲイン値を発生する。このため、出力信号の出力レベルは減衰し、スレッショルドレベルに近づくことになる。
【００３７】
ゲイン値が、スレッショルドレベルよりも小さい場合、ゲイン発生手段１１は、予め設定された１より大きいゲイン値を発生する。このため、出力信号の出力レベルは増幅され、スレッショルドレベルに近づくことになる。
【００３８】
特定の帯域分割フィルタ手段３の出力に対する重み付けを大きくすると、この帯域分割フィルタ手段３が分割する周波数帯域の周波数成分を多く含んだ音源が入力されると、入力信号に伴った加算手段７の出力値が大きくなり、スレッショルドレベルを越えやすくなり、特定の周波数帯域に連動したオートゲインコントロールの動作特性を得ることができる。
【００３９】
ゲイン発生手段１１により発生されたゲイン値は、放絡線発生手段１２に入力され、予め決められた時定数を持つアタックタイムとリリースタイムを持つように制御される。
【００４０】
この場合の、アタックタイムはゲイン発生手段１１が発生するゲイン値が減少する場合のゲイン値を変化させる時定数であり、リリースタイムはゲイン発生手段１１が発生するゲイン値が増加する場合のゲイン値を変化させる時定数である。
【００４１】
このアタックタイムとリリースタイムの設定により、急激なレベルの変化への対応（急激なレベルの変化に追従させるか否か）を設定することができる。
【００４２】
出力信号用乗算手段１３は、放絡線発生手段１２により時定数を持ったゲイン値と、音声入力端子１から入力された音声信号とを乗算し、この出力信号用乗算手段１３による出力信号がオートゲインコントロール装置の出力信号となる。
【００４３】
このように本実施の形態においては、所定の音声の周波数特性を帯域分割フィルタ手段３で分割近似しているので、所定の音声信号を狙ったオートゲインコントロールを行なうことができる。
【００４４】
また、帯域分割フィルタ手段３で分割した帯域別に重み付けをしているため、想定していない帯域の音声信号が入力されたとしても、その帯域の重み付け係数は小さいので、ゲイン値の変化が少なくなり、ゲイン発生手段１１によるゲイン値変動がなくなり、音声出力端子２から出力される信号のレベル変動を無くすことができる。
【００４５】
また、重み付け係数による重み付けの量により、音声ソースごとの特徴のある周波数帯域のみを強調することができるため、それ以外の周波数特性を持った音声ソースに対しては、一定のゲインを保つことができ、狙った周波数特性を持った音声ソースのみに反応させることができる。
【００４６】
本実施の形態の第１の他の態様としては、図５に示すように、重み付けデータ格納手段５に、数種類の音声ソース別の各分割帯域毎の重み付けデータを格納しておく。
【００４７】
例えば、男性アナウンス、女性アナウンスの周波数特性の平均分布により、周波数特性が異なっているように、用途やアナウンス対象の特徴により、音声帯域別重み付けデータを用意する。これは、予め測定し統計的にまとめ、平均の周波数特性の分布を重み付けデータとすればよい。
【００４８】
重み付けデータ切換え手段１４は、使用者の操作により重み付けデータ格納手段５が乗算手段６に出力する重み付けデータを切換えさせる。
【００４９】
重み付けデータ切換え手段１４は、物理的なスイッチの構造でもよいし、パソコンのGUI（グラフィカル・ユーザー・インターフェイス）上のスイッチでも良い。
【００５０】
例えば、男性アナウンスと女性アナウンスの周波数特性に基づいて、それぞれに対応した重み付けデータを重み付けデータ格納手段５に格納しておき、男性アナウンスの音声信号に対してオートゲインコントロール装置を使用する場合は、重み付けデータ切換え手段１４により男性アナウンスの周波数特性に基づいた重み付けデータを乗算手段６に出力するようにする。
【００５１】
また、女性アナウンス音声信号に対してオートゲインコントロール装置を使用する場合は、重み付けデータ切換え手段１４により女性アナウンスの周波数特性に基づいた重み付けデータを乗算手段６に出力するように切換える。
【００５２】
また、特定の人物の周波数特性のデータを予め測定しておき、そのデータを音声帯域別データモデルとして重み付けデータ格納手段５に格納しておいてもよい。
【００５３】
このように構成することによって、重み付けデータ切換え手段１４により音声ソースに合った重み付けデータを選択することができ、音声ソース固有の周波数特性に対応したオートゲインコントロールを行なうことができる。
【００５４】
本実施の形態の第２の他の態様としては、図６に示すように、上述の第１の他の態様と同様に、重み付けデータ格納手段５に、数種類の音声ソース別の各分割帯域毎の重み付けデータを格納し、重み付けデータ格納手段５が乗算手段６に出力する重み付けデータを切換えさせる重み付けデータ切換え手段１４を備える。
【００５５】
また、比較手段１０の判定結果によりゲイン発生手段１１のゲインを一定時間だけホールドさせるホールド制御回路１５と、ホールド制御回路１５のホールド時間のデータ（ホールドタイムデータ）を複数格納し重み付けデータ切換え手段１４の切換え指示により格納しているホールドタイムデータから１つを選択してホールド制御回路に出力設定するホールドタイムデータ格納手段１６とを備えている。
【００５６】
このようなオートゲインコントロール装置において、使用者が重み付けデータ切換え手段１４を操作して１つの重み付けデータを選択すると、重み付けデータ格納手段５は選択された重み付けデータを乗算手段６に出力する。
【００５７】
また、ホールドタイムデータ格納手段１６は、重み付けデータ切換え手段１４で選択された重み付けデータに対応したホールドタイムデータをホールド制御回路１５に出力する。
【００５８】
比較手段１０は、ゲイン補正手段８によって補正されたゲイン値と、スレッショルドレベル格納手段９に格納されているスレッショルドレベルデータとを比較するとともに、ゲイン値とスレッショルドレベルデータとの差分の変動量が予め設定された値以上になるとホールド制御回路１５にトリガーを出力する。
【００５９】
ホールド制御回路１５は、トリガーが入力されると、内部のタイマーを起動し、ホールドタイムデータ格納手段１６が出力し設定したホールドタイムデータの時間だけゲイン発生手段１１で発生するゲイン値を持続させる。
【００６０】
このように構成することによって、重み付けデータの切換えと連動してゲインのホールドタイムも音声ソースに対応して切換えることができ、出力レベルの不自然なゲイン変動を音声入力ソース別に抑えることができる。
【００６１】
例えば、ゲイン変動量が大きい音声ソースの場合に、出来るだけ出力レベルの変動が不自然にならないようにするときは、この音声ソースに適用される重み付けデータに対応してホールドタイムデータ格納手段１６に格納するホールドタイムデータとして長めの時間を格納しておくとよい。
【００６２】
また、音声ソースのゲイン変動量が推測できず、出力レベルの追従レスポンスを速くしたい場合は、この音声ソースに適用される重み付けデータに対応してホールドタイムデータ格納手段１６に格納するホールドタイムデータとして短い時間を格納しておくとよい。
【００６３】
なお、ホールドタイムデータは、予め測定したデータに基づいて、音声ソース別に最適なゲイン変動をするように決めるとよい。
【００６４】
本実施の形態の第３の他の態様としては、図７に示すように、上述の第２の他の態様と同様に、重み付けデータ格納手段５に、数種類の音声ソース別の各分割帯域毎の重み付けデータを格納し、比較手段１０の判定結果によりゲイン発生手段１１のゲインを一定時間だけホールドさせるホールド制御回路１５と、格納しているホールドタイムデータから１つを選択してホールド制御回路に出力設定するホールドタイムデータ格納手段１６とを備える。
【００６５】
また、音声入力端子１に入力される音声信号に対して高速フーリエ変換（ＦＦＴ：Fast Fourier Transform）処理を行い、入力音声信号の周波数特性を分析する高速フーリエ変換（ＦＦＴ）処理手段１７と、高速フーリエ変換処理手段１７の分析結果に基づき登録されている数種類の音声帯域別モデルデータから最も近いモデルデータを選択するデータモデル判断手段１８とを備えている。
【００６６】
このようなオートゲインコントロール装置において、ＦＦＴ処理手段１７は、音声入力端子１に入力された音声信号に対してＦＦＴ処理を行い入力音声信号の周波数特性を分析し、分析結果情報をデータモデル判断手段１８に出力する。
【００６７】
データモデル判断手段１８は、ＦＦＴ処理手段１７の分析結果情報に基づいて、入力された音声信号の周波数特性と近い周波数特性を持つ音声帯域別モデルデータを選択し、選択した音声帯域別モデルデータに対応する重み付けデータを重み付けデータ格納手段５から選択する。
【００６８】
重み付けデータ格納手段５は、選択された重み付けデータを乗算手段６に出力する。
【００６９】
また、ホールドタイムデータ格納手段１６は、データモデル判断手段１８で選択された重み付けデータに対応したホールドタイムデータをホールド制御回路１５に出力する。
【００７０】
これ以外は上述の第２の他の態様と同様に、比較手段１０は、ゲイン値とスレッショルドレベルデータとの差分の変動量が予め設定された値以上になるとホールド制御回路１５にトリガーを出力する。
【００７１】
ホールド制御回路１５は、トリガーが入力されると、ホールドタイムデータ格納手段１６が設定したホールドタイムデータの時間だけゲイン発生手段１１で発生するゲイン値を持続させる。
【００７２】
なお、ＦＦＴ処理をＤＳＰ（Digital Signal Processor）等で処理させる場合は、ある程度の処理時間が必要となるため、この処理時間分の時間間隔でＦＦＴ処理を行なうようにする。
【００７３】
このように構成することによって、自動的に入力音声信号の周波数特性に合った音声帯域別データモデルに対応する重み付けデータ及びホールドタイムデータが選択され、入力音声信号に合ったオートゲインコントロールを自動的に行なうことができる。
【００７４】
【発明の効果】
以上説明したように、本発明によれば、入力音声信号を予め設定された周波数帯域毎に重み付けして、重み付けした信号に基づいてオートゲインコントロールすることにより、所定の周波数特性を持った入力音声信号のみに対して動作をし、想定する音声信号の周波数特性に追従することができる。
【図面の簡単な説明】
【図１】本発明の一実施の形態のオートゲインコントロール装置を示す概略ブロック図
【図２】本発明の一実施の形態のオートゲインコントロール装置の想定する入力ソースの周波数特性の例を示す図
【図３】本発明の一実施の形態のオートゲインコントロール装置の各帯域分割フィルタ手段の周波数特性の例を示す図
【図４】本発明の一実施の形態のオートゲインコントロール装置の帯域分割フィルタ手段の総合特性の例を示す図
【図５】本発明の一実施の形態の第１の他の態様のオートゲインコントロール装置を示す概略ブロック図
【図６】本発明の一実施の形態の第２の他の態様のオートゲインコントロール装置を示す概略ブロック図
【図７】本発明の一実施の形態の第３の他の態様のオートゲインコントロール装置を示す概略ブロック図
【図８】従来のオートゲインコントロール装置を示す概略ブロック図
【符号の説明】
１音声入力端子
２音声出力端子
３帯域分割フィルタ手段
４絶対値変換手段
５重み付けデータ格納手段
６乗算手段
７加算手段
８ゲイン補正手段
９スレッショルドレベル格納手段
１０比較手段
１１ゲイン発生手段
１２放絡線発生手段
１３出力信号用乗算手段
１４重み付けデータ切換え手段
１５ホールド制御回路
１６ホールドタイムデータ格納手段
１７高速フーリエ変換処理手段
１８データモデル判断手段
１０１音声入力端子
１０２音声出力端子
１０３絶対値変換手段
１０４ゲイン補正手段
１０５スレッショルドレベル格納手段
１０６比較手段
１０７ゲイン発生手段
１０８放絡線発生手段[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an auto gain control device that makes an output signal level constant without depending on a sound input level.
[0002]
[Prior art]
Conventionally, this type of automatic gain control apparatus is shown in FIG. In this auto gain control device, the audio signal input from the audio input terminal 101 is subjected to absolute value conversion by the absolute value conversion means 103, and normalized and corrected to a predetermined gain by the gain correction means 104.
[0003]
The corrected gain is compared with the threshold level coefficient stored in advance in the threshold level storage means 105 by the comparison means 106.
[0004]
The gain generation means 107 generates a gain smaller than 1 when the gain is larger than the threshold level, and the output level as an output signal is attenuated.
[0005]
When the gain is smaller than the threshold level, the gain generation means 107 generates a gain larger than 1, and the output level as an output signal is amplified.
[0006]
Note that, in order to operate unnaturally with respect to a sudden change in input signal, the envelope generating means 108 moderates the amount of gain value fluctuation generated by the gain generating means 107, and the unnaturalness of the gain change. Is lost.
[0007]
The output multiplier 109 multiplies the gain value having the time constant output from the envelope generator 108 and the audio signal data input from the audio input terminal 101, and uses the output value as an output signal for audio. Output from the output terminal 102.
[0008]
In this way, a constant output signal level can be maintained near the threshold level.
[0009]
[Problems to be solved by the invention]
However, in such a conventional auto gain control device, since the frequency characteristic of the input audio signal is not considered, there is a problem that the operation of the auto gain control device varies depending on the frequency characteristic of the input audio signal. .
[0010]
For example, even when noise having a high frequency component is input by mistake, the gain may be controlled in the same manner as a normal audio signal. Low-frequency noise such as background noise can also cause malfunctions.
[0011]
Also, the level at which auto gain control operates is different between audio signals of announcements with completely different frequency characteristics and audio signals such as music sources, and the operation of auto gain control depends on the source of the input audio signal. There was also a problem.
[0012]
The present invention has been made to solve such a problem, and provides an auto gain control device capable of realizing good characteristics with respect to a predetermined input audio signal.
[0013]
[Means for Solving the Problems]
The auto gain control device of the present invention includes a band division filter unit that divides an input audio signal into signals having different frequency bands, an absolute value conversion unit that performs absolute value conversion on each of the signals having different frequency bands, and Weighted data storage means for storing weighted data for each value subjected to absolute value conversion by the absolute value conversion means, and each value subjected to absolute value conversion by the absolute value conversion means and stored in the weighted data storage means Multiplying means for multiplying weighted data, adding means for outputting gain values for all frequency bands of the audio signal weighted for each frequency band by adding the respective values multiplied by the multiplying means, and Gain correction means for correcting the gain value of the output of the addition means, and the gain correction means Comparing means for comparing an IN value and a threshold level, and gain generating means for generating a gain value based on a comparison result of the comparing means, and the input voice based on the gain value output from the gain generating means It has a configuration for controlling the gain value of the signal.
[0014]
With this configuration, the input audio signal is weighted for each predetermined frequency band, and auto gain control is performed based on the weighted signal.
[0015]
Here, a plurality of sets of weighting data are stored in the weighting data storage means, and weighting data switching means for switching the weighting data output to the multiplication means among the plurality of sets of weighting data is provided.
[0016]
With this configuration, the weighting for each frequency band of the input audio signal is switched.
[0017]
And a hold control means for maintaining a gain value generated by the gain generation means for a predetermined time; and a hold time data storage means for storing a plurality of times for maintaining the gain value by the hold control means. The means is configured to set a time for maintaining the gain value corresponding to the weighted data switched by the weighted data switching means in the hold control means.
[0018]
With this configuration, the generated gain maintenance time is also switched in accordance with the weighting of the switched input audio signal for each frequency band.
[0019]
In addition, conversion processing means for converting the input audio signal into frequency data by FFT processing is provided, and the weighting data switching means switches to weighting data corresponding to the frequency data.
[0020]
With this configuration, weighting suitable for the frequency characteristics of the input audio signal is automatically switched.
[0021]
DETAILED DESCRIPTION OF THE INVENTION
Embodiments of the present invention will be described below with reference to the drawings.
1 to 4 are diagrams showing an automatic gain control apparatus according to an embodiment of the present invention.
[0022]
As shown in FIG. 1, the auto gain control device according to the present embodiment has an audio input terminal 1 to which an audio signal is input, an audio output terminal 2 to which a gain-controlled audio signal is output, and preset. Divided by a plurality of band division filter means 3 for passing only signals in different frequency bands, an absolute value conversion means 4 for absolute value conversion of signals output from each of the band division filter means 3, and a band division filter means 3. Weighted data storage means 5 for storing data to be weighted for each frequency band, multiplication means 6 for multiplying the absolute value converted value by weighting data, addition means 7 for adding the weighted value, and addition Gain correction means 8 for correcting the value and threshold level data for storing threshold level data to be compared with the corrected value Means 9; comparison means 10 for comparing the corrected value and threshold level data; gain generation means 11 for generating a gain value based on the comparison result of comparison means 10; The generating means 12 and the output signal multiplying means 13 for multiplying the input audio signal by the gain value output from the envelope generating means 12 are provided.
[0023]
In such an automatic gain control device, the audio signal input to the audio input terminal 1 is input to the plurality of band division filter means 3, respectively.
[0024]
Each band division filter unit 3 outputs only signals in different frequency bands set in advance for each band division filter unit 3.
[0025]
For example, if the assumed frequency characteristics of the input source are as shown in FIG. 2, four band division filter means 3 having frequency characteristics as shown in FIGS. 3A to 3D are used.
[0026]
The addition characteristics of these four band division filter means 3 are as shown in FIG. 4, and this addition characteristic is approximated to the characteristics of the assumed input source in FIG. If the number of divisions by the band dividing filter means 3 is increased, the input source can be approximated faithfully, and good gain control suitable for the input source can be performed.
[0027]
Such output signals of the band division filter unit 3 are respectively input to the absolute value conversion unit 4 and are subjected to absolute value conversion for each band.
[0028]
The weighting data storage means 5 stores weighting data for each value obtained by performing absolute value conversion on the signal output from each band division filter means 3.
[0029]
This weighted data reflects the characteristics of the frequency band of the assumed input source, and the value is set larger as the weighted data in the frequency band that becomes the center of gain control.
[0030]
For example, when the four band division filter means 3 having the frequency characteristics as shown in FIG. 3 are used, the third filter output (the output of the band division filter means 3 having the characteristics shown in FIG. 3C) is weighted. Is increased, the contribution of the band of the third filter output in the gain control (the frequency band divided by the band dividing filter means 3 having the characteristics shown in FIG. 3C) is increased.
[0031]
The weighted data for each frequency band divided by the band division filter means 3 stored in the weighting data storage means 5 is multiplied by the absolute value converted output of the corresponding band division filter means 3. All values multiplied by the means 6 and multiplied by the multiplying means are added by the adding means 7.
[0032]
The signal output from the adding means 7 in this way is a signal weighted by the weighting data based on the voice band model data corresponding to the predetermined band division filter means 3 stored in the weighting data storage means 5 in advance.
[0033]
The output (gain value) of the adding means 7 approaches a predetermined gain value when an audio signal having the frequency characteristics of the audio band model data is input.
[0034]
The gain correcting unit 8 multiplies the output of the adding unit 7 by a preset coefficient for correction. Since the gain correction means 8 can adjust the absolute value of the gain value, the weighting data need only take into account the relative value for each frequency band.
[0035]
The gain value corrected by the gain correction means 8 is compared with the threshold level data stored in the threshold level storage means 9 in the comparison means 10.
[0036]
The gain generation means 11 generates a gain value smaller than a preset one when the gain value is larger than the threshold level data. For this reason, the output level of the output signal attenuates and approaches the threshold level.
[0037]
When the gain value is smaller than the threshold level, the gain generating means 11 generates a gain value larger than 1, which is set in advance. As a result, the output level of the output signal is amplified and approaches the threshold level.
[0038]
When the weighting on the output of the specific band division filter unit 3 is increased, when a sound source including a large number of frequency components of the frequency band divided by the band division filter unit 3 is input, the output of the addition unit 7 accompanying the input signal The value increases, the threshold level is easily exceeded, and the operating characteristics of auto gain control linked to a specific frequency band can be obtained.
[0039]
The gain value generated by the gain generation means 11 is input to the envelope generation means 12 and controlled so as to have an attack time and a release time having a predetermined time constant.
[0040]
In this case, the attack time is a time constant for changing the gain value when the gain value generated by the gain generating means 11 decreases, and the release time is the gain value when the gain value generated by the gain generating means 11 increases. Is a time constant that changes.
[0041]
By setting the attack time and the release time, it is possible to set a response to a sudden level change (whether to follow a sudden level change).
[0042]
The output signal multiplier 13 multiplies the gain value having a time constant by the envelope generator 12 and the audio signal input from the audio input terminal 1, and the output signal from the output signal multiplier 13 is This is the output signal of the auto gain control device.
[0043]
As described above, in the present embodiment, since the frequency characteristics of a predetermined sound are divided and approximated by the band division filter means 3, it is possible to perform auto gain control targeting a predetermined sound signal.
[0044]
Further, since weighting is performed for each band divided by the band dividing filter means 3, even if an audio signal of an unexpected band is input, the weighting coefficient of the band is small, so that the change in the gain value is reduced. Thus, the gain value fluctuation due to the gain generation means 11 is eliminated, and the level fluctuation of the signal output from the audio output terminal 2 can be eliminated.
[0045]
In addition, only the characteristic frequency band for each audio source can be emphasized by the amount of weighting by the weighting coefficient, so that a constant gain can be maintained for audio sources having other frequency characteristics. It can respond only to the audio source with the targeted frequency characteristics.
[0046]
As a first other aspect of the present embodiment, as shown in FIG. 5, weighting data for each divided band for each of several types of audio sources is stored in the weighting data storage means 5.
[0047]
For example, weighting data for each voice band is prepared depending on the use and the characteristics of the announcement target so that the frequency characteristics are different depending on the average distribution of the frequency characteristics of male announcements and female announcements. This may be measured in advance and statistically summarized, and an average frequency characteristic distribution may be used as weighting data.
[0048]
The weighting data switching unit 14 switches the weighting data output from the weighting data storage unit 5 to the multiplication unit 6 by a user operation.
[0049]
The weighting data switching means 14 may be a physical switch structure or a switch on a personal computer GUI (graphical user interface).
[0050]
For example, based on the frequency characteristics of a male announcement and a female announcement, weighting data corresponding to each is stored in the weighting data storage means 5, and when using an auto gain control device for a male announcement voice signal, Weighting data switching means 14 outputs weighting data based on the frequency characteristics of the male announcement to multiplication means 6.
[0051]
When the automatic gain control device is used for the female announcement voice signal, the weighting data switching means 14 is switched so as to output the weighting data based on the frequency characteristic of the female announcement to the multiplication means 6.
[0052]
Alternatively, frequency characteristic data of a specific person may be measured in advance, and the data may be stored in the weighted data storage unit 5 as a data model for each voice band.
[0053]
With this configuration, weighting data suitable for the audio source can be selected by the weighting data switching means 14, and automatic gain control corresponding to the frequency characteristics unique to the audio source can be performed.
[0054]
As the second other aspect of the present embodiment, as shown in FIG. 6, as in the first other aspect described above, the weighted data storage unit 5 stores each divided band for each of several types of audio sources. Weighting data switching means 14 for switching weighting data output from the weighting data storage means 5 to the multiplication means 6.
[0055]
Further, a hold control circuit 15 for holding the gain of the gain generation means 11 for a predetermined time based on the determination result of the comparison means 10 and a plurality of hold time data (hold time data) for the hold control circuit 15 are stored and the weighted data switching means 14 is stored. Hold time data storage means 16 for selecting one of the hold time data stored by the switching instruction and setting the output to the hold control circuit.
[0056]
In such an automatic gain control device, when the user operates the weighting data switching means 14 and selects one weighting data, the weighting data storage means 5 outputs the selected weighting data to the multiplication means 6.
[0057]
Further, the hold time data storage means 16 outputs hold time data corresponding to the weight data selected by the weight data switching means 14 to the hold control circuit 15.
[0058]
The comparison means 10 compares the gain value corrected by the gain correction means 8 with the threshold level data stored in the threshold level storage means 9, and the variation amount of the difference between the gain value and the threshold level data is previously determined. When the set value is exceeded, a trigger is output to the hold control circuit 15.
[0059]
When a trigger is input, the hold control circuit 15 starts an internal timer, and maintains the gain value generated by the gain generation means 11 for the time of the hold time data output and set by the hold time data storage means 16.
[0060]
With this configuration, the gain hold time can be switched corresponding to the audio source in conjunction with the switching of the weighting data, and unnatural gain fluctuations in the output level can be suppressed for each audio input source.
[0061]
For example, in the case of an audio source with a large amount of gain fluctuation, in order to prevent the output level fluctuation from becoming unnatural as much as possible, the hold time data storage means 16 corresponds to the weighting data applied to this audio source. A longer time may be stored as hold time data to be stored.
[0062]
Further, when the gain variation amount of the audio source cannot be estimated and it is desired to make the output level follow-up response faster, the hold time data stored in the hold time data storage means 16 corresponding to the weighting data applied to the audio source is used. It is good to store a short time.
[0063]
Note that the hold time data may be determined so as to change the gain appropriately for each audio source based on data measured in advance.
[0064]
As the third other aspect of the present embodiment, as shown in FIG. 7, as in the second other aspect described above, the weighted data storage unit 5 stores each divided band for each of several types of audio sources. Weight control data is stored, and the hold control circuit 15 for holding the gain of the gain generation means 11 for a predetermined time based on the determination result of the comparison means 10, and one of the stored hold time data is selected to the hold control circuit. Hold time data storage means 16 for setting the output.
[0065]
Also, fast Fourier transform (FFT) processing is performed on the sound signal input to the sound input terminal 1 to analyze the frequency characteristics of the input sound signal, and high-speed processing is performed. And a data model determination means 18 for selecting the closest model data from several types of model data classified by voice band based on the analysis result of the Fourier transform processing means 17.
[0066]
In such an automatic gain control apparatus, the FFT processing means 17 performs FFT processing on the audio signal input to the audio input terminal 1 to analyze the frequency characteristic of the input audio signal, and the analysis result information is used as data model determination means. 18 is output.
[0067]
Based on the analysis result information of the FFT processing unit 17, the data model determination unit 18 selects model data for each voice band having a frequency characteristic close to the frequency characteristic of the input voice signal, and selects the selected model data for each voice band. Corresponding weight data is selected from the weight data storage means 5.
[0068]
The weighting data storage unit 5 outputs the selected weighting data to the multiplication unit 6.
[0069]
The hold time data storage unit 16 outputs hold time data corresponding to the weighted data selected by the data model determination unit 18 to the hold control circuit 15.
[0070]
Except for this, the comparison means 10 outputs a trigger to the hold control circuit 15 when the variation amount of the difference between the gain value and the threshold level data is equal to or greater than a preset value, as in the second other aspect described above. .
[0071]
When the trigger is input, the hold control circuit 15 maintains the gain value generated by the gain generation unit 11 for the time of the hold time data set by the hold time data storage unit 16.
[0072]
Note that when the FFT processing is performed by a DSP (Digital Signal Processor) or the like, a certain amount of processing time is required, so the FFT processing is performed at a time interval corresponding to this processing time.
[0073]
With this configuration, weighting data and hold time data corresponding to the data model for each audio band that matches the frequency characteristics of the input audio signal are automatically selected, and auto gain control that matches the input audio signal is automatically selected. Can be done.
[0074]
【The invention's effect】
As described above, according to the present invention, an input sound signal having a predetermined frequency characteristic is obtained by weighting an input sound signal for each preset frequency band and performing auto gain control based on the weighted signal. It is possible to operate only on the signal and follow the assumed frequency characteristics of the audio signal.
[Brief description of the drawings]
FIG. 1 is a schematic block diagram illustrating an auto gain control device according to an embodiment of the present invention. FIG. 2 is a diagram illustrating an example of frequency characteristics of an input source assumed by the auto gain control device according to an embodiment of the present invention. FIG. 3 is a diagram illustrating an example of frequency characteristics of each band division filter unit of the auto gain control device according to the embodiment of the present invention. FIG. 4 is a band division filter of the auto gain control device according to the embodiment of the present invention. FIG. 5 is a schematic block diagram showing an auto gain control device according to another first aspect of an embodiment of the present invention. FIG. 6 is a schematic block diagram showing an example of the overall characteristics of the means. FIG. 7 is a schematic block diagram showing an auto gain control device according to another embodiment of the present invention. FIG. 7 is a schematic block diagram showing an auto gain control device according to a third other embodiment of the present invention. Figure a schematic block diagram showing the 8 conventional automatic gain control device [Description of symbols]
DESCRIPTION OF SYMBOLS 1 Audio | voice input terminal 2 Audio | voice output terminal 3 Band division | segmentation filter means 4 Absolute value conversion means 5 Weighting data storage means 6 Multiplication means 7 Addition means 8 Gain correction means 9 Threshold level storage means 10 Comparison means 11 Gain generation means 12 Envelope generation Means 13 Output signal multiplication means 14 Weighted data switching means 15 Hold control circuit 16 Hold time data storage means 17 Fast Fourier transform processing means 18 Data model judgment means 101 Audio input terminal 102 Audio output terminal 103 Absolute value conversion means 104 Gain correction means 105 threshold level storage means 106 comparison means 107 gain generation means 108 envelope generation means

Claims

入力音声信号を複数の周波数帯域の異なる信号に分割する帯域分割フィルタ手段と、前記複数の周波数帯域の異なる信号それぞれを絶対値変換する絶対値変換手段と、前記絶対値変換手段で絶対値変換されたそれぞれの値に対する重み付けデータを格納する重み付けデータ格納手段と、前記絶対値変換手段で絶対値変換されたそれぞれの値と前記重み付けデータ格納手段に格納されている重み付けデータとを乗算する乗算手段と、前記乗算手段で乗算されたそれぞれの値を加算することにより前記周波数帯域ごとに重み付けされた音声信号の全周波数帯域に対するゲイン値を出力する加算手段と、前記加算手段の出力のゲイン値を補正するゲイン補正手段と、前記ゲイン補正手段により補正されたゲイン値とスレッショルドレベルとを比較する比較手段と、前記比較手段の比較結果に基づいてゲイン値を発生するゲイン発生手段とを備え、
前記ゲイン発生手段から出力されるゲイン値に基づいて前記入力音声信号のゲイン値をコントロールすることを特徴とするオートゲインコントロール装置。Absolute value conversion is performed by the band division filter unit that divides the input audio signal into a plurality of signals having different frequency bands, the absolute value conversion unit that performs absolute value conversion on each of the signals having different frequency bands, and the absolute value conversion unit. Weighting data storing means for storing weighting data for each value, and multiplying means for multiplying each weight value converted by the absolute value converting means by the weighting data stored in the weighting data storing means, Adding means for outputting gain values for all frequency bands of the audio signal weighted for each frequency band by adding the values multiplied by the multiplication means, and correcting the gain value of the output of the adding means The gain correction means for comparing the gain value corrected by the gain correction means and the threshold level. Comprising a comparison unit, and a gain generating means for generating a gain value based on the comparison result of the comparing means,
An auto gain control apparatus for controlling a gain value of the input audio signal based on a gain value output from the gain generating means.

前記重み付けデータ格納手段に複数組の重み付けデータを格納し、前記乗算手段に出力する前記重み付けデータを前記複数組の重み付けデータの中で切り換える重み付けデータ切換え手段を備えることを特徴とする請求項１に記載のオートゲインコントロール装置。 The weighting data storage means stores a plurality of sets of weighting data, and comprises weighting data switching means for switching the weighting data output to the multiplication means among the plurality of sets of weighting data. The automatic gain control device described.

前記ゲイン発生手段が発生するゲイン値を所定時間維持するホールド制御手段と、前記ホールド制御手段でゲイン値を維持する時間を複数格納するホールドタイムデータ格納手段とを備え、前記ホールドタイムデータ格納手段は、前記重み付けデータ切換え手段が切り換えた前記重み付けデータに対応する前記ゲイン値を維持する時間を前記ホールド制御手段に設定することを特徴とする請求項２に記載のオートゲインコントロール装置。A hold control means for maintaining a gain value generated by the gain generation means for a predetermined time; and a hold time data storage means for storing a plurality of times for maintaining the gain value by the hold control means. 3. The automatic gain control device according to claim 2, wherein a time for maintaining the gain value corresponding to the weighted data switched by the weighted data switching means is set in the hold control means.

入力音声信号をＦＦＴ処理により周波数データに変換する変換処理手段を備え、前記重み付けデータ切換え手段は、前記周波数データに対応する重み付けデータに切り換えることを特徴とする請求項２または請求項３に記載のオートゲインコントロール装置。 4. The conversion processing means for converting an input audio signal into frequency data by FFT processing, wherein the weighting data switching means switches to weighting data corresponding to the frequency data. Auto gain control device.