JP3255077B2

JP3255077B2 - Phone

Info

Publication number: JP3255077B2
Application number: JP10506697A
Authority: JP
Inventors: 政人久保
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1997-04-23
Filing date: 1997-04-23
Publication date: 2002-02-12
Anticipated expiration: 2017-04-23
Also published as: JPH10304023A

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は電話機に関し、特に
ディジタル方式の電話機に関する。The present invention relates to a telephone, and more particularly to a digital telephone.

【０００２】[0002]

【従来の技術】電話機、特に可搬性を最大の特徴とする
携帯型の電話機は、屋外での使用がそのほとんどであ
り、通話中の周囲の騒音によって通話品質が左右されや
すく、特に移動中の使用が可能な携帯電話用或いは自動
車電話用の電話機は、外部騒音の影響をより顕著に受け
やすい。2. Description of the Related Art Telephones, especially portable telephones, which are most characterized by portability, are mostly used outdoors, and the quality of the call is easily influenced by the surrounding noise during the call. Telephones for mobile phones or automobile phones that can be used are more easily affected by external noise.

【０００３】また、ディジタル方式の携帯電話機では、
入力された音声信号を高度な情報圧縮による音声符号化
処理を行うことにより低伝送レートでの通話を実現して
いる。[0003] In a digital mobile phone,
The speech at a low transmission rate is realized by performing speech encoding processing on the inputted speech signal by advanced information compression.

【０００４】しかし、その音声符号化方式はほとんどの
場合、人間の声の特徴量をパラメータとして抽出し、こ
れを伝送することにより実現しているので環境雑音等の
伝送には適していない。However, in most cases, the speech coding method is realized by extracting a feature amount of a human voice as a parameter and transmitting the parameter, and is not suitable for transmission of environmental noise and the like.

【０００５】このため、こうした可搬型の電話機のなか
には、環境雑音を含んだ入力信号から音声区間を検出し
て、音声区間以外をノイズキャンセラ等の雑音抑圧処理
により、音声以外の雑音を排除して音声符号化を行う構
成のものがある。[0005] Therefore, among such portable telephones, a voice section is detected from an input signal containing environmental noise, and a part other than the voice section is subjected to noise suppression processing such as a noise canceller to remove noise other than the voice to remove the voice. There is a configuration that performs encoding.

【０００６】図４に示す音声区間検出器の構成図は、特
開平４−２５１２９９号公報「音声検出方式」に開示さ
れたものであり、入力音声を一定の時間フレーム単位で
ＬＰＣスペクトル分析を行い、低周波数帯域のスペクト
ルピークと高周波数帯域の平均スペクトルのレベル差が
閾値を越えたときを音声区間と判定する際に、１つ過去
のフレームが音声区間でないと判定されると、現在のフ
レームのスペクトルから１フレーム過去のスペクトルを
減算して修正フレームを作成し、その修正フレームのス
ペクトルを用いて上記音声区間の判定を行う構成とされ
ている。FIG. 4 is a block diagram of a voice section detector disclosed in Japanese Patent Application Laid-Open No. Hei 4-251299, "Speech Detection System", which performs an LPC spectrum analysis on an input voice in fixed time frame units. When it is determined that a speech interval is a time when the level difference between the spectrum peak of the low frequency band and the average spectrum of the high frequency band exceeds the threshold, if it is determined that the previous frame is not a speech interval, the current frame , A corrected frame is created by subtracting the spectrum of one frame past from the spectrum of, and the speech section is determined using the spectrum of the corrected frame.

【０００７】すなわち、入力音声信号は、フィルタ４０
１によって帯域制限されてＡＤ変換器４０２によりディ
ジタル信号に変換される。That is, the input audio signal is filtered by the filter 40.
1 and is converted into a digital signal by the AD converter 402.

【０００８】次に、ＤＳＰ４０５の中で１フレーム分切
り出された信号はＬＰＣスペクトル分析が行われ、スペ
クトルメモリ４０３に記憶される。スペクトルは１つ過
去のフレームにおけるスペクトルと現在のフレームのス
ペクトルの２フレーム分記憶される。Next, a signal extracted for one frame in the DSP 405 is subjected to LPC spectrum analysis and stored in the spectrum memory 403. The spectrum is stored for two frames, that is, the spectrum of one previous frame and the spectrum of the current frame.

【０００９】スイッチ４０４は、１つ過去のフレームが
音声区間でないと判定されるとＢ側に接続され、現在の
フレームのスペクトルよりそれぞれの周波数に対応する
１つ過去のフレームのスペクトルを減算し、作成された
修正フレームのスペクトルが送られる。[0009] The switch 404 is connected to the B side when it is determined that the previous frame is not a voice section, and subtracts the spectrum of the previous frame corresponding to each frequency from the spectrum of the current frame, The spectrum of the created modified frame is sent.

【００１０】また、１つ過去のフレームが音声区間と判
定されるとスイッチ４０４はＡ側に接続されて、現在の
フレームのスペクトルが送られる。When the previous frame is determined to be a voice section, the switch 404 is connected to the A side to transmit the spectrum of the current frame.

【００１１】そして、送られたスペクトルから、低周波
数帯域のスペクトルピークＰｓと高周波数帯域のスペク
トルピークＰｚが検出され、ＰｓからＰｚを減じたもの
のレベルを閾値判定することにより、音声区間の判定出
力が行われるというものである。一方、この種の音声区
間検出器の他の一例が特開昭６１−６５２９８号公報に
開示されている。これは周囲雑音レベルが低いときに
は、音声データの低い周波数から高い周波数までを利用
するが、周囲雑音レベルが高いときには母音中心の特徴
パターンを作成し、同時に音声を検出するための閾値も
変えて分析処理を行うというものである。Then, a spectrum peak Ps in the low frequency band and a spectrum peak Pz in the high frequency band are detected from the transmitted spectrum, and the level of the signal obtained by subtracting Pz from Ps is determined by a threshold value, thereby determining and outputting the voice section. Is performed. On the other hand, another example of this type of voice section detector is disclosed in Japanese Patent Application Laid-Open No. 61-65298. This uses low to high frequencies of audio data when the ambient noise level is low, but creates a feature pattern centered on vowels when the ambient noise level is high, and at the same time, changes the threshold for voice detection to analyze Processing is performed.

【００１２】前述した特開平４−２５１２９９号公報掲
載の音声検出方式を携帯型電話機の音声検出器に用いる
と、雑音のみの区間で得られた雑音のスペクトルを雑音
を含んだ音声区間のスペクトルから減じることにより、
音声区間の始まりを雑音に影響されずに検出することが
可能になり、正確に音声区間以外の雑音を除去できるた
め、環境雑音に影響を受けずに通話品質の向上を図るこ
とができる。When the above-described voice detection system disclosed in Japanese Patent Application Laid-Open No. 4-251299 is used for a voice detector of a portable telephone, a noise spectrum obtained in a noise-only section is calculated from a spectrum of a voice section containing noise. By reducing
Since the beginning of a voice section can be detected without being affected by noise, and noise in areas other than the voice section can be accurately removed, the communication quality can be improved without being affected by environmental noise.

【００１３】このように、ＤＳＰ等の信号処理チップに
より音声のスペクトルなどの特徴量の閾値判定での音声
検出は、雑音環境下においても正確な判定を期待するこ
とができる。[0013] As described above, it is expected that accurate detection can be expected even in a noisy environment when a signal processing chip such as a DSP detects a threshold value of a characteristic amount such as a spectrum of a voice.

【００１４】また、特開昭６１−６５２９８号公報掲載
の音声区間検出器では、雑音レベルの大小に応じて最適
な分析処理でパターンを作ることができるので、雑音レ
ベルの大小にかかわらず、正確な音声認識が可能とな
る。Further, in the voice section detector disclosed in Japanese Patent Application Laid-Open No. 61-65298, a pattern can be created by an optimal analysis process according to the level of the noise level. Voice recognition becomes possible.

【００１５】[0015]

【発明が解決しようとする課題】しかし、ディジタル方
式の携帯電話機においての使用では、音声検出器の他に
雑音除去（ノイズキャンセラ）器や音声符号化器が組み
合わされて使用され、それらはすべて信号処理チップで
の演算において実現されており、演算処理量の増加を引
き起こす要因になることが考えられる。However, in the use of a digital type mobile phone, a noise detector (noise canceller) and a voice coder are used in combination in addition to a voice detector, all of which are used for signal processing. This is realized in the calculation on the chip, and may be a factor that causes an increase in the amount of calculation processing.

【００１６】さらに、音声スペクトルの抽出などの処理
には多くの演算を必要とし、しかし簡略化した演算では
性能が劣化するなどして結局、演算量の増加により消費
電力の増加を招いてしまう。Further, processing such as extraction of a voice spectrum requires a large number of operations. However, in a simplified operation, the performance is degraded, and eventually the power consumption is increased due to an increase in the amount of operations.

【００１７】このため、元々演算量の多いディジタル方
式の携帯電話機に使用するには、消費電力の増加が可搬
性を最大の特徴とするこの種の電話機において重要であ
る通話時間の短縮を招くといった課題があった。For this reason, when used in a digital cellular phone which originally requires a large amount of computation, an increase in power consumption leads to a reduction in talk time, which is important in such a telephone having portability as its greatest feature. There were challenges.

【００１８】そこで本発明の目的は、演算器の増加によ
る消費電力の増加を防止することができる電話機を提供
することにある。An object of the present invention is to provide a telephone capable of preventing an increase in power consumption due to an increase in the number of arithmetic units.

【００１９】[0019]

【課題を解決するための手段】前記課題を解決するため
に本発明は、音声を雑音除去処理した後に送信する送信
部を有する電話機であって、前記音声をディジタル信号
に変換するディジタル変換手段と、前記ディジタル信号
に変換後の信号に雑音除去処理を施す雑音除去処理手段
と、前記雑音除去処理後のディジタル信号から音声特徴
量を抽出しこれを符号化する符号化手段と、前記符号化
されたディジタル信号を送信する前記送信部と、前記音
声特徴量に基づき音声検出を行う音声検出手段とを含
み、この音声検出手段での音声検出結果に基づき前記雑
音除去処理手段にて雑音除去処理を行うことを特徴とす
る。SUMMARY OF THE INVENTION In order to solve the above-mentioned problems, the present invention is directed to a telephone having a transmitting unit for transmitting voice after noise removal processing, wherein the digital conversion means converts the voice into a digital signal. Noise removal processing means for performing noise removal processing on the signal converted into the digital signal, and speech characteristics from the digital signal after the noise removal processing.
Encoding means for extracting the amount of encoding it, said transmitter for transmitting said encoded digital signal, the sound
Voice detection means for performing voice detection based on the voice feature amount , wherein the noise removal processing means performs noise removal processing based on the voice detection result of the voice detection means.

【００２０】本発明によれば、ディジタル信号を符号化
する過程において得られる音声の特徴量に基づき音声検
出を行うため、音声スペクトルの抽出などに用いる演算
器を設ける必要がなくなる。よって、これら演算器の増
加による消費電力の増加を防止することができる。According to the present invention, since voice detection is performed based on voice feature values obtained in the process of encoding a digital signal, there is no need to provide an arithmetic unit for extracting a voice spectrum. Therefore, it is possible to prevent an increase in power consumption due to an increase in these arithmetic units.

【００２１】[0021]

【発明の実施の形態】以下、本発明の実施の形態につい
て添付図面を参照しながら説明する。Embodiments of the present invention will be described below with reference to the accompanying drawings.

【００２２】図１は、本発明の電話機の最良の実施の形
態を示す構成図、図２は、同電話機の周波数特性パター
ン判定部とフィルタ係数格納部の動作を示す模式説明
図、図３は、周波数特性パターン判定部とフィルタ係数
格納部の機能を他の実施の形態により実施する例の模式
説明図である。FIG. 1 is a block diagram showing a preferred embodiment of a telephone according to the present invention, FIG. 2 is a schematic explanatory view showing the operation of a frequency characteristic pattern determining section and a filter coefficient storing section of the telephone, and FIG. FIG. 14 is a schematic explanatory diagram of an example in which the functions of a frequency characteristic pattern determination unit and a filter coefficient storage unit are implemented according to another embodiment.

【００２３】電話機１００は、送話マイクロホン１０１
と、アナログ信号をディジタル信号に変換するＡＤ変換
器１０２と、雑音除去処理部１０３と、音声符号化処理
部１０４と、送信機１０５と、アンテナ１０６と、スペ
クトル情報分析処理部１０７と、ピッチ情報・分析処理
部１０８と、周波数特性パターン判定部１０９と、フィ
ルタ係数格納部１１０と、状態推定部１１１と、前記処
理フィルタ部１１２とからなる。The telephone 100 has a transmitting microphone 101
An AD converter 102 for converting an analog signal into a digital signal, a noise removal processing unit 103, a speech coding processing unit 104, a transmitter 105, an antenna 106, a spectrum information analysis processing unit 107, pitch information -It is composed of an analysis processing unit 108, a frequency characteristic pattern determination unit 109, a filter coefficient storage unit 110, a state estimation unit 111, and the processing filter unit 112.

【００２４】図１に示す電話機１００は、無線電波を媒
介して送受信を行う携帯電話機であり、説明を容易にす
るため送信動作時の構成のみを示してある。Telephone 100 shown in FIG. 1 is a portable telephone that transmits and receives via radio waves, and only the configuration during the transmission operation is shown for ease of explanation.

【００２５】送話マイクロホン１０１を介して入力され
た送話信号は、ＡＤ変換器１０２によりディジタル信号
に変換されたのち、雑音除去処理部１０３に入力される
が、同じ信号は前処理フィルタ部１１２にも同時に入力
される。A transmission signal input via the transmission microphone 101 is converted into a digital signal by the AD converter 102 and then input to the noise removal processing unit 103. Is also input at the same time.

【００２６】前処理フィルタ１１２は、演算処理による
フィルタリングを行うディジタルフィルタであり、ディ
ジタル信号である送話信号は、所定の周波数特性を与え
られて状態推定部１１１に入力され、所定の時間（１フ
レーム区間）の送話信号が音声区間であるか、無音声区
間であるかを判定して、雑音除去処理部１０３の動作を
雑音除去制御情報出力１１３によって制御が行われる。The pre-processing filter 112 is a digital filter that performs filtering by arithmetic processing. A transmission signal, which is a digital signal, is given a predetermined frequency characteristic, is input to the state estimating unit 111, and has a predetermined time (1 It is determined whether the transmission signal of the frame section) is a voice section or a non-voice section, and the operation of the noise removal processing section 103 is controlled by the noise removal control information output 113.

【００２７】状態推定部１１１は、音声検出と状態推移
動作によって構成され、音声検出方法には、音声の平均
パワーの閾値判定や過去のフレームとの差分レベル判
定、また自己相関量の閾値判定等を組み合わせることに
より総合的に判定を行う方法が考えられる。The state estimating unit 111 is constituted by voice detection and state transition operation. The voice detection method includes a threshold value determination of an average power of voice, a difference level determination with respect to a past frame, a threshold value determination of an autocorrelation amount, and the like. Can be comprehensively determined by combining.

【００２８】また、特公平０３−０７１７１９号公報に
示されるように閾値を可変して、雑音による誤検出を少
なくするような音声検出方法を用いても良い。Further, as shown in Japanese Patent Publication No. 03-071719, a voice detection method may be used in which the threshold value is varied to reduce erroneous detection due to noise.

【００２９】また、状態推移動作は、たとえば音声区間
と判定された場合には現在のフレームから雑音除去処理
部１０３での雑音除去動作を停止または弱める、しか
し、音声区間の後に無音声の雑音区間と判定された場合
には、現在のフレームから雑音除去動作を開始または強
くするようにはしないで、無音の雑音区間が所定のフレ
ーム数が連続してカウントされた場合に、雑音除去動作
を開始または強めるように雑音除去処理部１０３の制御
を行う。In the state transition operation, for example, when it is determined that a speech section is present, the noise removal operation of the noise removal processing unit 103 is stopped or weakened from the current frame. If it is determined that the noise elimination operation is not started or strengthened from the current frame, the noise elimination operation is started when the silence noise section is counted continuously for a predetermined number of frames. Alternatively, the control of the noise removal processing unit 103 is performed so as to increase the power.

【００３０】このように、現在のフレーム区間の送話信
号に対する雑音除去動作を推移することにより、送話音
声の欠落を防止することが出来る。As described above, by performing the noise removal operation for the transmission signal in the current frame section, it is possible to prevent the transmission voice from being lost.

【００３１】一方、雑音処理部１０３に入力された送話
信号は、雑音除去制御情報出力１１３により指示された
動作状態により雑音除去処理が行われる。雑音除去処理
は、無処理や雑音抑圧処理及びフィルタリングによる除
去等の数段階の処理の中からフレーム区間の状態によっ
て最適な処理が選択される。On the other hand, the transmission signal input to the noise processing section 103 is subjected to noise removal processing according to the operation state specified by the noise removal control information output 113. In the noise removal processing, an optimum processing is selected according to the state of the frame section from several stages of processing such as no processing, noise suppression processing, and removal by filtering.

【００３２】雑音除去処理の行われた送話信号は、音声
符号化処理部１０４によって符号化処理が行われ、符号
化された送話信号は、送信機１０５を介してアンテナ１
０６より空中に放射される。The transmission signal subjected to the noise removal processing is subjected to encoding processing by a speech encoding processing unit 104, and the encoded transmission signal is transmitted via a transmitter 105 to an antenna 1.
It is emitted into the air from 06.

【００３３】音声符号化処理部１０４で送信信号は、波
形信号からピッチ情報やスペクトル情報等の音声特徴量
に変換され、これらを符号化して出力する。ここに、ピ
ッチ情報とは有声音の場合の基本周波数情報をいい、ス
ペクトル情報とは音声の周波数特性情報をいう。このよ
うに、音声の特徴に基づく音声符号化方式は、音声以外
の入力に対しては不向きであり、環境雑音等の雑音信号
に対しての変形が著しいため、環境雑音が通話品質の劣
化の原因になっていることは良く知られていることであ
る。The transmission signal is converted from the waveform signal into audio feature amounts such as pitch information and spectrum information by the audio encoding processing section 104, and these are encoded and output. Here, the pitch information refers to fundamental frequency information in the case of a voiced sound, and the spectrum information refers to frequency characteristic information of a voice. As described above, the speech coding method based on the features of speech is not suitable for inputs other than speech and significantly deforms noise signals such as environmental noise. The cause is well known.

【００３４】符号化処理の過程で得られたピッチ情報は
ピッチ情報分析処理部１０８に入力され、ピッチ周波数
が分布している周波数を示す周波数特性データへと変換
され、周波数特性パターン判定部１０９に入力される。The pitch information obtained in the encoding process is input to a pitch information analysis processing unit 108, converted into frequency characteristic data indicating the frequency at which the pitch frequency is distributed, and transmitted to a frequency characteristic pattern determination unit 109. Is entered.

【００３５】一方、符号化処理の過程で得られたスペク
トル情報は、スペクトル情報分析処理部１０７に入力さ
れ、音声の分布を示す周波数特性データへと変換され、
周波数特性パターン判定部１０９に入力される。On the other hand, the spectrum information obtained in the course of the encoding process is input to the spectrum information analysis processing unit 107, and is converted into frequency characteristic data indicating the distribution of speech.
Input to frequency characteristic pattern determination section 109.

【００３６】周波数特性パターン判定部１０９では、図
２に示すように入力された２つの周波数特性データを合
成した周波数特性１０９ａと一番近似した周波数特性の
パターン（同図中１０９ｂ〜１０９ｎのうちの１つ）を
選択する。The frequency characteristic pattern judging section 109, as shown in FIG. 2, combines the two input frequency characteristic data with the frequency characteristic pattern 109a most similar to the frequency characteristic 109a (of 109b to 109n in FIG. 2). Select (1).

【００３７】このパターンは、予めの学習により音声を
検出しやすい数種の周波数特性パターン１０９ｂ〜１０
９ｎが用意されていて、個人差や男女差による違いから
与えられる。This pattern includes several types of frequency characteristic patterns 109b to 109b in which voice can be easily detected by learning in advance.
9n are prepared, and are given based on differences due to individual differences and gender differences.

【００３８】選択された周波数特性パターンは、周波数
特性パターン判定出力１１５を介して、フィルタ係数格
納部１１０に入力され、予め周波数特性パターンと関連
づけられたフィルタ係数（同図中１１０ｂ〜１１０ｎの
うちの１つ）が呼び出されて、フィルタ係数出力１１４
により前処理フィルタ部１１２へ送出される。The selected frequency characteristic pattern is input to a filter coefficient storage unit 110 via a frequency characteristic pattern determination output 115, and a filter coefficient (one of 110b to 110n in FIG. 1) is called and the filter coefficient output 114
Is sent to the preprocessing filter unit 112.

【００３９】このフィルタ係数格納部１１０では、状態
推定部１１１により１フレーム過去の状態が知らされ、
現在のフレームが音声区間でなかった場合には１フレー
ム過去のフィルタ係数を出力するようにして、雑音区間
による影響を排除している。In the filter coefficient storage section 110, the state estimation section 111 notifies the state of one frame past,
If the current frame is not a voice section, the filter coefficients of one frame past are output to eliminate the influence of the noise section.

【００４０】これは、現在のフレームが非音声フレーム
（即ち、雑音フレーム）であるとすると、ピッチ情報・
分析処理部１０８及びスペクトル情報分析処理部１０７
は正しい情報を得ることができない場合があり、このた
め周波数特性パターン判定部１０９及びフィルタ係数格
納部１１０においても正しいデータを得ることができな
い場合がある。従って、状態推定部１１１は誤動作する
場合があるためである。This means that if the current frame is a non-speech frame (ie, a noise frame), the pitch information
Analysis processing unit 108 and spectrum information analysis processing unit 107
May not be able to obtain correct information, and therefore, it may not be possible to obtain correct data even in the frequency characteristic pattern determination unit 109 and the filter coefficient storage unit 110. Therefore, the state estimation unit 111 may malfunction.

【００４１】又、音声符号化処理部１０４は音声符号化
用であり、雑音を符号化する機能までは有していないた
めである。Also, the voice coding processing section 104 is for voice coding and does not have a function of coding noise.

【００４２】一方、１フレーム過去が音声区間であった
場合には、音声を検出しやすい周波数特性が送話信号に
与えられ、その後で音声検出が行われるので、より正確
に音声の検出が行われる。On the other hand, if the past one frame is a voice section, the transmission signal is given a frequency characteristic that makes it easy to detect voice, and voice detection is performed thereafter, so that voice detection can be performed more accurately. Will be

【００４３】また、この検出結果により雑音除去処理部
１０３がより正確に音声以外の信号を除去するように動
作するため、音声符号化処理部１０４も音声信号のみを
符号化することとなり、より正確なピッチ情報とスペク
トル情報を得ることが出来る。Further, since the noise removal processing section 103 operates to more accurately remove signals other than speech based on the detection result, the speech encoding processing section 104 also encodes only the speech signal, thus providing a more accurate Pitch information and spectrum information can be obtained.

【００４４】このように、フレーム単位の処理が続くこ
とにより、より正確な雑音除去が行われるようになる。As described above, by continuing the processing in units of frames, more accurate noise removal can be performed.

【００４５】次に本発明の第２の実施の形態を、図３を
参照して説明する。図３に示すのは、図２に示した周波
数特性パターン判定部１０９とフィルタ係数格納部１１
０を他の方式で実現したものである。Next, a second embodiment of the present invention will be described with reference to FIG. FIG. 3 shows the frequency characteristic pattern determination unit 109 and the filter coefficient storage unit 11 shown in FIG.
0 is realized by another method.

【００４６】周波数特性インパルス応答変換部１１６
は、ピッチ情報分析処理部１０８とスペクトル情報分析
処理部１０７より周波数特性データを得て、これらを合
成した周波数特性データ１０９ａを逆フーリエ変換計算
部１１８により、周波数軸データから時間軸データへと
変換する。Frequency characteristic impulse response converter 116
Obtains frequency characteristic data from the pitch information analysis processing unit 108 and the spectrum information analysis processing unit 107, and converts the frequency characteristic data 109a obtained by synthesizing the frequency characteristic data 109a from the frequency axis data to the time axis data by the inverse Fourier transform calculation unit 118. I do.

【００４７】変換された時間軸データは、インパルス応
答データ１１９として得られる。この、インパルス応答
データは、前処理フィルタ部１１２においてフィルタ係
数として使用される。The converted time axis data is obtained as impulse response data 119. This impulse response data is used as a filter coefficient in the pre-processing filter unit 112.

【００４８】このように構成することにより、フィルタ
係数自体を周波数特性データから算出するため、電話機
の中に数種類の周波数特性パターンを持たなくてよくな
り、さらに、あらゆる個人差や男女差を持った周波数特
性パターンにも対応したフィルタ係数を得ることが出来
る。With this configuration, since the filter coefficient itself is calculated from the frequency characteristic data, the telephone does not need to have several types of frequency characteristic patterns, and furthermore, all kinds of individual differences and gender differences are provided. A filter coefficient corresponding to the frequency characteristic pattern can be obtained.

【００４９】[0049]

【発明の効果】本発明によれば、音声をディジタル信号
に変換し、そのディジタル信号に雑音除去処理を行い、
その雑音除去処理後のディジタル信号を符号化して送信
する送信部を有する電話機であって、その電話機を、前
記ディジタル信号を符号化する過程において得られる音
声の特徴量を抽出する抽出手段と、この抽出手段で抽出
された音声の特徴量に基づき音声検出を行う音声検出手
段とを含み、この音声検出手段での音声検出結果に基づ
き前記雑音除去処理を行うよう構成したため、音声スペ
クトルの抽出などに用いる演算器を設ける必要がなくな
り、よって、これら演算器の増加による消費電力の増加
を防止することができる。According to the present invention, voice is converted into a digital signal, and the digital signal is subjected to noise removal processing.
What is claimed is: 1. A telephone having a transmitting unit that encodes and transmits a digital signal after the noise removal processing, comprising: extracting means for extracting a characteristic amount of voice obtained in a process of encoding the digital signal; A voice detection unit that performs voice detection based on the feature amount of the voice extracted by the extraction unit, and the noise removal processing is performed based on the voice detection result of the voice detection unit. It is not necessary to provide a computing unit to be used, so that an increase in power consumption due to an increase in these computing units can be prevented.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明に係る電話機の最良の実施の形態の構成
図である。FIG. 1 is a configuration diagram of a preferred embodiment of a telephone according to the present invention.

【図２】同電話機の周波数特性パターン判定部とフィル
タ係数格納部の動作を示す模式説明図である。FIG. 2 is a schematic explanatory view showing operations of a frequency characteristic pattern determination unit and a filter coefficient storage unit of the telephone.

【図３】周波数パターン判定部とフィルタ係数格納部の
機能を他の実施の形態により実施する例の模式説明図で
ある。FIG. 3 is a schematic explanatory diagram of an example in which functions of a frequency pattern determination unit and a filter coefficient storage unit are implemented according to another embodiment.

【図４】従来の音声区間検出器の一例の構成図である。FIG. 4 is a configuration diagram of an example of a conventional voice section detector.

【符号の説明】[Explanation of symbols]

１００電話機１０２ＡＤ変換器１０３雑音除去処理部１０４音声符号化処理部１０７スペクトル情報分析処理部１０８ピッチ情報・分析処理部１０９周波数特性パターン判定部１１０フイルタ係数格納部１１１状態推定部１１２前処理フィルタ部１１６周波数特性インパルス応答変換部 DESCRIPTION OF SYMBOLS 100 Telephone 102 A / D converter 103 Noise removal processing part 104 Speech coding processing part 107 Spectrum information analysis processing part 108 Pitch information / analysis processing part 109 Frequency characteristic pattern determination part 110 Filter coefficient storage part 111 State estimation part 112 Preprocessing filter part 116 Frequency response impulse response converter

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】音声を雑音除去処理した後に送信する送
信部を有する電話機であって、前記音声をディジタル信号に変換するディジタル変換手
段と、前記ディジタル信号に変換後の信号に雑音除去処
理を施す雑音除去処理手段と、前記雑音除去処理後のデ
ィジタル信号から音声特徴量を抽出しこれを符号化する
符号化手段と、前記符号化されたディジタル信号を送信
する前記送信部と、前記音声特徴量に基づき音声検出を
行う音声検出手段とを含み、この音声検出手段での音声
検出結果に基づき前記雑音除去処理手段にて雑音除去処
理を行うことを特徴とする電話機。1. A telephone having a transmitting unit for transmitting voice after noise removal processing, comprising: a digital conversion means for converting the voice into a digital signal; and performing noise removal processing on the signal converted into the digital signal. Noise removal processing means, encoding means for extracting and encoding a speech feature from the digital signal after the noise removal processing, the transmitting section for transmitting the encoded digital signal, and the speech feature And a voice detecting unit for performing voice detection based on the voice detection result, wherein the noise removal processing unit performs noise removal processing based on a voice detection result of the voice detection unit.

【請求項２】前記音声検出手段は、前記符号化手段で
抽出された音声の特徴量に対応する音声の周波数特性パ
ターンを生成する周波数特性パターン生成部と、この周
波数特性パターン生成部で生成された周波数特性パター
ンに対応するフィルタ係数を生成するフィルタ係数生成
部と、このフィルタ係数生成部で生成されたフィルタ係
数により音声の分布に合わせた特性に前記ディジタル信
号をフィルタリングする前処理フィルタ部と、この前処
理フィルタ部より出力された信号に基づき音声検出を行
う音声検出部とを含むことを特徴とする請求項１記載の
電話機。2. The sound detection means includes: a frequency characteristic pattern generation unit configured to generate a frequency characteristic pattern of a voice corresponding to a feature amount of the voice extracted by the encoding means; A filter coefficient generation unit that generates a filter coefficient corresponding to the frequency characteristic pattern, and a pre-processing filter unit that filters the digital signal to a characteristic according to the distribution of voice by the filter coefficient generated by the filter coefficient generation unit. 2. The telephone according to claim 1, further comprising a voice detection unit that performs voice detection based on a signal output from the preprocessing filter unit.

【請求項３】前記周波数特性パターン生成部は、音声
を検出しやすい複数の周波数特性パターンが格納される
格納部と、前記音声の特徴量に最も近似した周波数特性
パターンを前記格納部より選択する選択部とからなるこ
とを特徴とする請求項２記載の電話機。3. The frequency characteristic pattern generation unit selects a storage unit in which a plurality of frequency characteristic patterns for easily detecting a voice are stored, and a frequency characteristic pattern most similar to a feature amount of the voice from the storage unit. 3. The telephone according to claim 2, further comprising a selector.

【請求項４】前記フィルタ係数生成部は、複数のフィ
ルタ係数が格納される格納部と、前記周波数特性パター
ン生成部で生成された周波数特性パターンに対応するフ
ィルタ係数を選択する選択部とからなることを特徴とす
る請求項２又は３記載の電話機。4. The filter coefficient generation section includes a storage section for storing a plurality of filter coefficients, and a selection section for selecting a filter coefficient corresponding to the frequency characteristic pattern generated by the frequency characteristic pattern generation section. The telephone according to claim 2 or 3, wherein:

【請求項５】前記フィルタ係数生成部は、現在のフレ
ームが非音声区間と判定されたら、一つ過去のフレーム
のフィルタ係数を出力することを特徴とする請求項２〜
４いずれかに記載の電話機。5. The filter coefficient generator according to claim 2, wherein, when the current frame is determined to be a non-speech section, the filter coefficient generator outputs a filter coefficient of a previous frame.
4. The telephone according to any one of 4.

【請求項６】前記音声検出手段は、前記符号化手段で
抽出された音声の特徴量からインパルス応答を算出する
計算部と、この計算部で算出されたインパルス応答によ
り音声の分布に合わせた特性に前記ディジタル信号をフ
ィルタリングする前処理フィルタ部と、この前処理フィ
ルタ部より出力された信号に基づき音声検出を行う音声
検出部とを含むことを特徴とする請求項１記載の電話
機。6. A speech detecting means for calculating an impulse response from a feature quantity of the speech extracted by the encoding means, and a characteristic adapted to a speech distribution by the impulse response calculated by the calculating part. 2. The telephone according to claim 1, further comprising: a pre-processing filter for filtering the digital signal; and a voice detection unit for performing voice detection based on a signal output from the pre-processing filter.

【請求項７】前記符号化手段は、前記ディジタル信号
を符号化する過程において音声のピッチ周波数特性デー
タとスペクトル特性データとを抽出することを特徴とす
る請求項１〜６いずれかに記載の電話機。7. The telephone according to claim 1, wherein said encoding means extracts pitch frequency characteristic data and spectral characteristic data of voice in a process of encoding said digital signal. .