JP2007336364A

JP2007336364A - Echo canceler

Info

Publication number: JP2007336364A
Application number: JP2006167604A
Authority: JP
Inventors: Shinsuke Takada; 真資高田
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2006-06-16
Filing date: 2006-06-16
Publication date: 2007-12-27
Anticipated expiration: 2026-06-16
Also published as: JP4894368B2

Abstract

<P>PROBLEM TO BE SOLVED: To quickly follow up or cancel an echo against the delay variation depending on an echo route. <P>SOLUTION: The echo canceller comprises a means for generating an inaudible signal having inaudible time axis waveform components, a means for generating an output signal composed of a received signal plus the inaudible signal, a first generating means for generating a first delay estimation signal by applying a specified filter process to the inaudible signal, a second generating means for generating a second delay estimation signal by applying a specified filter process to a transmission signal, a calculating means for obtaining delay time information reflecting variable delay characteristics depending on an echo route, based on the cross correlation of the first delay estimation signal with the second one, and a delay adding means for delaying the received signal to delay the input of the received signal to an adaptive filter, based on the delay time information calculated by the delay time information calculating means. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、エコーキャンセラに関し、例えば、エコー経路による遅延が変動する場合にも適正なエコー成分を送話信号から除去するエコーキャンセラに適用し得る。 The present invention relates to an echo canceller, and can be applied to, for example, an echo canceller that removes an appropriate echo component from a transmission signal even when a delay due to an echo path varies.

特許文献１には、ネットワークを介して音声データを送受信するハンズフリー通話装置に関する技術が開示されている。以下では、特許文献１に記載の従来のハンズフリー通話装置におけるエコーキャンセラの機能を図２を参照しながら説明する。 Patent Document 1 discloses a technique related to a hands-free communication device that transmits and receives audio data via a network. Hereinafter, the function of the echo canceller in the conventional hands-free communication device described in Patent Document 1 will be described with reference to FIG.

図２は、従来のハンズフリー通話装置の構成を示すブロック図であり、図２では、ネットワーク（例えば、インタネット１００）に接続可能なネットワーク端末装置（例えば、パーソナルコンピュータ等）１１５が、サウンドボード１０３を装備しているものとする。 FIG. 2 is a block diagram showing a configuration of a conventional hands-free call device. In FIG. 2, a network terminal device (for example, a personal computer) 115 that can be connected to a network (for example, the Internet 100) is connected to a sound board 103. Is equipped.

ネットワーク端末装置１１５は、インターネット１００に接続する回線インターフェース１１４、復号器１０１、音声出力ドライバー１０２、音声入力ドライバー１１０、符号化器１１３を有し、また、サウンドボード１０３は、途切れ緩衝用蓄積バッファ１０４及び１０９、ディジタル−アナログ変換器（以後、Ｄ／Ａという）１０５、スピーカ１０６と接続するスピーカ端子１１６、マイク１０７と接続するマイク端子１１７、アナログ−ディジタル変換器（以後、Ａ／Ｄという）１０８を有する。 The network terminal device 115 includes a line interface 114 connected to the Internet 100, a decoder 101, an audio output driver 102, an audio input driver 110, and an encoder 113, and the sound board 103 includes an interruption buffer storage buffer 104. 109, a digital-analog converter (hereinafter referred to as D / A) 105, a speaker terminal 116 connected to the speaker 106, a microphone terminal 117 connected to the microphone 107, and an analog-digital converter (hereinafter referred to as A / D) 108. Have

まず、図２において、図示しない遠端話者からの音声データは、ネットワーク１００を通じてネットワーク端末装置１１５内の回線インターフェース１１４に入力し、復号器１０１により音声波形信号（以後、音声信号という）に復号される。 First, in FIG. 2, voice data from a far-end speaker (not shown) is input to the line interface 114 in the network terminal device 115 through the network 100 and decoded into a voice waveform signal (hereinafter referred to as a voice signal) by the decoder 101. Is done.

音声信号は、復号器１０１から音声出力ドライバー１０２に与えられ、音声出力ドライバー１０２からネットワーク端末装置１１５に装備されたサウンドボード１０３の途切れ緩衝用蓄積バッファ１０４に与えられる。 The audio signal is supplied from the decoder 101 to the audio output driver 102, and is supplied from the audio output driver 102 to the interruption buffer storage buffer 104 of the sound board 103 provided in the network terminal device 115.

途切れ緩衝用蓄積バッファ１０４は過去に蓄積された音声信号から順にＤ／Ａ１０５に出力し、出力された音声信号は、Ｄ／Ａ１０５によりアナログ信号に変換され、スピーカ端子１１６を通じてスピーカ１０６とＡ／Ｄ１０８とに与えられる。 The interruption buffer accumulation buffer 104 outputs the audio signal accumulated in the past to the D / A 105 in order, and the outputted audio signal is converted into an analog signal by the D / A 105, and the speaker 106 and the A / D 108 through the speaker terminal 116. And given to.

このとき、スピーカ１０６から放射された音声信号がマイク１０７に回り込み、その音声信号がＡ／Ｄ１０８に入力する。 At this time, the audio signal radiated from the speaker 106 goes around the microphone 107 and the audio signal is input to the A / D 108.

Ａ／Ｄ１０８に入力された信号のうちＤ／Ａ１０５から直接入力された信号は、途切れ緩衝用蓄積バッファ１０９を経由して信号ｒ（ｋ）として、音声入力ドライバー１０９内の適応フィルタ１１１に入力される。 Of the signals input to the A / D 108, the signal directly input from the D / A 105 is input to the adaptive filter 111 in the audio input driver 109 as the signal r (k) via the interruption buffer storage buffer 109. The

一方、スピーカ１０６を経由してＡ／Ｄ１０８に入力された信号は、途切れ緩衝用畜積バッファ１０９を経由して加算器１１２に入力される。 On the other hand, the signal input to the A / D 108 via the speaker 106 is input to the adder 112 via the interrupt buffer stock buffer 109.

適応フィルタ１１１では信号ｒ（ｋ）と信号ｆ（ｋ）を入力として打ち消し信号ｓ（ｋ）を作成する。 The adaptive filter 111 receives the signal r (k) and the signal f (k) as inputs and creates a cancellation signal s (k).

ここで、信号ｓ（ｋ）の作成に関しては、信号ｆ（ｋ）を最小にする適応アルゴリズム、例えば既存技術であるＮＬＭＳアルゴリズムなどを適用して作成することができ、信号ｆ（ｋ）を最小にする適応アルゴリズムであれば種々の方法を適用することができ、特にここでは説明を省略する。 Here, regarding the generation of the signal s (k), it can be generated by applying an adaptive algorithm that minimizes the signal f (k), for example, the NLMS algorithm that is an existing technology, and the signal f (k) is minimized. Various methods can be applied as long as they are adaptive algorithms, and the description thereof is omitted here.

加算器１１２では、スピーカ１０６から入力された信号、すなわちエコー信号と信号ｓ（ｋ）を相殺し、信号ｆ（ｋ）を適応フィルタ１１１と符号化器１１３に出力する。 The adder 112 cancels the signal input from the speaker 106, that is, the echo signal and the signal s (k), and outputs the signal f (k) to the adaptive filter 111 and the encoder 113.

符号化器１１３では、音声信号をインターネット１に出力するため、音声データをパケット整形して変換し、回線インターフェース１１４を通じてインターネット１００に出力する。 In the encoder 113, in order to output the audio signal to the Internet 1, the audio data is packet-shaped and converted, and is output to the Internet 100 through the line interface 114.

ところで、ネットワーク端末装置１１５が、例えばＰＣのように通信処理の他にもいろいろな処理を実行する機能を有しているものである場合、特殊な事情が発生する。すなわち、通信のようにリアルタイム性を必要とする処理の他にも、いろいろなアプリケーションプログラムを実行させて、図示しない計算資源（ＣＰＵやメモリ）を使っていることが多い。 By the way, when the network terminal device 115 has a function of executing various processes in addition to the communication process such as a PC, a special situation occurs. That is, in addition to processing that requires real-time performance such as communication, various application programs are often executed to use calculation resources (CPU and memory) (not shown).

この場合、音声出力ドライバー１０２、音声入力ドライバー１１０は処理を一旦待機させられることがしばしば発生する。通常、音声処理の待機はそのまま音声信号の断絶につながる。 In this case, the audio output driver 102 and the audio input driver 110 often have to wait for processing once. Usually, waiting for voice processing directly leads to disconnection of voice signals.

そこで、従来技術では、途切れ緩衝用蓄積バッファ１０４、１０９にあらかじめ音声データを蓄積しておき、音声処理の待機が発生しても、その間、途切れ緩衝用蓄積バッファ１０４、１０９に蓄積済みの音声信号をＤ／Ａ１０５、Ａ／Ｄ１０８に出力し、音声の途切れを防ぐようになっている。 Therefore, in the prior art, audio data is stored in advance in the interruption buffer storage buffers 104 and 109, and the audio signal that has been stored in the interruption buffer storage buffers 104 and 109 during that time even if the standby of the audio processing occurs. Are output to the D / A 105 and A / D 108 to prevent voice interruption.

ところで、逆に音声出力ドライバー１０２、音声入力ドライバー１０９は動作しているのに途切れ緩衝用畜積バッファ１０９の出力が待機される場合は、途切れ緩衝用蓄積バッファ１０９にどんどんデータが流れ込み、途切れ緩衝用蓄積バッファ１０９のデータ格納量が飽和してしまう。また、飽和中に途切れ緩衝用蓄積バッファ１０９に入力しようとするデータは、途切れ緩衝用蓄積バッファ１０９に蓄積できずに破棄されてしまう。こうした、途切れ緩衝用蓄積バッファ１０９のデータの枯渇や飽和は、音声信号として双方とも音声の途切れという現象になって現れることとなる。 On the other hand, when the audio output driver 102 and the audio input driver 109 are operating but the output of the interrupt buffer storage buffer 109 is on standby, the data flows more and more into the interrupt buffer storage buffer 109, and the interrupt buffer The amount of data stored in the storage buffer 109 is saturated. Further, data to be input to the interrupt buffer storage buffer 109 during saturation cannot be stored in the interrupt buffer storage buffer 109 and is discarded. Such depletion or saturation of the data in the buffer for buffering interruption 109 appears as a sound interruption phenomenon as both audio signals.

この音声信号の途切れや途切れ緩衝用蓄積バッファの蓄積量変動は、適応フィルタ１１１を用いたエコー除去にとって性能劣化となることが広く知られている。 It is widely known that the interruption of the audio signal and the fluctuation in the accumulation amount of the interruption buffer accumulation buffer cause performance degradation for echo removal using the adaptive filter 111.

つまり、通常エコー除去のための適応フィルタ１１１（エコーキャンセラ）は、復号器１０１の出力と音声入力ドライバー１１０の出力とをそれぞれ適応フィルタ１１１の入力として動作させている。そのため、信号経路として音声出力ドライバー１０２から始まって、途切れ緩衝用蓄積バッファ１０４、スピーカ１０６、マイク１０７を経由し、途切れ緩衝用蓄積バッファ１０９、音声入力ドライバー１１０に至るまでの経路は、２つの途切れ緩衝用バッファ１０４及び１０９の蓄積量変化（遅延変化）と、音声の途切れ（バッファの枯渇、飽和）とがエコー経路の時変動と要因となる。 That is, the adaptive filter 111 (echo canceller) for normal echo removal operates using the output of the decoder 101 and the output of the audio input driver 110 as inputs of the adaptive filter 111, respectively. Therefore, the path from the audio output driver 102 as a signal path to the interruption buffer storage buffer 109 and the audio input driver 110 via the interruption buffer storage buffer 104, the speaker 106, and the microphone 107 is two interruptions. Changes in the accumulated amounts (delay changes) in the buffer buffers 104 and 109 and interruptions in the sound (buffer depletion and saturation) cause time fluctuations in the echo path.

公知のように適応フィルタ１１１は、エコー経路が時間不変である前提が成り立つとき効果を発揮するので、上記のようなエコー経路の時間変動があると性能が大きく劣化することになる。 As is well known, the adaptive filter 111 is effective when the premise that the echo path is time-invariant is satisfied. Therefore, if there is a time variation of the echo path as described above, the performance greatly deteriorates.

従来技術では、Ｄ／Ａ１０５の出力から直接Ａ／Ｄ１０８に信号を入力し、Ａ／Ｄ１０８からの出力を途切れ緩衝用蓄積バッファ１０９を経由して適応フィルタ１１１に入力するようにして、途切れ緩衝用蓄積バッファの途切れや飽和の影響を受けた後の信号を適応フィルタ１１１の参照入力信号ｒ（ｋ）として入力し、一方で、これもまた途切れ緩衝用蓄積バッファ１０９の途切れや飽和の影響が反映されているエコー信号を加算器１１２に入力し、適応フィルタ１１１の入力にも途切れ緩衝用蓄積バッファ１０９の影響を取り込むことで、見かけ上、送受２つの経路上の途切れ緩衝用蓄積バッファ１０９の影響を相殺して、適応フィルタ１１１にとってエコー経路の時変動をなくすことでエコー除去能力の劣化を防止している。 In the prior art, a signal is directly input to the A / D 108 from the output of the D / A 105, and the output from the A / D 108 is input to the adaptive filter 111 via the interruption buffer accumulation buffer 109, thereby The signal after being affected by the interruption and saturation of the storage buffer is input as the reference input signal r (k) of the adaptive filter 111, while this also reflects the influence of the interruption and saturation of the interruption buffer storage buffer 109. The input echo signal is input to the adder 112, and the influence of the interruption buffer accumulation buffer 109 is also taken into the input of the adaptive filter 111, so that the influence of the interruption buffer accumulation buffer 109 on the two paths of transmission and reception is apparent. Is eliminated, and the adaptive filter 111 eliminates the time fluctuation of the echo path, thereby preventing the deterioration of the echo removal capability.

特開２００４−４０５８９号公報JP 2004-40589 A 境久雄、中山剛著、”聴覚と音響心理”、株式会社コロナ社発行、１９７８年、Ｐ１０７〜Ｐ１１７Hisao Sakai and Takeshi Nakayama, “Hearing and Acoustic Psychology”, published by Corona Inc., 1978, P107-P117

しかし、従来技術では、エコー除去のため、スピーカ１０６からマイク１０７までを結線させる必要があり、専用サウンドボード１０３を用いる必要があり、装置として高価なものであった。あるいは、専用ボードを使わない場合でも、ステレオで動作する片方のチャンネルのスピーカ１０６用出力を電線で他方のチャンネルのマイク１０７端子に接続する必要があり、本来のステレオサウンドを用いることができなくなってしまうという欠点があった。 However, in the prior art, it is necessary to connect the speaker 106 to the microphone 107 in order to eliminate echo, and it is necessary to use the dedicated sound board 103, which is expensive as an apparatus. Alternatively, even when a dedicated board is not used, it is necessary to connect the output for the speaker 106 of one channel operating in stereo to the microphone 107 terminal of the other channel with an electric wire, and the original stereo sound cannot be used. There was a drawback of end.

そのため、サウンドボードは汎用のものを使い、特段のハードウエア的改造をすることなく、また、ステレオサウンドを犠牲にしてモノラルにすることなく、途切れ緩衝用バッファの状態によってエコーの遅延が変わっても速やかにエコーを追従し、除去できるエコーキャンセラが望まれている。 Therefore, even if the delay of the echo changes depending on the state of the buffer for interruption, the sound board uses a general-purpose one, without special hardware modification, and without sacrificing stereo sound to monaural. An echo canceller that can quickly follow and remove echoes is desired.

かかる課題を解決するために、第１の本発明のエコーキャンセラは、適応フィルタを有する擬似エコー生成手段が受信信号に基づいて生成した擬似エコー信号を、エコー除去手段が送信信号から減算してエコー成分を除去するエコーキャンセラにおいて、（１）非可聴な時間軸波形成分を有する非可聴信号を発生する非可聴信号発生手段と、（２）受信信号に非可聴信号を加算した出力信号を生成する出力信号生成手段と、（３）非可聴信号に対して所定のフィルタ処理を施して第１の遅延推定信号を生成する第１の遅延推定信号生成手段と、（４）送信信号に対して所定のフィルタ処理を施して第２の遅延推定信号を生成する第２の遅延推定信号生成手段と、（５）第１の遅延推定信号と第２の遅延推定信号との相互相関に基づいて、エコー経路による可変遅延特性を反映させた遅延時間情報を求める遅延時間情報算出手段と、（６）遅延時間情報算出手段が算出した遅延時間情報に基づいて上記適応フィルタへの受信信号の入力を遅延させる遅延付加手段とを備えることを特徴とする。 In order to solve this problem, the echo canceller according to the first aspect of the present invention includes an echo canceler that subtracts a pseudo echo signal generated based on a received signal by a pseudo echo generator having an adaptive filter from a transmission signal. In an echo canceller for removing components, (1) a non-audible signal generating means for generating a non-audible signal having a non-audible time axis waveform component, and (2) an output signal obtained by adding the non-audible signal to the received signal is generated. Output signal generation means; (3) first delay estimation signal generation means for generating a first delay estimation signal by applying a predetermined filter process to the inaudible signal; and (4) predetermined for the transmission signal. Second delay estimation signal generation means for generating a second delay estimation signal by performing the filtering process of (5), and (5) based on the cross-correlation between the first delay estimation signal and the second delay estimation signal, Delay time information calculating means for obtaining delay time information reflecting variable delay characteristics according to the path; and (6) delaying the input of the received signal to the adaptive filter based on the delay time information calculated by the delay time information calculating means. Delay adding means.

本発明のエコーキャンセラによれば、エコー経路の遅延が変動する場合にも速やかにエコーを追従し、エコー成分を除去することができる。 According to the echo canceller of the present invention, even when the delay of the echo path fluctuates, it is possible to quickly follow the echo and remove the echo component.

（Ａ）第１の実施形態
以下、本発明のエコーキャンセラの第１の実施形態を図面を参照しながら説明する。 (A) First Embodiment Hereinafter, a first embodiment of an echo canceller of the present invention will be described with reference to the drawings.

第１の実施形態は、途切れ緩衝用蓄積バッファを有するサウンドボードを装備したネットワーク端末装置がハンズフリー通話装置として機能する場合のエコーキャンセルに本発明のエコーキャンセルを適用した場合を説明する。 1st Embodiment demonstrates the case where the echo cancellation of this invention is applied to the echo cancellation in case the network terminal device equipped with the sound board which has the accumulation | storage buffer for interruption buffers functions as a hands-free call apparatus.

（Ａ−１）第１の実施形態の構成
図１は、第１の実施形態に係るハンズフリー通話装置として機能する装置の内部構成を示すブロック図である。 (A-1) Configuration of First Embodiment FIG. 1 is a block diagram showing an internal configuration of a device that functions as a hands-free call device according to the first embodiment.

図１において、第１の実施形態に係る装置は、インターネット１に接続可能なネットワーク端末装置２４が、サウンドボード８を装備しているものとする。 In FIG. 1, in the device according to the first embodiment, a network terminal device 24 that can be connected to the Internet 1 is equipped with a sound board 8.

ネットワーク端末装置２４は、回線インターフェース１９、復号器２、低周波発生器３、スイッチ２３、第１の帯域制限フィルタ（ＢＰＦ１）４、加算器２５、音声出力ドライバー５、音声入力ドライバー１５、第２の帯域制限フィルタ（ＢＰＦ２）１６、加算器１７、符号化器１８、初期遅延付加部２０、適応フィルタ２１、初期遅延判定部２２、第１の低域通過フィルタ（ＬＰＯ１）２６、第２の低域通過フィルタ（ＬＰＯ２）２７を有する。 The network terminal device 24 includes a line interface 19, a decoder 2, a low frequency generator 3, a switch 23, a first band limiting filter (BPF1) 4, an adder 25, an audio output driver 5, an audio input driver 15, and a second. Band limiting filter (BPF2) 16, adder 17, encoder 18, initial delay adding unit 20, adaptive filter 21, initial delay determining unit 22, first low-pass filter (LPO1) 26, second low A pass-pass filter (LPO2) 27 is provided.

ネットワーク端末装置２４は、例えば、パーソナルコンピュータ等のような通信処理機能を有する情報処理装置を適用することができる。 As the network terminal device 24, for example, an information processing device having a communication processing function such as a personal computer can be applied.

また、サウンドボード８は、途切れ緩衝用蓄積バッファ６、ディジタル−アナログ変換器（以後、Ｄ／Ａという）７、スピーカ１０と接続するスピーカ端子９、マイク１１と接続するマイク端子１２、アナログ‐ディジタル変換器（以後、Ａ／Ｄという）１３、途切れ緩衝用蓄積バッファ１４を有する。 Further, the sound board 8 includes an interruption buffer storage buffer 6, a digital-analog converter (hereinafter referred to as D / A) 7, a speaker terminal 9 connected to the speaker 10, a microphone terminal 12 connected to the microphone 11, an analog-digital. A converter (hereinafter referred to as A / D) 13 and an interruption buffer storage buffer 14 are provided.

サウンドボード８は、ネットワーク端末装置２４に組み込むことができるユニットであり、少なくとも途切れ緩衝用蓄積バッファ６及び１４を有していれば既存のものを適用することができる。 The sound board 8 is a unit that can be incorporated into the network terminal device 24, and an existing one can be applied as long as it has at least the interrupt buffer storage buffers 6 and 14.

以下では、ネットワーク端末装置２４及びサウンドボード８の内部構成を図１を参照しながら説明する。 Hereinafter, the internal configurations of the network terminal device 24 and the sound board 8 will be described with reference to FIG.

回線インターフェース１９は、インターネット１との間でデータを送受信するものであり、音声パケットデータを受信した場合、その音声パケットデータを復号器２に与え、又符号化器１８から音声パケットデータをインターネット１に送出するものである。 The line interface 19 transmits / receives data to / from the Internet 1. When receiving the voice packet data, the line interface 19 gives the voice packet data to the decoder 2, and sends the voice packet data from the encoder 18 to the Internet 1. To send to.

復号器２は、回線インターフェース１９からの音声パケットデータを所定の復号方式に従って復号し、復号した音声信号（音声波形信号）を第１の帯域制限フィルタ（ＢＰＦ１）４に与えるものである。 The decoder 2 decodes the voice packet data from the line interface 19 according to a predetermined decoding method, and gives the decoded voice signal (voice waveform signal) to the first band limiting filter (BPF1) 4.

第１の帯域制限フィルタ（ＢＰＦ１）４は、音声信号を所望の周波数帯域に帯域制限するものである。第１の実施形態では、ＢＰＦ４は、信号の周波数を電話帯域である３００〜３４００Ｈｚに帯域制限するものとし、その帯域制限した信号ｘ（ｋ）を初期遅延付加部２０及び第１の加算器２５に与える。ここで、信号ｘ（ｋ）についてｋはデータの順番を表す。 The first band limiting filter (BPF1) 4 limits the audio signal to a desired frequency band. In the first embodiment, the BPF 4 limits the frequency of the signal to the telephone band of 300 to 3400 Hz, and the band-limited signal x (k) is added to the initial delay adding unit 20 and the first adder 25. To give. Here, k represents the order of data for the signal x (k).

第１の加算器２５は、スイッチ２３を経由して、低周波発生器３からの低周波信号を入力として取り込み、ＢＰＦ４からの信号ｘ（ｋ）と低周波信号とを加算し、信号ｘ１（ｋ）を第１の低域通過フィルタ（ＬＰＯ１）２６及び音声出力ドライバー５に与えるものである。 The first adder 25 takes in the low frequency signal from the low frequency generator 3 as an input via the switch 23, adds the signal x (k) from the BPF 4 and the low frequency signal, and adds the signal x1 ( k) is provided to the first low-pass filter (LPO1) 26 and the audio output driver 5.

スイッチ２３は、初期状態は開放しており、ネットワーク端末装置２４の立ち上げ時（例えば電源投入時）に閉じられ、低周波発生器３が発生した低周波信号を加算器２５に出力するものである。なお、スイッチ２３は、図示しない装置操作者の操作により切替指示信号などによって、電源投入時以外にも、所望のタイミングで閉じるようにしてもよい。例えば、適応フィルタ２１をリセットしたいときなどにスイッチ２３を閉じるようにしてもよい。 The switch 23 is open in the initial state, is closed when the network terminal device 24 is started up (for example, when the power is turned on), and outputs the low frequency signal generated by the low frequency generator 3 to the adder 25. is there. The switch 23 may be closed at a desired timing other than when the power is turned on by an operation of a device operator (not shown) by a switching instruction signal or the like. For example, the switch 23 may be closed when it is desired to reset the adaptive filter 21.

低周波発生器３は、人間には聞こえにくい低い周波数の信号を発生するものである。第１の実施形態では、例えば２０Ｈｚのトーン信号を用いるものとするが、これに限定しない。すなわち、聴覚的に捕らえにくい周波数であれば、どのような低周波数を発声するものであってもよい。 The low frequency generator 3 generates a low frequency signal that is difficult for humans to hear. In the first embodiment, for example, a tone signal of 20 Hz is used, but the present invention is not limited to this. In other words, any low frequency may be uttered as long as it is a frequency that is difficult to catch audibly.

第１の低域通過フィルタ（ＬＰＯ１）２６は、加算器２５から信号ｘ１（ｋ）を受け取り、信号ｘ１（ｋ）に対して低域通過フィルタ処理を施し、信号ｘ２（ｋ）として初期遅延判定部２２に出力するものである。第１の実施形態では、ＬＰＯ２６は、例えば３０Ｈｚ未満を通過帯域とし、それ以外の帯域を阻止域としたが、通過帯域はこれに限定されない。 The first low-pass filter (LPO1) 26 receives the signal x1 (k) from the adder 25, performs low-pass filter processing on the signal x1 (k), and determines an initial delay as the signal x2 (k). This is output to the unit 22. In the first embodiment, the LPO 26 uses, for example, less than 30 Hz as a pass band and other bands as a stop band, but the pass band is not limited to this.

音声出力ドライバー５は、加算器２５からの出力信号を受け取り、受信データの途切れ緩衝用蓄積バッファ６に出力するものである。 The audio output driver 5 receives an output signal from the adder 25 and outputs it to the accumulation buffer 6 for interrupting received data.

途切れ緩衝用蓄積バッファ６は、音声出力ドライバー５からの制御信号を受けており、その制御信号による制御の下、音声出力ドライバー５からの出力データを一時的に蓄積しながら、古いデータから順番にＤ／Ａ７に与えるものである。 The interruption buffer accumulation buffer 6 receives a control signal from the audio output driver 5 and, under the control of the control signal, temporarily accumulates the output data from the audio output driver 5, while starting from the oldest data. This is given to D / A7.

ディジタル−アナログ変換器（Ｄ／Ａ）７は、途切れ緩衝用蓄積バッファ６から受け取ったディジタル音声信号をアナログ音声信号に変換し、そのアナログ変換信号をスピーカ端子９を通じてスピーカ１０に与えるものである。 The digital-analog converter (D / A) 7 converts the digital audio signal received from the interruption buffer storage buffer 6 into an analog audio signal, and applies the analog conversion signal to the speaker 10 through the speaker terminal 9.

スピーカ１０は、受信したアナログ音声信号を空間に出力するものである。また、マイク１１は、話者が発した音声を捕捉し、マイク端子１２を通じてアナログ−ディジタル変換器（Ａ／Ｄ）１３に与えるものである。 The speaker 10 outputs the received analog audio signal to space. Further, the microphone 11 captures the voice uttered by the speaker and supplies it to the analog-digital converter (A / D) 13 through the microphone terminal 12.

このとき、スピーカ１０から空間に放射された音声信号の一部はマイク１１に漏れこみ、この音声信号もマイク端子１２を経由してＡ／Ｄ１３に入力される。 At this time, part of the audio signal radiated from the speaker 10 into the space leaks into the microphone 11, and this audio signal is also input to the A / D 13 via the microphone terminal 12.

アナログ−ディジタル変換器（Ａ／Ｄ）１３は、マイク１１からのアナログ音声信号をディジタル音声信号に変換し、そのディジタル音声信号を送信データの途切れ緩衝用蓄積バッファ１４に与えるものである。このとき、スピーカ１０から漏れた音声信号もディジタル変換される。 The analog-to-digital converter (A / D) 13 converts an analog audio signal from the microphone 11 into a digital audio signal, and supplies the digital audio signal to the transmission buffer interruption buffer storage buffer 14. At this time, the audio signal leaked from the speaker 10 is also digitally converted.

途切れ緩衝用蓄積バッファ１４は、音声入力ドライバー１５からの制御信号を受けており、その制御信号による制御の下、Ａ／Ｄ１３から受信した音声信号を一時的に蓄積しながら、古いデータから順番に音声入力ドライバー１５に与えるものである。 The interruption buffer accumulation buffer 14 receives a control signal from the audio input driver 15 and, under the control of the control signal, temporarily accumulates the audio signal received from the A / D 13, while starting from the oldest data. This is given to the voice input driver 15.

音声入力ドライバー１５は、途切れ緩衝用蓄積バッファ１４から受け取った音声信号を、音声信号ｙ（ｋ）として第２の帯域制限フィルタ（ＢＰＦ２）１６及び第２の低域通過フィルタ（ＬＰＯ２）２７に与えるものである。 The audio input driver 15 supplies the audio signal received from the interruption buffer storage buffer 14 to the second band-limiting filter (BPF2) 16 and the second low-pass filter (LPO2) 27 as the audio signal y (k). Is.

第２の帯域制限フィルタ（ＢＰＦ２）１６は、音声入力ドライバー１５から信号ｙ（ｋ）を所望の周波数帯域に帯域制限し、加算器１７に与えるものである。第１の実施形態では、例えば３００〜３４００Ｈｚとするがこれに限定されない。 The second band limiting filter (BPF 2) 16 band-limits the signal y (k) from the voice input driver 15 to a desired frequency band and supplies it to the adder 17. In the first embodiment, for example, the frequency is set to 300 to 3400 Hz, but is not limited thereto.

第２の低域通過フィルタ（ＬＰＯ２）２７は、音声入力ドライバー１５からの信号ｙ（ｋ）に、低域通過フィルタ処理を施し、信号ｙ２（ｋ）として初期遅延判定部２２に出力するものである。ＬＰＯ２７の動作は、ＬＰＯ２６と同一であるのでここでは説明しない。 The second low-pass filter (LPO2) 27 performs low-pass filter processing on the signal y (k) from the audio input driver 15 and outputs the signal y2 (k) to the initial delay determination unit 22. is there. Since the operation of the LPO 27 is the same as that of the LPO 26, it will not be described here.

第２の加算器１７は、適応フィルタ２１からの出力信号ｙ’（ｋ）と、ＢＰＦ１６からの出力信号ｙ１（ｋ）とを受け取り、出力信号ｙ’（ｋ）と出力信号ｙ１（ｋ）との相殺処理を行なうものである。ここで、ｋはデータの順番を表す。また、第２の加算器１７は、出力信号ｅ（ｋ）を、適応フィルタ２１及び符号化器１８に与えるものである。 The second adder 17 receives the output signal y ′ (k) from the adaptive filter 21 and the output signal y1 (k) from the BPF 16, and outputs the output signal y ′ (k) and the output signal y1 (k). The offset process is performed. Here, k represents the order of data. The second adder 17 gives the output signal e (k) to the adaptive filter 21 and the encoder 18.

適応フィルタ２１は、初期遅延付加部２０の出力信号ｘ３（ｋ）と、第２の加算器１７の出力信号ｅ（ｋ）とを取り込み、逐次、信号ｅ（ｋ）のパワーが最小になるように、出力信号ｘ３（ｋ）と出力信号ｅ（ｋ）に基づいて擬似エコー信号ｙ’（ｋ）を作成し、その擬似エコー信号ｙ’（ｋ）を出力するものである。 The adaptive filter 21 takes in the output signal x3 (k) of the initial delay adding unit 20 and the output signal e (k) of the second adder 17 so that the power of the signal e (k) is sequentially minimized. In addition, a pseudo echo signal y ′ (k) is created based on the output signal x3 (k) and the output signal e (k), and the pseudo echo signal y ′ (k) is output.

ここで、適応フィルタ２１における擬似エコー信号ｙ’（ｋ）の作成は、既存技術である学習同定法等の係数更新アルゴリズムを適用して求めることができ、信号ｅ（ｋ）のパワーを最小にすることができるアルゴリズムであれば、どのようなものをアルゴリズムを適用してもよい。 Here, the generation of the pseudo echo signal y ′ (k) in the adaptive filter 21 can be obtained by applying a coefficient update algorithm such as a learning identification method which is an existing technique, and the power of the signal e (k) is minimized. Any algorithm can be applied as long as it can be applied.

符号化器１８は、第２の加算器１７の出力信号ｅ（ｋ）を受け取り、信号ｅ（ｋ）を符号化し、回線インターフェース１９を介してインターネット１に出力するものである。これにより、図示しない遠方話者に向かって出力される。 The encoder 18 receives the output signal e (k) of the second adder 17, encodes the signal e (k), and outputs it to the Internet 1 via the line interface 19. Thereby, it outputs toward a distant speaker (not shown).

初期遅延判定部２２は、ＬＰＯ２６の出力信号ｘ２（ｋ）と、ＬＰＯ２７の出力信号ｙ２（ｋ）とを受け取り、これら信号ｘ２（ｋ）及びｙ２（ｋ）に基づいて、エコー信号経路初期遅延を推定するものである。 The initial delay determination unit 22 receives the output signal x2 (k) of the LPO 26 and the output signal y2 (k) of the LPO 27, and determines the initial delay of the echo signal path based on these signals x2 (k) and y2 (k). To be estimated.

ここで、エコー信号経路初期遅延とは、第１の加算器２５を出発し、音声出力ドライバー５、受信データ途切れ緩衝用蓄積バッファ６、Ｄ／Ａ７、スピーカ１０、マイク１１、Ａ／Ｄ１３、送信データ途切れ緩衝用蓄積バッファ１４、音声入力ドライバーを経由して第２の加算器１７の入力に至るまでの経路遅延をいう。 Here, the initial delay of the echo signal path means that the first adder 25 is started, the audio output driver 5, the reception data interruption buffer storage buffer 6, D / A 7, the speaker 10, the microphone 11, the A / D 13, and the transmission This is a delay in the path from the data interruption buffer storage buffer 14 to the input of the second adder 17 via the audio input driver.

初期遅延付加部２０は、初期遅延判定部２２によりエコー信号経路初期遅延が判定されると、適応フィルタ２１の入力に対して時間τだけ遅延させた信号ｘ３（ｋ）を適用フィルタ２１に与えるものである。 When the initial delay determining unit 22 determines the initial delay of the echo signal path, the initial delay adding unit 20 gives the applied filter 21 a signal x3 (k) delayed by the time τ with respect to the input of the adaptive filter 21. It is.

（Ａ−２）第１の実施形態の動作
続いて、第１の実施形態に係る装置の動作を図１を参照しながら詳細に説明する。 (A-2) Operation of First Embodiment Subsequently, the operation of the apparatus according to the first embodiment will be described in detail with reference to FIG.

まず、図示しない遠方話者からの音声パケットデータは、インターネット１を経由して、ネットワーク端末２４内の回線インターフェース１９に与えられ、回線インターフェース１９から復号器２に与えられる。 First, voice packet data from a distant speaker (not shown) is given to the line interface 19 in the network terminal 24 via the Internet 1 and is given to the decoder 2 from the line interface 19.

復号器２において音声パケットデータは復号され、復号された音声波形信号(音声信号)は、ＢＰＦ４に与えられ、ＢＰＦ４により所望周波数帯域に制限され（ここでは、電話帯域である３００〜３４００Ｈｚに制限され）、帯域制限された音声信号は第１の加算器２５及び初期遅延付加部２０に与えられる。 The voice packet data is decoded by the decoder 2 and the decoded voice waveform signal (voice signal) is given to the BPF 4 and is limited to a desired frequency band by the BPF 4 (here, limited to a telephone band of 300 to 3400 Hz). ), The band-limited audio signal is supplied to the first adder 25 and the initial delay adding unit 20.

第１の加算器２５では、例えばネットワーク端末装置２４の電源投入により閉じたスイッチ２３を通じて、低周波発生器３から聴覚的に捕らえにくい低周波信号（例えば、２０Ｈｚのトーン信号）を入力している。 In the first adder 25, for example, a low frequency signal (for example, a 20 Hz tone signal) that is not easily audible is input from the low frequency generator 3 through a switch 23 that is closed when the network terminal device 24 is turned on. .

そして、ＢＰＦ４からの帯域制限された音声信号が加算器２５に与えられると、帯域制限された音声信号と２０Ｈｚのトーン信号（低周波信号）とは、加算器２５により加算され、信号ｘ１（ｋ）として音声出力ドライバー５及び低域通過フィルタ（ＬＰＯ１）２６に出力される。 When the band-limited audio signal from the BPF 4 is supplied to the adder 25, the band-limited audio signal and the 20 Hz tone signal (low frequency signal) are added by the adder 25, and the signal x1 (k ) To the audio output driver 5 and the low-pass filter (LPO1) 26.

信号ｘ１（ｋ）が低域通過フィルタ（ＬＰＯ１）２６に与えられると、信号ｘ１（ｋ）は低域通過フィルタ処理が施され（３０Ｈｚ未満）、低域通過フィルタ処理された信号ｘ２（ｋ）が初期遅延判定部２２に与えられる。 When the signal x1 (k) is given to the low-pass filter (LPO1) 26, the signal x1 (k) is subjected to low-pass filtering (less than 30 Hz), and the low-pass filtered signal x2 (k) Is provided to the initial delay determination unit 22.

加算器２５の出力信号ｘ１（ｋ）が音声出力ドライバー５に与えられると、受信データの途切れ緩衝用蓄積バッファ６に出力され、その後Ｄ／Ａ７に与えられる。そして、信号（ディジタル音声信号）は、Ｄ／Ａ７によりアナログ信号に変換されてスピーカ１０に与えられる。 When the output signal x1 (k) of the adder 25 is given to the audio output driver 5, it is outputted to the received data break buffer buffer 6 and then to the D / A 7. The signal (digital audio signal) is converted to an analog signal by the D / A 7 and applied to the speaker 10.

このとき、一般に、ネットワーク端末装置２４（例えばＰＣ）において、音声処理よりも優先度の高いプログラムが実行状態にあるときは、音声出力ドライバー２５は、途切れ緩衝用蓄積バッファ６に対しデータを出力できず、待機状態となる。 At this time, generally, in the network terminal device 24 (for example, PC), when a program having a higher priority than the audio processing is in an execution state, the audio output driver 25 can output data to the interruption buffer storage buffer 6. Instead, it enters a standby state.

この間、Ｄ／Ａ７には途切れ緩衝用蓄積バッファ６からあらかじめ蓄積されたデータが出力され、音声出力ドライバー５からのデータ出力がなくても、見かけ上、音声が欠けないようにＤ／Ａ７から音声信号がスピーカ１０に与えられ、スピーカ１０から音声が出力される。もちろん、途切れ緩衝用蓄積バッファ６に蓄積される音声データがなくなり、完全に枯渇すれば音声データはなくなってしまう。ただし、音声出力ドライバー５の待機状態が復帰さえすれば、音声は大きな遅延を受けながらも、やがてはスピーカ１０から音声が出力される。つまり現象としては”音声遅れ”、または”音声の間あき”になる。 During this time, the data accumulated in advance from the interruption buffer accumulation buffer 6 is output to the D / A 7, and even if there is no data output from the audio output driver 5, the audio from the D / A 7 does not appear to be missing. A signal is given to the speaker 10, and sound is output from the speaker 10. Of course, there is no audio data stored in the interruption buffer storage buffer 6, and there is no audio data if it is completely exhausted. However, as long as the standby state of the audio output driver 5 is recovered, the audio is eventually output from the speaker 10 while receiving a large delay. In other words, the phenomenon is “voice delay” or “voice gap”.

一方で、なんらかの都合で、Ｄ／Ａ７が出力待機状態となると、途切れ緩衝用蓄積バッファ６のデータが出力されず、逆に途切れ緩衝用蓄積バッファ６の蓄積容量のほうが足りなくなるため、音声出力ドライバー５の出力データは破棄されてしまい、もはやスピーカ１０から音声が出力されず、”音声欠け”となる。 On the other hand, if the D / A 7 is in an output standby state for some reason, the data in the interruption buffer storage buffer 6 is not output, and conversely, the storage capacity of the interruption buffer storage buffer 6 becomes insufficient. The output data of No. 5 is discarded, and no sound is output from the speaker 10, resulting in “sound missing”.

上述のように途切れ緩衝用蓄積バッファ６では、データの音声信号が極力途切れないようにデータの蓄積と出力を行なう。 As described above, the interruption buffer storage buffer 6 stores and outputs data so that the audio signal of the data is not interrupted as much as possible.

すなわち、途切れ緩衝用蓄積バッファ６は、音声出力ドライバー５からの制御信号に応じた蓄積量だけ一旦蓄積したあと、古い順番から順にＤ／Ａ７に音声信号を出力し、Ｄ／Ａ７は、ディジタル音声信号をアナログ音声信号に変換して、スピーカ端子９を経由してスピーカ１０に出力するようにする。 That is, the interruption buffer accumulation buffer 6 temporarily accumulates the accumulation amount corresponding to the control signal from the audio output driver 5 and then outputs the audio signal to the D / A 7 in order from the oldest. The D / A 7 The signal is converted into an analog audio signal and output to the speaker 10 via the speaker terminal 9.

スピーカ１０は音声を空間に出力すると、出力された音声の一部がマイク１１に漏れこみ、入力信号としてマイク端子１２を経由してＡ／Ｄ１３に入力され、ディジタル信号に変換される。そして、ディジタル信号に変換された信号は、途切れ緩衝用蓄積バッファ１４に出力される。 When the speaker 10 outputs sound to the space, part of the output sound leaks into the microphone 11 and is input as an input signal to the A / D 13 via the microphone terminal 12 and converted into a digital signal. Then, the signal converted into the digital signal is output to the interruption buffer storage buffer 14.

途切れ緩衝用蓄積バッファ１４は、データの音声信号が途切れないように、Ａ／Ｄ１３からの音声信号のデータを蓄積する。 The interruption buffer accumulation buffer 14 accumulates the audio signal data from the A / D 13 so that the data audio signal is not interrupted.

ここで、マイク１１側でもスピーカ１０側で説明した状態と同じような状態が発生する場合があるので、マイク１１側での動作を説明しておく。 Here, since a state similar to the state described on the speaker 10 side may occur on the microphone 11 side, the operation on the microphone 11 side will be described.

ネットワーク端末装置２４（例えばＰＣ）が音声処理よりも優先度の高いプログラムを実行しようとするとき、Ａ／Ｄ１３は、途切れ緩衝用蓄積バッファ１４に対しデータを出力できず、待機状態となる。 When the network terminal device 24 (for example, PC) tries to execute a program having a higher priority than the voice processing, the A / D 13 cannot output data to the interruption buffer storage buffer 14 and enters a standby state.

この間、音声入力ドライバー１５には途切れ緩衝用畜積バッファ１４からあらかじめ蓄積されたデータが出力され、Ａ／Ｄ１３からのデータ出力がなくても、見かけ上、音声が欠けないように音声入力ドライバー１５に対して音声データが出力される。もちろん、途切れ緩衝用蓄積バッファ１４が完全に枯渇すれば音声データはなくなってしまう。 During this time, the data stored in advance from the interrupt buffer stock buffer 14 is output to the audio input driver 15, and even if there is no data output from the A / D 13, the audio input driver 15 does not seem to lack audio. Audio data is output. Of course, if the interruption buffer storage buffer 14 is completely depleted, the audio data is lost.

ただし、音声入力ドライバー１５の待機状態が復帰さえすれば、音声は大きな遅延を受けながらも、やがては音声入力ドライバー１５から信号が出力される。つまり現象としては送信する音声の”音声遅れ”、または”音声の間あき”になる。 However, as long as the standby state of the voice input driver 15 is restored, a signal is output from the voice input driver 15 in the long run while the voice is subjected to a large delay. In other words, the phenomenon is “voice delay” or “between voice” of the voice to be transmitted.

一方で、なんらかの都合で、音声入力ドライバー１５のデータ出力が待機するときは、途切れ緩衝用蓄積バッファ１４は、音声入力ドライバー１５になにも出力できないから、逆に途切れ緩衝用蓄積バッファ１４の蓄積容量のほうが足りなくなり、新たなＡ／Ｄ１３の出力データは破棄されてしまい、もはや音声入力ドライバー１５に対して出力できず、”音声欠け”となる。 On the other hand, when the data output of the voice input driver 15 is on standby for some reason, the interruption buffer storage buffer 14 cannot output anything to the voice input driver 15. The capacity becomes insufficient, and the output data of the new A / D 13 is discarded, and it can no longer be output to the audio input driver 15, resulting in “audio missing”.

このような音声欠けや音声遅れが極力発生しないよう、送信データの途切れ緩衝用蓄積バッファ１４や受信データの途切れ緩衝用蓄積バッファ６の蓄積量は、あらかじめネットワーク端末設計者等によって適宜設定される。 The amount of transmission data stored in the interruption buffer storage buffer 14 and the reception data interruption buffer storage buffer 6 is appropriately set in advance by a network terminal designer or the like so as to prevent such a lack of voice and voice delay as much as possible.

上述したように、途切れ緩衝用蓄積バッファ１４は、音声欠けがないように、音声入力ドライバー１５からの制御信号に応じた蓄積分だけ蓄積したあと、古い蓄積順番から順に音声入力ドライバー１５に音声データを出力する。 As described above, the interruption buffer accumulation buffer 14 accumulates the accumulated amount corresponding to the control signal from the audio input driver 15 so that there is no missing audio, and then stores the audio data in the audio input driver 15 in order from the oldest accumulation order. Is output.

途切れ緩衝用蓄積バッファ１４からの音声データが音声入力ドライバー１５に与えられると、音声信号ｙ（ｋ）が、低域通過フィルタ（ＬＰＯ２）２７及び帯域制限フィルタ（ＢＰＦ２）１６に与えられる。 When the audio data from the interruption buffer storage buffer 14 is supplied to the audio input driver 15, the audio signal y (k) is supplied to the low-pass filter (LPO 2) 27 and the band limiting filter (BPF 2) 16.

音声入力ドライバー１５からの出力信号ｙ（ｋ）が帯域制限フィルタ（ＢＰＦ２）１６に与えられると、信号ｙ（ｋ）は帯域制限フィルタ処理が施され（例えば、３００〜３４００Ｈｚ）、信号ｙ１（ｋ）として第２の加算器１７に出力される。 When the output signal y (k) from the audio input driver 15 is given to the band limiting filter (BPF2) 16, the signal y (k) is subjected to band limiting filtering (for example, 300 to 3400 Hz), and the signal y1 (k Is output to the second adder 17.

加算器１７においては、帯域制限フィルタ（ＢＰＦ２）１６の信号ｙ１（ｋ）から、適応フィルタ２１で生成された擬似エコー信号ｙ’（ｋ）が相殺され、エコー成分が除去された信号ｅ（ｋ）が、符号化器１８に与えられて、符号化処理がなされて、音声パケットデータがインターネット１を通じて遠端話者に送信される。 The adder 17 cancels the pseudo echo signal y ′ (k) generated by the adaptive filter 21 from the signal y1 (k) of the band limiting filter (BPF2) 16 and the signal e (k) from which the echo component is removed. ) Is provided to the encoder 18 to perform encoding processing, and voice packet data is transmitted to the far-end speaker through the Internet 1.

このとき、加算器１７からの出力信号ｅ（ｋ）は適応フィルタ２１にも戻されて、適応フィルタ２１におけるフィルタ係数更新処理に寄与される。 At this time, the output signal e (k) from the adder 17 is also returned to the adaptive filter 21 and contributes to the filter coefficient updating process in the adaptive filter 21.

一方、音声入力ドライバー１５からの出力信号ｙ（ｋ）が低域通過フィルタ（ＬＰＯ２）２７に与えられると、信号ｙ（ｋ）は低域通過フィルタ処理が施され（例えば、３０Ｈｚ未満）、信号ｙ２（ｋ）として初期遅延判定部２２に出力される。 On the other hand, when the output signal y (k) from the audio input driver 15 is given to the low-pass filter (LPO2) 27, the signal y (k) is subjected to low-pass filter processing (for example, less than 30 Hz), and the signal This is output to the initial delay determination unit 22 as y2 (k).

初期遅延判定部２２では、第１の低域通過フィルタ（ＬＰＯ１）２６の出力信号ｘ２（ｋ）及び第２の低域通過フィルタ（ＬＰＯ２）２７の出力信号ｙ２（ｋ）が入力されており、信号ｘ２（ｋ）及び信号ｙ２（ｋ）に基づいてエコー信号経路初期遅延が推定される。 The initial delay determination unit 22 receives the output signal x2 (k) of the first low-pass filter (LPO1) 26 and the output signal y2 (k) of the second low-pass filter (LPO2) 27, An initial delay of the echo signal path is estimated based on the signal x2 (k) and the signal y2 (k).

ここでは、初期遅延判定部２２によるエコー信号経路初期遅延を推定する動作を説明する。 Here, an operation of estimating the initial delay of the echo signal path by the initial delay determination unit 22 will be described.

初期遅延判定部２２は、信号ｘ２（ｋ）と信号ｙ２（ｋ）を用いて、下記式（１）に示す相互相関関数を用いて、エコー信号経路初期遅延を推定する。

The initial delay determination unit 22 estimates the echo signal path initial delay using the signal x2 (k) and the signal y2 (k), using the cross-correlation function shown in the following equation (1).

第１の実施形態では、Ｎ＝３２００００とする。 In the first embodiment, N = 320000.

式（１）は、２つの信号の似通い度を調べるために用いられる一般的な関数（相互相関関数）であり、信号ｘ２（ｋ）と信号ｙ２（ｋ）が似ていれば大きな値をとり、似ていなければ小さな値をとるものである。 Expression (1) is a general function (cross-correlation function) used to examine the similarity between two signals. If the signal x2 (k) and the signal y2 (k) are similar, a large value is obtained. If it is not similar, it takes a small value.

信号ｘ２（ｋ）は、低周波信号の２０Ｈｚのトーン信号であるので、サンプリング周波数を決めれば、トーン信号は１周期サンプル数に１回、自分自身と同じ波形になるので、関数Ｒ（ｌ）は大きな値をとる。 Since the signal x2 (k) is a low-frequency 20 Hz tone signal, if the sampling frequency is determined, the tone signal has the same waveform as itself once per cycle sample number, so the function R (l) Takes a large value.

例えば、信号ｘ２（ｋ）とｙ２（ｋ）とが同じ信号であれば、ｙ２（ｋ）＝ｘ２（ｋ）となるから、ｌ＝０としたときと同じであり、上記式（１）に代入すると、式（２）より関数Ｒ（ｌ）の最大値は１／２となる。

For example, if the signals x2 (k) and y2 (k) are the same signal, y2 (k) = x2 (k), which is the same as when l = 0. When substituted, the maximum value of the function R (l) is ½ from equation (2).

また、信号ｙ２（ｋ）が反転していれば、関数Ｒ（ｌ）の最大値は−１／２をとる。さらに、まったく信号がない、あるいは信号同士が関係ないときには、関数Ｒ（ｌ）は０となる。 If the signal y2 (k) is inverted, the maximum value of the function R (l) takes −½. Furthermore, the function R (l) is 0 when there is no signal at all or the signals are not related to each other.

すなわち、関数Ｒ（ｌ）の絶対値｜Ｒ（ｌ）｜を計算すれば、信号の似通いの具合を知ることができ、値は０≦｜Ｒ（ｌ）｜≦１／２となる。 That is, if the absolute value | R (l) | of the function R (l) is calculated, the degree of similarity of the signals can be known, and the value is 0 ≦ | R (l) | ≦ 1/2.

信号ｙ２（ｋ）は、信号ｘ２（ｋ）に起因するエコー信号であるから、基本的には信号ｘ２（ｋ）と同様にトーン信号であるため、信号ｘ２（ｋ）と信号ｙ２（ｋ）とを用いて、上記式（１）で計算すると、本来は１周期毎に大きな関数値を示す。 Since the signal y2 (k) is an echo signal resulting from the signal x2 (k), it is basically a tone signal similarly to the signal x2 (k), and therefore the signal x2 (k) and the signal y2 (k) When the above equation (1) is used for calculation, a large function value is originally shown for each cycle.

しかし、信号ｙ２（ｋ）は、初期遅延のあとエコー信号が到来する信号なので、初期遅延の間はｙ２（ｋ）＝０となっており、相互相関関数値は小さな値しかとらない。その後、信号がはじめて大きな値をとるのは、相互相関関数のずらし量ｌが１周期に達したときか、エコー経路の初期遅延τに一致したときである。 However, since the signal y2 (k) is a signal in which an echo signal arrives after the initial delay, y2 (k) = 0 during the initial delay, and the cross-correlation function value takes only a small value. Thereafter, the signal takes a large value for the first time when the shift amount l of the cross-correlation function reaches one period or when it coincides with the initial delay τ of the echo path.

つまり、初期遅延判定部２２は、式（１）を用いて信号ｘ２（ｋ）の１周期分を計算し、１周期以内に最大を示したずらし量ｌがエコー経路の初期遅延τであると判定する。 That is, the initial delay determination unit 22 calculates one period of the signal x2 (k) using the equation (1), and the shift amount l that shows the maximum within one period is the initial delay τ of the echo path. judge.

初期遅延判定部２２は、推定した初期遅延τを、初期遅延付加部２０に出力し、初期遅延付加部２０は、適応フィルタ２１の入力を時間τだけ遅延させる。 The initial delay determining unit 22 outputs the estimated initial delay τ to the initial delay adding unit 20, and the initial delay adding unit 20 delays the input of the adaptive filter 21 by time τ.

適応フィルタ２１は、初期遅延付加部２０の出力ｘ３（ｋ）と加算器の出力ｅ（ｋ）を用いて擬似エコー信号を作成する。 The adaptive filter 21 creates a pseudo echo signal using the output x3 (k) of the initial delay adding unit 20 and the output e (k) of the adder.

本実施形態では、式（１）を用いて相互相関を計算するようにしたが、式（１）を更に信号のパワー

In the present embodiment, the cross-correlation is calculated using the equation (1), but the equation (1) is further converted to the signal power.

で正規化して０≦｜Ｒ（ｌ）｜≦１で信号の相関を判定するようにしてもよく、２つの信号の相関が始めて発生するタイミングを計算できる方法であればどのような方法であってもかまわない。 The correlation between signals may be determined by 0 ≦ | R (l) | ≦ 1, and any method can be used as long as the timing at which the correlation between two signals first occurs can be calculated. It doesn't matter.

また、適応フィルタ２１における擬似エコーｙ’（ｋ）作成にあたっては、公知の学習同定法などのアルゴリズムを用いればよく特に限定はしない。 In addition, the creation of the pseudo echo y ′ (k) in the adaptive filter 21 is not particularly limited as long as an algorithm such as a known learning identification method is used.

（Ａ−３）第１の実施形態の効果
第１の実施形態では、途切れ緩衝用蓄積バッファを有するサウンドボード８を装備するハンズフリー通話装置において、復号器２で復号した入力音声信号を、帯域制限フィルタ４で所望周波数帯に帯域制限し、帯域制限フィルタ４からの出力に対して初期遅延付加部２０が遅延を加えるようにし、初期遅延付加部２０が加える遅延は、初期遅延判定２２部が判定した遅延τになるようにし、加える遅延については、初期遅延判定部２２でスイッチ２３を経由した低周波発生器３の出力を低域通過フィルタ処理した信号ｘ２（ｋ）と、エコー経路を経由したエコー信号を低域通過フィルタ処理した信号ｙ２（ｋ）との相互相関を計算することにより求めるようにした。 (A-3) Effect of First Embodiment In the first embodiment, in a hands-free call device equipped with a sound board 8 having an interruption buffer storage buffer, an input voice signal decoded by the decoder 2 is converted into a band. The band is limited to a desired frequency band by the limiting filter 4, and the initial delay adding unit 20 adds a delay to the output from the band limiting filter 4. The delay added by the initial delay adding unit 20 is determined by the initial delay determination unit 22. Regarding the delay to be added so as to be the determined delay τ, the signal x2 (k) obtained by subjecting the output of the low-frequency generator 3 via the switch 23 to the low-pass filter processing in the initial delay determination unit 22 and the echo path The obtained echo signal is obtained by calculating the cross-correlation with the signal y2 (k) obtained by performing the low-pass filter processing.

また、初期遅延判定部２２による初期遅延の判定の際、２つの信号入力について、いずれか一方の信号に低域通過フィルタを用いることで初期遅延の推定値がくるうことがある。そこで、第１の実施形態では、初期遅延判定部２２の２つの入力信号に同じ低域通過フィルタを用いることで、初期遅延の推定値がくるわないようにした。 In addition, when the initial delay is determined by the initial delay determining unit 22, an estimated value of the initial delay may be obtained by using a low-pass filter for either one of the two signal inputs. Therefore, in the first embodiment, the same low-pass filter is used for the two input signals of the initial delay determination unit 22 so that the estimated value of the initial delay does not come.

また、初期遅延判定部２２は、人間の耳には聞こえずらい低周波信号を発生する低周波発生器３の出力を用いるようにしたので、初期遅延計算の信号をスピーカ経由で出力しても通話の妨げになることがなく、帯域制限フィルタ１６を用いて、低周波信号を除去して適応フィルタ２１を駆動するようにした。 Further, since the initial delay determination unit 22 uses the output of the low frequency generator 3 that generates a low frequency signal that is difficult to hear for human ears, the initial delay calculation signal may be output via a speaker. The adaptive filter 21 is driven by removing the low frequency signal by using the band limiting filter 16 without hindering the call.

以上のことから、第１の実施形態によれば、初期遅延計算用の低周波信号が適応フィルタ２１の動作を妨げることなく、かつ、スピーカ端子９とマイク端子１２の間を結線が必要でないものであっても、ハードウエアの改造が不要の状態で、エコー信号を精度よく推定し、エコーの除去を行なうことができる。 From the above, according to the first embodiment, the low-frequency signal for calculating the initial delay does not interfere with the operation of the adaptive filter 21, and no connection is required between the speaker terminal 9 and the microphone terminal 12. Even so, it is possible to accurately estimate the echo signal and eliminate the echo without requiring hardware modification.

（Ｂ）第２の実施形態
次に、本発明のエコーキャンセラの第２の実施形態を図面を参照しながら説明する。 (B) Second Embodiment Next, a second embodiment of the echo canceller of the present invention will be described with reference to the drawings.

図３は、第１の実施形態に係るハンズフリー通話装置として機能する装置の内部構成を示すブロック図である。 FIG. 3 is a block diagram showing an internal configuration of a device that functions as a hands-free call device according to the first embodiment.

第２の実施形態の構成が第１の実施形態の構成と異なる点は、低周波発生器３を高周波発生器２Ａに変更する点、低域通過フィルタ２６及び２７を高域通過フィルタ２Ｂ及び２Ｃに変更する点であり、それ以外の構成要素は図１に示す第１の実施形態の構成と対応である。 The configuration of the second embodiment is different from the configuration of the first embodiment in that the low frequency generator 3 is changed to the high frequency generator 2A, and the low pass filters 26 and 27 are replaced with the high pass filters 2B and 2C. The other components correspond to the configuration of the first embodiment shown in FIG.

そのため、以下では、第２の実施形態の特徴となる点を詳細に説明し、第１の実施形態で説明した構成要素の詳細な説明については省略する。なお、図３において図１に対応する構成要素については対応する符号を付している。 Therefore, below, the point used as the characteristic of 2nd Embodiment is demonstrated in detail, and detailed description of the component demonstrated in 1st Embodiment is abbreviate | omitted. In FIG. 3, the components corresponding to those in FIG.

一般に、適応フィルタを駆動するためのアルゴリズムは、低周波数成分、例えば直流成分（以後ＤＣ）が混入すると、収束性能が急激に劣化することが知られている。したがって、第１の実施形態において、このような低周波成分の影響を適応フィルタに及ぼさないようにするには、精密に低域周波数をカットする音声帯域フィルタ１６を具備する必要があり、このような音声帯域フィルタは高価なものとなり、装置としても高価な装置となる。 In general, it is known that an algorithm for driving an adaptive filter rapidly degrades convergence performance when a low frequency component, for example, a direct current component (hereinafter referred to as DC) is mixed. Therefore, in the first embodiment, in order to prevent the influence of such a low frequency component from affecting the adaptive filter, it is necessary to include the audio band filter 16 that precisely cuts the low frequency. A voice bandpass filter is expensive and expensive as a device.

そこで、第２の実施形態では、高域周波数成分を用いて初期遅延を求めるようにすることで、装置コストの低減を図るものである。 Therefore, in the second embodiment, the apparatus cost is reduced by obtaining the initial delay using the high frequency component.

高周波発生器２Ａは、例えば、電話帯域の上限帯域近くの周波数帯域、又はそれ以上の周波数帯域の信号を発生するものである。本実施形態では、高周波発生器２Ａは、７ｋＨｚのトーン信号を発生するものとするが、これに限定されない。 The high frequency generator 2A generates a signal in a frequency band near the upper limit band of the telephone band or a frequency band higher than that, for example. In the present embodiment, the high frequency generator 2A generates a 7 kHz tone signal, but is not limited thereto.

高域通過フィルタ（ＨＰＦ１）２Ｂは、加算器２５からの出力信号ｘ１（ｋ）を取り込み、所定の高域成分を通過させるフィルタであり、出力は初期遅延判定部２２に与えるものである。また、高域通過フィルタ（ＨＰＦ２）２Ｃは、音声入力ドライバー１５からの出力信号ｙ（ｋ）を取り込み、所定の高域成分を通過させるフィルタであり、出力は初期遅延判定部２２に与えるものである。本実施形態では、高域通過フィルタ（ＨＰＦ１）２Ｂ及び（ＨＰＦ２）２Ｃは、７ｋＨｚ以上の成分を通過させるものとする。 The high-pass filter (HPF1) 2B is a filter that takes in the output signal x1 (k) from the adder 25 and passes a predetermined high-frequency component. The output is given to the initial delay determination unit 22. The high-pass filter (HPF2) 2C is a filter that takes in the output signal y (k) from the audio input driver 15 and passes a predetermined high-frequency component. The output is given to the initial delay determination unit 22. is there. In the present embodiment, the high-pass filters (HPF1) 2B and (HPF2) 2C pass components of 7 kHz or higher.

次に、第２の実施形態に係るエコーキャンセラの動作を説明する。まず、高周波発生器２Ａからの高周波信号（７ｋＨｚのトーン信号）は、スイッチ２３を介して加算器２５に与えられる。なお、スイッチ２３は、第１の実施形態と同様に、初期状態は開放しており、例えば装置電源投入時等に閉鎖するものであり、運用者等による所望タイミングで閉鎖できるようにしてもよい。 Next, the operation of the echo canceller according to the second embodiment will be described. First, a high frequency signal (7 kHz tone signal) from the high frequency generator 2 </ b> A is given to the adder 25 via the switch 23. As in the first embodiment, the switch 23 is open in the initial state, and is closed when the apparatus is turned on, for example, and may be closed at a desired timing by an operator or the like. .

加算器２５において、帯域制限フィルタ（ＢＦＰ１）４からの出力信号ｘ（ｋ）と高周波信号とが加算され、その加算された信号ｘ１（ｋ）は高域通過フィルタ（ＨＰＦ１）２Ｂと音声出力ドライバー５に与えられる。 In the adder 25, the output signal x (k) from the band limiting filter (BFP1) 4 and the high-frequency signal are added, and the added signal x1 (k) is the high-pass filter (HPF1) 2B and the audio output driver. Is given to 5.

なお、音声出力ドライバー５からエコー経路を通って再び音声入力ドライバー１５に与えられるまでの動作は、第１の実施形態で説明した動作と同様であるから、ここでの詳細な説明は省略する。 Since the operation from the audio output driver 5 through the echo path to the audio input driver 15 is similar to the operation described in the first embodiment, detailed description thereof is omitted here.

音声入力ドライバー１５からの出力信号ｙ（ｋ）は、帯域制限フィルタ（ＢＰＦ２）１６と高域通過フィルタ（ＨＰＦ２）２Ｃとに与えられる。 The output signal y (k) from the audio input driver 15 is given to the band limiting filter (BPF2) 16 and the high-pass filter (HPF2) 2C.

加算器２５からの出力信号ｘ１（ｋ）が高域通過フィルタ（ＨＰＦ１）２Ｂに与えられると、信号ｘ１（ｋ）は高域通過フィルタ処理（７ｋＨｚ以上の成分通過処理）が施され、高域通過フィルタ処理された信号ｘ２（ｋ）が初期遅延判定部２２に与えられる。 When the output signal x1 (k) from the adder 25 is given to the high-pass filter (HPF1) 2B, the signal x1 (k) is subjected to high-pass filter processing (component pass processing of 7 kHz or more), The signal x2 (k) subjected to the pass filter processing is given to the initial delay determination unit 22.

また、音声入力ドライバー１５からの出力信号ｙ（ｋ）が高域通過フィルタ（ＨＰＦ２）２Ｃに与えられると、信号ｙ（ｋ）は高域通過フィルタ処理（７ｋＨｚ以上の成分通過処理）が施され、高域通過フィルタ処理された信号ｙ２（ｋ）が初期遅延判定部２２に与えられる。 When the output signal y (k) from the audio input driver 15 is given to the high-pass filter (HPF2) 2C, the signal y (k) is subjected to high-pass filter processing (component pass processing of 7 kHz or more). The high-pass filter processed signal y2 (k) is supplied to the initial delay determination unit 22.

なお、高域通過フィルタ（ＨＰＦ１）２Ｂ及び（ＨＰＦ２）２Ｃは共に、通常音声周波数帯域を阻止し、遅延推定信号を通過させる。また、本実施形態では、７ｋＨｚ以上の周波数成分だけを通過させるようにした。 Note that the high-pass filters (HPF1) 2B and (HPF2) 2C both block the normal audio frequency band and pass the delay estimation signal. In the present embodiment, only frequency components of 7 kHz or higher are allowed to pass.

初期遅延判定部２２における信号ｘ２（ｋ）と信号ｙ２（ｋ）とに基づく初期遅延の求める動作や適応フィルタ２１の動作は第１の実施形態と同様であるので、ここでの詳細な説明は省略する。 Since the initial delay determining unit 22 obtains the initial delay based on the signal x2 (k) and the signal y2 (k) and the operation of the adaptive filter 21 is the same as that of the first embodiment, the detailed description here is as follows. Omitted.

以上のように、第２の実施形態によれば、第１の実施形態と同様の効果を得ることができる。 As described above, according to the second embodiment, the same effect as that of the first embodiment can be obtained.

また、第２の実施形態によれば、高周波発生器２Ａ、高域通過フィルタ（ＨＰＦ１）２Ｂ及び（ＨＰＦ２）２Ｃを用いることとしたので、低周波周波数に対する対策が不要であるから、音声帯域フィルタに対する精密さが要求されず、結果としてフィルタ構成が簡単となり、装置コストを低減させることができる。 Further, according to the second embodiment, since the high frequency generator 2A, the high-pass filters (HPF1) 2B and (HPF2) 2C are used, it is not necessary to take measures against the low frequency frequencies. Is not required, and as a result, the filter configuration becomes simple and the apparatus cost can be reduced.

さらに、例えば図示しない遠方話者側装置が同一構成の装置であるときなどは、相手側にも音声帯域フィルタ４があることになり、適応アルゴリズムへの影響の観点からは、音声帯域フィルタ１６を省略することも可能となり、さらに装置の製造コストを小さくすることができる。 Further, for example, when the far-side device (not shown) has the same configuration, the other side also has the voice band filter 4. From the viewpoint of the influence on the adaptive algorithm, the voice band filter 16 is set. It can be omitted, and the manufacturing cost of the apparatus can be further reduced.

（Ｃ）第３の実施形態
次に、本発明のエコーキャンセラの第３の実施形態を図４のブロック図を参照しながら説明する。 (C) Third Embodiment Next, a third embodiment of the echo canceller of the present invention will be described with reference to the block diagram of FIG.

第３の実施形態に係るハンズフリー通話装置は、ステレオサウンドに対応可能なサウンドボード８を装備する場合である。また、第３の実施形態は、従来の技術では、通信装置がステレオサウンドにも対応する場合、１チャンネル分を制御用に結線することでステレオ性能を損なってしまうことを鑑みてなされたものである。 The hands-free communication device according to the third embodiment is a case where a sound board 8 capable of supporting stereo sound is provided. In addition, the third embodiment is made in view of the fact that in the conventional technology, when the communication device also supports stereo sound, stereo performance is impaired by connecting one channel for control. is there.

図４に示すように、第３の実施形態に係るネットワーク端末装置２４は、回線インターフェース１９、復号器２、低周波発生器３、高周波発生器２Ａ、スイッチ２３（２３−１、２３−２）、第１の帯域制限フィルタ（ＢＰＦ１）４、第１の高域通過フィルタ（ＨＰＦ１）２Ｂ、加算器２５（２５−１、２５−２）、音声出力ドライバー５、音声入力ドライバー１５、第２の帯域制限フィルタ（ＢＰＦ２）１６、加算器１７（１７−１、１７−２）、符号化器１８、初期遅延付加部２０、適応フィルタ２１（２１−１、２１−２）、初期遅延判定部２２、第１の低域通過フィルタ（ＬＰＯ１）２６、第２の低域通過フィルタ（ＬＰＯ２）２７、第２の高域通過フィルタ（ＨＰＦ２）２Ｃを有する。 As shown in FIG. 4, the network terminal device 24 according to the third embodiment includes a line interface 19, a decoder 2, a low frequency generator 3, a high frequency generator 2A, and a switch 23 (23-1, 23-2). , First band-limiting filter (BPF1) 4, first high-pass filter (HPF1) 2B, adder 25 (25-1, 25-2), audio output driver 5, audio input driver 15, second Band-limiting filter (BPF2) 16, adder 17 (17-1, 17-2), encoder 18, initial delay adding unit 20, adaptive filter 21 (21-1, 21-2), initial delay determining unit 22 , A first low-pass filter (LPO1) 26, a second low-pass filter (LPO2) 27, and a second high-pass filter (HPF2) 2C.

また、第３の実施形態に係るサウンドボード８は、ステレオサウンドに対応可能なものであり、途切れ緩衝用蓄積バッファ６、ディジタル−アナログ変換器（以後、Ｄ／Ａという）７、スピーカ１０（１０−１、１０−２）と接続するスピーカ端子９（９−１、９−２）、マイク１１（１１−１、１１−２）と接続するマイク端子１２（１２−１、１２−２）、アナログ‐ディジタル変換器（以後、Ａ／Ｄという）１３、途切れ緩衝用蓄積バッファ１４を有する。 Further, the sound board 8 according to the third embodiment is capable of dealing with stereo sound, and includes an interruption buffer storage buffer 6, a digital-analog converter (hereinafter referred to as D / A) 7, and a speaker 10 (10). -1, 10-2) connected to speaker terminals 9 (9-1, 9-2), microphone terminals 12 (12-1, 12-2) connected to microphones 11 (11-1, 11-2), An analog-digital converter (hereinafter referred to as A / D) 13 and an interruption buffer storage buffer 14 are provided.

なお、図４では、第１及び第２の実施形態の構成と対応する構成については図１及び図３で示した符号と対応する符号を付している。さらに、以下では、第３の実施形態に特徴的な構成について詳細に説明する。 In FIG. 4, components corresponding to those in the first and second embodiments are denoted by reference symbols corresponding to the reference symbols illustrated in FIGS. 1 and 3. Further, the configuration characteristic to the third embodiment will be described in detail below.

図４に示すように、第３の実施形態に係るネットワーク端末装置２４は、第１の実施形態で説明した低周波発生器３、スイッチ２３−２、加算器２５−２、低域通過フィルタ（ＬＰＯ１）２６及び（ＬＰＯ２）２７と、第２の実施形態で説明した高周波発生器２Ａ、スイッチ２３−１、加算器２５−１、高域通過フィルタ（ＨＰＦ１）２Ｂ及び（ＨＰＦ２）２Ｃとを備える。 As shown in FIG. 4, the network terminal device 24 according to the third embodiment includes the low-frequency generator 3, the switch 23-2, the adder 25-2, the low-pass filter (described in the first embodiment). LPO1) 26 and (LPO2) 27, and the high-frequency generator 2A, switch 23-1, adder 25-1, high-pass filter (HPF1) 2B and (HPF2) 2C described in the second embodiment. .

回線インターフェース１９は、インターネット１から入力された信号をステレオ信号に変換して、ステレオ信号を形成する２チャネルの信号を復号器２に与えるものである。また、符号化部１８により符号化された送信データとしての２チャンネルの信号を受け取ると、インターネッ卜１に出力するために適したデータサイズ、構成のパケットに整形し、図示しない相手側話者に出力するものである。 The line interface 19 converts a signal input from the Internet 1 into a stereo signal, and gives a 2-channel signal forming the stereo signal to the decoder 2. Also, when receiving a 2-channel signal as transmission data encoded by the encoding unit 18, it is shaped into a packet having a data size and configuration suitable for output to the Internet 1, and is sent to a partner speaker (not shown). Output.

なお、本実施形態では、回線インターフェース１９がステレオ信号への変換機能を有するものとして説明するが、復号器２がステレオ信号への変換機能を備えるものとしてもよい。 In the present embodiment, the line interface 19 is described as having a function of converting to a stereo signal, but the decoder 2 may have a function of converting to a stereo signal.

復号器２は、回線インターフェース１９からの２チャネルの信号のそれぞれに対して復号処理を施し、復号した２チャネルの信号を帯域制限フィルタ（ＢＰＦ１）４に与えるものである。 The decoder 2 performs a decoding process on each of the two-channel signals from the line interface 19 and supplies the decoded two-channel signals to the band limiting filter (BPF1) 4.

帯域制限フィルタ（ＢＰＦ１）４は、復号器２からの復号した２チャネルの信号それぞれに対して帯域制限のフィルタ処理を施し、各信号を加算器２５−１及び２５−２に与えるものである。 The band-limiting filter (BPF1) 4 performs band-limiting filter processing on each of the decoded 2-channel signals from the decoder 2 and gives the signals to the adders 25-1 and 25-2.

このとき、例えば、ステレオ信号のＲチャンネルの信号を低周波発生器３に接続可能な加算器２５−２に与え、又ステレオ信号のＬチャンネルの信号を高周波発生器２Ａに接続可能な加算器２５−１に与える。もちろん、それぞれのチャンネルの信号をいずれかの加算器に与えるものとできれば、これに限定されない。 At this time, for example, the R channel signal of the stereo signal is supplied to the adder 25-2 connectable to the low frequency generator 3, and the L channel signal of the stereo signal is added to the high frequency generator 2A. To -1. Of course, the present invention is not limited to this as long as the signal of each channel can be given to any adder.

加算器２５−１は、第２の実施形態と同様に、帯域制限フィルタ（ＢＰＦ１）４からの信号と高周波発生器２Ａからの出力を加算して、音声出力ドライバー５及び高域通過フィルタ（ＨＰＦ１）２Ｂに与える。 Similarly to the second embodiment, the adder 25-1 adds the signal from the band limiting filter (BPF1) 4 and the output from the high frequency generator 2A, and outputs the audio output driver 5 and the high-pass filter (HPF1). ) Give to 2B.

また、加算器２５−２は、第１の実施形態と同様に、帯域制限フィルタ（ＢＰＦ１）４からの信号と低周波発生器３からの出力を加算して、音声出力ドライバー５及び低域通過フィルタ（ＬＰＯ１）２６に与える。 Similarly to the first embodiment, the adder 25-2 adds the signal from the band limiting filter (BPF1) 4 and the output from the low frequency generator 3, and outputs the audio output driver 5 and the low-pass signal. The filter (LPO1) 26 is given.

音声入力ドライバー１５は、２チャンネルの出力信号を、帯域制限フィルタ（ＢＰＦ２）１６、高域通過フィルタ（ＨＰＦ２）２Ｃ及び低域通過フィルタ（ＬＰＯ２）２７に与える。 The audio input driver 15 supplies the output signals of the two channels to the band limiting filter (BPF2) 16, the high pass filter (HPF2) 2C, and the low pass filter (LPO2) 27.

帯域制限フィルタ（ＢＰＦ２）１６は、音声入力ドライバー１５からの２チャンネルの各信号を、加算器１７−１及び１７−２に与えるものである。 The band limiting filter (BPF2) 16 gives the two-channel signals from the audio input driver 15 to the adders 17-1 and 17-2.

加算器１７−１は、帯域制限フィルタ（ＢＰＦ２）１６から出力と適応フィルタ２１−１からの出力とに基づいて擬似エコー成分を減算し、出力を適応フィルタ２１−１及び符号化器１８に与える。 The adder 17-1 subtracts the pseudo echo component based on the output from the band limiting filter (BPF2) 16 and the output from the adaptive filter 21-1, and provides the output to the adaptive filter 21-1 and the encoder 18. .

また、加算器１７−２は、帯域制限フィルタ（ＢＰＦ２）１６から出力と適応フィルタ２１−２からの出力とに基づいて擬似エコー成分を減算し、出力を適応フィルタ２１−２及び符号化器１８に与える。 Further, the adder 17-2 subtracts the pseudo echo component based on the output from the band-limiting filter (BPF2) 16 and the output from the adaptive filter 21-2, and outputs the output to the adaptive filter 21-2 and the encoder 18. To give.

適応フィルタ２１−１及び２１−２は、初期遅延付加部２０から、帯域制限フィルタ４の２チャンネルの出力信号ｘ（ｋ）のそれぞれに対して初期遅延付加部２０により遅延を加えた信号が入力されている。 The adaptive filters 21-1 and 21-2 receive from the initial delay adding unit 20 a signal obtained by adding a delay by the initial delay adding unit 20 to each of the two-channel output signals x (k) of the band limiting filter 4. Has been.

初期遅延付加部２０は、帯域制限フィルタ４から２チャンネルの信号ｘ（ｋ）を取り込み、初期遅延判定部２２からの初期遅延制御信号に基づいてそれぞれの信号に遅延を加えて、それぞれの適応フィルタ２１−１及び２１−２に与えるものである。 The initial delay adding unit 20 takes in the two-channel signal x (k) from the band limiting filter 4, adds a delay to each signal based on the initial delay control signal from the initial delay determining unit 22, and sets each adaptive filter. 21-1 and 21-2.

初期遅延判定部２２は、低域通過フィルタ（ＬＰＯ１）２６及び（ＬＰＯ２）２７からの出力と、高域通過フィルタ（ＨＰＦ１）２Ｂ及び（ＨＰＦ２）２Ｃからの出力を取り込み、それぞれのチャンネルについての初期遅延を判定し、初期遅延付加部２０に与えるものである。 The initial delay determination unit 22 takes in the outputs from the low-pass filters (LPO1) 26 and (LPO2) 27 and the outputs from the high-pass filters (HPF1) 2B and (HPF2) 2C, and initializes each channel. The delay is determined and given to the initial delay adding unit 20.

以上のように、第３の実施形態によれば、第１及び第２の実施形態と同様の効果を得ることができる。 As described above, according to the third embodiment, the same effects as those of the first and second embodiments can be obtained.

また、第３の実施形態によれば、低周波発生器３側に係る経路を用いて例えばスピーカ１、マイク１に関わる経路をステレオのＲチヤンネル、高周波発生器２Ａ側に係る経路を用いて例えばスピーカ２、マイク２に関わる経路をステレオのＬチャンネルに割り当てることによって、ステレオサウンドを損なうことなく通信を実現する通信装置を提供できる。 In addition, according to the third embodiment, for example, a path related to the speaker 1 and the microphone 1 is used using a path related to the low frequency generator 3 side, and a path related to the stereo R channel and the high frequency generator 2A side is used, for example. By assigning a path related to the speaker 2 and the microphone 2 to the stereo L channel, it is possible to provide a communication device that realizes communication without impairing the stereo sound.

（Ｄ）第４の実施形態
次に、本発明のエコーキャンセラの第４の実施形態を図面を参照しながら説明する。 (D) Fourth Embodiment Next, a fourth embodiment of the echo canceller of the present invention will be described with reference to the drawings.

途切れ緩衝用蓄積バッファによる遅延の変化には２通りの種類があり、１つは初期遅延の量的な変動であり、もう１つは完全な信号の欠落である。 There are two types of delay changes caused by the interruption buffer accumulation buffer, one is a quantitative variation in the initial delay, and the other is a complete signal loss.

エコーを適切に打消すためには、適応フィルタがこれらの種類を区別することが望ましい。なぜなら、信号の欠落は途切れ緩衝用蓄積バッファの容量飽和に原因して発生するものであるから、欠落が発生している間は、欠落の量が不確定となるので、その間はエコー経路を一定のものと仮定したままで推定するのは不適切である。つまり、定常的な遅延となって、エコーに反映されるまでは追従しないほうがよい。 In order to properly cancel the echo, it is desirable for the adaptive filter to distinguish between these types. Because the signal loss occurs due to the capacity saturation of the accumulation buffer for the interruption buffer, the amount of the loss is indefinite while the loss is occurring, so the echo path is constant during that time. It is inappropriate to estimate with the assumption. In other words, it is better not to follow until a steady delay is reflected in the echo.

そこで、信号の欠落を区別できれば、その間は、初期遅延の推定を実行しないようにできる。 Therefore, if it is possible to distinguish the missing signal, the estimation of the initial delay can be prevented during that period.

図５は、第４の実施形態に係るハンズフリー通話装置の構成を示すブロック図である。 FIG. 5 is a block diagram showing a configuration of a hands-free call device according to the fourth embodiment.

図５において、第４の実施形態の構成が第１の実施形態の構成と異なる点は、低周波発生器３とスイッチ２３との間に位相反転器４１を備える点、タイマ４０を新たに備える点、初期遅延判定部２２が初期遅延判定部４２に変わった点である。 In FIG. 5, the configuration of the fourth embodiment is different from the configuration of the first embodiment in that a phase inverter 41 is provided between the low frequency generator 3 and the switch 23, and a timer 40 is newly provided. In other words, the initial delay determination unit 22 is changed to an initial delay determination unit 42.

なお、図５に示す第４の実施形態の構成要素のうち図１に示す第１の実施形態の構成要素と対応するものについては、図１と対応する符号を示してあり、これら構成要素についての詳細な説明は省略する。 In addition, about the component corresponding to the component of 1st Embodiment shown in FIG. 1 among the components of 4th Embodiment shown in FIG. 5, the code | symbol corresponding to FIG. 1 is shown, About these components The detailed description of is omitted.

タイマ４０は、スイッチ２３の閉鎖後の時間をカウントするものであり、例えば、低周波発生器３の１周期分の時間毎に位相反転器４１に対して位相反転タイミング信号ｒｅｖを出力するものである。 The timer 40 counts the time after the switch 23 is closed. For example, the timer 40 outputs a phase inversion timing signal rev to the phase inverter 41 every time of one cycle of the low frequency generator 3. is there.

位相反転器４１は、低周波発生器３が発生した低周波信号を取り込み、タイマ４０から位相反転タイミング信号ｒｅｖを受信するたびに、上記低周波信号を移送反転して出力するものである。 The phase inverter 41 takes in the low frequency signal generated by the low frequency generator 3 and transfers and inverts the low frequency signal every time the phase inversion timing signal rev is received from the timer 40.

初期遅延判定部４２は、低域通過フィルタ（ＬＰＯ１）２６からの出力信号ｘ２（ｋ）を取り込むと共に、低周波発生器３が発生した低周波信号を取り込む。そして、初期遅延判定部４２は、信号ｘ２（ｋ）と低周波信号とを加算し、加算結果が０になる周期、すなわち低周波信号の位相の反転周期ＲＥＶ＿Ｔを計算すると共に、低域通過フィルタ２６の出力においても、第１の実施形態で説明したように相関が位相の反転周期ＲＥＶ＿Ｔ毎に強い相関を表す値Ｃ０、例えば０．５（正規化をしたときは１．０）をとることを計算する。 The initial delay determination unit 42 takes in the output signal x2 (k) from the low-pass filter (LPO1) 26 and takes in the low-frequency signal generated by the low-frequency generator 3. Then, the initial delay determination unit 42 adds the signal x2 (k) and the low frequency signal, calculates a period when the addition result becomes 0, that is, calculates a phase inversion period REV_T of the low frequency signal, and a low-pass filter. Also in the output of 26, as described in the first embodiment, the correlation takes a value C0 indicating a strong correlation every phase inversion period REV_T, for example, 0.5 (1.0 when normalized). Calculate

相関が位相の反転周期ＲＥＶ＿Ｔ毎に所定の閾値Ｃ０を超えない場合は、初期遅延判定部４２は、初期遅延付加量の変更をしないままの値を初期遅延付加部２０に出力する。また、相関が位相の反転周期ＲＥＶ＿Ｔ毎に所定の閾値Ｃ０を超えた場合は、初期遅延判定部４２は、新たに初期遅延判定部４２で計算した初期遅延を付加するよう初期遅延付加部２０に初期遅延判定部４２の出力である遅延量を出力する。 When the correlation does not exceed the predetermined threshold value C0 for each phase inversion period REV_T, the initial delay determination unit 42 outputs a value without changing the initial delay addition amount to the initial delay addition unit 20. When the correlation exceeds a predetermined threshold value C0 for each phase inversion period REV_T, the initial delay determination unit 42 causes the initial delay addition unit 20 to newly add the initial delay calculated by the initial delay determination unit 42. A delay amount that is an output of the initial delay determination unit 42 is output.

ここで、初期遅延判定部４２による信号の欠落方法を説明する。信号の欠落が発生したときには信号の周期性が失われる。低周波発生器３からの低周波信号はトーン信号であるので、２周期目の信号は位相反転器４１により位相反転されており、次に大きな相関が発生するのは３周期目となる。 Here, a signal omission method by the initial delay determination unit 42 will be described. When signal loss occurs, signal periodicity is lost. Since the low-frequency signal from the low-frequency generator 3 is a tone signal, the signal in the second period is phase-inverted by the phase inverter 41, and the next largest correlation is generated in the third period.

従って、初期遅延付加部４２は、３周期目でちょうど強い相関を検知したとき、その間のエコー経路における信号の欠落は発生していないものと判定し、初期遅延の推定をしてもよいことと判断し、求めた遅延量を初期遅延付加部２０に与える。 Therefore, when the initial delay adding unit 42 detects a strong correlation in the third period, it determines that no signal loss has occurred in the echo path during that period, and may estimate the initial delay. The determined delay amount is given to the initial delay adding unit 20.

一方、３周期目以外の場所で強い相関を検知した場合、エコー経路において信号のどこかに欠落が生じているものと判定し、遅延をエコー除去に反映させないようにする。特に、信号の欠落があった場合は３周期目以後に強い相関が現れることが多い。 On the other hand, when a strong correlation is detected at a place other than the third period, it is determined that a signal is missing somewhere in the echo path, and the delay is not reflected in echo removal. In particular, when a signal is missing, a strong correlation often appears after the third period.

以上のように、第４の実施形態によれば、第１の実施形態と同様の効果を得ることができる。 As described above, according to the fourth embodiment, the same effect as that of the first embodiment can be obtained.

また、第４の実施形態によれば、初期遅延判定部４２が、加算器２５からの出力信号と低周波信号との相関をとることにより、途切れ緩衝用蓄積バッファによる遅延の変動が初期遅延の量的な変動によるものであるか又は信号欠落によるものであるかを判定することができ、その判定結果に応じて適切なエコーを都合よく推定し、通話品質を向上できる。 In addition, according to the fourth embodiment, the initial delay determination unit 42 correlates the output signal from the adder 25 and the low frequency signal, so that the delay variation due to the interruption buffer accumulation buffer is the initial delay. It is possible to determine whether it is due to quantitative fluctuation or signal loss, and it is possible to conveniently estimate an appropriate echo according to the determination result, thereby improving the call quality.

（Ｅ）第５の実施形態
次に、本発明のエコーキャンセラの第５の実施形態を図面を参照しながら説明する。 (E) Fifth Embodiment Next, a fifth embodiment of the echo canceller of the present invention will be described with reference to the drawings.

第５の実施形態は、本来、会話のためには余分な信号であるエコー経路推定のための遅延推定信号をできるだけ短く出力して済ますようにしたものである。 In the fifth embodiment, a delay estimation signal for echo path estimation, which is an extra signal for conversation, is output as short as possible.

非特許文献１（ｐ．１１５〜ｐ.１１７）では、人間の聴覚特性として、大きな音のあとに続く小さな音が感覚としてあたかも隠蔽され、聞き取れない現象があることを述べている（継時マスキング）。 Non-Patent Document 1 (p.115 to p.117) states that as a human auditory characteristic, there is a phenomenon that a small sound that follows a loud sound is concealed as a sensation and cannot be heard (continuous masking). ).

非可聴域の高周波は人間には聞こえないが、犬などの動物にとっては可聴域信号であることが知られており、人間にとっての非可聴域の高周波であっても長時間にわたって継続的にスピーカから出力することが望ましくない場合がある。例えば装置設置近辺に小動物、例えばペットなどがいるような場合には、当該不動物の異常挙動を誘発するおそれがある。このような場合には遅延推定信号である高周波の出力時間をできるだけ短くすることが望ましい。 High frequencies in the non-audible range cannot be heard by humans, but it is known to be an audible range signal for animals such as dogs. May not be desirable. For example, when there is a small animal such as a pet in the vicinity of the installation of the device, there is a risk of inducing abnormal behavior of the animal. In such a case, it is desirable to shorten the output time of the high frequency that is the delay estimation signal as much as possible.

そこで、第５の実施形態は、上記の点に鑑みて、本来、会話のためには余分な信号であるエコー経路推定のための遅延推定信号をできるだけ短く出力して済ますようにしたものである。 Therefore, in the fifth embodiment, in view of the above points, the delay estimation signal for estimating the echo path, which is an extra signal for conversation, can be output as short as possible. .

図６は、第５の実施形態に係るハンズフリー通話装置の構成を示すブロック図である。 FIG. 6 is a block diagram showing a configuration of a hands-free call device according to the fifth embodiment.

図６において、第５の実施形態の構成が第２の実施形態の構成と異なる点は、スイッチ２３がスイッチ５１に変更した点、加算器２５の前に音声レベル検出器５０を備える点である。 In FIG. 6, the configuration of the fifth embodiment is different from the configuration of the second embodiment in that the switch 23 is changed to the switch 51 and that an audio level detector 50 is provided in front of the adder 25. .

なお、図６に示す第５の実施形態の構成要素のうち図２に示す第２の実施形態の構成要素と対応するものについては、図２と対応する符号を示してあり、これら構成要素についての詳細な説明は省略する。 In addition, about the component corresponding to the component of 2nd Embodiment shown in FIG. 2 among the components of 5th Embodiment shown in FIG. 6, the code | symbol corresponding to FIG. 2 is shown, About these components The detailed description of is omitted.

スイッチ５１は、初期状態では開放であり、ネットワーク端末装置２４の立ち上げ時、例えば電源投入時に閉じられる。なお、スイッチ５１は、図示しない装置操作者の操作による指示信号などによって、電源投入時以外にも、所望のタイミングで閉じるようにしてもよい。例えば、適応フィルタ２１のリセットをしたいときなどにスイッチ５１を閉じるようにしてもよい。 The switch 51 is open in the initial state, and is closed when the network terminal device 24 is started up, for example, when the power is turned on. The switch 51 may be closed at a desired timing other than when the power is turned on by an instruction signal or the like by an operation of a device operator (not shown). For example, the switch 51 may be closed when it is desired to reset the adaptive filter 21.

音声レベル検出器５０は、帯域制限フィルタ４からの出力信号ｘ（ｋ）を取り込み、この信号ｘ（ｋ）の信号パワーを計算し、この信号ｘ（ｋ）のパワーが予め定められたレベル以上である場合に所定時間だけスイッチ５１を閉じるようにし、それ以外で開放させるものである。 The audio level detector 50 takes in the output signal x (k) from the band limiting filter 4, calculates the signal power of the signal x (k), and the power of the signal x (k) exceeds a predetermined level. In this case, the switch 51 is closed only for a predetermined time, and is opened otherwise.

ここで、本実施形態では、音声レベル検出器５０は、信号ｘ（ｋ）のパワーが継時マスキング効果を発生するのに十分なレベルとして、例えば３０ｄＢ以上のとき、その後１０ｍｓだけスイッチ５１を閉じるようにし、その時間経過後スイッチ５１を開放させるものとする。 Here, in the present embodiment, the audio level detector 50 closes the switch 51 for 10 ms after that when the power of the signal x (k) is sufficient to generate the continuous masking effect, for example, 30 dB or more. Thus, the switch 51 is opened after the lapse of time.

なお、音圧レベルの基準値である０ｄＢは公知のように、１０００Ｈｚ、２０μＰａを０ｄＢとすればよく、後述する維持マスキングに必要なレベルを計算できれば、どのような単位を用いてもかまわない。 As is well known, 0 dB which is a reference value of the sound pressure level may be set to 1000 Hz and 20 μPa to 0 dB, and any unit may be used as long as a level necessary for maintenance masking described later can be calculated.

このように、音声レベル検出器５０が、継時マスキング効果を発生するのに十分なレベルの信号パワーｘ（ｋ）を検出したときに、所定の時間だけ遅延推定信号を混ぜ込み、それ以外の時間には混ぜ込まないようにすることで、近辺事物に対する影響を最小化させることができる。 As described above, when the audio level detector 50 detects the signal power x (k) at a level sufficient to generate the continuous masking effect, the delay estimation signal is mixed for a predetermined time, By not mixing in time, the influence on nearby things can be minimized.

高周波発生器２Ａからの信号はスイッチ５１が閉じられている間だけ加算器２５に出力され、それ以外は加算器２５に出力されない。 The signal from the high frequency generator 2A is output to the adder 25 only while the switch 51 is closed, and is not output to the adder 25 otherwise.

以上のように、第５の実施形態によれば、第１及び第２の実施形態と同様の効果を得ることができる。 As described above, according to the fifth embodiment, the same effects as those of the first and second embodiments can be obtained.

また、第５の実施形態によれば、音声レベル検出器５０を設け、音声レベル検出器５０のレベル検出結果に応じて、大きなレベルの音声信号があるときにだけ、維持マスキングによって隠蔽されるレベルの遅延推定信号を、高周波発生器２Ａから短時間だけ出力するようにしたので、装置使用の話者など、装置の使用場所に不必要に長時間エコー遅延推定のための信号を出力することがなく、信号出力のための電力を節約すると共に、不可避的に装置近傍に位置して信号を受ける生物等に対し、与える影響を軽微にする安全な装置を実現することができる。 Further, according to the fifth embodiment, the audio level detector 50 is provided, and the level concealed by the maintenance masking only when there is a high level audio signal according to the level detection result of the audio level detector 50. The delay estimation signal is output from the high-frequency generator 2A for a short time, so that a signal for estimating the echo delay for a long time may be output unnecessarily to the place where the apparatus is used, such as a speaker using the apparatus. In addition, it is possible to realize a safe device that saves power for signal output and inevitably has a small influence on a living organism or the like that is positioned near the device and receives a signal.

（Ｆ）第６の実施形態
次に、本発明のエコーキャンセラの第６の実施形態を図面を参照しながら説明する。 (F) Sixth Embodiment Next, a sixth embodiment of the echo canceller of the present invention will be described with reference to the drawings.

非特許文献１（ｐ．１０９〜ｐ．１１３）では、人間の聴覚特性として、ある周波数を聞いている最中はその周波数の倍数に当たる周波数などの、特別な周波数に関しては、たとえ信号があっても聞き取り感度が鈍化する効果が示されている（以後、これをマスキング効果という）。 In Non-Patent Document 1 (p.109 to p.113), as a human auditory characteristic, there is a signal for a special frequency such as a frequency that is a multiple of the frequency while listening to a certain frequency. Has also been shown to have an effect of decreasing the listening sensitivity (hereinafter referred to as a masking effect).

これは、第５の実施形態における、大きな音に続いて小さな音があるときに小さな音に対する感度が鈍る「維持マスキング」と感度が鈍る点で性質が似ているが、マスクされる側とマスクする側に周波数成分として一定の関係があるという点で性質が異なっている。 This is similar in nature to the "maintenance masking" in which the sensitivity to a small sound is dull when there is a small sound following a loud sound in the fifth embodiment, but the sensitivity is dull. The nature is different in that there is a certain relationship as a frequency component on the side to be performed.

そこで、第６の実施形態では、このマスキング効果を利用して、遅延推定信号発生器の周波数帯域を低周波や高周波などの非可聴領域の信号としなくても同様の動作を実現することで、安全性を向上させ、装置の設計コストを軽減させることを図る。 Therefore, in the sixth embodiment, by using this masking effect, by realizing the same operation without using the frequency band of the delay estimation signal generator as a signal in a non-audible region such as a low frequency or a high frequency, To improve safety and reduce device design costs.

図７は、第６の実施形態に係るハンズフリー通話装置の構成を示すブロック図である。 FIG. 7 is a block diagram showing the configuration of the hands-free call device according to the sixth embodiment.

図７において、第６の実施形態の構成が第５の実施形態の構成と異なる点は、音声レベル検出器５０が音声周波数分析器６０に変更されている点、高周波発生器２Ａが被マスク信号発生器６２に変更されている点、スイッチ５１がスイッチ６１に変更されている点、高域通過フィルタ（ＨＰＦ１）２Ｂがなくなった点、高域通過フィルタ（ＨＰＦ２）２Ｃが周波数弁別器６３に変更されている点、帯域制限フィルタ（ＢＰＦ２）１６がなくなった点、初期遅延付加部２０の入力が加算器２５からの出力を取り入れる点である。 In FIG. 7, the configuration of the sixth embodiment is different from the configuration of the fifth embodiment in that the audio level detector 50 is changed to an audio frequency analyzer 60, and the high frequency generator 2A is a masked signal. Changed to generator 62, changed switch 51 to switch 61, removed high-pass filter (HPF1) 2B, changed high-pass filter (HPF2) 2C to frequency discriminator 63 The band limiting filter (BPF 2) 16 is eliminated, and the input of the initial delay adding unit 20 takes in the output from the adder 25.

なお、図７に示す第７の実施形態の構成要素のうち図６に示す第５の実施形態の構成要素と対応するものについては対応する符号を付し、これらの構成要素の詳細な説明は省略する。 Of the constituent elements of the seventh embodiment shown in FIG. 7, those corresponding to the constituent elements of the fifth embodiment shown in FIG. 6 are denoted by the corresponding reference numerals, and detailed descriptions of these constituent elements are as follows. Omitted.

音声周波数分析器６０は、帯域制限フィルタ（ＢＰＦ１）４からの出力を取り込み、入力音声信号を周波数成分に変換し、後述のように周波数成分に変換した信号のうち比較的周波数成分のパワーが弱い部分を少なくとも１以上求めるものである。 The audio frequency analyzer 60 takes in the output from the band limiting filter (BPF1) 4, converts the input audio signal into a frequency component, and the power of the frequency component is relatively weak among the signals converted into the frequency component as will be described later. At least one part is obtained.

また、音声周波数分析器６０は、上述した比較的周波数成分のパワーが弱い部分を求めると、その部分の周波数の半分に当たる周波数成分のパワーを求める。そして、音声周波数分析器６０は、当該部分の周波数成分のパワーと当該部分の周波数の半分に当たる周波数成分のパワーとに基づいて音声入力成分でない周波数を求めるものである。 In addition, when the voice frequency analyzer 60 obtains a portion where the power of the frequency component is relatively weak as described above, the voice frequency analyzer 60 obtains the power of the frequency component corresponding to half the frequency of the portion. The voice frequency analyzer 60 obtains a frequency that is not a voice input component based on the power of the frequency component of the part and the power of the frequency component corresponding to half the frequency of the part.

さらに、音声周波数分析器６０は、音声入力成分でない周波数を求めると、その音声入力成分でない周波数成分の信号を出力させるように、被マスク信号発生器６２、スイッチ６１及び周波数弁別器６３に指示するものである。 Furthermore, when the audio frequency analyzer 60 obtains a frequency that is not an audio input component, the audio frequency analyzer 60 instructs the masked signal generator 62, the switch 61, and the frequency discriminator 63 to output a signal having a frequency component that is not the audio input component. Is.

ここで、音声周波数分析器６０による周波数成分の変換方法は、時間軸信号を周波数成分に変更することができれば、特に限定されるものではないが、例えば、既存する高速フーリエ変換（以後ＦＦＴ）を適用して変換する方法を適用することができる。 Here, the method of converting the frequency component by the audio frequency analyzer 60 is not particularly limited as long as the time axis signal can be changed to the frequency component. For example, the existing fast Fourier transform (hereinafter referred to as FFT) is used. A method of applying and converting can be applied.

また、音声周波数分析器６０により比較的周波数成分のパワーが弱い部分を求める方法は、例えば、一旦ＦＦＴで求めたパワースペクトルを平滑化した包絡を計算し、その計算した包絡線の谷部分を、比較的周波数成分のパワーが弱い部分として求める。なお、音声周波数分析器６０が求める比較的周波数成分のパワーが弱い部分の数は、少なくとも１以上であればよいが、本実施形態では５箇所を求めるものとする。 In addition, the method for obtaining a portion where the power of the frequency component is relatively weak by the audio frequency analyzer 60 is, for example, calculating an envelope obtained by smoothing the power spectrum once obtained by FFT, and calculating the valley portion of the calculated envelope. It is determined as a portion where the power of the frequency component is relatively weak. Note that the number of portions with relatively weak power of the frequency component obtained by the audio frequency analyzer 60 may be at least one, but in this embodiment, five portions are obtained.

さらに、音声周波数分析器６０により音声入力成分でない周波数を求める方法は、例えば、パワースペクトル成分の谷の部分の周波数成分のパワーと、その谷の部分の周波数の半分に当たる周波数成分のパワーとの関係が、以下に示す条件１を満たしているか否かを判定する。 Further, a method for obtaining a frequency that is not an audio input component by the audio frequency analyzer 60 is, for example, the relationship between the power of the frequency component in the valley portion of the power spectrum component and the power of the frequency component corresponding to half the frequency of the valley portion. Determines whether or not the following condition 1 is satisfied.

条件１：半分に当たる周波数成分のパワー≧谷部分周波数のパワー＋δｄＢ。 Condition 1: Power of frequency component corresponding to half ≧ power of valley frequency + δ dB.

ここで、δは予め定めることができる値であり、本実施形態ではδ＝３０ｄＢとするが、この値は特に限定されない。 Here, δ is a value that can be determined in advance. In this embodiment, δ = 30 dB, but this value is not particularly limited.

例えば、音声周波数分析器６０は、パワースペクトル成分の谷の部分を５個求めると、それぞれの谷の部分の周波数成分のパワーと、それに対応する周波数が半分に当たる周波数成分のパワーとを比較して、周波数が半分に当たる周波数成分のパワーが、谷の部分の周波数成分のパワーより予め定めた値δより大きいから否かを、すべての谷の部分の周波数成分について判断する。 For example, when the voice frequency analyzer 60 obtains five valley portions of the power spectrum component, the audio frequency analyzer 60 compares the power of the frequency component of each valley portion with the power of the frequency component corresponding to half the frequency corresponding thereto. Whether or not the frequency component power corresponding to half of the frequency is larger than the power of the frequency component in the valley portion by a predetermined value δ is determined for the frequency components in all valley portions.

例えば、周波数２４００Ｈｚにおけるパワースペクトルが谷を示したとき、音声周波数分析器６０は、その周波数２４００Ｈｚの半分に当たる周波数１２００Ｈｚにおけるパワーを求める。 For example, when the power spectrum at a frequency of 2400 Hz indicates a valley, the audio frequency analyzer 60 obtains power at a frequency of 1200 Hz corresponding to half of the frequency of 2400 Hz.

そして、（１２００Ｈｚのパワー）≧（２４００Ｈｚのパワー）＋３０ｄＢの条件を満たしているか否かを判断し、この条件を満たしている「谷の部分」を複数個求めるようにする。 Then, it is determined whether or not the condition of (1200 Hz power) ≧ (2400 Hz power) +30 dB is satisfied, and a plurality of “valley portions” satisfying this condition are obtained.

なお、本実施形態では、谷の周波数と、その半分の周波数との比較を行なう場合を示すが、谷の周波数の半分の周波数でなく、１／Ｎ（Ｎは正の整数）の周波数としてもよい。 In the present embodiment, the case where the frequency of the valley is compared with the half of the frequency is shown. However, the frequency of 1 / N (N is a positive integer) may be used instead of the half of the frequency of the valley. Good.

次に、音声周波数分析器６０は、条件１を満たしている谷の部分を複数個求めると、その谷の部分の周波数成分のパワーと、半分に当たる周波数成分のパワーとの差が、最も大きいパワー差となる最大乖離谷周波数Ｆ１を１個求め、この最大乖離谷周波数Ｆ１（例えば、１２００Ｈｚ）を被マスク信号発生器６２及び周波数弁別器６３に出力する。 Next, when the voice frequency analyzer 60 obtains a plurality of valley portions satisfying the condition 1, the power having the largest difference between the power of the frequency component in the valley portion and the power of the frequency component corresponding to half of the valley portion. One maximum divergence valley frequency F1 as a difference is obtained, and this maximum divergence valley frequency F1 (for example, 1200 Hz) is output to the masked signal generator 62 and the frequency discriminator 63.

被マスク信号発生器６２は、音声周波数分析器６０から最大乖離谷周波数Ｆ１の入力を取り込むと、その最大乖離谷周波数Ｆ１を成分に有する時間軸信号を発生して、スイッチ６１に出力するものである。 When receiving the input of the maximum deviation valley frequency F1 from the audio frequency analyzer 60, the masked signal generator 62 generates a time axis signal having the maximum deviation valley frequency F1 as a component and outputs the time axis signal to the switch 61. is there.

スイッチ６１は、被マスク信号発生器６２からの出力信号を初期遅延判定部２２に出力するものである。 The switch 61 outputs the output signal from the masked signal generator 62 to the initial delay determination unit 22.

周波数弁別器６３は、音声周波数分析器６０から最大乖離谷周波数Ｆ１の入力を取り込み、その最大乖離谷周波数Ｆ１の周波数成分をもつ信号だけを初期遅延判定部２２に通すように周波数を弁別するものである。 The frequency discriminator 63 receives the input of the maximum deviation valley frequency F1 from the audio frequency analyzer 60, and discriminates the frequency so that only the signal having the frequency component of the maximum deviation valley frequency F1 is passed to the initial delay determination unit 22. It is.

ここで、周波数弁別器６３による弁別の方法は、例えば、一旦ＦＦＴで周波数成分に変換し、音声周波数分析器６０からの周波数成分以外をすべて０にしたあと、既存技術である逆高速フーリエ変換変換（以後ＩＦＦＴ）を施して時間波形に戻すなどすればよく、所望周波数の弁別ができればこの方法には限定しない。 Here, the discrimination method by the frequency discriminator 63 is, for example, once converted into frequency components by FFT, all other than the frequency components from the audio frequency analyzer 60 are set to 0, and then the inverse fast Fourier transform transform which is an existing technology. (Hereafter, IFFT) may be applied to return to the time waveform, and the method is not limited to this method as long as the desired frequency can be discriminated.

初期遅延判定部２２は、周波数弁別器６３とスイッチ６１を経由した被マスク信号発生器６２からの出力を用いて初期遅延量の推定を実施するものである。これにより、前述した実施例と同様の方法でエコー経路の初期遅延を求めるようにしたので、スピーカ側にいる装置の使用者にとってはマスキング効果によって、遅延推定信号が聞こえてしまう不快感がない。 The initial delay determination unit 22 estimates the initial delay amount using the output from the masked signal generator 62 via the frequency discriminator 63 and the switch 61. Thus, since the initial delay of the echo path is obtained by the same method as in the above-described embodiment, the user of the apparatus on the speaker side does not have the uncomfortable feeling that the delay estimation signal is heard due to the masking effect.

初期遅延付加部２０では、加算器２５の出力に対して初期遅延判定部２２から出力される遅延量を付加するものである。 The initial delay adding unit 20 adds the delay amount output from the initial delay determining unit 22 to the output of the adder 25.

以上のように、第６の実施形態によれば、マスキング効果を利用して、初期遅延付加部２０が付与する初期遅延量を推定する遅延推定信号を可聴域の信号としたことにより、周波数帯域を低周波や高周波などの非可聴領域の信号にしなくてもよくなり、安全性を向上させ、装置の設計コストを低減させることができる。 As described above, according to the sixth embodiment, the delay estimation signal for estimating the initial delay amount provided by the initial delay adding unit 20 using the masking effect is the signal in the audible range. Is not required to be a signal in a non-audible region such as a low frequency or a high frequency, thereby improving safety and reducing the design cost of the apparatus.

また、第６の実施形態では、音声周波数分析器６０を備えることで、入力音声信号のうち本来の入力音声成分のない周波数帯域がどこであるかを求めることができ、被マスク信号発生器６２に当該周波数成分の音声信号を発生させることができる。 Further, in the sixth embodiment, by providing the audio frequency analyzer 60, it is possible to determine where the frequency band having no original input audio component in the input audio signal is. An audio signal having the frequency component can be generated.

さらに、第６の実施形態によれば、被マスク信号発生器６２及び周波数弁別器６３を設けることで、入力音声成分のない周波数成分の信号だけを初期遅延判定部２２に与えるようにしたので、初期遅延判定部２２は、当該周波数成分の信号だけを用いて初期遅延を判定することができ、スピーカ側の使用者にマスキング効果による不快感のない遅延推定信号を出力できる。 Furthermore, according to the sixth embodiment, by providing the masked signal generator 62 and the frequency discriminator 63, only the signal of the frequency component without the input audio component is given to the initial delay determination unit 22, The initial delay determination unit 22 can determine the initial delay using only the signal of the frequency component, and can output a delay estimation signal free from discomfort due to the masking effect to the user on the speaker side.

また、第６の実施形態では、周波数弁別器６３を設け、エコー信号に関しても音声周波数分析器６０から出力された周波数成分の信号だけを初期遅延判定部２２に出力するようにしたので、周波数弁別器６３の出力信号である出力信号に関しても、エコー経路をとおった、ほぼ遅延推定信号成分のみで構成されていて、エコー推定に最も望ましい観測信号を入手でき、高周波などの非可聴領域の信号を用いず、加算器２５からの出力を初期遅延付加部２０を経由して適応フィルタ２１に入力するようにしたので、適応フィルタ２１に対して悪影響のある信号がはじめからないので、第２の音声帯域フィルタ１６等を用いて周波数帯域以外の制限処理をしなくても、初期遅延推定信号が混じったままで適応フィルタ２１のエコー経路推定に用いてよくなり、第５の実施例での音声入力ドライバー１５と適応フィルタ２１の間にある第２の帯域制限フィルタ１６等をなくすことができる。 In the sixth embodiment, since the frequency discriminator 63 is provided and only the signal of the frequency component output from the audio frequency analyzer 60 is output to the initial delay determination unit 22 with respect to the echo signal, the frequency discrimination is performed. The output signal that is the output signal of the device 63 is also composed of only the delay estimation signal component through the echo path, and the most desirable observation signal for echo estimation can be obtained. Since the output from the adder 25 is input to the adaptive filter 21 via the initial delay adding unit 20 without using the signal, there is no signal having an adverse effect on the adaptive filter 21. Even if the band filter 16 or the like is not used to perform a restriction process other than the frequency band, the initial delay estimation signal is mixed and used for the echo path estimation of the adaptive filter 21. Good will, can be eliminated, such as the second band-limiting filter 16 located between the adaptive filter 21 and the voice input driver 15 in the fifth embodiment.

その結果、装置構成を簡単にできるばかりでなく、第２の帯域制限フィルタ１６の帯域制限処理に伴う不可避的な処理遅延をなくすことができるので、遅延感のない、より自然な会話が可能になる。 As a result, not only the apparatus configuration can be simplified, but also the inevitable processing delay associated with the band limiting process of the second band limiting filter 16 can be eliminated, so that more natural conversation without a sense of delay is possible. Become.

さらに、高周波などの非可聴領域の信号を用いないので、信号を相当時間長時間にわたってスピーカから出力しても、装置使用者は会話の音声信号しか聞こえないので、聴感的に違和感なく、スピーカ近辺の事物に対しても、非可聴周波数、例えば高周波を聞かせることによる異常挙動を少なくすることができ、長時間通信を行なっても、安全で通話品質に優れた通信端末を実現化するエコーキャンセラを提供できるのである。 Furthermore, since a signal in a non-audible region such as a high frequency is not used, even if the signal is output from the speaker for a long time, the device user can only hear the voice signal of the conversation. Echo cancellers that can reduce the abnormal behavior caused by listening to non-audible frequencies, such as high frequencies, and realize a safe and excellent communication quality even for long-term communication Can be provided.

（Ｇ）他の実施形態
第４の実施形態では、初期遅延判定部４２が低周波発生器３の出力と加算器２５を経由した位相反転器の出力を比較して、位相反転のタイミングを計算するようにしたが、タイマ４０から直接位相反転のタイミング信号を入力し、入力周期をカウントして位相反転周期の判断を行なうようにしてもよい。 (G) Other Embodiments In the fourth embodiment, the initial delay determination unit 42 compares the output of the low frequency generator 3 and the output of the phase inverter via the adder 25 to calculate the phase inversion timing. However, the phase inversion timing signal may be directly input from the timer 40 and the input period may be counted to determine the phase inversion period.

第５の実施形態において、遅延推定のための信号発生器を高周波発生器としたが、第５の実施例のおもな効果は継時マスキングによって、遅延推定信号を話者に悟られることなくスピーカから出力してエコー経路の遅延を推定することであるので、維持マスキングの効果に重点をおき、可聴域の信号を発生する任意信号発生器を用いてもよい。 In the fifth embodiment, the signal generator for delay estimation is a high-frequency generator. The main effect of the fifth embodiment is that the delay estimation signal is not recognized by the speaker by successive masking. Since the delay of the echo path is estimated by outputting from the speaker, an arbitrary signal generator that generates an audible signal with emphasis on the effect of the maintenance masking may be used.

第６の実施形態において、周波数弁別器としてＦＦＴとＩＦＦＴを用いる例を示したが、フィルタバンク、すなわち複数の帯域通過フィルタを設けて、音声周波数分析器の出力に応じた通過帯域をもつ、フィルタバンクの中の１つのフィルタを選択して通過させるようにしてもよい。 In the sixth embodiment, an example in which FFT and IFFT are used as frequency discriminators has been described. However, a filter bank, that is, a filter having a plurality of band pass filters and having a pass band corresponding to the output of the audio frequency analyzer One filter in the bank may be selected and passed.

第１の実施形態の通話装置の内部構成を示すブロック図である。It is a block diagram which shows the internal structure of the communication apparatus of 1st Embodiment. 従来の通話装置の内部構成を示すブロック図である。It is a block diagram which shows the internal structure of the conventional telephone apparatus. 第２の実施形態の通話装置の内部構成を示すブロック図である。It is a block diagram which shows the internal structure of the communication apparatus of 2nd Embodiment. 第３の実施形態の通話装置の内部構成を示すブロック図である。It is a block diagram which shows the internal structure of the communication apparatus of 3rd Embodiment. 第４の実施形態の通話装置の内部構成を示すブロック図である。It is a block diagram which shows the internal structure of the communication apparatus of 4th Embodiment. 第５の実施形態の通話装置の内部構成を示すブロック図である。It is a block diagram which shows the internal structure of the communication apparatus of 5th Embodiment. 第６の実施形態の通話装置の内部構成を示すブロック図である。It is a block diagram which shows the internal structure of the communication apparatus of 6th Embodiment.

符号の説明Explanation of symbols

１…ネットワーク（インターネット）、２…復号器、３…低周波発生器、４…帯域制限フィルタ（ＢＰＦ１）、５…音声出力ドライバー、６…途切れ緩衝用蓄積バッファ、７…Ｄ／Ａ（ディジタル／アナログ変換器）、８…サウンドボード、９…スピーカ端子、１０…スピーカ、１１…マイク、１２…マイク端子、１３…Ａ／Ｄ（アナログ／ディジタル変換器）、１４…途切れ緩衝用蓄積バッファ、１５…音声入力ドライバー、１６…帯域制限フィルタ（ＢＰＦ２）、１７（１７−１、１７−２）…加算器、１８…符号化器、１９…回線インターフェース、２０…初期遅延付加部、２１（２１−１、２１−２）…適応フィルタ、２２…初期遅延判定部、２３（２３−１、２３−２）…スイッチ、２４…ネットワーク端末装置、２５（２５−１、２５−２）…加算器、２６…低域通過フィルタ（ＬＰＯ１）、２７…低域通過フィルタ（ＬＰＯ２）、２Ａ…高周波発生器、２Ｂ…高域通過フィルタ（ＨＰＦ１）、２Ｃ…高域通過フィルタ（ＨＰＦ２）、４０…タイマ、４１…位相反転器、４２…初期遅延判定部、５０…音声レベル検出器、５１…スイッチ、６０…音声周波数分析器、６１…スイッチ、６２…被マスク信号発生器、６３…周波数弁別器。
DESCRIPTION OF SYMBOLS 1 ... Network (Internet), 2 ... Decoder, 3 ... Low frequency generator, 4 ... Band-limiting filter (BPF1), 5 ... Audio | voice output driver, 6 ... Accumulation buffer storage buffer, 7 ... D / A (digital / (Analog converter), 8 ... sound board, 9 ... speaker terminal, 10 ... speaker, 11 ... microphone, 12 ... microphone terminal, 13 ... A / D (analog / digital converter), 14 ... storage buffer for interruption buffer, 15 ... voice input driver, 16 ... band limiting filter (BPF2), 17 (17-1, 17-2) ... adder, 18 ... encoder, 19 ... line interface, 20 ... initial delay adding unit, 21 (21- 1, 21-2) ... adaptive filter, 22 ... initial delay determination unit, 23 (23-1, 23-2) ... switch, 24 ... network terminal device, 25 (25-1) 25-2) ... Adder, 26 ... Low pass filter (LPO1), 27 ... Low pass filter (LPO2), 2A ... High frequency generator, 2B ... High pass filter (HPF1), 2C ... High pass filter (HPF2), 40 ... timer, 41 ... phase inverter, 42 ... initial delay determination unit, 50 ... audio level detector, 51 ... switch, 60 ... audio frequency analyzer, 61 ... switch, 62 ... masked signal generator 63: Frequency discriminator.

Claims

適応フィルタを有する擬似エコー生成手段が受信信号に基づいて生成した擬似エコー信号を、エコー除去手段が送信信号から減算してエコー成分を除去するエコーキャンセラにおいて、
非可聴な時間軸波形成分を有する非可聴信号を発生する非可聴信号発生手段と、
上記受信信号に上記非可聴信号を加算した出力信号を生成する出力信号生成手段と、
上記非可聴信号に対して所定のフィルタ処理を施して第１の遅延推定信号を生成する第１の遅延推定信号生成手段と、
上記送信信号に対して所定のフィルタ処理を施して第２の遅延推定信号を生成する第２の遅延推定信号生成手段と、
上記第１の遅延推定信号と上記第２の遅延推定信号との相互相関に基づいて、エコー経路による可変遅延特性を反映させた遅延時間情報を求める遅延時間情報算出手段と、
上記遅延時間情報算出手段が算出した上記遅延時間情報に基づいて上記適応フィルタへの受信信号の入力を遅延させる遅延付加手段と
を備えることを特徴とするエコーキャンセラ。 In an echo canceller in which a pseudo echo signal generated based on a received signal by a pseudo echo generating means having an adaptive filter is subtracted from a transmission signal by an echo removing means to remove an echo component,
A non-audible signal generating means for generating a non-audible signal having a non-audible time axis waveform component;
Output signal generating means for generating an output signal obtained by adding the inaudible signal to the received signal;
First delay estimation signal generation means for generating a first delay estimation signal by applying a predetermined filtering process to the inaudible signal;
Second delay estimation signal generation means for generating a second delay estimation signal by applying a predetermined filtering process to the transmission signal;
Delay time information calculating means for obtaining delay time information reflecting variable delay characteristics by an echo path based on a cross-correlation between the first delay estimation signal and the second delay estimation signal;
An echo canceller comprising: delay addition means for delaying input of a received signal to the adaptive filter based on the delay time information calculated by the delay time information calculation means.

上記非可聴信号発生手段が、非可聴な低周波信号を発生する低周波信号発生手段であることを特徴とする請求項１に記載のエコーキャンセラ。 2. The echo canceller according to claim 1, wherein the inaudible signal generating means is a low frequency signal generating means for generating an inaudible low frequency signal.

上記第１の遅延推定信号生成手段及び上記第２の遅延推定信号生成手段が、上記非可聴信号及び上記送信信号に対して所定の低域通過フィルタ処理を施すものであることを特徴とする請求項１又は２に記載のエコーキャンセラ。 The first delay estimation signal generation means and the second delay estimation signal generation means perform predetermined low-pass filter processing on the inaudible signal and the transmission signal. Item 3. The echo canceller according to Item 1 or 2.

上記非可聴信号発生手段が、非可聴な高周波信号を発生する高周波信号発生手段であることを特徴とする請求項１に記載のエコーキャンセラ。 The echo canceller according to claim 1, wherein the non-audible signal generating means is a high-frequency signal generating means for generating a non-audible high-frequency signal.

上記第１の遅延推定信号生成手段及び上記第２の遅延推定信号生成手段が、上記非可聴信号及び上記送信信号に対して所定の高域通過フィルタ処理を施すものであることを特徴とする請求項１又は４に記載のエコーキャンセラ。 The first delay estimation signal generation means and the second delay estimation signal generation means perform predetermined high-pass filter processing on the inaudible signal and the transmission signal. Item 5. The echo canceller according to Item 1 or 4.

上記非可聴信号発生手段が、２以上の非可聴信号を発生するものであり、
上記第１の遅延推定信号生成手段及び上記第２の遅延推定信号生成手段が、複数のチャンネルを有する受信信号の各チャンネルの信号に対してそれぞれ異なる非可聴信号を割り当てて、第１の遅延推定信号及び第２の遅延推定信号を生成する
ことを特徴とする請求項１に記載のエコーキャンセラ。 The non-audible signal generating means generates two or more non-audible signals;
The first delay estimation signal generation means and the second delay estimation signal generation means allocate different inaudible signals to the signals of the respective channels of the reception signal having a plurality of channels, and perform the first delay estimation. The echo canceller according to claim 1, wherein the signal and the second delay estimation signal are generated.

上記非可聴信号発生手段から出力される上記非可聴信号の位相を反転し、その位相反転した非可聴信号を、上記出力信号生成手段が上記受信信号に加算させる信号とする位相反転手段を備え、
上記遅延時間情報算出手段が、上記非可聴信号と、位相反転した非可聴信号が加算された第１の遅延推定信号及び第２の遅延推定信号との位相の一致を判定し、その判定結果に応じた遅延時間情報を出力する
ことを特徴とする請求項１〜６のいずれかに記載のエコーキャンセラ。 Inverting the phase of the non-audible signal output from the non-audible signal generating means, and comprising a phase inverting means for causing the output signal generating means to add the phase of the non-audible signal to the received signal.
The delay time information calculating means determines the coincidence of the phase between the non-audible signal and the first delay estimation signal and the second delay estimation signal to which the phase-inverted non-audible signal is added. The corresponding delay time information is output. The echo canceller according to any one of claims 1 to 6.

上記受信信号の音声レベルを監視するものであり、経時マスキング効果を利用するため、上記受信信号の音声レベルが所定レベルに達したときに、上記非可聴信号発生手段からの上記非可聴信号の出力を遅延推定信号として出力させる音声レベル検出手段を備えることを特徴とする請求項１〜７のいずれかに記載のエコーキャンセラ。 Since the sound level of the received signal is monitored and the temporal masking effect is used, when the sound level of the received signal reaches a predetermined level, the non-audible signal is output from the non-audible signal generating means. The echo canceller according to any one of claims 1 to 7, further comprising: a voice level detection unit that outputs a signal as a delay estimation signal.

周波数的マスキング効果を利用するため、上記受信信号の周波数成分を分析するものであり、上記受信信号の音声成分のうち、比較的パワーの小さい周波数成分の周波数を特定する周波数分析手段を備え、
上記非可聴信号発生手段が、上記周波数分析手段が特定した周波数の周波数信号を発生するものであり、
上記第２の遅延推定信号生成手段が、上記送信信号のうち上記周波数分析手段が特定した周波数の周波数成分信号を弁別すると共に、その弁別した周波数成分信号を上記遅延時間情報算出手段に与えるものであり、
上記遅延時間情報算出手段が、上記非可聴信号発生手段が出力した上記周波数信号と、上記第２の遅延推定信号生成手段が弁別した周波数成分信号とに基づいて、遅延時間情報を算出する
ことを特徴とする請求項１〜８のいずれかに記載のエコーキャンセラ。
In order to utilize the frequency masking effect, the frequency component of the received signal is analyzed, and the frequency component of the frequency component of relatively low power among the audio components of the received signal is specified,
The non-audible signal generating means generates a frequency signal having a frequency specified by the frequency analyzing means;
The second delay estimation signal generation means discriminates a frequency component signal having a frequency specified by the frequency analysis means from the transmission signal and supplies the discriminated frequency component signal to the delay time information calculation means. Yes,
The delay time information calculating means calculates delay time information based on the frequency signal output from the non-audible signal generating means and the frequency component signal discriminated by the second delay estimation signal generating means. The echo canceller according to any one of claims 1 to 8.