JPWO2017134798A1

JPWO2017134798A1 - Voice communication device

Info

Publication number: JPWO2017134798A1
Application number: JP2016544644A
Authority: JP
Inventors: 茂明鈴木; 訓古田; 智治粟野
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2016-02-04
Filing date: 2016-02-04
Publication date: 2018-02-08
Anticipated expiration: 2036-02-04
Also published as: JP6011751B1; CN108495803A; WO2017134798A1; CN108495803B

Abstract

この発明に係る音声通話装置は、エレベータ用の音声通話装置であり、信号のエコーの経路に依存するパラメータを保持するパラメータ保持部と、前記パラメータを用いて音声通話時の音声信号を処理する信号処理部と、自装置が設置されたエレベータ内の環境に応じて、前記音声通話に先立って前記パラメータの学習を行うことが可能か否かを判定する学習契機判定部と、前記学習契機判定部で学習を行うことが可能と判定されたとき、前記パラメータの学習を行う学習信号を発生する学習信号発生部と、を備えたことを特徴とする。この構成によって、ユーザに違和感を与えることなく適切なタイミングで自動的に初期学習を行うことができる。A voice call device according to the present invention is a voice call device for an elevator, a parameter holding unit for holding a parameter depending on a signal echo path, and a signal for processing a voice signal at the time of a voice call using the parameter A processing unit, a learning opportunity determination unit that determines whether or not the parameter can be learned prior to the voice call according to an environment in an elevator in which the device is installed, and the learning opportunity determination unit And a learning signal generator for generating a learning signal for learning the parameter when it is determined that learning is possible. With this configuration, initial learning can be automatically performed at an appropriate timing without causing the user to feel uncomfortable.

Description

この発明は、音声通話を行う際に用いられる音声通話装置に関する。 The present invention relates to a voice call device used when making a voice call.

スピーカホン電話などのハンズフリー通話が可能な音声通話装置においては、ハウリングやエコーを防止するためのエコーキャンセラが用いられている。エコーが発生する条件は、音声通話装置が設定される環境に依存し、一般に狭くて硬い壁に囲まれた部屋は残響が大きいため大きなエコーが発生し、逆に広い部屋であればエコーが比較的小さい場合もある。エコーキャンセラは、エコー経路の特性（インパルス応答など）を逐次学習する適応フィルタを備えており、適応フィルタは環境によって異なるエコー経路の特性を逐次学習してエコーを消去する。但し、設置環境が大きく異なる場合などに、この逐次学習が困難となる、あるいは学習に時間がかかるといった問題が生じるため、通話前に初期学習を行って予め学習したパラメータを与えておく技術が開示されている（例えば、特許文献１）。 2. Description of the Related Art An echo canceller for preventing howling and echo is used in a voice call device capable of hands-free calling such as a speakerphone phone. The conditions for echo generation depend on the environment in which the voice communication device is set. Generally, a room surrounded by a narrow and hard wall has a large reverberation, so a large echo is generated. Sometimes small. The echo canceller includes an adaptive filter that sequentially learns the characteristics (e.g., impulse response) of the echo path, and the adaptive filter sequentially learns the characteristics of the echo path that varies depending on the environment and cancels the echo. However, there is a problem that this sequential learning becomes difficult or takes a long time to learn when the installation environment is greatly different, etc. Therefore, a technique is disclosed in which initial learning is performed and a pre-learned parameter is given before a call. (For example, Patent Document 1).

特許文献１に開示される技術では、初期学習を行う学習モードにおいて、エコーキャンセラの校正音を発生して音声通話装置のスピーカに出力し、このときマイクロフォンに入力される信号を用いて初期学習を行う。この初期学習を行うタイミングについては、電源投入時に自動的に初期学習を行うか、または音声通話装置に初期学習スイッチを備えてユーザがそのスイッチを用いて指定したタイミングで初期学習を行う方法が開示されている。 In the technique disclosed in Patent Document 1, in the learning mode in which the initial learning is performed, the calibration sound of the echo canceller is generated and output to the speaker of the voice communication device. At this time, the initial learning is performed using the signal input to the microphone. Do. Regarding the timing for performing this initial learning, a method is disclosed in which initial learning is automatically performed when the power is turned on, or the initial learning switch is provided in the voice communication device and the initial learning is performed at the timing specified by the user using the switch. Has been.

特開２００５−３２３３０８号公報．JP-A-2005-323308.

エレベータの非常通報に用いられる音声通話装置は、エレベータ内に設置されたインターホンのマイクとスピーカによりハンズフリー通話を行う装置であり、エコーキャンセラが必要となる。このマイクとスピーカ特性のばらつきや、エレベータの大きさによって、エコー環境は大きく異なるため、従来技術で示される初期学習の導入は有効である。しかし、エレベータ非常通報用の音声通話装置は、エレベータ内での閉じ込めなどの事態に遭遇したエレベータ搭乗者が用いることになるため、閉じ込められた状況でユーザが初期学習の開始を指定することには困難が伴う。従って、従来技術をそのまま用いると、初期学習のタイミングとして適しているとは言えないという問題がある。 A voice communication device used for an emergency call of an elevator is a device that performs a hands-free call using a microphone and a speaker of an interphone installed in the elevator, and an echo canceller is required. Since the echo environment varies greatly depending on the variation in the microphone and speaker characteristics and the size of the elevator, the introduction of the initial learning shown in the prior art is effective. However, since an elevator passenger who encounters a situation such as confinement in an elevator is used for a voice call device for an emergency call of an elevator, the user may specify the start of initial learning in a confined situation. There are difficulties. Therefore, if the conventional technique is used as it is, there is a problem that it cannot be said that it is suitable as the timing of the initial learning.

この発明は上記のような問題点を解決するためになされたもので、ユーザに違和感を与えることなく適切な時間帯で自動的に初期学習を行うことが可能な音声通話装置を得ることを目的とする。 The present invention has been made to solve the above problems, and an object of the present invention is to provide a voice communication device that can automatically perform initial learning in an appropriate time zone without giving a sense of incongruity to a user. And

この発明に係る音声通話装置は、エレベータ用の音声通話装置であり、信号のエコーの経路に依存するパラメータを保持するパラメータ保持部と、前記パラメータを用いて音声通話時の音声信号を処理する信号処理部と、自装置が設置されたエレベータ内の環境に応じて、前記音声通話に先立って前記パラメータの学習を行うことが可能か否かを判定する学習契機判定部と、前記学習契機判定部で学習を行うことが可能と判定されたとき、前記パラメータの学習を行う学習信号を発生する学習信号発生部と、を備えたことを特徴とする。 A voice call device according to the present invention is a voice call device for an elevator, a parameter holding unit for holding a parameter depending on a signal echo path, and a signal for processing a voice signal at the time of a voice call using the parameter A processing unit, a learning opportunity determination unit that determines whether or not the parameter can be learned prior to the voice call according to an environment in an elevator in which the device is installed, and the learning opportunity determination unit And a learning signal generator for generating a learning signal for learning the parameter when it is determined that learning is possible.

この発明によれば、ユーザに違和感を与えることなく適切な時間帯で自動的に初期学習を行うことができる。 According to the present invention, it is possible to automatically perform initial learning in an appropriate time zone without giving the user a sense of incongruity.

実施の形態１に係る音声通話装置が用いられるエレベータ非常通報用システムの構成図。1 is a configuration diagram of an elevator emergency call system in which a voice communication device according to Embodiment 1 is used. 実施の形態１に係る音声通話装置４の内部構成。The internal structure of the voice call device 4 according to the first embodiment. 実施の形態１に係る音声通話装置４のハードウェア構成を示す図。2 is a diagram showing a hardware configuration of a voice call device 4 according to Embodiment 1. FIG. 実施の形態１に係る音声通話装置４の図３と異なるハードウェア構成を示す図。The figure which shows the hardware constitutions different from FIG. 3 of the voice call apparatus 4 which concerns on Embodiment 1. FIG. 実施の形態１に係る音声通話装置４の動作を示すフローチャート。4 is a flowchart showing the operation of the voice call device 4 according to the first embodiment. 実施の形態１に係る学習契機判定部４１の初期学習時間の判定フロー。6 is a flow for determining an initial learning time of the learning opportunity determination unit 41 according to the first embodiment. 実施の形態１に係る信号処理部４５の内部構成を示す図。FIG. 3 is a diagram showing an internal configuration of a signal processing unit 45 according to the first embodiment. 実施の形態２に係る学習契機判定部４１の初期学習時間の判定フロー。The determination flow of the initial learning time of the learning opportunity determination part 41 which concerns on Embodiment 2. FIG. 実施の形態３に係る学習契機判定部４１の初期学習時間の判定フロー。The determination flow of the initial learning time of the learning opportunity determination part 41 which concerns on Embodiment 3. FIG. 実施の形態５に係る音声通話装置４の動作を示すフローチャート。10 is a flowchart showing the operation of the voice call device 4 according to the fifth embodiment. 実施の形態６による信号処理部４５の内部構成図。FIG. 10 is an internal configuration diagram of a signal processing unit 45 according to a sixth embodiment. 実施の形態６による信号処理部４５の内部構成図。FIG. 10 is an internal configuration diagram of a signal processing unit 45 according to a sixth embodiment.

実施の形態１．
この発明の実施の形態１に係る音声通話装置について説明する。図１は、この発明による音声通話装置が用いられるエレベータ非常通報用システムの構成図である。図において、１はエレベータ、２はエレベータ１内に設置されるインターホン、３はエレベータ１の動作を制御するエレベータ運転制御部、４は音声通話装置、５は通信ネットワーク、６はエレベータ１の通話先となる監視センター、７は監視センター６内の電話端末である。Embodiment 1 FIG.
A voice communication apparatus according to Embodiment 1 of the present invention will be described. FIG. 1 is a block diagram of an elevator emergency call system in which a voice communication device according to the present invention is used. In the figure, 1 is an elevator, 2 is an intercom installed in the elevator 1, 3 is an elevator operation control unit that controls the operation of the elevator 1, 4 is a voice communication device, 5 is a communication network, and 6 is a destination of the elevator 1. The monitoring center 7 is a telephone terminal in the monitoring center 6.

このシステムは、例えば、エレベータ１が故障してエレベータ１内に搭乗者が閉じ込められた場合、監視センター６のオペレータを呼出して通話する場合に用いられる。このとき、インターホン２は、エレベータ１に閉じ込められた人の音声入出力に用いられ、音声通話装置４はインターホン２のアナログ音声と通信ネットワーク５上を伝送するデジタル音声との相互変換、エレベータ１内で発生するエコーの抑圧を行う。通信ネットワーク５は音声通話装置４と監視センター６との間の音声データを伝送する。電話端末７は監視センター６内のオペレータが通話に用いる。なお、エレベータ運転制御部３はエレベータ１内に設置され、エレベータ１の運転、すなわち昇降やドアの開閉の制御を行い、上記音声通話には直接関係しないが、本実施形態においては、音声通話装置４が初期学習の時間を判断するための情報を提供する。 This system is used, for example, when a call is made by calling an operator of the monitoring center 6 when the elevator 1 fails and a passenger is confined in the elevator 1. At this time, the interphone 2 is used for voice input / output of a person confined in the elevator 1, and the voice communication device 4 performs interconversion between the analog voice of the interphone 2 and the digital voice transmitted on the communication network 5, in the elevator 1. Suppresses echoes generated in. The communication network 5 transmits voice data between the voice communication device 4 and the monitoring center 6. The telephone terminal 7 is used for a telephone call by an operator in the monitoring center 6. The elevator operation control unit 3 is installed in the elevator 1 and controls the operation of the elevator 1, that is, controls the raising / lowering and opening / closing of the door, and is not directly related to the voice call, but in this embodiment, the voice call device 4 provides information for determining the time of initial learning.

図２には、音声通話装置４の内部構成を、同装置に接続されるインターホン２、エレベータ運転制御部３と共に示す。図において、４１は初期学習の契機を判定する学習契機判定部、４２は通話の開始・終了を制御する通話制御部、４３は学習用の信号を発生する学習信号発生部、４４は通信回線からの入力と学習信号との何れかを選択するスイッチ、４５はインターホン２のスピーカ２１から出力された音声がインターホン２のマイクに回り込んだ信号であるエコーを抑圧するエコーキャンセラ機能を有する信号処理部、４６はデジタル信号をアナログ信号に変換するＤ／Ａ変換器、４７はアナログ信号をデジタル信号に変換するＡ／Ｄ（Ａｎａｌｏｇ／Ｄｉｇｉｔａｌ）変換器、４８は初期学習によって得られたパラメータを保持するパラメータ保持部、４９は通信回線とデータの送受信を行う通信回線インターフェース、２１はインターホン２内部のスピーカ、２２はインターホン２内部のマイクロフォン（以下、マイクと呼ぶ）、２３はインターホン２による非常通話開始に用いられる非常通話ボタンである。 FIG. 2 shows the internal configuration of the voice communication device 4 together with the interphone 2 and the elevator operation control unit 3 connected to the device. In the figure, 41 is a learning trigger determination unit that determines the trigger of initial learning, 42 is a call control unit that controls the start / end of a call, 43 is a learning signal generation unit that generates a learning signal, and 44 is from a communication line. 45 is a signal processing unit having an echo canceller function for suppressing an echo that is a signal obtained by sneaking the sound output from the speaker 21 of the interphone 2 into the microphone of the interphone 2. , 46 is a D / A converter that converts a digital signal into an analog signal, 47 is an A / D (Analog / Digital) converter that converts an analog signal into a digital signal, and 48 holds parameters obtained by initial learning. The parameter holding unit 49 is a communication line interface for transmitting and receiving data to and from the communication line, and 21 is a switch inside the intercom 2. Over Ca, 22 intercom 2 internal microphones (hereinafter, referred to as microphone), 23 is very call button to be used in very call initiation by intercom 2.

図３は、この発明の実施の形態１における音声通話装置４のハードウェア構成を示す図である。学習契機判定部４１、通話制御部４２、学習信号発生部４３、スイッチ４４、信号処理部４５、パラメータ保持部４８は、メモリ４０２に記憶されたプログラムを実行するプロセッサ４０１によって実現される。なお、これは一例であって、これ以外の専用処理回路などを用いたハードウェア構成であっても構わない。 FIG. 3 is a diagram showing a hardware configuration of the voice call device 4 according to Embodiment 1 of the present invention. The learning opportunity determination unit 41, the call control unit 42, the learning signal generation unit 43, the switch 44, the signal processing unit 45, and the parameter holding unit 48 are realized by a processor 401 that executes a program stored in the memory 402. This is merely an example, and a hardware configuration using a dedicated processing circuit other than this may be used.

Ｄ／Ａ変換器４６、Ａ／Ｄ変換器４７は、Ａ／Ｄ、Ｄ／Ａ変換ＬＳＩ（ＬａｒｇｅＳｃａｌｅＩｎｔｅｇｒａｔｉｏｎ）４０３により実現される。なお、これは一例であって、例えばプロセッサ４０１が、Ａ／Ｄ、Ｄ／Ａ変換ＬＳＩ４０３と統合されたシステムＬＳＩ等であっても良い。 The D / A converter 46 and the A / D converter 47 are realized by an A / D, D / A conversion LSI (Large Scale Integration) 403. This is an example, and for example, the processor 401 may be a system LSI integrated with an A / D or D / A conversion LSI 403 or the like.

スイッチ開閉検出ＬＳＩ４０４は、非常通話ボタン２３のボタン押下状態を、プロセッサ４０１の特定ポートから参照できるように電気変換する。なお、これは一例であって、例えばプロセッサ４０１がスイッチ開閉検出ＬＳＩ４０４と統合されたシステムＬＳＩ等であっても良い。 The switch open / close detection LSI 404 performs electrical conversion so that the button pressing state of the emergency call button 23 can be referred to from a specific port of the processor 401. This is an example, and for example, a system LSI in which the processor 401 is integrated with the switch open / close detection LSI 404 may be used.

エレベータ運転制御部３は、学習契機判定部４１に出力する各種情報をネットワークフレームに乗せて出力し、ネットワークインターフェースＡ４０５がこれを受信する。また、ネットワークフレームに乗せられた通信回線側の入出力データは、ネットワークインターフェースＢ４０６が送受信する。すなわち、ネットワークインターフェースＢ４０６は図２における通信回線インターフェース４９に相当する。なお、これは一例であって、例えば、１つのネットワークインターフェースが、エレベータ運転制御部３からの情報の受信と、通信回線側の入出力データの送受信を行う構成でも良い。 The elevator operation control unit 3 outputs various information output to the learning opportunity determination unit 41 on a network frame, and the network interface A405 receives the information. The network interface B 406 transmits / receives the input / output data on the communication line side carried in the network frame. That is, the network interface B 406 corresponds to the communication line interface 49 in FIG. This is only an example, and for example, one network interface may receive information from the elevator operation control unit 3 and transmit / receive input / output data on the communication line side.

更に、図４に示す構成として、プロセッサ４０１が実現する処理を複数のプロセッサで実現しても良く、この図においては、信号処理部４５、学習信号発生部４３などの信号処理をデジタル信号処理プロセッサ４０７が実現する。 Furthermore, as shown in FIG. 4, the processing realized by the processor 401 may be realized by a plurality of processors. In this figure, the signal processing of the signal processing unit 45, the learning signal generation unit 43, etc. 407 is realized.

図５は音声通話装置４の動作を示すフローチャートであり、以下、図５を用いて音声通話装置４の動作を説明する。 FIG. 5 is a flowchart showing the operation of the voice call device 4. Hereinafter, the operation of the voice call device 4 will be described with reference to FIG.

まず、搭乗者がエレベータ１に閉じ込められる状況などが発生して通話が行われる場合について説明する。通話制御部４２は非常通話ボタン２３が押下されたかどうかを監視しており（ＳＴ１）、エレベータ１内に閉じ込められた搭乗者が非常通話ボタン２３を押下すると、これを契機に通話制御部４２が通信回線インターフェース４９を介して監視センター６と制御信号を送受信し、監視センター６を呼出して通信を確立する（ＳＴ２）。この通信確立は、例えば、ＩＥＴＦ（ＩｎｔｅｒｎｅｔＥｎｇｉｎｅｅｒｉｎｇＴａｓｋＦｏｒｃｅ）のＲＦＣ（ＲｅｑｕｅｓｔｆｏｒＣｏｍｍｅｎｔｓ）３２６１で規定されるＳＩＰ（ＳｅｓｓｉｏｎＩｎｉｔｉａｔｉｏｎＰｒｏｔｏｃｏｌ）によって行われる。また、通話制御部４２はその通信状態、すなわち、センター呼出し中か、通話中か、その何れでもないアイドル状態であるかを、学習契機判定部４１と信号処理部４５に出力する。監視センター６との通信が確立すると、信号処理部４５はパラメータ保持部４８に格納されたパラメータを入力し（ＳＴ３）、通話が始まる。通話中、スイッチ４４は通信回線インターフェース４９からの入力信号を選択して信号処理部４５に出力するため、通信回線インターフェース４９からの入力信号は信号処理部４５を経由してＤ／Ａ変換器４６に出力される（ＳＴ４）。そして、Ｄ／Ａ変換器４６でアナログ信号に変換されてインターホン２内のスピーカ２１から出力される。また、インターホン２内のマイク２２からのエレベータ１内登場者の音声は、Ａ／Ｄ変換器４７でデジタル信号に変換された後、信号処理部４５によってエコーが消去され、通信回線インターフェース４９を経て通信回線に出力される（ＳＴ５）。そして、通話が終了するまでこの動作を継続する（ＳＴ６）。 First, a case will be described in which a call is made when a passenger is trapped in the elevator 1 or the like. The call control unit 42 monitors whether or not the emergency call button 23 has been pressed (ST1), and when a passenger confined in the elevator 1 presses the emergency call button 23, the call control unit 42 is triggered by this. Control signals are transmitted to and received from the monitoring center 6 via the communication line interface 49, and the monitoring center 6 is called to establish communication (ST2). This communication establishment is performed by, for example, SIP (Session Initiation Protocol) defined by RFC (Request for Comments) 3261 of the Internet Engineering Task Force (IETF). Further, the call control unit 42 outputs to the learning opportunity determination unit 41 and the signal processing unit 45 whether the communication state, that is, the center call, the call, or any idle state. When communication with the monitoring center 6 is established, the signal processing unit 45 inputs the parameters stored in the parameter holding unit 48 (ST3), and a call is started. During a call, the switch 44 selects an input signal from the communication line interface 49 and outputs it to the signal processing unit 45, so that the input signal from the communication line interface 49 passes through the signal processing unit 45 to the D / A converter 46. (ST4). Then, it is converted into an analog signal by the D / A converter 46 and output from the speaker 21 in the intercom 2. In addition, the voice of the person in the elevator 1 from the microphone 22 in the interphone 2 is converted into a digital signal by the A / D converter 47, and then the echo is eliminated by the signal processing unit 45, via the communication line interface 49. It is output to the communication line (ST5). This operation is continued until the call is finished (ST6).

なお、スイッチ４４は、学習契機判定部４１で行われる初期学習が可能か否かの判定結果が初期学習が可能でない場合、上述の通りに選択する。この学習契機判定部４１で行われる初期学習が可能か否かの判定動作については後述する。 Note that the switch 44 selects as described above when the determination result of whether or not the initial learning performed by the learning opportunity determination unit 41 is possible is not possible. The operation for determining whether or not the initial learning performed by the learning opportunity determination unit 41 is possible will be described later.

続いて、通話制御部４２が非常通話ボタン２３を押下されていないと判定する環境において（ＳＴ１）、学習契機判定部４１が初期学習可能と判定した場合の動作を説明する。学習契機判定部４１が初期学習が可能な時間（初期学習時間）と判定すると（ＳＴ７）、学習処理が開始される。学習信号発生部４３はパラメータ初期学習用の信号として白色雑音を出力する（ＳＴ８）。スイッチ４４は、通信回線インターフェース４９ではなく、学習信号発生部４３からの入力を選択して信号処理部４５に出力するので、学習信号発生部４３の出力信号は、信号処理部４５を経てＤ／Ａ変換器４６に出力される（ＳＴ９）。そして、Ｄ／Ａ変換器４６でアナログ信号に変換されてインターホン２内のスピーカ２１から出力される。インターホン２内のマイク２２からのエレベータ１内登場者の音声は、Ａ／Ｄ変換器４７でデジタル信号に変換された後、信号処理部４５によってエコーが消去される（ＳＴ１０）。そして、初期学習時間が終了するまでこの動作を継続し（ＳＴ１１）、初期学習時間が終了すると、エコー消去動作に伴って信号処理部４５が学習したパラメータをパラメータ保持部４８に格納する（ＳＴ１２）。 Next, an operation when the learning opportunity determination unit 41 determines that initial learning is possible in an environment where the call control unit 42 determines that the emergency call button 23 has not been pressed (ST1) will be described. When the learning opportunity determination unit 41 determines that the initial learning is possible (initial learning time) (ST7), the learning process is started. The learning signal generator 43 outputs white noise as a signal for initial parameter learning (ST8). Since the switch 44 selects the input from the learning signal generation unit 43 instead of the communication line interface 49 and outputs it to the signal processing unit 45, the output signal of the learning signal generation unit 43 passes through the signal processing unit 45 to D / The data is output to the A converter 46 (ST9). Then, it is converted into an analog signal by the D / A converter 46 and output from the speaker 21 in the intercom 2. The voice of the person in the elevator 1 from the microphone 22 in the interphone 2 is converted into a digital signal by the A / D converter 47, and then the echo is eliminated by the signal processing unit 45 (ST10). Then, this operation is continued until the initial learning time ends (ST11). When the initial learning time ends, the parameters learned by the signal processing unit 45 along with the echo cancellation operation are stored in the parameter holding unit 48 (ST12). .

なお、パラメータ初期学習用の信号は、信号処理部４５によってエコーを消去できる信号であれば、白色雑音以外の信号であっても構わない。 Note that the parameter initial learning signal may be a signal other than white noise as long as the signal processing unit 45 can cancel the echo.

学習契機判定部４１は、初期学習が可能か否かの判定結果をスイッチ４４、信号処理部４５、学習信号発生部４３に出力する。以下、図５のフローにおいてＳＴ７に示した判定、すなわち学習契機判定部４１が初期学習が可能か否かを判定する動作について説明する。 The learning opportunity determination unit 41 outputs a determination result as to whether or not initial learning is possible to the switch 44, the signal processing unit 45, and the learning signal generation unit 43. In the following, the determination shown in ST7 in the flow of FIG. 5, that is, the operation for determining whether or not the learning opportunity determination unit 41 can perform initial learning will be described.

この判定は、エレベータ運転制御部３からの情報とＡ／Ｄ変換器４７からの信号を基に行われる。まず、エレベータ運転制御部３は、エレベータ１の昇降、ドアの開閉制御を行うものだが、そのため、エレベータ１が停止中か動作（上昇または下降）中かどうかを示すエレベータ動作・停止情報、ドアが開いているか閉じているかを示すドア開閉情報、エレベータ１内の行先ボタンやドア開閉ボタンを押下したかどうかを示すボタン押下情報を持っており、これらの情報を学習契機判定部４１に出力する。信号処理部４５の初期学習は、エレベータ１が停止、かつドアが閉じた状態で、エレベータ１内に搭乗者がおらず、騒音が少ない状態で行うことが望ましい。このような条件を満たす時間帯を判定するため、学習契機判定部４１は、エレベータ運転制御部３からの各情報、通話制御部４２からの情報、及び、Ａ／Ｄ変換器４７の出力信号を用いる。 This determination is made based on information from the elevator operation control unit 3 and a signal from the A / D converter 47. First, the elevator operation control unit 3 controls the raising / lowering of the elevator 1 and the opening / closing control of the door. Therefore, the elevator operation / stop information indicating whether the elevator 1 is stopped or operating (ascending or descending), It has door opening / closing information indicating whether it is open or closed, and button pressing information indicating whether a destination button or door opening / closing button in the elevator 1 has been pressed, and these information is output to the learning opportunity determination unit 41. The initial learning of the signal processing unit 45 is desirably performed in a state where the elevator 1 is stopped and the door is closed, there is no passenger in the elevator 1 and noise is low. In order to determine a time zone that satisfies such conditions, the learning opportunity determination unit 41 receives each information from the elevator operation control unit 3, information from the call control unit 42, and an output signal of the A / D converter 47. Use.

なお、従来の音声通話装置では、電源投入は通話装置の設置時に行われるが、運用前の状態であるため、例えばエレベータのドアが開いているなど通話時のエコー環境と異なる状態である可能性がある。これに対して、本実施の形態において、ドアが閉じた状態で、エレベータ１内に搭乗者がおらず、騒音が少ない状態で初期学習を行う場合には、通話時のエコー環境に近い環境で初期学習を行うことができる。 In the conventional voice communication device, the power is turned on at the time of installation of the communication device, but since it is in a state before operation, there is a possibility that it is in a state different from the echo environment at the time of the call, for example, the elevator door is open. There is. On the other hand, in this embodiment, when initial learning is performed in a state where the door is closed, there are no passengers in the elevator 1 and noise is low, the environment is close to the echo environment at the time of a call. Initial learning can be performed.

図６は、学習契機判定部４１での初期学習が可能か否かを判定する判定フローの一例である。以下、このフローを参照して説明する。初めは初期学習時間でないという判定状態からスタートする（ＳＴ１３）。そして、エレベータ運転制御部３からのエレベータ動作・停止情報によりエレベータ１が停止中であり（ＳＴ１４）、かつ、エレベータ運転制御部３からのドア開閉情報によりドアが閉じており（ＳＴ１５）、かつ、エレベータ運転制御部３からのボタン押下情報よりエレベータ１内のボタンが最後に押下されてから所定の時間以上が経過しており（ＳＴ１６）、かつ、通話制御部４２からの情報により通話のための呼出し中でも通話中でもなく（ＳＴ１７）、かつ、Ａ／Ｄ変換器４７の出力信号レベルが所定値以下（これはエレベータ１内の騒音が少ないことを意味する）である（ＳＴ１８）、という全ての条件が満たされると初期学習が可能な時間（初期学習時間）と判定する（ＳＴ１９）。そして、初期学習時間と判定したタイミングから一定時間、例えば１０秒間を初期学習の時間と判定し（ＳＴ２０）、この時間が経過すると初期学習時間でないという判定状態に戻る（ＳＴ１３）。 FIG. 6 is an example of a determination flow for determining whether or not initial learning is possible in the learning opportunity determination unit 41. Hereinafter, description will be given with reference to this flow. The process starts from a determination state that the initial learning time is not reached (ST13). The elevator 1 is stopped by the elevator operation / stop information from the elevator operation control unit 3 (ST14), the door is closed by the door opening / closing information from the elevator operation control unit 3 (ST15), and More than a predetermined time has passed since the button in the elevator 1 was last pressed from the button pressing information from the elevator operation control unit 3 (ST16), and the information for the call is determined by the information from the call control unit 42. All conditions that are not calling or talking (ST17) and that the output signal level of the A / D converter 47 is equal to or lower than a predetermined value (this means that the noise in the elevator 1 is low) (ST18). Is satisfied, it is determined that the initial learning is possible (initial learning time) (ST19). Then, a predetermined time, for example, 10 seconds is determined as the initial learning time from the timing determined as the initial learning time (ST20), and when this time elapses, the state returns to the determination state that it is not the initial learning time (ST13).

なお、エレベータ運転制御部３からのボタン押下情報よりエレベータ１内のボタンが最後に押下されてから所定の時間以上が経過しているという条件（ＳＴ１６）はエレベータ１に搭乗者がいないという条件に相当する。従って、エレベータ１に搭乗者がいないことを他の条件から識別する構成を用いることも可能である。また、図６に示す判断フローは一例であって、音声通話装置４の置かれた環境を認識し、その環境に応じて初期学習が可能か否かを判定する構成であれば、他の条件設定に基づく判断フローを用いることも可能である。 It should be noted that the condition (ST16) that a predetermined time or more has passed since the button in the elevator 1 was last pressed from the button pressing information from the elevator operation control unit 3 is the condition that there is no passenger in the elevator 1. Equivalent to. Therefore, it is also possible to use a configuration for identifying that there is no passenger in the elevator 1 from other conditions. Further, the determination flow shown in FIG. 6 is an example, and other conditions can be used as long as the environment in which the voice communication device 4 is placed is recognized and whether or not initial learning is possible according to the environment is determined. It is also possible to use a decision flow based on settings.

次に、信号処理部４５の動作を、その内部構成を示す図７を用いて説明する。図において、４５１はエコー経路のインパルス応答を推定して擬似エコー信号を生成する適応フィルタ、４５２はインターホン２側から入力する送話信号から擬似エコーを減算する減算器、４５３は送話側の信号と受話側の信号から適応フィルタ４５１のインパルス応答推定動作の可否を判定する適応動作可否判定部、４５４は減算器４５２によってエコー抑圧後の残留エコーを更に抑圧する残留エコー抑圧部である。 Next, the operation of the signal processing unit 45 will be described with reference to FIG. In the figure, 451 is an adaptive filter that estimates an impulse response of an echo path and generates a pseudo echo signal, 452 is a subtractor that subtracts the pseudo echo from a transmission signal input from the interphone 2 side, and 453 is a signal on the transmission side. And 454 is a residual echo suppression unit that further suppresses the residual echo after echo suppression by the subtracter 452.

まず、学習契機判定部４１での判定結果が初期学習時間ではなく、通話が行われる場合について説明する。適応フィルタ４５１は、通話開始のタイミングでパラメータ保持部４８に保持されたパラメータを読込む。ここで、パラメータ保持部４８より入力するパラメータはエコー経路のインパルス応答推定値である。その後、適応フィルタ４５１は動作を開始し、受話側の信号（通信回線側の入力信号）より適応フィルタ４５１を通して擬似エコー信号を生成して出力し、減算器４５２がこの擬似エコー信号を送話信号（インターホン２側入力信号）から減算して適応動作可否判定部４５３、残留エコー抑圧部４５４に出力すると共に、適応フィルタに４５１に出力する。適応フィルタ４５１は、減算器４５２からの信号を用いてエコー経路のインパルス応答推定を行う。エコー経路のインパルス応答推定には、例えば、ＬＭＳ（ＬｅａｓｔＭｅａｎＳｑｕａｒｅｄ）アルゴリズムを用いる。ここで、エコー経路のインパルス応答推定動作を行うかどうかは、適応動作可否判定部４５３の判定に従う。適応動作可否判定部４５３は、適応フィルタ４５１におけるエコー経路推定動作が可能かどうかを判定し、判定結果を適応フィルタ４５１に出力する。ここで、受話側に通話信号があり、送話側には通話信号がなく受話側の信号がエレベータ１内で回り込んだエコー信号のみがある状態がエコー経路推定動作に最も望ましい状態であり、適応動作可否判定部４５３は、受話側の信号（通信回線側の入力信号）と減算器４５２による擬似エコー減算後の送話側信号を入力そのパワーを比較し、受話側の信号のパワーが送話側の信号のパワーよりも一定しきい値以上大きい場合、エコー経路推定動作が可能と判定する。減算器４５２の出力信号は、適応フィルタ４５１で生成した擬似エコー信号を送話信号から減算したエコー消去後の信号であるが、一般にはエコー成分が残留する。これを抑圧するため、残留エコー抑圧部４５４は、減算器４５２の出力信号に損失を与える。 First, a case where the determination result in the learning opportunity determination unit 41 is not an initial learning time but a call is performed will be described. The adaptive filter 451 reads the parameter held in the parameter holding unit 48 at the start of the call. Here, the parameter input from the parameter holding unit 48 is an estimated echo response value of the echo path. After that, the adaptive filter 451 starts operation, generates a pseudo echo signal through the adaptive filter 451 from the signal on the reception side (input signal on the communication line side), and outputs the pseudo echo signal. The subtracter 452 transmits the pseudo echo signal to the transmission signal. It is subtracted from the (interphone 2 side input signal) and output to the adaptive operation availability determination unit 453 and the residual echo suppression unit 454 and to the adaptive filter 451. The adaptive filter 451 performs impulse response estimation of the echo path using the signal from the subtractor 452. For the impulse response estimation of the echo path, for example, a LMS (Least Mean Squared) algorithm is used. Here, whether or not to perform the impulse response estimation operation of the echo path depends on the determination of the adaptive operation availability determination unit 453. The adaptive operation availability determination unit 453 determines whether the echo path estimation operation in the adaptive filter 451 is possible, and outputs the determination result to the adaptive filter 451. Here, a state in which there is a call signal on the receiving side, there is no call signal on the transmitting side, and there is only an echo signal in which the signal on the receiving side wraps around in the elevator 1, is the most desirable state for the echo path estimation operation. The adaptive operation availability determination unit 453 compares the power of the receiving side signal (communication line side input signal) and the transmitting side signal after subtraction of the pseudo echo by the subtractor 452, and compares the power of the receiving side signal. If it is greater than a certain threshold value than the signal power of the talk side, it is determined that an echo path estimation operation is possible. The output signal of the subtracter 452 is a signal after echo cancellation obtained by subtracting the pseudo echo signal generated by the adaptive filter 451 from the transmission signal, but generally an echo component remains. In order to suppress this, the residual echo suppression unit 454 gives a loss to the output signal of the subtractor 452.

なお、上記説明した適応動作可否判定部４５３の動作や残留エコー抑圧部４５４の動作は一例であって、例えば適応動作可否判定部４５３は擬似エコー減算前の送話側信号を併用する判定方法を取っても良く、残留エコー抑圧部４５４を備えない構成であっても良い。 Note that the operation of the adaptive operation availability determination unit 453 and the operation of the residual echo suppression unit 454 described above are examples, and for example, the adaptive operation availability determination unit 453 uses a determination method in which the transmission side signal before pseudo echo subtraction is used together. Alternatively, the residual echo suppression unit 454 may be omitted.

次に、学習契機判定部４１が初期学習が可能と判定した場合の初期学習中の動作を説明する。適応フィルタ４５１は、初期学習開始のタイミングでエコー経路のインパルス応答推定値を初期化(全て“０”)し、その後学習時間完了まで、通話時と同様な処理を行う。すなわち、受話側の信号（通信回線側の入力信号）より適応フィルタ４５１を通して擬似エコー信号を生成して出力し、減算器４５２からの信号を用いてエコー経路のインパルス応答推定を行う。適応動作可否判定部４５３は通話時と同様の動作を行う。同様に、残留エコー抑圧部４５４も通話時と同様の動作を行うが、通話中でないため、その出力は通信回線に送信されることはなく使用されない。初期学習時間が完了すると、その完了のタイミングにおいて、初期学習時間中に推定したエコー経路のインパルス応答推定値をパラメータ保持部４８に出力し、これを格納させる。 Next, an operation during initial learning when the learning opportunity determination unit 41 determines that initial learning is possible will be described. The adaptive filter 451 initializes the echo path impulse response estimated value at the timing of the initial learning start (all “0”), and then performs the same processing as during a call until the learning time is completed. That is, a pseudo echo signal is generated and output from the signal on the receiving side (input signal on the communication line side) through the adaptive filter 451, and the impulse response of the echo path is estimated using the signal from the subtracter 452. The adaptive operation availability determination unit 453 performs the same operation as during a call. Similarly, the residual echo suppression unit 454 performs the same operation as during a call, but since the call is not in progress, its output is not transmitted to the communication line and is not used. When the initial learning time is completed, the estimated impulse response value of the echo path estimated during the initial learning time is output to the parameter holding unit 48 and stored at the completion timing.

以上のように、本実施の形態に係る発明によれば、学習契機判定部４１が、周辺の環境を認識し、その環境に応じて初期学習が可能か否かを判定する構成を用いることにより、ユーザに違和感を与えることなく自動的に適切な時間帯で初期学習を行うことが可能な音声通話装置４を得ることができる。 As described above, according to the invention according to the present embodiment, the learning opportunity determination unit 41 recognizes the surrounding environment and determines whether or not initial learning is possible according to the environment. Thus, it is possible to obtain the voice call device 4 that can automatically perform initial learning in an appropriate time zone without giving the user a sense of incongruity.

特に、その周辺の環境として、エレベータ１が停止中で、ドアが閉まっており、エレベータ１内の行先ボタンが押下されてから所定の時間以上が経過し、通話中でなく、マイク２２からの入力信号レベルが所定値以下の場合に初期学習可能と判断することにより、実際に音声通話装置４が用いられる環境に近い環境で、初期学習を行うことができ、実際に音声通話装置４が用いられる環境に適用することが可能となる。 In particular, as the surrounding environment, the elevator 1 is stopped, the door is closed, and a predetermined time has passed since the destination button in the elevator 1 was pressed. By determining that initial learning is possible when the signal level is equal to or lower than a predetermined value, initial learning can be performed in an environment close to the environment where the voice call device 4 is actually used, and the voice call device 4 is actually used. It can be applied to the environment.

すなわち、実施の形態１に係る音声通話装置４は、エレベータ用の音声通話装置であり、信号のエコーの経路に依存するパラメータを保持するパラメータ保持部４８と、前記パラメータを用いて音声通話時の音声信号を処理する信号処理部４５と、自装置が設置されたエレベータ１内の環境に応じて、音声通話に先立って前記パラメータの学習を行うことが可能か否かを判定する学習契機判定部４１と、前記学習契機判定部４１で学習を行うことが可能と判定されたとき、前記パラメータの学習を行う学習信号を発生する学習信号発生部４３と、を備えたことを特徴とする。この構成によって、ユーザに違和感を与えることなく自動的に適切なタイミングで初期学習を行うことが可能となる。 That is, the voice call device 4 according to the first embodiment is a voice call device for an elevator, and includes a parameter holding unit 48 that holds parameters depending on a signal echo path, and a voice call using the parameters. A signal processing unit 45 that processes voice signals and a learning opportunity determination unit that determines whether or not the parameter can be learned prior to a voice call according to the environment in the elevator 1 in which the device is installed 41 and a learning signal generation unit 43 that generates a learning signal for learning the parameter when it is determined that the learning opportunity determination unit 41 can perform learning. With this configuration, it is possible to automatically perform initial learning at an appropriate timing without causing the user to feel uncomfortable.

また、実施の形態１に係る音声通話装置４では、学習契機判定部４１は、エレベータ１内に人がいないときに、前記パラメータの学習を行うことが可能と判定することを特徴とする。この構成によって、音声通話装置４はエレベータ１内に人がいないタイミングで初期学習を行うことができる。 Further, the voice call device 4 according to Embodiment 1 is characterized in that the learning opportunity determination unit 41 determines that the parameter can be learned when there is no person in the elevator 1. With this configuration, the voice communication device 4 can perform initial learning at a timing when there is no person in the elevator 1.

また、実施の形態１に係る音声通話装置４では、学習契機判定部４１は、エレベータ１が停止中であり、前記エレベータ１のドアが閉まった状態であり、前記エレベータ１内のボタンが押下されてから所定の時間が経過したとき、前記エレベータ１内の環境が前記パラメータの学習が可能な環境であると判定し、前記パラメータの学習を行うタイミングを決定することを特徴とする。この構成によって、ユーザに違和感を与えることなく自動的に適切なタイミングで初期学習を行うことが可能となる。 In the voice call device 4 according to the first embodiment, the learning opportunity determination unit 41 is in a state where the elevator 1 is stopped and the door of the elevator 1 is closed, and a button in the elevator 1 is pressed. When a predetermined time elapses, it is determined that the environment in the elevator 1 is an environment in which the parameter can be learned, and the timing for learning the parameter is determined. With this configuration, it is possible to automatically perform initial learning at an appropriate timing without causing the user to feel uncomfortable.

また、実施の形態１に係る音声通話装置４では、パラメータ保持部４８で保持されるパラメータは学習信号が音声通話装置４から送信される時点から音声通話装置４で受信される時点までに前記学習信号が通る経路のインパルス応答の値である。この構成によって、音声通話装置４はパラメータ保持部４８で保持されたインパルス応答の値を用いて通話信号のエコーキャンセルを行うことができ、良好な通話信号の品質を確保することができる。 In the voice call device 4 according to the first embodiment, the parameters held by the parameter holding unit 48 are learned from the time when the learning signal is transmitted from the voice call device 4 to the time when the voice call device 4 receives the learning signal. The value of the impulse response of the path through which the signal passes. With this configuration, the voice call device 4 can perform echo cancellation of the call signal using the value of the impulse response held by the parameter holding unit 48, and can ensure good call signal quality.

実施の形態２．
実施の形態１では、学習契機判定部４１が図６に示すフローに基づいて初期学習が可能な時間か否かの判定が行われた。具体的には、エレベータ１に搭乗者がいないことをエレベータ運転制御部３からのボタン押下情報よりエレベータ１内のボタンが最後に押下されてから所定の時間以上が経過しているという条件（ＳＴ１６）によって初期学習時間を判定した。これに対して、本実施の形態では、エレベータ１に搭乗者がいないことを他の手段で判定する構成を示す。Embodiment 2. FIG.
In the first embodiment, it is determined whether or not it is time for the initial learning to be possible based on the flow shown in FIG. Specifically, the condition that there is no passenger in the elevator 1 is that a predetermined time or more has passed since the button in the elevator 1 was last pressed based on the button pressing information from the elevator operation control unit 3 (ST16). ) To determine the initial learning time. On the other hand, in this Embodiment, the structure which determines with the other means that there is no passenger in the elevator 1 is shown.

エレベータ１の中には、乗り過ぎを通知するブザーを備えたものがある。そのようなエレベータ１においては、エレベータ運転制御部３は学習契機判定部４１に対して、エレベータ１内の重量情報を出力することができる。学習契機判定部４１はこの情報を初期学習時間の判定に用いる。図８に初期学習時間の判定フローを示す。このフローでは、エレベータ１内のボタンが最後に押下されてから所定の時間以上が経過しているという条件に代わり、エレベータ１内の重量が一定値以下である（ＳＴ２１）という条件を判定に用いており、それ以外は図６と同様のフローである。エレベータ１内の重量が一定値以下であれば搭乗者はいないと言える。 Some elevators 1 are provided with a buzzer that notifies overriding. In such an elevator 1, the elevator operation control unit 3 can output weight information in the elevator 1 to the learning opportunity determination unit 41. The learning opportunity determination unit 41 uses this information for determination of the initial learning time. FIG. 8 shows an initial learning time determination flow. In this flow, instead of the condition that a predetermined time has passed since the button in the elevator 1 was last pressed, the condition that the weight in the elevator 1 is equal to or less than a certain value (ST21) is used for the determination. Otherwise, the flow is the same as in FIG. If the weight in the elevator 1 is not more than a certain value, it can be said that there is no passenger.

以上のように、本実施の形態に係る発明によれば、学習契機判定部４１がエレベータ１内の重量が所定値以下であることを判定することにより、搭乗者がいないことを判定し、学習可能な時間か否かを決定する。この構成によって、実施の形態１と同様に、ユーザに違和感を与えることなく自動的に適切な時間帯で初期学習を行うことが可能な音声通話装置を得ることができる。 As described above, according to the invention according to the present embodiment, the learning opportunity determination unit 41 determines that there is no passenger by determining that the weight in the elevator 1 is equal to or less than the predetermined value, and learning. Determine if it is possible time. With this configuration, as in the first embodiment, it is possible to obtain a voice communication device that can automatically perform initial learning in an appropriate time zone without giving a sense of incongruity to the user.

すなわち、実施の形態２に係る音声通話装置４では、学習契機判定部４１は、エレベータ１が停止中であり、前記エレベータ１のドアが閉まった状態であり、前記エレベータ１内の重量が所定値以下であるとき、前記エレベータ１内の環境が前記パラメータの学習が可能な環境であると判定し、前記パラメータの学習を行う時間帯を決定することを特徴とする。この構成によって、ユーザに違和感を与えることなく自動的に適切な時間帯で初期学習を行うことが可能となる。 That is, in the voice call device 4 according to Embodiment 2, the learning opportunity determination unit 41 is in a state where the elevator 1 is stopped, the door of the elevator 1 is closed, and the weight in the elevator 1 is a predetermined value. When it is below, it is determined that the environment in the elevator 1 is an environment in which the parameter can be learned, and a time zone in which the parameter is learned is determined. With this configuration, it is possible to automatically perform initial learning in an appropriate time zone without causing the user to feel uncomfortable.

実施の形態３．
本実施の形態では、エレベータ１内に搭乗者がいないという条件をカメラ画像により判定する構成を示す。この通話の際、音声通信だけでなく画像通信も同時に可能なエレベータもある。Embodiment 3 FIG.
In this Embodiment, the structure which determines the conditions that there is no passenger in the elevator 1 with a camera image is shown. Some elevators can perform not only voice communication but also image communication at the same time.

図９に学習契機判定部４１が初期学習時間を判定するフローを示す。このフローでは、エレベータ１内のボタンが最後に押下されてから所定の時間以上が経過しているという条件に代わり、カメラ画像内が無人である（ＳＴ２２）という条件を判定に用いており、それ以外は図６と同様のフローである。 FIG. 9 shows a flow in which the learning opportunity determination unit 41 determines the initial learning time. In this flow, instead of the condition that a predetermined time has passed since the button in the elevator 1 was last pressed, the condition that the camera image is unmanned (ST22) is used for the determination. The other flow is the same as in FIG.

以上のように、本実施の形態に係る発明によれば、学習契機判定部４１はカメラ画像内が無人であることを判定することにより、搭乗者がいないことを判定し、初期学習が可能か否かを判定する。この構成によって、実施の形態１、２と同様に、ユーザに違和感を与えることなく自動的に適切な時間帯で初期学習を行うことが可能な音声通話装置を得ることができる。 As described above, according to the invention according to the present embodiment, the learning opportunity determination unit 41 determines that there is no passenger by determining that the camera image is unmanned, and is initial learning possible? Determine whether or not. With this configuration, as in the first and second embodiments, it is possible to obtain a voice call device that can automatically perform initial learning in an appropriate time zone without giving a sense of incongruity to the user.

すなわち、実施の形態３に係る音声通話装置４では、学習契機判定部４１は、エレベータ１が停止中であり、前記エレベータ１のドアが閉まった状態であり、前記エレベータ１内に設置されたカメラの画像が無人であるとき、前記エレベータ１内の環境が前記パラメータの学習が可能な環境であると判定することを特徴とする。この構成によって、ユーザに違和感を与えることなく自動的に適切な時間帯で初期学習を行うことが可能となる。 That is, in the voice call device 4 according to Embodiment 3, the learning opportunity determination unit 41 is in a state where the elevator 1 is stopped, the door of the elevator 1 is closed, and the camera installed in the elevator 1 When the image is unattended, it is determined that the environment in the elevator 1 is an environment in which the parameter can be learned. With this configuration, it is possible to automatically perform initial learning in an appropriate time zone without causing the user to feel uncomfortable.

実施の形態４．
実施の形態１〜３では、信号処理部４５がパラメータ保持部４８に保持するパラメータをエコー経路のインパルス応答推定値とした。これに対して、本実施の形態では、このインパルス応答推定値の全てを保持するのではなく、所定の遅延時間内のインパルス応答推定値のみを保持する構成を示す。Embodiment 4 FIG.
In the first to third embodiments, the parameters that the signal processing unit 45 holds in the parameter holding unit 48 are the impulse response estimation values of the echo path. On the other hand, in the present embodiment, a configuration is shown in which not all of the impulse response estimated values are retained, but only the impulse response estimated values within a predetermined delay time are retained.

適応フィルタ４５１が備えるインパルス応答推定値がｎ個ある場合、本実施の形態ではこれを遅延時間の短い順に並べるとＨ０、Ｈ１、・・・Ｈｎ−１となるとすると、パラメータ保持部４８にはｍ個のインパルス応答Ｈ０、Ｈ１、・・・Ｈｍ−１（ｍ＜ｎ）のみを格納する。また、通話開始の際、信号処理部４５はパラメータ保持部４８からｍ個のインパルス応答のみを入力することになるため、残りｎ−ｍ個のインパルス応答Ｈｍ、Ｈｍ＋１、・・・Ｈｎ−１については０値を用いて動作を開始する。 When there are n impulse response estimation values provided in the adaptive filter 451, in the present embodiment, if these are arranged in order of increasing delay time, H0, H1,. Only the impulse responses H0, H1,... Hm−1 (m <n) are stored. In addition, since the signal processing unit 45 inputs only m impulse responses from the parameter holding unit 48 at the start of a call, the remaining n−m impulse responses Hm, Hm + 1,. Starts operation with a zero value.

インパルス応答は、エコー経路特性をインパルス入力時の応答信号で表現するものであり、遅延時間の短いインパルス応答はインターホン２のスピーカ２１から出力される音声が直接インターホン２のマイク２２に伝わる直接音によるエコーに相当し、遅延時間の長いインパルス応答はインターホン２のスピーカ２１から出力される音声がエレベータ１の壁、ドアなどで反射してからインターホン２のマイク２２に伝わる反射音によるエコーに相当する。反射音はエレベータ１内の搭乗者の人数や位置に依存して変化するが、直接音は変化しない。このため、通話開始の際に、遅延時間の長いインパルス応答推定値Ｈｍ、Ｈｍ＋１、・・・Ｈｎ−１について全て０の状態で通話が開始されたとしても、全インパルス応答推定値Ｈ０、Ｈ１、・・・Ｈｎ−１をパラメータ保持部４８から入力する場合とエコー消去性能に殆ど差異はなく、保持するインパルス応答推定値が少なくなることからパラメータ保持部４８に要するメモリ量、すなわちＨ／Ｗ（Ｈａｒｄｗａｒｅ）規模を削減することができる。 The impulse response expresses the echo path characteristic with a response signal at the time of impulse input, and the impulse response with a short delay time is due to the direct sound that the sound output from the speaker 21 of the interphone 2 is directly transmitted to the microphone 22 of the interphone 2. The impulse response with a long delay time corresponds to an echo due to a reflected sound transmitted from the speaker 21 of the interphone 2 to the microphone 22 of the interphone 2 after the sound output from the speaker 21 of the interphone 2 is reflected by the wall or door of the elevator 1. The reflected sound changes depending on the number and positions of passengers in the elevator 1, but the direct sound does not change. For this reason, even when a call is started in a state where all impulse response estimation values Hm, Hm + 1,..., Hn−1 having a long delay time are all 0 at the start of the call, all impulse response estimation values H0, H1,. ... There is almost no difference in echo canceling performance from the case where Hn-1 is input from the parameter holding unit 48, and since the estimated impulse response value to be held decreases, the amount of memory required for the parameter holding unit 48, that is, H / W ( Hardware) The scale can be reduced.

以上のように、本実施の形態に係る発明によれば、パラメータ保持部４８に所定の遅延時間内のインパルス応答推定値のみを保持するためＨ／Ｗ規模を削減が可能となる。また、ユーザに違和感を与えることなく自動的に適切な時間帯で初期学習を行うことが可能な音声通話装置を得ることができる。 As described above, according to the invention according to the present embodiment, only the impulse response estimated value within a predetermined delay time is held in the parameter holding unit 48, so that the H / W scale can be reduced. In addition, it is possible to obtain a voice communication device that can automatically perform initial learning in an appropriate time zone without giving a sense of incongruity to the user.

すなわち、実施の形態４に係る音声通話装置４では、パラメータ保持部４８で保持されるパラメータは学習信号が音声通話装置４から送信される時点から音声通話装置４で受信される時点までに前記学習信号が通る経路のインパルス応答の値の中で遅延時間の短い一部のインパルス応答の値である。この構成によって、保持するインパルス応答推定値が少なくなることからパラメータ保持部４８に要するメモリ量、すなわちＨ／Ｗ規模を削減することができる。 That is, in the voice call device 4 according to the fourth embodiment, the parameters held by the parameter holding unit 48 are learned from the time when the learning signal is transmitted from the voice call device 4 to the time when the voice call device 4 receives the learning signal. This is the value of a part of the impulse response with a short delay time among the impulse response values of the path through which the signal passes. With this configuration, since the estimated impulse response value to be held is reduced, the amount of memory required for the parameter holding unit 48, that is, the H / W scale can be reduced.

実施の形態５．
実施の形態１〜４においては、エレベータ搭乗者がなく、センターとの間の通話が行われていない間にパラメータの初期学習を行うことが可能と判定した。これに対して、本実施の形態では、エレベータの閉じ込めなどが発生して通話が行われる直前に初期学習を行う形態を示す。Embodiment 5. FIG.
In the first to fourth embodiments, it is determined that there is no elevator passenger and initial learning of parameters can be performed while a call with the center is not being performed. On the other hand, this embodiment shows a mode in which initial learning is performed immediately before a call is made due to an elevator confinement.

初期学習時には白色雑音などの学習信号をスピーカから出力する必要があるが、エレベータ内で緊急事態に遭遇したユーザに対して耳障りな学習信号を聞かせる運用は許容し難いという問題がある。そこで、本実施の形態では、緊急事態に遭遇したユーザに対して耳障りに感じない学習信号を用いる構成を示す。 During initial learning, a learning signal such as white noise needs to be output from the speaker, but there is a problem that it is difficult to allow an operation in which an harsh learning signal is heard to a user who encounters an emergency in an elevator. Therefore, in the present embodiment, a configuration using a learning signal that does not feel harsh to a user who encounters an emergency situation is shown.

図１０は音声通話装置４の動作を示すフローチャートであり、以下、この図を用いて音声通話装置４の動作を説明する。通話制御部４２は非常通話ボタン２３が押下されたかどうかを監視しており（ＳＴ１）、エレベータ内に閉じ込められた搭乗者が非常通話ボタン２３を押下すると、これを契機に通話制御部４２が通信回線インターフェース４９を介して監視センター６と制御信号を送受信し、監視センター呼出しを開始する（ＳＴ２３）。これと同時に、学習契機判定部４１は初期学習の時間と判定し（ＳＴ２４）、初期学習を開始する。 FIG. 10 is a flowchart showing the operation of the voice call device 4. Hereinafter, the operation of the voice call device 4 will be described with reference to FIG. The call control unit 42 monitors whether or not the emergency call button 23 is pressed (ST1), and when a passenger confined in the elevator presses the emergency call button 23, the call control unit 42 communicates with this as a trigger. A control signal is transmitted / received to / from the monitoring center 6 via the line interface 49, and a monitoring center call is started (ST23). At the same time, the learning opportunity determination unit 41 determines the initial learning time (ST24), and starts the initial learning.

初期学習が始まると、学習信号発生部４３はパラメータ初期学習用の信号としてチャープ信号を出力する（ＳＴ２５）。なお、チャープ信号の説明は後述する。スイッチ４４は、通信回線インターフェース４９ではなく、学習信号発生部４３からの入力を選択して信号処理部４５に出力するので、学習信号発生部４３の出力信号は、信号処理部４５を経てＤ／Ａ変換器４６に出力される（ＳＴ９）。そして、Ｄ／Ａ変換器４６でアナログ信号に変換されてインターホン２内のスピーカ２１から出力される。インターホン２内のマイク２２からのエレベータ内登場者の音声は、Ａ／Ｄ変換器４７でデジタル信号に変換された後、信号処理部４５によってエコーが消去される（ＳＴ１０）。学習契機判定部４１は、初期学習の開始から一定時間、例えば１０秒経過するまでを初期学習時間と判定し（ＳＴ２０）、その間、上記の学習動作（ＳＴ２５、ＳＴ９、ＳＴ１０）を継続する。初期学習時間が終了すると、エコー消去動作に伴って信号処理部４５が学習したパラメータをパラメータ保持部４８に格納する（ＳＴ１２）。 When the initial learning starts, the learning signal generation unit 43 outputs a chirp signal as a parameter initial learning signal (ST25). The chirp signal will be described later. Since the switch 44 selects the input from the learning signal generation unit 43 instead of the communication line interface 49 and outputs it to the signal processing unit 45, the output signal of the learning signal generation unit 43 passes through the signal processing unit 45 to D / The data is output to the A converter 46 (ST9). Then, it is converted into an analog signal by the D / A converter 46 and output from the speaker 21 in the intercom 2. The voice of the person in the elevator from the microphone 22 in the interphone 2 is converted into a digital signal by the A / D converter 47, and then the echo is eliminated by the signal processing unit 45 (ST10). The learning opportunity determination unit 41 determines that a predetermined time, for example, 10 seconds elapses from the start of initial learning, as the initial learning time (ST20), and continues the learning operation (ST25, ST9, ST10) during that time. When the initial learning time ends, the parameters learned by the signal processing unit 45 along with the echo cancellation operation are stored in the parameter holding unit 48 (ST12).

上記初期学習動作と、通話制御部４２による監視センター６への呼出しは並行して行われ、監視センター６との通信が確立して呼出しが完了すると（ＳＴ２６）、信号処理部４５はパラメータ保持部４８に格納されたパラメータを入力し（ＳＴ３）、通話が始まる。通話中、スイッチ４４は通信回線インターフェース４９からの入力信号を選択して信号処理部４５に出力するため、通信回線インターフェース４９からの入力信号は信号処理部４５を経由してＤ／Ａ変換器４６に出力される（ＳＴ４）。そして、Ｄ／Ａ変換器４６でアナログ信号に変換されてインターホン２内のスピーカ２１から出力される。また、インターホン２内のマイク２２からのエレベータ内登場者の音声は、Ａ／Ｄ変換器４７でデジタル信号に変換された後、信号処理部４５によってエコーが消去され、通信回線インターフェース４９を経て通信回線に出力される（ＳＴ５）。そして、通話が終了するまでこの動作を継続する（ＳＴ６）。 The initial learning operation and the call control unit 42 call to the monitoring center 6 are performed in parallel, and when communication with the monitoring center 6 is established and the call is completed (ST26), the signal processing unit 45 is a parameter holding unit. The parameters stored in 48 are input (ST3), and the call starts. During a call, the switch 44 selects an input signal from the communication line interface 49 and outputs it to the signal processing unit 45, so that the input signal from the communication line interface 49 passes through the signal processing unit 45 to the D / A converter 46. (ST4). Then, it is converted into an analog signal by the D / A converter 46 and output from the speaker 21 in the intercom 2. In addition, the voice of the person in the elevator from the microphone 22 in the interphone 2 is converted into a digital signal by the A / D converter 47, and then the echo is eliminated by the signal processing unit 45 and communicated via the communication line interface 49. It is output to the line (ST5). This operation is continued until the call is finished (ST6).

以下、初期学習において学習信号発生部４３が発生する信号について説明する。初期学習の時間はセンター呼出し中であることから、学習信号発生部４３は、学習信号として、断続する呼出し音を発生する。一般の電話通信で用いられる呼出し音は、４００Ｈｚのトーン信号を１６Ｈｚで振幅変調した信号であるが、このような周波数帯域の狭い信号は学習信号には適していない。そこで、チャープ信号を断続的に出力し、これを呼出し音とする。チャープ信号とは、周波数が時間とともに増加または下降する信号である。ある時間ｔ（０≦ｔ≦Ｔ）におけるチャープ信号をＣＨ（ｔ）とし、この信号が時間０からＴの間に、周波数Ｆ０からＦ１まで増加するものである場合、下式のように表すことができる。

上式において、Ａはチャープ信号の最大振幅である。チャープ信号は、虫や鳥の鳴き声（ｃｈｉｒｐ）に似た音であることから白色雑音のように耳障りでない。また断続音とすることで呼出し音として自然に感じられるため、エレベータ搭乗者に違和感を与えることがない。更に、広い周波数帯域を持つため適応フィルタ４５１の学習にも適している。Hereinafter, signals generated by the learning signal generator 43 in the initial learning will be described. Since the initial learning time is during the center call, the learning signal generator 43 generates intermittent ringing sounds as learning signals. A ringing tone used in general telephone communication is a signal obtained by amplitude-modulating a 400 Hz tone signal at 16 Hz, but such a narrow frequency band signal is not suitable as a learning signal. Therefore, a chirp signal is intermittently output and used as a ringing tone. A chirp signal is a signal whose frequency increases or decreases with time. When the chirp signal at a certain time t (0 ≦ t ≦ T) is CH (t) and this signal increases from the frequency F0 to F1 during the time 0 to T, it is expressed as follows: Can do.

In the above equation, A is the maximum amplitude of the chirp signal. The chirp signal is not harsh like white noise because it is a sound similar to a chirp of insects and birds. Moreover, since it is naturally felt as a ringing tone by using an intermittent tone, it does not give a feeling of strangeness to the elevator passenger. Furthermore, since it has a wide frequency band, it is suitable for learning of the adaptive filter 451.

以上のように、本発明によれば、学習契機判定部４１が、センターの呼出しが開始されてから一定時間と、呼出し開始から通話確立までの時間のうち、何れか長い方を初期学習可能と判断する。また、学習信号発生部４３は周波数帯域が広く断続する呼出し音を学習信号として発生する。この構成を用いることにより、ユーザに違和感を与えることなく自動的に適切な時間帯で初期学習を行うことが可能な音声通話装置を得ることができる。 As described above, according to the present invention, the learning opportunity determination unit 41 can initially learn whichever is longer between a fixed time from the start of the center call and the time from the start of the call to the establishment of the call. to decide. The learning signal generator 43 generates a ringing tone having a wide frequency band as a learning signal. By using this configuration, it is possible to obtain a voice call device that can automatically perform initial learning in an appropriate time zone without causing a user to feel uncomfortable.

また、実施の形態５において、センターの呼出しが開始されるのはエレベータ１内に人がいるときである。すなわち、実施の形態５に係る音声通話装置４では、エレベータ１内に人がいるときに、前記学習信号発生部はチャープ信号を前記学習信号として用いることを特徴とする。この構成によって、エレベータの閉じ込めなどが発生して通話が行われる直前にユーザの存在する環境で初期学習を行う場合でも、ユーザにとって耳障りでない環境を維持しつつ、初期学習を行うことができる。その結果、ユーザにとって、違和感を与えることなく、良好な通話環境を実現することができる。 In the fifth embodiment, the center call is started when there is a person in the elevator 1. That is, in the voice call device 4 according to the fifth embodiment, when there is a person in the elevator 1, the learning signal generation unit uses a chirp signal as the learning signal. With this configuration, even when the initial learning is performed in an environment where the user exists immediately before a call is made due to an elevator being confined, the initial learning can be performed while maintaining an environment that is not harsh to the user. As a result, it is possible to realize a favorable call environment without giving a sense of incongruity to the user.

実施の形態６．
実施の形態１〜５では、エコー経路のインパルス応答推定値を初期学習で学習するパラメータとしたのに対し、本実施の形態では、エコー経路のインパルス応答推定値以外のパラメータを初期学習で学習するパラメータとする構成を示す。Embodiment 6 FIG.
In the first to fifth embodiments, the impulse response estimated value of the echo path is a parameter learned by initial learning, whereas in this embodiment, parameters other than the echo response estimated value of the echo path are learned by initial learning. Indicates the configuration as a parameter.

図１１は、本実施の形態による信号処理部４５の内部構成図であり、図において、４５５は減算器４５２の前後の信号に与えるゲインまたはロスを算出するゲイン／ロス算出部、４５６及び４５７は信号にゲインまたはロスを与えるゲイン／ロス挿入部である。 FIG. 11 is an internal configuration diagram of the signal processing unit 45 according to the present embodiment. In the figure, reference numeral 455 denotes a gain / loss calculation unit that calculates a gain or loss applied to signals before and after the subtractor 452, and reference numerals 456 and 457 denote. It is a gain / loss insertion section that gives a gain or loss to a signal.

図において、適応動作可否判定部４５３と残留エコー抑圧部４５４の動作は、図７に示した信号処理部４５と同様である。適応フィルタ４５１の動作については、学習契機判定部４１が初期学習可能と判定した初期学習中の動作のみが図７に示した信号処理部４５と異なる。適応フィルタ４５１は、学習開始のタイミングから一定時間経過後、例えば１秒後にエコー経路のインパルス応答推定値を初期化(全て“０”)し、その後学習時間完了まで、通話時と同様な処理を行う。初期学習時間が完了すると、その完了のタイミングにおいて、初期学習時間中に推定したエコー経路のインパルス応答推定値をパラメータ保持部４８に出力し、これを格納させる。 In the figure, the operations of the adaptive operation availability determination unit 453 and the residual echo suppression unit 454 are the same as those of the signal processing unit 45 shown in FIG. The operation of the adaptive filter 451 is different from the signal processing unit 45 shown in FIG. 7 only in the operation during the initial learning that the learning opportunity determination unit 41 determines that the initial learning is possible. The adaptive filter 451 initializes the impulse response estimated value of the echo path after a certain time elapses from the learning start timing, for example, 1 second later (all “0”), and then performs the same processing as during a call until the learning time is completed. Do. When the initial learning time is completed, the estimated impulse response value of the echo path estimated during the initial learning time is output to the parameter holding unit 48 and stored at the completion timing.

そして、学習契機判定部４１が初期学習可能と判定したタイミングから適応フィルタ４５１が学習を開始するまでの間は、ゲイン／ロス算出部４５５がゲイン／ロスの学習を行う。この間、ゲイン／ロス算出部４５５は、インターホン２に出力される信号レベルＬ１とインターホン２から入力される信号レベルＬ２とを算出し、適応フィルタ４５１が学習を開始するタイミングで、ゲイン／ロス挿入部４５６に与えるゲイン／ロス値としてα×Ｌ２／Ｌ１を、ゲイン／ロス挿入部４５７にはその逆数であるＬ１／（α×Ｌ２）を与えると共に、ゲイン／ロス挿入部４５６用のゲイン／ロス値であるα×Ｌ２／Ｌ１とその逆数でありゲイン／ロス挿入部４５７用のゲイン／ロス値であるＬ１／（α×Ｌ２）とをパラメータ保持部４８に出力して格納させる。ここで、αは安全係数であり、１以下の固定値を用いる。また、ゲイン／ロス算出部４５５は、通話制御部４２からの情報により通話開始のタイミングを知り、この通話開始のタイミングでパラメータ保持部４８からゲイン／ロス挿入部４５６用のゲイン／ロス値とゲイン／ロス挿入部４５７用のゲイン／ロス値を取り出し、それぞれゲイン／ロス挿入部４５６及びゲイン／ロス挿入部４５７に設定する。 Then, the gain / loss calculation unit 455 performs gain / loss learning from the timing when the learning opportunity determination unit 41 determines that initial learning is possible until the adaptive filter 451 starts learning. During this time, the gain / loss calculation unit 455 calculates the signal level L1 output to the interphone 2 and the signal level L2 input from the interphone 2, and at the timing when the adaptive filter 451 starts learning, the gain / loss insertion unit Α × L2 / L1 is given as a gain / loss value to be given to 456, L1 / (α × L2) which is the reciprocal number is given to the gain / loss insertion unit 457, and a gain / loss value for the gain / loss insertion unit 456 is given. .Alpha..times.L2 / L1 and L1 / (. Alpha..times.L2), which is the reciprocal and gain / loss value for the gain / loss insertion unit 457, are output to the parameter holding unit 48 and stored. Here, α is a safety coefficient, and a fixed value of 1 or less is used. Further, the gain / loss calculation unit 455 knows the timing of the call start from the information from the call control unit 42, and the gain / loss value and gain for the gain / loss insertion unit 456 from the parameter holding unit 48 at the call start timing. The gain / loss value for the / loss insertion unit 457 is extracted and set in the gain / loss insertion unit 456 and the gain / loss insertion unit 457, respectively.

ゲイン／ロス挿入部４５６及びゲイン／ロス挿入部４５７は通話時、初期学習時ともに同じ動作を行い、ゲイン／ロス算出部４５５から設定されたゲイン／ロス値を、入力信号に対して乗算して出力する。 The gain / loss insertion unit 456 and the gain / loss insertion unit 457 perform the same operation during a call and during initial learning, and multiply the input signal by the gain / loss value set by the gain / loss calculation unit 455. Output.

上記のようなゲイン／ロスの挿入は、適応フィルタ４５１を固定小数点演算で実現する場合に以下のような効果がある。エコー経路のインパルス応答は、エコー経路のゲインが大きいと大きな値となり、逆に、エコー経路のゲインが小さいと小さな値となる。従って、上記のようなゲイン／ロスの挿入を行わない場合、適応フィルタ４５１内のエコー経路インパルス応答推定値Ｈ０、Ｈ１、・・・、Ｈｎ−１が固定小数点数であれば、数値のオーバーフローを防止するため想定される最大のエコー経路ゲインに応じて、予め小数点位置を定めておくことになる。このような方法を取ると、エコー経路のゲインが小さい場合には固定小数点数であるＨ０、Ｈ１、・・・、Ｈｎ−１の上位側のビットが余って有効ビット数が減ることになり、エコー経路インパルス応答の推定精度が低くなる。上記のようなゲイン／ロスの挿入を行うと、ゲイン／ロス挿入部４５６で挿入されるゲイン／ロスによって適応フィルタ４５１から見たエコー経路のゲインが常に同様になり、Ｈ０、Ｈ１、・・・、Ｈｎ−１の有効ビット減少を防止することが可能となる。また、ゲイン／ロス挿入部４５７では、ゲイン／ロス挿入部４５６で挿入するゲイン／ロスの逆数を乗算するため、その他の処理に影響を与えないことになる。 The gain / loss insertion as described above has the following effects when the adaptive filter 451 is realized by fixed-point arithmetic. The impulse response of the echo path has a large value when the gain of the echo path is large, and conversely has a small value when the gain of the echo path is small. Therefore, if the gain / loss insertion as described above is not performed, if the echo path impulse response estimated values H0, H1,..., Hn-1 in the adaptive filter 451 are fixed-point numbers, the numerical value overflows. The decimal point position is determined in advance according to the maximum echo path gain assumed for prevention. When such a method is adopted, when the gain of the echo path is small, the high-order bits of the fixed-point numbers H0, H1,. The estimation accuracy of the echo path impulse response is lowered. When the gain / loss insertion as described above is performed, the gain of the echo path viewed from the adaptive filter 451 is always the same by the gain / loss inserted by the gain / loss insertion unit 456, and H0, H1,. , Hn−1 effective bit reduction can be prevented. Further, the gain / loss insertion unit 457 multiplies the reciprocal of the gain / loss inserted by the gain / loss insertion unit 456, and thus does not affect other processes.

なお、図１１に示す構成に変わり、ゲイン／ロス挿入部４５６とゲイン／ロス挿入部４５７によるゲイン／ロスの挿入位置を図１２のようにしても全く同様な効果がある。 In place of the configuration shown in FIG. 11, even if the gain / loss insertion positions by the gain / loss insertion unit 456 and the gain / loss insertion unit 457 are as shown in FIG.

以上のように、本発明によれば、ゲイン／ロス算出部４５５がエコー経路のゲインに応じたゲイン／ロス値を求め、適応フィルタ４５１の入力、出力信号に求めたゲイン／ロス値とその逆数を乗算する。この構成によって、適応フィルタ４５１によるエコー経路インパルス応答の推定精度を高く保つことができる音声通話装置を得ることができる。 As described above, according to the present invention, the gain / loss calculation unit 455 obtains the gain / loss value corresponding to the gain of the echo path, and obtains the gain / loss value obtained from the input and output signals of the adaptive filter 451 and its inverse. Multiply With this configuration, it is possible to obtain a voice communication device that can maintain high estimation accuracy of the echo path impulse response by the adaptive filter 451.

すなわち、実施の形態６に係る音声通話装置４では、パラメータ保持部４８で保持されるパラメータは学習信号が音声通話装置４から送信される時点から音声通話装置４で受信される時点までに学習信号が通る経路のゲインであることを特徴とする。この構成によって、適応フィルタ４５１によるエコー経路インパルス応答の推定精度を高く保つことができる音声通話装置を得ることができる。 That is, in the voice call device 4 according to the sixth embodiment, the parameters held by the parameter holding unit 48 are the learning signals from the time when the learning signal is transmitted from the voice call device 4 to the time when the voice call device 4 receives it. It is the gain of the path which passes. With this configuration, it is possible to obtain a voice communication device that can maintain high estimation accuracy of the echo path impulse response by the adaptive filter 451.

なお、実施の形態１〜６では初期学習を扱ったが、初期学習は通話に先立って行われる学習を意味するものである。従って、初期学習を１回だけ行ってもよいし、初期学習を数回行っても構わない。 Although the first to sixth embodiments deal with initial learning, initial learning means learning performed prior to a call. Therefore, the initial learning may be performed only once, or the initial learning may be performed several times.

１：エレベータ、２：インターホン、３：エレベータ運転制御部、４：音声通話装置、５：通信ネットワーク、６：監視センター、７：電話端末、２１：スピーカ、２２：マイクロフォン、２３：非常通話ボタン、４１：学習契機判定部、４２：通話制御部、４３：学習信号発生部、４４：スイッチ、４５：信号処理部、４６：Ｄ／Ａ変換器、４７：Ａ／Ｄ変換器、４８：パラメータ保持部、４９：通信回線インターフェース、４０１：プロセッサ、４０２：メモリ、４０３：Ａ／Ｄ、Ｄ／Ａ変換ＬＳＩ、４０４：スイッチ開閉検出ＬＳＩ、４０５：ネットワークインターフェースＡ、４０６：ネットワークインターフェースＢ、４０７：デジタル信号処理プロセッサ、４５１：適応フィルタ、４５２：減算器、４５３：適応動作可否判定部、４５４：残留エコー抑圧部、４５５：ゲイン／ロス算出部、４５６、４５７：ゲイン／ロス挿入部 1: elevator, 2: intercom, 3: elevator operation control unit, 4: voice communication device, 5: communication network, 6: monitoring center, 7: telephone terminal, 21: speaker, 22: microphone, 23: emergency call button, 41: Learning opportunity determination unit, 42: Call control unit, 43: Learning signal generation unit, 44: Switch, 45: Signal processing unit, 46: D / A converter, 47: A / D converter, 48: Parameter holding 49: Communication line interface 401: Processor 402: Memory 403: A / D / D / A conversion LSI 404: Switch open / close detection LSI 405: Network interface A 406: Network interface B 407: Digital Signal processor, 451: adaptive filter, 452: subtractor, 453: adaptive operation availability determination unit 454: residual echo suppressor, 455: Gain / loss calculation unit, 456, 457: Gain / Loss insertion portion

この発明に係る音声通話装置は、エレベータ用の音声通話装置であり、信号のエコーの経路に依存するパラメータを保持するパラメータ保持部と、前記パラメータを用いて音声通話時の音声信号を処理する信号処理部と、自装置が設置されたエレベータ内の環境に応じて、前記音声通話に先立って前記パラメータの学習を行うことが可能か否かを判定する学習契機判定部と、前記学習契機判定部で学習を行うことが可能と判定されたとき、前記パラメータの学習を行う学習信号を発生する学習信号発生部と、通話の開始と終了を制御する通話制御部とを備え、前記学習契機判定部は、前記通話制御部が無通話状態であることを示し、前記エレベータが停止中であり、前記エレベータのドアが閉まった状態であり、前記エレベータ内のボタンが押下されてから所定の時間が経過したとき、前記エレベータ内の環境が前記パラメータの学習が可能と判定し、前記パラメータ保持部は、前記信号処理部が学習するエコー経路のインパルス応答のうち、所定の遅延時間内にある一部のインパルス応答値を保持することを特徴とする。 A voice call device according to the present invention is a voice call device for an elevator, a parameter holding unit for holding a parameter depending on a signal echo path, and a signal for processing a voice signal at the time of a voice call using the parameter A processing unit, a learning opportunity determination unit that determines whether or not the parameter can be learned prior to the voice call according to an environment in an elevator in which the device is installed, and the learning opportunity determination unit A learning signal generator for generating a learning signal for learning the parameter, and a call control unit for controlling the start and end of the call, when the learning is determined to be possible. Indicates that the call control unit is in a no-call state, the elevator is stopped, the elevator door is closed, and a button in the elevator is When a predetermined time has elapsed since the time when the signal is lowered, it is determined that the environment in the elevator is capable of learning the parameter, and the parameter holding unit is configured to select a predetermined value among the impulse responses of the echo path learned by the signal processing unit. It is characterized in that a part of impulse response values within the delay time are held .

この発明によれば、従来技術よりも小さなＨ／Ｗ（Ｈａｒｄｗａｒｅ）規模で、ユーザに違和感を与えることなく適切な時間帯で自動的に初期学習を行うことができる。 According to the present invention, it is possible to automatically perform initial learning in an appropriate time zone without giving a sense of incongruity to the user on a H / W (Hardware) scale smaller than that of the prior art .

Claims

信号のエコーの経路に依存するパラメータを保持するパラメータ保持部と、
前記パラメータを用いて音声通話時の音声信号を処理する信号処理部と、
自装置が設置されたエレベータ内の環境に応じて、前記音声通話に先立って前記パラメータの学習を行うことが可能か否かを判定する学習契機判定部と、
前記学習契機判定部で学習を行うことが可能と判定されたとき、前記パラメータの学習を行う学習信号を発生する学習信号発生部と、
を備えたことを特徴とするエレベータ用の音声通話装置。A parameter holding unit for holding parameters depending on the path of the echo of the signal;
A signal processing unit for processing an audio signal during a voice call using the parameters;
A learning opportunity determination unit that determines whether or not the parameter can be learned prior to the voice call according to an environment in an elevator in which the device is installed;
A learning signal generation unit that generates a learning signal for learning the parameter when it is determined that the learning opportunity determination unit can perform learning;
A voice communication device for an elevator, comprising:

前記学習契機判定部は、前記エレベータ内に人がいないときに、前記パラメータの学習を行うことが可能と判定すること
を特徴とする請求項１に記載の音声通話装置。The voice call device according to claim 1, wherein the learning opportunity determination unit determines that the parameter can be learned when there is no person in the elevator.

前記学習契機判定部は、
前記エレベータが停止中であり、
前記エレベータのドアが閉まった状態であり、
前記エレベータ内のボタンが押下されてから所定の時間が経過したとき、
前記エレベータ内の環境が前記パラメータの学習が可能と判定することを特徴とする請求項１または請求項２に記載の音声通話装置。The learning opportunity determination unit
The elevator is stopped,
The elevator door is closed,
When a predetermined time has elapsed since the button in the elevator was pressed,
The voice communication device according to claim 1, wherein the environment in the elevator determines that the parameter can be learned.

前記学習契機判定部は、
前記エレベータが停止中であり、
前記エレベータのドアが閉まった状態であり、
前記エレベータ内の重量が所定値以下であるとき、
前記エレベータ内の環境が前記パラメータの学習が可能と判定することを特徴とする請求項１または請求項２に記載の音声通話装置。The learning opportunity determination unit
The elevator is stopped,
The elevator door is closed,
When the weight in the elevator is below a predetermined value,
The voice communication device according to claim 1, wherein the environment in the elevator determines that the parameter can be learned.

前記学習契機判定部は、
前記エレベータが停止中であり、
前記エレベータのドアが閉まった状態であり、
前記エレベータ内に設置されたカメラの画像が無人であるとき、
前記エレベータ内の環境が前記パラメータの学習が可能と判定することを特徴とする請求項１または請求項２に記載の音声通話装置。The learning opportunity determination unit
The elevator is stopped,
The elevator door is closed,
When the image of the camera installed in the elevator is unattended,
The voice communication device according to claim 1, wherein the environment in the elevator determines that the parameter can be learned.

前記パラメータは前記学習信号が前記音声通話装置から送信される時点から前記音声通話装置で受信される時点までに前記学習信号が通る経路のインパルス応答の値である
ことを特徴とする請求項１乃至５のいずれか１項に記載の音声通話装置。The parameter is an impulse response value of a path through which the learning signal passes from a time point when the learning signal is transmitted from the voice call device to a time point when the learning signal is received by the voice call device. 6. The voice communication device according to any one of 5 above.

前記パラメータは、前記インパルス応答の値の中で遅延時間の短い一部のインパルス応答の値であることを特徴とする請求項６に記載の音声通話装置。 The voice communication device according to claim 6, wherein the parameter is a value of a part of impulse response having a short delay time among the values of the impulse response.

前記パラメータは前記学習信号が前記音声通話装置から送信される時点から前記音声通話装置で受信される時点までに前記学習信号が通る経路のゲインである
ことを特徴とする請求項１乃至５のいずれか１項に記載の音声通話装置。6. The parameter according to claim 1, wherein the parameter is a gain of a path through which the learning signal passes from a time when the learning signal is transmitted from the voice communication device to a time when the learning signal is received by the voice communication device. The voice communication device according to claim 1.

前記エレベータ内に人がいるときに、前記学習信号発生部はチャープ信号を前記学習信号として用いる
ことを特徴とする請求項１に記載の音声通話装置。The voice communication device according to claim 1, wherein when there is a person in the elevator, the learning signal generation unit uses a chirp signal as the learning signal.